Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
-
Updated
Apr 14, 2024 - Python
Tika-Python is a Python binding to the Apache Tika™ REST services allowing Tika to be called natively in the Python community.
A windows service wrapper for the tika JSR 311 network server.
📄🚀 Unleash a powerful Document Search Engine with Apache NiFi for lightning-fast, comprehensive text indexing and search.
Our project is a testament to this need, offering a comprehensive solution that combines modern technologies and architectures to create a powerful document search engine. This engine is not just a tool but a sophisticated ecosystem designed to handle complex data processing and retrieval tasks.
Contains a custom tika 2.x server docker image.
If you are too lazy to read the whole document then generate wordart and keywords.
Extract and Visualize location from any file
Apache Tika Server as Debian GNU/Linux and Ubuntu Linux package
Web crawler with search indexing
A Windows Installer (MSI) for the windows service wrapper of the tika JSR 311 network server.
Tesseract OCR wrapper for Apache Tika and/or Open Semantic ETL caching the OCR results, so Tika-Server or Open Semantic ETL has not to reprocess slow and expensive OCR on same images again
Text extraction from scanned pdf documents in java
Container-ized (Docker) GeoTopicParser-Enabled Apache Tika Server with Lucene Geo Gazetteer.
A doc searcher of the documents on the local host that is based on: Tika+OCR, ElasticSearch and Kibana
Configurable Tika Server docker image. https://hub.docker.com/repository/docker/kujira/tika
Application in php to test load of pdf files, using docker-compose and apache-tika.
Polymer 3.0 app for Apache Tika.
Add a description, image, and links to the tika-server topic page so that developers can more easily learn about it.
To associate your repository with the tika-server topic, visit your repo's landing page and select "manage topics."