Found 6 repositories(showing 6)
vpnry
Convert documents to txt with tika-python
scottcode
An example of using the Python tika package to extract text and metadata from files. The tika package wraps the Apache Tika Java library.
Mikardis1
Convert PDF text to speech with a simple Python GUI using Tkinter, gTTS, and Tika.
reuladair
An example of using the tika-python api to convert assorted document types to text for NLP processing
felipepov
A document processing tool using Apache Tika to extract metadata and analyze Zipf's Law distributions across multilingual texts (Java/Python).
veerakumar01
AI Document Companion An AI-powered document assistant built with Python and FastAPI. Users can upload PDF, DOCX, or TXT files, extract text using Apache Tika, and get intelligent summaries and answers to questions using the LLAMA3 language model.
All 6 repositories loaded