Found 15,149 repositories(showing 30)
tesseract-ocr
Tesseract Open Source OCR Engine (main repository)
naptha
Pure Javascript OCR for more than 100 Languages ๐๐๐ฅ
ocrmypdf
OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them to be searched
pymupdf
PyMuPDF is a high performance Python library for data extraction, analysis, conversion & manipulation of PDF (and other) documents.
tesseract-ocr
Trained models with fast variant of the "best" LSTM models + legacy models
kreuzberg-dev
A polyglot document intelligence framework with a Rust core. Extract text, metadata, images, and structured information from PDFs, Office documents, images, and 91+ formats. Available for Rust, Python, Ruby, Java, Go, PHP, Elixir, C#, R, C, TypeScript (Node/Bun/Wasm/Deno)- or use via CLI, REST API, or MCP server.
madmaze
A Python wrapper for Google Tesseract
aisingapore
Free RPA tool by AI Singapore
tebelorg
Python package for doing RPA
python็ฌ่ซๆ็จ๏ผๅธฆไฝ ไป้ถๅฐไธ๏ผๅ ๅซjs้ๅ๏ผselenium, tesseract OCR่ฏๅซ,mongodb็ไฝฟ็จ๏ผไปฅๅscrapyๆกๆถ
gali8
Tesseract OCR iOS is a Framework for iOS7+, compiled also for armv7s and arm64.
rmtheis
Fork of Tesseract Tools for Android
otiai10
Go package for OCR (Optical Character Recognition), by using Tesseract C++ library
thiagoalessio
A wrapper to work with Tesseract OCR inside PHP.
Dicklesworthstone
Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs
charlesw
A .Net wrapper for tesseract-ocr
tesseract-ocr
Tesseract documentation
rmtheis
Experimental optical character recognition app
sirfz
A Python wrapper for the tesseract-ocr API
Akylas
Document scanning app
Pulover
Automation Utility - Recorder & Script Generator
manisandro
A Gtk/Qt front-end to tesseract-ocr.
ianzhao
Python tool for grabbing text via screenshot
nguyenq
Java JNA wrapper for Tesseract OCR API
JinpengLI
make a better chinese character recognition OCR than tesseract
tleyden
Run your own OCR-as-a-Service using Tesseract and Docker
ryfeus
Precompiled packages for AWS Lambda
GauravSingh9356
Personal Assistant built using python libraries. It does almost anything which includes sending emails, Optical Text Recognition, Dynamic News Reporting at any time with API integration, Todo list generator, Opens any website with just a voice command, Plays Music, Wikipedia searching, Dictionary with Intelligent Sensing i.e. auto spell checking, Weather Reporting i.e. temp, wind speed, humidity, YouTube searching, Google Map searching, Youtube Downloading, etc.
openpaperwork
A Python wrapper for Tesseract and Cuneiform -- Moved to Gnome's Gitlab
adaptech-cz
Fork of tess-two rewritten from scratch to support latest version of Tesseract OCR.