Search Results

Found 99 repositories(showing 30)

llm_aided_ocr

Dicklesworthstone

💛75

Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs

2.9k

204

NOASSERTION

Python

Updated 2 days ago

ai-assistllama2llm+3

ipcams_and_webcams_licence_plate_reader

Roee-BY

❤️40

Using yolov4 object detection and image processing with OpenCV, the project enables detecting and reading licence plates and storing a log plates detected. The detection is achieved by detecing all the vehicles in the photo using a yolov4 tiny model that was converted to tflite and for each of the detected vehicles using image processing the lince plate is detected and after enhancing the contrast between the letters and the background the letters the licence plate is being processed by tesseract-ocr and we get the licence plate number.

MIT

Jupyter Notebook

Updated 1 year ago

garage-doorgarage-door-openeripcam+3

Fine-Tuning-an-Arabic-OCR-Model-using-Tesseract-5.0

OmarSamirz

❤️40

This research aims to fine-tune an Arabic OCR model using Tesseract 5.0, enhancing text recognition accuracy through extensive data collection, preprocessing, and image generation. By leveraging advanced training techniques and data augmentation, we achieve significant improvements in word error rates (WER).

MIT

Jupyter Notebook

Updated 6 months ago

arabic-ocrarabic-ocr-modelarabic-tesseract-ocr+8

tesseract-ocr-enhanced

jo-valer

❤️40

Preprocessing methods to enhance Tesseract-OCR in the case of printed text on difficult background, or handwritten text on lined/squared paper.

MIT

Jupyter Notebook

Updated 1 year ago

handwritten-character-recognitionhandwritten-text-recognitionhtr+6

Text-Recognition-System

VitaliyDatsyshyn

❤️25

The aim of project is the development of an automated text recognition system with the support of the Ukrainian language, which focuses on the pre-processing of images and processing of recognized text. Two models of machine learning have been developed: to determine the angle at which the image is rotated and to determine the type of document. Algorithms for image enhancement (binarization, noise removal, contrast adjustment) and an algorithm for correcting errors in text using fuzzy string logic and Levenshtein distance have also been developed. The system is presented as a desktop application. The server part was developed using the C# programming language and ASP.NET Core, ML.NET frameworks, client part - WPF (Windows Presentation Foundation), text recognition - Tesseract OCR.

Apache-2.0

Updated 1 year ago

Newspaper-OCR-V3

PapaKaffey

❤️45

GPU-enhanced OCR pipeline for historical newspapers using Google Vision or Tesseract

Python

Updated 1 month ago

google-vision-ocrhistorical-documentsnewspapers+2

TesseractCSharp

Dynaruid

❤️20

This project is a modified version of the charlesw/tesseract library, designed to enhance OCR functionalities within the .NET ecosystem by improving cross-platform support. It aims to make the Tesseract OCR engine more effective in .NET environments.

Apache-2.0

Updated 9 months ago

portfolio-website

ShibaniJeevanandham

❤️35

An image processing project that captures vehicle images and extracts the license plate using OpenCV, enhancing the clarity of the plate using image enhancement techniques. Tech Stack: Python, OpenCV, NumPy, Tesseract OCR Skills Highlighted: Computer Vision, Python, Image Processing

Updated 10 months ago

Pharmacist-Assistant

KristaChauhan

❤️35

Pharmacist Assistant - AI-Powered Prescription Reader An AI-driven tool that extracts handwritten prescription text using OCR (Tesseract) and validates medicine names. Built with Flask, OpenCV, and AI, it enhances accuracy, reduces errors, and streamlines pharmacy workflows by automating prescription processing

Python

Updated 4 months ago

Image-Enhancement-And-Digitization-of-Distorted-Documents

BSK99

❤️35

Enhancing Image quality for better understanding by Tesseract OCR

Python

Updated 3 years ago

Doc-Scanner-Project

krushangptl

❤️40

A lightweight, end-to-end document scanner built using OpenCV and Tesseract OCR. This project detects documents (like receipts, paper, forms), extracts the perspective-warped version, enhances it, and finally uses OCR to extract the textual content from the image.

MIT

Jupyter Notebook

Updated 4 months ago

documentscannermachine-learningpython

Automatic-Number-Plate-recognition-ANPR-

Daxx25

🧡55

Developed an OCR-based ANPR system using Python, OpenCV, and Tesseract, achieving 90% text extraction accuracy. Integrated a SQL database for real-time license plate data storage and retrieval, enhancing traffic monitoring.

Python

Updated 1 week ago

Text-Extraction

HBX814

❤️35

This project automates text extraction from Hindi/Sanskrit PDFs using a pipeline that converts pages to images with pdf2image and applies OCR via Tesseract. It cleans and segments text based on Devanagari punctuation, preserving key elements like dates. The output, structured for NLP tasks, enhances accessibility to Indian language documents.

Python

Updated 11 months ago

SmartOCR

georgecpp

❤️35

Android Native app that demonstrates a heavy using of NDK (C++) for performing advanced image preprocessing (Sauvola Thresholding / Binarization) for enhancing Tesseract OCR engine results.

Updated 1 year ago

OCR_postprocessing

SiweiLiuCB

❤️35

In this project, I created an OCR post-processing procedure to enhance Tesseract OCR output

Updated 6 years ago

Bank_cheque_OCR

srushtikandagal

❤️35

Bank Cheque OCR automates cheque data extraction using OpenCV, PIL, and Tesseract OCR, enhancing accuracy in retrieving key details like cheque number, account number, date, and amount.

Jupyter Notebook

Updated 9 months ago

Dotslash5.0HackAttack

DarkKnightSgh

❤️35

Team HackAttack:Our solution combines state-of-the-art technologies to enhance web accessibility for visually impaired users. Leveraging the Vision Transformer model, we achieve real-time image recognition, accurately identifying objects and text within images. The integration of the Tesseract OCR API enables dynamic text extraction from images, pr

Python

Updated 1 year ago

deep-learningflaskgtts+7

Text-Detection-and-Recognition

MohabEldemery

❤️35

This Python code leverages the EAST text detection model and Tesseract OCR to detect and recognize text in images, enhancing text extraction for various applications.

Python

Updated 6 months ago

hackathon-backend

kchanda24

❤️40

Enterprise Content Management MVP with semantic search capabilities. Upload PDFs and images OCR text extraction from images using Tesseract LLM-assisted metadata suggestion using Google Gemini (enhanced with OCR text) Semantic and exact metadata search Image similarity search using OpenCLIP Vector embeddings with Qdrant SQLite database

NOASSERTION

Python

Updated 6 months ago

aiaiengineeringaiml+2

OCR-PDF-to-Text-Converter

abishekA7

❤️35

A Python-based OCR tool for extracting text from PDF and image files using Tesseract OCR. The script includes advanced image preprocessing techniques to enhance text recognition accuracy and outputs extracted text as .txt files. Ideal for processing scanned or handwritten documents.

Python

Updated 1 year ago

Printchakra-AI

chaman2003

🧡55

Al-powered document scanning and processing system with real-time desktop-mobile synchronization. Built with Flask (Python) backend, React + TypeScript frontend, OpenCV image enhancement, Tesseract OCR and Socket.IO WebSockets for seamless printing and workflow management.

Jupyter Notebook

Updated 3 weeks ago

computer-visionflaskhtml-css-javascript+5

license-plate-recognition

sakthi-da

❤️35

Made enhancements in the process of detection and recognition of license plate images with YOLOv8 algorithm in an object detection system and the classified images sent through CNN and Tesseract OCR based combined algorithms for the character recognition process.

Updated 1 year ago

wins_cc

snow030

🧡50

wins_cc is a lightweight, real-time English-to-Chinese interpreting tool designed for Windows 11, built with Python. It enhances the native Live Caption feature by adding automatic OCR (via Tesseract) and seamless machine translation (via Chrome’s built-in translation API).

MIT

Python

Updated 2 months ago

Image-Text-to-Speech-Conversion

Reddiar890

❤️35

Developed an innovative real-time Image Text to Speech (ITTS) system using Python, OpenCV, Tesseract OCR, and Google Text-to-Speech API to enhance accessibility for visually impaired individuals. The system accurately extracts text from live video streams and converts it into natural speech.

Jupyter Notebook

Updated 1 year ago

MedXpert-Backend-FastAPI

JaspreetSingh-exe

❤️40

AI-powered medical report analyzer that extracts text from PDFs/images, summarizes reports, detects abnormalities, and provides a chatbot for medical queries. Built with FastAPI, OCR (Tesseract, pdfplumber), OpenAI GPT-3.5, and deployed on Google Cloud. Future enhancements include medical image classification and predictions. Contributions Welcome!

Apache-2.0

Python

Updated 1 year ago

artificial-intelligencedockerfastapi+10

License_Plate_Identification_and_Feeing_System

Kybfl

❤️35

This project is a YOLOv8n-based license plate recognition system using transfer learning and CUDA for fast detection. OpenCV enhances plate visibility, Tesseract OCR extracts alphanumeric characters, and text is cleaned for accuracy. A dark-themed Tkinter interface enables easy model loading, image selection, and result display.

Python

Updated 3 months ago

ocrify

koligaurav462

❤️40

A powerful web-based OCR application built with Flask, supporting both Tesseract and EasyOCR engines. It provides high-accuracy text extraction with multi-language support, GPU acceleration via PyTorch and CUDA, and image preprocessing using CLAHE to enhance recognition. User-friendly interface for real-time image-to-text conversion.

Apache-2.0

Python

Updated 5 months ago

Voter-List-Hindi-Scanned-PDF-OCR-and-Extraction-to-Excel

almaas21

❤️35

This Python project enables the extraction of data from scanned PDFs of voter lists in Hindi, sourced from the official government electoral roll website. Using Tesseract OCR and PDF enhancement tools, it converts scanned PDFs into structured Excel files with columns for Name, Father/Husband Name, House Number, Age, and Gender.

Jupyter Notebook

Updated 12 months ago

tesseract-ocr-EnhancedDataFiles-Persian

MahdiRasaeizadeh77

❤️40

Developed and enhanced .NET-based components for document management systems, including OCR integration for Persian documents, Enahanced Persian Data Files include more efficiency according to Default tesseract files.

Apache-2.0

Updated 7 months ago

Sinhala-OCR-Converter

RavinAr1

❤️45

A simple OCR tool that converts Sinhala PDFs and images into editable Word documents using enhanced Tesseract OCR

TypeScript

Updated 2 months ago

GitHub Explorer

Search Results

llm_aided_ocr

ipcams_and_webcams_licence_plate_reader

Fine-Tuning-an-Arabic-OCR-Model-using-Tesseract-5.0

tesseract-ocr-enhanced

Text-Recognition-System

Newspaper-OCR-V3

TesseractCSharp

portfolio-website

Pharmacist-Assistant

Image-Enhancement-And-Digitization-of-Distorted-Documents

Doc-Scanner-Project

Automatic-Number-Plate-recognition-ANPR-

Text-Extraction

SmartOCR

OCR_postprocessing

Bank_cheque_OCR

Dotslash5.0HackAttack

Text-Detection-and-Recognition

hackathon-backend

OCR-PDF-to-Text-Converter

Printchakra-AI

license-plate-recognition

wins_cc

Image-Text-to-Speech-Conversion

MedXpert-Backend-FastAPI

License_Plate_Identification_and_Feeing_System

ocrify

Voter-List-Hindi-Scanned-PDF-OCR-and-Extraction-to-Excel

tesseract-ocr-EnhancedDataFiles-Persian

Sinhala-OCR-Converter

llm_aided_ocr

ipcams_and_webcams_licence_plate_reader

Fine-Tuning-an-Arabic-OCR-Model-using-Tesseract-5.0

tesseract-ocr-enhanced

Text-Recognition-System

Newspaper-OCR-V3

TesseractCSharp

portfolio-website

Pharmacist-Assistant

Image-Enhancement-And-Digitization-of-Distorted-Documents

Doc-Scanner-Project

Automatic-Number-Plate-recognition-ANPR-

Text-Extraction

SmartOCR

OCR_postprocessing

Bank_cheque_OCR

Dotslash5.0HackAttack

Text-Detection-and-Recognition

hackathon-backend

OCR-PDF-to-Text-Converter

Printchakra-AI

license-plate-recognition

wins_cc

Image-Text-to-Speech-Conversion

MedXpert-Backend-FastAPI

License_Plate_Identification_and_Feeing_System

ocrify

Voter-List-Hindi-Scanned-PDF-OCR-and-Extraction-to-Excel

tesseract-ocr-EnhancedDataFiles-Persian

Sinhala-OCR-Converter