Search Results

Found 495 repositories(showing 30)

OCR-Invoice

robela

❤️36

a console application that would run on Windows server to scan user’s Bill and Receipts, which are either captured by camera or in form of an electronic file like pdf etc. 1. All the invoices/receipts will be uploaded on server in a folder 2. The uploaded invoices/receipts will be scanned by OCR app and extract following information from the file and put them in database table - Vendor/Party Name - Invoice date - Tax amount - Total amount - Line items(Item Name, Item Qty, Item rate, Item Tax & Item Amount) 3. The processing of OCR should be done with 90% of accuracy 4. Application designed be able to handle the noise & quality of the uploaded invoice images.

149

Updated 5 months ago

german-ocr

Keyvanhardani

🧡60

German-OCR is specifically trained to extract text from German documents including invoices, receipts, forms, and other business documents.

Apache-2.0

Python

Updated 2 weeks ago

fine-tuninggerman-ocrllm+2

laravel-ocr

mayaramyadav

🧡55

Laravel OCR & Document Data Extractor – A powerful OCR and document parsing engine for Laravel. It provides intelligent text extraction, structured data parsing, and AI-powered cleanup for documents like invoices, receipts, and PDFs.

PHP

Updated 1 week ago

laravellaravel-packageocr+2

Table_Recognition_Project

ARUN-S-CODER

❤️45

Extract tables from invoice images, process text using OCR, extract entities and relationships using LLM and traditional methods, and construct a visual knowledge graph.

MIT

Python

Updated 1 month ago

invoice-scanner

dinispeixoto

🧡50

System that uses OCR (Optical Character Recognition) to extract data from invoice photos (e.g. products, supermarket, prices), and displays it in a dashboard.

Shell

Updated 4 weeks ago

Receipt-Scanner-in-OpenCV

agrawalamod

🧡65

Image Analysis Project: Built a system to take images of receipts or invoices as input and segment the text on the receipt. Using Tesseract OCR, we extracted the text from the receipts and stored it in a file. Future work: Use NLP libraries to get structured data.

C++

Updated 5 days ago

Invoice-Data-Extraction-System

mjawadshahid

❤️45

Automate the extraction of key data fields from invoice images using YOLOv8 and OCR. Train custom models to detect fields like invoice ID, total amount, and address, then extract text and export to Excel. Ideal for streamlining data entry and reducing manual effort.

Jupyter Notebook

Updated 1 month ago

Receipt-Scanner-for-Android

agrawalamod

❤️35

Ported OpevCV version to Android. Image Analysis Project: Built a system to take images of receipts or invoices as input and segment the text on the receipt. Using Tesseract OCR, we extracted the text from the receipts and stored it in a file.

Java

Updated 10 months ago

OCR-Net-MAUI

Bliitze

❤️40

Optical character recognition (OCR) allows you to extract printed or handwritten text from images, such as photos of street signs and products, as well as from documents—invoices, bills, financial reports, articles, and more. Microsoft's OCR technologies support extracting printed text in several languages.

MIT

Updated 2 years ago

Extracting-information-from-PDF-files-using-OCR-and-NLP

archowdhury

❤️35

Extracting relevant information like invoice number, date, amount etc. from PDF files using OCR and NLP techniques

Jupyter Notebook

Updated 6 months ago

TesserXtract.AI

giruu

❤️35

This Flask application empowers users to seamlessly upload image files like invoices or receipts, extract text using robust OCR technologies, and efficiently isolate key fields using precise regular expressions and multiprocessing to streamline data extraction and enhance productivity.

Python

Updated 5 months ago

aicssflask+16

TensorFlow-OCR-Invoice-Extractor

hrushikesh009

❤️35

A TensorFlow OCR solution,Leveraging advanced object detection models like EfficientDet, this tool simplifies date retrieval, streamlining restaurant management processes. Enhance accuracy and efficiency in financial record-keeping with InvoiceDateExtractor.

Jupyter Notebook

Updated 10 months ago

artificial-intelligenceocr-recognitionpython3+1

ParseExtract

ai92-github

🧡65

Extract data from images, pdf, invoices, receipts | Extract tables from pdf, images and convert to Excel/CSV | OCR complex pdfs, images.

Updated 3 days ago

extract-dataextract-data-from-imageextract-data-from-pdf+17

Invoice-OCR-and-Data-Extraction-System

khadibd

❤️40

A professional OCR automation web application built with Streamlit and EasyOCR, designed to extract structured invoice data from PDFs and images.

Python

Updated 1 month ago

OPTICAL-CHARACTER-RECOGNITION

satyamaditya

❤️20

Assignment By - MASTERS INDIA Project - Extract invoice number, invoice date, line items from invoice images. Project Details - MySelf Aditya, After my research toward this assignment, i found, this is the problem of OCR (OPTICAL CHARACTER RECOGNITION) Basically the work of OCR is to transform & extract the data from semi-structured(BILLs, INVOICES) or un-structured(CONTRACT, LEGAL DOCUMENTS) to structured format(CSV, EXCEL, XML, DATABASES). By this project i got idea about REAL-LIFE Problem, the problem of entire enterprise are data entry. Data entry is preety expensive & time consuming So, basically OCR creates an environment without manual data entry. 90% of stuffs can be configure by OCR & humans will only interrupt when accuracy is not good or some intervention is required. TOOLS of OCR are available in the market like- ABBYY, ROSSUM, AUTOMATION ANYWHERE, XTRACTA for the sake of Assignment, I did this Project with the help of CV2 (Computer Vision) and PYTESSERACT (python library for OCR). ''' ''' the main problem with this project is that we dont have any similar kind of format or structure, [things can be done- REGEX or ROI] initially i was looking for regex to solve this problem, but again due to variation in every format, i can just fetch the invoice date automatically by regex. if i show you the images, we can fetch the each given date by REGEX then store it in list, we know that invoice date will be store firstly & be the index 0. And hence we can easily fetch 0th index element every time & append it in CSV. But the problem arrives in remains 2 that we don't have any common starting point or end point hence we can't apply regex here......

Python

Updated 1 year ago

Invoice-OCR-Extractor

khadibd

🧡50

A Python-based invoice OCR extractor that automatically reads invoices from a folder (PDFs or images), extracts key fields and line items, and exports the results to a formatted Excel report.

NOASSERTION

Python

Updated 2 months ago

Invoice_Data_Extractor_using_OCR

JAYASIMMA

❤️35

No description available

Updated 2 months ago

OCR

alexngun

❤️40

An OCR - Optical Character Recognition that can extract financial data from images of invoices.

Apache-2.0

Python

Updated 10 months ago

invoice-pdfinvoice-recognitionocr+1

DeepVision-Tesseract-OCR-InvoiceScanner

carlosrod723

❤️40

A Python analysis using YOLO V4 for object detection and Tesseract OCR for text recognition. This custom OCR system automates invoice scanning by detecting and extracting key fields like Invoice number, Billing Date, and Total amount, streamlining the process of digital invoice reconciliation.

MIT

Jupyter Notebook

Updated 4 months ago

Invoice-Extraction-OCR-Challenge-RPA

MaxineXiong

❤️40

This repository houses a UiPath automation solution tailored for Invoice Extraction in RPA workflows. Tasked with reading each table row and extracting invoice details from each invoice photo through OCR, the solution outputs a CSV file with the extracted data alongside the table's ID and Due Date, specifically for two invoice suppliers.

MIT

Updated 10 months ago

invoiceinvoice-extractionocr+7

anj-dual-ocr-parser

atorhub

❤️35

ANJ Dual OCR Parser — AI-powered invoice/bill extractor featuring dual-pass OCR (quick + enhanced), smart parsing, automatic field detection, and multi-format export (JSON, CSV, XLSX, PDF, ZIP). 100% client-side, secure, lightweight, and deployable on GitHub Pages or Netlify.

JavaScript

Updated 3 months ago

pdf_analyzer

IbrahimShadi

❤️35

Rules-driven PDF analyzer & auto-renamer. Classifies (invoice / flight ticket / passport / other) with per-class probabilities, extracts key fields, optional OCR for scans, and renames files based on metadata.

Python

Updated 3 months ago

educational-toolflightticketinvoice-parser+4

AIDL

RittickSR

❤️35

An neural network and OCR based approach to extract data from invoices

Jupyter Notebook

Updated 5 years ago

zycus-assignment-textextraction-invoice

suhasdatascientist

❤️35

Given a set of invoice pdfs and OCR output, you need to extract important business fields and invoice line items from the OCR output.

Python

Updated 5 years ago

invoice-gemini-extracter

AmmarAhm3d

❤️45

Invoice-Gemini-Extracter: Python tool to extract structured invoice data (fields and line items) from PDFs/images using OCR, preprocessing, and Google Gemini-powered extraction/normalization.

Python

Updated 1 month ago

automationcomputer-visiondocument-extraction+11

invoice-ai-extractor

emredeveloper

❤️45

Invoice AI Extractor is an AI-powered tool that extracts structured data from invoices automatically. It combines OCR and modern language models to transform unstructured invoice documents into clean, machine-readable formats such as JSON and CSV.

Python

Updated 2 months ago

aiinvoicejson+1

Invoice_OCR_Detection

Remon128

❤️35

This is a Notebook which detects Invoice Images Data and Extracting records like Invoice Date and Items Description based on tesseract OCR Engine.

Jupyter Notebook

Updated 3 years ago

bajaj-ocr-api

Ramakm

❤️40

A robust API to extract line items and totals from different invoices using Tesseract OCR and FastAPI.

MIT

Python

Updated 3 months ago

apifastapifinance+4

Document-Processing-Agent-Project

Govindcoderr

❤️40

Intelligent Document Processing Agent – Extracts invoices using OCR + LLM, validates data, and syncs with ERP (zoho) automatically.

MIT

Python

Updated 5 months ago

Yavar-Hackathon-2025-Mohamed-Aashir-S

mdaashir

❤️35

A robust system for extracting and validating data from invoice PDFs using advanced OCR and computer vision techniques.

Python

Updated 10 months ago

GitHub Explorer

Search Results

OCR-Invoice

german-ocr

laravel-ocr

Table_Recognition_Project

invoice-scanner

Receipt-Scanner-in-OpenCV

Invoice-Data-Extraction-System

Receipt-Scanner-for-Android

OCR-Net-MAUI

Extracting-information-from-PDF-files-using-OCR-and-NLP

TesserXtract.AI

TensorFlow-OCR-Invoice-Extractor

ParseExtract

Invoice-OCR-and-Data-Extraction-System

OPTICAL-CHARACTER-RECOGNITION

Invoice-OCR-Extractor

Invoice_Data_Extractor_using_OCR

OCR

DeepVision-Tesseract-OCR-InvoiceScanner

Invoice-Extraction-OCR-Challenge-RPA

anj-dual-ocr-parser

pdf_analyzer

AIDL

zycus-assignment-textextraction-invoice

invoice-gemini-extracter

invoice-ai-extractor

Invoice_OCR_Detection

bajaj-ocr-api

Document-Processing-Agent-Project

Yavar-Hackathon-2025-Mohamed-Aashir-S

OCR-Invoice

german-ocr

laravel-ocr

Table_Recognition_Project

invoice-scanner

Receipt-Scanner-in-OpenCV

Invoice-Data-Extraction-System

Receipt-Scanner-for-Android

OCR-Net-MAUI

Extracting-information-from-PDF-files-using-OCR-and-NLP

TesserXtract.AI

TensorFlow-OCR-Invoice-Extractor

ParseExtract

Invoice-OCR-and-Data-Extraction-System

OPTICAL-CHARACTER-RECOGNITION

Invoice-OCR-Extractor

Invoice_Data_Extractor_using_OCR

OCR

DeepVision-Tesseract-OCR-InvoiceScanner

Invoice-Extraction-OCR-Challenge-RPA

anj-dual-ocr-parser

pdf_analyzer

AIDL

zycus-assignment-textextraction-invoice

invoice-gemini-extracter

invoice-ai-extractor

Invoice_OCR_Detection

bajaj-ocr-api

Document-Processing-Agent-Project

Yavar-Hackathon-2025-Mohamed-Aashir-S