Search Results

Found 238 repositories(showing 30)

docext

NanoNets

💛73

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

2.0k

140

Apache-2.0

Python

Updated 3 hours ago

documentdocument-analysisdocument-data-extraction+17

OpenOCR: An Open-Source Toolkit for General-OCR Research and Applications, integrates a unified training and evaluation benchmark, commercial-grade OCR and Document Parsing systems, and faithful reproductions of the core implementations from a wide range of academic papers.

1.3k

119

Apache-2.0

Python

Updated 24 minutes ago

chineseocrdocument-analysisdocument-parsing+5

benchmark

getomni-ai

💛71

OCR Benchmark

631

MIT

TypeScript

Updated 1 day ago

Awesome-Generative-Models-for-OCR

NiceRingNode

💛70

[arXiv 25] OCRGenBench: A Comprehensive Benchmark for Evaluating OCR Generative Capabilities

261

Apache-2.0

Python

Updated 1 day ago

OCR-Reasoning

SCUT-DLVCLab

🧡55

[ICLR 2026] OCR-Reasoning Benchmark: Unveiling the True Capabilities of MLLMs in Complex Text-Rich Image Reasoning

Apache-2.0

Python

Updated 3 weeks ago

KITAB-Bench

mbzuai-oryx

🧡65

[ACL 2025 🔥] A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

MIT

Python

Updated 4 days ago

arabicbenchmarklayout-detection+5

ocr-benchmark

video-db

🧡50

Benchmarking Vision-Language Models on OCR tasks in Dynamic Video Environments

MIT

Python

Updated 2 months ago

arxivbenchmarkeasyocr+6

khmer-ocr-benchmark-dataset

EKYCSolutions

🧡50

A standardized benchmark dataset for Khmer Optical Character Recognition (OCR) engine.

MIT

Python

Updated 3 weeks ago

datasetkhmerocr

benchmarking-ocr-gepa

Studio-Intrinsic

❤️40

No description available

Apache-2.0

Python

Updated 1 week ago

docext

datalab-to

🧡60

An on-premises, OCR-free unstructured data extraction, markdown conversion and benchmarking toolkit. (https://idp-leaderboard.org/)

Apache-2.0

Updated 2 weeks ago

pe-ocr-sanskrit

ayushbits

❤️35

Source and Data of our EMNLP Paper 'A Benchmark and Dataset for Post-OCR text correction in Sanskrit'

Python

Updated 11 months ago

ocr_benchmark

andyhuo520

🧡65

OCR Benchmark: GLM-OCR vs PaddleOCR-VL-1.5 on OmniDocBench — comprehensive evaluation across text, tables, formulas, and handwriting

Python

Updated 4 days ago

paper-ner-bench-das22

soduco

❤️40

All the material (paper, code, dataset, results) of our DAS 2022 paper (OCR+NER benchmark)

Jupyter Notebook

Updated 1 month ago

benchmarkdatasetdocument-analysis+2

Korean-OCR-based-on-Clova-AI-Deep-Text-Recognition-using-Text-in-the-Wild-Image-Data

HJK02130

🧡55

In this project, we implemented an Korean optical character regocnition(OCR) algorithm that can detect and recognize Korean text in images such as signboards, book covers and etc. using the pre-trained model of deep text recognition benchmark provided by Clova AI. We used the data of the DACON SW-central university joint AI contest and AI Hub.

Jupyter Notebook

Updated 1 week ago

noisy-ocr-benchmark

Hegghammer

❤️35

Replication materials for the article "OCR with Tesseract, Amazon Textract, and Google Document AI: A Benchmarking Experiment"

TeX

Updated 8 months ago

bio_ocr_minibenchmark

hgbrian

❤️35

a small OCR benchmark for biological sequences

MIT

Python

Updated 2 months ago

KORIE

MahmoudSalah

❤️45

KORIE: A Multi-Task Benchmark for Detection, OCR, and IE on Korean Retail Receipts

Updated 1 month ago

ocr_benchmarking

errajibadr

❤️25

No description available

Python

Updated 3 months ago

ocrsynth

axeld5

❤️35

From Text Dataset to OCR Dataset / Benchmark

Python

Updated 1 year ago

OCR-Benchmark

A9T9

❤️45

OCR benchmark test images (English, Chinese, Number detection)

Updated 1 month ago

Surya-OCR-Hardware-Benchmarking

Jl16ExA

❤️40

Surya-OCR-Hardware-Benchmarking is a repository dedicated to evaluating and analyzing the performance of the Surya OCR model across different hardware configurations. It provides tools and scripts for benchmarking GPU and CPU performance with various batch sizes, aiming to optimize OCR tasks for efficiency and speed in real-world applications.

Apache-2.0

Jupyter Notebook

Updated 1 year ago

Kannada-OCR-test-images-with-ground-truth

MILE-IISc

❤️35

This Kannada OCR benchmarking dataset contains 250 images, carefully chosen to have various kinds of recognition challenges. Some of the pages have italics and bold characters. Some of them have Halegannada poems and text; others are letterpress-printed pages, where the vowel modifiers appear as separate symbols and do not touch the consonants they go with. Some pages have interspersed English words; still others have tables with a lot of numeric data. In addition, there are old pages containing either a lot of broken characters or many words with two or more characters merged into a single connected component.

Shell

Updated 4 months ago

ocr-benchmark

johnidm

❤️20

This repository includes the results of my latest OCR study

Jupyter Notebook

Updated 1 year ago

MaViLS

andererka

🧡55

Code repository for the paper: 'MaViLS, a Benchmark Dataset for Video-to-Slide Alignment, Assessing Baseline Accuracy with a Multimodal Alignment Algorithm Leveraging Speech, OCR, and Visual Features'

Apache-2.0

Jupyter Notebook

Updated 3 weeks ago

fast360

shijincai

❤️35

The industry's first "Open Source OCR Arena," a free, no-login utility for one-click benchmarking of 7 top-tier models (Marker, MinerU, MonkeyOCR, Docling, Dolphin, OCRFlux, PP-StructureV3) on your PDF/image files, specializing in PDF-to-Markdown conversion.

Updated 6 months ago

benchmarkcomputer-visiondata-extraction+16

Recognition

Aryia-Behroziuan

❤️20

The classical problem in computer vision, image processing, and machine vision is that of determining whether or not the image data contains some specific object, feature, or activity. Different varieties of the recognition problem are described in the literature:[citation needed] Object recognition (also called object classification) – one or several pre-specified or learned objects or object classes can be recognized, usually together with their 2D positions in the image or 3D poses in the scene. Blippar, Google Goggles and LikeThat provide stand-alone programs that illustrate this functionality. Identification – an individual instance of an object is recognized. Examples include identification of a specific person's face or fingerprint, identification of handwritten digits, or identification of a specific vehicle. Detection – the image data are scanned for a specific condition. Examples include detection of possible abnormal cells or tissues in medical images or detection of a vehicle in an automatic road toll system. Detection based on relatively simple and fast computations is sometimes used for finding smaller regions of interesting image data which can be further analyzed by more computationally demanding techniques to produce a correct interpretation. Currently, the best algorithms for such tasks are based on convolutional neural networks. An illustration of their capabilities is given by the ImageNet Large Scale Visual Recognition Challenge; this is a benchmark in object classification and detection, with millions of images and 1000 object classes used in the competition.[29] Performance of convolutional neural networks on the ImageNet tests is now close to that of humans.[29] The best algorithms still struggle with objects that are small or thin, such as a small ant on a stem of a flower or a person holding a quill in their hand. They also have trouble with images that have been distorted with filters (an increasingly common phenomenon with modern digital cameras). By contrast, those kinds of images rarely trouble humans. Humans, however, tend to have trouble with other issues. For example, they are not good at classifying objects into fine-grained classes, such as the particular breed of dog or species of bird, whereas convolutional neural networks handle this with ease[citation needed]. Several specialized tasks based on recognition exist, such as: Content-based image retrieval – finding all images in a larger set of images which have a specific content. The content can be specified in different ways, for example in terms of similarity relative a target image (give me all images similar to image X), or in terms of high-level search criteria given as text input (give me all images which contain many houses, are taken during winter, and have no cars in them). Computer vision for people counter purposes in public places, malls, shopping centres Pose estimation – estimating the position or orientation of a specific object relative to the camera. An example application for this technique would be assisting a robot arm in retrieving objects from a conveyor belt in an assembly line situation or picking parts from a bin. Optical character recognition (OCR) – identifying characters in images of printed or handwritten text, usually with a view to encoding the text in a format more amenable to editing or indexing (e.g. ASCII). 2D code reading – reading of 2D codes such as data matrix and QR codes. Facial recognition Shape Recognition Technology (SRT) in people counter systems differentiating human beings (head and shoulder patterns) from objects

Updated 1 year ago

artificial-intelligencearya-behroozianaryia-behroziuan+1

ocr_correction_benchmark

FastAccounting

❤️35

OCR correction benchmark

Apache-2.0

Updated 1 year ago

quiver-benchmarks

OCR-D

❤️25

Benchmarking OCR-D workflows in Docker

MIT

HTML

Updated 2 years ago

MOTBench

gitwzl

❤️20

Menu OCR and translation evaluation benchmark for VLLMs

Python

Updated 9 months ago

app-benchmark

taggun

🧡60

Desktop app to perform tests and benchmark for receipt OCR

MIT

JavaScript

Updated 1 week ago

GitHub Explorer

Search Results

docext

OpenOCR

benchmark

Awesome-Generative-Models-for-OCR

OCR-Reasoning

KITAB-Bench

ocr-benchmark

khmer-ocr-benchmark-dataset

benchmarking-ocr-gepa

docext

pe-ocr-sanskrit

ocr_benchmark

paper-ner-bench-das22

Korean-OCR-based-on-Clova-AI-Deep-Text-Recognition-using-Text-in-the-Wild-Image-Data

noisy-ocr-benchmark

bio_ocr_minibenchmark

KORIE

ocr_benchmarking

ocrsynth

OCR-Benchmark

Surya-OCR-Hardware-Benchmarking

Kannada-OCR-test-images-with-ground-truth

ocr-benchmark

MaViLS

fast360

Recognition

ocr_correction_benchmark

quiver-benchmarks

MOTBench

app-benchmark

docext

OpenOCR

benchmark

Awesome-Generative-Models-for-OCR

OCR-Reasoning

KITAB-Bench

ocr-benchmark

khmer-ocr-benchmark-dataset

benchmarking-ocr-gepa

docext

pe-ocr-sanskrit

ocr_benchmark

paper-ner-bench-das22

Korean-OCR-based-on-Clova-AI-Deep-Text-Recognition-using-Text-in-the-Wild-Image-Data

noisy-ocr-benchmark

bio_ocr_minibenchmark

KORIE

ocr_benchmarking

ocrsynth

OCR-Benchmark

Surya-OCR-Hardware-Benchmarking

Kannada-OCR-test-images-with-ground-truth

ocr-benchmark

MaViLS

fast360

Recognition

ocr_correction_benchmark

quiver-benchmarks

MOTBench

app-benchmark