Search Results

Found 85 repositories(showing 30)

dots.ocr

rednote-hilab

💛86

Multilingual Document Layout Parsing in a Single Vision-Language Model

8.2k

733

MIT

Python

Updated 1 hour ago

dots.ocr.ne

FL33TW00D

❤️35

No description available

Python

Updated 2 months ago

dots.ocr-finetune

akam-ot

🧡65

Training and Fine-tuning code for DotsOCR

Jupyter Notebook

Updated 5 days ago

dots-ocr-client

sljeff

❤️30

Python client for dots.ocr with vLLM and Replicate backend support. Minimal deps, no file I/O.

MIT

Python

Updated 3 months ago

dots-ocr-editor

wjbmattingly

❤️25

No description available

HTML

Updated 2 months ago

Our project is based on one of the most important application of machine learning i.e. pattern recognition. Optical character recognition or optical character reader is the electronic or mechanical conversion of images of typed, handwritten or printed text into machine-encoded text, whether from a scanned document, a photo of a document, a scene-photo or from subtitle text superimposed on an image. We are working on developing an OCR for URDU. We studied a couple of research papers related to our project. So far, we have found that Both Arabic and Urdu are written in Perso-Arabic script; at the written level, therefore, they share similarities. The styles of Arabic and Persian writing have a heavy influence on the Urdu script. There are 6 major styles for writing Arabic, Persian and Pashto as well. Urdu is written in Naskh writing style which is most famous of all. Optical character recognition (OCR) is the process of converting an image of text, such as a scanned paper document or electronic fax file, into computer-editable text [1]. The text in an image is not editable: the letters are made of tiny dots (pixels) that together form a picture of text. During OCR, the software analyzes an image and converts the pictures of the characters to editable text based on the patterns of the pixels in the image. After OCR, the converted text can be exported and used with a variety of word-processing, page layout and spreadsheet applications [2]. One of the main aims of OCR is to emulate the human ability to read at a much faster rate by associating symbolic identities with images of characters. Its potential applications include Screen Readers, Refreshable Braille Displays [3], reading customer filled forms, reading postal address off envelops, archiving and retrieving text etc. OCR’s ultimate goal is to develop a communication interface between the computer and its potential users. Urdu is the national language of Pakistan. It is a language that is understood by over 300 million people belonging to Pakistan, India and Bangladesh. Due to its historical database of literature, there is definitely a need to devise automatic systems for conversion of this literature into electronic form that may be accessible on the worldwide web. Although much work has been done in the field of OCR, Urdu and other languages using the Arabic script like Farsi, Urdu and Arabic, have received least attention. This is due in part to a lack of interest in the field and in part to the intricacies of the Arabic script. Owing to this state of indifference, there remains a huge amount of Urdu and Arabic literature unattended and rotting away on some old shelves. The proposed research aims to develop workable solutions to many of the problems faced in realization of an OCR designed specifically for Urdu Noori Nastaleeq Script, which is widely used in Urdu newspapers, governmental documents and books. The underlying processes first isolate and classify ligatures based on certain carefully chosen special, contour and statistical features and eventually recognize them with the aid of Feed-Forward Back Propagation Neural Networks. The input to the system is a monochrome bitmap image file of Urdu text written in Noori Nastaleeq and the output is the equivalent text converted to an editable text file.

Jupyter Notebook

Updated 11 months ago

dots_ocr_suite

tmzncty

🧡60

这是一个基于 DotsOCR 库开发的 OCR（光学字符识别）处理工具箱，包含 PDF 转 Word (DOCX) 的完整应用。本项目旨在提供简单易用的工具，帮助用户将 PDF 文档或图片转换为可编辑的 Word 文档或 Markdown 格式，支持复杂的版面分析（如表格、公式、图片等）。

Python

Updated 1 day ago

Super-OCRs-Demo

PRITHIVSAKTHIUR

❤️40

A Gradio-based demo application for comparing state-of-the-art OCR models: DeepSeek-OCR, Dots.OCR, HunyuanOCR, and Nanonets-OCR2-3B.

Apache-2.0

Python

Updated 3 months ago

acceleratedeepseek-ocrdots-ocr+16

ocr-mcp

sandraschi

🧡60

FastMCP server providing advanced OCR capabilities with current state-of-the-art models (DeepSeek-OCR, Florence-2, DOTS.OCR, PP-OCRv5, Qwen-Image-Layered decomposition), WIA scanner control, and multi-format document processing for PDFs, CBZ comics, and images.

MIT

Python

Updated 5 days ago

agentic-workflowfastmcpmcp+2

DOTs_OCR

thaoluon

❤️35

No description available

MIT

Python

Updated 1 month ago

ComfyUI-Easy-DotsOCR

yolain

❤️45

A custom node for ComfyUI that provides text extraction via the DotsOCR engine.

Python

Updated 2 months ago

dots.ocr-cpu

Amir-Alipour

❤️45

Run the dots.ocr AI model fully on CPU

Jupyter Notebook

Updated 2 months ago

dots-ocr-docker-ready-setup

ChaosAIs

❤️35

A demo application connected with local dots.ocr service

Dockerfile

Updated 6 months ago

app.dots.ocr.runner

jason-ni

❤️30

Documents and publications of the Dots.OCR.Runner App

Python

Updated 2 months ago

finetuned-dots.ocr

Thangtran27

🧡65

Finetuning dots.ocr with LoRa and QLoRa

Python

Updated 1 day ago

DotsOcr

LuSrackhall

❤️35

一个使用Pixi管理依赖的dots.ocr快速部署方案

Dockerfile

Updated 7 months ago

dots-ocr-modal

chriscarrollsmith

❤️35

Run the open-source dots.ocr OCR model as a Modal function

Python

Updated 5 months ago

dots.ocr-fix-demo

PRITHIVSAKTHIUR

❤️40

This Gradio application demonstrates the capabilities of the "dots.ocr" model, a powerful multilingual document parser.

Apache-2.0

Jupyter Notebook

Updated 3 months ago

document-parsinggradiohtml+9

dots_ocr

as49537023

❤️35

dots_ocr

MIT

Python

Updated 6 months ago

DotsOcr-datapipeline

SilviyaDangol

❤️35

simplified workflow for creating dataset for finetuning dotsocr

Python

Updated 4 months ago

nlptepe_dotsOCR

bariscelikW

❤️40

dotsOCR and LLM integration for exam grading

Apache-2.0

Jupyter Notebook

Updated 7 months ago

dots.ocr-docker

manzolo

❤️45

Dockerized OCR with vLLM - dots.ocr vision-language model for document text extraction

Shell

Updated 1 month ago

dots.ocr

knightofcookies

🧡60

No description available

MIT

Python

Updated 6 days ago

dots-ocr

neosun100

❤️15

No description available

MIT

Python

Updated 5 months ago

dots.ocr-docker

sljeff

❤️35

No description available

Shell

Updated 2 months ago

dots.ocr-api

CesarPetrescu

❤️10

No description available

Python

Updated 7 months ago

arabic-chrono-dots-ocr

HassanMSh

❤️35

An open-source tool to OCR Arabic scanned books and organize historical records into a searchable chronological database.

Python

Updated 6 months ago

doc-parser-system

EEJQX

❤️45

文档图像解析，集成mineru和dots.ocr，用于构建文档图像ocr数据集pipline

Python

Updated 2 months ago

E-MailAutomation-2

hbksilver

❤️40

Create a workflow that: Reads the 6th page of "Session 11 - exercise 2 - UiPathOrchestratorAzureInstallationGuide2016.1" (in the attachment) Reads the 2nd page of "Session 11 - exercise 2 - ScannedDoc" (in the attachment) Sends an email with both documents attached and with the text from point 1 and text from point 2 as Body. Practice 2 - Walkthrough The first PDF document we are trying to extract text from is a native pdf, (the text is selectable, the doc is created from a Word document probably, etc) so we will use the dedicated activity for this case, Read PDF Text. We click on the dots next to File Name property and browse to the location we have the Orchestrator Installation Guide document saved at. We replace the Range property value from “All” to 6, in order to read just the page 6 from the document. We create a variable in the Output, to save the retrieved text for later. The second pdf document we are trying to extract from is a scanned pdf (everything is a picture, you cannot select any element, etc) so we will use the dedicated activity for this case, Read PDF With OCR. @e click on the dots next to File Name property and browse to the location we have the “Scanned.pdf” document saved at. We replace the Range property value from “All” to 6, in order to read just the page 6 from the document. We create a variable in the Output, to save the retrieved text for later. We add a Send Outlook Mail Message (can be any Send XXX Mail Message activity)set the To property. Set the Subject property. Set the Body property to the concatenated variables that keep the text pieces read from the two documents installationPDFText + invoicePDFText. To open the workflow with UiPath Studio, follow these steps: Click on the Download button below and save the archive on your local machine Extract the files Open UiPath Studio From the Start tab, click Open Browse for the extracted files Click on the xaml fileFrom the Design tab, hit Run

BSD-2-Clause

Updated 2 years ago

PDF_OCR_DOTS

babara6666

❤️45

PDF ocr with dots.ocr

Updated 2 months ago

GitHub Explorer

Search Results

dots.ocr

dots.ocr.ne

dots.ocr-finetune

dots-ocr-client

dots-ocr-editor

Urdu-OCR

dots_ocr_suite

Super-OCRs-Demo

ocr-mcp

DOTs_OCR

ComfyUI-Easy-DotsOCR

dots.ocr-cpu

dots-ocr-docker-ready-setup

app.dots.ocr.runner

finetuned-dots.ocr

DotsOcr

dots-ocr-modal

dots.ocr-fix-demo

dots_ocr

DotsOcr-datapipeline

nlptepe_dotsOCR

dots.ocr-docker

dots.ocr

dots-ocr

dots.ocr-docker

dots.ocr-api

arabic-chrono-dots-ocr

doc-parser-system

E-MailAutomation-2

PDF_OCR_DOTS

dots.ocr

dots.ocr.ne

dots.ocr-finetune

dots-ocr-client

dots-ocr-editor

Urdu-OCR

dots_ocr_suite

Super-OCRs-Demo

ocr-mcp

DOTs_OCR

ComfyUI-Easy-DotsOCR

dots.ocr-cpu

dots-ocr-docker-ready-setup

app.dots.ocr.runner

finetuned-dots.ocr

DotsOcr

dots-ocr-modal

dots.ocr-fix-demo

dots_ocr

DotsOcr-datapipeline

nlptepe_dotsOCR

dots.ocr-docker

dots.ocr

dots-ocr

dots.ocr-docker

dots.ocr-api

arabic-chrono-dots-ocr

doc-parser-system

E-MailAutomation-2

PDF_OCR_DOTS