Search Results

Found 59 repositories(showing 30)

llm_aided_ocr

Dicklesworthstone

💛75

Enhances Tesseract OCR output using LLMs (local or API) for error correction, smart chunking, and markdown formatting of scanned PDFs

2.9k

204

NOASSERTION

Python

Updated 2 days ago

ai-assistllama2llm+3

local-llm-pdf-ocr

ahnafnafee

💛70

Convert scanned PDFs into searchable text locally using Vision LLMs (olmOCR). 100% private, offline, and free. Features a modern Web UI & CLI.

MIT

Python

Updated 3 days ago

document-processingfastapilocal-llm+11

DocInferX is a fully-local, privacy-focused document intelligence system. It ingests PDFs and images, performs OCR, cleans text, chunks content, embeds it into a vector database, and lets you chat with your documents offline using a lightweight LLM (Phi-2).

MIT

Python

Updated 4 months ago

DOCUMIND-AI

Edge-Explorer

💛70

Documind is an intelligent desktop assistant that allows users to upload documents (PDF, DOCX, TXT) and ask natural language questions about their contents. It leverages local LLMs via Ollama, integrates advanced OCR for scanned files, and uses semantic indexing with LangChain + FAISS to deliver fast, context-aware answers.

MIT

Python

Updated 5 days ago

dockerfaissflask+9

pdf-processor

vlow

🧡60

A Python tool to automatically process, categorize, rename, and organize scanned PDF documents using OCR and a local LLM.

MIT

Python

Updated 2 weeks ago

AI_Research_Paper_Classification_Agent

RohanV01

❤️35

This repository features a Python script that automates the organization of PDF documents by leveraging a locally hosted Language Model (LLM). The script efficiently extracts text from PDFs using PyMuPDF, with an OCR fallback for more complex files, and classifies them based on their relevance to biological life sciences. By utilizing a local LLM,

Python

Updated 3 months ago

rag-docq-ollama

caglarldemir

❤️40

A performant Retrieval-Augmented Generation (RAG) pipeline using Ollama to run local LLMs like gpt-oss:20b, llama3:8b, qwen3:4b, and gemma3:4b. Supports PDFs, JPGs, PNGs via OCR, uses Sentence-Transformers for embeddings and ChromaDB for vector storage. Includes benchmarking tools for speed, memory, and answer quality across models.

MIT

Jupyter Notebook

Updated 4 months ago

local_pdf_ocr_vision_llm

chilang

❤️45

Streamlit PDF OCR app using Qwen3-VL vision models running locally on Apple Silicon via MLX

Python

Updated 1 month ago

docai

novatechflow

❤️40

Local-first OCR → Markdown → RAG toolkit with optional Hugging Face/custom endpoints. Tk UI for viewing, OCR-ing, and chatting with PDFs; Docker/pyproject ready; pluggable OCR/embeddings/LLM.

NOASSERTION

Python

Updated 4 months ago

huggingfacemarkdownocr+3

AI_Invoice_Analyzer

Methila-Meem

🧡55

Automatically extracts, validates, and structures invoice data from images and PDFs using OCR + a local LLM—eliminating manual data entry.

Jupyter Notebook

Updated 3 weeks ago

applied-aidocument-aillm-systems+2

paperRead

Skywarder0409

🧡55

An AI-powered local research paper analyzer. PDF to structured Markdown via Ollama (OCR + LLM) with a built-in Web UI. (基于 Ollama 的本地 AI 论文解读工具。支持 PDF 到结构化 Markdown，集成 OCR 与 LLM 分析，自带 Web 界面。)

Python

Updated 1 week ago

Multi-Modal-AI-Assistant-Jarvis-style-

subikshan2006

❤️35

Built a fully offline AI Assistant combining voice commands, local LLMs (LLaMA), vision (OCR, image captioning), and system control. Enabled natural voice Q&A from documents/screenshots, app launcher, and PDF search without API usage. Stack: Python, LangChain, LLaMA.cpp, OCR, Whisper, TTS, FAISS

Python

Updated 9 months ago

kazakh-history-qa-bot

naziraSuleimenova

❤️45

End-to-end pipeline for building a Kazakh history Q&A bot that extracts text from PDFs using OCR, generates training data with GPT-4-mini, and fine-tunes a local LLM (Qwen 2.5) with LoRA on Mac M4

Jupyter Notebook

Updated 1 month ago

Offline-MultiModal-RAG-System

AdityaWagh19

❤️40

Offline Multimodal RAG System: Ingest, index, and query DOC, PDF, images, and audio using a local LLM. Provides semantic search and context-aware responses entirely offline, supporting OCR, speech-to-text, and vector-based retrieval for versatile, private data access.

MIT

Python

Updated 5 months ago

plantdeck_rag

EzioDEVio

❤️35

PlantDeck is an offline herbal RAG that indexes your PDF books and monographs, extracts text/images with OCR, and answers questions with page-level citations using a local LLM via Ollama. Runs on your machine; no cloud. Field guide only; not medical advice.

Python

Updated 7 months ago

computer-visiondockerdocker-compose+14

chat-with-files

manasbhansali27

❤️35

A lightweight local AI assistant that lets you chat with your files — PDFs, documents, images, videos, and code — using semantic search, embeddings, OCR, and multimodal LLMs. Optimized to run on modest GPUs (e.g., RTX 3050 4GB) without requiring heavy VRAM like ChatRTX.

Python

Updated 5 months ago

chatbotdocument-processingembeddings+6

Vedavault-AI-Powered-Offline-AI-Tutor

anmol-pandey-2007

❤️35

Vedavault is an offline AI tutor that learns from PDFs using OCR and RAG, answers questions in multiple languages, supports Whisper-based speech-to-text and TTS voice responses, and generates topic-wise quizzes. Runs fully on local LLMs via Ollama with complete privacy and no internet needed.

JavaScript

Updated 3 months ago

RenAIme

tomgoeck

❤️25

This tool automatically detects new scanned PDFs (or images) in a specified folder, extracts text using OCR (Optical Character Recognition), and intelligently renames each file based on its content. A local LLM (Large Language Model) then analyzes the extracted text to create meaningful filenames (including dates) and moves to folders

MIT

Shell

Updated 10 months ago

local-llm-pdf-ocr

juanso123

🧡60

No description available

MIT

Python

Updated 2 hours ago

docker-composeembeddingsfeature-extraction+12

llm-ocr

ijaureguialzo

🧡60

OCR de PDF a Markdown con LLM local.

Apache-2.0

Python

Updated 3 weeks ago

2026

offline-invoice-pdf-to-json

coolgigi

❤️35

Fully offline invoice PDF to structured JSON extraction using OCR and local LLM

Python

Updated 3 months ago

IDP

biztalk72

🧡65

Intelligent Document Processing — Local LLM-powered PDF ingestion, OCR, RAG Q&A, and multimodal chatbot

Python

Updated 5 days ago

aurora-os

CheekyCodexConjurer

❤️35

Aurora OS · Local PDF-to-audiobook engine with OCR, LLM text cleanup and Coqui TTS.

Python

Updated 3 months ago

ai-extract

adraeger

🧡65

Extract payment data from PDF invoices using a local LLM (Ollama) with automatic scan detection and macOS Vision OCR

Python

Updated 1 day ago

Math-Question-Extractor

sarveshgoswami1104

❤️45

Local LLM–powered Streamlit app to extract algebra and calculus math questions from text, PDFs, and scanned documents using OCR.

Python

Updated 2 months ago

pdf-rag-system

NectarScript

❤️45

AI-powered PDF Question Answering system using RAG, FAISS, OCR, and flan-t5-base. Upload PDFs and ask questions with detailed answers using local LLM.

TypeScript

Updated 1 month ago

image-analyzer

alexandradew

❤️35

RESTful Spring Boot service that extracts and analyzes text from uploaded images or PDF documents using OCR and a local LLM

Java

Updated 9 months ago

ai-rename

adraeger

🧡65

Automatically rename PDF files based on their content using a local LLM (Ollama) with automatic scan detection and macOS Vision OCR

Python

Updated 1 day ago

aibank-statement-analyzer

AbhishekJha3511

🧡50

A powerful, local-first financial dashboard that uses **AI (LLMs)** and **OCR** to turn messy PDF bank statements into beautiful, interactive insights.

Unlicense

JavaScript

Updated 2 months ago

invoice-parser-w-qwen-public

cankayikci0

❤️45

A lightweight pipeline to extract structured invoice fields from PDFs and images using: RapidOCR for robust OCR. Ollama to run a local Qwen Instruct LLM

Python

Updated 1 month ago

GitHub Explorer

Search Results

llm_aided_ocr

local-llm-pdf-ocr

DocInferX

DOCUMIND-AI

pdf-processor

AI_Research_Paper_Classification_Agent

rag-docq-ollama

local_pdf_ocr_vision_llm

docai

AI_Invoice_Analyzer

paperRead

Multi-Modal-AI-Assistant-Jarvis-style-

kazakh-history-qa-bot

Offline-MultiModal-RAG-System

plantdeck_rag

chat-with-files

Vedavault-AI-Powered-Offline-AI-Tutor

RenAIme

local-llm-pdf-ocr

llm-ocr

offline-invoice-pdf-to-json

IDP

aurora-os

ai-extract

Math-Question-Extractor

pdf-rag-system

image-analyzer

ai-rename

aibank-statement-analyzer

invoice-parser-w-qwen-public

llm_aided_ocr

local-llm-pdf-ocr

DocInferX

DOCUMIND-AI

pdf-processor

AI_Research_Paper_Classification_Agent

rag-docq-ollama

local_pdf_ocr_vision_llm

docai

AI_Invoice_Analyzer

paperRead

Multi-Modal-AI-Assistant-Jarvis-style-

kazakh-history-qa-bot

Offline-MultiModal-RAG-System

plantdeck_rag

chat-with-files

Vedavault-AI-Powered-Offline-AI-Tutor

RenAIme

local-llm-pdf-ocr

llm-ocr

offline-invoice-pdf-to-json

IDP

aurora-os

ai-extract

Math-Question-Extractor

pdf-rag-system

image-analyzer

ai-rename

aibank-statement-analyzer

invoice-parser-w-qwen-public