GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

iamarunbrahma/vision-parse - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

vision-parse

iamarunbrahma•PUBLIC

View on GitHub

Parse PDFs into markdown using Vision LLMs

document-parserpdf-parserpdf-to-markdowntext-extraction

MIT License

Created on Dec 16, 2024

Updated on Apr 5, 2026

Stars

465

Forks

Watchers

465

Open Issues

Repository Health Score

💛

71/100

Good

Overall repository health assessment

Score Breakdown

Activity

Active development - updated this week

30/30

100%

Issues Analytics

Total Issues

All time

Open

0% of total

Closed

Recent Commits

Merge pull request #48 from iamarunbrahma/refactor_v13

Arun Brahma•6 months ago

d286105View on GitHub

Refactor whitespace in parser.py for improved readability; streamline image processing code and async calls.

Arun Brahma•6 months ago

4f70377View on GitHub

Update .gitignore to include .venv and uv.lock; remove uv.lock file as it is no longer needed.

Arun Brahma•6 months ago

3a5cee3View on GitHub

Add Pillow dependency for image processing; refactor whitespace in various files for consistency and readability.

Arun Brahma•6 months ago

af8a759View on GitHub

Refactor LLM class to improve Gemini client integration; streamline template accessors and update test cases for markdown generation.

Arun Brahma•6 months ago

d2cfbe9View on GitHub

update uv.lock

Arun Brahma•7 months ago

2205eafView on GitHub

Add support for dynamic vision model discovery and refactor constants; introduce ModelDetector class for improved model capability detection.

Arun Brahma•7 months ago

950c232View on GitHub

Update dependencies and documentation for Gemini model; remove DeepSeek references and examples.

Arun Brahma•7 months ago

81dc0c2View on GitHub

Refactor LLM class to streamline Ollama client initialization and enhance concurrency support.

Arun Brahma•7 months ago

c3580dbView on GitHub

Replace PyMuPDF with pypdfium2 for PDF processing and update related code in vision_parse module.

Arun Brahma•7 months ago

8b733f3View on GitHub

Merge pull request #32 from iamarunbrahma/add_streamlit_app

Arun Brahma•1 year ago

b862dafView on GitHub

Remove unnecessary blank line in Streamlit app

Arun Brahma•1 year ago

4a690bfView on GitHub

Add Streamlit demo application for Vision Parse PDF extraction

Arun Brahma•1 year ago

6d93f52View on GitHub

Merge pull request #31 from iamarunbrahma/updated_markdown_prompt

Arun Brahma•1 year ago

512bee0View on GitHub

Refine markdown prompt template text extraction instructions

Arun Brahma•1 year ago

d3fa683View on GitHub

View all commits

GitHub Explorer

vision-parse

Score Breakdown

Issues Activity: Last 6 months

Top Labels

Hottest Issues