Found 327 repositories(showing 30)
modesty
converts binary PDF to JSON and text, for server-side PDF processing and command-line use. Zero dependency.
mehmet-kozan
Pure TypeScript, cross-platform module for extracting text, images, and tabular data from PDFs. Run 🤗 directly in your browser or in Node.js
racosa
Simple tool for converting PDF to text using OCR
shahrukhx01
A python library for extracting text from PDFs without losing the formatting of the PDF content.
seinecle
The code base of the front-end of nocodefunctions.com
syllabs
A PDFMiner wrapper to ease the text extraction from pdf files.
yakovypg
We present Ypdf, a PDF document processing application that combines the best features of existing solutions and provides the most popular and requested functionality for free to its users.
ZA3karia
PDF2TEXT & EBOOK2TEXT
zhuweijun1003-source
一个强大的Python Web应用程序,可以从PDF中提取内容,使用DeepSeek AI进行优化,并导出为多种格式。
TheLime1
A collection of scripts to "help" you with your programming exams and assignments.
KaniyamFoundation
Project to convert PDF files to Text files using google OCR
pdfliberation
experimenting with pdf2text and python pdf-table-extract
chiraag-kakar
Simple and Useful Automation Tools built with the help of modules available with Python published at PyPI.
cpierce
PDF to Text Library
worldbank
Natural language processing tools developed by the World Bank's DECAT unit. A suite of text preprocessing and cleaning algorithms for NLP analysis and modeling.
andrealenzi11
Python library and Web service based on Poppler Pdftotext utility and Tesseract OCR for extracting text from PDF documents
Minion-Lover
No description available
StephanyBatista
A API in .Net Core to extract documents OCR with many libs linux
Pdf2TextLibrary is a Robot Framework library for read the pdf file as text data.
saubhagya
No description available
AzozzALFiras
A simple, free tool for extracting text from scanned PDFs and images using OCR, and converting images to PDFs. It processes files locally in the browser, ensuring privacy and security while enabling users to effortlessly convert documents and images into editable text or PDF format.
robgraeber
Extract an array of pages/text from a pdf.
0LL13
A PDF-to-text converter based on pdfminer2
juu7g
Python app to extract text from pdf
Minion-Lover
No description available
TanishqChamoli
Newspaper mining and the analysis of the results using python. Cleaning the text using OCR.
blackforestboi
Extract all text from PDF, works for extensions, pure Javascript
ViswanathaReddyGajjala
This repo contains the code to extract text from pdf/picture/scanned document.
sawyer-shi
Local File Converter: Supports PDF2Image, Image2PDF, Word2PDF, PDF2Word, PPT2PDF, Excel2PDF, CSV2Excel, Excel2CSV, CSV2PDF, Word2Text, Text2Word, Text2PDF, PDF2Text.
imesut
PdfReg is a web tool, which gets text at selected regions of pdf document.