Found 499 repositories(showing 30)
lukas-blecher
pix2tex: Using a ViT to convert images of equations into LaTeX code.
zai-org
GLM-OCR: Accurate × Fast × Comprehensive
zai-org
GLM-4.6V/4.5V/4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning
OleehyO
TexTeller can convert image to latex formulas (image2latex, latex OCR) with higher accuracy and exhibits superior generalization ability, enabling it to cover most usage scenarios.
prabhakar267
:clipboard: Python wrapper to grab text from images and save as text files using Tesseract Engine
shuoGG1239
截取图片并识别出图片的文字
SakuraMathcraft
A Windows math workspace for screenshot OCR, handwriting-to-LaTeX, editing, preview, and symbolic computation, powered by pix2text and MathLive.
wangleihitcs
读过的CV方向的一些论文,图像生成文字、弱监督分割等
zhongpei
image2text or chinese text prompt generator
KleinYuan
A deep learning project to tell a story with an image or a video.
Hangover3832
Various nodes for ComfyUI
ekiim
Vim commands to use mathpix from your screen
bestcondition
It's not an OCR project! It use 255 unicode characters like ⣻⣼⣽⣾⣿ show image, even play video. 这不是一个OCR项目,本项目可以用255个像 ⣻⣼⣽⣾⣿ 这样的unicode字符表示图片,甚至还能播放视频。
LeafYeeXYZ
A free WebApp for Text2Image and Image2Text, supporting Multiple Models and both Chinese&English Prompt / 一个零成本的 AI 绘画 WebApp, 支持文生图, 图生文, 多模型, 中英双语提示词
yuanxiaosc
CNN-Encoder and RNN-Decoder (Bahdanau Attention) for image caption or image to text on MS-COCO dataset. 图片描述
etosworld
Deep Extreme Cut http://www.vision.ee.ethz.ch/~cvlsegmentation/dextr . a tool to do automatically object segmentation from extreme points.
ponpaku
GLM-OCRを使ったローカルOCRサーバー(FastAPI + Web UI / 画像・PDF対応)
JulioPeixoto
Minimal local-first multimodal RAG library powered by SQLite + sqlite-vec.
TheLime1
A collection of scripts to "help" you with your programming exams and assignments.
enrico310786
Experiments with LAVIS library to perform image2text and text2image retrieval with BLIP and BLIP2 models
MurageKabui
A AutoIT 3 wrapper library around the OCRSpace API.
GINK03
No description available
ivanp7
An image-to-text converter, written in Common Lisp.
amrrs
demo of 🤗 spaces deployment of a streamlit python app
liuwons
Python tool that converts images to plain text
zhongpei
No description available
henryli2002
BUPT神经网络与深度学习课设
thefcraft
Civitai Stable Diffusion 337k Dataset; dataset of ai generated image
sorelyss
Implementation of Google's im2txt model for tensorflow (Updated for Python 3.5.2 and TensorFlow 1.0.1). Bazel is not necessary.
DavidYang347
Creat texture for 3d models use maya and stable diffusion webui