Found 1,404 repositories(showing 30)
killkimno
MORT 번역기 프로젝트 - Real-time game translator with OCR
ihatecsv
A real-time Electron-based desktop GUI for DeepSeek-OCR
thanhkeke97
🎮 Real-time Game Translation Tool | OCR + AI Translation | Windows Gaming | Open Source
neosun100
🎨 Ready-to-use DeepSeek-OCR Web UI | Modern Interface | 7 Recognition Modes | Batch Processing | Real-time Logging | Fully Responsive
arturaugusto
Real-time image preprocess and OCR.
aarongrider
VisionCamera Frame Processor Plugin to detect text in real time using MLKit Text Detector (OCR)
TeXPen
Handwritten LaTeX drawing website with real-time local OCR detection inference
nathanaday
Perform text detection in a variety of languages with your computer webcam using Google Tesseract OCR and OpenCV. This script achieves a real-time OCR effect via multi-threading.
michaelzhiluo
Convolutional Neural Network for Realtime Digit Recognition on Webcam
liabru
a real-time OCR, computer vision and machine learning experiment
pedrol2b
A powerful React Native Vision Camera plugin delivering high-performance Google ML Kit frame processor features—including text recognition (OCR), face detection, barcode scanning, pose detection, and more. Seamlessly bridges native ML Kit capabilities for real-time, on-device computer vision in your React Native apps.
tomkam1702
🎮 Real-time game subtitle translator with AI-powered OCR. Context-aware translation for 20+ languages. Free offline models + dirt cheap APIs. Perfect for gaming in foreign languages!
LaggyHammer
Real-time OCR with Tensorflow, openCV & Tesseract
Scoreboard OCR with a webcam and telephoto lens to read digits in real time from a in-venue scoreboard.
FarzadNekouee
An urban traffic violation detection system using classical image processing techniques. Features include real-time traffic light recognition, adaptive night-time stop line detection, robust license plate extraction, PyTesseract OCR for text recognition, dynamic penalized plate display, and MySQL logging.
spider863644
Forensight is a powerful Image OSINT + Real-time video call inspection tool for digital investigations. It automates image, metadata, and network intelligence gathering with precision tools like facial recognition, EXIF recovery, object detection, OCR, and footprint tracing — giving analysts hacker-grade insight into digital evidence.
The-Assembly
Tesseract is a cross-OS optical character recognition (OCR) engine developed by HP in the 1980s, and since 2006, maintained by Google as an open-source project with high marks for accuracy in reading raw image data into digital characters. The project has been continuously developed and now offers OCR supported by LSTM neural networks for highly improved results. In this session, we’ll use the Python wrapper for Tesseract to first test drive OCR on images through code before connecting our solution to a live IP video feed from your smartphone processed through OpenCV, and then translating the resultant text stream into audible form with gTTS (Google Text-To-Speech), enabling our mashup program to automatically read out loud from any script it ‘sees’. Prerequisites: —Python IDE such as PyCharm (https://www.jetbrains.com/pycharm) —The Tesseract engine (https://tesseract ocr.github.io/tessdoc/Home.html) —A smartphone configured as an IP Webcam (https://www.makeuseof.com/tag/ip-webcam-android-phone-as-a-web-cam/) ----------------------------------------- To learn more about The Assembly’s workshops, visit our website, social media or email us at workshops@theassembly.ae Our website: http://theassembly.ae Instagram: http://instagram.com/makesmartthings Facebook: http://fb.com/makesmartthings Twitter: http://twitter.com/makesmartthings #OCR #TextToSpeech #Tesseract
akarindt
Windows desktop application that assists players of Umamusume Pretty Derby by providing a real-time event tracker and choice assistant through OCR (Optical Character Recognition) technology.
MyRockae
an asynchronous service that processes file uploads, extracts text content using OCR, and interfaces with external LLM APIs to generate quizzes, flashcards, and other interactive educational content, ensuring efficient file handling and reliable data transfer to third-party AI services for real-time content generation.
Testing out HTR-OCR-Text translation using Google's Tesseract engine in real-time.
NitishKumar-ai
PersonalLearningPro is an open‑source, AI-powered school learning platform that offers intelligent test creation, adaptive AI tutoring, OCR-based test scanning, real-time chat, and role-based dashboards for students, teachers, principals, admins, and parents.
grasp-pixel
명일방주 AI 음성 더빙. Real-time AI voice dubbing for Arknights stories. Clones character voices with GPT-SoVITS and auto-plays TTS by recognizing dialogues via screen capture + OCR.
iseahound
Real-time Optical Character Recognition (OCR) Wrapper in AHK.
iFleey
Real-time OCR app for Android with PP-OCRv5 and LiteRT.
CodeEngineTechnology
Real Time OCR Web App (React, NodeJS, Python and AWS)
KiranThomasCherian
An application used to implement optical character recognition (OCR) ,made using flutter.The user can scan texts from images in gallery or can scan in real time by using the phone back camera.
ganeshshejul
Real-time deep learning system that detects emergency vehicles (ambulances, fire trucks) using SSD MobileNetV2 and OCR, automatically triggering traffic signal control to prioritize emergency vehicle passage. Built with TensorFlow, OpenCV, and Tesseract OCR.
qq751220449
DB, OCR, Pytorch, 文本检测算法,A PyToch implementation of "Real-time Scene Text Detection with Differentiable Binarization".
reuAC
A Windows-based screenshot OCR utility powered by DeepSeek-OCR. This tool allows users to quickly capture screen regions and perform high-accuracy Optical Character Recognition (OCR) directly on the captured image, leveraging the powerful DeepSeek-OCR model. It supports local model deployment and features real-time model output streaming.
Bbalduzz
Real time ocr and translations on a selectable region of the screen.