Found 5 repositories(showing 5)
sujan22359
No description available
Handwritten OCR + PII Extraction pipeline using OpenCV, Tesseract and EasyOCR. Includes image preprocessing, tilt correction, text extraction, PII detection and optional redaction for medical-style handwritten documents.
YIHAO0225
A multimodal PII redaction system that detects and removes sensitive information from video, audio, and text. Uses AWS Textract, Transcribe, Comprehend, and Rekognition. Features OCR, face detection, speech-to-text, audio PII detection, and automated redaction pipelines.
Automated PDF redaction pipeline using Python, AI (Gemini), and OCR. Permanently removes sensitive data (PII) from text, images, and metadata instead of just hiding it.
Prakash-sa
๐ Async document intake and redaction pipeline โ built with FastAPI, Python multiprocessing, and Tesseract OCR to automatically extract text, detect PII (emails, phone numbers, SSNs, credit cards), and generate redacted PDFs in production-ready fashion.
All 5 repositories loaded