Found 2 repositories(showing 2)
psdwizzard
This Chrome extension integrates screen reader functionality using the XttS-webui API. Currently in beta and using the XttS Server API backend, it will soon move to AllTalk. It enhances web accessibility with seamless text-to-speech capabilities. Licensed under the MIT License for unrestricted and commercial use
kirollos2001
An end-to-end modular NLP pipeline that transforms raw Arabic audio and video recordings into structured, speaker-attributed transcripts — then summarizes them with Google Gemini and reads the summary aloud using neural Arabic TTS (XTTS). Built for researchers, journalists, and developers working with Arabic speech data.
All 2 repositories loaded