Found 42,342 repositories(showing 30)
openai
Robust Speech Recognition via Large-Scale Weak Supervision
ggml-org
Port of OpenAI's Whisper model in C/C++
SYSTRAN
Faster Whisper transcription with CTranslate2
m-bain
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
chidiwilliams
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
modelscope
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
jamiepine
The open-source voice synthesis studio
PaddlePaddle
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
Vaibhavs10
No description available
Zackriya-Solutions
Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.
sashabaranov
OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go
Const-me
High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model
QuentinFuxa
Simultaneous speech-to-text models
niedev
Open source real-time translation app for Android that runs locally
xorbitsai
Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.
abus-aikorea
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
Shaunwei
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
ddean2009
AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音:Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using AI LLM,print money together! support:chatTTS,faster-whisper,GPTSoVITS,Azure,tencent Cloud,Ali Cloud.
argmaxinc
On-device Speech Recognition for Apple Silicon
thewh1teagle
Transcribe on your own!
ConnectAI-E
🎒 飞书 ×(GPT-4 + GPT-4V + DALL·E-3 + Whisper)= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀
MahmoudAshraf97
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
wenet-e2e
Production First and Production Ready End-to-End Speech Recognition Toolkit
sanchit-gandhi
JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.
WhisperSpeech
An Open Source text-to-speech system built by inverting Whisper.
cactus-compute
Low-latency AI engine for mobile devices & wearables
leetcode-mafia
Mac app for crushing tech interviews with AI
huggingface
Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.
collabora
A nearly-live implementation of OpenAI's Whisper.
embarklabs
Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms