Found 2,521 repositories(showing 30)
abus-aikorea
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
hexgrad
https://hf.co/hexgrad/Kokoro-82M
santinic
Generate audiobooks from e-books
remsky
Dockerized FastAPI wrapper for Kokoro-82M text-to-speech model w/CPU ONNX and NVIDIA GPU PyTorch support, handling, and auto-stitching
denizsafak
Generate audiobooks from EPUBs, PDFs and text with synchronized captions.
rsxdalv
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
thewh1teagle
TTS with kokoro and onnx runtime
sauravpanda
Run local LLMs like llama, deepseek-distill, kokoro and more inside your browser
nazdridoy
A CLI text-to-speech tool using the Kokoro model, supporting multiple languages, voices (with blending), and various input formats including EPUB books and PDF documents.
fikrikarim
On-device, real-time multimodal AI. Have natural voice and vision conversations with an AI that runs entirely on your machine. Powered by Gemma 4 E2B and Kokoro.
mbailey
Natural (2-way) voice conversations with Claude Code
lucasjinreal
🔥🔥 Kokoro in Rust. https://huggingface.co/hexgrad/Kokoro-82M Insanely fast, realtime TTS with high quality you ever have.
eduardolat
🔊 Kokoro Web: Free AI text-to-speech, online or self-hosted, OpenAI compatible!
prakharsr
Audiobook Creator is an app that converts books (EPUB, PDF, TXT etc.) into fully voiced audiobooks with intelligent character voice attribution. It uses LLMs and Kokoro/Orpheus TTS to generate engaging, multi-voice audiobooks. Features include emotion tag addition, character identification, and customizable narration. Licensed under GPL-3.0
ShayneP
Local voice AI powered by Ollama, Kokoro, Nemotron STT, and LiveKit.
CodeUpdaterBot
The best way to use AI is on your own computer. Use local or paid API models, and ctrl+k to show/hide the chat UI. Experience the future of AI, and help build it too!
bigsk1
🎙️ Speak with AI - Run locally using Ollama, OpenAI, Anthropic or xAI - Speech uses SparkTTS, OpenAI, ElevenLabs, Kokoro or Typecast
SearchSavior
Inference engine for Intel devices. Serve LLMs, VLMs, Whisper, Kokoro-TTS, Embedding and Rerank models over OpenAI endpoints.
rhulha
Unlimited text-to-speech in the Browser using Kokoro-JS, 100% local, 100% open source
PierrunoYT
A local implementation of the Kokoro Text-to-Speech model, featuring dynamic module loading, automatic dependency management, and a web interface.
amanvirparhar
A real-time speech-to-speech chatbot powered by Whisper Small, Llama 3.2, and Kokoro-82M.
RobViren
A random walk voice style cloning application for Kokoro text to speech
ddxfish
She's the AI agent you come home to.
mlalma
Kokoro TTS for iOS and macOSX
SUP3RMASS1VE
🟢 NVIDIA ONLY – All-in-One TTS App with Kokoro, KittenTTS, Higgs audio, Chatterbox, Fish-Speech, F5 & index-tts & indextts2, Supports Conversation Mode & eBook-to-Audiobook. All features work across all engines in a unified interface except vibe voice which is it's own app panel.
Lyrcaxis
Fast local TTS inference engine in C# with ONNX runtime. Multi-speaker, multi-platform and multilingual. Integrate on your .NET projects using a plug-and-play NuGet package, complete with all voices.
xenova
ML-powered speech synthesis directly in your browser
tarun7r
Cascading voice assistant combining real-time speech recognition, AI reasoning, and neural text-to-speech capabilities.
Xerophayze
TTS-Story is a web-based multi‑voice TTS studio for turning tagged scripts into audiobooks—featuring full speaker management, chunk review/regeneration, a job queue and library system, and local GPU or API backends including Kokoro, Chatterbox, VOX CPM, Pocket-TTS, Kitten-TTS, IndexTTS and QWEN3 TTS.
met4citizen
HeadTTS: Free neural text-to-speech (Kokoro) with timestamps and visemes for lip-sync. Runs in-browser (WebGPU/WASM) or on local Node.js WebSocket/REST server (CPU).