Found 55,056 repositories(showing 30)
CorentinJ
Clone a voice in 5 seconds to generate arbitrary speech in real-time
unslothai
Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.
RVC-Boss
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
coqui-ai
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
mudler
LocalAI is the open-source AI engine. Run any model - LLMs, vision, voice, image, video - on any hardware. No GPU required.
2noise
A generative speech model for daily dialogue.
babysor
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
myshell-ai
Instant voice cloning by MIT and MyShell. Audio foundation model.
fishaudio
SOTA Open Source TTS
resemble-ai
SoTA open-source TTS
mastra-ai
From the team behind Gatsby, Mastra is a framework for building AI-powered applications and agents with a modern TypeScript stack.
FunAudioLLM
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
index-tts
An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System
readest
Readest is a modern, feature-rich ebook reader designed for avid readers offering seamless cross-platform access, powerful tools, and an intuitive interface to elevate your reading experience.
nari-labs
A TTS model capable of generating ultra-realistic dialogue in one pass.
DrewThomasson
Generate audiobooks from e-books, voice cloning & 1158+ languages!
pot-app
🌈一个跨平台的划词翻译和OCR软件 | A cross-platform software for text translation and recognition.
NVIDIA-NeMo
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
neonbjb
A multi-voice TTS system trained with an emphasis on quality
SWivid
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
KittenML
State-of-the-art TTS model under 25MB 😻
PaddlePaddle
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
SparkAudio
Spark-TTS Inference Code
rhasspy
A fast, local neural text to speech system
rany2
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
QwenLM
Qwen3-TTS is an open-source series of TTS models developed by the Qwen team at Alibaba Cloud, supporting stable, expressive, and streaming speech generation, free-form voice design, and vivid voice cloning.
mozilla
:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)
krillinai
Video translation and dubbing tool powered by LLMs. The video translator offers 100 language translations and one-click full-process deployment. The video translation output is optimized for platforms like YouTube,TikTok. AI视频翻译配音工具,100种语言双向翻译,一键部署全流程,可以生抖音,小红书,哔哩哔哩,视频号,TikTok,Youtube等形态的内容成适配
jianchang512
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
fishaudio
vits2 backbone with multilingual-bert