Search Results

Found 72,989 repositories(showing 30)

unsloth

unslothai

💚95

Unsloth Studio is a web UI for training and running open models like Qwen3.5, Gemma 4, DeepSeek, gpt-oss locally.

59.8k

5.1k

Apache-2.0

Python

Updated 2 minutes ago

agentdeepseekfine-tuning+16

GPT-SoVITS

RVC-Boss

💚100

1 min voice data can also be used to train a good TTS model! (few shot voice cloning)

56.4k

6.2k

MIT

Python

Updated 1 minute ago

text-to-speechttsvits+3

TTS

coqui-ai

💚100

🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production

45.0k

6.0k

MPL-2.0

Python

Updated 52 minutes ago

deep-learningglow-ttshifigan+16

ChatTTS

2noise

💚100

A generative speech model for daily dialogue.

39.0k

4.2k

AGPL-3.0

Python

Updated 3 hours ago

agentchatchatgpt+14

MockingBird

babysor

💚100

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

36.9k

5.2k

NOASSERTION

Python

Updated 44 minutes ago

aideep-learningpytorch+3

OpenVoice

myshell-ai

💚100

Instant voice cloning by MIT and MyShell. Audio foundation model.

36.2k

4.0k

MIT

Python

Updated 1 hour ago

text-to-speechttsvoice-clone+1

DeepSpeech

mozilla

💚100

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

26.8k

4.1k

MPL-2.0

C++

Updated 14 hours ago

deep-learningdeepspeechembedded+7

CosyVoice

FunAudioLLM

💚95

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

20.4k

2.3k

Apache-2.0

Python

Updated 38 minutes ago

audio-generationcantonesechatbot+16

index-tts

💚100

An Industrial-Level Controllable and Efficient Zero-Shot Text-To-Speech System

19.8k

2.4k

NOASSERTION

Python

Updated 5 minutes ago

bigvgancross-lingualindextts+4

Handy

cjpais

💚100

A free, open source, and extensible speech-to-text application that works completely offline.

19.4k

1.6k

MIT

Rust

Updated 4 minutes ago

accessibilitycross-platformspeech-to-text+1

dia

nari-labs

💚95

A TTS model capable of generating ultra-realistic dialogue in one pass.

19.2k

1.7k

Apache-2.0

Python

Updated 7 hours ago

aiopen-weighttext-to-speech

leon

leon-ai

💚94

🧠 Leon is your open-source personal assistant.

17.1k

1.4k

MIT

TypeScript

Updated 5 minutes ago

aiai-assistantartificial-intelligence+17

NeMo

NVIDIA-NeMo

💚100

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

17.1k

3.4k

Apache-2.0

Python

Updated 3 minutes ago

asrdeeplearninggenerative-ai+7

pyvideotrans

jianchang512

💚95

Translate the video from one language to another and embed dubbing & subtitles.

16.8k

2.0k

GPL-3.0

Python

Updated 2 minutes ago

speech-to-texttext-to-speechvideo-transition

FunASR

modelscope

💚100

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

15.6k

1.6k

MIT

Python

Updated 45 minutes ago

audio-visual-speech-recognitionconformerdfsmn+12

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

12.6k

2.0k

Apache-2.0

Python

Updated 11 hours ago

asrcode-switchconformer+17

sherpa-onnx

k2-fsa

💛89

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

11.4k

1.3k

Apache-2.0

C++

Updated 33 minutes ago

aarch64androidarm32+17

piper

rhasspy

💚90

A fast, local neural text to speech system

10.8k

943

MIT

C++

Updated 1 hour ago

speech-synthesistext-to-speechtts

edge-tts

rany2

💚90

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

10.5k

989

NOASSERTION

Python

Updated 1 hour ago

speech-synthesistext-to-speechtts

TTS

mozilla

💚93

:robot: :speech_balloon: Deep learning for Text to Speech (Discussion forum: https://discourse.mozilla.org/c/tts)

10.1k

1.3k

MPL-2.0

Jupyter Notebook

Updated 2 hours ago

dataset-analysisdeep-learninggantts+13

WhisperLiveKit

QuentinFuxa

💚90

Simultaneous speech-to-text models

10.1k

1.0k

Apache-2.0

Python

Updated 2 hours ago

espnet

💚95

End-to-End Speech Processing Toolkit

9.8k

2.4k

Apache-2.0

Python

Updated 5 hours ago

chainerdeep-learningend-to-end+13

Amphion

open-mmlab

💛88

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

9.7k

805

MIT

Python

Updated 5 hours ago

audio-generationaudio-synthesisaudioldm+14

RealtimeSTT

KoljaB

💛88

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

9.6k

834

MIT

Python

Updated 9 hours ago

pythonrealtimespeech-to-text

VoiceCraft

jasonppy

💛86

Zero-Shot Speech Editing and Text-to-Speech in the Wild

8.5k

797

NOASSERTION

Jupyter Notebook

Updated 18 hours ago

EmotiVoice

netease-youdao

💛86

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

8.5k

749

Apache-2.0

Python

Updated 10 hours ago

aideep-learningemotion+10

VALL-E-X

Plachtaa

💛81

An open source implementation of Microsoft's VALL-E X zero-shot TTS model. Demo is available in https://plachtaa.github.io/vallex/

8.0k

779

MIT

Python

Updated 21 hours ago

emotional-speechgpttext-to-speech+4

vits

jaywalnut310

💚92

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

7.8k

1.4k

MIT

Python

Updated 2 hours ago

deep-learningpytorchspeech-synthesis+2

ChatTTS-ui

jianchang512

💛87

一个简单的本地网页界面，使用ChatTTS将文字合成为语音，同时支持对外提供API接口。A simple native web interface that uses ChatTTS to synthesize text into speech, along with support for external API interfaces.

7.5k

908

NOASSERTION

Python

Updated 5 hours ago

chatttstts

MeloTTS

myshell-ai

💛88

High-quality multi-lingual text-to-speech library by MyShell.ai. Support English, Spanish, French, Chinese, Japanese and Korean.

7.3k

1.0k

MIT

Python

Updated 1 hour ago

chineseenglishfrench+6

GitHub Explorer

Search Results

unsloth

GPT-SoVITS

TTS

ChatTTS

MockingBird

OpenVoice

DeepSpeech

CosyVoice

index-tts

Handy

dia

leon

NeMo

pyvideotrans

FunASR

PaddleSpeech

sherpa-onnx

piper

edge-tts

TTS

WhisperLiveKit

espnet

Amphion

RealtimeSTT

VoiceCraft

EmotiVoice

VALL-E-X

vits

ChatTTS-ui

MeloTTS

unsloth

GPT-SoVITS

TTS

ChatTTS

MockingBird

OpenVoice

DeepSpeech

CosyVoice

index-tts

Handy

dia

leon

NeMo

pyvideotrans

FunASR

PaddleSpeech

sherpa-onnx

piper

edge-tts

TTS

WhisperLiveKit

espnet

Amphion

RealtimeSTT

VoiceCraft

EmotiVoice

VALL-E-X

vits

ChatTTS-ui

MeloTTS