Found 368 repositories(showing 30)
FunAudioLLM
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
abus-aikorea
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
modstart-lib
AIGCPanel 是一个简单易用的一站式AI数字人系统,支持视频合成、声音合成、声音克隆,简化本地模型管理、一键导入和使用AI模型。
rsxdalv
A single Gradio + React WebUI with extensions for ACE-Step, OmniVoice, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!
lenML
🍦 Speech-AI-Forge is a project developed around TTS generation model, implementing an API Server and a Gradio-based WebUI.
ABexit
This is a speech interaction system built on an open-source model, integrating ASR, LLM, and TTS in sequence. The ASR model is SenceVoice, the LLM models are QWen2.5-0.5B/1.5B, and there are three TTS models: CosyVoice, Edge-TTS, and pyttsx3
CosyVoice在Windows环境下使用的版本
xingchensong
Reverse Engineering of Supervised Semantic Speech Tokenizer (S3Tokenizer) proposed in CosyVoice
qi-hua
使用vllm加速cosyvoice2的推理
Agents365-ai
AI-powered video podcast creation skill for coding agents. Supports Bilibili & YouTube, multi-language (zh-CN/en-US), 6 TTS engines (Edge/Azure/ElevenLabs/OpenAI/Doubao/CosyVoice), 4K Remotion rendering.
jianchang512
一个用于CosyVoice的api接口项目
AIFSH
a comfyui custom node for CosyVoice
xingchensong
FlashCosyVoice: A lightweight vLLM implementation built from scratch for CosyVoice.
gpustack
A text-to-speech and speech-to-text server compatible with the OpenAI API, supporting Whisper, FunASR, Bark, and CosyVoice backends.
journey-ad
CosyVoice2 功能扩充(预训练音色推理/3s极速复刻/自然语言控制/自动识别/音色模型保存/API)
JLW-7
Cute voice assistant built on ESP32 to help users with reminders, productivity, and daily conversations.
xiesx123
🚀🎬灵活、高效、可扩展,专属剪辑配音工具箱,释放创作潜力 . Flexible, efficient, and scalable toolbox for editing and dubbing, unleashing creative potential
v3ucn
CosyVoice在苹果MacOs上使用的版本
Moeary
支持10s语音极速配置 多角色管理的有声小说生成器
ScottishFold007
CosyVoice_DPO_NOTES: Supercharge Your Cosyvoice model with Cutting-Edge DPO Fine-Tuning!
zaigie
开箱即用的本地私有化部署语音服务,快速搭建FunASR与CosyVoice2/3后端
filliptm
No description available
devsapp
儿童有声读物的智能化自动化合生成,使用通义千问大模型+ Cosyvoice声音合成 + Flux 图像生成 + Paraformer 声音识别合成可用于生产的儿童有声读物
SpenserCai
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
Fun-CosyVoice3-0.5B-2512 语音合成服务的简化部署方案,以及快速测试和部署提供应用调用
This is a multi-character, ultra-personalized StoryTeller. It includes: 1) efficiently and accurately build multi-character voice library. 2) Effective large model prompts that use the large model to automatically distinguish roles. 3) Ultra-personalized voice cloning effects from cosyvoice.
jingzhunxue
FlowMirror-HydraVox — A natively accelerated multi-head autoregressive TTS system derived from CosyVoice 3.0. It predicts multiple tokens per step for faster, high-quality speech synthesis, featuring a WebUI, FAST API, and full training pipeline compatible with CosyVoice.
zhaoyun0071
Windows不用搭建环境只要英伟达显卡就行,解压即用!
neosun100
🎙️ CosyVoice All-in-One Docker - Production-ready TTS with Web UI, REST API & Voice Cloning
ceasarXuu
基于 CosyVoice 的语音克隆电子书架