Found 321 repositories(showing 30)
microsoft
Open-Source Frontier Voice AI
Enemyx-net
A comprehensive ComfyUI integration for Microsoft's VibeVoice text-to-speech model, enabling high-quality single and multi-speaker voice synthesis directly within your ComfyUI workflows.
vibevoice-community
VibeVoice: Expressive, longform conversational speech synthesis. (Community fork)
diodiogod
A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools
wildminder
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
zhao-kun
VibeVoiceFusion is a full-stack, multi-speaker voice generation web system featuring LoRA fine-tuning, batch generation, and VRAM optimization. Based on Microsoft's VibeVoice (AR + diffusion architecture)
FranckyB
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
homelab-00
A fully local and private Speech-To-Text app with cross-platform support, speaker diarization, Audio Notebook mode, LM Studio integration, and both longform and live transcription.
voicepowered-ai
Unofficial WIP LoRa Finetuning repository for VibeVoice
mpaepper
Fast local speech-to-text for any app using faster-whisper
zeropointnine
Audiobook creation tool with support for multiple TTS models (Qwen3-TTS, IndexTTS2, VibeVoice, Chatterbox, Fish S2-Pro, Higgs Audio V2, etc), focused on high-quality output. Plus player/reader web app.
DigiJoe79
A modern desktop application built with Tauri 2.0 for creating professional audiobooks using advanced text-to-speech and voice cloning technology (XTTS, Chatterbox, VibeVoice). Features drag & drop organization, multi-language support (17+ languages), smart text segmentation with NLP, and export to MP3/M4A/WAV formats.
marhensa
OpenAI API-compatible text-to-speech server using Microsoft VibeVoice-Realtime-0.5B. Docker or Python venv support, multiple voices with OpenAI aliases, CUDA-optimized.
shamspias
Beautiful voice app: record or upload to train a voice, generate speech from text or files, save & download voices.
danielclough
Rust implementation of VibeVoice text-to-speech with voice cloning and multi-speaker synthesis.
akadoubleone
No description available
SanDiegoDude
Frontier Open-Source Text-to-Speech
is a powerful, locally-hosted Text-to-Speech (TTS) application designed to provide high-quality voice synthesis. It leverages the Microsoft VibeVoice model as its core engine, seamlessly integrating a fast Python FastAPI backend and a modern React/TypeScript frontend.
shijincai
Archive of the official Microsoft VibeVoice repository (7B & 1.5B). Backup of the deleted source code for the open-source TTS models, including the removed 7B version. Try the VibeVoice online service
ncoder-ai
FastAPI wrapper around original Vibevoice 1.5B and 7B models
mzbac
vibevoice real time 0.5B swift port
timoncool
No description available
vibevoice-community
API server for VibeVoice
vorojar
Open-source AI audiobook studio. A free, private alternative to ElevenLabs. 3 voice modes, per-sentence voice & emotion control, LLM smart character analysis, mixed-voice generation. Runs 100% locally on your GPU with zero API costs.
mypapit
VibeVoice cloned
Deveraux-Parker
Getting VibeVoice 7b working with 10 gb of vram.
VibeVoice nodes with exl3 support. Realtime inference speed on 3090
0seba
No description available
BumpyClock
No description available
elbruno
No description available