Found 249 repositories(showing 30)
smthemex
Sonic is a method about ' Shifting Focus to Global Audio Perception in Portrait Animation',you can use it in comfyUI
ShmuelRonen
This node provides lip-sync capabilities in ComfyUI using ByteDance's LatentSync model. It allows you to synchronize video lips with audio input.
diodiogod
A ComfyUI custom node integration for local multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Echo-TTS, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools
ryanontheinside
Everything-Reactivity in ComfyUI (audio, MIDI, motion, proximity, and more). Animate and manipulate images, masks, videos, audio, and more. Native ACEStep extensions
yvann-ba
Audio Reactivity Nodes for ComfyUI 🔊 Create AI generated audio-driven animations
wildminder
ComfyUI custom node for the VibeVoice TTS. Expressive, long-form, multi-speaker conversational audio
christian-byrne
Separate stems (vocals, bass, drums, other) from audio. Recombine, tempo match, slice/crop audio
ShmuelRonen
A ComfyUI custom node that integrates Google's Gemini Flash 2.0 Experimental model, enabling multimodal analysis of text, images, video frames, and audio directly within ComfyUI workflows.
The New Stable Diffusion Audio Sampler 1.0 In a ComfyUI Node. Make some beats!
yuvraj108c
Transcribe audio and add subtitles to videos using Whisper in ComfyUI
Saganaki22
ComfyUI custom nodes for Fish Audio S2-Pro TTS — voice clone, multi-speaker, and text-to-speech
billwuhao
A Text To Speech node using Step-Audio-TTS in ComfyUI. Can speak, rap, sing, or clone voice.
ShmuelRonen
A custom node for ComfyUI that allows you to perform lip-syncing on videos using the Wav2Lip model. It takes an input video and an audio file and generates a lip-synced output video.
DarioFT
ComfyUI custom nodes for Qwen3-ASR (Automatic Speech Recognition) - audio-to-text transcription supporting 52 languages and dialects.
JiSenHua
ComfyUI to TouchDesigner custom node for real-time streaming of images, video, 3D models, audio, and text.
Firetheft
The Ultimate Local File Manager for Images, Videos, and Audio in ComfyUI
snicolast
Custom nodes that bring Character.AI's Ovi video+audio generator to ComfyUI with streamlined setup, selectable precision, attention-backend control, and per-node device targeting for multi-GPU rigs.
a1lazydog
No description available
LucipherDev
ComfyUI Custom Nodes for "TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching". This generates high-quality 44.1kHz audio up to 30 seconds using just a text prompt.
princepainter
ComfyUI custom nodes for LTXV audio-video separation sampling and latent preparation. PainterSamplerLTXV: Advanced sampler with external sigmas support - PainterLTXVtoVideo: LTXV latent preparation with audio/video separation
eigenpunk
some generative audio tools for ComfyUI
jags111
A collection amazing audio tools for working with audio and sound files in comfyUI
ID-LoRA
Custom ComfyUI node for generating videos with audio-visual identity based on a reference voice and image
billwuhao
A ComfyUI node containing multiple audio processing tools.
princepainter
A comprehensive ComfyUI toolkit for video generation, image editing, and audio-driven lip‑sync, featuring Flux, LTXV, Wan2.2 and advanced batch workflows.
SpenserCai
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
smthemex
HunyuanVideo-Avatar: High-Fidelity Audio-Driven Human Animation for Multiple Characters,try it in comfyUI ,if your VRAM >24G
Natural language → ComfyUI workflow JSON. 34 built-in templates, 360+ node definitions, auto model download. Supports txt2img, img2img, txt2vid, img2vid, audio, 3D generation across SD1.5/SDXL/SD3/FLUX/Wan2.2/HunyuanVideo/LTXV/Mochi/Cosmos + LLM integration. Works as a skill for Claude Code, Cursor, and other AI coding agents.
Saganaki22
ComfyUI node for AudioSR - Versatile Audio Super Resolution upscales audio to 48kHz using latent diffusion
smthemex
ComfyUI_Hallo2: Long-Duration and High-Resolution Audio-driven Portrait Image Animation