Found 4 repositories(showing 4)
filliptm
ComfyUI nodes for Mistral's Voxtral-4B text-to-speech model with direct PyTorch inference. Supports 20 preset voices across 9 languages on CUDA, MPS, and CPU.
debrockb
Audio transcription tool using Mistral AI Voxtral-Mini-3B model. Supports WAV, MP3, FLAC, M4A formats with automatic device detection (MPS/CUDA/CPU).
ffaerber
text to speech
snuri00
Voxtral 4B TTS inference engine with quantized CUDA kernels for low-VRAM GPUs
All 4 repositories loaded