Found 649 repositories(showing 30)
antirez
Pure C inference of Mistral Voxtral Realtime 4B speech to text model
TrevorS
Voxtral ASR & TTS running natively and in the browser. A Rust implementation of Mistral's Voxtral mini realtime ASR / TTS using the Burn ML framework
peteonrails
Voice-to-text with push-to-talk for Wayland compositors
hehehai
🎙️Voice input and translation app for macOS. Press to talk, release to paste.
herimor
VoXtream is a Full-Stream Zero-shot TTS model with Extremely Low Latency and Speaking rate Control
MIC-DKFZ
Free-Text Promptable Universal 3D Medical Image Segmentation
Al0olo
Training the missing codec encoder for Mistral's Voxtral-4B-TTS, enabling zero-shot voice cloning
Zarbuz
Import a MagicaVoxel project to Unity using the new VFX Graph
mudler
Pure C implementation of Voxtral-4B-TTS-2603
IDRnD
The VoxTube dataset official repository
elyxlz
Voxtral: Convert Mistral into a end2end SpeechLM. No information bottleneck, preserves prosody, learns interruptions from data. Unlike GPT4o (closed) or Moshi (complex), it's open, simple, natural.
Deep-unlearning
No description available
khalooei
Voxtral is a state-of-the-art model developed to handle both speech transcription and audio understanding with remarkable accuracy and efficiency. This demo interface lets you run the Voxtral model on powerful GPUs to evaluate its performance and see how it can be used for transcription and deeper analysis.
Johnson145
Offline Speech-to-Text (STT) service using Mistral's Voxtral model with Wyoming protocol compatibility for Home Assistant Assist integration.
andrijdavid
Port of Mistral's Voxtral model in C/C++
Innovative-Digitale-Medizin-IDM
This repository contains a fine-tuning script for the transcription task of Mistral's Voxtral model.
gustavhartz
Collaborative transcription service that keeps getting better
MIC-DKFZ
No description available
mzbac
No description available
gomesgustavoo
No description available
mzbac
No description available
coezbek
My investigation of Voxtral Mini-3B's capabilities.
rishikksh20
Voxtral Codec : Combining Semantic VQ and Acoustic FSQ for Ultra-Low Bitrate Speech Generation (Voxtral TTS Backbone)
AIAnytime
An AI-Powered Sales Call Analyzer with Mistral's Voxtral model built in Streamlit.
abrightmoore
MCEdit filter to parse VOX (MagicaVoxel) format to MCEdit block schematics
dmarzzz
No description available
hongchengzhu
Official Implementation of VoxTracer (MM' 23)
filliptm
ComfyUI nodes for Mistral's Voxtral-4B text-to-speech model with direct PyTorch inference. Supports 20 preset voices across 9 languages on CUDA, MPS, and CPU.
HexExecute
A sparse voxel octree in rust.
A custom implementation of Canopy Labs Orpheus server, for experimentation purposes