Found 6 repositories(showing 6)
HaujetZhao
将 Qwen3-ASR 的 LLM 部分导出为 GGUF,用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。
predict-woo
Implementation of Qwen3-ASR-0.6B in GGML
dolphin-creator
Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Silicon (MLX) & Windows/Linux (Llama.cpp).
shershah1024
Qwen3-ASR speech-to-text for llama.cpp — patch, GGUF models, and benchmarks
femelo
Python bindings for Qwen3ASR.cpp
vieenrose
No description available
All 6 repositories loaded