Found 211 repositories(showing 30)
QwenLM
Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.
meizhong986
ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV
QwenLM
Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.
antirez
C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models
FranckyB
A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.
nexmoe
Eve Recorder: A cross-platform long-running microphone recorder with real-time transcription. It uses Qwen3-ASR by default. VAD keeps only speech segments and transcribes speech-only chunks.
yeahhe365
A modern web UI for the Qwen ASR model, featuring audio recording, PWA support, Picture-in-Picture mode, and local caching for fast, accurate transcriptions.
Quantatirsk
Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 OpenAI API 与阿里云语音 API。
second-state
Rust implementation of Qwen3-ASR automatic speech recognition
DarioFT
ComfyUI custom nodes for Qwen3-ASR (Automatic Speech Recognition) - audio-to-text transcription supporting 52 languages and dialects.
HaujetZhao
将 Qwen3-ASR 的 LLM 部分导出为 GGUF,用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。
predict-woo
Implementation of Qwen3-ASR-0.6B in GGML
destinyfrancis
Open CLAW Knowledge Distiller · 龍蝦知識蒸餾器 — Turn YouTube/Bilibili videos into structured knowledge articles. Local Qwen3-ASR MLX + AI summarization. MCP server for Claude Code / Open CLAW agents.
moona3k
Qwen3-ASR speech recognition on Apple Silicon via MLX
1038lab
A lightweight ComfyUI custom node pack for Qwen3-ASR, providing simple speech‑to‑text workflows with local model caching and optional timestamp output. Supports Qwen/Qwen3‑ASR‑1.7B and 0.6B, with HuggingFace/ModelScope download options and clean integration for ComfyUI pipelines.
dolphin-creator
Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Silicon (MLX) & Windows/Linux (Llama.cpp).
uaysk
qwen3 asr server for openai compatible API
shumoLR
A ComfyUI speech recognition plugin based on [Qwen3-ASR](https://github.com/QwenLM/Qwen3-ASR).
alan890104
Pure-Rust inference engine for Qwen3-ASR speech recognition models (0.6B & 1.7B) using candle with Metal/CUDA acceleration
andrewleech
No description available
kaushiknishchay
ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports high-accuracy ASR and language identification for 52 languages/dialects, including 22 Chinese dialects and various English accents. Features word-level timestamps, long audio transcription, and VRAM-optimized inference.
s1916
使用Qwen3-ASR模型生成电影字幕
homestoo
这是一个兼容OpenAI接口的Qwen3语音识别(ASR)服务,支持多种部署方式。支持spokenly直接调用,提供完整的语音转文字功能,包括多语言支持、智能标点格式化,以及与OpenAI Whisper API完全兼容的接口。
lgy1027
实时语音转写与说话人识别系统,基于 Qwen3-ASR 构建,支持 WebSocket 流式传输与多声纹引擎切换
shershah1024
Qwen3-ASR speech-to-text for llama.cpp — patch, GGUF models, and benchmarks
hanxiao
MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon
Wasser1462
A small and simple example showing how to run Qwen3-ASR with ONNX Runtime.
appautomaton
TNT 🧨, powered by Qwen3-ASR
leeyc09
Silenci — AI-powered silence removal & subtitle generator for Final Cut Pro. macOS native app, Qwen3-ASR + MLX, 100% local & free.
neosun100
Qwen3-ASR Production Docker: 52-language ASR with dark-theme UI + REST API + MCP. All-in-One, zero runtime download.