Search Results

Found 211 repositories(showing 30)

Qwen3-ASR

QwenLM

💛70

Qwen3-ASR is an open-source series of ASR models developed by the Qwen team at Alibaba Cloud, supporting stable multilingual speech/music/song recognition, language detection and timestamp prediction.

2.3k

225

Apache-2.0

Python

Updated 1 hour ago

WhisperJAV

meizhong986

💛73

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV

1.4k

124

MIT

Python

Updated 48 minutes ago

aitranslatehallucinationjapanese+10

Qwen3-ASR-Toolkit

QwenLM

🧡67

Official Python toolkit for the Qwen3-ASR API. Parallel high‑throughput calls, robust long‑audio transcription, multi‑sample‑rate support.

924

MIT

Python

Updated 4 minutes ago

qwen-asr

antirez

💛71

C inference for Qwen3-ASR 0.6b and 1.7b transcriptions models

506

MIT

Updated 6 hours ago

Voice-Clone-Studio

FranckyB

🧡66

A Gradio-based web UI for voice cloning and voice design, powered by Qwen3-TTS & VibeVoice. Can use Whisper or VibeVoice-ASR for automatic transcription.

386

Apache-2.0

Python

Updated 14 hours ago

eve

nexmoe

🧡61

Eve Recorder: A cross-platform long-running microphone recorder with real-time transcription. It uses Qwen3-ASR by default. VAD keeps only speech segments and transcribes speech-only chunks.

315

TypeScript

Updated 22 hours ago

Qwen3-ASR-Studio

yeahhe365

💛71

A modern web UI for the Qwen ASR model, featuring audio recording, PWA support, Picture-in-Picture mode, and local caching for fast, accurate transcriptions.

256

MIT

TypeScript

Updated 16 hours ago

gradioqwen-asrreact+3

funasr-api

Quantatirsk

🧡66

Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务，支持 52 种语言，兼容 OpenAI API 与阿里云语音 API。

226

Python

Updated 17 hours ago

asrqwen3

qwen3_asr_rs

second-state

💛70

Rust implementation of Qwen3-ASR automatic speech recognition

215

Apache-2.0

Rust

Updated 4 hours ago

ComfyUI-Qwen3-ASR

DarioFT

🧡65

ComfyUI custom nodes for Qwen3-ASR (Automatic Speech Recognition) - audio-to-text transcription supporting 52 languages and dialects.

159

Python

Updated 3 days ago

Qwen3-ASR-GGUF

HaujetZhao

🧡60

将 Qwen3-ASR 的 LLM 部分导出为 GGUF，用 llama.cpp 进行加速推理。后者支持 Vulkan 和 Cuda 加速。

100

C++

Updated 17 hours ago

qwen3-asr.cpp

predict-woo

🧡60

Implementation of Qwen3-ASR-0.6B in GGML

C++

Updated 6 hours ago

openclaw-knowledge-distiller

destinyfrancis

🧡55

Open CLAW Knowledge Distiller · 龍蝦知識蒸餾器 — Turn YouTube/Bilibili videos into structured knowledge articles. Local Qwen3-ASR MLX + AI summarization. MCP server for Claude Code / Open CLAW agents.

Python

Updated 1 week ago

mlx-qwen3-asr

moona3k

💛70

Qwen3-ASR speech recognition on Apple Silicon via MLX

Apache-2.0

Python

Updated 16 hours ago

ComfyUI-QwenASR

1038lab

🧡60

A lightweight ComfyUI custom node pack for Qwen3-ASR, providing simple speech‑to‑text workflows with local model caching and optional timestamp output. Supports Qwen/Qwen3‑ASR‑1.7B and 0.6B, with HuggingFace/ModelScope download options and clean integration for ComfyUI pipelines.

GPL-3.0

Python

Updated 4 days ago

VideoContext-Engine

dolphin-creator

🧡65

Local Video RAG Engine. A FastAPI microservice for video understanding: Scene Detection + Whisper ASR + Qwen3-VL. Optimized for Apple Silicon (MLX) & Windows/Linux (Llama.cpp).

Python

Updated 5 hours ago

apple-siliconfastapillama-cpp+9

qwen3-asr-openai

uaysk

🧡65

qwen3 asr server for openai compatible API

Python

Updated 6 days ago

Comfyui_SynVow_Qwen3ASR

shumoLR

🧡65

A ComfyUI speech recognition plugin based on [Qwen3-ASR](https://github.com/QwenLM/Qwen3-ASR).

Python

Updated 1 day ago

qwen3-asr-rs

alan890104

🧡65

Pure-Rust inference engine for Qwen3-ASR speech recognition models (0.6B & 1.7B) using candle with Metal/CUDA acceleration

MIT

Rust

Updated 5 days ago

candlecudametal+3

qwen3-asr-onnx

andrewleech

🧡60

No description available

Apache-2.0

Python

Updated 3 days ago

ComfyUI-Qwen3-ASR

kaushiknishchay

💛70

ComfyUI nodes for Qwen3-ASR (0.6B/1.7B) and ForcedAligner. Supports high-accuracy ASR and language identification for 52 languages/dialects, including 22 Chinese dialects and various English accents. Features word-level timestamps, long audio transcription, and VRAM-optimized inference.

MIT

Python

Updated 5 days ago

asrasr-modelqwen3+3

Qwen3-ASR-src

s1916

❤️45

使用Qwen3-ASR模型生成电影字幕

Python

Updated 1 month ago

qwen3-asr-EdgeOne

homestoo

🧡60

这是一个兼容OpenAI接口的Qwen3语音识别(ASR)服务，支持多种部署方式。支持spokenly直接调用，提供完整的语音转文字功能，包括多语言支持、智能标点格式化，以及与OpenAI Whisper API完全兼容的接口。

MIT

JavaScript

Updated 1 week ago

matrix-live-diarizer

lgy1027

🧡65

实时语音转写与说话人识别系统，基于 Qwen3-ASR 构建，支持 WebSocket 流式传输与多声纹引擎切换

Python

Updated 2 days ago

qwen3-asr-llamacpp

shershah1024

🧡55

Qwen3-ASR speech-to-text for llama.cpp — patch, GGUF models, and benchmarks

Python

Updated 5 days ago

mls

hanxiao

🧡60

MLX Local Serving (MLS) - Unified ASR, TTS, and Translation on Apple Silicon

Apache-2.0

HTML

Updated 1 week ago

apple-siliconasrmlx+3

Qwen3-ASR-onnx

Wasser1462

🧡65

A small and simple example showing how to run Qwen3-ASR with ONNX Runtime.

Python

Updated 1 hour ago

tnt-asr

appautomaton

💛70

TNT 🧨, powered by Qwen3-ASR

NOASSERTION

Updated 1 day ago

Silence-Cutter

leeyc09

💛70

Silenci — AI-powered silence removal & subtitle generator for Final Cut Pro. macOS native app, Qwen3-ASR + MLX, 100% local & free.

Apache-2.0

Swift

Updated 1 day ago

apple-siliconfcpxmlfinal-cut-pro+9

Qwen3-ASR

neosun100

❤️45

Qwen3-ASR Production Docker: 52-language ASR with dark-theme UI + REST API + MCP. All-in-One, zero runtime download.

Apache-2.0

Python

Updated 1 week ago

GitHub Explorer

Search Results

Qwen3-ASR

WhisperJAV

Qwen3-ASR-Toolkit

qwen-asr

Voice-Clone-Studio

eve

Qwen3-ASR-Studio

funasr-api

qwen3_asr_rs

ComfyUI-Qwen3-ASR

Qwen3-ASR-GGUF

qwen3-asr.cpp

openclaw-knowledge-distiller

mlx-qwen3-asr

ComfyUI-QwenASR

VideoContext-Engine

qwen3-asr-openai

Comfyui_SynVow_Qwen3ASR

qwen3-asr-rs

qwen3-asr-onnx

ComfyUI-Qwen3-ASR

Qwen3-ASR-src

qwen3-asr-EdgeOne

matrix-live-diarizer

qwen3-asr-llamacpp

mls

Qwen3-ASR-onnx

tnt-asr

Silence-Cutter

Qwen3-ASR

Qwen3-ASR

WhisperJAV

Qwen3-ASR-Toolkit

qwen-asr

Voice-Clone-Studio

eve

Qwen3-ASR-Studio

funasr-api

qwen3_asr_rs

ComfyUI-Qwen3-ASR

Qwen3-ASR-GGUF

qwen3-asr.cpp

openclaw-knowledge-distiller

mlx-qwen3-asr

ComfyUI-QwenASR

VideoContext-Engine

qwen3-asr-openai

Comfyui_SynVow_Qwen3ASR

qwen3-asr-rs

qwen3-asr-onnx

ComfyUI-Qwen3-ASR

Qwen3-ASR-src

qwen3-asr-EdgeOne

matrix-live-diarizer

qwen3-asr-llamacpp

mls

Qwen3-ASR-onnx

tnt-asr

Silence-Cutter

Qwen3-ASR