Speech recognition API service powered by FunASR and Qwen-ASR, supporting 52 languages, compatible with OpenAI API and Alibaba Cloud Speech API. 基于 FunASR 与 Qwen3-ASR 的语音识别 API 服务,支持 52 种语言,兼容 OpenAI API 与阿里云语音 API。
Stars
226
Forks
36
Watchers
226
Open Issues
10
Overall repository health assessment
No package.json found
This might not be a Node.js project
303
commits
feat: add MPS-aware device pipeline and improve ASR timestamp handling
c79ebf3View on GitHubfix(docker): add setuptools to CPU requirements to resolve pkg_resources import error
ac9682fView on GitHubfix(docker): add nvcc for FlashInfer JIT compilation on older GPUs
3584253View on GitHubrefactor(deploy): use CUDA_VISIBLE_DEVICES for multi-GPU instance control
121dd0eView on GitHubfeat(deploy): unify internal nginx startup for cpu/gpu and auto multi-gpu
8567ac5View on GitHubRevert "fix(asr): serialize full long-audio requests to prevent cross-request mixing"
91f812eView on GitHubfix(asr): serialize full long-audio requests to prevent cross-request mixing
ad05bc4View on GitHubfix(asr): serialize main inference to avoid cross-request mixing
2680ae6View on GitHubrefactor(auth): centralize token extraction and validation
29f9e6eView on GitHubfix(auth): treat empty API_KEY as optional and support ws query token
826cad0View on GitHubrefactor(audio-splitter): merge only short segments after VAD
cd51b21View on GitHubtune(audio-splitter): reduce max silence merge threshold to 2s
a6d2c7aView on GitHub