A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
Stars
15.5k
Forks
1.6k
Watchers
15.5k
Open Issues
546
Overall repository health assessment
No package.json found
This might not be a Node.js project
Fix #2815: apply punc_model in generate() when vad_model is not configured (#2816)
43e05d4View on GitHubFix a compatibility issue with the timestamp format for Fun-ASR-Nano-2512 (#2814)
45c74cfView on GitHubFix #2809: remove unnecessary GPU-to-CPU transfer in VAD ComputeScores (#2817)
b6c29a6View on GitHubFix #2793: convert stereo/multi-channel audio to mono in extract_fbank (#2819)
a1bc5b5View on GitHubFix #2787: model.generate时,传入的是音频字节流,会对采样率校验耗时200ms。 (#2820)
1045123View on GitHubFix #2782: convert timestamp to int before offset addition in inference_with_vad (#2821)
5850915View on GitHubfix(dataset): fix dynamic masking error in input construction (#2801)
81d96c9View on GitHub1.9k
commits
643
commits
637
commits
311
commits
291
commits
108
commits
104
commits
65
commits
62
commits
41
commits