Found 3,207 repositories(showing 30)
kaldi-asr
kaldi-asr/kaldi is the official location of the Kaldi project.
alphacep
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
k2-fsa
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Axera NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
espnet
End-to-End Speech Processing Toolkit
mravanelli
pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.
MontrealCorpusTools
Command line utility for forced alignment using Kaldi
k2-fsa
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
DragonComputer
the open-source virtual assistant for Ubuntu based Linux distributions
alphacep
WebSocket, gRPC and WebRTC speech recognition server based on Vosk and Kaldi libraries
lhotse-speech
Tools for handling multimodal data in machine learning projects.
alumae
Real-time full-duplex speech recognition server, based on the Kaldi toolkit and the GStreamer framwork.
pykaldi
A Python wrapper for Kaldi
alphacep
Offline speech recognition for Android with Vosk library.
freewym
Espresso: A Fast End-to-End Neural Speech Recognition Toolkit
k2-fsa
Speech-to-text server framework with next-gen Kaldi
srvk
The official repository of the Eesen project
zw76859420
语音识别理论、论文和PPT
A React component to make correcting automated transcriptions of audio and video easier and faster. By BBC News Labs. - Work in progress
YoavRamon
This is a list of features, scripts, blogs and resources for better using Kaldi ( http://kaldi-asr.org/ )
ccoreilly
A speech recognition library running in the browser thanks to a WebAssembly build of Vosk
gooofy
Open tools and data for cloudless automatic speech recognition
funcwj
Tools for Speech Enhancement integrated with Kaldi
hitachi-speech
End-to-End Neural Diarization
dictation-toolbox
Speech recognition framework allowing powerful Python-based scripting and extension of Dragon NaturallySpeaking (DNS), Windows Speech Recognition (WSR), Kaldi and CMU Pocket Sphinx
open-speech
speech-aligner,是一个从“人声语音”及其“语言文本”,产生音素级别时间对齐标注的工具。speech-aligner, is a tool that generate phoneme-level alignment between human speech and its transcription
KarelVesely84
Python functions for reading kaldi data formats. Useful for rapid prototyping with python.
goodatlas
Kaldi-based Korean ASR (한국어 음성인식) open-source project
tencent-ailab
a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
daanzu
Python Kaldi speech recognition with grammars that can be set active/inactive dynamically at decode-time
SergeyShk
Проект для распознавания речи на русском языке на основе pykaldi.