Search Results

Found 328 repositories(showing 30)

speechbrain

💚91

A PyTorch-based Speech Toolkit

11.4k

1.7k

Apache-2.0

Python

Updated 34 minutes ago

asraudioaudio-processing+17

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

373

HTML

Updated 9 hours ago

beamformingdeep-learningdeeplearning+17

benchmarks

speechbrain

🧡56

This repository contains the SpeechBrain Benchmarks

140

Apache-2.0

Python

Updated 12 hours ago

speech-rest-api

askrella

❤️30

Transcription and TTS Rest API (OpenAI Whisper, Speechbrain)

Python

Updated 2 months ago

artificial-intelligenceopenaipython3+5

HyperPyYAML

speechbrain

❤️45

Extensions to YAML syntax for better python interaction

Apache-2.0

Python

Updated 1 month ago

pythonspeechbrainyaml

Fast-TSE

ns2250225

🧡55

基于speechbrain的快速目标说话人提取TSE服务

Python

Updated 1 week ago

ALT_SpeechBrain

guxm2021

🧡50

[ISMIR 2022] Transfer Learning of wav2vec 2.0 for Automatic Lyric Transcription

Apache-2.0

Python

Updated 2 months ago

automatic-lyric-transcriptionspeech-recognitiontransformers

SVT_SpeechBrain

guxm2021

💛70

[TOMM 2024] Automatic Lyric Transcription and Automatic Music Transcription from Multimodal Singing

Apache-2.0

Python

Updated 1 day ago

Utterr

maximus-choi

🧡60

Real-time speaker diarization using straightforward, intuitive logic - High accuracy thanks to SpeechBrain/Pyannote-WeSpeaker models

Apache-2.0

Python

Updated 3 weeks ago

pyannotereal-timerealtime+5

stutter-former

jordicapde

❤️45

StutterFormer is an AI model that aims to be able to receive a speech sample with stuttering disfluencies, and return it with the disfluencies attenuated or eliminated.

Jupyter Notebook

Updated 2 months ago

speech-enhancementspeechbrainstuttering+1

Speech

sangramsingnk

❤️30

Text-to-Speech Recipe Users can create speech signals from an input text by using text-to-speech (TTS), also referred to as speech synthesis. Popular TTS and Vocoder models, such as Tacotron 2, are supported by SpeechBrain (e.g, HiFIGAN).

Jupyter Notebook

Updated 1 year ago

VAF

nuaazs

❤️45

Backend of anti-fraud system based on speaker identification technology. 基于声纹识别的反诈系统后端

Python

Updated 2 months ago

speaker-identificationspeaker-recognitionspeechbrain

streaming-asr

benluks

❤️45

Low-latency ASR using SpeechBrain StreamingASR and torchaudio StreamReader.

Python

Updated 2 months ago

k2-speechbrain

luomingshuang

❤️30

In this repository, I try to combine k2 with speechbrain to decode well and fastly.

Python

Updated 2 years ago

voxlingua107_sb

alumae

❤️35

VoxLingua107 recipe for SpeechBrain

Python

Updated 1 year ago

ts-asr

lucadellalib

🧡60

Target speaker automatic speech recognition (TS-ASR)

Python

Updated 3 days ago

asrconformerpytorch+4

speechbrain-docs-zh-cn

JusperLee

❤️35

SpeechBrain中文文档

Updated 1 year ago

Flower-SpeechBrain

yan-gao-GY

❤️10

No description available

Python

Updated 2 years ago

ml4audio

SELMA-project

❤️40

audio, NLP, ML with huggingface, nvidia/nemo, speechbrain

MIT

Python

Updated 12 months ago

LibriStutter

OSU-slatelab

❤️35

A recipe for disfluency detection on the LibriStutter dataset using SpeechBrain

Python

Updated 1 year ago

speechbrain_PartialFake

caizexin

❤️40

No description available

MIT

Python

Updated 1 month ago

incremental_learning_for_asr

shahad-mahmud

❤️35

Incremental learning for automatic speech recognition (ASR)

Python

Updated 2 years ago

continual-learningincremental-learningknowledge-distillation+4

EEG-Based-Motor-Imagery-Decoding-with-Deep-Learning

aspyridakos

❤️35

Processing EEG data using Speechbrain-MOABB and model tuning to get best results

Jupyter Notebook

Updated 3 months ago

eeg-classificationeegnetmachine-learning+3

speaker_diarization_identification

Parva101

❤️40

A Streamlit web app for speaker diarization and identification in audio files. Upload or record audio, transcribe conversations, and automatically segment and label speakers using reference samples. This app makes it easy to analyze multi-speaker audio, export transcripts, and identify "who spoke when" for meetings, interviews, and more.

MIT

Python

Updated 7 months ago

assembly-aispeaker-diarizationspeaker-identification+4

Ask-picturize-it

amitpuri

❤️15

Record voice, transcribe a prompt, picturize the prompt, create variations, get description of a celebrity and upload, other use cases on KB

Jupyter Notebook

Updated 3 months ago

assemblyaiazure-openaicloudinary+17

speechbrain-cl

aalto-speech

❤️25

Implementation of different curriculum learning (CL) methods for speechbrain's ASR recipes.

MIT

Python

Updated 1 year ago

asrcurriculumpython+1

SE3D

Hguimaraes

🧡60

[Research] 2nd place solution at L3DAS21 challenge Task 1. Using FCN architecture and Perceptual Losses. Implemented with the SpeechBrain toolkit

MIT

Python

Updated 2 weeks ago

convolutional-neural-networksspeech-enhancementspeech-processing

Wav2Vec2-ru-Diarization

progressionnetwork

❤️40

Attempting to build a custom pipeline using 100k hours of Russian speech data, leveraging Wav2Vec2 and speechbrain/spkrec-ecapa-voxceleb for embedding extraction. This will involve employing a combination of non-standard clustering approaches.

MIT

Python

Updated 7 months ago

openwakeword-trainer

lgpearson1771

🧡60

Train custom wake word models with openWakeWord. A granular 13-step pipeline with compatibility patches for torchaudio 2.10+, Piper TTS, and speechbrain. Generates tiny ONNX models (~200 KB) for real-time keyword detection — like building your own "Hey Siri" trigger. WSL2/Linux + CUDA required.

MIT

Python

Updated 5 days ago

audio-augmentationcustom-wake-worddeep-learning+16

Speech-Separations-with-variable-number-of-sources

sayemomer

🧡65

Audio source separation model with a Whisper ECAPA-TDNN counter and pre‑trained speechbrain/sepformer-libri3mix and speechbrain/sepformer-wsj02mix for speech separation, implemented with SpeechBrain.

Jupyter Notebook

Updated 19 hours ago

audio-processingconversational-aispeech-separation+1

GitHub Explorer

Search Results

speechbrain

speechbrain.github.io

benchmarks

speech-rest-api

HyperPyYAML

Fast-TSE

ALT_SpeechBrain

SVT_SpeechBrain

Utterr

stutter-former

Speech

VAF

streaming-asr

k2-speechbrain

voxlingua107_sb

ts-asr

speechbrain-docs-zh-cn

Flower-SpeechBrain

ml4audio

LibriStutter

speechbrain_PartialFake

incremental_learning_for_asr

EEG-Based-Motor-Imagery-Decoding-with-Deep-Learning

speaker_diarization_identification

Ask-picturize-it

speechbrain-cl

SE3D

Wav2Vec2-ru-Diarization

openwakeword-trainer

Speech-Separations-with-variable-number-of-sources

speechbrain

speechbrain.github.io

benchmarks

speech-rest-api

HyperPyYAML

Fast-TSE

ALT_SpeechBrain

SVT_SpeechBrain

Utterr

stutter-former

Speech

VAF

streaming-asr

k2-speechbrain

voxlingua107_sb

ts-asr

speechbrain-docs-zh-cn

Flower-SpeechBrain

ml4audio

LibriStutter

speechbrain_PartialFake

incremental_learning_for_asr

EEG-Based-Motor-Imagery-Decoding-with-Deep-Learning

speaker_diarization_identification

Ask-picturize-it

speechbrain-cl

SE3D

Wav2Vec2-ru-Diarization

openwakeword-trainer

Speech-Separations-with-variable-number-of-sources