Search Results

Found 42,342 repositories(showing 30)

whisper

openai

💚95

Robust Speech Recognition via Large-Scale Weak Supervision

97.2k

12.0k

MIT

Python

Updated 12 minutes ago

whisper.cpp

ggml-org

💚100

Port of OpenAI's Whisper model in C/C++

48.3k

5.4k

MIT

C++

Updated 7 minutes ago

inferenceopenaispeech-recognition+3

faster-whisper

SYSTRAN

💚95

Faster Whisper transcription with CTranslate2

22.0k

1.8k

MIT

Python

Updated 1 hour ago

deep-learninginferenceopenai+5

whisperX

m-bain

💚95

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

21.1k

2.2k

BSD-2-Clause

Python

Updated 1 hour ago

asrspeechspeech-recognition+2

buzz

chidiwilliams

💚94

Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.

18.5k

1.4k

MIT

Python

Updated 2 hours ago

whisper

FunASR

modelscope

💚100

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

15.5k

1.6k

MIT

Python

Updated 1 hour ago

audio-visual-speech-recognitionconformerdfsmn+12

voicebox

jamiepine

💚99

The open-source voice synthesis studio

14.5k

1.7k

MIT

TypeScript

Updated 2 hours ago

aicudamlx+5

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

12.6k

2.0k

Apache-2.0

Python

Updated 7 hours ago

asrcode-switchconformer+17

insanely-fast-whisper

Vaibhavs10

💛81

No description available

12.4k

908

Apache-2.0

Jupyter Notebook

Updated 7 minutes ago

meetily

Zackriya-Solutions

💚91

Privacy first, AI meeting assistant with 4x faster Parakeet/Whisper live transcription, speaker diarization, and Ollama summarization built on Rust. 100% local processing. no cloud required. Meetily (Meetly Ai - https://meetily.ai) is the #1 Self-hosted, Open-source Ai meeting note taker for macOS & Windows.

10.9k

1.0k

MIT

Rust

Updated 1 hour ago

aiai-meeting-assistantllm+16

go-openai

sashabaranov

💚96

OpenAI ChatGPT, GPT-5, GPT-Image-1, Whisper API clients for Go

10.6k

1.7k

Apache-2.0

Updated 12 hours ago

chatgptchatgpt-apidall-e+8

Whisper

Const-me

💚90

High-performance GPGPU inference of OpenAI's Whisper automatic speech recognition (ASR) model

10.3k

925

MPL-2.0

C++

Updated 2 hours ago

WhisperLiveKit

QuentinFuxa

💚90

Simultaneous speech-to-text models

10.0k

1.0k

Apache-2.0

Python

Updated 49 minutes ago

RTranslator

niedev

💛88

Open source real-time translation app for Android that runs locally

9.8k

873

Apache-2.0

C++

Updated 10 hours ago

androidandroid-appbluetooth-le+11

inference

xorbitsai

💛87

Swap GPT for any LLM by changing a single line of code. Xinference lets you run open-source, speech, and multimodal models on cloud, on-prem, or your laptop — all through one unified, production-ready inference API.

9.2k

814

Apache-2.0

Python

Updated 31 minutes ago

artificial-intelligencechatglmdeployment+17

voice-pro

abus-aikorea

💛79

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

6.6k

714

GPL-3.0

Python

Updated 6 minutes ago

audiobookfaster-whispergradio+16

RealChar

Shaunwei

💛79

🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime (All in One Codebase!). Have a natural seamless conversation with AI everywhere (mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖

6.2k

780

MIT

JavaScript

Updated 1 day ago

MoneyPrinterPlus

ddean2009

💛87

AI一键批量生成各类短视频,自动批量混剪短视频,自动把视频发布到抖音,快手,小红书,视频号上,赚钱从来没有这么容易过! 支持本地语音模型chatTTS,fasterwhisper,GPTSoVITS,支持云语音：Azure,阿里云,腾讯云。支持Stable diffusion,comfyUI直接AI生图。Generate short videos with one click using AI LLM,print money together! support:chatTTS,faster-whisper,GPTSoVITS,Azure,tencent Cloud,Ali Cloud.

6.0k

1.1k

GPL-3.0

Python

Updated 4 hours ago

WhisperKit

argmaxinc

💛81

On-device Speech Recognition for Apple Silicon

5.9k

539

MIT

Swift

Updated 2 hours ago

inferenceiosmacos+6

vibe

thewh1teagle

💛74

Transcribe on your own!

5.7k

369

MIT

TypeScript

Updated 5 hours ago

aicross-platformdesktop+4

feishu-openai

ConnectAI-E

💛80

🎒 飞书 ×（GPT-4 + GPT-4V + DALL·E-3 + Whisper）= 飞一般的工作体验 🚀 语音对话、角色扮演、多话题讨论、图片创作、表格分析、文档导出 🚀

5.6k

934

GPL-3.0

Updated 5 hours ago

chatgptchatgpt-apichatgpt-bot+5

whisper-diarization

MahmoudAshraf97

💛75

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

5.5k

500

BSD-2-Clause

Jupyter Notebook

Updated 1 hour ago

asrspeaker-diarizationspeech+3

wenet

wenet-e2e

💛87

Production First and Production Ready End-to-End Speech Recognition Toolkit

5.1k

1.2k

Apache-2.0

Python

Updated 1 day ago

asrautomatic-speech-recognitionconformer+6

whisper-jax

sanchit-gandhi

💛79

JAX implementation of OpenAI's Whisper model for up to 70x speed-up on TPU.

4.7k

414

Apache-2.0

Jupyter Notebook

Updated 1 day ago

deep-learningjaxspeech-recognition+2

WhisperSpeech

💛77

An Open Source text-to-speech system built by inverting Whisper.

4.6k

270

MIT

Jupyter Notebook

Updated 21 hours ago

pytorchspeech-synthesistts

cactus

cactus-compute

💛78

Low-latency AI engine for mobile devices & wearables

4.6k

344

NOASSERTION

Updated 3 hours ago

aiandroidarm+17

cheetah

leetcode-mafia

💛72

Mac app for crushing tech interviews with AI

4.3k

304

CC0-1.0

Swift

Updated 2 days ago

aichatgptgpt+6

distil-whisper

huggingface

💛78

Distilled variant of Whisper for speech recognition. 6x faster, 50% smaller, within 1% word error rate.

4.1k

350

MIT

Python

Updated 8 hours ago

audiospeech-recognitionwhisper

WhisperLive

collabora

💛79

A nearly-live implementation of OpenAI's Whisper.

3.9k

541

MIT

Python

Updated 1 day ago

dictationobsopenai+9

embark

embarklabs

💛79

Framework for serverless Decentralized Applications using Ethereum, IPFS and other platforms

3.8k

486

MIT

JavaScript

Updated 1 day ago

blockchaindappdecentralized+7

GitHub Explorer

Search Results

whisper

whisper.cpp

faster-whisper

whisperX

buzz

FunASR

voicebox

PaddleSpeech

insanely-fast-whisper

meetily

go-openai

Whisper

WhisperLiveKit

RTranslator

inference

voice-pro

RealChar

MoneyPrinterPlus

WhisperKit

vibe

feishu-openai

whisper-diarization

wenet

whisper-jax

WhisperSpeech

cactus

cheetah

distil-whisper

WhisperLive

embark

whisper

whisper.cpp

faster-whisper

whisperX

buzz

FunASR

voicebox

PaddleSpeech

insanely-fast-whisper

meetily

go-openai

Whisper

WhisperLiveKit

RTranslator

inference

voice-pro

RealChar

MoneyPrinterPlus

WhisperKit

vibe

feishu-openai

whisper-diarization

wenet

whisper-jax

WhisperSpeech

cactus

cheetah

distil-whisper

WhisperLive

embark