Search Results

Found 4,965 repositories(showing 30)

mlx-audio

Blaizzy

💛77

A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.

6.6k

540

MIT

Python

Updated 5 hours ago

apple-siliconaudio-processingmlx+6

STT

coqui-ai

💛76

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

2.6k

301

MPL-2.0

C++

Updated 2 days ago

asrautomatic-speech-recognitiondeep-learning+7

WhisperJAV

meizhong986

💛73

ASR/STT subtitle generator. Uses Qwen3-ASR, local LLM, Whisper, TEN-VAD. Noise-robust for JAV

1.4k

124

MIT

Python

Updated 2 hours ago

aitranslatehallucinationjapanese+10

review_object_detection_metrics

rafaelpadilla

💛73

Object Detection Metrics. 14 object detection metrics: mean Average Precision (mAP), Average Recall (AR), Spatio-Temporal Tube Average Precision (STT-AP). This project supports different bounding box formats as in COCO, PASCAL, Imagenet, etc.

1.2k

228

NOASSERTION

Python

Updated 1 day ago

average-precisionbounding-boxescoco-api+8

mlx-tune

ARahim3

🧡67

Fine-tune LLMs on your Mac with Apple Silicon. SFT, DPO, GRPO, Vision, TTS, STT, Embedding, and OCR fine-tuning — natively on MLX. Unsloth-compatible API.

1.0k

Apache-2.0

Python

Updated 5 hours ago

apple-silicondeep-learninghuggingface+17

Complete-Ecommerce-in-laravel-10

Prajwal100

🧡67

Complete E-commerce Website in Laravel 10 - Full-featured eCommerce solution with modern UI, admin panel, PayPal integration, and powered by NepVox AI (TTS, STT, TTI)

1.0k

563

MIT

Blade

Updated 1 week ago

advance-ecommerce-projecte-commerceecommerce+8

open_stt

snakers4

💛72

Open STT

820

NOASSERTION

Python

Updated 21 hours ago

asrautomatic-speech-recognitiondataset+3

lobe-tts

lobehub

💛72

🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser

781

MIT

TypeScript

Updated 1 day ago

auzrebunedge+10

TTS-Voice-Wizard

VRCWizard

💛72

Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)

778

MIT

Updated 1 day ago

chatboxdiscordfree+11

voice-ai

rapidaai

💛73

Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.

713

179

NOASSERTION

Updated 13 minutes ago

agent-frameworkai-voiceai-voice-agent+17

sonus

evancohen

🧡51

:speech_balloon: /so.nus/ STT (speech to text) for Node with offline hotword detection

635

MIT

JavaScript

Updated 2 weeks ago

alexahotword-detectionkeyword-spotting+7

stts

inket

🧡61

A simple macOS app for monitoring the status of cloud services

560

MIT

Swift

Updated 1 week ago

appcloudmacos+2

Starmoon

StarmoonAI

💛71

A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT applications, and DIY robotics. Built with Python, NextJS, Arduino, ESP32, LLMs (GPT-4o), Deepgram STT and Azure TTS 🤖

543

GPL-3.0

TypeScript

Updated 3 days ago

esp32gptiot+6

Awesome-Korean-Speech-Recognition

rtzr

🧡66

한국어 음성인식 STT API 리스트. 각 성능 벤치마크.

501

CC0-1.0

Updated 4 hours ago

awesomekoreanspeech-recognition+3

willow-inference-server

toverainc

🧡56

Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS

499

Apache-2.0

Python

Updated 1 week ago

cudadeep-learningllama+9

local-voice-ai

ShayneP

💛72

Local voice AI powered by Ollama, Kokoro, Nemotron STT, and LiveKit.

473

145

MIT

TypeScript

Updated 1 day ago

RealtimeSTT_LLM_TTS

Ikaros-521

🧡66

实时STT，连接OpenAI接口/智谱AI（流式LLM）和GPT-SOVITS/Edge-TTS，通过网页的方式，进行跨网络的服务调用，实现实时对话的效果

436

MIT

Python

Updated 6 days ago

llmpythonstt+1

LangHelper

NsLearning

🧡61

Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.

348

MIT

Rust

Updated 2 weeks ago

aiasrassessment+10

ToolNeuron

Siddhesh2377

🧡66

On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.

334

Apache-2.0

Kotlin

Updated 12 hours ago

ai-personasandroidgguf-models+13

ChatGPT-OpenAI-Smart-Speaker

Olney1

💛71

This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.

311

MIT

Python

Updated 15 hours ago

agentsaiartificial-intelligence+14

twelvet

twelvet-projects

💛71

（Spring Boot 3. X Microservices framework）基于Spring Boot 3.X 的 Spring Cloud Alibaba / Spring Cloud Tencent + React的微服务框架。🔝 🔝 点个starrred 关注更新。Chat GPT(RAG、TTS、STT、LLM)

258

Apache-2.0

Java

Updated 1 day ago

javajava17jdk17+9

interview-helper

JasonJarvan

🧡66

开源的AI面试助手，使用OpenAI Whipser模型进行STT（Speak to Text 语音转文字）转录，然后将问题交给ChatGPT回答。

204

Python

Updated 13 hours ago

whisper-live-transcription

gaborvecsei

🧡66

Live-Transcription (STT) with Whisper PoC

200

Python

Updated 1 day ago

aiapplied-machine-learninggradio+5

personal-ai-starter-pack

disler

🧡56

Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.

195

Python

Updated 3 weeks ago

gst-deepspeech

Elleo

🧡65

NOTE: This plugin is now deprecated in favour of the coqui-stt branch in gst-plugins-bad: https://gitlab.freedesktop.org/philn/gstreamer/-/tree/coqui-stt/subprojects/gst-plugins-bad/ext/coqui

170

NOASSERTION

C++

Updated 2 days ago

webai-example-realtime-voice-chat

proj-airi

🧡65

🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one file!

165

MIT

TypeScript

Updated 1 day ago

chatgpt-voiceproj-airiproject-airi+2

STT-models

coqui-ai

🧡51

Open models for Coqui STT

154

Updated 1 week ago

deep-learningmodelsspeech-to-text

selfservicekiosk-audio-streaming

dialogflow

❤️26

A best practice for streaming audio from a browser microphone to Dialogflow or Google Cloud STT by using websockets.

146

Apache-2.0

JavaScript

Updated 10 months ago

ZZZ-RETIRED__openstt

MycroftAI

❤️35

RETIRED - OpenSTT is now retired. If you would like more information on Mycroft AI's open source STT projects, please visit:

140

Updated 5 months ago

nlpnlp-machine-learningnlu+6

STT-examples

coqui-ai

🧡66

🐸STT integration examples

132

MPL-2.0

Python

Updated 6 days ago

GitHub Explorer

Search Results

mlx-audio

STT

WhisperJAV

review_object_detection_metrics

mlx-tune

Complete-Ecommerce-in-laravel-10

open_stt

lobe-tts

TTS-Voice-Wizard

voice-ai

sonus

stts

Starmoon

Awesome-Korean-Speech-Recognition

willow-inference-server

local-voice-ai

RealtimeSTT_LLM_TTS

LangHelper

ToolNeuron

ChatGPT-OpenAI-Smart-Speaker

twelvet

interview-helper

whisper-live-transcription

personal-ai-starter-pack

gst-deepspeech

webai-example-realtime-voice-chat

STT-models

selfservicekiosk-audio-streaming

ZZZ-RETIRED__openstt

STT-examples

mlx-audio

STT

WhisperJAV

review_object_detection_metrics

mlx-tune

Complete-Ecommerce-in-laravel-10

open_stt

lobe-tts

TTS-Voice-Wizard

voice-ai

sonus

stts

Starmoon

Awesome-Korean-Speech-Recognition

willow-inference-server

local-voice-ai

RealtimeSTT_LLM_TTS

LangHelper

ToolNeuron

ChatGPT-OpenAI-Smart-Speaker

twelvet

interview-helper

whisper-live-transcription

personal-ai-starter-pack

gst-deepspeech

webai-example-realtime-voice-chat

STT-models

selfservicekiosk-audio-streaming

ZZZ-RETIRED__openstt

STT-examples