Found 3,170 repositories(showing 30)
Blaizzy
A text-to-speech (TTS), speech-to-text (STT) and speech-to-speech (STS) library built on Apple's MLX framework, providing efficient speech analysis on Apple Silicon.
Prajwal100
Complete E-commerce Website in Laravel 10 - Full-featured eCommerce solution with modern UI, admin panel, PayPal integration, and powered by NepVox AI (TTS, STT, TTI)
lobehub
🎤 Lobe TTS - A high-quality & reliable TTS/STT library for Server and Browser
VRCWizard
Speech to Text to Speech. Song now playing. Sends text as OSC messages to VRChat to display on avatar. (STTTS) (Speech to TTS) (VRC STT System) (VTuber TTS)
rapidaai
Rapida is an open-source, end-to-end voice AI orchestration platform for building real-time conversational voice agents with audio streaming, STT, TTS, VAD, multi-channel integration, agent state management, and observability.
StarmoonAI
A conversational, AI device + software framework for companionship, entertainment, education, healthcare, IoT applications, and DIY robotics. Built with Python, NextJS, Arduino, ESP32, LLMs (GPT-4o), Deepgram STT and Azure TTS 🤖
toverainc
Open source, local, and self-hosted highly optimized language inference server supporting ASR/STT, TTS, and LLM across WebRTC, REST, and WS
Ikaros-521
实时STT,连接OpenAI接口/智谱AI(流式LLM)和GPT-SOVITS/Edge-TTS,通过网页的方式,进行跨网络的服务调用,实现实时对话的效果
NsLearning
Striving to create a great Application with full functions of learning languages by ChatGPT, TTS, STT and other awesome AI models, supports talking, speaking assessment, memorizing words with contexts, Listening test, so on.
Siddhesh2377
On-device AI for Android — LLM chat (GGUF/llama.cpp), vision models (VLM), image generation (Stable Diffusion), tool calling, AI personas, RAG knowledge packs, TTS/STT. Fully offline, zero subscriptions, open-source.
This AI Smart Speaker uses speech recognition, TTS (text-to-speech), and STT (speech-to-text) to enable voice and vision-driven conversations, with additional web search capabilities via OpenAI and Langchain agents.
twelvet-projects
(Spring Boot 3. X Microservices framework) 基于Spring Boot 3.X 的 Spring Cloud Alibaba / Spring Cloud Tencent + React的微服务框架。🔝 🔝 点个starrred 关注更新。Chat GPT(RAG、TTS、STT、LLM)
disler
Fast STT, LLM, and TTS for personal AI assistants using OpenAI, Groq, AssemblyAI and ElevenLabs.
proj-airi
🎤💬 Full example of implementing ChatGPT's realtime voice from scratch with VAD + STT + LLM + TTS technology stack within almost one file!
hiteshsahu
One line solution for Android Text to speech(TTS) & Speech to Text(STT) translation problem
DePasqualeOrg
Swift tools for text to speech (TTS) and speech to text (STT) powered by MLX
akazwz
Full-stack AI chat platform built on Cloudflare using Workers, Durable Objects, KV, and AI Gateway. Features AI chat, Text-to-Speech (TTS), and Speech-to-Text (STT).
tylike
openai chatgpt or local llm(llama.cpp gguf format)+TTS+STT+Word+Excel
Purple-Horizons
🦞 Open-source browser-based voice chat for AI assistants. Self-hosted, private, free. Whisper STT + ElevenLabs TTS. Works with OpenAI, Claude, or custom agents.
karim23657
A collection of inspiring lists, repos, datasets, models, tools and more for Persian language speech to text(stt) and text to speech(tts) .
Nighthawk42
Whisper STT + Orpheus TTS + Gemma 3 using LM Studio to create a virtual assistant.
ramanujammv1988
On-device AI SDK for Flutter — LLM inference, vision, STT, TTS, image generation, embeddings, RAG, and function calling. Metal GPU on iOS/macOS.
OpenReplicant
AI 3D avatar voice interface in browser. VAD -> STT -> LLM -> TTS -> VRM (Prototype/Proof-of-Concept)
tochilkinva
Telegram bot with voice message recognition and generation. Speech to Text and Text to Speech
kaloprojects
ESP32-based voice device for chatting with multiple custom AI bots. Recording questions with I2S microphone, transcribing via ElevenLabs or Deepgram STT, creating response with Groq or Open AI LLM. TTS audio output with custom AI voices via I2S & speaker. Supporting ongoing dialogues, calling bots ‘by name’, real-time web search via keyword.
Azure-Samples
Build, test, and ship omnichannel voice agents on Azure—ACS telephony, custom STT→LLM→TTS pipeline, Voice Live API (voice-to-voice), and Foundry Agents.
rtk-ai
A universal AI toolkit for high-performance Speech-to-Text (STT) and Text-to-Speech (TTS) processing, designed for low-latency and easy model integration.
iamZhaoHang
为了实现真正的All in Local! 我将Llava视觉大模型、QWen2.5-VL多模态大模型,以及STT和TTS模型全部部署在本地计算机上,打造了一个完全离线的机器人视觉交互系统。 机器人通过摄像头感知周围环境,LLaVA和QWen2.5-VL进行视觉分析,STT进行语音识别,TTS进行语音播报,整个过程完全在本地完成。
Sharan-Kumar-R
Real-time AI ChatBot and voice-enabled AI VoiceBot using Deepgram (STT ↔ TTS) and Groq LLM for natural conversations.
mallahyari
A library for real-time Speech to Text (STT), and Text to Speech (TTS) capability