Found 3,652 repositories(showing 30)
kvcache-ai
Mooncake is the serving platform for Kimi, a leading LLM service provided by Moonshot AI.
JetBrains
Koog is a JVM (Java and Kotlin) framework for building predictable, fault-tolerant and enterprise-ready AI agents across all platforms – from backend services to Android and iOS, JVM, and even in-browser environments. Koog is based on our AI products expertise and provides proven solutions for complex LLM and AI problems
deep-diver
LLM as a Chatbot Service
Kedreamix
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
av
One command brings a complete pre-wired LLM stack with hundreds of services to explore.
yym68686
This is a project that unifies the management of LLM APIs. It can call multiple backend services through a unified API interface, convert them to the OpenAI format uniformly, and support load balancing. Currently supported backend services include: OpenAI, Anthropic, DeepBricks, OpenRouter, Gemini, Vertex, etc.
jackdark425
AI Group is a powerful mobile intelligent assistant application that integrates multiple large language models (LLMs) and AI services, providing you with a convenient intelligent interaction experience.
KenyonY
🚀 大语言模型高效转发服务 · An efficient forwarding service designed for LLMs. · OpenAI API Reverse Proxy
transcriptionstream
turnkey self-hosted offline transcription and diarization service with llm summary
FudanDISC
[中文法律大模型] DISC-LawLLM: an intelligent legal system powered by large language models (LLMs) to provide a wide range of legal services.
FudanDISC
DISC-FinLLM,中文金融大语言模型(LLM),旨在为用户提供金融场景下专业、智能、全面的金融咨询服务。DISC-FinLLM, a Chinese financial large language model (LLM) designed to provide users with professional, intelligent, and comprehensive financial consulting services in financial scenarios.
DeMCP is the first Decentralized MCP network, offering SSE proxies for MCP services and mainstream LLMs, tackling trust and security with TEE and blockchain.
FudanDISC
Repository of DISC-MedLLM, it is a comprehensive solution that leverages Large Language Models (LLMs) to provide accurate and truthful medical response in end-to-end conversational healthcare services.
vietanhdev
The best free SaaS base for GenAI, LLM, and other AI Services. Ship your next Generative AI startups in days.
apocas
RESTai is an AIaaS (AI as a Service) open-source platform. Supports many public and local LLM suported by Ollama/vLLM/etc. Precise embeddings usage, tuning, analytics etc. Built-in image/audio generation with dynamic loading generators. Live chat deployment. Built-in block based graphical language. Prompt versioning and much more...
pseudo-r
The Public ESPN API documentation provides a complete list of undocumented API endpoints, URL parameters, and JSON response formats across 20+ sports (NFL, NBA, MLB, NHL). Includes live curl examples, routing guides, and a Django REST API service for live scores, standings, and news. Optimized for search, developers, AI, and LLMs.
tgoai
Open-source AI Agent Customer Service Platform. Build AI agent teams with LLM orchestration, RAG knowledge base, multi-channel support, and human collaboration.
thoughtbot
Filter sensitive information from free text before sending it to external services or APIs, such as chatbots and LLMs.
shroominic
👾📦 CodeBoxAPI is the simplest sandboxing infrastructure for your LLM Apps and Services.
deep-diver
[ACL'25] Official Code for LlamaDuo: LLMOps Pipeline for Seamless Migration from Service LLMs to Small-Scale Local LLMs
Lex-au
Speech-to-speech AI assistant with natural conversation flow, mid-speech interruption, vision capabilities and AI-initiated follow-ups. Features low-latency audio streaming, dynamic visual feedback, and works with local LLM/TTS services via OpenAI-compatible endpoints.
matlab-deep-learning
Connect MATLAB to LLM APIs, including OpenAI® Chat Completions, Azure® OpenAI Services, and Ollama™
567-labs
A collection of LLM services you can self host via docker or modal labs to support your applications development
containers
Examples for building and running LLM services and applications locally with Podman
sgl-project
A workload for deploying LLM inference services on Kubernetes
MigoXLab
A general-purpose API load testing platform that supports LLM services and business HTTP interfaces, enabling one-click performance testing, result comparison, and AI-powered intelligent analysis and summarization. 一站式通用 API 压测平台,支持大模型推理与业务 HTTP 接口,一键完成性能测试、结果对比与 AI 智能分析总结
jagan-shanmugam
An OpenStreetMap MCP server implementation that enhances LLM capabilities with location-based services and geospatial data.
NetEase-Media
Higher performance OpenAI LLM service than vLLM serve: A pure C++ high-performance OpenAI LLM service implemented with GPRS+TensorRT-LLM+Tokenizers.cpp, supporting chat and function call, AI agents, distributed multi-GPU inference, multimodal capabilities, and a Gradio chat interface.
Emerging-AI
A deployment, monitoring and autoscaling service towards serverless LLM serving.
abdullahtarek
An AI-powered chatbot for coffee shops, integrating LLMs, NLP, and a React Native app for enhanced customer service