Found 7,594 repositories(showing 30)
pathwaycom
Python ETL framework for stream processing, real-time analytics, LLM pipelines, and RAG.
BlinkDL
RWKV (pronounced RwaKuv) is an RNN with great LLM performance, which can also be directly trained like a GPT transformer (parallelizable). We are at RWKV-7 "Goose". So it's combining the best of RNN and transformer - great performance, linear time, constant space (no kv-cache), fast training, infinite ctx_len, and free sentence embedding.
fixie-ai
A fast multimodal LLM for real-time voice
QwenLM
Qwen3-omni is a natively end-to-end, omni-modal LLM developed by the Qwen team at Alibaba Cloud, capable of understanding text, audio, images, and video, as well as generating speech in real time.
iusztinpaul
🦖 𝗟𝗲𝗮𝗿𝗻 about 𝗟𝗟𝗠𝘀, 𝗟𝗟𝗠𝗢𝗽𝘀, and 𝘃𝗲𝗰𝘁𝗼𝗿 𝗗𝗕𝘀 for free by designing, training, and deploying a real-time financial advisor LLM system ~ 𝘴𝘰𝘶𝘳𝘤𝘦 𝘤𝘰𝘥𝘦 + 𝘷𝘪𝘥𝘦𝘰 & 𝘳𝘦𝘢𝘥𝘪𝘯𝘨 𝘮𝘢𝘵𝘦𝘳𝘪𝘢𝘭𝘴
KimMeen
[ICLR 2024] Official implementation of " 🦙 Time-LLM: Time Series Forecasting by Reprogramming Large Language Models"
haykgrigo3
A LLM trained only on data from certain time periods to reduce modern bias
MicrosoftDocs
Official Microsoft Learn MCP Server and CLI tool – powering LLMs and AI agents with real-time, trusted Microsoft docs & code samples.
Capsize-Games
Offline inference engine for art, real-time voice conversations, LLM powered chatbots and automated workflows
vava-nessa
Find, benchmark and install in CLI 200+ FREE coding LLM models across 20+ providers in real time
qingsongedu
A professional list on Large (Language) Models and Foundation Models (LLM, LM, FM) for Time Series, Spatiotemporal, and Event Data.
SakanaAI
A Self-adaptation Framework🐙 that adapts LLMs for unseen tasks in real-time!
Scale3-Labs
Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorDBs and more.. Integrate using Typescript, Python. 🚀💻📊
EmbeddedLLM
The collaborative spreadsheet for AI. Chain cells into powerful pipelines, experiment with prompts and models, and evaluate LLM responses in real-time. Work together seamlessly to build and iterate on AI applications.
codefuse-ai
A LLM semantic caching system aiming to enhance user experience by reducing response time via cached query-result pairs.
hollobit
ChatGPT, GenerativeAI and LLMs Timeline
bgauryy
MCP server for semantic code research and context generation on real-time using LLM patterns | Search naturally across public & private repos based on your permissions | Transform any accessible codebase/s into AI-optimized knowledge on simple and complex flows | Find real implementations and live docs from anywhere
jmuncor
Intercept LLM API traffic and visualize token usage in a real-time terminal dashboard. Track costs, debug prompts, and monitor context window usage across your AI development sessions.
qixucen
[NeurIPS 2025] Atom of Thoughts for Markov LLM Test-Time Scaling
apirrone
Memento is a Python app that records everything you do on your computer and lets you go back in time, search, and chat with a LLM (Large Language Model) to find back information about what you did.
nashsu
LLM Wiki is a cross-platform desktop application that turns your documents into an organized, interlinked knowledge base — automatically. Instead of traditional RAG (retrieve-and-answer from scratch every time), the LLM incrementally builds and maintains a persistent wiki from your sources。
jofizcd
🌌 Give a soul to your digital waifu. Soul of Waifu is an immersive desktop roleplay & AI companion engine with Live2D/VRM avatars, real-time voice chat, and local LLM support. Watch your characters come to life.
SakanaAI
A Tree Search Library with Flexible API for LLM Inference-Time Scaling
xiyuanzh
tracking papers, datasets, and models of "large language model (LLM) for time series"
riccardomusmeci
Large Language Models (LLMs) applications and tools running on Apple Silicon in real-time with Apple MLX.
winstonkoh87
The Linux OS for AI Agents — Persistent memory, autonomy, and time-awareness for any LLM. Own the state. Rent the intelligence.
guidewire-oss
Unified test intelligence platform with multi-format ingestion, real-time analytics, and AI-powered insights via LLM integration
TimeCopilot
TimeCopilot: the GenAI Forecasting Agent. Built on LLMs and Time Series Foundation Models, it lets you forecast, cross-validate, and detect anomalies using multiple foundation models through a single API. From finance and energy to web analytics, TimeCopilot turns natural-language queries into production-ready forecasts.
liyucheng09
Compress your input to ChatGPT or other LLMs, to let them process 2x more content and save 40% memory and GPU time.
backnotprop
Context management for long-context LLMs, agents, and vibe coding. Instantly build context for an entire repo, selected files, folders, and GitHub issues to generate structured AI-XML context with real-time token counting.