Search Results

Found 99 repositories(showing 30)

beyondllm

aiplanethub

🧡51

Build, evaluate and observe LLM apps

292

Apache-2.0

Jupyter Notebook

Updated 1 month ago

aiartificial-intelligenceembeddings+10

[ICML'25] Our study systematically investigates massive values in LLMs' attention mechanisms. First, we observe massive values are concentrated in low-frequency dimensions across different attention heads exclusively in attention queries (Q) and keys (K) while absent in values (V).

Python

Updated 1 month ago

invariant-gateway

invariantlabs-ai

🧡60

LLM proxy to observe and debug what your AI agents are doing.

Apache-2.0

Python

Updated 1 week ago

ai-agentsdebuggingguardrails+3

paraview_mcp

llnl

🧡65

ParaView-MCP integrates multimodal LLMs with ParaView via Model Context Protocol, enabling natural language control of scientific visualizations. The agent observes the viewport for visual feedback, making complex visualization tool accessible to all users while providing intelligent automation for experts.

BSD-3-Clause

Python

Updated 6 days ago

mcp-mesh

dhyansraj

🧡60

MIT

Python

Updated 1 day ago

agentic-aiai-agentsai-agents-framework+8

cdpilot

mehmetnadir

🧡55

Zero-dependency browser automation CLI. 70+ commands, 10 test assertions, smart commands (click/fill by text — no LLM needed). MCP server for AI agents with 500x fewer tokens. Extract, observe, script runner. 50KB, pure CDP.

MIT

Python

Updated 2 days ago

ai-agentassertionsautomation+17

sim-cli

svd-ai-lab

🧡55

sim — a CLI runtime that lets LLM agents launch, drive, and observe CAD/CAE simulators through one protocol

Apache-2.0

Python

Updated 9 hours ago

agent-runtimeai-agentsansys-fluent+12

Would-You-Kindly

user1342

🧡50

A security testing tool designed to evaluate the effectiveness of large language models (LLMs) in protecting secrets and preventing security breaches. With customisable LLM options, the tool allows you to simulate attacks on LLMs using various techniques and observe their defence capabilities.

GPL-3.0

Python

Updated 2 months ago

llm-observe-hub

ra189zor

❤️30

Real-time observability and analytics platform for local LLMs, with dashboard and API.

MIT

HTML

Updated 4 months ago

llm_quest_benchmark

yourconscience

❤️45

Observe and analyze LLM agents decision-making through Space Rangers text adventures! 👾🚀📊

MIT

Python

Updated 1 month ago

guardix

maltyxx

🧡60

An autonomous Web Application Firewall (WAF) that uses a Large Language Model (LLM) to learn and adapt its security rules automatically based on observed traffic.

MIT

Rust

Updated 4 weeks ago

http-proxyllmollama+1

tao-llm

solzilberman

❤️35

Minimal implementation of thought-act-observe design pattern for LLMs (gpt-3.5-turbo).

Python

Updated 1 year ago

LLM_Trend_Observer

TengJiao33

🧡65

自动化AI排行信息推送。每天推送HF,OpenRouter,LMSYS和Artificial Analysis的实时排行榜信息，稳抓LLM动态

Python

Updated 19 hours ago

llm-usage-monitoring

kyyasdev

❤️35

A small project captures everything our LLM traffic touches: FastAPI intercepted each prompt, Postgres archived the full exchange, and the React dashboard replayed token counts like telemetry. It wasn’t just a proxy—it was proof we could observe any model in real time, down to the user label and individual completion.

Python

Updated 4 months ago

machiave-llm

JoNeedsSleep

❤️45

LLMs play Diplomacy testing out their Machiavellian prowess, and we get to observe them.

Python

Updated 1 month ago

loop-llm

azank1

🧡50

MCP server that observes every prompt, scores quality in real time, and closes the loop with iterative refinement. Built on FastMCP, SQLite, and Bayesian priors — no extra LLM required.

MIT

Python

Updated 1 month ago

LLM_OBSERVE

sfc-gh-sdickson

❤️35

Testing Tool for LLM Observability

Python

Updated 6 months ago

LLM-Quality-Observer

dongkoony

❤️45

Production-ready MLOps platform for monitoring and evaluating LLM response quality with automated alerts and real-time analytics

TypeScript

Updated 1 month ago

ai-opsanalyticsdocker+8

wheel

kunish

🧡55

Wheel. LLM API Gateway — Aggregate, Balance, Observe.

MIT

Updated 6 days ago

claude-code-helicone

kernel-systems

🧡65

Observe claude code agent's LLM calls

Python

Updated 1 day ago

llm-guardrails

logsv

🧡50

An open-source platform to govern, evaluate, observe, and control LLMs in production.

MIT

JavaScript

Updated 2 months ago

llmarena

kevinsze1996

❤️35

A interface allows people choosing different llm and observe them talking to each other

Python

Updated 5 months ago

binance_project

PoiName1923

❤️45

Streaming Pipeline to observe Trades on Binance and build some LLM and Dashboard base on data.

Python

Updated 1 month ago

V.I.G.I.L

cruz209

❤️45

VIGIL: A reflective runtime for LLM agents that observes behavior, appraises failures, and proposes its own fixes (even to itself)

Python

Updated 2 months ago

build-your-own-coding-agent

vijayashankar-g

🧡55

A minimal AI coding agent that observes terminal output, thinks using an LLM, fixes broken code with tools, and loops until the program runs successfully.

Python

Updated 3 weeks ago

ambientghost

spyrae

🧡60

Local AI agent framework for macOS. Observes work patterns via native APIs, analyzes with local LLM (Ollama), stores everything locally. Tauri 2.0 + Swift + React + SQLite.

MIT

Rust

Updated 2 weeks ago

local-aimacosmenu-bar-app+7

PropInsight

inchara23

❤️35

PropInsight is an AI-powered property inspection report generator that utilizes LLM models to analyze property types and observed issues, generating comprehensive and data-driven reports for smarter decision-making.

Python

Updated 5 months ago

fastapillmnatural-language-processing-nlp+1

webdoc

Soham041201

💛70

A Claude-Code–style CLI built with Bun, TypeScript, React Ink, and Playwright that observes UI interactions and network calls, correlates them using LLM + Vision, and generates safe, structured API documentation.

MIT

TypeScript

Updated 5 days ago

rl-explorations

Spartan-71

🧡60

A learning-in-public repo. I'm going from zero RL knowledge → fine-tuning LLMs with reinforcement learning. Every folder is a phase. Every experiment has notes on what I tried and what I observed.

MIT

Updated 3 weeks ago

Screen-Mate-A-Context-Aware-Intelligent-Screen-Assistant

Pranav0402

❤️35

Screen-Mate is an AI-powered desktop assistant that observes your screen, understands context, and provides real-time, proactive help using OCR, YOLO, and LLMs — offering smart suggestions and debugging support through a minimal floating overlay.

Updated 4 months ago

GitHub Explorer

Search Results

beyondllm

Rope_with_LLM

invariant-gateway

paraview_mcp

mcp-mesh

cdpilot

sim-cli

Would-You-Kindly

llm-observe-hub

llm_quest_benchmark

guardix

tao-llm

LLM_Trend_Observer

llm-usage-monitoring

machiave-llm

loop-llm

LLM_OBSERVE

LLM-Quality-Observer

wheel

claude-code-helicone

llm-guardrails

llmarena

binance_project

V.I.G.I.L

build-your-own-coding-agent

ambientghost

PropInsight

webdoc

rl-explorations

Screen-Mate-A-Context-Aware-Intelligent-Screen-Assistant

beyondllm

Rope_with_LLM

invariant-gateway

paraview_mcp

mcp-mesh

cdpilot

sim-cli

Would-You-Kindly

llm-observe-hub

llm_quest_benchmark

guardix

tao-llm

LLM_Trend_Observer

llm-usage-monitoring

machiave-llm

loop-llm

LLM_OBSERVE

LLM-Quality-Observer

wheel

claude-code-helicone

llm-guardrails

llmarena

binance_project

V.I.G.I.L

build-your-own-coding-agent

ambientghost

PropInsight

webdoc

rl-explorations

Screen-Mate-A-Context-Aware-Intelligent-Screen-Assistant