Search Results

Found 10 repositories(showing 10)

Eval-ai-library

meshkovQA

💛70

Comprehensive AI Model Evaluation Framework with advanced techniques including Temperature-Controlled Verdict Aggregation via Generalized Power Mean. Support for multiple LLM providers and 15+ evaluation metrics for RAG systems and AI agents.

Apache-2.0

Python

Updated 2 days ago

ai-evaluationai-evaluation-frameworkai-evaluation-metrics+3

eval-ai-library

firstlinesoftware

🧡50

Comprehensive AI Evaluation Framework with advanced techniques including Temperature-Controlled Verdict Aggregation via Generalized Power Mean. Support for multiple LLM providers and 15+ evaluation metrics for RAG systems and AI agents.

Apache-2.0

Python

Updated 1 month ago

ai-evaluationai-evaluation-frameworkai-evaluation-metrics+3

ai-agent-eval-scenario-library

microsoft

❤️35

No description available

MIT

Updated 1 week ago

eval-guide

microsoft

💛70

A plugin for AI agent evaluation. Plan evals, generate test cases, interpret results for Copilot Studio agents. Grounded in Microsoft's Eval Scenario Library & Triage Playbook.

MIT

HTML

Updated 1 day ago

EveAI

Dmunch04

❤️40

A Python library for interacting, and creating your own AI, with Eve

MIT

Updated 6 years ago

eval-ai-test

AirVetra

🧡55

Automated LLM testing pipeline for LM Studio using Eval AI Library. Features dynamic model loading/unloading, interactive CLI, multiple metrics (RAG, Security, Deterministic), and integrated web dashboard.

Python

Updated 2 weeks ago

GTeam

gcampton

🧡65

AI professional firm for Claude Code — 29 eval-tested specialists (lawyers, accountants, designers, SEO, copywriters, and more) with real methodologies and reference libraries. Lightweight coordinator loads skills on demand.

Go Template

Updated 3 days ago

docx-to-json-test-case-converter

arjunghosh

💛70

A Python CLI tool and a library to convert word docx file into .JSON and .JSONL (e.g.: For AI Foundry Eval upload) file

MIT

Python

Updated 2 days ago

ai-builder-kit

cylijinpeng

🧡60

A practical builder library for AI agents: prompts, skills, MCP, frameworks, RAG, evals, and starter packs.

MIT

JavaScript

Updated 1 week ago

agent-workflow-studio

trehansalil

🧡55

AI Agent Workflow Studio — paste a business process, auto-generate prompts/tool schemas/evals, red-team for prompt injection, data leakage & tool misuse, with pass/fail traces, attack libraries, regression tests, and a live hardening checklist.

Updated 1 week ago

All 10 repositories loaded

GitHub Explorer

Search Results

Eval-ai-library

eval-ai-library

ai-agent-eval-scenario-library

eval-guide

EveAI

eval-ai-test

GTeam

docx-to-json-test-case-converter

ai-builder-kit

agent-workflow-studio

Eval-ai-library

eval-ai-library

ai-agent-eval-scenario-library

eval-guide

EveAI

eval-ai-test

GTeam

docx-to-json-test-case-converter

ai-builder-kit

agent-workflow-studio