Back to search
Comprehensive AI Evaluation Framework with advanced techniques including Temperature-Controlled Verdict Aggregation via Generalized Power Mean. Support for multiple LLM providers and 15+ evaluation metrics for RAG systems and AI agents.
Stars
11
Forks
0
Watchers
11
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
4
commits