A simple, powerful LLM benchmarking tool built with PraisonAI Agents
Stars
3
Forks
2
Watchers
3
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
26
commits
Add comprehensive feature documentation for Test Suites, Evaluation, HTML Extraction, Cost Tracking, Reports, Plugins
87e8e53View on GitHubfeat: Add SVG generation benchmark with multi-metric evaluation
d60324bView on GitHubfeat: introduce plugin system for extensible evaluators
2b7888eView on GitHubfeat: enhance reporting and output options for benchmark results
270779bView on GitHubfeat: add cost and token usage tracking for benchmark tests
1b349c1View on GitHubdocs: update evaluation system to v4.0 with hybrid scoring
28a6b23View on GitHubfeat: add retry logic with exponential backoff for benchmark tests
1d8bd47View on GitHub