Sharing both practical insights and theoretical knowledge about LLM evaluation that we gathered while managing the Open LLM Leaderboard and designing lighteval!
Stars
2.1k
Forks
122
Watchers
2.1k
Open Issues
5
Overall repository health assessment
No package.json found
This might not be a Node.js project
61
commits
9
commits
2
commits
2
commits
1
commits
1
commits
1
commits
1
commits
1
commits
1
commits
Update README with community translations and yearly dives
2060dfcView on GitHubupdate: translation of automated-benchmarks/some-evaluation-datasets.md (#33)
ff92847View on GitHub