Rank LLMs, RAG systems, and prompts using automated head-to-head evaluation
Stars
108
Forks
10
Watchers
108
Open Issues
4
Overall repository health assessment
No package.json found
This might not be a Node.js project
Update model recommendations to include recent releases (#105)
554ed02View on GitHubAttribute human votes to the user who provided them (#101)
a3b7745View on GitHubTreat empty prompts and responses as empty strings (#100)
470f368View on GitHubEnable WAL `journal_mode` with better write serialization (#99)
ae9adbfView on GitHubHandle time zones in timestamps retrieved from SQLite (#96)
d330860View on GitHub