Automated system for LLM evaluation via agents.
Stars
48
Forks
7
Watchers
Open Issues
2
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
55
commits
12
5
Add api model evaluation options. (#31)
bf09eaf
add agent alpha logo
a0cd8f2
update qrcode
2f769e6
fix bug in readme
ce0516f
2649bcc
da3e8ec
15fca72
add slogan in readme
a6acfe8
demo for en&zh version
1fab142
fix bug in eval.tsx
d2e9d15
fix the front for report radar, bench scores and visualization (#28)
16f9c67
Feat/reclassify bench categories (#26)
bc760e8
fix change eval type bug
876db8b
fix readme py version
a9d9672
fix readme license logo
c4bbc57