Compare agents pairwise via multi‑round evaluations for SE tasks.
Stars
7
Forks
0
Watchers
Open Issues
Overall repository health assessment
^1.x
^21.x
^40.x
^2.x
^3.x
^16.x
^5.x
^4.x
User
44
commits
add
5760454
c3f1086
f84a816
85ae2d7
9318990
9c174e2
960ca92
44283b2
3f7b762
12ca0e9
1260246
5b7a58a
4a25776
e158963
fd89470