A benchmark for LLMs on complicated tasks in the terminal
Stars
1.9k
Forks
500
Watchers
1.9k
Open Issues
292
Overall repository health assessment
No package.json found
This might not be a Node.js project
214
commits
153
commits
60
commits
55
commits
55
commits
19
commits
17
commits
16
commits
15
commits
12
commits
Update goose_agent.py to enable built in todo extension (#1344)
7623dbfView on GitHubUpdate goose_agent.py to explicitly include needed developer extension (#1341)
d877022View on GitHubPort enhancements and fixes from terminal-bench-1.5 (#1322)
bdd1201View on GitHubIncrease max agent timeout from 360 to 750 seconds (#1321)
34cd20fView on GitHub