The first benchmark to evaluate LLMs' usability as programming-language interpreters
Stars
12
Forks
0
Watchers
Open Issues
Overall repository health assessment
No package.json found
This might not be a Node.js project
User
commits
Adding conda as a system requirement in README
3e37b8a
Clarifying the method to implement for BaseRunner
23aa137
Clarifying extending BaseRunner
2537fb0
Adding bash script for end-to-end run
1c7a337
Correcting typo
2ef5e61
Removing badge
cb2a95d
Adding HF link
a8d1c0e
Adding arXiv link
ea7b17f
Updating paper title in README
7f0d10c
Updating License
fb61ff0
Making ready for release
fbd0b72
Initial commit
6349a9d