Back to search
ToolQA, a new dataset to evaluate the capabilities of LLMs in answering challenging questions with external tools. It offers two levels (easy/hard) across eight real-life scenarios.
Stars
285
Forks
15
Watchers
285
Open Issues
5
Overall repository health assessment
No package.json found
This might not be a Node.js project
21
commits
2
commits