Benchmarking the capabilities of LLM agents across the scientific research lifecycle: from replication to peer review and research design.
Stars
4
Forks
1
Watchers
4
Open Issues
1
Overall repository health assessment
No package.json found
This might not be a Node.js project
Merge branch 'main' of github.com:CenterForOpenScience/llm-benchmarking
12052a2View on GitHubMerge branch 'main' of https://github.com/CenterForOpenScience/llm-benchmarking
2e3e3a2View on GitHub