OpenCompass is an LLM evaluation platform, supporting a wide range of models (Llama3, Mistral, InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Stars
6.8k
Forks
756
Watchers
6.8k
Open Issues
443
Overall repository health assessment
No package.json found
This might not be a Node.js project
110
commits
82
commits
82
commits
81
commits
60
commits
56
commits
50
commits
46
commits
23
commits
17
commits
[Update] Add repeat config for HMMT2025 and UGD_hard (#2425)
fcbb26eView on GitHub[Refactor] Support concurrent inference acorss tasks. (#2403)
3cdd4c2View on GitHub[Update] Add support for RawPromptTemplate in OpenAISDKStreaming and ChatMLDatasets (#2414)
1799dcfView on GitHub[Fix] Fix the tag match in generic_llmjudge_postprocess (#2417)
3d68e28View on GitHub[Feautre] Add RawPromptTemplate and new dataset configs (#2407)
e39144fView on GitHub