Found 3 repositories(showing 3)
princeton-nlp
[ICLR 2024] Evaluating Large Language Models at Evaluating Instruction Following
project2429
No description available
CarenceLiu
NLP Project, tested on PandaLM testset, Auto-J testset, MT-bench and LLMBar benchmark dataset
All 3 repositories loaded