Found 4 repositories(showing 4)
centminmod
No description available
dickman419
🔍 Evaluate Claude Haiku 4.5's code analysis against 27 LLM models to enhance understanding, documentation, and troubleshooting of your nftables wrapper.
Systematic benchmark comparing Claude Haiku 4.5 vs MiniMax M2.1 on agentic coding tasks. Includes full audit trails, LLM-as-judge evaluation, and path divergence analysis.
reginapichler
For the seminar "Natural Language Processing Meets Computational Social Science" at LMU Munich, an investigation about fact checking with LLMs has been done. This repository includes the Code, data and results (plots and reports) of the evaluation of political statements of the 2024 presidential debate in the U.S. with Claude 3.5-Haiku and GPT 4.o
All 4 repositories loaded