Search Results

Found 4 repositories(showing 4)

claude-haiku-4.5-evaluation

centminmod

❤️25

No description available

Updated 5 months ago

anthropicchatgptclaude+17

claude-haiku-4.5-evaluation

dickman419

🧡65

🔍 Evaluate Claude Haiku 4.5's code analysis against 27 LLM models to enhance understanding, documentation, and troubleshooting of your nftables wrapper.

Updated 1 hour ago

anthropicchatgptclaude+14

Haiku-4.5-vs-Minimax-2.1

EmZod

❤️40

Systematic benchmark comparing Claude Haiku 4.5 vs MiniMax M2.1 on agentic coding tasks. Includes full audit trails, LLM-as-judge evaluation, and path divergence analysis.

Shell

Updated 2 months ago

agentic-aiai-agentsanthropic+7

Detecting-misinformation-in-political-statements

reginapichler

❤️35

For the seminar "Natural Language Processing Meets Computational Social Science" at LMU Munich, an investigation about fact checking with LLMs has been done. This repository includes the Code, data and results (plots and reports) of the evaluation of political statements of the 2024 presidential debate in the U.S. with Claude 3.5-Haiku and GPT 4.o

Python

Updated 1 year ago

All 4 repositories loaded

GitHub Explorer

Search Results

claude-haiku-4.5-evaluation

claude-haiku-4.5-evaluation

Haiku-4.5-vs-Minimax-2.1

Detecting-misinformation-in-political-statements

claude-haiku-4.5-evaluation

claude-haiku-4.5-evaluation

Haiku-4.5-vs-Minimax-2.1

Detecting-misinformation-in-political-statements