Aidan Bench attempts to measure <big_model_smell> in LLMs.
Stars
318
Forks
14
Watchers
318
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
97
commits
49
commits
19
commits
3
commits
2
commits
fixes to load env vars properly, bugged out before and got annoying
79ec81aView on GitHubAdd latest AI models: Claude 4 (Opus/Sonnet + thinking), OpenAI (o3/o3-pro/o4-mini), Gemini 2.0/2.5, Grok 3 thinking variants, DeepSeek R1-0528, new Mistral models
82ee00fView on GitHubfix import errors when API keys are missing, add clearer error messages
36fdd60View on GitHubtimeout and token limit research; empirical analysis of processing times and token usage patterns, timeout testing experiments, discovery that historical slow scenarios now complete quickly
ed92c9cView on GitHubtime limit experiment analysis and results; statistical experiment comparing model performance over time, a pilot thinking experiment with timing baselines, results dashboard and analysis docs, timeaware model configs and prompts
8cba33cView on GitHub