Found 9 repositories(showing 9)
stephenleo
Benchmark various LLM Structured Output frameworks: Instructor, Mirascope, Langchain, LlamaIndex, Fructose, Marvin, Outlines, etc on tasks like multi-label classification, named entity recognition, synthetic data generation, etc.
cleanlab
A Structured Output Benchmark whose 'ground-truth' is actually right
Syndicats
No description available
Bae-ChangHyun
Benchmark tool for comparing LLM structured output frameworks (Instructor, LangChain, Marvin, PydanticAI, Mirascope, Guardrails)
matthieudelsart
No description available
bendechrai
Benchmark tool for testing LLM structured JSON response adherence across providers (OpenAI, Anthropic, Google, Groq, OpenRouter). Tests one-shot vs sequential prompting and strict vs non-strict modes with retry handling.
codeboratory
No description available
Structured Output envoluation with Gemini
LLM Structured Output Benchmark: JSON extraction from text via OpenRouter with Instructor + Zod
All 9 repositories loaded