Search Results

Found 3 repositories(showing 3)

frai-benchmark

sebuzdugan

🧡50

FRAI Benchmark: Future Responsible AI Evaluation A consensus-based safety and compliance benchmark for SOTA LLMs (DeepSeek, Grok, GPT-5). Uses a "Panel of Experts" methodology where multiple frontier models judge response quality to ensure non-biased, high-fidelity safety scores

NOASSERTION

Python

Updated 2 months ago

fraig-bmc_benchmark

ChangyuanYU

❤️30

AIGER format benchmarks for Fraig-BMC

Updated 7 months ago

SmartReceipt-Benchmark

AmauryRoub

🧡55

Conception d’un pipeline Python (OCR & LLMs multimodaux) pour extraire les données de justificatifs de frais, incluant un outil de benchmark comparant la précision, la latence et les coûts d’API de chaque méthode.

Python

Updated 3 weeks ago

All 3 repositories loaded

GitHub Explorer

Search Results

frai-benchmark

fraig-bmc_benchmark

SmartReceipt-Benchmark

frai-benchmark

fraig-bmc_benchmark

SmartReceipt-Benchmark