Found 21 repositories(showing 21)
4kaws
Evalflow — Terminal UI for pulling Kaggle Community Benchmark output CSVs, merging them into SFT and preference-pair datasets, and publishing to Kaggle Datasets.
emartai
pytest for LLMs — catch prompt regressions before they reach production
jojojoy
No description available
SharwillKhisti
Virtual Lab Assistant
one-aalam
TypeScript-first LLM evaluation library built on Autoevals with Vitest integration
snpgps
No description available
karthikbadam
No description available
shiveshsood
Visual prompt eval workbench — dark canvas for evaluating LLM prompts against golden datasets with LLM-as-a-Judge scoring
zhuwenyu
评务通
mittalbacancy
No description available
XylonFu
EvalFlow is a plugin for EvalScope that simplifies model deployment and evaluation through a unified command-line interface. It handles both model serving and benchmark evaluation in one workflow.
SaiNihal2622
EvalFlow Pro - AI Evaluation & Data Quality Platform
saikiranpulagalla
No description available
PonmadhanD
No description available
dongwookim80
No description available
sap04
No description available
franekp
No description available
OnWikApp
No description available
dongwookim80
No description available
No description available
saikiranpulagalla
No description available
All 21 repositories loaded