Fast and intuitive evals on any LLM
Stars
19
Forks
0
Watchers
Open Issues
21
Overall repository health assessment
^8.1.0
^30.2.0
^30.0.0
^4.0.9
^11.1.0
^20.10.0
^14.0.10
^29.4.5
^5.3.3
User
83
commits
Update README to reflect agent evaluation focus
8185579
Add Claude Code Skills for CLI and Eval Writing (#31)
280c37b
docs: reorganize README and create docs folder (#24)
4dc786f
feat: add @vibecheck/runner package for programmatic API (#23)
fe6ca1a
Merge pull request #22 from hev/fix-tests
f7a591d
fix ci
8d8bc27
reorg
073f41a
39864f6
test exit
6e7967a
fix tests
6f18ed0
issues
ca3f17c
Merge pull request #11 from hev/remove-interactive
6695e01
publsh script
f2f5453
publish script update
0cefab1
remove interactive
e5ed040