[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.
Stars
191
Forks
11
Watchers
191
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
16
commits
2
commits
1
commits
1
commits
1
commits
fix(CLI): Fix #24 by only shows agreement results when `label` column is present (#26)
fa70a11View on GitHubbuild(release): disable attestations for publish workflow (#23)
8d3719aView on GitHubbuild(release): bump pypa/gh-action-pypi-publish version to v1.12.4 (#22)
a624f55View on GitHubfix(evaluator): Handle expection when input length is too long (#19)
4cce5d2View on GitHubfix(HFChat): Update HFChat for compatibility with transformers>=1.45.0
13b5821View on GitHubfix(tests): Update unit tests for compatibility with openai>=1.49.0 (#10)
13a10adView on GitHubfix(TextClassificationJailbreakEvaluator): Fix typo in text_classification_service (#7)
6f025b0View on GitHub