A trajectory-level benchmark for detecting prompt injection, policy erosion, intent drift, and coordinated misuse in agentic LLM systems.
Stars
1
Forks
1
Watchers
1
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
21
commits
Add formal boundary statement for release authority
b40f25eView on GitHubAdd benchmark protocol and leaderboard infrastructure
bf9d3f4View on GitHubAdd benchmark versioning, eval protocol, blind spots, and scale report
683e326View on GitHubAdd professional demo walkthrough and boundary disclaimers
d60bc8cView on GitHubUpgrade completeness and limitations to research-grade
107b68bView on GitHubAdd completeness and limitations section, fix contact email
965ac8aView on GitHubAdd ceiling analysis, distribution shift protocol, and lifecycle policy
38b0237View on GitHubAdd adaptive attacker with bandit + genetic mutation hybrid
1cf401bView on GitHubAdd reproducibility, negative results, and advanced documentation
5b3b3b2View on GitHubAdd production-facing documentation addressing interviewer scrutiny points
92257aeView on GitHub