GitHub Explorer

by Alexey Ratnikov

GitHub Explorer

GitHub Explorer|TRENDING COMPARE|FEEDBACK

Back to search

yingchen-coding/agentic-misuse-benchmark - GitHub Explorer | GitHub Explorer | Trending | Compare

Back to search

agentic-misuse-benchmark

yingchen-coding•PUBLIC

View on GitHub

A trajectory-level benchmark for detecting prompt injection, policy erosion, intent drift, and coordinated misuse in agentic LLM systems.

Other

Created on Jan 31, 2026

Updated on Mar 19, 2026

Stars

Forks

Watchers

Open Issues

Repository Health Score

🧡

60/100

Fair

Overall repository health assessment

Score Breakdown

Activity

Regular updates - updated this month

20/30

67%

Recent Commits

Add formal boundary statement for release authority

yingchen-coding•2 months ago

b40f25eView on GitHub

Add benchmark protocol and leaderboard infrastructure

yingchen-coding•2 months ago

bf9d3f4View on GitHub

Add machine-readable scenario output schema

yingchen-coding•2 months ago

fa502d2View on GitHub

Add benchmark versioning, eval protocol, blind spots, and scale report

yingchen-coding•2 months ago

683e326View on GitHub

Add professional demo walkthrough and boundary disclaimers

yingchen-coding•2 months ago

d60bc8cView on GitHub

Upgrade completeness and limitations to research-grade

yingchen-coding•2 months ago

107b68bView on GitHub

Add completeness and limitations section, fix contact email

yingchen-coding•2 months ago

965ac8aView on GitHub

Add ceiling analysis, distribution shift protocol, and lifecycle policy

yingchen-coding•2 months ago

38b0237View on GitHub

Add editor config to gitignore

yingchen-coding•2 months ago

a02a915View on GitHub

Add adaptive attacker with bandit + genetic mutation hybrid

yingchen-coding•2 months ago

1cf401bView on GitHub

Add reproducibility, negative results, and advanced documentation

yingchen-coding•2 months ago

5b3b3b2View on GitHub

Add production-facing documentation addressing interviewer scrutiny points

yingchen-coding•2 months ago

92257aeView on GitHub

Add CC BY-NC 4.0 license (non-commercial)

yingchen-coding•2 months ago

58fcd2dView on GitHub

Add agentic-safety-incident-lab to portfolio

yingchen-coding•2 months ago

718279fView on GitHub

Add model-safety-regression-suite to portfolio

yingchen-coding•2 months ago

2fad1a6View on GitHub

View all commits