60 self-contained, Dockerized vulnerable web applications for evaluating AI-powered penetration testing agents. Covers modern tech stacks (Node.js, Python, Go, Java, PHP, Ruby) across diverse vulnerability classes from basic injection to multi-step exploit chains.
Stars
21
Forks
3
Watchers
21
Open Issues
0
Overall repository health assessment
No package.json found
This might not be a Node.js project
9
commits
2
commits
Consolidate 60 benchmark branches into benchmarks/ directory
978b38aView on GitHubPosition as premier open pentest benchmark, Apache 2.0 license
4abfb42View on GitHubAdd README.md with project overview and benchmark inventory
f0e0ac4View on GitHubAdd implementation plans for APEX-041-25 through APEX-060-25 chain benchmarks
c1e387aView on GitHubUpdate all implementation plans with src/ directory structure and expected_results
68253b8View on GitHubInitial commit: Add project documentation and implementation plans
1b2aaa9View on GitHub