Back to search
Contains random samples referenced in the paper "Sleeper Agents: Training Robustly Deceptive LLMs that Persist Through Safety Training".
Stars
142
Forks
19
Watchers
142
Open Issues
2
Overall repository health assessment
No language data available
No package.json found
This might not be a Node.js project