Search Results

Found 523 repositories(showing 30)

Awesome_GPT_Super_Prompting

CyberAlbSecOP

💛79

ChatGPT Jailbreaks, GPT Assistants Prompt Leaks, GPTs Prompt Injection, LLM Prompt Security, Super Prompts, Prompt Hack, Prompt Security, Ai Prompt Engineering, Adversarial Machine Learning.

3.8k

470

GPL-3.0

HTML

Updated 30 minutes ago

adversarial-machine-learningagentai+14

AI-Infra-Guard

Tencent

💛77

A full-stack AI Red Teaming platform securing AI ecosystems via OpenClaw Security Scan, Agent Scan, Skills Scan, MCP scan, AI Infra scan and LLM jailbreak evaluation.

3.4k

337

Apache-2.0

Python

Updated 23 minutes ago

agentaibenchmark+13

agentic_security

msoedov

🧡69

Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪

1.8k

246

Apache-2.0

Python

Updated 8 hours ago

agent-frameworkagent-securityai-red-team+11

FuzzyAI

cyberark

💛73

A powerful tool for automated LLM fuzzing. It is designed to help developers and security researchers identify and mitigate potential jailbreaks in their LLM APIs.

1.3k

182

Apache-2.0

Jupyter Notebook

Updated 9 hours ago

aiai-red-teamfuzzing+7

Awesome-Jailbreak-on-LLMs

yueliu1999

🧡67

Awesome-Jailbreak-on-LLMs is a collection of state-of-the-art, novel, exciting jailbreak methods on LLMs. It contains papers, codes, datasets, evaluations, and analyses.

1.3k

109

MIT

Updated 9 hours ago

aijailbreakllm+6

Spiritual-Spell-Red-Teaming

Goochbeater

🧡67

A repo for jailbreaking various LLMs, mainly Claude

716

164

Updated 2 hours ago

LLM-Jailbreaks

langgptai

💛71

LLM Jailbreaks, ChatGPT, Claude, Llama, DAN Prompts, Prompt Leaking

582

Apache-2.0

Updated 11 hours ago

aichatgptclaude+6

vigil-llm

deadbits

🧡66

⚡ Vigil ⚡ Detect prompt injections, jailbreaks, and other potentially risky Large Language Model (LLM) inputs

470

Apache-2.0

Python

Updated 23 hours ago

adversarial-attacksadversarial-machine-learninglarge-language-models+5

awesome-gemini-prompts

langgptai

💛71

Gemini Prompts, Gemini 3 Prompts, jailbreak, LLM Prompts, LangGPT —— by 云中江树

436

CC0-1.0

Updated 10 hours ago

awesome-llm-promptschatgptgemini+5

JamesGPT

jconorgrogan

🧡56

Jailbreak for ChatGPT: Predict the future, opine on politics and controversial topics, and assess what is true. May help us understand more about LLM Bias

395

Updated 3 weeks ago

llm-adaptive-attacks

tml-epfl

🧡56

Jailbreaking Leading Safety-Aligned LLMs with Simple Adaptive Attacks [ICLR 2025]

380

MIT

Shell

Updated 1 week ago

AutoDAN-Turbo

SaFo-Lab

💛71

[ICLR 2025 Spotlight] The official implementation of our ICLR2025 paper "AutoDAN-Turbo: A Lifelong Agent for Strategy Self-Exploration to Jailbreak LLMs".

359

MIT

Python

Updated 6 hours ago

persuasive_jailbreaker

CHATS-lab

🧡66

Persuasive Jailbreaker: we can persuade LLMs to jailbreak them!

355

Apache-2.0

HTML

Updated 23 hours ago

UltraBr3aks

SlowLow999

🧡66

sharing NEW strong AI jailbreaks of multiple vendors (LLMs)

266

Updated 1 day ago

Chatgpt-DAN

alexisvalentino

🧡66

DAN - The ‘JAILBREAK’ Version of ChatGPT and How to Use it. (update: this was 3 years ago, might not work at the current state of LLMs check OBLITERATUS)

227

Updated 12 hours ago

TAP

RICommunity

🧡56

TAP: An automated jailbreaking method for black-box LLMs

226

MIT

Python

Updated 1 week ago

JailbreakEval

CryptoAILab

🧡65

[NDSS'25 Best Technical Poster] A collection of automated evaluators for assessing jailbreak attempts.

191

MIT

Python

Updated 3 days ago

llm-jailbreaksllm-safety

COLD-Attack

Yu-Fangxu

🧡65

[ICML 2024] COLD-Attack: Jailbreaking LLMs with Stealthiness and Controllability

177

Python

Updated 5 days ago

augustus

praetorian-inc

🧡65

LLM security testing framework for detecting prompt injection, jailbreaks, and adversarial attacks — 190+ probes, 28 providers, single Go binary

174

Apache-2.0

Updated 11 hours ago

ai-securitycapability

FlipAttack

yueliu1999

🧡65

[ICML 2025] An official source code for paper "FlipAttack: Jailbreak LLMs via Flipping".

171

Python

Updated 2 days ago

aijailbreakllm+3

spikee

ReversecLabs

🧡66

Simple Prompt Injection Kit for Evaluation and Exploitation

164

Apache-2.0

HTML

Updated 1 day ago

genaillm-jailbreaksllm-red-teaming+3

JailTrickBench

usail-hkust

💛70

Bag of Tricks: Benchmarking of Jailbreak Attacks on LLMs. Empirical tricks for LLM Jailbreaking. (NeurIPS 2024)

162

MIT

Python

Updated 3 days ago

AwesomeLLMJailBreakPapers

WhileBug

🧡60

Awesome LLM Jailbreak academic papers

134

Updated 2 days ago

last_layer

arekusandr

🧡50

Ultra-fast, low latency LLM prompt injection/jailbreak detection ⛓️

125

MIT

Python

Updated 2 weeks ago

chatgpt-promptsjailbreaklarge-language-models+6

Agent-Smith

sail-sg

🧡55

[ICML 2024] Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast

117

MIT

Python

Updated 1 week ago

wildguard

allenai

🧡60

Open One-Stop Moderation Tools for Safety Risks, Jailbreaks, and Refusals of LLMs

113

NOASSERTION

Python

Updated 2 weeks ago

ArtPrompt

uw-nsl

💛70

[ACL24] Official Repo of Paper `ArtPrompt: ASCII Art-based Jailbreak Attacks against Aligned LLMs`

MIT

Python

Updated 1 day ago

[COLM 2024] JailBreakV-28K: A comprehensive benchmark designed to evaluate the transferability of LLM jailbreak attacks to MLLMs, and further assess the robustness and safety of MLLMs against a variety of jailbreak attacks.

Python

Updated 1 month ago

jailbreakv-28k

CJA_Comprehensive_Jailbreak_Assessment

Junjie-Chu

❤️35

This is the public code repository of paper 'Comprehensive Assessment of Jailbreak Attacks Against LLMs'

Python

Updated 7 months ago

LLM-Jailbreaking-Guide

gally16

❤️45

LLM Jailbreaking Guide主流大语言模型越狱指南

Updated 1 month ago

GitHub Explorer

Search Results

Awesome_GPT_Super_Prompting

AI-Infra-Guard

agentic_security

FuzzyAI

Awesome-Jailbreak-on-LLMs

Spiritual-Spell-Red-Teaming

LLM-Jailbreaks

vigil-llm

awesome-gemini-prompts

JamesGPT

llm-adaptive-attacks

AutoDAN-Turbo

persuasive_jailbreaker

UltraBr3aks

Chatgpt-DAN

TAP

JailbreakEval

COLD-Attack

augustus

FlipAttack

spikee

JailTrickBench

AwesomeLLMJailBreakPapers

last_layer

Agent-Smith

wildguard

ArtPrompt

JailBreakV_28K

CJA_Comprehensive_Jailbreak_Assessment

LLM-Jailbreaking-Guide

Awesome_GPT_Super_Prompting

AI-Infra-Guard

agentic_security

FuzzyAI

Awesome-Jailbreak-on-LLMs

Spiritual-Spell-Red-Teaming

LLM-Jailbreaks

vigil-llm

awesome-gemini-prompts

JamesGPT

llm-adaptive-attacks

AutoDAN-Turbo

persuasive_jailbreaker

UltraBr3aks

Chatgpt-DAN

TAP

JailbreakEval

COLD-Attack

augustus

FlipAttack

spikee

JailTrickBench

AwesomeLLMJailBreakPapers

last_layer

Agent-Smith

wildguard

ArtPrompt

JailBreakV_28K

CJA_Comprehensive_Jailbreak_Assessment

LLM-Jailbreaking-Guide