Search Results

Found 10 repositories(showing 10)

LLM-Jailbreaking-Guide

gally16

🧡65

LLM Jailbreaking Guide主流大语言模型越狱指南

Updated 2 days ago

TransferAttack

thu-coai

❤️45

[ACL 2025] Guiding not Forcing: Enhancing the Transferability of Jailbreaking Attacks on LLMs via Removing Superfluous Constraints

Python

Updated 1 month ago

xJailbreak

Bowen1911

🧡55

Code of paper: xJailbreak: Representation Space Guided Reinforcement Learning for Interpretable LLM Jailbreaking"

Python

Updated 5 days ago

RLbreaker

XuanChen-xc

🧡50

Code for "When LLM Meets DRL: Advancing Jailbreaking Efficiency via DRL-guided Search" (NeurIPS 2024)

MIT

Python

Updated 1 month ago

[AAAI-2026]MAJIC: Markovian Adaptive Jailbreaking. An automated black-box attack framework against LLMs that iteratively selects and fuses innovative disguise strategies guided by a dynamically updated Markov transition matrix.

MIT

Python

Updated 2 days ago

Br3aks_C0d3x

SlowLow999

❤️45

A guide for every LLM jailbreaker. Learn, Test and Break!

Updated 1 month ago

llm-prompts-collection

Miabeyefendi

❤️45

The ultimate collection of bunch of LLM models for example GPT-4, Gemini, Claude prompts. Includes prompt engineering guides, productivity templates, developer modes, jailbreaks (JB), and system overrides for testing AI safety. - Educational Purposes Only!

Updated 1 month ago

llm-security-guide

capetron

💛70

LLM security threats and mitigations: prompt injection, data leakage, model poisoning, jailbreaking. Enterprise AI security checklist and on-premise deployment guide.

MIT

Updated 1 day ago

ai-safetycybersecurityllm-security+2

LLM-Prompt-Injection

VVVI5HNU

❤️45

Defensive guide for testing and securing LLM-integrated applications against prompt injection, API misuse, data leakage, and jailbreak attempts.

Updated 1 month ago

AI-FUNDAMENTALS-AND-PROBING

coollane925

❤️45

This is a beginner-intermediate level report for people who are interested in LLM conditioning, probing, and general understanding of the fundamentals. This is NOT a guide on how to jailbreak LLMs. This report has a synopsis at the top - refer to that for a more detailed description.

Updated 1 month ago

ai-safetyartificial-intelligencelanguage-models+2

All 10 repositories loaded

GitHub Explorer

Search Results

LLM-Jailbreaking-Guide

TransferAttack

xJailbreak

RLbreaker

MAJIC-AAAI2026

Br3aks_C0d3x

llm-prompts-collection

llm-security-guide

LLM-Prompt-Injection

AI-FUNDAMENTALS-AND-PROBING

LLM-Jailbreaking-Guide

TransferAttack

xJailbreak

RLbreaker

MAJIC-AAAI2026

Br3aks_C0d3x

llm-prompts-collection

llm-security-guide

LLM-Prompt-Injection

AI-FUNDAMENTALS-AND-PROBING