Found 1,500 repositories(showing 30)
cleverhans-lab
An adversarial example library for constructing attacks, building defenses, and benchmarking both
makcedward
Data augmentation for NLP
bethgelab
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
advboxes
Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models. Advbox give a command line tool to generate adversarial examples with Zero-Coding.
BorealisAI
A Toolbox for Adversarial Robustness Research
anishathalye
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples
carlini
Robust evasion attacks against neural network to find adversarial examples
thunlp
An Open-Source Package for Textual Adversarial Attack.
hendrycks
A Harder ImageNet Test Set (CVPR 2021)
airbnb
🗣️ Tool to generate adversarial text examples and test machine learning models against them
sarathknv
Implementation of Papers on Adversarial Examples
utkuozbulak
Pytorch implementation of convolutional neural network adversarial attack techniques
LLM-Tuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
Trustworthy-AI-Group
A list of recent papers about adversarial learning
coeff-giving
Contest Proposal and infrastructure for the Unrestricted Adversarial Examples Challenge
carlini
Targeted Adversarial Examples on Speech-to-Text systems
1Konny
Simple pytorch implementation of FGSM and I-FGSM
mathcbc
a Pytorch implementation of the paper "Generating Adversarial Examples with Adversarial Networks" (advGAN).
duoergun0729
对抗样本
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
A curated list of awesome resources for adversarial examples in deep learning
dongyp13
A non-targeted adversarial attack method, which won the first place in NIPS 2017 non-targeted adversarial attacks competition
This is the reading list mainly on adversarial examples (attacks, defenses, etc.) I try to keep and update regularly.
eth-sri
A certifiable defense against adversarial examples by training neural networks to be provably robust
tao-bai
A curated list of papers on adversarial machine learning (adversarial examples and defense methods).
MadryLab
Datasets for the paper "Adversarial Examples are not Bugs, They Are Features"
davidiommi
Pytorch pipeline for 3D image domain translation using Cycle-Generative-Adversarial-networks, without paired examples.
ashafahi
Official TensorFlow Implementation of Adversarial Training for Free! which trains robust models at no extra cost compared to natural training.
jsikyoon
Tensorflow Implementation of Adversarial Attack to Capsule Networks
Implementation code for the paper "Generating Natural Language Adversarial Examples"