Found 1,098 repositories(showing 30)
bethgelab
A Python toolbox to create adversarial examples that fool neural networks in PyTorch, TensorFlow, and JAX
advboxes
Advbox is a toolbox to generate adversarial examples that fool neural networks in PaddlePaddle、PyTorch、Caffe2、MxNet、Keras、TensorFlow and Advbox can benchmark the robustness of machine learning models. Advbox give a command line tool to generate adversarial examples with Zero-Coding.
anishathalye
Obfuscated Gradients Give a False Sense of Security: Circumventing Defenses to Adversarial Examples
carlini
Robust evasion attacks against neural network to find adversarial examples
airbnb
🗣️ Tool to generate adversarial text examples and test machine learning models against them
sarathknv
Implementation of Papers on Adversarial Examples
LLM-Tuning-Safety
We jailbreak GPT-3.5 Turbo’s safety guardrails by fine-tuning it on only 10 adversarially designed examples, at a cost of less than $0.20 via OpenAI’s APIs.
Trustworthy-AI-Group
A list of recent papers about adversarial learning
coeff-giving
Contest Proposal and infrastructure for the Unrestricted Adversarial Examples Challenge
carlini
Targeted Adversarial Examples on Speech-to-Text systems
mathcbc
a Pytorch implementation of the paper "Generating Adversarial Examples with Adversarial Networks" (advGAN).
duoergun0729
对抗样本
Repository for the Paper (AAAI 2024, Oral) --- Visual Adversarial Examples Jailbreak Large Language Models
A curated list of awesome resources for adversarial examples in deep learning
This is the reading list mainly on adversarial examples (attacks, defenses, etc.) I try to keep and update regularly.
eth-sri
A certifiable defense against adversarial examples by training neural networks to be provably robust
tao-bai
A curated list of papers on adversarial machine learning (adversarial examples and defense methods).
MadryLab
Datasets for the paper "Adversarial Examples are not Bugs, They Are Features"
davidiommi
Pytorch pipeline for 3D image domain translation using Cycle-Generative-Adversarial-networks, without paired examples.
Implementation code for the paper "Generating Natural Language Adversarial Examples"
cihangxie
Improving Transferability of Adversarial Examples with Input Diversity
YisenWang
Code for ICLR2020 "Improving Adversarial Robustness Requires Revisiting Misclassified Examples"
tinapan-pt
Official pytorch implementation of paper "VideoMoCo: Contrastive Video Representation Learning with Temporally Adversarial Examples" (CVPR 2021).
LOLESXi-Project
LOLESXi is a curated compilation of binaries/scripts available in VMware ESXi that are were used to by adversaries in their intrusions. This project gathers procedural examples from public reports of adversarial activities targeting ESXi hosts
pfnet-research
Submission to Kaggle NIPS'17 competition on adversarial examples (non-targeted adversarial attack track)
zhengliz
Generating Natural Adversarial Examples, ICLR 2018
dongyp13
The translation-invariant adversarial attack method to improve the transferability of adversarial examples.
BruceMacD
Testing the effectiveness of practical implementations of adversarial examples against facial recognition.
yanminglai
Realization of paper: "Generating Adversarial Malware Examples for Black-Box Attacks Based on GAN" 2017
cihangxie
Adversarial Examples for Semantic Segmentation and Object Detection