Trainwreck: A damaging adversarial attack on image classifiers

Jan Zah\'alka

arXiv:2311.14772·cs.CV·June 12, 2024·1 cites

Trainwreck: A damaging adversarial attack on image classifiers

Jan Zah\'alka

PDF

Open Access 1 Repo

TL;DR

This paper introduces Trainwreck, a novel train-time adversarial attack that damages image classifiers by poisoning training data stealthily, demonstrating high effectiveness across multiple models and proposing data redundancy as a defense.

Contribution

It formalizes damaging adversarial attacks and proposes Trainwreck, a black-box, transferable train-time attack that degrades model performance using stealthy data poisoning.

Findings

01

Trainwreck effectively damages models on CIFAR datasets.

02

It achieves comparable or better potency than existing data poisoning methods.

03

Data redundancy with hashing can defend against Trainwreck.

Abstract

Adversarial attacks are an important security concern for computer vision (CV). As CV models are becoming increasingly valuable assets in applied practice, disrupting them is emerging as a form of economic sabotage. This paper opens up the exploration of damaging adversarial attacks (DAAs) that seek to damage target CV models. DAAs are formalized by defining the threat model, the cost function DAAs maximize, and setting three requirements for success: potency, stealth, and customizability. As a pioneer DAA, this paper proposes Trainwreck, a train-time attack that conflates the data of similar classes in the training data using stealthy ( $ϵ \leq 8/255$ ) class-pair universal perturbations obtained from a surrogate model. Trainwreck is a black-box, transferable attack: it requires no knowledge of the target architecture, and a single poisoned dataset degrades the performance of any…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

janzahalka/trainwreck
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Anomaly Detection Techniques and Applications

MethodsDepthwise Convolution · Pointwise Convolution · 1x1 Convolution · Depthwise Separable Convolution · Batch Normalization · Inverted Residual Block · EfficientNetV2