Adversarially Robust Classification by Conditional Generative Model Inversion
Mitra Alirezaei, Tolga Tasdizen

TL;DR
This paper introduces a new classification approach using conditional generative model inversion that is inherently robust to adversarial attacks without gradient obfuscation or prior attack assumptions.
Contribution
The proposed method employs a conditional generator inversion for classification, avoiding gradient obfuscation and assumptions about attack magnitude, enhancing robustness against adversarial attacks.
Findings
Highly robust against black-box attacks
Improved robustness against white-box attacks
Does not obfuscate gradients, unlike Defense-GAN
Abstract
Most adversarial attack defense methods rely on obfuscating gradients. These methods are successful in defending against gradient-based attacks; however, they are easily circumvented by attacks which either do not use the gradient or by attacks which approximate and use the corrected gradient. Defenses that do not obfuscate gradients such as adversarial training exist, but these approaches generally make assumptions about the attack such as its magnitude. We propose a classification model that does not obfuscate gradients and is robust by construction without assuming prior knowledge about the attack. Our method casts classification as an optimization problem where we "invert" a conditional generator trained on unperturbed, natural images to find the class that generates the closest sample to the query image. We hypothesize that a potential source of brittleness against adversarial…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Bacillus and Francisella bacterial research · Forensic and Genetic Research
