Assessing Neural Network Robustness via Adversarial Pivotal Tuning
Peter Ebert Christensen, V\'esteinn Sn{\ae}bjarnarson, Andrea Dittadi,, Serge Belongie, Sagie Benaim

TL;DR
This paper introduces Adversarial Pivotal Tuning (APT), a method that uses pretrained generative models to create semantic, class-preserving manipulations of images to evaluate and improve classifier robustness.
Contribution
The paper proposes APT, a novel technique that leverages pretrained generators for detailed, semantic manipulations to assess and enhance neural network robustness against adversarial attacks.
Findings
APT can generate diverse, photorealistic, class-preserving manipulations.
Classifiers robust to benchmark attacks are vulnerable to APT manipulations.
APT can be used to improve classifier robustness through targeted training.
Abstract
The robustness of image classifiers is essential to their deployment in the real world. The ability to assess this resilience to manipulations or deviations from the training data is thus crucial. These modifications have traditionally consisted of minimal changes that still manage to fool classifiers, and modern approaches are increasingly robust to them. Semantic manipulations that modify elements of an image in meaningful ways have thus gained traction for this purpose. However, they have primarily been limited to style, color, or attribute changes. While expressive, these manipulations do not make use of the full capabilities of a pretrained generative model. In this work, we aim to bridge this gap. We show how a pretrained image generator can be used to semantically manipulate images in a detailed, diverse, and photorealistic way while still preserving the class of the original…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Assessing Neural Network Robustness via Adversarial Pivotal Tuning· youtube
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Adversarial Robustness in Machine Learning
