Assessing Neural Network Robustness via Adversarial Pivotal Tuning

Peter Ebert Christensen; V\'esteinn Sn{\ae}bjarnarson; Andrea Dittadi,; Serge Belongie; Sagie Benaim

arXiv:2211.09782·cs.CV·January 9, 2024

Assessing Neural Network Robustness via Adversarial Pivotal Tuning

Peter Ebert Christensen, V\'esteinn Sn{\ae}bjarnarson, Andrea Dittadi,, Serge Belongie, Sagie Benaim

PDF

Open Access 1 Video

TL;DR

This paper introduces Adversarial Pivotal Tuning (APT), a method that uses pretrained generative models to create semantic, class-preserving manipulations of images to evaluate and improve classifier robustness.

Contribution

The paper proposes APT, a novel technique that leverages pretrained generators for detailed, semantic manipulations to assess and enhance neural network robustness against adversarial attacks.

Findings

01

APT can generate diverse, photorealistic, class-preserving manipulations.

02

Classifiers robust to benchmark attacks are vulnerable to APT manipulations.

03

APT can be used to improve classifier robustness through targeted training.

Abstract

The robustness of image classifiers is essential to their deployment in the real world. The ability to assess this resilience to manipulations or deviations from the training data is thus crucial. These modifications have traditionally consisted of minimal changes that still manage to fool classifiers, and modern approaches are increasingly robust to them. Semantic manipulations that modify elements of an image in meaningful ways have thus gained traction for this purpose. However, they have primarily been limited to style, color, or attribute changes. While expressive, these manipulations do not make use of the full capabilities of a pretrained generative model. In this work, we aim to bridge this gap. We show how a pretrained image generator can be used to semantically manipulate images in a detailed, diverse, and photorealistic way while still preserving the class of the original…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Assessing Neural Network Robustness via Adversarial Pivotal Tuning· youtube

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Cell Image Analysis Techniques · Adversarial Robustness in Machine Learning