Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial   Training

Jiacheng Zhang; Feng Liu; Dawei Zhou; Jingfeng Zhang; Tongliang Liu

arXiv:2406.00685·cs.CV·June 4, 2024

Improving Accuracy-robustness Trade-off via Pixel Reweighted Adversarial Training

Jiacheng Zhang, Feng Liu, Dawei Zhou, Jingfeng Zhang, Tongliang Liu

PDF

Open Access 1 Repo

TL;DR

This paper introduces Pixel-reweighted Adversarial Training (PART), a novel approach that allocates different perturbation budgets to pixel regions based on their importance, improving accuracy and robustness in adversarial training.

Contribution

The paper proposes a new pixel-reweighted adversarial training framework that emphasizes key regions, enhancing model accuracy without sacrificing robustness.

Findings

01

PART improves accuracy on CIFAR-10, SVHN, and TinyImagenet-200.

02

It maintains robustness while increasing accuracy.

03

Pixel importance guides perturbation allocation effectively.

Abstract

Adversarial training (AT) trains models using adversarial examples (AEs), which are natural images modified with specific perturbations to mislead the model. These perturbations are constrained by a predefined perturbation budget $ϵ$ and are equally applied to each pixel within an image. However, in this paper, we discover that not all pixels contribute equally to the accuracy on AEs (i.e., robustness) and accuracy on natural images (i.e., accuracy). Motivated by this finding, we propose Pixel-reweighted AdveRsarial Training (PART), a new framework that partially reduces $ϵ$ for less influential pixels, guiding the model to focus more on key regions that affect its outputs. Specifically, we first use class activation mapping (CAM) methods to identify important pixel regions, then we keep the perturbation budget for these regions while lowering it for the remaining regions…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

tmlr-group/PART
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage Processing Techniques and Applications · Integrated Circuits and Semiconductor Failure Analysis · Adversarial Robustness in Machine Learning

MethodsFocus