Enhancing Robust Representation in Adversarial Training: Alignment and   Exclusion Criteria

Nuoyan Zhou; Nannan Wang; Decheng Liu; Dawei Zhou; Xinbo Gao

arXiv:2310.03358·cs.CV·November 21, 2023

Enhancing Robust Representation in Adversarial Training: Alignment and Exclusion Criteria

Nuoyan Zhou, Nannan Wang, Decheng Liu, Dawei Zhou, Xinbo Gao

PDF

Open Access 1 Repo

TL;DR

This paper proposes a novel framework for adversarial training that enhances robust feature learning through alignment and exclusion criteria, significantly improving adversarial robustness on benchmark datasets.

Contribution

It introduces a generic adversarial training framework using asymmetric negative contrast and reverse attention to better learn robust features.

Findings

01

Achieves state-of-the-art robustness on three benchmark datasets.

02

Effectively separates classes in feature space for improved robustness.

03

Enhances alignment between natural and adversarial examples.

Abstract

Deep neural networks are vulnerable to adversarial noise. Adversarial Training (AT) has been demonstrated to be the most effective defense strategy to protect neural networks from being fooled. However, we find AT omits to learning robust features, resulting in poor performance of adversarial robustness. To address this issue, we highlight two criteria of robust representation: (1) Exclusion: \emph{the feature of examples keeps away from that of other classes}; (2) Alignment: \emph{the feature of natural and corresponding adversarial examples is close to each other}. These motivate us to propose a generic framework of AT to gain robust representation, by the asymmetric negative contrast and reverse attention. Specifically, we design an asymmetric negative contrast based on predicted probabilities, to push away examples of different classes in the feature space. Moreover, we propose to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

changzhang777/ancra
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications