Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai; Yuyuan Zeng; Yong Jiang; Shu-Tao Xia; Xingjun Ma; Yisen Wang

arXiv:2103.08307·cs.LG·January 19, 2022·59 cites

Improving Adversarial Robustness via Channel-wise Activation Suppressing

Yang Bai, Yuyuan Zeng, Yong Jiang, Shu-Tao Xia, Xingjun Ma, Yisen Wang

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a channel-wise activation suppressing strategy to improve the robustness of deep neural networks against adversarial attacks by reducing redundant activations caused by adversarial perturbations.

Contribution

The paper proposes a novel Channel-wise Activation Suppressing (CAS) method that inherently suppresses adversarial activations and enhances existing defense techniques.

Findings

01

CAS improves adversarial robustness when integrated with existing defenses.

02

Adversarial examples exhibit higher and more uniformly distributed channel activations.

03

CAS effectively suppresses redundant activations caused by adversarial perturbations.

Abstract

The study of adversarial examples and their activation has attracted significant attention for secure and robust learning with deep neural networks (DNNs). Different from existing works, in this paper, we highlight two new characteristics of adversarial examples from the channel-wise activation perspective: 1) the activation magnitudes of adversarial examples are higher than that of natural examples; and 2) the channels are activated more uniformly by adversarial examples than natural examples. We find that the state-of-the-art defense adversarial training has addressed the first issue of high activation magnitudes via training on adversarial examples, while the second issue of uniform activation remains. This motivates us to suppress redundant activation from being activated by adversarial perturbations via a Channel-wise Activation Suppressing (CAS) strategy. We show that CAS can…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

bymavis/CAS_ICLR2021
pytorchOfficial

Videos

Improving Adversarial Robustness via Channel-wise Activation Suppressing· slideslive

Taxonomy

TopicsAdversarial Robustness in Machine Learning