Masking Adversarial Damage: Finding Adversarial Saliency for Robust and   Sparse Network

Byung-Kwan Lee; Junho Kim; Yong Man Ro

arXiv:2204.02738·cs.CV·April 7, 2022

Masking Adversarial Damage: Finding Adversarial Saliency for Robust and Sparse Network

Byung-Kwan Lee, Junho Kim, Yong Man Ro

PDF

Open Access 1 Repo

TL;DR

This paper introduces MAD, a novel adversarial pruning method that uses second-order information to identify and remove non-essential parameters, maintaining robustness while reducing model size.

Contribution

MAD is the first adversarial pruning technique leveraging second-order loss information to preserve robustness and sparsity simultaneously.

Findings

01

MAD effectively prunes adversarially trained networks without losing robustness.

02

Model parameters in the initial layer are highly sensitive to adversarial examples.

03

Compressed feature representations retain semantic information for target objects.

Abstract

Adversarial examples provoke weak reliability and potential security issues in deep neural networks. Although adversarial training has been widely studied to improve adversarial robustness, it works in an over-parameterized regime and requires high computations and large memory budgets. To bridge adversarial robustness and model compression, we propose a novel adversarial pruning method, Masking Adversarial Damage (MAD) that employs second-order information of adversarial loss. By using it, we can accurately estimate adversarial saliency for model parameters and determine which parameters can be pruned without weakening adversarial robustness. Furthermore, we reveal that model parameters of initial layer are highly sensitive to the adversarial examples and show that compressed feature representation retains semantic information for the target objects. Through extensive experiments on…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ByungKwanLee/Masking-Adversarial-Damage
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Advanced Neural Network Applications · Anomaly Detection Techniques and Applications

MethodsPruning