FACM: Intermediate Layer Still Retain Effective Features against   Adversarial Examples

Xiangyuan Yang; Jie Lin; Hanlin Zhang; Xinyu Yang; Peng Zhao

arXiv:2206.00924·cs.CV·April 4, 2023

FACM: Intermediate Layer Still Retain Effective Features against Adversarial Examples

Xiangyuan Yang, Jie Lin, Hanlin Zhang, Xinyu Yang, Peng Zhao

PDF

Open Access

TL;DR

This paper introduces FACM, a novel approach that leverages effective features from intermediate neural network layers to improve robustness against adversarial attacks, combining feature analysis and conditional matching modules.

Contribution

The paper proposes a new model utilizing intermediate layer features for adversarial robustness, including correction modules and a decision module, which can be fine-tuned and combined with existing defenses.

Findings

01

Intermediate layers retain effective features for classification.

02

FACM improves robustness by reducing adversarial subspace.

03

Model can be fine-tuned and combined with other defenses.

Abstract

In strong adversarial attacks against deep neural networks (DNN), the generated adversarial example will mislead the DNN-implemented classifier by destroying the output features of the last layer. To enhance the robustness of the classifier, in our paper, a \textbf{F}eature \textbf{A}nalysis and \textbf{C}onditional \textbf{M}atching prediction distribution (FACM) model is proposed to utilize the features of intermediate layers to correct the classification. Specifically, we first prove that the intermediate layers of the classifier can still retain effective features for the original category, which is defined as the correction property in our paper. According to this, we propose the FACM model consisting of \textbf{F}eature \textbf{A}nalysis (FA) correction module, \textbf{C}onditional \textbf{M}atching \textbf{P}rediction \textbf{D}istribution (CMPD) correction module and decision…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications

MethodsFeedback Alignment