FACM: Intermediate Layer Still Retain Effective Features against Adversarial Examples
Xiangyuan Yang, Jie Lin, Hanlin Zhang, Xinyu Yang, Peng Zhao

TL;DR
This paper introduces FACM, a novel approach that leverages effective features from intermediate neural network layers to improve robustness against adversarial attacks, combining feature analysis and conditional matching modules.
Contribution
The paper proposes a new model utilizing intermediate layer features for adversarial robustness, including correction modules and a decision module, which can be fine-tuned and combined with existing defenses.
Findings
Intermediate layers retain effective features for classification.
FACM improves robustness by reducing adversarial subspace.
Model can be fine-tuned and combined with other defenses.
Abstract
In strong adversarial attacks against deep neural networks (DNN), the generated adversarial example will mislead the DNN-implemented classifier by destroying the output features of the last layer. To enhance the robustness of the classifier, in our paper, a \textbf{F}eature \textbf{A}nalysis and \textbf{C}onditional \textbf{M}atching prediction distribution (FACM) model is proposed to utilize the features of intermediate layers to correct the classification. Specifically, we first prove that the intermediate layers of the classifier can still retain effective features for the original category, which is defined as the correction property in our paper. According to this, we propose the FACM model consisting of \textbf{F}eature \textbf{A}nalysis (FA) correction module, \textbf{C}onditional \textbf{M}atching \textbf{P}rediction \textbf{D}istribution (CMPD) correction module and decision…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
MethodsFeedback Alignment
