CE-based white-box adversarial attacks will not work using super-fitting

Youhuan Yang; Lei Sun; Leyu Dai; Song Guo; Xiuqing Mao; Xiaoqin Wang; and Bayi Xu

arXiv:2205.02741·cs.LG·May 17, 2022

CE-based white-box adversarial attacks will not work using super-fitting

Youhuan Yang, Lei Sun, Leyu Dai, Song Guo, Xiuqing Mao, Xiaoqin Wang, and Bayi Xu

PDF

Open Access

TL;DR

This paper introduces a super-fitting technique that mathematically guarantees resistance against all current and future CE-based white-box adversarial attacks, significantly enhancing neural network robustness.

Contribution

It proposes a novel super-fitting approach with a mathematical proof of effectiveness, enabling models to resist all CE-based white-box adversarial attacks.

Findings

01

Super-fitting improves adversarial robustness against various attack algorithms.

02

The method outperforms nearly 50 recent defense models in robustness.

03

Super-fitting achieves the highest robustness in experimental evaluations.

Abstract

Deep neural networks are widely used in various fields because of their powerful performance. However, recent studies have shown that deep learning models are vulnerable to adversarial attacks, i.e., adding a slight perturbation to the input will make the model obtain wrong results. This is especially dangerous for some systems with high-security requirements, so this paper proposes a new defense method by using the model super-fitting state to improve the model's adversarial robustness (i.e., the accuracy under adversarial attacks). This paper mathematically proves the effectiveness of super-fitting and enables the model to reach this state quickly by minimizing unrelated category scores (MUCS). Theoretically, super-fitting can resist any existing (even future) CE-based white-box adversarial attacks. In addition, this paper uses a variety of powerful attack algorithms to evaluate the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications