On Model Robustness Against Adversarial Examples
Shufei Zhang, Kaizhu Huang, Zenglin Xu

TL;DR
This paper introduces a new theoretical framework based on energy functions to analyze and improve the robustness of deep learning models against adversarial examples, outperforming existing methods on multiple datasets.
Contribution
The paper proposes a novel energy-based theory for model robustness and develops a new regularization method that enhances adversarial resistance beyond traditional training approaches.
Findings
Energy regularization improves robustness against adversarial attacks.
The proposed method outperforms previous adversarial training techniques.
Models trained with energy regularization show superior performance on MNIST, CIFAR-10, and SVHN.
Abstract
We study the model robustness against adversarial examples, referred to as small perturbed input data that may however fool many state-of-the-art deep learning models. Unlike previous research, we establish a novel theory addressing the robustness issue from the perspective of stability of the loss function in the small neighborhood of natural examples. We propose to exploit an energy function to describe the stability and prove that reducing such energy guarantees the robustness against adversarial examples. We also show that the traditional training methods including adversarial training with the norm constraint (AT) and Virtual Adversarial Training (VAT) tend to minimize the lower bound of our proposed energy function. We make an analysis showing that minimization of such lower bound can however lead to insufficient robustness within the neighborhood around the input sample.…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Security and Verification in Computing · Advanced Malware Detection Techniques
