Toward Adversarial Robustness via Semi-supervised Robust Training
Yiming Li, Baoyuan Wu, Yan Feng, Yanbo Fan, Yong Jiang, Zhifeng Li,, Shutao Xia

TL;DR
This paper introduces a semi-supervised robust training method that jointly minimizes standard and robust risks to improve adversarial robustness of deep neural networks, extending to various perturbation types.
Contribution
The work proposes a novel robust training approach that combines standard and robust risk minimization, with a semi-supervised extension and theoretical guarantees, covering diverse perturbation types.
Findings
SRT outperforms state-of-the-art defenses against pixel-wise and spatial perturbations.
Theoretical proof shows R_adv is upper-bounded by R_stand + R_rob.
Method demonstrates robustness to combined perturbations.
Abstract
Adversarial examples have been shown to be the severe threat to deep neural networks (DNNs). One of the most effective adversarial defense methods is adversarial training (AT) through minimizing the adversarial risk , which encourages both the benign example and its adversarially perturbed neighborhoods within the -ball to be predicted as the ground-truth label. In this work, we propose a novel defense method, the robust training (RT), by jointly minimizing two separated risks ( and ), which is with respect to the benign example and its neighborhoods respectively. The motivation is to explicitly and jointly enhance the accuracy and the adversarial robustness. We prove that is upper-bounded by , which implies that RT has similar effect as AT. Intuitively, minimizing the standard risk enforces the benign example to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Integrated Circuits and Semiconductor Failure Analysis · Anomaly Detection Techniques and Applications
