Improving Adversarial Robustness by Putting More Regularizations on Less Robust Samples
Dongyoon Yang, Insung Kong, Yongdai Kim

TL;DR
This paper introduces a new adversarial training method that applies more regularization to less robust samples, leading to improved robustness and generalization in neural networks.
Contribution
It proposes a theoretically motivated adversarial training algorithm that emphasizes regularization on vulnerable samples, outperforming existing methods.
Findings
Achieves state-of-the-art robustness against adversarial attacks.
Improves generalization accuracy on clean data.
Demonstrates superior empirical performance through numerical experiments.
Abstract
Adversarial training, which is to enhance robustness against adversarial attacks, has received much attention because it is easy to generate human-imperceptible perturbations of data to deceive a given deep neural network. In this paper, we propose a new adversarial training algorithm that is theoretically well motivated and empirically superior to other existing algorithms. A novel feature of the proposed algorithm is to apply more regularization to data vulnerable to adversarial attacks than other existing regularization algorithms do. Theoretically, we show that our algorithm can be understood as an algorithm of minimizing the regularized empirical risk motivated from a newly derived upper bound of the robust risk. Numerical experiments illustrate that our proposed algorithm improves the generalization (accuracy on examples) and robustness (accuracy on adversarial attacks)…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications · COVID-19 diagnosis using AI
MethodsBalanced Selection
