Improving Adversarial Robustness with Self-Paced Hard-Class Pair Reweighting
Pengyue Hou, Jie Han, Xingyu Li

TL;DR
This paper introduces a self-paced reweighting strategy in adversarial training that emphasizes hard-class pairs, improving neural network robustness against adversarial attacks by leveraging class similarity and consistency.
Contribution
It proposes a novel self-paced reweighting method focusing on hard-class pairs to enhance adversarial robustness in neural networks.
Findings
Outperforms state-of-the-art defenses in robustness.
Effectively leverages hard-class pair information.
Boosts model discriminability and consistency.
Abstract
Deep Neural Networks are vulnerable to adversarial attacks. Among many defense strategies, adversarial training with untargeted attacks is one of the most effective methods. Theoretically, adversarial perturbation in untargeted attacks can be added along arbitrary directions and the predicted labels of untargeted attacks should be unpredictable. However, we find that the naturally imbalanced inter-class semantic similarity makes those hard-class pairs become virtual targets of each other. This study investigates the impact of such closely-coupled classes on adversarial attacks and develops a self-paced reweighting strategy in adversarial training accordingly. Specifically, we propose to upweight hard-class pair losses in model optimization, which prompts learning discriminative features from hard classes. We further incorporate a term to quantify hard-class pair consistency in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Anomaly Detection Techniques and Applications
