Hard Adversarial Example Mining for Improving Robust Fairness
Chenhao Lin, Xiang Ji, Yulong Yang, Qian Li, Chao Shen, Run Wang,, Liming Fang

TL;DR
This paper introduces HAM, a framework that improves the fairness of adversarial training by focusing on hard adversarial examples, reducing overconfidence issues, and enhancing robustness across multiple datasets.
Contribution
HAM is a novel adaptive hard adversarial example mining method that enhances robustness and fairness in adversarial training by selectively focusing on challenging examples.
Findings
HAM improves robust fairness significantly.
HAM reduces computational cost of adversarial training.
Experimental results outperform existing methods on CIFAR-10, SVHN, and Imagenette.
Abstract
Adversarial training (AT) is widely considered the state-of-the-art technique for improving the robustness of deep neural networks (DNNs) against adversarial examples (AE). Nevertheless, recent studies have revealed that adversarially trained models are prone to unfairness problems, restricting their applicability. In this paper, we empirically observe that this limitation may be attributed to serious adversarial confidence overfitting, i.e., certain adversarial examples with overconfidence. To alleviate this problem, we propose HAM, a straightforward yet effective framework via adaptive Hard Adversarial example Mining.HAM concentrates on mining hard adversarial examples while discarding the easy ones in an adaptive fashion. Specifically, HAM identifies hard AEs in terms of their step sizes needed to cross the decision boundary when calculating loss value. Besides, an early-dropping…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications
MethodsAutoencoders
