Hard Adversarial Example Mining for Improving Robust Fairness

Chenhao Lin; Xiang Ji; Yulong Yang; Qian Li; Chao Shen; Run Wang,; Liming Fang

arXiv:2308.01823·cs.LG·August 4, 2023·1 cites

Hard Adversarial Example Mining for Improving Robust Fairness

Chenhao Lin, Xiang Ji, Yulong Yang, Qian Li, Chao Shen, Run Wang,, Liming Fang

PDF

Open Access

TL;DR

This paper introduces HAM, a framework that improves the fairness of adversarial training by focusing on hard adversarial examples, reducing overconfidence issues, and enhancing robustness across multiple datasets.

Contribution

HAM is a novel adaptive hard adversarial example mining method that enhances robustness and fairness in adversarial training by selectively focusing on challenging examples.

Findings

01

HAM improves robust fairness significantly.

02

HAM reduces computational cost of adversarial training.

03

Experimental results outperform existing methods on CIFAR-10, SVHN, and Imagenette.

Abstract

Adversarial training (AT) is widely considered the state-of-the-art technique for improving the robustness of deep neural networks (DNNs) against adversarial examples (AE). Nevertheless, recent studies have revealed that adversarially trained models are prone to unfairness problems, restricting their applicability. In this paper, we empirically observe that this limitation may be attributed to serious adversarial confidence overfitting, i.e., certain adversarial examples with overconfidence. To alleviate this problem, we propose HAM, a straightforward yet effective framework via adaptive Hard Adversarial example Mining.HAM concentrates on mining hard adversarial examples while discarding the easy ones in an adaptive fashion. Specifically, HAM identifies hard AEs in terms of their step sizes needed to cross the decision boundary when calculating loss value. Besides, an early-dropping…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Advanced Neural Network Applications

MethodsAutoencoders