Robustness May Be at Odds with Fairness: An Empirical Study on   Class-wise Accuracy

Philipp Benz; Chaoning Zhang; Adil Karjauv; In So Kweon

arXiv:2010.13365·cs.LG·October 12, 2021·20 cites

Robustness May Be at Odds with Fairness: An Empirical Study on Class-wise Accuracy

Philipp Benz, Chaoning Zhang, Adil Karjauv, In So Kweon

PDF

Open Access

TL;DR

This paper empirically investigates class-wise accuracy and robustness in adversarially trained CNNs, revealing inter-class discrepancies that are consistent across datasets and models, and exploring potential mitigation strategies.

Contribution

It provides the first comprehensive analysis of class-wise robustness discrepancies in adversarial training, highlighting their universality and potential causes.

Findings

01

Inter-class accuracy and robustness vary significantly even with balanced datasets.

02

Adversarial training tends to increase inter-class robustness discrepancies.

03

Discrepancies are consistent across different datasets, models, and hyper-parameters.

Abstract

Convolutional neural networks (CNNs) have made significant advancement, however, they are widely known to be vulnerable to adversarial attacks. Adversarial training is the most widely used technique for improving adversarial robustness to strong white-box attacks. Prior works have been evaluating and improving the model average robustness without class-wise evaluation. The average evaluation alone might provide a false sense of robustness. For example, the attacker can focus on attacking the vulnerable class, which can be dangerous, especially, when the vulnerable class is a critical one, such as "human" in autonomous driving. We propose an empirical study on the class-wise accuracy and robustness of adversarially trained models. We find that there exists inter-class discrepancy for accuracy and robustness even when the training dataset has an equal number of samples for each class. For…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsQualitative Comparative Analysis Research