Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation
Shiji Zhao, Chi Chen, Ranjie Duan, Xizhe Wang, Xingxing Wei

TL;DR
This paper introduces Anti-Bias Soft Label Distillation (ABSLD), a novel method that improves adversarial robustness fairness across classes by adaptively adjusting soft label smoothness during training, outperforming existing methods.
Contribution
The paper proposes ABSLD, a new knowledge distillation approach that enhances adversarial robustness fairness by class-wise soft label adjustment, supported by empirical and theoretical analysis.
Findings
ABSLD improves robustness and fairness simultaneously.
Class-wise soft label smoothness significantly impacts robustness fairness.
ABSLD outperforms state-of-the-art methods in experiments.
Abstract
Adversarial Training (AT) is widely recognized as an effective approach to enhance the adversarial robustness of Deep Neural Networks. As a variant of AT, Adversarial Robustness Distillation (ARD) has shown outstanding performance in enhancing the robustness of small models. However, both AT and ARD face robust fairness issue: these models tend to display strong adversarial robustness against some classes (easy classes) while demonstrating weak adversarial robustness against others (hard classes). This paper explores the underlying factors of this problem and points out the smoothness degree of soft labels for different classes significantly impacts the robust fairness from both empirical observation and theoretical analysis. Based on the above exploration, we propose Anti-Bias Soft Label Distillation (ABSLD) within the Knowledge Distillation framework to enhance the adversarial robust…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)
MethodsKnowledge Distillation
