Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation

Shiji Zhao; Chi Chen; Ranjie Duan; Xizhe Wang; Xingxing Wei

arXiv:2506.08611·cs.CV·June 11, 2025

Towards Class-wise Fair Adversarial Training via Anti-Bias Soft Label Distillation

Shiji Zhao, Chi Chen, Ranjie Duan, Xizhe Wang, Xingxing Wei

PDF

Open Access 1 Repo

TL;DR

This paper introduces Anti-Bias Soft Label Distillation (ABSLD), a novel method that improves adversarial robustness fairness across classes by adaptively adjusting soft label smoothness during training, outperforming existing methods.

Contribution

The paper proposes ABSLD, a new knowledge distillation approach that enhances adversarial robustness fairness by class-wise soft label adjustment, supported by empirical and theoretical analysis.

Findings

01

ABSLD improves robustness and fairness simultaneously.

02

Class-wise soft label smoothness significantly impacts robustness fairness.

03

ABSLD outperforms state-of-the-art methods in experiments.

Abstract

Adversarial Training (AT) is widely recognized as an effective approach to enhance the adversarial robustness of Deep Neural Networks. As a variant of AT, Adversarial Robustness Distillation (ARD) has shown outstanding performance in enhancing the robustness of small models. However, both AT and ARD face robust fairness issue: these models tend to display strong adversarial robustness against some classes (easy classes) while demonstrating weak adversarial robustness against others (hard classes). This paper explores the underlying factors of this problem and points out the smoothness degree of soft labels for different classes significantly impacts the robust fairness from both empirical observation and theoretical analysis. Based on the above exploration, we propose Anti-Bias Soft Label Distillation (ABSLD) within the Knowledge Distillation framework to enhance the adversarial robust…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

zhaoshiji123/absld
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Ethics and Social Impacts of AI · Explainable Artificial Intelligence (XAI)

MethodsKnowledge Distillation