TRIX- Trading Adversarial Fairness via Mixed Adversarial Training

Tejaswini Medi; Steffen Jung; Margret Keuper

arXiv:2507.07768·cs.LG·July 11, 2025

TRIX- Trading Adversarial Fairness via Mixed Adversarial Training

Tejaswini Medi, Steffen Jung, Margret Keuper

PDF

Open Access

TL;DR

TRIX introduces a class-aware adversarial training method that adaptively applies different attack strengths to strong and weak classes, significantly improving fairness and robustness in image classification.

Contribution

The paper proposes TRIX, a novel adversarial training framework that dynamically adjusts attack strategies per class to enhance fairness and robustness against adversarial attacks.

Findings

01

Improves worst-case class accuracy on clean and adversarial data.

02

Reduces inter-class robustness disparities.

03

Maintains overall accuracy while enhancing fairness.

Abstract

Adversarial Training (AT) is a widely adopted defense against adversarial examples. However, existing approaches typically apply a uniform training objective across all classes, overlooking disparities in class-wise vulnerability. This results in adversarial unfairness: classes with well distinguishable features (strong classes) tend to become more robust, while classes with overlapping or shared features(weak classes) remain disproportionately susceptible to adversarial attacks. We observe that strong classes do not require strong adversaries during training, as their non-robust features are quickly suppressed. In contrast, weak classes benefit from stronger adversaries to effectively reduce their vulnerabilities. Motivated by this, we introduce TRIX, a feature-aware adversarial training framework that adaptively assigns weaker targeted adversaries to strong classes, promoting feature…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Domain Adaptation and Few-Shot Learning · Ethics and Social Impacts of AI