Towards Fair Class-wise Robustness: Class Optimal Distribution Adversarial Training
Hongxin Zhi, Hongtao Yu, Shaome Li, Xiuming Zhao, Yiteng Wu

TL;DR
This paper introduces CODAT, a novel adversarial training framework that optimizes class-wise robustness fairly by leveraging distributionally robust optimization with theoretical guarantees, improving robustness and fairness.
Contribution
It proposes a min-max training framework with a closed-form solution for class-wise weight optimization, ensuring consistent joint optimization of weights and model parameters.
Findings
Improves robust fairness across classes in neural networks.
Outperforms state-of-the-art methods in robustness and fairness metrics.
Provides theoretical guarantees for class weight optimization.
Abstract
Adversarial training has proven to be a highly effective method for improving the robustness of deep neural networks against adversarial attacks. Nonetheless, it has been observed to exhibit a limitation in terms of robust fairness, characterized by a significant disparity in robustness across different classes. Recent efforts to mitigate this problem have turned to class-wise reweighted methods. However, these methods suffer from a lack of rigorous theoretical analysis and are limited in their exploration of the weight space, as they mainly rely on existing heuristic algorithms or intuition to compute weights. In addition, these methods fail to guarantee the consistency of the optimization direction due to the decoupled optimization of weights and the model parameters. They potentially lead to suboptimal weight assignments and consequently, a suboptimal model. To address these…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Advanced Statistical Methods and Models · Anomaly Detection Techniques and Applications
