Improving Fair Training under Correlation Shifts
Yuji Roh, Kangwook Lee, Steven Euijong Whang, Changho Suh

TL;DR
This paper addresses the challenge of maintaining fairness in AI models under correlation shifts between labels and sensitive groups, proposing a pre-processing method to improve fairness and accuracy when data distributions change.
Contribution
It introduces the concept of correlation shifts and a pre-processing technique to mitigate their impact, enhancing existing fairness algorithms' effectiveness.
Findings
Pre-processing reduces correlation shifts effectively.
Improved fairness and accuracy on synthetic and real datasets.
Decouples pre-processing from in-processing fairness methods.
Abstract
Model fairness is an essential element for Trustworthy AI. While many techniques for model fairness have been proposed, most of them assume that the training and deployment data distributions are identical, which is often not true in practice. In particular, when the bias between labels and sensitive groups changes, the fairness of the trained model is directly influenced and can worsen. We make two contributions for solving this problem. First, we analytically show that existing in-processing fair algorithms have fundamental limits in accuracy and group fairness. We introduce the notion of correlation shifts, which can explicitly capture the change of the above bias. Second, we propose a novel pre-processing step that samples the input data to reduce correlation shifts and thus enables the in-processing approaches to overcome their limitations. We formulate an optimization problem for…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI) · Privacy-Preserving Technologies in Data
