De-biased Representation Learning for Fairness with Unreliable Labels
Yixuan Zhang, Feng Zhou, Zhidong Li, Yang Wang, Fang Chen

TL;DR
This paper introduces DBRF, a novel framework for fair representation learning that disentangles sensitive information from non-sensitive attributes using an information-theoretic approach, aiming to predict ideal fair labels from unreliable observed labels.
Contribution
The paper proposes a de-biased representation learning method that leverages mutual information and information bottleneck to learn fair representations aligned with latent ideal labels, despite unreliable observed labels.
Findings
DBRF effectively disentangles sensitive information from non-sensitive attributes.
The method predicts ideal fair labels better than existing approaches.
Experimental results validate the effectiveness of DBRF on synthetic and real-world data.
Abstract
Removing bias while keeping all task-relevant information is challenging for fair representation learning methods since they would yield random or degenerate representations w.r.t. labels when the sensitive attributes correlate with labels. Existing works proposed to inject the label information into the learning procedure to overcome such issues. However, the assumption that the observed labels are clean is not always met. In fact, label bias is acknowledged as the primary source inducing discrimination. In other words, the fair pre-processing methods ignore the discrimination encoded in the labels either during the learning procedure or the evaluation stage. This contradiction puts a question mark on the fairness of the learned representations. To circumvent this issue, we explore the following question: \emph{Can we learn fair representations predictable to latent ideal fair labels…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning
