TL;DR
This paper introduces a variational autoencoder-based method that utilizes both labeled and unlabeled data to improve fairness and stability in decision-making algorithms, addressing biases and data scarcity issues.
Contribution
It proposes a novel approach that leverages unlabeled data for fair decision-making, enhancing stability and fairness over existing methods that only use labeled data.
Findings
Converges to the optimal fair policy with low variance on synthetic data.
Achieves higher fairness and utility in real-world experiments.
Provides a more stable learning process compared to previous approaches.
Abstract
Decision making algorithms, in practice, are often trained on data that exhibits a variety of biases. Decision-makers often aim to take decisions based on some ground-truth target that is assumed or expected to be unbiased, i.e., equally distributed across socially salient groups. In many practical settings, the ground-truth cannot be directly observed, and instead, we have to rely on a biased proxy measure of the ground-truth, i.e., biased labels, in the data. In addition, data is often selectively labeled, i.e., even the biased labels are only observed for a small fraction of the data that received a positive decision. To overcome label and selection biases, recent work proposes to learn stochastic, exploring decision policies via i) online training of new policies at each time-step and ii) enforcing fairness as a constraint on performance. However, the existing approach uses only…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
