Don't Throw it Away! The Utility of Unlabeled Data in Fair Decision   Making

Miriam Rateike; Ayan Majumdar; Olga Mineeva; Krishna P. Gummadi,; Isabel Valera

arXiv:2205.04790·stat.ML·July 5, 2022

Don't Throw it Away! The Utility of Unlabeled Data in Fair Decision Making

Miriam Rateike, Ayan Majumdar, Olga Mineeva, Krishna P. Gummadi,, Isabel Valera

PDF

1 Repo

TL;DR

This paper introduces a variational autoencoder-based method that utilizes both labeled and unlabeled data to improve fairness and stability in decision-making algorithms, addressing biases and data scarcity issues.

Contribution

It proposes a novel approach that leverages unlabeled data for fair decision-making, enhancing stability and fairness over existing methods that only use labeled data.

Findings

01

Converges to the optimal fair policy with low variance on synthetic data.

02

Achieves higher fairness and utility in real-world experiments.

03

Provides a more stable learning process compared to previous approaches.

Abstract

Decision making algorithms, in practice, are often trained on data that exhibits a variety of biases. Decision-makers often aim to take decisions based on some ground-truth target that is assumed or expected to be unbiased, i.e., equally distributed across socially salient groups. In many practical settings, the ground-truth cannot be directly observed, and instead, we have to rely on a biased proxy measure of the ground-truth, i.e., biased labels, in the data. In addition, data is often selectively labeled, i.e., even the biased labels are only observed for a small fraction of the data that received a positive decision. To overcome label and selection biases, recent work proposes to learn stochastic, exploring decision policies via i) online training of new policies at each time-step and ii) enforcing fairness as a constraint on performance. However, the existing approach uses only…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

ayanmaj92/fairall
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.