Fairness Without Demographics in Repeated Loss Minimization

Tatsunori B. Hashimoto; Megha Srivastava; Hongseok Namkoong and; Percy Liang

arXiv:1806.08010·stat.ML·August 1, 2018·108 cites

Fairness Without Demographics in Repeated Loss Minimization

Tatsunori B. Hashimoto, Megha Srivastava, Hongseok Namkoong and, Percy Liang

PDF

Open Access 1 Repo

TL;DR

This paper addresses fairness in machine learning without using demographic data by proposing a distributionally robust optimization method that prevents disparity amplification over time, improving minority group outcomes.

Contribution

It introduces a DRO-based approach that controls minority risk without requiring demographic information, preventing fairness degradation over time.

Findings

01

DRO prevents disparity amplification where ERM fails

02

Improves minority group user satisfaction in text autocomplete

03

Controls risk of minority groups over time

Abstract

Machine learning models (e.g., speech recognizers) are usually trained to minimize average loss, which results in representation disparity---minority groups (e.g., non-native speakers) contribute less to the training objective and thus tend to suffer higher loss. Worse, as model accuracy affects user retention, a minority group can shrink over time. In this paper, we first show that the status quo of empirical risk minimization (ERM) amplifies representation disparity over time, which can even make initially fair models unfair. To mitigate this, we develop an approach based on distributionally robust optimization (DRO), which minimizes the worst case risk over all distributions close to the empirical distribution. We prove that this approach controls the risk of the minority group at each time step, in the spirit of Rawlsian distributive justice, while remaining oblivious to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

https://worksheets.codalab.org/worksheets/0x17a501d37bbe49279b0c70ae10813f4c
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRetirement, Disability, and Employment · Global Health Care Issues · Advanced Causal Inference Techniques