Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data   and Bayesian Inference

Disi Ji; Padhraic Smyth; Mark Steyvers

arXiv:2010.09851·stat.ML·October 21, 2020·22 cites

Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference

Disi Ji, Padhraic Smyth, Mark Steyvers

PDF

Open Access 1 Video

TL;DR

This paper introduces a Bayesian framework that leverages unlabeled data to improve the accuracy and confidence in fairness metric estimation, especially when labeled data is scarce.

Contribution

It presents a novel hierarchical Bayesian model that combines labeled and unlabeled data to produce more reliable fairness assessments with quantified uncertainty.

Findings

01

Significant reduction in estimation error across multiple datasets

02

Improved fairness metric estimates with lower variance

03

Effective use of unlabeled data enhances fairness evaluation

Abstract

We investigate the problem of reliably assessing group fairness when labeled examples are few but unlabeled examples are plentiful. We propose a general Bayesian framework that can augment labeled data with unlabeled data to produce more accurate and lower-variance estimates compared to methods based on labeled data alone. Our approach estimates calibrated scores for unlabeled examples in each group using a hierarchical latent variable model conditioned on labeled examples. This in turn allows for inference of posterior distributions with associated notions of uncertainty for a variety of group fairness metrics. We demonstrate that our approach leads to significant and consistent reductions in estimation error across multiple well-known fairness datasets, sensitive attributes, and predictive models. The results show the benefits of using both unlabeled data and Bayesian inference in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Can I Trust My Fairness Metric? Assessing Fairness with Unlabeled Data and Bayesian Inference· slideslive

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Adversarial Robustness in Machine Learning