Distribution-Free Statistical Dispersion Control for Societal   Applications

Zhun Deng; Thomas P. Zollo; Jake C. Snell; Toniann Pitassi; Richard; Zemel

arXiv:2309.13786·cs.LG·March 7, 2024

Distribution-Free Statistical Dispersion Control for Societal Applications

Zhun Deng, Thomas P. Zollo, Jake C. Snell, Toniann Pitassi, Richard, Zemel

PDF

Open Access 1 Video

TL;DR

This paper introduces a new framework for controlling the dispersion of loss distributions in machine learning, ensuring fairness and reliability in societal applications without relying on specific data distributions.

Contribution

It proposes a flexible, distribution-free approach to controlling statistical dispersion measures, extending beyond previous methods focused on expected loss or prediction intervals.

Findings

01

Effective in toxic comment detection

02

Applicable to medical imaging

03

Useful for fair film recommendation

Abstract

Explicit finite-sample statistical guarantees on model performance are an important ingredient in responsible machine learning. Previous work has focused mainly on bounding either the expected loss of a predictor or the probability that an individual prediction will incur a loss value in a specified range. However, for many high-stakes applications, it is crucial to understand and control the dispersion of a loss distribution, or the extent to which different members of a population experience unequal effects of algorithmic decisions. We initiate the study of distribution-free control of statistical dispersion measures with societal implications and propose a simple yet flexible framework that allows us to handle a much richer class of statistical functionals beyond previous work. Our methods are verified through experiments in toxic comment detection, medical imaging, and film…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Distribution-Free Statistical Dispersion Control for Societal Applications· slideslive

Taxonomy

TopicsMisinformation and Its Impacts