Model Mis-specification and Algorithmic Bias

Runshan Fu; Yangfan Liang; Peter Zhang

arXiv:2105.15182·cs.LG·June 1, 2021·1 cites

Model Mis-specification and Algorithmic Bias

Runshan Fu, Yangfan Liang, Peter Zhang

PDF

Open Access

TL;DR

This paper investigates how model mis-specification can lead to significant algorithmic bias, even with unbiased data, by analyzing group-level prediction errors and their dependence on feature distribution moments.

Contribution

It provides a theoretical analysis of bias caused by model mis-specification, revealing how group errors can differ despite similar overall error levels.

Findings

01

Group-level errors can be large despite negligible population error

02

Bias can be maximized with errors of opposite signs for different groups

03

Errors depend on the first and second moments of feature distributions

Abstract

Machine learning algorithms are increasingly used to inform critical decisions. There is a growing concern about bias, that algorithms may produce uneven outcomes for individuals in different demographic groups. In this work, we measure bias as the difference between mean prediction errors across groups. We show that even with unbiased input data, when a model is mis-specified: (1) population-level mean prediction error can still be negligible, but group-level mean prediction errors can be large; (2) such errors are not equal across groups; and (3) the difference between errors, i.e., bias, can take the worst-case realization. That is, when there are two groups of the same size, mean prediction errors for these two groups have the same magnitude but opposite signs. In closed form, we show such errors and bias are functions of the first and second moments of the joint distribution of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsEthics and Social Impacts of AI · Explainable Artificial Intelligence (XAI) · Privacy-Preserving Technologies in Data