Debiasing classifiers: is reality at variance with expectation?

Ashrya Agrawal; Florian Pfisterer; Bernd Bischl; Francois; Buet-Golfouse; Srijan Sood; Jiahao Chen; Sameena Shah; Sebastian; Vollmer

arXiv:2011.02407·cs.LG·June 1, 2021

Debiasing classifiers: is reality at variance with expectation?

Ashrya Agrawal, Florian Pfisterer, Bernd Bischl, Francois, Buet-Golfouse, Srijan Sood, Jiahao Chen, Sameena Shah, Sebastian, Vollmer

PDF

TL;DR

This paper empirically investigates debiasing classifiers, revealing that they often fail to generalize and can worsen fairness, due to bias-variance trade-offs and the influence of base rates, emphasizing the need for extensive validation.

Contribution

It provides a rigorous empirical analysis of debiasing methods, highlighting their limitations and explaining failures through bias-variance trade-offs and base rate effects.

Findings

01

Debiasing methods often do not generalize out-of-sample.

02

Imposing fairness constraints increases variance, affecting performance.

03

Partial debiasing can improve out-of-sample results due to bias-variance trade-offs.

Abstract

We present an empirical study of debiasing methods for classifiers, showing that debiasers often fail in practice to generalize out-of-sample, and can in fact make fairness worse rather than better. A rigorous evaluation of the debiasing treatment effect requires extensive cross-validation beyond what is usually done. We demonstrate that this phenomenon can be explained as a consequence of bias-variance trade-off, with an increase in variance necessitated by imposing a fairness constraint. Follow-up experiments validate the theoretical prediction that the estimation variance depends strongly on the base rates of the protected class. Considering fairness--performance trade-offs justifies the counterintuitive notion that partial debiasing can actually yield better results in practice on out-of-sample data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.