Adapting Fairness Interventions to Missing Values
Raymond Feng, Flavio P. Calmon, Hao Wang

TL;DR
This paper investigates how missing data impacts algorithmic fairness, showing that imputation can worsen fairness and accuracy, and proposes adaptive algorithms that improve fairness outcomes in the presence of missing values.
Contribution
The paper introduces scalable adaptive algorithms for fair classification with missing data, preserving missing pattern information and outperforming traditional impute-then-classify methods.
Findings
Imputing data can significantly reduce group fairness and accuracy.
Adaptive algorithms improve fairness and accuracy across datasets.
Handling missing patterns explicitly enhances fairness interventions.
Abstract
Missing values in real-world data pose a significant and unique challenge to algorithmic fairness. Different demographic groups may be unequally affected by missing data, and the standard procedure for handling missing values where first data is imputed, then the imputed data is used for classification -- a procedure referred to as "impute-then-classify" -- can exacerbate discrimination. In this paper, we analyze how missing values affect algorithmic fairness. We first prove that training a classifier from imputed data can significantly worsen the achievable values of group fairness and average accuracy. This is because imputing data results in the loss of the missing pattern of the data, which often conveys information about the predictive label. We present scalable and adaptive algorithms for fair classification with missing values. These algorithms can be combined with any…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsInsurance, Mortality, Demography, Risk Management
