Quantifying and mitigating the impact of label errors on model disparity metrics
Julius Adebayo, Melissa Hall, Bowen Yu, Bobbie Chern

TL;DR
This paper investigates how label errors in training and testing data influence model disparity metrics, revealing their sensitivity especially for minority groups, and proposes methods to identify and correct these errors to improve fairness.
Contribution
It introduces an influence estimation approach for training labels affecting disparity metrics and an automatic relabeling scheme to enhance model fairness.
Findings
Group calibration metrics are sensitive to label errors, especially for minority groups.
The proposed influence-based method improves identification of training inputs affecting disparity.
Automatic relabel-and-finetune scheme reduces group calibration error.
Abstract
Errors in labels obtained via human annotation adversely affect a model's performance. Existing approaches propose ways to mitigate the effect of label error on a model's downstream accuracy, yet little is known about its impact on a model's disparity metrics. Here we study the effect of label error on a model's disparity metrics. We empirically characterize how varying levels of label error, in both training and test data, affect these disparity metrics. We find that group calibration and other metrics are sensitive to train-time and test-time label error -- particularly for minority groups. This disparate effect persists even for models trained with noise-aware algorithms. To mitigate the impact of training-time label error, we present an approach to estimate the influence of a training input's label on a model's group disparity metric. We empirically assess the proposed approach on a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMachine Learning and Data Classification · Imbalanced Data Classification Techniques · Music and Audio Processing
