Exploring the interplay of label bias with subgroup size and separability: A case study in mammographic density classification
Emma A.M. Stanley, Raghav Mehta, M\'elanie Roschewitz, Nils D. Forkert, Ben Glocker

TL;DR
This study examines how label bias in medical imaging datasets affects deep learning models, revealing that subgroup size and separability significantly influence learned features and fairness outcomes.
Contribution
It provides the first detailed analysis of how label bias impacts feature representations and subgroup performance in medical imaging AI.
Findings
Label bias causes significant shifts in learned feature representations.
Subgroup size and separability affect the extent of bias impact.
Validation set label quality influences subgroup performance metrics.
Abstract
Systematic mislabelling affecting specific subgroups (i.e., label bias) in medical imaging datasets represents an understudied issue concerning the fairness of medical AI systems. In this work, we investigated how size and separability of subgroups affected by label bias influence the learned features and performance of a deep learning model. Therefore, we trained deep learning models for binary tissue density classification using the EMory BrEast imaging Dataset (EMBED), where label bias affected separable subgroups (based on imaging manufacturer) or non-separable "pseudo-subgroups". We found that simulated subgroup label bias led to prominent shifts in the learned feature representations of the models. Importantly, these shifts within the feature space were dependent on both the relative size and the separability of the subgroup affected by label bias. We also observed notable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGlobal Cancer Incidence and Screening
