Exploring the interplay of label bias with subgroup size and separability: A case study in mammographic density classification

Emma A.M. Stanley; Raghav Mehta; M\'elanie Roschewitz; Nils D. Forkert; Ben Glocker

arXiv:2507.17996·cs.CV·July 25, 2025

Exploring the interplay of label bias with subgroup size and separability: A case study in mammographic density classification

Emma A.M. Stanley, Raghav Mehta, M\'elanie Roschewitz, Nils D. Forkert, Ben Glocker

PDF

Open Access

TL;DR

This study examines how label bias in medical imaging datasets affects deep learning models, revealing that subgroup size and separability significantly influence learned features and fairness outcomes.

Contribution

It provides the first detailed analysis of how label bias impacts feature representations and subgroup performance in medical imaging AI.

Findings

01

Label bias causes significant shifts in learned feature representations.

02

Subgroup size and separability affect the extent of bias impact.

03

Validation set label quality influences subgroup performance metrics.

Abstract

Systematic mislabelling affecting specific subgroups (i.e., label bias) in medical imaging datasets represents an understudied issue concerning the fairness of medical AI systems. In this work, we investigated how size and separability of subgroups affected by label bias influence the learned features and performance of a deep learning model. Therefore, we trained deep learning models for binary tissue density classification using the EMory BrEast imaging Dataset (EMBED), where label bias affected separable subgroups (based on imaging manufacturer) or non-separable "pseudo-subgroups". We found that simulated subgroup label bias led to prominent shifts in the learned feature representations of the models. Importantly, these shifts within the feature space were dependent on both the relative size and the separability of the subgroup affected by label bias. We also observed notable…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGlobal Cancer Incidence and Screening