TL;DR
This study systematically evaluates feature disentanglement methods to reduce shortcut learning in medical imaging, improving model robustness and generalization across confounded datasets.
Contribution
It provides a comprehensive benchmark of disentanglement techniques, comparing their effectiveness and efficiency in mitigating shortcuts in medical imaging models.
Findings
Disentanglement methods improve classification under strong spurious correlations.
Latent space analysis reveals differences in representation quality.
Combining data rebalancing with disentanglement yields the best results.
Abstract
Although deep learning models in medical imaging often achieve excellent classification performance, they can rely on shortcut learning, exploiting spurious correlations or confounding factors that are not causally related to the target task. This poses risks in clinical settings, where models must generalize across institutions, populations, and acquisition conditions. Feature disentanglement is a promising approach to mitigate shortcut learning by separating task-relevant information from confounder-related features in latent representations. In this study, we systematically evaluated feature disentanglement methods for mitigating shortcuts in medical imaging, including adversarial learning and latent space splitting based on dependence minimization. We assessed classification performance and disentanglement quality using latent space analyses across one artificial and two medical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
