Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers
Christopher Boland, Sotirios Tsaftaris, Sonia Dahdouh

TL;DR
This paper introduces a knowledge distillation method from specialist teachers to prevent shortcut learning in medical image analysis, improving robustness and accuracy especially in out-of-distribution scenarios.
Contribution
It proposes a novel intermediate layer knowledge distillation framework targeting shortcut features, enhancing model robustness in medical imaging tasks.
Findings
Consistent performance improvements over traditional bias mitigation methods.
Achieves baseline-like accuracy on bias-free and out-of-distribution data.
Effective across multiple datasets and neural network architectures.
Abstract
Deep learning models are prone to learning shortcut solutions to problems using spuriously correlated yet irrelevant features of their training data. In high-risk applications such as medical image analysis, this phenomenon may prevent models from using clinically meaningful features when making predictions, potentially leading to poor robustness and harm to patients. We demonstrate that different types of shortcuts (those that are diffuse and spread throughout the image, as well as those that are localized to specific areas) manifest distinctly across network layers and can, therefore, be more effectively targeted through mitigation strategies that target the intermediate layers. We propose a novel knowledge distillation framework that leverages a teacher network fine-tuned on a small subset of task-relevant data to mitigate shortcut learning in a student network trained on a large…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · COVID-19 diagnosis using AI
