Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers

Christopher Boland; Sotirios Tsaftaris; Sonia Dahdouh

arXiv:2511.17421·cs.CV·November 25, 2025

Preventing Shortcut Learning in Medical Image Analysis through Intermediate Layer Knowledge Distillation from Specialist Teachers

Christopher Boland, Sotirios Tsaftaris, Sonia Dahdouh

PDF

Open Access

TL;DR

This paper introduces a knowledge distillation method from specialist teachers to prevent shortcut learning in medical image analysis, improving robustness and accuracy especially in out-of-distribution scenarios.

Contribution

It proposes a novel intermediate layer knowledge distillation framework targeting shortcut features, enhancing model robustness in medical imaging tasks.

Findings

01

Consistent performance improvements over traditional bias mitigation methods.

02

Achieves baseline-like accuracy on bias-free and out-of-distribution data.

03

Effective across multiple datasets and neural network architectures.

Abstract

Deep learning models are prone to learning shortcut solutions to problems using spuriously correlated yet irrelevant features of their training data. In high-risk applications such as medical image analysis, this phenomenon may prevent models from using clinically meaningful features when making predictions, potentially leading to poor robustness and harm to patients. We demonstrate that different types of shortcuts (those that are diffuse and spread throughout the image, as well as those that are localized to specific areas) manifest distinctly across network layers and can, therefore, be more effectively targeted through mitigation strategies that target the intermediate layers. We propose a novel knowledge distillation framework that leverages a teacher network fine-tuned on a small subset of task-relevant data to mitigate shortcut learning in a student network trained on a large…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Adversarial Robustness in Machine Learning · COVID-19 diagnosis using AI