Training with Explanations Alone: A New Paradigm to Prevent Shortcut Learning
Pedro R. A. S. Bassi, Haydr A. H. Ali, Andrea Cavalli, Sergio Decherchi

TL;DR
This paper introduces Training with Explanations Alone (TEA), a novel method where a student model learns to ignore biases by matching explanation heatmaps from a teacher, improving bias resistance and generalization in AI models.
Contribution
The paper proposes TEA, a new training paradigm that uses explanation heatmaps for bias mitigation without needing a segmenter or direct output loss.
Findings
TEA outperforms 14 state-of-the-art bias mitigation methods.
TEA improves generalization to unseen hospitals and datasets.
TEA effectively reduces background and foreground bias influence.
Abstract
Application of Artificial Intelligence (AI) in critical domains, like the medical one, is often hampered by shortcut learning, which hinders AI generalization to diverse hospitals and patients. Shortcut learning can be caused, for example, by background biases -- features in image backgrounds that are spuriously correlated to classification labels (e.g., words in X-rays). To mitigate the influence of image background and foreground bias on AI, we introduce a new training paradigm, dubbed Training with Explanations Alone (TEA). TEA trains a classifier (TEA student) only by making its explanation heatmaps match target heatmaps from a larger teacher model. By learning from its explanation heatmaps, the TEA student pays attention to the same image features as the teacher. For example, a teacher uses a large segmenter to remove image backgrounds before classification, thus ignoring…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEducation and Critical Thinking Development
