Mitigating Shortcut Learning with InterpoLated Learning

Michalis Korakakis; Andreas Vlachos; Adrian Weller

arXiv:2507.05527·cs.LG·July 9, 2025

Mitigating Shortcut Learning with InterpoLated Learning

Michalis Korakakis, Andreas Vlachos, Adrian Weller

PDF

Open Access 1 Video

TL;DR

InterpoLated Learning (InterpoLL) is a novel method that interpolates representations to reduce shortcut reliance, improving minority group generalization in NLP tasks without sacrificing overall accuracy.

Contribution

This paper introduces InterpoLated Learning, a representation interpolation technique that mitigates shortcuts and enhances minority group generalization across various NLP architectures.

Findings

01

InterpoLL outperforms ERM and existing shortcut mitigation methods.

02

It improves minority generalization without harming majority accuracy.

03

Effective across multiple NLP model architectures.

Abstract

Empirical risk minimization (ERM) incentivizes models to exploit shortcuts, i.e., spurious correlations between input attributes and labels that are prevalent in the majority of the training data but unrelated to the task at hand. This reliance hinders generalization on minority examples, where such correlations do not hold. Existing shortcut mitigation approaches are model-specific, difficult to tune, computationally expensive, and fail to improve learned representations. To address these issues, we propose InterpoLated Learning (InterpoLL) which interpolates the representations of majority examples to include features from intra-class minority examples with shortcut-mitigating patterns. This weakens shortcut influence, enabling models to acquire features predictive across both minority and majority examples. Experimental results on multiple natural language understanding tasks…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Mitigating Shortcut Learning with InterpoLated Learning· underline

Taxonomy

TopicsTopic Modeling · Domain Adaptation and Few-Shot Learning · Machine Learning and Data Classification