Mitigating Shortcuts in Language Models with Soft Label Encoding
Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu, Mengnan Du

TL;DR
This paper introduces Soft Label Encoding (SoftLE), a debiasing method that reduces reliance on spurious correlations in language models by smoothing labels based on shortcut reliance, improving out-of-distribution generalization.
Contribution
The paper proposes a novel SoftLE framework that encodes shortcut reliance into soft labels, enhancing model robustness against spurious correlations in NLU tasks.
Findings
SoftLE improves out-of-distribution generalization significantly.
SoftLE maintains competitive in-distribution accuracy.
Extensive experiments validate SoftLE's effectiveness on benchmark tasks.
Abstract
Recent research has shown that large language models rely on spurious correlations in the data for natural language understanding (NLU) tasks. In this work, we aim to answer the following research question: Can we reduce spurious correlations by modifying the ground truth labels of the training data? Specifically, we propose a simple yet effective debiasing framework, named Soft Label Encoding (SoftLE). We first train a teacher model with hard labels to determine each sample's degree of relying on shortcuts. We then add one dummy class to encode the shortcut degree, which is used to smooth other dimensions in the ground truth label to generate soft labels. This new ground truth label is used to train a more robust student model. Extensive experiments on two NLU benchmark tasks demonstrate that SoftLE significantly improves out-of-distribution generalization while maintaining…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis
