Mitigating Shortcuts in Language Models with Soft Label Encoding

Zirui He; Huiqi Deng; Haiyan Zhao; Ninghao Liu; Mengnan Du

arXiv:2309.09380·cs.CL·September 19, 2023·2 cites

Mitigating Shortcuts in Language Models with Soft Label Encoding

Zirui He, Huiqi Deng, Haiyan Zhao, Ninghao Liu, Mengnan Du

PDF

Open Access

TL;DR

This paper introduces Soft Label Encoding (SoftLE), a debiasing method that reduces reliance on spurious correlations in language models by smoothing labels based on shortcut reliance, improving out-of-distribution generalization.

Contribution

The paper proposes a novel SoftLE framework that encodes shortcut reliance into soft labels, enhancing model robustness against spurious correlations in NLU tasks.

Findings

01

SoftLE improves out-of-distribution generalization significantly.

02

SoftLE maintains competitive in-distribution accuracy.

03

Extensive experiments validate SoftLE's effectiveness on benchmark tasks.

Abstract

Recent research has shown that large language models rely on spurious correlations in the data for natural language understanding (NLU) tasks. In this work, we aim to answer the following research question: Can we reduce spurious correlations by modifying the ground truth labels of the training data? Specifically, we propose a simple yet effective debiasing framework, named Soft Label Encoding (SoftLE). We first train a teacher model with hard labels to determine each sample's degree of relying on shortcuts. We then add one dummy class to encode the shortcut degree, which is used to smooth other dimensions in the ground truth label to generate soft labels. This new ground truth label is used to train a more robust student model. Extensive experiments on two NLU benchmark tasks demonstrate that SoftLE significantly improves out-of-distribution generalization while maintaining…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis