Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious   Feature-Label Correlation

Yanrui Du; Jing Yan; Yan Chen; Jing Liu; Sendong Zhao; Qiaoqiao She,; Hua Wu; Haifeng Wang; Bing Qin

arXiv:2205.12593·cs.CL·June 23, 2023

Less Learn Shortcut: Analyzing and Mitigating Learning of Spurious Feature-Label Correlation

Yanrui Du, Jing Yan, Yan Chen, Jing Liu, Sendong Zhao, Qiaoqiao She,, Hua Wu, Haifeng Wang, Bing Qin

PDF

Open Access 1 Repo

TL;DR

This paper investigates how deep neural networks learn spurious word-label correlations due to dataset biases, and proposes a training strategy to reduce reliance on these shortcuts, improving robustness across tasks.

Contribution

The study introduces Less-Learn-Shortcut (LLS), a novel training method that quantifies and down-weights biased examples to mitigate shortcut learning in neural models.

Findings

01

LLS reduces model reliance on spurious correlations.

02

LLS improves performance on adversarial data.

03

LLS maintains in-domain accuracy.

Abstract

Recent research has revealed that deep neural networks often take dataset biases as a shortcut to make decisions rather than understand tasks, leading to failures in real-world applications. In this study, we focus on the spurious correlation between word features and labels that models learn from the biased data distribution of training data. In particular, we define the word highly co-occurring with a specific label as biased word, and the example containing biased word as biased example. Our analysis shows that biased examples are easier for models to learn, while at the time of prediction, biased words make a significantly higher contribution to the models' predictions, and models tend to assign predicted labels over-relying on the spurious correlation between words and labels. To mitigate models' over-reliance on the shortcut (i.e. spurious correlation), we propose a training…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

PaddlePaddle/TrustAI/tree/main/tutorials/data_bias_identification/less_learn_shortcut
paddleOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Adversarial Robustness in Machine Learning