Fighting Spurious Correlations in Text Classification via a Causal   Learning Perspective

Yuqing Zhou; Ziwei Zhu

arXiv:2411.01045·cs.LG·February 4, 2025

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective

Yuqing Zhou, Ziwei Zhu

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces CCR, a causal learning-based method for text classification that reduces reliance on spurious correlations, thereby enhancing robustness and generalization, especially in out-of-distribution scenarios.

Contribution

The paper proposes a novel causal feature selection and weighting approach, improving robustness without requiring group labels, and provides theoretical and empirical validation.

Findings

01

CCR achieves state-of-the-art results without group labels.

02

CCR outperforms existing methods on robustness metrics.

03

In some cases, CCR rivals models with group labels.

Abstract

In text classification tasks, models often rely on spurious correlations for predictions, incorrectly associating irrelevant features with the target labels. This issue limits the robustness and generalization of models, especially when faced with out-of-distribution data where such spurious correlations no longer hold. To address this challenge, we propose the Causally Calibrated Robust Classifier (CCR), which aims to reduce models' reliance on spurious correlations and improve model robustness. Our approach integrates a causal feature selection method based on counterfactual reasoning, along with an unbiased inverse propensity weighting (IPW) loss function. By focusing on selecting causal features, we ensure that the model relies less on spurious features during prediction. We theoretically justify our approach and empirically show that CCR achieves state-of-the-art performance among…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

yuqing-zhou/causal-learning-for-robust-classifier
pytorchOfficial

Videos

Fighting Spurious Correlations in Text Classification via a Causal Learning Perspective· underline

Taxonomy

TopicsText and Document Classification Technologies

MethodsFeature Selection