Unlock the Potential of Counterfactually-Augmented Data in   Out-Of-Distribution Generalization

Caoyun Fan; Wenqing Chen; Jidong Tian; Yitian Li; Hao He; Yaohui Jin

arXiv:2310.06666·cs.CL·October 11, 2023

Unlock the Potential of Counterfactually-Augmented Data in Out-Of-Distribution Generalization

Caoyun Fan, Wenqing Chen, Jidong Tian, Yitian Li, Hao He, Yaohui Jin

PDF

Open Access

TL;DR

This paper enhances the effectiveness of Counterfactually-Augmented Data (CAD) for out-of-distribution generalization in language models by addressing the myopia phenomenon through additional constraints, leading to improved performance on sentiment analysis and natural language inference tasks.

Contribution

It introduces two constraints based on CAD's structural properties to mitigate the myopia phenomenon, enabling models to extract more complete causal features and improve OOD generalization.

Findings

01

Improved OOD performance by 1.0% to 5.9% on two tasks.

02

Identified the myopia phenomenon as a key issue in CAD effectiveness.

03

Proposed constraints effectively mitigate myopia and enhance causal feature extraction.

Abstract

Counterfactually-Augmented Data (CAD) -- minimal editing of sentences to flip the corresponding labels -- has the potential to improve the Out-Of-Distribution (OOD) generalization capability of language models, as CAD induces language models to exploit domain-independent causal features and exclude spurious correlations. However, the empirical results of CAD's OOD generalization are not as efficient as anticipated. In this study, we attribute the inefficiency to the myopia phenomenon caused by CAD: language models only focus on causal features that are edited in the augmentation operation and exclude other non-edited causal features. Therefore, the potential of CAD is not fully exploited. To address this issue, we analyze the myopia phenomenon in feature space from the perspective of Fisher's Linear Discriminant, then we introduce two additional constraints based on CAD's structural…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Explainable Artificial Intelligence (XAI) · Machine Learning in Healthcare

MethodsFLIP · Focus