Automatic Counterfactual Augmentation for Robust Text Classification Based on Word-Group Search
Rui Song, Fausto Giunchiglia, Yingji Li, Hao Xu

TL;DR
This paper introduces a novel word-group based counterfactual augmentation method to improve the robustness of text classification models by mitigating shortcut learning and emphasizing causal features.
Contribution
It proposes a new word-group mining approach using beam search and a counterfactual augmentation technique with adaptive voting, addressing limitations of previous single-word focus methods.
Findings
Enhanced robustness against shortcut learning in text classification
Improved performance on cross-domain and fairness tasks
Effective identification of causal word-groups
Abstract
Despite large-scale pre-trained language models have achieved striking results for text classificaion, recent work has raised concerns about the challenge of shortcut learning. In general, a keyword is regarded as a shortcut if it creates a superficial association with the label, resulting in a false prediction. Conversely, shortcut learning can be mitigated if the model relies on robust causal features that help produce sound predictions. To this end, many studies have explored post-hoc interpretable methods to mine shortcuts and causal features for robustness and generalization. However, most existing methods focus only on single word in a sentence and lack consideration of word-group, leading to wrong causal features. To solve this problem, we propose a new Word-Group mining approach, which captures the causal effect of any keyword combination and orders the combinations that most…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies
MethodsFocus
