Robustifying Sentiment Classification by Maximally Exploiting Few   Counterfactuals

Maarten De Raedt; Fr\'ederic Godin; Chris Develder; Thomas Demeester

arXiv:2210.11805·cs.CL·October 24, 2022

Robustifying Sentiment Classification by Maximally Exploiting Few Counterfactuals

Maarten De Raedt, Fr\'ederic Godin, Chris Develder, Thomas Demeester

PDF

Open Access 1 Repo

TL;DR

This paper introduces a method to improve sentiment classification robustness by automatically generating counterfactual samples from a small manually annotated subset, significantly enhancing out-of-distribution performance.

Contribution

The paper presents a novel approach that leverages automatic counterfactual generation in encoding space, reducing manual annotation needs while improving model robustness.

Findings

01

Achieved +3% accuracy with only 1% manual counterfactuals

02

Outperformed methods using 100% in-distribution data

03

Enhanced OOD test performance on multiple datasets

Abstract

For text classification tasks, finetuned language models perform remarkably well. Yet, they tend to rely on spurious patterns in training data, thus limiting their performance on out-of-distribution (OOD) test data. Among recent models aiming to avoid this spurious pattern problem, adding extra counterfactual samples to the training data has proven to be very effective. Yet, counterfactual data generation is costly since it relies on human annotation. Thus, we propose a novel solution that only requires annotation of a small fraction (e.g., 1%) of the original training data, and uses automatic generation of extra counterfactuals in an encoding vector space. We demonstrate the effectiveness of our approach in sentiment classification, using IMDb data for training and other sets for OOD tests (i.e., Amazon, SemEval and Yelp). We achieve noticeable accuracy improvements by adding only 1%…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maarten-deraedt/emnlp2022-robustifying-sentiment-classification
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining

MethodsCounterfactuals Explanations · Test