Zero-Shot Cross-Lingual Sentiment Classification under Distribution   Shift: an Exploratory Study

Maarten De Raedt; Semere Kiros Bitew; Fr\'ederic Godin; Thomas; Demeester; Chris Develder

arXiv:2311.06549·cs.CL·November 14, 2023·1 cites

Zero-Shot Cross-Lingual Sentiment Classification under Distribution Shift: an Exploratory Study

Maarten De Raedt, Semere Kiros Bitew, Fr\'ederic Godin, Thomas, Demeester, Chris Develder

PDF

Open Access

TL;DR

This study investigates zero-shot cross-lingual sentiment classification under distribution shifts, analyzing the effects of language and domain changes, and proposes cost-effective methods to improve out-of-distribution generalization using large language models.

Contribution

It provides the first analysis of OOD generalization in multilingual models, evaluates the impact of counterfactual data, and introduces new LLM-based approaches that outperform CAD without costly annotations.

Findings

01

OOD performance declines with distribution shifts.

02

Counterfactuals from high-resource languages help low-resource languages.

03

Proposed LLM-based methods improve accuracy by up to 3.1%.

Abstract

The brittleness of finetuned language model performance on out-of-distribution (OOD) test samples in unseen domains has been well-studied for English, yet is unexplored for multi-lingual models. Therefore, we study generalization to OOD test data specifically in zero-shot cross-lingual transfer settings, analyzing performance impacts of both language and domain shifts between train and test data. We further assess the effectiveness of counterfactually augmented data (CAD) in improving OOD generalization for the cross-lingual setting, since CAD has been shown to benefit in a monolingual English setting. Finally, we propose two new approaches for OOD generalization that avoid the costly annotation process associated with CAD, by exploiting the power of recent large language models (LLMs). We experiment with 3 multilingual models, LaBSE, mBERT, and XLM-R trained on English IMDb movie…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Natural Language Processing Techniques · Speech Recognition and Synthesis

MethodsmBERT · Counterfactuals Explanations · XLM-R