Few-Shot Cross-Lingual Stance Detection with Sentiment-Based   Pre-Training

Momchil Hardalov; Arnav Arora; Preslav Nakov; Isabelle Augenstein

arXiv:2109.06050·cs.CL·December 22, 2021

Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training

Momchil Hardalov, Arnav Arora, Preslav Nakov, Isabelle Augenstein

PDF

Open Access 1 Repo 1 Video

TL;DR

This paper introduces a sentiment-based pre-training approach for cross-lingual stance detection, demonstrating significant improvements in low-resource settings across diverse languages and datasets.

Contribution

It proposes a novel sentiment-based data generation method and a label encoder to enhance cross-lingual stance detection with limited labeled data.

Findings

01

Sentiment-based pre-training improves F1 scores by over 6% in low-resource settings.

02

The approach is effective across 12 languages and 15 datasets.

03

The proposed method outperforms several strong baselines.

Abstract

The goal of stance detection is to determine the viewpoint expressed in a piece of text towards a target. These viewpoints or contexts are often expressed in many different languages depending on the user and the platform, which can be a local news outlet, a social media platform, a news forum, etc. Most research in stance detection, however, has been limited to working with a single language and on a few limited targets, with little work on cross-lingual stance detection. Moreover, non-English sources of labelled data are often scarce and present additional challenges. Recently, large multilingual language models have substantially improved the performance on many non-English tasks, especially such with limited numbers of examples. This highlights the importance of model pre-training and its ability to learn from few examples. In this paper, we present the most comprehensive study of…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

checkstep/senti-stance
pytorchOfficial

Videos

Few-Shot Cross-Lingual Stance Detection with Sentiment-Based Pre-Training· underline

Taxonomy

TopicsSentiment Analysis and Opinion Mining · Topic Modeling · Text and Document Classification Technologies