X-Stance: A Multilingual Multi-Target Dataset for Stance Detection
Jannis Vamvas, Rico Sennrich

TL;DR
This paper introduces X-Stance, a large multilingual dataset for stance detection across three languages, enabling cross-lingual and cross-target transfer learning, and demonstrates baseline results with multilingual BERT.
Contribution
It provides a new multilingual, multi-target stance detection dataset and explores zero-shot transfer learning across languages and issues.
Findings
Moderate success in zero-shot cross-lingual transfer
Effective training of a single model on multiple issues
Baseline results establish a benchmark for future work
Abstract
We extract a large-scale stance detection dataset from comments written by candidates of elections in Switzerland. The dataset consists of German, French and Italian text, allowing for a cross-lingual evaluation of stance detection. It contains 67 000 comments on more than 150 political issues (targets). Unlike stance detection models that have specific target issues, we use the dataset to train a single model on all the issues. To make learning across targets possible, we prepend to each instance a natural question that represents the target (e.g. "Do you support X?"). Baseline results from multilingual BERT show that zero-shot cross-lingual and cross-target transfer of stance detection is moderately successful with this approach.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Natural Language Processing Techniques · Sentiment Analysis and Opinion Mining
MethodsLinear Layer · Residual Connection · Attention Dropout · Linear Warmup With Linear Decay · Weight Decay · Refunds@Expedia|||How do I get a full refund from Expedia? · Dense Connections · Adam · WordPiece · Softmax
