Cross-lingual Transfer Learning for Check-worthy Claim Identification   over Twitter

Maram Hasanain; Tamer Elsayed

arXiv:2211.05087·cs.CL·November 10, 2022·1 cites

Cross-lingual Transfer Learning for Check-worthy Claim Identification over Twitter

Maram Hasanain, Tamer Elsayed

PDF

Open Access

TL;DR

This paper explores cross-lingual transfer learning using Multilingual BERT to identify check-worthy claims on Twitter across multiple languages, demonstrating that zero-shot transfer can be effective even with limited target language data.

Contribution

It systematically compares six cross-lingual approaches for claim verification in social media, showing zero-shot transfer can match or surpass monolingual models in some cases.

Findings

01

Zero-shot transfer performs comparably to monolingual models.

02

Some language pairs benefit significantly from cross-lingual transfer.

03

Cross-lingual methods outperform or match state-of-the-art models in certain languages.

Abstract

Misinformation spread over social media has become an undeniable infodemic. However, not all spreading claims are made equal. If propagated, some claims can be destructive, not only on the individual level, but to organizations and even countries. Detecting claims that should be prioritized for fact-checking is considered the first step to fight against spread of fake news. With training data limited to a handful of languages, developing supervised models to tackle the problem over lower-resource languages is currently infeasible. Therefore, our work aims to investigate whether we can use existing datasets to train models for predicting worthiness of verification of claims in tweets in other languages. We present a systematic comparative study of six approaches for cross-lingual check-worthiness estimation across pairs of five diverse languages with the help of Multilingual BERT (mBERT)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Hate Speech and Cyberbullying Detection · Topic Modeling

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Layer Normalization · Residual Connection · Dropout · WordPiece · Attention Dropout · Weight Decay · Dense Connections