XFEVER: Exploring Fact Verification across Languages
Yi-Chen Chang, Canasai Kruengkrai, Junichi Yamagishi

TL;DR
This paper presents XFEVER, a multilingual fact verification dataset derived from FEVER, enabling cross-lingual verification research, and demonstrates baseline models' effectiveness and limitations across languages.
Contribution
Introduces the XFEVER dataset for cross-lingual fact verification, along with baseline models for zero-shot and translate-train scenarios.
Findings
Multilingual models can perform fact verification across languages.
Performance varies significantly by language.
Model calibration can be improved by analyzing prediction similarities.
Abstract
This paper introduces the Cross-lingual Fact Extraction and VERification (XFEVER) dataset designed for benchmarking the fact verification models across different languages. We constructed it by translating the claim and evidence texts of the Fact Extraction and VERification (FEVER) dataset into six languages. The training and development sets were translated using machine translation, whereas the test set includes texts translated by professional translators and machine-translated texts. Using the XFEVER dataset, two cross-lingual fact verification scenarios, zero-shot learning and translate-train learning, are defined, and baseline models for each scenario are also proposed in this paper. Experimental results show that the multilingual language model can be used to build fact verification models in different languages efficiently. However, the performance varies by language and is…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Computational and Text Analysis Methods · Biomedical Text Mining and Ontologies
MethodsSparse Evolutionary Training
