XFEVER: Exploring Fact Verification across Languages

Yi-Chen Chang; Canasai Kruengkrai; Junichi Yamagishi

arXiv:2310.16278·cs.CL·October 26, 2023·2 cites

XFEVER: Exploring Fact Verification across Languages

Yi-Chen Chang, Canasai Kruengkrai, Junichi Yamagishi

PDF

Open Access 1 Repo

TL;DR

This paper presents XFEVER, a multilingual fact verification dataset derived from FEVER, enabling cross-lingual verification research, and demonstrates baseline models' effectiveness and limitations across languages.

Contribution

Introduces the XFEVER dataset for cross-lingual fact verification, along with baseline models for zero-shot and translate-train scenarios.

Findings

01

Multilingual models can perform fact verification across languages.

02

Performance varies significantly by language.

03

Model calibration can be improved by analyzing prediction similarities.

Abstract

This paper introduces the Cross-lingual Fact Extraction and VERification (XFEVER) dataset designed for benchmarking the fact verification models across different languages. We constructed it by translating the claim and evidence texts of the Fact Extraction and VERification (FEVER) dataset into six languages. The training and development sets were translated using machine translation, whereas the test set includes texts translated by professional translators and machine-translated texts. Using the XFEVER dataset, two cross-lingual fact verification scenarios, zero-shot learning and translate-train learning, are defined, and baseline models for each scenario are also proposed in this paper. Experimental results show that the multilingual language model can be used to build fact verification models in different languages efficiently. However, the performance varies by language and is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nii-yamagishilab/xfever
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Computational and Text Analysis Methods · Biomedical Text Mining and Ontologies

MethodsSparse Evolutionary Training