Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity   Classification

Xia Zeng; Arkaitz Zubiaga

arXiv:2205.05646·cs.CL·May 12, 2022·1 cites

Aggregating Pairwise Semantic Differences for Few-Shot Claim Veracity Classification

Xia Zeng, Arkaitz Zubiaga

PDF

Open Access

TL;DR

This paper introduces SEED, a vector-based method for few-shot claim veracity classification that aggregates semantic differences, outperforming baselines on FEVER and SCIFACT datasets.

Contribution

SEED is a novel approach that simulates class representative vectors to improve few-shot claim veracity classification performance.

Findings

01

SEED outperforms fine-tuned BERT/RoBERTa baselines.

02

SEED surpasses state-of-the-art perplexity-based methods.

03

Consistent improvements observed on FEVER and SCIFACT datasets.

Abstract

As part of an automated fact-checking pipeline, the claim veracity classification task consists in determining if a claim is supported by an associated piece of evidence. The complexity of gathering labelled claim-evidence pairs leads to a scarcity of datasets, particularly when dealing with new domains. In this paper, we introduce SEED, a novel vector-based method to few-shot claim veracity classification that aggregates pairwise semantic differences for claim-evidence pairs. We build on the hypothesis that we can simulate class representative vectors that capture average semantic differences for claim-evidence pairs in a class, which can then be used for classification of new instances. We compare the performance of our method with competitive baselines including fine-tuned BERT/RoBERTa models, as well as the state-of-the-art few-shot veracity classification method that leverages…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Data Quality and Management · Biomedical Text Mining and Ontologies