Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024

Christopher Malon

arXiv:2411.05762·cs.CL·November 11, 2024

Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024

Christopher Malon

PDF

Open Access 1 Video

TL;DR

This paper presents a multi-hop evidence pursuit system combining large language models and search engines to verify claims on the web, improving accuracy and evidence adequacy in the FEVER 2024 shared task.

Contribution

It introduces a novel iterative question generation and evidence retrieval approach that enhances claim verification performance over traditional single-step methods.

Findings

01

Achieved 0.477 AVeriTeC score on the FEVER 2024 test set.

02

Improved label accuracy by 0.045 over all-at-once question generation strategy.

03

Highlighted the effectiveness of various design choices like question formulation and metadata inclusion.

Abstract

Separating disinformation from fact on the web has long challenged both the search and the reasoning powers of humans. We show that the reasoning power of large language models (LLMs) and the retrieval power of modern search engines can be combined to automate this process and explainably verify claims. We integrate LLMs and search under a multi-hop evidence pursuit strategy. This strategy generates an initial question based on an input claim using a sequence to sequence model, searches and formulates an answer to the question, and iteratively generates follow-up questions to pursue the evidence that is missing using an LLM. We demonstrate our system on the FEVER 2024 (AVeriTeC) shared task. Compared to a strategy of generating all the questions at once, our method obtains .045 higher label accuracy and .155 higher AVeriTeC score (evaluating the adequacy of the evidence). Through…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

Multi-hop Evidence Pursuit Meets the Web: Team Papelo at FEVER 2024· underline

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Scientific Computing and Data Management

MethodsSparse Evolutionary Training