Use of Retrieval-Augmented Large Language Model Agent for Long-Form COVID-19 Fact-Checking

Jingyi Huang; Yuyi Yang; Mengmeng Ji; Charles Alba; Sheng Zhang; Ruopeng An

arXiv:2512.00007·cs.IR·December 2, 2025

Use of Retrieval-Augmented Large Language Model Agent for Long-Form COVID-19 Fact-Checking

Jingyi Huang, Yuyi Yang, Mengmeng Ji, Charles Alba, Sheng Zhang, Ruopeng An

PDF

Open Access

TL;DR

This paper introduces SAFE, a retrieval-augmented system combining large language models with document retrieval to improve long-form COVID-19 misinformation fact-checking, showing significant performance gains over baseline models.

Contribution

The study presents SAFE, a novel agent system integrating retrieval-augmented generation with large language models for scalable, accurate long-form COVID-19 fact-checking, including an enhanced variant with Self-RAG.

Findings

01

SAFE outperforms baseline LLMs in accuracy and consistency.

02

SAFE achieves higher usefulness, clearness, and authenticity ratings.

03

LOTR-RAG design is more effective than SRAG-augmented variant.

Abstract

The COVID-19 infodemic calls for scalable fact-checking solutions that handle long-form misinformation with accuracy and reliability. This study presents SAFE (system for accurate fact extraction and evaluation), an agent system that combines large language models with retrieval-augmented generation (RAG) to improve automated fact-checking of long-form COVID-19 misinformation. SAFE includes two agents - one for claim extraction and another for claim verification using LOTR-RAG, which leverages a 130,000-document COVID-19 research corpus. An enhanced variant, SAFE (LOTR-RAG + SRAG), incorporates Self-RAG to refine retrieval via query rewriting. We evaluated both systems on 50 fake news articles (2-17 pages) containing 246 annotated claims (M = 4.922, SD = 3.186), labeled as true (14.1%), partly true (14.4%), false (27.0%), partly false (2.2%), and misleading (21.0%) by public health…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts · Topic Modeling · Artificial Intelligence in Healthcare and Education