Use of Retrieval-Augmented Large Language Model Agent for Long-Form COVID-19 Fact-Checking
Jingyi Huang, Yuyi Yang, Mengmeng Ji, Charles Alba, Sheng Zhang, Ruopeng An

TL;DR
This paper introduces SAFE, a retrieval-augmented system combining large language models with document retrieval to improve long-form COVID-19 misinformation fact-checking, showing significant performance gains over baseline models.
Contribution
The study presents SAFE, a novel agent system integrating retrieval-augmented generation with large language models for scalable, accurate long-form COVID-19 fact-checking, including an enhanced variant with Self-RAG.
Findings
SAFE outperforms baseline LLMs in accuracy and consistency.
SAFE achieves higher usefulness, clearness, and authenticity ratings.
LOTR-RAG design is more effective than SRAG-augmented variant.
Abstract
The COVID-19 infodemic calls for scalable fact-checking solutions that handle long-form misinformation with accuracy and reliability. This study presents SAFE (system for accurate fact extraction and evaluation), an agent system that combines large language models with retrieval-augmented generation (RAG) to improve automated fact-checking of long-form COVID-19 misinformation. SAFE includes two agents - one for claim extraction and another for claim verification using LOTR-RAG, which leverages a 130,000-document COVID-19 research corpus. An enhanced variant, SAFE (LOTR-RAG + SRAG), incorporates Self-RAG to refine retrieval via query rewriting. We evaluated both systems on 50 fake news articles (2-17 pages) containing 246 annotated claims (M = 4.922, SD = 3.186), labeled as true (14.1%), partly true (14.4%), false (27.0%), partly false (2.2%), and misleading (21.0%) by public health…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Artificial Intelligence in Healthcare and Education
