Beyond Retrieval: Improving Evidence Quality for LLM-based Multimodal Fact-Checking

Haoran Ou; Gelei Deng; Xingshuo Han; Jie Zhang; Han Qiu; Shangwei Guo; Tianwei Zhang

arXiv:2505.03135·cs.AI·January 9, 2026

Beyond Retrieval: Improving Evidence Quality for LLM-based Multimodal Fact-Checking

Haoran Ou, Gelei Deng, Xingshuo Han, Jie Zhang, Han Qiu, Shangwei Guo, Tianwei Zhang

PDF

Open Access

TL;DR

This paper introduces Aletheia, an end-to-end multimodal fact-checking framework that enhances evidence retrieval quality, leading to significant improvements in verifying disinformation using LLMs.

Contribution

The paper presents a novel evidence retrieval strategy within Aletheia that improves coverage and filters irrelevant information, advancing multimodal fact-checking accuracy.

Findings

01

Achieves 88.3% accuracy on public datasets

02

Improves verification accuracy by up to 30.8%

03

Enhances evidence quality for LLM-based disinformation detection

Abstract

The increasing multimodal disinformation, where deceptive claims are reinforced through coordinated text and visual content, poses significant challenges to automated fact-checking. Recent efforts leverage Large Language Models (LLMs) for this task, capitalizing on their strong reasoning and multimodal understanding capabilities. Emerging retrieval-augmented frameworks further equip LLMs with access to open-domain external information, enabling evidence-based verification beyond their internal knowledge. Despite their promising gains, our empirical study reveals notable shortcomings in the external search coverage and evidence quality evaluation. To mitigate those limitations, we propose Aletheia, an end-to-end framework for automated multimodal fact-checking. It introduces a novel evidence retrieval strategy that improves evidence coverage and filters useless information from…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques