Beyond Retrieval: Improving Evidence Quality for LLM-based Multimodal Fact-Checking
Haoran Ou, Gelei Deng, Xingshuo Han, Jie Zhang, Han Qiu, Shangwei Guo, Tianwei Zhang

TL;DR
This paper introduces Aletheia, an end-to-end multimodal fact-checking framework that enhances evidence retrieval quality, leading to significant improvements in verifying disinformation using LLMs.
Contribution
The paper presents a novel evidence retrieval strategy within Aletheia that improves coverage and filters irrelevant information, advancing multimodal fact-checking accuracy.
Findings
Achieves 88.3% accuracy on public datasets
Improves verification accuracy by up to 30.8%
Enhances evidence quality for LLM-based disinformation detection
Abstract
The increasing multimodal disinformation, where deceptive claims are reinforced through coordinated text and visual content, poses significant challenges to automated fact-checking. Recent efforts leverage Large Language Models (LLMs) for this task, capitalizing on their strong reasoning and multimodal understanding capabilities. Emerging retrieval-augmented frameworks further equip LLMs with access to open-domain external information, enabling evidence-based verification beyond their internal knowledge. Despite their promising gains, our empirical study reveals notable shortcomings in the external search coverage and evidence quality evaluation. To mitigate those limitations, we propose Aletheia, an end-to-end framework for automated multimodal fact-checking. It introduces a novel evidence retrieval strategy that improves evidence coverage and filters useless information from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Software Engineering Research · Natural Language Processing Techniques
