When retrieval outperforms generation: Dense evidence retrieval for scalable fake news detection
Alamgir Munir Qazi, John P. McCrae, Jamal Abdul Nasir

TL;DR
This paper introduces DeReC, a dense retrieval-based fact verification system that outperforms LLM-based methods in accuracy and efficiency, making fake news detection more practical for real-world use.
Contribution
DeReC demonstrates that general-purpose text embeddings combined with classification can replace costly LLMs in fact verification, achieving higher accuracy and efficiency.
Findings
DeReC reduces runtime by over 90% compared to LLM-based methods.
DeReC achieves an F1 score of 65.58%, surpassing the previous state-of-the-art.
DeReC is effective across datasets of varying sizes.
Abstract
The proliferation of misinformation necessitates robust yet computationally efficient fact verification systems. While current state-of-the-art approaches leverage Large Language Models (LLMs) for generating explanatory rationales, these methods face significant computational barriers and hallucination risks in real-world deployments. We present DeReC (Dense Retrieval Classification), a lightweight framework that demonstrates how general-purpose text embeddings can effectively replace autoregressive LLM-based approaches in fact verification tasks. By combining dense retrieval with specialized classification, our system achieves better accuracy while being significantly more efficient. DeReC outperforms explanation-generating LLMs in efficiency, reducing runtime by 95% on RAWFC (23 minutes 36 seconds compared to 454 minutes 12 seconds) and by 92% on LIAR-RAW (134 minutes 14 seconds…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Big Data and Digital Economy
