Generating Grounded Responses to Counter Misinformation via Learning Efficient Fine-Grained Critiques
Xiaofei Xu, Xiuzhen Zhang, Ke Deng

TL;DR
This paper introduces MisMitiFact, an efficient framework that uses lightweight critique models to generate fact-grounded counter-responses to misinformation, significantly improving scalability and cost-effectiveness over existing methods.
Contribution
It presents a novel, scalable approach for misinformation mitigation by training lightweight critique models to refine LLM outputs with evidence-based feedback.
Findings
Achieves comparable response quality to LLM self-feedback.
Increases feedback generation throughput by approximately 5 times.
Demonstrates effectiveness in correcting key factual errors in responses.
Abstract
Fake news and misinformation poses a significant threat to society, making efficient mitigation essential. However, manual fact-checking is costly and lacks scalability. Large Language Models (LLMs) offer promise in automating counter-response generation to mitigate misinformation, but a critical challenge lies in their tendency to hallucinate non-factual information. Existing models mainly rely on LLM self-feedback to reduce hallucination, but this approach is computationally expensive. In this paper, we propose MisMitiFact, Misinformation Mitigation grounded in Facts, an efficient framework for generating fact-grounded counter-responses at scale. MisMitiFact generates simple critique feedback to refine LLM outputs, ensuring responses are grounded in evidence. We develop lightweight, fine-grained critique models trained on data sourced from readily available fact-checking sites to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Topic Modeling · Hate Speech and Cyberbullying Detection
