FKA-Owl: Advancing Multimodal Fake News Detection through Knowledge-Augmented LVLMs
Xuannan Liu, Peipei Li, Huaibo Huang, Zekun Li, Xing Cui, and Jiahao Liang, Lixiong Qin, Weihong Deng, Zhaofeng He

TL;DR
FKA-Owl enhances multimodal fake news detection by integrating forgery-specific knowledge into large vision-language models, improving their ability to identify manipulated content across diverse domains.
Contribution
The paper introduces a novel framework that incorporates forgery-specific semantic and artifact knowledge into LVLMs for better fake news detection.
Findings
FKA-Owl outperforms previous methods on public benchmarks.
The framework effectively models text-image correlations and image artifacts.
Extensive experiments validate its superior cross-domain performance.
Abstract
The massive generation of multimodal fake news involving both text and images exhibits substantial distribution discrepancies, prompting the need for generalized detectors. However, the insulated nature of training restricts the capability of classical detectors to obtain open-world facts. While Large Vision-Language Models (LVLMs) have encoded rich world knowledge, they are not inherently tailored for combating fake news and struggle to comprehend local forgery details. In this paper, we propose FKA-Owl, a novel framework that leverages forgery-specific knowledge to augment LVLMs, enabling them to reason about manipulations effectively. The augmented forgery-specific knowledge includes semantic correlation between text and images, and artifact trace in image manipulation. To inject these two kinds of knowledge into the LVLM, we design two specialized modules to establish their…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · Spam and Phishing Detection · Topic Modeling
