KEN: Knowledge Augmentation and Emotion Guidance Network for Multimodal Fake News Detection
Peican Zhu, Yubo Jing, Le Cheng, Keke Tang, Yangming Guo

TL;DR
This paper introduces KEN, a multimodal fake news detection model that combines semantic understanding, knowledge augmentation, and emotion-guided learning to improve accuracy on social media misinformation.
Contribution
The paper presents a novel KEN framework that leverages LVLM for semantic understanding and incorporates emotion guidance for fine-grained fake news detection.
Findings
KEN outperforms existing methods on two real-world datasets.
Knowledge augmentation improves semantic comprehension of images and text.
Emotion-guided modeling enhances detection accuracy across different emotional news types.
Abstract
In recent years, the rampant spread of misinformation on social media has made accurate detection of multimodal fake news a critical research focus. However, previous research has not adequately understood the semantics of images, and models struggle to discern news authenticity with limited textual information. Meanwhile, treating all emotional types of news uniformly without tailored approaches further leads to performance degradation. Therefore, we propose a novel Knowledge Augmentation and Emotion Guidance Network (KEN). On the one hand, we effectively leverage LVLM's powerful semantic understanding and extensive world knowledge. For images, the generated captions provide a comprehensive understanding of image content and scenes, while for text, the retrieved evidence helps break the information silos caused by the closed and limited text and context. On the other hand, we consider…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
