Improving Fake News Detection by Using an Entity-enhanced Framework to   Fuse Diverse Multimodal Clues

Peng Qi; Juan Cao; Xirong Li; Huan Liu; Qiang Sheng; Xiaoyue Mi; Qin; He; Yongbiao Lv; Chenyang Guo; Yingchao Yu

arXiv:2108.10509·cs.MM·August 25, 2021

Improving Fake News Detection by Using an Entity-enhanced Framework to Fuse Diverse Multimodal Clues

Peng Qi, Juan Cao, Xirong Li, Huan Liu, Qiang Sheng, Xiaoyue Mi, Qin, He, Yongbiao Lv, Chenyang Guo, Yingchao Yu

PDF

TL;DR

This paper introduces an entity-enhanced multimodal framework that effectively models high-level semantic correlations between text and images, significantly improving fake news detection accuracy.

Contribution

It proposes a novel framework that captures entity inconsistency, mutual enhancement, and text complementation in multimodal fake news detection, leveraging visual entities for better semantic understanding.

Findings

01

Outperforms existing models in fake news detection accuracy

02

Effectively models three key cross-modal correlations

03

Demonstrates robustness across diverse datasets

Abstract

Recently, fake news with text and images have achieved more effective diffusion than text-only fake news, raising a severe issue of multimodal fake news detection. Current studies on this issue have made significant contributions to developing multimodal models, but they are defective in modeling the multimodal content sufficiently. Most of them only preliminarily model the basic semantics of the images as a supplement to the text, which limits their performance on detection. In this paper, we find three valuable text-image correlations in multimodal fake news: entity inconsistency, mutual enhancement, and text complementation. To effectively capture these multimodal clues, we innovatively extract visual entities (such as celebrities and landmarks) to understand the news-related high-level semantics of images, and then model the multimodal entity inconsistency and mutual enhancement…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.