LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with   External Knowledge Augmentation

Keyang Xuan; Li Yi; Fan Yang; Ruochen Wu; Yi R. Fung; Heng Ji

arXiv:2402.11943·cs.CL·June 24, 2024·3 cites

LEMMA: Towards LVLM-Enhanced Multimodal Misinformation Detection with External Knowledge Augmentation

Keyang Xuan, Li Yi, Fan Yang, Ruochen Wu, Yi R. Fung, Heng Ji

PDF

Open Access 1 Repo

TL;DR

This paper introduces LEMMA, a method that enhances multimodal misinformation detection by augmenting LVLMs with external knowledge, significantly improving accuracy on social media datasets.

Contribution

The paper proposes a novel external knowledge augmentation technique for LVLMs to improve multimodal misinformation detection accuracy.

Findings

01

LEMMA improves detection accuracy by 7% on Twitter dataset.

02

LEMMA improves detection accuracy by 13% on Fakeddit dataset.

03

LVLMs have strong reasoning skills but need external knowledge for better verification.

Abstract

The rise of multimodal misinformation on social platforms poses significant challenges for individuals and societies. Its increased credibility and broader impact compared to textual misinformation make detection complex, requiring robust reasoning across diverse media types and profound knowledge for accurate verification. The emergence of Large Vision Language Model (LVLM) offers a potential solution to this problem. Leveraging their proficiency in processing visual and textual information, LVLM demonstrates promising capabilities in recognizing complex information and exhibiting strong reasoning skills. In this paper, we first investigate the potential of LVLM on multimodal misinformation detection. We find that even though LVLM has a superior performance compared to LLMs, its profound reasoning may present limited power with a lack of evidence. Based on these observations, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

fan19-hub/LEMMA
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMisinformation and Its Impacts