Similarity over Factuality: Are we making progress on multimodal out-of-context misinformation detection?
Stefanos-Iordanis Papadopoulos, Christos Koutlis, Symeon Papadopoulos,, Panagiotis C. Petrantonakis

TL;DR
This paper introduces MUSE, a simple multimodal similarity measure for out-of-context misinformation detection, which competes with complex models and raises questions about evaluation and progress in the field.
Contribution
The study proposes MUSE, a straightforward similarity-based baseline, and demonstrates its effectiveness and limitations compared to complex models in multimodal misinformation detection.
Findings
MUSE with conventional classifiers can outperform state-of-the-art methods.
Integrating MUSE into AITR improves performance significantly.
Surface-level patterns in MUSE highlight challenges in assessing factuality.
Abstract
Out-of-context (OOC) misinformation poses a significant challenge in multimodal fact-checking, where images are paired with texts that misrepresent their original context to support false narratives. Recent research in evidence-based OOC detection has seen a trend towards increasingly complex architectures, incorporating Transformers, foundation models, and large language models. In this study, we introduce a simple yet robust baseline, which assesses MUltimodal SimilaritiEs (MUSE), specifically the similarity between image-text pairs and external image and text evidence. Our results demonstrate that MUSE, when used with conventional classifiers like Decision Tree, Random Forest, and Multilayer Perceptron, can compete with and even surpass the state-of-the-art on the NewsCLIPpings and VERITE datasets. Furthermore, integrating MUSE in our proposed "Attentive Intermediate Transformer…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMisinformation and Its Impacts · User Authentication and Security Systems · Deception detection and forensic psychology
MethodsResidual Connection · Byte Pair Encoding · Layer Normalization · Label Smoothing · Linear Layer · Adam · Dropout · Multi-Head Attention · Dense Connections · Softmax
