NewsCLIPpings: Automatic Generation of Out-of-Context Multimodal Media
Grace Luo, Trevor Darrell, Anna Rohrbach

TL;DR
This paper introduces NewsCLIPpings, a large-scale dataset of unmanipulated but mismatched news images and captions to study and detect out-of-context media used in misinformation, highlighting machine-driven image repurposing as a threat.
Contribution
The paper presents a novel dataset of unmanipulated yet mismatched image-caption pairs for misinformation detection and benchmarks state-of-the-art models on this challenging dataset.
Findings
Machine-driven image repurposing is a realistic threat.
Models show varied performance across different pretraining domains.
The dataset contains challenging instances that can mislead humans.
Abstract
Online misinformation is a prevalent societal issue, with adversaries relying on tools ranging from cheap fakes to sophisticated deep fakes. We are motivated by the threat scenario where an image is used out of context to support a certain narrative. While some prior datasets for detecting image-text inconsistency generate samples via text manipulation, we propose a dataset where both image and text are unmanipulated but mismatched. We introduce several strategies for automatically retrieving convincing images for a given caption, capturing cases with inconsistent entities or semantic context. Our large-scale automatically generated NewsCLIPpings Dataset: (1) demonstrates that machine-driven image repurposing is now a realistic threat, and (2) provides samples that represent challenging instances of mismatch between text and image in news that are able to mislead humans. We benchmark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Misinformation and Its Impacts · Viral Infections and Outbreaks Research
