TL;DR
This paper introduces a new challenging dataset and a deep multimodal model for detecting image repurposing by analyzing images and related metadata, significantly improving detection accuracy over existing methods.
Contribution
The paper presents the MEIR dataset for image repurposing detection and a novel end-to-end deep multimodal model that outperforms state-of-the-art techniques.
Findings
The proposed model achieves up to 0.23 higher AUC on MEIR.
It outperforms existing methods on multiple datasets.
The dataset includes real-world manipulations of location, person, and organization.
Abstract
Nefarious actors on social media and other platforms often spread rumors and falsehoods through images whose metadata (e.g., captions) have been modified to provide visual substantiation of the rumor/falsehood. This type of modification is referred to as image repurposing, in which often an unmanipulated image is published along with incorrect or manipulated metadata to serve the actor's ulterior motives. We present the Multimodal Entity Image Repurposing (MEIR) dataset, a substantially challenging dataset over that which has been previously available to support research into image repurposing detection. The new dataset includes location, person, and organization manipulations on real-world data sourced from Flickr. We also present a novel, end-to-end, deep multimodal learning model for assessing the integrity of an image by combining information extracted from the image with related…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
