SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

Zhenglin Huang; Jinwei Hu; Xiangtai Li; Yiwei He; Xingyu Zhao; Bei Peng; Baoyuan Wu; Xiaowei Huang; Guangliang Cheng

arXiv:2412.04292·cs.CV·June 27, 2025

SIDA: Social Media Image Deepfake Detection, Localization and Explanation with Large Multimodal Model

Zhenglin Huang, Jinwei Hu, Xiangtai Li, Yiwei He, Xingyu Zhao, Bei Peng, Baoyuan Wu, Xiaowei Huang, Guangliang Cheng

PDF

Open Access 1 Repo 4 Models

TL;DR

This paper introduces SIDA, a multimodal model for detecting, localizing, and explaining deepfakes on social media, supported by a large, diverse, and realistic dataset called SID-Set.

Contribution

The paper presents a new large-scale dataset SID-Set and a multimodal framework SIDA for deepfake detection, localization, and explanation, advancing current capabilities in social media image authenticity assessment.

Findings

01

SIDA outperforms existing models on SID-Set and other benchmarks.

02

SID-Set contains 300K diverse, realistic images with comprehensive annotations.

03

SIDA provides detection, localization, and textual explanations of deepfakes.

Abstract

The rapid advancement of generative models in creating highly realistic images poses substantial risks for misinformation dissemination. For instance, a synthetic image, when shared on social media, can mislead extensive audiences and erode trust in digital content, resulting in severe repercussions. Despite some progress, academia has not yet created a large and diversified deepfake detection dataset for social media, nor has it devised an effective solution to address this issue. In this paper, we introduce the Social media Image Detection dataSet (SID-Set), which offers three key advantages: (1) extensive volume, featuring 300K AI-generated/tampered and authentic images with comprehensive annotations, (2) broad diversity, encompassing fully synthetic and tampered images across various classes, and (3) elevated realism, with images that are predominantly indistinguishable from genuine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hzlsaber/sida
pytorch

Models

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Anomaly Detection Techniques and Applications