So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Zhenglin Huang; Tianxiao Li; Xiangtai Li; Haiquan Wen; Yiwei He; Jiangning Zhang; Hao Fei; Xi Yang; Xiaowei Huang; Bei Peng; Guangliang Cheng

arXiv:2505.18660·cs.CV·October 6, 2025

So-Fake: Benchmarking and Explaining Social Media Image Forgery Detection

Zhenglin Huang, Tianxiao Li, Xiangtai Li, Haiquan Wen, Yiwei He, Jiangning Zhang, Hao Fei, Xi Yang, Xiaowei Huang, Bei Peng, Guangliang Cheng

PDF

3 Datasets

TL;DR

This paper introduces So-Fake, a large-scale social media image forgery dataset, a challenging out-of-domain benchmark, and an advanced detection framework that significantly improves accuracy and localization, advancing research in AI-generated image detection.

Contribution

The paper presents a comprehensive social media-oriented forgery dataset, a large out-of-domain benchmark, and a reinforcement learning-based detection framework, addressing current limitations in diversity, scale, and generalization.

Findings

01

So-Fake-Set contains over 2 million images from 35 generative models.

02

So-Fake-OOD provides a challenging out-of-domain test for real-world scenarios.

03

So-Fake-R1 outperforms previous methods with a 1.3% higher detection accuracy and 4.5% better localization IoU.

Abstract

Recent advances in AI-powered generative models have enabled the creation of increasingly realistic synthetic images, posing significant risks to information integrity and public trust on social media platforms. While robust detection frameworks and diverse, large-scale datasets are essential to mitigate these risks, existing academic efforts remain limited in scope: current datasets lack the diversity, scale, and realism required for social media contexts, while detection methods struggle with generalization to unseen generative technologies. To bridge this gap, we introduce So-Fake-Set, a comprehensive social media-oriented dataset with over 2 million high-quality images, diverse generative sources, and photorealistic imagery synthesized using 35 state-of-the-art generative models. To rigorously evaluate cross-domain robustness, we establish a novel and large-scale (100K)…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Datasets

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.