GenImage: A Million-Scale Benchmark for Detecting AI-Generated Image
Mingjian Zhu, Hanting Chen, Qiangyu Yan, Xudong Huang, Guanyu Lin, Wei, Li, Zhijun Tu, Hailin Hu, Jie Hu, Yunhe Wang

TL;DR
This paper introduces the GenImage dataset, a large-scale benchmark with over one million AI-generated and real images, designed to improve detection of AI-generated images and evaluate detector robustness across diverse scenarios.
Contribution
The paper presents the GenImage dataset, the largest of its kind, including images from advanced generators, and proposes evaluation tasks for detector performance in real-world conditions.
Findings
Detectors trained on GenImage show strong generalization across different generators.
The dataset enables comprehensive evaluation of detection methods under various image degradations.
GenImage accelerates development of more robust AI-generated image detectors.
Abstract
The extraordinary ability of generative models to generate photographic images has intensified concerns about the spread of disinformation, thereby leading to the demand for detectors capable of distinguishing between AI-generated fake images and real images. However, the lack of large datasets containing images from the most advanced image generators poses an obstacle to the development of such detectors. In this paper, we introduce the GenImage dataset, which has the following advantages: 1) Plenty of Images, including over one million pairs of AI-generated fake images and collected real images. 2) Rich Image Content, encompassing a broad range of image classes. 3) State-of-the-art Generators, synthesizing images with advanced diffusion models and GANs. The aforementioned advantages allow the detectors trained on GenImage to undergo a thorough evaluation and demonstrate strong…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection · Cell Image Analysis Techniques
