GIM: A Million-scale Benchmark for Generative Image Manipulation Detection and Localization
Yirui Chen, Xudong Huang, Quan Zhang, Wei Li, Mingjian Zhu, Qiangyu, Yan, Simiao Li, Hanting Chen, Hailin Hu, Jie Yang, Wei Liu, Jie Hu

TL;DR
This paper introduces GIM, a large-scale benchmark dataset with over one million manipulated images, to advance the detection and localization of generative image manipulations, and proposes a new framework, GIMFormer, that outperforms existing methods.
Contribution
The paper creates the GIM dataset, the largest of its kind, and proposes GIMFormer, a novel IMDL framework that achieves superior performance on this benchmark.
Findings
GIM dataset contains over one million manipulated images.
GIMFormer outperforms previous state-of-the-art methods.
The benchmark enables comprehensive evaluation of IMDL techniques.
Abstract
The extraordinary ability of generative models emerges as a new trend in image editing and generating realistic images, posing a serious threat to the trustworthiness of multimedia data and driving the research of image manipulation detection and location (IMDL). However, the lack of a large-scale data foundation makes the IMDL task unattainable. In this paper, we build a local manipulation data generation pipeline that integrates the powerful capabilities of SAM, LLM, and generative models. Upon this basis, we propose the GIM dataset, which has the following advantages: 1) Large scale, GIM includes over one million pairs of AI-manipulated images and real images. 2) Rich image content, GIM encompasses a broad range of image classes. 3) Diverse generative manipulation, the images are manipulated images with state-of-the-art generators and various manipulation tasks. The aforementioned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
Taxonomy
TopicsAdvanced Vision and Imaging · Image Processing Techniques and Applications · Image and Object Detection Techniques
MethodsSegment Anything Model
