WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection
Yan Hong, Jianfu Zhang

TL;DR
WildFake is a large, diverse dataset designed to improve the detection of AI-generated images, addressing challenges of generalization and robustness in real-world scenarios.
Contribution
This paper introduces WildFake, a comprehensive dataset with diverse, hierarchical AI-generated images to enhance detection methods' robustness and generalizability.
Findings
WildFake improves detection robustness across various generative models
Detectors trained on WildFake outperform those trained on smaller datasets
Hierarchical structure aids in understanding different generative techniques
Abstract
The extraordinary ability of generative models enabled the generation of images with such high quality that human beings cannot distinguish Artificial Intelligence (AI) generated images from real-life photographs. The development of generation techniques opened up new opportunities but concurrently introduced potential risks to privacy, authenticity, and security. Therefore, the task of detecting AI-generated imagery is of paramount importance to prevent illegal activities. To assess the generalizability and robustness of AI-generated image detection, we present a large-scale dataset, referred to as WildFake, comprising state-of-the-art generators, diverse object categories, and real-world applications. WildFake dataset has the following advantages: 1) Rich Content with Wild collection: WildFake collects fake images from the open-source community, enriching its diversity with a broad…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Neural Network Applications
MethodsDiffusion
