WildFake: A Large-scale Challenging Dataset for AI-Generated Images   Detection

Yan Hong; Jianfu Zhang

arXiv:2402.11843·cs.CV·February 20, 2024·3 cites

WildFake: A Large-scale Challenging Dataset for AI-Generated Images Detection

Yan Hong, Jianfu Zhang

PDF

Open Access

TL;DR

WildFake is a large, diverse dataset designed to improve the detection of AI-generated images, addressing challenges of generalization and robustness in real-world scenarios.

Contribution

This paper introduces WildFake, a comprehensive dataset with diverse, hierarchical AI-generated images to enhance detection methods' robustness and generalizability.

Findings

01

WildFake improves detection robustness across various generative models

02

Detectors trained on WildFake outperform those trained on smaller datasets

03

Hierarchical structure aids in understanding different generative techniques

Abstract

The extraordinary ability of generative models enabled the generation of images with such high quality that human beings cannot distinguish Artificial Intelligence (AI) generated images from real-life photographs. The development of generation techniques opened up new opportunities but concurrently introduced potential risks to privacy, authenticity, and security. Therefore, the task of detecting AI-generated imagery is of paramount importance to prevent illegal activities. To assess the generalizability and robustness of AI-generated image detection, we present a large-scale dataset, referred to as WildFake, comprising state-of-the-art generators, diverse object categories, and real-world applications. WildFake dataset has the following advantages: 1) Rich Content with Wild collection: WildFake collects fake images from the open-source community, enriching its diversity with a broad…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Neural Network Applications

MethodsDiffusion