Towards the Detection of Diffusion Model Deepfakes
Jonas Ricker, Simon Damm, Thorsten Holz, Asja Fischer

TL;DR
This paper investigates the detectability of images generated by diffusion models (DMs), revealing that while existing GAN detectors fail on DMs, retraining them enables effective detection and highlighting fundamental differences in artifact presence.
Contribution
It demonstrates that retrained GAN detectors can reliably identify DM-generated images and provides insights into the distinct artifact patterns of DMs versus GANs.
Findings
State-of-the-art GAN detectors fail on DM images.
Retraining detectors on DM images achieves near-perfect detection.
DM images lack grid-like frequency artifacts and tend to underestimate high frequencies.
Abstract
In the course of the past few years, diffusion models (DMs) have reached an unprecedented level of visual quality. However, relatively little attention has been paid to the detection of DM-generated images, which is critical to prevent adverse impacts on our society. In contrast, generative adversarial networks (GANs), have been extensively studied from a forensic perspective. In this work, we therefore take the natural next step to evaluate whether previous methods can be used to detect images generated by DMs. Our experiments yield two key findings: (1) state-of-the-art GAN detectors are unable to reliably distinguish real from DM-generated images, but (2) re-training them on DM-generated images allows for almost perfect detection, which remarkably even generalizes to GANs. Together with a feature space analysis, our results lead to the hypothesis that DMs produce fewer detectable…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis
