Who Made This? Fake Detection and Source Attribution with Diffusion Features
Simone Bonechi, Paolo Andreini, Barbara Toniella Corradini

TL;DR
FRIDA is a lightweight, data-efficient framework that uses diffusion features from a pre-trained model to detect and attribute AI-generated images, achieving state-of-the-art results across unseen generators.
Contribution
Introduces FRIDA, a novel diffusion feature-based method for robust fake image detection and source attribution, capable of generalizing to unseen generators with limited data.
Findings
State-of-the-art cross-generator detection accuracy
Effective source model attribution with a neural classifier
Robust detection with minimal training data
Abstract
The rapid rise of generative models has yielded synthetic images of striking realism, blurring the line between real and fake content. As novel models proliferate, detectors must go beyond mere fake identification to robustly generalise across unseen generators and synthetic content. We introduce FRIDA (Fake image Recognition and source Identification via Diffusion features Analysis), a lightweight, data-efficient framework that uses features from a pre-trained Stable Diffusion Model to detect and attribute AI-generated images. Through an in-depth analysis of how data from different generators are encoded across diffusion U-Net layers, we propose a method that (i) detects synthetic images using a training-free -Nearest Neighbour approach and (ii) performs source model attribution via a compact neural classifier. On the GenImage benchmark, FRIDA achieves state-of-the-art…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Face Recognition and Perception
