Supervised Contrastive Learning for Few-Shot AI-Generated Image Detection and Attribution
Jaime \'Alvarez Urue\~na, David Camacho, Javier Huertas Tato

TL;DR
This paper introduces a two-stage framework using supervised contrastive learning and few-shot k-NN classification to detect and attribute AI-generated images, achieving high accuracy and robustness against unseen generators.
Contribution
It presents a novel contrastive learning-based detection method combined with few-shot learning for source attribution, improving generalization to unseen generative models.
Findings
Achieves 91.3% detection accuracy with only 150 images per class.
Outperforms existing methods by 5.2 percentage points in detection accuracy.
Improves AUC and OSCR by 14.70% and 4.27% respectively in attribution tasks.
Abstract
The rapid advancement of generative artificial intelligence has enabled the creation of synthetic images that are increasingly indistinguishable from authentic content, posing significant challenges for digital media integrity. This problem is compounded by the accelerated release cycle of novel generative models, which renders traditional detection approaches (reliant on periodic retraining) computationally infeasible and operationally impractical. This work proposes a novel two-stage detection framework designed to address the generalization challenge inherent in synthetic image detection. The first stage employs a vision deep learning model trained via supervised contrastive learning to extract discriminative embeddings from input imagery. Critically, this model was trained on a strategically partitioned subset of available generators, with specific architectures withheld from…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Media Forensic Detection · Generative Adversarial Networks and Image Synthesis · Law in Society and Culture
