Unknown Aware AI-Generated Content Attribution
Ellie Thieu, Jifan Zhang, Haoyue Bai

TL;DR
This paper addresses the challenge of attributing AI-generated images to specific models by leveraging limited labeled data and unlabeled wild data, improving identification of unseen generators in open-world scenarios.
Contribution
It introduces a constrained optimization method that effectively uses unlabeled wild data to enhance attribution accuracy for unseen generative models.
Findings
Wild data improves attribution to unseen generators
Baseline struggles with new, unseen models
Proposed method outperforms traditional classifiers
Abstract
The rapid advancement of photorealistic generative models has made it increasingly important to attribute the origin of synthetic content, moving beyond binary real or fake detection toward identifying the specific model that produced a given image. We study the problem of distinguishing outputs from a target generative model (e.g., OpenAI Dalle 3) from other sources, including real images and images generated by a wide range of alternative models. Using CLIP features and a simple linear classifier, shown to be effective in prior work, we establish a strong baseline for target generator attribution using only limited labeled data from the target model and a small number of known generators. However, this baseline struggles to generalize to harder, unseen, and newly released generators. To address this limitation, we propose a constrained optimization approach that leverages unlabeled…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenerative Adversarial Networks and Image Synthesis · Domain Adaptation and Few-Shot Learning · Face recognition and analysis
