SPRINT: Robust Model Attribution of Generated Images via Secret Pixel Reconstruction
Kai Yao, Marc Juarez

TL;DR
SPRINT introduces a private, secret pixel reconstruction method for robust model attribution of AI-generated images, significantly improving resistance to adaptive attacks while maintaining high accuracy.
Contribution
It proposes a novel fingerprinting approach that privatizes verification, making adaptive evasion attacks ineffective without sacrificing detection performance.
Findings
Achieves 99.17% accuracy on clean images in closed-world settings.
Reduces adaptive attack success rates to 1% or below.
Maintains high AUROC of 99.30% in open-world scenarios.
Abstract
Detecting the source model of AI-generated images is a growing accountability problem. AI fingerprinting techniques address this by detecting imperceptible patterns in the images that are unique to each model, achieving high detection accuracy under ideal conditions. However, recent research has shown that image fingerprints are extremely brittle to adaptive attacks, where knowledge of the technique can be exploited to perturb the fingerprints and evade detection. We present SPRINT (Secret Pixel Reconstruction fingerprinting), a novel model attribution method specifically designed to provide robustness to adaptive attacks. As opposed to existing fingerprinting, which focuses on publicly discoverable patterns in the image, SPRINT relies on a secret to define hidden reconstruction targets, thus keeping the verification task itself private. As a result, the attacker can no longer see the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
