PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models

Murthy L; Subarna Tripathi

arXiv:2506.03170·cs.CV·July 25, 2025

PALADIN : Robust Neural Fingerprinting for Text-to-Image Diffusion Models

Murthy L, Subarna Tripathi

PDF

Open Access

TL;DR

This paper introduces PALADIN, a neural fingerprinting method for text-to-image diffusion models that achieves near-perfect attribution accuracy, addressing security concerns in open-source generative models.

Contribution

The paper proposes a novel neural fingerprinting technique using cyclic error correcting codes to improve attribution accuracy in diffusion models.

Findings

01

Achieves near 100% attribution accuracy.

02

Balances attribution accuracy with generation quality.

03

Introduces a coding theory-based approach for neural fingerprinting.

Abstract

The risk of misusing text-to-image generative models for malicious uses, especially due to the open-source development of such models, has become a serious concern. As a risk mitigation strategy, attributing generative models with neural fingerprinting is emerging as a popular technique. There has been a plethora of recent work that aim for addressing neural fingerprinting. A trade-off between the attribution accuracy and generation quality of such models has been studied extensively. None of the existing methods yet achieved 100% attribution accuracy. However, any model with less than cent percent accuracy is practically non-deployable. In this work, we propose an accurate method to incorporate neural fingerprinting for text-to-image diffusion models leveraging the concepts of cyclic error correcting codes from the literature of coding theory.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGenerative Adversarial Networks and Image Synthesis · Adversarial Robustness in Machine Learning · Digital Media Forensic Detection