CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models
Junhoo Lee, Mijin Koo, Nojun Kwak

TL;DR
CSF is a novel black-box fingerprinting method for text-to-image models that uses compositional prompts to attribute models to their lineages without internal access.
Contribution
It introduces the first black-box approach for model attribution using compositional semantics, applicable to commercial API deployments without internal model access.
Findings
CSF successfully attributes models across 6 families and 13 variants.
The Bayesian framework enables controlled-risk lineage decisions.
CSF outperforms existing methods in black-box attribution scenarios.
Abstract
Text-to-image models are commercially valuable assets often distributed under restrictive licenses, but such licenses are enforceable only when violations can be detected. Existing methods require pre-deployment watermarking or internal model access, which are unavailable in commercial API deployments. We present Compositional Semantic Fingerprinting (CSF), the first black-box method for attributing fine-tuned text-to-image models to protected lineages using only query access. CSF treats models as semantic category generators and probes them with compositional underspecified prompts that remain rare under fine-tuning. This gives IP owners an asymmetric advantage: new prompt compositions can be generated after deployment, while attackers must anticipate and suppress a much broader space of fingerprints. Across 6 model families (FLUX, Kandinsky, SD1.5/2.1/3.0/XL) and 13 fine-tuned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
