CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models

Junhoo Lee; Mijin Koo; Nojun Kwak

arXiv:2604.16363·cs.CR·April 21, 2026

CSF: Black-box Fingerprinting via Compositional Semantics for Text-to-Image Models

Junhoo Lee, Mijin Koo, Nojun Kwak

PDF

TL;DR

CSF is a novel black-box fingerprinting method for text-to-image models that uses compositional prompts to attribute models to their lineages without internal access.

Contribution

It introduces the first black-box approach for model attribution using compositional semantics, applicable to commercial API deployments without internal model access.

Findings

01

CSF successfully attributes models across 6 families and 13 variants.

02

The Bayesian framework enables controlled-risk lineage decisions.

03

CSF outperforms existing methods in black-box attribution scenarios.

Abstract

Text-to-image models are commercially valuable assets often distributed under restrictive licenses, but such licenses are enforceable only when violations can be detected. Existing methods require pre-deployment watermarking or internal model access, which are unavailable in commercial API deployments. We present Compositional Semantic Fingerprinting (CSF), the first black-box method for attributing fine-tuned text-to-image models to protected lineages using only query access. CSF treats models as semantic category generators and probes them with compositional underspecified prompts that remain rare under fine-tuning. This gives IP owners an asymmetric advantage: new prompt compositions can be generated after deployment, while attackers must anticipate and suppress a much broader space of fingerprints. Across 6 model families (FLUX, Kandinsky, SD1.5/2.1/3.0/XL) and 13 fine-tuned…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.