SemBind: Binding Diffusion Watermarks to Semantics Against Black-Box Forgery Attacks

Xin Zhang; Zijin Yang; Kejiang Chen; Linfeng Ma; Weiming Zhang; Nenghai Yu

arXiv:2601.20310·cs.CR·January 29, 2026

SemBind: Binding Diffusion Watermarks to Semantics Against Black-Box Forgery Attacks

Xin Zhang, Zijin Yang, Kejiang Chen, Linfeng Ma, Weiming Zhang, Nenghai Yu

PDF

Open Access

TL;DR

SemBind is a novel framework that enhances the security of latent diffusion model watermarks by binding them to image semantics, effectively resisting black-box forgery attacks without compromising image quality.

Contribution

It introduces a semantic masker trained with contrastive learning to bind watermarks to semantics, providing a flexible defense against black-box forgery in latent-based watermarking.

Findings

01

Significantly reduces false acceptance in black-box forgery scenarios

02

Maintains high image quality while enhancing watermark security

03

Compatible with existing latent-based watermarking methods

Abstract

Latent-based watermarks, integrated into the generation process of latent diffusion models (LDMs), simplify detection and attribution of generated images. However, recent black-box forgery attacks, where an attacker needs at least one watermarked image and black-box access to the provider's model, can embed the provider's watermark into images not produced by the provider, posing outsized risk to provenance and trust. We propose SemBind, the first defense framework for latent-based watermarks that resists black-box forgery by binding latent signals to image semantics via a learned semantic masker. Trained with contrastive learning, the masker yields near-invariant codes for the same prompt and near-orthogonal codes across prompts; these codes are reshaped and permuted to modulate the target latent before any standard latent-based watermark. SemBind is generally compatible with existing…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Generative Adversarial Networks and Image Synthesis · Digital Media Forensic Detection