TL;DR
This paper introduces SemanticRegen, a novel attack that effectively removes semantic watermarks from AI-generated images by selectively inpainting backgrounds, exposing vulnerabilities in current watermarking defenses.
Contribution
The paper presents SemanticRegen, a three-stage, label-free attack that erases state-of-the-art semantic watermarks while preserving image content, revealing a critical weakness in current watermarking schemes.
Findings
SemanticRegen defeats the TreeRing watermark (p=0.10)
Reduces bit-accuracy below 0.75 for other schemes
Maintains high perceptual quality with masked SSIM of 0.94
Abstract
As AI-generated imagery becomes ubiquitous, invisible watermarks have emerged as a primary line of defense for copyright and provenance. The newest watermarking schemes embed semantic signals - content-aware patterns that are designed to survive common image manipulations - yet their true robustness against adaptive adversaries remains under-explored. We expose a previously unreported vulnerability and introduce SemanticRegen, a three-stage, label-free attack that erases state-of-the-art semantic and invisible watermarks while leaving an image's apparent meaning intact. Our pipeline (i) uses a vision-language model to obtain fine-grained captions, (ii) extracts foreground masks with zero-shot segmentation, and (iii) inpaints only the background via an LLM-guided diffusion model, thereby preserving salient objects and style cues. Evaluated on 1,000 prompts across four watermarking…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsDiffusion
