BitMark: Watermarking Bitwise Autoregressive Image Generative Models
Louis Kerner, Michel Meintz, Bihe Zhao, Franziska Boenisch, Adam Dziedzic

TL;DR
BitMark introduces a novel bitwise watermarking framework for autoregressive image generative models, embedding detectable signals at the bit level to prevent model collapse and enable reliable identification of generated images.
Contribution
This work presents the first robust bitwise watermarking method integrated into autoregressive image models, ensuring high fidelity, robustness, and radioactive traceability.
Findings
Watermark remains detectable after fine-tuning diffusion models.
Watermarking does not significantly affect image quality or generation speed.
The method effectively prevents model collapse by enabling detection of generated content.
Abstract
State-of-the-art text-to-image models generate photorealistic images at an unprecedented speed. This work focuses on models that operate in a bitwise autoregressive manner over a discrete set of tokens that is practically infinite in size. However, their impressive generative power comes with a growing risk: as their outputs increasingly populate the Internet, they are likely to be scraped and reused as training data-potentially by the very same models. This phenomenon has been shown to lead to model collapse, where repeated training on generated content, especially from the models' own previous versions, causes a gradual degradation in performance. A promising mitigation strategy is watermarking, which embeds human-imperceptible yet detectable signals into generated images-enabling the identification of generated content. In this work, we introduce BitMark, a robust bitwise…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Chaos-based Image/Signal Encryption · Digital Media Forensic Detection
MethodsSparse Evolutionary Training · Diffusion · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
