IConMark: Robust Interpretable Concept-Based Watermark For AI Images
Vinu Sankar Sadasivan, Mehrdad Saberi, Soheil Feizi

TL;DR
IConMark introduces a semantic, interpretable watermarking method for AI-generated images that is robust against adversarial attacks and augmentations, enhancing digital authenticity verification.
Contribution
The paper presents IConMark, a novel semantic watermarking technique that embeds interpretable concepts into AI images, improving robustness and human interpretability over traditional methods.
Findings
IConMark outperforms baselines in watermark detection accuracy.
Hybrid approaches further improve robustness against manipulations.
Watermarks remain human-readable and verifiable.
Abstract
With the rapid rise of generative AI and synthetic media, distinguishing AI-generated images from real ones has become crucial in safeguarding against misinformation and ensuring digital authenticity. Traditional watermarking techniques have shown vulnerabilities to adversarial attacks, undermining their effectiveness in the presence of attackers. We propose IConMark, a novel in-generation robust semantic watermarking method that embeds interpretable concepts into AI-generated images, as a first step toward interpretable watermarking. Unlike traditional methods, which rely on adding noise or perturbations to AI-generated images, IConMark incorporates meaningful semantic attributes, making it interpretable to humans and hence, resilient to adversarial manipulation. This method is not only robust against various image augmentations but also human-readable, enabling manual verification of…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
