Seeds Don't Lie: An Adaptive Watermarking Framework for Computer Vision Models
Jacob Shams, Ben Nassi, Ikuya Morikawa, Toshiya Shimizu, Asaf Shabtai,, Yuval Elovici

TL;DR
This paper introduces an adaptive watermarking framework for computer vision models that leverages unique random seed behaviors during training to detect unauthorized model extraction, demonstrating high robustness against various attacks.
Contribution
The paper proposes the RAW framework that uses seed-induced model behaviors as a watermark, providing a novel, robust method for IP protection in model extraction scenarios.
Findings
Achieves >0.9 AUC in detection accuracy.
Robust against unseen extraction attacks and model pruning.
Effective across different model architectures.
Abstract
In recent years, various watermarking methods were suggested to detect computer vision models obtained illegitimately from their owners, however they fail to demonstrate satisfactory robustness against model extraction attacks. In this paper, we present an adaptive framework to watermark a protected model, leveraging the unique behavior present in the model due to a unique random seed initialized during the model training. This watermark is used to detect extracted models, which have the same unique behavior, indicating an unauthorized usage of the protected model's intellectual property (IP). First, we show how an initial seed for random number generation as part of model training produces distinct characteristics in the model's decision boundaries, which are inherited by extracted models and present in their decision boundaries, but aren't present in non-extracted models trained on…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Advanced Malware Detection Techniques · Adversarial Robustness in Machine Learning
Methodsfail
