SoK: How Robust is Image Classification Deep Neural Network Watermarking? (Extended Version)
Nils Lukas, Edward Jiang, Xinda Li, Florian Kerschbaum

TL;DR
This paper systematically evaluates the robustness of existing DNN watermarking schemes against a comprehensive set of removal attacks, revealing that none are truly robust in practice, highlighting flaws in current evaluation methods.
Contribution
It provides the first extensive empirical evaluation of watermarking schemes against diverse removal attacks, including novel methods, and proposes taxonomies for better understanding.
Findings
All surveyed schemes fail under comprehensive attacks
Adaptive attacks and untested removal methods compromise robustness
Current evaluation practices are insufficient and flawed
Abstract
Deep Neural Network (DNN) watermarking is a method for provenance verification of DNN models. Watermarking should be robust against watermark removal attacks that derive a surrogate model that evades provenance verification. Many watermarking schemes that claim robustness have been proposed, but their robustness is only validated in isolation against a relatively small set of attacks. There is no systematic, empirical evaluation of these claims against a common, comprehensive set of removal attacks. This uncertainty about a watermarking scheme's robustness causes difficulty to trust their deployment in practice. In this paper, we evaluate whether recently proposed watermarking schemes that claim robustness are robust against a large set of removal attacks. We survey methods from the literature that (i) are known removal attacks, (ii) derive surrogate models but have not been evaluated…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning
