SoK: How Robust is Image Classification Deep Neural Network   Watermarking? (Extended Version)

Nils Lukas; Edward Jiang; Xinda Li; Florian Kerschbaum

arXiv:2108.04974·cs.CR·August 12, 2021·5 cites

SoK: How Robust is Image Classification Deep Neural Network Watermarking? (Extended Version)

Nils Lukas, Edward Jiang, Xinda Li, Florian Kerschbaum

PDF

Open Access 1 Repo

TL;DR

This paper systematically evaluates the robustness of existing DNN watermarking schemes against a comprehensive set of removal attacks, revealing that none are truly robust in practice, highlighting flaws in current evaluation methods.

Contribution

It provides the first extensive empirical evaluation of watermarking schemes against diverse removal attacks, including novel methods, and proposes taxonomies for better understanding.

Findings

01

All surveyed schemes fail under comprehensive attacks

02

Adaptive attacks and untested removal methods compromise robustness

03

Current evaluation practices are insufficient and flawed

Abstract

Deep Neural Network (DNN) watermarking is a method for provenance verification of DNN models. Watermarking should be robust against watermark removal attacks that derive a surrogate model that evades provenance verification. Many watermarking schemes that claim robustness have been proposed, but their robustness is only validated in isolation against a relatively small set of attacks. There is no systematic, empirical evaluation of these claims against a common, comprehensive set of removal attacks. This uncertainty about a watermarking scheme's robustness causes difficulty to trust their deployment in practice. In this paper, we evaluate whether recently proposed watermarking schemes that claim robustness are robust against a large set of removal attacks. We survey methods from the literature that (i) are known removal attacks, (ii) derive surrogate models but have not been evaluated…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dnn-security/Watermark-Robustness-Toolbox
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Adversarial Robustness in Machine Learning