Have You Stolen My Model? Evasion Attacks Against Deep Neural Network Watermarking Techniques
Dorjan Hitaj, Luigi V. Mancini

TL;DR
This paper investigates the robustness of deep neural network watermarking techniques, revealing that adversaries can evade detection even when watermark removal is difficult, thus challenging current copyright protection methods.
Contribution
It critically evaluates state-of-the-art DNN watermarking schemes and demonstrates their vulnerabilities to evasion attacks, highlighting the need for more robust protection methods.
Findings
Adversaries can evade watermark detection despite removal difficulty.
Current watermarking schemes are vulnerable to evasion attacks.
Watermarking may not reliably prevent model theft detection.
Abstract
Deep neural networks have had enormous impact on various domains of computer science, considerably outperforming previous state of the art machine learning techniques. To achieve this performance, neural networks need large quantities of data and huge computational resources, which heavily increases their construction costs. The increased cost of building a good deep neural network model gives rise to a need for protecting this investment from potential copyright infringements. Legitimate owners of a machine learning model want to be able to reliably track and detect a malicious adversary that tries to steal the intellectual property related to the model. Recently, this problem was tackled by introducing in deep neural networks the concept of watermarking, which allows a legitimate owner to embed some secret information(watermark) in a given model. The watermark allows the legitimate…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Digital Media Forensic Detection · Advanced Steganography and Watermarking Techniques
