A Game Between the Defender and the Attacker for Trigger-based Black-box   Model Watermarking

Chaoyue Huang; Hanzhou Wu

arXiv:2501.01194·cs.CR·January 14, 2025

A Game Between the Defender and the Attacker for Trigger-based Black-box Model Watermarking

Chaoyue Huang, Hanzhou Wu

PDF

Open Access

TL;DR

This paper introduces a game-theoretic framework for trigger-based black-box watermarking of DNNs, providing a theoretical foundation for designing more robust watermarking schemes against attackers.

Contribution

It formulates a game between attacker and defender, defining payoff functions and optimal responses to advance the theoretical understanding of model watermarking.

Findings

01

Constructed payoff functions for attacker and defender

02

Determined optimal strategies for both players

03

Enriched the theoretical foundation of watermarking

Abstract

Watermarking deep neural network (DNN) models has attracted a great deal of attention and interest in recent years because of the increasing demand to protect the intellectual property of DNN models. Many practical algorithms have been proposed by covertly embedding a secret watermark into a given DNN model through either parametric/structural modulation or backdooring against intellectual property infringement from the attacker while preserving the model performance on the original task. Despite the performance of these approaches, the lack of basic research restricts the algorithmic design to either a trial-based method or a data-driven technique. This has motivated the authors in this paper to introduce a game between the model attacker and the model defender for trigger-based black-box model watermarking. For each of the two players, we construct the payoff function and determine…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Digital Media Forensic Detection · Internet Traffic Analysis and Secure E-voting