MEA-Defender: A Robust Watermark against Model Extraction Attack

Peizhuo Lv; Hualong Ma; Kai Chen; Jiachen Zhou; Shengzhi Zhang,; Ruigang Liang; Shenchen Zhu; Pan Li; and Yingjun Zhang

arXiv:2401.15239·cs.CR·January 30, 2024·1 cites

MEA-Defender: A Robust Watermark against Model Extraction Attack

Peizhuo Lv, Hualong Ma, Kai Chen, Jiachen Zhou, Shengzhi Zhang,, Ruigang Liang, Shenchen Zhu, Pan Li, and Yingjun Zhang

PDF

Open Access 1 Repo

TL;DR

This paper introduces MEA-Defender, a novel watermarking technique designed to protect DNN models from model extraction attacks by embedding a watermark that remains intact during such attacks.

Contribution

The paper proposes a new watermarking method that combines input samples from different classes and ensures the watermark's robustness against model extraction attacks.

Findings

01

MEA-Defender is highly robust against multiple model extraction attacks.

02

The watermark remains detectable even after various removal and detection attempts.

03

Extensive experiments validate the effectiveness across different models and datasets.

Abstract

Recently, numerous highly-valuable Deep Neural Networks (DNNs) have been trained using deep learning algorithms. To protect the Intellectual Property (IP) of the original owners over such DNN models, backdoor-based watermarks have been extensively studied. However, most of such watermarks fail upon model extraction attack, which utilizes input samples to query the target model and obtains the corresponding outputs, thus training a substitute model using such input-output pairs. In this paper, we propose a novel watermark to protect IP of DNN models against model extraction, named MEA-Defender. In particular, we obtain the watermark by combining two samples from two source classes in the input domain and design a watermark loss function that makes the output domain of the watermark within that of the main task samples. Since both the input domain and the output domain of our watermark…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

lvpeizhuo/mea-defender
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Vehicle License Plate Recognition