MEA-Defender: A Robust Watermark against Model Extraction Attack
Peizhuo Lv, Hualong Ma, Kai Chen, Jiachen Zhou, Shengzhi Zhang,, Ruigang Liang, Shenchen Zhu, Pan Li, and Yingjun Zhang

TL;DR
This paper introduces MEA-Defender, a novel watermarking technique designed to protect DNN models from model extraction attacks by embedding a watermark that remains intact during such attacks.
Contribution
The paper proposes a new watermarking method that combines input samples from different classes and ensures the watermark's robustness against model extraction attacks.
Findings
MEA-Defender is highly robust against multiple model extraction attacks.
The watermark remains detectable even after various removal and detection attempts.
Extensive experiments validate the effectiveness across different models and datasets.
Abstract
Recently, numerous highly-valuable Deep Neural Networks (DNNs) have been trained using deep learning algorithms. To protect the Intellectual Property (IP) of the original owners over such DNN models, backdoor-based watermarks have been extensively studied. However, most of such watermarks fail upon model extraction attack, which utilizes input samples to query the target model and obtains the corresponding outputs, thus training a substitute model using such input-output pairs. In this paper, we propose a novel watermark to protect IP of DNN models against model extraction, named MEA-Defender. In particular, we obtain the watermark by combining two samples from two source classes in the input domain and design a watermark loss function that makes the output domain of the watermark within that of the main task samples. Since both the input domain and the output domain of our watermark…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Steganography and Watermarking Techniques · Internet Traffic Analysis and Secure E-voting · Vehicle License Plate Recognition
