Attention Distraction: Watermark Removal Through Continual Learning with   Selective Forgetting

Qi Zhong; Leo Yu Zhang; Shengshan Hu; Longxiang Gao; Jun; Zhang; Yong Xiang

arXiv:2204.01934·cs.CV·April 6, 2022

Attention Distraction: Watermark Removal Through Continual Learning with Selective Forgetting

Qi Zhong, Leo Yu Zhang, Shengshan Hu, Longxiang Gao, Jun, Zhang, Yong Xiang

PDF

Open Access

TL;DR

This paper introduces Attention Distraction, a novel source data-free watermark removal method that uses continual learning and selective forgetting to effectively erase watermarks without harming model performance.

Contribution

It proposes a new watermark removal attack that leverages continual learning and unlabeled data to selectively forget watermarks without source data.

Findings

01

Successfully removes watermarks across various datasets and models.

02

Achieves watermark removal with minimal resource usage.

03

Outperforms existing state-of-the-art watermark removal methods.

Abstract

Fine-tuning attacks are effective in removing the embedded watermarks in deep learning models. However, when the source data is unavailable, it is challenging to just erase the watermark without jeopardizing the model performance. In this context, we introduce Attention Distraction (AD), a novel source data-free watermark removal attack, to make the model selectively forget the embedded watermarks by customizing continual learning. In particular, AD first anchors the model's attention on the main task using some unlabeled data. Then, through continual learning, a small number of \textit{lures} (randomly selected natural images) that are assigned a new label distract the model's attention away from the watermarks. Experimental results from different datasets and networks corroborate that AD can thoroughly remove the watermark with a small resource budget without compromising the model's…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsGeophysical Methods and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications