Attention Distraction: Watermark Removal Through Continual Learning with Selective Forgetting
Qi Zhong, Leo Yu Zhang, Shengshan Hu, Longxiang Gao, Jun, Zhang, Yong Xiang

TL;DR
This paper introduces Attention Distraction, a novel source data-free watermark removal method that uses continual learning and selective forgetting to effectively erase watermarks without harming model performance.
Contribution
It proposes a new watermark removal attack that leverages continual learning and unlabeled data to selectively forget watermarks without source data.
Findings
Successfully removes watermarks across various datasets and models.
Achieves watermark removal with minimal resource usage.
Outperforms existing state-of-the-art watermark removal methods.
Abstract
Fine-tuning attacks are effective in removing the embedded watermarks in deep learning models. However, when the source data is unavailable, it is challenging to just erase the watermark without jeopardizing the model performance. In this context, we introduce Attention Distraction (AD), a novel source data-free watermark removal attack, to make the model selectively forget the embedded watermarks by customizing continual learning. In particular, AD first anchors the model's attention on the main task using some unlabeled data. Then, through continual learning, a small number of \textit{lures} (randomly selected natural images) that are assigned a new label distract the model's attention away from the watermarks. Experimental results from different datasets and networks corroborate that AD can thoroughly remove the watermark with a small resource budget without compromising the model's…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeophysical Methods and Applications · Adversarial Robustness in Machine Learning · Advanced Neural Network Applications
