Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design

Yuhao Sun; Yihua Zhang; Gaowen Liu; Hongtao Xie; Sijia Liu

arXiv:2508.10065·cs.CR·August 15, 2025

Invisible Watermarks, Visible Gains: Steering Machine Unlearning with Bi-Level Watermarking Design

Yuhao Sun, Yihua Zhang, Gaowen Liu, Hongtao Xie, Sijia Liu

PDF

TL;DR

This paper introduces Water4MU, a novel watermarking-based approach for machine unlearning that strategically modifies data content to enable precise removal of sensitive data, improving unlearning efficiency and model utility.

Contribution

The paper proposes a bi-level optimization framework, Water4MU, integrating digital watermarking with machine unlearning to enhance data removal effectiveness and model performance.

Findings

01

Water4MU effectively improves unlearning in image classification and generation.

02

Watermarking facilitates precise data removal without degrading unrelated model tasks.

03

Outperforms existing unlearning methods in challenging forget scenarios.

Abstract

With the increasing demand for the right to be forgotten, machine unlearning (MU) has emerged as a vital tool for enhancing trust and regulatory compliance by enabling the removal of sensitive data influences from machine learning (ML) models. However, most MU algorithms primarily rely on in-training methods to adjust model weights, with limited exploration of the benefits that data-level adjustments could bring to the unlearning process. To address this gap, we propose a novel approach that leverages digital watermarking to facilitate MU by strategically modifying data content. By integrating watermarking, we establish a controlled unlearning mechanism that enables precise removal of specified data while maintaining model utility for unrelated tasks. We first examine the impact of watermarked data on MU, finding that MU effectively generalizes to watermarked data. Building on this, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.