Machine Unlearning for Masked Diffusion Language Models

Georu Lee; Seungwon Jeong; Hoki Kim; Jinseong Park; Woojin Lee

arXiv:2605.18253·cs.CL·May 19, 2026

Machine Unlearning for Masked Diffusion Language Models

Georu Lee, Seungwon Jeong, Hoki Kim, Jinseong Park, Woojin Lee

PDF

1 Repo

TL;DR

This paper introduces Masked Diffusion Unlearning (MDU), a novel framework for unlearning specific knowledge in masked diffusion language models, demonstrating superior performance on benchmarks.

Contribution

The paper presents the first unlearning method tailored for MDLMs, utilizing diffusion process insights to effectively unlearn knowledge while balancing privacy and utility.

Findings

01

MDU achieves high unlearning performance compared to existing methods.

02

Empirical results on benchmarks validate MDU's effectiveness.

03

Code is publicly available at the provided GitHub link.

Abstract

Recent masked diffusion language models (MDLMs), such as LLaDA and Dream, have achieved performance comparable to autoregressive large language models. Unlike autoregressive models, which generate text sequentially, MDLMs generate text by iteratively denoising masked positions in parallel. During fine-tuning, MDLMs learn to recover responses from masked response states conditioned on a prompt, thereby shifting their predictions from a prompt-masked unconditional distribution toward a prompt-conditional distribution. Despite this distinct generative and fine-tuning mechanism, machine unlearning for MDLMs remains largely unexplored. In this paper, we propose Masked Diffusion Unlearning (MDU), the first unlearning framework for MDLMs, by revisiting the process of learning specific knowledge in terms of diffusion. Specifically, MDU minimizes a forward KL divergence from the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

leegeoru/MDU
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.