On the Limitations and Prospects of Machine Unlearning for Generative AI
Shiji Zhou, Lianzhe Wang, Jiangnan Ye, Yongliang Wu, Heng Chang

TL;DR
This paper discusses the challenges and future prospects of applying machine unlearning techniques to generative AI models, focusing on large language models and diffusion models, to enhance safety and alignment.
Contribution
It formulates the unlearning problem for GenAI, analyzes current limitations, and proposes future directions including benchmarks, metrics, and utility-unlearning trade-offs.
Findings
Identifies key limitations of machine unlearning in GenAI.
Highlights the need for standardized benchmarks and evaluation metrics.
Discusses the trade-off between utility and unlearning effectiveness.
Abstract
Generative AI (GenAI), which aims to synthesize realistic and diverse data samples from latent variables or other data modalities, has achieved remarkable results in various domains, such as natural language, images, audio, and graphs. However, they also pose challenges and risks to data privacy, security, and ethics. Machine unlearning is the process of removing or weakening the influence of specific data samples or features from a trained model, without affecting its performance on other data or tasks. While machine unlearning has shown significant efficacy in traditional machine learning tasks, it is still unclear if it could help GenAI become safer and aligned with human desire. To this end, this position paper provides an in-depth discussion of the machine unlearning approaches for GenAI. Firstly, we formulate the problem of machine unlearning tasks on GenAI and introduce the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Data Processing Techniques
