Null Space Constrained Contrastive Visual Forgetting for MLLM Unlearning
Yuhang Wang, Zhenxing Niu, Haoxuan Ji, Guangyu He, Linlin Zhang, Haichang Gao

TL;DR
This paper presents a novel method for unlearning specific visual knowledge in Multimodal Large Language Models by using contrastive mechanisms and null space constraints to balance forgetting and retention.
Contribution
It introduces a contrastive visual forgetting mechanism and null space constraints for effective unlearning in MLLMs, extending to continual unlearning scenarios.
Findings
Achieves effective target visual knowledge removal while preserving non-target knowledge.
Significantly reduces degradation of retained knowledge during unlearning.
Demonstrates strong performance across diverse benchmarks.
Abstract
The core challenge of machine unlearning is to strike a balance between target knowledge removal and non-target knowledge retention. In the context of Multimodal Large Language Models (MLLMs), this challenge becomes even more pronounced, as knowledge is further divided into visual and textual modalities that are tightly intertwined. In this paper, we introduce an MLLM unlearning approach that aims to forget target visual knowledge while preserving non-target visual knowledge and all textual knowledge. Specifically, we freeze the LLM backbone and achieve unlearning by fine-tuning the visual module. First, we propose a Contrastive Visual Forgetting (CVF) mechanism to separate target visual knowledge from retained visual knowledge, guiding the representations of target visual concepts toward appropriate regions in the feature space. Second, we identify the null space associated with…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
