AUVIC: Adversarial Unlearning of Visual Concepts for Multi-modal Large Language Models
Haokun Chen, Jianing Li, Yao Zhang, Jinhe Bi, Yan Xia, Jindong Gu, Volker Tresp

TL;DR
AUVIC introduces an adversarial unlearning framework for visual concepts in multimodal large language models, enabling precise removal of sensitive data with minimal impact on related knowledge, and provides a new benchmark for evaluation.
Contribution
The paper presents AUVIC, a novel adversarial unlearning method for visual concepts in MLLMs, and introduces VCUBench, the first benchmark for evaluating visual concept unlearning.
Findings
AUVIC achieves state-of-the-art target forgetting rates.
Minimal performance degradation on non-target concepts.
Effective isolation of target visual concepts during unlearning.
Abstract
Multimodal Large Language Models (MLLMs) achieve impressive performance once optimized on massive datasets. Such datasets often contain sensitive or copyrighted content, raising significant data privacy concerns. Regulatory frameworks mandating the 'right to be forgotten' drive the need for machine unlearning. This technique allows for the removal of target data without resource-consuming retraining. However, while well-studied for text, visual concept unlearning in MLLMs remains underexplored. A primary challenge is precisely removing a target visual concept without disrupting model performance on related entities. To address this, we introduce AUVIC, a novel visual concept unlearning framework for MLLMs. AUVIC applies adversarial perturbations to enable precise forgetting. This approach effectively isolates the target concept while avoiding unintended effects on similar entities. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
Taxonomy
TopicsMultimodal Machine Learning Applications · Domain Adaptation and Few-Shot Learning · Topic Modeling
