Which Concepts to Forget and How to Refuse? Decomposing Concepts for Continual Unlearning in Large Vision-Language Models
Hyundong Jin, Dongyoon Han, Eunwoo Kim

TL;DR
This paper introduces a novel framework for continual unlearning in large vision-language models, enabling them to selectively refuse specific image-instruction pairs by decomposing concepts and using specialized refusal modules, improving precision and utility.
Contribution
The paper proposes a concept-grounded continual unlearning framework with a multimodal routing scheme, enhancing refusal accuracy and utility preservation in vision-language models.
Findings
Outperforms existing methods in concept-grounded refusal responses
Preserves general utility across unlearning sequences
Effectively decomposes visual and textual concepts for targeted unlearning
Abstract
Continual unlearning poses the challenge of enabling large vision-language models to selectively refuse specific image-instruction pairs in response to sequential deletion requests, while preserving general utility. However, sequential unlearning updates distort shared representations, creating spurious associations between vision-language pairs and refusal behaviors that hinder precise identification of refusal targets, resulting in inappropriate refusals. To address this challenge, we propose a novel continual unlearning framework that grounds refusal behavior in fine-grained descriptions of visual and textual concepts decomposed from deletion targets. We first identify which visual-linguistic concept combinations characterize each forget category through a concept modulator, then determine how to generate appropriate refusal responses via a mixture of refusal experts, termed…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Topic Modeling · Domain Adaptation and Few-Shot Learning
