Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models

Hongji Li; Junchi yao; Manjiang Yu; Priyanka Singh; Xue Li; Di Wang; Lijie Hu

arXiv:2512.17911·cs.CL·December 23, 2025

Towards Reasoning-Preserving Unlearning in Multimodal Large Language Models

Hongji Li, Junchi yao, Manjiang Yu, Priyanka Singh, Xue Li, Di Wang, Lijie Hu

PDF

Open Access

TL;DR

This paper introduces RMLLMU-Bench, a benchmark for evaluating unlearning in multimodal large language models, and proposes R-MUSE, a method that effectively forgets sensitive information while preserving reasoning ability.

Contribution

The paper presents the first benchmark for reasoning-preserving unlearning in RMLLMs and a novel inference-time intervention method that balances forgetting with reasoning retention.

Findings

01

Existing methods either leak reasoning information or degrade reasoning ability.

02

R-MUSE outperforms baselines in balancing forgetting and reasoning retention.

03

Benchmark results highlight the need for specialized unlearning approaches.

Abstract

Machine unlearning aims to erase requested data from trained models without full retraining. For Reasoning Multimodal Large Language Models (RMLLMs), this is uniquely challenging: intermediate chain-of-thought steps can still leak sensitive information even when final answers are forgotten, and overly aggressive interventions easily damage general reasoning ability. Yet no benchmark jointly evaluates how well unlearning methods suppress reasoning-level leakage while preserving reasoning competence. We address this gap with RMLLMU-Bench, the first benchmark for RMLLM unlearning that extends standard forgetting metrics with dedicated measures of reasoning leakage and reasoning retention. A systematic evaluation on RMLLMU-Bench reveals that existing unlearning methods for MLLMs and Large (Language) Reasoning Models (LRMs) either leave substantial leakage in the reasoning process or…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Multimodal Machine Learning Applications · Explainable Artificial Intelligence (XAI)