From Learning to Unlearning: Biomedical Security Protection in Multimodal Large Language Models
Dunyuan Xu, Xikai Yang, Yaoqian Li, Jinpeng Li, Pheng-Ann Heng

TL;DR
This paper introduces a new benchmark and evaluation method for unlearning in biomedical multimodal large language models, addressing privacy and correctness issues with limited effectiveness of current approaches.
Contribution
It presents the first benchmark dataset for evaluating unlearning in biomedical MLLMs and proposes a novel Unlearning Efficiency Score for comprehensive assessment.
Findings
Current unlearning methods show limited effectiveness in biomedical contexts.
The benchmark effectively simulates private data and factual errors.
The proposed score provides a new metric for unlearning performance.
Abstract
The security of biomedical Multimodal Large Language Models (MLLMs) has attracted increasing attention. However, training samples easily contain private information and incorrect knowledge that are difficult to detect, potentially leading to privacy leakage or erroneous outputs after deployment. An intuitive idea is to reprocess the training set to remove unwanted content and retrain the model from scratch. Yet, this is impractical due to significant computational costs, especially for large language models. Machine unlearning has emerged as a solution to this problem, which avoids complete retraining by selectively removing undesired knowledge derived from harmful samples while preserving required capabilities on normal cases. However, there exist no available datasets to evaluate the unlearning quality for security protection in biomedical MLLMs. To bridge this gap, we propose the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
