Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench

Zheyuan Liu; Guangyao Dou; Mengzhao Jia; Zhaoxuan Tan; Qingkai Zeng,; Yongle Yuan; Meng Jiang

arXiv:2410.22108·cs.CL·February 18, 2025

Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench

Zheyuan Liu, Guangyao Dou, Mengzhao Jia, Zhaoxuan Tan, Qingkai Zeng,, Yongle Yuan, Meng Jiang

PDF

Open Access 1 Repo 2 Datasets 1 Video

TL;DR

This paper introduces MLLMU-Bench, a new benchmark for evaluating the effectiveness of unlearning algorithms in multimodal large language models, addressing privacy concerns by assessing how models forget sensitive data.

Contribution

The paper presents MLLMU-Bench, the first comprehensive benchmark for multimodal model unlearning, including diverse profiles and evaluation metrics for efficacy, generalizability, and utility.

Findings

01

Unimodal unlearning algorithms perform well in generation and cloze tasks.

02

Multimodal unlearning approaches excel in classification tasks with multimodal inputs.

03

Baseline results demonstrate varying effectiveness across different unlearning algorithms.

Abstract

Generative models such as Large Language Models (LLM) and Multimodal Large Language models (MLLMs) trained on massive web corpora can memorize and disclose individuals' confidential and private data, raising legal and ethical concerns. While many previous works have addressed this issue in LLM via machine unlearning, it remains largely unexplored for MLLMs. To tackle this challenge, we introduce Multimodal Large Language Model Unlearning Benchmark (MLLMU-Bench), a novel benchmark aimed at advancing the understanding of multimodal machine unlearning. MLLMU-Bench consists of 500 fictitious profiles and 153 profiles for public celebrities, each profile feature over 14 customized question-answer pairs, evaluated from both multimodal (image+text) and unimodal (text) perspectives. The benchmark is divided into four sets to assess unlearning algorithms in terms of efficacy, generalizability,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

franciscoliu/MLLMU-Bench
pytorchOfficial

Datasets

Videos

Protecting Privacy in Multimodal Large Language Models with MLLMU-Bench· underline

Taxonomy

TopicsPrivacy-Preserving Technologies in Data