CLEAR: Character Unlearning in Textual and Visual Modalities

Alexey Dontsov; Dmitrii Korzh; Alexey Zhavoronkin; Boris Mikheev; Denis Bobkov; Aibek Alanov; Oleg Y. Rogov; Ivan Oseledets; Elena Tutubalina

arXiv:2410.18057·cs.CV·August 18, 2025

CLEAR: Character Unlearning in Textual and Visual Modalities

Alexey Dontsov, Dmitrii Korzh, Alexey Zhavoronkin, Boris Mikheev, Denis Bobkov, Aibek Alanov, Oleg Y. Rogov, Ivan Oseledets, Elena Tutubalina

PDF

Open Access 1 Repo 1 Datasets 1 Video

TL;DR

CLEAR introduces the first open benchmark for multimodal unlearning, enabling evaluation of methods that remove private information from combined text and visual data, highlighting the importance of joint modality unlearning.

Contribution

This paper presents CLEAR, the first open-source benchmark for multimodal unlearning, facilitating systematic evaluation and comparison of unlearning methods across text and visual modalities.

Findings

01

Joint unlearning of text and images outperforms single-modality approaches.

02

Evaluation of 11 MU methods across four datasets shows varying effectiveness.

03

CLEAR enables comprehensive assessment of multimodal unlearning techniques.

Abstract

Machine Unlearning (MU) is critical for removing private or hazardous information from deep learning models. While MU has advanced significantly in unimodal (text or vision) settings, multimodal unlearning (MMU) remains underexplored due to the lack of open benchmarks for evaluating cross-modal data removal. To address this gap, we introduce CLEAR, the first open-source benchmark designed specifically for MMU. CLEAR contains 200 fictitious individuals and 3,700 images linked with corresponding question-answer pairs, enabling a thorough evaluation across modalities. We conduct a comprehensive analysis of 11 MU methods (e.g., SCRUB, gradient ascent, DPO) across four evaluation sets, demonstrating that jointly unlearning both modalities outperforms single-modality approaches. The dataset is available at https://huggingface.co/datasets/therem/CLEAR

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

somvy/multimodal_unlearning
pytorch

Datasets

therem/CLEAR
dataset· 930 dl
930 dl

Videos

CLEAR: Character Unlearning in Textual and Visual Modalities· underline

Taxonomy

TopicsSecond Language Acquisition and Learning