Metric Unreliability in Multimodal Machine Unlearning: A Systematic Analysis and Principled Unified Score

Abdullah Ahmad Khan; Hamid Laga; Ferdous Sohel

arXiv:2605.02206·cs.CV·May 11, 2026

Metric Unreliability in Multimodal Machine Unlearning: A Systematic Analysis and Principled Unified Score

Abdullah Ahmad Khan, Hamid Laga, Ferdous Sohel

PDF

1 Repo

TL;DR

This paper systematically analyzes the reliability of various metrics used in evaluating multimodal machine unlearning in vision-language models, proposing a unified score to improve consistency.

Contribution

It identifies conflicting metric rankings in multimodal unlearning and introduces the Unified Quality Score (UQS) for more stable evaluation.

Findings

01

Five metrics yield conflicting rankings across benchmarks.

02

Agreement among metrics is lower in multimodal VQA than unimodal classification.

03

UQS provides stable and reliable model rankings.

Abstract

Machine unlearning in Vision-Language Models (VLMs) is required for compliance with the General Data Protection Regulation (GDPR), yet current evaluation practices are inconsistent. We present the first systematic study of metric reliability in multimodal unlearning. Five standard metrics, Forget Accuracy (FA), Retain Accuracy (RA), Membership Inference Attack (MIA), Activation Distance (AD), and JS divergence (JS), yield conflicting method rankings across three VQA benchmarks (MLLMU-Bench, UnLOK-VQA, MMUBench). Kendall tau analysis over 36 unlearned LLaVA-1.5-7B models reveals two opposing clusters, {FA, RA, MIA} and {AD, JS}, with tau_FA_AD = -0.26, reproduced on BLIP-2 OPT-2.7B. Agreement is lower in multimodal VQA (average tau = 0.086) than in unimodal classification (average tau = 0.158; difference = 0.072), indicating that dual image-and-text pathways amplify inconsistency. We…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

neurips26/UnifiedUnl
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.