FiMMIA: scaling semantic perturbation-based membership inference across modalities

Anton Emelyanov; Sergei Kudriashov; Alena Fenogenova

arXiv:2512.02786·cs.LG·December 3, 2025

FiMMIA: scaling semantic perturbation-based membership inference across modalities

Anton Emelyanov, Sergei Kudriashov, Alena Fenogenova

PDF

Open Access 1 Video

TL;DR

This paper introduces FiMMIA, a modular framework that extends perturbation-based membership inference attacks to multimodal large language models, addressing distribution shifts and improving detection of data membership across modalities.

Contribution

The work generalizes membership inference methods to multimodal models and releases a comprehensive framework with extended baseline pipelines for better detection of data contamination.

Findings

01

Effective in detecting data membership in multimodal models

02

Addresses distribution shifts in datasets

03

Provides publicly available code and framework

Abstract

Membership Inference Attacks (MIAs) aim to determine whether a specific data point was included in the training set of a target model. Although there are have been numerous methods developed for detecting data contamination in large language models (LLMs), their performance on multimodal LLMs (MLLMs) falls short due to the instabilities introduced through multimodal component adaptation and possible distribution shifts across multiple inputs. In this work, we investigate multimodal membership inference and address two issues: first, by identifying distribution shifts in the existing datasets, and second, by releasing an extended baseline pipeline to detect them. We also generalize the perturbation-based membership inference methods to MLLMs and release \textbf{FiMMIA} -- a modular \textbf{F}ramework for \textbf{M}ultimodal \textbf{MIA}.\footnote{The source code and framework have been…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

FiMMIA: scaling semantic perturbation-based membership inference across modalities· underline

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Topic Modeling · Advanced Graph Neural Networks