EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs

Ali Satvaty; Suzan Verberne; Fatih Turkmen

arXiv:2511.00192·cs.LG·November 4, 2025

EL-MIA: Quantifying Membership Inference Risks of Sensitive Entities in LLMs

Ali Satvaty, Suzan Verberne, Fatih Turkmen

PDF

Open Access

TL;DR

This paper introduces EL-MIA, a framework for assessing the risk of sensitive entity membership inference in large language models, revealing current methods' limitations and the need for stronger adversaries.

Contribution

The paper proposes a novel entity-level membership inference framework, constructs a benchmark dataset, and systematically evaluates existing and new MIA methods for LLMs.

Findings

01

Existing MIA methods have limited effectiveness at entity-level inference.

02

Entity membership susceptibility correlates with model scale and training epochs.

03

Simple methods can outline entity-level risks, indicating a need for stronger adversarial testing.

Abstract

Membership inference attacks (MIA) aim to infer whether a particular data point is part of the training dataset of a model. In this paper, we propose a new task in the context of LLM privacy: entity-level discovery of membership risk focused on sensitive information (PII, credit card numbers, etc). Existing methods for MIA can detect the presence of entire prompts or documents in the LLM training data, but they fail to capture risks at a finer granularity. We propose the ``EL-MIA'' framework for auditing entity-level membership risks in LLMs. We construct a benchmark dataset for the evaluation of MIA methods on this task. Using this benchmark, we conduct a systematic comparison of existing MIA techniques as well as two newly proposed methods. We provide a comprehensive analysis of the results, trying to explain the relation of the entity level MIA susceptability with the model scale,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Advanced Graph Neural Networks