Free Record-Level Privacy Risk Evaluation Through Artifact-Based Methods

Joseph Pollock; Igor Shilov; Euodia Dodd; Yves-Alexandre de Montjoye

arXiv:2411.05743·cs.LG·June 13, 2025

Free Record-Level Privacy Risk Evaluation Through Artifact-Based Methods

Joseph Pollock, Igor Shilov, Euodia Dodd, Yves-Alexandre de Montjoye

PDF

Open Access 1 Repo

TL;DR

This paper introduces LT-IQR, a novel artifact-based method that efficiently identifies training samples vulnerable to membership inference attacks during model training without additional model training, achieving high precision across datasets.

Contribution

The paper presents LT-IQR, a new approach that analyzes loss trajectories during training to identify privacy-sensitive samples without extra computational cost.

Findings

01

LT-IQR achieves 92% precision@k=1% in identifying vulnerable samples.

02

Outperforms traditional vulnerability metrics and lightweight MIAs.

03

Works across datasets and model architectures.

Abstract

Membership inference attacks (MIAs) are widely used to empirically assess privacy risks in machine learning models, both providing model-level vulnerability metrics and identifying the most vulnerable training samples. State-of-the-art methods, however, require training hundreds of shadow models with the same architecture as the target model. This makes the computational cost of assessing the privacy of models prohibitive for many practical applications, particularly when used iteratively as part of the model development process and for large models. We propose a novel approach for identifying the training samples most vulnerable to membership inference attacks by analyzing artifacts naturally available during the training process. Our method, Loss Trace Interquartile Range (LT-IQR), analyzes per-sample loss trajectories collected during model training to identify high-risk samples…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

computationalprivacy/loss_traces
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data