Free Record-Level Privacy Risk Evaluation Through Artifact-Based Methods
Joseph Pollock, Igor Shilov, Euodia Dodd, Yves-Alexandre de Montjoye

TL;DR
This paper introduces LT-IQR, a novel artifact-based method that efficiently identifies training samples vulnerable to membership inference attacks during model training without additional model training, achieving high precision across datasets.
Contribution
The paper presents LT-IQR, a new approach that analyzes loss trajectories during training to identify privacy-sensitive samples without extra computational cost.
Findings
LT-IQR achieves 92% precision@k=1% in identifying vulnerable samples.
Outperforms traditional vulnerability metrics and lightweight MIAs.
Works across datasets and model architectures.
Abstract
Membership inference attacks (MIAs) are widely used to empirically assess privacy risks in machine learning models, both providing model-level vulnerability metrics and identifying the most vulnerable training samples. State-of-the-art methods, however, require training hundreds of shadow models with the same architecture as the target model. This makes the computational cost of assessing the privacy of models prohibitive for many practical applications, particularly when used iteratively as part of the model development process and for large models. We propose a novel approach for identifying the training samples most vulnerable to membership inference attacks by analyzing artifacts naturally available during the training process. Our method, Loss Trace Interquartile Range (LT-IQR), analyzes per-sample loss trajectories collected during model training to identify high-risk samples…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data
