PRIVET: Privacy Metric Based on Extreme Value Theory

Antoine Szatkownik (TAU; BioInfo); Aur\'elien Decelle; Beatriz Seoane (TAU); Nicolas Bereux (TAU); L\'eo Planche (BioInfo); Guillaume Charpiat (TAU); Burak Yelmen; Flora Jay (BioInfo; TAU); Cyril Furtlehner (TAU)

arXiv:2510.24233·cs.LG·October 29, 2025

PRIVET: Privacy Metric Based on Extreme Value Theory

Antoine Szatkownik (TAU, BioInfo), Aur\'elien Decelle, Beatriz Seoane (TAU), Nicolas Bereux (TAU), L\'eo Planche (BioInfo), Guillaume Charpiat (TAU), Burak Yelmen, Flora Jay (BioInfo, TAU), Cyril Furtlehner (TAU)

PDF

Open Access 3 Reviews

TL;DR

PRIVET is a novel privacy evaluation method using extreme value theory to assign individual privacy leak scores to synthetic data samples, improving detection of privacy risks across diverse data types.

Contribution

It introduces a sample-based, modality-agnostic algorithm for privacy assessment that outperforms existing global criteria methods.

Findings

01

Reliably detects memorization and privacy leakage in high-dimensional data.

02

Provides both dataset-level and sample-level privacy assessments.

03

Effective even with limited data or underfitting regimes.

Abstract

Deep generative models are often trained on sensitive data, such as genetic sequences, health data, or more broadly, any copyrighted, licensed or protected content. This raises critical concerns around privacy-preserving synthetic data, and more specifically around privacy leakage, an issue closely tied to overfitting. Existing methods almost exclusively rely on global criteria to estimate the risk of privacy failure associated to a model, offering only quantitative non interpretable insights. The absence of rigorous evaluation methods for data privacy at the sample-level may hinder the practical deployment of synthetic data in real-world applications. Using extreme value statistics on nearest-neighbor distances, we propose PRIVET, a generic sample-based, modality-agnostic algorithm that assigns an individual privacy leak score to each synthetic sample. We empirically demonstrate that…

Peer Reviews

Decision·Submitted to ICLR 2026

Reviewer 01Rating 4Confidence 4

Strengths

Another metric that addresses an important privacy problem. Authors provide some interesting examples based on pseudo-synthetic data that suits their distance metrics and embeddings.

Weaknesses

Only a single experiment using an RBM-generated synthetic dataset is presented, which limits the generality of the findings. Including additional datasets would strengthen the empirical validation. Moreover, the current results do not clearly demonstrate whether the proposed approach detects any actual privacy risk. One potential improvement would be to employ a data generator with a controlled privacy parameter—for example, a differentially private generator with varying epsilon values—and ev

Reviewer 02Rating 4Confidence 2

Strengths

1) Broad empirical coverage. The study spans high-dimensional genetics, computer vision with modern embeddings, and a membership-attack scenario, which together demonstrate versatility across modalities and training regimes, including underfitting. 2) Interpretability and actionable outputs. The ability to flag specific synthetic samples as probable leaks and to summarize with an estimated number of leaks is useful in governance workflows. The method surfaces when embedding choices compromise de

Weaknesses

1) Embedding dependence undermines "modality-agnostic" positioning. Results show that detectability varies substantially with the representation, with failures for certain image transforms under DINOv2 and improvements with wavelets. The method is only as good as the embedding and distance, which weakens the claim of generality. A guidance section on selecting or learning privacy-aware embeddings, or an adaptive metric learning step, would be valuable. 2) Figure~1 readability. The numerical anno

Reviewer 03Rating 4Confidence 3

Strengths

1. The paper applies extreme value theory to model the nearest-neighbor distance tails, providing a statistically grounded method for privacy leakage detection. 2. PRIVET can generate interpretable, per-sample privacy scores while remaining consistent with dataset-level statistics, which bridges the gap between global metrics (like AATS, FLD) and local boolean methods (like AUTH). 3. Experiments on both genetic and image datasets demonstrate the versatility of the method, showing good scalabil

Weaknesses

1. Table 1’s distinction between “interpretable” and “non-interpretable” metrics is not clearly defined. It appears that *interpretability* is equated with having a boolean “leak / not leak” label, but the table also lists a “real value” column without clear justification. Moreover, the rationale for classifying methods such as AUTH as interpretable only at the sample level, but not at the dataset level, is insufficiently explained. 2. The paper claims that prior metrics like FLD and PQMass fai

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Advanced Malware Detection Techniques