TL;DR
This paper introduces a novel method for measuring privacy risks in machine learning models by analyzing the perturbation needed to generate adversarial examples, effectively inferring membership information without extra training data.
Contribution
The authors propose a new approach to membership inference using adversarial example perturbations, outperforming existing methods without additional training data.
Findings
Method performs comparably or better than state-of-the-art approaches.
No additional training samples needed for the proposed method.
Effective across various models and multivariate data.
Abstract
The use of personal data for training machine learning systems comes with a privacy threat and measuring the level of privacy of a model is one of the major challenges in machine learning today. Identifying training data based on a trained model is a standard way of measuring the privacy risks induced by the model. We develop a novel approach to address the problem of membership inference in pattern recognition models, relying on information provided by adversarial examples. The strategy we propose consists of measuring the magnitude of a perturbation necessary to build an adversarial example. Indeed, we argue that this quantity reflects the likelihood of belonging to the training data. Extensive numerical experiments on multivariate data and an array of state-of-the-art target models show that our method performs comparable or even outperforms state-of-the-art strategies, but without…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
