Machine Learning with Membership Privacy using Adversarial Regularization
Milad Nasr, Reza Shokri, Amir Houmansadr

TL;DR
This paper proposes an adversarial regularization method to train machine learning models that protect against membership inference attacks, ensuring predictions on training data are indistinguishable from other data, thus enhancing privacy without significant utility loss.
Contribution
It introduces a min-max adversarial training framework that provably achieves membership privacy and improves model robustness against inference attacks.
Findings
Mitigates membership inference attack risks close to random guessing
Maintains high classification accuracy with negligible utility loss
Provides a formal privacy guarantee through adversarial training
Abstract
Machine learning models leak information about the datasets on which they are trained. An adversary can build an algorithm to trace the individual members of a model's training dataset. As a fundamental inference attack, he aims to distinguish between data points that were part of the model's training set and any other data points from the same distribution. This is known as the tracing (and also membership inference) attack. In this paper, we focus on such attacks against black-box models, where the adversary can only observe the output of the model, but not its parameters. This is the current setting of machine learning as a service in the Internet. We introduce a privacy mechanism to train machine learning models that provably achieve membership privacy: the model's predictions on its training data are indistinguishable from its predictions on other data points from the same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques
