A Blessing of Dimensionality in Membership Inference through Regularization
Jasper Tan, Daniel LeJeune, Blake Mason, Hamid Javadi, Richard G., Baraniuk

TL;DR
This paper reveals that, contrary to common belief, increasing model parameters with proper regularization can enhance both privacy and utility, demonstrating a 'blessing of dimensionality' in membership inference vulnerability.
Contribution
It introduces the concept that regularization can turn overparameterization into a privacy-preserving advantage, supported by theoretical analysis and empirical validation.
Findings
Increasing parameters can lower privacy without regularization.
Proper regularization can improve both privacy and performance.
Empirical results on neural networks confirm the 'blessing of dimensionality'.
Abstract
Is overparameterization a privacy liability? In this work, we study the effect that the number of parameters has on a classifier's vulnerability to membership inference attacks. We first demonstrate how the number of parameters of a model can induce a privacy--utility trade-off: increasing the number of parameters generally improves generalization performance at the expense of lower privacy. However, remarkably, we then show that if coupled with proper regularization, increasing the number of parameters of a model can actually simultaneously increase both its privacy and performance, thereby eliminating the privacy--utility trade-off. Theoretically, we demonstrate this curious phenomenon for logistic regression with ridge regularization in a bi-level feature ensemble setting. Pursuant to our theoretical exploration, we develop a novel leave-one-out analysis tool to precisely…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Machine Learning and Data Classification
MethodsEarly Stopping · Logistic Regression
