On the Effectiveness of Regularization Against Membership Inference Attacks
Yigitcan Kaya, Sanghyun Hong, Tudor Dumitras

TL;DR
This paper systematically evaluates how various regularization techniques affect the privacy of deep learning models against membership inference attacks, revealing that some mechanisms may inadvertently aid attacks and that existing defenses may offer a false sense of security.
Contribution
It provides a comprehensive analysis of 8 regularization methods against recent MIAs, introduces a new white-box metric for privacy risk, and highlights limitations of current defenses.
Findings
Certain regularization methods like label smoothing may aid MIAs.
Some mechanisms do not significantly reduce information leakage.
Existing defenses may give a false sense of privacy.
Abstract
Deep learning models often raise privacy concerns as they leak information about their training data. This enables an adversary to determine whether a data point was in a model's training set by conducting a membership inference attack (MIA). Prior work has conjectured that regularization techniques, which combat overfitting, may also mitigate the leakage. While many regularization mechanisms exist, their effectiveness against MIAs has not been studied systematically, and the resulting privacy properties are not well understood. We explore the lower bound for information leakage that practical attacks can achieve. First, we evaluate the effectiveness of 8 mechanisms in mitigating two recent MIAs, on three standard image classification tasks. We find that certain mechanisms, such as label smoothing, may inadvertently help MIAs. Second, we investigate the potential of improving the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Advanced Neural Network Applications
