Overconfidence is a Dangerous Thing: Mitigating Membership Inference   Attacks by Enforcing Less Confident Prediction

Zitao Chen; Karthik Pattabiraman

arXiv:2307.01610·cs.CR·July 6, 2023·1 cites

Overconfidence is a Dangerous Thing: Mitigating Membership Inference Attacks by Enforcing Less Confident Prediction

Zitao Chen, Karthik Pattabiraman

PDF

Open Access 1 Repo

TL;DR

This paper introduces HAMP, a novel training framework that reduces overconfidence in ML models to defend against membership inference attacks, achieving high privacy and accuracy without extra data.

Contribution

HAMP is a new method that enforces less confident predictions through high-entropy labels and regularization, improving privacy protection against MIAs while maintaining model accuracy.

Findings

01

HAMP outperforms seven state-of-the-art defenses in privacy-utility trade-offs.

02

HAMP maintains high accuracy across five benchmark datasets.

03

HAMP effectively obscures differences between training and testing predictions.

Abstract

Machine learning (ML) models are vulnerable to membership inference attacks (MIAs), which determine whether a given input is used for training the target model. While there have been many efforts to mitigate MIAs, they often suffer from limited privacy protection, large accuracy drop, and/or requiring additional data that may be difficult to acquire. This work proposes a defense technique, HAMP that can achieve both strong membership privacy and high accuracy, without requiring extra data. To mitigate MIAs in different forms, we observe that they can be unified as they all exploit the ML model's overconfidence in predicting training samples through different proxies. This motivates our design to enforce less confident prediction by the model, hence forcing the model to behave similarly on the training and testing samples. HAMP consists of a novel training framework with high-entropy…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

dependablesystemslab/mia_defense_hamp
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data