Anonymizing Machine Learning Models

Abigail Goldsteen; Gilad Ezov; Ron Shmelkin; Micha Moffie; Ariel; Farkash

arXiv:2007.13086·cs.CR·February 2, 2022

Anonymizing Machine Learning Models

Abigail Goldsteen, Gilad Ezov, Ron Shmelkin, Micha Moffie, Ariel, Farkash

PDF

1 Repo

TL;DR

This paper introduces a novel accuracy-guided anonymization method for machine learning models that enhances privacy protection while maintaining high model utility, outperforming existing k-anonymity techniques and rivaling differential privacy in preventing inference attacks.

Contribution

It proposes a new anonymization approach that leverages model knowledge to preserve accuracy, offering a practical alternative to differential privacy with improved utility and attack resistance.

Findings

01

Outperforms state-of-the-art k-anonymity in utility, especially with high k and many quasi-identifiers.

02

Achieves comparable or better resistance to membership inference attacks than differential privacy.

03

Provides a practical, less complex method for privacy-preserving machine learning models.

Abstract

There is a known tension between the need to analyze personal data to drive business and privacy concerns. Many data protection regulations, including the EU General Data Protection Regulation (GDPR) and the California Consumer Protection Act (CCPA), set out strict restrictions and obligations on the collection and processing of personal data. Moreover, machine learning models themselves can be used to derive personal information, as demonstrated by recent membership and attribute inference attacks. Anonymized data, however, is exempt from the obligations set out in these regulations. It is therefore desirable to be able to create models that are anonymized, thus also exempting them from those obligations, in addition to providing better protection against attacks. Learning on anonymized data typically results in significant degradation in accuracy. In this work, we propose a method…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

IBM/ai-privacy-toolkit
tfOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.