Class Clown: Data Redaction in Machine Unlearning at Enterprise Scale

Daniel L. Felps; Amelia D. Schwickerath; Joyce D. Williams; Trung N.; Vuong; Alan Briggs; Matthew Hunt; Evan Sakmar; David D. Saranchak; Tyler; Shumaker

arXiv:2012.04699·cs.CR·December 10, 2020

Class Clown: Data Redaction in Machine Unlearning at Enterprise Scale

Daniel L. Felps, Amelia D. Schwickerath, Joyce D. Williams, Trung N., Vuong, Alan Briggs, Matthew Hunt, Evan Sakmar, David D. Saranchak, Tyler, Shumaker

PDF

Open Access

TL;DR

This paper presents a scalable method for data redaction in deep neural networks that complies with privacy laws by minimizing retraining through membership inference attacks and incremental model updates.

Contribution

It introduces a novel lifecycle process for handling data redaction requests in enterprise-scale DNNs, reducing retraining needs while ensuring legal compliance.

Findings

01

Effective data redaction reduces retraining time by 50%.

02

Membership inference attacks accurately identify sensitive data points.

03

Incremental updates maintain model accuracy after redaction.

Abstract

Individuals are gaining more control of their personal data through recent data privacy laws such the General Data Protection Regulation and the California Consumer Privacy Act. One aspect of these laws is the ability to request a business to delete private information, the so called "right to be forgotten" or "right to erasure". These laws have serious financial implications for companies and organizations that train large, highly accurate deep neural networks (DNNs) using these valuable consumer data sets. However, a received redaction request poses complex technical challenges on how to comply with the law while fulfilling core business operations. We introduce a DNN model lifecycle maintenance process that establishes how to handle specific data redaction requests and minimize the need to completely retrain the model. Our process is based upon the membership inference attack as a…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdversarial Robustness in Machine Learning · Privacy-Preserving Technologies in Data · Advanced Neural Network Applications