InfoShape: Task-Based Neural Data Shaping via Mutual Information
Homa Esfahanizadeh, William Wu, Manya Ghobadi, Regina, Barzilay, Muriel Medard

TL;DR
InfoShape introduces a neural encoder that selectively removes sensitive data while preserving task-relevant information, using mutual information estimators and Lagrangian optimization to balance privacy and utility.
Contribution
It presents a novel task-based data shaping method that effectively balances privacy and utility using neural mutual information estimators.
Findings
Effective removal of sensitive info while maintaining task performance.
Downstream classification accuracy correlates with privacy and utility metrics.
Demonstrates practical neural data shaping for privacy-preserving machine learning.
Abstract
The use of mutual information as a tool in private data sharing has remained an open challenge due to the difficulty of its estimation in practice. In this paper, we propose InfoShape, a task-based encoder that aims to remove unnecessary sensitive information from training data while maintaining enough relevant information for a particular ML training task. We achieve this goal by utilizing mutual information estimators that are based on neural networks, in order to measure two performance metrics, privacy and utility. Using these together in a Lagrangian optimization, we train a separate neural network as a lossy encoder. We empirically show that InfoShape is capable of shaping the encoded samples to be informative for a specific downstream task while eliminating unnecessary sensitive information. Moreover, we demonstrate that the classification accuracy of downstream models has a…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Advanced Neural Network Applications
