Preserving Differential Privacy Between Features in Distributed Estimation
Christina Heinze-Deml, Brian McWilliams, Nicolai Meinshausen

TL;DR
This paper introduces PriDE, a scalable framework for distributed estimation that preserves differential privacy across features held by different data owners, enabling privacy-aware machine learning without a trusted curator.
Contribution
The paper proposes PriDE, a novel method for distributed estimation that maintains differential privacy in vertically-partitioned data settings, with theoretical guarantees and empirical validation.
Findings
PriDE preserves $(\epsilon,\delta)$-distributed differential privacy.
PriDE achieves bounded estimation error compared to non-private estimates.
Empirical results confirm PriDE's effectiveness on real and synthetic datasets.
Abstract
Privacy is crucial in many applications of machine learning. Legal, ethical and societal issues restrict the sharing of sensitive data making it difficult to learn from datasets that are partitioned between many parties. One important instance of such a distributed setting arises when information about each record in the dataset is held by different data owners (the design matrix is "vertically-partitioned"). In this setting few approaches exist for private data sharing for the purposes of statistical estimation and the classical setup of differential privacy with a "trusted curator" preparing the data does not apply. We work with the notion of -distributed differential privacy which extends single-party differential privacy to the distributed, vertically-partitioned case. We propose PriDE, a scalable framework for distributed estimation where each party…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
See pages 1-last of pride.pdf
