Differentially Private Database Release via Kernel Mean Embeddings

Matej Balog; Ilya Tolstikhin; Bernhard Sch\"olkopf

arXiv:1710.01641·stat.ML·June 1, 2018·ICML

Differentially Private Database Release via Kernel Mean Embeddings

Matej Balog, Ilya Tolstikhin, Bernhard Sch\"olkopf

PDF

1 Repo

TL;DR

This paper introduces a new method for privately releasing database summaries using kernel mean embeddings, enabling accurate population statistic estimation while protecting individual privacy.

Contribution

It proposes a novel framework that releases kernel mean embeddings for differentially private data sharing, with theoretical guarantees and two practical instantiations.

Findings

01

Guarantees differential privacy of the released embeddings.

02

Ensures the consistency of estimators derived from the embeddings.

03

Provides two implementations suitable for different scenarios.

Abstract

We lay theoretical foundations for new database release mechanisms that allow third-parties to construct consistent estimators of population statistics, while ensuring that the privacy of each individual contributing to the database is protected. The proposed framework rests on two main ideas. First, releasing (an estimate of) the kernel mean embedding of the data generating random variable instead of the database itself still allows third-parties to construct consistent estimators of a wide class of population statistics. Second, the algorithm can satisfy the definition of differential privacy by basing the released kernel mean embedding on entirely synthetic data points, while controlling accuracy through the metric available in a Reproducing Kernel Hilbert Space. We describe two instantiations of the proposed framework, suitable under different scenarios, and prove theoretical…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

matejbalog/RKHS-private-database
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.