Preventing Adversarial Use of Datasets through Fair Core-Set   Construction

Benjamin Spector; Ravi Kumar; Andrew Tomkins

arXiv:1910.10871·cs.LG·October 25, 2019

Preventing Adversarial Use of Datasets through Fair Core-Set Construction

Benjamin Spector, Ravi Kumar, Andrew Tomkins

PDF

Open Access

TL;DR

This paper introduces a method for constructing a core-set of datasets that maintains performance on desired tasks while reducing the risk of adversarial use, enhancing privacy and security.

Contribution

The paper presents novel core-set construction techniques for both linear models and neural networks to improve dataset privacy against adversarial tasks.

Findings

01

Core-sets enable strong primary task performance.

02

Core-sets reduce effectiveness of unwanted tasks.

03

Methods are effective on various data types.

Abstract

We propose improving the privacy properties of a dataset by publishing only a strategically chosen "core-set" of the data containing a subset of the instances. The core-set allows strong performance on primary tasks, but forces poor performance on unwanted tasks. We give methods for both linear models and neural networks and demonstrate their efficacy on data.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Adversarial Robustness in Machine Learning · Explainable Artificial Intelligence (XAI)