Personalized Privacy Auditing and Optimization at Test Time

Cuong Tran; Ferdinando Fioretto

arXiv:2302.00077·cs.LG·February 2, 2023

Personalized Privacy Auditing and Optimization at Test Time

Cuong Tran, Ferdinando Fioretto

PDF

Open Access

TL;DR

This paper investigates personalized privacy-preserving inference, demonstrating that individuals can often provide only a small subset of features without sacrificing model accuracy, thereby enhancing privacy and reducing verification efforts.

Contribution

It introduces a personalized, sequential feature selection algorithm that minimizes data sharing at test time while maintaining prediction accuracy.

Findings

01

Individuals can report as little as 10% of features without accuracy loss.

02

The proposed method reduces privacy risks and verification efforts.

03

Effective across multiple learning tasks.

Abstract

A number of learning models used in consequential domains, such as to assist in legal, banking, hiring, and healthcare decisions, make use of potentially sensitive users' information to carry out inference. Further, the complete set of features is typically required to perform inference. This not only poses severe privacy risks for the individuals using the learning systems, but also requires companies and organizations massive human efforts to verify the correctness of the released information. This paper asks whether it is necessary to require \emph{all} input features for a model to return accurate predictions at test time and shows that, under a personalized setting, each individual may need to release only a small subset of these features without impacting the final decisions. The paper also provides an efficient sequential algorithm that chooses which attributes should be…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPrivacy-Preserving Technologies in Data · Machine Learning and Data Classification · Data Quality and Management

MethodsTest