The Challenge of Differentially Private Screening Rules
Amol Khanna, Fred Lu, Edward Raff

TL;DR
This paper introduces the first differentially private screening rule for linear and logistic regression, highlighting the challenges and limitations due to noise addition, and emphasizes the open problem of developing effective private screening methods.
Contribution
It develops the first differentially private screening rule for linear and logistic regression models, revealing fundamental difficulties in private screening due to noise.
Findings
Private screening rules face significant noise-related challenges.
The difficulty arises from the screening step, not the optimizer.
Developing effective private screening methods remains an open problem.
Abstract
Linear -regularized models have remained one of the simplest and most effective tools in data analysis, especially in information retrieval problems where n-grams over text with TF-IDF or Okapi feature values are a strong and easy baseline. Over the past decade, screening rules have risen in popularity as a way to reduce the runtime for producing the sparse regression weights of models. However, despite the increasing need of privacy-preserving models in information retrieval, to the best of our knoweledge, no differentially private screening rule exists. In this paper, we develop the first differentially private screening rule for linear and logistic regression. In doing so, we discover difficulties in the task of making a useful private screening rule due to the amount of noise added to ensure privacy. We provide theoretical arguments and experimental evidence that this…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Cryptography and Data Security · Mobile Crowdsensing and Crowdsourcing
