Striking a Balance: An Optimal Mechanism Design for Heterogenous Differentially Private Data Acquisition for Logistic Regression
Ameya Anjarlekar, Rasoul Etesami, R. Srikant

TL;DR
This paper proposes an optimal mechanism for acquiring privacy-sensitive data from heterogeneous sellers to train logistic regression models, balancing model accuracy, privacy guarantees, and payment costs.
Contribution
It introduces a novel two-step optimization framework and an online algorithm for privacy-aware data acquisition in machine learning.
Findings
Effective logistic regression with heterogeneous differential privacy guarantees.
A two-step optimization framework for balancing privacy, accuracy, and payments.
An online algorithm for sequential seller data acquisition.
Abstract
We address the challenge of solving machine learning tasks using data from privacy-sensitive sellers. Since the data is private, we design a data market that incentivizes sellers to provide their data in exchange for payments. Therefore our objective is to design a mechanism that optimizes a weighted combination of test loss, seller privacy, and payment, striking a balance between building a good privacy-preserving ML model and minimizing payments to the sellers. To achieve this, we first propose an approach to solve logistic regression with known heterogeneous differential privacy guarantees. Building on these results and leveraging standard mechanism design theory, we develop a two-step optimization framework. We further extend this approach to an online algorithm that handles the sequential arrival of sellers.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Advanced Causal Inference Techniques · Statistical Methods and Bayesian Inference
MethodsLogistic Regression
