Privacy Budget Scheduling
Tao Luo, Mingen Pan, Pierre Tholoniat, Asaf Cidon, Roxana Geambasu,, Mathias L\'ecuyer

TL;DR
This paper introduces PrivateKube, a Kubernetes extension that manages privacy budgets as a resource, and proposes DPF, a scheduler that optimizes privacy budget allocation for multiple ML models, improving the number of models trained under privacy constraints.
Contribution
The paper presents a novel privacy resource management system in Kubernetes and a new scheduling algorithm tailored for non-replenishable privacy budgets, enhancing privacy-aware ML training.
Findings
DPF enables training more models under the same privacy guarantee.
DPF performs well with Rényi Differential Privacy, a highly composable DP variant.
PrivateKube effectively manages privacy budgets alongside traditional resources.
Abstract
Machine learning (ML) models trained on personal data have been shown to leak information about users. Differential privacy (DP) enables model training with a guaranteed bound on this leakage. Each new model trained with DP increases the bound on data leakage and can be seen as consuming part of a global privacy budget that should not be exceeded. This budget is a scarce resource that must be carefully managed to maximize the number of successfully trained models. We describe PrivateKube, an extension to the popular Kubernetes datacenter orchestrator that adds privacy as a new type of resource to be managed alongside other traditional compute resources, such as CPU, GPU, and memory. The abstractions we design for the privacy resource mirror those defined by Kubernetes for traditional resources, but there are also major differences. For example, traditional compute resources are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPrivacy-Preserving Technologies in Data · Stochastic Gradient Optimization Techniques · Age of Information Optimization
