Near-Optimal Algorithms for Private Online Optimization in the Realizable Regime
Hilal Asi, Vitaly Feldman, Tomer Koren, Kunal Talwar

TL;DR
This paper introduces new differentially private algorithms for online learning in the realizable setting, achieving near-optimal regret bounds that significantly improve over previous non-realizable results.
Contribution
It presents the first near-optimal DP algorithms for online prediction and convex optimization in the realizable regime, with improved regret bounds.
Findings
Expert prediction regret: O(ε^{-1} log^{1.5} d)
Adaptive small-loss regret: O(L* log d + ε^{-1} log^{1.5} d)
Convex optimization regret: O(ε^{-1} d^{1.5}) and O(ε^{-2/3} (dT)^{1/3})
Abstract
We consider online learning problems in the realizable setting, where there is a zero-loss solution, and propose new Differentially Private (DP) algorithms that obtain near-optimal regret bounds. For the problem of online prediction from experts, we design new algorithms that obtain near-optimal regret where is the number of experts. This significantly improves over the best existing regret bounds for the DP non-realizable setting which are . We also develop an adaptive algorithm for the small-loss setting with regret where is the total loss of the best expert. Additionally, we consider DP online convex optimization in the realizable setting and propose an algorithm with near-optimal regret $O…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Bandit Algorithms Research · Stochastic Gradient Optimization Techniques · Sparse and Compressive Sensing Techniques
\addparagraphcolumntypes
C
Near-Optimal Algorithms for Private Online Optimization
in the Realizable Regime
Hilal Asi Apple; [email protected]
Vitaly Feldman Apple; [email protected].
Tomer Koren Blavatnik School of Computer Science, Tel Aviv University; [email protected].
Kunal Talwar Apple; [email protected].
Abstract
We consider online learning problems in the realizable setting, where there is a zero-loss solution, and propose new Differentially Private (DP) algorithms that obtain near-optimal regret bounds. For the problem of online prediction from experts, we design new algorithms that obtain near-optimal regret \widetilde{O}\big{(}\varepsilon^{-1}\log^{1.5}{d}\big{)} where is the number of experts. This significantly improves over the best existing regret bounds for the DP non-realizable setting which are \widetilde{O}\big{(}\varepsilon^{-1}\min\big{\{}d,T^{1/3}\log d\big{\}}\big{)}. We also develop an adaptive algorithm for the small-loss setting with regret where is the total loss of the best expert. Additionally, we consider DP online convex optimization in the realizable setting and propose an algorithm with near-optimal regret \widetilde{O}\big{(}\varepsilon^{-1}d^{1.5}\big{)}, as well as an algorithm for the smooth case with regret \widetilde{O}\big{(}\varepsilon^{-2/3}(dT)^{1/3}\big{)}, both significantly improving over existing bounds in the non-realizable regime.
1 Introduction
We study the problem of private online optimization in the realizable setting where there is a zero-loss solution. In this problem, an online algorithm interacts with an adversary over rounds. The adversary picks a (non-negative) loss function at round and simultaneously the algorithm picks a response , suffering loss . The algorithm aims to minimize the regret, which is the loss compared to the best solution in hindsight, while at the same time keeping the sequence of predictions differentially private with respect to individual loss functions.
In this paper, we focus on two well-studied instances of this problem. In differentially private online prediction from experts (DP-OPE), we have experts and the adversary chooses a loss function . Our second setting is differentially private online convex optimization (DP-OCO) where is a convex set with bounded diameter, and the adversary chooses convex and -Lipschitz loss functions .
Several papers have recently studied DP-OPE and DP-OCO in the general non-realizable setting [JKT12, ST13, JT14, AS17, KMSTTX21]. These papers have resulted in different algorithms with sub-linear regret for both problems. For DP-OPE, [AS17, JT14] developed private versions of follow-the-regularized-leader (FTRL) obtaining (normalized) regret \min\big{\{}{d}/{T\varepsilon},{\sqrt{T\log d}}/{T\varepsilon}\big{\}}. More recently, [AFKT22] developed low-switching algorithms for DP-OPE with oblivious adversaries, obtaining normalized regret roughly . Additionally, for the problem of DP-OCO, [KMSTTX21] have recently proposed a DP-FTRL algorithm based on the binary tree mechanism which obtains (normalized) regret \big{(}{\sqrt{d}}/{T\varepsilon}\big{)}^{1/2}.
Despite this progress, the regret bounds of existing algorithms are still polynomially worse than existing lower bounds. Currently, the only existing lower bounds for oblivious adversaries are the trivial bounds from the non-online versions of the same problems: for DP-OPE, lower bounds for private selection [SU17] imply a (normalized) regret lower bound of , while existing lower bounds for DP-SCO [FKT20] give a (normalized) regret lower bound of for DP-OCO.
Practical optimization problems arising from over-parameterized models often lead to instances that additionally satisfy realizability, i.e. that the optimal loss is zero or close to zero. This motivates the study of designing algorithms that can do better under this assumption. Realizability has been studied since the early days of learning theory and ubiquitous in the non-private online optimization literature [SST10, SS12, Haz16]. It has proven useful for improving regret bounds in non-private OPE and OCO [SS12, SST10] and in the closely related problem of differentially private stochastic convex optimization (DP-SCO) [ACCD22]. In this work we study DP-OPE and DP-OCO in the realizable setting and develop new algorithms that obtain near-optimal regret bounds in several settings.
1.1 Contributions
We propose new algorithms and lower bounds for the problems of differentially private online prediction from experts (DP-OPE) and differentially private online convex optimization (DP-OCO) in the realizable setting. The following are our primary contributions:
- •
Near-optimal algorithms for DP-OPE. We design new algorithms that obtain near-optimal regret for DP-OPE with experts when there is a zero-loss expert. The best existing algorithms for non-realizable DP-OPE obtain significantly worse regret bounds \min\big{\{}{d}/{\varepsilon},T^{1/3}\log d/{\varepsilon}\big{\}} [AS17, AFKT22], which have a polynomial dependence on either or the number of experts . Our algorithms build on sequential applications of the exponential mechanism to pick a good expert, and the sparse-vector-technique to identify when the current expert is no longer a good expert (with near-zero loss). Crucially, an oblivious adversary cannot identify which expert the algorithm has picked, resulting in a small number of switches. We deploy a potential-based proof strategy to show that this algorithm have logarithmic number of switches. We also show that a lower bound of holds for any -DP algorithm even in the realizable case.
- •
Adaptive algorithms for DP-OPE with low-loss experts. We also develop an algorithm that adapts to the setting where there is an expert with low loss, that is, . Our algorithms are adaptive to the value of and obtain total regret of .
- •
Near-optimal regret for low-dimensional DP-OCO. Building on our algorithms for DP-OPE, we propose a new algorithm for DP-OCO that obtains regret . This is near-optimal for low-dimensional problems where and improves over the best existing algorithm which obtains a normalized regret [KMSTTX21].
- •
Improved regret for smooth DP-OCO. When the loss function is smooth, we show that DP-FTRL [KMSTTX21] with certain parameters obtains an improved normalized regret of if there is a zero-loss expert.
1.2 Related work
Several works have studied online optimization in the realizable setting, developing algorithms with better regret bounds [SS12, SST10]. For online prediction from experts, the weighted majority algorithm obtains a regret bound of compared to in the non-realizable setting. Moreover, for online convex optimization, [SST10] show that online mirror descent achieves regret compared to in the general case.
On the other hand, the private online optimization literature has mainly studied the general non-realizable case [JKT12, ST13, JT14, AS17, KMSTTX21]. For online prediction from experts, the best existing regret bounds for -DP are [JT14] and [AS17]. [AFKT22] show that these rates can be improved using a private version of the shrinking dartboard algorithm, obtaining regret roughly . For online convex optimization, [KMSTTX21] developed a private follow-the-regularized-leader algorithm using the binary tree mechanism that obtains normalized regret bound \widetilde{O}\big{(}{\sqrt{d}}/{T\varepsilon}\big{)}^{1/2}.
The realizable setting has recently been studied in the different but related problem of differentially private stochastic convex optimization (DP-SCO) [ACCD22]. DP-SCO and DP-OCO are closely related as one can convert an OCO algorithm into an SCO algorithm using standard online-to-batch transformations [Haz16] [ACCD22] study DP-SCO problems in the interpolation regime where there exists a minimizer that minimizes all loss functions, and propose algorithms that improve the regret over the general setting if the functions satisfy certain growth conditions.
2 Preliminaries
In online optimization, we have an interactive -round game between an adversary and an online algorithm. In this paper, we focus on oblivious adversaries that choose in advance a sequence of loss functions where . Then, at round , the adversary releases a loss function and simultaneously the algorithm plays a solution . The algorithm then suffers loss at this round. The regret of the online algorithm is
[TABLE]
For ease of notation, for an oblivious adversary that chooses a loss sequence , we let denote the output of the interaction between the online algorithm and the adversary.
In this work, we are mainly interested in two instances of the above general online optimization problem:
- •
Online prediction from experts (OPE). In this problem, we have a set of experts , and the adversary chooses loss functions .
- •
Online convex optimization (OCO). In OCO, we are optimizing over a convex set with bounded diameter ,111The diameter of a set (in Euclidean geometry) is defined as . and the adversary chooses loss functions that are convex and -Lipschitz.
We are mainly interested in the so-called realizable setting. More precisely, we say than an OPE (or OCO) problem is realizable if there exists a feasible solution such that . We also extend some of our results to the near-realizable setting where .
The main goal of this paper is to study both of these problems under the restriction of differential privacy.
Definition 2.1** (Differential Privacy).**
A randomized algorithm is -differentially private against oblivious adversaries (-DP) if, for all sequences and that differ in a single element, and for all events in the output space of , we have
[TABLE]
We note that our algorithms satisfy a stronger privacy guarantee against adaptive adversaries (see for example the privacy definition in [JRSS21]). However, we choose to focus solely on oblivious adversaries for ease of presentation and readability.
2.1 Background on Differential Privacy
In our analysis, we require the following standard privacy composition result.
Lemma 2.1** (Advanced composition [DR14]).**
If are randomized algorithms that each is -DP, then their composition is -DP.
In addition to basic facts about differential privacy such as composition and post-processing, our development uses two key techniques from the privacy literature: the Sparse-vector-technique and the binary tree mechanism, which we now describe.
Sparse vector technique.
We recall the sparse-vector-technique [DR14] which we use for the realizable setting in Section 3. Given an input , the algorithm takes a stream of queries in an online manner. We assume that each is -sensitive, that is, for neighboring datasets that differ in a single element. We have the following guarantee.
Lemma 2.2** ([DR14], Theorem 3.24).**
Let . For a threshold and , there is an -DP algorithm () that halts at time such that for with probability at least ,
- •
For all , ;
- •
* or .*
To facilitate the notation for using in our algorithms, we assume that it has the following components:
: initializes a new instance of with privacy parameter , threshold , and probability parameter . This returns an instance (data structure) that supports the following two functions. 2. 2.
: adds a new query to . 3. 3.
: tests if the last query that was added to was above threshold. In that case, the algorithm stops and does not accept more queries.
The binary tree mechanism.
We also build on the binary tree mechanism [DNPR10, CSS11] which allows to privately estimate the running sum of a sequence of numbers .
Lemma 2.3** ([DNPR10], Theorem 4.1).**
Let . There is an -DP algorithm (BinaryTree) that takes a stream of numbers and outputs such that for all with probability at least ,
[TABLE]
The same approach extends to the case when ’s are vectors in with . In this case, the error vector is distributed at and the mechanism satisfies -DP.
Additional notation.
For a positive integer , we let . Moreover, for a sequence , we use the shorthand .
3 Near-optimal regret for online prediction from experts
In this section, we consider the online prediction from experts problem in the near-realizable regime, where the best expert achieves small loss . Under this setting, we develop a new private algorithm that achieves regret . For the realizable setting where , this algorithm obtains near-optimal regret .
The algorithm builds on the fact that an oblivious adversary cannot know which expert the algorithm picks. Therefore, if the algorithm picks a random good expert with loss smaller than , the adversary has to increase the loss for many experts before identifying the expert chosen by the algorithm. The algorithm will therefore proceed as follows: at each round, privately check using sparse-vector-technique whether the previous expert is still a good expert (has loss nearly ). If not, randomly pick (privately) a new expert from the set of remaining good experts. The full details are in Algorithm 1.
The following theorem summarizes the performance of Algorithm 1.
Theorem 1**.**
Let be chosen by an oblivious adversary such that there is such that . Let , , , and . If then Algorithm 1 is -DP and with probability at least has regret
[TABLE]
Further, if and then Algorithm 1 is -DP and with probability at least has regret
[TABLE]
While Algorithm 1 requires the knowledge of , we also design an adaptive version that does not require in the next section. Note that the algorithm obtains regret roughly for the realizable setting where .
Proof.
First, we prove the privacy guarantees of the algorithm using privacy composition results: there are applications of the exponential mechanism with privacy parameter . Moreover, sparse-vector is applied over each user’s data only once, hence the applications of sparse-vector are -DP. Overall, the algorithm is -DP and -DP (using advanced compositions; see Lemma 2.1). Setting results in -DP and results in -DP.
We proceed to analyze utility. First, note that the guarantees of the sparse-vector algorithm (Lemma 2.2) imply that with probability at least for each time-step , if sparse-vector identifies above threshold query then s_{t}(x)\geq\underaccent{\bar}{\Delta}\coloneqq L-\frac{8B}{\varepsilon}\geq 4/\eta. Otherwise, . In the remainder of the proof, we condition on this event. The idea is to show that the algorithm has logarithmic number of switches, and each switch the algorithm pays roughly regret.
To this end, we define a potential at time :
[TABLE]
where . Note that and for all as there is such that . We split the iterates to rounds where is the largest such that . Let be the number of switches in (number of times the exponential mechanism is used to pick ). The following key lemma shows that cannot be too large.
Lemma 3.1**.**
Fix . Then for any , it holds that
[TABLE]
Proof.
Let be a time-step where a switch happens (exponential mechanism is used to pick ). Note that . We prove that the probability that is switched between and is at most . To this end, note that if is switched before then \sum_{i=t}^{t_{i+1}}\ell_{i}(x)\geq\underaccent{\bar}{\Delta} as sparse-vector identifies , and therefore L_{t_{i+1}}(x)-L_{t}(x)\geq\underaccent{\bar}{\Delta}-L^{\star}\geq 4/\eta. Thus we have that
[TABLE]
where the second inequality follows the fact that for , and the last inequality since . This argument shows that after the first switch inside the range , each additional switch happens with probability at most . The claim follows. ∎
We now proceed with the proof. Let be the total number of switches. Note that and Lemma 3.1 implies is upper bounded by a geometric random variable with success probability . Therefore, using concentration of geometric random variables (Lemma A.2), we get that
[TABLE]
Noting that , this shows that the algorithm does not reach the switching budget with probability . Thus, the guarantees of the sparse-vector algorithm imply that the algorithm pays regret at most for each switch, hence the total regret of the algorithm is at most . The claim follows as . ∎
3.1 Adaptive algorithms for DP experts
While Algorithm 1 achieves near-optimal loss for settings with low-loss experts, it requires the knowledge of the value of . As is not always available in practice, our goal in this section is to develop an adaptive version of Algorithm 1 which obtains similar regret without requiring the knowledge of . Similarly to other online learning problems, we propose to use the doubling trick [KV05] to design our adaptive algorithms. We begin with an estimate of . Then we apply Algorithm 1 using until the exponential mechanism picks an expert that contradicts the current estimate of , that is, . We use the Laplace mechanism to check this privately. Noting that this happens with small probability if , we conclude that our estimate of was too small and set a new estimate and repeat the same steps. As , this process will stop in at most phases, hence we can divide the privacy budget equally among phases while losing at most a factor of . We present the full details in Algorithm 2.
We have the following guarantees for the adaptive algorithm.
Theorem 2**.**
Let be chosen by an oblivious adversary such that there is such that . Let . Then Algorithm 2 is -DP and with probability at least has regret
[TABLE]
Proof.
First we prove privacy. Note that can change at most times as . Therefore, we have at most applications of Algorithm 1. Each one of these is -DP. Moreover, since we have at most applications of the exponential mechanism in Algorithm 1, we have at most applications of the Laplace mechanism in Algorithm 2. Each of these is -DP. Overall, privacy composition implies that the final privacy is -DP.
Now we prove utility. Algorithm 2 consists of at most applications of Algorithm 1 with different values of . We will show that each of these applications incurrs low regret. Consider an application of Algorithm 1 with . If , then Theorem 1 implies that the regret is at most
[TABLE]
Now consider the case where . We will show that Algorithm 2 will double and that the regret of Algorithm 1 up to that time-step is not too large. Let be the largest such that . Note that up to time , the best expert had loss at most hence the regret up to time is
[TABLE]
Now let denote the next time-step when Algorithm 1 applies the exponential mechanism. Sparse-vector guarantees that in the range the algorithm suffers regret at most . Moreover, the guarantees of the Laplace mechanism imply that at this time-step, with probability , hence Algorithm 2 will double and run a new application of Algorithm 1. Overall, an application of Algorithm 1 with results in regret and doubles . Finally, note that if then with probability the algorithm will not double the value of . As each application of Algorithm 1 has regret
[TABLE]
and is bounded by with high probability, this proves the claim. ∎
3.2 A binary-tree based algorithm
In this section, we present another algorithm which achieves the optimal regret for settings with zero-expert loss. Instead of using sparse-vector, this algorithm builds on the binary tree mechanism. The idea is to repetitively select random good experts and apply the binary tree to calculate a private version of their aggregate losses. Whenever all of the chosen experts are detected to have non-zero loss, we choose a new set of good experts. Similarly to Algorithm 1, we can show that each new phase reduces the number of good experts by a constant factor as an oblivious adversary does not know the choices of the algorithm, hence there are only phases.
We provide a somewhat informal description of the algorithm in Algorithm 3. This algorithm also achieves regret in the realizable case. We do not provide a proof as it is somewhat similar to that of Theorem 1.
4 Faster rates for DP-OCO
In this section we study differentially private online convex optimization (DP-OCO) and propose new algorithms with faster rates in the realizable setting. In Section 4.1, we develop an algorithm that reduces the OCO problem to an experts problem (by discretizing the space) and then uses our procedure for experts. In Section 4.2, we show that follow-the-regularized-leader (FTRL) using the binary tree mechanism results in faster rates in the realizable setting for smooth functions.
4.1 Experts-based algorithm for DP-OCO
The algorithm in this section essentially reduces the problem of DP-OCO to DP-OPE by discretizing the space into sufficiently many experts. In particular, we consider a -net of the space , that is, a set such that for all there is such that . Such a set exists if ([Duc19], Lemma 7.6). Given a loss function , we define the loss of expert to be . Then, we run Algorithm 1 for the given DP-OPE problem. This algorithm has the following guarantees.
Theorem 3**.**
Let and be non-negative, convex and -Lipschitz functions chosen by an oblivious adversary. Then running Algorithm 1 over with is -DP and with probability at least has regret
[TABLE]
Proof.
Let be the experts chosen by the algorithm. First, Theorem 1 implies that this algorithm obtains the following regret with respect to the best expert
[TABLE]
where . Since is -Lipschitz for each , we obtain that
[TABLE]
Overall this gives
[TABLE]
Setting proves the claim. ∎
These results demonstrates that existing algorithms which achieve normalized regret roughly are not optimal for the realizable setting. Moreover, in the low-dimensional regime (constant ), the above bound is nearly-optimal up to logarithmic factors as we have a lower bound of from the stochastic setting of this problem (see discussion in the introduction).
Finally, while the algorithm we presented in Theorem 3 has exponential runtime due to discretizing the space, we note that applying Algorithm 1 over the unit ball results in similar rates and polynomial runtime. Recall that this algorithm only accesses the loss functions to sample from the exponential mechanism, and uses sparse-vector over the running loss. Both of these can be implemented in polynomial time—since the losses are convex—using standard techniques from log-concave sampling.
4.2 Binary-tree based FTRL
In this section, we consider DP-OCO with smooth loss functions and show that DP-FTRL [KMSTTX21, Algorithm 1] with modified parameters obtains improved normalized regret in the realizable setting, compared to in the non-realizable setting.
We present the details in Algorithm 4. Appendix B.1 in [KMSTTX21] has more detailed information about the implementation of the binary tree mechanism in DP-FTRL.
We have the following guarantees for DP-FTRL in the realizable and smooth setting.
Theorem 4**.**
*Let and be non-negative, convex, -Lipschitz, and -smooth functions chosen by an oblivious adversary. DP-FTRL with is -DP and generates that has regret
[TABLE]
For the proof, we use the following property for smooth non-negative functions.
Lemma 4.1** ([Nes04]).**
Let be non-negative and -smooth function. Then .
Proof.
The proof follows similar arguments to the proof of Theorem 5.1 in [KMSTTX21]. Let
[TABLE]
be the iteration of DP-FTRL where is the noise added by the binary tree mechanism. Moreover, let be the non-private solution, that is,
[TABLE]
Lemma C.2 in [KMSTTX21] states that . Therefore, we have
[TABLE]
where the second inequality follows from the Fenchel-Young inequality. We can now upper bound the right term. Indeed, Theorem 5.2 in [Haz16] implies that FTRL has
[TABLE]
Overall we now get
[TABLE]
The binary tree mechanism also guarantees that for all ,
[TABLE]
(see Appendix B.1 in [KMSTTX21]). Thus, taking expectation and setting the regularization parameter to \lambda=32\beta+\big{(}\frac{\beta}{\varepsilon^{2}}(L/D)^{2}Td\log(T)\log(1/\delta)\big{)}^{1/3}, we have
[TABLE]
∎
5 Lower bounds
In this section, we prove lower bounds for private experts in the realizable setting which show that our upper bounds are nearly-optimal up to logarithmic factors. The lower bound demonstrates that a logarithmic dependence on is necessary even in the realizable setting. Note that for DP-OCO in the realizable setting, a lower bound of for pure DP follows from known lower bounds for DP-SCO in the interpolation regime [ACCD22] using online-to-batch conversions [Haz16].
The following theorem states our lower bound for DP-OPE.
Theorem 5**.**
Let and . If is -DP then there is an oblivious adversary such that and
[TABLE]
Proof.
Let for all and for let be the function that has for and otherwise . The oblivious adversary picks one of the following sequences uniformly at random: where and . Assume towards a contradiction that the algorithm obtains regret . This implies that there exists sequences such that the algorithm obtains expected regret where the expectation is only over the randomness of the algorithm. Assume without loss of generality these sequences are . Let be the set of outputs that has low regret on , that is,
[TABLE]
Note that since if then at least of the last outputs must be equal to . Now Markov inequality implies that
[TABLE]
Moreover, group privacy gives
[TABLE]
where the last inequality follows since . Overall we get that
[TABLE]
which is a contradiction for . ∎
6 Conclusion
In this work, we studied differentially private online learning problems in the realizable setting, and developed algorithms with improved rates compared to the non-realizable setting. However, several questions remain open in this domain. First, our near-optimal algorithms for DP-OPE obtain regret, whereas the lower bound we have is . Hence, perhaps there are better algorithms with tighter logarithmic factors than our sparse-vector based algorithms. Additionally, for DP-OCO, our algorithms are optimal only for low-dimensional setting, and there remains polynomial gaps in the high-dimensional setting. Finally, optimal rates for both problems (DP-OPE and DP-OCO) are still unknown in the general non-realizable setting.
Acknowledgements
This work has received support from the Israeli Science Foundation (ISF) grant no. 2549/19 and the Len Blavatnik and the Blavatnik Family foundation.
Appendix A Concentration for sums of geometric variables
In this section, we proof a concentration result for the sum of geometric random variables, which allows us to upper bound the number of switches in the sparse-vector based algorithm. We say that is geometric random variable with success probability if for . To this end, we use the following Chernoff bound.
Lemma A.1** ([MU05], Ch. 4.2.1).**
Let for . Then for ,
[TABLE]
The following lemma demonstrates that the sum of geometric random variables concentrates around its mean with high probability.
Lemma A.2**.**
Let be iid geometric random variables with success probability . Let . Then for any
[TABLE]
Proof.
Notice that is distributed according to the negative binomial distribution where we can think of as the number of Bernoulli trials until we get successes. More precisely, let for be Bernoulli random variables with probability . Then the event has the same probability as . Thus we have that
[TABLE]
We can now use Chernoff inequality (Lemma A.1) to get that for
[TABLE]
This proves that
[TABLE]
The claim now follows by noticing that for iid geometric random variable when , thus
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[ACCD 22] Hilal Asi, Karan Chadha, Gary Cheng and John Duchi “Private optimization in the interpolation regime: faster rates and hardness results” In Proceedings of the 39th International Conference on Machine Learning , 2022
- 2[AFKT 22] Hilal Asi, Vitaly Feldman, Tomer Koren and Kunal Talwar “Private Online Prediction from Experts: Separations and Faster Rates” In ar Xiv:2210.13537 [cs.LG] , 2022
- 3[AS 17] Naman Agarwal and Karan Singh “The price of differential privacy for online learning” In Proceedings of the 34th International Conference on Machine Learning , 2017, pp. 32–40
- 4[CSS 11] T-H Hubert Chan, Elaine Shi and Dawn Song “Private and continual release of statistics” In ACM Transactions on Information and System Security (TISSEC) 14.3 , 2011, pp. 1–24
- 5[DNPR 10] Cynthia Dwork, Moni Naor, Toniann Pitassi and Guy N Rothblum “Differential privacy under continual observation” In Proceedings of the Forty-Second Annual ACM Symposium on the Theory of Computing , 2010, pp. 715–724
- 6[DR 14] Cynthia Dwork and Aaron Roth “The Algorithmic Foundations of Differential Privacy” In Foundations and Trends in Theoretical Computer Science 9.3 & 4 Hanover, MA, USA: Now Publishers Inc., 2014, pp. 211–407
- 7[Duc 19] John C. Duchi “Information Theory and Statistics” Accessed May 2019, Lecture Notes for Statistics 311/EE 377, Stanford University, 2019 URL: http://web.stanford.edu/class/stats 311/lecture-notes.pdf
- 8[FKT 20] Vitaly Feldman, Tomer Koren and Kunal Talwar “Private stochastic convex optimization: optimal rates in linear time” In Proceedings of the 52nd Annual ACM on the Theory of Computing , 2020, pp. 439–449
