A Constructive Proof of a Concentration Bound for Real-Valued Random Variables
Wolfgang Mulzer, Natalia Shenkman

TL;DR
This paper extends a constructive proof of a concentration bound originally for Boolean variables to real-valued variables, improving the bound and providing an efficient algorithm for identifying dependent variables.
Contribution
It generalizes the Impagliazzo-Kabanets algorithm to real-valued variables and enhances the concentration bound by a constant factor.
Findings
Extended the constructive proof to real-valued variables.
Improved the concentration bound by a constant factor.
Provided an efficient randomized algorithm for dependency detection.
Abstract
Almost 10 years ago, Impagliazzo and Kabanets (2010) gave a new combinatorial proof of Chernoff's bound for sums of bounded independent random variables. Unlike previous methods, their proof is constructive. This means that it provides an efficient randomized algorithm for the following task: given a set of Boolean random variables whose sum is not concentrated around its expectation, find a subset of statistically dependent variables. However, the algorithm of Impagliazzo and Kabanets (2010) is given only for the Boolean case. On the other hand, the general proof technique works also for real-valued random variables, even though for this case, Impagliazzo and Kabanets (2010) obtain a concentration bound that is slightly suboptimal. Herein, we revisit both these issues and show that it is relatively easy to extend the Impagliazzo-Kabanets algorithm to real-valued random variables and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Constructive Proof of a Concentration
Bound for Real-Valued Random Variables††thanks: This research was supported in part by ERC StG 757609. It is based on the Master’s thesis of the second author that was defended on 10. January 2019 at Freie Universität Berlin.
Wolfgang Mulzerlabel=e1][email protected],[email protected] label=e2 [[
url]http://www.mi.fu-berlin.de/inf/groups/ag-ti
Natalia Shenkman**]a
[
Wolfgang Mulzer and Natalia Shenkman
Department of Computer Science, Freie Universität Berlin
Takustraße 9, 14195 Berlin, Germany
Abstract
Almost 10 years ago, Impagliazzo and Kabanets (2010) gave a new combinatorial proof of Chernoff’s bound for sums of bounded independent random variables. Unlike previous methods, their proof is constructive. This means that it provides an efficient randomized algorithm for the following task: given a set of Boolean random variables whose sum is not concentrated around its expectation, find a subset of statistically dependent variables. However, the algorithm of Impagliazzo and Kabanets (2010) is given only for the Boolean case. On the other hand, the general proof technique works also for real-valued random variables, even though for this case, Impagliazzo and Kabanets (2010) obtain a concentration bound that is slightly suboptimal.
Herein, we revisit both these issues and show that it is relatively easy to extend the Impagliazzo-Kabanets algorithm to real-valued random variables and to improve the corresponding concentration bound by a constant factor.
generalized Chernoff-Hoeffding bound,
concentration bound,
randomized algorithm.,
keywords:
and
[**]aCorresponding author
]Freie Universität Berlin
1 Introduction
The weak law of large numbers is a central pillar of modern probability theory: any sample average of independent random variables converges in probability to its expected value. This qualitative statement is made more precise by concentration bounds, which quantify the speed of convergence for certain prominent special cases. Due to their wide applicability in mathematics, statistics, and computer science, a whole industry of concentration bounds has developed over the last decades. By now, the literature contains myriads of different bounds, satisfying various needs and proved in numerous ways; see, e.g., Chernoff (1952), Hoeffding (1963), Schmidt et al. (1995), Panconesi and Srinivasan (1997), or the interesting and extensive textbooks and surveys by McDiarmid (1998), Chung and Lu (2006), Alon and Spencer (2008), and Mulzer (2018).
In theoretical computer science, a central application area of concentration bounds lies in the design and analysis of randomized algorithms. About 10 years ago, Impagliazzo and Kabanets (2010) went in the other direction, showing that methods from theoretical computer science can be useful in obtaining new proofs for concentration bounds. This led to a new—algorithmic—proof of a generalized Chernoff bound for Boolean random variables, which the authors “consider more revealing and intuitive than the standard Bernstein-style proofs, and hope that its constructiveness will have other applications in computer science”.
Impagliazzo and Kabanets were able to extend their combinatorial approach to real-valued bounded random variables. However, in order to do this, they had to use a slightly different argument. This came at the cost of a sub-optimal multiplicative constant in the bound. Furthermore, with the new argument, it was not clear how to generalize the main randomized algorithm to the real-valued case. Here, we present a constructive proof of Chernoff’s bound that remedies both these issues, giving the same bound and the same algorithmic result as are known for the Boolean case.
2 The Real-Valued Impagliazzo-Kabanets Theorem
The main result of Impagliazzo and Kabanets for real-valued bounded random variables is stated as Theorem 2.1 below. Essentially, this theorem can be seen as a very simple adaptation of the famous Chernoff-Hoeffding bound (Shenkman, 2018, Computation (4.6)). In their paper, Impagliazzo and Kabanets (2010) proved the bound with a sub-optimal multiplicative constant. Very recently, Pelekis and Ramon (2017) claimed the bound with an optimal constant, but, unfortunately, their argument is flawed (Shenkman, 2018, Remark 5.10). We present a new proof of Theorem 2.1 that leads to an optimal multiplicative constant and a randomized algorithm for real-valued bounded random variables. Remarkably, our result is obtained by following the original approach of Impagliazzo and Kabanets for the Boolean case, and pushing through the calculation to the end thanks to Lemma 2.6.
In what follows, we set , for , and use for the expectation operator. Moreover, for , we denote by the binary relative entropy with the conventions and , for all .
Theorem 2.1**.**
Suppose we are given a sequence of random variables, and real constants such that , , and almost surely, for all , and
[TABLE]
for all . Set , , and . Then, for any , we have
[TABLE]
Remark 2.2**.**
The condition , for , in Theorem 2.1 can be overcome by imposing a stronger condition of dependence than (2.1); see (Shenkman, 2018, Theorem 3.11).
The remainder of this section is dedicated to the proof of Theorem 2.1, which follows the ideas found in Impagliazzo and Kabanets (2010). We first deal with the case where and . We fix a parameter , and we consider the following random process: for , we sample the random variable . Then, we normalize the resulting values to obtain a sequence of probabilities. We use these probabilities to sample conditionally independent Boolean random variables . Finally, we go through the , and for each , we set to with probability and we keep it unchanged with probability . Now, the aim is to bound the expected value \mathbb{E}\big{[}\prod_{i=1}^{n}Y_{i}\big{]} in two different ways. On the one hand, it will turn out that (2.1) implies that \mathbb{E}\big{[}\prod_{i=1}^{n}Y_{i}\big{]} can be upper-bounded by , the expectation of the product of independent Boolean random variables that are set to with probability , where is the normalized average of the . On the other hand, the expectation \mathbb{E}\big{[}\prod_{i=1}^{n}Y_{i}\big{]} can be lower-bounded by \mathbb{P}\big{(}X\geq(c+t)n\big{)} times the conditional expectation given the event , which turns out to be at least , where is the normalized deviation parameter . Combining the two bounds and optimizing for will then lead to (2.2).
We now proceed with the details. For , we define the normalized variables and the normalized constants , as well as and . We define Boolean random variables Y_{i}\sim\text{Bernoulli}\big{(}\widetilde{X}_{i}\big{)}, for , that are conditionally independent given . In other words, are independent on the -algebra generated by the set \big{\{}\widetilde{X}_{i}\mid i\in[n]\big{\}}. Furthermore, let be fixed, and let be a random variable, independent of and of , taking values in with the probability mass function
[TABLE]
for all . If , then (2.2) holds trivially, so from now on we assume that . As mentioned, our goal is to bound the expectation \mathbb{E}\big{[}\prod_{i\in\mathcal{I}}Y_{i}\big{]} in two different ways, and we start with the upper bound. The first lemma shows that the condition (2.1) on the moments of the carries over to the normalized variables .
Lemma 2.3**.**
For any , we have
[TABLE]
Proof.
The lemma follows quickly by plugging in the definitions. More precisely, we have
[TABLE]
as claimed. ∎
The normalized moment condition from Lemma 2.3 now shows that expectation of can be upper-bounded by the expectation of a product of independent Boolean random variables with success probabilities .
Lemma 2.4**.**
For any , we have
[TABLE]
Proof.
The lemma follows by the conditional independence of the . We have
[TABLE]
as desired. ∎
To achieve an upper bound for \mathbb{E}\big{[}\prod_{i\in\mathcal{I}}Y_{i}\big{]}, we must still account for the random subset . Essentially, it says that we can think of the expected product of independent Boolean random variables with success probability .
Lemma 2.5**.**
We have
[TABLE]
Proof.
We proceed as follows:
[TABLE]
as stated. ∎
We turn to the lower bound for \mathbb{E}\big{[}\prod_{i\in\mathcal{I}}Y_{i}\big{]}. For this, we first bound the conditional expectation given the event assuming that \mathbb{P}\big{(}X\geq(c+t)n\big{)}>0.
Lemma 2.6**.**
We have
[TABLE]
Proof.
First, we note that for and , the binomial series expansion gives
[TABLE]
since , and thus . The derivation proceeds as follows:
[TABLE]
as desired. ∎
Now, combining Lemmas 2.5 and 2.6 with the law of total expectation, we obtain that for any ,
[TABLE]
and hence, for any ,
[TABLE]
A straightforward calculation shows that is minimized at and that
[TABLE]
To complete the proof, it remains to consider the cases where or . First, observe that implies , for all , which, in turn, gives almost surely for all . Consequently, we have \mathbb{P}\big{(}X\geq(a+t)n\big{)}=1=e^{-D(0\,\|\,0)n}, if , and \mathbb{P}\big{(}X\geq(a+t)n\big{)}=0=e^{-D(\tilde{t}\,\|\,0)n}, if . Second, if and , we have that
[TABLE]
This concludes the proof of Theorem 2.1.
3 Algorithmic Implications
We provide a generalization of Theorem in Impagliazzo and Kabanets (2010).
Theorem 3.1**.**
There is a randomized algorithm such that the following holds. Let be -valued random variables. Let and be such that
[TABLE]
for some . Then, on inputs , the algorithm , using oracle access to the distribution of , runs in time and outputs a set such that, with probability at least , one has
[TABLE]
Proof.
We follow the argument of Impagliazzo and Kabanets: for , let the random variables be conditionally independent given the . Furthermore, for , let be a random set that is independent of the and of the . Using the law of total expectation and Lemma 2.6, we infer that
[TABLE]
Moreover, by proceeding as in the proof of Lemma 2.5, we can show that \mathbb{E}\big{[}c^{|\mathcal{I}|}\big{]}\leq(\lambda c+1-\lambda)^{n}. Hence, we obtain
[TABLE]
The rest of the proof is completely analogous to (Impagliazzo and Kabanets, 2010, Theorem 4.1). ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alon and Spencer (2008) Alon, N. and Spencer, J.H. (2008). The probabilistic method, 3rd ed. Wiley-Interscience, New York.
- 2Chernoff (1952) Chernoff, H. (1952). A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations , Annals of Mathematical Statistics 23 , pp. 493–507.
- 3Chung and Lu (2006) Chung, F. and Lu, L. (2006). Complex Graphs and Networks, CBMS Regional Conference Series in Mathematics 107 . American Mathematical Society.
- 4Hoeffding (1963) Hoeffding, W. (1963). Probability inequalities for sums of bounded random variables , Journal of the American Statistical Association 58 , pp. 13–30.
- 5Impagliazzo and Kabanets (2010) Impagliazzo, R. and Kabanets, V. (2010). Constructive proofs of concentration bounds. In: Serna, M., Shaltie, R., Jansen, K., Rolim, J. (eds) Approximation, Randomization, and Combinatorial Optimization. Algorithms and Techniques. RANDOM 2010, APPROX 2010. Lecture Notes in Computer Science 6302 , pp. 617–631. Springer, Berlin, Heidelberg.
- 6Mc Diarmid (1998) Mc Diarmid, C. (1998). Concentration. In: Habib, M., Mc Diarmid, C., Ramirez-Alfonsin, J., Reed, B. (eds) Probabilistic Methods for Algorithmic Discrete Mathematics. Algorithms and Combinatorics 16 , pp. 195–248. Springer, Berlin, Heidelberg.
- 7Mulzer (2018) Mulzer, W. (2018). Five proofs of Chernoff’s bound with applications , Bulletin of the EATCS (BEATCS) 124 , (February 2018).
- 8Panconesi and Srinivasan (1997) Panconesi, A. and Srinivasan, A. (1997). Randomized distributed edge coloring via an extension of the Chernoff-Hoeffding bounds , SIAM Journal on Computing 26:2 , pp. 350–368.
