Decoupling Maximal Inequalities
Aryeh Kontorovich

TL;DR
This paper investigates how maximal inequalities behave under different dependence structures among non-negative random variables, showing that pairwise independence and certain negative dependencies still allow for effective bounds.
Contribution
It demonstrates that pairwise independence and specific negative dependence conditions are sufficient for maximal inequalities to hold similarly to the independent case.
Findings
Pairwise independence suffices for maximal inequalities to behave like the independent case.
Negative dependence conditions can also ensure similar maximal inequality bounds.
Violations of negative dependence can be tolerated if properly quantified.
Abstract
A {\em maximal inequality} seeks to estimate in terms of properties of the . When the latter are independent, the union bound (in its various guises) can yield tight upper bounds. If, however, the are strongly dependent, the estimates provided by the union bound will be rather loose. In this note, we show that for non-negative random variables, pairwise independence suffices for the maximal inequality to behave comparably to its independent version. The condition of pairwise independence may be relaxed to a kind of negative dependence, and even the latter admits violations -- provided these are properly quantified.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Risk and Portfolio Optimization
Decoupling Maximal Inequalities
Aryeh Kontorovich
Abstract
A maximal inequality seeks to estimate in terms of properties of the . When the latter are independent, the union bound (in its various guises) can yield tight upper bounds. If, however, the are strongly dependent, the estimates provided by the union bound will be rather loose. In this note, we show that for non-negative random variables, pairwise independence suffices for the maximal inequality to behave comparably to its independent version. The condition of pairwise independence may be relaxed to a kind of negative dependence, and even the latter admits violations — provided these are properly quantified.
1 Motivation
Maximal inequalities are at the heart of empirical process theory (van Handel, 2014). The case of Gaussian processes is well-understood via the celebrated generic chaining technique (Talagrand, 2016). There, a key role in the lower bounds is played Slepian’s inequality, which allows one to approximate a Gaussian process by an appropriate uncorrelated one. The absence of a generic analog of Slepian’s inequality — say, for the kind of Binomal process considered in Cohen and Kontorovich (2022) — can be a major obstruction in obtaining tight lower bounds. Indeed, as Proposition 3 below shows, for nonnegative , any upper bound on , where is the “the independent version” of , automatically yields an upper bound on . The reverse direction, of course, fails without additional structural assumptions. We discover that pairwise independence suffices for the reverse direction, and that this condition can be relaxed further.
2 The Bernoulli case
Let and be two collections of Bernoulli random variables, where the s are mutually independent (and independent of the s), with . Letting and , we have
[TABLE]
Decoupling from above.
An elegant result of Pinelis (2022) (answering our question) shows that ; his proof provided for completeness:
Proposition 1** (Pinelis).**
For and as above, we have
[TABLE]
Proof.
Put , and , and observe that
[TABLE]
On the other hand,
[TABLE]
whence . ∎
Further, we note that Pinelis’s constant is optimal. Indeed, consider the case where , , and . This makes as .
Despite its elegance, Proposition 1 will likely have limited applications, since in practice, the techniques for upper-bounding rely on the union bound and are insensitive to the dependence structure of — in which case the technique employed in upper-bounding automatically upper-bounds as well.
Decoupling from below.
A more interesting and useful direction would be to obtain an estimate of the form . Clearly, no such dimension-free estimate can hold without further assumptions on the . Indeed, for a small , let and . In this case, . On the other hand, , and so as . Nor can the ratio exceed , since
[TABLE]
Let us recall the notion of pairwise independence. For the Bernoulli case, it means that for each , we have . The main result of this note is that pairwise independence suffices for .
Proposition 2**.**
Let be as above, and assume additionally that the are pairwise independent. Then
[TABLE]
Proof.
By the Paley-Zygmund inequality,111 We thank Ron Peled for the suggestion of applying Paley-Zygmund to .
[TABLE]
Now and, by pairwise independence,
[TABLE]
Hence,
[TABLE]
On the other hand, is readily computed:
[TABLE]
Therefore, to prove the claim, it suffices to show that
[TABLE]
To this end,222 This elegant proof that is due to D. Berend, who also corrected a mistake in an earlier, clunkier proof of ours. we factorize , where , and . Thus, and in particular, it suffices to verify the latter. Now if then obviously and we are done. Otherwise, since trivially holds, we have . In this case, . ∎
We conjecture that the constant in Proposition 2 is not optimal. For a fixed , define the joint pairwise independent distribution on — conjecturally, an extremal one for minimizing — as follows: , , , and . This makes as . If our conjecture is correct, the optimal constant for the lower bound is , or exactly twice Pinelis’s constant.333 We thank Daniel Berend, Alexander Goldenshluger, and Yuval Peres for raising the question of the constants. AG (and also Omer Ben-Porat) pointed out a possible connection to prophet inequalities — and in particular, the Bernoulli selection lemma in Correa et al. (2017)
Relaxing pairwise independence.
An inspection of the proof shows that we do not actually need , but rather only . This condition is called negative (pairwise) covariance (Dubhashi and Ranjan, 1998).
3 Positive real case
In this section, we assume that are nonnegative integrable random variables and the are their independent copies: each is distributed identically to and the are mutually independent.
As a warmup, let us see how Proposition 1 yields :
Proposition 3**.**
Let be nonnegative and integrable with independent copies as above. For , we have
[TABLE]
Proof.
For and , put , and , . Then
[TABLE]
∎
For pairwise independent , we have a reverse inequality:
Proposition 4**.**
Let be nonnegative and integrable with independent copies as above. If additionally the are pairwise independent, then
[TABLE]
Proof.
The proof is entirely analogous to that of Proposition 3, except that Proposition 2 is invoked in the inequality step. ∎
Relaxing pairwise independence.
As before, the full strength of pairwise independence of the is not needed. The condition for all and would suffice; it is weaker than pairwise negative upper orthant dependence (Joag-Dev and Proschan, 1983).444 Thanks to Murat Kocaoglu for this reference.
4 Back to Bernoulli: beyond negative covariance
What if the Bernoulli do not satisfy the negative covariance condition ? Proposition 2 is not directly inapplicable, but not all is lost. For , define by
[TABLE]
and put . Thus, , and, repeating the calculation in Eq. (1),
[TABLE]
Let us put , , , , and . Now, for , we have
[TABLE]
and so we obtain a generalization of Proposition 2:
Proposition 5**.**
Let be as above. Then
[TABLE]
When , Proposition 5 yields useful estimates.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Cohen and Kontorovich [2022] Doron Cohen and Aryeh Kontorovich. Local glivenko-cantelli, 2022. URL https://arxiv.org/abs/2209.04054 .
- 2Correa et al. [2017] José Correa, Patricio Foncea, Ruben Hoeksma, Tim Oosterwijk, and Tjark Vredeveld. Posted price mechanisms for a random stream of customers. In Proceedings of the 2017 ACM Conference on Economics and Computation , EC ’17, page 169–186, New York, NY, USA, 2017. Association for Computing Machinery. ISBN 9781450345279. doi: 10.1145/3033274.3085137 . URL https://doi.org/10.1145/3033274.3085137 . · doi ↗
- 3Dubhashi and Ranjan [1998] Devdatt Dubhashi and Desh Ranjan. Balls and bins: a study in negative dependence. Random Struct. Algorithms , 13(2):99–124, September 1998. ISSN 1042-9832. doi: 10.1002/(SICI)1098-2418(199809)13:2¡99::AID-RSA 1¿3.0.CO;2-M . URL http://dx.doi.org/10.1002/(SICI)1098-2418(199809)13:2<99::AID-RSA 1>3.0.CO;2-M . · doi ↗
- 4Joag-Dev and Proschan [1983] Kumar Joag-Dev and Frank Proschan. Negative Association of Random Variables with Applications. The Annals of Statistics , 11(1):286 – 295, 1983. doi: 10.1214/aos/1176346079 . URL https://doi.org/10.1214/aos/1176346079 . · doi ↗
- 5Pinelis [2022] Iosif Pinelis. Max decoupling inequality. Math Overflow, 2022. URL https://mathoverflow.net/q/422636 .
- 6Talagrand [2016] Michel Talagrand. Upper and lower bounds for stochastic processes . Springer, 2016.
- 7van Handel [2014] Ramon van Handel. Probability in high dimension. Technical report, PRINCETON UNIV NJ, 2014.
