A Natural Extension of the BK Inequality
Jacob D. Baron, Jeff Kahn

TL;DR
This paper generalizes the van den Berg-Kesten Inequality to multiple events, providing a new tool for bounding probabilities of disjoint event occurrences in complex probability spaces.
Contribution
It introduces an extension of the BK inequality to handle an arbitrary number of events, enhancing probabilistic bounds for event counts.
Findings
Extended BK inequality to multiple events
Provides bounds for upper tail probabilities
Applicable to complex product spaces
Abstract
We extend the seminal van den Berg-Kesten Inequality on disjoint occurrence of two events to a setting with arbitrarily many events, where the quantity of interest is the maximum number that occur disjointly. This provides a handy tool for bounding upper tail probabilities for event counts in a product probability space.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProbability and Risk Models · Statistical Distribution Estimation and Applications · Random Matrices and Applications
A Natural Extension of the BK Inequality
Jacob D. Baron Department of Mathematics, Rutgers University, Piscataway, NJ. Supported by the U.S. Department of Homeland Security under Grant Award 2012-ST-104-000044. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either express or implied, of the U.S. Department of Homeland Security.
Jeff Kahn Department of Mathematics, Rutgers University, Piscataway, NJ. Supported by the National Science Foundation under Grant Awards DMS1201337 and DMS1501962.
(Sept 2016)
Abstract
We extend the seminal van den Berg–Kesten Inequality [2] on disjoint occurrence of two events to a setting with arbitrarily many events, where the quantity of interest is the maximum number that occur disjointly. This provides a handy tool for bounding upper tail probabilities for event counts in a product probability space.
1 Introduction
The purpose of this note is to prove a natural stochastic domination result that greatly extends a fundamental inequality on disjoint occurrence of events.
To begin we recall a few definitions. For (real-valued) random variables and , stochastically dominates (written ) if . An event in a partially ordered is increasing if its indicator is a nondecreasing function, and decreasing if its complement is increasing. A probability measure on a partially ordered is positively associated (PA) if whenever both and are increasing (or, equivalently, whenever both are decreasing), and note that any probability measure on a linearly ordered is PA. We write for .
Our setting is a finite product probability space with each partially ordered. Events () are said to occur disjointly at if there are disjoint such that for each and , we have whenever agrees with on . We write
[TABLE]
The study of disjoint occurrence was initiated by van den Berg and Kesten [2], who showed what is now called the “BK Inequality”:
[TABLE]
for increasing (see also e.g. [3, Section 2.3]). The following (substantial) extension of this seminal result is apparently new [1].
Theorem 1**.**
Let be a finite product probability space with the ’s partially ordered and the ’s PA. Given , let
[TABLE]
Let be independent Bernoullis with , , and . If the ’s are all increasing, or all decreasing, then
[TABLE]
Remarks.
- (i)
Taking , and in the definition of “” recovers (1) from (2).
- (ii)
The most spectacular of the developments growing out of [2] is Reimer’s proof [7] of the “BK Conjecture” (of [2]) which says that (1) doesn’t require that be increasing. In contrast, trivial examples show this requirement (or some requirement) to be necessary in (2); for instance if with uniform measure, , and , then
- (iii)
As a consequence of (2), the Chernoff Bound (e.g. [6, Theorem 2.1]) applied to yields, for ,
[TABLE]
(where for , and ). This looks similar to a lemma of Janson, proved (in slightly restricted form) in [5, Lemma 2] or [6, Lemma 2.46]:
Lemma 2**.**
For events in a probability space, and , letting
[TABLE]
[TABLE]
But there are two big differences between (3) and (4). On one hand, (4) clearly applies more broadly. On the other hand, (3) implies (4) when it applies, since independent increasing (or decreasing) events, if they occur, necessarily occur disjointly (a standard observation easily extracted from the usual proof of Harris’s Inequality [4]). In fact when (3) applies it can be much stronger than (4), because dependent events can easily occur disjointly—so can be much larger than , even though the bounds given for their upper tails are the same. For example, if are distinct vertices of the Erdős–Rényi random graph and, for , , then but can be large.
- (iv)
It is not true that in the generality of Lemma 2, as the example in Remark (ii) also shows.
For (3), we can trade the requirement that the ’s be all increasing (or all decreasing) for the requirement that the ’s be all linearly ordered:
Theorem 3**.**
In the setting of Theorem 1, with arbitrary ’s, (3) holds if each is linearly ordered.
Unlike (3), this is neither stronger nor weaker than Lemma 2 even when it appiles, because arbitrary independent events need not occur disjointly. For example, if with uniform measure and, for , is the event that , then but can be large.
Historical Note. We learned of Lemma 2 only after proving Theorem 1; in fact our motivation for the theorem was to obtain something like the lemma, as in Remark (iii). Upon learning of the lemma, we realized its proof could be tweaked to give Theorem 3.
2 Proofs
The proof of Theorem 1, which is similar to the original proof of (1) in [2], is not hard but is a little awkward to write, and a few additional definitions will be helpful. We prove it for increasing ’s; the decreasing case is of course analogous.
For and , we take and, for , . For and for some , is said to witness if whenever and . (This is of course abusive since we can’t have unless .) We then (that is, for ) say () occur disjointly at if there are disjoint such that witnesses and, for , set
[TABLE]
Thus the of Theorem 1 is evaluated at a random .
Proof of Theorem 1.
Say affects if there are and with , and for a collection of events in , let be the number of that affect at least two members of .
We proceed by induction on . If this number is zero then the laws of and agree (since the ’s are independent). So we may assume , say (without loss of generality) the index 1 affects at least two of the ’s.
Let , , be copies of , independent of each other and of . Let and (for )
[TABLE]
Thus, apart from irrelevant variables, is a copy of gotten by replacing by . In particular and, with , we have (since affects iff it affects , and affects iff and 1 affects ). So by the inductive hypothesis it is enough to show
[TABLE]
for each positive integer . Here it’s convenient to work with the stronger conditional version:
Claim. For each (with ),
[TABLE]
Proof of Claim. Since, for any and with ,
[TABLE]
we need only show (6) for with (since the left hand side of (6) is zero if and both sides are 1 if ).
Given such a , set and, for , let consist of those ’s for which there are containing and disjoint ’s in () such that witnesses (for ) and . Then, evidently,
each is increasing,
,
for with , iff , and
for with , iff for some ,
whence
[TABLE]
where the inequality follows from that assumption that is PA. ∎
For the proof of Theorem 3 we need just one little observation, which follows immediately from Reimer’s Theorem [7] by induction: for events in a product probability space with each factor linearly ordered,
[TABLE]
Proof of Theorem 3.
For some to-be-determined integer and each of size , let be the indicator of . Let , so that
[TABLE]
(by (7)).
The rest of the proof follows [6, Lemma 2.46] verbatim, so we will be brief. If then , so by Markov,
[TABLE]
Setting (to minimize the right hand side) yields
[TABLE]
which, with calculus, gives the stronger bound in (3). ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. van den Berg. Personal communication, Oct 2015.
- 2[2] J. van den Berg and H. Kesten. Inequalities with applications to percolation and reliability. J. Appl. Probab. , 22(3):556–569, Sept 1985.
- 3[3] Geoffrey R. Grimmett. Percolation , volume 321 of Grundlehren der mathematischen Wissenschaften . Springer-Verlag Berlin Heidelberg, Berlin, 2nd edition, 1999.
- 4[4] T. E. Harris. A lower bound on the critical probability in a certain percolation process. Math. Proc. Cambridge Phil. Soc. , 56(1):13–20, Jan 1960.
- 5[5] Svante Janson. Poisson approximation for large deviations. Random Structures Algorithms , 1(2):221–229, June 1990.
- 6[6] Svante Janson, Tomasz Łuczak, and Andrzej Ruciński. Random Graphs . Wiley-Interscience Series in Discrete Mathematics and Optimization. Wiley, New York, 2000.
- 7[7] David Reimer. Proof of the Van den Berg–Kesten conjecture. Combin. Probab. Comput. , 9(1):27–32, Jan 2000.
