A Natural Extension of the BK Inequality

Jacob D. Baron; Jeff Kahn

arXiv:1905.02883·math.CO·May 9, 2019

A Natural Extension of the BK Inequality

Jacob D. Baron, Jeff Kahn

PDF

Open Access

TL;DR

This paper generalizes the van den Berg-Kesten Inequality to multiple events, providing a new tool for bounding probabilities of disjoint event occurrences in complex probability spaces.

Contribution

It introduces an extension of the BK inequality to handle an arbitrary number of events, enhancing probabilistic bounds for event counts.

Findings

01

Extended BK inequality to multiple events

02

Provides bounds for upper tail probabilities

03

Applicable to complex product spaces

Abstract

We extend the seminal van den Berg-Kesten Inequality on disjoint occurrence of two events to a setting with arbitrarily many events, where the quantity of interest is the maximum number that occur disjointly. This provides a handy tool for bounding upper tail probabilities for event counts in a product probability space.

Equations35

□_{i = 1}^{k} A_{i} = {ω \in Ω : A_{1}, \dots, A_{k} occur disjointly at ω} .

□_{i = 1}^{k} A_{i} = {ω \in Ω : A_{1}, \dots, A_{k} occur disjointly at ω} .

Pr (A □ B) \leq Pr (A) Pr (B)

Pr (A □ B) \leq Pr (A) Pr (B)

X = max {∣ I ∣ : I \subseteq [k] and □_{i \in I} A_{i} occurs} .

X = max {∣ I ∣ : I \subseteq [k] and □_{i \in I} A_{i} occurs} .

X ≼ Y .

X ≼ Y .

Pr (X \geq λ + t) \leq exp [- λ φ (t / λ)] (\leq exp [- t^{2} / (2 (λ + t /3))])

Pr (X \geq λ + t) \leq exp [- λ φ (t / λ)] (\leq exp [- t^{2} / (2 (λ + t /3))])

Z = max {∣ I ∣ : I \subseteq [k]

Z = max {∣ I ∣ : I \subseteq [k]

Pr (Z \geq λ + t) \leq exp [- λ φ (t / λ)] .

Pr (Z \geq λ + t) \leq exp [- λ φ (t / λ)] .

X_{\cal A}(\omega)=\max\{|R|:R\subseteq[k],~{}\mbox{the $A_{j}$'s indexed by $R$ occur disjointly at $\omega$}\}.

X_{\cal A}(\omega)=\max\{|R|:R\subseteq[k],~{}\mbox{the $A_{j}$'s indexed by $R$ occur disjointly at $\omega$}\}.

B_{j} = {ω \in Ω^{*} : (ω_{n + j}, ω_{2}, \dots, ω_{n}) \in A_{j}} .

B_{j} = {ω \in Ω^{*} : (ω_{n + j}, ω_{2}, \dots, ω_{n}) \in A_{j}} .

μ (X_{A} \geq r) \leq μ^{*} (X_{B} \geq r)

μ (X_{A} \geq r) \leq μ^{*} (X_{B} \geq r)

μ (X_{A} (ω) \geq r ∣ ω_{[2, n]} = y) \leq μ^{*} (X_{B} (ω) \geq r ∣ ω_{[2, n]} = y) .

μ (X_{A} (ω) \geq r ∣ ω_{[2, n]} = y) \leq μ^{*} (X_{B} (ω) \geq r ∣ ω_{[2, n]} = y) .

X_{B} (y) = X_{A} (y) \leq X_{A} (ω) \leq X_{A} (y) + 1

X_{B} (y) = X_{A} (y) \leq X_{A} (ω) \leq X_{A} (y) + 1

μ (X_{A} (ω) \geq r ∣ ω_{[2, n]} = y)

μ (X_{A} (ω) \geq r ∣ ω_{[2, n]} = y)

\displaystyle\leq 1-\mbox{$\prod_{j\in[k]}\mu_{1}(\overline{{\cal F}}_{j})$}=\mu^{*}(X_{\cal B}(\omega)\geq r\mid\omega_{[2,n]}=y),

Pr (□_{i \in I} A_{i}) \leq i \in I \prod Pr (A_{i}) .

Pr (□_{i \in I} A_{i}) \leq i \in I \prod Pr (A_{i}) .

E χ = r! ∣ I ∣ = r \sum Pr (□_{i \in I} A_{i}) \leq r! ∣ I ∣ = r \sum i \in I \prod Pr (A_{i}) \leq λ^{r}

E χ = r! ∣ I ∣ = r \sum Pr (□_{i \in I} A_{i}) \leq r! ∣ I ∣ = r \sum i \in I \prod Pr (A_{i}) \leq λ^{r}

Pr (X \geq λ + t) \leq Pr (χ \geq (λ + t)_{r}) \leq \frac{λ ^{r}}{( λ + t ) _{r}} = i = 0 \prod r - 1 \frac{λ}{λ + t - i} .

Pr (X \geq λ + t) \leq Pr (χ \geq (λ + t)_{r}) \leq \frac{λ ^{r}}{( λ + t ) _{r}} = i = 0 \prod r - 1 \frac{λ}{λ + t - i} .

lo g Pr (X \geq λ + t) \leq i = 0 \sum t - 1 lo g (λ / (λ + t - i)) \leq \int_{0}^{t} lo g (λ / (λ + t - x)) d x,

lo g Pr (X \geq λ + t) \leq i = 0 \sum t - 1 lo g (λ / (λ + t - i)) \leq \int_{0}^{t} lo g (λ / (λ + t - x)) d x,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsProbability and Risk Models · Statistical Distribution Estimation and Applications · Random Matrices and Applications

Full text

A Natural Extension of the BK Inequality

Jacob D. Baron Department of Mathematics, Rutgers University, Piscataway, NJ. Supported by the U.S. Department of Homeland Security under Grant Award 2012-ST-104-000044. The views and conclusions contained in this document are those of the authors and should not be interpreted as necessarily representing the official policies, either express or implied, of the U.S. Department of Homeland Security.

Jeff Kahn Department of Mathematics, Rutgers University, Piscataway, NJ. Supported by the National Science Foundation under Grant Awards DMS1201337 and DMS1501962.

(Sept 2016)

Abstract

We extend the seminal van den Berg–Kesten Inequality [2] on disjoint occurrence of two events to a setting with arbitrarily many events, where the quantity of interest is the maximum number that occur disjointly. This provides a handy tool for bounding upper tail probabilities for event counts in a product probability space.

1 Introduction

The purpose of this note is to prove a natural stochastic domination result that greatly extends a fundamental inequality on disjoint occurrence of events.

To begin we recall a few definitions. For (real-valued) random variables $X$ and $Y$ , $Y$ stochastically dominates $X$ (written $X\preccurlyeq Y$ ) if $\Pr(Y\geq r)\geq\Pr(X\geq r)\;\forall\,r\in\mathbb{R}$ . An event $A$ in a partially ordered $\Gamma$ is increasing if its indicator is a nondecreasing function, and decreasing if its complement is increasing. A probability measure $m$ on a partially ordered $\Gamma$ is positively associated (PA) if $m(A\cap B)\geq m(A)m(B)$ whenever both $A$ and $B\subseteq\Gamma$ are increasing (or, equivalently, whenever both are decreasing), and note that any probability measure on a linearly ordered $\Gamma$ is PA. We write $[n]$ for $\{1,2,\ldots,n\}$ .

Our setting is a finite product probability space $(\Omega,\mu)=\prod_{i=1}^{n}(\Omega_{i},\mu_{i})$ with each $\Omega_{i}$ partially ordered. Events $A_{1},A_{2},\ldots,A_{k}$ ( $\subseteq\Omega$ ) are said to occur disjointly at $\omega\in\Omega$ if there are disjoint $S_{1},\ldots,S_{k}\subseteq[n]$ such that for each $i\in[k]$ and $\omega^{\prime}\in\Omega$ , we have $\omega^{\prime}\in A_{i}$ whenever $\omega^{\prime}$ agrees with $\omega$ on $S_{i}$ . We write

[TABLE]

The study of disjoint occurrence was initiated by van den Berg and Kesten [2], who showed what is now called the “BK Inequality”:

[TABLE]

for increasing $A,B\subseteq\{0,1\}^{n}$ (see also e.g. [3, Section 2.3]). The following (substantial) extension of this seminal result is apparently new [1].

Theorem 1.

Let $(\Omega,\mu)=\prod_{i=1}^{n}(\Omega_{i},\mu_{i})$ be a finite product probability space with the $\Omega_{i}$ ’s partially ordered and the $\mu_{i}$ ’s PA. Given $A_{1},A_{2},\ldots,A_{k}\subseteq\Omega$ , let

[TABLE]

Let $Y_{1},\ldots,Y_{k}$ be independent Bernoullis with $\mathbb{E}Y_{i}=\Pr(A_{i})$ , $Y=\sum Y_{i}$ , and $\lambda=\sum{\mathbb{E}}Y_{i}$ . If the $A_{i}$ ’s are all increasing, or all decreasing, then

[TABLE]

Remarks.

(i)

Taking $\Omega=\{0,1\}^{n}$ , $k=2$ and $r=2$ in the definition of “ $X\preccurlyeq Y$ ” recovers (1) from (2).

(ii)

The most spectacular of the developments growing out of [2] is Reimer’s proof [7] of the “BK Conjecture” (of [2]) which says that (1) doesn’t require that $A,B$ be increasing. In contrast, trivial examples show this requirement (or some requirement) to be necessary in (2); for instance if $\Omega=\{0,1\}$ with uniform measure, $k=2$ , $A_{1}=\{0\}$ and $A_{2}=\{1\}$ , then $\Pr(X\geq 1)=1>3/4=\Pr(Y\geq 1).$

(iii)

As a consequence of (2), the Chernoff Bound (e.g. [6, Theorem 2.1]) applied to $Y$ yields, for $t\geq 0$ ,

[TABLE]

(where $\varphi(x)=(1+x)\log(1+x)-x$ for $x>-1$ , and $\varphi(-1)=1$ ). This looks similar to a lemma of Janson, proved (in slightly restricted form) in [5, Lemma 2] or [6, Lemma 2.46]:

Lemma 2.

For events $A_{1},\ldots,A_{k}$ in a probability space, $\lambda=\sum\Pr(A_{i})$ and $t\geq 0$ , letting

[TABLE]

But there are two big differences between (3) and (4). On one hand, (4) clearly applies more broadly. On the other hand, (3) implies (4) when it applies, since independent increasing (or decreasing) events, if they occur, necessarily occur disjointly (a standard observation easily extracted from the usual proof of Harris’s Inequality [4]). In fact when (3) applies it can be much stronger than (4), because dependent events can easily occur disjointly—so $X$ can be much larger than $Z$ , even though the bounds given for their upper tails are the same. For example, if $x_{1},\ldots,x_{k},y_{1},\ldots,y_{k}$ are distinct vertices of the Erdős–Rényi random graph $G_{n,p}$ and, for $i\in[k]$ , $A_{i}=\{\text{there is an }x_{i}y_{i}\text{-path}\}$ , then $Z\leq 1$ but $X$ can be large.

(iv)

It is not true that $Z\preccurlyeq Y$ in the generality of Lemma 2, as the example in Remark (ii) also shows.

For (3), we can trade the requirement that the $A_{i}$ ’s be all increasing (or all decreasing) for the requirement that the $\Omega_{i}$ ’s be all linearly ordered:

Theorem 3.

In the setting of Theorem 1, with arbitrary $A_{i}$ ’s, (3) holds if each $\Omega_{i}$ is linearly ordered.

Unlike (3), this is neither stronger nor weaker than Lemma 2 even when it appiles, because arbitrary independent events need not occur disjointly. For example, if $\Omega=\{0,1\}^{n}$ with uniform measure and, for $i\in[n-1]$ , $A_{i}$ is the event that $\{\omega_{i},\omega_{n}\}=\{0,1\}$ , then $X\leq 1$ but $Z$ can be large.

Historical Note. We learned of Lemma 2 only after proving Theorem 1; in fact our motivation for the theorem was to obtain something like the lemma, as in Remark (iii). Upon learning of the lemma, we realized its proof could be tweaked to give Theorem 3.

2 Proofs

The proof of Theorem 1, which is similar to the original proof of (1) in [2], is not hard but is a little awkward to write, and a few additional definitions will be helpful. We prove it for increasing $A_{i}$ ’s; the decreasing case is of course analogous.

For $\Omega=\prod_{i\in I}\Omega_{i}$ and $S\subseteq I$ , we take $\Omega_{S}=\prod_{i\in S}\Omega_{i}$ and, for $\omega\in\Omega$ , $\omega_{S}=(\omega_{i}:i\in S)$ . For $A\subseteq\Omega$ and $\omega\in\Omega_{J}$ for some $J\subseteq I$ , $S\subseteq J$ is said to witness $\omega\in A$ if $\omega^{\prime}\in A$ whenever $\omega^{\prime}\in\Omega$ and $\omega^{\prime}_{S}=\omega_{S}$ . (This is of course abusive since we can’t have $\omega\in A$ unless $J=I$ .) We then (that is, for $\omega\in\Omega_{J}$ ) say $A_{1},\ldots,A_{k}$ ( $\subseteq\Omega$ ) occur disjointly at $\omega$ if there are disjoint $S_{1},\ldots,S_{k}\subseteq J$ such that $S_{j}$ witnesses $\omega\in A_{j}$ $\forall j$ and, for ${\cal A}=\{A_{1},\ldots,A_{k}\}$ , set

[TABLE]

Thus the $X$ of Theorem 1 is $X_{\cal A}$ evaluated at a random $\omega\in\Omega$ .

Proof of Theorem 1.

Say $i\in[n]$ affects $A\subseteq\Omega$ if there are $\omega\in A$ and $\omega^{\prime}\in\Omega\setminus A$ with $\omega_{[n]\setminus\{i\}}=\omega^{\prime}_{[n]\setminus\{i\}}$ , and for a collection ${\cal B}$ of events in $\Omega$ , let $\psi({\cal B})$ be the number of $i\in[n]$ that affect at least two members of ${\cal B}$ .

We proceed by induction on $\psi({\cal A})$ . If this number is zero then the laws of $X$ and $Y$ agree (since the $A_{j}$ ’s are independent). So we may assume $\psi({\cal A})\neq 0$ , say (without loss of generality) the index 1 affects at least two of the $A_{j}$ ’s.

Let $(\Omega_{n+j},\mu_{n+j})$ , $j\in[k]$ , be copies of $(\Omega_{1},\mu_{1})$ , independent of each other and of $(\Omega_{1},\mu_{1}),\ldots,(\Omega_{n},\mu_{n})$ . Let $(\Omega^{*},\mu^{*})=\prod_{i=2}^{n+k}(\Omega_{i},\mu_{i})$ and (for $j\in[k]$ )

[TABLE]

Thus, apart from irrelevant variables, $B_{j}$ is a copy of $A_{j}$ gotten by replacing $(\Omega_{1},\mu_{1})$ by $(\Omega_{n+j},\mu_{n+j})$ . In particular $\Pr(B_{j})=\Pr(A_{j})$ and, with ${\cal B}=\{B_{1},\ldots,B_{k}\}$ , we have $\psi({\cal B})=\psi({\cal A})-1$ (since $i\in[2,n]$ affects $B_{j}$ iff it affects $A_{j}$ , and $n+i$ affects $B_{j}$ iff $j=i$ and 1 affects $A_{i}$ ). So by the inductive hypothesis it is enough to show

[TABLE]

for each positive integer $r$ . Here it’s convenient to work with the stronger conditional version:

Claim. For each $y\in\Omega_{[2,n]}$ (with $\mu_{i}(y_{i})>0$ $\forall\,i\in[2,n]$ ),

[TABLE]

Proof of Claim. Since, for any $y\in\Omega_{[2,n]}$ and $\omega\in\Omega$ with $\omega_{[2,n]}=y$ ,

[TABLE]

we need only show (6) for $y$ with $X_{\cal A}(y)=r-1$ (since the left hand side of (6) is zero if $X_{\cal A}(y)\leq r-2$ and both sides are 1 if $X_{\cal A}(y)\geq r$ ).

Given such a $y$ , set ${\cal F}=\{x\in\Omega_{1}:X_{\cal A}(x,y)=r\}$ and, for $i\in[k]$ , let ${\cal F}_{i}\subseteq\Omega_{1}$ consist of those $x$ ’s for which there are $I\in\binom{[k]}{r}$ containing $i$ and disjoint $S_{j}$ ’s in $[n]$ ( $j\in I$ ) such that $S_{j}$ witnesses $(x,y)\in A_{j}$ (for $j\in I$ ) and $1\in S_{i}$ . Then, evidently,

$\circ$

each ${\cal F}_{i}$ is increasing,

$\circ$

${\cal F}=\cup_{i\in[k]}{\cal F}_{i}$ ,

$\circ$

for $\omega\in\Omega$ with $\omega_{[2,n]}=y$ , $X_{\cal A}=r$ iff $\omega_{1}\in{\cal F}$ , and

$\circ$

for $\omega\in\Omega^{*}$ with $\omega_{[2,n]}=y$ , $X_{\cal B}\geq r$ iff $\omega_{n+j}\in{\cal F}_{j}$ for some $j\in[k]$ ,

whence

[TABLE]

where the inequality follows from that assumption that $\mu_{1}$ is PA. ∎

For the proof of Theorem 3 we need just one little observation, which follows immediately from Reimer’s Theorem [7] by induction: for events $\{A_{i}\}_{i\in I}$ in a product probability space with each factor linearly ordered,

[TABLE]

Proof of Theorem 3.

For some to-be-determined integer $r\leq k$ and each $I\subseteq[k]$ of size $r$ , let $B_{I}$ be the indicator of $\square_{i\in I}A_{i}$ . Let $\chi=r!\sum B_{I}$ , so that

[TABLE]

(by (7)).

The rest of the proof follows [6, Lemma 2.46] verbatim, so we will be brief. If $X\geq\lambda+t$ then $\chi\geq(\lambda+t)_{r}=\prod_{i=0}^{r-1}(\lambda+t-i)$ , so by Markov,

[TABLE]

Setting $r=t$ (to minimize the right hand side) yields

[TABLE]

which, with calculus, gives the stronger bound in (3). ∎

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. van den Berg. Personal communication, Oct 2015.
2[2] J. van den Berg and H. Kesten. Inequalities with applications to percolation and reliability. J. Appl. Probab. , 22(3):556–569, Sept 1985.
3[3] Geoffrey R. Grimmett. Percolation , volume 321 of Grundlehren der mathematischen Wissenschaften . Springer-Verlag Berlin Heidelberg, Berlin, 2nd edition, 1999.
4[4] T. E. Harris. A lower bound on the critical probability in a certain percolation process. Math. Proc. Cambridge Phil. Soc. , 56(1):13–20, Jan 1960.
5[5] Svante Janson. Poisson approximation for large deviations. Random Structures Algorithms , 1(2):221–229, June 1990.
6[6] Svante Janson, Tomasz Łuczak, and Andrzej Ruciński. Random Graphs . Wiley-Interscience Series in Discrete Mathematics and Optimization. Wiley, New York, 2000.
7[7] David Reimer. Proof of the Van den Berg–Kesten conjecture. Combin. Probab. Comput. , 9(1):27–32, Jan 2000.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A Natural Extension of the BK Inequality

Abstract

1 Introduction

Theorem 1**.**

Lemma 2**.**

Theorem 3**.**

2 Proofs

Proof of Theorem 1.

Proof of Theorem 3.

Theorem 1.

Lemma 2.

Theorem 3.