On a contraction property of Bernoulli canonical processes

Witold Bednorz; Rafa{\l} Martynek

arXiv:1812.04399·math.PR·April 3, 2019

On a contraction property of Bernoulli canonical processes

Witold Bednorz, Rafa{\l} Martynek

PDF

TL;DR

This paper advances Bernoulli comparison by relaxing the contraction condition on functions, enabling comparison of Rademacher sums through a new inequality involving Gaussian increments and subset sums.

Contribution

It introduces a generalized comparison inequality for Bernoulli processes that relaxes the contraction assumption to a condition based on Gaussian increments and subset sums.

Findings

01

Improved Bernoulli comparison under relaxed conditions

02

Established a new inequality involving subset sums and Gaussian increments

03

Applicable to independent Rademacher variables and functions with certain properties

Abstract

In this paper we improve Bernoulli comparison. The result works for independent Rademacher random variables $(ε_{i})_{i \geq 1}$ and states that we can compare $E sup_{t \in T} \sum_{i \geq 1} φ_{i} (t) ε_{i}$ with $E sup_{t \in T} \sum_{i \geq 1} t_{i} ε_{i}$ , where a function $φ = (φ_{i})_{i \geq 1} : ℓ^{2} \supset T \to ℓ^{2}$ , satisfies certain conditions. Originally, it is assumed that each of $φ_{i}$ is a contraction. We relax this assumption towards comparison of Gaussian parts of increments, which can be described in the following way. For all $s, t \in T$ , $p \geq 0$ $∣ I^{c} ∣ \leq C p in f i \in I \sum ∣ φ_{i} (t) - φ_{i} (s) ∣^{2} \leq C^{2} ∣ I^{c} ∣ \leq p in f i \in I \sum ∣ t_{i} - s_{i} ∣^{2},$ where $C \geq 1$ is an absolute constant and $I \subset N$ , $I^{c} = N \ I$ .

Equations224

∣ I^{c} ∣ ⩽ C p in f i \in I \sum ∣ φ_{i} (t) - φ_{i} (s) ∣^{2} ⩽ C^{2} ∣ I^{c} ∣ ⩽ p in f i \in I \sum ∣ t_{i} - s_{i} ∣^{2},

∣ I^{c} ∣ ⩽ C p in f i \in I \sum ∣ φ_{i} (t) - φ_{i} (s) ∣^{2} ⩽ C^{2} ∣ I^{c} ∣ ⩽ p in f i \in I \sum ∣ t_{i} - s_{i} ∣^{2},

X_{t} = i = 1 \sum \infty t_{i} ξ_{i},

X_{t} = i = 1 \sum \infty t_{i} ξ_{i},

n \to \infty lim ∥ i = 1 \sum n t_{i} ξ_{i} - X_{t} ∥_{2} = 0.

n \to \infty lim ∥ i = 1 \sum n t_{i} ξ_{i} - X_{t} ∥_{2} = 0.

∥ X_{t} - X_{s} ∥_{2} = ∥ t - s ∥_{2}, \mbox f or s, t \in T .

∥ X_{t} - X_{s} ∥_{2} = ∥ t - s ∥_{2}, \mbox f or s, t \in T .

S_{X} (T) = F \subset T sup E t \in F sup X_{t},

S_{X} (T) = F \subset T sup E t \in F sup X_{t},

S_{X} (T) = F \subset T sup E t \in F sup X_{t} = E t \in T sup X_{t} .

S_{X} (T) = F \subset T sup E t \in F sup X_{t} = E t \in T sup X_{t} .

γ_{X} (T) = in f t \in T sup n ⩾ 1 \sum ∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n}},

γ_{X} (T) = in f t \in T sup n ⩾ 1 \sum ∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n}},

X_{t} = X_{π_{0} (t)} + n ⩾ 1 \sum (X_{π_{n} (t)} - X_{π_{n - 1} (t)}) .

X_{t} = X_{π_{0} (t)} + n ⩾ 1 \sum (X_{π_{n} (t)} - X_{π_{n - 1} (t)}) .

∥ X_{t} ∥_{2^{n + 1}} ⩽ C_{1} ∥ X_{t} ∥_{2^{n}} .

∥ X_{t} ∥_{2^{n + 1}} ⩽ C_{1} ∥ X_{t} ∥_{2^{n}} .

γ_{X} (T_{1} + T_{2}) ⩽ C_{1} (γ_{X} (T_{1}) + γ_{X} (T_{2})) .

γ_{X} (T_{1} + T_{2}) ⩽ C_{1} (γ_{X} (T_{1}) + γ_{X} (T_{2})) .

∥ X_{π_{n + 1} (t + s)} - X_{π_{n} (t + s)} ∥_{2^{n + 1}} ⩽ ∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n + 1}} + ∥ X_{π_{n} (s)} - X_{π_{n - 1} (s)} ∥_{2^{n + 1}} .

∥ X_{π_{n + 1} (t + s)} - X_{π_{n} (t + s)} ∥_{2^{n + 1}} ⩽ ∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n + 1}} + ∥ X_{π_{n} (s)} - X_{π_{n - 1} (s)} ∥_{2^{n + 1}} .

∥ X_{π_{n + 1} (t + s)} - X_{π_{n} (t + s)} ∥_{2^{n + 1}} ⩽ C_{1} (∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n}} + ∥ X_{π_{n} (s)} - X_{π_{n - 1} (s)} ∥_{2^{n}}) .

∥ X_{π_{n + 1} (t + s)} - X_{π_{n} (t + s)} ∥_{2^{n + 1}} ⩽ C_{1} (∥ X_{π_{n} (t)} - X_{π_{n - 1} (t)} ∥_{2^{n}} + ∥ X_{π_{n} (s)} - X_{π_{n - 1} (s)} ∥_{2^{n}}) .

S_{X} (T) ⩽ 4 γ_{X} (T)

S_{X} (T) ⩽ 4 γ_{X} (T)

E A \in A_{N} sup ∣ X_{π_{N} (A)} - X_{π_{0} (A)} ∣ ⩽ E A \in A_{N} sup k = 1 \sum N ∣ X_{π_{k} (A_{k})} - X_{π_{k - 1} (A_{k - 1})} ∣

E A \in A_{N} sup ∣ X_{π_{N} (A)} - X_{π_{0} (A)} ∣ ⩽ E A \in A_{N} sup k = 1 \sum N ∣ X_{π_{k} (A_{k})} - X_{π_{k - 1} (A_{k - 1})} ∣

⩽ E A \in A_{N} sup k = 1 \sum N 2∥ X_{π (A_{k})} - X_{π_{k - 1} (A_{k - 1})} ∥_{2^{k}} (1 + E A \in A_{N} sup (\frac{∣ X _{π_{k} (A_{k})} - X _{π_{k - 1} (A_{k - 1})} ∣}{2∥ X _{π (A_{k})} - X _{π_{k - 1} (A_{k - 1})} ∥ _{2^{k}}} - 1)_{+}

⩽ 2 γ_{X} (T) (1 + k = 1 \sum N B \in A_{k} \sum E (\frac{∣ X _{π_{k} (B)} - X _{π_{k - 1} (B_{k - 1})} ∣}{2∥ X _{π_{k} (B)} - X _{π_{k - 1} (B_{k - 1})} ∥ _{2^{k}}} - 1)_{+}) .

E (\frac{∣ X _{t} ∣}{2∥ X _{t} ∥ _{2^{k}}} - 1)_{+} ⩽ \frac{1}{2 ^{k} N _{k}} .

E (\frac{∣ X _{t} ∣}{2∥ X _{t} ∥ _{2^{k}}} - 1)_{+} ⩽ \frac{1}{2 ^{k} N _{k}} .

E ((∣ Y_{t} ∣/2) - 1)_{+}

E ((∣ Y_{t} ∣/2) - 1)_{+}

⩽ \int_{2}^{\infty} \frac{v ^{2^{k} - 1}}{2 ^{2^{k}}} P (∣ Y_{t} ∣ ⩾ v) d v ⩽ \frac{1}{2 ^{k} N _{k}} \int_{0}^{\infty} 2^{k} v^{2^{k} - 1} P (∣ Y_{t} ∣ ⩾ v) d v

= \frac{E ∣ Y _{t} ∣ ^{2^{k}}}{2 ^{k} N _{k}} = \frac{1}{2 ^{k} N _{k}} .

E A \in A_{N} sup ∣ X_{π_{N} (A)} - X_{π_{0} (A)} ∣ ⩽ 2 γ_{X} (T) (1 + k = 1 \sum N \frac{N _{k}}{2 ^{k} N _{k}}) ⩽ 4 γ_{X} (T) .

E A \in A_{N} sup ∣ X_{π_{N} (A)} - X_{π_{0} (A)} ∣ ⩽ 2 γ_{X} (T) (1 + k = 1 \sum N \frac{N _{k}}{2 ^{k} N _{k}}) ⩽ 4 γ_{X} (T) .

∥ B_{t} ∥_{p} ⩽ i = 1 \sum p ∣ t_{i}^{*} ∣ + p (i > p \sum ∣ t_{i}^{*} ∣^{2})^{\frac{1}{2}} ⩽ 4∥ B_{t} ∥_{p},

∥ B_{t} ∥_{p} ⩽ i = 1 \sum p ∣ t_{i}^{*} ∣ + p (i > p \sum ∣ t_{i}^{*} ∣^{2})^{\frac{1}{2}} ⩽ 4∥ B_{t} ∥_{p},

i = 1 \sum p ∣ t_{i}^{*} ∣ = ∣ I^{c} ∣ ⩽ p sup i \in I^{c} \sum ∣ t_{i} ∣

i = 1 \sum p ∣ t_{i}^{*} ∣ = ∣ I^{c} ∣ ⩽ p sup i \in I^{c} \sum ∣ t_{i} ∣

p (i > p \sum ∣ t_{i}^{*} ∣^{2})^{\frac{1}{2}} = p ∣ I^{c} ∣ ⩽ p in f (i \in I \sum ∣ t_{i} ∣^{2})^{\frac{1}{2}} .

p (i > p \sum ∣ t_{i}^{*} ∣^{2})^{\frac{1}{2}} = p ∣ I^{c} ∣ ⩽ p in f (i \in I \sum ∣ t_{i} ∣^{2})^{\frac{1}{2}} .

K^{- 1} (γ_{B} (T_{1} + T_{2})) ⩽ S_{B} (T) ⩽ K (γ_{B} (T_{1} + T_{2})),

K^{- 1} (γ_{B} (T_{1} + T_{2})) ⩽ S_{B} (T) ⩽ K (γ_{B} (T_{1} + T_{2})),

S_{B} (T) ⩾ K^{- 1} (t \in T_{1} sup ∥ t ∥_{1} + S_{G} (T_{2})),

S_{B} (T) ⩾ K^{- 1} (t \in T_{1} sup ∥ t ∥_{1} + S_{G} (T_{2})),

\frac{2}{π} ∥ B_{t} - B_{s} ∥_{p} = E ∣ g ∣∥ B_{t} - B_{s} ∥_{p} ⩽ ∥ G_{t} - G_{s} ∥_{p}

\frac{2}{π} ∥ B_{t} - B_{s} ∥_{p} = E ∣ g ∣∥ B_{t} - B_{s} ∥_{p} ⩽ ∥ G_{t} - G_{s} ∥_{p}

\pi_{m}(t)=\left\{\begin{array}[]{lll}t_{0}&\mbox{if}&t\in T\backslash T^{m}\\ t&\mbox{if}&t\in T^{m}\backslash T^{m-1}.\end{array}\right.

\pi_{m}(t)=\left\{\begin{array}[]{lll}t_{0}&\mbox{if}&t\in T\backslash T^{m}\\ t&\mbox{if}&t\in T^{m}\backslash T^{m-1}.\end{array}\right.

n = 1 \sum \infty ∥ B_{π_{n} (t)} - B_{π_{n - 1} (t)} ∥_{2^{n}} = ∥ B_{t} - B_{t_{0}} ∥_{2^{m}} ⩽ ∥ t - t_{0} ∥_{1} .

n = 1 \sum \infty ∥ B_{π_{n} (t)} - B_{π_{n - 1} (t)} ∥_{2^{n}} = ∥ B_{t} - B_{t_{0}} ∥_{2^{m}} ⩽ ∥ t - t_{0} ∥_{1} .

t \in T sup n = 1 \sum \infty ∥ B_{π_{n} (t)} - B_{π_{n - 1} (t)} ∥_{2^{n}} ⩽ 2 t \in T_{1} sup ∥ t ∥_{1} .

t \in T sup n = 1 \sum \infty ∥ B_{π_{n} (t)} - B_{π_{n - 1} (t)} ∥_{2^{n}} ⩽ 2 t \in T_{1} sup ∥ t ∥_{1} .

S_{B} (T) ⩾ K^{- 1} (γ_{B} (T_{1}) + γ_{B} (T_{2})) ⩾ (K C_{1})^{- 1} γ_{B} (T_{1} + T_{2}) .

S_{B} (T) ⩾ K^{- 1} (γ_{B} (T_{1}) + γ_{B} (T_{2})) ⩾ (K C_{1})^{- 1} γ_{B} (T_{1} + T_{2}) .

S_{B} (T) ⩽ S_{B} (T_{1}) + S_{B} (T_{2}) = S_{B} (T_{1} + T_{2}) ⩽ 4 γ_{B} (T_{1} + T_{2}),

S_{B} (T) ⩽ S_{B} (T_{1}) + S_{B} (T_{2}) = S_{B} (T_{1} + T_{2}) ⩽ 4 γ_{B} (T_{1} + T_{2}),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

On a contraction property of Bernoulli canonical processes

Witold Bednorz & Rafał Martynek 111Subject classification: 60G15, 60G17 222Keywords and phrases: VC classes, inequality 333Research partially supported by MNiSW Grant N N201 608740. 444Institute of Mathematics, University of Warsaw, Banacha 2, 02-097 Warszawa, Poland

Abstract

In this paper we give several results concerning the supremum of canonical processes. The main theorem concerns a contraction property of Bernoulli canonical process which generalizes the one proved by Talagrand (Theorem 2.1 in [16]). The result works for independent Rademacher random variables $(\varepsilon_{i})_{i\geq 1}$ and states that we can compare $\mathbf{E}\sup_{t\in T}\sum_{i\geq 1}\varphi_{i}(t)\varepsilon_{i}$ with $\mathbf{E}\sup_{t\in T}\sum_{i\geq 1}t_{i}\varepsilon_{i}$ , where a function $\varphi=(\varphi_{i})_{i\geq 1}:\ell^{2}\supset T\rightarrow\ell^{2}$ , satisfies certain conditions. Originally, it is assumed that each of $\varphi_{i}$ is a contraction. We relax this assumption towards comparison of Gaussian parts of increments, which can be described in the following way. For all $s,t\in T$ , $p\geqslant 0$

[TABLE]

where $C\geqslant 1$ is an absolute constant and $I\subset{\mathbb{N}}$ , $I^{c}={\mathbb{N}}\backslash I$ .

1 Introduction and notation

Throughout this paper we will use the following notation. For the set $A$ the number of elements in $A$ will be denoted as $|A|$ . If $t=(t_{n})$ , ${n\geqslant 1}$ is a sequence of real numbers and $p\geqslant 1$ then $\|t\|_{p}=(\sum_{n=0}^{\infty}t_{i}^{p})^{\frac{1}{p}}$ and $\ell^{p}$ is the space of all sequences $t$ with $\|t\|_{p}<\infty$ . If $S,T\subset\ell^{p}$ then $S+T=\{s+t:s\in S,t\in T\}$ . For a random variable $\xi$ and $p>0$ we put $\|\xi\|_{p}=(\mathbf{E}|\xi|^{p})^{\frac{1}{p}}$ . If $(\xi_{i})$ , $i\geqslant 1$ is a sequence of independent, identically distributed random variables such that $\mathbf{E}\xi_{i}=0$ , $\mathbf{E}\xi_{i}^{2}=1$ and $t=(t_{n})\in\ell^{2}$ then the random variable

[TABLE]

is well-defined. For each $T\subset\ell^{2}$ with $0\in T$ the process $X_{T}=(X_{t})_{t\in T}$ is called canonical. The convergence of the above series holds in the sense of $\|\cdot\|_{2}$ which means that

[TABLE]

Clearly,

[TABLE]

Remark 1

The almost sure convergence in (1) might be guaranteed also when the independence assumption on $\xi_{i}$ ’s is skipped. In such case we may consider finite dimensional version of (1), where $T\subset{\mathbb{R}}^{d}$ . The most studied example is when $\xi_{i}$ ’s have log-concave tails i.e. $\mathbf{P}(|\xi_{i}|>t)=\exp(-N_{i}(t))$ for $N_{i}:[0,\infty]\rightarrow[0,\infty]$ convex and may be dependent.

We want to distinguish two types of canonical processes which will be of special interest. If $(\xi_{n})=(\varepsilon_{n})$ and $\mathbf{P}(\varepsilon_{n}=1)=\mathbf{P}(\varepsilon_{n}=-1)=\frac{1}{2}$ then the process $X_{T}$ is called canonical Bernoulli and it is denoted by $B_{T}=(B_{t})_{t\in T}$ . This class of processes is important for various applications e.g. infinitely divisible processes [16], empirical processes (see [17] for the comprehensive study). If $(\xi_{n})=(g_{n})$ and $g_{n}$ are distributed by the normal law $\mathcal{N}(0,1)$ then the process $X_{T}$ is called canonical Gaussian and it is denoted by $G_{T}=(G_{t})_{t\in T}$ . In fact, canonical Gaussian processes can be seen as a motivation to study canonical processes in general. The reason for that being the Karhunen-Lòeve representation of separable Gaussian process with the canonical Gaussian process, (see e.g. [10] Corollary 5.3.4).

The main object studied will be the suprema of canonical processes. For any set $T$ and a stochastic process $(X_{t})_{t\in T}$ we define

[TABLE]

where the supremum is taken over all finite subsets $F$ of $T$ . Usually, by considering the separable modification of $X_{t},t\in T$ it is possible to guarantee that $\sup_{t\in T}X_{t}$ is well-defined random variable (for the definition of separable version of the process and the discussion on the measurability of the supremum in a general setting of Banach space which is not necessarily separable see Ch. 2 in [9]). In this case $S_{X}(T)$ coincides with the usual expectation of the supremum over $X_{t}$ , namely

[TABLE]

Let us finish this section with a few important technicalities which will be helpful in dealing with canonical processes. We have that $S_{X}(T)=S_{X}(T-t)$ , where $T-t=\{s-t:\;s\in T\}$ so we may always require that $0\in T$ . Moreover, $S_{X}(T)=S_{X}(\mathrm{Conv}T)$ and $S_{X}(T)=S_{X}(\mathrm{cl}T)$ , where $\mathrm{Conv}T$ is the convex hull of the set $T$ and $\mathrm{cl}T$ is the closure of $T$ in $\ell^{2}$ .

We follow the convention that numerical constants denoted by the same letter might vary from line to line. The same constants will be subindexed e.g. $C_{1},C_{2}$ etc.

2 Suprema of canonical processes via chaining

First, we recall the basics of the chaining approach to upper bounds for stochastic processes. We say that the sequence ${\cal A}=({\cal A}_{n})_{n\geqslant 0}$ of partitions of $T$ is an admissible partition of T if ${\cal A}_{0}=\{T\}$ and $|{\cal A}_{n}|\leqslant N_{n}=2^{2^{n}}$ for $n\geqslant 1$ (usually it is required also that these partitions are nested i.e. for any set $A\in A_{n}$ , $n\geqslant 1$ there is a set $B\in A_{n-1}$ such that $A\subset B$ ). For $t\in T$ we denote by $A_{n}(t)$ the unique element of partition ${\cal A}_{n}$ which contains $t$ . Let $\pi_{n}:T\rightarrow T$ be a sequence of points such that its’ $n$ -th element is defined so that $\pi_{n}(t)=\pi_{n}(s)$ for all $s,t\in A\in{\cal A}_{n}$ . We will denote it by $\pi_{n}(A)$ . Let

[TABLE]

where the infimum is taken over all admissible sequences of partitions. We denote by $T_{n}$ the family of all $\pi_{n}(t)$ , $t\in T$ . In words, at each step of partitioning we choose some point $\pi_{n}(t)$ which belongs to the same partition set as t. Clearly, $|T_{n}|\leqslant 2^{2^{n}}$ and $|T_{0}|=1$ . By the chain we mean writing $X_{t}$ as the sum of consecutive approximations i.e.

[TABLE]

Let us observe the following property of $\gamma_{X}(T)$ .

Lemma 1

Let $T_{1}$ and $T_{2}$ be some index sets. Suppose that for some stochastic process $(X_{t})$ $\gamma_{X}(T_{1})$ , $\gamma_{X}(T_{2})$ and $\gamma_{X}(T_{1}+T_{2})$ are well-defined and for $n\geqslant 1$

[TABLE]

Then,

[TABLE]

In particular, for canonical Bernoulli and canonical Gaussian processes the above inequality holds with $C_{1}=\sqrt{3}$ .

Proof. Let $t\in T_{1}$ and $s\in T_{2}$ . Let $({\cal A}_{n}(T_{1}))_{n\geqslant 0}$ and $({\cal A}_{n}(T_{2}))_{n\geqslant 0}$ be admissible partitions of $T_{1}$ and $T_{2}$ respectively. Define ${\cal A}_{n+1}(T_{1}+T_{2})$ as all possible sums of these partitions i.e. ${\cal A}_{n+1}(T_{1}+T_{2})=\{A+B:A\in{\cal A}_{n}(T_{1}),B\in{\cal A}_{n}(T_{2})\}$ and ${\cal A}_{0}(T_{1}+T_{2})=\{T_{1}+T_{2}\}$ . It is obviously admissible since $N_{n}\cdot N_{n}=N_{n+1}$ . Moreover, for $A\in{\cal A}_{n}(T_{1})$ and $B\in{\cal A}_{n}(T_{2})$ let $\pi_{n+1}(A+B)=\pi_{n}(A)+\pi_{n}(B)$ . We also put $\pi_{0}(T_{1}+T_{2})=\pi_{0}(T_{1})+\pi_{0}(T_{2})$ . Clearly, for $t\in A$ and $s\in B$

[TABLE]

So, by (2) we obtain

[TABLE]

The conclusion follows since $t\in T_{1}$ , $s\in T_{2}$ were arbitrary as well as the partitions ${\cal A}_{n}(T_{1})$ and ${\cal A}_{n}(T_{2})$ . The reason why the inequality (2) holds with constant $\sqrt{3}$ for canonical Gaussian is a straightforward consequence of the fact that for $p\in{\mathbb{N}}$ we have $\|G_{t}\|_{2p}=\|t\|_{2}(2p)!!$ , where $(2p)!!=(2p-1)(2p-3)\dots 1$ . The result for canonical Bernoulli processes follows from the general Kahane’s inequality (see e.g.[3] Theorem 13.2.1)

$\blacksquare$

In [7],[11] it was proved that under a suitable regularity assumptions $S_{X}(T)\leqslant K\gamma_{X}(T)$ , where $K$ is a universal constant. Let us give a short argument for a similar upper bound.

Theorem 1

For a stochastic process $(X_{t})_{t\in T)}$ for which $S_{X}(T)$ is well-defined we have that

[TABLE]

.

Proof. Let $({\cal A}_{n})_{n\geqslant 0}$ be any admissible partition of $T$ . For any set $A\in{\cal A}_{n}$ and $k\leqslant n$ we denote by $A_{k}$ its $k$ -parent i.e. $A_{k}\in{\cal A}_{k}$ and $A\subset{\cal A}_{k}$ . Consequently, if $t\in A\in{\cal A}_{n}$ then $\pi_{k}(t)=\pi_{k}(A_{k})$ . The proof is based on the analysis of the partition sequence. Let $N$ be fixed and consider ${\cal A}_{N}$ . The chaining argument gives

[TABLE]

We show that

[TABLE]

Indeed, denoting $Y_{t}=X_{t}/\|X_{t}\|_{2^{k}}$ we have $\mathbf{E}|Y_{t}|^{2^{k}}=1$ and hence

[TABLE]

Therefore,

[TABLE]

This ends the proof.

$\blacksquare$

The question about lower bounds for the suprema of canonical processes is much more involved. Let us summarize the processes in which the full characterization of the supremum (i.e. lower and upper bound) can be provided with the use of $\gamma_{X}(T)$ . The seminal result of Fernique and Talagrand known as the Majorizing Measure Theorem (see [2], [14] or [17] for the modern formulation) is equivalent with the statement that $S_{G}(T)$ is comparable with $\gamma_{G}(T)$ up to a numerical constant. In [15] it was proved that $S_{X}(T)$ is comparable with a quantity which, in a sense, is equivalent to $\gamma_{X}(T)$ for canonical process generated by $\xi$ ’s which are symmetric and satisfy $\mathbf{P}(|\xi|>t)=\exp(-c_{p}t^{p})$ for a fixed $p\in[1,2]$ . A similar result holds for $p>2$ , yet it is only possible to show that there exists a set $T^{\prime}\subset\ell^{2}$ (which may significantly differ from $T$ ) such that $S_{X}(T)$ is comparable with $\gamma_{X}(T^{\prime})$ up to a numerical constant. Note that the limiting case, when $p\rightarrow\infty$ is the question about canonical Bernoulli processes. Later, the idea of [15] was slightly generalized by R. Latała in [6] for canonical processes generated by $\xi$ with log-concave tails, yet under specific regularity assumptions. Finally, in [8] it was proved it suffices to assume only certain conditions on a moment growth of $\xi$ . Unfortunately, this result still does not apply to Bernoulli processes. The question of characterization of $S_{B}(T)$ was a long-standing problem posed by M. Talagrand and known as the Bernoulli conjecture. It was finally proved in [1]. In order to explain this result we need to provide a family of distances relevant to canonical Bernoulli processes which follow from some properties of Bernoulli-type random variables. By the results (see [4], [12] and [5] for the below formulation) for any $p\in{\mathbb{N}}$ , $p\geqslant 1$

[TABLE]

where $(t^{\ast}_{i})_{i\geqslant 1}$ is the rearrangement of $(t_{i})_{i\geqslant 1}$ such that $|t^{\ast}_{1}|\geqslant|t^{\ast}_{2}|\geqslant\ldots$ . Now, if we denote by $I\subset{\mathbb{N}}$ some index set, we can think of (4) as a decomposition of the norm $\|B_{t}\|_{p}$ into the $\ell^{1}$ part

[TABLE]

and the Gaussian part

[TABLE]

In fact a similar characterization to (4) can be formulated for a broad class of processes to mention processes with log-concave distributions. In particular, in [7] there is a characterization of $\|X_{t}-X_{s}\|_{p}$ for canonical processes based on one-unconditional log-concave random variables. As we have mentioned the characterization of $S_{B}(T)$ was known as the Bernoulli conjecture and was finally proved in [1]. It states that similarly to (4) the understanding of $S_{B}(T)$ can be decomposed into the Gaussian and $\ell^{1}$ part. More precisely, there must exist a decomposition of $T$ into $T_{1},T_{2}\subset\ell^{2}$ such that $T_{1}+T_{2}\supset T$ and moreover $S_{B}(T)$ dominates up to a universal constant both $\sup_{t\in T_{1}}\|t\|_{1}$ and $S_{G}(T_{2})$ . Usually such decomposition is formulated in the language of existence of a mapping $\pi:T\rightarrow\ell^{2}$ which defines $T_{1}=\{t-\pi(t):t\in T\}$ and $T_{2}=\{\pi(t):t\in T\}$ . Recall that we can always assume that $0\in T$ and $\pi(0)=0$ . We now turn to prove that the Bernoulli Theorem [1] implies that there must exist a subset $T^{\prime}\subset\ell^{2}$ such that $\gamma_{B}(T^{\prime})$ is comparable to $S_{B}(T)$ . The idea of the proof works also for other classes of canonical processes for which we can characterize $S_{X}(T)$ in terms of increments, see Remark 2 below.

Theorem 2

There exists a function $\pi:T\rightarrow\ell^{2}$ such that

[TABLE]

where $K$ is a universal constant, $T_{1}=\{t-\pi(t):t\in T\}$ and $T_{2}=\{\pi(t):t\in T\}$ .

Proof. First, we have to notice that it suffices to prove the result for countable sets $T$ . Indeed for any dense countable set $\bar{T}$ it is true that $S_{B}(T)=S_{B}(\bar{T})$ . Suppose we have a decomposition of $\bar{T}$ into $\bar{T}_{1}$ and $\bar{T}_{2}$ so that (5) holds. It is easy to observe that $\gamma_{B}(\bar{T}_{1})=\gamma_{B}(\mathrm{cl}\bar{T}_{1})$ and $\gamma_{B}(\bar{T}_{2})=\gamma_{B}(\mathrm{cl}\bar{T}_{2})$ moreover, $T_{1}$ and $T_{2}$ must be bounded since otherwise $\gamma_{B}(T_{1})$ or $\gamma_{B}(T_{2})$ is infinite and hence also $S_{B}(T)$ . Therefore, $\mathrm{cl}\bar{T}_{1}+\mathrm{cl}\bar{T}_{2}$ is compact and contains $\mathrm{cl}T$ . Consequently, with no loss in generality we can assume that $T$ is countable. Then, by the main result of [1] we get the existence of $\pi:T\rightarrow\ell^{2}$ and consequently the existence of the decomposition into countable sets $T_{1},T_{2}\subset\ell^{2}$ such that $T\subset T_{1}+T_{2}$ and

[TABLE]

where $K$ is a universal constant. By the Pisier’s [13] and Talagrand’s theorems [17] we have that $S_{G}(T)$ is comparable with $\gamma_{G}(T)$ . Let $g$ be a standard normal variable independent of $B_{t}$ , $t\in T$ . Observe that for any $p\geqslant 1$

[TABLE]

and hence $\frac{\sqrt{\pi}}{\sqrt{2}}\gamma_{G}(T_{2})\geqslant\gamma_{B}(T_{2})$ . On the other hand, we can choose an admissible sequence $(T^{n}_{1})^{\infty}_{n=0}$ such that $\bigcup^{\infty}_{n=0}T^{n}_{1}=T_{1}$ . Fix any given point $t_{0}$ in $T$ . Define

[TABLE]

If $t\in T^{m}\backslash T^{m-1}$

[TABLE]

Therefore, by the triangle inequality

[TABLE]

In this way we have proved that

[TABLE]

On the other hand, we have a trivial upper bound

[TABLE]

by Theorem 1.

$\blacksquare$

Let us also observe that for $\mathbf{P}(|\xi_{i}|>t)=\exp(-c_{p}t^{p})$ , $p\geqslant 2$ we could give a similar proof. It is based on the fact that for any $p$ there is a Talagrand’s [17] characterization of $S_{X}(T)$ .

Remark 2

For the class of canonical processes based on independent symmetric $\xi_{i}$ such that $\mathbf{P}(|\xi_{i}|>t)=\exp(-c_{p}t^{p})$ , $p\geqslant 2$ , $S_{X}(T)$ is comparable with $\gamma_{X}(T_{1}+T_{2})$ up to a constant for some $T_{1}+T_{2}\subset\ell^{2}$ that contains $T$ . The role of $T_{2}$ may be again addressed to the Gaussian reason, whereas $T_{1}\subset\ell^{p^{\ast}}$ for $p^{\ast}=\frac{p}{p-1}$ .

In general, we conjecture that the same is true for canonical processes based on log-concave random variables.

Conjecture 1

If $(\xi_{i})$ , $i\geqslant 1$ is a sequence of independent log-concave random variables with mean 0 and variance 1 then there exists $\pi:T\rightarrow\ell^{2}$ and sets $T_{1}=\{t-\pi(t)\in\ell^{2}:\;t\in T\}$ , $T_{2}=\{\pi(t)\in\ell^{2}:\;t\in T\}$ such that

[TABLE]

where $K$ is a universal constant.

3 Contractions of canonical Bernoulli processes

Suppose we have a map $\varphi:T\rightarrow\ell^{2}$ . The main question we treat in this paper is under what assumptions on $X_{t}$ , $T$ and $\varphi$ we can show that $S_{X}(\varphi(T))$ is bounded by $S_{X}(T)$ up to a numerical constant. In particular we are interested in the case of canonical Bernoulli processes. Let’s start with classic results concerning comparison of Gaussian processes. It is well-known that if $G_{t}$ and $G^{\prime}_{t}$ , $t\in T$ are centered Gaussian processes and $\mathbf{E}|G_{t}-G_{s}|^{2}\leqslant\mathbf{E}|G^{\prime}_{t}-G^{\prime}_{s}|^{2}$ , then for each finite subset $F\subset T$

[TABLE]

This comparison is a consequence of Slepian’s Lemma (Corollary 3.14 in [9] provides the proof with constant 2, the proof with the best possible constant $1$ is Corollary 2.1.3 in [2]). Note also that by the Majorizing Measure Theorem the result can be generalized to the case when we compare a centered Gaussian process with a centered process for which we only require sub-gaussianity property, see Theorem 12.16 in [9]. We start with a discussion on possible extensions of this result. It is natural to ask for other cases when similar comparison results hold. From Theorem 1 it can be easily deduced that if we can compare moments then we can compare $\gamma$ -type upper bounds.

Corollary 1

Suppose that $(X_{t})_{t\in T}$ is a canonical process and suppose that for each $n\geqslant 1$ , $\varphi:T\rightarrow\ell_{2}$ and constant $C$ it satisfies

[TABLE]

then $S_{X}(\varphi(T))\leqslant 4C\gamma_{X}(T)$ .

Proof. Clearly, by Theorem 1 we have $S_{X}(\varphi(T))\leqslant 4\gamma_{X}(\varphi(T))\leqslant 4C\gamma_{X}(T)$ .

$\blacksquare$

This means that if we could show that $S_{X}(t)\geqslant K^{-1}\gamma_{X}(T)$ , then by Corollary 1 we would be able to prove that $S_{X}(\varphi(T))\leqslant 4CKS_{X}(T)$ . Unfortunately, in general, there is no proof that $\gamma_{X}(T)$ is comparable with $S_{X}(T)$ . On the other hand, as it was discussed before there are cases where the idea works. In particular, we could use Corollary 1 in order to recover the Gaussian comparison result with some absolute constant. However, in the Gaussian setting, one can simply refer to (7) rewriting it in the following way

[TABLE]

We now move to the case of canonical Bernoulli processes. The only known comparison result is Theorem 2.1 in [16] and Theorem 4.12 in [9]. It states that if $\varphi=(\varphi_{i})_{i\geqslant 1}:T\rightarrow\ell_{2}$ , where $\varphi_{i}:{\mathbb{R}}\rightarrow{\mathbb{R}}$ are contractions then $S_{B}(T)$ dominates $S_{B}(\varphi(T))$ with the constant $1$ , namely

[TABLE]

Note that if we are interested in the comparison up to a numerical constant (not necessarily equal $1$ ) then the requirement of coordinate contractions is too demanding. However, it is known that the result analogous to (7), where we assume that $\varphi:\ell^{2}\rightarrow\ell^{2}$ is a Lipschitz contraction does not hold for Bernoulli processes. Therefore some additional assumptions on $\varphi$ or $T$ are required. As we show in this paper, the comparison for canonical Bernoulli processes should depend on a suitable family of distances already presented in (4). The straightforward consequence of Theorem 2 is the following comparison result.

Corollary 2

Suppose that $\varphi:T\rightarrow\ell^{2}$ can be extended to $T_{1}+T_{2}$ in such a way that for any $p\geqslant 1$

[TABLE]

then $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ , where $K$ is a universal constant.

Proof. Clearly, by Theorem 1 we have $S_{B}(\varphi(T))\leqslant 4\gamma_{B}(\varphi(T))$ . Hence, by Theorem 2

[TABLE]

$\blacksquare$

Note that the trouble with application of the above result is that $T_{1}+T_{2}$ may be much larger than $T$ . We conjecture the following generalization of the above result.

Conjecture 2

Let $\varphi=(\varphi_{i})_{i\geqslant 1}:T\rightarrow\ell^{2}$ . If

[TABLE]

then $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ , for an absolute constant $K$ .

Towards this aim we prove a weaker form of the conjecture. As we have explained the norm $\|B_{t}-B_{s}\|_{p}$ can be decomposed into the Gaussian and $\ell^{1}$ part. Our condition states that if Gaussian part of $\|B_{t}-B_{s}\|_{p}$ dominates Gaussian part of $\|B_{\varphi(t)}-B_{\varphi(s)}\|_{p}$ , for all $s,t\in T$ and $p\geqslant 1$ then $S_{B}(T)$ dominates $S_{B}(\varphi(T))$ up to an absolute constant.

Theorem 3

Suppose that for all $s,t\in T$ and all natural $p$ such that $p\geqslant 0$ we have

[TABLE]

for an absolute constant $C\geqslant 1$ . Then $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ , where $K$ is a universal constant.

The result is stronger than the comparison for Bernoulli processes (10). In this way Theorem 3 supports the conjecture that (11) suffices to prove that $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ . Note that there is an important case for which the conjecture is true. Namely, when we assume that all supports $J(t)=\{i\geqslant 1:\;|t_{i}|>0\}$ of $t\in T$ are disjoint. It is crucial is to understand that in this case the decomposition postulated in the Bernoulli Theorem can have a special form: $\pi(t)=t_{J^{1}(t)}$ and $t-\pi(t)=t_{J^{2}(t)}$ , where $J^{1}(t)$ and $J^{2}(t)$ are disjoint and $J^{1}(t)\cup J^{2}(t)=J(t)$ . We show this fact when proving the following result.

Theorem 4

Suppose that (11) is satisfied and supports $J(t)=\{i\geqslant 1:\;|t_{i}|>0\}$ are disjoint for all $t\in T$ then $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ , where $K$ is a universal constant.

As we show in the last section, results of this type are of interest when one wants to compare weak and strong moments for random series in a Banach space. The question was proposed by K. Oleszkiewicz in private communication.

4 Proof of the main result

In this section we prove Theorem 3 and Theorem 4.

Proof.[Proof of Theroem 3] The main step in the proof of the Bernoulli theorem - Proposition 6.2 in [1] is to show the existence of a suitable admissible sequence of partitions. Consequently, if $S_{B}(T)<\infty$ and $0\in T$ then it is possible to define nested partitions ${\cal A}_{n}$ of $T$ such that $|{\cal A}_{n}|\leqslant N_{n}$ . Moreover, for each $A\in{\cal A}_{n}$ it is possible to find $j_{n}(A)\in{\mathbb{Z}}$ and $\pi_{n}(A)\in T$ (we use the notation $j_{n}(t)=j_{n}(A_{n}(t))$ and $\pi_{n}(t)=\pi_{n}(A_{n}(t))$ , where $t\in A_{n}(t)\in{\cal A}_{n}$ ) which satisfy the following conditions

(i)

$\|t-s\|_{2}\leqslant\sqrt{M}r^{-j_{0}(T)}$ , for $s,t\in T$ ; 2. (ii)

if $n\geqslant 1$ , ${\cal A}_{n}\ni A\subset A^{\prime}\in{\cal A}_{n-1}$ then

(a)

either $j_{n}(A)=j_{n-1}(A^{\prime})$ and $\pi_{n}(A)=\pi_{n-1}(A^{\prime})$ 2. (b)

or $j_{n}(A)>j_{n-1}(A^{\prime})$ , $\pi_{n}(A)\in A^{\prime}$ and

[TABLE]

where for any $t\in A$

[TABLE] 3. (iii)

Moreover, numbers $j_{n}(A)$ , $A\in{\cal A}_{n}$ , $n\geqslant 0$ satisfy

[TABLE]

where $L$ is an absolute constant.

As proved in Theorem 3.1 in [1] the existence of the quantities ${\cal A}_{n},j_{n}(A),\pi_{n}(A),I_{n}(A)$ that satisfy conditions (i) and (ii) formulated above implies the existence of a decomposition $T_{1},T_{2}\subset\ell^{2}$ , $T_{1}+T_{2}\supset T$ such that

[TABLE]

Together with the condition (iii) we get (6). Our aim is to use the mapping $\varphi$ to transport all the required quantities to $\varphi(T)$ . Before we do it we formulate an auxiliary fact about sets $I_{n}(A)$ , namely we show that we can get rid of truncation in (13) if we skip a well controlled number of coordinates. We observe that for each $t\in A\in{\cal A}_{n}$ there must exist set $J_{n}(t)$ such that $|J^{c}_{n}(t)|\leqslant M2^{n+1}$ and

[TABLE]

The fact will be proved in two steps. First, we show that $|I_{n}(t)^{c}|\leqslant M2^{n}$ . We may only prove that $|I_{n}(t)|=|I_{n}(A_{n}(t))|\leqslant 2^{n}$ , if $\pi_{n-1}(t)\neq\pi_{n}(t)$ , which implies $j_{n-1}(t)\neq j_{n}(t)$ and $\pi_{n}(t)\in A_{n-1}(t)$ . Therefore, there exists $k\in\{1,\ldots,n\}$ such that

[TABLE]

and hence $\pi_{n}(t)\in A_{n-1}(t)\subset A_{n-k}(t)$ and $\pi_{n-1}(t)=\pi_{n-k}(t)$ , $j_{n-1}(t)=j_{n-k}(t)$ so by the construction of $({\cal A}_{n})_{n\geqslant 0}$

[TABLE]

Consequently,

[TABLE]

Obviously,

[TABLE]

Therefore by the induction, $|I_{n}(t)^{c}|\leqslant M\sum^{n}_{k=1}2^{n-k}\leqslant M2^{n}$ . Let

[TABLE]

The second step is to establish that $|I_{n}(t)\backslash J_{n}(t)|\leqslant M2^{n}$ . Again it suffices to prove the result only for $n$ such that $j_{n}(t)>j_{n-1}(t)$ . Note that by (13)

[TABLE]

and hence the result holds. It remains to observe that

[TABLE]

We turn to construct an admissible partition sequence together with all the supporting quantities for the set $\varphi(T)$ . Let ${\cal B}_{n}$ consists of $\varphi(A)$ , $A\in{\cal A}_{n}$ . Obviously partitions ${\cal B}_{n}$ are admissible, nested and ${\cal B}_{0}=\{\varphi(T)\}$ . Moreover, for each $n\geqslant 0$ and $A\in{\cal A}_{n}$ we define

[TABLE]

and obviously

[TABLE]

As we have mentioned at the beginning of this proof, in order to use Theorem 3.1 in [1] we have to verify conditions (i) and (ii) for the new sequence ${\cal B}=({\cal B}_{n})_{n\geqslant 0}$ as well as $j_{n}(B),\pi_{n}(B),I_{n}(B)$ for $B\in{\cal B}_{n}$ , $n\geqslant 0$ . For this aim we need our main condition (12). First it is obvious that that (12) implies for $p=0$ that

[TABLE]

If $A\in{\cal B}_{n}$ and $\varphi(A)\subset\varphi(A^{\prime})\in{\cal B}_{n-1}$ then either

[TABLE]

and

[TABLE]

or $j_{n}(\varphi(A))=j_{n}(A)>j_{n-1}(A^{\prime})=j_{n-1}(\varphi(A^{\prime}))$ . In this case we have $\pi_{n}(\varphi(A))=\varphi(\pi_{n}(A))\in\varphi(A^{\prime})$ and it suffices to show that

[TABLE]

Obviously, the problem now is that we know a little about the structure of the set $I_{n}(\varphi(A))$ . Therefore, we simply prove that

[TABLE]

It is obvious that

[TABLE]

We can choose $C_{2}\geqslant 2CM$ in a way that by (12) we get

[TABLE]

Hence, by (15) and (17)

[TABLE]

which proves (16) with $C_{3}=C_{2}+C^{2}M$ . We have proved that assumptions required in Theorem 3.1 in [1] are satisfied for $({\cal B}_{n})_{n\geqslant 0}$ and the supporting quantities. Consequently, there exists a decomposition $S_{1},S_{2}\subset\ell^{2}$ such that $S_{1}+S_{2}\supset\varphi(T)$ and

[TABLE]

Since $j_{n}(\varphi(t))=j_{n}(t)$ and we have (14) for $({\cal A}_{n})_{n\geqslant 0}$ we obtain that

[TABLE]

It implies that

[TABLE]

for a universal constant $K$ and ends the proof.

$\blacksquare$

The second case we consider is when for all $t\in T$ supports $J(t)=\{i\geqslant 1:\;|t_{i}|>0\}$ are disjoint. The proof requires the following notation. For any $t\in\ell^{2}$ and $J\subset\{1,2,\ldots\}$ we define $t1_{J}\in\ell^{2}$ such that $(t1_{J})_{i}=t_{i}$ for $i\in J$ and $(t1_{J})_{i}=0$ otherwise.

Proof.[Proof of Theorem 4] Obviously, we may require that $b(T)<\infty$ . We additionally assume that $0\in T$ . It simplifies the proof, but it works also for the general case as we will point out at the end. Recall that by Bernoulli Theorem [1] there exists a decomposition $T_{1}+T_{2}\supset T$ such that

[TABLE]

where $K$ is an absolute constant. Obviously, we may think of $K$ as suitably large. We can represent the decomposition by $\pi:T\rightarrow\ell^{2}$ in a way that $T_{2}=\{\pi(t):\;t\in T\}$ and $T_{1}=\{t-\pi(t):\;t\in T\}$ . We show that under the disjoint supports assumption we may additionally require that $\pi(t)=t1_{J^{2}(t)}$ and $t-\pi(t)=t1_{J^{1}(t)}$ where $J^{1}(t)$ and $J^{2}(t)$ are disjoint subsets of $J(t)$ such that $J^{1}(t)\cup J^{2}(t)=J(t)$ . Moreover, $J^{2}(t)=\{i\in J(t):\;|t_{i}|\leqslant p(t)\}$ , for some suitably chosen $p(t)\geqslant 0$ .

In order to prove the result we have to look closer into the definition of $\pi(t)$ in the proof of Theorem 3.1 in [1]. The definition is based on the construction of admissible partitions we have described in the proof of Theorem 3 above. Using the notation introduced there let

[TABLE]

Note that $S_{B}(T)$ is comparable with $\sup_{t\in T}\sum_{n\geqslant 0}2^{n}r^{-j_{n}(t)}$ . Therefore, if $S_{B}(T)$ is finite then necessarily $\lim_{n\rightarrow\infty}j_{n}(t)=\infty$ for all $t\in T$ . From the partition construction used in Section 6 in [1] we know that we can additionally assume a regularity condition on $j_{n}(t)$ , $n\geqslant 0$ , namely

[TABLE]

and for technical purpose we take $j_{-1}(t)=-\infty$ . As in the proof of Theorem 3.1 in [1] the Bernoulli decomposition $\pi(t)$ is given by $\pi(t)_{i}=\pi_{m(t,i)}(t)_{i}$ , where if $m(t,i)=\infty$ the definition means that $\pi(t)_{i}=\lim_{n\rightarrow\infty}\pi_{n}(t)_{i}$ and the limit exists. Consequently, denoting $J_{n}(t)=\{i\geqslant 1:\;m(t,i)=n\}$ and $J_{\infty}(t)=\{i\geqslant 1:\;m(t,i)=\infty\}$ we get

[TABLE]

Clearly, $J_{n}(t)$ , $n\geqslant 0$ and $J_{\infty}(t)$ are disjoint. Note also that if $m(t,i)=\infty$ and $i\in J(\pi(t))$ , then there must exist $n\geqslant 0$ such that $|\pi_{k}(t)_{i}|>0$ for all $k\geqslant n$ . Due to the disjoint supports assumption it is only possible if there exists $n\geqslant 0$ such that $\pi_{n}(t)_{i}=\pi_{n+1}(t)_{i}=\ldots$ . Now, if there exists $m\geqslant 0$ such that $A_{m}(t)=\{t\}$ we define

[TABLE]

The moment $\tau(t)$ is of special nature in the sense that without loss of generality we may assume that for $n\geqslant\tau(t)$ it is true that $j_{n}(t)=j_{n-1}(t)+2$ . It is due to the fact the partition is ceased after this moment. Now, we define

[TABLE]

We can now introduce the improved version of $\pi$ denoted by $\bar{\pi}$ and given by

[TABLE]

It is clear that

[TABLE]

For $n\geqslant 0$ let

[TABLE]

Observe, that $J^{1}(t)=\bigcup_{n<\tau(t)}L_{n}(t)$ . If $i\in L_{n}(t)$ , $n\geqslant 0$ , then we may find $0\leqslant m\leqslant n$ such that $j_{m-1}(t)<j_{m}(t)=j_{m+1}(t)=\ldots=j_{n}(t)$ . Consequently, using the definition (13) of $I_{n}(t)$ for all $s\in A_{m}(t)$

[TABLE]

We need to show that the decomposition $\bar{\pi}$ is of the right form i.e. satisfies (18). For this aim we need to investigate a few cases following from different possible paths of approximations $\pi$ . First suppose that $t\neq\pi_{n}(t)$ . Then we may use the above inequality for $s=t$ and due to the disjoint supports we have

[TABLE]

The same inequality holds if $t=\pi_{n}(t)$ but $A_{m}(t)\neq\{t\}$ . We show that $L_{n}(t)\subset I_{n}(t)$ . Indeed, suppose that $i\not\in I_{n}(t)$ . It means that for some $k\in\{0,1,\ldots,n-1\}$ we have $|\pi_{k+1}(t)_{i}-\pi_{k}(t)_{i}|>r^{-j_{k}(t)}$ . This may concern $i\in J(t)$ only if $\pi_{k+1}(t)=t,\pi_{k}(t)\neq t$ or $\pi_{k}(t)=t$ and $\pi_{k+1}(t)\neq t$ , but then it means that $|t_{i}|>r^{-j_{k}(t)}\geqslant r^{-j_{n-1}(t)}$ i.e. $i\not\in L_{n}(t)$ . It concludes the argument that $L_{n}(t)\subset I_{n}(t)$ . For $1\leqslant n<\tau(t)$ it implies that

[TABLE]

For $n=0$ we use simply that $|t_{i}|\leqslant 2S_{B}(T)$ and hence

[TABLE]

Now suppose that $t=\pi_{n}(t)=\pi_{m}(t)$ and $A_{m}(t)=\{t\}$ . If either $t\neq\pi_{m-1}(t)$ or $\{t\}\neq A_{m-1}(t)$ , then $\tau(t)=m$ . Otherwise $\tau(t)<m$ . If $\tau(t)=m$ , then by the above argument

[TABLE]

and thus using that $|t_{i}|\geqslant r^{-j_{m}(t)-1}$ and $j_{m}(t)=j_{m-1}(t)+2$ , we have

[TABLE]

We have the remaining bound

[TABLE]

Combining (20), (21) and (22) we conclude by (14)

[TABLE]

where $L$ is an absolute constant.

Now consider $s,t\in T$ , $s\neq t$ . In order to prove that

[TABLE]

we have to argue that $J^{2}(t)\cap J(\pi(s))=\emptyset$ , $J^{2}(s)\cap J(\pi(t))=\emptyset$ for all $n\geqslant 0$ . Note that $J^{2}(t)\subset J_{\infty}(t)$ and $J^{2}(s)\subset J_{\infty}(s)$ . Moreover, $J_{\infty}(s)$ and $J_{\infty}(s)$ are disjoint. Obviously, it suffices to show the argument that $J^{2}(t)\cap J(\pi(s))=\emptyset$ .

First, note that $J^{2}(t)\cap J_{\infty}(s)=\emptyset$ . Indeed if the set was non-empty then for a given $n\geqslant 0$ we would have $t=\pi_{n}(s)=\pi_{n+1}(s)=\ldots$ , but then $s\in A_{n}(t)$ for all $n\geqslant 0$ and therefore $\tau(t)=\infty$ . This would imply $J^{2}(t)=\emptyset$ which is a contradiction. Suppose that $i\in J^{2}(t)$ and $i\in J_{n}(s)$ . This is only possible if $\pi_{n}(s)=t$ and $\pi_{n+1}(s)\neq\pi_{n}(s)=t$ and $r^{-j_{n}(s)}<|\pi_{n}(s)_{i}|$ . Let $m\geqslant 0$ be such that $j_{m-1}(s)<j_{m}(s)=j_{m+1}(s)=\ldots=j_{n}(s)$ , then either $m=0$ or $m\geqslant 1$ and $t=\pi_{n}(s)=\pi_{m}(s)\in A_{m-1}(s)$ , which means that $A_{m-1}(s)=A_{m-1}(t)$ and $j_{m-1}(s)=j_{m-1}(t)$ . Therefore, $\tau(t)\geqslant m$ and $j_{\tau(t)}(t)>j_{m-1}(t)$ . If $i\in J^{2}(t)\cap J_{n}(s)$ , then

[TABLE]

which is a contradiction. If $m=0$ , then the argument is trivial.

Summing up, by (23) we have

[TABLE]

and by (24) and the Gaussian comparison we have $\gamma_{G}(\pi(T))\leqslant\gamma_{G}(\bar{\pi}(T))$ , which means that our improved version of $\pi$ satisfies

[TABLE]

where $K$ is a universal constant. In this way we have proved that we may additionally require that $\pi(t)=t1_{J^{2}(t)}$ and $t-\pi(t)=t1_{J^{1}(t)}$ for some disjoint $J^{1}(t),J^{2}(t)$ such that $J^{1}(t)\cup J^{2}(t)=J(t)$ . Recall that $J^{2}(t)$ in each case is of the form $\{i\in J(t):\;|t_{i}|\leqslant r(t)\}$ , for a given $r(t)\geqslant 0$ .

We turn to the main part of the proof. Let $p(t)$ be the smallest positive integer such that

[TABLE]

Note that it is possible that $J^{2}(t)=\emptyset$ in which case we may think of $p(t)$ as equal $\infty$ . Since $K$ is large enough and $S_{B}(T)\geqslant\frac{1}{2}\sup_{t\in T}\|t\|_{2}$ it is clear that $p(t)$ must be at least greater than, say, $2$ . Consequently, by the choice of $p(t)$

[TABLE]

The last step is to define a suitable decomposition for $\varphi(T)$ . For each $t\in T$ we define $\pi(\varphi(t))=t_{J^{2}(\varphi(t))}$ and $\varphi(t)-\pi(\varphi(t))=t_{J^{1}(\varphi(t))}$ , where $J^{2}(\varphi(t))$ and $J^{1}(\varphi(t))$ are defined by the decomposition of the norm $\|B_{\varphi(t)}\|_{p(t)}$ i.e.

[TABLE]

and

[TABLE]

Consequently by the decomposition (4) and the main assumption (11),

[TABLE]

Therefore, using (25), (26)

[TABLE]

Moreover, by (25)

[TABLE]

It implies that

[TABLE]

Therefore, by the Gaussian comparison, we get $\gamma_{G}(\pi(\varphi(T)))\leqslant K\gamma_{G}(\pi(T))$ and hence finally

[TABLE]

It ends the proof in the case when $0\in T$ . For the general case the proof follows the same lines, where instead of $t$ we consider $t-\pi_{0}(t)$ . Notice that formally this may not obey the disjoint supports assumption, but it does not affect qualitatively the argument presented above.

$\blacksquare$

Note that the above proof works since in the case of disjoint supports we have almost perfect knowledge about the decomposition in Bernoulli Theorem. On the other hand, it is not difficult to give an alternative proof based on the independence of variables $B_{t}$ , $t\in T$ , but it is worth seeing what the decomposition in Theorem 3.1 in [1] should be in order to make Bernoulli comparison possible.

5 The Oleszkiewicz problem

In this section we give an example how to apply our result to compare expectations of norms of random series in a Banach space. First, we prove a general result which concerns $\varphi:T\rightarrow\ell^{2}$ where $\varphi$ is linear, $T$ is convex and $T=-T$ . Then, the assumption (8) becomes

[TABLE]

where $\mathrm{Lin}(T)$ is the linear space spanned by the set $T$ . It is because by the assumptions on $T$ any point $u\in\mathrm{Lin}(T)$ can be represented as $c\cdot t$ , where $c\in{\mathbb{R}}$ and $t\in T$ . By the linearity of $\varphi$

[TABLE]

On the other hand, we can easily extend the condition (27) on the closure of $\mathrm{Lin}(T)$ . We turn to prove that if $\mathrm{cl}(\mathrm{Lin}(T))=\ell^{2}$ then (27) implies that $S_{B}(T)$ dominates $S_{B}(\varphi(T))$ .

Theorem 5

Suppose that $T=-T$ , $T$ is convex and $\mathrm{cl}(\mathrm{Lin}(T))=\ell^{2}$ , if $\varphi$ is linear and satisfies (8) then $S_{B}(\varphi(T))\leqslant KS_{B}(T)$ , where $K$ is a universal constant.

Proof. By the Bernoulli theorem [1] we have that there exist $T_{1},T_{2}$ such that $T\subset T_{1}+T_{2}$ and

[TABLE]

Since $\varphi$ is linear it can be easily extended to $\mathrm{cl}(\mathrm{Lin}(T))=\ell^{2}$ and thus we can define $S_{i}=\varphi(T_{i})$ , $i\in\{1,2\}$ . Obviously $S_{1}+S_{2}\supset\varphi(T)$ moreover (27) implies in particular that

[TABLE]

and

[TABLE]

Consequently

[TABLE]

and

[TABLE]

Therefore

[TABLE]

It ends the proof.

$\blacksquare$

We aim to study the question posed by Oleszkiewicz that concerns comparability of weak and strong moments for Bernoulli series in a Banach space. Let $x_{i}$ , $y_{i}$ , $i\geqslant 1$ be vectors in a Banach space $(B,\|\cdot\|)$ . Suppose that for all $x^{\ast}\in B^{\ast}$ and $u\geqslant 0$

[TABLE]

This property is called weak tail domination. As we have explained in the introduction the weak tail domination can be understood in terms of comparability of weak moments, i.e. for any integer $p\geqslant 1$ and $x^{\ast}\in B^{\ast}$

[TABLE]

Oleszkiewicz asked whether or not it implies the comparability of strong moments. Namely whether (28) or rather (29) implies that

[TABLE]

where $K$ is an absolute constant. Note that in the Oleszkiewicz problem one may assume that $B$ is a separable space since we can easily restrict $B$ to the closure of $\mathbf{Lin}(y_{1},x_{1},y_{2},x_{2},\ldots)$ . Therefore we have that

[TABLE]

where the supremum is taken over all finite sets $F$ contained in $B^{\ast}_{1}=\{x^{\ast}\in B^{\ast}:\;\|x^{\ast}\|\leqslant 1\}$ . We may assume that $\mathbf{E}\|\sum_{i\geqslant 1}y_{i}\varepsilon_{i}\|<\infty$ since otherwise there is nothing to prove. Consequently for each $x^{\ast}\in B^{\ast}$ series $\sum_{i\geqslant 1}x^{\ast}(y_{i})\varepsilon_{i}$ is convergent which is equivalent to $\sum_{i\geqslant 1}(x^{\ast}(y_{i}))^{2}<\infty$ . Let $Q:B^{\ast}\rightarrow\ell^{2}$ be defined by $Q(x^{\ast})=(x^{\ast}(y_{i}))_{i\geqslant 1}$ . It is clear that $Q:B^{\ast}/\ker Q\rightarrow\ell^{2}$ is a linear isomorphism on the closed linear subspace of $\ell^{2}$ . We apply Theorem 5 to get the following result.

Corollary 3

Suppose that $Q$ is onto $\ell^{2}$ then (28) implies (30).

Unfortunately if $Q$ is not onto $\ell^{2}$ then the above argument fails. Still it is believed that the comparison holds. A partial result can be deduced from Theorem 3 namely

Corollary 4

Suppose that for each $x^{\ast}\in B^{\ast}$ and $p\geqslant 0$

[TABLE]

Then (30) holds, i.e.

[TABLE]

Proof. It suffices to notice that (31) implies (12) and then apply Theorem 3.

$\blacksquare$

Acknowledgments

We would like to thank prof. Kwapień for comments on the shape of this paper and helpful discussion about Theorem 1.

Bibliography17

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Bednorz, W and Latała, R: On the boundedness of Bernoulli processes, Ann. Math. , 180 , (2014), 1167–1203.
2[2] Fernique, X: Régularité des trajectoires des fonctions aléatoires gaussiennes. (French) École d’Été de Probabilités de Saint-Flour, IV-1974. Lecture Notes in Math. 480 , (1975), 1–96, Springer, Berlin.
3[3] Garling, D. J. H.: Inequalities: A Journey into Linear Analysis, (2007), Cambridge University Press, Cambridge.
4[4] Hitczenko, P: Domination inequality for martingale transforms of a Rademacher sequence, Israel J. Math. , 84 , (1993), 161–178.
5[5] Hitczenko, P and Kwapień, S: On the Rademacher Series. In: Hoffmann-Jørgensen J., Kuelbs J., Marcus M.B. (eds) Probability in Banach Spaces, 9 . Progress in Probability , 35 . (1994) Birkhäuser, Boston, MA.
6[6] Latała, R: Sudakov minoration principle and supremum of some processes. Geom. Funct. Anal. , 7(5) , (1997), 936–953.
7[7] Latała, R: Moments of unconditional logarithmically concave vectors, in Geometric Aspects of Functional Analysis, Israel Seminar 2006-2010, Lecture Notes in Math. 2050 , (2012), 301–315, Springer.
8[8] Latała, R and Tkocz, T: A note on suprema of canonical processes based on random variables with regular moments. Electron. J. Probab. , 20(36) , (2015), 1–17.