Subgaussianity is hereditarily determined

Pandelis Dodos; Konstantinos Tyros

arXiv:1902.05297·math.PR·January 29, 2021

Subgaussianity is hereditarily determined

Pandelis Dodos, Konstantinos Tyros

PDF

TL;DR

This paper demonstrates that the subgaussian properties of linear combinations of bounded random vectors are fundamentally linked to the subgaussian behavior of their random subsets, revealing a hereditary structure.

Contribution

It establishes that subgaussianity of a linear combination is essentially determined by the subgaussianity of its random subsets, highlighting a hereditary property.

Findings

01

Subgaussianity is hereditarily determined by random subsets.

02

The behavior of the entire sum is linked to its parts.

03

Subgaussian properties can be inferred from subset behaviors.

Abstract

Let $n$ be a positive integer, let $X = (X_{1}, \dots, X_{n})$ be a random vector in $R^{n}$ with bounded entries, and let $(θ_{1}, \dots, θ_{n})$ be a vector in $R^{n}$ . We show that the subgaussian behavior of the random variable $θ_{1} X_{1} + \dots + θ_{n} X_{n}$ is essentially determined by the subgaussian behavior of the random variables $\sum_{i \in H} θ_{i} X_{i}$ where $H$ is a random subset of ${1, \dots, n}$ .

Equations220

\|X\|_{\psi_{2}}\coloneqq\inf\big{\{}s>0:\mathbb{E}\big{[}e^{(X/s)^{2}}\big{]}\leqslant 2\big{\}}

\|X\|_{\psi_{2}}\coloneqq\inf\big{\{}s>0:\mathbb{E}\big{[}e^{(X/s)^{2}}\big{]}\leqslant 2\big{\}}

∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} ⩽ K ∥ θ ∥_{2}

∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} ⩽ K ∥ θ ∥_{2}

⟨ θ, X ⟩ = i = 1 \sum n θ_{i} X_{i}

⟨ θ, X ⟩ = i = 1 \sum n θ_{i} X_{i}

θ_{H} = (θ_{1}^{'}, \dots, θ_{n}^{'}) : = {θ_{i}^{'} = θ_{i} θ_{i}^{'} = 0 if i \in H, otherwise .

θ_{H} = (θ_{1}^{'}, \dots, θ_{n}^{'}) : = {θ_{i}^{'} = θ_{i} θ_{i}^{'} = 0 if i \in H, otherwise .

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant p-o_{C\to\infty;K,p}(1).

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant p-o_{C\to\infty;K,p}(1).

\mu_{p}\big{(}\{H:\bm{X}\text{ is $K$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant\gamma,

\mu_{p}\big{(}\{H:\bm{X}\text{ is $K$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant\gamma,

μ_{p} ({H}) = p^{∣ H ∣} (1 - p)^{n - ∣ H ∣}

μ_{p} ({H}) = p^{∣ H ∣} (1 - p)^{n - ∣ H ∣}

\mu_{p}\Big{(}\Big{\{}H:\Big{|}\sum_{i\in H}c_{i}-p\sum_{i=1}^{n}c_{i}\Big{|}\geqslant t\Big{\}}\Big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{c}\|_{2}^{2}}\Big{)}.

\mu_{p}\Big{(}\Big{\{}H:\Big{|}\sum_{i\in H}c_{i}-p\sum_{i=1}^{n}c_{i}\Big{|}\geqslant t\Big{\}}\Big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{c}\|_{2}^{2}}\Big{)}.

∣ f (H \cup {i}) - f (H) ∣ ⩽ c_{i} .

∣ f (H \cup {i}) - f (H) ∣ ⩽ c_{i} .

\mu_{p}\big{(}\big{\{}H:|f(H)-M|\geqslant t\big{\}}\big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{c}\|^{2}_{2}}\Big{)}.

\mu_{p}\big{(}\big{\{}H:|f(H)-M|\geqslant t\big{\}}\big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{c}\|^{2}_{2}}\Big{)}.

C=C(K,p,\gamma)\coloneqq p^{-1}\,\big{(}K+\sqrt{\ln(2/\gamma)}\big{)}.

C=C(K,p,\gamma)\coloneqq p^{-1}\,\big{(}K+\sqrt{\ln(2/\gamma)}\big{)}.

\mu_{p}\big{(}\{H:\bm{X}\text{ is $K$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant\gamma,

\mu_{p}\big{(}\{H:\bm{X}\text{ is $K$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant\gamma,

⟨ θ, X ⟩ = p^{- 1} H \subseteq [n] \sum μ_{p} ({H}) ⟨ θ_{H}, X ⟩ .

⟨ θ, X ⟩ = p^{- 1} H \subseteq [n] \sum μ_{p} ({H}) ⟨ θ_{H}, X ⟩ .

∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} ⩽ p^{- 1} H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} .

∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} ⩽ p^{- 1} H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} .

\sum_{H\subseteq[n]}\mu_{p}(\{H\})\,\langle\bm{\theta}_{H},\bm{X}\rangle=\sum_{i=1}^{n}\theta_{i}X_{i}\Big{(}\sum_{i\in H\subseteq[n]}\mu_{p}(\{H\})\Big{)}=p\,\langle\bm{\theta},\bm{X}\rangle.

\sum_{H\subseteq[n]}\mu_{p}(\{H\})\,\langle\bm{\theta}_{H},\bm{X}\rangle=\sum_{i=1}^{n}\theta_{i}X_{i}\Big{(}\sum_{i\in H\subseteq[n]}\mu_{p}(\{H\})\Big{)}=p\,\langle\bm{\theta},\bm{X}\rangle.

M : = H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}},

M : = H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}},

\mu_{p}\big{(}\big{\{}H:\big{|}\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}-M\big{|}\geqslant t\big{\}}\big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{\theta}\|^{2}_{2}}\Big{)}.

\mu_{p}\big{(}\big{\{}H:\big{|}\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}-M\big{|}\geqslant t\big{\}}\big{)}\leqslant 2\exp\Big{(}-\frac{2t^{2}}{\|\bm{\theta}\|^{2}_{2}}\Big{)}.

\big{|}\|\langle\bm{\theta}_{H\cup\{i\}},\bm{X}\rangle\|_{\psi_{2}}-\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\big{|}\leqslant\|\theta_{i}X_{i}\|_{\psi_{2}}\leqslant\theta_{i}.

\big{|}\|\langle\bm{\theta}_{H\cup\{i\}},\bm{X}\rangle\|_{\psi_{2}}-\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\big{|}\leqslant\|\theta_{i}X_{i}\|_{\psi_{2}}\leqslant\theta_{i}.

\mu_{p}\big{(}\big{\{}H:\big{|}\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}-M\big{|}\geqslant t_{0}\big{\}}\big{)}\leqslant\frac{\gamma}{2}.

\mu_{p}\big{(}\big{\{}H:\big{|}\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}-M\big{|}\geqslant t_{0}\big{\}}\big{)}\leqslant\frac{\gamma}{2}.

C=C(K,p,\eta)\coloneqq 18\,\frac{(K+1)}{p}\,\log_{2}\Big{(}\frac{4}{\eta}\Big{)}.

C=C(K,p,\eta)\coloneqq 18\,\frac{(K+1)}{p}\,\log_{2}\Big{(}\frac{4}{\eta}\Big{)}.

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant p-\eta.

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\geqslant p-\eta.

θ : = (n, n - times 1, \dots, 1) \in R^{n + 1} .

θ : = (n, n - times 1, \dots, 1) \in R^{n + 1} .

H : = {H \subseteq [n + 1] : 1 \in / H and ∣ H \cap {2, \dots, n + 1} ∣ ⩾ p n /2} .

H : = {H \subseteq [n + 1] : 1 \in / H and ∣ H \cap {2, \dots, n + 1} ∣ ⩾ p n /2} .

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\leqslant p+o_{n\to\infty;p,C}(1).

\mu_{p}\big{(}\{H:\bm{X}\text{ is $C$-subgaussian at the direction }\bm{\theta}_{H}\}\big{)}\leqslant p+o_{n\to\infty;p,C}(1).

\mu_{p}\big{(}\big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\geqslant\lambda K\|\bm{\theta}\|_{2}\big{\}}\big{)}\leqslant 3\exp\Big{(}-\frac{\ln 2}{32}\lambda^{2}\Big{)}.

\mu_{p}\big{(}\big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\geqslant\lambda K\|\bm{\theta}\|_{2}\big{\}}\big{)}\leqslant 3\exp\Big{(}-\frac{\ln 2}{32}\lambda^{2}\Big{)}.

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} ⩽ 12 K ∥ θ ∥_{2} .

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} ⩽ 12 K ∥ θ ∥_{2} .

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} ⩽ 12 ∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} .

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}} ⩽ 12 ∥ ⟨ θ, X ⟩ ∥_{ψ_{2}} .

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}}

H \sim μ_{p} E ∥ ⟨ θ_{H}, X ⟩ ∥_{ψ_{2}}

\mu_{p}\big{(}\big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\geqslant(12+\lambda)K\|\bm{\theta}\|_{2}\big{\}}\big{)}\leqslant 2\exp(-2\ln 2\,\lambda^{2}K^{2}).

\mu_{p}\big{(}\big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\geqslant(12+\lambda)K\|\bm{\theta}\|_{2}\big{\}}\big{)}\leqslant 2\exp(-2\ln 2\,\lambda^{2}K^{2}).

\mu_{p}\Big{(}\Big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\leqslant\sqrt{\frac{3}{\ln 2}}\,M\|\bm{\theta}\|_{2}\Big{\}}\Big{)}\geqslant 1-3\exp\Big{(}-\frac{M^{2}}{2Q^{2}}\Big{)}.

\mu_{p}\Big{(}\Big{\{}H:\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\leqslant\sqrt{\frac{3}{\ln 2}}\,M\|\bm{\theta}\|_{2}\Big{\}}\Big{)}\geqslant 1-3\exp\Big{(}-\frac{M^{2}}{2Q^{2}}\Big{)}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Subgaussianity is hereditarily determined

Pandelis Dodos and Konstantinos Tyros

Department of Mathematics, University of Athens, Panepistimiopolis 157 84, Athens, Greece

[email protected]

Department of Mathematics, University of Athens, Panepistimiopolis 157 84, Athens, Greece

[email protected]

Abstract.

Let $n$ be a positive integer, let $\bm{X}=(X_{1},\dots,X_{n})$ be a random vector in $\mathbb{R}^{n}$ with bounded entries, and let $(\theta_{1},\dots,\theta_{n})$ be a vector in $\mathbb{R}^{n}$ . We show that the subgaussian behavior of the random variable $\theta_{1}X_{1}+\dots+\theta_{n}X_{n}$ is essentially determined by the subgaussian behavior of the random variables $\sum_{i\in H}\theta_{i}X_{i}$ where $H$ is a random subset of $\{1,\dots,n\}$ .

2010 Mathematics Subject Classification: 60E15, 60G99.

Key words: subgaussian random variable, subgaussian random vector, subvector.

1. Introduction

1.1. Subgaussianity

Recall that a real-valued random variable $X$ is called subgaussian if its tails are dominated by (that is, they decay at least as fast as) the tails of a gaussian. One of the several equivalent ways to quantify this property is using the Orlicz norm for the function $\psi_{2}(x)=e^{x^{2}}-1$ . Specifically, the random variable $X$ is subgaussian if its Orlicz norm

[TABLE]

is finite.

Next, let $n$ be a positive integer, and let $\bm{X}=(X_{1},\dots,X_{n})$ be a random vector in $\mathbb{R}^{n}$ , that is, $\bm{X}$ is a finite sequence of real-valued random variables defined on a common probability space. Also let $K>0$ and $\bm{\theta}=(\theta_{1},\dots,\theta_{n})\in\mathbb{R}^{n}$ , and recall that the random vector $\bm{X}$ is said to be $K$ -subgaussian at the direction $\bm{\theta}$ provided that

[TABLE]

where

[TABLE]

is the inner product of $\bm{\theta}$ and $\bm{X}$ , and $\|\bm{\theta}\|_{2}=(\theta_{1}^{2}+\dots+\theta_{n}^{2})^{1/2}$ is the euclidean norm of the vector $\bm{\theta}$ .

1.2. The problem

Let $\bm{X}=(X_{1},\dots,X_{n})$ be a random vector with $[-1,1]\text{-valued}$ entries, and fix $\bm{\theta}=(\theta_{1},\dots,\theta_{n})\in\mathbb{R}^{n}$ . For every subset $H$ of $[n]\coloneqq\{1,\dots,n\}$ let $\bm{\theta}_{H}\in\mathbb{R}^{n}$ denote the vector defined by

[TABLE]

In this paper we address the question whether the subgaussian behavior of the random vector $\bm{X}$ at the direction $\bm{\theta}$ is reflected to (and, conversely, whether it is characterized by) the typical subgaussian behavior of $\bm{X}$ at the direction $\bm{\theta}_{H}$ where $H$ is a random subset of $[n]$ distributed according to the uniform probability measure on $\{0,1\}^{n}$ or, more generally, according to the $p$ -biased measure111The $p$ -biased measure $\mu_{p}$ is defined by $\mu_{p}(\{H\})=p^{|H|}(1-p)^{n-|H|}$ for every $H\subseteq[n]$ . (Here, and in the rest of this paper, we identify every $H\subseteq[n]$ with its indicator function $\mathbf{1}_{H}\in\{0,1\}^{n}$ .) $\mu_{p}$ ( $0<p<1$ ).

This question was motivated by a problem in density Ramsey theory; see Subsection 5.2 for more details. Related questions—though of a somewhat different nature—have been studied in high-dimensional probability and asymptotic convex geometry (see, e.g., [BN]), as well as in the study of thin sets in harmonic analysis (see [Pi]). It is important to note that the main point in our approach lies in the fact that, apart from the boundedness condition on $\bm{X}$ , we make no further assumptions on the distributions of the random variables $X_{1},\dots,X_{n}$ and on their correlation. (This level of generality is actually necessary for certain applications in combinatorics.)

1.3. Examples

At this point it is useful to give examples of bounded random vectors which are subgaussian at a given direction. For concreteness we will restrict our discussion to the direction $\bm{\sigma}=(1,\dots,1)\in\mathbb{R}^{n}$ , but corresponding examples can be given for any other direction.

Undoubtedly, the most important examples are random vectors with independent entries and, more generally, random vectors which are bounded martingale difference sequences. Another interesting class of examples consists of Sidon sets of characters in a compact abelian group $G$ . (Here, we view $G$ as a probability space equipped with the Haar probability measure, and we view every character as a complex-valued random variable on $G$ ; see [Pi] for details). Note, however, that all these examples are subgaussian at every direction.

A different—but quite relevant—example is a random vector whose entries exhibit high cancellation. More precisely, fix a $[-1,1]$ -valued random variable $Z$ . Assume for simplicity that $n$ is even, say $n=2k$ , and fix a subset $T$ of $[n]$ with $|T|=k$ . We define $\bm{X}=(X_{1},\dots,X_{n})$ by setting $X_{i}=Z$ if $i\in T$ , and $X_{i}=-Z$ if $i\notin T$ . Notice that $\langle\bm{\sigma},\bm{X}\rangle=0$ , and so $\bm{X}$ is $K$ -subgaussian at the direction $\bm{\sigma}$ for any $K>0$ . On the other hand, observe that $\langle\bm{\sigma}_{T},\bm{X}\rangle=(n/2)Z$ ; consequently, if $\bm{X}$ is $K$ -subgaussian at the direction $\bm{\sigma}_{T}$ , then $K\geqslant(\|Z\|_{\psi_{2}}/\sqrt{2})\,n^{1/2}$ . Nevertheless, it is easy to see that we may select, with positive probability, a subset $H$ of $[n]$ such that $\bm{X}$ is $O(1)$ -subgaussian at the direction $\bm{\sigma}_{H}$ .

All the above examples can be combined together by taking convex combinations. Precisely, let $J$ be a nonempty finite set, and for every $j\in J$ let $\bm{X}_{j}$ be a random vector in $\mathbb{R}^{n}$ whose entries are either independent, or exhibit high cancellation in the sense we described above. If $\bm{X}$ is any convex combination of $(\bm{X}_{j}:j\in J)$ , then clearly $\bm{X}$ is $O(1)\text{-subgaussian}$ at the direction $\bm{\sigma}$ , but it is already not quite straightforward to find a subset $H$ of $[n]$ with $|H|=n/2+O(\sqrt{n})$ such that $\bm{X}$ is $O(1)$ -subgaussian at the direction $\bm{\sigma}_{H}$ .

1.4. The main result

Our main result shows that such a selection is possible in full generality. Specifically, we have the following theorem; more precise quantitative versions are given in Proposition 3.1 and Theorem 4.1 in the main text. (For our conventions for asymptotic notation see Subsection 2.2; recall that by $\mu_{p}$ we denote the $p$ -biased measure on $\{0,1\}^{n}$ .)

Theorem 1.1.

The following hold.

(1)

Let $K>0$ , and let $0<p<1$ . Also let $n$ be a positive integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\bm{\theta}\in\mathbb{R}^{n}$ . If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then for every $C>0$

[TABLE]

$($ Thus, the error term in (1.5) does not dependent on the dimension $n$ , the random vector $\bm{X}$ , and the direction $\bm{\theta}$ . $)$ ** 2. (2)

Conversely, let $K>0$ , let $0<p<1$ , and let $0<\gamma\leqslant 1$ . Also let $n$ be a positive integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\bm{\theta}\in\mathbb{R}^{n}$ . If

[TABLE]

then $\bm{X}$ is $O_{K,p,\gamma}(1)$ -subgaussian at the direction $\bm{\theta}$ .

1.5. Sharpness of the probability

Although the lower bound in (1.5) is independent of the direction $\bm{\theta}$ , we note that the probability appearing on the left-hand side of (1.5) does depend upon the choice of $\bm{\theta}$ . Indeed, if $\bm{\theta}=(1,\dots,1)\in\mathbb{R}^{n}$ , then this probability is $1-o_{C\to\infty;K,p}(1)-o_{n\to\infty;p}(1)$ . (See Corollary 4.11 in the main text.) At the other extreme, there exist random vectors and directions in $\mathbb{R}^{n}$ for which the corresponding probability is at most $p+o_{n\to\infty;p,C}(1)$ for any fixed $C>0$ . (See Example 4.2.) In particular, the lower bound in (1.5) is optimal.

1.6. Related results/Outline of the argument

Beyond its probabilistic content, Theorem 1.1 can also be placed in the general context of property testing (see, e.g., [G]). Indeed, Theorem 1.1 essentially asserts that subgaussianity, at any given direction, is testable.

Theorem 1.1 can also be viewed as a partial unconditionality result, in the spirit of the work of Elton [E1, E2] and Pajor [Pa]. In fact, this is more than an analogy since part (1) of Theorem 1.1 for $p=1/2$ and the direction $(1,\dots,1)\in\mathbb{R}^{n}$ can be proved using the Sauer–Shelah lemma which is a main tool in the proof of the Elton–Pajor theorem.

That said, the proof of the general case of Theorem 1.1 is quite intrinsic and, apart from a couple of basic tools, it relies exclusively on properties of subgaussian random variables.

The first part is based on a large deviation inequality for the $\psi_{2}\text{-norm}$ of the random variables $\langle\bm{\theta}_{H},\bm{X}\rangle$ which can be seen as a reverse triangle inequality; this is the content of Proposition 4.3 in the main text. With this inequality at our disposal, we detect the behavior of the probability in (1.5) using the $\ell_{\infty}$ -norm $\|\bm{\theta}\|_{\infty}$ of the direction $\bm{\theta}$ . Specifically, if $\|\bm{\theta}\|_{2}=1$ and $\|\bm{\theta}\|_{\infty}$ is sufficiently small, say $\|\bm{\theta}\|_{\infty}\leqslant 1/L$ , then we may select $C=O_{K,p,L}(1)$ such that the corresponding probability is $1-o_{L\to\infty;K}(1)-o_{n\to\infty,p}(1)$ . On the other hand, if $\|\bm{\theta}\|_{\infty}\geqslant 1/L$ , then we fix a coordinate $i_{0}\in[n]$ such that $|\theta_{i_{0}}|\geqslant 1/L$ and we proceed by conditioning on the set of all $H\subseteq[n]$ such that $i_{0}\in H$ .

*Remark 1.2**.*

The argument is roughly analogous to the proof of Roth’s theorem [Ro]. Indeed, the case where the $\ell_{\infty}$ -norm is small corresponds to case of small Fourier bias and it implies pseudorandomness. On the other hand, the case where the $\ell_{\infty}$ -norm is non-negligible corresponds to the case of correlation with a character, and the proof takes advantage of this structural information.

The proof of the second part of Theorem 1.1 is quite simple, and it follows from a standard application of the bounded differences inequality.

1.7. Structure of the paper

We close this introduction by briefly discussing the contents of this paper. In Section 2, we fix our notation (which is mostly standard), and we recall some basic material which is needed for the proof of our main result. In Section 3 we give the proof of part (2) of Theorem 1.1, and in Section 4 we give the proof of part (1). Finally, in Section 5 we present and we comment on various extensions of Theorem 1.1.

Acknowledgments

We would like to thank the anonymous referee for carefully reading the paper and for several helpful suggestions.

2. Background material

2.1.

By $\mathbb{N}=\{0,1,\dots\}$ we denote the set of all natural numbers. Recall that for every positive integer $n$ we set $[n]\coloneqq\{1,\dots,n\}$ . Moreover, for every finite set $H$ by $|H|$ we denote its cardinality.

2.2.

We use the following $o(\cdot)$ and $O(\cdot)$ notation. If $a_{1},\dots,a_{k}$ are parameters and $C$ is a positive real/integer, then we write $o_{C\to\infty;a_{1},\dots,a_{k}}(X)$ to denote a quantity bounded in magnitude by $XF_{a_{1},\dots,a_{k}}(C)$ where $F_{a_{1},\dots,a_{k}}$ is a function which depends on $a_{1},\dots,a_{k}$ and goes to zero as $C\to\infty$ . Similarly, by $O_{a_{1},\dots,a_{k}}(X)$ we denote a quantity bounded in magnitude by $XC_{a_{1},\dots,a_{k}}$ where $C_{a_{1},\dots,a_{k}}$ is a positive constant depending on the parameters $a_{1},\dots,a_{k}$ .

2.3.

As we have mentioned, for every positive integer $n$ and every $0<p<1$ by $\mu_{p}$ we denote the $p$ -biased measure on $\{0,1\}^{n}$ , that is, the probability measure on $\{0,1\}^{n}$ which is defined by setting

[TABLE]

for every $H\subseteq[n]$ . In particular, $\mu_{1/2}$ is the uniform probability measure on $\{0,1\}^{n}$ .

2.4.

For every vector $\bm{c}=(c_{1},\dots,c_{n})$ in $\mathbb{R}^{n}$ and every $1\leqslant p\leqslant\infty$ by $\|\bm{c}\|_{p}$ we shall denote the $\ell_{p}$ -norm of $\bm{c}$ , that is, $\|\bm{c}\|_{p}=(|c_{1}|^{p}+\dots+|c_{n}|^{p})^{1/p}$ if $1\leqslant p<\infty$ , and $\|\bm{c}\|_{\infty}=\max\{|c_{1}|,\dots,|c_{n}|\}$ .

2.5. Properties of subgaussian random variables

We will need the following properties of subgaussian random variables. For a proof, as well as for a detailed discussion of related material, see [V, Chapter 2].

Proposition 2.1.

Let $X$ be a real-valued random variable.

(a)

If $X$ is subgaussian, then we have $\mathbb{P}(\{|X|\geqslant t\})\leqslant 2\exp(-t^{2}/\|X\|_{\psi_{2}}^{2})$ for every $t>0$ . 2. (b)

Conversely, let $K>0$ and assume that $\mathbb{P}(\{|X|\geqslant t\})\leqslant 2\exp(-t^{2}/K^{2})$ for every $t>0$ . Then, $X$ is subgaussian and, moreover, $\|X\|_{\psi_{2}}\leqslant\sqrt{3}\,K$ .

2.6. Hoeffding’s inequality and the bounded differences inequality

In various places in the paper, we will apply Hoeffding’s inequality and the bounded differences inequality. We will use these basic inequalities in a form which, although less general, is better suited to our needs. (The standard forms of these inequalities and their proofs can be found, e.g., in [BLM, Theorem 2.8] and [BLM, Theorem 6.2] respectively.)

Precisely, we will need the following consequence of Hoeffding’s inequality.

Proposition 2.2.

Let $n$ be a positive integer, and let $\bm{c}=(c_{1},\dots,c_{n})\in\mathbb{R}^{n}\setminus\{0\}$ . Also let $0<p<1$ . Then for any $t>0$ we have

[TABLE]

We will also need the following special case of the bounded differences inequality.

Proposition 2.3.

Let $n$ be a positive integer, let $f\colon\{0,1\}^{n}\to\mathbb{R}$ be a function, and let $\bm{c}=(c_{1},\dots,c_{n})\in\mathbb{R}^{n}\setminus\{0\}$ such that for every $i\in[n]$ and every $H\subseteq[n]\setminus\{i\}$

[TABLE]

Also let $0<p<1$ . Then, setting $M\coloneqq\underset{H\sim\mu_{p}}{\mathbb{E}}f(H)$ , for any $t>0$ we have

[TABLE]

3. Proof of Theorem 1.1: part (2)

We have the following, more informative, version of part (2) of Theorem 1.1.

Proposition 3.1.

Let $K>0$ , let $0<p<1$ , let $0<\gamma\leqslant 1$ , and set

[TABLE]

Also let $n$ be a positive integer, let $\bm{X}=(X_{1},\dots,X_{n})$ be a random vector in $\mathbb{R}^{n}$ with $\|X_{i}\|_{\psi_{2}}\leqslant 1$ for every $i\in[n]$ , and let $\bm{\theta}\in\mathbb{R}^{n}\setminus\{0\}$ . If

[TABLE]

then $\bm{X}$ is $C$ -subgaussian at the direction $\bm{\theta}$ .

*Remark 3.2**.*

We do not know which is the optimal dependence of the constant $C(K,p,\gamma)$ with respect to the parameters $K,p$ and $\gamma$ . The referee noted that the dependence on $p$ could be improved; observe that the parameter $p$ is important in the sparse regime, that is, when $p=o_{n\to\infty}(1)$ .

Proposition 3.1 is based on two auxiliary results. The first one is an elementary identity which expresses the random variable $\langle\bm{\theta},\bm{X}\rangle$ as a linear combination of the random variables $\langle\bm{\theta}_{H},\bm{X}\rangle$ .

Fact 3.3.

Let $p,n,\bm{X},\bm{\theta}$ be as in Proposition 3.1. Then we have

[TABLE]

In particular,

[TABLE]

Proof.

Observe that

[TABLE]

The estimate in (3.4) follows from this identity and the triangle inequality. ∎

The second auxiliary result is the following, fairly straightforward, consequence of the bounded differences inequality; we isolate this consequence for future use.

Lemma 3.4.

Let $p,n,\bm{X},\bm{\theta}$ be as in Proposition 3.1. Then, setting

[TABLE]

for any $t>0$ we have

[TABLE]

Proof.

By the triangle inequality, for every $i\in[n]$ and every $H\subseteq[n]\setminus\{i\}$ we have

[TABLE]

Using this observation, the result follows from Proposition 2.3. ∎

We are now ready to proceed to the proof of Proposition 3.1.

Proof of Proposition 3.1.

Setting $t_{0}\coloneqq\sqrt{\ln(2/\gamma)}\,\,\|\bm{\theta}\|_{2}>0$ , by (3.6), we have

[TABLE]

Thus, by (3.2), we may select $H\subseteq[n]$ such that

$\bullet$

$\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\leqslant K\|\bm{\theta}_{H}\|_{2}\leqslant K\|\bm{\theta}\|_{2}$ , and 2. $\bullet$

$M\leqslant\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}+t_{0}$ .

Therefore, $M\leqslant K\|\bm{\theta}\|_{2}+t_{0}$ . By (3.4), (3.5) and the choice of $C$ in (3.1), we conclude that $\|\langle\bm{\theta},\bm{X}\rangle\|_{\psi_{2}}\leqslant C\|\bm{\theta}\|_{2}$ , as desired. ∎

4. Proof of Theorem 1.1: part (1)

4.1.

This section is devoted to the proof of the following theorem.

Theorem 4.1.

Let $K>0$ , let $0<p<1$ , let $0<\eta<p$ , and set

[TABLE]

Also let $n$ be a positive integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\bm{\theta}\in\mathbb{R}^{n}\setminus\{0\}$ . If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then

[TABLE]

It is clear that Theorem 4.1 yields part (1) of Theorem 1.1. As we have already pointed out in the introduction, the lower bound in (4.2) is optimal.

*Example 4.2**.*

Let $n$ be an arbitrary positive integer, and set

[TABLE]

We fix a $[-1,1]$ -valued random variable $Z$ and, as in Subsection 1.3, we define the (high cancellation) random vector $\bm{X}=(X_{1},\dots,X_{n+1})$ in $\mathbb{R}^{n+1}$ by setting $X_{1}=-Z$ , and $X_{i}=Z$ if $i\in\{2,\dots,n+1\}$ . Since $\langle\bm{\theta},\bm{X}\rangle=0$ , the random vector $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ for any $K>0$ . Next, let $0<p<1$ be arbitrary, and set

[TABLE]

By Proposition 2.2, we see that $\mu_{p}(\mathcal{H})=1-p-o_{n\to\infty;p}(1)$ . Moreover, if $H\in\mathcal{H}$ , then $\langle\bm{\theta}_{H},\bm{X}\rangle=|H|\,Z$ and, therefore, if $K$ is any positive real such that $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}_{H}$ , then $K\geqslant(\sqrt{p/2}\,\,\|Z\|_{\psi_{2}})\,n^{1/2}$ . Thus, we conclude that for any $C>0$ ,

[TABLE]

4.2. A large deviation inequality for the $\psi_{2}$ -norm

The first step of the proof of Theorem 4.1 is the following large deviation inequality.

Proposition 4.3.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $n,\bm{X},\bm{\theta}$ be as in Theorem 4.1. If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then for any $\lambda\geqslant 8\sqrt{2}$ we have

[TABLE]

In order to put Proposition 4.3 in a proper context recall that, by (3.3) and the triangle inequality, we have $p\,\|\langle\bm{\theta},\bm{X}\rangle\|_{\psi_{2}}\leqslant\underset{H\sim\mu_{p}}{\mathbb{E}}\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}$ . The next corollary shows that this estimate can actually be reversed. Thus, we may view Proposition 4.3 as a reverse triangle inequality.

Corollary 4.4.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $n,\bm{X},\bm{\theta}$ be as in Theorem 4.1. If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then

[TABLE]

In particular, if $\|\langle\bm{\theta},\bm{X}\rangle\|_{\psi_{2}}\geqslant\|\bm{\theta}\|_{2}/\sqrt{2}$ , then

[TABLE]

Proof.

It is a straightforward consequence of Proposition 4.3. Indeed,

[TABLE]

as desired. ∎

Corollary 4.4 can be used, in turn, to upgrade Proposition 4.3 and provide finer information for the distribution of the $\psi_{2}$ -norm of the random variables $\langle\bm{\theta}_{H},\bm{X}\rangle$ . Specifically, we have the following corollary; it follows immediately by Lemma 3.4, Corollary 4.4, and taking into account the fact that $\|X\|_{\psi_{2}}\leqslant 1/\sqrt{\ln 2}$ for every $[-1,1]$ -valued random variable $X$ .

Corollary 4.5.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $n,\bm{X},\bm{\theta}$ be as in Theorem 4.1. If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then for any $\lambda>0$

[TABLE]

4.3. Proof of Proposition 4.3

It is based on the following lemma.

Lemma 4.6.

Let $K>0$ , and let $0<p<1$ . Also let $n,\bm{X},\bm{\theta}$ be as in Theorem 4.1. If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , then, setting $Q\coloneqq\max\{2pK,\sqrt{2}\}$ , for every $M\geqslant\max\{4\sqrt{2\ln 2}\,pK,4\sqrt{\ln 2}\}$ we have

[TABLE]

It is easy to see that Proposition 4.3 follows from Lemma 4.6. Indeed, let $\lambda\geqslant 8\sqrt{2}$ be arbitrary, and set $M\coloneqq(\sqrt{\ln 2}/2)\,\lambda K$ . It is easy to see that with this choice we have that $M\geqslant\max\{4\sqrt{2\ln 2}\,pK,4\sqrt{\ln 2}\}$ . Noticing that $2K\geqslant\max\{2pK,\sqrt{2}\}$ , by Lemma 4.6, we conclude that (4.5) is satisfied.

Thus, it is enough to prove Lemma 4.6. To this end, we need the following sublemma.

Sublemma 4.7.

Let $X$ be a real-valued random variable, let $R,C>0$ , and assume that $\mathbb{P}(\{|X|\geqslant 2^{j}R\})\leqslant 2\exp(-(2^{j}R)^{2}/C^{2})$ for every $j\in\mathbb{N}$ . Then we have

[TABLE]

Proof.

Set $N\coloneqq\max\{2C,R/\sqrt{\ln 2}\}$ . By Proposition 2.1, it suffices to show that for every $t>0$ we have $\mathbb{P}(\{|X|\geqslant t\})\leqslant 2\exp(-t^{2}/N^{2})$ .

Indeed, notice first that, since $N\geqslant R/\sqrt{\ln 2}$ , we have $2\exp(-R^{2}/N^{2})\geqslant 1$ . This, in turn, implies that $\mathbb{P}(\{|X|\geqslant t\})\leqslant 2\exp(-t^{2}/N^{2})$ if $0<t\leqslant R$ .

The remaining cases (that is, when $t\geqslant R$ ) follow from our hypothesis and a standard dyadic pigeonholing. Specifically, for every $j\in\mathbb{N}$ set $t_{j}\coloneqq 2^{j}R$ and observe that

[TABLE]

Let $t\geqslant R$ be arbitrary and let $j_{0}\in\mathbb{N}$ be such that $t_{j_{0}}\leqslant t<t_{j_{0}+1}=2t_{j_{0}}$ . Then we have

[TABLE]

and the proof is completed. ∎

We are ready to proceed to the proof of Lemma 4.6.

Proof of Lemma 4.6.

The left-hand side of (4.9) is scale-invariant; thus we may assume that $\|\bm{\theta}\|_{2}=1$ , and it is enough to prove that

[TABLE]

for every $M\geqslant\max\{4\sqrt{2\ln 2}\,pK,4\sqrt{\ln 2}\}$ .

Step 1. We will show that for every $t>0$ we have

[TABLE]

Fix $t>0$ . Let $(\Omega,\mathcal{F},\mathbb{P})$ denote the underlying probability space. Let $\omega\in\Omega$ be arbitrary; since $\bm{X}(\omega)\in[-1,1]^{n}$ and $\|\bm{\theta}\|_{2}=1$ , by Proposition 2.2, we have

[TABLE]

(We note that here is the only place in the argument where the boundedness of the random vector $\bm{X}$ is used.) Next, observe that the event

[TABLE]

contains the event

[TABLE]

Finally, notice that $\|\langle\bm{\theta},\bm{X}\rangle\|_{\psi_{2}}\leqslant K$ since $\|\bm{\theta}\|_{2}=1$ and $\bm{X}$ is $K$ -subgaussian at the direction $\bm{\theta}$ . Thus, by Proposition 2.1 applied to the fixed $t$ , we have

[TABLE]

Let $\mu_{p}\times\mathbb{P}$ denote the product probability measure of $\mu_{p}$ and $\mathbb{P}$ . Then using: (i) the estimates in (4.13) and (4.16), (ii) the inclusion of the events in (4.14) and (4.15), (iii) the choice of the constant $Q$ , and (iv) Fubini’s theorem, we obtain that

[TABLE]

or, equivalently,

[TABLE]

By (4.18) and Markov’s inequality, we conclude that

[TABLE]

which is clearly equivalent to (4.12).

Step 2. We will estimate the probability in (4.11) using a discretization argument, (4.12) and Sublemma 4.7. We proceed to the details.

Let $M\geqslant\max\{4\sqrt{2\ln 2}\,pK,4\sqrt{\ln 2}\}$ be arbitrary. For every $j\in\mathbb{N}$ set

[TABLE]

and observe that, by (4.12), we have $\mu_{p}(\mathcal{C}_{M}^{j})\geqslant 1-2\exp\Big{(}-\frac{2^{2j}M^{2}}{2Q^{2}}\Big{)}$ . Therefore, setting

[TABLE]

we have

[TABLE]

where the last inequality holds true since $M\geqslant 2\sqrt{2\ln 2}\,Q\geqslant\sqrt{2\ln 2}\,Q$ . Moreover, for every $H\in\mathcal{C}_{M}$ , by Sublemma 4.7 applied for “ $X=\langle\bm{\theta}_{H},\bm{X}\rangle$ ”, “ $R=M$ ” and “ $C=\sqrt{2}\,Q$ ” and using again the fact that $M\geqslant 2\sqrt{2\ln 2}\,Q$ , we see that $\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\leqslant\sqrt{3/\ln 2}\,M$ . This shows that (4.11) is satisfied, and the proof of Lemma 4.6 is completed. ∎

4.4. The main dichotomy

The next, and last, step of the proof of Theorem 4.1 is the following proposition which relates the probability on the left-hand side of (4.2) with the $\ell_{\infty}$ -norm of the direction $\bm{\theta}$ . In particular, this probability gets bigger as $\|\bm{\theta}\|_{\infty}$ gets smaller.

Proposition 4.8.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $n,\bm{X},\bm{\theta}$ be as in Theorem 4.1. Assume that $\|\bm{\theta}\|_{2}=1$ and that $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ . Finally, let $0<\alpha\leqslant 1$ . Then, for every $\lambda>0$ , the following hold.

(i)

If $\|\bm{\theta}\|_{\infty}\leqslant\alpha$ , then

[TABLE] 2. (ii)

If $\|\bm{\theta}\|_{\infty}\geqslant\alpha$ , then

[TABLE]

*Remark 4.9**.*

Note that the lower bound in (4.23) depends upon the choice of $\alpha$ (thus, it is not uniform) but this is offset by making the subgaussianity constant of $\bm{X}$ at the direction $\bm{\theta}_{H}$ independent of $\alpha$ . In (4.24), this phenomenon is reversed.

*Remark 4.10**.*

The dependence on $p$ in (4.23) is tight up to a logarithmic factor. This can be seen by considering the diagonal direction of a random vector $\bm{X}$ whose entries are truncated independent exponential random variables. We are grateful to the referee for pointing this out.

Proof of Proposition 4.8.

Fix $\lambda>0$ , and set

[TABLE]

Since $\|\bm{\theta}\|_{2}=1$ and $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\theta}$ , by Corollary 4.5, we have

[TABLE]

Also write $\bm{\theta}=(\theta_{1},\dots,\theta_{n})$ .

Part (i): Assume that $\|\bm{\theta}\|_{\infty}\leqslant\alpha$ , and set

[TABLE]

Notice that for every $H\in\mathcal{H}_{1}\cap\mathcal{H}_{2}$ we have $\|\langle\bm{\theta}_{H},\bm{X}\rangle\|_{\psi_{2}}\leqslant\sqrt{2/p}\,(12+\lambda)K\,\|\bm{\theta}_{H}\|_{2}$ , that is, the random vector $\bm{X}$ is $\big{(}\sqrt{2/p}\,(12+\lambda)K\big{)}$ -subgaussian at the direction $\bm{\theta}_{H}$ . Also observe that

[TABLE]

Thus, by Proposition 2.2 applied for the vector “ $\bm{c}=(\theta_{1}^{2},\dots,\theta_{n}^{2})$ ” and “ $t=p/2$ ”, we obtain that

[TABLE]

Combining (4.26) and (4.29), we see that (4.23) is satisfied.

Part (ii): Now assume that $\|\bm{\theta}\|_{\infty}\geqslant\alpha$ . Fix $i_{0}\in[n]$ such that $|\theta_{i_{0}}|\geqslant\alpha$ , and set

[TABLE]

Observe that for every $H\in\mathcal{H}_{3}$ we have $\alpha\leqslant\|\bm{\theta}_{H}\|_{\infty}\leqslant\|\bm{\theta}_{H}\|_{2}$ . Consequently, for every $H\in\mathcal{H}_{1}\cap\mathcal{H}_{3}$ the random vector $\bm{X}$ is $\big{(}(12+\lambda)K\alpha^{-1}\big{)}$ -subgaussian at the direction $\bm{\theta}_{H}$ . Since $\mu_{p}(\mathcal{H}_{3})=p$ , the result follows. ∎

We close this subsection with the following consequence of Proposition 4.8 which complements Example 4.2 and concerns the behavior of the probability in (4.2) for the “flat” vector $(1,\dots,1)\in\mathbb{R}^{n}$ .

Corollary 4.11.

Let $K>0$ , and let $0<p<1$ . Also let $n,\bm{X}$ be as in Theorem 4.1, and set $\bm{\sigma}\coloneqq(1,\dots,1)\in\mathbb{R}^{n}$ . If $\bm{X}$ is $K\text{-subgaussian}$ at the direction $\bm{\sigma}$ , then for every $\lambda>0$ we have

[TABLE]

Proof.

It follows by part (i) of Proposition 4.8 applied to the vector “ $\bm{\theta}=\bm{\sigma}/\sqrt{n}$ ” (notice that $\|\bm{\theta}\|_{2}=1$ ), the constant “ $K=K+1$ ” and “ $\alpha=1/\sqrt{n}$ ”. ∎

4.5. Proof of Theorem 4.1

The result follows by applying Proposition 4.8 for

[TABLE]

and observing that

[TABLE]

by (4.32) and the choice of $C(K,p,\eta)$ in (4.1). Indeed, clearly we may assume that $\|\bm{\theta}\|_{2}=1$ . Therefore, if $\|\bm{\theta}\|_{\infty}\leqslant\alpha$ , then, by (4.23) and the previous observation,

[TABLE]

while if $\|\bm{\theta}\|_{\infty}\geqslant\alpha$ , then, by (4.24),

[TABLE]

*Remark 4.12**.*

Note that the lower bound in (4.2) can be proved without invoking Proposition 4.8. Indeed, one can proceed using Corollary 4.5, the elementary identity

[TABLE]

and Markov’s inequality. However, this approach yields a weaker estimate for the constant $C(K,p,\eta)$ in (4.1) and, more importantly, it provides no information on the behavior of the probability appearing on the left-hand side of (4.2).

5. Comments

5.1. Extension to non-linear functions

Beyond the class of linear functions, Theorem 1.1 can be extended to certain chaoses which have a natural combinatorial interpretation: they are the homomorphism densities associated with weighted uniform hypergraphs (see, e.g., [L, Chapter 7]). Of course, in order to be meaningful such an extension, one has to select an appropriate normalization. We will adopt the scaling which appears in the bounded differences inequality222This choice is not optimal for certain classes of functions, but it appears to be the right choice at this level of generality..

5.1.1.

Specifically, let $n$ be a positive integer, and let $f\colon[-1,1]^{n}\to\mathbb{R}$ be a bounded measurable function. For every $i\in[n]$ set

[TABLE]

and define

[TABLE]

Notice that: (i) the quantity $\|\cdot\|_{\Delta}$ is a semi-norm, (ii) $\|f+c\|_{\Delta}=\|f\|_{\Delta}$ for every $c\in\mathbb{R}$ , (iii) $\|f\|_{\Delta}=0$ if and only if the function $f$ is constant, and (iv) if $f$ is linear, that is, $f(x_{1},\dots,x_{n})=\theta_{1}x_{1}+\dots+\theta_{n}x_{n}$ , then $\|f\|_{\Delta}=2\|(\theta_{1},\dots,\theta_{n})\|_{2}$ .

5.1.2.

Next, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]$ -valued entries. Given $K>0$ , we say that $\bm{X}$ is $K$ -subgaussian with respect to $f$ if

[TABLE]

Observe that if $f(x_{1},\dots,x_{n})=\theta_{1}x_{1}+\dots+\theta_{n}x_{n}$ is linear, then this is equivalent to saying that $\bm{X}$ is $K$ -subgaussian at the direction $(\theta_{1},\dots,\theta_{n})$ . Also note that if the random vector $\bm{X}$ has independent entries, then the bounded differences inequality yields that $\bm{X}$ is $O(1)\text{-subgaussian}$ with respect to $f-\mathbb{E}[f(\bm{X})]$ .

5.1.3.

It is also straightforward to extend (1.4). Precisely, for every subset $H$ of $[n]$ let $f_{H}\colon[-1,1]^{n}\to\mathbb{R}$ denote the function defined by

[TABLE]

where $\pi_{H}(x_{1},\dots,x_{n})=(x^{\prime}_{1},\dots,x^{\prime}_{n})$ with $x^{\prime}_{i}=x_{i}$ if $i\in H$ , and $x^{\prime}_{i}=0$ otherwise.

Thus, the non-linear version of the question discussed in the introduction is whether the subgaussian behavior of the random vector $\bm{X}$ with respect to the function $f$ is reflected to/characterized by the typical subgaussian behavior of $\bm{X}$ with respect to $f_{H}$ where $H$ is random subset of $[n]$ .

5.1.4.

It is likely that this problem is rather delicate. As we have mentioned, we will consider the case where the function $f$ is the homomorphism density associated with a weighted uniform hypergraph.

More precisely, let $d$ be a positive integer. For every integer $n\geqslant d$ and every $A\subseteq[n]$ by ${A\choose d}$ we denote the set of all subsets of $A$ of cardinality $d$ . Let $\mathcal{W}$ be a weighted $d\text{-uniform}$ hypergraph, that is, $\mathcal{W}$ is a map which assigns to every hyperedge $e\in{[n]\choose d}$ a weight $\mathcal{W}(e)\in\mathbb{R}$ . The homomorphism density function associated with $\mathcal{W}$ is the map $\hom_{\mathcal{W}}\colon[-1,1]^{n}\to\mathbb{R}$ defined by

[TABLE]

Note that if $H$ is a subset of $[n]$ , then the restriction $(\hom_{\mathcal{W}})_{H}$ of $\hom_{\mathcal{W}}$ defined in (5.4) is naturally identified with the homomorphism density function $\hom_{\mathcal{W}[H]}$ associated with the induced on $H$ sub-hypergraph $\mathcal{W}[H]$ of $\mathcal{W}$ .

5.1.5.

We have the following theorem.

Theorem 5.1.

The following hold.

(1)

Let $K>0$ , let $0<p<1$ , and let $d$ be a positive integer. Also let $n\geqslant d$ be an integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\mathcal{W}$ be a weighted $d$ -uniform hypergraph on $[n]$ . If $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{W}}$ , then for every $C>0$

[TABLE] 2. (2)

Conversely, let $K>0$ , let $0<p<1$ , let $0<\gamma\leqslant 1$ , and let $d$ be a positive integer. Also let $n\geqslant d$ be an integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\mathcal{W}$ be a weighted $d\text{-uniform}$ hypergraph on $[n]$ . If

[TABLE]

then $\bm{X}$ is $O_{K,p,\gamma,d}(1)$ -subgaussian with respect to $\hom_{\mathcal{W}}$ .

The proof of Theorem 5.1 is similar to the proof of Theorem 1.1; for the convenience of the reader we present the details in the Appendix.

We also note that the lower bound in (5.6) is optimal. Specifically, we have the following analogue of Example 4.2.

*Example 5.2**.*

Fix a positive integer $d$ , and let $n\geqslant d$ be an arbitrary integer. We define a weighted $d$ -uniform hypergraph $\mathcal{E}$ on $[n+d]$ by the rule

[TABLE]

Also fix a $[-1,1]$ -valued random variable $Z$ , and let $\bm{X}=(X_{1},\dots,X_{n+d})$ be the random vector in $\mathbb{R}^{n+d}$ defined by setting $X_{i}=Z$ for every $i\in[n+d]$ . Observe that $\hom_{\mathcal{E}}(\bm{X})=0$ , and so $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{E}}$ for any $K>0$ . Next, let $0<p<1$ be arbitrary, and set

[TABLE]

By Proposition 2.2, we see that $\mu_{p}(\mathcal{H})=1-p^{d}-o_{n\to\infty;p,d}(1)$ . Fix $H\in\mathcal{H}$ and set $G\coloneqq H\cap\{d+1,\dots,n+d\}$ . Since $\hom_{\mathcal{E}[H]}(\bm{X})=-{G\choose d}Z^{d}$ , we have

[TABLE]

On the other hand, note that

$\bullet$

$\Delta_{i}(\hom_{\mathcal{E}[H]})=0$ if $i\in\{1,\dots,d\}$ (this is because $\{1,\dots,d\}\nsubseteq H$ ), 2. $\bullet$

$\Delta_{i}(\hom_{\mathcal{E}[H]})=0$ if $i\in\{d+1,\dots,n+d\}\setminus H$ , and 3. $\bullet$

$|\Delta_{i}(\hom_{\mathcal{E}[H]})|\leqslant 2{G\choose d-1}\leqslant 2n^{d-1}$ if $i\in H\cap\{d+1,\dots,n+d\}$

which implies that $\|\hom_{\mathcal{E}[H]}\|_{\Delta}\leqslant 2n^{d-1/2}$ . Therefore, if $K$ is any positive real such that $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{E}[H]}$ , then

[TABLE]

Thus, for any $C>0$ we have

[TABLE]

5.2. Extension to partially subgaussian random vectors

Let $n$ be a positive integer, and let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ . Given $K,\tau>0$ and $\bm{\theta}\in\mathbb{R}^{n}$ , we say333This terminology is not standard. that $\bm{X}$ is $(K,\tau)$ -partially subgaussian at the direction $\bm{\theta}$ provided that

[TABLE]

Notice that if $\tau=O(\|\bm{\theta}\|_{2})$ , then this is equivalent to saying that the random vector $\bm{X}$ is $O_{K}(1)$ -subgaussian at the direction $\bm{\theta}$ . Thus, this notion is of interest when $\tau$ is significantly larger than $\|\bm{\theta}\|_{2}$ . Examples of random vectors which are partially subgaussian with parameters in this regime appear frequently in combinatorics, most notably in various density increment strategies. Specifically, one encounters random vectors in $\mathbb{R}^{n}$ which are $(K,\tau)$ -partially subgaussian at the direction $(1,\dots,1)\in\mathbb{R}^{n}$ with $K=O(1)$ and $\tau=\eta n$ where $\eta>0$ is a very small constant; see [DK, Part 2]. The understanding of the statistical/concentration properties of these examples was the starting point of the present paper.

5.2.1.

It is not hard to see that Theorem 1.1 can be extended to $(K,\tau)$ -partially subgaussian random vectors, but of course one is also interested in determining the quantitative dependence on the parameter $\tau$ . In this direction we have the following analogue of Proposition 4.8.

Proposition 5.3.

Let $K\geqslant 1/\sqrt{2}$ , let $0<p<1$ , and let $\tau\geqslant\max\{p^{-1},\sqrt{2}K\}$ . Also let $n$ be a positive integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $\bm{\theta}\in\mathbb{R}^{n}$ with $\|\bm{\theta}\|_{2}=1$ . Assume that $\bm{X}$ is $(K,\tau)\text{-partially}$ subgaussian at the direction $\bm{\theta}$ . Finally, let $0<\alpha\leqslant 1$ . Then the following hold.

(i)

If $\|\bm{\theta}\|_{\infty}\leqslant\alpha$ , then

[TABLE] 2. (ii)

If $\|\bm{\theta}\|_{\infty}\geqslant\alpha$ , then

[TABLE]

In particular, Proposition 5.3 yields that if $K\geqslant 1/\sqrt{2}$ , $\tau=\eta n$ for some $\eta>0$ , $n\geqslant\max\{2K^{2},p^{-2}\}/\eta^{2}$ , and the random vector $\bm{X}$ is $(K,\tau)\text{-partially}$ subgaussian at the direction $(1,\dots,1)\in\mathbb{R}^{n}$ , then the probability on the left-hand side of (5.8) is at least

[TABLE]

that is, we have an exponential improvement upon (4.31).

5.2.2.

Not surprisingly, the proof of Proposition 5.3 follows the lines of the proof of Proposition 4.8. The only difference is that, instead of Corollary 4.5, it uses a straightforward variant of Lemma 4.6 for partially subgaussian random vectors. (In particular, the exponential gain in (5.10) comes from the fact that we need to control the tails up to $\tau$ .) We leave the details to the interested reader.

5.3. Extension to not necessarily bounded random vectors

It is open to us whether part (1) of Theorem 1.1 can be extended to random vectors with subgaussian, but not necessarily bounded, entries. Although the boundedness of $\bm{X}$ is used only in (4.13), the strategy of our proof uses this property in an essential way and it cannot be dropped by merely optimizing the argument.

Appendix A Proof of Theorem 5.1

A.1. Preliminary tools

We begin by observing the following two simple facts; they will be used in the proofs of both parts of Theorem 5.1.

Fact A.1.

Let $0<p<1$ , let $d,n$ be positive integers with $d\leqslant n$ , and let $\mathcal{W}$ be a weighted $d$ -uniform hypergraph on $[n]$ . Then, for any $\bm{x}\in[-1,1]^{n}$ we have

[TABLE]

In particular, if $\bm{X}$ is a random vector in $\mathbb{R}^{n}$ with $[-1,1]$ -valued entries, then

[TABLE]

Proof.

Write $\bm{x}=(x_{1},\dots,x_{n})$ and notice that

[TABLE]

The estimate in (A.2) follows from (A.1) and the triangle inequality. ∎

Fact A.2.

Let $n$ be a positive integer, let $\bm{X}$ be a random vector in $\mathbb{R}^{n}$ with $[-1,1]\text{-valued}$ entries, and let $f\colon[-1,1]^{n}\to\mathbb{R}$ be a bounded measurable function. Define $g\colon\{0,1\}^{n}\to\mathbb{R}$ by setting $g(H)=\|f_{H}(\bm{X})\|_{\psi_{2}}$ for every $H\subseteq[n]$ , where $f_{H}$ is as in (5.4). Then we have

[TABLE]

Proof.

The desired estimate is a consequence of the fact that for every bounded random variable $Y$ we have

[TABLE]

Indeed, fix $i\in[n]$ and $H\subseteq[n]\setminus\{i\}$ , and observe that

[TABLE]

Thus, we have $\Delta_{i}(g)\leqslant\Delta_{i}(f)/\sqrt{\ln 2}$ for every $i\in[n]$ . This, in turn, implies inequality (A.3). ∎

A.2. Proof of part (2)

Let $K,p,\gamma,d,n,\bm{X},\mathcal{W}$ be as in part (2) of Theorem 5.1, and set

[TABLE]

We will show that if

[TABLE]

then $\bm{X}$ is $C$ -subgaussian with respect to $\hom_{\mathcal{W}}$ . To this end we need the following lemma.

Lemma A.3.

Let $p,d,n,\bm{X},\mathcal{W}$ be as in part (2) of Theorem 5.1. Then, setting

[TABLE]

for any $t>0$ we have

[TABLE]

Proof.

Define $g\colon\{0,1\}^{n}\to\mathbb{R}$ by setting $g(H)=\|\hom_{\mathcal{W}[H]}(\bm{X})\|_{\psi_{2}}$ for every $H\subseteq[n]$ , and observe that $\underset{H\sim\mu_{p}}{\mathbb{E}}g(H)=M$ . By Fact A.2 applied for the function “ $f=\hom_{\mathcal{W}}$ ”, we see that $\|g\|_{\Delta}\leqslant\|\hom_{\mathcal{W}}\|_{\Delta}/\sqrt{\ln 2}$ . Hence, by Proposition 2.3, for any $t>0$ we have

[TABLE]

as desired. ∎

Now set $t_{0}\coloneqq\sqrt{1-\log_{2}(\gamma)}\,\|\hom_{\mathcal{W}}\|_{\Delta}$ , and let $M$ be as in (A.7). By (A.6) and Lemma A.3, there exists $H_{0}\subseteq[n]$ such that

$\bullet$

$M\leqslant t_{0}+\|\hom_{\mathcal{W}[H_{0}]}(\bm{X})\|_{\psi_{2}}$ , and 2. $\bullet$

$\|\hom_{\mathcal{W}[H_{0}]}(\bm{X})\|_{\psi_{2}}\leqslant K\|\hom_{\mathcal{W}[H_{0}]}\|_{\Delta}\leqslant K\|\hom_{\mathcal{W}}\|_{\Delta}$ .

(The last inequality follows from the definition of the semi-norm $\|\cdot\|_{\Delta}$ and (5.4).) Using these estimates, the result follows by (A.2) and the choice of $C$ in (A.5).

A.3. Proof of part (1)

The proof of this part is more involved. As we have already noted, the argument is similar to that of the proof of Theorem 4.1.

A.3.1. A large deviation inequality

The first step is the following analogue of Proposition 4.3.

Proposition A.4.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $d,n,\bm{X},\mathcal{W}$ be as in part (1) of Theorem 5.1. If $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{W}}$ , then for any $\lambda\geqslant 8\sqrt{2}$ ,

[TABLE]

Proof.

Note that, arguing as in Subsection 4.3, it is enough to show the following.

*Let $K>0$ , and let $0<p<1$ . Also let $d,n,\bm{X},\mathcal{W}$ be as in part (1) of Theorem 5.1. If $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{W}}$ , then, setting $Q\coloneqq\max\{2p^{d}K,\sqrt{2}\}$ , for every $M\geqslant\max\{4\sqrt{2\ln 2}\,p^{d}K,4\sqrt{\ln 2}\}$ we have *

[TABLE]

The left-hand side of (A.10) is scale-invariant, and so we may assume that the weighted hypergraph $\mathcal{W}$ satisfies $\|\hom_{\mathcal{W}}\|_{\Delta}=1$ . Thus, it is enough to prove that

[TABLE]

for every $M\geqslant\max\{4\sqrt{2\ln 2}\,p^{d}K,4\sqrt{\ln 2}\}$ .

As in Lemma 4.6, we start by showing that for any $t>0$ we have

[TABLE]

Fix $t>0$ and let $(\Omega,\mathcal{F},\mathbb{P})$ denote the underlying probability space. Let $\omega\in\Omega$ be arbitrary, and recall that $\bm{X}(\omega)\in[-1,1]^{n}$ . We define the map $\zeta\colon\{0,1\}^{n}\to\mathbb{R}$ by setting $\zeta(H)=\hom_{\mathcal{W}[H]}\big{(}\bm{X}(\omega)\big{)}$ for every $H\subseteq[n]$ ; observe that $\Delta_{i}(\zeta)\leqslant\Delta_{i}(\hom_{\mathcal{W}})$ for every $i\in[n]$ . Since $\|\hom_{\mathcal{W}}\|_{\Delta}=1$ , by Proposition 2.3 and identity (A.1),

[TABLE]

(Note that (A.13) is the analogue of (4.13). We point out that this is, essentially, the only step of the proof which differs from that of Proposition 4.3.) Also observe that the event

[TABLE]

contains the event

[TABLE]

On the other hand, we have $\|\hom_{\mathcal{W}}(\bm{X})\|_{\psi_{2}}\leqslant K$ since $\|\hom_{\mathcal{W}}\|_{\Delta}=1$ and the random vector $\bm{X}$ is $K$ -subgaussian with respect to $\hom_{\mathcal{W}}$ . Thus, by Proposition 2.1,

[TABLE]

Denoting by $\mu_{p}\times\mathbb{P}$ the product probability measure of $\mu_{p}$ and $\mathbb{P}$ , the previous discussion yields that

[TABLE]

The estimate in (A.12) now follows from (A.17) and Markov’s inequality.

With inequality (A.12) at our disposal, we will estimate the probability in (A.11) using Sublemma 4.7. Precisely, fix $M\geqslant\max\{4\sqrt{2\ln 2}\,p^{d}K,4\sqrt{\ln 2}\}$ , and for every $j\in\mathbb{N}$ set

[TABLE]

Also set

[TABLE]

By (A.12), we have $\mu_{p}(\mathcal{C}_{M}^{j})\geqslant 1-2\exp\Big{(}-\frac{2^{2j}M^{2}}{2Q^{2}}\Big{)}$ for every $j\in\mathbb{N}$ . This estimate and the fact that $M\geqslant 2\sqrt{2\ln 2}\,Q\geqslant\sqrt{2\ln 2}\,Q$ are easily seen to imply that

[TABLE]

For every $H\in\mathcal{C}_{M}$ , by Sublemma 4.7 applied for the random variable “ $X=\hom_{\mathcal{W}[H]}(\bm{X})$ ”, “ $R=M$ ” and “ $C=\sqrt{2}\,Q$ ” and using again the fact that $M\geqslant 2\sqrt{2\ln 2}\,Q$ , we obtain that $\|\hom_{\mathcal{W}[H]}(\bm{X})\|_{\psi_{2}}\leqslant\sqrt{3/\ln 2}\,M$ . That is, (A.11) is satisfied, as desired. ∎

A.3.2. Consequences

We will need two consequences of Proposition A.4. The first one is the analogue of Corollary 4.4; its proof is identical to that of Corollary 4.4.

Corollary A.5.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $d,n,\bm{X},\mathcal{W}$ be as in part (1) of Theorem 5.1. If $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{W}}$ , then

[TABLE]

The second corollary is the analogue of Corollary 4.5.

Corollary A.6.

Let $K\geqslant 1/\sqrt{2}$ , and let $0<p<1$ . Also let $d,n,\bm{X},\mathcal{W}$ be as in part (1) of Theorem 5.1. If $\bm{X}$ is $K\text{-subgaussian}$ with respect to $\hom_{\mathcal{W}}$ , then for any $\lambda>0$ ,

[TABLE]

Proof.

As in the proof of Lemma A.3, define the function $g\colon\{0,1\}^{n}\to\mathbb{R}$ by setting $g(H)=\|\hom_{\mathcal{W}[H]}(\bm{X})\|_{\psi_{2}}$ for every $H\subseteq[n]$ . Recall that, by Fact A.2, we have

[TABLE]

Using Corollary A.5 and (A.23), the result follows by applying Proposition 2.3 to the function $g$ and the vector “ $\mathbf{c}=\big{(}\Delta_{1}(g),\dots,\Delta_{n}(g)\big{)}$ ”. ∎

A.3.3. Completion of the proof

Notice that part (1) of Theorem 5.1 follows from the following, more informative, theorem.

Theorem A.7.

Let $K,p,d,n,\bm{X},\mathcal{W}$ be as in part (1) of Theorem 5.1. Also let $0<\eta<p^{d}$ , and set

[TABLE]

If $\bm{X}$ is $K$ -subgaussian with respect to $\hom_{\mathcal{W}}$ , then

[TABLE]

Proof.

Set

[TABLE]

and observe that $2\exp(-2\ln 2\lambda^{2}(K+1)^{2})=\eta/2$ . Also set

[TABLE]

and

[TABLE]

By Corollary A.6, we have

[TABLE]

On the other hand, by identity (A.1), the fact that $\|\cdot\|_{\Delta}$ is a semi-norm, and the triangle inequality, we have $p^{d}\,\|\hom_{\mathcal{W}}\|_{\Delta}\leqslant\underset{H\sim\mu_{p}}{\mathbb{E}}\|\hom_{\mathcal{W}[H]}\|_{\Delta}$ . Moreover, notice that $\|\hom_{\mathcal{W}[H]}\|_{\Delta}\leqslant\|\hom_{\mathcal{W}}\|_{\Delta}$ for every $H\subseteq[n]$ . Using these observations, we obtain that

[TABLE]

Therefore, by (A.29) and (A.30), we see that

[TABLE]

Finally observe that, by the choice of $C$ in (A.24), for every $H\in\mathcal{H}_{1}\cap\mathcal{H}_{2}$ we have $\|\hom_{\mathcal{W}[H]}(\bm{X})\|_{\psi_{2}}\leqslant C\|\hom_{\mathcal{W}[H]}\|_{\Delta}$ . The proof is completed. ∎

*Remark A.8**.*

We note that it is also possible to obtain a partial extension of part (i) of Proposition 4.8. More precisely, if $\mathcal{W}$ is the complete $d$ -uniform hypergraph on $n$ vertices—that is, if $\mathcal{W}(e)=1$ for every $e\in{[n]\choose d}$ —or, more generally, if the weighted hypergraph $\mathcal{W}$ is sufficiently pseudorandom444The notion of pseudorandomness which is needed in our setting is the following requirement: for every $0<p<1$ , if $n$ is sufficiently large (depending only on $p$ ), then for every $i\in[n]$ and every $H\subseteq[n]$ with $|H|\geqslant pn$ we have

$\sum_{i\in e\in{H\choose d}}\!\!\!\mathcal{W}(e)\geqslant(p^{d-1}/2)\!\!\sum_{i\in e\in{[n]\choose d}}\!\!\!\mathcal{W}(e).$

, then the probability on the left-hand side of (5.6) is $1-o_{C\to\infty;K,p,d}(1)-o_{n\to\infty;p,d}(1)$ .

Bibliography11

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BN] S. G. Bobkov, and F. L. Nazarov, Large deviations of typical linear functionals on a convex body with unconditional basis , in “Stochastic inequalities and applications”, Progress in Probability, Vol. 56, Birkhäuser, Basel, 2003, 3–13.
2[BLM] S. Boucheron, G. Lugosi and P. Massart, Concentration inequalities. A nonasymptotic theory of independence , Oxford University Press, 2013.
3[DK] P. Dodos and V. Kanellopoulos, Ramsey Theory for Product Spaces , Mathematical Surveys and Monographs, Vol. 212, American Mathematical Society, 2016.
4[E 1] J. Elton, Weakly null normalized sequences in Banach spaces , Ph.D. Thesis, Yale University, 1978.
5[E 2] J. Elton, Sign-embeddings of ℓ 1 n superscript subscript ℓ 1 𝑛 \ell_{1}^{n} , Trans. Amer. Math. Soc. 279 (1983), 113–124.
6[G] O. Goldreich (editor), Property Testing: Current Research and Surveys , Lecture Notes in Computer Science, Vol. 6390, Springer, 2010.
7[L] L. Lovász, Large Networks and Graph Limits , American Mathematical Society Colloquium Publications, Vol. 60, American Mathematical Society, 2012.
8[Pa] A. Pajor, Sous espaces ℓ 1 n subscript superscript ℓ 𝑛 1 \ell^{n}_{1} des espaces de Banach , Travaux en cours 16, Herman, Paris, 1985.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Subgaussianity is hereditarily determined

Abstract.

1. Introduction

1.1. Subgaussianity

1.2. The problem

1.3. Examples

1.4. The main result

Theorem 1.1**.**

1.5. Sharpness of the probability

1.6. Related results/Outline of the argument

Remark 1.2*.*

1.7. Structure of the paper

Acknowledgments

2. Background material

2.1.

2.2.

2.3.

2.4.

2.5. Properties of subgaussian random variables

Proposition 2.1**.**

2.6. Hoeffding’s inequality and the bounded differences inequality

Proposition 2.2**.**

Proposition 2.3**.**

3. Proof of Theorem 1.1: part (2)

Proposition 3.1**.**

Remark 3.2*.*

Fact 3.3**.**

Proof.

Lemma 3.4**.**

Proof.

Proof of Proposition 3.1.

4. Proof of Theorem 1.1: part (1)

4.1.

Theorem 4.1**.**

Example 4.2*.*

4.2. A large deviation inequality for the ψ2\psi_{2}ψ2​-norm

Proposition 4.3**.**

Corollary 4.4**.**

Proof.

Corollary 4.5**.**

4.3. Proof of Proposition 4.3

Lemma 4.6**.**

Sublemma 4.7**.**

Proof.

Proof of Lemma 4.6.

4.4. The main dichotomy

Proposition 4.8**.**

Remark 4.9*.*

Remark 4.10*.*

Proof of Proposition 4.8.

Corollary 4.11**.**

Proof.

4.5. Proof of Theorem 4.1

Remark 4.12*.*

5. Comments

5.1. Extension to non-linear functions

5.1.1.

5.1.2.

5.1.3.

5.1.4.

5.1.5.

Theorem 5.1**.**

Example 5.2*.*

5.2. Extension to partially subgaussian random vectors

5.2.1.

Proposition 5.3**.**

5.2.2.

5.3. Extension to not necessarily bounded random vectors

Appendix A Proof of Theorem 5.1

A.1. Preliminary tools

Fact A.1**.**

Proof.

Fact A.2**.**

Proof.

Theorem 1.1.

*Remark 1.2**.*

Proposition 2.1.

Proposition 2.2.

Proposition 2.3.

Proposition 3.1.

*Remark 3.2**.*

Fact 3.3.

Lemma 3.4.

Theorem 4.1.

*Example 4.2**.*

4.2. A large deviation inequality for the $\psi_{2}$ -norm

Proposition 4.3.

Corollary 4.4.

Corollary 4.5.

Lemma 4.6.

Sublemma 4.7.

Proposition 4.8.

*Remark 4.9**.*

*Remark 4.10**.*

Corollary 4.11.

*Remark 4.12**.*

Theorem 5.1.

*Example 5.2**.*

Proposition 5.3.

Fact A.1.

Fact A.2.

Lemma A.3.

Proposition A.4.

Corollary A.5.

Corollary A.6.

Theorem A.7.

*Remark A.8**.*