The variance of the $\ell_p^n$-norm of the Gaussian vector, and   Dvoretzky's theorem

Anna Lytova; Konstantin Tikhomirov

arXiv:1705.05052·math.FA·July 11, 2017

The variance of the $\ell_p^n$-norm of the Gaussian vector, and Dvoretzky's theorem

Anna Lytova, Konstantin Tikhomirov

PDF

TL;DR

This paper provides a complete characterization of the variance of the $ ext{l}_p^n$-norm of Gaussian vectors for all $p$, revealing two transition points and implications for Dvoretzky's theorem.

Contribution

It fully determines the variance of the $ ext{l}_p^n$-norm of Gaussian vectors across all $p$, including the logarithmic regime, and identifies two key transition points.

Findings

01

Variance behavior changes at two transition points in $p$.

02

Complete characterization of variance for all $p$ in relation to Gaussian vectors.

03

Implications for random Dvoretzky's theorem in $ ext{l}_p^n$ spaces.

Abstract

Let $n$ be a large integer, and let $G$ be the standard Gaussian vector in $R^{n}$ . Paouris, Valettas and Zinn (2015) showed that for all $p \in [1, c lo g n]$ , the variance of the $ℓ_{p}^{n}$ --norm of $G$ is equivalent, up to a constant multiple, to $\frac{2 ^{p}}{p} n^{2/ p - 1}$ , and for $p \in [C lo g n, \infty]$ , $V a r ∥ G ∥_{p} ≃ (lo g n)^{- 1}$ . Here, $C, c > 0$ are universal constants. That result left open the question of estimating the variance for $p$ logarithmic in $n$ . In this note, we resolve the question by providing a complete characterization of $V a r ∥ G ∥_{p}$ for all $p$ . We show that there exist two transition points (windows) in which behavior of $V a r ∥ G ∥_{p}$ , viewed as a function of $p$ , significantly changes. We also discuss some implications of our result in context of random Dvoretzky's theorem for $ℓ_{p}^{n}$ .

Equations412

Var (∥ G ∥_{p}) ≃ \frac{2 ^{p}}{p} n^{2/ p - 1};

Var (∥ G ∥_{p}) ≃ \frac{2 ^{p}}{p} n^{2/ p - 1};

{\mathbf{Var}}(\|G\|_{p})\simeq\frac{\exp\big{(}-\frac{p}{2e}n^{2/p}+\log n\big{)}}{\sqrt{\log n}\,\,\big{(}\sqrt{\log n}+p-\frac{2\log n}{\log(2e)}\big{)}};

{\mathbf{Var}}(\|G\|_{p})\simeq\frac{\exp\big{(}-\frac{p}{2e}n^{2/p}+\log n\big{)}}{\sqrt{\log n}\,\,\big{(}\sqrt{\log n}+p-\frac{2\log n}{\log(2e)}\big{)}};

{\mathbf{Var}}(\|G\|_{p})\simeq\frac{1}{\log n}\Big{(}1-\frac{\xi^{2}-\xi}{p}\Big{)}.

{\mathbf{Var}}(\|G\|_{p})\simeq\frac{1}{\log n}\Big{(}1-\frac{\xi^{2}-\xi}{p}\Big{)}.

{\mathbf{P}}\big{\{}E\mbox{ is $(1+\varepsilon)$--spherical subspace of }\ell_{(2-\delta)\log n}^{n}\big{\}}\geq 1-n^{-w_{\delta}}.

{\mathbf{P}}\big{\{}E\mbox{ is $(1+\varepsilon)$--spherical subspace of }\ell_{(2-\delta)\log n}^{n}\big{\}}\geq 1-n^{-w_{\delta}}.

{\mathbf{P}}\big{\{}E\mbox{ is not $(1+\frac{w_{\delta}}{\log n})$--spherical subspace of }\ell_{(2+\delta)\log n}^{n}\big{\}}\geq w_{\delta}.

{\mathbf{P}}\big{\{}E\mbox{ is not $(1+\frac{w_{\delta}}{\log n})$--spherical subspace of }\ell_{(2+\delta)\log n}^{n}\big{\}}\geq w_{\delta}.

f(G):=\bigg{(}\sum_{i=1}^{n}\min(M,|g_{i}|)^{p}\bigg{)}^{1/p},

f(G):=\bigg{(}\sum_{i=1}^{n}\min(M,|g_{i}|)^{p}\bigg{)}^{1/p},

\|x\|_{p}:=\Big{(}\sum_{i=1}^{n}|x_{i}|^{p}\Big{)}^{1/p}.

\|x\|_{p}:=\Big{(}\sum_{i=1}^{n}|x_{i}|^{p}\Big{)}^{1/p}.

∥ x ∥_{q} \leq ∥ x ∥_{p} \leq n^{1/ p - 1/ q} ∥ x ∥_{q}, 1 \leq p \leq q \leq \infty.

∥ x ∥_{q} \leq ∥ x ∥_{p} \leq n^{1/ p - 1/ q} ∥ x ∥_{q}, 1 \leq p \leq q \leq \infty.

\sqrt{\frac{2}{\pi}}\Big{(}\frac{1}{t}-\frac{1}{t^{3}}\Big{)}e^{-t^{2}/2}<{\mathbf{P}}\big{\{}|g|\geq t\big{\}}<\sqrt{\frac{2}{\pi}}\frac{1}{t}e^{-t^{2}/2},\quad\quad t>0.

\sqrt{\frac{2}{\pi}}\Big{(}\frac{1}{t}-\frac{1}{t^{3}}\Big{)}e^{-t^{2}/2}<{\mathbf{P}}\big{\{}|g|\geq t\big{\}}<\sqrt{\frac{2}{\pi}}\frac{1}{t}e^{-t^{2}/2},\quad\quad t>0.

{\mathbf{E}}|g|^{p}=\frac{1}{\sqrt{\pi}}2^{p/2}\,\Gamma\Big{(}\frac{p+1}{2}\Big{)},\quad\quad p>-1.

{\mathbf{E}}|g|^{p}=\frac{1}{\sqrt{\pi}}2^{p/2}\,\Gamma\Big{(}\frac{p+1}{2}\Big{)},\quad\quad p>-1.

{\mathbf{Var}}\big{(}f(G)\big{)}\leq C\sum\limits_{i=1}^{n}\frac{{\mathbf{E}}|\partial_{i}f(G)|^{2}}{1+\log\big{(}\sqrt{{\mathbf{E}}|\partial_{i}f(G)|^{2}}/{\mathbf{E}}|\partial_{i}f(G)|\big{)}},

{\mathbf{Var}}\big{(}f(G)\big{)}\leq C\sum\limits_{i=1}^{n}\frac{{\mathbf{E}}|\partial_{i}f(G)|^{2}}{1+\log\big{(}\sqrt{{\mathbf{E}}|\partial_{i}f(G)|^{2}}/{\mathbf{E}}|\partial_{i}f(G)|\big{)}},

\displaystyle x_{\ell}:=\min\big{\{}y\in[0,x_{\max}]:\,f(y)\geq f(x_{\max})/2\big{\}},

\displaystyle x_{\ell}:=\min\big{\{}y\in[0,x_{\max}]:\,f(y)\geq f(x_{\max})/2\big{\}},

\displaystyle x_{r}:=\max\big{\{}y\in[x_{\max},a]:\,f(y)\geq f(x_{\max})/2\big{\}}.

\frac{1}{2} (x_{r} - x_{ℓ}) f (x_{m a x}) \leq \int_{0}^{a} f (x) d x \leq 2 (x_{r} - x_{ℓ}) f (x_{m a x}) .

\frac{1}{2} (x_{r} - x_{ℓ}) f (x_{m a x}) \leq \int_{0}^{a} f (x) d x \leq 2 (x_{r} - x_{ℓ}) f (x_{m a x}) .

(q / e)^{q /2} ≲ \int_{0}^{a} x^{q} e^{- x^{2} /2} d x ≲ (q / e)^{q /2};

(q / e)^{q /2} ≲ \int_{0}^{a} x^{q} e^{- x^{2} /2} d x ≲ (q / e)^{q /2};

\frac{a ^{q + 1} e ^{- a^{2} /2}}{a + q - a ^{2}} ≲ \int_{0}^{a} x^{q} e^{- x^{2} /2} d x ≲ \frac{a ^{q + 1} e ^{- a^{2} /2}}{a + q - a ^{2}} .

\frac{a ^{q + 1} e ^{- a^{2} /2}}{a + q - a ^{2}} ≲ \int_{0}^{a} x^{q} e^{- x^{2} /2} d x ≲ \frac{a ^{q + 1} e ^{- a^{2} /2}}{a + q - a ^{2}} .

lo g (z + 1) + \frac{lo g 4}{q} = z, z = x^{2} / q - 1.

lo g (z + 1) + \frac{lo g 4}{q} = z, z = x^{2} / q - 1.

z ≃ \pm \frac{1}{q},

z ≃ \pm \frac{1}{q},

x_{r}^{2} - q ≃ q, x_{ℓ}^{2} - q ≃ - q .

x_{r}^{2} - q ≃ q, x_{ℓ}^{2} - q ≃ - q .

\frac{c _{1} a}{a + q - a ^{2}} \leq a - x_{ℓ} \leq \frac{c _{2} a}{a + q - a ^{2}},

\frac{c _{1} a}{a + q - a ^{2}} \leq a - x_{ℓ} \leq \frac{c _{2} a}{a + q - a ^{2}},

lo g f (x)

lo g f (x)

\displaystyle=q\log a+q\log\Big{(}1-\frac{a-x}{a}\Big{)}-\frac{a^{2}}{2}+a(a-x)-\frac{(a-x)^{2}}{2}

\displaystyle\geq\log f(a)+q\Big{(}-\frac{a-x}{a}-\frac{(a-x)^{2}}{a^{2}}\Big{)}+a(a-x)-\frac{(a-x)^{2}}{2}

\geq lo g f (a) - \frac{c ( q - a ^{2} )}{a + q - a ^{2}} - \frac{c ^{2} q}{( a + q - a ^{2} ) ^{2}} - \frac{1}{2} c^{2}

> lo g f (a) - lo g 2,

lo g f (x)

lo g f (x)

\leq lo g f (a) - \frac{( q - a ^{2} ) ( a - x )}{a} - \frac{( a - x ) ^{2}}{2}

= lo g f (a) - \frac{C ( q - a ^{2} )}{a + q - a ^{2}} - \frac{C ^{2}}{2} \frac{a ^{2}}{( a + q - a ^{2} ) ^{2}} .

(q/e)^{q/2}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim(q/e)^{q/2};

(q/e)^{q/2}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim(q/e)^{q/2};

\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim\frac{qa^{q-1}e^{-a^{2}/2}}{a+q-a^{2}}.

\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim\frac{qa^{q-1}e^{-a^{2}/2}}{a+q-a^{2}}.

\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim\frac{1}{\tau}\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}.

\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}\lesssim{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\lesssim\frac{1}{\tau}\frac{a^{q+1}e^{-a^{2}/2}}{a+q-a^{2}}.

\displaystyle{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}=\sqrt{2/\pi}\int_{0}^{a}t^{q}e^{-t^{2}/2}\,dt\quad\mbox{and}\quad

\displaystyle{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}=\sqrt{2/\pi}\int_{0}^{a}t^{q}e^{-t^{2}/2}\,dt\quad\mbox{and}\quad

\displaystyle{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}=\sqrt{2/\pi}\int_{0}^{a}t^{q}e^{-t^{2}/2}\,dt+\sqrt{2/\pi}a^{q}\int_{a}^{\infty}e^{-t^{2}/2}\,dt,

{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\leq{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}+\sqrt{2/\pi}a^{q-1}e^{-a^{2}/2}.

{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}\leq{\mathbf{E}}\min\big{(}|g|,a\big{)}^{q}\leq{\mathbf{E}}\big{(}|g|\chi_{\{|g|\leq a\}}\big{)}^{q}+\sqrt{2/\pi}a^{q-1}e^{-a^{2}/2}.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The variance of the $\ell_{p}^{n}$ –norm of the Gaussian vector, and Dvoretzky’s theorem

Anna Lytova111University of Opole, Poland; email: [email protected]. A significant part of this work was done when A.L. was visiting Princeton University in January–February, 2017 and Konstantin Tikhomirov222Princeton University, NJ; email: [email protected]. The research is partially supported by the Simons Foundation.

Abstract

Let $n$ be a large integer, and let $G$ be the standard Gaussian vector in ${\mathbb{R}}^{n}$ . Paouris, Valettas and Zinn (2015) showed that for all $p\in[1,c\log n]$ , the variance of the $\ell_{p}^{n}$ –norm of $G$ is equivalent, up to a constant multiple, to $\frac{2^{p}}{p}n^{2/p-1}$ , and for $p\in[C\log n,\infty]$ , ${\mathbf{Var}}\|G\|_{p}\simeq(\log n)^{-1}$ . Here, $C,c>0$ are universal constants. That result left open the question of estimating the variance for $p$ logarithmic in $n$ . In this note, we resolve the question by providing a complete characterization of ${\mathbf{Var}}\|G\|_{p}$ for all $p$ . We show that there exist two transition points (windows) in which behavior of ${\mathbf{Var}}\|G\|_{p}$ significantly changes. We also discuss some implications of our result in context of random Dvoretzky’s theorem for $\ell_{p}^{n}$ .

MSC 2010: 46B06, 46B09, 52A21, 60E15, 60G15

Keywords and phrases: $\ell_{p}^{n}$ spaces, variance of $\ell_{p}$ norm, Dvoretzky’s theorem, order statistics

1 Introduction

Let $n$ be a large integer, $p$ be a number in $[1,\infty]$ , and denote by $\|\cdot\|_{p}$ the standard $\ell_{p}^{n}$ –norm in ${\mathbb{R}}^{n}$ . Let $G$ be the standard $n$ -dimensional Gaussian vector. Variance of the $\|\cdot\|_{p}$ –norm of $G$ may serve as a basic example of the concentration of measure phenomenon (most of the Gaussian mass is located in a thin shell of an appropriately rescaled $\ell_{p}^{n}$ –ball). It is well known that for a fixed $p<\infty$ , ${\mathbf{Var}}\|G\|_{p}\simeq v_{p}n^{2/p-1}$ , where the quantity $v_{p}$ depends only on $p$ and not on $n$ (see, in particular, [17] and [21]), whereas the variance of the $\|\cdot\|_{\infty}$ –norm of $G$ is of order $(\log n)^{-1}$ (see, for example, [4, p. 47–48] and [21]). At the same time, for $p$ growing to infinity with $n$ , no sharp results were available until quite recently. In [21], Paouris, Valettas and Zinn showed that ${\mathbf{Var}}\|G\|_{p}\simeq\frac{2^{p}}{p}n^{2/p-1}$ for $p\leq c\log n$ and ${\mathbf{Var}}\|G\|_{p}\simeq(\log n)^{-1}$ for $p\geq C\log n$ ( $C,c>0$ being universal constants). This result of [21] leaves the gap $c\log n\leq p\leq C\log n$ in which the behavior of the variance was not clarified. The authors of [21] conjectured that the variance changes from polynomially small in $n$ to logarithmic around $\widetilde{p}=\log_{2}(n)$ . This conjecture was the starting point of our work.

The question of computing the Gaussian variance of the $\ell_{p}^{n}$ –norm seems natural on its own right; nevertheless, it gains more sense in the context of asymptotic geometric analysis. Since the fundamental discovery of Milman [13], it is known that Gaussian concentration properties of a norm $\|\cdot\|$ in ${\mathbb{R}}^{n}$ are strongly connected with geometry of random subspaces of $({\mathbb{R}}^{n},\|\cdot\|)$ . The classical theorem of Dvoretzky [8] asserts that every infinite-dimensional Banach space contains finite subspaces of arbitrarily large dimension which are arbitrarily close to Euclidean (in the Banach–Mazur metric). Milman showed in [13] that a stronger result takes place. Given a norm $\|\cdot\|$ in ${\mathbb{R}}^{n}$ , a subspace $E\subset{\mathbb{R}}^{n}$ and a real number $K\geq 1$ , we will (rather, unconventionally) call the subspace $K$ -spherical if $\sup_{x\in E,\|x\|_{2}=1}\|x\|/\inf_{x\in E,\|x\|_{2}=1}\|x\|\leq K$ . The theorem of Milman states that for any norm $\|\cdot\|$ in ${\mathbb{R}}^{n}$ with the Lipschitz constant $L$ and any $\varepsilon\in(0,1/2)$ , the random $\frac{c\varepsilon^{2}}{\log(1/\varepsilon)}\big{(}\frac{{\mathbf{E}}\|G\|}{L}\big{)}^{2}$ –dimensional subspace of $({\mathbb{R}}^{n},\|\cdot\|)$ with uniform (rotation-invariant) distribution is $(1+\varepsilon)$ –spherical with probability close to $1$ . In particular, the Dvoretzky–Rogers lemma implies that for any norm $\|\cdot\|$ with the unit ball in John’s position, the random $\frac{c\varepsilon^{2}\log n}{\log(1/\varepsilon)}$ –dimensional subspace is $(1+\varepsilon)$ –spherical with large probability. We refer to monographs and surveys [14, 22, 26, 1] for more information as well as to papers [21, 18, 19, 29, 20] for some recent developments of the subject. In this text, we leave out any discussion of the existential Dvoretzky theorem which is concerned with finding at least one large almost Euclidean subspace (the best known general result in this direction is due to Schechtman [24]) as well as the isomorphic Dvoretzky theorem which deals with the regime when distortion $\varepsilon$ grows to infinity with $n$ (see, in particular, [15]).

In the regime of “constant distortion” (say, when $1+\varepsilon=2$ ) the result of Milman is sharp, that is, if a random $k$ -dimensional subspace is $2$ –spherical with high probability then necessarily $k\leq C\big{(}\frac{{\mathbf{E}}\|G\|}{L}\big{)}^{2}$ (see Milman–Schechtman [16] and Huang–Wei [11] for reverse estimates matching Milman’s bound). However, when $\varepsilon$ tends to zero with $n\to\infty$ , the original estimate is suboptimal. Gordon [10] and later Schechtman [23] improved the dependence on $\varepsilon$ from $\frac{\varepsilon^{2}}{\log(1/\varepsilon)}$ to $\varepsilon^{2}$ , which is sharp for some norms but not in general. For example, it was shown in [25] and [28] that a random $k$ –dimensional subspace of $\ell_{\infty}^{n}$ is $(1+\varepsilon)$ –spherical with probability close to one if and only if $k\lesssim\frac{\varepsilon\log n}{\log(1/\varepsilon)}$ . Moreover, for $1$ -unconditional norms in the $\ell$ -position, it was proved in [29] that random $\frac{c\varepsilon\log n}{\log(1/\varepsilon)}$ –dimensional subspaces are $(1+\varepsilon)$ –spherical with high probability. For arbitrary norms, the problem of interdependence between $\varepsilon$ and the dimension in the random Dvoretzky theorem is wide open, and even in the class of $\ell_{p}^{n}$ –spaces there is no complete solution as of this writing.

A considerable progress in estimating the distortion (in the “almost isometric” regime) of uniform random subspaces of $\ell_{p}^{n}$ for all $p$ was due to Naor [17] and Paouris, Valettas and Zinn [21]. For a fixed $2<p<\infty$ , Naor [17] obtained concentration inequalities which, in particular, can be employed to show that random $u_{n,p}(\varepsilon n)^{2/p}$ –dimensional sections of the $\ell_{p}^{n}$ –ball are $(1+\varepsilon)$ –spherical with probability close to one whenever $\varepsilon\geq n^{-v_{p}}$ (where $v_{p}>0$ depends only on $p$ and $u_{n,p}$ is a quantity of order polylogarithmic in $n$ arising from the application of the covering argument). The bound $w_{p}(\varepsilon n)^{2/p}$ on the dimension of typical $(1+\varepsilon)$ -Euclidean subspaces of $\ell_{p}^{n}$ (for $w_{p}>0$ depending only on $p>2$ ) was confirmed by Paouris, Valettas and Zinn [21] in the range $\varepsilon\geq n^{-v_{p}}$ , and it was shown that for a fixed $p$ the estimate is close to optimal. The paper [21] provides bounds (upper and lower) for the Dvoretzky dimension, as well as concentration inequalities for the standard Gaussian vector and the Gaussian variance in different regimes giving an emphasis to the case when $p$ grows with $n$ . However, for $p$ logarithmic in $n$ , the results are not sharp.

In the context of Dvoretzky’s theorem, the $\ell_{p}^{n}$ –spaces for logarithmic $p$ supply rather interesting geometric examples. As was observed in [21], there are universal constants $c,C>0$ such that, say, ${\mathbf{Var}}\|G\|_{c\log n}\leq n^{-1/2}$ , whereas ${\mathbf{Var}}\|G\|_{C\log n}\gtrsim\frac{1}{\log n}$ ; thus, the variance can be quite sensitive to replacing a norm with an equivalent norm. Note that the bounds for the variance immediately imply that, for example, the random $3$ -dimensional subspace of $\ell_{c\log n}^{n}$ is $(1+n^{-c})$ –spherical with probability at least $1-n^{-c}$ for a universal constant $c>0$ (and instead of $3$ we can take any constant dimension). At the same time, most of $3$ –dimensional subspaces of $\ell_{\infty}^{n}$ (which is a constant Banach–Mazur distance away from $\ell_{c\log n}^{n}$ ) are not even $(1+\frac{1}{\log n})$ –spherical [28]. The result of [21] leaves open the question whether there is a “phase transition” point $\widetilde{p}=\widetilde{p}(n)$ such that for any $\delta>0$ and all sufficiently large $n$ we have ${\mathbf{Var}}\|G\|_{(1-\delta)\widetilde{p}}\leq n^{-v_{\delta}}$ and ${\mathbf{Var}}\|G\|_{(1+\delta)\widetilde{p}}\geq\frac{v_{\delta}}{\log n}$ , where $v_{\delta}>0$ depends only on $\delta$ . Our result answers this question and completely settles the problem of computing the Gaussian variance of $\|\cdot\|_{p}$ –norms. Below, for any two quantities $a,b$ we write “ $a\simeq b$ ” if $C^{-1}a\leq b\leq Ca$ for a universal constant $C>0$ .

Theorem A.

There is a universal constant $n_{0}>0$ with the following property. Let $n\geq n_{0}$ and let $G$ be the standard Gaussian vector in ${\mathbb{R}}^{n}$ . Further, denote by $\xi$ the quantile of order $1-\frac{1}{n}$ with respect to the distribution of the absolute value of a standard Gaussian variable $|g|$ , i.e. such that ${\mathbf{P}}\{|g|\leq\xi\}=1-\frac{1}{n}$ . Then

•

For all $p$ in the range $1\leq p\leq\frac{2\log n}{\log(2e)}$ we have

[TABLE]

•

For $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ we have

[TABLE]

•

For $p\geq\xi^{2}$ we have

[TABLE]

Everywhere in this note, “ $\log$ ” stands for the natural logarithm. As we mentioned before, the above estimates in the regimes $p\leq c\log n$ and $p\geq C\log n$ were previously derived in [21]. The variance of $\|G\|_{p}$ , the way we represent it, is a piece-wise function, with the pieces equivalent at respective boundary points. The points $\frac{2\log n}{\log(2e)}$ and $\xi^{2}=2\log n-o(\sqrt{\log n})$ (see (8)) are chosen rather arbitrarily in a sense that each one can be shifted to the right or to the left by a small constant multiple of $\sqrt{\log n}$ , which would change the estimates only by a multiplicative constant. In this connection, we prefer to speak about “transition windows” rather than “transition points”.

To have a better picture of how the variance changes with $p$ , it may be useful to consider its logarithm $\log{\mathbf{Var}}\|G\|_{p}$ in the range $c\log n\leq p<\infty$ for some fixed small constant $c$ , so that the term $n^{2/p}$ is bounded. For $c\log n\leq p\leq\frac{2\log n}{\log(2e)}$ , $\log{\mathbf{Var}}\|G\|_{p}$ is an almost linear function of $p$ . In the range $p\geq\xi^{2}$ , $\log{\mathbf{Var}}\|G\|_{p}$ is essentially of order $-\log\log n$ (up to a bounded multiple). In the intermediate regime $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ , disregarding additive terms double logarithmic in $n$ , $\log{\mathbf{Var}}\|G\|_{p}$ behaves as $-\frac{p}{2e}n^{2/p}+\log n$ , which is a convex function close to parabola $-\frac{(2\log n-p)^{2}}{4\log n}$ near the point $2\log n$ .

Our result implies, in particular, that for any fixed $\delta\in(0,1)$ and all sufficiently large $n$ , we have ${\mathbf{Var}}\|G\|_{(2-\delta)\log n}\leq n^{-v_{\delta}}$ whereas ${\mathbf{Var}}\|G\|_{(2+\delta)\log n}\geq\frac{v_{\delta}}{\log n}$ for some $v_{\delta}>0$ depending only on $\delta$ . Observe that the Banach–Mazur distance between $\ell_{(2-\delta)\log n}^{n}$ and $\ell_{(2+\delta)\log n}^{n}$ is of order $1+O(\delta)$ , so the “power of $n$ to logarithmic” transition happens at an almost isometric scale. In the context of the random Dvoretzky theorem, this implies

Corollary B.

For any $\delta\in(0,1)$ , there are $w_{\delta},n_{\delta}>0$ depending on $\delta$ with the following property. For any $\varepsilon\in(0,1)$ and $n\geq n_{\delta}$ , let $2\leq k\leq\lceil w_{\delta}\log n/\log(2/\varepsilon)\rceil$ , and let $E$ be a uniformly distributed random $k$ -dimensional subspace of ${\mathbb{R}}^{n}$ . Then

[TABLE]

At the same time,

[TABLE]

Our result highlights an interesting characteristic of the order statistics of the standard Gaussian vector $G=(g_{1},g_{2},\dots,g_{n})$ . Let $(g_{1}^{*},g_{2}^{*},\dots,g_{n}^{*})$ be the non-increasing rearrangement of the vector of absolute values $(|g_{1}|,|g_{2}|,\dots,|g_{n}|)$ . Then Chernoff–type estimates imply that order statistics $g_{i}^{*}$ for $i$ relatively large, say, at least a positive constant power of $n$ , are strongly concentrated, so that their typical fluctuations are small (at most a negative constant power of $n$ ). Thus, the large (logarithmic in $n$ ) fluctuation of $\|G\|_{p}$ for $p\geq(2+\delta)\log n$ is due to the fact that the $p$ -th powers of the first few order statistics comprise a relatively large portion of the sum $\|G\|_{p}^{p}=\sum_{i=1}^{n}(g_{i}^{*})^{p}$ with a significant probability, whereas for $p\leq(2-\delta)\log n$ the $p$ -th powers of the first order statistics are typically hugely dominated by the total sum $\|G\|_{p}^{p}$ .

Our technique of proving Theorem A is in certain aspects similar to [21]. As in [21], a crucial role in our argument is played by Talagrand’s $L_{1}-L_{2}$ bound (see Theorem 2.1 in the next section), which allows to get sharper estimates for the variance than the Poincaré inequality. Another important step, also presented in [21], consists in obtaining strong upper bounds for negative moments of $\ell_{p}^{n}$ –norms. Our approach to bounding the moments is completely different from the one used in [21], as, instead of relying on general Gaussian inequalities, we employ a rather elementary but efficient technique involving lower deviation estimates for the order statistics of random vectors. This allows us to get strong estimates including the case $p\approx 2\log n$ i.e. in the range not treated in [21]. A principal new ingredient to our proof, compared to [21], is the use of truncated Gaussians. For a number $M>0$ , we consider an auxiliary function

[TABLE]

and use a trivial inequality ${\mathbf{Var}}\|G\|_{p}\leq 2{\mathbf{E}}(\|G\|_{p}-f(G))^{2}+2{\mathbf{Var}}f(G)$ . It turns out that, for a carefully chosen truncation level $M$ (the right choice is not straightforward), both terms in the last inequality can be estimated in an optimal way by combining Talagrand’s $L_{1}-L_{2}$ theorem with rather elementary probabilistic arguments and bounds for truncated moments of Gaussian variables. The truncation technique is also used to obtain matching lower bounds for the variance. We will discuss this approach in more detail at the beginning of Section 4.

The organization of the rest of the paper is as follows. In Section 2 we discuss notation and state several facts important for our work as well as provide a detailed derivation of upper and lower bounds for truncated Gaussian moments (Section 2.1) and lower deviation estimates for Gaussian order statistics, using Chernoff’s inequality (Section 2.2). In Section 3 we provide upper bounds for the negative moments of $\ell_{p}^{n}$ –norms in terms of quantiles of the Gaussian distribution. In Section 4 we obtain upper bounds for the variance, and in Section 5 derive matching lower bounds. For reader’s convenience, we give a proof of Corollary B in Section 6.

2 Preliminaries

Let us start with notation and some basic facts that will be useful for us. The canonical inner product in ${\mathbb{R}}^{n}$ is denoted by $\langle\cdot,\cdot\rangle$ . Given a vector $x=(x_{1},x_{2}\dots,x_{n})\in{\mathbb{R}}^{n}$ and a real number $1\leq p<\infty$ , the standard $\ell_{p}^{n}$ –norm of $x$ is defined as

[TABLE]

Additionally, the $\ell_{\infty}^{n}$ –norm $\|x\|_{\infty}:=\max_{i\leq n}|x_{i}|$ . The following relation is true for any $x\in{\mathbb{R}}^{n}$ :

[TABLE]

Given a real number $t$ , $\lfloor t\rfloor$ is the largest integer not exceeding $t$ . Universal constants are denoted by $C,c,\widetilde{c}$ , etc. and their value may be different on different occasions. Given two quantities $a$ and $b$ , we write $a\simeq b$ whenever there is a universal constant $C\neq 0$ with $C^{-1}a\leq b\leq Ca$ . Further, for two non-negative quantities $a,b$ we write $a\lesssim b$ ( $a\gtrsim b$ ) if there is a universal constant $C>0$ with $a\leq Cb$ (respectively, $Ca\geq b$ ). Sometimes it will be convenient for us to write the relation $a\lesssim b$ as $a=O(b)$ .

The expectation of a random variable $Z$ will be denoted by ${\mathbf{E}}Z$ , the variance — by ${\mathbf{Var}}Z$ , and the median — by ${\mathbf{Med}}Z$ . Given an event ${\mathcal{E}}$ , by $\chi_{{\mathcal{E}}}$ we denote the indicator function of ${\mathcal{E}}$ . Throughout the text, standard Gaussian variables will be denoted by $g,g_{1},g_{2},\dots$ and the standard Gaussian vector in ${\mathbb{R}}^{n}$ — by $G$ . It is well known (see, for example, [9, Chapter 7]) that the Gaussian distribution satisfies the relations

[TABLE]

The absolute moments of a standard Gaussian variable are given by

[TABLE]

The next theorem is a basis for our analysis; its “discrete” version was proved by M. Talagrand in [27].

Theorem 2.1 (Talagrand’s $L_{1}\text{--}L_{2}$ bound; see

[6], [4, Chapter 5]).

Suppose $f$ is an absolutely continuous function in ${\mathbb{R}}^{n}$ and let $\partial_{i}f$ ( $i\leq n$ ) be the partial derivatives of $f$ . Then we have

[TABLE]

where $C>0$ is a universal constant.

2.1 Bounds for truncated moments of Gaussian variables

In this subsection, we derive rather elementary upper and lower bounds for high moments of random variables of the form $|g|\chi_{\{|g|\leq a\}}$ and $\min(|g|,a)$ for a fixed $a>0$ . The results presented here are by no means new, but may be hard to locate in literature. For reader’s convenience, we provide proofs.

Let us start with a simple calculus lemma.

Lemma 2.2.

Fix $0<a<\infty$ . Let $f$ be a positive log-concave function on $[0,a]$ , and let $x_{\max}\in[0,a]$ be a point of global maximum for $f$ . Define

[TABLE]

Then

[TABLE]

As a consequence of the above statement, we get

Lemma 2.3.

Let $q,a\geq 1$ be some real numbers.

•

If $q\leq a^{2}$ then

[TABLE]

•

If $q\geq a^{2}$ then

[TABLE]

Proof.

It is easy to see that $\sqrt{q}$ is the point of global maximum of the log-concave function $f(x):=x^{q}e^{-x^{2}/2}$ on $[0,\infty)$ , and $f(\sqrt{q})=(q/e)^{q/2}$ . We will use the last lemma to evaluate the integrals.

The case $q\leq a^{2}$ . We can assume without loss of generality that $q$ is large (greater than a large absolute constant). To get the desired bound it is enough to show that $x_{r}-x_{\ell}\simeq 1$ , where $x_{r}>x_{\ell}$ are the two solutions of the equation $2f(x)=f(\sqrt{q})=(q/e)^{q/2}$ . We can rewrite the equation in the form

[TABLE]

Since $q$ is large, we can assume that all solutions of the last equation satisfy $z\in[-c,c]$ for a small constant $c>0$ . Then, using Taylor’s expansion for the logarithm, we obtain

[TABLE]

for the two solutions of (6). Hence,

[TABLE]

The result follows.

The case $q\geq a^{2}$ . Let $x_{\ell}$ , $x_{r}$ , $x_{\max}$ be defined as in Lemma 2.2. We have $x_{\max}=x_{r}=a$ , and $f(x_{\max})=f(a)=a^{q}\exp(-a^{2}/2)$ . To get the desired bounds, it suffices to show that there exist constants $c_{1}$ , $c_{2}$ such that

[TABLE]

where $2{x_{\ell}}^{q}\exp(-{x_{\ell}}^{2}/2)=f(a)$ , and then apply Lemma 2.2. We will rely on the fact that $f(x)$ is strictly increasing on $[0,a]$ .

For a sufficiently small universal constant $\widetilde{c}>0$ we have $\log(1-z)>-z-z^{2}$ , $|z|<\widetilde{c}$ . Hence, for $x:=a-\frac{\widetilde{c}a}{a+q-a^{2}}$ we obtain

[TABLE]

where in the last inequality we used the condition $q\geq a^{2}$ and the fact that $\widetilde{c}$ is small. Thus, $f(x)\geq\frac{1}{2}f(a)$ whence $x_{\ell}\leq a-\frac{\widetilde{c}a}{a+q-a^{2}}$ .

Now, choose $x:=a-\frac{\widetilde{C}a}{a+q-a^{2}}$ , where $\widetilde{C}>0$ is a large enough universal constant (say, $\widetilde{C}=10$ definitely suffices). If $x<0$ then obviously $x_{\ell}\geq x$ , and we are done. Otherwise, we use the trivial relation $\log(1-z)\leq-z$ , $z\in(-\infty,1)$ , to obtain

[TABLE]

When $q\leq a+a^{2}$ , the last term is less than $-\frac{1}{8}\widetilde{C}^{2}$ , whereas for $q\geq a+a^{2}$ , the second term is less than $-\frac{1}{2}\widetilde{C}$ . In any case, we get $f(x)\leq\frac{1}{2}f(a)$ , whence $x_{\ell}\geq a-\frac{\widetilde{C}a}{a+q-a^{2}}$ . The result follows. ∎

Corollary 2.4.

Let $q,a\geq 1$ be some real numbers.

(i) If $q\leq a^{2}$ then

[TABLE]

(ii) If $q\geq a^{2}$ then

[TABLE]

(iii) In particular, if $\tau q\leq a^{2}\leq q$ for some $\tau\in(0,1)$ , then

[TABLE]

Proof.

Since

[TABLE]

then, applying (4), we get

[TABLE]

This and the second part of Lemma 2.3 yield the assertion for $q\geq a^{2}$ and for $\tau q\leq a^{2}\leq q$ . The case $q\leq a^{2}$ follows from the first part of Lemma 2.3 and the fact that $\max_{\mathbb{R}}t^{q}e^{-t^{2}/2}=q^{q/2}e^{-q/2}$ . ∎

Remark 2.5.

The last statement asserts that, for $a^{2}\geq q$ , the $q$ -th moments of the truncated variables $\min(|g|,a)$ and $|g|\chi_{\{|g|\leq a\}}$ are equivalent, with a constant multiple, to the (not truncated) absolute moment ${\mathbf{E}}|g|^{q}$ (see 5).

2.2 Chernoff–type bounds for order statistics

Given any number $\alpha\in[0,1)$ , the quantile of order $\alpha$ with respect to the distribution of $|g|$ is the number $\xi_{\alpha}$ satisfying ${\mathbf{P}}\{|g|\leq\xi_{\alpha}\}=\alpha$ . It follows from (4) that

[TABLE]

Standard estimates for quantiles of the Gaussian distribution (see, for example, [7, p. 264]) imply that for $1\leq i\leq n/2$ we have

[TABLE]

Further, for the standard Gaussian vector $G=(g_{1},g_{2},\dots,g_{n})$ in ${\mathbb{R}}^{n}$ , the order statistics of $G$ , denoted by $g_{1}^{*},g_{2}^{*},\dots,g_{n}^{*}$ , are the non-increasing rearrangement of the vector of absolute values $(|g_{1}|,|g_{2}|,\dots,|g_{n}|)$ . Given $\beta\in(0,1)$ , we have

[TABLE]

It follows from Chernoff’s theorem for the partial binomial sums (see [5] for the original result, or [2, p. 24] as a modern reference) that for $i\leq\beta n$ we have

[TABLE]

Applying the relation $\log(1+t)\leq t-t^{2}/(2+2t)$ ( $t\geq 0$ ), we get

[TABLE]

The relation (9) allows to derive deviation inequalities for order statistics. Let us remark at this point that, although order statistics are systematically studied in literature (see classical book [7], or paper [3] as an example of recent developments), we were not able to locate results in a form convenient for us. For completeness, we provide proofs of next three lemmas.

Lemma 2.6 (Lower deviation for large order statistics).

There are universal constants $C,c>0$ with the following property. Assume that $n$ is large, and that $1\leq i\leq\sqrt{n}$ . Let $\frac{1}{\sqrt{\log n}}\leq u\leq 1-\frac{C}{\log n}$ . Then

[TABLE]

Proof.

Let $n,i,u$ satisfy the assumptions and let $s\in(0,1-i/n)$ be such that $\xi_{s}=u\,\xi_{1-i/n}$ . Observe that, in view of the lower bound on $u$ and the approximation formula (8), we have $\xi_{s}\gtrsim 1$ . Then, applying (7) twice, we get

[TABLE]

where, by (8), we have $\xi_{1-i/n}\simeq\sqrt{\log(n/i)}$ . Thus,

[TABLE]

The assumptions $u\leq 1-\frac{C}{\log n}$ and $i\leq\sqrt{n}$ imply that

[TABLE]

which is bigger than a large absolute constant if $C$ is large enough, whence $(1-s)n\gg i$ . Applying (9), we get

[TABLE]

It remains to reuse (10). ∎

Lemma 2.7 (Lower deviation for intermediate order statistics).

There is a universal constant $c>0$ with the following property. Let $n$ be large, let $i\leq n/2$ and $u\in(0,1)$ . Then

[TABLE]

Proof.

As in the proof of the above lemma, we let $s\in(0,1-i/n)$ be such that $\xi_{s}=u\,\xi_{1-i/n}$ . Denoting by $F$ the cdf of $|g|$ , we have

[TABLE]

whence, applying (7) and (8),

[TABLE]

and

[TABLE]

for a sufficiently small universal constant $\widetilde{c}>0$ . Finally, in view of (9),

[TABLE]

∎

The two lemmas above need to be complemented with the following crude bound for probability of very large deviations.

Lemma 2.8.

Let $u\geq 0$ and $i\leq n/2$ . Then

[TABLE]

Proof.

We have

[TABLE]

∎

3 Negative truncated moments of $\ell_{p}^{n}$ –norms

In this section, we derive upper bounds for expressions of the form

[TABLE]

where the numbers $q\geq 1$ and $L>0$ are such that $qL=O(\log n)$ , and $T$ is a truncation level which can take any value in the range $[\xi_{1-1/n},\infty]$ . In particular, for $T=\infty$ the above quantity is the $-Lq$ -th moment of the $\|\cdot\|_{q}$ –norm — ${\mathbf{E}}\|G\|_{q}^{-Lq}$ . Negative moments of arbitrary norms were considered in [12], where, in particular, bounds for quantities of the form $({\mathbf{E}}\|G\|^{-q})^{1/q}$ were derived for $q$ less than $d(\|\cdot\|)$ , the “lower Dvoretzky dimension” of a norm $\|\cdot\|$ . In [21], negative $r$ -th moments of $\|\cdot\|_{q}$ –norms were considered in the same context as our note; however, the relations derived in [21] (see, in particular [21, Lemma 3.6]) do not extend to the case when both $q$ and $r$ are greater than $\log n$ . Finally, let us mention a recent work [19] where a strong upper bound on $({\mathbf{E}}\|G\|^{-q})^{1/q}$ was obtained in terms of the positive moment ${\mathbf{E}}\|G\|$ and the variance ${\mathbf{Var}}\|G\|$ for any norm in ${\mathbb{R}}^{n}$ . On the other hand, applying this result of [19] would require extra care because of absence of a truncation level in the statement of [19], and the necessity to have precise lower bounds for ${\mathbf{E}}\|G\|_{q}$ . The approach we take here is relatively elementary and based on the Chernoff inequality which we used in Section 2.2.

We start with the following small ball probability estimate:

Lemma 3.1.

Let $n$ be a large integer, $G=(g_{1},g_{2},\dots,g_{n})$ be the standard Gaussian vector, and let $T\in[\xi_{1-1/n},\infty]$ and $q\geq 1$ . Then for any number $\tau\in(0,1/2)$ we have

[TABLE]

where $C^{\prime},c>0$ are universal constants.

Proof.

Obviously,

[TABLE]

so that for any $\tau\in(0,1/2)$ we have

[TABLE]

First, assume that $(2\tau)^{1/q}\leq 1-\frac{C}{\log n}$ , where the constant $C>0$ comes from Lemma 2.6. We will divide the above sum into two parts corresponding to large and “intermediate” order statistics. For every $i\leq\sqrt{n}$ , using the notation $r:={\max((2\tau)^{1/q},(\log n)^{-1/2})}^{2}$ , we get, in view of Lemmas 2.6 and 2.8,

[TABLE]

Further, for all $\sqrt{n}<i\leq n/2$ we have, by Lemmas 2.7 and 2.8,

[TABLE]

Combining the estimates (note that the first term in the first minimum form a geometric sum), we get

[TABLE]

Finally, observe that for $(2\tau)^{1/q}\geq 1-\frac{C}{\log n}$ , we have that $n^{(1-(2\tau)^{2/q})/4}$ is bounded from above by an absolute constant, so the last estimate is trivially satisfied as long as $C^{\prime}$ is chosen sufficiently large. ∎

As a consequence, we obtain

Proposition 3.2.

For any $K>0$ there are $n_{K},v_{K}>0$ depending only on $K$ with the following property. Let $n\geq n_{K}$ , let $q\geq 1$ and $0<L\leq K$ be such that $qL\leq K\log n$ , and let $g_{1},g_{2},\dots,g_{n}$ be i.i.d. standard Gaussians. Then for any $T\in[\xi_{1-1/n},\infty]$ we have

[TABLE]

Proof.

Fix admissible parameters $K,L,q,T$ . We will assume that $n$ is large. For any integer $m\geq 1$ , we have

[TABLE]

Applying Lemma 3.1, we obtain for all $m\geq 2L$ :

[TABLE]

In the range $2L\leq m\leq Lq$ , we have

[TABLE]

for a sufficiently small universal constant $c^{\prime\prime}>0$ . In particular, for all such $m$ the probability in (11) is bounded from above by $w_{K}4^{-m}$ , where $w_{K}>0$ may only depend on $K$ . Further, for $Lq<m\leq 10Lq\log n$ , we have

[TABLE]

Finally, for $m>10Lq\log n$ the probability in (11) is bounded by

[TABLE]

Combining the estimates, we get for $h:=(\sum\nolimits_{i=1}^{n}\min(|g_{i}|,T)^{q})^{-L}$ and $\zeta:=(\sum\nolimits_{i=1}^{n}{\xi^{q}_{1-i/n}})^{-L}$ ,

[TABLE]

Hence,

[TABLE]

and the result follows. ∎

Remark 3.3.

Note that for any $1\leq q\leq K\log n$ , we have

[TABLE]

where the symbol “ $\simeq_{K}$ ” means that the quantities are equivalent up to a multiple depending only on parameter $K>0$ . To see this, observe that for any $i\in\{1,2,\dots,n-1\}$ we have ${\mathbf{P}}\{\min(|g|,\xi_{1-1/n})\in(\xi_{1-i/n},\xi_{1-(i+1)/n}]\}=\frac{1}{n}$ , whence

[TABLE]

In remains to apply (8) to compare ${\xi^{q}_{1-1/n}}$ with the power of the second quantile ${\xi^{q}_{1-2/n}}$ .

4 Upper bounds for the variance

In this section we obtain upper bounds for ${\mathbf{Var}}\|G\|_{p}$ , $p\geq C$ . Before we proceed with the proofs, let us provide some motivation for the strategy we have taken. As we mentioned in the introduction, the basic tool for estimating the variance from above is Talagrand’s $L_{1}-L_{2}$ bound (Theorem 2.1). In [21], the theorem was directly applied to the norm $\|\cdot\|_{p}$ , which gives the estimate

[TABLE]

where $\partial_{i}\|G\|_{p}$ denotes the $i$ -th partial derivative of the norm (viewed as a function in ${\mathbb{R}}^{n}$ ) evaluated at $G$ . An elementary computation then leads to an equivalent inequality

[TABLE]

where $B=1+\log\big{(}\sqrt{{\mathbf{E}}|\partial_{i}\|G\|_{p}|^{2}}/{\mathbf{E}}|\partial_{i}\|G\|_{p}|\big{)}$ , and so $B$ can be at most logarithmic in $n$ . A natural approach to estimating the expectation in the last formula would be to remove $g_{1}$ from the denominator and use independence:

[TABLE]

However, this approach fails for all $p>\log_{2}n$ : the upper bound for the variance we get this way is worse than the bound ${\mathbf{Var}}\|G\|_{p}\lesssim 1$ that follows from $1$ –Lipschitzness of $\|\cdot\|_{p}$ –norm. To see this, observe that

[TABLE]

whence, applying standard estimates for absolute moments of Gaussian variables, we get that the expression on the right hand side of (13) is at least of order $\frac{2^{p}}{B}n^{2/p-1}$ .

In fact, as we show later, the estimate (13) is not sharp for all $p>\frac{2\log n}{\log(2e)}$ . Clearly, the problem with the above argument lies in the fact that, for large $p$ , the input of the individual coordinate $|g_{1}|^{p}$ to the total sum can be huge, and removing the term from the denominator in (12) alters the expectation.

As a way to resolve the issue, we will consider truncated Gaussian variables. Given $p\in[1,\infty)$ and a truncation level $T>0$ , we introduce an auxiliary function

[TABLE]

so that

[TABLE]

and then treat the two terms on the r.h.s. separately (the parameter $p$ shall always be clear from the context). Determining the right truncation level $T$ (when both terms admit satisfactory upper estimates) is not straightforward. We prefer to postpone the actual definition of the truncation level, and consider first some general estimates when $p$ is arbitrary and $T\geq\xi_{1-1/n}$ .

We start with ${\mathbf{E}}(\|G\|_{p}-f_{T}(G))^{2}$ .

Lemma 4.1.

For any large integer $n$ , any $1\leq p<\infty$ and any truncation level $T\geq\xi_{1-1/n}$ we have

[TABLE]

Proof.

Define a random set $I=I(G):=\{i\leq n:\,|g_{i}|>T\}$ . Since for any concave function $h$ in ${\mathbb{R}}$ and any $t\geq 0$ and $x\geq y$ , we have

[TABLE]

then, taking $h(r):=r^{1/p}$ and $t:=\sum_{i\notin I}|g_{i}|^{p}$ , we get

[TABLE]

For every $m\geq 1$ , let $\chi_{\{|I(G)|=m\}}$ be the indicator of the event that exactly $m$ coordinates of $G$ are greater (in absolute value) than $T$ . It follows from the above that for every $m\geq 1$ we have

[TABLE]

where, in view of (17),

[TABLE]

Hence,

[TABLE]

In view of (4), we have ${\mathbf{P}}\{|g|>T\}\leq\sqrt{2/\pi}\,T^{-1}\exp(-T^{2}/2)$ , and

[TABLE]

Summarizing, we get

[TABLE]

It is easy to show that for any number $a\in{\mathbb{R}}$ we have

[TABLE]

Since in our case $a=\sqrt{{2}/{\pi}}{T}^{-1}\exp(-T^{2}/2)$ , relation (7) implies that $(1+a)^{n-1}\lesssim 1$ , and

[TABLE]

The result follows. ∎

As the next step, we consider the variance of $f_{T}(G)$ .

Lemma 4.2.

Let $n$ be a large integer, let $p\in[1,3\log n]$ and let $T\geq\xi_{1-1/n}$ . Then

[TABLE]

where

[TABLE]

Proof.

It follows from Theorem 2.1 that

[TABLE]

where $|\partial_{1}f_{T}|=f_{T}^{1-p}|g_{1}|^{p-1}\chi_{\{|g_{1}|\leq T\}}$ . First we estimate the numerator. By Proposition 3.2 applied with a constant parameter $K\geq 6$ to the standard $(n-1)$ –dimensional truncated Gaussian vector, we have

[TABLE]

Next, observe that

[TABLE]

(this can be easily verified using relation (8)). Then, in view of Remark 3.3,

[TABLE]

It remains to estimate from below the denominator in (19). Essentially repeating the above computations, we get

[TABLE]

Further,

[TABLE]

and the statement follows. ∎

For shortness, in what follows we denote

[TABLE]

Note that $\xi\exp(\xi^{2}/2)\simeq n$ and by (8)

[TABLE]

Let us state a combination of the last two lemmas as a corollary:

Corollary 4.3.

Let $n$ be a large integer, let $p\in[1,3\log n]$ , and let $T\geq\xi$ . Then

[TABLE]

where $A$ is defined by (18).

Essentially, our work consists in optimizing the above expression over admissible $T$ . It turns out that taking the truncation level close to

[TABLE]

produces optimal upper bounds for the variance. Observe that the quantity in (22) is greater than $\xi$ . Indeed,

[TABLE]

Thus, (22) may serve as an admissible truncation level in (21). The following estimates are implied by Corollary 2.4 and relation (7).

Lemma 4.4 (Estimates for ${\mathbf{E}}\min(\xi,|g|)^{p}$ ).

Let $n$ be a large integer and let $p\geq 1$ . Then

•

For $1\leq p\leq\xi^{2}$ , we have

[TABLE]

•

For $p\geq\xi^{2}$ , we have

[TABLE]

While working with expression (22) directly may be complicated, the above lemma allows somewhat simpler (equivalent) definition. For $p\geq 1$ , we define a truncation level $M$ as follows

[TABLE]

In the next statement we collect some simple properties of $M$ .

Lemma 4.5.

Provided that $n$ is sufficiently large, we have:

•

$M\geq\xi$ * for all $p\geq 1$ ;*

•

If $1\leq p\leq\frac{2\log n}{\log(2e)}$ then $2p-2\leq M^{2}$ ;

•

If $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ then $p\leq M^{2}\leq 2p$ ;

•

If $\xi^{2}<p$ then $M^{2}\leq p^{1+\frac{1}{p}}$ .

Proof.

First, taking into account that

[TABLE]

we get $M\geq\xi$ for all $p\geq 1$ . In the range $1\leq p\leq\frac{2\log n}{\log(2e)}$ , the assertion trivially follows from (20) and the estimate $M\geq\xi$ . For $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ , we have $2p\geq n^{2/p}{p}/{e}=M^{2}$ , and as $n^{2/p}>e$ , we get $M^{2}>p$ . In the interval $\xi^{2}<p$ the statement follows from the definition of $M$ . ∎

Lemma 4.6.

We have $\exp(-n^{2/p}p/(2e))\leq n^{-2}2^{p}$ for all $p\in[1,2\log n]$ .

Proof.

It is enough to show that

[TABLE]

The derivative of the right hand side with respect to $p$ is

[TABLE]

which is less than zero if and only if $p\leq\frac{2\log n}{\log(2e)}$ . Thus, the minimum of $p\log 2+n^{2/p}p/(2e)$ on $[1,2\log n]$ is attained at $p=\frac{2\log n}{\log(2e)}$ , and at the point the expression is equal to $2\log n$ . ∎

Lemma 4.7 (Estimates for $M^{-1}\exp(-M^{2}/2)$ ).

Let $n$ be a large integer, $p\geq 1$ , and let $M=M(p)$ be defined as before. Then

•

For $1\leq p\leq\xi^{2}$ , we have

[TABLE]

•

For $\xi^{2}<p$ , we have

[TABLE]

Proof.

For $1\leq p\leq\xi^{2}$ the statement follows directly from the definition of $M$ . suppose that $\xi^{2}<p$ . Then $M\simeq\xi$ , and, applying (7), we get

[TABLE]

We will use the fact that $z-1=\log z+O((z-1)^{2})$ for all $z\geq 1$ . Note that

[TABLE]

Hence, the previous estimate implies

[TABLE]

where in the last relation we used the fact that $t^{t}\geq e^{-1/e}$ for any $t>0$ . This proves the lemma. ∎

The last lemma obviously provides upper bounds for the first term in (21) (for $T=M$ ).

Lemma 4.8.

Let $n$ be a large integer and let $p\geq 1$ . Then

•

If $1\leq p\leq\frac{2\log n}{\log(2e)}$ then

[TABLE]

•

If $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ then

[TABLE]

•

If $\xi^{2}<p$ then

[TABLE]

Proof.

First, consider the range $1\leq p\leq\frac{2\log n}{\log(2e)}$ . We have $2p-2\leq n^{2/p}{p}/{e}=M^{2}$ , whence, by Corollary 2.4,

[TABLE]

Next, assume that $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ . Here, $2p\geq n^{2/p}{p}/{e}=M^{2}$ , and in view of Corollary 2.4,

[TABLE]

Hölder’s inequality then implies

[TABLE]

On the other hand

[TABLE]

whence

[TABLE]

Applying the definition of $M$ and the estimates for $M^{-1}\exp(-M^{2}/2)$ from Lemma 4.7, we get

[TABLE]

Finally, note that, in the given range for $p$ , we have

[TABLE]

Now, we consider the interval $\xi^{2}<p$ . In this range we have $2p-2\geq 1.5M^{2}$ , so

[TABLE]

It remains to apply Lemma 4.7. ∎

Lemma 4.9.

Let $n$ be a large integer and let $p\in[1,3\log n]$ . Then, with $A$ defined by formula (18) with $T=M(p)$ , we have $1+\log A\gtrsim p$ .

Proof.

Since $M\geq\xi$ for all $p\geq 1$ , we have in view of Lemma 4.4 and the definition of $M$ :

[TABLE]

Further, ${\mathbf{E}}(|g|^{p-1}\chi_{\{|g|\leq M\}})\leq{\mathbf{E}}\min(M,\,|g|)^{p-1}\leq({\mathbf{E}}\min(M,\,|g|)^{p})^{1-1/p}$ . This leads to

[TABLE]

If $1\leq p\leq\frac{2\log n}{\log(2e)}$ , then $2p-2\leq M^{2}=n^{2/p}p/e$ , and by Corollary 2.4, we have

[TABLE]

Hence,

[TABLE]

If $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ , then $p\leq M^{2}=n^{2/p}p/e\leq 2p$ , and by Corollary 2.4 and Lemma 4.8, we get

[TABLE]

In the range under consideration, the minimum of $\exp\big{(}-\frac{p}{2e}n^{2/p}\big{)}$ is attained at $p=\frac{2\log n}{\log(2e)}$ , whence $B\gtrsim n^{2\log 2/\log(2e)}(\log n)^{-1/2}$ , and the statement follows.

Finally, if $\xi^{2}\leq p\leq 3\log\,n$ , then $p^{1+1/p}\geq M^{2}$ and $p\simeq\xi^{2}$ . Denote $q:=p^{1+1/p}$ . By Hölder’s inequality and Corollary 2.4 we have

[TABLE]

On the other hand, Lemma 4.8 gives

[TABLE]

Together with Lemma 4.7 the estimates imply

[TABLE]

This completes the proof of the lemma. ∎

A combination of Lemmas 4.7, 4.8, and 4.9 with Corollary 4.3 gives

Proposition 4.10.

Let $n$ be a large integer and let $p\in[1,3\log n]$ . Then

•

For $1\leq p\leq\frac{2\log n}{\log(2e)}$ we have

[TABLE]

•

For $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ we have

[TABLE]

•

For $\xi^{2}<p\leq 3\log n$ we have

[TABLE]

Proof.

First, assume that $1\leq p\leq\frac{2\log n}{\log(2e)}$ . By Corollary 4.3, Lemmas 4.7, 4.8, and 4.9 and Corollary 2.4 we have

[TABLE]

It remains to apply Lemma 4.6 to the first term.

Next, we treat the case $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ . Using the same argument as above and the fact that $n^{1/p}=O(1)$ , we obtain

[TABLE]

Hence,

[TABLE]

Finally, we consider the range $\xi^{2}<p\leq 3\log n$ . We have, in view of Lemma 4.8, Corollary 2.4 and relation (7):

[TABLE]

Thus,

[TABLE]

∎

Note that in Proposition 4.10 we treat the cae $p<3\log n$ . In the regime $p>3\log n$ , we will rely on the following result from [21]:

Lemma 4.11 ([21, Section 3]).

We have ${\mathbf{Var}}\|G\|_{p}\lesssim\frac{1}{\log n}$ for all $n>1$ and $p\geq 2.01$ .

5 Lower bounds for the variance

Let us start with a useful auxiliary result from [21]. We provide a proof for the reader’s convenience.

Lemma 5.1 ([21, Section 3]).

Let $p\geq 1$ and $n>1$ . Then

[TABLE]

where $G=(g_{1},g_{2},\dots,g_{n})$ and $G^{\prime}=(g_{1}^{\prime},g_{2}^{\prime},\dots,g_{n}^{\prime})$ are independent standard Gaussian vectors in ${\mathbb{R}}^{n}$ .

Proof.

First, clearly ${\mathbf{Var}}(\|G\|_{p})=\frac{1}{2}{\mathbf{E}}(\|G\|_{p}-\|G^{\prime}\|_{p})^{2}$ . Next, it can be checked, using elementary convexity properties, that for any two positive real numbers $a,b$ we have

[TABLE]

Applying the above inequality for $a:=\|G\|_{p}^{p}$ and $b:=\|G^{\prime}\|_{p}^{p}$ , we obtain

[TABLE]

It is easy to see that, for $i\neq j$ , the terms in the above sum are equal zero, whence

[TABLE]

The result follows. ∎

As a simple corollary, we obtain the main technical element of the section:

Lemma 5.2.

There is a universal constant $C$ with the following property. Assume that $n>1$ and $p\geq C$ . Further, let $T\geq 2$ and $\tau\in(0,1)$ be any numbers such that

[TABLE]

where $g_{1},g_{2},\dots,g_{2n-2}$ are i.i.d. standard Gaussians. Then for the standard Gaussian vector $G$ in ${\mathbb{R}}^{n}$ we have

[TABLE]

Proof.

In view of Lemma 5.1, we have

[TABLE]

where $G^{\prime}=(g_{1}^{\prime},g_{2}^{\prime},\dots,g_{n}^{\prime})$ is an independent copy of $G=(g_{1},g_{2},\dots,g_{n})$ . By the assumptions on $T$ we have ${\mathbf{P}}\{\sum_{i=2}^{n}|g_{i}|^{p}+\sum_{i=2}^{n}|g_{i}^{\prime}|^{p}\leq T^{p}\}\geq\tau$ , whence

[TABLE]

Further, observe that for any two numbers $a\geq 0$ and $0\leq b\leq 1$ we have $(a-b)^{2}>a^{2}/4-1/2$ , whence, in particular,

[TABLE]

Together with the above inequalities, it gives

[TABLE]

where in the last step we used that, by Corollary 2.4,

[TABLE]

if $p$ is big enough. ∎

Naturally, we would like to apply the above lemma with $T$ close to $M(p)$ where the truncation level $M(p)$ was defined in Section 4. We have

Lemma 5.3.

Let $n$ be a large integer and let $C$ be the constant from Lemma 5.2. Then for any $p\geq C$ we have

[TABLE]

where $M=M(p)$ is defined by formula (23).

Proof.

As it was observed back in Lemma 4.4, we have

[TABLE]

In particular, there is a universal constant $C_{1}>0$ such that

[TABLE]

By Markov’s inequality, given $2n-2$ i.i.d. Gaussian variables $g_{1},g_{2},\dots,g_{2n-2}$ , we have

[TABLE]

Further, we observe that

[TABLE]

whence

[TABLE]

Thus, $T:=C_{1}^{1/p}M$ and $\tau:=e^{-2}-e^{-3}$ satisfy conditions of Lemma 5.2. Applying Lemma 5.2, we obtain

[TABLE]

∎

As a corollary of Lemma 5.3 and bounds on truncated moments from Lemma 4.8, we obtain

Proposition 5.4.

Let $n$ be a large integer and let $p\geq C$ , where $C$ is defined in Lemma 5.2. Then

•

For $1\leq p\leq\frac{2\log n}{\log(2e)}$ we have

[TABLE]

•

For $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ we have

[TABLE]

•

For $\xi^{2}<p$ we have

[TABLE]

Proof.

First, assume that $1\leq p\leq\frac{2\log n}{\log(2e)}$ . Then a combination of Lemmas 5.3 and 4.8 and the definition of $M$ gives

[TABLE]

Next, assume that $\frac{2\log n}{\log(2e)}\leq p\leq\xi^{2}$ . Again, combining Lemmas 5.3 and 4.8, we get

[TABLE]

Finally, consider the case $\xi^{2}<p$ . We have $M^{p}=\frac{p\xi^{p}}{\xi+p-\xi^{2}}$ , and

[TABLE]

The statement follows. ∎

Note that in the regime $p>\xi^{2}$ the above estimate gives the right order for ${\mathbf{Var}}\|G\|_{p}$ only if $p=O(\log n)$ . For $p\gg\log n$ we will use the following estimate from [21]:

Lemma 5.5 ([21]).

There is a universal constant $C>0$ such that for $n>1$ and all $p\geq C\log n$ we have

[TABLE]

Together Proposition 4.10, Lemma 4.11, Proposition 5.4 and Lemma 5.5 imply Theorem A from the introduction in the regime $p\geq C$ . For $1\leq p\leq C$ , we refer to [21].

6 Proof of Corollary B

Let $0<\varepsilon,\delta<1$ be given. It follows from Theorem A that there exists $v_{\delta}>0$ depending on $\delta$ such that for all sufficiently large $n$ , we have

[TABLE]

Let $w_{\delta}:=v_{\delta}/40$ and $1<k<w_{\delta}\log n/\log(2/\varepsilon)$ . Construct an $n\times k$ Gaussian matrix $\mathcal{G}$ whose columns are jointly independent standard Gaussian vectors $G_{1},G_{2},\dots,G_{k}$ in ${\mathbb{R}}^{n}$ . Then a uniform random $k$ -dimensional subspace can be defined as

[TABLE]

The subspace $E$ is $(1+\varepsilon)$ –spherical in $\ell_{p}^{n}$ for $p:=(2-\delta)\log n$ if

[TABLE]

The last inequality holds whenever for all $x\in S^{k-1}$ we have

[TABLE]

Let $a:={\mathbf{E}}\|G\|_{p}$ . Note that if $x\in S^{k-1}$ then $\mathcal{G}x$ is a standard Gaussian vector and we have

[TABLE]

Let $\mathcal{N}$ be an $\varepsilon^{\prime}/5$ -net of minimal cardinality in $\ell_{2}$ -metric in $S^{k-1}$ . We have $|\mathcal{N}|<(15/\varepsilon^{\prime})^{k}$ and

[TABLE]

Conditioning on the event that $\big{|}\|\mathcal{G}y\|_{p}-a\big{|}\leq\varepsilon^{\prime}a/4$ for all $y\in\mathcal{N}$ we have

[TABLE]

(see e.g. Lemma 3.2 of [20]). Thus if there exists $x\in S^{k-1}$ such that $\big{|}\|\mathcal{G}x\|_{p}-a\big{|}>\varepsilon^{\prime}a/2$ then there exists $y\in\mathcal{N}$ such that $\|x-y\|_{2}\leq\varepsilon^{\prime}/5$ and

[TABLE]

This leads to

[TABLE]

To pass from $\|\mathcal{G}x\|_{p}$ to $\|\mathcal{G}x\|_{p}/\|\mathcal{G}x\|_{2}$ , note that given $\widetilde{\varepsilon}<1/3$ and non-negative random variables $\xi_{1}$ , $\xi_{2}$ , the event that

[TABLE]

is contained inside the event

[TABLE]

By the standard concentration estimates, we have

[TABLE]

hence, taking $\xi_{1}=\|G\|_{p}$ , $\xi_{2}=\|G\|_{2}$ , and $\widetilde{\varepsilon}=\varepsilon^{\prime}/2$ , we get

[TABLE]

provided that $n$ is big enough. Thus (1) is proved.

To prove the second part of Corollary B, it is enough to consider the case $k=2$ . Then $E=\mathrm{span}\,\{G,G^{\prime}\}$ , where $G$ and $G^{\prime}$ are two independent standard Gaussian vectors in ${\mathbb{R}}^{n}$ , and

[TABLE]

Thus it is enough to show that for $p:=(2+\delta)\log n$ we have

[TABLE]

for some $w_{\delta}>0$ depending only on $\delta$ . Observe that standard concentration estimates imply

[TABLE]

whence it is enough to show that

[TABLE]

for some $\widetilde{w}_{\delta}>0$ . Recall that ${\mathbf{Var}}\|G\|_{p}\geq\frac{v_{\delta}}{\log n}$ for some $v_{\delta}>0$ . Hence,

[TABLE]

Next, observe that ${\mathbf{E}}\|G\|_{p}\simeq\sqrt{\log n}$ , and in view of $1$ -symmetry of the $\|\cdot\|_{p}$ –norm and by a result from [29], we have

[TABLE]

This, together with the above relation, implies

[TABLE]

whence

[TABLE]

This, and the fact that $\|G\|_{p}\simeq\sqrt{\log n}$ with very large probability, implies the statement.

Bibliography29

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Artstein-Avidan, A. Giannopoulos and V. D. Milman, Asymptotic geometric analysis. Part I, Mathematical Surveys and Monographs, 202, American Mathematical Society, Providence, RI(2015). MR 3331351
2[2] S. Boucheron, G. Lugosi and P. Massart, Concentration inequalities , Oxford University Press, Oxford, 2013. MR 3185193
3[3] S. Boucheron and M. Thomas, Concentration inequalities for order statistics, Electron. Commun. Probab. 17 (2012), no. 51, 12 pp. MR 2994876
4[4] S. Chatterjee, Superconcentration and related topics, Springer Monographs in Mathematics (2013).
5[5] H. Chernoff, A measure of asymptotic efficiency for tests of a hypothesis based on the sum of observations, Ann. Math. Statistics 23 (1952), 493–507. MR 0057518
6[6] D. Cordero-Erausquin and M. Ledoux, Hypercontractive measures, Talagrand’s inequality, and influences, in Geometric aspects of functional analysis , 169–189, Lecture Notes in Math., 2050, Springer, Heidelberg. MR 2985132
7[7] H. A. David, Order statistics , second edition, Wiley, New York, 1981. MR 0597893
8[8] A. Dvoretzky, Some results on convex bodies and Banach spaces, in Proc. Internat. Sympos. Linear Spaces (Jerusalem, 1960) , 123–160, Jerusalem Academic Press, Jerusalem. MR 0139079

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The variance of the ℓpn\ell_{p}^{n}ℓpn​–norm of the Gaussian vector, and Dvoretzky’s theorem

Abstract

1 Introduction

Theorem A**.**

Corollary B**.**

2 Preliminaries

Theorem 2.1** **(Talagrand’s L1–L2L_{1}\text{--}L_{2}L1​–L2​ bound; see

2.1 Bounds for truncated moments of Gaussian variables

Lemma 2.2**.**

Lemma 2.3**.**

Proof.

Corollary 2.4**.**

Proof.

Remark 2.5**.**

2.2 Chernoff–type bounds for order statistics

Lemma 2.6** (Lower deviation for large order statistics).**

Proof.

Lemma 2.7** (Lower deviation for intermediate order statistics).**

Proof.

Lemma 2.8**.**

Proof.

3 Negative truncated moments of ℓpn\ell_{p}^{n}ℓpn​–norms

Lemma 3.1**.**

Proof.

Proposition 3.2**.**

Proof.

Remark 3.3**.**

4 Upper bounds for the variance

Lemma 4.1**.**

Proof.

Lemma 4.2**.**

Proof.

Corollary 4.3**.**

Lemma 4.4** (Estimates for Emin⁡(ξ,∣g∣)p{\mathbf{E}}\min(\xi,|g|)^{p}Emin(ξ,∣g∣)p).**

Lemma 4.5**.**

Proof.

Lemma 4.6**.**

Proof.

Lemma 4.7** (Estimates for M−1exp⁡(−M2/2)M^{-1}\exp(-M^{2}/2)M−1exp(−M2/2)).**

Proof.

Lemma 4.8**.**

Proof.

Lemma 4.9**.**

Proof.

Proposition 4.10**.**

Proof.

Lemma 4.11** ([21, Section 3]).**

5 Lower bounds for the variance

Lemma 5.1** ([21, Section 3]).**

Proof.

Lemma 5.2**.**

Proof.

Lemma 5.3**.**

Proof.

Proposition 5.4**.**

Proof.

Lemma 5.5** ([21]).**

6 Proof of Corollary B

The variance of the $\ell_{p}^{n}$ –norm of the Gaussian vector, and Dvoretzky’s theorem

Theorem A.

Corollary B.

Theorem 2.1 (Talagrand’s $L_{1}\text{--}L_{2}$ bound; see

Lemma 2.2.

Lemma 2.3.

Corollary 2.4.

Remark 2.5.

Lemma 2.6 (Lower deviation for large order statistics).

Lemma 2.7 (Lower deviation for intermediate order statistics).

Lemma 2.8.

3 Negative truncated moments of $\ell_{p}^{n}$ –norms

Lemma 3.1.

Proposition 3.2.

Remark 3.3.

Lemma 4.1.

Lemma 4.2.

Corollary 4.3.

Lemma 4.4 (Estimates for ${\mathbf{E}}\min(\xi,|g|)^{p}$ ).

Lemma 4.5.

Lemma 4.6.

Lemma 4.7 (Estimates for $M^{-1}\exp(-M^{2}/2)$ ).

Lemma 4.8.

Lemma 4.9.

Proposition 4.10.

Lemma 4.11 ([21, Section 3]).

Lemma 5.1 ([21, Section 3]).

Lemma 5.2.

Lemma 5.3.

Proposition 5.4.

Lemma 5.5 ([21]).