High-dimensional limit theorems for random vectors in $\ell_p^n$-balls.   II

Zakhar Kabluchko; Joscha Prochno; Christoph Thaele

arXiv:1906.03599·math.PR·June 11, 2019

High-dimensional limit theorems for random vectors in $\ell_p^n$-balls. II

Zakhar Kabluchko, Joscha Prochno, Christoph Thaele

PDF

Open Access

TL;DR

This paper establishes central limit, moderate deviations, and large deviations theorems for the q-norms of high-dimensional random vectors in p^n-balls, extending previous work with new applications to projections.

Contribution

It introduces a unified framework for limit theorems for p^n-ball vectors under general distributions, including new applications to projections.

Findings

01

Proved a central limit theorem for p^n-ball vectors.

02

Established moderate deviations principles.

03

Derived large deviations results.

Abstract

In this article we prove three fundamental types of limit theorems for the $q$ -norm of random vectors chosen at random in an $ℓ_{p}^{n}$ -ball in high dimensions. We obtain a central limit theorem, a moderate deviations as well as a large deviations principle when the underlying distribution of the random vectors belongs to a general class introduced by Barthe, Gu\'edon, Mendelson, and Naor. It includes the normalized volume and the cone probability measure as well as projections of these measures as special cases. Two new applications to random and non-random projections of $ℓ_{p}^{n}$ -balls to lower-dimensional subspaces are discussed as well. The text is a continuation of [Kabluchko, Prochno, Th\"ale: High-dimensional limit theorems for random vectors in $ℓ_{p}^{n}$ -balls, Commun. Contemp. Math. (2019)].

Equations268

P_{W_{n}, n, p} := W_{n} ({0}) C_{n, p} + H U_{n, p},

P_{W_{n}, n, p} := W_{n} ({0}) C_{n, p} + H U_{n, p},

h(r)={1\over p^{n/p}\Gamma\big{(}1+{n\over p}\big{)}}{1\over(1-r^{p})^{1+n/p}}\int_{0}^{\infty}s^{n/p}e^{-{1\over p}{sr^{p}(1-r^{p})^{-1}}}\,\mathbf{W}_{n}(\textup{d}s),\qquad r\in[0,1].

h(r)={1\over p^{n/p}\Gamma\big{(}1+{n\over p}\big{)}}{1\over(1-r^{p})^{1+n/p}}\int_{0}^{\infty}s^{n/p}e^{-{1\over p}{sr^{p}(1-r^{p})^{-1}}}\,\mathbf{W}_{n}(\textup{d}s),\qquad r\in[0,1].

\int_{B_{p}^{n}} f (x) P_{W_{n}, n, p} (d x)

\int_{B_{p}^{n}} f (x) P_{W_{n}, n, p} (d x)

= W_{n} ({0}) \int_{S_{p}^{n - 1}} f (x) C_{n, p} (d x) + \int_{B_{p}^{n}} f (x) h (∥ x ∥_{p}) U_{n, p} (d x)

x\mapsto{\Gamma\big{(}\alpha+{n\over p}\big{)}\over\Gamma(\alpha)\big{(}2\Gamma\big{(}1+{1\over p}\big{)}\big{)}^{n}}\,\big{(}1-\|x\|_{p}^{p}\big{)}^{\alpha-1},\qquad x\in{\mathbb{B}}_{p}^{n}\,.

x\mapsto{\Gamma\big{(}\alpha+{n\over p}\big{)}\over\Gamma(\alpha)\big{(}2\Gamma\big{(}1+{1\over p}\big{)}\big{)}^{n}}\,\big{(}1-\|x\|_{p}^{p}\big{)}^{\alpha-1},\qquad x\in{\mathbb{B}}_{p}^{n}\,.

M_{p} (q) := \frac{p ^{q / p}}{q + 1} \frac{Γ ( 1 + \frac{q + 1}{p} )}{Γ ( 1 + \frac{1}{p} )} = p^{q / p} \frac{Γ ( \frac{q + 1}{p} )}{Γ ( \frac{1}{p} )} .

M_{p} (q) := \frac{p ^{q / p}}{q + 1} \frac{Γ ( 1 + \frac{q + 1}{p} )}{Γ ( 1 + \frac{1}{p} )} = p^{q / p} \frac{Γ ( \frac{q + 1}{p} )}{Γ ( \frac{1}{p} )} .

\frac{W _{n}}{n} n \to \infty ⟶ P 0 .

\frac{W _{n}}{n} n \to \infty ⟶ P 0 .

\displaystyle\sqrt{n}\bigg{(}{n^{{1\over p}-{1\over q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}\,\,\overset{d}{\underset{n\to\infty}{\longrightarrow}}\,\,N,

\displaystyle\sqrt{n}\bigg{(}{n^{{1\over p}-{1\over q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}\,\,\overset{d}{\underset{n\to\infty}{\longrightarrow}}\,\,N,

\displaystyle\sigma^{2}={1\over q^{2}}\bigg{(}{\Gamma({1\over p})\Gamma({2q+1\over p})\over\Gamma({q+1\over p})^{2}}-1\bigg{)}-{1\over p}.

\displaystyle\sigma^{2}={1\over q^{2}}\bigg{(}{\Gamma({1\over p})\Gamma({2q+1\over p})\over\Gamma({q+1\over p})^{2}}-1\bigg{)}-{1\over p}.

W_{n}^{*} := \frac{W _{n} - μ _{n}}{n} n \to \infty ⟶ d ζ

W_{n}^{*} := \frac{W _{n} - μ _{n}}{n} n \to \infty ⟶ d ζ

\displaystyle\sqrt{n}\Bigg{(}{n^{{1\over p}-{1\over q}}\frac{(1+\frac{\mu_{n}}{n})^{1/p}}{M_{p}(q)^{1/q}}}\|Z_{n}\|_{q}-1\Bigg{)}\,\,\overset{d}{\underset{n\to\infty}{\longrightarrow}}\,\,\tilde{N}\,,

\displaystyle\sqrt{n}\Bigg{(}{n^{{1\over p}-{1\over q}}\frac{(1+\frac{\mu_{n}}{n})^{1/p}}{M_{p}(q)^{1/q}}}\|Z_{n}\|_{q}-1\Bigg{)}\,\,\overset{d}{\underset{n\to\infty}{\longrightarrow}}\,\,\tilde{N}\,,

\tilde{σ}^{2} = \frac{Γ ( \frac{1}{p} ) Γ ( \frac{2 q + 1}{p} )}{q ^{2} Γ ( \frac{q + 1}{p} ) ^{2}} - \frac{1}{q ^{2}} + \frac{1}{p ( 1 + μ ) ^{2}} - \frac{2}{p ( 1 + μ )} + \frac{τ ^{2}}{p ^{2} ( 1 + μ ) ^{2}} .

\tilde{σ}^{2} = \frac{Γ ( \frac{1}{p} ) Γ ( \frac{2 q + 1}{p} )}{q ^{2} Γ ( \frac{q + 1}{p} ) ^{2}} - \frac{1}{q ^{2}} + \frac{1}{p ( 1 + μ ) ^{2}} - \frac{2}{p ( 1 + μ )} + \frac{τ ^{2}}{p ^{2} ( 1 + μ ) ^{2}} .

- x \in A^{\circ} in f I (x) \leq n \to \infty lim inf s_{n}^{- 1} lo g P [X_{n} \in A] \leq n \to \infty lim sup s_{n}^{- 1} lo g P [X_{n} \in A] \leq - x \in \overline{A} in f I (x)

- x \in A^{\circ} in f I (x) \leq n \to \infty lim inf s_{n}^{- 1} lo g P [X_{n} \in A] \leq n \to \infty lim sup s_{n}^{- 1} lo g P [X_{n} \in A] \leq - x \in \overline{A} in f I (x)

\displaystyle\limsup_{n\to\infty}{1\over b_{n}^{2}}\log\mathbf{W}_{n}\big{(}(\delta b_{n}\sqrt{n},\infty)\big{)}=-\infty.

\displaystyle\limsup_{n\to\infty}{1\over b_{n}^{2}}\log\mathbf{W}_{n}\big{(}(\delta b_{n}\sqrt{n},\infty)\big{)}=-\infty.

\displaystyle{\sqrt{n}\over b_{n}}\bigg{(}{n^{{1/p}-{1/q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}

\displaystyle{\sqrt{n}\over b_{n}}\bigg{(}{n^{{1/p}-{1/q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}

\lim_{n\to\infty}{1\over b_{n}^{2}}\log\mathbb{P}\Bigg{[}{\sqrt{n}\over b_{n}}\bigg{(}{n^{{1/p}-{1/q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}\geq t\Bigg{]}=-{t^{2}\over 2\sigma^{2}}.

\lim_{n\to\infty}{1\over b_{n}^{2}}\log\mathbb{P}\Bigg{[}{\sqrt{n}\over b_{n}}\bigg{(}{n^{{1/p}-{1/q}}\over M_{p}(q)^{1/q}}\|Z_{n}\|_{q}-1\bigg{)}\geq t\Bigg{]}=-{t^{2}\over 2\sigma^{2}}.

Λ (t_{1}, t_{2}) = lo g \int_{0}^{\infty} e^{t_{1} x^{q} + (t_{2} - 1/ p) x^{p}} \frac{d x}{p ^{1/ p} Γ ( 1 + 1/ p )} .

Λ (t_{1}, t_{2}) = lo g \int_{0}^{\infty} e^{t_{1} x^{q} + (t_{2} - 1/ p) x^{p}} \frac{d x}{p ^{1/ p} Γ ( 1 + 1/ p )} .

\displaystyle\limsup_{n\to\infty}{1\over n^{p/q}}\log\mathbb{P}\bigg{[}{W_{n}\over n}>\delta\bigg{]}=-\infty

\displaystyle\limsup_{n\to\infty}{1\over n^{p/q}}\log\mathbb{P}\bigg{[}{W_{n}\over n}>\delta\bigg{]}=-\infty

\mathbb{I}_{{\bf Z},2}(x)=\begin{cases}\frac{1}{p}\big{(}x^{q}-M_{p}(q)\big{)}^{p/q}&:x\geq M_{p}(q)^{1/q}\\ +\infty&:\text{otherwise}.\end{cases}

\mathbb{I}_{{\bf Z},2}(x)=\begin{cases}\frac{1}{p}\big{(}x^{q}-M_{p}(q)\big{)}^{p/q}&:x\geq M_{p}(q)^{1/q}\\ +\infty&:\text{otherwise}.\end{cases}

I_{W} (x) = {0 + \infty : x = 0 : x \neq = 0.

I_{W} (x) = {0 + \infty : x = 0 : x \neq = 0.

I_{W} (x) = {+ \infty \frac{x}{p} : x < 0 : x \geq 0.

I_{W} (x) = {+ \infty \frac{x}{p} : x < 0 : x \geq 0.

Λ (t) = lo g \int_{0}^{\infty} e^{(t - 1/ p) x^{p}} \frac{d x}{p ^{1/ p} Γ ( 1 + 1/ p )}

Λ (t) = lo g \int_{0}^{\infty} e^{(t - 1/ p) x^{p}} \frac{d x}{p ^{1/ p} Γ ( 1 + 1/ p )}

\frac{n ^{1/ p}}{M _{p} ( 2 )} ∥ P_{E_{n}} X_{n} ∥_{2} - k_{n} n \to \infty ⟶ d N,

\frac{n ^{1/ p}}{M _{p} ( 2 )} ∥ P_{E_{n}} X_{n} ∥_{2} - k_{n} n \to \infty ⟶ d N,

\sigma^{2}(p,\lambda)={\lambda\over 4}{\Gamma({1\over p})\Gamma({5\over p})\over\Gamma({3\over p})^{2}}-\lambda\Big{(}{3\over 4}+{1\over p}\Big{)}+{1\over 2}.

\sigma^{2}(p,\lambda)={\lambda\over 4}{\Gamma({1\over p})\Gamma({5\over p})\over\Gamma({3\over p})^{2}}-\lambda\Big{(}{3\over 4}+{1\over p}\Big{)}+{1\over 2}.

\displaystyle\mathbb{I}(y)=\begin{cases}{1\over p}\Big{(}{y^{2}\over\lambda}-M_{p}(2)\Big{)}^{p/2}&:y\geq\sqrt{\lambda M_{p}(2)}\\ +\infty&:\text{otherwise},\end{cases}

\displaystyle\mathbb{I}(y)=\begin{cases}{1\over p}\Big{(}{y^{2}\over\lambda}-M_{p}(2)\Big{)}^{p/2}&:y\geq\sqrt{\lambda M_{p}(2)}\\ +\infty&:\text{otherwise},\end{cases}

\frac{n ^{1/ p}}{M _{p} ( 2 )} ∥ Π_{k_{n}} X_{n} ∥_{2} - k_{n} n \to \infty ⟶ d \tilde{N},

\frac{n ^{1/ p}}{M _{p} ( 2 )} ∥ Π_{k_{n}} X_{n} ∥_{2} - k_{n} n \to \infty ⟶ d \tilde{N},

\tilde{\sigma}^{2}(p,\lambda)=\frac{1}{4}\bigg{(}{\Gamma({1\over p})\Gamma({5\over p})\over\Gamma({3\over p})^{2}}-1\bigg{)}-\frac{\lambda}{p}.

\tilde{\sigma}^{2}(p,\lambda)=\frac{1}{4}\bigg{(}{\Gamma({1\over p})\Gamma({5\over p})\over\Gamma({3\over p})^{2}}-1\bigg{)}-\frac{\lambda}{p}.

μ_{k_{n}} := E W_{k_{n}} = n - k_{n} + p and μ := n \to \infty lim \frac{μ _{k_{n}}}{k _{n}} = \frac{1 - λ}{λ} \geq 0 .

μ_{k_{n}} := E W_{k_{n}} = n - k_{n} + p and μ := n \to \infty lim \frac{μ _{k_{n}}}{k _{n}} = \frac{1 - λ}{λ} \geq 0 .

{\mathrm{Var}\,W_{k_{n}}\over k_{n}}=p\Big{(}{n\over k_{n}}-1\Big{)}+{p^{2}\over k_{n}}\to p\,{1-\lambda\over\lambda}=:\tau^{2}\geq 0\,,

{\mathrm{Var}\,W_{k_{n}}\over k_{n}}=p\Big{(}{n\over k_{n}}-1\Big{)}+{p^{2}\over k_{n}}\to p\,{1-\lambda\over\lambda}=:\tau^{2}\geq 0\,,

W_{k_{n}}^{*} := \frac{W _{k_{n}} - μ _{k_{n}}}{k _{n}} n \to \infty ⟶ d ζ \sim N (0, τ^{2}) .

W_{k_{n}}^{*} := \frac{W _{k_{n}} - μ _{k_{n}}}{k _{n}} n \to \infty ⟶ d ζ \sim N (0, τ^{2}) .

\sqrt{k_{n}}\Bigg{(}k_{n}^{{1\over p}-{1\over 2}}{\big{(}1+{\mu_{k_{n}}\over k_{n}}\big{)}^{1/p}\over\sqrt{M_{p}(2)}}\|\Pi_{k_{n}}X_{n}\|_{2}-1\Bigg{)}\overset{d}{\underset{n\to\infty}{\longrightarrow}}\tilde{N},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and statistical mechanics · Geometry and complex manifolds · Point processes and geometric inequalities

Full text

High-dimensional limit theorems

for random vectors in $\ell_{p}^{n}$ -balls. II

Zakhar Kabluchko

Institut für Mathematische Stochastik, Westfälische Wilhelms-Universität Münster, Germany

[email protected]

,

Joscha Prochno

Institut für Mathematik & Wissenschaftliches Rechnen, Karl-Franzens-Universität Graz, Austria

[email protected]

and

Christoph Thäle

Faculty of Mathematics, Ruhr University Bochum, Germany

[email protected]

Abstract.

In this article we prove three fundamental types of limit theorems for the $q$ -norm of random vectors chosen at random in an $\ell_{p}^{n}$ -ball in high dimensions. We obtain a central limit theorem, a moderate deviations as well as a large deviations principle when the underlying distribution of the random vectors belongs to a general class introduced by Barthe, Guédon, Mendelson, and Naor. It includes the normalized volume and the cone probability measure as well as projections of these measures as special cases. Two new applications to random and non-random projections of $\ell_{p}^{n}$ -balls to lower-dimensional subspaces are discussed as well. The text is a continuation of [Kabluchko, Prochno, Thäle: High-dimensional limit theorems for random vectors in $\ell_{p}^{n}$ -balls, Commun. Contemp. Math. (2019)].

Key words and phrases:

Asymptotic geometric analysis, central limit theorem, convex bodies, $\ell_{p}^{n}$ -balls, large deviations principle, moderate deviations principle, stochastic geometry

2010 Mathematics Subject Classification:

Primary: 60F10, 52A23 Secondary: 60D05, 46B09

1. Introduction and main results

The study of high-dimensional geometric structures and particularly of convex bodies has received considerable attention in the last decade. In parts, this was triggered by modern applications in high-dimensional statistics, machine learning, and numerical analysis. Many of the deep discoveries are of a probabilistic flavor or have been obtained by means of novel and powerful probabilistic methods. It therefore comes as no surprise that (central) limit theorems have been obtained for various quantities that appear in high-dimensional stochastic geometry or the asymptotic theory of convex bodies. Probably the first high-dimensional central limit theorem is known as the Poincaré-Maxwell-Borel Lemma (see, e.g., [9, 23]). It shows that the distribution of the first $k$ coordinates of a point chosen uniformly at random from the $n$ -dimensional Euclidean ball or sphere converges to a $k$ -dimensional Gaussian distribution, as the dimension $n$ of the ambient space tends to infinity. The most prominent result of the past $15$ years is arguably Klartag’s central limit theorem for isotropic convex bodies [15], showing that most $k$ -dimensional marginals of random points chosen uniformly at random from a convex body are approximately Gaussian. Many more deep central limit phenomena have been discovered in the recent past. Among others, there is a central limit theorem for the volume of convex hulls of Gaussian random vectors obtained by Bárány and Vu in [5] or Reitzner’s central limit theorems for the volume and the number of $i$ -dimensional faces of random polytopes in smooth convex bodies [20] that were obtained when the number of random points tends to infinity (see also Bárány and Thäle [4] and Thäle, Turchi, and Wespi [24] for results about general intrinsic volumes). There is a central limit theorem due to Paouris, Pivovarov, and Zinn [18] for the volume of $k$ -dimensional random projections of the $n$ -dimensional cube when $n\to\infty$ , a result that had previously been obtained by Kabluchko, Litvak, and Zaporozhets [12] in the special case $k=1$ . Alonso-Gutiérrez, Prochno, and Thäle [1] proved a central limit theorem and Berry-Esseen bounds for the Euclidean norm of random orthogonal projections of points chosen uniformly at random from the unit ball of $\ell_{p}^{n}$ , as $n\to\infty$ , and Kabluchko, Prochno, and Thäle [13] obtained a multivariate central limit theorem for the $q$ -norm of random vectors chosen uniformly at random in the unit $p$ -ball of $\mathbb{R}^{n}$ , which extended the corresponding $1$ -dimensional result obtained by Schmuckenschläger [22].

While the results in the previous paragraph describe central limit phenomena for several geometry related quantities, there is considerably less known about the large deviations behavior. Large deviations principles, which appear on the scale of a law of large numbers, have only recently been introduced in geometric functional analysis by Gantert, Kim, and Ramanan [11], who obtained a large deviations principle for $1$ -dimensional random projections of $\ell_{p}^{n}$ -balls in $\mathbb{R}^{n}$ , as the space dimension tends to infinity. Subsequent work of Alonso-Gutiérrez, Prochno, and Thäle [1] provided a description of the large deviations behavior for the Euclidean norm of projections of $\ell_{p}^{n}$ -balls to high-dimensional random subspaces (the so-called annealed case), and Kabluchko, Prochno, and Thäle [13] obtained a complete description of the large deviations behavior of $\ell_{q}$ -norms of high-dimensional random vectors that are chosen uniformly at random in an $\ell_{p}^{n}$ -ball, which can be seen as an asymptotic version of a result of Schechtman and Zinn [21].

The motivation for this manuscript is essentially three-fold and we shall discuss the details in the following subsections together with our corresponding results. The first is the aim for an extension of the (multivariate) central limit theorems obtained in [13, Theorem 1.1] and [22, Proposition 2.4] and the large deviations principles [13, Theorems 1.2 and 1.3] to a considerably wider class of distributions on $\ell_{p}^{n}$ -balls. The second aim is to go between the Gaussian fluctuations described by the central limit theorem and the large deviations and to describe the moderate deviations behavior of the random variables studied there. Moderate deviations are typically non-parametric (in contrast to large deviations) and consider probabilities on scales between those of a law of large numbers and a central limit theorem. These new findings for the moderate scaling therefore complement and refine both the new central limit theorems (Theorem A and Theorem B) as well the new large deviations principle (Theorem D). For a variety of applications of such results, despite the once presented below, we refer the reader to [13].

Before we present our results, let us explain the distributional set-up of this manuscript. As already mentioned, we consider a much more general class of distributions compared to [13] and [22]. Those have been introduced and studied by Barthe, Guédon, Mendelson, and Naor [6], and are closely related to the geometry of $\ell_{p}^{n}$ -balls. This class contains the uniform distribution considered in [1, 11, 13], the cone probability measure on the $\ell_{p}^{n}$ -unit ball ${\mathbb{B}}_{p}^{n}:=\{x\in\mathbb{R}^{n}:\|x\|_{p}\leq 1\}$ as special cases, and many more (see below). As usual, $\|x\|_{p}=(|x_{1}|^{p}+\ldots+|x_{n}|^{p})^{1/p}$ denotes the $\ell_{p}$ -norm of the vector $x=(x_{1},\ldots,x_{n})$ , and the parameter $p$ satisifes $0<p<\infty$ . For every $n\in\mathbb{N}$ , we let $\mathbf{W}_{n}$ be any Borel probability measure on $[0,\infty)$ , $\mathbf{U}_{n,p}$ be the uniform distribution, and $\mathbf{C}_{n,p}$ be the cone probability measure on ${\mathbb{B}}_{p}^{n}$ . The distributions we consider are of the form

[TABLE]

where the function $H:{\mathbb{B}}_{p}^{n}\to\mathbb{R}$ is given by $H(x)=h(\|x\|_{p})$ with

[TABLE]

In other words this means that

[TABLE]

for all non-negative measurable functions $f:{\mathbb{B}}_{p}^{n}\to\mathbb{R}$ , where ${\mathbb{S}}_{p}^{n-1}=\{x\in\mathbb{R}^{n}:\|x\|_{p}=1\}$ denotes the $\ell_{p}^{n}$ -sphere. The class of measures of the form $\mathbf{P}_{\mathbf{W}_{n},n,p}$ contains the following important cases, which are of particular interest (see Theorem 1, Theorem 3, Corollary 3, and Corollary 4 in [6]):

(i)

If $\mathbf{W}_{n}$ is the exponential distribution with rate $1/p$ (and mean $p$ ), then $\mathbf{W}_{n}(\{0\})=0$ , $H\equiv 1$ , and $\mathbf{P}_{\mathbf{W}_{n},n,p}$ reduces to the uniform distribution $\mathbf{U}_{n,p}$ on ${\mathbb{B}}_{p}^{n}$ .

(ii)

If $\mathbf{W}_{n}=\delta_{0}$ is the Dirac measure concentrated at [math], then $\mathbf{W}_{n}(\{0\})=1$ , $H\equiv 0$ , and $\mathbf{P}_{\mathbf{W}_{n},n,p}$ is just the cone probability measure on ${\mathbb{B}}_{p}^{n}$ .

(iii)

If $\mathbf{W}_{n}={\rm Gamma}(\alpha,1/p)$ is a gamma distribution with shape parameter $\alpha>0$ and rate $1/p$ , then $\mathbf{P}_{\mathbf{W}_{n},n,p}$ is the beta-type probability measure on ${\mathbb{B}}_{p}^{n}$ with Lebesgue density given by

[TABLE]

In particular, if $\alpha=m/p$ for some $m\in\mathbb{N}$ , this is the image of the cone probability measure $\mathbf{C}_{n+m,p}$ on ${\mathbb{B}}_{p}^{n+m}$ under the orthogonal projection onto the first $n$ coordinates. Similarly, if $\alpha=1+m/p$ , this distribution arises as the image of the uniform distribution $\mathbf{U}_{n+m,p}$ on ${\mathbb{B}}_{p}^{n+m}$ under the same orthogonal projection.

After having discussed the class of distributions we consider, we now turn to our main results.

Remark 1.

Note that although for $0<p<1$ the unit balls ${\mathbb{B}}_{p}^{n}$ are not convex, we decided to include them into our analysis, simply because our results are valid in this regime as well. On the other hand, we leave out the case $p=\infty$ , since in this case we can only treat the uniform distribution on ${\mathbb{B}}_{\infty}^{n}$ and this was already studied in [13].

1.1. Central limit theorems

The first result in this manuscript is a generalization of the central limit theorems [13, Theorem 1.1] and [22, Proposition 2.4] to the broader class of distributions presented above. While the result can in principle be proved in a multivariate form, we prefer to stay in the one-dimensional setting for clarity and for ease of comparison with the moderate and large deviations principles discussed in the next subsections. The theorem below describes the Gaussian fluctuations of the $q$ -norm of vectors chosen at random from the balls ${\mathbb{B}}_{p}^{n}$ according to the measures $\mathbf{P}_{\mathbf{W}_{n},n,p}$ . In this paper, we denote by $\overset{\mathbb{P}}{\longrightarrow}$ and $\overset{d}{\longrightarrow}$ convergence in probability and in distribution, respectively. Moreover, we put

[TABLE]

for any $q>0$ .

Theorem A (Central limit theorem).

Fix $0<p<\infty$ and $0<q<\infty$ . Let $(\mathbf{W}_{n})_{n\in\mathbb{N}}$ be a sequence of Borel probability measures on $[0,\infty)$ . For each $n\in\mathbb{N}$ let $Z_{n}\in{\mathbb{B}}_{p}^{n}$ be distributed according to $\mathbf{P}_{\mathbf{W}_{n},n,p}$ and $W_{n}$ according to $\mathbf{W}_{n}$ . Assume that

[TABLE]

Then

[TABLE]

where $N\sim\mathcal{N}(0,\sigma^{2})$ is a centered Gaussian random variable with variance

[TABLE]

Let us return to the situations (i)–(iii) described above and discuss some special cases of Theorem A. If for each $n$ , $\mathbf{W}_{n}=\mathbf{W}$ for some fixed Borel probability measure $\mathbf{W}$ on $[0,\infty)$ , then assumption (3) is clearly satisfied. In particular, taking $\mathbf{W}$ to be the Dirac measure at zero (recall (ii) above) or the exponential distribution with rate $1/p$ (recall (i) above), we recover the central limit theorem of Schmuckenschläger [22], see also Kabluchko, Prochno, and Thäle [13]. As another example, we fix a sequence of positive real numbers $(a_{n})_{n\in\mathbb{N}}$ such that $a_{n}/\sqrt{n}\to 0$ , as $n\to\infty$ , and let for each $n\in\mathbb{N}$ , $\mathbf{W}_{n}=\Gamma(a_{n},b)$ be the gamma distribution with shape parameter $a_{n}$ and some fixed rate $b\in(0,\infty)$ . Markov’s inequality implies that (3) is satisfied in this case, from which the central limit theorem follows. In particular, taking $b=1/p$ we cover the situation discussed under (iii) above.

Remark 2.

In the case when $p=q$ , the asymptotic variance $\sigma^{2}$ vanishes. In this case, Theorem A just states the distributional convergence of $\sqrt{n}(\|Z_{n}\|_{p}-1)$ to [math].

Remark 3.

Theorem A should be compared with the (multivariate) central limit theorem for $\|Z_{n}\|_{q}$ proved in [19]. The latter is valid under the condition that $\sqrt{n}(1-\|Z_{n}\|_{p})\overset{\mathbb{P}}{\underset{n\to\infty}{\longrightarrow}}0$ , as $n\to\infty$ . One can in fact show (with some efforts, see the previous remark) that our condition (3) implies the one in [19]. However, we prefer to give an alternative and separate argument, since it can be developed further to give a proof of our MDP.

In one of our applications we present in Section 2 below, a slight generalization of Theorem A is needed, where we allow the random variables $W_{n}$ to converge to a non-trivial limiting distribution after a suitable centering and rescaling by $\sqrt{n}$ .

Theorem B (Generalized central limit theorem).

Fix $0<p<\infty$ and $0<q<\infty$ . Let $(\mathbf{W}_{n})_{n\in\mathbb{N}}$ be a sequence of Borel probability measures on $[0,\infty)$ . For each $n\in\mathbb{N}$ let $Z_{n}\in{\mathbb{B}}_{p}^{n}$ be distributed according to $\mathbf{P}_{\mathbf{W}_{n},n,p}$ and $W_{n}$ according to $\mathbf{W}_{n}$ . Assume that

[TABLE]

with $\zeta\sim\mathcal{N}(0,\tau^{2})$ for some $\tau^{2}\geq 0$ , where $(\mu_{n})_{n\in\mathbb{N}}$ is a sequence of non-negative real numbers satisfying $\mu_{n}/n\to\mu\in[0,\infty)$ , as $n\to\infty$ . Then

[TABLE]

where $\tilde{N}\sim\mathcal{N}(0,\tilde{\sigma}^{2})$ is a centered Gaussian random variable with variance

[TABLE]

We emphasize that Theorem B is indeed a generalization of Theorem A. Namely, if (4) is satisfied with $\mu_{n}=0$ for all $n\in\mathbb{N}$ and $\tau^{2}=0$ then $W_{n}/\sqrt{n}\overset{d}{\underset{n\to\infty}{\longrightarrow}}0$ and hence $W_{n}/\sqrt{n}\overset{\mathbb{P}}{\underset{n\to\infty}{\longrightarrow}}0$ , as $n\to\infty$ , so that (3) is satisfied. Moreover, let us briefly mention that Theorem B allows us to consider, for example, a gamma distribution $\Gamma(a_{n},b)$ for $\mathbf{W}_{n}$ with constant rate $b\in(0,\infty)$ and shape parameter $a_{n}\in(0,\infty)$ satisfying $a_{n}/n\to a\in[0,\infty)$ , as $n\to\infty$ . We take advantage of this flexibility in Section 2 below.

1.2. Moderate deviations principle

We will next describe the moderate deviations. A moderate deviations principle (MDP) is formally nothing else than a large deviations principle (LDP) but with important differences in the behavior of the two principles. For instance, while LDPs provide estimates on the scale of a law of large numbers, MDPs describe the probabilities at scales between a law of large numbers and a distributional limit theorem (like a central limit theorem). Moreover, while the rate function in an LDP depends in a subtle way on the distribution of the underlying random variables, the rate function in an MDP in typical situations is non-parametric and given by the Gaussian one inherited from a central limit theorem. Let us recall that a sequence $(X_{n})_{n\in\mathbb{N}}$ of random vectors in $\mathbb{R}^{d}$ ( $d\in\mathbb{N}$ ) satisfies an LDP with speed $s_{n}$ and *‘*good rate function’ $\mathbb{I}:\mathbb{R}^{d}\to[0,\infty]$ if

[TABLE]

for all measurable $A\subseteq\mathbb{R}^{d}$ ( $A^{\circ}$ being the interior and $\overline{A}$ the closure of $A$ ), where $\mathbb{I}$ is lower semi-continuous and has compact level sets $\{x\in\mathbb{R}^{d}\,:\,\mathbb{I}(x)\leq\alpha\}$ , $\alpha\in\mathbb{R}$ . We say in this paper that a sequence $(X_{n})_{n\in\mathbb{N}}$ satisfies an MDP if the speed sequence $(s_{n})_{n\in\mathbb{N}}$ is given by $s_{n}=b_{n}\sqrt{n}$ with a positive sequence $(b_{n})_{n\in\mathbb{N}}$ satisfying $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ , where for two sequences $(x_{n})_{n\in\mathbb{N}}$ and $(y_{n})_{n\in\mathbb{N}}$ we use the Landau notation $x_{n}=o(y_{n})$ if $\lim_{n\to\infty}\frac{x_{n}}{y_{n}}=0$ and $x_{n}=\omega(y_{n})$ if $\lim_{n\to\infty}|\frac{x_{n}}{y_{n}}|=+\infty$ . In our case, the random variables $X_{n}$ are suitably scaled versions of the $q$ -norm of random points in ${\mathbb{B}}_{p}^{n}$ .

The following MDP complements both the central limit theorems (Theorem A, Theorem B and also [13, Theorem 1.1]) as well as the large deviations result proved in [13, Theorem 1.2] and Theorem D below.

Theorem C (Moderate deviations principle).

Fix $0<p<\infty$ and $0<q<\infty$ with $q<p$ . Let $(\mathbf{W}_{n})_{n\in\mathbb{N}}$ be a sequence of Borel probability measures on $[0,\infty)$ and $(b_{n})_{n\in\mathbb{N}}$ be a sequence of positive real numbers satisfying $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ . For each $n\in\mathbb{N}$ let $Z_{n}\in{\mathbb{B}}_{p}^{n}$ be distributed according to $\mathbf{P}_{\mathbf{W}_{n},n,p}$ . Assume that, for all $\delta>0$ ,

[TABLE]

Then the sequence of random variables

[TABLE]

satisfies an MDP with speed $b_{n}^{2}$ and good rate function $\mathbb{I}(t)={t^{2}\over 2\sigma^{2}}$ , $t\in\mathbb{R}$ , where $\sigma^{2}$ is the variance from Theorem A.

In particular, Theorem C implies that, for all $t\in\mathbb{R}$ ,

[TABLE]

Let us briefly return to the special cases (i)–(iii). Clearly, if $\mathbf{W}_{n}$ is the Dirac measure at zero, Assumption (5) is satisfied. This covers case (ii) from above. On the other hand, if for each $n\in\mathbb{N}$ , $\mathbf{W}_{n}=\Gamma(a_{n},b)$ is a gamma distribution with shape parameter $a_{n}\in(0,\infty)$ and rate $b>0$ , we can use the MDP for sums of independent random variables (see Lemma 11 below) to conclude that Assumption (5) is satisfied if $a_{n}=\omega(\sqrt{n}b_{n})$ . Especially, taking $b=1/p$ , this covers cases (i) and (iii).

Remark 4.

If $p=q$ , then the core term for the MDP that we study in Lemma 16 below vanishes and therefore, we do not obtain an MDP with a non-trivial rate function.

1.3. Large deviations principle

The third type of limit theorem we obtain is a large deviations principle. As we shall see in a moment, contrary to the quadratic and non-parametric rate function in the MDP, the LDP is more sensitive to the underlying distribution and displays a significant difference in behavior depending on the parameter $p$ and its relative position with respect to the parameter $q$ .

Theorem D (Large deviations principle).

Fix $0<p<\infty$ and $0<q<\infty$ with $p\neq q$ . Let $(\mathbf{W}_{n})_{n\in\mathbb{N}}$ be a sequence of Borel probability measures on $[0,\infty)$ and for each $n\in\mathbb{N}$ let $W_{n}$ be distributed according to $\mathbf{W}_{n}$ . For each $n\in\mathbb{N}$ let $Z_{n}\in{\mathbb{B}}_{p}^{n}$ be distributed according to $\mathbf{P}_{\mathbf{W}_{n},n,p}$ . Then the sequence of random variables $n^{{1/p}-{1/q}}\|Z_{n}\|_{q}$ satisfies the following LDPs:

(1)

If $q<p$ we assume that the sequence $(W_{n}/n)_{n\in\mathbb{N}}$ satisfies an LDP with speed $n$ and good rate function $\mathbb{I}_{\mathbf{W}}$ . Then the LDP is with speed $n$ and good rate function $\mathbb{I}_{{\bf Z},1}=(\mathbb{I}_{1}+\mathbb{I}_{\mathbf{W}})\circ F^{-1}$ , where $F(x,y,z)=x^{1/q}(y+z)^{-1/p}$ and $\mathbb{I}_{1}=\Lambda^{*}$ is the Legendre-Fenchel transform of the function

[TABLE] 2. (2)

If $q>p$ we assume that sequence $(W_{n}/n)_{n\in\mathbb{N}}$ is exponentially equivalent to [math] in the sense that

[TABLE]

for all $\delta>0$ . Then the LDP is with speed $n^{p/q}$ and good rate function

[TABLE]

We emphasize that while the rate function $\mathbb{I}_{{\bf Z},2}$ for $q>p$ is universal in the sense that it does not depend on $\mathbb{I}_{\mathbf{W}}$ (provided that $\mathbb{I}_{\mathbf{W}}$ does not vanish in a neighborhood of $1$ ), this is not the case for the rate function $\mathbb{I}_{{\bf Z},1}$ for $q<p$ , which in a subtle way depends on $\mathbb{I}_{\mathbf{W}}$ . As examples we consider the special cases (i) and (ii) above. If for each $n\in\mathbb{N}$ , $\mathbf{W}_{n}$ is the Dirac measure at zero, the function $\mathbb{I}_{\mathbf{W}}$ is given by

[TABLE]

Moreover, if $\mathbf{W}_{n}$ is the exponential distribution with parameter $1/p$ for each $n\in\mathbb{N}$ , then

[TABLE]

Remark 5.

If $p=q$ , then the LDP of Theorem D (1) remains valid in a modified form. In fact, it still holds with speed $n$ , but the rate function is then given by $(\widetilde{\Lambda}^{*}+\mathbb{I}_{\mathbf{W}})\circ\widetilde{F}^{-1}$ , where $\widetilde{\Lambda}^{*}$ is the Legendre-Fenchel transform of

[TABLE]

and $\widetilde{F}$ is the function $\widetilde{F}:(t_{1},t_{2})\mapsto t_{1}^{1/p}/(t_{1}+t_{2})^{1/p}$ .

1.4. Structure

The remaining parts of this text are structured as follows. Two applications of our results to random and non-random projections of $\ell_{p}^{n}$ -balls are discussed in Section 2. In Section 3 we rephrase some preliminary results, which are used in proofs of Theorems A, B, C, and D. The latter are contained in Section 4. More precisely, we develop a crucial probabilistic representation for the involved random variables in Section 4.1 and then prove Theorem A in Section 4.2, Theorem B in Section 4.3, Theorem C in Section 4.4, and Theorem D in Section 4.5.

2. Application to projections of $\ell_{p}^{n}$ -balls

2.1. Random versus non-random subspaces

Projections of $\ell_{p}^{n}$ -balls to lower-dimensional subspaces were subject of a number of studies, see, e.g., [1, 2, 11, 14, 16, 17]. In these works two different set-ups were studied, one in which the subspace one projects onto is random, and another one, in which the choice of the subspace is deterministic (for an extensive comparison of both situations for one-dimensional projections we refer the reader to [10, 11]). We shall use the limit theory for the general distributions $\mathbf{P}_{\mathbf{W}_{n},n,p}$ on $\ell_{p}^{n}$ -balls presented in the previous section to compare both approaches. We start by recalling the framework for projections onto random subspaces taken from [1, 2]. We let $(k_{n})_{n\in\mathbb{N}}$ be a sequence of integers satisfying $k_{n}\in\{1,\ldots,n\}$ and $k_{n}/n\to\lambda\in[0,1]$ , as $n\to\infty$ . Moreover, for each $n\in\mathbb{N}$ , let $X_{n}$ be uniformly distributed on ${\mathbb{B}}_{p}^{n}$ and let $E_{n}$ be a uniformly distributed $k_{n}$ -dimensional random subspace (where the uniform distribution refers to the Haar probability measure on the Grassmannian of all $k_{n}$ -dimensional linear subspaces in $\mathbb{R}^{n}$ ). We assume that the two sequences $(X_{n})_{n\in\mathbb{N}}$ and $(E_{n})_{n\in\mathbb{N}}$ are independent. Moreover, we denote by $P_{E_{n}}X_{n}$ the orthogonal projection of $X_{n}$ onto $E_{n}$ . The quantity studied in [1, 2] is the Euclidean norm of the projection of the random vector $X_{n}$ onto the random subspace $E_{n}$ , i.e., $\|P_{E_{n}}X_{n}\|_{2}$ .

We first rephrase the central limit theorem [2, Theorem 1.1]. It says that if $k_{n}\to\infty$ , as $n\to\infty$ , then

[TABLE]

where $N$ is a centered Gaussian random variable with variance

[TABLE]

Observe that taking $\lambda=1$ the constant $\sigma^{2}(p,1)$ coincides with $\sigma^{2}$ from Theorem A if we take $q=2$ there.

Next, we recall the LDP for the same quantities from [1, Theorem 1.2] (for simplicity we restrict ourselves to the case $p<2$ , since only in this case an explicit form of the rate function is available). Using the same notation as before, it says that for any $p\in[1,2)$ the sequence of random variables $n^{{1\over p}-{1\over 2}}\|P_{E_{n}}X_{n}\|_{2}$ satisfies an LDP with speed $n^{p/2}$ and good rate function

[TABLE]

whenever $\lambda:=\lim\limits_{n\to\infty}{k_{n}\over n}\in(0,1]$ .

The projections onto random subspaces as just described can be compared with projections onto sequences of deterministic subspaces. In fact, our distributional framework allows to deal with projections onto coordinate subspaces. Namely, let the sequence $(k_{n})_{n\in\mathbb{N}}$ be as above and let, for each $n\in\mathbb{N}$ , $X_{n}$ be uniformly distributed in the $n$ -dimensional $\ell_{p}^{n}$ -ball ${\mathbb{B}}_{p}^{n}$ with $0<p<\infty$ . We denote by $\Pi_{k_{n}}X_{n}$ the orthogonal projection of $X_{n}$ onto the first $k_{n}$ coordinates. Thus, $\Pi_{k_{n}}$ is the projection from $\mathbb{R}^{n}$ to $\{x=(x_{1},\ldots,x_{n})\in\mathbb{R}^{n}:x_{i}=0\text{ for }i>k_{n}\}$ , which in turn can be identified with $\mathbb{R}^{k_{n}}$ .

Theorem E (Central limit theorem for deterministic projections).

Assume that $k_{n}/n\to\lambda\in(0,1]$ , as $n\to\infty$ . Then,

[TABLE]

where $\tilde{N}$ is a centered Gaussian random variable with variance

[TABLE]

Proof.

Recalling the special case (iii) for $\mathbf{P}_{\mathbf{W}_{k_{n}},k_{n},p}$ from the previous section, we see that the projected random vector $\Pi_{k_{n}}X_{n}$ has distribution $\mathbf{P}_{\mathbf{W}_{k_{n}},k_{n},p}$ on ${\mathbb{B}}_{p}^{k_{n}}$ , where $\mathbf{W}_{k_{n}}=\Gamma({n-k_{n}\over p}+1,{1\over p})$ is a gamma distribution with shape parameter ${n-k_{n}\over p}+1$ and rate $1/p$ . We are going to apply the central limit theorem to the gamma distribution with the aim of verifying condition (4) of Theorem B. Keeping in mind that $k_{n}$ is now the dimension parameter of the projection, we define

[TABLE]

In addition, we have that

[TABLE]

as $n\to\infty$ . Assume, for a moment, that $n-k_{n}\to\infty$ . Then, even though $\tau^{2}$ can vanish, we have $\mathrm{Var}\,W_{k_{n}}\to\infty$ . Under these circumstances, the central limit theorem is applicable to the gamma distribution and yields that

[TABLE]

On the other hand, if $n-k_{n}$ stays bounded, then $\mu_{k_{n}}$ stays bounded, hence the sequence $(W_{k_{n}})_{n\in\mathbb{N}}$ is tight, and since $k_{n}\to\infty$ (recall that $\lambda\neq 0$ ), we conclude that (9) still holds with $\tau^{2}=0$ . Summarizing, we conclude that (9) always holds under the assumptions of the theorem. Indeed assume that (9) is violated. Since the sequence $(W_{k_{n}}^{*})_{n\in\mathbb{N}}$ has uniformly bounded variances, we could pass to a subsequence for which $W_{k_{n}}^{*}$ converges weakly to some distribution different from $\mathcal{N}(0,\tau^{2})$ . Passing one more time to a subsequence, we could assume that either $n-k_{n}\to\infty$ or $n-k_{n}$ is bounded. However, as was explained above, this would lead to a contradiction.

We can thus apply Theorem B with $q=2$ and dimension parameter $k_{n}$ instead of $n$ to conclude that

[TABLE]

where $\tilde{N}$ is a centered normal random variable with variance

[TABLE]

After recalling that $\mu_{k_{n}}=n-k_{n}+p$ , this can be written in the form

[TABLE]

To complete the proof, we need to replace the factor $(n+p)^{1/p}$ by $n^{1/p}$ . That this is always possible can be seen as follows. For $n\in\mathbb{N}$ we define

[TABLE]

Then (10) reads as $a_{n}\xi_{n}-b_{n}\overset{d}{\underset{n\to\infty}{\longrightarrow}}\tilde{N}$ , and our aim is to show that the same is true with $a_{n}$ replaced by $a_{n}^{\prime}$ . To this end we write

[TABLE]

Since $a_{n}^{\prime}/a_{n}\to 1$ , as $n\to\infty$ , the first term converges in distribution to $\tilde{N}$ by Slutsky’s theorem, and it remains to prove that $b_{n}\big{(}1-{a_{n}^{\prime}\over a_{n}}\big{)}\to 0$ . This is done as follows:

[TABLE]

Summarizing, we have shown that (10) is in fact equivalent to

[TABLE]

thus completing the proof. ∎

Theorem E, together with (7), leads us to the remarkable observation that we have the same central limit behavior regardless of whether we project onto uniform random subspaces of dimensions $k_{n}$ or onto deterministic coordinate subspaces of the same dimension, provided their dimension is sufficiently large, i.e., if $k_{n}/n\to 1$ as $n\to\infty$ . Indeed, the centering in both results is the same, and it is easy to check that $\sigma^{2}(p,1)=\tilde{\sigma}^{2}(p,1)$ . On the other hand, if $k_{n}/n\to\lambda\in(0,1)$ , we still have a central limit theorem for the (suitably centered and rescaled) quantities $\|P_{E_{n}}X_{n}\|_{2}$ and $\|\Pi_{k_{n}}X_{n}\|_{2}$ , with the same centering, but this time with different limiting variances $\sigma^{2}(p,\lambda)$ and $\tilde{\sigma}^{2}(p,\lambda)$ , respectively.

A similar comparison as for the central limit theorem can be made on the large deviations scale. We restrict ourselves to the case $1\leq p<2$ and $k_{n}/n\to 1$ , that is $\lambda=1$ . We are interested in large deviations of $\|\Pi_{k_{n}}X_{n}\|_{2}$ , which is distributed as the $2$ -norm of a random vector with the probability law $\mathbf{P}_{\mathbf{W}_{k_{n}},k_{n},p}$ on ${\mathbb{B}}_{p}^{k_{n}}$ , where $\mathbf{W}_{k_{n}}$ is the gamma distribution $\Gamma({n-k_{n}\over p}+1,{1\over p})$ , as above. Let us check that the sequence of random variables $W_{k_{n}}/k_{n}$ with $W_{k_{n}}$ having distribution $\mathbf{W}_{k_{n}}$ is exponentially equivalent to [math] in the sense of (6). Fix some $\delta>0$ . Since $n-k_{n}=o(n)$ , the convolution property of the gamma distribution in its shape parameter entails that, for large $n$ , the random variable $W_{k_{n}}$ is stochastically dominated by a sum $S_{[\delta n/4]}$ of $[\delta n/4]$ i.i.d. $\Gamma(1,{1\over p})$ -distributed random variables. Note that $\mathbb{E}S_{[\delta n/4]}=p[\delta n/4]$ . Moreover, again for $n$ sufficiently large, $\frac{k_{n}}{n}>\frac{p}{2}$ . We deduce from this and Cramér’s theorem (see Lemma 8 below) that for large $n\in\mathbb{N}$

[TABLE]

where $c=c(\delta,p)\in(0,\infty)$ is some constant depending on $\delta$ and $p$ , but since $p\in[1,2)$ the dependence on $p$ can be omitted. Note that the above argument would fail if $k_{n}/n\to\lambda<1$ . Thus,

[TABLE]

since $p<2$ . In this case, Theorem D can be applied with $q=2$ and we obtain an LDP for $k_{n}^{1/p-1/2}\|\Pi_{k_{n}}X_{n}\|_{2}$ with speed $k_{n}^{p/2}$ and the rate function given in Theorem D. Since $k_{n}/n\to 1$ , we conclude that $n^{1/p-1/2}\|\Pi_{k_{n}}X_{n}\|_{2}$ satisfies an LDP with speed $n^{p/2}$ and the same rate function $\mathbb{I}$ as in (8) with $\lambda=1$ there. Again, this shows that the same large deviations behavior is present regardless of whether we project onto uniform random subspaces of dimensions $k_{n}$ or onto deterministic coordinate subspaces of the same dimension, again provided their dimension is sufficiently large in the sense that $k_{n}/n\to 1$ , as $n\to\infty$ .

2.2. $1$ -dimensional random projections of $\ell_{p}^{n}$ -balls

In this section we present another application of our main results demonstrating the advantage of studying the more general distributions $\mathbf{P}_{\mathbf{W},n,p}$ on the $\ell_{p}^{n}$ -balls. In [13, Corollary 2.6], we proved a generalization to $\ell_{p}^{n}$ -balls of a central limit theorem obtained by Paouris, Pivovarov, and Zinn [18, p. 703] and Kabluchko, Litvak, and Zaporozhets [12, Theorem 3.6] for the width of orthogonal projections of the $n$ -dimensional cube ${\mathbb{B}}_{\infty}^{n}$ onto a uniformly distributed random direction. For $1<q<\infty$ with $q\neq 2$ and a random vector chosen from ${\mathbb{S}}^{n-1}$ with respect to the cone probability measure (which in this case coincides with the normalized spherical Lebesgue measure), it was shown in [13] that, as $n\to\infty$ ,

[TABLE]

where $N$ is a centered Gaussian random variable with variance

[TABLE]

Here, $q^{*}$ denotes the Hölder conjugate of $q$ satisfying $\frac{1}{q}+\frac{1}{q^{*}}=1$ , $P_{\theta}$ denotes the orthogonal projection onto the line spanned by $\theta$ , and

[TABLE]

While the argument to obtain this central limit theorem had to be extracted from the proof of the main result [13, Theorem 1.1], it is in our set-up a direct consequence of Theorem A, since we study more general distributions for which the choice $\mathbf{W}_{n}=\delta_{0}$ and $p=2$ yields that $\mathbf{P}_{\mathbf{W}_{n},n,2}$ is just the cone probability measure on ${\mathbb{B}}_{2}^{n}$ . More precisely, to obtain the central limit theorem above, we use the representation (12) and apply Theorem A with the choice $\mathbf{W}_{n}=\delta_{0}$ , $p=2$ , $q$ replaced by $q^{*}$ , and take $Z_{n}=\theta$ .

Beyond the Gaussian fluctuations just described, our results in Theorems C and D concerning moderate and large deviations allow us to deduce the complementing MDPs and LDPs for the length of the orthogonal projection of ${\mathbb{B}}_{q}^{n}$ onto a random direction as well. We start with the description of the moderate deviations behaviour. Using Theorem C with the choice $p=2$ , $\mathbf{W}_{n}=\delta_{0}$ , and $q$ replaced by $q^{*}$ with $q^{*}<p$ , we obtain that the sequence of random variables

[TABLE]

satisfies an MDP with speed $b_{n}^{2}$ and good rate function $\mathbb{I}(t)=t^{2}/(2\sigma^{2}(q))$ , where $(b_{n})_{n\in\mathbb{N}}$ is a sequence of positive real numbers satisfying $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ , and the constant $\sigma^{2}(q)$ is as in (11).

The large deviations are obtained similarly. Using Theorem C with the choice $p=2$ , $\mathbf{W}_{n}=\delta_{0}$ , and $q$ replaced by $q^{*}$ with $q^{*}>p$ (we restrict ourselves to this case, since only in this case we have a closed form expression for the rate function), we obtain that the sequence of random variables

[TABLE]

satisfies an LDP with speed $n^{2/q^{*}}=n^{2-2/q}$ and good rate function

[TABLE]

Finally, we mention that the constant $M_{2}(q^{*})^{1/q^{*}}$ can be explicitly expressed as

[TABLE]

in terms of the parameter $q$ .

3. Preliminaries

In this section we briefly present some background material used throughout the rest of this text. For convenience of the reader, we split this into different subsections that may be skipped depending on the reader’s background.

3.1. Generalized Gaussian random variables

Let us denote, for $0<p<\infty$ , by $(Y_{i})_{i\in\mathbb{N}}$ a sequence of independent copies of a $p$ -generalized Gaussian random variable with Lebesgue density

[TABLE]

where the normalization constant $c_{p}$ is given by $c_{p}:=2p^{1/p}\Gamma(1+\frac{1}{p})$ . Next, recall the definition of the constant $M_{p}(q)$ from (2). It can be used to express first- and second-order moments of $p$ -generalized Gaussian random variables as follows. Namely, for $q,r,s>0$ we have that

[TABLE]

see [2, Lemma 3.1]. Note that $M_{p}(p)=1$ .

The family of $p$ -generalized Gaussian random variables can be used to describe a probabilistic interpretation of the distributions $\mathbf{P}_{\mathbf{W}_{n},n,p}$ that were defined in the introduction. This interpretation is one of the key devices in the proofs of Theorems A, C, and D.

Lemma 6 (Probabilistic interpretation, Theorem 3 in [6]).

Let $0<p<\infty$ , $Y^{(n)}=(Y_{1},\dots,Y_{n})$ be a random vector of independent and $p$ -generalized coordinates, and assume that $W_{n}$ is a non-negative random variable with distribution ${\bf W}_{n}$ , which is independent of $Y^{(n)}$ . Then the random vector

[TABLE]

is distributed according to the measure $\mathbf{P}_{\mathbf{W}_{n},n,p}$ .

3.2. Moderate and large deviations

Let $(X_{n})_{n\in\mathbb{N}}$ be a sequence of random vectors on some probability space $(\Omega,\mathcal{A},\mathbb{P})$ taking values in a Hausdorff topological space $\mathbb{X}$ . Further, let $(s_{n})_{n\in\mathbb{N}}$ be an increasing sequence of real numbers and $\mathbb{I}:\mathbb{X}\to[0,\infty]$ be a lower semi-continuous function with compact level sets $\{x\in\mathbb{R}^{d}\,:\,\mathbb{I}(x)\leq\alpha\}$ for all $\alpha\in\mathbb{R}$ . One says that $(X_{n})_{n\in\mathbb{N}}$ satisfies a large deviations principle (LDP) on $\mathbb{X}$ with speed $s_{n}$ and good rate function $\mathbb{I}$ , provided that

[TABLE]

for all Borel sets $A\subseteq\mathbb{X}$ , where $A^{\circ}$ denotes the interior and $\overline{A}$ the closure of $A$ . As already discussed in the introduction, a moderate deviations principle (MDP) is formally the same as an LDP, but on a different rage of scales.

We shall now present a few basic results from large deviations theory which are needed below. Assume that a sequence $(X_{n})_{n\in\mathbb{N}}$ of random variables satisfies an LDP with speed $s_{n}$ and rate function $I$ . Suppose now that $(Y_{n})_{n\in\mathbb{N}}$ is a sequence of random variables that are ‘close’ to the ones from the first sequence. The next result provides conditions under which in such a situation an LDP from the first can be transferred to the second sequence.

Lemma 7 (Exponential equivalence, Theorem 4.2.13 in [8]).

Let $(X_{n})_{n\in\mathbb{N}}$ and $(Y_{n})_{n\in\mathbb{N}}$ be two sequence of $\mathbb{R}^{d}$ -valued random vectors and assume that $(X_{n})_{n\in\mathbb{N}}$ satisfies an LDP on $\mathbb{R}^{d}$ with speed $s_{n}$ and rate function $\mathbb{I}$ . Further, suppose that the two sequences $(X_{n})_{n\in\mathbb{N}}$ and $(Y_{n})_{n\in\mathbb{N}}$ are exponentially equivalent, which is to say that

[TABLE]

for any $\delta>0$ . Then $(Y_{n})_{n\in\mathbb{N}}$ satisfies an LDP on $\mathbb{R}^{d}$ with the same speed and the same rate function.

Next, we recall what is known as Cramér’s theorem. It provides an LDP for sequences of independent and identically distributed random variables.

Lemma 8 (Cramér’s theorem, Theorem 2.2.3 in [8]).

Let $(X_{n})_{n\in\mathbb{N}}$ be a sequence of i. i. d. random variables. Assume that $\mathbb{E}e^{\lambda X_{1}}<\infty$ for all $|\lambda|<\lambda_{0}$ for some $\lambda_{0}>0$ . Then the sequence of random variables ${1\over n}\sum_{i=1}^{n}X_{i}$ satisfies an LDP on $\mathbb{R}$ with speed $n$ and good rate function $\mathbb{I}(x)=\sup\big{\{}\lambda x-\log\mathbb{E}e^{\lambda X_{1}}:\lambda\in\mathbb{R}\big{\}}$ , i.e., $\mathbb{I}$ is the Legendre-Fenchel transform of the log-moment generating function $\log\mathbb{E}e^{\lambda X_{1}}$ .

Let $d_{1},d_{2}\in\mathbb{N}$ and suppose that $(X_{n})_{n\in\mathbb{N}}$ is a sequence of $\mathbb{R}^{d_{1}}$ -valued random vectors and that $(Y_{n})_{n\in\mathbb{N}}$ is a sequence of $\mathbb{R}^{d_{2}}$ -random vectors. We assume that both sequences satisfy LDPs with the same speed. The next result, taken from [1, Proposition 2.4], yields that also the sequence of $\mathbb{R}^{d_{1}+d_{2}}$ -valued random vectors $(X_{n},Y_{n})$ satisfies an LDP and provides the form of the rate function.

Lemma 9.

Assume that $(X_{n})_{n\in\mathbb{N}}$ satisfies an LDP on $\mathbb{R}^{d_{1}}$ with speed $s_{n}$ and good rate function $\mathbb{I}_{\bf X}$ and that $(Y_{n})_{n\in\mathbb{N}}$ satisfies an LDP on $\mathbb{R}^{d_{2}}$ with speed $s_{n}$ and good rate function $\mathbb{I}_{\bf Y}$ . Then, if $X_{n}$ and $Y_{n}$ are independent for each $n\in\mathbb{N}$ , the sequence of random vectors $(X_{n},Y_{n})$ satisfies an LDP on $\mathbb{R}^{d_{1}+d_{2}}$ with speed $s_{n}$ and good rate function $\mathbb{I}$ given by $\mathbb{I}(x):=\mathbb{I}_{\mathbf{X}}(x_{1})+\mathbb{I}_{\mathbf{Y}}(x_{2})$ , $x=(x_{1},x_{2})\in\mathbb{R}^{d_{1}}\times\mathbb{R}^{d_{2}}$ .

Finally, we consider the possibility to transport a large deviations principle to another one by means of a continuous function, a result which is known as the so-called contraction principle.

Lemma 10 (Contraction principle, Theorem 4.2.1 in [8]).

Let $\mathbb{X}$ and $\mathbb{Y}$ be two Hausdorff topological space and let let $F:\mathbb{X}\to\mathbb{Y}$ be a continuous function. Further, let $(X_{n})_{n\in\mathbb{N}}$ be a sequence of $\mathbb{X}$ -valued random elements that satisfies an LDP with speed $s_{n}$ and good rate function $\mathbb{I}_{\mathbf{X}}$ . Then the sequence $(F(X_{n}))_{n\in\mathbb{N}}$ of $\mathbb{Y}$ -valued random elements satisfies an LDP with the same speed and with good rate function $\mathbb{I}=\mathbb{I}_{\mathbf{X}}\circ F^{-1}$ , i.e.,

[TABLE]

with the convention that $\mathbb{I}(y)=+\infty$ if $F^{-1}(\{y\})=\varnothing$ .

As explained before, a moderate deviations principle is formally nothing else than a large deviations principle and describes (in our set-up) the deviation probabilities at scales between a law of large numbers and a central limit theorem. An important tool for us will be the following MDP for sums of independent and identically distributed random vectors.

Lemma 11 (MDP for sums of random vectors, Theorem 3.7.1 in [8]).

Let $(X_{n})_{n\in\mathbb{N}}$ be a sequence of independent and identically distributed random vectors in $\mathbb{R}^{d}$ and let $(s_{n})_{n\in\mathbb{N}}$ be sequence of positive real numbers such that $s_{n}=\omega(\sqrt{n})$ and $s_{n}=o(n)$ . We assume that $X_{1}$ is centered, its covariance matrix $\mathbf{C}=\mathrm{Cov}(X_{1})$ is invertible, and $\log\mathbb{E}\,e^{\langle\lambda,X_{1}\rangle}<\infty$ for all $\lambda$ in a ball around the origin having positive radius. Then the sequence of random vectors $\frac{1}{s_{n}}\sum_{i=1}^{n}X_{i}$ , $n\in\mathbb{N}$ , satisfies an LDP with speed $s_{n}^{2}/n$ (i.e., an MDP) and good rate function $\mathbb{I}(x)={1\over 2}\langle x,\mathbf{C}^{-1}x\rangle$ , $x\in\mathbb{R}^{d}$ .

Remark 12.

There exist versions of Lemma 11 under less restrictive assumptions on the (exponential) moments of the involved random vectors, see [3], for example. However, such results do not lead to simplifications or improvements in our situation.

4. Proof of the main results

4.1. A probabilistic representation for the $q$ -norm

In a first step we develop a probabilistic representation for the random variables $\|Z_{n}\|_{q}$ , which will turn out to be useful for both, the proof of the central limit theorems and the moderate deviations principle. In what follows we let $Y_{1},Y_{2},\ldots$ be a sequence of independent $p$ -generalized Gaussian random variables and define, for each $n\in\mathbb{N}$ ,

[TABLE]

where $0<p,q<\infty$ .

Lemma 13 (Probabilistic interpretation).

Fix $0<p<\infty$ , $0<q<\infty$ and $n\in\mathbb{N}$ . Let $\mathbf{W}_{n}$ be a Borel probability measure on $[0,\infty)$ . Let $Z_{n}\in{\mathbb{B}}_{p}^{n}$ be distributed according to $\mathbf{P}_{\mathbf{W}_{n},n,p}$ and $W_{n}$ be distributed according to $\mathbf{W}_{n}$ and independent of $Y_{1},Y_{2},\ldots$ . Then

[TABLE]

where $\Psi_{p}:\mathbb{R}^{3}\to\mathbb{R}$ is such that, for some $M,\delta>0$ , we have $|\Psi_{p}(x,y,z)|\leq M\|(x,y,z)\|_{2}^{2}$ whenever $\|(x,y,z)\|_{2}^{2}<\delta$ .

Proof.

We first observe that as a consequence of Lemma 6 the random vector $Z_{n}$ has the probabilistic representation

[TABLE]

where $Y^{(n)}=(Y_{1},\ldots,Y_{n})$ is a vector of independent $p$ -generalized Gaussian random variables and $W_{n}$ is a random variable with distribution $\mathbf{W}_{n}$ , which is independent of $Y^{(n)}$ . Thus

[TABLE]

Recalling the definitions of the random variables $S_{n}^{(1)}$ and $S_{n}^{(2)}$ , we can rewrite the last expression as

[TABLE]

Next, we define the function

[TABLE]

where $D_{F}$ stands for the domain of $F$ . Clearly, some open neighborhood of $(0,0,0)$ is contained in $D_{F}$ , and a Taylor expansion of $F$ around $(0,0,0)$ shows that for all $(x,y,z)\in D_{F}$ ,

[TABLE]

where the function $\Psi_{p}:D_{F}\to\mathbb{R}$ is such that, for some $M,\delta>0$ , we have $|\Psi_{p}(x,y,z)|\leq M\|(x,y,z)\|_{2}^{2}$ whenever $\|(x,y,z)\|_{2}^{2}<\delta$ . Combining this with the representation (14) for $\|Z_{n}\|_{q}$ proves the claim. ∎

4.2. Proof of the central limit theorem (Theorem A)

For each $n\in\mathbb{N}$ let us define the random variable

[TABLE]

It follows from Lemma 13 that

[TABLE]

For any $n\in\mathbb{N}$ , we decompose $V_{n}$ into the random variables

[TABLE]

Slutsky’s theorem (see [7, Proposition A.42 (b)]) completes the proof of Theorem A once we show that

[TABLE]

where $N\sim\mathcal{N}(0,\sigma^{2})$ is the centered Gaussian random variable as in Theorem A.

Assumption (3) says that $W_{n}/\sqrt{n}$ converges in distribution to [math], as $n\to\infty$ . Therefore, the multivariate central limit theorem applied to $(S_{n}^{(1)},S_{n}^{(2)})$ and the continuous mapping theorem yield

[TABLE]

where $(\xi,\eta)$ is a centered Gaussian random vector in $\mathbb{R}^{2}$ with covariance matrix $\Sigma$ given by

[TABLE]

As a consequence, ${\xi\over qM_{p}(q)}-{\eta\over p}$ is a centered Gaussian random variable $N$ with variance

[TABLE]

Finally, we shall argue that $R_{n}\overset{\mathbb{P}}{\underset{n\to\infty}{\longrightarrow}}0$ . To this end, we write

[TABLE]

Since there exist $\delta,M>0$ such that $|\Psi_{p}(x,y,z)|\leq M\|(x,y,z)\|_{2}^{2}$ whenever $\|(x,y,z)\|_{2}^{2}<\delta$ , we obtain

[TABLE]

The weak law of large numbers ensures that, as $n\to\infty$ , the first two probabilities converge to zero, while our assumption (3) on the random variables $W_{n}$ ensures that the last probability tends to zero as well. Thus, for any $\varepsilon>0$ , we have that

[TABLE]

Again by the weak law of large numbers we have that $S_{n}^{(1)}/\sqrt{n}$ and $S_{n}^{(2)}/\sqrt{n}$ both converge to zero in probability, as $n\to\infty$ . Moreover, the central limit theorem implies that $S_{n}^{(1)}$ and $S_{n}^{(2)}$ converge in distribution to non-degenerate Gaussian random variables. Hence, Slutsky’s theorem implies that $(S_{n}^{(1)})^{2}/\sqrt{n}$ and $(S_{n}^{(2)})^{2}/\sqrt{n}$ both converge to zero in distribution. Since the random variables are defined on the same probability space and because the limit is (almost surely) constant, we even have that $(S_{n}^{(1)})^{2}/\sqrt{n}$ and $(S_{n}^{(2)})^{2}/\sqrt{n}$ converge to zero in probability. Finally, $W_{n}^{2}/n^{3/2}$ also converges to zero in probability by our assumption (3). Thus, the first probability in the last expression also converges to zero, while the second summand has already been treated before. As a consequence, we conclude that indeed

[TABLE]

which completes the argument. $\Box$

4.3. Proof of the generalized central limit theorem (Theorem B)

Since the proof of Theorem B is very similar to the one of Theorem A, we restrict ourselves to the details that need to be adapted.

First of all, we recall that

[TABLE]

Then, following with minimal changes the proof of Lemma 13, we obtain

[TABLE]

We define for each $n\in\mathbb{N}$ the random variable

[TABLE]

In the same way as in the proof of Lemma 13, one shows that $V_{n}\stackrel{{\scriptstyle d}}{{=}}T_{n}+R_{n}$ with

[TABLE]

Thus, using Slutsky’s theorem, we conclude the result of Theorem B once we have shown that

[TABLE]

where $\tilde{N}\sim\mathcal{N}(0,\tilde{\sigma}^{2})$ is the Gaussian random variable from the statement of Theorem B.

We start with the assertion on the sequence $T_{n}$ . First of all, we notice that by Assumption (4), the multivariate central limit theorem applied to $(S_{n}^{(1)},S_{n}^{(2)})$ , and the continuous mapping theorem,

[TABLE]

where $\zeta\sim\mathcal{N}(0,\tau^{2})$ is independent of the centered Gaussian random vector $(\xi,\eta)$ in $\mathbb{R}^{2}$ with covariance matrix $\Sigma$ given by

[TABLE]

The limiting variable $\tilde{N}$ is centered Gaussian. To compute its variance, observe that

[TABLE]

Thus, the limiting variance is given by

[TABLE]

where the second line follows by recalling (2) and performing computations with gamma functions.

To show that $R_{n}\overset{\mathbb{P}}{\underset{n\to\infty}{\longrightarrow}}0$ , as $n\to\infty$ , we can in principle follow the lines of the proof of Theorem A, but we have to replace the terms $S_{n}^{(2)}\over\sqrt{n}$ and $W_{n}\over n$ there by $S_{n}^{(2)}\over\sqrt{n}(1+{\mu_{n}\over n})$ and $W_{n}^{*}\over\sqrt{n}(1+{\mu_{n}\over n})$ , respectively. In particular, in a first step this results in showing that both sequences converge in distribution to [math], that is for every fixed $\delta>0$ ,

[TABLE]

as $n\to\infty$ . Both claims easily follow from the Slutsky theorem after recalling that both $S_{n}^{(2)}$ and $W_{n}^{*}$ converge in distribution to normal random variables, and that $1+\frac{\mu_{n}}{n}\to 1+\mu$ .

Moreover, in a second step one needs to argue that for any fixed $\varepsilon,M>0$ ,

[TABLE]

as $n\to\infty$ . Recall that all three sequences $S_{n}^{(1)},S_{n}^{(2)},W_{n}^{*}$ converge in distribution to normal random variables. For the former two sequences, this follows from the central limit theorem, whereas the claim for $W_{n}^{*}$ is a consequence of our assumption (4). Again by a Slutsky-type argument, the sequences $(S_{n}^{(1)})^{2}/\sqrt{n}$ , $(S_{n}^{(2)})^{2}/(\sqrt{n}(1+\frac{\mu_{n}}{n})^{2})$ and $(W_{n}^{*})^{2}/(\sqrt{n}(1+\frac{\mu_{n}}{n})^{2})$ converge to zero in probability, hence so does their sum. This establishes (17) and hence (16), which completes the proof of Theorem B. $\Box$

4.4. Proof of the moderate deviations principle (Theorem C)

Let $(b_{n})_{n\in\mathbb{N}}$ be a sequence of positive real numbers such that $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ . As in the proof of the central limit theorem, we consider the sequence of random variables

[TABLE]

and observe that Lemma 13 implies

[TABLE]

where $\Psi_{p}:\mathbb{R}^{3}\to\mathbb{R}$ is such that $|\Psi_{p}(x,y,z)|\leq M\|(x,y,z)\|_{2}^{2}$ whenever $\|(x,y,z)\|_{2}^{2}<\delta$ for some $M,\delta>0$ .

Our strategy to prove the moderate deviations principle of Theorem C is as follows:

We prove a bivariate moderate deviations principle for the sequence of rescaled random vectors $b_{n}^{-1}(S_{n}^{(1)},S_{n}^{(2)})$ in $\mathbb{R}^{2}$ .

2.

We apply the contraction principle to deduce a moderate deviations principle for the linear combination $S_{n}^{(1)}/(b_{n}\,qM_{p}(q))-S_{n}^{(2)}/(p\,b_{n})$ .

3.

We show that the sequence of random variables $V_{n}/b_{n}$ is exponentially equivalent to the sequence formed in step 2.

We start with the first step of the proof.

Lemma 14 (Bivariate MDP).

Fix $0<p<\infty$ and $0<q<\infty$ with $q<p$ . Let $(b_{n})_{n\in\mathbb{N}}$ be a sequence of positive real numbers such that $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ and consider the random vectors

[TABLE]

Then the sequence of random vectors $S_{n}/b_{n}$ satisfies an MDP on $\mathbb{R}^{2}$ with speed $b_{n}^{2}$ and good rate function

[TABLE]

where $c_{p,q}:=(p+q^{2})\Gamma({1+q\over p})^{2}-p\Gamma({1\over p})\Gamma({1+2q\over p})$ .

Proof.

First, we observe that $S_{n}$ is a sum of centered i. i. d. random vectors in $\mathbb{R}^{2}$ with covariance matrix

[TABLE]

given by

[TABLE]

The moment generating function of the random vector $(|Y_{1}|^{q}-M_{p}(q),|Y_{1}|^{p}-1)$ on $\mathbb{R}^{2}$ is given by

[TABLE]

Since $q<p$ , the function $M$ is finite on $\mathbb{R}\times(-\infty,1/p)$ , a set which contains the origin $(0,0)\in\mathbb{R}^{2}$ in its interior. Therefore, Lemma 11 (with the choice $s_{n}=\sqrt{n}b_{n}$ there) implies that the sequence of random variables $S_{n}/b_{n}$ satisfies an MDP on $\mathbb{R}^{2}$ with speed $b_{n}^{2}$ and good rate function

[TABLE]

Inserting the values for $c_{11},c_{22}$ and $c_{12}=c_{21}$ , and simplifying the resulting expression proves the claim. ∎

Remark 15.

In the previous proof we used our assumption that $q<p$ in order to verify the finiteness of certain exponential moments. As already discussed in Remark 12 above, there exist version of the MDP for sums of independent random vectors not requiring the finiteness of such exponential moments. However, also when applying such weaker versions from [3], for example, the assumption that $q<p$ is in fact needed.

We continue with the second step and use the contraction principle to obtain an MDP the linear combinations of $S_{n}^{(1)}$ and $S_{n}^{(2)}$ .

Lemma 16 (MDP for the core term).

Let $(b_{n})_{n\in\mathbb{N}}$ be s sequence of positive real numbers such that $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ . Then the sequence of random variables

[TABLE]

satisfies an MDP on $\mathbb{R}$ with speed $b_{n}^{2}$ and good rate function $\mathbb{I}_{2}(t)=t^{2}/(2\sigma^{2})$ , where $\sigma^{2}$ is the constant in Theorem A.

Proof.

Consider the continuous function

[TABLE]

and observe that, for each $n\in\mathbb{N}$ , the random variable ${S_{n}^{(1)}\over b_{n}\,qM_{p}(q)}-{S_{n}^{(2)}\over p\,b_{n}}$ has the same distribution as $G(S_{n}/b_{n})$ , where $S_{n}$ was defined in Lemma 14. Thus, the contraction principle (see Lemma 10) implies the desired MDP with speed $b_{n}^{2}$ and good rate function

[TABLE]

This optimization problem leads us to the Lagrangian

[TABLE]

and the Lagrange multiplier equations

(i)

$\frac{c_{22}}{c_{11}c_{22}-c_{12}^{2}}\,x-\frac{c_{12}}{c_{11}c_{22}-c_{12}^{2}}\,y+\frac{\lambda}{qM_{p}(q)}=0$ , 2. (ii)

$\frac{c_{11}}{c_{11}c_{22}-c_{12}^{2}}\,y-\frac{c_{12}}{c_{11}c_{22}-c_{12}^{2}}\,x-\frac{\lambda}{p}=0$ , 3. (iii)

$\frac{x}{qM_{p}(q)}-\frac{y}{p}-t=0$ ,

where $c_{11}$ , $c_{22}$ , and $c_{12}$ are the entries of the covariance matrix given by (19). This yields the critical value

[TABLE]

and from a direct (but tedious) computation, we obtain the explicit quadratic form of the rate function. We refrain from providing the details of the computation. ∎

We will now proceed with the third step and prove the exponential equivalence. In what follows, we let the random vectors $S_{n}$ be as in (18), the random variables $V_{n}$ as in (15), and the function $G$ be given by (20).

Lemma 17 (Exponential equivalence - MDP).

Let $(b_{n})_{n\in\mathbb{N}}$ be s sequence of positive real numbers such that $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ . Then the sequences of random variables $G(S_{n}/b_{n})$ and $V_{n}/b_{n}$ are exponentially equivalent.

Proof.

We start by recalling that, for each $n\in\mathbb{N}$ ,

[TABLE]

and

[TABLE]

Let us fix $\varepsilon>0$ . We observe that

[TABLE]

where we used that $W_{n}$ is a non-negative random variable for each $n\in\mathbb{N}$ . The function $\Psi_{p}$ is the same as in Lemma 13. Assumption (5) (with $\delta=p\frac{\varepsilon}{2}$ there) implies

[TABLE]

To discuss the second term, we first write

[TABLE]

where $M\in(0,\infty)$ is the parameter from Lemma 13. For the first summand in the previous expression, we obtain the estimate

[TABLE]

The first two terms both decay like $e^{-cn}$ for a suitable $c\in(0,\infty)$ by Cramér’s theorem (see Lemma 8). For the last term, we use again condition (5) (with $\delta=\sqrt{\varepsilon/(6M)}$ there) and obtain

[TABLE]

where we also used that $b_{n}=o(\sqrt{n})$ . As a consequence,

[TABLE]

since $b_{n}=\omega(1)$ and $b_{n}=o(\sqrt{n})$ . Recalling the definition and the properties of the function $\Psi_{p}$ from Lemma 13, we obtain for sufficiently large $n$

[TABLE]

where we also used that $b_{n}=o(\sqrt{n})$ . Again by Cramér’s theorem (see Lemma 8), the first two terms decay like $e^{-cn}$ for suitable $c\in(0,\infty)$ and their sum is bounded by $2e^{-cn}$ for sufficiently large $n$ . Using this together with assumption (5), we obtain

[TABLE]

where we used that $b_{n}=o(\sqrt{n})$ . Putting everything together and using [8, Lemma 1.2.15], we get

[TABLE]

Since $\varepsilon>0$ was arbitrary, this shows the exponential equivalence that was claimed in the lemma. ∎

Proof of Theorem C.

The MDP is now a direct consequence of Lemma 7 together with the MDP for the core term (see Lemma 16) and the exponential equivalence (see Lemma 17). ∎

4.5. Proof of the large deviations principles (Theorem D)

In this last section we present the proof of the large deviations principles in Theorem D. On the way, we shall use some results we have obtained in [13]. In what follows, we assume that for each $n\in\mathbb{N}$ , $Y^{(n)}=(Y_{1},\dots,Y_{n})$ is a vector of independent $p$ -generalized Gaussian random variables, and we assume that $(Y^{(n)})_{n\in\mathbb{N}}$ and $(W_{n})_{n\in\mathbb{N}}$ are independent.

4.5.1. The case $q<p$

We start by recalling that, for each $n\in\mathbb{N}$ , we have the distributional equality

[TABLE]

see the proof of Lemma 13. In the proof of Theorem 1.2 in [13], we have already seen that the sequence of random vectors

[TABLE]

satisfies an LDP on $\mathbb{R}^{2}$ with speed $n$ and a good rate function $\mathbb{I}_{1}(t_{1},t_{2})$ . More precisely, thanks to Cramér’s theorem (see Lemma 8) $\mathbb{I}_{1}$ can be identified as the Legendre-Fenchel transform $\Lambda^{*}$ of the function

[TABLE]

Since $(Y^{(n)})_{n\in\mathbb{N}}$ and $(W_{n})_{n\in\mathbb{N}}$ are assumed to be independent and since $(W_{n}/n)_{n\in\mathbb{N}}$ satisfies an LDP with speed $n$ and good rate function $\mathbb{I}_{\mathbf{W}}$ , the sequence of random vectors

[TABLE]

satisfies an LDP on $\mathbb{R}^{3}$ with good rate function $\mathbb{I}_{2}$ given by

[TABLE]

where we used Lemma 9. Next, we consider the mapping

[TABLE]

which is continuous on its domain. Clearly,

[TABLE]

for each $n\in\mathbb{N}$ . Therefore, we can apply the contraction principle (see Lemma 10) to conclude that $(n^{1/p-1/q}\|Z_{n}\|_{q})_{n\in\mathbb{N}}$ satisfies an LDP with speed $n$ and good rate function $\mathbb{I}_{{\bf Z},1}=\mathbb{I}_{2}\circ F^{-1}$ . This completes the argument. $\Box$

Remark 18.

The assumption that $q<p$ was used only in disguise above and is behind the LDP for the sequence of random vectors in (21). Indeed and as indicated above, the proof of this LDP is based on Cramér’s theorem, which in turn requires finiteness of some exponential moments, or equivalently, that the origin is an interior point of the domain of the function $\Lambda$ defined in (22). However, from the definition of this function it is clear that this can only be the case if $q<p$ .

4.5.2. The case $q>p$

As was shown in the proof of [13, Theorem 1.3], the sequence of random variables

[TABLE]

satisfies an LDP with speed $n^{p/q}$ and good rate function

[TABLE]

We will now prove that the two sequences $(U_{n})_{n\in\mathbb{N}}$ and $(n^{1/p-1/q}\|Z_{n}\|_{q})_{n\in\mathbb{N}}$ are exponentially equivalent.

Lemma 19 (Exponential equivalence - LDP).

The sequences $(U_{n})_{n\in\mathbb{N}}$ and $(n^{1/p-1/q}\|Z_{n}\|_{q})_{n\in\mathbb{N}}$ are exponentially equivalent with rate $n^{p/q}$ .

Proof.

As we have seen in the proof of Lemma 13, one has that

[TABLE]

for each $n\in\mathbb{N}$ . Let $\eta\in(0,\infty)$ . Then, for every $\varepsilon\in(0,1)$ , we obtain

[TABLE]

Let us consider the second term. Write

[TABLE]

and note that $A_{1}(\varepsilon)>1$ and $A_{2}(\varepsilon)>0$ for all $\varepsilon\in(0,1)$ . This leads to the estimate

[TABLE]

By Cramér’s theorem, the first term in the previous line decays exponentially like $e^{-c_{1}n}$ , since $A_{1}(\varepsilon)>1$ for all $\varepsilon\in(0,1)$ . In fact, the rate function in the corresponding LDP does not vanish in $O\setminus\{1\}$ , where $O\subset\mathbb{R}$ is an open neighborhood of $1$ , which implies that the constant $c_{1}$ stays strictly positive when letting $\varepsilon\to 0$ . In combination with our Assumption (6) (applied with $\delta=A_{2}(\varepsilon)$ ) this shows that

[TABLE]

In the first inequality above, we have used the elementary fact (see, e.g., [8, Lemma 1.2.15]) that for families of non-negative real numbers $a_{1}(\delta),a_{2}(\delta)$ , $\delta>0$ one has that

[TABLE]

The remaining term

[TABLE]

can be treated in the same way.

Putting everything together, we obtain from the LDP for the sequence $(U_{n})_{n\in\mathbb{N}}$ that

[TABLE]

When $\varepsilon\to 0$ , the expression above tends to $-\infty$ . Hence, the two sequences $(U_{n})_{n\in\mathbb{N}}$ and $(n^{1/p-1/q}\|Z_{n}\|_{q})_{n\in\mathbb{N}}$ are indeed exponentially equivalent. ∎

Proof of Theorem D, part (2) .

The proof of is now a direct consequence of Lemma 19 combined with the fact that $(U_{n})_{n\in\mathbb{N}}$ satisfies an LDP with speed $n^{p/q}$ and good rate function $\mathbb{I}_{\bf U}$ given by (23). ∎

Acknowledgement

We would also like to thank Nicola Turchi for exchanges about the topics of this paper.

ZK has been supported by the German Research Foundation under Germany’s Excellence Strategy EXC 2044 – 390685587, Mathematics Münster: Dynamics - Geometry - Structure. JP has been supported by a Visiting International Professor Fellowship from the Ruhr University Bochum and its Research School PLUS, by the Austrian Science Fund (FWF) Project F5508-N26, which is part of the Special Research Program “Quasi-Monte Carlo Methods: Theory and Applications”, and by the FWF Project P32405 “Asymptotic Geometric Analysis and Applications”. ZK and CT have been supported by the DFG Scientific Network Cumulants, Concentration and Superconcentration.

Bibliography24

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D. Alonso-Gutiérrez, J. Prochno, and C. Thäle. Large deviations for high-dimensional random projections of ℓ p n superscript subscript ℓ 𝑝 𝑛 \ell_{p}^{n} -balls. Adv. in Appl. Math. , 99:1–35, 2018.
2[2] D. Alonso-Gutierrez, J. Prochno, and C. Thäle. Gaussian fluctuations for high-dimensional random projections of ℓ p n superscript subscript ℓ 𝑝 𝑛 \ell_{p}^{n} -balls. Bernoulli , 2019+.
3[3] M.A. Arcones. Moderate deviations of empirical processes. In Stochastic inequalities and applications , volume 56 of Progr. Probab. , pages 189–212. Birkhäuser, Basel, 2003.
4[4] I. Bárány and C. Thäle. Intrinsic volumes and Gaussian polytopes: the missing piece of the jigsaw. Doc. Math. , 22:1323–1335, 2017.
5[5] I. Bárány and V. Vu. Central limit theorems for Gaussian polytopes. Ann. Probab. , 35(4):1593–1621, 2007.
6[6] F. Barthe, O. Guédon, S. Mendelson, and A. Naor. A probabilistic approach to the geometry of the ℓ p n subscript superscript ℓ 𝑛 𝑝 \ell^{n}_{p} -ball. Ann. Probab. , 33(2):480–513, 2005.
7[7] R.F. Bass. Stochastic Processes , volume 33 of Cambridge Series in Statistical and Probabilistic Mathematics . Cambridge University Press, Cambridge, 2011.
8[8] A. Dembo and O. Zeitouni. Large Deviations. Techniques and Applications , volume 38 of Stochastic Modelling and Applied Probability . Springer-Verlag, Berlin, 2010. Corrected reprint of the second (1998) edition.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

High-dimensional limit theorems

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction and main results

Remark 1**.**

1.1. Central limit theorems

Theorem A** (Central limit theorem).**

Remark 2**.**

Remark 3**.**

Theorem B** (Generalized central limit theorem).**

1.2. Moderate deviations principle

Theorem C** (Moderate deviations principle).**

Remark 4**.**

1.3. Large deviations principle

Theorem D** (Large deviations principle).**

Remark 5**.**

1.4. Structure

2. Application to projections of ℓpn\ell_{p}^{n}ℓpn​-balls

2.1. Random versus non-random subspaces

Theorem E** (Central limit theorem for deterministic projections).**

Proof.

2.2. 111-dimensional random projections of ℓpn\ell_{p}^{n}ℓpn​-balls

3. Preliminaries

3.1. Generalized Gaussian random variables

Lemma 6** (Probabilistic interpretation, Theorem 3 in [6]).**

3.2. Moderate and large deviations

Lemma 7** (Exponential equivalence, Theorem 4.2.13 in [8]).**

Lemma 8** (Cramér’s theorem, Theorem 2.2.3 in [8]).**

Lemma 9**.**

Lemma 10** (Contraction principle, Theorem 4.2.1 in [8]).**

Lemma 11** (MDP for sums of random vectors, Theorem 3.7.1 in [8]).**

Remark 12**.**

4. Proof of the main results

4.1. A probabilistic representation for the qqq-norm

Lemma 13** (Probabilistic interpretation).**

Proof.

4.2. Proof of the central limit theorem (Theorem A)

4.3. Proof of the generalized central limit theorem (Theorem B)

4.4. Proof of the moderate deviations principle (Theorem C)

Lemma 14** (Bivariate MDP).**

Proof.

Remark 15**.**

Lemma 16** (MDP for the core term).**

Proof.

Lemma 17** (Exponential equivalence - MDP).**

Proof.

Proof of Theorem C.

4.5. Proof of the large deviations principles (Theorem D)

4.5.1. The case q<pq<pq<p

Remark 18**.**

4.5.2. The case q>pq>pq>p

Lemma 19** (Exponential equivalence - LDP).**

Proof.

Proof of Theorem D, part (2) .

Acknowledgement

Remark 1.

Theorem A (Central limit theorem).

Remark 2.

Remark 3.

Theorem B (Generalized central limit theorem).

Theorem C (Moderate deviations principle).

Remark 4.

Theorem D (Large deviations principle).

Remark 5.

2. Application to projections of $\ell_{p}^{n}$ -balls

Theorem E (Central limit theorem for deterministic projections).

2.2. $1$ -dimensional random projections of $\ell_{p}^{n}$ -balls

Lemma 6 (Probabilistic interpretation, Theorem 3 in [6]).

Lemma 7 (Exponential equivalence, Theorem 4.2.13 in [8]).

Lemma 8 (Cramér’s theorem, Theorem 2.2.3 in [8]).

Lemma 9.

Lemma 10 (Contraction principle, Theorem 4.2.1 in [8]).

Lemma 11 (MDP for sums of random vectors, Theorem 3.7.1 in [8]).

Remark 12.

4.1. A probabilistic representation for the $q$ -norm

Lemma 13 (Probabilistic interpretation).

Lemma 14 (Bivariate MDP).

Remark 15.

Lemma 16 (MDP for the core term).

Lemma 17 (Exponential equivalence - MDP).

4.5.1. The case $q<p$

Remark 18.

4.5.2. The case $q>p$

Lemma 19 (Exponential equivalence - LDP).