On the geometry of random polytopes

Shahar Mendelson

arXiv:1902.01664·math.FA·February 6, 2019

On the geometry of random polytopes

Shahar Mendelson

PDF

Open Access

TL;DR

This paper provides a simple proof of a recent result showing that the convex hull of certain random matrix rows approximates a specific geometric body under minimal assumptions.

Contribution

It offers a straightforward proof of a geometric approximation result for random polytopes generated by symmetric random matrices.

Findings

01

Convex hull of random matrix rows approximates a specific geometric body.

02

High probability bounds for the approximation.

03

Minimal assumptions on the distribution of matrix entries.

Abstract

We present a simple proof to a fact recently established in [5]: let $ξ$ be a symmetric random variable that has variance $1$ , let $Γ = (ξ_{ij})$ be an $N \times n$ random matrix whose entries are independent copies of $ξ$ , and set $X_{1}, ..., X_{N}$ to be the rows of $Γ$ . Then under minimal assumptions on $ξ$ and as long as $N \geq c_{1} n$ , $c_{2} (B_{\infty}^{n} \cap lo g (e N / n) B_{2}^{n}) \subset absconv (X_{1}, ..., X_{N})$ with high probability.

Equations114

c_{2}\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}\subset{\rm absconv}(X_{1},...,X_{N})

c_{2}\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}\subset{\rm absconv}(X_{1},...,X_{N})

K = absconv (X_{1}, ..., X_{N}) = Γ^{*} B_{1}^{N};

K = absconv (X_{1}, ..., X_{N}) = Γ^{*} B_{1}^{N};

c_{1} (α) lo g (e N / n) B_{2}^{n} \subset absconv (X_{1}, ..., X_{N})

c_{1} (α) lo g (e N / n) B_{2}^{n} \subset absconv (X_{1}, ..., X_{N})

c_{2}(\alpha)\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}\subset{\rm absconv}(X_{1},...,X_{N}),

c_{2}(\alpha)\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}\subset{\rm absconv}(X_{1},...,X_{N}),

c_{2} (B_{\infty}^{n} \cap lo g (e N / n) B_{2}^{n}) \subset absconv (X_{1}, ..., X_{N})

c_{2} (B_{\infty}^{n} \cap lo g (e N / n) B_{2}^{n}) \subset absconv (X_{1}, ..., X_{N})

P r (∣ ξ ∣ \geq κ) \geq δ .

P r (∣ ξ ∣ \geq κ) \geq δ .

c_{3} (B_{\infty}^{n} \cap lo g (e N / n) B_{2}^{n}) \subset absconv (X_{1}, ..., X_{N}) .

c_{3} (B_{\infty}^{n} \cap lo g (e N / n) B_{2}^{n}) \subset absconv (X_{1}, ..., X_{N}) .

K = absconv (X_{1}, ..., X_{n}) = Γ^{*} B_{1}^{N}

K = absconv (X_{1}, ..., X_{n}) = Γ^{*} B_{1}^{N}

L=\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}.

L=\bigl{(}B_{\infty}^{n}\cap\sqrt{\log(eN/n)}B_{2}^{n}\bigr{)}.

P r (\exists z \in \partial L^{\circ} ∥Γ z ∥_{\infty} \leq c_{0}) \leq 2 exp (- c_{1} N^{1 - α} n^{α}) .

P r (\exists z \in \partial L^{\circ} ∥Γ z ∥_{\infty} \leq c_{0}) \leq 2 exp (- c_{1} N^{1 - α} n^{α}) .

z \in \partial L^{\circ} in f ∥Γ z ∥_{\infty} \geq c_{0},

z \in \partial L^{\circ} in f ∥Γ z ∥_{\infty} \geq c_{0},

z \in \partial L^{\circ} in f ∣ {i : ∣ ⟨ z, X_{i} ⟩ ∣ \geq c_{0}} ∣

z \in \partial L^{\circ} in f ∣ {i : ∣ ⟨ z, X_{i} ⟩ ∣ \geq c_{0}} ∣

P r (∣ ⟨ z, X ⟩ ∣ \geq 2 c_{0}) \geq 4 (\frac{n}{N})^{α} .

P r (∣ ⟨ z, X ⟩ ∣ \geq 2 c_{0}) \geq 4 (\frac{n}{N})^{α} .

\bigl{|}\{i:|\left\langle z,X_{i}\right\rangle|\geq 2c_{0}\}\bigr{|}\geq 2N^{1-\alpha}n^{\alpha}.

\bigl{|}\{i:|\left\langle z,X_{i}\right\rangle|\geq 2c_{0}\}\bigr{|}\geq 2N^{1-\alpha}n^{\alpha}.

\sup_{z\in\partial L^{\circ}}\bigl{|}\{i:|\left\langle z-\pi z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\leq N^{1-\alpha}n^{\alpha}

\sup_{z\in\partial L^{\circ}}\bigl{|}\{i:|\left\langle z-\pi z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\leq N^{1-\alpha}n^{\alpha}

1 - 2 exp (- c (α) N^{1 - α} n^{α}),

1 - 2 exp (- c (α) N^{1 - α} n^{α}),

\bigl{|}\{i:|\left\langle\pi z,X_{i}\right\rangle|\geq 2c_{0}\}|\geq 2N^{1-\alpha}n^{\alpha}

\bigl{|}\{i:|\left\langle\pi z,X_{i}\right\rangle|\geq 2c_{0}\}|\geq 2N^{1-\alpha}n^{\alpha}

\bigl{|}\{i:|\left\langle z-\pi z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\leq N^{1-\alpha}n^{\alpha}.

\bigl{|}\{i:|\left\langle z-\pi z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\leq N^{1-\alpha}n^{\alpha}.

∣ ⟨ z, X_{i} ⟩ ∣ \geq ∣ ⟨ π z, X_{i} ⟩ ∣ - ∣ ⟨ z - π z, X_{i} ⟩ ∣ \geq c_{0},

∣ ⟨ z, X_{i} ⟩ ∣ \geq ∣ ⟨ π z, X_{i} ⟩ ∣ - ∣ ⟨ z - π z, X_{i} ⟩ ∣ \geq c_{0},

\inf_{z\in\partial L^{\circ}}\bigl{|}\{i:|\left\langle z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\geq N^{1-\alpha}n^{\alpha};

\inf_{z\in\partial L^{\circ}}\bigl{|}\{i:|\left\langle z,X_{i}\right\rangle|\geq c_{0}\}\bigr{|}\geq N^{1-\alpha}n^{\alpha};

P r (∣ i = 1 \sum n ε_{i} z_{i} ∣ > t)

P r (∣ i = 1 \sum n ε_{i} z_{i} ∣ > t)

Pr\bigl{(}|\left\langle z,X\right\rangle|\geq c^{\prime}\bigr{)}\geq 2\exp(-c^{\prime\prime}r).

Pr\bigl{(}|\left\langle z,X\right\rangle|\geq c^{\prime}\bigr{)}\geq 2\exp(-c^{\prime\prime}r).

\|z\|_{L_{r}^{\circ}}\leq\sum_{i=1}^{r}z_{i}^{*}+\sqrt{r}\bigl{(}\sum_{i>r}(z_{i}^{2})^{*}\bigr{)}^{1/2}\leq c_{0}\|z\|_{L_{r}^{\circ}},

\|z\|_{L_{r}^{\circ}}\leq\sum_{i=1}^{r}z_{i}^{*}+\sqrt{r}\bigl{(}\sum_{i>r}(z_{i}^{2})^{*}\bigr{)}^{1/2}\leq c_{0}\|z\|_{L_{r}^{\circ}},

\frac{\|z\|_{L_{r}^{\circ}}}{\sqrt{2}}\leq\sum_{j=1}^{r}\bigl{(}\sum_{i\in I_{j}}z_{i}^{2}\bigr{)}^{1/2}\leq\|z\|_{L_{r}^{\circ}}.

\frac{\|z\|_{L_{r}^{\circ}}}{\sqrt{2}}\leq\sum_{j=1}^{r}\bigl{(}\sum_{i\in I_{j}}z_{i}^{2}\bigr{)}^{1/2}\leq\|z\|_{L_{r}^{\circ}}.

\mathbb{E}|Y|\geq c(\kappa,\delta)\bigl{(}\sum_{j\in J}z_{j}^{2}\bigr{)}^{1/2},

\mathbb{E}|Y|\geq c(\kappa,\delta)\bigl{(}\sum_{j\in J}z_{j}^{2}\bigr{)}^{1/2},

\mathbb{E}|Y|=\mathbb{E}_{\xi}\mathbb{E}_{\varepsilon}\bigl{|}\sum_{j\in J}\varepsilon_{j}z_{j}\xi_{j}\bigr{|}\gtrsim\mathbb{E}_{\xi}\bigl{(}\sum_{j\in J}z_{j}^{2}\xi_{j}^{2}\bigr{)}^{1/2}.

\mathbb{E}|Y|=\mathbb{E}_{\xi}\mathbb{E}_{\varepsilon}\bigl{|}\sum_{j\in J}\varepsilon_{j}z_{j}\xi_{j}\bigr{|}\gtrsim\mathbb{E}_{\xi}\bigl{(}\sum_{j\in J}z_{j}^{2}\xi_{j}^{2}\bigr{)}^{1/2}.

\bigl{(}\sum_{j\in J}z_{j}^{2}\xi_{j}^{2}\bigr{)}^{1/2}\geq\kappa\bigl{(}\sum_{j\in J}\eta_{j}z_{j}^{2}\bigr{)}^{1/2}.

\bigl{(}\sum_{j\in J}z_{j}^{2}\xi_{j}^{2}\bigr{)}^{1/2}\geq\kappa\bigl{(}\sum_{j\in J}\eta_{j}z_{j}^{2}\bigr{)}^{1/2}.

\mathbb{E}\bigl{(}\sum_{j\in J}\eta_{j}z_{j}^{2}\bigr{)}^{1/2}\geq c(\delta)\bigl{(}\sum_{j\in J}z_{j}^{2}\bigr{)}^{1/2}.

\mathbb{E}\bigl{(}\sum_{j\in J}\eta_{j}z_{j}^{2}\bigr{)}^{1/2}\geq c(\delta)\bigl{(}\sum_{j\in J}z_{j}^{2}\bigr{)}^{1/2}.

\mathbb{E}\bigl{(}\sum_{j=1}^{\ell}\eta_{j}a_{j}\bigr{)}^{1/2}\geq\sqrt{\gamma}p^{3/2}\geq\sqrt{\gamma}\delta^{3/2}.

\mathbb{E}\bigl{(}\sum_{j=1}^{\ell}\eta_{j}a_{j}\bigr{)}^{1/2}\geq\sqrt{\gamma}p^{3/2}\geq\sqrt{\gamma}\delta^{3/2}.

A = j = 1 \sum ℓ a_{j}^{2} \leq a_{1} j = 1 \sum ℓ a_{j} \leq γ p

A = j = 1 \sum ℓ a_{j}^{2} \leq a_{1} j = 1 \sum ℓ a_{j} \leq γ p

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities · Advanced Combinatorial Mathematics · Random Matrices and Applications

Full text

On the geometry of random polytopes

Shahar Mendelson LPSM, Sorbonne University, and Mathematical Sciences Institute, The Australian National University. Email: [email protected]

Abstract

We present a simple proof to a fact recently established in [5]: let $\xi$ be a symmetric random variable that has variance $1$ , let $\Gamma=(\xi_{ij})$ be an $N\times n$ random matrix whose entries are independent copies of $\xi$ , and set $X_{1},...,X_{N}$ to be the rows of $\Gamma$ . Then under minimal assumptions on $\xi$ and as long as $N\geq c_{1}n$ ,

[TABLE]

with high probability.

1 Introduction

Let $\xi$ be a symmetric random variable that has variance $1$ and let $X=(\xi_{1},...,\xi_{n})$ be the random vector whose coordinates are independent copies of $\xi$ . Consider a random matrix $\Gamma$ whose rows $X_{1},...,X_{N}$ are independent copies of $X$ . In this note we explore the geometry of the random polytope

[TABLE]

specifically, we study whether $K$ is likely to contain a large canonical convex body.

One of the first results in this direction is from [4], where it is shown that if $\xi$ is the standard gaussian random variable, $0<\alpha<1$ and $N\geq c_{0}(\alpha)n$ , then

[TABLE]

with probability at least $1-2\exp(-c_{2}N^{1-\alpha}n^{\alpha})$ . It should be noted that this estimate cannot be improved—up to the dependence of the constants on $\alpha$ (see, for example, the discussion in Section 4 of [9]).

The proof of (1.1) relies heavily on the tail behaviour of the gaussian random variable. It is therefore natural to try and extend (1.1) beyond the gaussian case, to random polytopes generated by more general random variables that still have ‘well-behaved’ tails. The optimal subgaussian estimate was established in [9]:

Theorem 1.1.

Let $\xi$ be a mean-zero random variable that has variance $1$ and is $L$ -subgaussian111A centred random variable is $L$ -subgaussian if for every $p\geq 2$ , $\|\xi\|_{L_{p}}\leq L\sqrt{p}\|\xi\|_{L_{2}}$ .. Let $0<\alpha<1$ and set $N\geq c_{0}(\alpha)n$ . Then with probability at least $1-2\exp(-c_{1}N^{1-\alpha}n^{\alpha})$

[TABLE]

where $c_{0}$ and $c_{2}$ are constants that depend on $\alpha$ and $c_{1}$ is an absolute constant.

Remark 1.2.

Note that the body ${\rm absconv}(X_{1},...,X_{N})$ contains in (1.2) is slightly smaller than in (1.1), as one has to intersect the Euclidean ball from (1.1) with the unit cube.

While Theorem 1.1 resolves the problem when $\xi$ is subgaussian, the situation is less clear when $\xi$ is heavy-tailed. That naturally leads to the following question:

Question 1.3.

Under what conditions on $\xi$ one still has that for $N\geq c_{1}n$ ,

[TABLE]

with high probability?

Following the progress in [7], where Question 1.3 had been studied under milder moment assumptions on $\xi$ than in Theorem 1.1, Question 1.3 was answered in [5] under a minimal small-ball condition on $\xi$ .

Definition 1.4.

A mean-zero random variable $\xi$ satisfies a small-ball condition with constants $\kappa$ and $\delta$ if

[TABLE]

Theorem 1.5.

[5]** Let $\xi$ be a symmetric, variance $1$ random variable that satisfies (1.4) with constants $\kappa$ and $\delta$ . For $0<\alpha<1$ there are constants $c_{1},c_{2}$ and $c_{3}$ that depend on $\kappa,\delta$ and $\alpha$ for which the following holds. If $N\geq c_{1}n$ then with probability at least $1-2\exp(-c_{2}N^{1-\alpha}n^{\alpha})$ ,

[TABLE]

Remark 1.6.

The assumption made in [5] is slightly stronger than in Theorem 1.5; namely, that for every $x\in\mathbb{R}$ , $Pr(|\xi-x|\geq\kappa)\geq\delta$ . However, (1.4) suffices for the proof. At the same time, in [5] the random variables $(\xi_{ij})$ are only assumed to be independent, symmetric and variance $1$ , with each one of the $\xi_{ij}$ ’s satisfying (1.4) with the same constants $\kappa$ and $\delta$ . In what follows we consider only the case in which $\xi_{ij}$ are independent copies of a single random variable $\xi$ —though extending the presentation to the independent case is straightforward.

The original proof of Theorem 1.5 is based on the construction of a well-chosen net, and that construction is rather involved. Here we present a much simpler argument that is based on the small-ball method (see, e.g., [10, 11, 12]). As an added value, the method presented here gives more information than the assertion of Theorem 1.5, as is explained in what follows.

The starting point of the proof of Theorem 1.5 is straightforward: let

[TABLE]

and set

[TABLE]

By comparing the support functions of $L$ and of $K$ , one has to show that with the wanted probability, for every $z\in\mathbb{R}^{n}$ , $h_{L}(z)\leq h_{cK}(z)$ . And, since $h_{cK}(z)=c\|\Gamma z\|_{\infty}$ , Theorem 1.5 can be established by showing that for suitable constants $c_{0}$ and $c_{1}$ ,

[TABLE]

What we actually show is a stronger statement than (1.5): not only is there a high probability event on which

[TABLE]

but in fact, on that “good event”, for each $z\in\partial L^{\circ}$ , $\Gamma z$ has $\sim N^{1-\alpha}n^{\alpha}$ large coordinates, with each one of these coordinates satisfying that $|\left\langle z,X_{i}\right\rangle|\geq c_{0}$ . Thus, the fact that $\|\Gamma z\|_{\infty}\geq c_{0}$ is exhibited by many coordinates and not just by a single one.

Proving that indeed, with high probability the smallest cardinality

[TABLE]

is large is carried out in two steps:

Controlling a single point. For $0<\alpha<1$ and a well chosen $c_{0}=c_{0}(\alpha)$ one establishes an individual estimate: that for every fixed $z\in\partial L^{\circ}$ ,

[TABLE]

In particular, if $X_{1},...,X_{N}$ are independent copies of $X$ then with probability at least $1-2\exp(-c_{2}N^{1-\alpha}n^{\alpha})$ ,

[TABLE]

From a single function to uniform control. Thanks to the high probability estimate with which (1.6) holds, it is possible to control uniformly any subset of $\partial L^{\circ}$ whose cardinality is at most $\exp(c_{2}N^{1-\alpha}n^{\alpha}/2)$ . Let ${\cal T}$ be a minimal $\rho$ -cover of $\partial L^{\circ}$ with respect to the $\ell_{2}$ norm of the allowed cardinality. For every $z\in\partial L^{\circ}$ , let $\pi z\in{\cal T}$ that satisfies $\|z-\pi z\|_{2}\leq\rho$ . The wanted uniform control is achieved by showing that

[TABLE]

with probability at least $1-2\exp(-c_{3}(\alpha)N^{1-\alpha}n^{\alpha})$ .

Indeed, combining the two estimates it follows that with probability at least

[TABLE]

for every $z\in\partial L^{\circ}$ , one has that

[TABLE]

and

[TABLE]

Hence, on that event, for every $z\in\partial L^{\circ}$ there is $J_{z}\subset\{1,...,n\}$ of cardinality at least $N^{1-\alpha}n^{\alpha}$ , and for every $j\in J_{z}$ ,

[TABLE]

implying that

[TABLE]

in particular, $\inf_{z\in\partial L^{\circ}}\|\Gamma z\|_{\infty}\geq c_{0}$ as required.

In the next section this line of reasoning is used to prove Theorem 1.5.

2 Proof of Theorem 1.5

Before we begin the proof, let us introduce some notation. Throughout, absolute constant are denoted by $c,c_{1},c^{\prime}$ etc. . Unless specified otherwise, the value of these constants may change from line to line. Constants that depend on some parameter $\alpha$ are denoted by $c(\alpha)$ . We write $a\lesssim b$ if there is an absolute constant $c$ such that $a\leq cb$ ; $a\lesssim_{\alpha}b$ implies that $a\leq c(\alpha)b$ ; and $a\sim b$ if both $a\lesssim b$ and $b\lesssim a$ .

The required estimate for a single point follows very closely ideas from [13], which had been developed for obtaining lower estimates on the tails of marginals of the Rademacher vector $(\varepsilon_{i})_{i=1}^{n}$ , that is, on

[TABLE]

as a function of the ‘location’ in $\mathbb{R}^{n}$ of $(z_{i})_{i=1}^{n}$ .

Fix $1\leq r\leq n$ and consider the interpolation body $L_{r}=B_{\infty}^{n}\cap\sqrt{r}B_{2}^{n}$ and its dual $L_{r}^{\circ}={\rm conv}(B_{1}^{n}\cup(1/\sqrt{r})B_{2}^{n})$ . The key estimate one needs to establish the wanted individual control is:

Theorem 2.1.

There exist constants $c^{\prime}$ and $c^{\prime\prime}$ that depend only on the small-ball constants of $\xi$ ( $\kappa$ and $\delta$ ) such that if $z\in\partial L_{r}^{\circ}$ then

[TABLE]

The proof of Theorem 2.1 is based on some well-known facts on the interpolation norm $\|\ \|_{L_{r}^{\circ}}$ .

Lemma 2.2.

There exists an absolute constant $c_{0}$ such that for every $z\in\mathbb{R}^{n}$ ,

[TABLE]

where $(z_{i}^{*})_{i=1}^{n}$ is the nonincreasing rearrangement of $(|z_{i}|)_{i=1}^{n}$ .

Moreover, for very $z\in\mathbb{R}^{n}$ there is a partition of $\{1,...,n\}$ to $r$ disjoint blocks $I_{1},...,I_{r}$ such that

[TABLE]

The first part of Lemma 2.2 is due to Holmstedt (see Theorem 4.1 in [6]) and it gives useful intuition on the nature of the norm $\|\ \|_{L_{r}^{\circ}}$ . The second part is Lemma 2 from [13] and it plays an essential role in what follows.

Before proving Theorem 2.1, we require an additional observation that is based on the small-ball condition satisfies by $\xi$ .

Lemma 2.3.

Let $J\subset\{1,...,n\}$ and set $Y=\sum_{j\in J}z_{j}\xi_{j}$ . Then

[TABLE]

where $c(\kappa,\delta)<1$ is a constant the depends only on $\xi$ ’s small-ball constants $\kappa$ and $\delta$ .

Proof. Let $(\varepsilon_{j})_{j\in J}$ be independent, symmetric, $\{-1,1\}$ -valued random variables that are also independent of $(\xi_{j})_{j\in J}$ . Recall that $\xi$ is symmetric and therefore $(\xi_{j})_{j\in J}$ has the same distribution as $(\varepsilon_{j}\xi_{j})_{j\in J}$ . By Khintchine’s inequality it is straightforward to verify that

[TABLE]

Let $(\eta_{j})_{j\in J}=\mathbbm{1}_{\{|\xi_{j}|\geq\kappa\}}$ ; thus, the $\eta_{j}$ ’s are iid $\{0,1\}$ -valued random variables whose mean is at least $\delta$ , and point-wise

[TABLE]

Hence, and all that is left to complete the proof is to show that

[TABLE]

Let $a_{j}=z_{j}^{2}/(\sum_{j\in J}z_{j}^{2})$ and in particular, $\|(a_{j})_{j\in J}\|_{1}=1$ . Assume without loss of generality that $J=\{1,...,\ell\}$ and that the $a_{j}$ ’s are non-increasing, let $\gamma>0$ be a parameter to be specified in what follows, and set $p=\mathbb{E}\eta_{1}\geq\delta$ .

Consider two cases:

$\bullet$ If $a_{1}\geq\gamma p$ then with probability at least $p$ , $\sum_{j=1}^{\ell}\eta_{j}a_{j}\geq a_{1}\geq\gamma p$ . In that case

[TABLE]

$\bullet$ Alternatively, $a_{1}\leq\gamma p$ , implying that

[TABLE]

because $\|(a_{j})_{j=1}^{\ell}\|_{1}=1$ .

By Bernstein’s inequality,

[TABLE]

provided that $\gamma$ is a small-enough absolute constant. Using, once again, that $\|(a_{j})_{j=1}^{\ell}\|_{1}=1$ it is evident that with probability $1/2$ , $\sum_{j=1}^{\ell}\eta_{j}a_{j}\geq(1/2)p$ and therefore

[TABLE]

Thus, setting $c(\kappa,\delta)\sim\kappa\delta^{3/2}$ one has that

[TABLE]

as claimed.

Proof of Theorem 2.1. Fix $z\in\partial L_{r}^{\circ}$ and recall that by Lemma 2.2 there is a decomposition of $\{1,...,n\}$ to disjoint blocks $(I_{j})_{j=1}^{r}$ such that

[TABLE]

Let $Y_{j}=\sum_{i\in I_{j}}z_{i}\xi_{i}$ ; observe that $Y_{1},...,Y_{r}$ are independent random variables and that by Lemma 2.3,

[TABLE]

for a constant $0<c(\kappa,\delta)<1$ .

At the same time,

[TABLE]

Therefore, by the Paley-Zygmund inequality (see, e.g., [2]), for any $0<\theta<1$ ,

[TABLE]

Setting $\theta=1/2$ ,

[TABLE]

and since $Y_{j}$ is a symmetric random variable (because the $\xi_{i}$ ’s are symmetric), it follows that

[TABLE]

For $1\leq j\leq r$ let

[TABLE]

which are independent events. Hence,

[TABLE]

Thus, by (2.1), if $c^{\prime}=\frac{1}{4}c(\kappa,\delta)$ and $c^{\prime\prime}=\log(1/c_{1}(\kappa,\delta))>0$ , one has

[TABLE]

From here on, the constants $c^{\prime}$ and $c^{\prime\prime}$ denote the constants from Theorem 2.1.

Corollary 2.4.

For $0<\alpha<1$ , $\kappa$ and $\delta$ there are constants $c_{0}$ and $c_{1}$ that depend on $\alpha$ , $\kappa$ and $\delta$ , and an absolute constant $c_{2}$ for which the following holds. If $N\geq c_{0}n$ , $r\leq c_{1}\sqrt{\log(eN/n)}$ and $z\in\partial L_{r}^{\circ}$ then with probability at least $1-2\exp(-c_{2}N^{1-\alpha}n^{\alpha})$ ,

[TABLE]

Proof. Let $z\in\partial L_{r}^{\circ}$ , and invoking Theorem 2.1,

[TABLE]

where $c^{\prime}$ and $c^{\prime\prime}$ depend only on $\kappa$ and $\delta$ .

Let $r_{0}=c_{1}\log(eN/n)$ such that $\exp(-c^{\prime\prime}r_{0})\geq 4(n/N)^{\alpha}$ ; thus, $c_{1}=c_{1}(\alpha,\kappa,\delta)$ . If $r\leq r_{0}$ , $X_{1},...,X_{N}$ are independent copied of $X$ and $\eta_{i}=\mathbbm{1}_{\{|\left\langle z,X_{i}\right\rangle|\geq c^{\prime}\}}$ , then $\mathbb{E}\eta_{i}\geq 4(n/N)^{\alpha}$ . Hence, by a standard concentration argument, with probability at least $1-2\exp(-c_{2}N^{1-\alpha}n^{\alpha})$ ,

[TABLE]

where $c_{2}$ is an absolute constant.

Thanks to the high probability estimate with which Corollary 2.4 holds, one can control uniformly all the elements of a set ${\cal T}\subset\partial L_{r}^{\circ}$ as long as $|{\cal T}|\leq\exp(c_{0}N^{1-\alpha}n^{\alpha})$ for a suitable absolute constant $c_{0}$ , and as long as $r\leq c(\alpha,\kappa,\delta)\log(eN/n)$ . In that case, there is an event of probability at least $1-2\exp(-c_{1}N^{1-\alpha}n^{\alpha})$ such that for every $z\in{\cal T}$ ,

[TABLE]

The natural choice of a set ${\cal T}$ is a minimal $\rho$ -cover of $\partial L_{r}^{\circ}$ with respect to the $\ell_{2}$ norm. Note that $L_{r}^{\circ}={\rm absconv}(B_{1}^{n}\cup r^{-1/2}B_{2}^{n})\subset B_{2}^{n}$ , and so there is a $\rho$ -cover of the allowed cardinality for

[TABLE]

where $c_{2}$ is an absolute constant.

Clearly, $\{z-\pi z:z\in\partial L_{r}^{\circ}\}\subset\rho B_{2}^{n}$ , and to complete the proof of Theorem 1.5 it suffices to show that with probability at least $1-2\exp(-c_{3}N^{1-\alpha}n^{\alpha})$

[TABLE]

To prove (2.3), observe that $Q$ is the supremum of an empirical process indexed by a class of binary valued functions

[TABLE]

in particular, for every $f_{z}\in F$ ,

[TABLE]

By Talagrand’s concentration inequality for bounded empirical processes ([14], see also [1]), with probability at least $1-2\exp(-t)$ ,

[TABLE]

Let us show that for the right choice of $t$ and $N$ large enough, $Q\leq N^{1-\alpha}n^{\alpha}$ .

The required estimate on $(2)$ and $(3)$ clearly holds as long as

[TABLE]

As for $\mathbb{E}Q$ , note that point-wise

[TABLE]

Let $(\varepsilon_{i})_{i=1}^{N}$ be independent, symmetric, $\{-1,1\}$ -valued random variables that are independent of $(X_{i})_{i=1}^{N}$ . By the Giné-Zinn symmetrization theorem [3] and the contraction inequality for Bernoulli processes [8],

[TABLE]

which is sufficiently small as long as $N\gtrsim_{\alpha,\kappa,\delta}n$ .

3 Concluding Remarks

This proof of Theorem 1.5 is based on the small-ball method and follows an almost identical path to previous results that use the method: first, one obtains an individual estimate that implies that for each $v$ in a fine-enough net, many of the values $(|\left\langle X_{i},v\right\rangle|)_{i=1}^{N}$ are in the ‘right range’; and then, that the ‘oscillation vector’ $(|\left\langle X_{i},z-v\right\rangle|)_{i=1}^{N}$ does not spoil too many coordinates when $v$ is ‘close enough’ to $z$ . Thus, with high probability and uniformly in $z$ , many of the values $(|\left\langle X_{i},z\right\rangle|)_{i=1}^{N}$ are in the right range.

Having said that, there is one substantial difference between this proof and other instances in which the small-ball method had been used. Perviously, individual estimates had been obtained in the small-ball regime; here the necessary regime is different: one requires a lower estimate on the tails of marginals of $X=(\xi_{i})_{i=1}^{n}$ . And indeed, the core of the proof is the individual estimate from Theorem 2.1, where one shows that if $\xi$ satisfies a small-ball condition and $X$ has iid coordinates distributed as $\xi$ then its marginals exhibit a ‘super-gaussian’ behaviour at the right level.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] S. Boucheron, G. Lugosi, and P. Massart. Concentration inequalities . Oxford University Press, Oxford, 2013. A nonasymptotic theory of independence.
2[2] V.H. de la Peña and E. Giné. Decoupling: from Dependence to Independence . Springer, New York, 1999.
3[3] E. Giné and J. Zinn. Some limit theorems for empirical processes. Ann. Probab. , 12(4):929–998, 1984.
4[4] E. D. Gluskin. Extremal properties of orthogonal parallelepipeds and their applications to the geometry of Banach spaces. Mat. Sb. (N.S.) , 136(178)(1):85–96, 1988.
5[5] O. Guédon, A.E. Litvak, and K. Tatarko. Random polytopes obtained by matrices with heavy tailed entries. manuscript, available at ar Xiv:1811.12007 , 2018.
6[6] T. Holmstedt. Interpolation of quasi-normed spaces. Math. Scand. , 26:177–199, 1970.
7[7] F. Krahmer, C. Kummerle, and H. Rauhut. A quotient property for matrices with heavy-tailed entries and its application to noise-blind compressed sensing. manuscript, available at ar Xiv:1806.04261 , 2018.
8[8] M. Ledoux and M. Talagrand. Probability in Banach spaces . Classics in Mathematics. Springer-Verlag, Berlin, 2011.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the geometry of random polytopes

Abstract

1 Introduction

Theorem 1.1**.**

Remark 1.2**.**

Question 1.3**.**

Definition 1.4**.**

Theorem 1.5**.**

Remark 1.6**.**

2 Proof of Theorem 1.5

Theorem 2.1**.**

Lemma 2.2**.**

Lemma 2.3**.**

Corollary 2.4**.**

3 Concluding Remarks

Theorem 1.1.

Remark 1.2.

Question 1.3.

Definition 1.4.

Theorem 1.5.

Remark 1.6.

Theorem 2.1.

Lemma 2.2.

Lemma 2.3.

Corollary 2.4.