Random polytopes and the wet part for arbitrary probability   distributions

Imre B\'ar\'any; Matthieu Fradelizi; Xavier Goaoc; Alfredo Hubard,; G\"unter Rote

arXiv:1902.06519·math.PR·October 13, 2020

Random polytopes and the wet part for arbitrary probability distributions

Imre B\'ar\'any, Matthieu Fradelizi, Xavier Goaoc, Alfredo Hubard,, G\"unter Rote

PDF

TL;DR

This paper extends classical geometric probability results to arbitrary distributions, analyzing how the convex hull's properties relate to the measure's wet part, with bounds that depend on sample size.

Contribution

It generalizes bounds on the measure and vertices of convex hulls from uniform to arbitrary distributions, showing tightness of the bounds.

Findings

01

Lower bounds from uniform case hold generally

02

Upper bounds require a logarithmic factor adjustment

03

Example demonstrates the bounds' tightness

Abstract

We examine how the measure and the number of vertices of the convex hull of a random sample of $n$ points from an arbitrary probability measure in $R^{d}$ relates to the wet part of that measure. This extends classical results for the uniform distribution from a convex set [B\'ar\'any and Larman 1988]. The lower bound of B\'ar\'any and Larman continues to hold in the general setting, but the upper bound must be relaxed by a factor of $lo g n$ . We show by an example that this is tight.

Figures1

Click any figure to enlarge with its caption.

Equations115

K_{t} = {x \in K :

K_{t} = {x \in K :

\mbox an d Vol (K \cap h) \leq t Vol K}

\frac{1}{4} Vol K_{1/ n} \leq E [Vol (K ∖ P_{n})] \leq Vol K_{c / n} .

\frac{1}{4} Vol K_{1/ n} \leq E [Vol (K ∖ P_{n})] \leq Vol K_{c / n} .

W_{t}^{μ} = {x \in R^{d} : \mbox t h er e i s aha l f s p a ce h \mbox w i t h x \in h \mbox an d μ (h) \leq t} .

W_{t}^{μ} = {x \in R^{d} : \mbox t h er e i s aha l f s p a ce h \mbox w i t h x \in h \mbox an d μ (h) \leq t} .

\frac{1}{4} w^{μ} (\frac{1}{n}) \leq E [1 - μ (P_{n}^{μ})] \leq w^{μ} ((d + 2) \frac{l n n}{n}) + \frac{ε _{d} ( n )}{n},

\frac{1}{4} w^{μ} (\frac{1}{n}) \leq E [1 - μ (P_{n}^{μ})] \leq w^{μ} ((d + 2) \frac{l n n}{n}) + \frac{ε _{d} ( n )}{n},

E [1 - μ (P_{n}^{ν})] > \frac{1}{2} \cdot w^{ν} (lo g_{2} n / n)

E [1 - μ (P_{n}^{ν})] > \frac{1}{2} \cdot w^{ν} (lo g_{2} n / n)

E [f_{0} (P_{n}^{μ})]

E [f_{0} (P_{n}^{μ})]

= n \cdot \int_{x} Pr [x \in / P_{n - 1}^{μ}] d μ (x) = n (1 - E [μ (P_{n - 1}^{μ})])

E [f_{0} (P_{n}^{μ})] \geq i = 1 \sum n Pr [x_{i} \in / conv (X_{n} ∖ {x_{i}})] = n (1 - E [μ (P_{n - 1}^{μ})])

E [f_{0} (P_{n}^{μ})] \geq i = 1 \sum n Pr [x_{i} \in / conv (X_{n} ∖ {x_{i}})] = n (1 - E [μ (P_{n - 1}^{μ})])

\frac{1}{e}nw^{\mu}(\tfrac{1}{n})\leq\mathbb{E}\left[f_{0}(P_{n}^{\mu})\right]\leq nw^{\mu}\bigl{(}{(d+2)\tfrac{\ln n}{n}}\bigr{)}+\varepsilon_{d}(n),

\frac{1}{e}nw^{\mu}(\tfrac{1}{n})\leq\mathbb{E}\left[f_{0}(P_{n}^{\mu})\right]\leq nw^{\mu}\bigl{(}{(d+2)\tfrac{\ln n}{n}}\bigr{)}+\varepsilon_{d}(n),

E [f_{0} (P_{n}^{ν})] > \frac{1}{2} n \cdot w^{ν} (lo g_{2} n / n)

E [f_{0} (P_{n}^{ν})] > \frac{1}{2} n \cdot w^{ν} (lo g_{2} n / n)

Vol K_{t} \leq Vol K_{c t} \leq c^{'} Vol K_{t}

Vol K_{t} \leq Vol K_{c t} \leq c^{'} Vol K_{t}

E [Vol (K ∖ P_{n})] \leq c^{'} Vol K_{1/ n} .

E [Vol (K ∖ P_{n})] \leq c^{'} Vol K_{1/ n} .

w^{μ} (t) = {p, 1, if t < τ if t \geq τ

w^{μ} (t) = {p, 1, if t < τ if t \geq τ

w^{μ} (t) \leq w^{μ} (c t) \leq c^{'} \cdot w^{μ} (t) .

w^{μ} (t) \leq w^{μ} (c t) \leq c^{'} \cdot w^{μ} (t) .

Pr [x \in / P_{n}] \geq Pr [h \cap P_{n} = \emptyset] = (1 - μ (h))^{n} \geq (1 - t)^{n} .

Pr [x \in / P_{n}] \geq Pr [h \cap P_{n} = \emptyset] = (1 - μ (h))^{n} \geq (1 - t)^{n} .

1 - E [μ (P_{n})]

1 - E [μ (P_{n})]

\geq \int_{x \in W_{t}} Pr [x \in / P_{n}] d μ (x)

\geq \int_{x \in W_{t}} (1 - t)^{n} d μ (x) = (1 - t)^{n} w (t) .

E [f_{0} (P_{n})] = n E [1 - μ (P_{n - 1})] \geq n (1 - t)^{n - 1} w (t) .

E [f_{0} (P_{n})] = n E [1 - μ (P_{n - 1})] \geq n (1 - t)^{n - 1} w (t) .

E [f_{0} (P_{n})] \geq n (1 - \frac{1}{n})^{n - 1} w (\frac{1}{n}) \geq \frac{1}{e} n w (\frac{1}{n}),

E [f_{0} (P_{n})] \geq n (1 - \frac{1}{n})^{n - 1} w (\frac{1}{n}) \geq \frac{1}{e} n w (\frac{1}{n}),

E [1 - μ (P_{n})] \leq \frac{1}{n + 1} E [f_{0} (P_{n + 1})] \leq \frac{2}{n + 1} \leq w (\frac{1}{n + 1}) \leq w (3 \frac{ln n}{n}) .

E [1 - μ (P_{n})] \leq \frac{1}{n + 1} E [f_{0} (P_{n + 1})] \leq \frac{2}{n + 1} \leq w (\frac{1}{n + 1}) \leq w (3 \frac{ln n}{n}) .

π_{H} (N) = X \subseteq U, ∣ X ∣ \leq N max ∣ {X \cap h : h \in H} ∣,

π_{H} (N) = X \subseteq U, ∣ X ∣ \leq N max ∣ {X \cap h : h \in H} ∣,

2 π_{H} (N) \cdot (1 - \frac{s}{N})^{(N - s) ε - 1} .

2 π_{H} (N) \cdot (1 - \frac{s}{N})^{(N - s) ε - 1} .

π_{H} (N) \leq 2 i = 0 \sum d (i N - 1) .

π_{H} (N) \leq 2 i = 0 \sum d (i N - 1) .

E [1 - μ (P_{n})]

E [1 - μ (P_{n})]

= \int_{R^{d} ∖ W_{ε}} Pr [x \in / P_{n}] d μ (x) + \int_{W_{ε}} Pr [x \in / P_{n}] d μ (x)

\leq \int_{R^{d} ∖ W_{ε}} Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] d μ (x) + \int_{W_{ε}} d μ (x)

\leq Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] + w (ε) .

E [1 - μ (P_{n})] \leq w (ε) + Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] .

E [1 - μ (P_{n})] \leq w (ε) + Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] .

ln Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] \leq ln π_{H} (N) + ((N - n) ε - 1) ln (1 - \frac{n}{N}) + ln 2.

ln Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] \leq ln π_{H} (N) + ((N - n) ε - 1) ln (1 - \frac{n}{N}) + ln 2.

π_{H} (N) \leq 2 i = 0 \sum d (i N - 1) \leq N^{d} and ln π_{H} (N) \leq d ln N .

π_{H} (N) \leq 2 i = 0 \sum d (i N - 1) \leq N^{d} and ln π_{H} (N) \leq d ln N .

ln Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] \leq d ln n

ln Pr [R^{d} ∖ W_{ε} \neq \subseteq P_{n}] \leq d ln n

+ ((N - n) ε - 1) ln (1 - \frac{n}{N}) + ln 2.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Random polytopes and the wet part for arbitrary probability distributions

Imre Bárány, Matthieu Fradelizi, Xavier Goaoc, Alfredo Hubard, Günter Rote

Abstract.

We examine how the measure and the number of vertices of the convex hull of a random sample of an arbitrary probability measure in $\mathbb{R}^{d}$ relates to the wet part of that measure.

1. Introduction and Main Results

Let $K$ be a convex body (convex compact set with non-empty interior) in $\mathbb{R}^{d}$ , and let $X_{n}=\{x_{1},\ldots,x_{n}\}$ be a random sample of $n$ uniform independent points from $K$ . The set $P_{n}=\operatorname{conv}X_{n}$ is a random polytope in $K$ . For $t\in[0,1)$ we define the wet part $K_{t}$ of $K$ :

[TABLE]

The name “wet part” comes from the mental picture when $K$ is in $\mathbb{R}^{3}$ and contains water of volume $t\operatorname{Vol}K$ . Bárány and Larman [2] proved that the measure of the wet part captures how well $P_{n}$ approximates $K$ in the following sense:

Theorem 1 ([2, Theorem 1]).

There are constants $c$ and $N_{0}$ depending only on $d$ such that for every convex body $K$ in $\mathbb{R}^{d}$ and for every $n>N_{0}$

[TABLE]

By Efron’s formula (see (2) below), this directly translates into bounds for the expected number of vertices of $P_{n}$ , see Section 1.2.

1.1. Results for general measures.

The notions of random polytope and wet part extend to a general probability measure $\mu$ defined on the Borel sets of $\mathbb{R}^{d}$ . The definition of a $\mu$ -random polytope $P_{n}^{\mu}$ is clear: $X_{n}$ is a sample of $n$ random independent points chosen according to $\mu$ , and $P_{n}^{\mu}=\operatorname{conv}X_{n}$ . The wet part $W_{t}^{\mu}$ is defined as

[TABLE]

The $\mu$ -measure of the wet part is denoted by $w^{\mu}(t):=\mu(W_{t}^{\mu})$ . Here is an extension of Theorem 1 to general measures:

Theorem 2.

For any probability measure $\mu$ in $\mathbb{R}^{d}$ and $n\geq 2$ ,

[TABLE]

where $\varepsilon_{d}(n)\to 0$ as $n\to+\infty$ and is independent of $\mu$ .

A similar upper bound, albeit with worse constants, follows from a result of Vu [13, Lemma 4.2], which states that $P_{n}^{\mu}$ contains $\mathbb{R}^{d}\setminus W_{c\ln n/n}^{\mu}$ with high-probability. Since a containment with high probability is usually stronger than an upper bound in expectation, one may have hoped that the $\log n/n$ in the upper bound of Theorem 2 can be reduced. Our main result shows that this is not possible, not even in the plane:

Theorem 3.

There exists a probability measure $\nu$ on $\mathbb{R}^{2}$ such that

[TABLE]

for infinitely many $n$ .

The measure that we construct actually has compact support and can be embedded into $\mathbb{R}^{d}$ for any $d\geq 2$ . It will be apparent from the proof that the same construction has the stronger property that for every constant $C>0$ , the inequality $\mathbb{E}[1-\mu(P_{n}^{\nu})]>\frac{1}{2}\cdot w^{\nu}({C\log_{2}n/n})$ holds for infinitely many values $n$ .

1.2. Consequences for $\mathbf{f}$ -vectors

Let $f_{0}(P_{n}^{\mu})$ denote the number of vertices of $P_{n}^{\mu}$ . For non-atomic measures (measures where no single point has positive probability), Efron’s formula [7] relates $E\left[f_{0}(P_{n}^{\mu})\right]$ and $\mathbb{E}\left[\mu(P_{n}^{\mu})\right]$ :

[TABLE]

For any measure, this still holds as an inequality:

[TABLE]

The measure that is constructed in Theorem 3 is non-atomic. As a consequence, Theorems 2 and 3 give the following bounds for the number of vertices:

Theorem 4.

(i)

For any non-atomic probability measure $\mu$ in $\mathbb{R}^{d}$ ,

[TABLE]

where $\varepsilon_{d}(n)\to 0$ as $n\to+\infty$ and is independent of $\mu$ .

(ii)

There exists a non-atomic probability measure $\nu$ on $\mathbb{R}^{2}$ such that

[TABLE]

for infinitely many $n$ .

Theorem 4 follows from Theorems 2 and 3 except that Efron’s Formula (2) induces a shift in indices, as it relates $f_{0}(P_{n}^{\mu})$ to $\mu(P_{n-1}^{\mu})$ . This shift affects only the constant in the lower bound of Theorem 4(i), which goes from $\frac{1}{4}$ to $\frac{1}{e}$ , see Section 3.1.

The upper bound of Theorem 4(i) fails for general distributions. For instance, if $\mu$ is a discrete distribution on a finite set, then $w^{\mu}(t)=0$ for any $t$ smaller than the mass of any single point and the upper bound cannot hold uniformly as $n\to\infty$ . Of course, in that case Inequality (3) is strict.

For convex bodies, the number $f_{i}(P_{n})$ of $i$ -dimensional faces of $P_{n}$ can also be controlled via the measure of the wet part since Bárány [1] proved that $\mathbb{E}[f_{i}(P_{n})]=\Theta(n\operatorname{Vol}K_{1/n})$ for every $0\leq i\leq d-1$ . No similar generalization is possible for Theorem 2. Indeed, consider a measure $\mu$ in $\mathbb{R}^{4}$ supported on two circles, one on the $(x_{1},x_{2})$ -plane, the other in the $(x_{3},x_{4})$ -plane, and uniform on each circle; $P_{n}^{\mu}$ has $\Omega(n^{2})$ edges almost surely.

Before we get to the proofs of Theorems 2 (Section 3.2) and 3 (Section 4), we discuss in Section 2 a key difference between the wet parts of convex bodies and of general measures.

2. Wet part: convex sets versus general measures

A key ingredient in the proof of the upper bound of Theorem 1 in [2] is that for a convex body $K$ in $\mathbb{R}^{d}$ , the measure of the wet part $K_{t}$ cannot change too abruptly as a function of $t$ : If $c\geq 1$ , then

[TABLE]

where $c^{\prime}$ is a constant that depends only on $c$ and $d$ [2, Theorem 7]. In particular, a multiplicative factor can be taken out of the volume parameter of the wet part and the upper bound in Theorem 1 can be equivalently expressed as

[TABLE]

(This is in fact how of the upper bound of Theorem 1 is actually formulated in [2, Theorem 1].) This alternative formulation shows immediately that the lower bound of Theorem 1 (and hence also of Theorem 2) cannot be improved by more than a constant.

2.1. Two circles and a sharp drop

The right inequality in (4) does not extend to general measures. An easy example showing this is the following “drop construction”. It is a probability measure $\mu$ in the plane supported on two concentric circles, uniform on each of them, and with measure $p$ on the outer circle. Let $\tau$ denote the measure of a halfplane externally tangent to the inner circle; remark that $\tau<p/2$ . The measure $w^{\mu}(t)$ of the wet part drops at $t=\tau$ :

[TABLE]

We can make this drop arbitrarily sharp by choosing a small $p$ . In particular, for any given $c^{\prime}$ , setting $p<\frac{1}{c^{\prime}}$ makes it impossible to fulfill the right inequality in (4) for $t<\tau<ct$ .

This example also challenges Inequality (5). As shown in Figure 1 (top), the function $w^{\mu}(1/n)$ has a sharp drop, while $\mathbb{E}\left[1-\mu(P_{n}^{\mu})\right]$ shifts from the higher to the lower branch of the step in a gradual way. For this construction, the straightforward extension of Theorem 1 would imply that $\mathbb{E}[1-\mu(P_{n}^{\mu})]$ remains within a constant multiplicative factor of $w^{\mu}(1/n)$ . Thus, $\mathbb{E}[1-\mu(P_{n}^{\mu})]$ would have to follow the steep drop.

2.2. A drop for the number of vertices.

The fact that $\mathbb{E}[1-\mu(P_{n}^{\mu})]$ cannot drop too sharply is more easily seen by examining $\mathbb{E}\left[f_{0}(P_{n}^{\mu})\right]$ . Since the measure defined in Equation (6) is non-atomic, Efron’s Formula (2) applies, so let us compare $\mathbb{E}\left[f_{0}(P_{n}^{\mu})\right]$ and $n\cdot nw^{\mu}(1/n)$ . As illustrated in Figure 1 (bottom), $n\cdot w^{\mu}(1/n)$ has a sawtooth shape with a sharp drop from 300 to 3 at $n=300$ , and $\mathbb{E}[f_{0}(P_{n})]$ does actually shift from the higher to the lower branch of the sawtooth, in a gradual way.

The fact that $\mathbb{E}[f_{0}(P_{n}^{\mu})]$ can decrease is perhaps surprising at first sight, but this phenomenom is easy to explain: We pick random points one by one. As long as all points lie on the inner circle, $f_{0}(P_{n}^{\mu})=n$ . The first point to fall on the outer circle swallows a constant fraction of the points into the interior of $P_{n}^{\mu}$ , while adding only a single new point on the convex hull, causing a big drop. This happens around $n\approx 1/p$ .

Again, the straightforward extension of Theorem 1 would imply that $\mathbb{E}[f_{0}(P_{n})]$ follows the steep drop. Yet, on average, a single additional point can reduce $f_{0}(P_{n})$ by a factor of at most $1/2$ . Hence, the drop of $\mathbb{E}[f_{0}(P_{n})]$ cannot be so abrupt as the drop of $n\cdot w^{\mu}(1/n)$ , for $p$ small enough.

2.3. A sequence of drops

We prove Theorem 3 in Section 4 by an explicit construction that sets up a sequence of such drops. The function $n\cdot w^{\mu}(1/n)$ reaches larger and larger peaks as $n$ increases, while dropping down more and more steeply between those peaks. Our proof of Theorem 3 will not actually refer to any drop or oscillating behavior. We will simply identify a sequence of values $n=n_{1},n_{2},\ldots$ for which $\mathbb{E}[1-\mu(P_{n}^{\mu})]$ is larger than $\frac{1}{2}w^{\mu}(\log_{2}n/n)$ .

2.4. Open questions

It is an outstanding open problem whether a drop as exhibited by our two-circle construction can occur for the uniform selection from a convex body: Can the expectation of the number of vertices of a random polytope decrease in such a setting? This is impossible in the plane [6] or for the three dimensional ball [4], but open in general. See [5] and the discussion therein.

Perhaps Theorem 1 remains valid for some restricted class of measures $\mu$ , for instance, logconcave measures. One approach to circumvent the “impossibility result” of Theorem 3 would be to first extend (4) and establish that for $c>1$ there is $c^{\prime}$ such that for all $t>0$

[TABLE]

The second step would derive from this property the extension of Theorem 1. We don’t know if any of these two steps is valid.

We can weaken the claim of Theorem 1 in a different way, while maintaining it for all measures. For example, it is plausible that the upper bound in the theorem holds for a subset of numbers $n\in\mathbb{N}$ of positive density. On the other hand we do not know if there is a measure for which the bound of Theorem 1 is valid only for a finite number of natural numbers.

3. Proof of Theorem 2

Let $\mu$ be a probability measure in $\mathbb{R}^{d}$ . For better readability we drop all superscripts $\mu$ .

3.1. Lower bound

The proof of the lower bound is similar to the one in the convex-body case. For every fixed point $x\in W_{t}$ , by definition, there exists a half-space $h$ with $x\in h$ and $\mu(h)\leq t$ . If $h\cap P_{n}$ is empty, then $x$ is not in $P_{n}$ , and therefore, for $x\in W_{t}$ ,

[TABLE]

Then, for any $t$ ,

[TABLE]

We choose $t=1/n$ . Since the sequence $(1-\frac{1}{n})^{n}$ is increasing, for $n\geq 2$ we have $1-\mathbb{E}[\mu(P_{n})]\geq\frac{1}{4}w(\tfrac{1}{n})$ .∎

To obtain the analogous lower bound from Theorem 4(i), we write

[TABLE]

Again, choosing $t=1/n$ yields the claimed lower bound

[TABLE]

since the sequence $(1-\frac{1}{n})^{n-1}$ is now decreasing to ${\frac{1}{e}}$ .

3.2. Floating bodies and $\varepsilon$ -nets

Before we turn our attention to the upper bound, we will point out a connection to $\varepsilon$ -nets. Consider a probability space $(U,\mu)$ and a family $\mathcal{H}$ of measurable subsets of $U$ . An $\varepsilon$ -net for $(U,\mu,\mathcal{H})$ is a set $S\subseteq U$ that intersects every $h\in\mathcal{H}$ with $\mu(h)\geq\varepsilon$ [10, $\mathsection 10.2$ ]. In the special case where $U=(\mathbb{R}^{d},\mu)$ and $\mathcal{H}$ consists of all half-spaces, if a set $S$ is an $\varepsilon$ -net, then the convex hull $P$ of $S$ contains $\mathbb{R}^{d}\setminus W_{\varepsilon}$ . Indeed, assume that there exists a point $x$ in $\mathbb{R}^{d}\setminus W_{\varepsilon}$ and not in $P$ . Consider a closed halfspace $h$ that contains $x$ and is disjoint from $P$ . Since $x\notin W_{\varepsilon}$ we must have $\mu(h)>\varepsilon$ and $S$ cannot be an $\varepsilon$ -net.

We call the region $\mathbb{R}^{d}\setminus W_{\varepsilon}$ the floating body of the measure $\mu$ with parameter $\varepsilon$ , by analogy to the case of convex bodies. The relation between floating bodies and $\varepsilon$ -nets was first observed by Van Vu, who used the $\varepsilon$ -net Theorem to prove that $P_{n}^{\mu}$ contains $\mathbb{R}^{d}\setminus W_{c\log n/n}$ with high probability [13, Lemma 4.2] (a fact previously established by Bárány and Dalla [3] when $\mu$ is the normalized Lebesgue measure on a convex body). This implies that, with high probability, $1-\mu(P_{n})\leq w(c\log n/n)$ . The analysis we give in Section 3.3 refines Vu’s analysis to sharpen the constant. Note that Theorem 3 shows that Vu’s result is already asymptotically best possible.

3.3. Upper bound

For $d=1$ , the proof of the upper bound is straightforward and may actually be improved. Indeed, we have $w(t)=\min\{2t,1\}$ , and Efron’s Formula (3) yields

[TABLE]

We will therefore assume $d\geq 2$ .

We use a lower bound on the probability of a random sample of $U$ to be an $\varepsilon$ -net for $(U,\mu,\mathcal{H})$ . We define the shatter function (or growth function) of the family $\mathcal{H}$ as

[TABLE]

Lemma 5 ([12, Theorem 3.2]).

Let $(U,\mu)$ be a probability space and $\mathcal{H}$ a family of measurable subsets of $U$ . Let $X_{s}$ be a sample of $s$ random independent elements chosen according to $\mu$ . For any integer $N>s$ , the probability that $X_{s}$ is not a $\varepsilon$ -net for $(U,\mu,\mathcal{H})$ is at most

[TABLE]

Lemma 5 is a quantitative refinement of a foundational result in learning theory [14, Theorem 2]. It is commonly used to prove that small $\varepsilon$ -nets exist for range spaces of bounded Vapnik-Chervonenkis dimension [9], see also [12, Theorem 3.1] or [11, Theorem 15.5]. For that application, it is sufficient to show that the probability of failure is less than 1; This works for $\varepsilon\approx d\ln n/n$ (with appropriate lower-order terms), where $d$ is the Vapnik-Chervonenkis dimension. In our proof, we will need a smaller failure probability of order $o(1/n)$ , and we will achieve this by setting $\varepsilon\approx(d+2)\ln n/n$ . We will apply the lemma in the case where $U=\mathbb{R}^{d}$ and $\mathcal{H}$ is the set of halfspaces in $\mathbb{R}^{d}$ . We mention that by increasing $\varepsilon$ more agressively, the probability of failure can be made exponentially small.

For the family $\mathcal{H}$ of halfspaces in $\mathbb{R}^{d}$ , we have the following sharp bound on the shatter function [8]:

[TABLE]

The proof of the upper bound of Theorem 2 starts by remarking that for any $\varepsilon\in[0,1]$ we have:

[TABLE]

Here, the first inequality between the probabilities holds since the event $x\notin P_{n}$ trivially implies that $\mathbb{R}^{d}\setminus W_{\varepsilon}\not\subseteq P_{n}$ when $x\in\mathbb{R}^{d}\setminus W_{\varepsilon}$ . We thus have

[TABLE]

We now want to set $\varepsilon$ so that $\Pr[\mathbb{R}^{d}\setminus W_{\varepsilon}\not\subseteq P_{n}]$ is $\frac{\varepsilon_{d}(n)}{n}$ with $\varepsilon_{d}(n)\to 0$ as $n\to\infty$ . As shown in Section 3.2, the event $\mathbb{R}^{d}\setminus W_{\varepsilon}\not\subseteq P_{n}$ implies that $P_{n}$ fails to be an $\varepsilon$ -net. The probability can thus be bounded from above using Lemma 5 with $s=n$ . Taking logarithms, for any $N>n$ ,

[TABLE]

Since we assume that $d\geq 2$ , we have

[TABLE]

We set $N=n\lceil\ln n\rceil$ , so that:

[TABLE]

We then set $\varepsilon=\delta\frac{\ln n}{n}$ , with $\delta\approx d$ to be fine-tuned later. If $n$ is large enough, the factor $\left({(N-n)\varepsilon-1}\right)\approx\delta\ln^{2}n$ is nonnegative, and we can use the inequality $\ln(1-x)\leq-x$ for $x\in[0,1)$ in order to bound the second term:

[TABLE]

Altogether, we get

[TABLE]

so for every $\delta>d+1$ we have $\Pr[\mathbb{R}^{d}\setminus W_{\varepsilon}\not\subseteq P_{n}]=\frac{\varepsilon_{d}(n)}{n}$ with $\varepsilon_{d}(n)\to 0$ as $n\to\infty$ . Setting $\delta=d+2$ yields the claimed bound. ∎

4. Proof of Theorem 3

In this section, logarithms are base $2$ . For better readability we drop the superscripts $\nu$ .

4.1. The construction

The measure $\nu$ is supported on a sequence of concentric circles $C_{1},C_{2},\ldots$ , where $C_{i}$ has radius

[TABLE]

On each $C_{i}$ , $\nu$ is uniform, implying that $\nu$ is rotationally invariant. We let $D_{i}=\bigcup_{j\geq i}C_{j}$ . For $i\geq 1$ we put

[TABLE]

and remark that $\nu(\mathbb{R}^{2})=s_{1}=1$ , so $\nu$ is a probability measure. The sequence $\{s_{i}\}_{i\in\mathbb{N}}$ decreases very rapidly. The probabilities of the individual circles are

[TABLE]

for $i\geq 1$ .

The infinite sequence of values $n$ for which we claim the inequality of Theorem 3 is

[TABLE]

In Section 4.2, we examine the wet part and prove that $w(\tfrac{\log n_{i}}{n_{i}})\leq s_{i}$ . We then want to establish the complementary bound $\mathbb{E}\left[1-\nu(P_{n_{i}})\right]>s_{i}/2$ . Since $\nu$ is non-atomic, Efron’s formula yields

[TABLE]

and it suffices to establish that $\mathbb{E}\left[f_{0}(P_{n_{i}+1})\right]>(n_{i}+1)s_{i}/2$ . This is what we do in Section 4.3.

4.2. The wet part

Let us again drop the superscript $\nu$ . Let $h_{i}$ be a closed halfplane that has a single point in common with $C_{i}$ , so its bounding line is tangent to $C_{i}$ . We have

[TABLE]

So, as $t$ decreases, $w(t)$ drops step by step, each step being from $s_{i}$ to $s_{i+1}$ . In particular,

[TABLE]

For $j>i$ , the portion of $C_{j}$ contained in $h_{i}$ is equal to $2\arccos(r_{i}/r_{j})$ . Hence,

[TABLE]

We will bound the term $\arccos(r_{i}/r_{j})$ by a more explicit expression in terms of $i$ . To get rid of the $\arccos$ function, we use the fact that $\cos x\geq 1-x^{2}/2$ for all $x\in\mathbb{R}$ . We obtain, for $0\leq y\leq 1$ ,

[TABLE]

Moreover, the ratio ${r_{i}}/{r_{j}}$ can be bounded as follows:

[TABLE]

Thus we deduce that

[TABLE]

We have established a bound on ${\arccos(r_{i}/r_{j})}/{\pi}$ , which is the fraction of a single circle $C_{j}$ that is contained in $h_{i}$ . Hence, considering all circles $C_{j}$ with $j>i$ together, we get

[TABLE]

We check that for $i\geq 4$ ,

[TABLE]

because $2^{-i}(1+2^{1-i}i)<\frac{\sqrt{2}}{\pi i}$ for all $i\geq 4$ . Using (8), this gives our desired bound:

[TABLE]

for all $i\geq 4$ . With little effort, one can show that actually $w(\tfrac{\log n_{i}}{n_{i}})=s_{i}$ . One can also see that, for any $C>0$ , the condition $w(C\tfrac{\log n_{i}}{n_{i}})\leq s_{i}$ holds if $i$ is large enough, because the exponential factor $2^{-i}$ dominates any constant factor $C$ in the last chain of inequalities. This justifies the remark that we made after the statement of Theorem 3.

4.3. The random polytope

Assume now that $X_{n}$ is a set of $n$ points sampled independently from $\nu$ . We intend to bound from below the expectation $\mathbb{E}\left[f_{0}(\operatorname{conv}X_{n_{i}+1})\right]$ . Observe that for any $n\in\mathbb{N}$ one has

[TABLE]

Intuitively, as $n$ varies in the range near $n_{i}$ , many points of $X_{n}$ lie on $C_{i}$ and yet no point of $X_{n}$ lies in $D_{i+1}$ . So $P_{n}$ has, in expectation, at least $np_{i}\approx ns_{i}$ vertices. At the same time, the term $w(\log n/n)$ in the claimed lower bound drops to $s_{i}$ . So the expected number of vertices is about $ns_{i}$ which is larger than $\frac{1}{2}ns_{i}=\frac{n}{2}w(\log n/n)$ .

Formally, we estimate the expected number of vertices when $n=n_{i}+1$ :

[TABLE]

The last square bracket tends to 1 as $i\to\infty$ . In particular, it is larger than $\frac{1}{2}$ for $i\geq 4$ . This shows that for all $i\geq 4$

[TABLE]

4.4. Higher dimension

We can embed the plane containing $\nu$ in $\mathbb{R}^{d}$ for $d\geq 3$ . The analysis remains true but the random polytope is of course flat with probability 1. To get a full-dimensional example, we can replace each circle by a $(d-1)$ -dimensional sphere, all other parameters being kept identical: all spheres are centered in the same point, $C_{i}$ has radius $1-\frac{1}{i+1}$ , the measure is uniform on each $C_{i}$ and the measure of $\cup_{j\geq i}C_{i}$ is $4\cdot 2^{-2^{i}}$ . The analysis holds mutatis mutandis.

As another example, which does not require new calculations, we can combine $\nu$ with the uniform distribution on the edges of a regular $(d-2)$ -dimensional simplex in the $(d-2)$ -dimensional subspace orthogonal to the plane that contains the circles, mixing the two distributions in the ratio $50:50$ .

In all our constructions, the measure is concentrated on lower-dimensional manifolds of $\mathbb{R}^{d}$ , circles, spheres, or line segments. If a continuous distribution is desired, one can replace each circle in the plane by a narrow annulus and each sphere by a thin spherical shell, without changing the characteristic behaviour.

5. An alternative treatment of atomic measures

Even for measures with atoms, one can give a precise meaning to Efron’s formula: The expression in (1) counts the expected number of convex hull vertices of $P_{n}$ that are unique in the sample $X_{n}$ . From this, it is obvious that Efron’s formula (2) is a lower bound on $\mathbb{E}[f_{0}(P_{n})]$ (3).

For dealing with atomic measures, there is alternative possibility. The resulting statements involve different quantities than our original results, but they have the advantage of holding for every measure. We denote by $\bar{f}_{0}(X_{n})$ the number of points of the sample $X_{n}$ that lie on the boundary of their convex hull $P_{n}$ , counted with multiplicity in case of coincident points. We denote by $\breve{P}_{n}$ the interior of $P_{n}$ . Then a derivation analogous to 1–2 leads to the following variation of Efron’s formula:

[TABLE]

We emphasize that we mean the boundary and interior with respect to the ambient space $\mathbb{R}^{d}$ , not the relative boundary or interior.

Even for some non-atomic measures, this gives different results. Consider the uniform distribution on the boundary of an equilateral triangle. Then $\mathbb{E}[\bar{f}_{0}(X_{n})]=n$ , while $\mathbb{E}[f_{0}(P_{n})]\leq 6$ . Accordingly, $\mathbb{E}[\mu(\breve{P}_{n})]=0$ , while $\mathbb{E}[\mu(P_{n})]$ converges to $1$ .

We denote the closure of the wet part $W_{t}^{\mu}$ by $\bar{W}_{t}^{\mu}$ and its measure by $\bar{w}^{\mu}(t):=\mu(\bar{W}_{t}^{\mu})$ .

With these concepts, we can prove the following analogs of Theorems 2–4. Observe that for a measure $\mu$ for which for every hyperplane $H$ , $\mu(H)=0$ the content of this theorem is the same as the previous ones.

Theorem 6.

(i)

For any probability measure $\mu$ in $\mathbb{R}^{d}$ and $n\geq 2$ ,

[TABLE]

and

[TABLE]

where $\varepsilon_{d}(n)\to 0$ as $n\to+\infty$ and is independent of $\mu$ . 2. (ii)

There is a non-atomic probability measure $\nu$ on $\mathbb{R}^{2}$ such that

[TABLE]

and

[TABLE]

for infinitely many $n$ .

Proof sketch.

Since the derivation is parallel to the proofs in Sections 3–4, we only sketch a few crucial points.

(i) For proving the lower bound in (10), we modify the initial argument leading to (7): For every fixed $x\in W_{t}$ , there is a closed half-space $h$ with $x\in h$ whose corresponding open halfspace $\breve{h}$ has measure $\mu(\breve{h})\leq t$ . Therefore,

[TABLE]

The remainder of the proof can be adapted in a straightforward way.

In Section 3.2, we have established that for an $\varepsilon$ -net $S$ , its convex hull $P$ contains $\mathbb{R}^{d}\setminus W_{\varepsilon}$ . Since the interior operator is monotone, this implies that $\mathbb{R}^{d}\setminus\bar{W}_{\varepsilon}\subseteq\breve{P}$ . Therefore, the $\varepsilon$ -net argument of Section 3.3 applies to the modified setting and establishes the upper bound in (10).

Finally, by Efron’s modified formula (9), the result (10) carries over to (11) as in our original derivation.

(ii) The lower-bound construction of Theorem 3 gives zero measure to every hyperplane, and therefore all quantities in part (ii) are equal to the corresponding quantites in Theorem 3 and Theorem 4(ii). ∎

Acknowledgements.

I. B. was supported by the Hungarian National Research, Development and Innovation Office NKFIH Grants K 111827 and K 116769, and by the Bézout Labex (ANR-10-LABX-58). X. G. was supported by Institut Universitaire de France. The authors are grateful for the hospitality during the ASPAG (ANR-17-CE40-0017) workshop on geometry, probability, and algorithms in Arcachon in April 2018.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] I. Bárány, Intrinsic volumes and f 𝑓 f -vectors of random polytopes, Mathematische Annalen 285 (1989), 671–699.
2[2] I. Bárány, D. G. Larman, Convex bodies, economic cap coverings, random polytopes, Mathematika 35 (1988), 274–291.
3[3] I. Bárány, L. Dalla, Few points to generate a random polytope, Mathematika 44 (1997), 325–331.
4[4] M. Beermann, Random polytopes . Ph. D. thesis, University of Osnabrück, 2015. Available at https://repositorium.ub.uni-osnabrueck.de/bitstream/urn:nbn:de:gbv:700-2015062313276/1/thesis_beermann.pdf .
5[5] M. Beermann, M. Reitzner, Monotonicity of functionals of random polytopes . Preprint ar Xiv:1706.08342, (2017).
6[6] O. Devillers, M. Glisse, X. Goaoc, G. Moroz, M. Reitzner, The monotonicity of f 𝑓 f -vectors of random polytopes. Electron. Commun. Probab. 18 (2013), 1–8.
7[7] B. Efron, The convex hull of a random set of points. Biometrika 52 (1965), 331–343.
8[8] E. F. Harding, The number of partitions of a set of n 𝑛 n points in k 𝑘 k dimensions induced by hyperplanes. Proceedings of the Edinburgh Mathematical Society 15(4) (1967), 285–289.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Random polytopes and the wet part for arbitrary probability distributions

Abstract.

1. Introduction and Main Results

Theorem 1** ([2, Theorem 1]).**

1.1. Results for general measures.

Theorem 2**.**

Theorem 3**.**

1.2. Consequences for f\mathbf{f}f-vectors

Theorem 4**.**

2. Wet part: convex sets versus general measures

2.1. Two circles and a sharp drop

2.2. A drop for the number of vertices.

2.3. A sequence of drops

2.4. Open questions

3. Proof of Theorem 2

3.1. Lower bound

3.2. Floating bodies and ε\varepsilonε-nets

3.3. Upper bound

Lemma 5** ([12, Theorem 3.2]).**

4. Proof of Theorem 3

4.1. The construction

4.2. The wet part

4.3. The random polytope

4.4. Higher dimension

5. An alternative treatment of atomic measures

Theorem 6**.**

Proof sketch.

Acknowledgements.

Theorem 1 ([2, Theorem 1]).

Theorem 2.

Theorem 3.

1.2. Consequences for $\mathbf{f}$ -vectors

Theorem 4.

3.2. Floating bodies and $\varepsilon$ -nets

Lemma 5 ([12, Theorem 3.2]).

Theorem 6.