An Upper Bound of the Minimal Dispersion via Delta Covers

Daniel Rudolf

arXiv:1701.06430·cs.CG·October 3, 2017

An Upper Bound of the Minimal Dispersion via Delta Covers

Daniel Rudolf

PDF

Open Access

TL;DR

This paper establishes an upper bound on the largest empty test set volume for point sets in high-dimensional cubes, using delta covers, with specific bounds for axis-parallel boxes and toroidal cases.

Contribution

It introduces a new upper bound on minimal dispersion based on delta covers, applicable to various geometric test sets in high dimensions.

Findings

01

Bound of (log |Γ_δ|)/n + δ for minimal dispersion

02

Specific bounds for axis-parallel boxes: (4d/n) log(9n/d)

03

Specific bounds for torus: (4d/n) log(2n)

Abstract

For a point set of $n$ elements in the $d$ -dimensional unit cube and a class of test sets we are interested in the largest volume of a test set which does not contain any point. For all natural numbers $n$ , $d$ and under the assumption of a $d e l t a$ -cover with cardinality $∣ Γ_{δ} ∣$ we prove that there is a point set, such that the largest volume of such a test set without any point is bounded by $\frac{l o g ∣ Γ _{δ} ∣}{n} + δ$ . For axis-parallel boxes on the unit cube this leads to a volume of at most $\frac{4 d}{n} lo g (\frac{9 n}{d})$ and on the torus to $\frac{4 d}{n} lo g (2 n)$ .

Equations80

disp (P, B) := P \cap B = \emptyset, B \in B sup λ_{d} (B) .

disp (P, B) := P \cap B = \emptyset, B \in B sup λ_{d} (B) .

disp_{B} (n, d) := P \subset [0, 1]^{d}, ∣ P ∣ = n in f disp (P, B),

disp_{B} (n, d) := P \subset [0, 1]^{d}, ∣ P ∣ = n in f disp (P, B),

N_{B} (d, ε) = min {n \in N ∣ disp_{B} (n, d) \leq ε} .

N_{B} (d, ε) = min {n \in N ∣ disp_{B} (n, d) \leq ε} .

{\rm disp}_{\mathcal{B}}(n,d)\leq\frac{2d_{\mathcal{B}}}{n}\log_{2}\Big{(}\frac{6n}{d_{\mathcal{B}}}\Big{)}\quad\text{for}\quad n\geq d_{\mathcal{B}},

{\rm disp}_{\mathcal{B}}(n,d)\leq\frac{2d_{\mathcal{B}}}{n}\log_{2}\Big{(}\frac{6n}{d_{\mathcal{B}}}\Big{)}\quad\text{for}\quad n\geq d_{\mathcal{B}},

N_{B} (d, ε) \leq 8 d_{B} ε^{- 1} lo g_{2} (13 ε^{- 1}),

N_{B} (d, ε) \leq 8 d_{B} ε^{- 1} lo g_{2} (13 ε^{- 1}),

B_{ex} = {Π_{k = 1}^{d} [x_{k}, y_{k}) \subseteq [0, 1]^{d} ∣ x_{k} < y_{k}, k = 1, \dots, d},

B_{ex} = {Π_{k = 1}^{d} [x_{k}, y_{k}) \subseteq [0, 1]^{d} ∣ x_{k} < y_{k}, k = 1, \dots, d},

B_{per} = {Π_{k = 1}^{d} I_{k} (x, y) ∣ x = (x_{1}, \dots, x_{d}), y = (y_{1}, \dots, y_{d}) \in [0, 1]^{d}}

B_{per} = {Π_{k = 1}^{d} I_{k} (x, y) ∣ x = (x_{1}, \dots, x_{d}), y = (y_{1}, \dots, y_{d}) \in [0, 1]^{d}}

I_{k} (x, y) = {(x_{k}, y_{k}) [0, 1] ∖ [y_{k}, x_{k}] x_{k} < y_{k} y_{k} \leq x_{k},

I_{k} (x, y) = {(x_{k}, y_{k}) [0, 1] ∖ [y_{k}, x_{k}] x_{k} < y_{k} y_{k} \leq x_{k},

\forall B \in B \exists L_{B}, U_{B} \in Γ_{δ} with L_{B} \subseteq B \subseteq U_{B}

\forall B \in B \exists L_{B}, U_{B} \in Γ_{δ} with L_{B} \subseteq B \subseteq U_{B}

disp_{B} (n, d) \leq \frac{lo g ∣ Γ _{δ} ∣}{n} + δ .

disp_{B} (n, d) \leq \frac{lo g ∣ Γ _{δ} ∣}{n} + δ .

{\rm disp}_{\mathcal{B}_{\rm ex}}(n,d)\leq\frac{4d}{n}\log\Big{(}\frac{9n}{d}\Big{)}.

{\rm disp}_{\mathcal{B}_{\rm ex}}(n,d)\leq\frac{4d}{n}\log\Big{(}\frac{9n}{d}\Big{)}.

N_{B_{ex}} (ε, d) \leq 8 d ε^{- 1} lo g (33 ε^{- 1}) .

N_{B_{ex}} (ε, d) \leq 8 d ε^{- 1} lo g (33 ε^{- 1}) .

\frac{\log_{2}d}{4(n+\log_{2}d)}\leq{\rm disp}_{\mathcal{B}_{\rm ex}}(n,d)\leq\frac{1}{n}\min\Big{\{}2^{7d+1},2^{d-1}\Pi_{i=1}^{d-1}p_{i}\Big{\}},

\frac{\log_{2}d}{4(n+\log_{2}d)}\leq{\rm disp}_{\mathcal{B}_{\rm ex}}(n,d)\leq\frac{1}{n}\min\Big{\{}2^{7d+1},2^{d-1}\Pi_{i=1}^{d-1}p_{i}\Big{\}},

N_{B_{ex}} (ε, d) \leq 2^{7 d + 1} ε^{- 1} .

N_{B_{ex}} (ε, d) \leq 2^{7 d + 1} ε^{- 1} .

N_{B_{ex}} (ε, d) \leq c_{ε} lo g_{2} d

N_{B_{ex}} (ε, d) \leq c_{ε} lo g_{2} d

(1/4 - ε) ε^{- 1} lo g_{2} d \leq N_{B_{ex}} (ε, d) \leq 8 d ε^{- 1} lo g (33 ε^{- 1}) .

(1/4 - ε) ε^{- 1} lo g_{2} d \leq N_{B_{ex}} (ε, d) \leq 8 d ε^{- 1} lo g (33 ε^{- 1}) .

disp_{B_{per}} (n, d) \leq \frac{4 d}{n} lo g (2 n) .

disp_{B_{per}} (n, d) \leq \frac{4 d}{n} lo g (2 n) .

N_{B_{per}} (ε, d) \leq 8 d ε^{- 1} [lo g (8 d) + lo g ε^{- 1}] .

N_{B_{per}} (ε, d) \leq 8 d ε^{- 1} [lo g (8 d) + lo g ε^{- 1}] .

min {1, d / n} \leq disp_{B_{per}} (n, d) \leq \frac{4 d}{n} lo g (2 n),

min {1, d / n} \leq disp_{B_{per}} (n, d) \leq \frac{4 d}{n} lo g (2 n),

d ε^{- 1} \leq N_{B_{per}} (ε, d) \leq 8 d ε^{- 1} [lo g (8 d) + lo g ε^{- 1}] .

d ε^{- 1} \leq N_{B_{per}} (ε, d) \leq 8 d ε^{- 1} [lo g (8 d) + lo g ε^{- 1}] .

disp (P, B) \leq δ + A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) .

disp (P, B) \leq δ + A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) .

λ_{d} (B ∖ L_{B}) \leq λ_{d} (U_{B} ∖ L_{B}) \leq δ .

λ_{d} (B ∖ L_{B}) \leq λ_{d} (U_{B} ∖ L_{B}) \leq δ .

disp (P, B) \leq P \cap B = \emptyset, B \in B sup (λ_{d} (U_{B} ∖ L_{B}) + λ_{d} (L_{B})) \leq δ + A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) .

disp (P, B) \leq P \cap B = \emptyset, B \in B sup (λ_{d} (U_{B} ∖ L_{B}) + λ_{d} (L_{B})) \leq δ + A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) .

A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) \leq \frac{lo g ∣ Γ _{δ} ∣}{n} .

A \cap P = \emptyset, A \in Γ_{δ} max λ_{d} (A) \leq \frac{lo g ∣ Γ _{δ} ∣}{n} .

\displaystyle\mathbb{P}\Big{(}

\displaystyle\mathbb{P}\Big{(}

\displaystyle=1-\mathbb{P}\Big{(}\bigcup_{A\in\Gamma_{\delta}}\{\mathbf{1}_{A\cap\{X_{1},\dots,X_{n}\}=\emptyset}\cdot\lambda_{d}(A)>c_{n}\}\Big{)}

\displaystyle\geq 1-\sum_{A\in\Gamma_{\delta}}\mathbb{P}\Big{(}\mathbf{1}_{A\cap\{X_{1},\dots,X_{n}\}=\emptyset}\cdot\lambda_{d}(A)>c_{n}\Big{)}

> 1 - ∣ Γ_{δ} ∣ (1 - c_{n})^{n} .

\mathbb{P}\Big{(}\max_{A\in\Gamma_{\delta},\;A\cap\{X_{1},\dots,X_{n}\}=\emptyset}\lambda_{d}(A)\leq\frac{\log|{\Gamma_{\delta}}|}{n}\Big{)}>0.

\mathbb{P}\Big{(}\max_{A\in\Gamma_{\delta},\;A\cap\{X_{1},\dots,X_{n}\}=\emptyset}\lambda_{d}(A)\leq\frac{\log|{\Gamma_{\delta}}|}{n}\Big{)}>0.

P (disp ({X_{1}, \dots, X_{n}}, B) \leq 2 δ)

P (disp ({X_{1}, \dots, X_{n}}, B) \leq 2 δ)

> 1 - ∣ Γ_{δ} ∣ (1 - δ)^{n} .

n := \frac{lo g ( ∣ Γ _{δ} ∣ α ^{- 1} )}{δ} \geq \frac{lo g ( ∣ Γ _{δ} ∣ α ^{- 1} )}{lo g ( 1 - δ ) ^{- 1}}

n := \frac{lo g ( ∣ Γ _{δ} ∣ α ^{- 1} )}{δ} \geq \frac{lo g ( ∣ Γ _{δ} ∣ α ^{- 1} )}{lo g ( 1 - δ ) ^{- 1}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsPoint processes and geometric inequalities · Computational Geometry and Mesh Generation · Limits and Structures in Graph Theory

Full text

11institutetext: Daniel Rudolf

Institut für Mathematische Stochastik, University of Goettingen, Goldschmidtstraße 7, 37077 Göttingen, Germany

11email: [email protected]

An Upper Bound of the Minimal Dispersion via Delta Covers

Daniel Rudolf

Abstract

For a point set of $n$ elements in the $d$ -dimensional unit cube and a class of test sets we are interested in the largest volume of a test set which does not contain any point. For all natural numbers $n$ , $d$ and under the assumption of the existence of a $\delta$ -cover with cardinality $|\Gamma_{\delta}|$ we prove that there is a point set, such that the largest volume of such a test set without any point is bounded above by $\frac{\log|\Gamma_{\delta}|}{n}+\delta$ . For axis-parallel boxes on the unit cube this leads to a volume of at most $\frac{4d}{n}\log(\frac{9n}{d})$ and on the torus to $\frac{4d}{n}\log(2n)$ .

Dedicated to Ian H. Sloan on the occasion of his 80th birthday.

1 Introduction and Main Results

For a point set $P$ of $n$ elements in the unit cube $[0,1]^{d}$ and for a set $\mathcal{B}$ of measurable subsets of $[0,1]^{d}$ the quantity of interest is the dispersion, given by

[TABLE]

Here $\lambda_{d}$ denotes the $d$ -dimensional Lebesgue measure and $\mathcal{B}$ is called set of test sets. The dispersion measures the size of the largest hole which does not contain any point of $P$ . The shape of the hole is specified by the set of test sets. We are interested in point sets with best possible upper bounds of the dispersion, which thus allow only small holes without any point. Of course, any estimate of ${\rm disp}(P,\mathcal{B})$ depends on $n$ , $d$ and $\mathcal{B}$ .

Classically, the dispersion of a point set $P$ was introduced by Hlawka Hl76 as the radius of the largest ball, with respect to some metric, which does not contain any point of $P$ . This quantity appears in the setting of quasi-Monte Carlo methods for optimization, see Ni83 and (Ni92, , Chapter 6). The notion of the dispersion from (1) was introduced by Rote and Tichy in RoTi96 to allow more general test sets. There the focus is on the dependence of $n$ (the cardinality of the point set) of ${\rm disp}(P,\mathcal{B})$ . In contrast to that, we are also interested in the behavior with respect to the dimension.

There is a well known relation to the star-discrepancy, namely, the dispersion is a lower bound of this quantity. For further literature, open problems, recent developments and applications related to this topic we refer to DiPi10 ; DiRuZh13 ; Ni92 ; No15 ; NoWo10 .

For the test sets we focus on axis-parallel boxes. Point sets with small dispersion with respect to such axis-parallel boxes are useful for the approximation of rank-one tensors, see BaDaDeGr14 ; NoRu16 . In computational geometry, given a point configuration the problem of finding the largest empty axis-parallel box is well studied. Starting with NaLeHs84 for $d=2$ , there is a considerable amount of work for $d>2$ , see DuJi13 ; DuJi16 and the references therein. Given a large dataset of points, the search for empty axis-parallel boxes is motivated by the fact that such boxes may reveal natural constraints in the data and thus unknown correlations, see EdGrLiMi03 .

The minimal dispersion, given by

[TABLE]

quantifies the best possible behavior of the dispersion with respect to $n$ , $d$ and $\mathcal{B}$ . Another significant quantity is the inverse of the minimal dispersion, that is, the minimal number of points $N_{\mathcal{B}}(d,\varepsilon)$ with minimal dispersion at most $\varepsilon\in(0,1)$ , i.e.,

[TABLE]

By virtue of a result of Blumer, Ehrenfeucht, Haussler and Warmuth (BlEhHaWa89, , Lemma A2.1, Lemma A2.2 and Lemma A2.4) one obtains

[TABLE]

or stated differently

[TABLE]

where $\log_{2}$ is the dyadic logarithm and $d_{\mathcal{B}}$ denotes the VC-dimension111The VC-dimension is the cardinality of the largest subset $T$ of $[0,1]^{d}$ such that the set system $\{T\cap B\mid B\in\mathcal{B}\}$ contains all subsets of $T$ . of $\mathcal{B}$ . The dependence on $d$ is hidden in the VC-dimension $d_{\mathcal{B}}$ . For example, for the set of test sets of axis-parallel boxes

[TABLE]

it is well known that $d_{\mathcal{B}_{\rm ex}}=2d$ . However, the concept of VC-dimension is not as easy to grasp as it might seem on the first glance and it is also not trivial to prove upper bounds on $d_{\mathcal{B}}$ depending on $\mathcal{B}$ . For instance, for periodic axis-parallel boxes, which coincide with the interpretation of considering the torus instead of the unit cube, given by

[TABLE]

with

[TABLE]

the dependence on $d$ in $d_{\mathcal{B}_{\rm per}}$ is not obvious. The conjecture here is that $d_{\mathcal{B}_{\rm per}}$ behaves similar as $d_{\mathcal{B}_{\rm ex}}$ , i.e., linear in $d$ , but we do not have a proof for this fact.

The aim of this paper is to prove an estimate similar to (2) based on the concept of a $\delta$ -cover of $\mathcal{B}$ . For a discussion about $\delta$ -covers, bracketing numbers and VC-dimension we refer to Gn08 . Let $\mathcal{B}$ be a set of measurable subsets of $[0,1]^{d}$ . A $\delta$ -cover for $\mathcal{B}$ with $\delta>0$ is a finite set $\Gamma_{\delta}\subseteq\mathcal{B}$ which satisfies

[TABLE]

such that $\lambda_{d}(U_{B}\setminus L_{B})\leq\delta$ . The main abstract theorem is as follows.

Theorem 1.1

For a set of test sets $\mathcal{B}$ assume that for $\delta>0$ the set $\Gamma_{\delta}$ is a $\delta$ -cover of $\mathcal{B}$ . Then

[TABLE]

The cardinality of the $\delta$ -cover plays a crucial role in the upper bound of the minimal dispersion. Thus, to apply the theorem to concrete sets of test sets one has to construct suitable, not too large, $\delta$ -covers.

For $\mathcal{B}_{\rm ex}$ the best results on $\delta$ -covers we know are due to Gnewuch, see Gn08 . As a consequence of the theorem and a combination of (Gn08, , Formula (1), Theorem 1.15, Lemma 1.18) one obtains

Corollary 1

For $\mathcal{B}_{\rm ex}$ and $n>2d$ we have

[TABLE]

(For $n\leq 2d$ the trivial estimate ${\rm disp}_{\mathcal{B}_{\rm ex}}(n,d)\leq 1$ applies.) In particular,

[TABLE]

Obviously, this is essentially the same as the estimates (2) and (3) in the setting of $\mathcal{B}_{\rm ex}$ . Let us discuss how those estimates fit into the literature. From (AiHiRu15, , Theorem 1 and (4)) we know that

[TABLE]

where $p_{i}$ denotes the $i$ th prime. The upper bound $2^{7d+1}/n$ is due to Larcher based on suitable $(t,m,d)$ -nets and for $d\geq 54$ improves the super-exponential estimate $2^{d-1}\Pi_{i=1}^{d-1}p_{i}/n$ of Rote and Tichy (RoTi96, , Proposition 3.1) based on the Halton sequence. The order of convergence with respect to $n$ is optimal, but the dependence on $d$ in the upper bound is exponential. In the estimate of Corollary 1 the optimal order in $n$ is not achieved, but the dependence on $d$ is much better. Already for $d=5$ it is required that $n$ must be larger than $5\cdot 10^{72}$ to obtain a smaller upper bound from (7) than from (5). By rewriting the result of Larcher in terms of $N_{\mathcal{B}_{\rm ex}}(\varepsilon,d)$ the dependence on $d$ can be very well illustrated, one obtains

[TABLE]

Here, for fixed $\varepsilon$ there is an exponential dependence on $d$ , whereas in the estimate of (6) there is a linear dependence on $d$ . Summarizing, according to $N_{\mathcal{B}_{\rm ex}}(\varepsilon,d)$ the result of Corollary 1 reduces the gap with respect to $d$ , we obtain222After acceptance of the current paper a new upper bound of $N_{\mathcal{B}_{\rm ex}}(\varepsilon,d)$ was proven in So17 . From So17 one obtains for $\varepsilon\in(0,1/4)$ that

$N_{\mathcal{B}_{\rm ex}}(\varepsilon,d)\leq c_{\varepsilon}\log_{2}d$

with $c_{\varepsilon}=\varepsilon^{-(\varepsilon^{-2}+2)}(4\log\varepsilon^{-1}+1)$ for $\varepsilon^{-1}\in\mathbb{N}$ . In particular, it shows that the lower bound cannot be improved with respect to the dimension. Note that the dependence on $\varepsilon^{-1}$ is not as good as in (6).

[TABLE]

As already mentioned for $\mathcal{B}_{\rm per}$ the estimates (2) and (3) are not applicable, since we do not know the VC-dimension. We construct a $\delta$ -cover in Lemma 2 below and obtain the following estimate as a consequence of the theorem. Note that, since $\mathcal{B}_{\rm ex}\subset\mathcal{B}_{\mathcal{\rm per}}$ , we cannot expect something better than in Corollary 1.

Corollary 2

For $\mathcal{B}_{\rm per}$ and $n\geq 2$ we have

[TABLE]

*In particular, *

[TABLE]

Indeed, the estimates of Corollary 2 are not as good as the estimates of Corollary 1. By adding the result of Ullrich (Ul15, , Theorem 1) one obtains

[TABLE]

or stated differently,

[TABLE]

In particular, (10) illustrates the dependence on the dimension, namely, for fixed $\varepsilon\in(0,1)$ Corollary 2 gives, except of a $\log d$ term, the right dependence on $d$ .

In the rest of the paper we prove the stated results and provide a conclusion.

2 Auxiliary Results, Proofs and Remarks

For the proof of Theorem 1.1 we need the following lemma.

Lemma 1

For $\delta>0$ let $\Gamma_{\delta}$ be a $\delta$ -cover of $\mathcal{B}$ . Then, for any point set $P\subset[0,1]^{d}$ with $n$ elements we have

[TABLE]

Proof

Let $B\in\mathcal{B}$ with $B\cap P=\emptyset$ . Then, there are $L_{B},U_{B}\in\Gamma_{\delta}$ with $L_{B}\subseteq B\subseteq U_{B}$ such that

[TABLE]

In particular, $L_{B}\cap P=\emptyset$ and

[TABLE]

Remark 1

In the proof we actually only used that there is a set $L_{B}\subseteq B$ with $\lambda_{d}(B\setminus L_{B})\leq\delta$ . Thus, instead of considering $\delta$ -covers it would be enough to work with set systems which approximate $B$ from below up to $\delta$ .

By probabilistic arguments similar to those of (BeCh87, , Section 8.1) we prove the main theorem. As in (HeNoWaWo01, , Theorem 1 and Theorem 3) for the star-discrepancy, it also turns out that such arguments are useful for studying the dependence on the dimension of the dispersion.

Proof of Theorem 1.1. By Lemma 1 it is enough to show that there is a point set $P$ which satisfies

[TABLE]

Let $(\Omega,\mathcal{F},\mathbb{P})$ be a probability space and $(X_{i})_{1\leq i\leq n}$ be an iid sequence of uniformly distributed random variables mapping from $(\Omega,\mathcal{F},\mathbb{P})$ into $[0,1]^{d}$ . We consider the sequence of random variables as “point set” and prove that with high probability the desired property (11) is satisfied. For $(c_{n})_{n\in\mathbb{N}}\subset(0,1)$ we have

[TABLE]

By the fact that $1-|{\Gamma_{\delta}}|^{-1/n}\leq\frac{\log|{\Gamma_{\delta}}|}{n}$ and by choosing $c_{n}=\frac{\log|{\Gamma_{\delta}}|}{n}$ we obtain

[TABLE]

Thus, there exists a realization of $(X_{i})_{1\leq i\leq n}$ , say $({{\bm{x}}}_{i})_{1\leq i\leq n}\subset[0,1]^{d}$ , so that for $P=\{{\bm{x}}_{1},\dots,{\bm{x}}_{n}\}$ the inequality (11) is satisfied. ∎∎

Remark 2

By Lemma 1 and the same arguments as in the proof of the theorem one can see that a point set of iid uniformly distributed random variables $X_{1},\dots,X_{n}$ satisfies a “good dispersion bound” with high probability. In detail,

[TABLE]

In particular, for confidence level $\alpha\in(0,1]$ and

[TABLE]

the probability that the random point set has dispersion smaller than $2\delta$ is strictly larger than $1-\alpha$ . This implies

[TABLE]

where the dependence on $d$ is hidden in $|{\Gamma_{\varepsilon/2}}|$ .

In the spirit of NoWo08 ; NoWo10 ; NoWo12 we are interested in polynomial tractability of the minimal dispersion, that is, $N_{\mathcal{B}}(d,\varepsilon)$ may not grow faster than polynomial in $\varepsilon^{-1}$ and $d$ . The following corollary is a consequence of the theorem and provides a condition on the $\delta$ -cover for such polynomial tractability.

Corollary 3

For $\delta\in(0,1)$ and the set of test sets $\mathcal{B}$ let $\Gamma_{\delta}$ be a $\delta$ -cover satisfying

[TABLE]

Then, for $n>c_{3}d$ one has

[TABLE]

Proof

Set $\delta=c_{3}d/n$ in (4) and the assertion follows.

This implies the result of Corollary 1.

Proof of Corollary 1. By (Gn08, , Formula (1), Theorem 1.15, Lemma 1.18) one has

[TABLE]

Here the last inequality follows mainly by $d!>\sqrt{2\pi d}(d/\mathrm{e})^{d}$ and the assertion is proven by Corollary 3 with $c_{1}=6\mathrm{e}$ , $c_{2}=0$ , $c_{3}=2$ . ∎∎

For $\mathcal{B}_{\rm per}$ we need to construct a $\delta$ -cover.

Lemma 2

For $\mathcal{B}_{\rm per}$ with $\delta>0$ and $m=\lceil 2d/\delta\rceil$ the set

[TABLE]

with

[TABLE]

is a $\delta$ -cover and satisfies $|{\Gamma_{\delta}}|=(m+1)^{2d}$ .

Proof

For arbitrary ${\bm{x}},{\bm{y}}\in[0,1]^{d}$ with ${\bm{x}}=(x_{1},\dots,x_{d})$ and ${\bm{y}}=(y_{1},\dots,y_{d})$ there are

[TABLE]

such that

[TABLE]

Define $B({\bm{x}},{\bm{y}})=\Pi_{k=1}^{d}I_{k}({\bm{x}},{\bm{y}})$ and note that it is enough to find $L_{B},U_{B}\in\Gamma_{\delta}$ with $L_{B}\subseteq B({\bm{x}},{\bm{y}})\subseteq U_{B}$ and $\lambda_{d}(U_{B}\setminus L_{B})\leq\delta$ . For any coordinate $k\in\{1,\dots,d\}$ we distinguish four cases illustrated in Figure 1:

Case: $|{x_{k}-y_{k}}|\leq 1/m$ and $x_{k}<y_{k}$ :

Define $I_{k}^{L}=\emptyset$ and $I_{k}^{U}=(a_{k},\bar{b}_{k})$ . (Here $I_{k}^{L}=[0,1]\setminus[0,1]=\emptyset$ .) 2. 2.

Case: $|{x_{k}-y_{k}}|\leq 1/m$ and $x_{k}\geq y_{k}$ :

Define $I_{k}^{L}=[0,1]\setminus[b_{k},\bar{a}_{k}]$ and $I_{k}^{U}=[0,1]\setminus[a_{k},a_{k}]$ . (Here $I_{k}^{U}=[0,1]\setminus\{a_{k}\}$ .) 3. 3.

Case: $|{x_{k}-y_{k}}|>1/m$ and $x_{k}<y_{k}$ :

Define $I_{k}^{L}=(\bar{a}_{k},b_{k})$ and $I_{k}^{U}=(a_{k},\bar{b}_{k})$ . 4. 4.

Case: $|{x_{k}-y_{k}}|>1/m$ and $x_{k}\geq y_{k}$ :

Define $I_{k}^{L}=[0,1]\setminus[b_{k},\bar{a}_{k}]$ and $I_{k}^{U}=[0,1]\setminus[\bar{b}_{k},a_{k}]$ .

In all cases we have $I_{k}^{L}\subseteq I_{k}({\bm{x}},{\bm{y}})\subseteq I_{k}^{U}$ as well as $\lambda_{1}(I_{k}^{U}\setminus I_{k}^{L})\leq 2/m$ . For $L_{B}=\Pi_{i=1}^{d}I_{i}^{L}\in\Gamma_{\delta}$ and $U_{B}=\Pi_{i=1}^{d}I_{i}^{U}\in\Gamma_{\delta}$ the inclusion property with respect to $B(x,y)$ does hold and

[TABLE]

By the choice of $m$ the right-hand side $2d/m$ is bounded by $\delta$ and the assertion is proven.

Now we easily can prove an upper bound of the minimal dispersion according to $\mathcal{B}_{\rm per}$ as formulated in Corollary 2.

Proof of Corollary 2. By the previous lemma we know that there is a $\delta$ -cover with cardinality bounded by $(4d\delta^{-1})^{2d}$ . Then by Corollary 3 with $c_{1}=4$ , $c_{2}=1$ and $c_{3}=2$ the proof is finished. ∎∎

3 Conclusion

Based on $\delta$ -covers we provide in the main theorem an estimate of the minimal dispersion similar to the one of (2). In the case where the VC-dimension of the set of test sets is not known, but a suitable $\delta$ -cover can be constructed our Theorem 1.1 leads to new results, as illustrated for $\mathcal{B}_{\rm per}$ . One might argue, that we only show existence of “good” point sets. However, Remark 2 tells us that a uniformly distributed random point set has small dispersion with high probability. As far as we know, an explicit construction of such point sets is not known.

Acknowledgements.

The author thanks Aicke Hinrichs, David Krieg, Erich Novak and Mario Ullrich for fruitful discussions to this topic.

Bibliography23

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) Aistleitner, C., Hinrichs, A., Rudolf, D.: On the size of the largest empty box amidst a point set. Discrete Appl. Math. 230 , 146–150 (2017)
2(2) Bachmayr, M., Dahmen, W., De Vore, R., Grasedyck, L.: Approximation of high-dimensional rank one tensors. Constr. Approx. 39 (2), 385–395 (2014)
3(3) Beck, J., Chen, W.: Irregularities of Distribution (Cambridge Tracts in Mathematics). Cambridge University Press (1987)
4(4) Blumer, A., Ehrenfeucht, A., Haussler, D., Warmuth, M.: Learnability and the Vapnik-Chervonenkis dimension. J. Assoc. Comput. Mach. 36 (4), 929–965 (1989)
5(5) Dick, J., Pillichshammer, F.: Digital nets and sequences: Discrepancy Theory and Quasi-Monte Carlo Integration. Cambridge University Press, Cambridge (2010)
6(6) Dick, J., Rudolf, D., Zhu, H.: Discrepancy bounds for uniformly ergodic Markov chain quasi-Monte Carlo. Ann. Appl. Probab. 26 , 3178–3205 (2016)
7(7) Dumitrescu, A., Jiang, M.: On the largest empty axis-parallel box amidst n 𝑛 n points. Algorithmica 66 (2), 225–248 (2013)
8(8) Dumitrescu, A., Jiang, M.: Perfect vector sets, properly overlapping partitions, and largest empty box. Preprint, Available at https://arxiv.org/abs/1608.06874 (2016)