Poisson fluctuations for edge counts in high-dimensional random   geometric graphs

Jens Grygierek

arXiv:1905.11221·math.PR·May 28, 2019

Poisson fluctuations for edge counts in high-dimensional random geometric graphs

Jens Grygierek

PDF

Open Access

TL;DR

This paper establishes a Poisson limit theorem for edge counts in high-dimensional random geometric graphs, demonstrating phase transition phenomena as dimension and intensity grow, using the Malliavin-Stein method.

Contribution

It introduces a novel Poisson approximation result for geometric graph edge counts in high dimensions, extending previous normal approximation bounds.

Findings

01

Poisson limit theorem for edge counts in high-dimensional graphs

02

Quantitative bounds involving first and second order difference operators

03

Phase transition phenomenon confirmed in high-dimensional setting

Abstract

We prove a Poisson limit theorem in the total variation distance of functionals of a general Poisson point process using the Malliavin-Stein method. Our estimates only involve first and second order difference operators and are closely related to the corresponding bounds for the normal approximation in the Wasserstein distance by Last, Peccati and Schulte (2016). As an application of this Poisson limit theorem, we consider a stationary Poisson point process in $R^{d}$ and connect any two points whenever their distance is less than or equal to a prescribed distance parameter. This construction gives rise to the well known random geometric graph. The number of edges of this graph is counted that have a midpoint in the $d$ -dimensional unit ball. A quantitative Poisson limit theorem for this counting statistic is derived, as the space dimension $d$ and the intensity of the Poisson…

Equations172

γ_{1} (F)

γ_{1} (F)

γ_{2} (F)

γ_{3, N} (F)

d_{W} (F, Z) \leq 2 γ_{1} (F) + γ_{2} (F) + γ_{3, N} (F),

d_{W} (F, Z) \leq 2 γ_{1} (F) + γ_{2} (F) + γ_{3, N} (F),

γ_{3, P} (F)

γ_{3, P} (F)

d_{T V} (F, P (θ)) \leq \frac{1 - e ^{- θ}}{θ} (2 γ_{1} (F) + γ_{2} (F) + \frac{γ _{3, P} ( F )}{θ} + ∣ \mathds E [F] - θ ∣ + ∣ \mathds V [F] - θ ∣),

d_{T V} (F, P (θ)) \leq \frac{1 - e ^{- θ}}{θ} (2 γ_{1} (F) + γ_{2} (F) + \frac{γ _{3, P} ( F )}{θ} + ∣ \mathds E [F] - θ ∣ + ∣ \mathds V [F] - θ ∣),

δ_{d} = \frac{1}{d},

δ_{d} = \frac{1}{d},

E (λ_{d}, δ_{d}, d) := \frac{1}{2} (y_{1}, y_{2}) \in (η_{λ_{d}})_{\neq =}^{2} \sum \mathds 1 {∥ y_{1} - y_{2} ∥ \leq δ_{d}, \frac{y _{1} + y _{2}}{2} \in \mathds B^{d}} .

E (λ_{d}, δ_{d}, d) := \frac{1}{2} (y_{1}, y_{2}) \in (η_{λ_{d}})_{\neq =}^{2} \sum \mathds 1 {∥ y_{1} - y_{2} ∥ \leq δ_{d}, \frac{y _{1} + y _{2}}{2} \in \mathds B^{d}} .

\mathds E [E_{d}] = \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

\mathds E [E_{d}] = \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

\frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} + (1 - \frac{δ _{d}}{2})^{d} κ_{d}^{3} λ_{d}^{3} δ_{d}^{2 d} \leq \mathds V [E_{d}] \leq \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} + (1 + \frac{δ _{d}}{2})^{d} κ_{d}^{3} λ_{d}^{3} δ_{d}^{2 d} .

\frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} + (1 - \frac{δ _{d}}{2})^{d} κ_{d}^{3} λ_{d}^{3} δ_{d}^{2 d} \leq \mathds V [E_{d}] \leq \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} + (1 + \frac{δ _{d}}{2})^{d} κ_{d}^{3} λ_{d}^{3} δ_{d}^{2 d} .

d \to \infty lim \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

d \to \infty lim \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

d \to \infty lim \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

d \to \infty lim \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d}

d_{T V} (E_{d}, P (θ)) \leq C_{1} (κ_{d} λ_{d} δ_{d}^{d})^{\frac{1}{2}} + C_{2} \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} - θ,

d_{T V} (E_{d}, P (θ)) \leq C_{1} (κ_{d} λ_{d} δ_{d}^{d})^{\frac{1}{2}} + C_{2} \frac{1}{2} κ_{d}^{2} λ_{d}^{2} δ_{d}^{d} - θ,

E_{d} ⟶ D P (θ), as d \to \infty.

E_{d} ⟶ D P (θ), as d \to \infty.

\mathds B_{r}^{d} (z) := {x \in \mathds R^{d} : ∥ x - z ∥ \leq r},

\mathds B_{r}^{d} (z) := {x \in \mathds R^{d} : ∥ x - z ∥ \leq r},

κ_{d} := Λ_{d} (\mathds B^{d}) = \frac{π ^{\frac{d}{2}}}{Γ [ 1 + \frac{d}{2} ]}

κ_{d} := Λ_{d} (\mathds B^{d}) = \frac{π ^{\frac{d}{2}}}{Γ [ 1 + \frac{d}{2} ]}

d_{W} (X, Y) := h \in Lip (1) sup ∣ \mathds E [h (X)] - \mathds E [h (Y)] ∣ .

d_{W} (X, Y) := h \in Lip (1) sup ∣ \mathds E [h (X)] - \mathds E [h (Y)] ∣ .

d_{T V} (X, Y) := A \subseteq \mathds N sup ∣ \mathds P (X \in A) - \mathds P (Y \in A) ∣ .

d_{T V} (X, Y) := A \subseteq \mathds N sup ∣ \mathds P (X \in A) - \mathds P (Y \in A) ∣ .

\mathds P (η (B) = k) = \frac{μ ( B ) ^{k}}{k !} e^{- μ (B)},

\mathds P (η (B) = k) = \frac{μ ( B ) ^{k}}{k !} e^{- μ (B)},

D_{x} F := D_{x} f (η) := f (η + δ_{x}) - f (η),

D_{x} F := D_{x} f (η) := f (η + δ_{x}) - f (η),

D_{x_{1}, \dots, x_{m}}^{m} F

D_{x_{1}, \dots, x_{m}}^{m} F

\displaystyle\begin{array}[]{rrcl}DF:\mathds{X}\rightarrow\mathds{R},&\quad x&\overset{DF}{\longmapsto}&D_{x}f(\eta),\\ D^{m}F:\mathds{X}\rightarrow\mathds{R},&\quad(x_{1},\ldots,x_{m})&\overset{D^{m}F}{\longmapsto}&D_{x_{1},\ldots,x_{m}}^{m}f(\eta).\end{array}

\displaystyle\begin{array}[]{rrcl}DF:\mathds{X}\rightarrow\mathds{R},&\quad x&\overset{DF}{\longmapsto}&D_{x}f(\eta),\\ D^{m}F:\mathds{X}\rightarrow\mathds{R},&\quad(x_{1},\ldots,x_{m})&\overset{D^{m}F}{\longmapsto}&D_{x_{1},\ldots,x_{m}}^{m}f(\eta).\end{array}

F = \mathds E [F] + n = 1 \sum \infty I_{n} (f_{n})

F = \mathds E [F] + n = 1 \sum \infty I_{n} (f_{n})

n = 1 \sum \infty nn! ∥ f_{n} ∥_{L^{2} (μ^{n})}^{2} < \infty.

n = 1 \sum \infty nn! ∥ f_{n} ∥_{L^{2} (μ^{n})}^{2} < \infty.

D_{x} F = n = 1 \sum \infty n I_{n - 1} (f_{n} (x, \cdot)),

D_{x} F = n = 1 \sum \infty n I_{n - 1} (f_{n} (x, \cdot)),

\mathds E \mathds X \int (f (η + δ_{x}) - f (η))^{2} μ (d x) < \infty.

\mathds E \mathds X \int (f (η + δ_{x}) - f (η))^{2} μ (d x) < \infty.

n = 1 \sum \infty n^{2} n! ∥ f_{n} ∥_{L^{2} (μ^{n})}^{2} < \infty,

n = 1 \sum \infty n^{2} n! ∥ f_{n} ∥_{L^{2} (μ^{n})}^{2} < \infty,

L F = - n = 1 \sum \infty n I_{n} (f_{n}),

L F = - n = 1 \sum \infty n I_{n} (f_{n}),

L^{- 1} F := - n = 1 \sum \infty \frac{1}{n} I_{n} (f_{n}) .

L^{- 1} F := - n = 1 \sum \infty \frac{1}{n} I_{n} (f_{n}) .

P_{s} F := \int \mathds E [f (η^{(s)} + χ) η] Π_{(1 - s) μ} (d χ),

P_{s} F := \int \mathds E [f (η^{(s)} + χ) η] Π_{(1 - s) μ} (d χ),

L^{- 1} F = - 0 \int 1 s^{- 1} P_{s} F d s .

L^{- 1} F = - 0 \int 1 s^{- 1} P_{s} F d s .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Stochastic processes and statistical mechanics · Point processes and geometric inequalities

Full text

Poisson fluctuations for edge counts in high-dimensional random geometric graphs

Jens Grygierek111Institute of Mathematics, Osnabrück University, Germany. Email: [email protected]

Abstract

We prove a Poisson limit theorem in the total variation distance of functionals of a general Poisson point process using the Malliavin-Stein method. Our estimates only involve first and second order difference operators and are closely related to the corresponding bounds for the normal approximation in the Wasserstein distance by Last, Peccati and Schulte, see [LPS16]. As an application of this Poisson limit theorem, we consider a stationary Poisson point process in $\mathds{R}^{d}$ and connect any two points whenever their distance is less than or equal to a prescribed distance parameter. This construction gives rise to the well known random geometric graph. The number of edges of this graph is counted that have a midpoint in the $d$ -dimensional unit ball. A quantitative Poisson limit theorem for this counting statistic is derived, as the space dimension $d$ and the intensity of the Poisson point process tend to infinity simultaneously, extending our previous work, [GT16] where we derived a central limit theorem, showing that the phase transition phenomenon holds also in the high-dimensional set-up.

Keywords. Poisson limit theorem, edge counting statistic, high dimensional random geometric graph, Poisson point process, second-order Poincaré inequality, stochastic geometry, Mehler’s formula, Stein’s method, Malliavin calculus, phase transition

MSC (2010). 60D05, 60F05

1 Introduction and main results

Fix an intensity $\lambda\in(0,\infty)$ and a distance parameter $\delta\in(0,\infty)$ and let $\eta_{\lambda}$ be a stationary Poisson point process in $\mathds{R}^{d}$ , $d\in\mathds{N}$ with intensity $\lambda$ . The points of $\eta_{\lambda}$ are taken as the vertices of a random graph and we connect any two distinct vertices by an edge provided that their distance is less than or equal to $\delta$ . By this construction the random geometric graph in $\mathds{R}^{d}$ arises.

This paper is a direct continuation of [GT16], where we have derived a quantitative central limit theorem for the number of edges that have their midpoint in the $d$ -dimensional unit ball $\mathds{B}^{d}$ as the space dimension $d$ and the intensity $\lambda$ tend to infinity simultaneously such that the expectation of the considered edge counting statistic tends to infinity. In this paper we derive the corresponding Poisson limit theorem in the case that the expectation tends to a positive but finite constant by first proving a general Poisson limit theorem for Poisson functionals using the Malliavin-Stein method, that comes in the taste of the remarkable central limit theorem [LPS16, Theorem 1.1].

1.1 Poisson approximation for $\mathds{N}$ -valued Poisson functionals

We first rephrase a version of the main result from [LPS16], a so-called second order Poincaré inequality for Poisson functionals, see also [LP18, Theorem 2.13], that only involves moments of first and second order difference operators.

Theorem 1.1.

Let $F\in\operatorname{dom}(D)$ be a Poisson functional such that ${\mathds{E}}\left[F\right]=0$ and ${\mathds{V}}\left[F\right]=1$ . Define

[TABLE]

and let $Z$ be a standard Gaussian random variable, then

[TABLE]

where $\operatorname{d}_{W}$ denotes the Wasserstein-distance, see Definition 2.1.

Replacing the third term in the approximation bound with

[TABLE]

we can formulate the analogue of Theorem 1.1 for Poisson approximation, which is our first main result and will be used later to derive Theorem 1.4.

Theorem 1.2 (Poisson Approximation).

Let $\eta$ be a Poisson point process on $\mathds{X}$ with $\sigma$ -finite non-atomic intensity measure $\mu$ and let $F$ be an $\mathds{N}$ -valued Poisson functional satisfying $F\in\operatorname{dom}(D)$ . Further, let $\mathcal{P}(\theta)$ be a Poisson distributed random variable with parameter $\theta>0$ . Then

[TABLE]

where $\operatorname{d}_{TV}$ denotes the total variation distance, see Definition 2.2.

Note that $\gamma_{1}(F)$ and $\gamma_{2}(F)$ were also used before in the central limit theorem, which will be useful in the proof of our second main result, since it allows us to reuse some of the calculations we did in the previous work [GT16].

1.2 Poisson fluctuations for edge counts in high-dimensional random geometric graphs

Let $\eta_{d}$ be a stationary Poisson point process on $\mathds{R}^{d}$ with dimension-dependent intensity $\lambda_{d}\in(0,\infty)$ , i.e. the intensity measure is given by $\mu_{d}=\lambda_{d}\Lambda_{d}$ , where $\Lambda_{d}$ denotes the $d$ -dimensional Lebesgue measure. We choose a dimension-dependent distance parameter $\delta_{d}$ with $\delta_{d}\rightarrow 0$ for $d\rightarrow\infty$ , namely we take

[TABLE]

which implies that $\delta_{d}\in(0,1)$ for all $d\geq 2$ . The motivation for our choice is explained in Remark 5.2 below, where we also give the precise conditions for $\delta_{d}$ to allow for more general choices. We notice that $\delta_{d}\rightarrow 0$ . Finally we choose the dimension-dependent intensity $\lambda_{d}$ such that $\lambda_{d}\rightarrow\infty$ for $d\rightarrow\infty$ .

Let $\mathcal{E}(\lambda_{d},\delta_{d},d)$ denote the number of edges of the random geometric graph that have their midpoint in the $d$ -dimensional unit ball $\mathds{B}^{d}$ , that is the edge-counting statistic given by

[TABLE]

To simplify our notation we shall use the abbreviation $\mathcal{E}_{d}$ for $\mathcal{E}(\lambda_{d},\delta_{d},d)$ . The expectation and the variance of $\mathcal{E}_{d}$ was already derived in our previous work [GT16, eq. 4, eq. 5, Lemma 7], namely:

[TABLE]

and

[TABLE]

Here and below, $\kappa_{d}:=\Lambda_{d}(\mathds{B}^{d})$ denotes the volume of the $d$ -dimensional unit ball. Note that the exponential decay of $\kappa_{d}$ behaves like $\frac{1}{\sqrt{\pi d}}\left({\frac{2\pi e}{d}}\right)^{\frac{d}{2}}$ , as $d\rightarrow\infty$ , according to Stirling’s formula.

We investigate the asymptotic distributional behavior of $\mathcal{E}_{d}$ as $\delta_{d}\rightarrow 0$ and the intensity $\lambda_{d}$ as well as the space dimension $d$ tend to infinity simultaneously. This set-up is opposed to the most of the existing literature in which the focus lies on random geometric graphs in $\mathds{R}^{d}$ with some fixed space dimension $d$ , see [Bub+16] and [Dev+11] for notable exceptions, where, however, questions concerning the high-dimensional fluctuations are not touched.

The asymptotic behavior of $\mathcal{E}_{d}$ depends on how fast the sequence $(\lambda_{d})_{d\in\mathds{N}}$ increases as $d\rightarrow\infty$ . This phenomenon is quite common for asymptotic results related to edge counts (or more generally subgraph counts) and component counts. In particular, here, one has to distinguishes the following phases, determined by the limit of the expectation ${\mathds{E}}\left[\mathcal{E}_{d}\right]$ :

[TABLE]

Remark 1.3.

If the expectation tends to infinity (1) the edge-counting statistic satisfies a central limit theorem, see [GT16, Theorem 1].

In this paper, we obtain a Poisson limit theorem for a finite non-zero limit (2) showing that the phase-transition phenomenon for the edge-counting statistic holds also in the high-dimensional set-up:

Theorem 1.4.

Assume $\frac{1}{2}\kappa_{d}^{2}\lambda_{d}^{2}\delta_{d}^{d}\rightarrow\theta\in(0,\infty)$ for $d\rightarrow\infty$ and let $\mathcal{P}(\theta)$ be a Poisson distributed random variable with parameter $\theta$ . Then one can find absolute constants $\mathbf{C}_{1},\mathbf{C}_{2},\mathbf{D}\in(0,\infty)$ such that

[TABLE]

whenever $d\geq\mathbf{D}$ . In particular, one has that

[TABLE]

Remark 1.5.

If the expectation tends to zero, (3), we also have ${\mathds{V}}\left[\mathcal{E}_{d}\right]\rightarrow 0$ , indicating that the edge-counting statistic vanishes in the limit, since the random graph contains almost surely no edges.

The rest of this text is structured as follows. In Section 2 we recall some necessary background material on Poisson functionals and the Malliavin-Stein method. In particular we introduce Mehler’s formula that will be the core ingredient in the proof of Theorem 1.2 in Section 3. In Section 4 we derive a general bound for second order $U$ -statistics. The final Section 5 contains the proof of Theorem 1.4.

2 Preliminaries

The $d$ -dimensional Euclidean space is denoted by $\mathds{R}^{d}$ and we let $\mathscr{B}^{d}$ be the Borel $\sigma$ -field on $\mathds{R}^{d}$ . The Lebesgue measure on $\mathds{R}^{d}$ is indicated by $\Lambda_{d}$ . A $d$ -dimensional ball with radius $r>0$ and center in $z\in\mathds{R}^{d}$ is defined by

[TABLE]

where $\lVert\cdot\rVert$ stands for the usual Euclidean norm. We shall write $\mathds{B}^{d}$ instead of $\mathds{B}^{d}_{1}(0)$ and denote by

[TABLE]

the volume of the $d$ -dimensional unit ball $\mathds{B}^{d}$ , where $\Gamma\left[{\cdot}\right]$ is Euler’s gamma function.

We will use the Wasserstein-distance for the normal approximation and the total variation distance for the Poisson approximation, see for instance [BP16, Section 2.1].

Definition 2.1.

We denote by $\operatorname{Lip}(1)$ the class of Lipschitz functions $h:\mathds{R}\rightarrow\mathds{R}$ with Lipschitz constant less or equal to one, i.e. $h$ is absolutely continuous and almost everywhere differentiable with $\lVert h^{\prime}\rVert_{\infty}\leq 1$ . Given two $\mathds{R}$ -valued random variables $X,Y$ , with ${\mathds{E}}\left\lvert X\right\rvert<\infty$ and ${\mathds{E}}\left\lvert Y\right\rvert<\infty$ the Wasserstein distance between the laws of $X$ and $Y$ , written $\operatorname{d}_{W}(X,Y)$ is defined as

[TABLE]

Definition 2.2.

Given two $\mathds{N}$ -valued random variables $X,Y$ , the total variation distance between the laws of $X$ and $Y$ , written $\operatorname{d}_{TV}(X,Y)$ is defined as

[TABLE]

2.1 Poisson functionals and Malliavin-Stein Method

Let $(\mathds{X},\mathscr{X},\mu)$ be a Borel measure space with $\sigma$ -finite and non-atomic measure $\mu$ such that $\mu(\mathds{X})>0$ . For $p>0$ and $n\in\mathds{N}$ we denote by $L^{p}(\mu^{n})$ the set of all measurable functions $f:\mathds{X}^{n}\rightarrow\mathds{R}$ such that $\int\lvert f\rvert^{p}{\mathrm{d}}\mu^{n}<\infty$ .

We use the symbol $\mathrm{N}_{\sigma}:=\mathrm{N}_{\sigma}(\mathds{X})$ to indicate the class of all $\sigma$ -finite measures $\chi$ on $\mathds{X}$ with $\chi(B)\in\mathds{N}\cup\{{\infty}\}$ for all $B\in\mathscr{X}$ and supply the space $\mathrm{N}_{\sigma}$ with the smallest $\sigma$ -field $\mathscr{N}_{\sigma}:=\mathscr{N}_{\sigma}(\mathds{X})$ such that all mappings of the form $\chi\mapsto\chi(B)$ with $\chi\in\mathrm{N}$ and $B\in\mathscr{X}$ are measurable.

It will be convenient for us to identify a counting measure $\chi\in\mathrm{N}_{\sigma}$ with its support and to write $x\in\chi$ if the point $x\in\mathds{X}$ is charged by $\chi$ . The Dirac measure concentrated at a point $x\in\mathds{X}$ is denoted by $\delta_{x}$ . This construction mostly follows [Pec12] and [LPS16]. We let $(\Omega,\mathscr{F},{\mathds{P}})$ be our underlying probability space and denote by $L^{p}({\mathds{P}})$ , $p>0$ , the space of all random variables $Y:\Omega\rightarrow\mathds{R}$ such that ${\mathds{E}}\lvert Y\rvert^{p}<\infty$ .

Consider a $\sigma$ -finite non-atomic measure $\mu$ on $\mathds{X}$ . A Poisson point process $\eta$ with intensity measure $\mu$ is a random counting measure on $\mathds{X}$ , that is a random element in $\mathrm{N}_{\sigma}$ , such that

a)

For all $B\in\mathscr{X}$ and all $k\in\mathds{N}$ it holds, that $\eta(B)\overset{d}{\sim}\text{Po}_{\mu(B)}$ , i.e.,

[TABLE]

and for $\mu(B)=\infty$ , we set $\frac{\infty^{k}}{k!}e^{-\infty}=0$ for all $k$ . 2. b)

For all $m\in\mathds{N}\setminus\{0\}$ and all pairwise disjoint measurable sets $B_{1},\ldots,B_{m}\in\mathscr{X}$ , the random variables $\eta(B_{1}),\ldots,\eta(B_{m})$ are independent.

By a Poisson functional $F$ we understand a random variable $F\in L^{2}({\mathds{P}})$ , that is almost surely of the form $F=f(\eta)$ , where $f:\mathrm{N}_{\sigma}\rightarrow\mathds{R}$ is some measurable function, the so-called representative of $F$ . For a Poisson functional $F$ with representative $f$ and $x\in\mathds{X}$ we define the first-order difference operator

[TABLE]

and for $m\geq 2$ points $x_{1},\ldots,x_{m}\in\mathds{X}$ the $m$ -th-order difference operator $D_{x_{1},\ldots,x_{m}}F$ is defined inductively by

[TABLE]

where $D_{x}^{1}F=D_{x}F$ . Note that this definition does not depend on the choice of the representative $f$ $\mu^{m}$ -a.e. and ${\mathds{P}}$ -a.s. and further that $D^{m}_{x_{1},\ldots,x_{m}}F$ is symmetric in the arguments $x_{1},\ldots,x_{m}$ .

In the following we will denote by $DF$ resp. $D^{m}F$ the mappings

[TABLE]

For a short introduction to the Malliavin-Calculus we recall some of the important tools in the development of the theory. For a deeper discussion of Fock Spaces and Chaos Expansion as well as Malliavin-Calculus and Malliavin-Stein Method we refer the reader to [Las16] and the books [LP18, PR16]. We introduce the notion of the Wiener-Itô chaos expansion, see [LPS16] and the references therein, especially [LP11] for more details and proofs.

Every Poisson functional $F$ admits a representation of the type

[TABLE]

where the series coverges in $L^{2}({\mathds{P}})$ . For each $n\geq 1$ , the kernel $f_{n}$ is given by the (scaled) expectation of the $n$ -order difference operator, i.e. $f_{n}:=\frac{1}{n!}{\mathds{E}}\left[D^{n}F\right]$ and $I_{n}(\cdot)$ denotes the $n$ -th order Wiener-Itô integral. This representation is known as Wiener-Itô chaos expansion of $F$ .

We say a Poisson functional lies in the domain of $D$ , $F\in\operatorname{dom}(D)$ , if

[TABLE]

In this case $D$ is called the Malliavin derivative operator associated with the Poisson process $\eta$ , and it holds ${\mathds{P}}$ -a.s. and $\mu$ -a.e., $x\in\mathds{X}$ , that

[TABLE]

where the right hand side is the definition of the Malliavin derivative operator and the left hand side is the path-wise defined first-order difference operator given by (4).

Note that the following Lemma can be used to easily check if a Poisson functional lies in the domain of $D$ .

Lemma 2.3 ([PT13, Lemma 3.1]).

Let $F\in L^{2}({\mathds{P}})$ denote a Poisson functional with representative $f$ such that

[TABLE]

Then $F\in\operatorname{dom}(D)$ .

The Wiener Itô chaos expansion gives rise to the Ornstein-Uhlenbeck generator $L$ , that is defined for all Poisson functionals $F\in\operatorname{dom}(L)$ , i.e.

[TABLE]

by

[TABLE]

and its (pseudo) inverse $L^{-1}$ is given by

[TABLE]

In [Pec+10, Section 3, Theorem 3.1] the Malliavin-Calculus was combined with Stein’s method to derive a bound on the Wasserstein distance between the law of a standardized Poisson Functional $F\in\operatorname{dom}(D)$ and the standard Gaussian distribution. This bound as well as the bound derived in [Pec12, Theorem 3.1], stated here as Theorem 3.1 for Poisson approximation in the total variation distance rely on the inverse $L^{-1}$ of the Ornstein-Uhlenbeck generator $L$ , which generally requires the calculation of the Wiener-Itô chaos expansion of $F$ . In [LPS16] this was solved for the normal approximation case by establishing and applying a general Mehler formula for Poisson processes which allows to represent the inverse Ornstein-Uhlenbeck generator in terms of thinned Poisson point processes to derive bounds that only rely on the moments of the first- and second-order difference operators $D_{x}F$ and $D^{2}_{x_{1},x_{2}}F$ .

2.2 Mehler’s formula

For the sake of brevity we only introduce Mehler’s formula and the derived results we will need in the proof of Theorem 1.2 and refer the reader for the full coverage to [LPS16].

Let $s\in[0,1]$ and denote by $\eta^{(s)}$ the $s$ -thinning of our Poisson point process $\eta$ and by $\Pi_{\nu}$ the distribution of a Poisson point process with intensity measure $\nu$ . We define the operator $P_{s}$ by

[TABLE]

where the conditional expectation is taken with respect to the random thinning and the Poisson point process $\chi$ , conditioned on $\eta$ . Using the operator $P_{s}$ , we derive Mehler’s formula:

Theorem 2.4 (Mehler’s formula, [LPS16, Theorem 3.2]).

Let $F$ be a Poisson functional and ${\mathds{E}}\left[F\right]=0$ , then we have ${\mathds{P}}$ -a.s. that

[TABLE]

We will need the following inequalities in the proof of our Poisson limit theorem, Theorem 1.2.

Lemma 2.5 ([LPS16, Lemma 3.4]).

Let $F$ be a Poisson functional and $p\geq 1$ , then

[TABLE]

and

[TABLE]

Since $L^{-1}F=L^{-1}(F-{\mathds{E}}\left[F\right])$ and $D_{x}F=D_{x}(F-{\mathds{E}}\left[F\right])$ , we can rephrase the result on the covariance, see [LPS16, Theorem 4.1], to obtain a result on the variance of our Poisson functional $F$ :

Theorem 2.6.

Let $F\in\operatorname{dom}(D)$ , then

[TABLE]

3 Proof of Theorem 1.2

Let us first recall the Malliavin bounds for Poisson approximation from [Pec12, Theorem 3.1]:

Theorem 3.1.

Let $\eta$ be a Poisson point process on $\mathds{X}$ with $\sigma$ -finite and non-atomic intensity measure $\mu$ and let $F$ be an $\mathds{N}$ -valued Poisson functional satisfying $F\in\operatorname{dom}(D)$ . Further let $\mathcal{P}(\theta)$ be a Poisson distributed random variable with parameter $\theta>0$ . Then

[TABLE]

The main idea of the proof is to take Mehler’s formula and its application from [LPS16, Sections 3 and 4] and adapt this technique for the bound given by Theorem 3.1.

Proof of Theorem 1.2:

Using the Cauchy-Schwarz inequality we can bound the first term by

[TABLE]

and apply Theorem 2.6 to derive

[TABLE]

which yields the first part of our bound

[TABLE]

The second term can be bounded by using Fubini’s theorem and Hölders-inequality with parameters $p=q=2$ . Thus

[TABLE]

which can be bounded using Lemma 2.5 by

[TABLE]

yielding the second part of our bound

[TABLE]

completing the proof of Theorem 1.2. ∎

4 A general bound for second-order $U$ -statistics

In this section, we adapt the general bound for the normal approximation of second-order $U$ -statistics, that was provided in [GT16, Section 3] to the Poisson case, showing that some of the previous results therein can be reused. Let $F_{d}$ denote a second-order $U$ -statistics in the sense of [RS13] based on a Poisson point process in $\mathds{R}^{d}$ having intensity measure $\mu$ . Formally we define

[TABLE]

and assume that $h:\mathds{R}^{d}\times\mathds{R}^{d}\rightarrow\left\{{0,1}\right\}$ is a symmetric measurable function, which we allow to depend on the space dimension $d$ . Furthermore, we assume that ${\mathds{E}}\left[F_{d}^{2}\right]<\infty$ . Finally we define the two parameter integrals

[TABLE]

cf. [GT16, Section 3], where we already omit the exponents of $h:\mathds{R}^{d}\times\mathds{R}^{d}\rightarrow\left\{{0,1}\right\}$ .

Following [GT16, Section 3], by Mecke’s formula we have that

[TABLE]

and

[TABLE]

Next, we compute the expectations occurring at the right-hand side of Theorem 1.2 to prepare the bounds for the three terms $\gamma_{1}(F_{d})$ , $\gamma_{2}(F_{d})$ , and $\gamma_{3,P}(F_{d})$ .

Lemma 4.1.

Let $x,x_{1},x_{2}\in\mathds{R}^{d}$ . Then

(a)

${\mathds{E}}[(D_{x}F_{d})^{2}]=A(x)^{2}+A(x)$ , 2. (b)

${\mathds{E}}[\left\lvert D_{x}F_{d}\right\rvert^{3}]=A(x)^{3}+3A(x)^{2}+A(x)$ , 3. (c)

${\mathds{E}}[(D_{x}F_{d})^{4}]=P(x)$ , with

[TABLE] 4. (d)

${\mathds{E}}[(D_{x}F_{d}(D_{x}F_{d}-1))^{2}]=Q(x)$ , with

[TABLE] 5. (e)

${\mathds{E}}[(D_{x_{1},x_{2}}F_{d})^{4}]=h(x_{1},x_{2})$ .

Proof:

Assertions (b), (c) and (e) are following directly from [GT16, Lemma 3] using $h^{2}=h$ . Additionally the proof of (a) is similar to the proof of (b) writing

[TABLE]

To prove d) we write

[TABLE]

and obtain $Q(x)$ using (a) and (b) combined with $D_{x}F\geq 0$ and (c). ∎

We shall now provide the announced expressions for the terms $\gamma_{1}(F_{d})$ , $\gamma_{2}(F_{d})$ and $\gamma_{3,P}(F_{d})$ .

Lemma 4.2.

We have that

[TABLE]

Proof:

The expression for $\gamma_{1}(F_{d})$ and $\gamma_{2}(F_{d})$ are following similar to [GT16, Lemma 4] by replacing the standardized $U$ -statistics $\widetilde{F_{d}}$ with the non-standardized $F_{d}$ . Using Lemma 4.1 (a) and (d) we have that

[TABLE]

and the proof is complete. ∎

Now we can combine these expressions established so far to reformulate Theorem 1.2 for our second-order $U$ -statistic $F_{d}$ .

Proposition 4.3.

Let $\eta$ be a Poisson point process on $\mathds{X}$ with $\sigma$ -finite non-atomic intensity measure $\mu$ and let $F_{d}:=\frac{1}{2}\sum_{(y_{1},y_{2})\in\eta^{2}_{\neq}}h(y_{1},y_{2})$ be a second-order $U$ -statistic with symmetric kernel $h:\mathds{R}^{d}\times\mathds{R}^{d}\rightarrow\left\{{0,1}\right\}$ . Suppose that ${\mathds{E}}\int_{\mathds{R}^{d}}(D_{x}F_{d})^{2}\mu({\mathrm{d}}x)<\infty$ . Defining

[TABLE]

one has that

[TABLE]

where $\mathcal{P}(\theta)$ is a Poisson distributed random variable with parameter $\theta>0$ .

5 Proof of Theorem 1.4

Let us recall that $\eta_{d}$ denotes a stationary Poisson point process on $\mathds{R}^{d}$ with intensity $\lambda_{d}$ given by (2). We denote by $\mu$ the intensity measure of $\eta_{d}$ , that is, $\mu$ is $\lambda_{d}$ times the Lebesgue measure on $\mathds{R}^{d}$ . Moreover, from now on we will assume without loss of generality that all the random variables $(\mathcal{E}_{d})_{d\geq 2}$ are defined on a common probability space $(\Omega,\mathscr{F},{\mathds{P}})$ .

It easy to see, that the edge counting statistic $\mathcal{E}_{d}$ is a second-order $U$ -statistic with measurable, symmetric and $d$ -dependent kernel $h:\mathds{R}^{d}\times\mathds{R}^{d}\rightarrow\left\{{0,1}\right\}$ , given by

[TABLE]

To derive Theorem 1.4 we apply the Poisson approximation bound derived in Proposition 4.3 using the bounds on the parameter integrals and the expectation and variance of $\mathcal{E}_{d}$ given by [GT16, eq. 15, Lemma 6, Lemma 7]. We have

[TABLE]

and

[TABLE]

Lemma 5.1.

Let $h:\mathds{R}^{d}\times\mathds{R}^{d}\rightarrow\left\{{0,1}\right\}$ be the function given by (7). Then for all $x\in\mathds{R}^{d}$ it holds that

[TABLE]

Remark 5.2 (cf. [GT16, Remark 8]).

Our particular choice $\delta_{d}=\frac{1}{d}$ ensures that we can find absolute constants $\mathbf{C}_{1},\mathbf{C}_{2}\in(0,\infty)$ and $\mathbf{D}\in\mathds{N}$ such that

[TABLE]

and

[TABLE]

for all $d\geq\mathbf{D}$ . The existence of such constants is important to derive the final bounds on the right hand side of our main result and implies restrictions to more general choices of $\delta_{d}$ , see the proof of Lemma 5.4. If one is only interested in the Poisson limit, the first condition (11) can be omitted, since it is only involved in the lower variance bound used in the Gaussian approximations, see [GT16, Lemma 11 and eq. 20].

In the next step, we check the integrability condition in Proposition 4.3. Note that this condition determines the limiting distribution, yielding the Gaussian limit if (1) holds resp. the Poisson limit if (2) holds:

Lemma 5.3.

If (1) holds, we have

[TABLE]

and [GT16, Theorem 1] yields the Gaussian limit for the standardized edge counting statistics $({\mathds{V}}\left[\mathcal{E}_{d}\right])^{-\frac{1}{2}}(\mathcal{E}_{d}-{\mathds{E}}\left[\mathcal{E}_{d}\right])$ .

If (2) holds, we have

[TABLE]

thus we can apply the Poisson approximation given by Proposition 4.3.

Proof:

The first claim was already shown by [GT16, Lemma 9]. For the second claim, note that

[TABLE]

Using (8) and (9) we obtain

[TABLE]

Assumption (2), ${\mathds{E}}\left[\mathcal{E}_{d}\right]=\frac{1}{2}\kappa_{d}^{2}\lambda_{d}^{2}\delta_{d}^{d}\rightarrow\theta$ , implies that $\kappa_{d}^{3}\lambda_{d}^{3}\delta_{d}^{2d}\rightarrow 0$ . The choice of $\delta_{d}$ according to Remark 5.2 ensures that $\left({1+\frac{\delta_{d}}{2}}\right)^{d}$ can be bounded. Thus ${\mathds{V}}\left[\mathcal{E}_{d}\right]\rightarrow\theta$ and further ${\mathds{V}}\left[\mathcal{E}_{d}\right]+{\mathds{E}}\left[\mathcal{E}_{d}\right]<\infty$ . ∎

Now, we will use the bounds for the parameter integral $A(x)$ to derive an upper bound for the three terms appearing in Proposition 4.3.

Lemma 5.4.

There are absolute constants $\mathbf{C}_{1},\mathbf{C}_{2},\mathbf{C}_{3}\in(0,\infty)$ and $\mathbf{D}\in\mathds{N}$ such that

[TABLE]

for all $d\geq\mathbf{D}$ .

Proof:

Applying (10) to the definition of $P(x)$ in Lemma 4.1 we see that

[TABLE]

Therefore, it follows that

[TABLE]

We now use Fubini’s theorem to re-write the double integral. Together with (10) this implies

[TABLE]

Note that (2), ${\mathds{E}}\left[\mathcal{E}_{d}\right]=\frac{1}{2}\kappa_{d}^{2}\lambda_{d}^{2}\delta_{d}^{d}\rightarrow\theta$ , implies $\kappa_{d}\lambda_{d}\delta_{d}^{d}\rightarrow 0$ , thus the speed of convergence is dominated by the term with the lowest exponent. Additionally $\kappa_{d}^{2}\lambda_{d}^{2}\delta_{d}^{d}\rightarrow\theta$ and Remark 5.2 imply, that we can bound $(1+\frac{\delta_{d}}{2})^{d}$ and $\kappa_{d}^{2}\lambda_{d}^{2}\delta_{d}^{d}$ by absolute constants for $d$ sufficiently large. Thus there are absolute constants $\tilde{\mathbf{C}}_{1},\hat{\mathbf{C}}_{1},\mathbf{C}_{1}\in(0,\infty)$ and $\mathbf{D}_{1}\in\mathds{N}$ such that

[TABLE]

for all $d\geq\mathbf{D}_{1}$ . Using (10) we obtain in a similar way that

[TABLE]

for all $d\geq\mathbf{D}_{2}$ , where $\mathbf{C}_{2}\in(0,\infty)$ and $\mathbf{D}_{2}\in\mathds{N}$ are absolute constants. Applying (10) to the definition of $Q(x)$ in Lemma 4.1 we see that

[TABLE]

and

[TABLE]

Therefore, it follows that

[TABLE]

for all $d\geq\mathbf{D}_{3}$ , where $\tilde{\mathbf{C}}_{3},\hat{\mathbf{C}}_{3},\mathbf{C}_{3}\in(0,\infty)$ and $\mathbf{D}_{3}\in\mathds{N}$ are absolute constants. Setting $D:=\max\left\{{\mathbf{D}_{1},\mathbf{D}_{2},\mathbf{D}_{3}}\right\}$ completes the proof. ∎

After these preparations, we can now present the proof of our second main result.

Proof of Theorem 1.4:

We use Proposition 4.3 and the results of the last lemma. Assuming (2) we find absolute constants $\mathbf{C}_{1},\mathbf{C}_{2},\mathbf{C}_{3},\mathbf{C}_{4},\mathbf{C}_{5}\in(0,\infty)$ and $\mathbf{D}\in\mathds{N}$ such that

[TABLE]

holds for all $d\geq\mathbf{D}$ . Since $\kappa_{d}\lambda_{d}\delta_{d}^{d}\rightarrow 0$ the first and the last term are converging faster to zero than the second term, thus we can find absolute constants $\widetilde{\mathbf{C}}_{1},\widetilde{\mathbf{C}}_{2}\in(0,\infty)$ and $\widetilde{\mathbf{D}}$ such that

[TABLE]

holds for all $d\geq\widetilde{\mathbf{D}}$ . Using our assumption (2) it follows that $\operatorname{d}_{TV}\left({\mathcal{E}_{d},\mathcal{P}(\theta)}\right)\rightarrow 0$ and hence $\mathcal{E}\overset{\mathclap{D}}{\longrightarrow}\mathcal{P}(\theta)$ as $d\rightarrow\infty$ . This completes the proof of Theorem 1.4. ∎

Bibliography13

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[BP 16] Solesne Bourguin and Giovanni Peccati “The Malliavin-Stein method on the Poisson space” In Stochastic analysis for Poisson point processes 7 , Bocconi Springer Ser. Bocconi Univ. Press, [place of publication not identified], 2016, pp. 185–228
2[Bub+16] Sébastien Bubeck, Jian Ding, Ronen Eldan and Miklós Z. Rácz “Testing for high-dimensional geometry in random graphs” In Random Structures Algorithms 49.3 , 2016, pp. 503–532 DOI: 10.1002/rsa.20633 · doi ↗
3[Dev+11] Luc Devroye, András György, Gábor Lugosi and Frederic Udina “High-dimensional random geometric graphs and their clique number” In Electron. J. Probab. 16 , 2011, pp. no. 90 \bibrangessep 2481–2508 DOI: 10.1214/EJP.v 16-967 · doi ↗
4[GT 16] Jens Grygierek and Christoph Thäle “Gaussian fluctuations for edge counts in high-dimensional random geometric graphs” In ar Xiv e-prints , 2016, pp. ar Xiv:1612.03286 ar Xiv: 1612.03286 [math.PR]
5[Las 16] Günter Last “Stochastic analysis for Poisson processes” In Stochastic analysis for Poisson point processes 7 , Bocconi Springer Ser. Bocconi Univ. Press, [place of publication not identified], 2016, pp. 1–36 DOI: 10.1007/978-3-319-05233-5˙1 · doi ↗
6[LP 11] Günter Last and Mathew D. Penrose “Poisson process Fock space representation, chaos expansion and covariance inequalities” In Probab. Theory Related Fields 150.3-4 , 2011, pp. 663–690 DOI: 10.1007/s 00440-010-0288-5 · doi ↗
7[LP 18] Günter Last and Mathew Penrose “Lectures on the Poisson process” 7 , Institute of Mathematical Statistics Textbooks Cambridge University Press, Cambridge, 2018, pp. xx+293
8[LPS 16] Günter Last, Giovanni Peccati and Matthias Schulte “Normal approximation on Poisson spaces: Mehler’s formula, second order Poincaré inequalities and stabilization” In Probab. Theory Related Fields 165.3-4 , 2016, pp. 667–723 DOI: 10.1007/s 00440-015-0643-7 · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Poisson fluctuations for edge counts in high-dimensional random geometric graphs

Abstract

1 Introduction and main results

1.1 Poisson approximation for \mathdsN\mathds{N}\mathdsN-valued Poisson functionals

Theorem 1.1**.**

Theorem 1.2** (Poisson Approximation).**

1.2 Poisson fluctuations for edge counts in high-dimensional random geometric graphs

Remark 1.3**.**

Theorem 1.4**.**

Remark 1.5**.**

2 Preliminaries

Definition 2.1**.**

Definition 2.2**.**

2.1 Poisson functionals and Malliavin-Stein Method

Lemma 2.3** ([PT13, Lemma 3.1]).**

2.2 Mehler’s formula

Theorem 2.4** (Mehler’s formula, [LPS16, Theorem 3.2]).**

Lemma 2.5** ([LPS16, Lemma 3.4]).**

Theorem 2.6**.**

3 Proof of Theorem 1.2

Theorem 3.1**.**

4 A general bound for second-order UUU-statistics

Lemma 4.1**.**

Lemma 4.2**.**

Proposition 4.3**.**

5 Proof of Theorem 1.4

Lemma 5.1**.**

Remark 5.2** (cf. [GT16, Remark 8]).**

Lemma 5.3**.**

Lemma 5.4**.**

1.1 Poisson approximation for $\mathds{N}$ -valued Poisson functionals

Theorem 1.1.

Theorem 1.2 (Poisson Approximation).

Remark 1.3.

Theorem 1.4.

Remark 1.5.

Definition 2.1.

Definition 2.2.

Lemma 2.3 ([PT13, Lemma 3.1]).

Theorem 2.4 (Mehler’s formula, [LPS16, Theorem 3.2]).

Lemma 2.5 ([LPS16, Lemma 3.4]).

Theorem 2.6.

Theorem 3.1.

4 A general bound for second-order $U$ -statistics

Lemma 4.1.

Lemma 4.2.

Proposition 4.3.

Lemma 5.1.

Remark 5.2 (cf. [GT16, Remark 8]).

Lemma 5.3.

Lemma 5.4.