Testing goodness of fit for point processes via topological data analysis
Christophe Ange Napol\'eon Biscio, Nicolas Chenavier, Christian, Hirsch, Anne Marie Svane

TL;DR
This paper develops new goodness-of-fit tests for point patterns using topological data analysis, specifically persistent Betti numbers, and demonstrates their effectiveness through simulations and a neuroscience application.
Contribution
It introduces a novel statistical framework based on persistent Betti numbers for assessing point process models, including theoretical conditions and practical testing procedures.
Findings
Persistent Betti numbers are asymptotically Gaussian for large observation windows.
The proposed tests outperform global envelope tests in power.
Application to neuroscience data demonstrates practical utility.
Abstract
We introduce tests for the goodness of fit of point patterns via methods from topological data analysis. More precisely, the persistent Betti numbers give rise to a bivariate functional summary statistic for observed point patterns that is asymptotically Gaussian in large observation windows. We analyze the power of tests derived from this statistic on simulated point patterns and compare its performance with global envelope tests. Finally, we apply the tests to a point pattern from an application context in neuroscience. As the main methodological contribution, we derive sufficient conditions for a functional central limit theorem on bounded persistent Betti numbers of point processes with exponential decay of correlations.
| 5.1% | 59.3% | 60.7% | |
| 4.8% | 94.7% | 71.4% |
| Ripley’s | 42.6% | 20.5% |
|---|---|---|
| 41.5% | 26.3% | |
| 27.0% | 32.2% |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Testing goodness of fit for point processes via topological data analysis
Christophe A.N. Biscio
,
Nicolas Chenavier
,
Christian Hirsch
and
Anne Marie Svane
Aalborg University, Department of Mathematical Sciences, Skjernvej 4, 9220 Aalborg Ø, Denmark
[email protected], [email protected]
Université Littoral Côte d’Opale, EA 2797, LMPA, 50 rue Ferdinand Buisson, 62228 Calais, France
University of Mannheim, Institute of Mathematics, 68161 Mannheim, Germany
Abstract.
We introduce tests for the goodness of fit of point patterns via methods from topological data analysis. More precisely, the persistent Betti numbers give rise to a bivariate functional summary statistic for observed point patterns that is asymptotically Gaussian in large observation windows. We analyze the power of tests derived from this statistic on simulated point patterns and compare its performance with global envelope tests. Finally, we apply the tests to a point pattern from an application context in neuroscience. As the main methodological contribution, we derive sufficient conditions for a functional central limit theorem on bounded persistent Betti numbers of point processes with exponential decay of correlations.
Key words and phrases:
Point processes, goodness-of-fit tests, central limit theorem, topological data analysis, persistent Betti number
2010 Mathematics Subject Classification:
60D05; 55N20; 60F17
1. Introduction
Topological data analysis (TDA) provides insights into a variety of datasets by capturing their most salient properties via refined topological features. Since the mathematical field of topology specializes in describing invariants of objects independently of the choice of a precise metric, these features are robust against small perturbations or different embeddings of the object [11, 12]. Among the most classical topological invariants are the Betti numbers. Loosely speaking, they capture the number of -dimensional holes of the investigated structure. TDA refines this idea substantially by constructing filtrations and tracing when topological features appear and disappear. In point pattern analysis, simplicial complexes are built so that they are topologically equivalent to a union of balls with the same radius and centered at the data points, see the first three panels of Figure 1. As the radius increases, a sequence of simplicial complexes is then defined. Examples of such complexes are the basic Čech complex or the more elaborate -complex, which is based on the Delaunay triangulation, see [18]. In that framework, 1-dimensional features correspond to loops in the simplicial complexes while 0-dimensional features correspond to connected components. When moving up in the filtration, additional edges appear and at some point create new loops. On the other hand, more and more triangles also appear, thereby causing completely filled loops to disappear. Usually, the filtration is indexed by time, and we refer to the appearance and disappearance of features as births and deaths. We refer the reader to [18] for a detailed presentation of these concepts. The persistence diagram visualizes the time points when the features are born and die, see the bottom-right panel in Figure 1. Persistent Betti numbers count the number of events in upper-left blocks of the persistence diagram and are also illustrated in the figure.
In this paper, we leverage persistent Betti numbers to derive goodness-of-fit tests for planar point processes. In this setting, the abstract general definition of persistent Betti numbers gives way to a clear geometric intuition induced by a picture of growing disks centered at the points of the pattern and all having radius , corresponding to the index of the filtration. Features of dimension 0 correspond to connected components in the union of balls, interpreted as point clusters, whereas boundaries of the complement set can be considered as the loops forming the 1-dimensional features. Since the notion of clusters in the sense of connected components lies at the heart of persistent Betti numbers in degree 0, they become highly attractive as a tool to detect clustering in point patterns. Our tests are based on a novel functional central limit theorem (CLT) for the persistent Betti numbers in large domains. The present work embeds into two active streams of current research.
First, now that TDA has become widely adopted, the community is vigorously working towards putting the approach on a firm statistical foundation paving the way for hypothesis testing. On the one hand, this encompasses large-sample Monte Carlo tests when working on a fixed domain [6, 9, 13]. Although these tests are highly flexible, the test statistics under the null hypothesis must be re-computed each time when testing observations in a different window. In large domains, this becomes time-consuming. On the other hand, there has been substantial progress towards establishing CLTs in large domains for functionals related to persistent Betti numbers [39, 40, 29, 35, 25]. However, these results are restricted to the null hypothesis of complete spatial randomness – i.e., the Poisson point process – and establish asymptotic Gaussianity on a multivariate, but not on a functional level. Our proof of a functional CLT is based on recently developed stabilization techniques for point processes with exponential decay of correlations [8]. As explained in the final section of [10], the main technical step towards a functional CLT are bounds on the cumulants.
Second, the introduction of global rank envelope tests has lead to a novel surge of research activity in goodness-of-fit tests for point processes [34]. One of the reasons for their popularity is that they rely on functional summary statistics rather than scalar quantities. Thus, they reveal a substantially more fine-grained picture of the underlying point pattern. In the overwhelming majority of cases, variants of the -function are used as a functional summary statistic, thereby essentially capturing the relative density of point pairs at different distances. Here, the persistent Betti numbers offer an opportunity to augment the basic second-order information by more refined characteristics of the data. Still, even for classical summary statistics, rigorous limit theorems in large domains remain scarce. For instance, a functional central limit theorem of the estimated -function is proven in detail only for the Poisson point process in [22] and an extension to -determinantal point processes is outlined in [23].
The rest of the manuscript is organized as follows. First, in Section 2, we introduce the concepts of -bounded persistence diagrams and -bounded persistent Betti numbers. Next, in Section 3, we state the two main results of the paper, a CLT for the -bounded persistence diagram and a functional CLT for the -bounded persistent Betti numbers. In Section 4, we provide specific examples of point processes satisfying the conditions of the main results. Sections 5 and 6 explore TDA-based tests for simulated and real datasets, respectively. Finally, Section 7 summarizes the findings and points to possible avenues of future research. The proofs of the main results are deferred to Sections 8 and 9 of the appendix.
2. -bounded persistent Betti numbers
For a locally finite point set , persistent Betti numbers provide refined measures for the amount of clusters and voids on varying length scales. More precisely, we let
[TABLE]
denote the union of closed disks of radius centered at points in . A 0-dimensional topological feature is a connected component of this union, corresponding to a cluster of points in , while a 1-dimensional feature can be thought of as a bounded connected component of the background space, often identified with its boundary loop, and describes a vacant area in the plane. As the disks grow, new features arise and vanish; we say that they are born and die again. The persistent Betti numbers quantify this evolution of clusters and loops. Henceforth, we consider the persistence diagram only until a fixed deterministic radius .
As approaches the critical radius for continuum percolation, long-range phenomena emerge [31]. Thus, determining whether two points are connected could require exploring large regions in space. While useful quantitative bounds on cluster sizes are known for Poisson point processes [1], for more general classes of point processes the picture remains opaque and research is currently at a very early stage [27, 7]. Recently, a central limit theorem for persistent Betti numbers has been established in the Poisson setting [29, 25], but for general point processes the long-range interactions pose a formidable obstacle towards proving a fully-fledged functional CLT.
From a more practical point of view, these long-range dependencies are of less concern. Although large features can carry interesting information, we expect that spatially bounded topological features already provide a versatile tool for the statistical analysis of both simulated point patterns and real datasets, even when focusing only on features of a bounded size. For that purpose, we concentrate on features whose spatial diameter does not exceed a large deterministic threshold .
To define these -bounded features, we introduce the Gilbert graph on the vertex set . The Gilbert graph has for vertices the points in and two points are connected by an edge if the distance between them is at most or, equivalently, if the two disks of radius centered at the points intersect.
2.1. -bounded clusters
The 0-dimensional -bounded features alive at time are the connected components of with diameter at most . Starting at , all points belong to separate connected components that merge into larger clusters when increases. We thus say that all components are born at time 0.
To define the death time of a component, let denote the connected component of in . The components of meet at time
[TABLE]
Then, the death time of is the smallest such that the spatial diameter of exceeds or such that is lexicographically larger than , where are the points of and whose associated disks meet at time . This ordering determines which component dies when two of them meet.
2.2. -bounded loops
Next, we introduce 1-dimensional features. At time , these correspond to holes, i.e., bounded connected components in the vacant phase . In contrast to the clusters, there are no holes at time [math], so that both birth and death times must be specified. Moreover, it needs to be defined how holes are related for different radii .
The death time of a hole in is the first time when the hole is completely covered by disks, i.e., . We identify a hole with the point that is covered last. Thus, holes in and in , are identified if .
New holes in can only appear when two balls merge, which corresponds to including a new edge in . When a new hole is formed, it can happen in two ways: either a finite component is separated from the infinite component, or an existing hole is split in two. In both cases, we define the size of the newly created hole(s) as follows: Let be the points in such that the disks of radius around the points intersect the boundary of the hole in . Then, the size of is the diameter of the set . The size remains unchanged until the next time the hole is split into smaller pieces. Then the size is recomputed for both new holes. This definition ensures that the size decreases when the balls grow and only changes when a new edge is added to .
The birth time of a hole is the minimal such that there is a hole in with and size less than . By an -bounded loop, we mean a loop with size lower than .
2.3. The persistence diagram
We now adapt the definition of the persistence diagram in [25] to only include -bounded features. That is, we define the th -bounded persistence diagram, , as the empirical measure
[TABLE]
where is an index set over all -bounded -dimensional features that die before time and are the birth and death times of the th feature. Then, the th -bounded persistent Betti numbers
[TABLE]
are the number of -bounded features born before time and dead after time . When , all features are born at time 0, so that only death times are relevant. Hence, we write instead of the more verbose .
3. Main results
Henceforth, denotes a simple stationary point process in . We think of as a random variable taking values in the space of locally finite subsets of endowed with the smallest -algebra such that the number of points in any given Borel set becomes measurable. Throughout the manuscript, we assume that the factorial moment measures exist and are absolutely continuous. In particular, writing , the th factorial moment density is determined via the identity
[TABLE]
for any pairwise disjoint bounded Borel sets , where denotes the number of points of in . Moreover, as we rely on the framework of [8], we also require that exhibits exponential decay of correlations. Loosely speaking this expresses an approximate factorization of the factorial moment densities and is made precise in Section 4 below. Many of the most prominent examples of point processes appearing in spatial statistics exhibit exponential decay of correlations [8, Section 2.2].
Our first main result is a CLT for the persistence diagram built on the restriction of the point process to a large observation window . With a slight abuse of notation, we write . To prove the CLT, we impose an additional condition concerning moments under the reduced -point Palm distribution . We recall that this distribution is determined via
[TABLE]
for any bounded measurable , where denotes -tuples of pairwise distinct points in . Then, we impose the following moment condition.
- (M)
For every
[TABLE]
To state the CLT for the persistence diagram precisely, we let
[TABLE]
denote the integral of a bounded measurable function with respect to the measure .
Theorem 3.1** (CLT for persistence diagrams).**
Let , and be a bounded measurable function. Assume that exhibits exponential decay of correlations and satisfies condition (M). Furthermore, assume that for some . Then,
[TABLE]
converges in distribution to a standard normal random variable as .
In order to derive a functional CLT for the persistent Betti numbers, we add a further constraint on , which is needed to establish a lower bound on the variance via a conditioning argument in the vein of [38, Lemma 4.3]. For this purpose, we consider a random measure , which is jointly stationary with and which we think of as capturing additional useful information on the dependence structure of . For instance, if is a Cox point process, we choose to be the random intensity measure. If is a Poisson cluster process, then would describe the cluster centers. If the dependence structure is exceptionally simple, it is also possible to take . The idea of using additional information is motivated from conditioning on the spatially refined information coming from the clan-of-ancestors construction in Gibbsian point processes [38].
The point process is conditionally -dependent if and are conditionally independent given for any bounded Borel sets such that the distance between and is larger than some . Here, denote the -algebra generated by and .
Finally, we impose an absolute continuity-type assumption on the Poisson point process in a fixed box with respect to when conditioned on and the outside points. More precisely, we demand that there exists with the following property, where denotes a homogeneous Poisson point process in the window .
- (AC)
Let be such that . Then,
[TABLE]
Although (AC) appears technical, Section 4 illustrates that it is tractable for many commonly used point processes.
Since the persistent Betti numbers exhibit jumps at the birth- and death times of features, we work in the Skorokhod topology [5, Section 14].
Theorem 3.2** (Functional CLT for persistent Betti numbers).**
Let and be a conditionally -dependent point process with exponential decay of correlations and satisfying conditions (M) and (AC). Then, the following convergence statements hold true.
- q=0.
The one-dimensional process
[TABLE]
converges weakly in Skorokhod topology to a centered Gaussian process. 2. q=1.
The two-dimensional process
[TABLE]
converges weakly in Skorokhod topology to a centered Gaussian process.
Additionally, [8, Theorem 1.12] implies convergence of the rescaled variances. While Theorem 3.1 is an adaptation of [8], Theorem 3.2 is much more delicate. As an application of Theorem 3.2, we obtain a functional CLT for the following two characteristics, which are modified variants of the accumulated persistence function from [6]:
[TABLE]
and
[TABLE]
Corollary 3.3** (Functional CLT for the APF).**
Let and be as in Theorem 3.2. Then, both \big{\{}n^{-1/2}(\mathsf{APF}^{M,0}_{r}(\mathcal{P}_{n})-\mathbb{E}[\mathsf{APF}^{M,0}_{r}(\mathcal{P}_{n})])\big{\}}_{r\leq r_{\mathsf{f}}} and \big{\{}n^{-1/2}(\mathsf{APF}^{M,1}_{r}(\mathcal{P}_{n})-\mathbb{E}[\mathsf{APF}^{M,1}_{r}(\mathcal{P}_{n})])\big{\}}_{r\leq r_{\mathsf{f}}} converge to centered Gaussian processes.
4. Examples of point processes
In this section, we give examples of point processes which satisfy the assumptions of our main theorems. More precisely, we show that log-Gaussian Cox processes with compactly supported covariance functions and Matérn cluster processes both satisfy the conditions of Theorems 3.1 and 3.2. We also show that the Ginibre point process satisfies the conditions of Theorem 3.1.
Conversely, we do not expect that hard-core point processes satisfy the functional central limit theorem in the generality of Theorem 3.2. Indeed, hard-core conditions put a strict lower bound on the death time of clusters and the birth time of loops. However, we believe that suitable repulsive point processes, where the hard-core conditions only need to be imposed with a certain probability can be embedded in the framework of Theorem 3.2.
We first recall the definition of exponential decay of correlations from [8]. To this end, we define the separation distance between and as in [8, Formula (1.3)] via
[TABLE]
Definition 4.1**.**
Let be a stationary point process in , such that the -point correlation function exists for all . Then, exhibits exponential decay of correlations if there exist , such that
- (1)
for all , 2. (2)
for some , 3. (3)
[TABLE]
for any .
4.1. Log-Gaussian Cox process
Let be a stationary Gaussian process with mean and covariance function . Then, the random measure on defined as , for any Borel subset has moments of any order. Let be a Cox process with random intensity measure , referred to as a Log-Gaussian Cox process. By [15, Equation (7)], the factorial moment densities of are given by
[TABLE]
To apply Theorems 3.1 and 3.2, we assume that is bounded and of compact support, which ensures that exhibits exponential decay of correlation.
We show below that condition is satisfied. Let . According to [16, Theorem 1], the Log-Gaussian Cox process under the reduced Palm version is also a Log-Gaussian Cox process with underlying Gaussian process . According to [17, Equation (5.4.5)],
[TABLE]
for suitable coefficients , where denotes the th factorial moment density with respect to . Therefore, it is enough to prove that
[TABLE]
for all . Now, Equation (8) in [15] gives that
[TABLE]
where the right-hand side is bounded as and are bounded independently of . This verifies condition (M).
Since conditionally on , the point process is a Poisson point process, the conditional -dependence property holds with .
It remains to verify condition . By [33, Equation (6.2)], conditionally on , the distribution of the point process admits the density with respect to a homogeneous Poisson point process with intensity 1 in given by
[TABLE]
where . In particular, is strictly positive for all . Therefore, if are two events such that , then . This verifies condition .
4.2. Matérn cluster process
Let be a homogeneous Poisson point process in with intensity . Given a realization of , we define a family of independent point processes , where , , is a homogeneous Poisson point process with intensity 1 in the disk of radius centered at . The point process is referred to as a Matérn cluster process. Since is -dependent, it exhibits exponential decay of correlations.
Next, we verify condition (M). For this purpose, we deduce from [16, Section 5.3.2] that a Matérn cluster process is a Cox process whose random intensity measure has as density the random field given by
[TABLE]
Now, let and be fixed. From [16, Equations (19) and (20)] we obtain that
[TABLE]
Since is increasing in for every in the sense of [30], the Harris-FKG inequality [30, Theorem 20.4] gives that
[TABLE]
where we used that is a Poisson random variable with parameter . In order to bound \mathbb{E}\Big{[}\mathcal{P}(W_{1})^{p}\prod_{i\leq l}\lambda(x_{i})\Big{]}, we first apply the Hölder inequality and stationarity, to arrive at
[TABLE]
First, is finite since is a Poisson point process. For the remaining part, we note that , where denotes the Minkowski sum. Hence,
[TABLE]
where is a homogeneous Poisson point process of intensity 1 in the disk [30, Theorem 3.9]. Again, since is a Poisson random variable with parameter , the latter expression is finite. Taking the supremum over all and all , this verifies condition (M). The point process is also conditionally -dependent, by taking and .
It remains to prove . By [33, Equation (6.2)], conditional on , the distribution of admits the density
[TABLE]
with respect to the distribution of a homogeneous Poisson point process. Now, consider the event on the event
[TABLE]
the density is positive. Therefore, if are such that , then almost surely
[TABLE]
Since occurs with positive probability, this proves condition (AC).
4.3. Ginibre point process
The Ginibre point process is a determinantal point process with kernel
[TABLE]
with . As mentioned in [8, p. 19], this point process exhibits exponential decay of correlation. According to [21, Theorem 2], for we have , where the right-hand side is finite by [26, Lemma 4.2.6]. Hence, we obtain an upper bound for , which is independent of , thereby verifying condition (M).
5. Simulation study
We elucidate in a simulation study, how cluster- and loop-based test statistics derived from Theorem 3.2 can detect deviations from complete spatial randomness. The simulations are carried out on top of the R-packages spatstat and TDA [20, 2].
For the entire simulation study, the null model is a Poisson point process with intensity in a observation window. Moreover, we fix so large that it encompasses the entire sampling window and therefore suppress its appearance in the notation. Although the proof of Theorem 3.2 relies on the -boundedness, the simulation study illustrates that it is not critical to impose this condition when testing hypotheses on common point patterns.
5.1. Deviation tests
As a first step, we derive scalar cluster- and loop-based test statistics.
5.1.1. Definition of test statistics
As a test statistic based on clusters, we use the integral over the number of cluster deaths in a time interval with , i.e.,
[TABLE]
After subtracting the mean, this test statistic becomes reminiscent of the classical Cramér-von-Mises statistic except that we do not consider squared deviations. Although squaring would make it easier to detect two-sided deviations, it would also require knowledge of quantiles of the square integral of a centered Gaussian process. Albeit possible, this incurs substantial computational expenses. Our simpler alternative has the appeal that as an integral of a Gaussian process, is asymptotically normal and therefore characterized by its mean and variance.
As a test statistic based on loops, we use the accumulated persistence function, which aggregates the life times of all loops with birth times in a time interval with , i.e.,
[TABLE]
By Corollary 3.3, after centering and rescaling, the statistic converges in the large-volume limit to a normal random variable.
The statistics and are specific possibilities to define scalar characteristics from the persistence diagram. Depending on the application context other choices, such as instead of could be useful. However, in the simulation study below we found the weighting by life times of clusters to be detrimental.
5.1.2. Exploratory Analysis
As alternatives to the Poisson null hypothesis, we consider the attractive Matérn cluster and the repulsive Strauss processes. More precisely, the Matérn cluster process features a Poisson parent process with intensity 2 and generates a number of offspring uniformly in a disk of radius around each parent. The Strauss process has interaction parameter and interaction radius . The intensity parameter was tuned so as to match approximately the intensity of the null model. Figure 2 shows realizations of the null model and the alternatives.
In a first step, in Figure 3, we plot the persistence diagrams of samples from the null model and of the alternatives.
From the cluster-based diagrams, it becomes apparent that in comparison to the null model, in the Matérn cluster process, features can die also at rather late times, whereas this happens very rarely in the Strauss process. When analyzing loops, we see that loops with long life times can appear earlier in the null model than in the Matérn cluster process. Conversely, while some loops with substantial life time emerge at later times in the null model, there are very few such cases in the Strauss model.
5.1.3. Mean and variance under the null model
Now, we determine the mean and variance of and under the null model with . For this purpose, we compute the number of cluster deaths and accumulated loop life times for 10,000 independent draws of the null model.
Comparing the mean curves for the number of cluster deaths in the null model with those of the alternatives matches up nicely with the intuition about attraction and repulsion. For late times, they all approach a common value, namely the expected number of points in the observation window. However, Figure 4 shows that for the Matérn model, the slope is far steeper for early times, caused by merging of components of points within a cluster. In contrast, for the Strauss process the increase is at first much less pronounced than in the Poisson model, thereby reflecting the repulsive nature of the Gibbs potential.
For the loops, a radically different picture emerges. Here, the curve for the Strauss process lies above the accumulated loop life times of the null model. The Strauss model spawns substantially more loops than the Poisson model, although most of them live for a shorter period. Still, taken together these competing effects lead to a net increase of the accumulated loop life times in the Strauss model.
5.1.4. Type I and II errors
By Theorem 3.2, the statistics and are asymptotically normal, so that knowing the mean and variance allows us to construct a deviation test whose nominal confidence level is asymptotically exact. For the loops, we can choose the entire relevant time range, so that . For the cluster features, this choice would be unreasonable, as for late times, we simply obtain the number of points in the observation window, which is not discriminative. Hence, we set . We stress that in situations with no a priori knowledge of a good choice of , the test power can degrade substantially.
To analyze the type I and II errors, we draw 1,000 realizations from the null model and from the alternatives, respectively. Table 1 shows the rejection rates of this test setup. Under the null model the rejection rates are close to the nominal 5%-level, thereby illustrating that already for moderately large point patterns the approximation by the Gaussian limit is accurate. Using the mean and variance from the null model, we now compute the test powers for the alternatives. Already leads to a test power of approximately for both alternatives. When considering , we obtain a type I error rate of 4.8%, so that the confidence level is kept. Moreover, the power analysis reveals that in the present simulation set-up, is better in detecting deviations from the null hypothesis than .
5.2. Envelope Tests
Leveraging Theorem 3.2 shows that the deviation statistics and are asymptotically normal. Using a simulation-based estimate for the asymptotic mean and variance under the null model allowed us to construct a deviation test whose confidence level is asymptotically precise. A caveat of the above analysis is that the magnitude of clustering and repulsion is strong and clearly visible in the samples.
Recently, global envelope tests have gained widespread popularity, because they are both powerful and provide graphical insights as to why a null hypothesis is rejected [34]. The global envelope tests are fundamentally Monte Carlo-based tests and therefore do not relate directly to the large-volume CLT. However, they also rely on a functional summary statistics as input. Most of the applications in spatial statistics use a distance-based second-order functional such as Ripley’s -function. In this section, we compare such classical choices with cluster- and loop-based statistics.
5.2.1. Alternatives
Since envelope tests excel at detecting subtle changes from the null model, we consider now a new parameter set-up to compare the -function with the cluster- and loop-based statistics. Here, both the Matérn cluster as well as the Strauss process are substantially more similar to the Poisson point process. Hence, for the alternatives, we use again Matérn cluster and Strauss processes, but choose different parameters.
We found that the cluster- and loop-based statistics were particularly powerful in situations involving small interaction radii. Hence, as alternatives we choose the process and the process, see Figure 5. The interaction parameter of the Strauss process was again tuned to match approximately the intensity of the null model.
5.2.2. Power analysis
To analyze the power of the envelope test, we generate realizations of the null model and 1,000 realizations of the alternatives. Then, we perform the global envelope test from [34] with three functional summary statistics. The first is Ripley’s -function [33, Definition 4.6]. Second, we consider the number of cluster deaths as illustrated in Figure 4. Third, for the loops, we use a two-dimensional functional statistics derived from the persistent Betti numbers associated with life times rather than death times in order to expand the support of the statistic to the entire first quadrant. More precisely, counts the number of loops born before time and with life time at least .
The rejection rates from Table 2 illustrate that for the alternatives described above, the cluster-based test gives a similar test power as the -function-based test for the Matérn cluster process and a substantially higher test power for the Strauss process. Moreover, the loop-based test works even better in the Strauss case, but performs substantially worse for the Matérn alternative.
6. Analysis of the minicolumn dataset
In this section, we explore to what extent the deviation tests from Section 5 provide insights when dealing with real data. For this purpose, we analyze the minicolumn dataset provided by scientists at the Centre for Stochastic Geometry and Advanced Bioimaging.
As it should serve only to illustrate the application of Theorem 3.2, the present analysis is very limited in scope, and we refer to [14] for a far more encompassing study. For instance, that work considers two datasets and investigates 3D data together with marks for the directions attached to the neurons.
6.1. Exploratory analysis
The minicolumn dataset consists of 634 points emerging as two-dimensional projections of a three-dimensional point pattern of neurons. As neurons are believed to arrange in vertical columns, the projections are expected to exhibit clustering, see [32, 37]. The projections are taken along -axis, since neuroscientists expect an arrangement in vertical columns. A visual inspection of the point pattern in Figure 6 supports this hypothesis.
As a first step, we explore whether the purported clustering already manifests in the persistence diagram. Comparing the loop-based persistence diagram of the minicolumn data with the persistence diagram of a homogeneous Poisson point process in Figure 7 shows that loops with substantial life times tend to be born later in the minicolumn model. This suggests clustering since loops formed by points within a cluster typically disappear rapidly.
Now, we explore whether the impressions from the persistence diagrams are reflected in the summary statistics from Section 5. When comparing in Figure 8 (left) the number of cluster death at different points in time, we note that until time 35, the curve for the observed data runs a bit above the curve for the null model. This provides already a first indication towards clustering. Next, we proceed to the loop-based features. As shown in Figure 8 (right), the curve for the observed pattern runs substantially below the one of the null model. This reflects a property that we have seen already in the persistence diagram: clusters with substantial life time tend to be born earlier in the null model, thereby leading to a steeper increase of the accumulated life times.
6.2. Test for complete spatial randomness
Under the impression of the previous visualizations, we now test the minicolumn pattern against the null model. As in Section 5, we deduce from Theorem 3.2 that the statistics are asymptotically normal under the null model, so that we only need to determine means and variances.
A subtle issue concerns the choice of the integration interval. The simplest option would be to take the whole intervals shown in Figure 8. For instance, for the loop-based features, this means . However, for the cluster-based features the choice of the interval is less clear, since taking the whole interval is not discriminatory. The experiences from the simulation study indicate that the test is most powerful for early death times. Therefore, we choose .
With these choices, both the cluster-based and the loop-based test reject the null-hypothesis at the 5% level, since the corresponding -values are and . However, the tests are sensitive to the choice of the integration bound. This is especially true for the cluster-based test, where going from to results in a -value of , so that the null hypothesis is no longer rejected. On the other hand, the loop-based test still yields a -value of . However, when reducing even further to , then the -value increases sharply to , so that the null-hypothesis is no longer rejected.
7. Discussion
In this paper, we elucidated how to apply tools from TDA to derive goodness-of-fit tests for planar point patterns. For this purpose, we derived sufficient conditions for a large-domain functional CLT for the -bounded persistent Betti numbers on point processes exhibiting exponential decay of correlations. Following the framework developed in [8], the main difficulty arose from a detailed analysis of geometric configurations when bounding higher-order cumulants.
A simulation study revealed that the asymptotic Gaussianity is already accurate for patterns consisting of a few hundred data points. Additionally, as functional summary statistics, the persistent Betti numbers can also be used in the context of global envelope tests. Here, our finding is that TDA-based statistics can provide helpful additional information for point patterns with small interaction radii.
Finally, we applied the TDA-based tests on a point pattern from a neuroscientific dataset. As conjectured from the application context, the functional summary statistics indicate a clustering of points and the tests reject the Poisson null-model. However, the analysis also reveals a sensitivity to the range of birth times considered in the statistics.
In future work, we plan to extend the present analysis to dimensions larger than 2. On a technical level, the definition of higher-dimensional features requires a deeper understanding of persistent homology groups. Additionally, when thinking of broader application scenarios, a further step is to extend the testing framework from mere point patterns to random closed sets involving a richer geometric structure.
Acknowledgments
We thank J. Møller and A. D. Christoffersen for valuable discussions on the minicolumn dataset and for providing references. We also thank J. Yukich for inspiring discussions and helpful remarks. Finally, we thank the Centre for Stochastic Geometry and Advanced Bioimaging for collecting and sharing the data. CB, CH and AS are supported by The Danish Council for Independent Research — Natural Sciences, grant DFF – 7014-00074 Statistics for point processes in space and beyond, and by the Centre for Stochastic Geometry and Advanced Bioimaging, funded by grant 8721 from the Villum Foundation. NS was partially supported by the French ANR grant ASPAG (ANR-17-CE40- 0017).
8. Proof of Theorem 3.1
The main tool to prove Theorem 3.1 is the general CLT of [8, Theorem 1.14]. To be in that framework, we need to express the quantity \langle f,\mathsf{PD}^{M,q}(\mathcal{P}_{n})\rangle=\sum_{i\in I^{M,q}(\mathcal{P}_{n})}f\big{(}B_{i}^{M},D_{i}^{M}\big{)} in the form for a suitable score function .
In other words, we need to transform the indexing over features into an indexing over the points of the point process . We achieve this goal by assigning to each feature a point that either kills or gives birth to this feature, depending on whether or .
First, the death of a cluster at time is always caused by the merging of two points at distance . Indeed, when the size of a component has a jump, this can only appear by attaching to another component. If dies by this merging, we say that kills . This ensures that if two components both die when they merge, their deaths are caused by different points.
Similarly, if , then the birth of a hole at time is caused by two points at distance whose connection creates a new hole. If only one feature is born at time , we choose the lexicographic minimum of and and say that it gives birth to this hole. However, if a large hole is split into two -bounded holes, it can happen that two holes are born at the same time. In this situation, we assign one hole to each of and . Hence, we define the score functions as
[TABLE]
Definition (8) translates the desired CLT for into the framework of [8, Theorem 1.14]. It remains to verify the conditions stated therein.
Proof of Theorem 3.1.
According to [8, Theorem 1.14] we have to verify that the pair belongs to class (A2) (see Definition [8, Definition 1.7]) and that the th moment condition [8, Equation (1.19)] holds for every .
Belonging to class (A2) involves itself three conditions. The first is exponential decay of correlations, one of our standing assumptions on the point process . The second asks for an exponentially decaying radius of stabilization. Since we work with -bounded features, this radius is finite. Finally, we need to verify the power-growth condition [8, Equation (1.18)] stating that for , the upper bound
[TABLE]
holds for every , locally finite and . To achieve this goal, we note that in the worst case can be responsible for the death of all other points of . Similarly, it can give birth to at most holes. Hence,
[TABLE]
Finally, we verify the th moment condition [8, Equation (1.19)]. That is, we prove that for every there exists such that
[TABLE]
We explain in detail how this is achieved if , noting that the case can be deduced after minor modifications. If is responsible for the death of a component at time , then there exists at distance from . Since each ball grows for time at most , we see that
[TABLE]
and an application of condition (M) concludes the proof. ∎
9. Proofs of Theorem 3.2 and Corollary 3.3
In the following, we assume , since the proofs for are similar but easier. Hence, to simplify notation, we write for .
Proof of Corollary 3.3.
Note that if is a Gaussian process, then the process is also Gaussian. As mentioned above, the plan is to start from Theorem 3.2 and then apply the continuous mapping theorem [28, Theorem 4.27]. To this end, we show that is a continuous functional of the persistent Betti numbers . We assert that
[TABLE]
The remainder of the proof proceeds in two steps. First, we verify identity (10). Second, we show that the right-hand side is continuous in with respect to the Skorokhod topology.
To prove identity (10), linearity allows us to reduce the claim to the case where the persistence diagram consists of a single -measure at a point for some . If , then both sides vanish. If , then and , so that the right-hand side of (10) gives the asserted
[TABLE]
Let , where is the Skorokhod space of càdlàg functions from to . For any put
[TABLE]
According to (10), it is sufficient to prove that the function , is continuous with respect to the Skorokhod topology. We prove this for the first integral. The arguments for the second are similar. Let be càdlàg and be an increasing continuous bijection. Then,
[TABLE]
If approaches in the Skorokhod metric, then by definition of this metric, we can choose such that the first two expressions become arbitrarily small. Moreover, since itself is càdlàg, it follows that also the third expression tends to 0. ∎
The proof of Theorem 3.2 decomposes into two steps: lower and upper variance bounds and an upper bound on fourth-order cumulants. In what follows, we write
[TABLE]
for the increment of in the block with and . Notice that this is minus the measure from (2) evaluated at the block . Moreover, is the number of holes with birth time before and death time between and minus the number of holes with birth time before and death time between and . Following [4], two blocks are neighboring if they share a common side.
Proposition 9.1** (Variance lower bound).**
Let be a conditionally -dependent point process with exponential decay of correlations. Moreover, let and be pairwise disjoint blocks such that each contains some with . Then,
[TABLE]
Proposition 9.2** (Variance upper bound).**
Let be a conditionally -dependent point process with exponential decay of correlations. Then, there exist and such that
[TABLE]
holds for all and blocks .
Now, the th cumulant of real random variables equals
[TABLE]
provided that all appearing moments are well-defined [36, Proposition 3.2.1]. Here, the sum ranges over all partitions of the set .
Proposition 9.3** (Cumulant bound).**
Let be a conditionally -dependent point process with exponential decay of correlations satisfying conditions (AC) and (M). Then, there exist and such that
[TABLE]
holds for all and neighboring blocks .
We postpone the proofs of Propositions 9.1–9.3 to Sections 9.1–9.3, respectively. To deduce Theorem 3.2 from these two central auxiliary results, we write
[TABLE]
for the centered persistent Betti numbers.
Proof of Theorem 3.2.
Let and be pairwise distinct, and put
[TABLE]
Then, after suitable regrouping of terms, we can express in the form
[TABLE]
as in Proposition 9.1. Now, combining Proposition 9.2 with Theorem 3.1 and the variance asymptotics [8, Theorem 1.12] shows that the centered and rescaled random variable converges in distribution to a Gaussian. Hence, the Cramér-Wold device yields convergence of the finite-dimensional distributions of .
Next, [36, Proposition 3.2.1] gives the general cumulant identity
[TABLE]
for centered random variables . Hence, by Propositions 9.2 and 9.3,
[TABLE]
for some . In particular, the process \big{\{}n^{-1/2}\overline{\beta}_{b,d}(\mathcal{P}_{n})\big{\}}_{b,d\leq r_{\mathsf{f}}} is tight in Skorokhod topology [24, Lemma 3]. In this context, we note that condition (8.4) of [24, Lemma 3] follows from the variance upper bound derived in Proposition 9.2 and that similar as in (2.18) [22], we have replaced the equality in (8.5) of [24, Lemma 3] by an inequality. Combining this property with the convergence of finite-dimensional distributions yields the asserted weak convergence. ∎
9.1. Proof of Proposition 9.1
To show the variance lower bound, we adapt a conditioning argument that has already been successfully applied in the setting of Gibbsian point processes [38]. More precisely, we subdivide the window into blocks of a fixed size and use the law of conditional variance to obtain a lower bound in the order of the number of blocks.
Associate with the th feature in a center point , for instance by taking the point as defined in Section 2.2. Then,
[TABLE]
defines a signed measure of total mass .
In the vein of [38], the key towards proving a lower bound on the variance is the following non-degeneracy property, where is introduced in Section 3.
Lemma 9.4** (Non-degeneracy).**
It holds that
[TABLE]
Before proving Lemma 9.4, we explain how it implies Proposition 9.1. In essence, the proof follows along the lines of [38, Lemma 4.3]. Nevertheless, since the details of the conditioning argument differ a bit from the corresponding picture for Gibbs processes, we explain how to adapt the main steps from [38, Lemma 4.3] in the present setting.
Proof of Proposition 9.1.
The idea of proof is to consider a family of well-separated blocks in . Then, we leverage the conditional -dependence of the point process and the -boundedness of the features to decompose the variance of their contributions as the sum of the variances. More precisely, we apply the assumption of conditional -dependence with the conditioning set
[TABLE]
chosen as the complement of the union of well-separated blocks of side length . Then, the law of total variance yields the lower bound
[TABLE]
Moreover, since the statistics in the smaller domain
[TABLE]
is measurable with respect to . We obtain that
[TABLE]
because is measurable. Thanks to the conditional -dependence, we have
[TABLE]
Now, the number of -blocks contained in is of order , and we conclude by noting that Lemma 9.4 and imply that each of the contributions is bounded away from 0. ∎
To verify non-degeneracy, we rely on the techniques introduced in [38]. In particular, we make use of [38, Lemma 2.3], which we restate below to render the presentation self-contained.
Lemma 9.5**.**
Let be a real random variable and be Borel sets of . Then,
[TABLE]
Proof of Lemma 9.4.
Write
[TABLE]
for the events that there are no points in and , respectively. Next, let
[TABLE]
denote the event that all but the first of the considered persistent Betti numbers vanish. Now, let denote the indices of all features that are entirely contained in and put
[TABLE]
Then, by Lemma 9.5 with and ,
[TABLE]
and it remains to show that the right-hand side is non-zero.
Since are pairwise disjoint and contain points above the diagonal, [25, Example 1.8] shows that under the homogeneous Poisson point process the event has positive probability. Also is of positive probability. Hence, an application of condition (AC) concludes the proof. ∎
9.2. Proof of Proposition 9.2
For a block , we let denote the score function associated with . That is,
[TABLE]
is the number of holes born by with birth and death times in . Note that if gives birth to the th hole, then it gets in contact with another point at time . In particular, contains a point in the annulus .
Moreover, if the th hole dies at time , then a previously vacant component is covered completely, which is caused by three disks centered at points in meeting at a single point in the plane. The three center points of the disks must form a triangle with no obtuse angle. Otherwise, two of the disks would meet for the first time in the interior of the third and hence no connected component in the background was covered by the merging. This could be interpreted as a feature that is born and dies at the same time, but we chose to exclude such features in our definition of 1-features.
Henceforth, let denote the two disks of radius whose boundary passes through . If , we let be empty. The points in are exactly the points such that the time when the boundaries of the three disks around , , and meet in one point is at most . For , we let
[TABLE]
where . This set consists of all points such that the boundaries of the three disks around , and meet at time with . Some may still form a triangle having an obtuse angle with and , that is, the disks around , , and already met earlier in an interior point of one of the disks. However, all that can cause the death of a hole in together with and must be contained in .
Now,
[TABLE]
where denotes the event that for some with the event occurs, where
[TABLE]
Here, we say that kill the hole if the disks around the points meet for the first time at . In particular, any three points can kill at most one hole.
Similarly, for a block ,
[TABLE]
where we let denote the event that for some with the event
[TABLE]
occurs.
Using this notation, the proof of the variance upper bound is now based on the following pivotal geometric moment bound. In the following, denotes the unreduced Palm measure characterized via for any non-negative measurable . We recall from (3) that denotes the th factorial moment density. In the following, we adhere to the convention .
Lemma 9.6** (Moment bound).**
Let be a stationary point process having fast decay of correlations and satisfying condition (M). Let and . Then, there exist and such that for all and any ball of radius ,
- (1)
[TABLE]
holds for all blocks . 2. (2)
[TABLE]
holds for all neighboring blocks , and 3. (3)
[TABLE]
holds for all neighboring blocks .
The proof of Lemma 9.7 relies on a delicate geometric analysis that we defer to Section 9.4. We now prove Proposition 9.2. As in [8, Equation (1.6)], for and , we introduce the mixed -moments
[TABLE]
In the rest of the manuscript, we freely use that exponential decay of correlations implies boundedness of the factorial moment densities [8, Inequality (1.11)].
Proof of Proposition 9.2.
To lighten notation, we write instead of . To give the paper more pleasant to read, we have not attempted to optimize the exponents occurring in the course of this proof. Proceeding as in [8, Equation (4.1)], the refined Campbell-Mecke formula [8, Equation (1.9)] gives that \mathsf{Var}\big{(}\beta(E,\mathcal{P}_{n})\big{)} equals
[TABLE]
We derive bounds for the two summands separately.
By stationarity, (11) and Hölder’s inequality, the first expression is at most
[TABLE]
Hence, Lemma 9.6(1) with yields the asserted upper bound.
To deal with the double integral in (13), we recall that is a local score function and that exhibits exponential decay of correlations. Hence, as in [8, Equation (3.26)], the factorial moment measure expansion shows that
[TABLE]
for some . In particular, choosing a cut-off , we see that
[TABLE]
holds for a suitable and it suffices to derive an upper bound for
[TABLE]
For the second summand, we can argue similarly as in (14), so that it remains to bound the integral involving . Here, we set , note that and combine (11) with Hölder’s inequality to arrive at
[TABLE]
We bound thanks to condition (M). Finally, by Jensen’s inequality applied to the uniform distribution on ,
[TABLE]
so that applying Lemma 9.6(1) with shows that the right-hand side is of order at most , thereby concluding the proof. ∎
9.3. Proof of Proposition 9.3
To prove Proposition 9.3, we take up the idea suggested in [24, Theorem 8] and [10, Theorem 8.1] and express in terms of cumulant measures induced by the functional of interest. A slight technical nuisance in the present setting comes from dealing with a product of two different functionals – one associated with the block and the other with – whereas the semi-cluster measure machinery from [8, Section 4.3] relies on a single score function. However, this artificial difficulty can be overcome by formally attaching -valued marks to . Taking up the notation from [19], we let and denote the correspondingly marked space and point process. Writing , we define an augmented score function , where points with mark 1 are evaluated with the first score function and points with mark 2 are evaluated with the second score function. In other words,
[TABLE]
We take the concise proof of Proposition 9.2 as a blueprint for the strategy of the more involved setting laid out in Proposition 9.3. In particular, we need to address two main steps: bounds for mixed moments and a reduction of the integral to the diagonal.
In order to reduce to the diagonal, we decompose the cumulant measure into semi-cluster measures as in [3, Section 5.1] and [19, Section 3.2]. For the convenience of the reader, we reproduce the basic definitions. First, the th moment measure is given as
[TABLE]
where is non-negative and measurable with each defined on , and
[TABLE]
denotes the empirical measure associated with and . In terms of mixed -moments, with the projection of to the coordinates in , we write
[TABLE]
where are the singular differentials determined via
[TABLE]
where is any non-negative measurable function [19, Section 3.1]. As in (12), for , the mixed -moments are given as
[TABLE]
for every .
Similarly, the th cumulant measure equals
[TABLE]
so that
[TABLE]
where
[TABLE]
denotes the moment measure with coordinates in .
Next, the space decomposes into a union of subsets according to which coordinate is most distant from the diagonal [19, Lemma 3.1]. More precisely, write
[TABLE]
for the maximal separation of and , where . Then, put
[TABLE]
Here, the marks are ignored for the diagonal . We also put .
Lemma 9.7** (Off-diagonal bounds).**
Let denote a non-trivial partition of . Then, there exist and such that
[TABLE]
holds for all and neighboring blocks .
Before proving Lemma 9.7, we elucidate how to deduce Proposition 9.3.
Proof of Proposition 9.3.
First, integration over the cumulant measure decomposes into a diagonal and an off-diagonal part [19, Equation (3.28)]. That is,
[TABLE]
where is the indicator function of the domain and the sum is over all non-trivial partitions . By Lemma 9.7, the off-diagonal contributions in this decomposition are bounded above by .
Next, when integrating over the diagonal, we leverage that in the decomposition (17), only contributes [19, Lemma 3.1]. Hence,
[TABLE]
so that applying Lemma 9.6(2) with and noting the convention preceding that result concludes the proof. ∎
To prove Lemma 9.7, we decompose the cumulant measures into semi-cluster measures [3, Lemma 5.1]. More precisely, as in [3, 19], any two disjoint non-empty subsets , induce a cluster measure
[TABLE]
Now, decomposes into semi-cluster measures
[TABLE]
where the sum runs over all partitions such that and are non-empty subsets of and , respectively [19, Lemma 3.2].
Equipped with these ingredients, we now prove Lemma 9.7. Since the basic structure of the proof parallels that of Proposition 9.2, we only provide details for the steps that are substantially different.
Proof of Lemma 9.7.
Putting for , we first derive an upper bound for
[TABLE]
For this purpose, we decompose the moment measures , and according to (16). Hence, we need bounds for the absolute value of differences of mixed -moments of the form
[TABLE]
where and are partitions of and , respectively. Since we are working on the set , as in the proof of Proposition 9.2, the fast decay of -correlations bounds (19) by for a suitable .
Next, as in [19, Section 3.1] the singular differentials occurring in the expansion (16) of the moment measure can be grouped into a single object. More precisely, we write for the measure that equals on the subset of consisting of all such that if for some and otherwise.
In the setting of the present proof, we note that the bounds on the mixed moments from (19) only involve coordinates with indices in the set . Hence, we need to consider also singular differentials only with respect to these coordinates, i.e., integrate with respect to . In particular, we arrive at the bound
[TABLE]
Now, setting , the exponential decay assumption on the function gives control on one integral over the window, while the integrals with respect to the remaining variables are controlled by the volume of balls. Then, a repeated application of Hölder’s inequality provides suitable bounds on the moment measures such that
[TABLE]
holds for some . Hence, it suffices to provide upper bounds for
[TABLE]
where is an arbitrary partition of . We explain how to proceed for , noting that for the arguments are similar but easier.
We claim that for some ,
[TABLE]
To prove this claim, decompose according to (16) and let be an arbitrary partition of . As in the proof of Proposition 9.2, a repeated use of Hölder’s inequality shows that on , the mixed moments of the form
[TABLE]
are bounded above by c^{\prime}\big{(}\mathbb{P}_{\boldsymbol{x}}(E_{x_{1},x_{i}}^{\prime\prime})\rho^{(p^{\prime\prime})}(\boldsymbol{x})\big{)}^{1-\varepsilon} for a suitable and some . At this point, we may proceed similarly as in (15) by invoking Lemmas 9.6(2) and 9.6(3). As an illustration consider the setting where and . Then, we set , and . We combine Jensen’s inequality with Lemma 9.6(3) to show that
[TABLE]
Hence, inserting the definition of concludes the proof. ∎
9.4. Proof of Lemma 9.6
We now turn to the proof of Lemma 9.6. The proof is based on the following four lemmas that are used to bound the probability with which certain point configurations occur. Throughout we use the notation
[TABLE]
The proofs make use of the inequalities
[TABLE]
where is some constant. Moreover, we repeatedly use that the volume of an annulus is given by
[TABLE]
Lemma 9.8**.**
Let and . There is a constant such that for all ,
[TABLE]
Proof.
Recall that
[TABLE]
The line through and cuts the disk into two parts. The area of the larger part is given by
[TABLE]
is the union of two such sets of radius from which we remove two sets of the same type with radius from the interior. This yields the formula for the area.
The inequality follows from
[TABLE]
[TABLE]
∎
Lemma 9.9**.**
Let and and let be a disk of radius . Then, there is a constant such that
[TABLE]
Proof.
Integration with respect to yields:
[TABLE]
When , the claim follows directly from Lemma 9.8. Otherwise, letting , we split the integral in two terms according to whether or . Applying Lemma 9.8 yields the bound
[TABLE]
To bound (24), we apply the mean value theorem and perform the integration to obtain the bound
[TABLE]
To bound (25), we bound the integrand using Lemma 9.8 and note that
[TABLE]
This proves the claim when . ∎
Lemma 9.10**.**
Let be a disk of radius . There is a constant such that for all with and either or and ,
[TABLE]
Proof.
We may assume . Indeed, if , we can show the claim by first integrating with respect to , then using that by Lemma 9.9,
[TABLE]
and finally integrating with respect to and to provide a factor . If , we first integrate with respect to , which yields the area of . This is bounded by , and by Lemma 9.9 the remaining integral is bounded by
[TABLE]
Let . We write the integral as a sum of three terms corresponding to whether I: , II: , or III: .
Term I: We first integrate with respect to . Since
[TABLE]
the mean value theorem applied to the formula in Lemma 9.8 implies that . We then integrate with respect to and to obtain the bound .
Term II: When , we first integrate with respect to to obtain the area of . To bound term II, we need to explicitly compute this area. For this, we first compute the area of the intersection where . By the assumption on ,
[TABLE]
This ensures that the line containing the two points where the boundaries of the disks and meet separates and . The area of is
[TABLE]
The area of is given by
[TABLE]
It is a straightforward computation to see that is uniformly bounded by on the set of satisfying (26) and . In particular, (26) guarantees that
[TABLE]
such that and have bounded derivatives for the relevant values of . It follows that (27) is bounded by . The remaining integral is of order by Lemma 9.9, which yields the appropriate bound.
Term III: In this case, we first integrate with respect to providing a factor . The remaining integral is bounded using Lemma 9.9. ∎
The fourth lemma allows us to analyze which point configurations can cause the birth and death of -bounded features. To state it, we recall the -complex associated with a locally finite point set , see e.g. [18, Sec. III.4] for details. It is built from the Delaunay triangulation, which is a triangulation of the plane with vertex set . For , is the union of all edges in the Delaunay triangulation with length at most and all triangles such that the three balls of radius centered at its vertices cover the triangle. Then and the inclusion is a homotopy equivalence, i.e. it preserves the topology.
Lemma 9.11**.**
Let be locally finite.
- (i)
Each connected component of contains at most one -bounded connected component of .
- (ii)
If an -bounded loop is born at time because two balls centered at meet, then there is an edge of length joining in the -complex.
- (iii)
If an -bounded feature dies at time because exactly three balls centered at points meet, then form a triangle with no obtuse angle in the -complex.
Proof.
The analogous statements hold for unbounded loops by the homotopy equivalence between the -complex and the union of balls. (i) follows because any -bounded loop is also an unbounded loop. An -bounded feature is either born the same way as the corresponding unbounded component or when two balls meet to split off a component. In both cases, some unbounded loop is born by the merging, and hence an edge is added to the -complex. This shows (ii). When an -bounded loop dies, so does the corresponding unbounded loop, hence (iii) is clear. ∎
We are now ready to prove Lemma 9.6.
Proof of Lemma 9.6.
Proof of (1). Stationarity and Equation (4) yield
[TABLE]
In the following, we let , and
[TABLE]
for simplicity. By definition of , (LABEL:disint) is bounded by
[TABLE]
Here, we have used that is symmetric in and and in , , and . Applying (4) again, we may bound the last term in (LABEL:disjointSum) by
[TABLE]
since . The remaining terms are treated similarly. Now choose a covering , where each is a translation of and such that for some independent of (for instance using that ). Then, by the moment condition (M) for ,
[TABLE]
We apply this in (30) together with Lemma 9.9. Since each is bounded according to the assumption of fast decay of correlations, we obtain the bound .
**Proof of (2). ** In the following, we use the notation
[TABLE]
Note that since the blocks and are neighboring, the features in and are different. Putting , we now expand as in (LABEL:disint)
[TABLE]
The condition comes from the fact that can give birth to at most one feature when connecting to another point, and since and are neighboring, and correspond to different features. Similarly, comes from the fact that a triangle can kill at most one feature.
The event excludes certain point configurations that are not possible. If the triangles formed by and share an edge, and the vertices of this edge coincide with and , then is not allowed. Indeed, it follows from Lemma 9.11 that the triangles correspond to the same feature in the -complex until and are joined. Thus, this must happen before both triangles are born, that is, at the latest at time . Moreover, if the two triangles share an edge, then the two points in not lying on this edge cannot be equal to and or to and , as this would lead to crossing edges in the -complex by Lemma 9.11 (since the triangles formed by cannot have any obtuse angles).
We now write the sum in (LABEL:9pointsum) as a sum where each term is a sum over , , as in (LABEL:disjointSum). Each such term comes from grouping into sets of equal points. Consider for illustration the term corresponding to the situation . The sum is handled as in the proof of Lemma 9.6(1) by applying (4) and bounding the involved Palm means. For this special point configuration, it is sufficient to bound by 1.
[TABLE]
Now, we apply the Hölder inequality with to obtain the bound
[TABLE]
In the first integral, we first integrate with respect to and then apply Lemma 9.9, while in the second integral we first integrate with respect to and use the bound in Lemma 9.8 and then apply Lemma 9.9 again. Next we use that and are neighboring blocks so that either or .
When , we get the bound
[TABLE]
so we take and .
When , we use Lemma 9.9 to get the bound
[TABLE]
so we take and .
For a general term, note that there are at least four different points among , so one of them, say , cannot be equal to any of . We consider two cases:
- I
is not among .
- II
, , and and .
Since we no longer keep track of which edge kills which triangle, all possible point configurations allowed by fall into one of the above cases after possibly renaming the variables.
In particular, if and the points are all different, one of them cannot be any of , and we could have taken this as and be in Case I. If , and, say, is not any of , , , we could have chosen as and be in Case I.
We further divide the Case I configurations allowed by into the following two sub-cases that have to be treated separately:
- Ia.
is not any of .
- Ib.
, , .
Again, after renaming the variables, we are always in one of the two sub-cases.
Case Ia: We apply the Hölder inequality to
[TABLE]
The first factor is integrated with respect to and the remaining integral is bounded using Lemma 9.9. The second factor is first integrated wrt. , the result is bounded using Lemma 9.8, and the remaining integral is bounded using Lemma 9.9. The rest of the argument proceeds as in the special case treated above.
Case Ib: The claim follows by applying the Hölder inequality to (35) and arguing as in Case Ia using Lemma 9.10 to bound the first integral.
Case II: We apply the Hölder inequality exactly as in (35) and argue as in Case Ia, except that the second integral is first integrated with respect to rather than .
Proof of (3). As in (LABEL:disint), we find
[TABLE]
The set consists of tuples of points and, similar to , it excludes certain configurations of the points that are not allowed by Lemma 9.11. If the triangles formed by and share an edge, then the length of this edge must be at most . Moreover, if the two triangles share an edge, then the two points in not lying on this edge cannot be equal to and or to and .
The contribution from the cases where two of the points are identical is bounded by
[TABLE]
which is handled exactly as in the proof of Lemma 9.6(2). Thus, it remains to treat the terms where are all different. Therefore, if we put , we must bound
[TABLE]
The rest of the proof proceeds as the proof of Lemma 9.6(2) by suitable applications of the Hölder inequality. We divide into two cases according to whether all points in , are one of or not. After renaming the variables, we may assume
- I
, , , and , or
- II
is not any of .
Notice that in Case I we exclude the case , , , and because it was excluded by definition of . After renaming variables, Case II is divided into
- IIa
is not any of or , and is not any of .
- IIb
and is not any of , , .
- IIc
, , , , .
- IId
, , , , .
In Case IIa, is not one of , while in Case IIb, IIc, and IId it is. Case IIb corresponds to the situation in which the triangles formed by share an edge, while in Case IIc and IId they share only one vertex. In Case IIc, each triangle contains one of the edges joining to and to , while in Case IId they do not.
Case I: When , we first write
[TABLE]
We then apply the Hölder inequality. Integrating first with respect to and then in (36) and integrating with respect to first in (37) yields a bound of order
[TABLE]
This is the same as (33) since . When , we replace by in (36), to obtain a bound of order
[TABLE]
which reduces to the same form as (34).
Case IIa: We apply the Hölder inequality to (36)–(37) and integrate first with respect to and then in (36) and with respect to first in (37). The remaining argument proceeds as in the proof of Lemma 9.6(2) Ia.
Case IIb: We apply the Hölder inequality to (36)–(37) and integrate first with respect to and then in (36) and with respect to first in (37) and argue as in the proof of Lemma 9.6(2) Ia.
Case IIc: In (36), we first integrate with respect to . In (37), we first integrate with respect to and to obtain a factor . Then we integrate with respect to and and apply Lemma 9.9 to obtain a factor . The resulting bounds are stricter than (33) and (34).
Case IId: Here we integrate (36) with respect to first and then while (37) is integrated first with respect to and then .
In all cases treated above, a minor difference to (LABEL:holderStep) is that the integration domains are slightly more complicated due to the indicator . However, it contributes at most a factor to the bound, and this cancels when we divide by .
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. Ahlberg, V. Tassion, and A. Teixeira. Sharpness of the phase transition for continuum percolation in ℝ 2 superscript ℝ 2 \mathbb{R}^{2} . Probab. Theory Related Fields , 172(1):525–581, 2018.
- 2[2] A. Baddeley and R. Turner. spatstat: An R package for analyzing spatial point patterns. Journal of Statistical Software, Articles , 12(6):1–42, 2005.
- 3[3] Y. Baryshnikov and J. E. Yukich. Gaussian limits for random measures in geometric probability. Ann. Appl. Probab. , 15(1A):213–253, 2005.
- 4[4] P. J. Bickel and M. J. Wichura. Convergence criteria for multiparameter stochastic processes and some applications. Ann. Math. Statist. , 42:1656–1670, 1971.
- 5[5] P. Billingsley. Convergence of Probability Measures . J. Wiley & Sons, New York, second edition, 1999.
- 6[6] C. A. N. Biscio and J. Møller. The accumulated persistence function, a new useful functional summary statistic for topological data analysis, with a view to brain artery trees and spatial point process applications. J. Comput. Graph. Statist. , 2019 (to appear).
- 7[7] B. Błaszczyszyn and D. Yogeshwaran. Clustering and percolation of point processes. Electron. J. Probab. , 18:1–20, 2013.
- 8[8] B. Błaszczyszyn, D. Yogeshwaran, and J. E. Yukich. Limit theory for geometric statistics of point processes having fast decay of correlations. Ann. Probab. , 47(2):835–895, 2019.
