Scattering Statistics of Generalized Spatial Poisson Point Processes
Michael Perlmutter, Jieqian He, Matthew Hirn

TL;DR
This paper introduces a novel machine learning approach using Gabor-type measurements for analyzing inhomogeneous Poisson point processes, providing invariance to transformations and distinguishing them from other processes.
Contribution
It proposes a new scattering transform based on Gabor measurements that decouples scale and frequency, enhancing analysis of Poisson point processes.
Findings
Effectively distinguishes Poisson processes from self-similar processes
Separates different types of Poisson point processes
Provides invariance to translations and reflections
Abstract
We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsOptical Imaging and Spectroscopy Techniques · Morphological variations and asymmetry · Point processes and geometric inequalities
Scattering Statistics of Generalized Spatial Poisson Point Processes
Abstract
We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.
**Index Terms— ** Scattering transform, Poisson point process, convolutional neural network
1 Introduction
Convolutional neural networks (CNNs) have obtained impressive results for a number of learning tasks in which the underlying signal data can be modelled as a stochastic process, including texture discrimination [1], texture synthesis [2, 3], time-series analysis [4], and wireless networks [5]. In many scenarios, it is natural to model the signal data as the points of a (potentially complex) spatial point process. Furthermore, there are numerous other fields, including stochastic geometry [6], forestry [7], geoscience [8] and genetics [9], in which spatial point processes are used to model the underlying generating process of certain phenomena (e.g., earthquakes). This motivates us to consider the capacity of CNNs to capture the statistical properties of such processes.
The Wavelet scattering transform [10] is a model for CNNs, which consists of an alternating cascade of linear wavelet transforms and complex modulus nonlinearities. It has provable stability and invariance properties and has been used to achieve near state of the art results in fields such as audio signal processing [11], computer vision [12], and quantum chemistry [13]. In this paper, we examine a generalized scattering transform that utilizes a broader class of filters (which includes wavelets). We primarily focus on filters with small support, which is similar to those used in most CNNs.
Expected wavelet scattering moments for stochastic processes with stationary increments were introduced in [14], where it is shown that such moments capture important statistical information of one-dimensional Poisson processes, fractional Brownian motion, -stable Lévy processes, and a number of other stochastic processes. In this paper, we extend the notion of scattering moments to our generalized architecture, and generalize many of the results from [14]. However, the main contributions contained here consist of new results for more general spatial point processes, including inhomogeneous Poisson point processes, which are not stationary and do not have stationary increments. The collection of expected scattering moments is a non-parametric model for these processes, which we show captures important summary statistics.
In Section 2 we will define our expected scattering moments. Then, in Sections 3 and 4 we will analyze these moments for certain generalized Poisson point processes and self-similar processes. We will present numerical examples in Section 5, and provide a short conclusion in section 6.
2 Expected Scattering Moments
Let be a compactly supported mother wavelet with dilations for , and let be a stochastic process with stationary increments. The first-order wavelet scattering moments are defined in [14] as , where the expectation does not depend on since has stationary increments and is a wavelet which implies is stationary. Much of the analysis of in [14] relies on the fact that these moments can be rewritten as , where . This motivates us to define scattering moments as the integration of a filter, against a random signed measure
To that end, let be a continuous window function with support contained in . Denote by the dilation of , and set to be the Gabor-type filter with scale and central frequency ,
[TABLE]
Note that with an appropriately chosen window function (1) includes dyadic wavelet families in the case that and . However, it also includes many other filters, such as Gabor filters used in the windowed Fourier transform.
Let be a random signed measure and assume that is -periodic for some in the sense that for any Borel set we have , for all (where is the standard orthonormal basis for ). For , set . We define the first-order and second-order expected scattering moments, at location as
[TABLE]
Note is not assumed to be stationary, which is why these moments depend on . Since is periodic, we may also define time-invariant scattering coefficients by
[TABLE]
In the following sections, we analyze these moments for arbitrary frequencies and small scales , thus allowing the filters to serve as a model for the learned filters in CNNs. In particular, we will analyze the asymptotic behavior of the scattering moments as decreases to zero.
3 Scattering Moments of Generalized Poisson Processes
In this section, we let be an inhomogeneous, compound spatial Poisson point process. Such processes generalize ordinary Poisson point processes by incorporating variable charges (heights) at the points of the process and a non-uniform intensity for the locations of the points. They thus provide a flexible family of point processes that can be used to model many different phenomena. In this section, we provide a review of such processes and analyze their first and second-order scattering moments.
Let be a continuous, periodic function on with
[TABLE]
and define its first and second order moments by
[TABLE]
A random measure is called an inhomogeneous Poisson point process with intensity function if for any Borel set ,
[TABLE]
and, in addition, is independent of for all that do not intersect . Now let be a sequence of i.i.d. random variables independent of . An inhomogeneous, compound Poisson point process is given by
[TABLE]
For a further overview of these processes, we refer the reader to Section 6.4 of [15].
3.1 First-order Scattering Asymptotics
Computing the convolution of with gives
[TABLE]
which can be interpreted as a waveform emitting from each location . Invariant scattering moments aggregate the random interference patterns in . The results below show that the expectation of these interference patterns encode important statistical information related to the point process.
For notational convenience, we let
[TABLE]
denote the expected number of points of in the support of . By conditioning on , the number of points in the support of , and using the fact that
[TABLE]
one may obtain the following theorem.111A proof of Theorem 1, as well as the proofs of other theorems stated in this paper, can be found in the appendix
Theorem 1**.**
Let , and be a periodic continuous intensity function satisfying (4). Then for every every such that and every
[TABLE]
where the error term satisfies
[TABLE]
*and is an i.i.d. sequence of random variables, independent of the , taking values in the unit cube and with density for *
If we set and let then one may use the fact that a small cube has at most one point of with overwhelming probability to obtain the following result.
Theorem 2**.**
Let satisfy the same assumptions as in Theorem 1. Let be a sequence of scale and frequency pairs such that . Then
[TABLE]
for all , and consequently
[TABLE]
This theorem shows that for small scales the scattering moments encode the intensity function , up to factors depending upon the summary statistics of the charges and the window . Thus even a one-layer location-dependent scattering network yields considerable information regarding the underlying data generation process.
In the case of ordinary (non-compound) homogeneous Poisson processes, Theorem 2 recovers the constant intensity. For general and invariant scattering moments, the role of higher-order moments of is highlighted by considering higher-order expansions (e.g., ) in (6). The next theorem considers second-order expansions and illustrates their dependence on the second moment of .
Theorem 3**.**
Let satisfy the same assumptions as in Theorem 1. If , is a sequence such that and , then
[TABLE]
where , are independent uniform random variables on ; and is a sequence of random variables independent of the taking values in the unit cube with respective densities, for .
We note that the scale normalization on the left hand side of (10) is , compared to a normalization of in Theorem 2. Thus, intuitively, (10) is capturing information at moderately small scales that are larger than the scales considered in Theorem 2. Unlike Theorem 2, which gives a way to compute , Theorem 3 does not allow one to compute since it would require knowledge of in addition to the distribution from which the charges are drawn. However, Theorem 3 does show that at moderately small scales the invariant scattering coefficients depend non-trivially on the second moment of . Therefore, they can be used to distinguish between, for example, an inhomogeneous Poisson point process with intensity function and a homogeneous Poisson point process with constant intensity.
3.2 Second-Order Scattering Moments of Generalized Poisson Processes
Our next result shows that second-order scattering moments encode higher-order moment information about the
Theorem 4**.**
Let satisfy the same assumptions as in Theorem 1. Let and be sequences of scale-frequency pairs with for some and . Let and Assume and let . Then,
[TABLE]
Theorem 2 shows first-order scattering moments with are not able to distinguish between different types of Poisson point processes at very small scales if the charges have the same first moment. However, Theorem 4 shows second-order scattering moments encode higher-moment information about the charges, and thus are better able to distinguish them (when used in combination with the first-order coefficients). In Sec. 4, we will see first-order invariant scattering moments can distinguish Poisson point processes from self-similar processes if but may fail to do so for larger values of
4 Comparison to Self-Similar Processes
We will show first-order invariant scattering moments can distinguish between Poisson point processes and certain self-similar processes, such as -stable processes, or fractional Brownian motion (fBM). These results generalize those in [14] both by considering more general filters and general scattering moments.
For a stochastic process , we consider the convolution of the filter with the noise defined by and define (in a slight abuse of notation) the first-order scattering moments at time by In the case where is a compound, inhomogeneous Poisson (counting) process, will be a compound Poisson random measure and these scattering moments will coincide with those defined in (2).
The following theorem analyzes the small-scale first-order scattering moments when is either an -stable process, or an fBM. It shows the small-scale asymptotics of the corresponding scattering moments are guaranteed to differ from those of a Poisson point process when We also note that both -stable processes and fBM have stationary increments and thus for all .
Theorem 5**.**
Let and let be a sequence of scale-frequency pairs with and . Then, if is a symmetric -stable process, we have
[TABLE]
Similarly, if is an fBM with Hurst parameter and has bounded variation on then
[TABLE]
This theorem shows that first-order invariant scattering moments distinguish inhomogeneous, compound Poisson processes from both -stable processes and fractional Brownian motion except in the cases where or . In particular, these measurements distinguish Brownian motion, from a Poisson point process except in the case where .
5 Numerical Illustrations
We carry out several experiments to numerically validate the previously stated results. In all of our experiments, we hold the frequency constant while letting decrease to zero.
Compound Poisson point processes with the same intensities: We generated three homogeneous compound Poisson point processes, all with intensity , where the charges , , and are chosen so that uniformly, , and are Rademacher random variables. The charges of the three signals have the same first moment and different second moment with and . As predicted by Theorem 2, Figure 1 shows first-order scattering moments will not be able to distinguish between the three processes with , but will distinguish the process with Gaussian charges from the other two when .
Inhomogeneous, non-compound Poisson point processes: We also consider an inhomogeneous, non-compound Poisson point processes with intensity function (where we estimate , by averaging over 1000 realizations). Figure 2 plots the scattering moments for the inhomogeneous process at different times, and shows they align with the true intensity function.
Poisson point process and self similar process: We consider a Brownian motion compared to a Poisson point process with intensity and charges . Figure 3 shows the convergence rate of the first-order scattering moments can distinguish these processes when but not when
6 Conclusion
We have constructed Gabor-filter scattering transforms for random measures on Our work is closely related to [14] but considers more general classes of filters and point processes (although we note that [14] provides a more detailed analysis of self-similar processes). In future work, it would be interesting to explore the use of these measurements for tasks such as, e.g., synthesizing new signals.
Appendix A Proof of Theorem 1
To prove Theorem 1 we will need the following lemma.
Lemma 1**.**
Let be a Poisson random variable with parameter . Then for all , , , we have
[TABLE]
Proof.
For and , . Therefore,
[TABLE]
∎
The proof of Theorem 1.
Recalling the definitions of and , and setting , we see
[TABLE]
where are the points in . Conditioned on the event that , the locations of the points on are distributed as i.i.d. random variables taking values in with density
[TABLE]
Therefore, the random variables
[TABLE]
take values in the unit cube and have density
[TABLE]
Note that in the special case that is homogeneous, i.e. is constant, the are uniform random variables on .
Therefore, computing the conditional expectation, we have for
[TABLE]
where (14) follows from (i) the independence of the random variables and ; (ii) the fact that for any sequence of i.i.d. random variables ,
[TABLE]
and (iii) the fact that
[TABLE]
Therefore, since ,
[TABLE]
where
[TABLE]
By (14) and Lemma 1, if is small enough so that , then:
[TABLE]
∎
Appendix B Proof of Theorem 2
Proof.
Let be a sequence of scale and frequency pairs such that . Applying Theorem 1 with , we obtain:
[TABLE]
where we write to emphasize the fact that the density of is:
[TABLE]
Using the error bound (7), we see that:
[TABLE]
Furthermore, since , we observe that:
[TABLE]
and by the continuity of ,
[TABLE]
Finally, by the continuity of , we see that
[TABLE]
Therefore, by the bounded convergence theorem,
[TABLE]
That completes the proof of (8).
To prove (9), we assume that is periodic with period along each coordinate and again use Theorem 1 with to observe,
[TABLE]
By (7), the second integral converges to zero as . Therefore,
[TABLE]
by the continuity of and the bounded convergence theorem. ∎
Appendix C Proof of Theorem 3
Proof.
We apply Theorem 1 with and obtain:
[TABLE]
where , , are random variables taking values on the unit cube with densities,
[TABLE]
Dividing both sides in (18) by and subtracting yields:
[TABLE]
Using the error bound (7),
[TABLE]
at a rate independent of . Recalling (16) from the proof of Theorem 2, we use the fact that and the bounded convergence theorem to conclude,
[TABLE]
where , , are uniform random variables on the unit cube and . Similarly,
[TABLE]
Lastly, recalling that as and using (15) from the proof of Theorem 2, we see
[TABLE]
Now we integrate both sides of (21) over and divide by . Taking the limit as , on the left hand side we get:
[TABLE]
where we used the definition of the invariant scattering moments and (25). On the right hand side of (21), we use (25), (27) and the dominated convergence theorem to see that the first term is:
[TABLE]
Using (15), (23), and the bounded convergence theorem, the second term of (21) is:
[TABLE]
where
[TABLE]
Finally, the third term of (21) goes to zero using the bounded convergence theorem and (22). Putting together the left and right hand sides of (21) with these calculations finishes the proof. ∎
Appendix D Proof of Theorem 4
Proof.
As in the proof of Theorem 1, let denote the number of points in the cube . Then since the support of is contained in ,
[TABLE]
where are the points of in . Therefore, in the event that ,
[TABLE]
and so, partitioning the space of possible outcomes based on , we obtain:
[TABLE]
where
[TABLE]
Using the above, we can write the second order convolution term as:
[TABLE]
The following lemma implies that decays rapidly in at a rate independent of .
Lemma 2**.**
There exists , independent of , such that if ,
[TABLE]
Once we have proved Lemma 2, equation (11) will follow once we show,
[TABLE]
Let us prove (28) first and postpone the proof of Lemma 2. We will use the fact that the support of is contained in . Let , , , and let be the points of in the cube . We have that , and conditioned on the event that , the locations of the points are distributed as i.i.d. random variables taking values in with density . Therefore the i.i.d. random variables defined by take values in and have density
[TABLE]
Now, we condition on to see that
[TABLE]
The following lemma will be used to estimate the scaling of the term in (31).
Lemma 3**.**
For all ,
[TABLE]
Furthermore, there exists , independent of , such that if then
[TABLE]
Proof.
Making a change of variables in both and , and recalling the assumption that , we observe that
[TABLE]
The continuity of implies that
[TABLE]
Furthermore, the assumption implies
[TABLE]
Therefore, (33) follows from the dominated convergence theorem and by the observation that the inner integral of (36) is zero unless . Equation (34) follows from inserting (37) into (36) and sending to infinity. ∎
Since
[TABLE]
the independence of and , the continuity of , and Lemma 3 imply that taking in (31) yields:
[TABLE]
The following lemma shows that (32) is (and converges at a rate independent of ), and therefore completes the proof of (11) subject to proving Lemma 2.
Lemma 4**.**
For all there exists , independent of , such that if , then
[TABLE]
Proof.
For any sequence of i.i.d. random variables, it holds that
[TABLE]
Therefore, by Lemma 1, Lemma 3, and the fact that the and are i.i.d. and independent of each other, we see that if where is as in (34),
[TABLE]
where the last inequality uses the fact that ∎
We will now complete the proof of the theorem by proving Lemma 2.
Proof.
[Lemma 2] Since
[TABLE]
we see that
[TABLE]
First turning our attention to the second term, we note that
[TABLE]
since for all Therefore, conditioning on if
[TABLE]
by Lemma 4. Now, turning our attention to the first term, note that
[TABLE]
Therefore, by the same logic as in (38)
[TABLE]
So again conditioning on and applying Lemma 4, we see that if
[TABLE]
∎
This completes the proof of (11). Line (12) follows from integrating with respect to observing that the error bounds in Lemmas 2 and 3 are independent of and applying the bounded convergence theorem.
∎
Appendix E The Proof of Theorem 5
In order to prove Theorems 5, we will need the following lemma which shows that the scaling relationship of a self-similar process induces a similar relationship on stochastic integrals against
Lemma 5**.**
Let be a stochastic process that satisfies the scaling relation
[TABLE]
for some (where denotes equality in distribution). Then for any measurable function ,
[TABLE]
Proof.
Let be a stochastic process satisfying (39), and let be a sequence of partitions of such that
[TABLE]
Then, by the scaling relation (39),
[TABLE]
∎
We will now use Lemma 5 to prove Theorem 5.
Proof.
We first consider the case where is an -stable process, Since has stationary increments, its scattering coefficients do not depend on and it suffices to analyze
[TABLE]
where the second equality uses the fact the distribution of does not change if it is run in reverse, i.e.
[TABLE]
It is well known that satisfies (39) for Therefore, by Lemma 5
[TABLE]
So,
[TABLE]
The proof will be complete as soon as we show that
[TABLE]
By the triangle inequality,
[TABLE]
Since we may choose strictly greater than such that and note that by Jensen’s inequality
[TABLE]
and since is a -integrable martingale, the boundedness of martingale transforms (see [16] and also [17]) implies
[TABLE]
which converges to zero by the continuity of on and the assumption that converges to
Similarly, in the case where is a fractional Brownian motion with Hurst parameter we again need to show
[TABLE]
However, fractional Brownian motion is not a semi-martingale so we cannot apply Burkholder’s theorem as we did in the proof of Theorem 5. Instead, we use the Young-Lóeve estimate [18] which states that if is any (deterministic) function with bounded variation, and is any function which is -Hölder continuous, then
[TABLE]
is well-defined as the limit of Riemann sums and
[TABLE]
where and are the bounded variation and -Hölder seminorms respectively. For all , the function satisfies, and
[TABLE]
One can check that the fact that converges to implies that converges to zero in both and in the bounded variation seminorm, and that therefore that converges to zero.
It is well-known that fractional Brownian motion with Hurst parameter admits a continuous modification which is -Hölder continuous for any Therefore,
[TABLE]
Lastly, one can use the Garsia-Rodemich-Rumsey inequality [19], to show that
[TABLE]
for all For details we refer the reader to the survey article [20]. Therefore,
[TABLE]
as desired.
∎
Remark 1**.**
The assumption that has bounded-variation was used to justify that the stochastic integral against fractional Brownian motion was well defined as the limit of Riemann sums because of its Hölder continuity and the above mentioned result of [18]. This allowed us to avoid the technical complexities of defining such an integral using either the Malliavin calculus or the Wick product.
Appendix F Details of Numerical Experiments
F.1 Definition of Filters
For all the numerical experiments, we take the window function to be the smooth bump function
[TABLE]
Therefore for , our filters are given by
[TABLE]
F.2 Frequencies
In all of our experiments, we hold the frequency, which we sample uniformly at random from constant while allowing the scale to decrease to zero.
F.3 Simulation of Poisson point process
We use the standard method to generate a realization of a Poisson point process. For Poisson point process with intensity , the time interval between two neighbor jumps follows exponential distribution:
[TABLE]
Therefore, taking the inverse cumulative distribution function, we sample the time interval between two neighbor jumps through:
[TABLE]
where are i.i.d. uniform random variables on and assign the charge to the jump at location .
For inhomogeneous Poisson process with intensity funciton , we simulate the time interval based on a well-known algorithm. We, first define the cumulated intensity:
[TABLE]
then generate the location of jumps by the Algorithm 1.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Laurent Sifre and Stéphane Mallat, “Rotation, scaling and deformation invariant scattering for texture discrimination,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , June 2013.
- 2[2] Leon Gatys, Alexander S Ecker, and Matthias Bethge, “Texture synthesis using convolutional neural networks,” in Advances in Neural Information Processing Systems 28 , 2015, pp. 262–270.
- 3[3] Joseph Antognini, Matt Hoffman, and Ron J. Weiss, “Synthesizing diverse, high-quality audio textures,” ar Xiv:1806.08002, 2018.
- 4[4] Mikolaj Binkowski, Gautier Marti, and Philippe Donnat, “Autoregressive convolutional neural networks for asynchronous time series,” in Proceedings of the 35th International Conference on Machine Learning , Jennifer Dy and Andreas Krause, Eds., Stockholmsmässan, Stockholm Sweden, 10–15 Jul 2018, vol. 80 of Proceedings of Machine Learning Research , pp. 580–589, PMLR.
- 5[5] Antoine Brochard, Bartłomiej Błaszczyszyn, Stéphane Mallat, and Sixin Zhang, “Statistical learning of geometric characteristics of wireless networks,” ar Xiv:1812.08265, 2018.
- 6[6] Martin Haenggi, Jeffrey G. Andrews, François Baccelli, Olivier Dousse, and Massimo Franceschetti, “Stochastic geometry and random graphs for the analysis and design of wireless networks,” IEEE Journal on Selected Areas in Communications , vol. 27, no. 7, pp. 1029–1046, 2009.
- 7[7] Astrid Genet, Pavel Grabarnik, Olga Sekretenko, and David Pothier, “Incorporating the mechanisms underlying inter-tree competition into a random point process model to improve spatial tree pattern analysis in forestry,” Ecological Modelling , vol. 288, pp. 143–154, 09 2014.
- 8[8] Frederic Paik Schoenberg, “A note on the consistent estimation of spatial-temporal point process parameters,” Statistica Sinica , 2016.
