Scattering Statistics of Generalized Spatial Poisson Point Processes

Michael Perlmutter; Jieqian He; Matthew Hirn

arXiv:1902.03537·math.ST·October 12, 2021·ICASSP

Scattering Statistics of Generalized Spatial Poisson Point Processes

Michael Perlmutter, Jieqian He, Matthew Hirn

PDF

Open Access

TL;DR

This paper introduces a novel machine learning approach using Gabor-type measurements for analyzing inhomogeneous Poisson point processes, providing invariance to transformations and distinguishing them from other processes.

Contribution

It proposes a new scattering transform based on Gabor measurements that decouples scale and frequency, enhancing analysis of Poisson point processes.

Findings

01

Effectively distinguishes Poisson processes from self-similar processes

02

Separates different types of Poisson point processes

03

Provides invariance to translations and reflections

Abstract

We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.

Equations290

g_{γ} (t) = w_{s} (t) e^{i ξ \cdot t}, γ = (s, ξ), t \in R^{d} .

g_{γ} (t) = w_{s} (t) e^{i ξ \cdot t}, γ = (s, ξ), t \in R^{d} .

S_{γ, p} Y (t)

S_{γ, p} Y (t)

S_{γ, p, γ^{'}, p^{'}} Y (t)

S Y (γ, p)

S Y (γ, p)

S Y (γ, p, γ^{'}, p^{'})

0 < λ_{m i n} := t in f λ (t) \leq ∥ λ ∥_{\infty} < \infty,

0 < λ_{m i n} := t in f λ (t) \leq ∥ λ ∥_{\infty} < \infty,

m_{p} (λ) : = \frac{1}{T ^{d}} \int_{[0, T]^{d}} λ (t)^{2} d t, p = 1, 2 .

m_{p} (λ) : = \frac{1}{T ^{d}} \int_{[0, T]^{d}} λ (t)^{2} d t, p = 1, 2 .

P(N(B)=n)=e^{-\Lambda(B)}\frac{\bigl{(}\Lambda(B)\bigr{)}^{n}}{n!},\enspace\Lambda(B)=\int_{B}\lambda(t)\,dt,

P(N(B)=n)=e^{-\Lambda(B)}\frac{\bigl{(}\Lambda(B)\bigr{)}^{n}}{n!},\enspace\Lambda(B)=\int_{B}\lambda(t)\,dt,

Y (d t) = j = 1 \sum \infty A_{j} δ_{t_{j}} (d t) .

Y (d t) = j = 1 \sum \infty A_{j} δ_{t_{j}} (d t) .

(g_{γ} * Y) (t) = \int_{R^{d}} g_{γ} (t - u) Y (d u) = j = 1 \sum \infty A_{j} g_{γ} (t - t_{j}),

(g_{γ} * Y) (t) = \int_{R^{d}} g_{γ} (t - u) Y (d u) = j = 1 \sum \infty A_{j} g_{γ} (t - t_{j}),

Λ_{s} (t) := Λ ([t - s, t]^{d}) = \int_{[t - s, t]^{d}} λ (u) d u

Λ_{s} (t) := Λ ([t - s, t]^{d}) = \int_{[t - s, t]^{d}} λ (u) d u

P [N ([t - s, t]^{d}) > m] = O ((s^{d} ∥ λ ∥_{\infty})^{m + 1})

P [N ([t - s, t]^{d}) > m] = O ((s^{d} ∥ λ ∥_{\infty})^{m + 1})

S_{γ, p} Y (t) \approx k = 1 \sum m e^{- Λ_{s} (t)} \frac{( Λ _{s} ( t ) ) ^{k}}{k !} E [j = 1 \sum k A_{j} w (V_{j}) e^{i s ξ \cdot V_{j}}^{p}],

S_{γ, p} Y (t) \approx k = 1 \sum m e^{- Λ_{s} (t)} \frac{( Λ _{s} ( t ) ) ^{k}}{k !} E [j = 1 \sum k A_{j} w (V_{j}) e^{i s ξ \cdot V_{j}}^{p}],

∣ ε (m, s, ξ, t) ∣ \leq C_{m, p} \frac{∥ λ ∥ _{\infty}}{λ _{m i n}} ∥ w ∥_{p}^{p} E [∣ A_{1} ∣^{p}] ∥ λ ∥_{\infty}^{m + 1} s^{d (m + 1)}

∣ ε (m, s, ξ, t) ∣ \leq C_{m, p} \frac{∥ λ ∥ _{\infty}}{λ _{m i n}} ∥ w ∥_{p}^{p} E [∣ A_{1} ∣^{p}] ∥ λ ∥_{\infty}^{m + 1} s^{d (m + 1)}

k \to \infty lim \frac{S _{γ_{k}, p} Y ( t )}{s _{k}^{d}} = λ (t) E [∣ A_{1} ∣^{p}] ∥ w ∥_{p}^{p},

k \to \infty lim \frac{S _{γ_{k}, p} Y ( t )}{s _{k}^{d}} = λ (t) E [∣ A_{1} ∣^{p}] ∥ w ∥_{p}^{p},

k \to \infty lim \frac{S Y ( γ _{k} , p )}{s _{k}^{d}} = m_{1} (λ) E [∣ A_{1} ∣^{p}] ∥ w ∥_{p}^{p} .

k \to \infty lim \frac{S Y ( γ _{k} , p )}{s _{k}^{d}} = m_{1} (λ) E [∣ A_{1} ∣^{p}] ∥ w ∥_{p}^{p} .

k \to \infty lim

k \to \infty lim

=

k \to \infty lim \frac{S _{γ_{k}, p, γ_{k}^{'}, p^{'}} Y ( t )}{s _{k}^{d (p^{'} + 1)}}

k \to \infty lim \frac{S _{γ_{k}, p, γ_{k}^{'}, p^{'}} Y ( t )}{s _{k}^{d (p^{'} + 1)}}

k \to \infty lim \frac{S Y ( γ _{k} , p , γ _{k}^{'} , p ^{'} )}{s _{k}^{d (p^{'} + 1)}}

k \to \infty lim \frac{S X ( γ _{k} , p )}{s _{k}^{p / α}} = E [\int_{0}^{1} w (u) e^{i Lu} d X (u)^{p}] .

k \to \infty lim \frac{S X ( γ _{k} , p )}{s _{k}^{p / α}} = E [\int_{0}^{1} w (u) e^{i Lu} d X (u)^{p}] .

k \to \infty lim \frac{S X ( γ _{k} , p )}{s _{k}^{p H}} = E [\int_{0}^{1} w (u) e^{i Lu} d X (u)^{p}] .

k \to \infty lim \frac{S X ( γ _{k} , p )}{s _{k}^{p H}} = E [\int_{0}^{1} w (u) e^{i Lu} d X (u)^{p}] .

E [Z^{α} \mathbbm 1_{{Z > m}}] = k = m + 1 \sum \infty e^{- λ} \frac{λ ^{k}}{k !} k^{α} \leq C_{m, α} λ^{m + 1} .

E [Z^{α} \mathbbm 1_{{Z > m}}] = k = m + 1 \sum \infty e^{- λ} \frac{λ ^{k}}{k !} k^{α} \leq C_{m, α} λ^{m + 1} .

E [Z^{α} \mathbbm 1_{{Z > m}}]

E [Z^{α} \mathbbm 1_{{Z > m}}]

= λ^{m + 1} k = 0 \sum \infty e^{- λ} \frac{λ ^{k}}{( k + m + 1 )!} (k + m + 1)^{α}

\leq λ^{m + 1} k = 0 \sum \infty \frac{( k + m + 1 ) ^{α}}{( k + m + 1 )!}

= C_{α, m} λ^{m + 1} .

S_{γ, p} Y (t)

S_{γ, p} Y (t)

= E j = 1 \sum N_{s} (t) A_{j} w (\frac{t - t _{j}}{s}) e^{i ξ \cdot (t - t_{j})}^{p},

p_{Z} (z) = \frac{λ ( z )}{Λ _{s} ( t )}, z \in [t - s, t]^{d} .

p_{Z} (z) = \frac{λ ( z )}{Λ _{s} ( t )}, z \in [t - s, t]^{d} .

V_{i} := \frac{t - Z _{i}}{s}

V_{i} := \frac{t - Z _{i}}{s}

p_{V} (v) = \frac{s ^{d}}{Λ _{s} ( t )} λ (t - v s), v \in [0, 1]^{d} .

p_{V} (v) = \frac{s ^{d}}{Λ _{s} ( t )} λ (t - v s), v \in [0, 1]^{d} .

E j = 1 \sum N_{s} (t) A_{j} w (\frac{t - t _{j}}{s}) e^{i ξ \cdot (t - t_{j})}^{p} : N_{s} (t) = k

E j = 1 \sum N_{s} (t) A_{j} w (\frac{t - t _{j}}{s}) e^{i ξ \cdot (t - t_{j})}^{p} : N_{s} (t) = k

=

\leq

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptical Imaging and Spectroscopy Techniques · Morphological variations and asymmetry · Point processes and geometric inequalities

Full text

Scattering Statistics of Generalized Spatial Poisson Point Processes

Abstract

We present a machine learning model for the analysis of randomly generated discrete signals, modeled as the points of an inhomogeneous, compound Poisson point process. Like the wavelet scattering transform introduced by Mallat, our construction is naturally invariant to translations and reflections, but it decouples the roles of scale and frequency, replacing wavelets with Gabor-type measurements. We show that, with suitable nonlinearities, our measurements distinguish Poisson point processes from common self-similar processes, and separate different types of Poisson point processes.

**Index Terms— ** Scattering transform, Poisson point process, convolutional neural network

1 Introduction

Convolutional neural networks (CNNs) have obtained impressive results for a number of learning tasks in which the underlying signal data can be modelled as a stochastic process, including texture discrimination [1], texture synthesis [2, 3], time-series analysis [4], and wireless networks [5]. In many scenarios, it is natural to model the signal data as the points of a (potentially complex) spatial point process. Furthermore, there are numerous other fields, including stochastic geometry [6], forestry [7], geoscience [8] and genetics [9], in which spatial point processes are used to model the underlying generating process of certain phenomena (e.g., earthquakes). This motivates us to consider the capacity of CNNs to capture the statistical properties of such processes.

The Wavelet scattering transform [10] is a model for CNNs, which consists of an alternating cascade of linear wavelet transforms and complex modulus nonlinearities. It has provable stability and invariance properties and has been used to achieve near state of the art results in fields such as audio signal processing [11], computer vision [12], and quantum chemistry [13]. In this paper, we examine a generalized scattering transform that utilizes a broader class of filters (which includes wavelets). We primarily focus on filters with small support, which is similar to those used in most CNNs.

Expected wavelet scattering moments for stochastic processes with stationary increments were introduced in [14], where it is shown that such moments capture important statistical information of one-dimensional Poisson processes, fractional Brownian motion, $\alpha$ -stable Lévy processes, and a number of other stochastic processes. In this paper, we extend the notion of scattering moments to our generalized architecture, and generalize many of the results from [14]. However, the main contributions contained here consist of new results for more general spatial point processes, including inhomogeneous Poisson point processes, which are not stationary and do not have stationary increments. The collection of expected scattering moments is a non-parametric model for these processes, which we show captures important summary statistics.

In Section 2 we will define our expected scattering moments. Then, in Sections 3 and 4 we will analyze these moments for certain generalized Poisson point processes and self-similar processes. We will present numerical examples in Section 5, and provide a short conclusion in section 6.

2 Expected Scattering Moments

Let $\psi\in\mathbf{L}^{2}(\mathbb{R})$ be a compactly supported mother wavelet with dilations $\psi_{j}(t)=2^{-j}\psi(2^{-j}t)$ for $j\in\mathbb{Z}$ , and let $X(t),t\in\mathbb{R},$ be a stochastic process with stationary increments. The first-order wavelet scattering moments are defined in [14] as $SX(j)=\mathbb{E}[|\psi_{j}\ast X|]$ , where the expectation does not depend on $t$ since $X(t)$ has stationary increments and $\psi_{j}$ is a wavelet which implies $X\ast\psi_{j}(t)$ is stationary. Much of the analysis of in [14] relies on the fact that these moments can be rewritten as $SX(j)=\mathbb{E}[|\overline{\psi}_{j}\ast dX|]$ , where $d\overline{\psi}_{j}=\psi_{j}$ . This motivates us to define scattering moments as the integration of a filter, against a random signed measure $Y(dt).$

To that end, let $w\in\mathbf{L}^{2}(\mathbb{R}^{d})$ be a continuous window function with support contained in $[0,1]^{d}$ . Denote by $w_{s}(t)=w\left(\frac{t}{s}\right)$ the dilation of $w$ , and set $g_{\gamma}(t)$ to be the Gabor-type filter with scale $s>0$ and central frequency $\xi\in\mathbb{R}^{d}$ ,

[TABLE]

Note that with an appropriately chosen window function $w,$ (1) includes dyadic wavelet families in the case that $s=2^{j}$ and $|\xi|=C/s$ . However, it also includes many other filters, such as Gabor filters used in the windowed Fourier transform.

Let $Y(dt)$ be a random signed measure and assume that $Y$ is $T$ -periodic for some $T>0$ in the sense that for any Borel set $B$ we have $Y(B)=Y(B+Te_{i})$ , for all $1\leq i\leq d$ (where $\{e_{i}\}_{i\leq d}$ is the standard orthonormal basis for $\mathbb{R}^{d}$ ). For $f\in\mathbf{L}^{2}(\mathbb{R}^{d})$ , set $f\ast Y(t)\coloneqq\int_{\mathbb{R}^{d}}f(t-u)Y(du)$ . We define the first-order and second-order expected scattering moments, $1\leq p,p^{\prime}<\infty,$ at location $t$ as

[TABLE]

Note $Y(dt)$ is not assumed to be stationary, which is why these moments depend on $t$ . Since $Y(dt)$ is periodic, we may also define time-invariant scattering coefficients by

[TABLE]

In the following sections, we analyze these moments for arbitrary frequencies $\xi$ and small scales $s$ , thus allowing the filters $g_{\gamma}$ to serve as a model for the learned filters in CNNs. In particular, we will analyze the asymptotic behavior of the scattering moments as $s$ decreases to zero.

3 Scattering Moments of Generalized Poisson Processes

In this section, we let $Y(dt)$ be an inhomogeneous, compound spatial Poisson point process. Such processes generalize ordinary Poisson point processes by incorporating variable charges (heights) at the points of the process and a non-uniform intensity for the locations of the points. They thus provide a flexible family of point processes that can be used to model many different phenomena. In this section, we provide a review of such processes and analyze their first and second-order scattering moments.

Let $\lambda(t)$ be a continuous, periodic function on $\mathbb{R}^{d}$ with

[TABLE]

and define its first and second order moments by

[TABLE]

A random measure $N(dt)\coloneqq\sum_{j=1}^{\infty}\delta_{t_{j}}(dt)$ is called an inhomogeneous Poisson point process with intensity function $\lambda(t)$ if for any Borel set $B\subset\mathbb{R}^{d}$ ,

[TABLE]

and, in addition, $N(B)$ is independent of $N(B^{\prime})$ for all $B^{\prime}$ that do not intersect $B$ . Now let $(A_{j})_{j=1}^{\infty}$ be a sequence of i.i.d. random variables independent of $N$ . An inhomogeneous, compound Poisson point process $Y(dt)$ is given by

[TABLE]

For a further overview of these processes, we refer the reader to Section 6.4 of [15].

3.1 First-order Scattering Asymptotics

Computing the convolution of $g_{\gamma}$ with $Y(dt)$ gives

[TABLE]

which can be interpreted as a waveform $g_{\gamma}$ emitting from each location $t_{j}$ . Invariant scattering moments aggregate the random interference patterns in $|g_{\gamma}\ast Y|$ . The results below show that the expectation of these interference patterns encode important statistical information related to the point process.

For notational convenience, we let

[TABLE]

denote the expected number of points of $N$ in the support of $g_{\gamma}(t-\cdot)$ . By conditioning on $N\left([t-s,t]^{d}\right)$ , the number of points in the support of $g_{\gamma}$ , and using the fact that

[TABLE]

one may obtain the following theorem.111A proof of Theorem 1, as well as the proofs of other theorems stated in this paper, can be found in the appendix

Theorem 1.

Let $\mathbb{E}[|A_{1}|^{p}]<\infty$ , and $\lambda(t)$ be a periodic continuous intensity function satisfying (4). Then for every $t\in\mathbb{R}^{d},$ every $\gamma=(s,\xi)$ such that $s^{d}\|\lambda\|_{\infty}<1,$ and every $m\geq 1,$

[TABLE]

where the error term $\varepsilon(m,s,\xi,t)$ satisfies

[TABLE]

*and $V_{1},V_{2},\ldots$ is an i.i.d. sequence of random variables, independent of the $A_{j}$ , taking values in the unit cube $[0,1]^{d}$ and with density $p_{V}(v)=\frac{s^{d}}{\Lambda_{s}(t)}\lambda(t-vs)$ for $v\in[0,1]^{d}.$ *

If we set $m=1,$ and let $s\rightarrow 0,$ then one may use the fact that a small cube $[t-s,t]^{d}$ has at most one point of $N$ with overwhelming probability to obtain the following result.

Theorem 2.

Let $Y(dt)$ satisfy the same assumptions as in Theorem 1. Let $\gamma_{k}=(s_{k},\xi_{k})$ be a sequence of scale and frequency pairs such that $\lim_{k\rightarrow\infty}s_{k}=0$ . Then

[TABLE]

for all $t$ , and consequently

[TABLE]

This theorem shows that for small scales the scattering moments $S_{\gamma,p}Y(t)$ encode the intensity function $\lambda(t)$ , up to factors depending upon the summary statistics of the charges $(A_{j})_{j=1}^{\infty}$ and the window $w$ . Thus even a one-layer location-dependent scattering network yields considerable information regarding the underlying data generation process.

In the case of ordinary (non-compound) homogeneous Poisson processes, Theorem 2 recovers the constant intensity. For general $\lambda(t)$ and invariant scattering moments, the role of higher-order moments of $\lambda(t)$ is highlighted by considering higher-order expansions (e.g., $m>1$ ) in (6). The next theorem considers second-order expansions and illustrates their dependence on the second moment of $\lambda(t)$ .

Theorem 3.

Let $Y$ satisfy the same assumptions as in Theorem 1. If $(\gamma_{k})_{k\geq 1}=(s_{k},\xi_{k})_{k\geq 1}$ , is a sequence such that $\lim_{k\rightarrow\infty}s_{k}=0$ and $\lim_{k\rightarrow\infty}s_{k}\xi_{k}=L\in\mathbb{R}^{d}$ , then

[TABLE]

where $U_{1}$ , $U_{2}$ are independent uniform random variables on $[0,1]^{d}$ ; and $(V_{k})_{k\geq 1}$ is a sequence of random variables independent of the $A_{j}$ taking values in the unit cube with respective densities, $p_{V_{k}}(v)=\frac{s_{k}^{d}}{\Lambda_{s_{k}}(t)}\lambda(t-vs_{k})$ for $v\in[0,1]^{d}$ .

We note that the scale normalization on the left hand side of (10) is $s^{-2d}$ , compared to a normalization of $s^{-d}$ in Theorem 2. Thus, intuitively, (10) is capturing information at moderately small scales that are larger than the scales considered in Theorem 2. Unlike Theorem 2, which gives a way to compute $m_{1}(\lambda)$ , Theorem 3 does not allow one to compute $m_{2}(\lambda)$ since it would require knowledge of $\Lambda_{s_{k}}(t)$ in addition to the distribution from which the charges $(A_{j})_{j=1}^{\infty}$ are drawn. However, Theorem 3 does show that at moderately small scales the invariant scattering coefficients depend non-trivially on the second moment of $\lambda(t)$ . Therefore, they can be used to distinguish between, for example, an inhomogeneous Poisson point process with intensity function $\lambda(t)$ and a homogeneous Poisson point process with constant intensity.

3.2 Second-Order Scattering Moments of Generalized Poisson Processes

Our next result shows that second-order scattering moments encode higher-order moment information about the $(A_{j})_{j=1}^{\infty}.$

Theorem 4.

Let $Y(dt)$ satisfy the same assumptions as in Theorem 1. Let $\gamma_{k}=(s_{k},\xi_{k})$ and $\gamma_{k}^{\prime}=(s_{k}^{\prime},\xi_{k}^{\prime})$ be sequences of scale-frequency pairs with $s_{k}^{\prime}=cs_{k}$ for some $c>0$ and $\lim_{k\rightarrow\infty}s_{k}\xi_{k}=L\in\mathbb{R}^{d}$ . Let $1\leq p,p^{\prime}<\infty$ and $q=pp^{\prime}.$ Assume $\mathbb{E}|A_{1}|^{q}<\infty,$ and let $K\coloneqq\left\|g_{c,L/c}\ast|g_{1,0}|^{p}\right\|_{p^{\prime}}^{p^{\prime}}$ . Then,

[TABLE]

Theorem 2 shows first-order scattering moments with $p=1$ are not able to distinguish between different types of Poisson point processes at very small scales if the charges have the same first moment. However, Theorem 4 shows second-order scattering moments encode higher-moment information about the charges, and thus are better able to distinguish them (when used in combination with the first-order coefficients). In Sec. 4, we will see first-order invariant scattering moments can distinguish Poisson point processes from self-similar processes if $p=1,$ but may fail to do so for larger values of $p.$

4 Comparison to Self-Similar Processes

We will show first-order invariant scattering moments can distinguish between Poisson point processes and certain self-similar processes, such as $\alpha$ -stable processes, $1<\alpha\leq 2,$ or fractional Brownian motion (fBM). These results generalize those in [14] both by considering more general filters and general $p^{\text{th}}$ scattering moments.

For a stochastic process $X(t),$ $t\in\mathbb{R}$ , we consider the convolution of the filter $g_{\gamma}$ with the noise $dX$ defined by $g_{\gamma}\ast dX(t)\coloneqq\int_{\mathbb{R}}g_{\gamma}(t-u)\,dX(u),$ and define (in a slight abuse of notation) the first-order scattering moments at time $t$ by $S_{\gamma,p}X(t)\coloneqq\mathbb{E}[|g_{\gamma}\ast dX(t)|^{p}]\,.$ In the case where $X(t)$ is a compound, inhomogeneous Poisson (counting) process, $Y=dX$ will be a compound Poisson random measure and these scattering moments will coincide with those defined in (2).

The following theorem analyzes the small-scale first-order scattering moments when $X$ is either an $\alpha$ -stable process, or an fBM. It shows the small-scale asymptotics of the corresponding scattering moments are guaranteed to differ from those of a Poisson point process when $p=1.$ We also note that both $\alpha$ -stable processes and fBM have stationary increments and thus $S_{\gamma,p}X(t)=SX(\gamma,p)$ for all $t$ .

Theorem 5.

Let $1\leq p<\infty,$ and let $\gamma_{k}=(s_{k},\xi_{k})$ be a sequence of scale-frequency pairs with $\lim_{k\rightarrow\infty}s_{k}=0$ and $\lim_{k\rightarrow\infty}s_{k}\xi_{k}=L\in\mathbb{R}$ . Then, if $X(t)$ is a symmetric $\alpha$ -stable process, $p<\alpha\leq 2,$ we have

[TABLE]

Similarly, if $X(t)$ is an fBM with Hurst parameter $H\in(0,1)$ and $w$ has bounded variation on $[0,1],$ then

[TABLE]

This theorem shows that first-order invariant scattering moments distinguish inhomogeneous, compound Poisson processes from both $\alpha$ -stable processes and fractional Brownian motion except in the cases where $p=\alpha$ or $p=1/H$ . In particular, these measurements distinguish Brownian motion, from a Poisson point process except in the case where $p=2$ .

5 Numerical Illustrations

We carry out several experiments to numerically validate the previously stated results. In all of our experiments, we hold the frequency $\xi$ constant while letting $s$ decrease to zero.

Compound Poisson point processes with the same intensities: We generated three homogeneous compound Poisson point processes, all with intensity $\lambda(t)\equiv\lambda_{0}=0.01$ , where the charges $A_{1,j}$ , $A_{2,j}$ , and $A_{3,j}$ are chosen so that $A_{1,j}=1$ uniformly, $A_{2,j}\sim\mathcal{N}(0,\sqrt{\frac{\pi}{2}})$ , and $A_{3,j}$ are Rademacher random variables. The charges of the three signals have the same first moment $\mathbb{E}[|A_{i,j}|]=1$ and different second moment with $\mathbb{E}[|A_{1,j}|^{2}]=\mathbb{E}[|A_{3,j}|^{2}]=1$ and $\mathbb{E}[|A_{2,j}|^{2}]=\frac{\pi}{2}$ . As predicted by Theorem 2, Figure 1 shows first-order scattering moments will not be able to distinguish between the three processes with $p=1$ , but will distinguish the process with Gaussian charges from the other two when $p=2$ .

Inhomogeneous, non-compound Poisson point processes: We also consider an inhomogeneous, non-compound Poisson point processes with intensity function $\lambda(t)=0.01(1+0.5\sin(\frac{2\pi t}{N}))$ (where we estimate $S_{\gamma,p}Y(t)$ , by averaging over 1000 realizations). Figure 2 plots the scattering moments for the inhomogeneous process at different times, and shows they align with the true intensity function.

Poisson point process and self similar process: We consider a Brownian motion compared to a Poisson point process with intensity $\lambda=0.01$ and charges $(A)_{j=1}^{\infty}\equiv 10$ . Figure 3 shows the convergence rate of the first-order scattering moments can distinguish these processes when $p=1$ but not when $p=2.$

6 Conclusion

We have constructed Gabor-filter scattering transforms for random measures on $\mathbb{R}^{d}.$ Our work is closely related to [14] but considers more general classes of filters and point processes (although we note that [14] provides a more detailed analysis of self-similar processes). In future work, it would be interesting to explore the use of these measurements for tasks such as, e.g., synthesizing new signals.

Appendix A Proof of Theorem 1

To prove Theorem 1 we will need the following lemma.

Lemma 1.

Let $Z$ be a Poisson random variable with parameter $\lambda$ . Then for all $\alpha\in\mathbb{R}$ , $m\in\mathbb{N}$ , $0<\lambda<1$ , we have

[TABLE]

Proof.

For $0<\lambda<1$ and $k\in\mathbb{N}$ , $e^{-\lambda}\lambda^{k}\leq 1$ . Therefore,

[TABLE]

∎

The proof of Theorem 1.

Recalling the definitions of $Y(dt)$ and $S_{\gamma,p}Y(t)$ , and setting $N_{s}(t)=N\left([t-s,t]^{d}\right)$ , we see

[TABLE]

where $t_{1},t_{2},\ldots t_{N_{s}(t)}$ are the points $N(t)$ in $[t-s,t]^{d}$ . Conditioned on the event that $N_{s}(t)=k$ , the locations of the $k$ points on $[t-s,t]^{d}$ are distributed as i.i.d. random variables $Z_{1},\ldots,Z_{k}$ taking values in $[t-s,t]^{d}$ with density

[TABLE]

Therefore, the random variables

[TABLE]

take values in the unit cube $[0,1]^{d}$ and have density

[TABLE]

Note that in the special case that $N$ is homogeneous, i.e. $\lambda(t)\equiv\lambda_{0}$ is constant, the $V_{i}$ are uniform random variables on $[0,1]^{d}$ .

Therefore, computing the conditional expectation, we have for $k\geq 1$

[TABLE]

where (14) follows from (i) the independence of the random variables $A_{j}$ and $V_{j}$ ; (ii) the fact that for any sequence of i.i.d. random variables $Z_{1},Z_{2},\ldots$ ,

[TABLE]

and (iii) the fact that

[TABLE]

Therefore, since $\mathbb{P}[N_{s}(t)=k]=e^{-\Lambda_{s}(t)}\cdot\nicefrac{{(\Lambda_{s}(t))^{k}}}{{k!}}$ ,

[TABLE]

where

[TABLE]

By (14) and Lemma 1, if $s$ is small enough so that $\Lambda_{s}(t)\leq s^{d}\|\lambda\|_{\infty}<1$ , then:

[TABLE]

∎

Appendix B Proof of Theorem 2

Proof.

Let $(s_{k},\xi_{k})$ be a sequence of scale and frequency pairs such that $\lim_{k\rightarrow\infty}s_{k}=0$ . Applying Theorem 1 with $m=1$ , we obtain:

[TABLE]

where we write $V_{1,k}=V_{1}$ to emphasize the fact that the density of $V_{1,k}$ is:

[TABLE]

Using the error bound (7), we see that:

[TABLE]

Furthermore, since $0\leq\Lambda_{s_{k}}(t)\leq s_{k}^{d}\|\lambda\|_{\infty}$ , we observe that:

[TABLE]

and by the continuity of $\lambda(t)$ ,

[TABLE]

Finally, by the continuity of $\lambda(t)$ , we see that

[TABLE]

Therefore, by the bounded convergence theorem,

[TABLE]

That completes the proof of (8).

To prove (9), we assume that $\lambda(t)$ is periodic with period $T$ along each coordinate and again use Theorem 1 with $m=1$ to observe,

[TABLE]

By (7), the second integral converges to zero as $k\rightarrow\infty$ . Therefore,

[TABLE]

by the continuity of $\lambda(t)$ and the bounded convergence theorem. ∎

Appendix C Proof of Theorem 3

Proof.

We apply Theorem 1 with $m=2$ and obtain:

[TABLE]

where $V_{i,k}$ , $i=1,2$ , are random variables taking values on the unit cube $[0,1]^{d}$ with densities,

[TABLE]

Dividing both sides in (18) by $s_{k}^{2d}\|w\|_{p}^{p}\mathbb{E}[|A_{1}|^{p}]$ and subtracting $\frac{\Lambda_{s_{k}}(t)}{s_{k}^{2d}}\frac{\mathbb{E}[|w(V_{1,k})|^{p}]}{\|w\|_{p}^{p}}$ yields:

[TABLE]

Using the error bound (7),

[TABLE]

at a rate independent of $t$ . Recalling (16) from the proof of Theorem 2, we use the fact that $\lim_{k\rightarrow\infty}p_{V_{k}}\equiv 1$ and the bounded convergence theorem to conclude,

[TABLE]

where $U_{i}$ , $i=1,2$ , are uniform random variables on the unit cube and $L=\lim_{k\rightarrow\infty}s_{k}\xi_{k}$ . Similarly,

[TABLE]

Lastly, recalling that $s_{k}\rightarrow 0$ as $k\rightarrow\infty$ and using (15) from the proof of Theorem 2, we see

[TABLE]

Now we integrate both sides of (21) over $[0,T]^{d}$ and divide by $T^{d}$ . Taking the limit as $k\rightarrow\infty$ , on the left hand side we get:

[TABLE]

where we used the definition of the invariant scattering moments and (25). On the right hand side of (21), we use (25), (27) and the dominated convergence theorem to see that the first term is:

[TABLE]

Using (15), (23), and the bounded convergence theorem, the second term of (21) is:

[TABLE]

where

[TABLE]

Finally, the third term of (21) goes to zero using the bounded convergence theorem and (22). Putting together the left and right hand sides of (21) with these calculations finishes the proof. ∎

Appendix D Proof of Theorem 4

Proof.

As in the proof of Theorem 1, let $N_{s}(t)=N\left([t-s,t]^{d}\right)$ denote the number of points in the cube $[t-s,t]^{d}$ . Then since the support of $w$ is contained in $[0,1]^{d}$ ,

[TABLE]

where $t_{1},t_{2},\ldots,t_{N_{s_{k}}(t)}$ are the points of $N$ in $[t-s_{k},t]^{d}$ . Therefore, in the event that $N_{s_{k}}(t)=1$ ,

[TABLE]

and so, partitioning the space of possible outcomes based on $N_{s_{k}}(t)$ , we obtain:

[TABLE]

where

[TABLE]

Using the above, we can write the second order convolution term as:

[TABLE]

The following lemma implies that $\left(g_{\gamma_{k}^{\prime}}\ast e_{k}\right)(t)$ decays rapidly in $\mathbf{L}^{p^{\prime}}$ at a rate independent of $t$ .

Lemma 2.

There exists $\delta>0$ , independent of $t$ , such that if $s_{k}<\delta$ ,

[TABLE]

Once we have proved Lemma 2, equation (11) will follow once we show,

[TABLE]

Let us prove (28) first and postpone the proof of Lemma 2. We will use the fact that the support of $g_{\gamma_{k}^{\prime}}\ast|g_{\gamma_{k}}|^{p}$ is contained in $[0,s_{k}+s_{k}^{\prime}]^{d}$ . Let $\tilde{s}_{k}:=s_{k}+s_{k}^{\prime}$ , $N_{k}(t):=N_{\tilde{s}_{k}}(t)$ , $\Lambda_{k}(t):=\Lambda_{\tilde{s}_{k}}(t)$ , and let $t_{1},t_{2},\ldots,t_{N_{k}(t)}$ be the points of $N$ in the cube $[t-\tilde{s}_{k},t]^{d}$ . We have that $\mathbb{P}[N_{k}(t)=n]=e^{-\Lambda_{k}(t)}\frac{(\Lambda_{k}(t))^{n}}{n!}$ , and conditioned on the event that $N_{k}(t)=n$ , the locations of the points $t_{1},\ldots,t_{n}$ are distributed as i.i.d. random variables $Z_{1}(t),\ldots,Z_{n}(t)$ taking values in $[t-\tilde{s}_{k},t]^{d}$ with density $p_{Z(t)}(z)=\frac{\lambda(z)}{\Lambda_{k}(t)}$ . Therefore the i.i.d. random variables $\widetilde{V}_{1}(t),\ldots,\widetilde{V}_{n}(t)$ defined by $\widetilde{V}_{i}(t):=t-Z_{i}(t)$ take values in $[0,\tilde{s}_{k}]^{d}$ and have density

[TABLE]

Now, we condition on $N_{k}(t)$ to see that

[TABLE]

The following lemma will be used to estimate the scaling of the term in (31).

Lemma 3.

For all $t\in\mathbb{R}^{d}$ ,

[TABLE]

Furthermore, there exists $\delta>0$ , independent of $t$ , such that if $s_{k}<\delta$ then

[TABLE]

Proof.

Making a change of variables in both $u$ and $v$ , and recalling the assumption that $s_{k}^{\prime}=cs_{k}$ , we observe that

[TABLE]

The continuity of $\lambda(t)$ implies that

[TABLE]

Furthermore, the assumption $0<\lambda_{\min}\leq\|\lambda\|_{\infty}<\infty$ implies

[TABLE]

Therefore, (33) follows from the dominated convergence theorem and by the observation that the inner integral of (36) is zero unless $v\in[0,1+c]^{d}$ . Equation (34) follows from inserting (37) into (36) and sending $k$ to infinity. ∎

Since

[TABLE]

the independence of $\widetilde{V}_{1}(t)$ and $A_{1}$ , the continuity of $\lambda(t)$ , and Lemma 3 imply that taking $k\rightarrow\infty$ in (31) yields:

[TABLE]

The following lemma shows that (32) is $O\left(s_{k}^{d(p^{\prime}+2)}\right)$ (and converges at a rate independent of $t$ ), and therefore completes the proof of (11) subject to proving Lemma 2.

Lemma 4.

For all $\alpha\in\mathbb{R}$ there exists $\delta>0$ , independent of $t$ , such that if $s_{k}<\delta$ , then

[TABLE]

Proof.

For any sequence of i.i.d. random variables, $Z_{1},Z_{2},\ldots,$ it holds that

[TABLE]

Therefore, by Lemma 1, Lemma 3, and the fact that the $\widetilde{V}_{j}(t)$ and $A_{i}$ are i.i.d. and independent of each other, we see that if $s_{k}<\delta,$ where $\delta$ is as in (34),

[TABLE]

where the last inequality uses the fact that $\Lambda_{k}(t)\leq\tilde{s}_{k}^{d}\|\lambda\|_{\infty}=(1+c)^{d}s_{k}^{d}\|\lambda\|_{\infty}.$ ∎

We will now complete the proof of the theorem by proving Lemma 2.

Proof.

[Lemma 2] Since

[TABLE]

we see that

[TABLE]

First turning our attention to the second term, we note that

[TABLE]

since $N_{s_{k}}(u)\leq N_{s_{k}+s_{k}^{\prime}}(t)=N_{\tilde{s}_{k}}(t)=N_{k}(t)$ for all $u\in[t-s^{\prime}_{k},t]^{d}.$ Therefore, conditioning on $N_{k}(t),$ if $s_{k}<\delta,$

[TABLE]

by Lemma 4. Now, turning our attention to the first term, note that

[TABLE]

Therefore, by the same logic as in (38)

[TABLE]

So again conditioning on $N_{k}(t),$ and applying Lemma 4, we see that if $s_{k}<\delta$

[TABLE]

∎

This completes the proof of (11). Line (12) follows from integrating with respect to $t,$ observing that the error bounds in Lemmas 2 and 3 are independent of $t,$ and applying the bounded convergence theorem.

∎

Appendix E The Proof of Theorem 5

In order to prove Theorems 5, we will need the following lemma which shows that the scaling relationship of a self-similar process $X(t)$ induces a similar relationship on stochastic integrals against $dX(t).$

Lemma 5.

Let $X$ be a stochastic process that satisfies the scaling relation

[TABLE]

for some $\beta>0$ (where $=_{d}$ denotes equality in distribution). Then for any measurable function $f:\mathbb{R}\rightarrow\mathbb{R}$ ,

[TABLE]

Proof.

Let $X=(X(t))_{t\in\mathbb{R}}$ be a stochastic process satisfying (39), and let $\mathcal{P}_{n}=\{0=t_{0}^{n}<t_{1}^{n}<\ldots<t_{K_{n}}^{n}=1\}$ be a sequence of partitions of $[0,1]$ such that

[TABLE]

Then, by the scaling relation (39),

[TABLE]

∎

We will now use Lemma 5 to prove Theorem 5.

Proof.

We first consider the case where $X=(X(t))_{t\in\mathbb{R}}$ is an $\alpha$ -stable process, $p<\alpha\leq 2.$ Since $X$ has stationary increments, its scattering coefficients do not depend on $t$ and it suffices to analyze

[TABLE]

where the second equality uses the fact the distribution of $X$ does not change if it is run in reverse, i.e.

[TABLE]

It is well known that $X(t)$ satisfies (39) for $\beta=\nicefrac{{1}}{{\alpha}}.$ Therefore, by Lemma 5

[TABLE]

So,

[TABLE]

The proof will be complete as soon as we show that

[TABLE]

By the triangle inequality,

[TABLE]

Since $1\leq p<\alpha,$ we may choose $p^{\prime}$ strictly greater than $1$ such that $p\leq p^{\prime}<\alpha,$ and note that by Jensen’s inequality

[TABLE]

and since $X(t)$ is a $p^{\prime}$ -integrable martingale, the boundedness of martingale transforms (see [16] and also [17]) implies

[TABLE]

which converges to zero by the continuity of $w$ on $[0,1]$ and the assumption that $s_{k}\xi_{k}$ converges to $L.$

Similarly, in the case where $(X(t))_{t\in\mathbb{R}}$ is a fractional Brownian motion with Hurst parameter $H,$ we again need to show

[TABLE]

However, fractional Brownian motion is not a semi-martingale so we cannot apply Burkholder’s theorem as we did in the proof of Theorem 5. Instead, we use the Young-Lóeve estimate [18] which states that if $x(u)$ is any (deterministic) function with bounded variation, and $y(u)$ is any function which is $\alpha$ -Hölder continuous, $0<\alpha<1,$ then

[TABLE]

is well-defined as the limit of Riemann sums and

[TABLE]

where $\|\cdot\|_{BV}$ and $\|\cdot\|_{\alpha}$ are the bounded variation and $\alpha$ -Hölder seminorms respectively. For all $k$ , the function $h_{k}(u)\coloneqq w(u)\left(e^{i\xi_{k}s_{k}u}-e^{iLu}\right)\coloneqq w(u)f_{k}(u)$ satisfies, $h_{k}(0)=0$ and

[TABLE]

One can check that the fact that $s_{k}\xi_{k}$ converges to $L$ implies that $f_{k}$ converges to zero in both $\mathbf{L}^{\infty}$ and in the bounded variation seminorm, and that therefore that $\|h_{k}\|_{BV}$ converges to zero.

It is well-known that fractional Brownian motion with Hurst parameter $H$ admits a continuous modification which is $\alpha$ -Hölder continuous for any $\alpha<H.$ Therefore,

[TABLE]

Lastly, one can use the Garsia-Rodemich-Rumsey inequality [19], to show that

[TABLE]

for all $1<p<\infty.$ For details we refer the reader to the survey article [20]. Therefore,

[TABLE]

as desired.

∎

Remark 1.

The assumption that $w$ has bounded-variation was used to justify that the stochastic integral against fractional Brownian motion was well defined as the limit of Riemann sums because of its Hölder continuity and the above mentioned result of [18]. This allowed us to avoid the technical complexities of defining such an integral using either the Malliavin calculus or the Wick product.

Appendix F Details of Numerical Experiments

F.1 Definition of Filters

For all the numerical experiments, we take the window function $w$ to be the smooth bump function

[TABLE]

Therefore for $\gamma=(s,\xi)$ , our filters are given by

[TABLE]

F.2 Frequencies

In all of our experiments, we hold the frequency, $\xi,$ which we sample uniformly at random from $(0,2\pi),$ constant while allowing the scale to decrease to zero.

F.3 Simulation of Poisson point process

We use the standard method to generate a realization of a Poisson point process. For Poisson point process with intensity $\lambda$ , the time interval between two neighbor jumps follows exponential distribution:

[TABLE]

Therefore, taking the inverse cumulative distribution function, we sample the time interval between two neighbor jumps through:

[TABLE]

where $U_{j}$ are i.i.d. uniform random variables on $[0,1],$ and assign the charge $A_{j}$ to the jump at location $t_{j}$ .

For inhomogeneous Poisson process with intensity funciton $\lambda(t)$ , we simulate the time interval based on a well-known algorithm. We, first define the cumulated intensity:

[TABLE]

then generate the location of jumps $t_{j}$ by the Algorithm 1.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Laurent Sifre and Stéphane Mallat, “Rotation, scaling and deformation invariant scattering for texture discrimination,” in The IEEE Conference on Computer Vision and Pattern Recognition (CVPR) , June 2013.
2[2] Leon Gatys, Alexander S Ecker, and Matthias Bethge, “Texture synthesis using convolutional neural networks,” in Advances in Neural Information Processing Systems 28 , 2015, pp. 262–270.
3[3] Joseph Antognini, Matt Hoffman, and Ron J. Weiss, “Synthesizing diverse, high-quality audio textures,” ar Xiv:1806.08002, 2018.
4[4] Mikolaj Binkowski, Gautier Marti, and Philippe Donnat, “Autoregressive convolutional neural networks for asynchronous time series,” in Proceedings of the 35th International Conference on Machine Learning , Jennifer Dy and Andreas Krause, Eds., Stockholmsmässan, Stockholm Sweden, 10–15 Jul 2018, vol. 80 of Proceedings of Machine Learning Research , pp. 580–589, PMLR.
5[5] Antoine Brochard, Bartłomiej Błaszczyszyn, Stéphane Mallat, and Sixin Zhang, “Statistical learning of geometric characteristics of wireless networks,” ar Xiv:1812.08265, 2018.
6[6] Martin Haenggi, Jeffrey G. Andrews, François Baccelli, Olivier Dousse, and Massimo Franceschetti, “Stochastic geometry and random graphs for the analysis and design of wireless networks,” IEEE Journal on Selected Areas in Communications , vol. 27, no. 7, pp. 1029–1046, 2009.
7[7] Astrid Genet, Pavel Grabarnik, Olga Sekretenko, and David Pothier, “Incorporating the mechanisms underlying inter-tree competition into a random point process model to improve spatial tree pattern analysis in forestry,” Ecological Modelling , vol. 288, pp. 143–154, 09 2014.
8[8] Frederic Paik Schoenberg, “A note on the consistent estimation of spatial-temporal point process parameters,” Statistica Sinica , 2016.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Scattering Statistics of Generalized Spatial Poisson Point Processes

Abstract

1 Introduction

2 Expected Scattering Moments

3 Scattering Moments of Generalized Poisson Processes

3.1 First-order Scattering Asymptotics

Theorem 1**.**

Theorem 2**.**

Theorem 3**.**

3.2 Second-Order Scattering Moments of Generalized Poisson Processes

Theorem 4**.**

4 Comparison to Self-Similar Processes

Theorem 5**.**

5 Numerical Illustrations

6 Conclusion

Appendix A Proof of Theorem 1

Lemma 1**.**

Proof.

The proof of Theorem 1.

Appendix B Proof of Theorem 2

Proof.

Appendix C Proof of Theorem 3

Proof.

Appendix D Proof of Theorem 4

Proof.

Lemma 2**.**

Lemma 3**.**

Proof.

Lemma 4**.**

Proof.

Proof.

Appendix E The Proof of Theorem 5

Lemma 5**.**

Proof.

Proof.

Remark 1**.**

Appendix F Details of Numerical Experiments

F.1 Definition of Filters

F.2 Frequencies

F.3 Simulation of Poisson point process

Theorem 1.

Theorem 2.

Theorem 3.

Theorem 4.

Theorem 5.

Lemma 1.

Lemma 2.

Lemma 3.

Lemma 4.

Lemma 5.

Remark 1.