Shrinking scale equidistribution for monochromatic random waves on   compact manifolds

Matthew de Courcy-Ireland

arXiv:1902.05271·math.PR·February 15, 2019

Shrinking scale equidistribution for monochromatic random waves on compact manifolds

Matthew de Courcy-Ireland

PDF

TL;DR

This paper proves that monochromatic random waves on compact manifolds become uniformly distributed at very small scales, nearly matching the optimal wave scale, with high probability, using spectral and probabilistic tools.

Contribution

It establishes shrinking scale equidistribution for monochromatic random waves on any compact manifold, extending previous results to more general settings.

Findings

01

Equidistribution occurs at near-optimal wave scales.

02

High probability of uniform distribution across the manifold.

03

Uses Weyl's law and Chernoff bounds for proof.

Abstract

We prove equidistribution at shrinking scales for the monochromatic ensemble on a compact Riemannian manifold of any dimension. This ensemble on an arbitrary manifold takes a slowly growing spectral window in order to synthesize a random function. With high probability, equidistribution takes place close to the optimal wave scale and simultaneously over the whole manifold. The proof uses Weyl's law to approximate the two-point correlation function of the ensemble, and a Chernoff bound to deduce concentration.

Equations212

Δ ϕ_{j} + t_{j}^{2} ϕ_{j} = 0.

Δ ϕ_{j} + t_{j}^{2} ϕ_{j} = 0.

ϕ (x) = T - η (T) \leq t_{j} < T \sum c_{j} ϕ_{j} (x)

ϕ (x) = T - η (T) \leq t_{j} < T \sum c_{j} ϕ_{j} (x)

P {z sup \frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2} - E [\frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2}] \geq ε} \to 0.

P {z sup \frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2} - E [\frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2}] \geq ε} \to 0.

C_{\varepsilon}T^{n}\exp\left(-c(\varepsilon)\big{(}(rT)^{-(n-1)/2}+\eta^{-1}\big{)}^{-1}\right).

C_{\varepsilon}T^{n}\exp\left(-c(\varepsilon)\big{(}(rT)^{-(n-1)/2}+\eta^{-1}\big{)}^{-1}\right).

\int_{A} ∣ ϕ_{λ} ∣^{2} d vol \to vol (A)

\int_{A} ∣ ϕ_{λ} ∣^{2} d vol \to vol (A)

K (x, x^{'}) = T - η < t_{j} \leq T \sum ϕ_{j} (x) \overline{ϕ_{j} (x^{'})} .

K (x, x^{'}) = T - η < t_{j} \leq T \sum ϕ_{j} (x) \overline{ϕ_{j} (x^{'})} .

E [ϕ (x) \overline{ϕ (x^{'})}] = j \sum k \sum ϕ_{j} (x) \overline{ϕ_{k} (x^{'})} E [c_{j} c_{k}] = σ^{2} K (x, x^{'}) .

E [ϕ (x) \overline{ϕ (x^{'})}] = j \sum k \sum ϕ_{j} (x) \overline{ϕ_{k} (x^{'})} E [c_{j} c_{k}] = σ^{2} K (x, x^{'}) .

E [\frac{1}{vol ( M )} \int_{M} ∣ ϕ ∣^{2}] = 1.

E [\frac{1}{vol ( M )} \int_{M} ∣ ϕ ∣^{2}] = 1.

σ^{2} = \frac{vol ( M )}{\int _{M} K ( x , x ) d x} = \frac{vol ( M )}{\sum \int _{M} ∣ ϕ _{j} ∣ ^{2}} .

σ^{2} = \frac{vol ( M )}{\int _{M} K ( x , x ) d x} = \frac{vol ( M )}{\sum \int _{M} ∣ ϕ _{j} ∣ ^{2}} .

j \sum \int_{M} ϕ_{j}^{2} = # {j; T - η (T) \leq t_{j} \leq T} = N .

j \sum \int_{M} ϕ_{j}^{2} = # {j; T - η (T) \leq t_{j} \leq T} = N .

σ^{2} = var [c] = \frac{vol ( M )}{N} ≍ N^{- 1} .

σ^{2} = var [c] = \frac{vol ( M )}{N} ≍ N^{- 1} .

E [\int_{B} ∣ ϕ ∣^{2}] = σ^{2} \int_{B} K (x, x) d x = vol (B) \frac{\int _{B} K ( x , x ) d x / vol ( B )}{\int _{M} K ( x , x ) d x / vol ( M )}

E [\int_{B} ∣ ϕ ∣^{2}] = σ^{2} \int_{B} K (x, x) d x = vol (B) \frac{\int _{B} K ( x , x ) d x / vol ( B )}{\int _{M} K ( x , x ) d x / vol ( M )}

\sigma^{2}\int_{B}K(x,x)dx=\operatorname{vol}(B)\sigma^{2}\left(\frac{N}{\operatorname{vol}(M)}+O(T^{n-1})\right)=\operatorname{vol}(B)\left(1+O\big{(}\eta^{-1}\big{)}\right)

\sigma^{2}\int_{B}K(x,x)dx=\operatorname{vol}(B)\sigma^{2}\left(\frac{N}{\operatorname{vol}(M)}+O(T^{n-1})\right)=\operatorname{vol}(B)\left(1+O\big{(}\eta^{-1}\big{)}\right)

X_{z} = \frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2} .

X_{z} = \frac{1}{vol ( B _{r} ( z ))} \int_{B_{r} (z)} ∣ ϕ ∣^{2} .

P {∣ X_{z} - E X_{z} ∣ > ε for some z} \leq (number of points) z max P {∣ X_{z} - E X_{z} ∣ > ε} .

P {∣ X_{z} - E X_{z} ∣ > ε for some z} \leq (number of points) z max P {∣ X_{z} - E X_{z} ∣ > ε} .

∣ X_{z} - E [X_{z}] ∣ > ε .

∣ X_{z} - E [X_{z}] ∣ > ε .

ε < ∣ X_{z} - X_{z_{j}} ∣ + ∣ X_{z_{j}} - E [X_{z_{j}}] ∣ + ∣ E [X_{z_{j}}] - E [X_{z}] ∣

ε < ∣ X_{z} - X_{z_{j}} ∣ + ∣ X_{z_{j}} - E [X_{z_{j}}] ∣ + ∣ E [X_{z_{j}}] - E [X_{z}] ∣

∣ E [X_{z_{j}}] - E [X_{z}] ∣

∣ E [X_{z_{j}}] - E [X_{z}] ∣

≲ \frac{vol ( B _{r} ( z ) Δ B _{r} ( z ))}{vol ( B _{r} )}

vol (B_{r} (z) Δ B_{r} (z^{'})) ≲ r^{n - 1} d (z, z^{'}) .

vol (B_{r} (z) Δ B_{r} (z^{'})) ≲ r^{n - 1} d (z, z^{'}) .

vol (B Δ B^{'}) ≲ vol (B) + vol (B^{'}) ≲ r^{n} .

vol (B Δ B^{'}) ≲ vol (B) + vol (B^{'}) ≲ r^{n} .

∣ E [X_{z_{j}}] - E [X_{z}] ∣ ≲ \frac{r ^{n - 1} T ^{- 1}}{r ^{n}} = \frac{1}{r T} .

∣ E [X_{z_{j}}] - E [X_{z}] ∣ ≲ \frac{r ^{n - 1} T ^{- 1}}{r ^{n}} = \frac{1}{r T} .

\int_{B} ∣ ϕ ∣^{2} - \int_{B^{'}} ∣ ϕ ∣^{2} ≲ \int_{B Δ B^{'}} ∣ ϕ ∣^{2} ≲ vol (B Δ B^{'}) ∥ ϕ ∥_{\infty}^{2} .

\int_{B} ∣ ϕ ∣^{2} - \int_{B^{'}} ∣ ϕ ∣^{2} ≲ \int_{B Δ B^{'}} ∣ ϕ ∣^{2} ≲ vol (B Δ B^{'}) ∥ ϕ ∥_{\infty}^{2} .

\frac{ε}{3} ≲ r^{- n} (r^{n - 1} T^{- 1} ∥ ϕ ∥_{\infty}^{2}) .

\frac{ε}{3} ≲ r^{- n} (r^{n - 1} T^{- 1} ∥ ϕ ∥_{\infty}^{2}) .

∥ ϕ ∥_{\infty} ≳ ε r T .

∥ ϕ ∥_{\infty} ≳ ε r T .

P (∥ ϕ ∥_{\infty} \geq c ε r T) ≲ T^{n} exp (- c^{'} ε r T)

P (∥ ϕ ∥_{\infty} \geq c ε r T) ≲ T^{n} exp (- c^{'} ε r T)

X_{z} = \frac{1}{vol ( B )} \int_{B} ∣ ϕ ∣^{2} = j \sum k \sum c_{j} c_{k} \frac{1}{vol ( B )} \int_{B} ϕ_{j} \overline{ϕ_{k}} .

X_{z} = \frac{1}{vol ( B )} \int_{B} ∣ ϕ ∣^{2} = j \sum k \sum c_{j} c_{k} \frac{1}{vol ( B )} \int_{B} ϕ_{j} \overline{ϕ_{k}} .

X_{z} = z^{T} A z

X_{z} = z^{T} A z

A_{j k} = \frac{σ ^{2}}{vol ( B )} \int_{B} ϕ_{j} \overline{ϕ_{k}} .

A_{j k} = \frac{σ ^{2}}{vol ( B )} \int_{B} ϕ_{j} \overline{ϕ_{k}} .

X_{z} = z^{T} A z = (U z)^{T} D (U z) = j \sum λ_{j} y_{j}^{2}

X_{z} = z^{T} A z = (U z)^{T} D (U z) = j \sum λ_{j} y_{j}^{2}

g (s) = E [e^{s z^{T} A z}] = j = 1 \prod N (1 - 2 s λ_{j})^{- 1/2}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Shrinking scale equidistribution for monochromatic random waves on compact manifolds

Matthew de Courcy-Ireland

Department of Mathematics

Princeton University

Princeton NJ 08544

[email protected]

(Date: February 14, 2019)

Abstract.

We prove equidistribution at shrinking scales for the monochromatic ensemble on a compact Riemannian manifold of any dimension. This ensemble on an arbitrary manifold takes a slowly growing spectral window in order to synthesize a random function. With high probability, equidistribution takes place close to the optimal wave scale and simultaneously over the whole manifold. The proof uses Weyl’s law to approximate the two-point correlation function of the ensemble, and a Chernoff bound to deduce concentration.

1. Introduction

Consider a compact manifold $M$ together with a Riemannian metric $g$ . By compactness, the spectrum of the Laplacian is a discrete sequence of eigenvalues $0=t_{0}^{2}\leq t_{1}^{2}\leq t_{2}^{2}\leq\ldots\rightarrow\infty$ , possibly with multiplicity. The corresponding eigenfunctions $\phi_{j}:M\rightarrow\mathbb{R}$ satisfy

[TABLE]

These eigenfunctions form an orthonormal basis for $L^{2}(M)$ , the $L^{2}$ space with respect to integration against the volume form of $g$ . Thus one can expand functions in terms of the Laplace eigenfunctions, and a natural model for a random function on $M$ is to randomize the coefficients in such an expansion. The monochromatic ensemble takes the specific form

[TABLE]

where the coefficients $c_{j}$ are independent, identically distributed Gaussian random variables of mean 0. The parameter $T$ is large. If the window $\eta(T)$ is short compared to $T$ , then $\phi(x)$ is a stand-in for a “random eigenfunction” with eigenvalue $T^{2}$ . The problem with literally taking a random eigenfunction is that when an eigenvalue has multiplicity 1, the random function would simply be a deterministic function multiplied by a random scalar.

Consider a ball $B=B_{r}(z)$ with center $z\in M$ whose radius $r>0$ is allowed to vary with $T$ . We can normalize so that $\int_{B}\phi^{2}$ , in expectation, is close to $\operatorname{vol}(B)$ .

Theorem 1.

If $rT/\log(T)\rightarrow\infty$ (or in case $\dim{M}=2,\ rT/\log(T)^{2}\rightarrow\infty$ ) and the spectral window obeys $\eta(T)/\log(T)\rightarrow\infty$ and $\eta(T)\lesssim T^{1/2}$ , then for any $\varepsilon>0$ ,

[TABLE]

The wave scale $1/T$ is the natural wavelength of an eigenfunction with Laplace eigenvalue $T^{2}$ , also called the Planck scale or de Broglie wavelength. At such a fine scale, there could be a large discrepancy between $\int_{B}|\phi^{2}|$ and $\operatorname{vol}(B)$ . For instance, $\int_{B}\phi^{2}$ may be much larger than $\operatorname{vol}(B)$ if $\phi$ achieves its maximum inside $B$ . The hypothesis of Theorem 1 is that $r$ is large compared to the wave scale in the sense that $rT/\log(T)\rightarrow\infty$ . We then conclude there is only a small deviation even in the worst case over all centers $z$ . The assumption is a relatively mild one, as it allows $rT/\log(T)$ to grow arbitrarily slowly so that Theorem 1 takes place almost at the wave scale.

Theorem 1 follows from a more explicit bound: for any $\varepsilon>0$ , there are positive $C_{\varepsilon}$ and $c(\varepsilon)$ such that the probability of an $\varepsilon$ -deviation occurring somewhere on $M$ is at most

[TABLE]

The factor $T^{n}$ in (1.3) arises from taking a union bound over roughly $T^{n}$ points, separated pairwise by a distance $1/T$ . The exponential factor is an upper bound for the probability of a deviation at a single point. Under the assumption that $\eta$ and $rT$ grow faster than logarithmically, the factor $T^{n}$ can be absorbed into the exponential and Theorem 1 follows. We describe the union bound in more detail in Section 3. Section 4 uses a Chernoff bound to estimate the probability of a deviation at a single point. The result is expressed in terms of the variance of the local integrals $\int_{B}\phi^{2}$ , which we estimate in Lemma 4. The key input is the Local Weyl Law for Laplace eigenfunctions, in a form proved by Canzani and Hanin [6] and described in Section 5. This is used to estimate the two-point correlation function of $\phi$ , defined in Section 2. We complete the proof of (1.3) in Sections 6 and 7. Section 8 concludes with some further questions and a lemma that applies if the coefficients in (1.2) are not necessarily Gaussian.

To have a model for random eigenfunctions, the window $\eta$ should be as small as possible, so it is not a serious restriction to assume that $\eta\lesssim T^{1/2}$ in Theorem 1. This assumption is convenient for stating simplified estimates, but the arguments below could still be implemented as long as $\eta=o(T/\log{T})$ .

We mainly have in mind real-valued functions $\phi_{j}:M\rightarrow\mathbb{R}$ , but we write absolute values in Theorem 1 because a similar statement holds for complex-valued functions as well. However, the complex version is not as sharp since complex eigenfunctions may equidistribute at even smaller scales than their real counterparts. For instance, on the circle $M=S^{1}$ , $e^{iTx}$ is uniform at all scales because its modulus is identically 1, whereas $\cos(Tx)$ is limited by the wave scale $1/T$ . Nevertheless, the notation below will involve complex conjugates in order to include the complex case. It would also be appropriate to take Gaussians in the complex plane if one were interested in the complex case, instead of the real coefficients $c_{j}$ . This can be incorporated into the same proof as for the real case, since a single complex Gaussian is equivalent to two independent real Gaussians.

To provide some context for Theorem 1, consider the property of quantum unique ergodicity (QUE). By QUE for a Riemannian manifold $M$ , we mean that for any fixed measurable subset $A$ of $M$ ,

[TABLE]

for any sequence of Laplace eigenfunctions $\phi_{\lambda}$ with growing eigenvalue $\lambda\rightarrow\infty$ . There is a further question of the distribution of the microlocal lifts of $|\phi|^{2}d\operatorname{vol}$ to phase space $S^{*}M$ , but we confine our attention to the base space $M$ . If (1.4) holds along a full subsequence of eigenfunctions, the manifold enjoys quantum ergodicity but may lack uniqueness of quantum limits. The quantum ergodicity theorem proved by Shnirelman [25, 26], Colin de Verdière [8], and Zelditch [28] shows that negative curvature implies quantum ergodicity. Rudnick and Sarnak conjecture that the stronger property of QUE is true on any compact negatively curved surface [24]. This has been shown for examples of arithmetic origin in work of Lindenstrauss [22, 23], and Bourgain-Lindenstrauss [4], Jakobson [19], Holowinsky [17], and Holowinsky-Soundararajan [16]. For a general metric, work of Anantharaman [1], Anantharaman-Nonnenmacher [2], Anantharaman-Silberman [3], and Dyatlov-Jin [10] places constraints on the measures that arise as quantum limits but it remains unknown whether the uniform measure is the only possibility.

From this point of view, it is of interest to randomize and see whether one at least has uniform distribution with high probability. VanderKam [27] showed that one does have equidistribution for random spherical harmonics on the sphere, where QUE is known to fail. A more refined question is whether there is equidistribution even if the test set $A$ shrinks as the frequency grows. This scenario has been studied recently in papers of Han [12] (assuming high multiplicity), Han-Tacy [13] (with a spectral window instead of high multiplicity), Granville-Wigman [11] (on an arithmetic torus guaranteeing high multiplicity), Lester-Rudnick [21] (on higher-dimensional tori), Humphries [18] (for non-random functions on arithmetic surfaces, with the averaging being done over the sphere center instead). In particular, Theorem 4.4 from Han-Tacy [13] estimates the probability that there is some point with a given deviation, much like our Theorem 1 but in a different context. In [13], instead of fluctuating near 1, $\int_{M}\phi^{2}$ is conditioned to be exactly 1. This is more natural for the quantum interpretation, but the corresponding coefficients in (1.2) are no longer independent random variables, and Han-Tacy treat this with an elegant application of Lévy’s concentration of measure in high-dimensional spheres. The radius in [13] is $r=T^{-p}$ with $p$ close to $1/2$ , whereas we take $r$ equal to $T^{-1}$ up to a logarithmic power. Thus Theorem 1 is closer to the wave scale, but in the easier case of a fixed $\varepsilon>0$ instead of the shrinking deviation from [13].

2. Two-point function

A fundamental quantity governing the statistics of random functions of the form (1.2) is the two-point function of the ensemble, given by

[TABLE]

At each point, $\phi(x)$ is a Gaussian of mean zero, and it is $K(x,x^{\prime})$ that records the correlation of these random variables at different points on the manifold. Indeed, suppose the coefficients $c_{j}$ in (1.2) are independent with mean 0 and variance $\sigma^{2}=\mathbb{E}[c_{j}^{2}]$ . We then have

[TABLE]

A natural normalization is to require

[TABLE]

To arrange this, the variance of the coefficients must be

[TABLE]

The basis functions are orthonormal in $L^{2}(M)$ , so the denominator is just the number of eigenvalues in the interval, say $N$ :

[TABLE]

Thus we choose the variance of the coefficients to be

[TABLE]

For other sets $B\subseteq M$ , we then have

[TABLE]

In the homogeneous case, $K(x,x)$ is independent of $x$ and the expectation is simply $\operatorname{vol}(B)$ . In general, it is never very far from $\operatorname{vol}(B)$ , as we will see from Weyl’s law:

[TABLE]

3. Outline of the proof: Union bound

To prove Theorem 1, we follow the strategy of [9]. We write the random variable of interest as

[TABLE]

It has expectation $\mathbb{E}[X_{z}]=1+O(\eta^{-1})$ of order 1. The key point is that for a monochromatic wave $\phi$ of frequency $T$ , the modulus of continuity at scale $1/T$ is under control. This allows one to replace the supremum over all $z\in M$ by a maximum over roughly $T^{n}$ sample points, where $n=\dim(M)$ . The union bound is that for a finite number of points $z$

[TABLE]

For our application, the number of points is proportional to $T^{n}$ . By the union bound, there will be only a $o(1)$ probability of there being some point $z$ at which a deviation of $\varepsilon$ occurs, provided the probability of a deviation at any single point $z$ is $o(T^{-n})$ . Thus the union bound reduces the problem to a calculation at a single point. That calculation can be done by a Chernoff bound.

Passing to the grid brings with it another error: Conceivably the integrals around all the gridpoints are within $\varepsilon$ of their average, but nevertheless the integral around some point off the grid differs considerably. We must show that this “off-grid” error occurs with only a low probability.

To be more precise, suppose there is a point $z$ such that

[TABLE]

Take a grid of points $z_{j}$ such that every point of $M$ is within $1/T$ of a gridpoint. The number of gridpoints is thus of order $T^{n}$ . We have

[TABLE]

Thus one of the three terms must be greater than $\varepsilon/3$ . The difference of expected values is non-random and small: Both are $1+O(\eta^{-1})$ , so their difference is $O(\eta^{-1})$ . Eventually, this will not be greater than $\varepsilon/3$ since we assume $\eta(T)\rightarrow\infty$ . Alternatively, note that

[TABLE]

To bound the volume of the symmetric difference, we have the following claim.

Claim 2.

If $B_{r}(z)$ and $B_{r}(z^{\prime})$ are balls of radius $r\rightarrow 0$ centered at points $z,z^{\prime}$ separated by less than $r$ in a Riemannian manifold of dimension $n$ ,

[TABLE]

Proof.

Indeed, for small radii $r$ , we can compare to Euclidean balls or simply to a Euclidean box with $n-1$ sidelengths of order $r$ and a remaining side of order $s=d(z,z^{\prime})$ . The bound $r^{n-1}s$ holds for larger separations as well, but becomes worse than the easier bound

[TABLE]

∎

With a separation of less than $1/T$ between $z$ and $z_{j}$ , we therefore have

[TABLE]

Assuming $rT\rightarrow\infty$ , this term will be less than $\varepsilon/3$ . Thus the difference of expected values will eventually be less than $\varepsilon/3$ whether we assume $\eta\rightarrow\infty$ or $rT\rightarrow\infty$ (and later, we will assume that both of them diverge faster than logarithmically). In the case of an $\varepsilon$ -difference of $\int_{B}|\phi|^{2}$ from its mean, it is one of the other two terms $|X_{z}-X_{z_{j}}|$ or $|X_{z_{j}}-\mathbb{E}[X_{z_{j}}]|$ that must be greater than $\varepsilon/3$ (and in fact, almost greater than $\varepsilon/2$ once $rT$ and $\eta$ are large enough).

Suppose it is the integrals around $z$ versus $z^{\prime}=z_{j}$ that differ by more than $\varepsilon/3$ . We have

[TABLE]

Since $d(z,z_{j})<1/T$ , the same volume bound as above gives

[TABLE]

That is,

[TABLE]

To control the probability of $\phi$ having such a large maximum, we use another union bound. More precise estimates of $\|\phi\|_{\infty}$ have been given by Burq-Lebeau [5] and Canzani-Hanin [6], but we include the following sketch to keep the present argument self-contained. Again, take a grid of roughly $T^{n}$ points. Either there is a gridpoint $w_{j}$ at which $|\phi(w_{j})|\geq C\sqrt{\varepsilon rT}$ or else there are two points separated by only $1/T$ at which the values of $\phi$ differ by at least $C\sqrt{\varepsilon rT}$ . The latter is very unlikely because $1/T$ is the wave scale for $\phi$ . Whereas the values $\phi(w)$ are Gaussian with unit variance, the derivatives of $\phi$ are Gaussian with variance $T^{2}$ , so a difference of $C\sqrt{\varepsilon rT}$ between points separated by only $1/T$ would require $\phi$ to have some directional derivative more than $\sqrt{\varepsilon rT}$ standard deviations above its mean. This occurs with probability less than $\exp(-c\varepsilon rT)$ . Likewise, having $|\phi(w_{j})|\geq C\sqrt{\varepsilon rT}$ requires a Gaussian to be more than $\sqrt{\varepsilon rT}$ standard deviaions above its mean. From the union bound,

[TABLE]

which is negligible as long as $rT/\log(T)\rightarrow\infty$ . Thus we can move to the final case: The probability that an integral around any single point shows a deviation of more than $\varepsilon/3$ .

4. Chernoff bound

Each variable $X_{z}$ is a quadratic form in the coefficients $c_{j}$ . Writing $B=B_{r}(z)$ , we have

[TABLE]

We scale by the variance to write $c_{j}=\sigma\mathfrak{z}_{j}$ , where $\mathfrak{z}_{j}$ is a standard Gaussian of mean 0 and variance 1. Thus

[TABLE]

where the matrix $A$ has entries

[TABLE]

Note that this matrix depends on $z$ , as well as $r$ and $T$ , but we have suppressed this in the notation. Since $A$ is a symmetric matrix, or Hermitian if we prefer to start from complex-valued eigenfunctions $\phi_{j}$ , we may diagonalize to write $A=U^{T}DU$ where $U$ is orthogonal (or unitary, in the complex case) and $D$ is diagonal with entries, say, $\lambda_{j}$ . In eigencoordinates, the random variable $X_{z}$ becomes

[TABLE]

where $y=U\mathfrak{z}$ is again a standard Gaussian vector.

Evaluating a Gaussian integral, it follows that the moment generating function of a quadratic form $\mathfrak{z}^{T}A\mathfrak{z}$ in standard Gaussians $\mathfrak{z}=(\mathfrak{z}_{1},\ldots,\mathfrak{z}_{N})$ is

[TABLE]

where $\lambda_{j}$ are the eigenvalues of $A$ . In the complex case, each factor effectively occurs twice because of the real and imaginary parts of $y_{j}$ , leading to $(1-2s\lambda_{j})^{-1}$ instead of $(1-2s\lambda_{j})^{-1/2}$ . One has convergence in (4.5) as long as $1-2s\lambda_{j}>0$ for all $j$ , so $s$ must be small enough. Specifically, $g(s)$ is defined for $s<1/(2\lambda_{\max})$ , where $\lambda_{\max}$ is the largest eigenvalue of $A$ .

Estimates for $g(s)$ allow us to execute a Chernoff bound on the tail probability. For any $s>0$ , $X>\mathbb{E}[X]+\varepsilon$ if and only if $e^{sX}>e^{s\mathbb{E}[X]+s\varepsilon}$ , so by Markov’s inequality

[TABLE]

In the case at hand, where $X=\mathfrak{z}^{T}A\mathfrak{z}$ , we have

[TABLE]

Expanding the logarithm in a power series (provided $2s\lambda_{\max}<1$ ), we have

[TABLE]

The term $p=1$ contributes $s\sum_{j}\lambda_{j}=s\mathbb{E}[X]$ . This cancels the expected value above so that

[TABLE]

We would like to minimize the sum of the first two terms by choosing

[TABLE]

but it is not clear whether $2s_{?}\lambda_{\max}<1$ , that is, whether $g(s_{?})$ is defined. We would need to know that

[TABLE]

at least for sufficiently small $\varepsilon$ . In the case of the manifold $S^{2}$ with its usual round metric, we were able to show in [9] that $\lambda_{\max}$ and $\sum\lambda_{j}^{2}$ are of the same order of magnitude, so that this holds once $\varepsilon$ is small enough. Here, we choose a different $s$ to guarantee that $2s\lambda_{\max}<1$ , namely

[TABLE]

where $c<1/2$ . Note that $\lambda_{\max}\leq\sqrt{\sum\lambda_{j}^{2}}$ , so that this is a valid choice of $s$ .

Claim 3.

For this choice $s=c/\sqrt{\sum\lambda_{j}^{2}}$ , where $0<c<1/2$ , we have

[TABLE]

where $A$ can be taken as $2c^{2}/(1-2c)^{2}$ .

Proof.

Indeed, this follows from Taylor’s theorem. For a twice differentiable function $f$ , we have

[TABLE]

Applied to the function $f(x)=-\log(1-x)$ , this gives

[TABLE]

In particular, for $x\leq a$ we have

[TABLE]

so we may take $A=(1-a)^{-2}$ to have a bound valid for all $x$ up to $a$ . We take $x=2s\lambda_{j}$ where $s=c(\sum\lambda_{j}^{2})^{-1/2}$ with $0<c<1/2$ . These values of $x$ are at most

[TABLE]

Taylor’s theorem then gives

[TABLE]

Summing over $j$ and dividing by 2, we get

[TABLE]

Hence, noting again that $\sum_{j}\lambda_{j}=\mathbb{E}[X]$ , we have proved the claim. ∎

With this estimate in hand, we can bound the tail probability as follows:

[TABLE]

The lower tail, where $X<\mathbb{E}[X]-\varepsilon$ , is slightly different but can be treated by the same method. We have $X<\mathbb{E}[X]-\varepsilon$ if and only if $-X>\mathbb{E}[-X]+\varepsilon$ , so we can apply the argument above with $-X$ in place of $X$ . Instead of $g(s)$ , the relevant function for the Chernoff bound is

[TABLE]

This function $g_{-}(s)$ is defined for all $s\geq 0$ whereas $g(s)$ is defined only for sufficiently small $s$ . The Chernoff bound is

[TABLE]

We have $-\log(1+x)\leq-x+x^{2}/2$ for all $x\geq 0$ , so that

[TABLE]

where we choose $s=c\big{(}\sum\lambda_{j}^{2}\big{)}^{-1/2}$ as above. This shows that the lower tail probability obeys the same bound as the upper tail probability, namely

[TABLE]

In fact, since $g_{-}(s)$ is defined for all $s$ , we could simply choose $s=s_{?}$ to get an even better bound. This doesn’t help us though, since we control both upper and lower tail together by the sum of their respective bounds:

[TABLE]

for any $c<1/2$ .

In order to take advantage of this, we need an estimate on the second moment $\sum\lambda_{j}^{2}$ .

Lemma 4.

[TABLE]

We will prove the lemma using estimates for the two-point function $K(x,x^{\prime})$ . We have

[TABLE]

The trace $\operatorname{tr}(A^{2})$ , and also the trace of any power of $A$ , can be expressed in terms of $K(x,x^{\prime})$ as follows.

Recall that

[TABLE]

Since the $(j,k)$ -entry of $A$ is

[TABLE]

the entries of $A^{p}$ are

[TABLE]

When we sum the diagonal entries, we get

[TABLE]

We can equally well express this product of integrals as one multiple integral:

[TABLE]

The integrand factors:

[TABLE]

We summarize this as follows:

Lemma 5.

If $A$ is the matrix with entries

[TABLE]

and $K$ is the kernel given by

[TABLE]

then

[TABLE]

with the indices interpreted cyclically so that $x_{0}$ means $x_{p}$ .

In particular, with $p=2$ , we have

[TABLE]

5. Input from semiclassics

To prove the variance estimate in Lemma 4 , we need to know the size of $K(x,x^{\prime})$ . Here is the basic estimate:

Claim 6.

On a compact manifold of dimension $n$ , with spectral kernel

[TABLE]

defined over a window $\eta(T)\rightarrow\infty$ growing arbitrarily slowly and such that

[TABLE]

we have

[TABLE]

for all $x,x^{\prime}$ and an improved bound for well-separated pairs:

[TABLE]

improving on the trivial bound once $d(x,x^{\prime})>1/T$ .

For $d(x,y)\lesssim 1/T$ , the basis for claim 6 is Hörmander’s Theorem 4.4 from [17]. This in turn is based on Lax’s parametrix for the wave equation, constructed in [20]. Using the wave equation in this way may break down when $Td(x,y)$ is unbounded. For larger distances we instead appeal to the results of Canzani-Hanin [7]. Their Theorem 2 improves the $O(T^{n-1})$ error term in Hörmander’s estimate for $K(x,y)$ to $o(T^{n-1})$ , assuming $x,y$ are in a ball $B_{r}(z)$ of radius $r\rightarrow 0$ arbitrarily slowly around some non-self-focal point $z$ . Without the assumption on $z$ , one cannot conclude the remainder is $o(T^{n-1})$ since the sphere is a counterexample, but the method of [7] still gives

[TABLE]

where the error term is uniform over pairs $(x,y)$ with $d(x,y)<r$ . In this notation, $g_{y}$ and $|*|_{g_{y}}$ are the length and inner product on the tangent space at $y$ defined by the metric $g$ , $\sqrt{|g_{y}|}$ is the volume form, and $\exp_{y}$ is the exponential map. Note that $\exp_{y}^{-1}(x)$ is well defined for $d(x,y)$ sufficiently small (less than the injectivity radius of $M$ ).

Using polar coordinates at $y$ , with $\omega=\exp_{y}^{-1}(x)$ and $\xi=s\alpha$ , the difference between the main terms for $T$ and $T-\eta$ is

[TABLE]

The integral over $S^{n-1}$ gives the Bessel function

[TABLE]

up to a normalizing factor depending only on $n$ . This is a bounded function that begins to oscillate when $Td(x,y)$ reaches the first zeros of $J_{n/2-1}$ , and decays as a power $(Td(x,y)^{-n/2+1/2}$ as $Td(x,y)\rightarrow\infty$ . We have

[TABLE]

by the binomial expansion. This implies

[TABLE]

for some constant $c=c_{n}>0$ . Note that the $\eta^{-1}$ in the error corresponds to the remainder in Weyl’s law whereas $\eta T^{-1}$ is from truncating the binomial expansion in (5.4). They are equal when $\eta=T^{1/2}$ .

If $d(x,y)\lesssim 1/T$ , we simply use the fact that $J$ is bounded to obtain the trivial bound

[TABLE]

This is useful for nearby pairs $(x,y)$ , but for $d(x,y)\gtrsim 1/T$ it is better to input the fact that $J(u)\lesssim u^{-n/2+1/2}$ to obtain

[TABLE]

We have assumed $\eta\lesssim T^{1/2}$ so that $\eta T^{-1}$ can be absorbed into the error $\eta^{-1}$ . This gives (5.2). ∎

We have assumed that $\eta\lesssim T^{1/2}$ for convenience, and indeed what we have in mind is that $\eta$ is a power of $\log{T}$ . If one did want to allow larger $\eta$ , the error in (5.2) would become $\eta T^{-1}$ instead of $\eta^{-1}$ . For the arguments in Section 7 below to go through, one would then need to assume $\eta=o(T/\log{T})$ .

6. Upper bound on the variance

By the triangle inequality, $d(x,x^{\prime})\leq d(x,z)+d(z,x^{\prime})<2r$ . Since the integrand is nonnegative, we can bound the inner integral in (4.19) by

[TABLE]

Having moved the center to $x^{\prime}$ , we introduce polar coordinates $(\rho,\omega)$ where the radial coordinate $\rho=d(x,x^{\prime})$ ranges from 0 to $2r$ . The volume form is given approximately by its Euclidean counterpart:

[TABLE]

Indeed, the volume form is obtained from the metric $g$ by $\sqrt{\det(g)}$ and we have the expansion

[TABLE]

We integrate the estimate (5.2) from section 5, namely

[TABLE]

This diverges as $\rho\rightarrow 0$ , since we would be better off using the trivial bound for $\rho<1/T$ , but the singularity is integrable. We obtain

[TABLE]

Integrating over $x$ and noting that $\operatorname{vol}(B_{r})\asymp r^{n}$ , we obtain

[TABLE]

as claimed in Lemma 4. This improves on what one would get by replacing $K$ with its maximum, namely

[TABLE]

Recall that we have normalized to have Gaussian coefficients of variance proportional to $T^{n-1}\eta$ . Thus this factor $(T^{n-1}\eta)^{2}$ will cancel, leaving

[TABLE]

This vanishes as $rT\rightarrow\infty$ and $\eta\rightarrow\infty$ , whereas the trivial bound would only show the variance is bounded.

7. Collecting the bounds and proving Theorem 1

From the union bound, we had

[TABLE]

From the Chernoff bound,

[TABLE]

From the variance formula,

[TABLE]

Therefore

[TABLE]

We already assumed $rT/\log(T)\rightarrow\infty$ so that $T^{n}\exp(-c_{3}\varepsilon rT)\rightarrow 0$ no matter how small is the given $\varepsilon$ , which controls the probability of an “off-grid” deviation. To control the “on-grid” deviation, we must further assume that

[TABLE]

This guarantees that, again, the factor of $T^{n}$ can be absorbed. Equivalently, we need

[TABLE]

that is, both $(rT)^{-(n-1)/2}\log(T)\rightarrow 0$ and $\eta^{-1}\log(T)\rightarrow 0$ . For $n\geq 3$ , the first of these is already implied by the assumption $rT/\log(T)\rightarrow\infty$ . If $n=2$ , then we instead assume $(rT)/\log(T)^{2}\rightarrow\infty$ . Thus the requirements amount to both $rT$ and $\eta(T)$ being asymptotically larger than $\log(T)$ :

[TABLE]

These are the hypotheses of Theorem 1, and the proof is complete. Moreover, we have proved the rate of convergence for Theorem 1 claimed in (1.3): for any $\varepsilon>0$ , there are positive $C_{\varepsilon}$ and $c(\varepsilon)$ such that

[TABLE]

8. Conclusion

The proof we have given relies on a union bound, ignoring the interesting question of how integrals $\int_{B}|\phi|^{2}$ and $\int_{B^{\prime}}|\phi|^{2}$ over different sets are correlated. One might also wonder about other ensembles of random functions, for instance band-limited functions with a window $\eta(T)$ proportional to $T$ instead of $o(T)$ , or where the distribution of the coefficients is not Gaussian. One could study other sets $B$ , not necessarily balls, either with diameter shrinking like the $r$ in our setup, or volume shrinking like $r^{n}$ . The lifts of $|\phi|^{2}d\operatorname{vol}$ to $S^{*}M$ are another interesting class of random measures. Regarding more general coefficients, we note the article [14] of Hanson-Wright on concentration for quadratic forms in independent random variables.

As a first step addressing two of these further directions, here is an exact covariance formula. The covariance between two of our integrals takes a similar form to the variance of a single one. In [9], we did this calculation on the sphere. This was an algebraic calculation valid in more general circumstances, as we now indicate. This proof applies to non-Gaussian distributions of the coefficients, as long as the first four moments are the same as for a Gaussian, whereas the proof by differentiating the moment generating function is specific to Gaussians. Without the assumption on the fourth moment, there is a more complicated formula involving $\sum_{j}\phi_{j}(x)^{2}\phi_{j}(y)^{2}$ in addition to the kernel $\sum_{j}\phi_{j}(x)\phi_{j}(y)$ .

Lemma 7.

Suppose $c_{j}$ are independent random variables with first and third moments [math], variance $\sigma^{2}$ , and fourth moment $3\sigma^{4}$ . Suppose $\phi_{j}:M\rightarrow\mathbb{C}$ are functions on some measure space $M$ (assumed $\sigma$ -finite for purposes of Fubini’s theorem) and $\phi=\sum_{j}c_{j}\phi_{j}$ is the corresponding random function. Then for any measurable subsets $B\subseteq M$ , $B^{\prime}\subseteq M$ ,

[TABLE]

where $K(x,x^{\prime})=\sum_{j}\phi_{j}(x)\overline{\phi_{j}(x^{\prime})}$ . If the fourth moment $\mathbb{E}[c^{4}]$ does not necessarily equal $3\sigma^{4}$ , then the covariance is given by

[TABLE]

Proof.

We compute the covariance $\mathbb{E}[\int_{B}|\phi|^{2}\int_{B^{\prime}}|\phi|^{2}]-\mathbb{E}[\int_{B}|\phi|^{2}]\mathbb{E}[\int_{B^{\prime}}|\phi|^{2}]$ by expanding $|\phi|^{2}$ and using linearity of expectation to exchange $\mathbb{E}$ with the sums and integrals. For the expectation of the product, we have

[TABLE]

Since the coefficients are independent and have mean 0, the expectation $\mathbb{E}[c_{i}c_{j}c_{k}c_{l}]$ is $3\sigma^{4}$ if all indices $i,j,k,$ and $l$ are equal, $\sigma^{4}$ if they are equal in pairs, and [math] in all other cases. In light of the different cases $i=j\neq k=l$ , $i=k\neq j=l$ , or $i=l\neq j=k$ , it follows that

[TABLE]

The factor of 3 means that the first term exactly supplies the missing diagonal terms $i=k$ , $i=j$ , and $i=l$ (which we have merged with $i=k$ , the two cases giving the same contribution) in the three other sums. The completed sums then factor, so that

[TABLE]

For the product of the expectations, we have

[TABLE]

by independence of the coefficients. Thus subtraction gives

[TABLE]

which is (8.1).

If the fourth moment $\mathbb{E}[c^{4}]$ does not match that of a Gaussian, then the same method shows that the covariance is given by

[TABLE]

∎

Note that, whereas $\sum_{j}\phi_{j}(x)\phi_{j}(x^{\prime})$ is unaffected by an orthogonal change of basis $\phi_{j}\mapsto\sum_{k}a_{jk}\phi_{k}$ , the sum of squares $\sum_{j}\phi_{j}(x)^{2}\phi_{j}(x^{\prime})^{2}$ may depend on the choice of orthonormal basis. If $\mathbb{E}[c^{4}]=3\sigma^{4}$ , then this extra term disappears.

Acknowledgments

We thank Peter Sarnak for his advice, encouragement, and support over the course of this work. We thank Yaiza Canzani for helpful discussions about Weyl’s law. We thank the Natural Sciences and Engineering Research Council of Canada for its support through a PGS D grant.

Bibliography28

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] N. Anantharaman, Entropy and the localization of eigenfunctions , Annals of Math. (2), 168 (2008), 435–475.
2[2] N. Anantharaman and S. Nonnenmacher, Half-delocalization of eigenfunctions for the Laplacian on an Anosov manifold , Ann. Inst. Four. (Grenoble), 57, 6 (2007), 2465–2523.
3[3] N. Anantharaman and L. Silberman, A Haar component for quantum limits on locally symmetric spaces , Israel J. Math. v 195 no.1 493-447 (2013)
4[4] J. Bourgain and E. Lindenstrauss, Entropy of quantum limits , Comm. Math. Phys., 233 (2003), 153–171.
5[5] N. Burq and G. Lebeau, Injections de Sobolev probabilistes et applications. Ann. Sci. Éc. Norm. Supér. (4), 46 (2013), 917–962. ar Xiv:1111.7310. (2011)
6[6] Y. Canzani and B. Hanin. High Frequency Eigenfunction Immersions and Supremum Norms of Random Waves. Electronic Research Announcements in Mathematical Sciences, Volume 22, 2015, pp. 76-86. ar Xiv: 1406.2309.
7[7] Y Canzani and B. Hanin, Scaling limit for the kernel of the spectral projector and remainder estimates in the pointwise Weyl law , Analysis & PDE, Vol. 8, No. 7 (2015), 1707-1732
8[8] Y. Colin de Verdière, Ergodicité et les fonctions propres du laplacien Comm. Math. Phys., 102 (1985), 497–502.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Shrinking scale equidistribution for monochromatic random waves on compact manifolds

Abstract.

1. Introduction

Theorem 1**.**

2. Two-point function

3. Outline of the proof: Union bound

Claim 2**.**

Proof.

4. Chernoff bound

Claim 3**.**

Proof.

Lemma 4**.**

Lemma 5**.**

5. Input from semiclassics

Claim 6**.**

6. Upper bound on the variance

7. Collecting the bounds and proving Theorem 1

8. Conclusion

Lemma 7**.**

Proof.

Acknowledgments

Theorem 1.

Claim 2.

Claim 3.

Lemma 4.

Lemma 5.

Claim 6.

Lemma 7.