Shrinking scale equidistribution for monochromatic random waves on compact manifolds
Matthew de Courcy-Ireland

TL;DR
This paper proves that monochromatic random waves on compact manifolds become uniformly distributed at very small scales, nearly matching the optimal wave scale, with high probability, using spectral and probabilistic tools.
Contribution
It establishes shrinking scale equidistribution for monochromatic random waves on any compact manifold, extending previous results to more general settings.
Findings
Equidistribution occurs at near-optimal wave scales.
High probability of uniform distribution across the manifold.
Uses Weyl's law and Chernoff bounds for proof.
Abstract
We prove equidistribution at shrinking scales for the monochromatic ensemble on a compact Riemannian manifold of any dimension. This ensemble on an arbitrary manifold takes a slowly growing spectral window in order to synthesize a random function. With high probability, equidistribution takes place close to the optimal wave scale and simultaneously over the whole manifold. The proof uses Weyl's law to approximate the two-point correlation function of the ensemble, and a Chernoff bound to deduce concentration.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Shrinking scale equidistribution for monochromatic random waves on compact manifolds
Matthew de Courcy-Ireland
Department of Mathematics
Princeton University
Princeton NJ 08544
(Date: February 14, 2019)
Abstract.
We prove equidistribution at shrinking scales for the monochromatic ensemble on a compact Riemannian manifold of any dimension. This ensemble on an arbitrary manifold takes a slowly growing spectral window in order to synthesize a random function. With high probability, equidistribution takes place close to the optimal wave scale and simultaneously over the whole manifold. The proof uses Weyl’s law to approximate the two-point correlation function of the ensemble, and a Chernoff bound to deduce concentration.
1. Introduction
Consider a compact manifold together with a Riemannian metric . By compactness, the spectrum of the Laplacian is a discrete sequence of eigenvalues , possibly with multiplicity. The corresponding eigenfunctions satisfy
[TABLE]
These eigenfunctions form an orthonormal basis for , the space with respect to integration against the volume form of . Thus one can expand functions in terms of the Laplace eigenfunctions, and a natural model for a random function on is to randomize the coefficients in such an expansion. The monochromatic ensemble takes the specific form
[TABLE]
where the coefficients are independent, identically distributed Gaussian random variables of mean 0. The parameter is large. If the window is short compared to , then is a stand-in for a “random eigenfunction” with eigenvalue . The problem with literally taking a random eigenfunction is that when an eigenvalue has multiplicity 1, the random function would simply be a deterministic function multiplied by a random scalar.
Consider a ball with center whose radius is allowed to vary with . We can normalize so that , in expectation, is close to .
Theorem 1**.**
If (or in case ) and the spectral window obeys and , then for any ,
[TABLE]
The wave scale is the natural wavelength of an eigenfunction with Laplace eigenvalue , also called the Planck scale or de Broglie wavelength. At such a fine scale, there could be a large discrepancy between and . For instance, may be much larger than if achieves its maximum inside . The hypothesis of Theorem 1 is that is large compared to the wave scale in the sense that . We then conclude there is only a small deviation even in the worst case over all centers . The assumption is a relatively mild one, as it allows to grow arbitrarily slowly so that Theorem 1 takes place almost at the wave scale.
Theorem 1 follows from a more explicit bound: for any , there are positive and such that the probability of an -deviation occurring somewhere on is at most
[TABLE]
The factor in (1.3) arises from taking a union bound over roughly points, separated pairwise by a distance . The exponential factor is an upper bound for the probability of a deviation at a single point. Under the assumption that and grow faster than logarithmically, the factor can be absorbed into the exponential and Theorem 1 follows. We describe the union bound in more detail in Section 3. Section 4 uses a Chernoff bound to estimate the probability of a deviation at a single point. The result is expressed in terms of the variance of the local integrals , which we estimate in Lemma 4. The key input is the Local Weyl Law for Laplace eigenfunctions, in a form proved by Canzani and Hanin [6] and described in Section 5. This is used to estimate the two-point correlation function of , defined in Section 2. We complete the proof of (1.3) in Sections 6 and 7. Section 8 concludes with some further questions and a lemma that applies if the coefficients in (1.2) are not necessarily Gaussian.
To have a model for random eigenfunctions, the window should be as small as possible, so it is not a serious restriction to assume that in Theorem 1. This assumption is convenient for stating simplified estimates, but the arguments below could still be implemented as long as .
We mainly have in mind real-valued functions , but we write absolute values in Theorem 1 because a similar statement holds for complex-valued functions as well. However, the complex version is not as sharp since complex eigenfunctions may equidistribute at even smaller scales than their real counterparts. For instance, on the circle , is uniform at all scales because its modulus is identically 1, whereas is limited by the wave scale . Nevertheless, the notation below will involve complex conjugates in order to include the complex case. It would also be appropriate to take Gaussians in the complex plane if one were interested in the complex case, instead of the real coefficients . This can be incorporated into the same proof as for the real case, since a single complex Gaussian is equivalent to two independent real Gaussians.
To provide some context for Theorem 1, consider the property of quantum unique ergodicity (QUE). By QUE for a Riemannian manifold , we mean that for any fixed measurable subset of ,
[TABLE]
for any sequence of Laplace eigenfunctions with growing eigenvalue . There is a further question of the distribution of the microlocal lifts of to phase space , but we confine our attention to the base space . If (1.4) holds along a full subsequence of eigenfunctions, the manifold enjoys quantum ergodicity but may lack uniqueness of quantum limits. The quantum ergodicity theorem proved by Shnirelman [25, 26], Colin de Verdière [8], and Zelditch [28] shows that negative curvature implies quantum ergodicity. Rudnick and Sarnak conjecture that the stronger property of QUE is true on any compact negatively curved surface [24]. This has been shown for examples of arithmetic origin in work of Lindenstrauss [22, 23], and Bourgain-Lindenstrauss [4], Jakobson [19], Holowinsky [17], and Holowinsky-Soundararajan [16]. For a general metric, work of Anantharaman [1], Anantharaman-Nonnenmacher [2], Anantharaman-Silberman [3], and Dyatlov-Jin [10] places constraints on the measures that arise as quantum limits but it remains unknown whether the uniform measure is the only possibility.
From this point of view, it is of interest to randomize and see whether one at least has uniform distribution with high probability. VanderKam [27] showed that one does have equidistribution for random spherical harmonics on the sphere, where QUE is known to fail. A more refined question is whether there is equidistribution even if the test set shrinks as the frequency grows. This scenario has been studied recently in papers of Han [12] (assuming high multiplicity), Han-Tacy [13] (with a spectral window instead of high multiplicity), Granville-Wigman [11] (on an arithmetic torus guaranteeing high multiplicity), Lester-Rudnick [21] (on higher-dimensional tori), Humphries [18] (for non-random functions on arithmetic surfaces, with the averaging being done over the sphere center instead). In particular, Theorem 4.4 from Han-Tacy [13] estimates the probability that there is some point with a given deviation, much like our Theorem 1 but in a different context. In [13], instead of fluctuating near 1, is conditioned to be exactly 1. This is more natural for the quantum interpretation, but the corresponding coefficients in (1.2) are no longer independent random variables, and Han-Tacy treat this with an elegant application of Lévy’s concentration of measure in high-dimensional spheres. The radius in [13] is with close to , whereas we take equal to up to a logarithmic power. Thus Theorem 1 is closer to the wave scale, but in the easier case of a fixed instead of the shrinking deviation from [13].
2. Two-point function
A fundamental quantity governing the statistics of random functions of the form (1.2) is the two-point function of the ensemble, given by
[TABLE]
At each point, is a Gaussian of mean zero, and it is that records the correlation of these random variables at different points on the manifold. Indeed, suppose the coefficients in (1.2) are independent with mean 0 and variance . We then have
[TABLE]
A natural normalization is to require
[TABLE]
To arrange this, the variance of the coefficients must be
[TABLE]
The basis functions are orthonormal in , so the denominator is just the number of eigenvalues in the interval, say :
[TABLE]
Thus we choose the variance of the coefficients to be
[TABLE]
For other sets , we then have
[TABLE]
In the homogeneous case, is independent of and the expectation is simply . In general, it is never very far from , as we will see from Weyl’s law:
[TABLE]
3. Outline of the proof: Union bound
To prove Theorem 1, we follow the strategy of [9]. We write the random variable of interest as
[TABLE]
It has expectation of order 1. The key point is that for a monochromatic wave of frequency , the modulus of continuity at scale is under control. This allows one to replace the supremum over all by a maximum over roughly sample points, where . The union bound is that for a finite number of points
[TABLE]
For our application, the number of points is proportional to . By the union bound, there will be only a probability of there being some point at which a deviation of occurs, provided the probability of a deviation at any single point is . Thus the union bound reduces the problem to a calculation at a single point. That calculation can be done by a Chernoff bound.
Passing to the grid brings with it another error: Conceivably the integrals around all the gridpoints are within of their average, but nevertheless the integral around some point off the grid differs considerably. We must show that this “off-grid” error occurs with only a low probability.
To be more precise, suppose there is a point such that
[TABLE]
Take a grid of points such that every point of is within of a gridpoint. The number of gridpoints is thus of order . We have
[TABLE]
Thus one of the three terms must be greater than . The difference of expected values is non-random and small: Both are , so their difference is . Eventually, this will not be greater than since we assume . Alternatively, note that
[TABLE]
To bound the volume of the symmetric difference, we have the following claim.
Claim 2**.**
If and are balls of radius centered at points separated by less than in a Riemannian manifold of dimension ,
[TABLE]
Proof.
Indeed, for small radii , we can compare to Euclidean balls or simply to a Euclidean box with sidelengths of order and a remaining side of order . The bound holds for larger separations as well, but becomes worse than the easier bound
[TABLE]
∎
With a separation of less than between and , we therefore have
[TABLE]
Assuming , this term will be less than . Thus the difference of expected values will eventually be less than whether we assume or (and later, we will assume that both of them diverge faster than logarithmically). In the case of an -difference of from its mean, it is one of the other two terms or that must be greater than (and in fact, almost greater than once and are large enough).
Suppose it is the integrals around versus that differ by more than . We have
[TABLE]
Since , the same volume bound as above gives
[TABLE]
That is,
[TABLE]
To control the probability of having such a large maximum, we use another union bound. More precise estimates of have been given by Burq-Lebeau [5] and Canzani-Hanin [6], but we include the following sketch to keep the present argument self-contained. Again, take a grid of roughly points. Either there is a gridpoint at which or else there are two points separated by only at which the values of differ by at least . The latter is very unlikely because is the wave scale for . Whereas the values are Gaussian with unit variance, the derivatives of are Gaussian with variance , so a difference of between points separated by only would require to have some directional derivative more than standard deviations above its mean. This occurs with probability less than . Likewise, having requires a Gaussian to be more than standard deviaions above its mean. From the union bound,
[TABLE]
which is negligible as long as . Thus we can move to the final case: The probability that an integral around any single point shows a deviation of more than .
4. Chernoff bound
Each variable is a quadratic form in the coefficients . Writing , we have
[TABLE]
We scale by the variance to write , where is a standard Gaussian of mean 0 and variance 1. Thus
[TABLE]
where the matrix has entries
[TABLE]
Note that this matrix depends on , as well as and , but we have suppressed this in the notation. Since is a symmetric matrix, or Hermitian if we prefer to start from complex-valued eigenfunctions , we may diagonalize to write where is orthogonal (or unitary, in the complex case) and is diagonal with entries, say, . In eigencoordinates, the random variable becomes
[TABLE]
where is again a standard Gaussian vector.
Evaluating a Gaussian integral, it follows that the moment generating function of a quadratic form in standard Gaussians is
[TABLE]
where are the eigenvalues of . In the complex case, each factor effectively occurs twice because of the real and imaginary parts of , leading to instead of . One has convergence in (4.5) as long as for all , so must be small enough. Specifically, is defined for , where is the largest eigenvalue of .
Estimates for allow us to execute a Chernoff bound on the tail probability. For any , if and only if , so by Markov’s inequality
[TABLE]
In the case at hand, where , we have
[TABLE]
Expanding the logarithm in a power series (provided ), we have
[TABLE]
The term contributes . This cancels the expected value above so that
[TABLE]
We would like to minimize the sum of the first two terms by choosing
[TABLE]
but it is not clear whether , that is, whether is defined. We would need to know that
[TABLE]
at least for sufficiently small . In the case of the manifold with its usual round metric, we were able to show in [9] that and are of the same order of magnitude, so that this holds once is small enough. Here, we choose a different to guarantee that , namely
[TABLE]
where . Note that , so that this is a valid choice of .
Claim 3**.**
For this choice , where , we have
[TABLE]
where can be taken as .
Proof.
Indeed, this follows from Taylor’s theorem. For a twice differentiable function , we have
[TABLE]
Applied to the function , this gives
[TABLE]
In particular, for we have
[TABLE]
so we may take to have a bound valid for all up to . We take where with . These values of are at most
[TABLE]
Taylor’s theorem then gives
[TABLE]
Summing over and dividing by 2, we get
[TABLE]
Hence, noting again that , we have proved the claim. ∎
With this estimate in hand, we can bound the tail probability as follows:
[TABLE]
The lower tail, where , is slightly different but can be treated by the same method. We have if and only if , so we can apply the argument above with in place of . Instead of , the relevant function for the Chernoff bound is
[TABLE]
This function is defined for all whereas is defined only for sufficiently small . The Chernoff bound is
[TABLE]
We have for all , so that
[TABLE]
where we choose s=c\big{(}\sum\lambda_{j}^{2}\big{)}^{-1/2} as above. This shows that the lower tail probability obeys the same bound as the upper tail probability, namely
[TABLE]
In fact, since is defined for all , we could simply choose to get an even better bound. This doesn’t help us though, since we control both upper and lower tail together by the sum of their respective bounds:
[TABLE]
for any .
In order to take advantage of this, we need an estimate on the second moment .
Lemma 4**.**
[TABLE]
We will prove the lemma using estimates for the two-point function . We have
[TABLE]
The trace , and also the trace of any power of , can be expressed in terms of as follows.
Recall that
[TABLE]
Since the -entry of is
[TABLE]
the entries of are
[TABLE]
When we sum the diagonal entries, we get
[TABLE]
We can equally well express this product of integrals as one multiple integral:
[TABLE]
The integrand factors:
[TABLE]
We summarize this as follows:
Lemma 5**.**
If is the matrix with entries
[TABLE]
and is the kernel given by
[TABLE]
then
[TABLE]
with the indices interpreted cyclically so that means .
In particular, with , we have
[TABLE]
5. Input from semiclassics
To prove the variance estimate in Lemma 4 , we need to know the size of . Here is the basic estimate:
Claim 6**.**
On a compact manifold of dimension , with spectral kernel
[TABLE]
defined over a window growing arbitrarily slowly and such that
[TABLE]
we have
[TABLE]
for all and an improved bound for well-separated pairs:
[TABLE]
improving on the trivial bound once .
For , the basis for claim 6 is Hörmander’s Theorem 4.4 from [17]. This in turn is based on Lax’s parametrix for the wave equation, constructed in [20]. Using the wave equation in this way may break down when is unbounded. For larger distances we instead appeal to the results of Canzani-Hanin [7]. Their Theorem 2 improves the error term in Hörmander’s estimate for to , assuming are in a ball of radius arbitrarily slowly around some non-self-focal point . Without the assumption on , one cannot conclude the remainder is since the sphere is a counterexample, but the method of [7] still gives
[TABLE]
where the error term is uniform over pairs with . In this notation, and are the length and inner product on the tangent space at defined by the metric , is the volume form, and is the exponential map. Note that is well defined for sufficiently small (less than the injectivity radius of ).
Using polar coordinates at , with and , the difference between the main terms for and is
[TABLE]
The integral over gives the Bessel function
[TABLE]
up to a normalizing factor depending only on . This is a bounded function that begins to oscillate when reaches the first zeros of , and decays as a power as . We have
[TABLE]
by the binomial expansion. This implies
[TABLE]
for some constant . Note that the in the error corresponds to the remainder in Weyl’s law whereas is from truncating the binomial expansion in (5.4). They are equal when .
If , we simply use the fact that is bounded to obtain the trivial bound
[TABLE]
This is useful for nearby pairs , but for it is better to input the fact that to obtain
[TABLE]
We have assumed so that can be absorbed into the error . This gives (5.2). ∎
We have assumed that for convenience, and indeed what we have in mind is that is a power of . If one did want to allow larger , the error in (5.2) would become instead of . For the arguments in Section 7 below to go through, one would then need to assume .
6. Upper bound on the variance
By the triangle inequality, . Since the integrand is nonnegative, we can bound the inner integral in (4.19) by
[TABLE]
Having moved the center to , we introduce polar coordinates where the radial coordinate ranges from 0 to . The volume form is given approximately by its Euclidean counterpart:
[TABLE]
Indeed, the volume form is obtained from the metric by and we have the expansion
[TABLE]
We integrate the estimate (5.2) from section 5, namely
[TABLE]
This diverges as , since we would be better off using the trivial bound for , but the singularity is integrable. We obtain
[TABLE]
Integrating over and noting that , we obtain
[TABLE]
as claimed in Lemma 4. This improves on what one would get by replacing with its maximum, namely
[TABLE]
Recall that we have normalized to have Gaussian coefficients of variance proportional to . Thus this factor will cancel, leaving
[TABLE]
This vanishes as and , whereas the trivial bound would only show the variance is bounded.
7. Collecting the bounds and proving Theorem 1
From the union bound, we had
[TABLE]
From the Chernoff bound,
[TABLE]
From the variance formula,
[TABLE]
Therefore
[TABLE]
We already assumed so that no matter how small is the given , which controls the probability of an “off-grid” deviation. To control the “on-grid” deviation, we must further assume that
[TABLE]
This guarantees that, again, the factor of can be absorbed. Equivalently, we need
[TABLE]
that is, both and . For , the first of these is already implied by the assumption . If , then we instead assume . Thus the requirements amount to both and being asymptotically larger than :
[TABLE]
These are the hypotheses of Theorem 1, and the proof is complete. Moreover, we have proved the rate of convergence for Theorem 1 claimed in (1.3): for any , there are positive and such that
[TABLE]
8. Conclusion
The proof we have given relies on a union bound, ignoring the interesting question of how integrals and over different sets are correlated. One might also wonder about other ensembles of random functions, for instance band-limited functions with a window proportional to instead of , or where the distribution of the coefficients is not Gaussian. One could study other sets , not necessarily balls, either with diameter shrinking like the in our setup, or volume shrinking like . The lifts of to are another interesting class of random measures. Regarding more general coefficients, we note the article [14] of Hanson-Wright on concentration for quadratic forms in independent random variables.
As a first step addressing two of these further directions, here is an exact covariance formula. The covariance between two of our integrals takes a similar form to the variance of a single one. In [9], we did this calculation on the sphere. This was an algebraic calculation valid in more general circumstances, as we now indicate. This proof applies to non-Gaussian distributions of the coefficients, as long as the first four moments are the same as for a Gaussian, whereas the proof by differentiating the moment generating function is specific to Gaussians. Without the assumption on the fourth moment, there is a more complicated formula involving in addition to the kernel .
Lemma 7**.**
Suppose are independent random variables with first and third moments [math], variance , and fourth moment . Suppose are functions on some measure space (assumed -finite for purposes of Fubini’s theorem) and is the corresponding random function. Then for any measurable subsets , ,
[TABLE]
where . If the fourth moment does not necessarily equal , then the covariance is given by
[TABLE]
Proof.
We compute the covariance by expanding and using linearity of expectation to exchange with the sums and integrals. For the expectation of the product, we have
[TABLE]
Since the coefficients are independent and have mean 0, the expectation is if all indices and are equal, if they are equal in pairs, and [math] in all other cases. In light of the different cases , , or , it follows that
[TABLE]
The factor of 3 means that the first term exactly supplies the missing diagonal terms , , and (which we have merged with , the two cases giving the same contribution) in the three other sums. The completed sums then factor, so that
[TABLE]
For the product of the expectations, we have
[TABLE]
by independence of the coefficients. Thus subtraction gives
[TABLE]
which is (8.1).
If the fourth moment does not match that of a Gaussian, then the same method shows that the covariance is given by
[TABLE]
∎
Note that, whereas is unaffected by an orthogonal change of basis , the sum of squares may depend on the choice of orthonormal basis. If , then this extra term disappears.
Acknowledgments
We thank Peter Sarnak for his advice, encouragement, and support over the course of this work. We thank Yaiza Canzani for helpful discussions about Weyl’s law. We thank the Natural Sciences and Engineering Research Council of Canada for its support through a PGS D grant.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] N. Anantharaman, Entropy and the localization of eigenfunctions , Annals of Math. (2), 168 (2008), 435–475.
- 2[2] N. Anantharaman and S. Nonnenmacher, Half-delocalization of eigenfunctions for the Laplacian on an Anosov manifold , Ann. Inst. Four. (Grenoble), 57, 6 (2007), 2465–2523.
- 3[3] N. Anantharaman and L. Silberman, A Haar component for quantum limits on locally symmetric spaces , Israel J. Math. v 195 no.1 493-447 (2013)
- 4[4] J. Bourgain and E. Lindenstrauss, Entropy of quantum limits , Comm. Math. Phys., 233 (2003), 153–171.
- 5[5] N. Burq and G. Lebeau, Injections de Sobolev probabilistes et applications. Ann. Sci. Éc. Norm. Supér. (4), 46 (2013), 917–962. ar Xiv:1111.7310. (2011)
- 6[6] Y. Canzani and B. Hanin. High Frequency Eigenfunction Immersions and Supremum Norms of Random Waves. Electronic Research Announcements in Mathematical Sciences, Volume 22, 2015, pp. 76-86. ar Xiv: 1406.2309.
- 7[7] Y Canzani and B. Hanin, Scaling limit for the kernel of the spectral projector and remainder estimates in the pointwise Weyl law , Analysis & PDE, Vol. 8, No. 7 (2015), 1707-1732
- 8[8] Y. Colin de Verdière, Ergodicité et les fonctions propres du laplacien Comm. Math. Phys., 102 (1985), 497–502.
