Random characteristics for Wigner matrices
Per von Soosten, Simone Warzel

TL;DR
This paper extends the random characteristics method to Wigner matrices with non-normal entries and provides a dynamic proof of the weak local semicircle law in the bulk spectrum.
Contribution
It introduces a generalized approach for Wigner matrices and offers a new dynamic proof of the semicircle law without assuming normality.
Findings
Extended random characteristics to broader Wigner matrices
Provided a simple proof of the weak local semicircle law
Validated the approach for non-normal entries
Abstract
We extend the random characteristics approach to Wigner matrices whose entries are not required to have a normal distribution. As an application, we give a simple and fully dynamical proof of the weak local semicircle law in the bulk.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\addtokomafont
author
Random characteristics for Wigner matrices
Per von Soosten and Simone Warzel
\minisec
Abstract We extend the random characteristics approach to Wigner matrices whose entries are not required to have a normal distribution. As an application, we give a simple and fully dynamical proof of the weak local semicircle law in the bulk.
1 Introduction
Starting with the seminal works of Erdős, Schlein, and Yau [15, 16], a large portion of recent progress in random matrix theory rests on strong concentration of measure phenomena for the resolvent on almost microscopic scales. Such estimates are an important model-dependent step in proving Wigner-Dyson universality of the local eigenvalue statistics and uniform delocalization bounds for the eigenvectors. The mean-field setting is especially well-understood: the methods in [12, 18, 17] give strong results and the central ideas are robust enough to cover a wide range of models (see, for example, [2, 3, 5, 6, 7, 9, 14, 13] and references therein). The most basic example consists of the Wigner matrices, whose entries are drawn, independently up to symmetry, from some density with mean zero and variance . A typical result proves that the resolvent has essentially deterministic entries
[TABLE]
the approximation being valid on the smallest possible scale . The function
[TABLE]
is the Stieltjes transform of the semicircle law.
The approximations (1.1) are usually proved by deriving an approximate self-consistent equation
[TABLE]
where we used the notation that we retain throughout this paper. The stability of the self-consistent equation is then used to show that the approximate solution is close to the solution of the exact equation. Because it is usually not possible to prove the validity of the self-consistent equation on local scales directly, the analysis uses the stability of (1.2) again to show that rough estimates on the validity of (1.2) self-improve at finer scales. This idea enables a careful bootstrapping scheme, which allows one to successively improve the scale of the approximation (1.1).
It was noted by Pastur [22], that the self-consistent equation (1.2) can also be viewed as the terminal constraint at of the deterministic advection equation
[TABLE]
which can be easily solved by considering a suitable set of deterministic characteristic curves. In fact, if one generates a Gaussian Wigner matrix dynamically by evolving the entries with Brownian motion [11], one can derive a stochastic version of (1.3)
[TABLE]
with some explicit matrix-valued martingale (see, for example, [24]). This approach can be used to derive the validity of the semicircle law on global scales in the infinite-volume limit [4]. Deterministic characteristic curves have also featured in the analysis on local scales in more recent random matrix literature such as [1, 8, 19, 20].
The works [25, 26] showed that the SDE (1.4) can also be analyzed on local scales by considering the evolution along the random characteristic
[TABLE]
yielding a simple dynamical mechanism for directly proving concentration of measure for the resolvent. The method thus allows one to completely separate any stability arguments from local concentration of measure estimates, thereby circumventing the need for a bootstrap argument.
The purpose of this paper is to show that this approach to local resolvent estimates is not limited to Wigner matrices with Gaussian entries. The basis of this is the construction of a matrix martingale whose rescaled entries follow a given density at time . This will be possible for densities satisfying the following assumption.
Assumption 1.1**.**
The density is strictly positive, has zero mean and unit variance, and the function
[TABLE]
is bounded and Lipschitz continuous on .
The integral equation (1.6) is equivalent to
[TABLE]
so the condition that be bounded essentially amounts to being sub-Gaussian, whereas the Lipschitz continuity of is linked to the regularity of . Madan and Yor [21] used the Lipschitz continuity of to construct a scalar martingale satisfying
[TABLE]
where is a standard Brownian motion. Inspired by an idea of Dupire [10], they then showed that Kolmogorov’s forward equation implies
[TABLE]
Hence, to construct the matrix process , we take an array of standard Brownian motions that are independent up to the constraint and define as the solution of the SDE
[TABLE]
Then in distribution, where is a Wigner matrix whose rescaled entries have distribution . It will be important in the sequel that the quadratic variation of satisfies
[TABLE]
Combining Itô’s lemma with the Neumann expansion of the resolvent yields the matrix-valued SDE
[TABLE]
for the resolvent process with . Multiplying out the drift gives an expression of the form
[TABLE]
The matrix-valued operator , which acts on an matrix as
[TABLE]
is a dominant term whose presence in (1.8) provides the self-energy corrections in the resolvent. On the other hand, the operator
[TABLE]
should be thought of as a finite-volume error reflecting the lack of Hermitian symmetry in our model. Therefore, the self-energy correction coincides with the Itô correction. This is similar in spirit (and identical in the Gaussian case) to the cumulant expansion of He, Knowles, and Rosenthal [18]. However, the technical details are simpler here because the quadratic variation process naturally encodes the fluctuations around the Gaussian noise driving the dynamics.
We will illustrate our method by giving a new proof of the weak bulk local semicircle law for the normalized resolvent trace of Wigner matrices whose entries are drawn from densities satisfying Assumption 1.1. To state this result, we will make use of the stochastic domination language of [12]. Let and be two non-negative -dependent families of random variables. We will say that is stochastically dominated by uniformly in , if, given any , the inequality
[TABLE]
is satisfied for all sufficiently large . We express this relationship by writing uniformly in . Although most of the probabilities in this paper can be controlled with an explicit exponential tail, we have refrained from doing so for the sake of brevity. Our proof of the local semicircle law will be valid in a bulk spectral domain
[TABLE]
where for some . For simplicity, we will assume that the minimal spectral scale is given by
[TABLE]
where is fixed, but arbitrarily small.
Theorem 1.2**.**
We have
[TABLE]
We stress that this theorem first appeared in [15]. It has been extended both to Wigner matrices with minimal assumption on the distribution of the entries [2] and to random matrices with a more general spread-out variance profile [12].
Theorem 1.2 implies that is bounded both above and away from zero when . The Schur complement formula and classical concentration of measure results show that
[TABLE]
so Taylor expansion yields
[TABLE]
Similar considerations apply to the off-diagonal entries
[TABLE]
These calculations have become standard and are explained in great detail in [17]. However, given Theorem 1.2, no further bootstrapping is required to conclude (1.9) and (1.10).
The organization of this paper is as follows. In Section 2, we show in which sense the evolution (1.8) approximates (1.4). Then, in Section 3, we prove that (1.5) defines an approximate characteristic flow, which we use to derive the local semicircle law in Section 4.
2 The self-energy correction
In comparison the Gaussian case, the main complication in the random characteristic approach for more general Wigner matrices is that the self-energy operator remains random and time-dependent. Nevertheless, we will prove in Theorem 2.1 that
[TABLE]
with very high probability so that the characteristic curve (1.5) still counteracts a large part of the term
[TABLE]
in (1.8). The resulting error term is not small a-priori, but retains a specific structure that enables the Grönwall scheme of Lemma 3.3.
For technical reasons, we will prove Theorem 2.1 for spectral parameters in a very large domain
[TABLE]
and for times that are greater than a cutoff
[TABLE]
with some fixed to be specified later in the proof of Lemma 3.3. In the statement of Theorem 2.1, and throughout this paper, denotes the operator norm.
Theorem 2.1**.**
For any choice of in (2.2) we have
[TABLE]
Proof.
We first show that
[TABLE]
uniformly in , , and . Let denote the matrix obtained by replacing the -th row and column of by zeros. Denoting by the resolvent of , the resolvent identity implies
[TABLE]
when . Since by Assumption 1.1, we conclude the uniform deterministic estimate
[TABLE]
using (2.4) and the trivial resolvent bound for the summand with . Similarly, we have
[TABLE]
To prove (2.3) it thus suffices to show that
[TABLE]
uniformly in , , and . After conditioning on , the random variables are independent and bounded by . Using Hoeffding’s inequality with respect to the conditional probability we get
[TABLE]
with
[TABLE]
and
[TABLE]
which proves (2.5).
The extension of (2.3) to the maximum over is by the union bound, whereas the extension to the supremum over all and beyond the cutoff is by a stochastic continuity bound that we turn to in the next step. We fix and consider the neighborhood
[TABLE]
around some point . The theorem then follows from (2.3) and
[TABLE]
Indeed, for every and , there exists an and a finite grid of cardinality such that . Let be arbitrary. Applying the union bound to (2.3) and (2.6) shows that the events
[TABLE]
and
[TABLE]
hold simultaneously for all with probability , when is large enough. The claim of Theorem 2.1 follows since can be chosen arbitrarily large.
Finally, to prove (2.6), we note first that is a submartingale in . Doob’s inequality and classical norm bounds [23] for the rescaled Wigner matrix show that
[TABLE]
for all with some -dependent constant . Since the resolvent is Lipschitz continuous with constant in both and , this implies
[TABLE]
Doob’s inequality also implies that
[TABLE]
for any pair of indices . In combination with the Lipschitz continuity of , this implies
[TABLE]
for any . Since is a polynomial combination of the and , which are bounded by and respectively, the bound (2.6) follows. ∎
3 Fluctuations along characteristic curves
We will show that, for fixed realizations of the randomness, the unique solutions of
[TABLE]
serve as approximate characteristic curves of the resolvent SDE (1.8). We start these curves at spectral parameters in an initial spectral domain
[TABLE]
where is any constant satisfying
[TABLE]
Thus is uniformly bounded and Lipschitz continuous in . Given an initial point , we consider the process
[TABLE]
where
[TABLE]
is the characteristic flow stopped at
[TABLE]
The main observation is that is approximately constant.
Theorem 3.1**.**
The process satisfies
[TABLE]
uniformly in .
Proof.
Since defines a piecewise -process, an application of Itô’s lemma shows that satisfies the same SDE as but with an additional counter-term in the drift. More precisely, while , the evolution consists of two terms
[TABLE]
with
[TABLE]
When dealing with the integrated versions of these processes, we will choose the initial conditions so that . The proof of Theorem 3.1 is then a direct consequence of the estimates on and provided in Lemma 3.2 and Lemma 3.3, respectively. ∎
The proofs of the subsequent results will repeatedly use the crucial fact that any continuously differentiable function satisfies the identity
[TABLE]
provided that . The term is then usually estimated by a trivial bound in terms of . We refer to these two steps simply as the “integration trick”.
Lemma 3.2**.**
The process satisfies
[TABLE]
uniformly in .
Proof.
The martingale part of is
[TABLE]
and the quadratic variation of its unit trace is given by
[TABLE]
where denotes the Hilbert-Schmidt norm, denotes the entrywise product, and is the matrix with entries . If , we conclude that
[TABLE]
by the integration trick. Thus
[TABLE]
by the Burkholder-Davis-Gundy inequality and Markov’s inequality. The other term in ,
[TABLE]
is also stochastically dominated by uniformly in because of the integration trick. ∎
Lemma 3.3**.**
The process satisfies
[TABLE]
uniformly in .
Proof.
We will split the integral
[TABLE]
at the point from (2.2). By the trivial bound on the resolvent, the characteristic started at remains in the domain from (2.1) for all . Setting
[TABLE]
Theorem 2.1 shows that the second part of the integral in (3.2) is bounded by
[TABLE]
with probability for arbitrary and large enough . The first part of the integral in (3.2) can be bounded by using the trivial bound on the operator norm of the resolvent. So choosing the cutoff exponent in (2.2) large enough and using Lemma 3.2 yields
[TABLE]
on an event that also has probability for arbitrary and large enough . On this event, Grönwall’s inequality implies
[TABLE]
and the integral inside the exponential is bounded by because of the integration trick. Since for , we have shown that
[TABLE]
We now insert this bound on back into the integral in (3.3). Choosing the cutoff exponent in (2.2) large enough yields
[TABLE]
via the integration trick. ∎
Having established Theorem 3.1, we now argue that the stochastic domination holds simultaneously for a continuum of points using a discretization argument.
Theorem 3.4**.**
Let . Then
[TABLE]
Proof.
We begin by proving a Lipschitz bound for the characteristic flow . Fix and let
[TABLE]
As long as , we have
[TABLE]
so Grönwall’s inequality and the integration trick yield
[TABLE]
Rearranging, we get
[TABLE]
To prove the theorem, we choose a finite grid such that its cardinality is bounded by for some and
[TABLE]
From Theorem 3.1 and the union bound, we have
[TABLE]
so it suffices to show that the left side of (3.6) controls the left side of (3.4). Given , we pick such that . Then and since (3.5) guarantees that
[TABLE]
as long as and . The trivial -Lipschitz continuity of the resolvent in then shows that the process stays within an error of for all times . ∎
4 Proof of the local semicircle law
To prove Theorem 1.2 we choose an arbitrarily small and prove that the local semicircle law is valid on the event
[TABLE]
with an error that is also of order . By Theorem 3.4 the event has probability for arbitrary when is large enough. We begin by noting that the characteristic flow may be computed explicitly on .
Lemma 4.1**.**
Let and suppose occurs. Then for every , there exists such that and
[TABLE]
Proof.
Let be the time-reversal of defined by
[TABLE]
so that . Whenever and , we necessarily have , so
[TABLE]
by the definition of . The desired in the conclusion of the lemma will be
[TABLE]
which satisfies by the trivial resolvent bound. It thus remains to prove that . For this, we note that is simply connected since is a homeomorphism. If there were any point , there would also be some point with . Since , the relation (4.1) implies
[TABLE]
which leads to the contradiction by (3.1). ∎
Before proving Theorem 1.2, we mention that the relation
[TABLE]
is equivalent to . Since the semicircle law is analytic in the bulk interval , its Stieltjes transform is Lipschitz continuous in with a constant independent of .
Proof of Theorem 1.2.
It suffices to prove that
[TABLE]
on the event for all . Let , let be the point furnished by Lemma 4.1, and let be the solution of (4.2). By Lemma 4.1 we have for all sufficiently large , so the Lipschitz continuous dependence of on implies
[TABLE]
On the event the bound
[TABLE]
is valid with a uniform constant for all . ∎
\minisec
Acknowledgments We are grateful to two anonymous referees for their helpful suggestions. We would also like to thank Institut Mittag-Leffler for their hospitality during the early stages of this project. This work was supported by the DFG grants SO 1724/1-1 (P. S.) and EXC-2111 – 390814868 (S. W.).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Adhikari and J. Huang. Dyson Brownian motion for general β 𝛽 \beta and potential at the edge. Preprint at ar Xiv:1810.08308, 2018.
- 2[2] A. Aggarwal. Bulk universality for generalized Wigner matrices with few moments. Probability Theory and Related Fields , 173(1):375–432, Feb 2019.
- 3[3] J. Alt, L. Erdős, and T. Krüger. Local inhomogeneous circular law. Ann. Appl. Probab. , 28(1):148–203, 2018.
- 4[4] G. W. Anderson, A. Guionnet, and O. Zeitouni. An introduction to random matrices , volume 118 of Cambridge Studies in Advanced Mathematics . Cambridge University Press, Cambridge, 2010.
- 5[5] Z. Bao, L. Erdős, and K. Schnelli. Local law of addition of random matrices on optimal scale. Comm. Math. Phys. , 349(3):947–990, 2017.
- 6[6] R. Bauerschmidt, J. Huang, and H.-T. Yau. Local Kesten–Mc Kay law for random regular graphs. Comm. Math. Phys. , 369(2):523–636, 2019.
- 7[7] A. Bloemendal, L. Erdős, A. Knowles, H.-T. Yau, and J. Yin. Isotropic local laws for sample covariance and generalized Wigner matrices. Electron. J. Probab. , 19:no. 33, 53, 2014.
- 8[8] P. Bourgade. Extreme gaps between eigenvalues of Wigner matrices. Preprint at ar Xiv:1812.10376, 2018.
