Finite-rank perturbations of random band matrices via infinitesimal free   probability

Benson Au

arXiv:1906.10268·math.PR·April 26, 2022

Finite-rank perturbations of random band matrices via infinitesimal free probability

Benson Au

PDF

TL;DR

This paper investigates the infinitesimal spectral distribution of banded GUE matrices, establishing a phase transition at band width proportional to the square root of matrix size, and extends results on finite-rank perturbations and outlier detection.

Contribution

It proves a sharp phase transition for the infinitesimal distribution of banded GUE matrices and extends infinitesimal free probability results to this setting.

Findings

01

Sharp $\sqrt{N}$ transition for infinitesimal distribution

02

Model is infinitesimally free from matrix units and all-ones matrix for large band widths

03

Finite-rank perturbations produce outliers at classical positions

Abstract

We prove a sharp $N$ transition for the infinitesimal distribution of a periodically banded GUE matrix. For band widths $b_{N} = Ω (N)$ , we further prove that our model is infinitesimally free from the matrix units and the normalized all-ones matrix. Our results allow us to extend previous work of Shlyakhtenko on finite-rank perturbations of Wigner matrices in the infinitesimal framework. For finite-rank perturbations of our model, we find outliers at the classical positions from the deformed Wigner ensemble.

Equations343

λ_{1} (A_{N}) \geq \dots \geq λ_{N} (A_{N}), μ (A_{N}) = \frac{1}{N} k \in [N] \sum δ_{λ_{k}} .

λ_{1} (A_{N}) \geq \dots \geq λ_{N} (A_{N}), μ (A_{N}) = \frac{1}{N} k \in [N] \sum δ_{λ_{k}} .

G_{μ} (z) = \int_{R} \frac{1}{z - t} μ (d t) .

G_{μ} (z) = \int_{R} \frac{1}{z - t} μ (d t) .

G_{μ_{1} ⊞ μ_{2}} (z) = G_{μ_{ℓ}} (ω_{ℓ} (z)), \forall ℓ \in {1, 2} .

G_{μ_{1} ⊞ μ_{2}} (z) = G_{μ_{ℓ}} (ω_{ℓ} (z)), \forall ℓ \in {1, 2} .

\displaystyle\mu_{\mathbf{C}}=\mu_{\mathbf{A}}\boxplus\mu_{\mathbf{B}}=\bigg{(}\frac{1}{2\pi}\sqrt{4-t^{2}}\,dt\bigg{)}\boxplus\bigg{(}\frac{1}{2}\delta_{\pm 1}\bigg{)}

\displaystyle\mu_{\mathbf{C}}=\mu_{\mathbf{A}}\boxplus\mu_{\mathbf{B}}=\bigg{(}\frac{1}{2\pi}\sqrt{4-t^{2}}\,dt\bigg{)}\boxplus\bigg{(}\frac{1}{2}\delta_{\pm 1}\bigg{)}

\displaystyle=\frac{1}{2\pi\sqrt{3}}\Bigg{[}\frac{\sqrt[3]{27t-2t^{3}+3\sqrt{3}|t|\sqrt{27-4t^{2}}}}{\sqrt[3]{2}}-\frac{\sqrt[3]{2}t^{2}}{\sqrt[3]{27t-2t^{3}+3\sqrt{3}|t|\sqrt{27-4t^{2}}}}\Bigg{]}\,dt.

μ ⊞ δ_{0} = μ, \forall μ \in P (R) .

μ ⊞ δ_{0} = μ, \forall μ \in P (R) .

E [X_{j, j}^{(N)}] = 0, \forall j \in [N];

E [X_{j, j}^{(N)}] = 0, \forall j \in [N];

\displaystyle\sup_{N\in\mathbb{N}}\sup_{j\in[N]}\mathbb{E}\big{[}|X_{j,j}^{(N)}|^{2}\big{]}<\infty;

\displaystyle\lim_{N\to\infty}\frac{1}{N}\sum_{j\in[N]}\mathbb{E}\big{[}|X_{j,j}^{(N)}|^{2}\mathbbm{1}\{|X_{j,j}^{(N)}|\geq\varepsilon\sqrt{N}\}\big{]}=0,\qquad\forall\varepsilon>0.

\displaystyle\mathbb{E}[X_{j,k}^{(N)}]=0\quad\text{and}\quad\mathbb{E}\big{[}|X_{j,k}^{(N)}|^{2}\big{]}=\sigma^{2},\qquad\forall j<k\in[N];

\displaystyle\mathbb{E}[X_{j,k}^{(N)}]=0\quad\text{and}\quad\mathbb{E}\big{[}|X_{j,k}^{(N)}|^{2}\big{]}=\sigma^{2},\qquad\forall j<k\in[N];

\displaystyle\sup_{N\in\mathbb{N}}\sup_{j<k\in[N]}\mathbb{E}\big{[}|X_{j,k}^{(N)}|^{4}\big{]}<\infty;

\displaystyle\lim_{N\to\infty}\frac{1}{N^{2}}\sum_{j<k\in[N]}\mathbb{E}\big{[}|X_{j,k}^{(N)}|^{4}\mathbbm{1}\{|X_{j,k}^{(N)}|\geq\varepsilon N^{1/4}\}\big{]}=0,\qquad\forall\varepsilon>0.

\displaystyle\mathbb{E}\big{[}|\operatorname{Re}(X_{j,k}^{(N)})|^{2}\big{]}=\mathbb{E}\big{[}|\operatorname{Im}(X_{j,k}^{(N)})|^{2}\big{]}=\frac{\sigma^{2}}{2},\qquad\forall j<k\in[N].

\displaystyle\mathbb{E}\big{[}|\operatorname{Re}(X_{j,k}^{(N)})|^{2}\big{]}=\mathbb{E}\big{[}|\operatorname{Im}(X_{j,k}^{(N)})|^{2}\big{]}=\frac{\sigma^{2}}{2},\qquad\forall j<k\in[N].

ρ_{θ} = θ + \frac{σ ^{2}}{θ} .

ρ_{θ} = θ + \frac{σ ^{2}}{θ} .

λ_{m_{1} + \dots + m_{ℓ - 1} + i} (W_{N} + P_{N}) \to P ρ_{θ_{ℓ}};

λ_{m_{1} + \dots + m_{ℓ - 1} + i} (W_{N} + P_{N}) \to P ρ_{θ_{ℓ}};

λ_{N - m_{L} - \dots - m_{L - ℓ + 1} + i} (W_{N} + P_{N}) \to P ρ_{θ_{L - ℓ + 1}},

λ_{N - m_{L} - \dots - m_{L - ℓ + 1} + i} (W_{N} + P_{N}) \to P ρ_{θ_{L - ℓ + 1}},

μ_{a} : C ⟨ x ⟩ \to C, P \mapsto φ (P (a)),

μ_{a} : C ⟨ x ⟩ \to C, P \mapsto φ (P (a)),

φ (a_{1} a_{2} \dots a_{k}) = 0, \forall a_{j} \in \accentset \circ A_{i (j)},

φ (a_{1} a_{2} \dots a_{k}) = 0, \forall a_{j} \in \accentset \circ A_{i (j)},

ν_{a} : C ⟨ x ⟩ \to C, P \mapsto φ^{'} (P (a)) .

ν_{a} : C ⟨ x ⟩ \to C, P \mapsto φ^{'} (P (a)) .

φ^{'} (a_{1} a_{2} \dots a_{k}) = j = 1 \sum k φ (a_{1} a_{2} \dots a_{j - 1} φ^{'} (a_{j}) a_{j + 1} \dots a_{k}), \forall a_{j} \in \accentset \circ A_{i (j)} .

φ^{'} (a_{1} a_{2} \dots a_{k}) = j = 1 \sum k φ (a_{1} a_{2} \dots a_{j - 1} φ^{'} (a_{j}) a_{j + 1} \dots a_{k}), \forall a_{j} \in \accentset \circ A_{i (j)} .

φ_{t} ([a_{1} - φ_{t} (a_{1})] [a_{2} - φ_{t} (a_{2})] \dots [a_{k} - φ_{t} (a_{k})]) = O (t^{2}) as t \to 0,

φ_{t} ([a_{1} - φ_{t} (a_{1})] [a_{2} - φ_{t} (a_{2})] \dots [a_{k} - φ_{t} (a_{k})]) = O (t^{2}) as t \to 0,

ν_{x} = N \to \infty lim N (μ_{A_{N}} - μ_{x})

ν_{x} = N \to \infty lim N (μ_{A_{N}} - μ_{x})

ν_{A} (x^{ℓ})

ν_{A} (x^{ℓ})

\displaystyle=\lim_{N\to\infty}\mathbb{E}[{\operatorname{Tr}}(\mathbf{A}_{N}^{\ell})]-N\Big{(}\lim_{M\to\infty}\mu_{\mathbf{A}_{M}}(x^{\ell})\Big{)}

\displaystyle=\lim_{N\to\infty}\mathbb{E}\Big{[}\sum_{k=1}^{N}\lambda_{k}(\mathbf{A}_{N})^{\ell}\Big{]}-N\Big{(}\lim_{M\to\infty}\frac{1}{M}\mathbb{E}\Big{[}\sum_{j=1}^{M}\lambda_{j}(\mathbf{A}_{M})^{\ell}\Big{]}\Big{)},

\nu_{\mathbf{A}}=\frac{1}{2}\bigg{[}\frac{1}{2}\delta_{\pm 2\sigma}-\frac{1}{\pi\sqrt{4\sigma^{2}-t^{2}}}\,dt\bigg{]}.

\nu_{\mathbf{A}}=\frac{1}{2}\bigg{[}\frac{1}{2}\delta_{\pm 2\sigma}-\frac{1}{\pi\sqrt{4\sigma^{2}-t^{2}}}\,dt\bigg{]}.

(μ_{a}, ν_{a}) ⊞_{B} (μ_{b}, ν_{b}) = (μ_{a + b}, ν_{a + b}) .

(μ_{a}, ν_{a}) ⊞_{B} (μ_{b}, ν_{b}) = (μ_{a + b}, ν_{a + b}) .

N \to \infty lim μ_{P_{N}}

N \to \infty lim μ_{P_{N}}

N \to \infty lim N (μ_{P_{N}} - δ_{0})

\displaystyle\bigg{(}\frac{1}{2\pi\sigma^{2}}\sqrt{4\sigma^{2}-t^{2}}\,dt,0\bigg{)}\boxplus_{B}\bigg{(}\delta_{0},\sum_{j=1}^{N_{0}}\delta_{\theta_{j}}-N_{0}\delta_{0}\bigg{)}

\displaystyle\bigg{(}\frac{1}{2\pi\sigma^{2}}\sqrt{4\sigma^{2}-t^{2}}\,dt,0\bigg{)}\boxplus_{B}\bigg{(}\delta_{0},\sum_{j=1}^{N_{0}}\delta_{\theta_{j}}-N_{0}\delta_{0}\bigg{)}

=

ν_{j} = \frac{θ _{j} ( t - 2 θ _{j} )}{2 π ( θ _{j} ( t - θ _{j} ) - σ ^{2} ) 4 σ ^{2} - t ^{2}} d t

ν_{j} = \frac{θ _{j} ( t - 2 θ _{j} )}{2 π ( θ _{j} ( t - θ _{j} ) - σ ^{2} ) 4 σ ^{2} - t ^{2}} d t

\nu_{j}^{+}=\MT_{s}tart_{c}ases:nnnn{\quad}{\m@th\displaystyle#\hfil}{\m@th\displaystyle#\hfil}{\{}\mathbbm{1}\{t\in[-2\sigma,2\theta_{j}]\}\frac{d\nu_{j}}{dt}&\text{if $\theta_{j}>0$};\\ \mathbbm{1}\{t\in[2\theta_{j},2\sigma]\}\frac{d\nu_{j}}{dt}\text{if $\theta_{j}<0$}.{}

\nu_{j}^{+}=\MT_{s}tart_{c}ases:nnnn{\quad}{\m@th\displaystyle#\hfil}{\m@th\displaystyle#\hfil}{\{}\mathbbm{1}\{t\in[-2\sigma,2\theta_{j}]\}\frac{d\nu_{j}}{dt}&\text{if $\theta_{j}>0$};\\ \mathbbm{1}\{t\in[2\theta_{j},2\sigma]\}\frac{d\nu_{j}}{dt}\text{if $\theta_{j}<0$}.{}

\displaystyle\bigg{(}\frac{1}{2\pi\sigma^{2}}\sqrt{4\sigma^{2}-t^{2}}\,dt,\frac{1}{2}\bigg{[}\frac{1}{2}\delta_{\pm 2\sigma}-\frac{1}{\pi\sqrt{4\sigma^{2}-t^{2}}}\,dt\bigg{]}\bigg{)}\boxplus_{B}\bigg{(}\delta_{0},\sum_{j=1}^{N_{0}}\delta_{\theta_{j}}-N_{0}\delta_{0}\bigg{)}

\displaystyle\bigg{(}\frac{1}{2\pi\sigma^{2}}\sqrt{4\sigma^{2}-t^{2}}\,dt,\frac{1}{2}\bigg{[}\frac{1}{2}\delta_{\pm 2\sigma}-\frac{1}{\pi\sqrt{4\sigma^{2}-t^{2}}}\,dt\bigg{]}\bigg{)}\boxplus_{B}\bigg{(}\delta_{0},\sum_{j=1}^{N_{0}}\delta_{\theta_{j}}-N_{0}\delta_{0}\bigg{)}

=

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

\MHInternalSyntaxOn\MHInternalSyntaxOff

Finite-rank perturbations of random band matrices via infinitesimal free probability

Benson Au

University of California, San Diego

Department of Mathematics

9500 Gilman Drive # 0112

La Jolla, CA 92093-0112

USA

[email protected]

Abstract.

We prove a sharp $\sqrt{N}$ transition for the infinitesimal distribution of a periodically banded GUE matrix. For band widths $b_{N}=\Omega(\sqrt{N})$ , we further prove that our model is infinitesimally free from the matrix units and the normalized all-ones matrix. Our results allow us to extend previous work of Shlyakhtenko on finite-rank perturbations of Wigner matrices in the infinitesimal framework. For finite-rank perturbations of our model, we find outliers at the classical positions from the deformed Wigner ensemble.

Key words and phrases:

BBP transition; finite-rank perturbation; infinitesimal free probability; random band matrix; traffic probability; Wigner matrix

2010 Mathematics Subject Classification:

15B52; 46L53; 46L54; 60B20

1. Introduction

1.1. Motivation

The contact between random matrices and free probability first appeared in the seminal work of Voiculescu [Voi91]. By now, a well-developed theory exists to illustrate the depth of this connection: see, for example, the monographs [VDN92, NS06, AGZ10, MS17]. We summarize the basic paradigm as follows: in many generic situations, independent random matrices become freely independent in the large $N$ limit. The analytic machinery of free probability then allows us to understand various joint asymptotics associated to such multi-matrix models.

Despite the tremendous success of this approach, the standard free probability framework comes with inherent limitations. In particular, free independence only prescribes the zeroth order behavior of our random variables: for random matrices, this shortcoming already manifests itself at the level of outliers. To make this precise, we introduce some notation. In this article, we restrict our attention to self-adjoint matrices. For such a matrix $\mathbf{A}_{N}\in\operatorname{Mat}_{N}(\mathbb{C})$ , we write $(\lambda_{k}(\mathbf{A}_{N}))_{k\in[N]}$ for its eigenvalues, counting multiplicity, arranged in a non-increasing order. We further write $\mu(\mathbf{A}_{N})$ for the empirical spectral distribution (ESD) of $\mathbf{A}_{N}$ . Thus,

[TABLE]

Hereafter, when we refer to a matrix $\mathbf{A}_{N}$ , we implicitly refer to a sequence of matrices $(\mathbf{A}_{N})_{N\in\mathbb{N}}$ .

Now, suppose that we have random matrices $\mathbf{A}_{N}$ and $\mathbf{B}_{N}$ such that the ESDs converge weakly in expectation to some compactly supported probability measures $\mu_{\mathbf{A}}$ and $\mu_{\mathbf{B}}$ respectively. If we further assume that $\mathbf{A}_{N}$ and $\mathbf{B}_{N}$ are asymptotically free, then we can even compute the limiting spectral distribution (LSD) of rational functions in the pair $(\mathbf{A}_{N},\mathbf{B}_{N})$ [HMS18]. In particular, the freeness relationship completely determines the LSD of the sum $\mathbf{C}_{N}=\mathbf{A}_{N}+\mathbf{B}_{N}$ from the marginals $\mu_{\mathbf{A}}$ and $\mu_{\mathbf{B}}$ . By analogy with the classical case, this operation is known as the free (additive) convolution, for which we use the notation $\mu_{\mathbf{C}}=\mu_{\mathbf{A}}\boxplus\mu_{\mathbf{B}}$ . We recall the following characterization of the free convolution in terms of subordination functions (see, for example, [MS17, Chapter 3]). For a probability measure $\mu$ on $\mathbb{R}$ , we denote its Cauchy transform by $G_{\mu}:\mathbb{C}^{+}\to\mathbb{C}^{-}$ , where

[TABLE]

We use the notation $F_{\mu}=\frac{1}{G_{\mu}}:\mathbb{C}^{+}\to\mathbb{C}^{+}$ for the reciprocal Cauchy transform.

Theorem 1.1 ([Voi93, Bia98]).

For any pair of probability measures $\mu_{1},\mu_{2}$ on $\mathbb{R}$ , there exists a unique pair of analytic functions $\omega_{1},\omega_{2}:\mathbb{C}^{+}\to\mathbb{C}^{+}$ such that

(i)

$G_{\mu_{1}}(\omega_{1}(z))=G_{\mu_{2}}(\omega_{2}(z))$ ; 2. (ii)

$\omega_{1}(z)+\omega_{2}(z)=z+F_{\mu_{1}}(\omega_{1}(z))$ .

Moreover, the common function in property (i) corresponds to the Cauchy transform of a unique probability measure on $\mathbb{R}$ . We define the free convolution $\mu_{1}\boxplus\mu_{2}$ as this unique probability measure, namely

[TABLE]

The tools of free harmonic analysis enable a great deal of practical computations. For example, if one takes $\mathbf{A}_{N}$ to be a normalized matrix from the Gaussian unitary ensemble (GUE), then a classical result of Wigner shows that the LSD is the so-called semicircle distribution $\mu_{\mathbf{A}}=\frac{1}{2\pi}\sqrt{4-t^{2}}\,dt$ [Wig55]. At the same time, the unitary invariance of the GUE implies that $\mathbf{A}_{N}$ is asymptotically free from a large class of random matrices [Voi91]. In the setting above, one can take $\mathbf{B}_{N}$ to be an independent diagonal matrix with i.i.d. Rademacher entries, in which case $\mu_{\mathbf{B}}=\frac{1}{2}\delta_{\pm 1}$ . Implementing Theorem 1.1, we obtain the LSD of the sum $\mathbf{C}_{N}=\mathbf{A}_{N}+\mathbf{B}_{N}$ :

[TABLE]

Such additive perturbations appear naturally as models of interaction and noise. Under suitable conditions, we see that free probability allows us to understand the spectral distribution at the aggregate level; however, this approach fails to capture the behavior of the extremal eigenvalues. Indeed, consider the case of a rank one perturbation $\mathbf{B}_{N}=\theta\mathbf{E}_{N}^{(1,1)}$ , where $\mathbf{E}_{N}^{(j,k)}$ is the matrix unit in the $(j,k)$ -th coordinate and $\theta\in\mathbb{R}$ . For $\mathbf{A}_{N}$ GUE as before, the free convolution calculation $\mu_{\mathbf{C}}=\mu_{\mathbf{A}}\boxplus\mu_{\mathbf{B}}$ reduces to the trivial identity

[TABLE]

From this perspective, the effect of the perturbation $\mathbf{B}_{N}=\theta\mathbf{E}_{N}^{(1,1)}$ appears no different than the unperturbed model $\mathbf{B}_{N}=0$ .

In actuality, we know that the behavior of the extremal eigenvalue exhibits a phase transition depending on the magnitude $|\theta|$ of the perturbation (a so-called BBP transition in view of the original work [BBAP05] on complex sample covariance matrices). In the case of the deformed GUE, Péché showed that the fluctuations of the extremal eigenvalue deviate from the Tracy-Widom distribution [TW94] when $|\theta|\geq 1$ with the extremal eigenvalue even separating from the bulk when $|\theta|>1$ [Péc06]. The unitary invariance of the GUE implies that the same result holds for any rank one self-adjoint perturbation with nontrivial eigenvalue $\theta$ . Féral and Péché then extended the result to complex sub-Gaussian Wigner matrices under the perturbation $\mathbf{B}_{N}^{\prime}=\frac{\theta}{N}\mathbf{J}_{N}$ , where $\mathbf{J}_{N}$ is the all-ones matrix [FP07]. Notably, they proved the universality of the fluctuations of the extremal eigenvalue (cf. [FK81, Sos99]). Maïda established a large deviation principle for the extremal eigenvalue of the deformed Gaussian ensembles: as a corollary, this proves the same bulk separation phenomenon for the deformed Gaussian orthogonal ensemble (GOE) [Maï07]. Capitaine, Donati-Martin, and Féral generalized the bulk separation phenomenon to finite-rank perturbations: for example, $\mathbf{B}_{N}$ of the form $\sum_{j=1}^{N_{0}}\theta_{j}\mathbf{E}_{N}^{(j,j)}$ for some fixed $N_{0}$ . In this case, multiple eigenvalues exit the bulk, one for each value of $|\theta_{j}|>1$ . Their result holds for general Wigner matrices, real and complex, under the technical assumption that the entries satisfy a Poincaré inequality. At the same time, they extended the universality of the fluctuations of the extremal eigenvalue under perturbations of the form $\mathbf{B}_{N}^{\prime}=\frac{\theta}{N}\mathbf{J}_{N}$ to real Wigner matrices. In contrast, they also proved the non-universality of the fluctuations of the extremal eigenvalue under perturbations of the form $\mathbf{B}_{N}=\theta\mathbf{E}_{N}^{(1,1)}$ [CDMF09]. In a later work, the same authors also determined the joint fluctuations of the extremal eigenvalues [CDMF12]. Pizzo, Renfrew, and Soshnikov [PRS13] and later Renfrew and Soshnikov [RS13] removed the technical assumptions for these results: the version we state below is due to them. For additional reading and related results, see the surveys [Péc14, CDM17].

Theorem 1.2 (BBP transition).

For each $N\in\mathbb{N}$ , let $(X_{j,k}^{(N)})_{j\leq k\in[N]}$ be a family of independent random variables, the off-diagonal entries $j<k$ possibly being complex-valued. We assume that the diagonal entries $j=k$ are centered with uniformly bounded variance satisfying the Lindeberg condition:

[TABLE]

For $(X_{j,k}^{(N)})_{j<k\in[N]}$ real-valued, we assume that the off-diagonal entries $j<k$ are centered with identical variance and uniformly bounded fourth moments satisfying a Lindeberg type condition:

[TABLE]

For $(X_{j,k}^{(N)})_{j<k\in[N]}$ complex-valued, we assume that the real and imaginary parts of each off-diagonal entry are independent with identical variance in addition to the conditions above. As a consequence,

[TABLE]

Let $\mathbf{X}_{N}(j,k)=X_{j,k}^{(N)}$ denote the corresponding (unnormalized) Wigner matrix with the usual normalization $\mathbf{W}_{N}=\frac{1}{\sqrt{N}}\mathbf{X}_{N}$ . Assume that $\mathbf{P}_{N}$ is a deterministic self-adjoint matrix of the same symmetry class as $\mathbf{W}_{N}$ with fixed rank $r$ independent of the dimension. We further assume that the non-trivial eigenvalues of $\mathbf{P}_{N}$ are independent of $N$ , say $\theta_{1}>\cdots>\theta_{L}$ , where $\theta_{\ell}\neq 0$ occurs with multiplicity $m_{\ell}$ for $\ell\in[L]$ . Let $L_{+\sigma}=\#(\ell\in[L]:\theta_{\ell}>\sigma)$ and $L_{-\sigma}=\#(\ell\in[L]:\theta_{\ell}<-\sigma)$ , and define

[TABLE]

Then we have the following asymptotic behavior at the edge of the spectrum of the deformed Wigner ensemble $\mathbf{W}_{N}+\mathbf{P}_{N}$ :

(i)

For any $\ell\in[L_{+\sigma}]$ and $i\in[m_{\ell}]$ ,

[TABLE] 2. (ii)

$\lambda_{m_{1}+\cdots+m_{L_{+\sigma}}+1}(\mathbf{W}_{N}+\mathbf{P}_{N})\overset{\mathbb{P}}{\to}2\sigma$ ; 3. (iii)

$\lambda_{N-m_{L}-\cdots-m_{L-L_{-\sigma}+1}}(\mathbf{W}_{N}+\mathbf{P}_{N})\overset{\mathbb{P}}{\to}-2\sigma$ ; 4. (iv)

For any $\ell\in[L_{-\sigma}]$ and $i\in[m_{L-\ell+1}]$ ,

[TABLE]

where $\overset{\mathbb{P}}{\to}$ denotes convergence in probability.

Recall that our earlier free convolution calculation failed to identify such outliers. Nevertheless, it turns out that the behavior of the outlying eigenvalues (as well as their eigenvectors) can be understood in terms of the subordination functions $\omega_{\ell}$ from Theorem 1.1 [CDMFF11, Cap13, BBCF17] (see also [BGN11] for related results). This suggests that free probability may yet prove useful to this end. Shlyakhtenko explained this connection using the framework of infinitesimal free probability, an extension of free probability to the first order. In particular, by calculating a type B free convolution, one obtains the $\frac{1}{N}$ correction to the LSD of such deformed ensembles. The outlying eigenvalues then appear in this correction in the form of Dirac masses [Shl18]. We review this framework in the next section.

1.2. Background

We begin by recalling the usual free probability framework.

Definition 1.3 (Free probability).

By a non-commutative (NC) probability space $(\mathcal{A},\varphi)$ , we mean a unital algebra $\mathcal{A}$ over $\mathbb{C}$ paired with a unital linear functional $\varphi:\mathcal{A}\to\mathbb{C}$ . We say that $\varphi$ is tracial if $\varphi(ab)=\varphi(ba)$ for all $a,b\in\mathcal{A}$ . The distribution of a family of random variables $\mathbf{a}=(a_{i})_{i\in I}\subset\mathcal{A}$ is the linear functional

[TABLE]

where $\mathbf{x}=(x_{i})_{i\in I}$ is a set of non-commuting indeterminates and $P(\mathbf{a})\in\mathcal{A}$ is the usual evaluation of NC polynomials. A sequence of families $(\mathbf{a}_{N})_{N\in\mathbb{N}}$ , each living in a possibly different NC probability space $(\mathcal{A}_{N},\varphi_{N})$ , converges in distribution if the sequence $(\mu_{\mathbf{a}_{N}})_{N\in\mathbb{N}}$ converges pointwise. Note that the limit defines a new NC probability space $(\mathbb{C}\langle\mathbf{x}\rangle,\lim_{N\to\infty}\mu_{\mathbf{a}_{N}})$ .

Unital subalgebras $(\mathcal{A}_{i})_{i\in I}$ of $\mathcal{A}$ are said to be freely independent (or simply free) if for any $k\geq 2$ and consecutively distinct indices $i(1)\neq i(2)\neq\cdots\neq i(k)$ ,

[TABLE]

where $\accentset{\circ}{\mathcal{A}}_{i(j)}=\{a\in\mathcal{A}_{i(j)}:\varphi(a)=0\}$ denotes the subspace of centered elements. We say that collections of random variables $(\mathcal{S}_{i})_{i\in I}$ are free if the unital subalgebras that they generate are free. If a sequence of families $(\mathbf{a}_{N})_{N\in\mathbb{N}}$ converges in distribution, then we say that the random variables $\mathbf{a}_{N}=(a_{N}^{(i)})_{i\in I}$ are asymptotically free if the indeterminates $\mathbf{x}=(x_{i})_{i\in I}$ are free in $(\mathbb{C}\langle\mathbf{x}\rangle,\lim_{N\to\infty}\mu_{\mathbf{a}_{N}})$ .

*Remark 1.4**.*

The reader might wonder how the notion of a distribution above relates to the usual notion of a distribution for a real-valued random variable. If we assume both existence and uniqueness for the moment problem defined by $\mu_{a}$ , then the two notions coincide. The moment sequences we consider in this paper will satisfy this assumption, so we speak of the two notions interchangeably. In particular, if $a,b\in(\mathcal{A},\varphi)$ are free with determinate moment problems, then $\mu_{a+b}=\mu_{a}\boxplus\mu_{b}$ .

Example 1.5 (Random matrices).

Let $\operatorname{Mat}_{N}(L^{\infty-}(\Omega,\mathcal{F},\mathbb{P}))$ denote the algebra of random $N\times N$ matrices whose entries, possibly complex-valued, have finite absolute moments of all orders. Then $(\operatorname{Mat}_{N}(L^{\infty-}(\Omega,\mathcal{F},\mathbb{P})),\frac{1}{N}\mathbb{E}[{\operatorname{Tr}}(\cdot)])$ defines a tracial NC probability space.

Voiculescu showed that independent unitarily invariant random matrices are asymptotically free [Voi91], the GUE being a prototypical example. Dykema later extended this result to general Wigner matrices [Dyk93]. We now know freeness to be an ubiquitous phenomenon for invariant/mean-field multi-matrix models in the large $N$ limit [MS17] (see also [Spe17]).

Understanding the spectral behavior of non mean-field ensembles constitutes a major ongoing program of research, where random band matrices emerge as an attractive interpolative model (see [Bou18] and the references therein). Here, the primary questions concern the local eigenvalue statistics and localization versus delocalization for the eigenvectors. In a different direction, we showed that freeness governs random band matrices for band widths $1\ll b_{N}\ll N$ [Au18], motivating the investigations in this paper at the infinitesimal level. The results in [Au18] rely on an extension of free probability introduced by Male called traffic probability [Mal]: we make use of the traffic framework again, this time in conjunction with the infinitesimal framework. We refer the reader to [Mal17, MP, Gaba, Gabb, Gabc, CDM, ACD*+*] for additional reading on traffic probability and its applications.

Belinschi and Shlyakhtenko introduced infinitesimal free probability in [BS12] to provide an analytic interpretation of the type $B$ free probability of Biane, Goodman, and Nica [BGN03]. We content ourselves with the basic framework: for more on the interplay between these two notions, see [FN10]. For recent work on infinitesimal free probability and its applications to random matrices, we mention the contributions [Min, DF, Tse].

Definition 1.6 (Infinitesimal free probability).

By an infinitesimal NC probability space $(\mathcal{A},\varphi,\varphi^{\prime})$ , we mean a NC probability space $(\mathcal{A},\varphi)$ with an additional linear functional $\varphi^{\prime}:\mathcal{A}\to\mathbb{C}$ satisfying $\varphi^{\prime}(1)=0$ . The infinitesimal distribution of a family of random variables $\mathbf{a}=(a_{i})_{i\in I}\subset\mathcal{A}$ is the linear functional

[TABLE]

We refer to the pair $(\mu_{\mathbf{a}},\nu_{\mathbf{a}})$ as the type $B$ distribution of $\mathbf{a}$ .

Unital subalgebras $(\mathcal{A}_{i})_{i\in I}$ of $\mathcal{A}$ are said to be infinitesimally free if

(i)

the $(\mathcal{A}_{i})_{i\in I}$ are free in $(\mathcal{A},\varphi)$ ; 2. (ii)

for any $k\geq 2$ and consecutively distinct indices $i(1)\neq i(2)\neq\cdots\neq i(k)$ ,

[TABLE]

Conditions (i) and (ii) are equivalent to the following asymptotic:

[TABLE]

where $a_{j}\in\mathcal{A}_{i(j)}$ and $\varphi_{t}=\varphi+t\varphi^{\prime}$ for $t\in\mathbb{R}$ . Thus, heuristically, we think of infinitesimal freeness as “freeness to the first order”.

*Remark 1.7**.*

In view of Remark 1.4, the reader might wonder how the notion of an infinitesimal distribution relates to the usual notion of a signed measure on the real line. If we assume both existence and uniqueness for the signed moment problem defined by $\nu_{a}$ , then the two notions coincide. The signed moment sequences we consider in this paper will typically satisfy this assumption, so we speak of the two notions interchangeably when possible. Note that the condition $\nu_{a}(1)=\varphi^{\prime}(1)=0$ implies that the corresponding signed measure has total mass zero.

Example 1.8 (Random matrices, revisited).

Let $\mathcal{A}_{N}=(\mathbf{A}_{N}^{(i)})_{i\in I}$ be a family of random matrices in $(\operatorname{Mat}_{N}(L^{\infty-}(\Omega,\mathcal{F},\mathbb{P})),\frac{1}{N}\mathbb{E}[{\operatorname{Tr}}(\cdot)])$ . Assume that $\mathcal{A}_{N}$ converges in distribution with limit $\mu_{\mathbf{x}}=\lim_{N\to\infty}\mu_{\mathcal{A}_{N}}$ . If we further assume that the limit

[TABLE]

exists, then $(\mathbb{C}\langle\mathbf{x}\rangle,\mu_{\mathbf{x}},\nu_{\mathbf{x}})$ defines a tracial infinitesimal NC probability space (both $\mu_{\mathbf{x}}$ and $\nu_{\mathbf{x}}$ vanish on the commutators). By a slight abuse of terminology, we often refer to $\nu_{\mathbf{x}}$ (resp., $(\mu_{\mathbf{x}},\nu_{\mathbf{x}})$ ) as the infinitesimal distribution (resp., type $B$ distribution) of $\mathcal{A}_{N}$ .

In the single matrix case, say $\mathbf{A}_{N}$ , the infinitesimal distribution $\nu_{\mathbf{A}}$ corresponds to the $\frac{1}{N}$ correction to the LSD $\mu_{\mathbf{A}}$ . Indeed, by definition,

[TABLE]

where we recall that $\mathbf{A}_{N}$ is assumed to be self-adjoint. For example, in the case of $\mathbf{A}_{N}\overset{d}{=}\operatorname{GUE}(N,\frac{\sigma^{2}}{N})$ , the infinitesimal distribution is null $\nu_{\mathbf{A}}=0$ , a consequence of the genus expansion [HZ86]. On the other hand, a result of Johansson [Joh98] shows that the situation becomes much different for $\mathbf{A}_{N}\overset{d}{=}\operatorname{GOE}(N,\frac{\sigma^{2}}{N})$ , where

[TABLE]

We mention that such corrections also exist for complex Wishart matrices [MN04, Min] and $\beta$ -ensembles [DE06].

Note that the eigenvalues $(\lambda_{k}(\mathbf{A}_{N}))_{k\in[N]}$ appear in (1) via the unnormalized trace. This suggests that the infinitesimal distribution is sensitive to outliers. To see this, we will need the following subordination result for the type $B$ free (additive) convolution.

Theorem 1.9 ([BS12]).

Suppose that $a,b\in(\mathcal{A},\varphi,\varphi^{\prime})$ are infinitesimally free with compactly supported type $B$ distributions $(\mu_{a},\nu_{a}),(\mu_{b},\nu_{b})\in\mathcal{P}(\mathbb{R})\times\mathcal{M}_{0}(\mathbb{R})$ . By this, we mean that both coordinates of the type $B$ distribution have compact support. Then, in the notation of Theorem 1.1, the sum $a+b$ also has a compactly supported type $B$ distribution $(\mu_{a+b},\nu_{a+b})\in\mathcal{P}(\mathbb{R})\times\mathcal{M}_{0}(\mathbb{R})$ characterized by

(i)

$\mu_{a+b}=\mu_{a}\boxplus\mu_{b}$ ; 2. (ii)

$G_{\nu_{a+b}}(z)=G_{\nu_{a}}(\omega_{a}(z))\omega_{a}^{\prime}(z)+G_{\nu_{b}}(\omega_{b}(z))\omega_{b}^{\prime}(z)$ ,

where $\omega_{a}^{\prime}(z),\omega_{b}^{\prime}(z)$ denote the usual derivatives. We define the type $B$ convolution $(\mu_{a},\nu_{a})\boxplus_{B}(\mu_{b},\nu_{b})$ as this unique type $B$ distribution, namely

[TABLE]

Theorem 1.10 ([Shl18]).

Let $\mathbf{W}_{N}\overset{d}{=}\operatorname{GUE}/\operatorname{GOE}(N,\frac{\sigma^{2}}{N})$ . Then for any fixed $N_{0}$ , the matrices $\mathbf{W}_{N}$ and $(\mathbf{E}_{N}^{(j,k)})_{j,k\in[N_{0}]}$ are asymptotically infinitesimally free.

Of course, we can easily compute the type $B$ distribution of the matrix units. For $\mathbf{P}_{N}=\sum_{j=1}^{N_{0}}\theta_{j}\mathbf{E}_{N}^{(j,j)}$ , we see that

[TABLE]

Using Theorem 1.9, one obtains the $\frac{1}{N}$ correction to the LSD of the deformed Gaussian ensemble $\mathbf{W}_{N}+\mathbf{P}_{N}$ (cf. Theorem 1.2).

Corollary 1.11 ([Shl18]).

If $\mathbf{W}_{N}\overset{d}{=}\operatorname{GUE}(N,\frac{\sigma^{2}}{N})$ and $\mathbf{P}_{N}=\sum_{j=1}^{N_{0}}\theta_{j}\mathbf{E}_{N}^{(j,j)}$ , then the type $B$ distribution of $\mathbf{W}_{N}+\mathbf{P}_{N}$ is given by

[TABLE]

where

[TABLE]

is a probability measure if $|\theta_{j}|\geq\sigma$ ; otherwise, $\nu_{j}$ is a signed measure of total mass zero with Jordan decomposition $\nu_{j}=\nu_{j}^{+}-\nu_{j}^{-}$ , where

[TABLE]

If instead $\mathbf{W}_{N}\overset{d}{=}\operatorname{GOE}(N,\frac{\sigma^{2}}{N})$ , then the type $B$ distribution of $\mathbf{W}_{N}+\mathbf{P}_{N}$ is given by

[TABLE]

where $\nu_{j}$ is as before.

The proof of Theorem 1.10 relies on Wick’s formula for Gaussian integration. Naturally, one can ask if the result extends to general Wigner matrices. In this case, one needs to first prove the existence of an infinitesimal distribution for the single matrix model, a calculation carried out by Enriquez and Ménard (see also [KKP96]). We state a slight generalization of their result to allow for entries with possibly different distributions: the proof remains unchanged.

Theorem 1.12 ([EM16]).

For each $N\in\mathbb{N}$ , let $(X_{j,k}^{(N)})_{j\leq k\in[N]}$ be a family of independent random variables, the off-diagonal entries $j<k$ possibly being complex-valued. We assume that the diagonal entries $j=k$ are centered with identical variance:

[TABLE]

For $(X_{j,k}^{(N)})_{j<k\in[N]}$ real-valued ( $\beta=1$ ), we assume that the off-diagonal entries $j<k$ are centered with identical variance and fourth moments:

[TABLE]

For $(X_{j,k}^{(N)})_{j<k\in[N]}$ complex-valued ( $\beta=2$ ), we assume that the pseudo-variance of each off-diagonal entry vanishes in addition to the conditions above:

[TABLE]

Lastly, we assume a strong uniform control on the moments:

[TABLE]

Then the corresponding Wigner matrix $\mathbf{W}_{N}(j,k)=\frac{1}{\sqrt{N}}X_{j,k}^{(N)}$ has an infinitesimal distribution $\nu=\frac{1}{2}\Big{[}\frac{\mathbbm{1}\{\beta=1\}}{2}\delta_{\pm 2\sigma}+\nu_{\operatorname{ac}}\Big{]}$ , where

[TABLE]

1.3. Statement of results

Our first result extends Theorem 1.10 to general Wigner matrices. We also consider perturbations of the form $\frac{\theta}{N}\mathbf{J}_{N}$ , where we recall that $\mathbf{J}_{N}$ is the all-ones matrix.

Theorem 1.13.

Let $\mathbf{W}_{N}$ be a Wigner matrix of the form in Theorem 1.12. Then for any fixed $N_{0}$ , the matrices $\mathbf{W}_{N}$ , $(\mathbf{E}_{N}^{(j,k)})_{j,k\in[N_{0}]}$ , and $\frac{1}{N}\mathbf{J}_{N}$ are asymptotically infinitesimally free.

Note that the type $B$ distribution of $\frac{\theta}{N}\mathbf{J}_{N}$ is identical to that of $\theta\mathbf{E}_{N}^{(j,j)}$ , allowing us to essentially repeat the calculation of Corollary 1.11.

Corollary 1.14.

The type $B$ distribution of the deformed Wigner ensemble

[TABLE]

is given by

[TABLE]

where $\nu$ is as in Theorem 1.12 and $\nu_{j}$ is as in Corollary 1.11.

*Remark 1.15**.*

The result above shows that while the infinitesimal distribution is sensitive to outliers, it fails to distinguish their fluctuations. Indeed, recall that the fluctuations of the extremal eigenvalue under perturbations of the form $\theta\mathbf{E}_{N}^{(1,1)}$ (resp., $\frac{\theta}{N}\mathbf{J}_{N}$ ) are non-universal (resp., universal) for $|\theta|>\sigma$ , whereas the infinitesimal distribution of $\mathbf{W}_{N}+\theta\mathbf{E}_{N}^{(1,1)}$ and $\mathbf{W}_{N}+\frac{\theta}{N}\mathbf{J}_{N}$ are identical. In general, the fluctuations of the extremal eigenvalues depend on the geometry of the eigenvectors of the perturbation: localized (as in the case of $\sum_{j=1}^{N_{0}}\theta_{j}\mathbf{E}_{N}^{(j,j)}$ ) versus delocalized (as in the case of $\frac{\theta_{N_{0}+1}}{N}\mathbf{J}_{N}$ ) [CDMF12].

The usual strategy for studying outliers relies on a fine analysis of the resolvent, using delicate estimates currently unavailable for non mean-field ensembles. In contrast, the purview of the infinitesimal framework extends quite naturally to random band matrices. We restrict ourselves to the idealized situation of a periodically banded GUE matrix.

Definition 1.16 (Random band matrix).

Let $\mathbf{X}_{N}\overset{d}{=}\operatorname{GUE}(N,\sigma^{2})$ . For a band width $b_{N}\geq 0$ , we define $\mathbf{B}_{N}$ to be the corresponding periodic band matrix of ones:

[TABLE]

where

[TABLE]

We assume that the band width $b_{N}\to\infty$ , and we set

[TABLE]

We call the random matrix

[TABLE]

a (normalized) periodically banded GUE matrix (of band width $b_{N}$ ). Of course, if $b_{N}\geq\lfloor N/2\rfloor$ , then $\mathbf{\Xi}_{N}\overset{d}{=}\operatorname{GUE}(N,\frac{\sigma^{2}}{N})$ .

Bogachev, Molchanov, and Pastur proved that the ESD $\mu(\mathbf{\Xi}_{N})$ converges weakly almost surely to the semicircle distribution [BMP91]. In particular, this holds regardless of the rate $b_{N}\to\infty$ because of the periodic band width structure (2). We considered the multi-matrix case in [Au18], where it was shown that independent copies $(\mathbf{\Xi}_{N}^{(i)})_{i\in I}$ of $\mathbf{\Xi}_{N}$ are asymptotically free, regardless of the relative rates of growth of the band widths $(b_{N}^{(i)})_{i\in I}$ . So, for example, it could be that

[TABLE]

We highlight this homogeneity around $\sqrt{N}$ because of its conjectural role, confirmed at the level of physical rigor, as the critical value for the localization-delocalization transition for random band matrices (again, see [Bou18] for a recent survey).

While the rate $b_{N}\to\infty$ did not play a role in our calculations at the zeroth order, a $\sqrt{N}$ factor appears quite naturally at the first order. Our next result proves a sharp transition for the infinitesimal distribution around this rate.

Theorem 1.17.

Let $\mathbf{\Xi}_{N}$ be a periodically banded GUE matrix of band width $b_{N}$ . Then for any $\ell\in\mathbb{N}$ ,

[TABLE]

where ${\operatorname{Cat}}(\ell)=\frac{\binom{2\ell}{\ell}}{\ell+1}$ is the $\ell$ th Catalan number, $m_{2}(\sigma^{2},c)=0$ , and $m_{2\ell}(\sigma^{2},c)\in(0,\infty)$ for $\ell\geq 2$ . In particular, if $b_{N}\gg\sqrt{N}$ , then the type $B$ distribution of $\mathbf{\Xi}_{N}$ exists and agrees with that of a usual GUE matrix $\mathbf{W}_{N}$ .

The numbers $m_{2\ell}(\sigma^{2},c)$ correspond to sums of volumes of regions cut out of a hypercube and satisfy

[TABLE]

Thus, a solution to the signed moment problem defined by the sequence

[TABLE]

would necessarily be unique; however, we do not prove existence. Nevertheless, given a finite limit for the infinitesimal distribution, we can consider the question of finite-rank perturbations.

Theorem 1.18.

Let $\mathbf{\Xi}_{N}$ be a periodically banded GUE matrix of band width $b_{N}$ such that $b_{N}\gg\sqrt{N}$ or $\lim_{N\to\infty}\frac{b_{N}}{N}=c\in(0,\infty)$ . Then for any fixed $N_{0}$ , the matrices $\mathbf{\Xi}_{N}$ , $(\mathbf{E}_{N}^{(j,k)})_{j,k\in[N_{0}]}$ , and $\frac{1}{N}\mathbf{J}_{N}$ are asymptotically infinitesimally free.

For band widths $b_{N}\gg\sqrt{N}$ , this allows us to repeat the calculation of Corollary 1.11. In particular, we find outliers at the classical positions from the deformed Wigner ensemble.

Corollary 1.19.

For $b_{N}\gg\sqrt{N}$ , the type $B$ distribution of the deformed RBM

[TABLE]

is given by

[TABLE]

where $\nu_{j}$ is as in Corollary 1.11.

*Remark 1.20**.*

A solution to the signed moment problem at the rate $b_{N}\asymp\sqrt{N}$ would allow us to deduce the type $B$ distribution of the corresponding deformed model: one simply needs to add the hypothetical signed measure to the infinitesimal distribution in Corollary 1.19.

In this article, we consider the BBP transition for random band matrices exclusively within the infinitesimal framework. Naturally, one can ask if the usual form of these results hold, namely, convergence in probability of the extremal eigenvalues and convergence in distribution of the fluctuations. This will be the subject of future work. In the next section, we record the outcome of numerical simulations for various band widths. Notably, the data suggests that the position of the outliers and their fluctuations extend below the rate $b_{N}\asymp\sqrt{N}$ .

2. Numerical simulations

We consider the fluctuations of the largest eigenvalue under both localized and delocalized perturbations of our model separately. In particular, we record the data

[TABLE]

for 5000 realizations of the matrix $\mathbf{\Xi}_{N}$ , where $\sigma^{2}=1$ , $\theta=2$ , and $N=7776$ . The peculiar choice of dimension allows for the precise band widths $b_{N}=N^{3/5}=216$ and $b_{N}=N^{2/5}=36$ . For reference, we also consider the band width $b_{N}=\lfloor N/2\rfloor$ , in which case $\mathbf{\Xi}_{N}$ reduces to the usual GUE and $F_{N,1}(\lfloor N/2\rfloor),F_{N,2}(\lfloor N/2\rfloor)\overset{d}{\to}\mathcal{N}(0,1)$ by a result of Péché [Péc06]. We emphasize the difference in scaling between $F_{N,1}(b_{N})$ and $F_{N,2}(b_{N})$ . Indeed, the data strongly suggests that we still have the convergence $F_{N,1}(b_{N}),F_{N,2}(b_{N})\overset{d}{\to}\mathcal{N}(0,1)$ under the respective normalizations (even at the rate $b_{N}=N^{2/5}\ll\sqrt{N}$ ). The simulations were performed in Julia [BEKS17] and the data plotted using Gadfly [JAN*+*18].

The scaling in $F_{N,1}$ should come as no surprise. To see this, note that the periodic band width structure in some sense reduces the trace expansion at each entry locally to that of a $\xi_{N}\times\xi_{N}$ matrix. So, heuristically, we think of $\theta\mathbf{E}_{N}^{(1,1)}$ as a perturbation of $\mathbf{W}_{\xi_{N}}\overset{d}{=}\operatorname{GUE}(\xi_{N},\frac{\sigma^{2}}{\xi_{N}})$ . On the other hand, in the case of $F_{N,2}$ , adding $\frac{\theta}{N}\mathbf{J}_{N}$ forces us to consider the entire $N\times N$ matrix, removing any notion of homogeneity. Moreover, the entries of this perturbation come in at a different scale than our matrix entries $\mathbf{\Xi}_{N}=\frac{1}{\sqrt{\xi_{N}}}\mathbf{B}_{N}\circ\mathbf{X}_{N}$ . We can still make (non-rigorous) sense of the scaling in $F_{N,2}$ by considering the trace expansion as a choice at each entry between the original matrix $\mathbf{X}_{N}(i,j)$ and the perturbation $\frac{\theta}{N}$ , where the first option is available iff $|i-j|_{N}\leq b_{N}$ . But this precisely balances with the normalization of the entries in $\mathbf{\Xi}_{N}$ , and so the scaling should follow the usual case of $\mathbf{W}_{N}+\frac{\theta}{N}\mathbf{J}_{N}$ .

For (undeformed) random band matrices, Sodin proved that the extremal eigenvalues converge to the edge of the support for band widths $b_{N}\gg\log(N)$ with the fluctuations exhibiting a crossover at the rate $b_{N}\asymp N^{5/6}$ [Sod10]. The simulations do not support the idea of a similar crossover for the deformed model, suggesting that the perturbations regularize the fluctuations of the extremal eigenvalues.

3. The infinitesimal distribution of a random band matrix

For convenience, we fix the variance $\sigma^{2}=1$ in this section: the general result follows from a simple scaling. Section 3.1 proves the existence of an infinitesimal distribution for a periodically banded GUE matrix in the regime $b_{N}=\Omega(\sqrt{N})$ using a band variant of the genus expansion. Section 3.2 then proves the asymptotic infinitesimal freeness of our model from the matrix units and the normalized all-ones matrix, allowing us to carry out the advertised type $B$ free convolution calculation.

3.1. A band variant of the genus expansion

We consider traces in powers of our matrix $\mathbf{\Xi}_{N}$ . To begin, note that

[TABLE]

This follows from the usual symmetry argument, which still holds even in the presence of the band width condition. We turn our attention to the even powers, where we must now account for the band width explicitly:

[TABLE]

where $\eta(2\ell+1)=\eta(1)$ . Using Wick’s formula, we obtain the expansion

[TABLE]

where $\gamma=(1,2,\ldots,2\ell)\in\mathfrak{S}_{2\ell}$ . Here, we consider a pair partition $\pi$ as a $2\ell$ -permutation when computing the composition $\eta\circ\gamma\circ\pi$ . Interchanging the sums, we arrive at the expression

[TABLE]

where

[TABLE]

Note that we have the simple upper bound

[TABLE]

where $\#(\gamma\circ\pi)$ denotes the number of cycles of $\gamma\circ\pi\in\mathfrak{S}_{2\ell}$ . Indeed, starting with an arbitrary cycle of $\gamma\circ\pi$ , say the cycle that contains 1, we have $N$ choices for the common index $\eta(1)\in[N]$ of the elements in this cycle. After making this choice, we must then choose the indices of the remaining cycles to satisfy the band width condition $|\eta(j)-\eta(j+1)|_{N}\leq b_{N}$ , for which there are at most $\xi_{N}$ choices at each step. In general, this upper bound is strict: by the time you arrive to choose the index of a cycle of $\gamma\circ\pi$ , you might have fewer than $\xi_{N}$ choices if the cycle is neighboring two cycles whose indices have already been chosen. As an example, take $\ell=4$ and $\pi=(1,5)(2,8)(3,7)(4,6)$ . In this case, $\gamma\circ\pi=(1,6,5,2)(3,8)(4,7)$ . Suppose that we pick the indices $\eta(1)=1$ and $\eta(3)=1+b_{N}$ for the cycles $(1,6,5,2)$ and $(3,10)$ respectively. Then the index $\eta(4)$ of the cycle $(4,9)$ must satisfy both

[TABLE]

If we assume that $b_{N}\ll N$ , then we only have $1+b_{N}<\xi_{N}$ choices for $\eta(4)$ .

We quickly see the problem. By using up all of our leeway, we could potentially leave the indices too far apart to meet up again. For a simple parallel, consider placing three points $p_{1},p_{2},p_{3}$ in $\mathbb{R}^{2}$ such that any pair of points must be within unit distance of each other. Choosing the first point arbitrarily, say at the origin, and placing $p_{2}$ at $(1,0)$ , we can no longer place $p_{3}$ at an arbitrary point in the unit circle. This analogy gives us a lower bound for our original problem. If we instead divide our leeway by $\#(\gamma\circ\pi)-1$ and pick the indices of the successive cycles arbitrarily at periodic distance less than or equal to this quotient, then we will stay within the permitted region (essentially just the triangle inequality). Thus,

[TABLE]

We define a graph to keep track of the constraints on $\eta$ induced by cycles of $\gamma\circ\pi$ with adjacent elements $j,j+1$ . Let $C_{2\ell}$ be the directed cycle graph on the vertices $V_{2\ell}=(v_{j})_{j=1}^{2\ell}$ with edges $E_{2\ell}=(e_{j})_{j=1}^{2\ell}$ in the direction $v_{j}\xrightarrow{e_{j}}v_{j+1}$ . We equate the map $\eta:[2\ell]\to[N]$ with a labeling $\eta:V_{2\ell}\to[N]$ of the vertices in the obvious way. The edges then indicate the band width constraint by virtue of the equivalence

[TABLE]

At the moment, the direction of the edges do not play a role.

For a pair partition $\pi\in\mathcal{P}_{2}(2\ell)$ , we define $C_{2\ell}^{\pi}$ as the directed multigraph obtained from $C_{2\ell}$ by identifying the vertices $V_{2\ell}$ according to the blocks of $\pi$ as follows: if $(j<k)$ is a block of $\pi$ , then we identify the source of the edge $e_{j}$ with the target of the edge $e_{k}$ (so $v_{j}\overset{\pi}{\sim}v_{k+1}$ ) and the source of the edge $e_{k}$ with the target of the edge $e_{j}$ (so $v_{k}\overset{\pi}{\sim}v_{j+1}$ ). In other words, for each block $(j<k)\in\pi$ , we overlay the edges $e_{j}$ and $e_{k}$ head-to-tail. The vertices in the graph $C_{2\ell}^{\pi}$ correspond to the cycles of $\gamma\circ\pi$ with the edges indicating a constraint on the labels of the cycles induced by the constraint on the labels of the vertices. Note that the graph $C_{2\ell}^{\pi}$ might have loops. Of course, the constraint from a loop is vacuous, nor do the multiplicity/direction of the edges indicate any additional constraint at the level of the cycles of $\gamma\circ\pi$ . So, we define $\underline{C}_{2\ell}^{\pi}$ as the underlying simple graph.

At the same time, we know that

[TABLE]

where

[TABLE]

by a result of Biane [Bia97]. In particular, if $\pi\in\mathcal{NC}_{2}(2\ell)$ , then the graph $C_{2\ell}^{\pi}$ is a double tree in the sense of Male [Mal]. By this, we mean that $C_{2\ell}^{\pi}$ has no loops and $\underline{C}_{2\ell}^{\pi}$ is a tree such that the multiplicity of each edge in $C_{2\ell}^{\pi}$ is two (so-called twin edges). To see this, note that the graph $\underline{C}_{2\ell}^{\pi}=(\underline{V}_{2\ell}^{\pi},\underline{E}_{2\ell}^{\pi})$ is connected with $\#(\underline{V}_{2\ell}^{\pi})=\#(\gamma\circ\pi)=\ell+1$ , which implies that $\#(\underline{E}_{2\ell}^{\pi})\geq\ell$ . At the same time, $C_{2\ell}^{\pi}$ is obtained from $C_{2\ell}$ by overlaying pairs of edges, whence $\#(\underline{E}_{2\ell}^{\pi})\leq\ell$ . Thus,

[TABLE]

as was to be shown. In the case of a tree $\underline{C}_{2\ell}^{\pi}$ , we do not run into a problem when choosing the vertices greedily using the entire leeway at each step, and so the upper bound for $Q(\ell,N,b_{N},\pi)$ in (6) becomes an equality. In other words,

[TABLE]

Indeed, recall that our earlier counterexample

[TABLE]

Let $\mathcal{C}_{2}(2\ell)=\mathcal{P}_{2}(2\ell)\setminus\mathcal{NC}_{2}(2\ell)$ . Applying (7) to our earlier (4) and rearranging, we obtain

[TABLE]

Using our bounds (6) for $Q(\ell,N,b_{N},\pi)$ , we see that

[TABLE]

Since $\#(\gamma\circ\pi)\leq\ell+1$ , we also have the lower bound

[TABLE]

At this point, we use the genus expansion to count the number of pair partitions $\pi$ that contribute to a given exponent $\#(\gamma\circ\pi)-\ell-1$ appearing in the summands of our bounds. Altogether, this allows us to write

[TABLE]

where $\varepsilon_{g}(\ell)=\#(\pi\in\mathcal{P}_{2\ell}:\#(\gamma\circ\pi)-\ell-1=-2g)$ . Naturally, this calculation recovers the semicircle law. To see this, we simply take the normalized limit

[TABLE]

however, without this normalization, the ratio $\frac{N}{\xi_{N}^{2}}$ arises in (8) as the leading order term. In particular, if $b_{N}\gg\sqrt{N}$ , then $\frac{N}{\xi_{N}^{2}}=o(1)$ and

[TABLE]

In this case, the infinitesimal distribution of a periodically banded GUE matrix is null, which matches the calculation for the usual GUE. On the other hand, if $b_{N}\ll\sqrt{N}$ , then the lower bound in (8) implies that

[TABLE]

Finally, in the intermediate regime $\lim_{N\to\infty}\frac{b_{N}}{\sqrt{N}}=c\in(0,\infty)$ , we see that

[TABLE]

where $\varepsilon_{1}(0)=\varepsilon_{1}(1)=0$ and

[TABLE]

the equality (resp., asymptotic) following from the three-term recurrence of Harer and Zagier [HZ86] (resp., Stirling’s formula).

Of course, one expects to be able to say more than just (9), namely, that a limit exists in the intermediate regime (and hopefully with some nice formula or interpretation). This amounts to calculating

[TABLE]

for $\pi\in\mathcal{P}_{2}(2\ell)$ such that $g(\pi)=1$ ; however, the value of this limit crucially depends on the particular geometry of $\pi$ . For example, consider the pair partitions

[TABLE]

Then $\#(\gamma\circ\pi_{1})=\#(\gamma\circ\pi_{2})=4$ , and so $g(\pi_{1})=g(\pi_{2})=1$ . Going through the graph construction, we see that $\underline{C}_{10}^{\pi_{1}}$ is a tree, which means that $Q(5,N,b_{N},\pi_{1})=N\xi_{N}^{3}$ attains the upper bound. On the other hand, $\underline{C}_{10}^{\pi_{2}}$ is the undirected cycle graph $\underline{C}_{4}$ , which means that $Q(5,N,b_{N},\pi_{2})<tN\xi_{N}^{3}$ for some $t\in(0,1)$ .

In fact, the graph $\underline{C}_{2\ell}^{\pi}=(\underline{V}_{2\ell}^{\pi},\underline{E}_{2\ell}^{\pi})$ contains precisely the information we need to compute the limit. To see this, note that we can rewrite (5) as

[TABLE]

Indeed, the vertices of $\underline{C}_{2\ell}^{\pi}$ correspond to the cycles of $\gamma\circ\pi$ by construction and the edges function precisely to keep track of the band width constraint. This suggests computing the limit of $Q(\ell,N,b_{N},\pi)$ as an integral over the hypercube $[0,1]^{\#(V)}$ after scaling by $N^{\#(V)}$ (for example, as in [Au18]); however, in this case, the band width $b_{N}\asymp\sqrt{N}\ll N$ and $\#(V)=\ell-1$ (recall that $g(\pi)=1$ ). So, the integral interpretation must take into account the vanishing scale of the mesh size $b_{N}$ and the difference in the scaling exponent $\#(V)\neq\ell$ . To accomplish this, we use the fact that the periodic band width structure implies a certain homogeneity in our choice of admissible maps $\eta$ . In particular, fixing a vertex $v_{0}\in\underline{V}_{2\ell}^{\pi}$ , we see that

[TABLE]

So, we consider the equivalent expression

[TABLE]

where

[TABLE]

This reduces the problem to computing

[TABLE]

which we can interpret using (11). After fixing the label $\eta(v_{0})=1$ , we must choose the labels of the remaining $\ell-2$ vertices according to the band width constraint. The exponent of the normalization $b_{N}^{\ell-2}$ now matches the remaining degrees of freedom, and the base matches the maximum step size $|\eta(v)-\eta(w)|_{N}\leq b_{N}$ .

The remaining issue concerns the region of integration. After the normalization, the step size $|\eta(v)-\eta(w)|_{N}\leq b_{N}$ becomes a single unit length. To ensure that the integral captures the full range of possibilities, we must choose a hypercube of appropriate side length. If $\eta(v_{0})=1$ , then the image $\eta(V)$ will necessarily be disjoint from $[1+(\ell-2)b_{N},N-(\ell-2)b_{N}]$ . So, the hypercube $[0,2(\ell-2)]^{\ell-2}$ will ensure that the region of integration is sufficiently large.

We can now give the integral representation for our limit. For $\ell\geq 2$ , we define an integral $I_{\ell}^{\pi}$ associated to the graph $\underline{C}_{2\ell}^{\pi}=(\underline{V}_{2\ell}^{\pi},\underline{E}_{2\ell}^{\pi})$ as follows. Pick an arbitrary vertex $v_{0}\in\underline{V}_{2\ell}^{\pi}$ , and let $E_{0}\subset\underline{E}_{2\ell}^{\pi}$ be the set of edges adjacent to $v_{0}$ . We write $E_{1}=\underline{E}_{2\ell}^{\pi}\setminus E_{0}$ for the remaining edges. By construction, the integral

[TABLE]

then satisfies

[TABLE]

where

[TABLE]

For $\ell=2$ , we set $[0,0]^{0}=\{0\}$ by convention, in which case the integral reduces to $I_{2}^{\pi}=1$ for the only genus one partition $\pi=(1,3)(2,4)\in\mathcal{P}_{2}(4)$ . As an example, consider the case of $\ell=4$ , $\pi=(1,5)(2,8)(3,7)(4,6)$ , and $b_{N}=\sqrt{N}$ . Then

[TABLE]

One can easily verify that this agrees with the direct calculation

[TABLE]

Thus, for $\lim_{N\to\infty}\frac{b_{N}}{\sqrt{N}}=c\in(0,\infty)$ , we see that

[TABLE]

Altogether, this proves Theorem 1.17.

We use our earlier bound (9) and the asymptotic (10) for $\varepsilon_{1}(\ell)$ to see that

[TABLE]

At the same time,

[TABLE]

where $\pi_{2\ell}=(1,3)(2,4)(5,6)(7,8)\cdots(2\ell-1,2\ell)$ . Note that $\pi_{2\ell}$ is “one-crossing”. In particular, $g(\pi)=1$ since $\gamma\circ\pi=(1,4,3,2,5,7,9,\ldots,2\ell-1)(6)(8)\cdots(2\ell)$ . Moreover, the corresponding graph $\underline{C}_{2\ell}^{\pi_{2\ell}}$ is the star graph with $\ell-1$ vertices. Since $\underline{C}_{2\ell}^{\pi_{2\ell}}$ is a tree, we know that $I_{\pi_{2\ell}}=2^{\ell-2}$ , whence

[TABLE]

which proves (3). The hypothetical signed measure $\nu_{c}$ associated to the infinitesimal distribution in the intermediate regime $\lim_{N\to\infty}\frac{b_{N}}{\sqrt{N}}=c\in(0,\infty)$ then satisfies

[TABLE]

In fact, we expect that $\lim_{\ell\to\infty}[m_{2\ell}(1,c)]^{\frac{1}{2\ell}}=2$ , but we do not prove this here.

*Remark 3.1**.*

Naturally, one can ask about the joint infinitesimal distribution of independent periodically banded GUE matrices $(\mathbf{\Xi}_{N}^{(i)})_{i\in I}$ . To answer this question, we partition the index set $I=I_{1}\sqcup I_{2}$ according to the rates $b_{N}^{(i)}\to\infty$ , where

[TABLE]

Repeating our banded genus expansion for a mixed trace in $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{1}}$ shows that the joint infinitesimal distribution of $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{1}}$ is null, which implies that the $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{1}}$ are asymptotically infinitesimally free. Similarly, any mixed trace in the two families $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{1}}$ and $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{2}}$ vanishes in the limit, which implies that $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{1}}$ and $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{2}}$ are asymptotically infinitesimally free as well.

On the other hand, the same calculation shows that the $(\mathbf{\Xi}_{N}^{(i)})_{i\in I_{2}}$ are not asymptotically infinitesimally free. For example, if $b_{N}^{(1)},b_{N}^{(2)}=\sqrt{N}$ , then

[TABLE]

whereas asymptotic infinitesimal freeness would insist that this limit be zero. Note that our integral interpretation still holds and prescribes a rule for computing the infinitesimal distribution in this case. To account for the possibly different ratios $\lim_{N\to\infty}\frac{b_{N}^{(i)}}{\sqrt{N}}=c_{i}$ , we must adjust both the integrand and the region of integration via a straightforward combination of the ideas above and [Au18, $\S$ 4.3]. We leave the details to the interested reader.

Note that the infinitesimal calculation is very specific (GUE and periodically banded) and cannot be extended to regular band matrices

[TABLE]

In particular, let $\mathbf{\Xi}_{N}$ now denote the banded GUE matrix constructed with $\mathbf{B}_{N}$ as above. For $1\ll b_{N}\ll N$ , we know that $\mu(\mathbf{\Xi}_{N})$ still converges weakly almost surely to the semicircle distribution [BMP91]. A simple calculation shows that

[TABLE]

In this case,

[TABLE]

and so an infinitesimal distribution does not exist for any such band width.

3.2. Finite-rank perturbations

We now consider the multi-matrix model

[TABLE]

Most of our calculations in this section remain valid in a more general setting. In particular, we extend our definition of

[TABLE]

to independent unnormalized Wigner matrices $(\mathbf{X}_{N}^{(i)})_{i\in I}$ of the form

[TABLE]

where the band widths $(b_{N}^{(i)})_{i\in I}$ satisfy

[TABLE]

Dykema proved that the family $\mathcal{W}_{N}=(\frac{1}{\sqrt{N}}\mathbf{X}_{N}^{(i)})_{i\in I}$ converges in distribution to a semicircular system [Dyk93]. We generalized this result to the family $\mathcal{Z}_{N}$ in [Au18] (recall that if $b_{N}^{(i)}\geq\lfloor N/2\rfloor$ , then $\mathbf{\Xi}_{N}^{(i)}=\frac{1}{\sqrt{N}}\mathbf{X}_{N}^{(i)}$ ). For concreteness, we write $(\mathbb{C}\langle\mathbf{x}\rangle,\tau_{\mathcal{Z}})$ for this limiting distribution, where

[TABLE]

The results in [Au18] as well as the remainder of this section make use of the traffic probability framework [Mal], which we briefly review.

Definition 3.2 (Traffic probability).

By a multidigraph $G=(V,E,\operatorname{src},\operatorname{tar})$ , we mean a non-empty set of vertices $V$ , a set of edges $E$ , and a pair of functions $\operatorname{src},\operatorname{tar}:E\to V$ specifying the source and target of each edge. A test graph $T=(G,\gamma)$ is a finite multidigraph $G$ with edge labels $\gamma:E\to I$ . For a partition $\pi\in\mathcal{P}(V)$ , we define $T^{\pi}=(G^{\pi},\gamma^{\pi})$ as the test graph obtained from $T$ by identifying the vertices of $T$ according to blocks of $\pi$ . Formally, we construct $G^{\pi}=(V^{\pi},E^{\pi},\operatorname{src}^{\pi},\operatorname{tar}^{\pi})$ as

(i)

$V^{\pi}=V/\mathord{\sim}_{\pi}$ and $E^{\pi}=E$ ; 2. (ii)

$\operatorname{src}^{\pi}(e)=[\operatorname{src}(e)]_{\sim_{\pi}}$ and $\operatorname{tar}^{\pi}(e)=[\operatorname{tar}(e)]_{\sim_{\pi}}$ ; 3. (iii)

$\gamma^{\pi}=\gamma$ .

Since $E^{\pi}=E$ , we often omit the superscript and use the same notation for the edge set of the quotient $T^{\pi}$ . We write $\mathcal{T}\langle I\rangle$ for the set of all test graphs in $I$ and $\mathbb{C}\mathcal{T}\langle I\rangle$ for the complex vector space spanned by $\mathcal{T}\langle I\rangle$ .

We define the traffic state $\tau_{N}:\mathbb{C}\mathcal{T}\langle I\rangle\to\mathbb{C}$ as the unique linear functional

[TABLE]

where $c(T)$ denotes the number of connected components of $T$ . For convenience, we abbreviate $(\phi(\operatorname{tar}(e)),\phi(\operatorname{src}(e)))$ as $(\phi(e))$ . Similarly, we define the injective traffic state $\tau_{N}^{0}:\mathbb{C}\mathcal{T}\langle I\rangle\to\mathbb{C}$ as the unique linear functional

[TABLE]

Henceforth, we use the notation $\phi:V\hookrightarrow[N]$ to indicate an injective map. The functionals $\tau_{N}$ and $\tau_{N}^{0}$ satisfy the relations

[TABLE]

where $0_{V}$ denotes the singleton partition and Möb is the usual Möbius function on the poset of partitions.

Example 3.3.

Let $p\in\mathbb{C}\langle\mathbf{x}\rangle$ be a monomial $p=x_{i(1)}\cdots x_{i(d)}$ . Then

[TABLE]

where

[TABLE]

We can now prove the following generalization of Lemma 3.2 in [Shl18].

Lemma 3.4.

For any NC polynomials $p_{1},\ldots,p_{r}\in\mathbb{C}\langle\mathbf{x}\rangle$ ,

[TABLE]

where $j_{0}=j_{r}$ .

Proof.

Note that we can rewrite the desired trace as

[TABLE]

which reduces the problem to computing

[TABLE]

Furthermore, by linearity, it suffices to prove the result for monomials $p_{s}\in\mathbb{C}\langle\mathbf{x}\rangle$ . For concreteness, we write

[TABLE]

where $i_{s}:[d_{s}]\to I$ . To convert this to the traffic notation, let $T_{s}=(G_{s},\gamma_{s})$ be the test graph

[TABLE]

where $V_{s}=(v_{s,t-1})_{t\in[d_{s}+1]}$ , $E_{s}=(e_{s,t})_{t\in[d_{s}]}$ , $v_{s,t-1}\sim_{e_{s,t}}v_{s,t}$ , and $\gamma_{s}(e_{s,t})=i_{s}(t)$ . We define $T=(G,\gamma)=\sqcup_{s=1}^{r}T_{s}$ as the disjoint union of the $T_{s}$ , in which case

[TABLE]

where we recall that $E^{\pi}=E$ . Note that the inner sum on the previous line might be empty: for example, if $v_{s,0}\sim_{\pi}v_{s^{\prime},d_{s}^{\prime}}$ for some $k_{s}\neq j_{s^{\prime}}$ . Conversely, if $k_{s}=j_{s}^{\prime}$ for some $s,s^{\prime}\in[r]$ , then we must have $v_{s,0}\sim_{\pi}v_{s^{\prime},d_{s^{\prime}}}$ . Thus, taking into account the various indices, we can restrict the outer summation over $\mathcal{P}(V)$ to

[TABLE]

We now analyze the contribution from a quotient $T^{\pi}$ . First, we decompose $T^{\pi}$ into its connected components $T^{\pi}=T_{(1)}^{\pi}\sqcup\cdots\sqcup T_{(u)}^{\pi}$ , where the notation $T_{(n)}^{\pi}$ is meant to distinguish between the test graphs $T_{(n)}^{\pi}$ and $T_{s}$ . For an injective map $\phi:V^{\pi}\hookrightarrow[N]$ , the independence of our matrix entries implies that

[TABLE]

Let us then focus on a connected component $T_{(n)}^{\pi}=(G_{(n)}^{\pi},\gamma_{(n)}^{\pi})$ . We define $\mathcal{L}_{(n)}^{\pi}$ as the set of loop edges of $T_{(n)}^{\pi}$ , which divides $E_{(n)}^{\pi}=\mathcal{L}_{(n)}^{\pi}\sqcup\mathcal{N}_{(n)}^{\pi}$ . As before, we write $\underline{G}_{(n)}^{\pi}=(\underline{V}_{(n)}^{\pi},\underline{E}_{(n)}^{\pi})$ for the underlying simple graph. For an edge $e\in E_{(n)}^{\pi}$ , we define

[TABLE]

Naturally, we can think of $\underline{E}_{(n)}^{\pi}=\{[e]:e\in\mathcal{N}_{(n)}^{\pi}\}$ . By a slight abuse of notation, we also write $\underline{\mathcal{L}}_{(n)}^{\pi}=\{[l]:l\in\mathcal{L}_{(n)}^{\pi}\}$ . Separating the normalization

[TABLE]

we can again use the injectivity of $\phi$ to decompose the remaining expectation as

[TABLE]

The asymptotic follows from our strong moment assumption (12), which bounds the contribution from such a term uniformly in $\pi$ and $\phi$ , where $d=\sum_{s=1}^{r}d_{s}$ is the total degree of our monomials $p_{s}$ . Strictly speaking, the asymptotic depends on both $d$ and the finite set $I_{0}=\gamma(E)$ , but both are fixed independent of $N$ by our monomials $p_{s}$ . For convenience, we omit this last detail from the notation.

We would then like to bound the number of injective maps $\phi$ that actually contribute (i.e., the number of $\phi$ such that the term in (16) is non-zero). Note that since the off-diagonal entries of our matrices are centered, we can assume that

[TABLE]

otherwise, one of the factors in the product above vanishes. Of course, the graph $\underline{G}_{(n)}^{\pi}$ is still connected with the same vertex set as $G_{(n)}^{\pi}$ , whence

[TABLE]

We must also remember to include the band width constraint in our bound. In particular, a contributing map $\phi$ satisfies

[TABLE]

We introduce some notation for the set of admissible maps

[TABLE]

Similarly, we define

[TABLE]

Note that

[TABLE]

Consider a spanning tree $\underline{H}_{(n)}^{\pi}=(\underline{V}_{(n)}^{\pi},\underline{F}_{(n)}^{\pi})$ of $\underline{G}_{(n)}^{\pi}$ . We think of a spanning tree as recording a minimal working subset of the band width constraints. In particular, $\underline{H}_{(n)}^{\pi}$ bounds the number of contributing maps $\phi|_{V_{(n)}^{\pi}}\in A_{N,\pi}^{(n)}$ by

[TABLE]

To see this, pick an arbitrary initial vertex $[v_{0}]_{\sim\pi}$ of $\underline{H}_{(n)}^{\pi}$ . Clearly, we have $N$ options for $\phi([v_{0}]_{\sim_{\pi}})\in[N]$ at this stage. The bound then follow from walking through the rest of our graph while satisfying the band width constraints imposed by the edges $[e]\in\underline{F}_{(n)}^{\pi}$ . Note that this fails to account for the special vertices $[v_{s,0}]_{\sim\pi}$ and $[v_{s,d_{s}}]_{\sim_{\pi}}$ , which have fixed labels $\phi([v_{s,0}]_{\sim\pi})=k_{s}$ and $\phi([v_{s,d_{s}}]_{\sim\pi})=j_{s}$ respectively. In particular, each connected component has at least one such special vertex. Choosing this special vertex to be the initial vertex $[v_{0}]_{\sim\pi}$ removes the factor of $N$ in our earlier bound, and so

[TABLE]

where the asymptotic follows from (17). In view of (15)-(21), we conclude that

[TABLE]

Altogether, our analysis implies that the expectation survives the normalization, but only just barely. Indeed, in formulating the bound (21), we only considered one of the special vertices $[v_{s,0}]_{\sim_{\pi}},[v_{s,d_{s}}]_{\sim_{\pi}}$ despite the fact that each test graph $T_{s}$ has two such vertices $v_{s,0},v_{s,d_{s}}$ before the identifications by $\pi\in\mathcal{P}_{+}(V)$ . Assume then that $k_{s}\neq j_{s}$ for some $s\in[r]$ . In this case, $[v_{s,0}]_{\sim_{\pi}}\neq[v_{s,d_{s}}]_{\sim_{\pi}}$ are distinct vertices in some connected component $T_{(n)}^{\pi}$ . As a result, we lose an additional degree of freedom when choosing a contributing map $\phi|_{V_{(n)}^{\pi}}\in A_{N,\pi}^{(n)}$ . To see this, we return to our spanning tree $\underline{H}_{(n)}^{\pi}$ . We denote the last edge on the unique path from $[v_{s,0}]_{\sim_{\pi}}$ to $[v_{s,d_{s}}]_{\sim_{\pi}}$ in $\underline{H}_{(n)}^{\pi}$ by $[e_{*}]$ . Running through the same argument as before with $[v_{s,0}]_{\sim_{\pi}}$ as the initial vertex now gives the improved bound

[TABLE]

where the asymptotic follows from (13). Putting this back in to (22) proves that

[TABLE]

Let us now assume that $k_{s}=j_{s}$ for every $s\in[r]$ . A partition $\pi\in\mathcal{P}_{+}(V)$ then necessarily identifies $v_{s,0}\sim_{\pi}v_{s,d_{s}}$ for every $s\in[r]$ . Furthermore, if $k_{s}=k_{s^{\prime}}$ for some $s,s^{\prime}\in[r]$ , then $\pi$ must also identify $v_{s,0}\sim_{\pi}v_{s^{\prime},0}$ . We imagine making these identifications first before carrying out the rest of the identifications prescribed by $\pi$ . At the first step, this corresponds to identifying the ends of the test graph (14), creating a directed cycle $C_{s}$ with a special vertex $[v_{s,0}]_{\sim_{\pi}}=[v_{s,d_{s}}]_{\sim_{\pi}}$ that we can think of as a root. We then identify the roots of different cycles $C_{s},C_{s^{\prime}}$ if $k_{s}=k_{s^{\prime}}$ . It will be convenient to redefine (14) to account for this first step beforehand, namely

[TABLE]

Now, suppose that $\pi\in\mathcal{P}_{+}(V)$ identifies vertices across different cycles:

[TABLE]

If $k_{s}\neq k_{s^{\prime}}$ , then we claim that

[TABLE]

To see this, let $T_{(n_{*})}^{\pi}$ denote the connected component of $T^{\pi}$ that contains the vertex $[v_{s,d_{s}}]_{\sim\pi}$ . By assumption, $T_{(n_{*})}^{\pi}$ also contains the vertex $[v_{s^{\prime},d_{s^{\prime}}}]_{\sim\pi}\neq[v_{s,d_{s}}]_{\sim\pi}$ . Our earlier work shows that $\#(A_{N,\pi}^{(n_{*})})$ satisfies the asymptotic (23) since the component $T_{(n_{*})}^{\pi}$ has two special vertices to account for, which proves (25).

If $k_{s}=k_{s^{\prime}}$ , then we claim that (25) holds if $v_{s,t}\not\sim_{\pi}v_{s^{\prime},d_{s^{\prime}}}$ . To see this, let $T_{(n_{*})}^{\pi}$ be as before. If $v_{s,t}\not\sim_{\pi}v_{s^{\prime},d_{s^{\prime}}}$ , then $[v_{s,t}]_{\sim_{\pi}}\neq[v_{s^{\prime},d_{s^{\prime}}}]_{\sim_{\pi}}$ are distinct vertices in $T_{(n_{*})}^{\pi}$ . In particular, there are four edge-disjoint paths from $[v_{s,t}]_{\sim_{\pi}}$ to $[v_{s^{\prime},d_{s^{\prime}}}]_{\sim_{\pi}}$ . Indeed, there are two edge-disjoint paths from $[v_{s,t}]_{\sim_{\pi}}$ to $[v_{s,d_{s}}]_{\sim_{\pi}}$ using only the edges of $T_{s}$ , and there are two edge-disjoint paths from $[v_{s^{\prime},t^{\prime}}]_{\sim_{\pi}}$ to $[v_{s^{\prime},d_{s^{\prime}}}]_{\sim_{\pi}}$ using only the edges of $T_{s^{\prime}}$ . Thus, any spanning tree $\underline{H}_{(n_{*})}^{\pi}$ of $T_{(n_{*})}^{\pi}$ will necessarily omit (at least) one of the total edges from these paths. In view of (17) and (13), we conclude that

[TABLE]

which again proves (25).

Thus, we are left to consider partitions

[TABLE]

where

[TABLE]

Note that $\mathcal{P}_{++}(V)$ factorizes into partitions of the test graphs (24) via the bijection

[TABLE]

where $\pi=\coprod_{s=1}^{r}\pi_{s}$ is the partition obtained from $(\pi_{1},\ldots,\pi_{r})$ by first taking the disjoint union of the blocks of the $\pi_{s}$ and then identifying the vertices $v_{s,d_{s}}\sim_{\pi}v_{s^{\prime},d_{s^{\prime}}}$ that satisfy $k_{s}=k_{s^{\prime}}$ . Of course, the resulting quotient test graph $T^{\pi}$ might have fewer than $r$ connected components; however, the defining property (26) of $\mathcal{P}_{++}(V)$ implies that $T^{\pi}$ can be obtained as follows: first, let $(\pi_{1},\ldots,\pi_{r})$ be the factorization of $\pi$ as above. Next, apply the partitions $\pi_{s}$ to obtain the quotient test graphs $T_{s}^{\pi_{s}}=(V_{s}^{\pi_{s}},E_{s}^{\pi_{s}})$ . Finally, in the disjoint union of the $T_{s}^{\pi_{s}}$ , identify the vertices $[v_{s,d_{s}}]_{\sim_{\pi_{s}}}$ and $[v_{s^{\prime},d_{s^{\prime}}}]_{\sim_{\pi_{s^{\prime}}}}$ if $k_{s}=k_{s^{\prime}}$ . The injectivity of the maps $\phi\in A_{N,\pi}$ then implies that

[TABLE]

We would also like to factorize the set

[TABLE]

where

[TABLE]

however, in general, this map is not bijective since $\#(A_{N,\coprod_{s=1}^{r}\pi_{s}})<\prod_{s=1}^{r}\#(B_{N,\pi_{s}})$ for $r\geq 2$ . Nevertheless, we do have the asymptotic equality

[TABLE]

The contributions from the additional terms counted by the maps in $\times_{s=1}^{r}B_{N,\pi_{s}}$ can still be bounded uniformly via (16). In view of (27), this implies that such overcounting will not affect our calculations in the limit. In other words,

[TABLE]

So, we will be done if we can prove that

[TABLE]

The main result in [Au18] implies that

[TABLE]

where a colored double tree is a double tree whose twin edges $[e]=\{e,e^{\prime}\}$ each have the same color $\gamma(e)=\gamma(e^{\prime})$ and

[TABLE]

In short, this follows from (17), (18), and the spanning tree argument. Similarly, we can strict the outer sum in (28) to the same class of partitions $\pi_{s}$ . Note that for large $N$ , the periodicity of the band width condition implies that

[TABLE]

We use the fact that a quotient of a directed cycle is a double tree only if each of its twin edges $[e]=\{e,e^{\prime}\}$ go in opposite directions $\operatorname{src}(e)=\operatorname{tar}(e^{\prime})$ and $\operatorname{src}(e^{\prime})=\operatorname{tar}(e)$ [Au18, Figure 5]. In that case,

[TABLE]

since the calculations in the expectation only involve variances (as opposed to pseudo-variances). This homogeneity allows us to conclude that averaging over the labels $\phi_{s}([v_{s,d_{s}}]_{\sim_{\pi_{s}}})\in[N]$ does not affect the calculation. Consequently,

[TABLE]

as was to be shown. ∎

Assuming an infinitesimal distribution for the family $\mathcal{Z}_{N}$ , Lemma 3.4 proves that $\mathcal{Z}_{N}$ and $\mathcal{E}_{N}$ are asymptotically infinitesimally free. Indeed, this follows from a straightforward application of the following criteria for infinitesimal freeness.

Proposition 3.5 ([Shl18]).

Let $(\mathcal{A},\varphi,\varphi^{\prime})$ be a tracial infinitesimal NC probability space. Suppose that $\mathcal{Z}$ and $\mathcal{E}$ are subalgebras of $\mathcal{A}$ such that $\mathcal{E}\subset\ker(\varphi)$ (in particular, $\mathcal{E}$ is non-unital). Then $\mathcal{Z}$ and $\mathcal{E}$ are infinitesimally free iff for any $r$ -tuples $(E_{s})_{s=1}^{r}\subset\mathcal{E}$ and $(Z_{s})_{s=1}^{r}\subset\accentset{\circ}{\mathcal{Z}}$ , we have the identities

(i)

$\varphi(E_{1}Z_{1}E_{2}Z_{2}\cdots E_{r}Z_{r})=0$ ; 2. (ii)

$\varphi^{\prime}(E_{1}Z_{1}E_{2}Z_{2}\cdots E_{r}Z_{r})=0$ .

For example, this proves a preliminary version of Theorem 1.13 (resp., Theorem 1.18) restricted to the matrices $\mathbf{W}_{N}$ (resp., $\mathbf{\Xi}_{N}$ ) and $(\mathbf{E}_{N}^{(j,k)})_{1\leq j,k\leq N_{0}}$ . We now extend the calculation to include the matrix $\mathbf{K}_{N}=\frac{1}{N}\mathbf{J}_{N}$ . For this, we will need the following lemma concerning the formation of double trees as quotients of paths.

Lemma 3.6.

Let $G_{n}=(V_{n},E_{n})$ be a path graph of length $n$ , where

[TABLE]

If $\pi\in\mathcal{P}(V_{n})$ is such that $G_{n}^{\pi}$ is a double tree, then $v_{0}\sim_{\pi}v_{n}$ .

Proof.

Since a double tree has an even number of edges, we only need to prove the result for even values of $n$ . We proceed by induction on the length of the path. If $n\in\{0,2\}$ , then the statement follows. So, assume the result is true for paths of length $n\leq 2m$ , and consider $G_{2m+2}$ . If $G_{2m+2}^{\pi}$ is a double tree, then it must identify $v_{0}$ with another vertex $v_{i}\in V_{2m+2}$ for some $i\in[2m+2]$ . Indeed, this follows from the fact that the degree of every vertex in a double tree is even. The edges $e_{1},\ldots,e_{i}$ then form a trail in $G_{2m+2}^{\pi}$ starting and ending at the same vertex $v_{0}\sim_{\pi}v_{i}$ . This implies that the subgraph $H$ spanned by these edges is also a double tree. Since the remaining edges $e_{i+1},\ldots,e_{n}$ span a connected subgraph $K$ of $G_{2m+2}^{\pi}$ , the fact that $H$ is a double tree implies that $K$ is a double tree as well. We can then apply the induction hypothesis to conclude that $v_{i}\sim_{\pi}v_{2m+2}$ . ∎

We use this to prove the analogue of Lemma 3.4 for $\mathbf{K}_{N}$ .

Lemma 3.7.

For any NC polynomials $p_{1},\ldots,p_{r}\in\mathbb{C}\langle\mathbf{x}\rangle$ ,

[TABLE]

Proof.

We carry forward the notation from the proof of Lemma 3.4. In particular, restricting to monomials $p_{s}$ , we write $T_{s}$ for the test graphs (14); $T$ for their disjoint union; and $(T_{(n)}^{\pi})_{n=1}^{u}$ for the connected components of a quotient $T^{\pi}$ . We redefine the set of admissible maps since we no longer have special vertices with fixed labels to account for, namely

[TABLE]

We still have the bounds (19) and (20), which imply the following analogue of (22):

[TABLE]

Thus, we can restrict to partitions $\pi\in\mathcal{P}(V)$ such that $T^{\pi}$ has exactly $r$ connected components. Of course, since $T$ already has $r$ connected components, this means that we are simply considering the disjoint union of partitions $\pi_{s}\in\mathcal{P}(V_{s})$ for $s\in[r]$ . As before, even though $\#(A_{N,\pi})\leq\prod_{s=1}^{r}\#(\mathcal{B}_{N,\pi_{s}})$ , the fact that

[TABLE]

allows us to factor

[TABLE]

So, we will be done if we can prove that

[TABLE]

but this follows from Lemma 3.6 and Example 3.3 (recall that [Au18] allows us to restrict to $\pi_{s}\in\mathcal{P}(V_{s})$ such that $T_{s}^{\pi_{s}}$ is a colored double tree). ∎

As in the case of the matrix units, assuming an infinitesimal distribution for $\mathcal{Z}_{N}$ , Lemma 3.7 proves that $\mathcal{Z}_{N}$ and $\mathbf{K}_{N}$ are asymptotically infinitesimally free. To complete the proof of Theorems 1.13 and 1.18, we turn our attention to the non-unital algebra $\mathcal{F}_{N}$ generated by $\mathcal{E}_{N}$ and $\mathbf{K}_{N}$ .

Lemma 3.8.

The algebra $\mathcal{F}_{N}$ is spanned by elements of the form

(i)

$\prod_{s=1}^{t}(\mathbf{E}_{N}^{(j_{s-1},k_{s})}\mathbf{K}_{N})=\frac{1}{N^{t-1}}\mathbf{E}_{N}^{(j_{0},j_{0})}\mathbf{K}_{N}$ , where $t\geq 1$ ; 2. (ii)

$\prod_{s=1}^{t}(\mathbf{K}_{N}\mathbf{E}_{N}^{(j_{s-1},k_{s})})=\frac{1}{N^{t-1}}\mathbf{K}_{N}\mathbf{E}_{N}^{(k_{t},k_{t})}$ , where $t\geq 1$ ; 3. (iii)

$\mathopen{}\big{[}\prod_{s=1}^{t}(\mathbf{E}_{N}^{(j_{s-1},k_{s})}\mathbf{K}_{N})\big{]}\mathclose{}\mathbf{E}_{N}^{(j_{t},k_{t+1})}=\frac{1}{N^{t}}\mathbf{E}_{N}^{(j_{0},k_{t+1})}$ , where $t\geq 0$ ; 4. (iv)

$\mathopen{}\big{[}\prod_{s=1}^{t}(\mathbf{K}_{N}\mathbf{E}_{N}^{(j_{s-1},k_{s})})\big{]}\mathclose{}\mathbf{K}_{N}=\frac{1}{N^{t}}\mathbf{K}_{N}$ , where $t\geq 0$ .

Proof.

The result follows from a simple computation using the fact that $\mathbf{K}_{N}$ is idempotent and the identity $\mathbf{E}_{N}^{(j_{0},k_{1})}\mathbf{E}_{N}^{(j_{1},k_{2})}=\mathbbm{1}\{k_{1}=j_{1}\}\mathbf{E}_{N}^{(j_{0},k_{2})}$ . ∎

Corollary 3.9.

$\mathcal{E}_{N}$ * and $\mathbf{K}_{N}$ are asymptotically infinitesimally free.*

Proof.

The characterization of $\mathcal{F}_{N}$ in Lemma 3.8 implies that

[TABLE]

Once again, a straightforward application of Proposition 3.5 proves the result. ∎

We adopt the notation $\mathbf{E}_{N}^{(0,0)}=\mathbf{K}_{N}$ to characterize the type $B$ distribution of $\mathcal{Z}_{N}\cup\mathcal{F}_{N}$ . The following result implies that the only non-trivial values of $(\mu_{\mathcal{Z}\cup\mathcal{F}},\nu_{\mathcal{Z}\cup\mathcal{F}})$ have already been computed in Lemmas 3.4 and 3.7.

Lemma 3.10.

For any NC monomials $q_{1},\ldots,q_{r}\in\mathbb{C}\langle\mathbf{y}\rangle$ and $p_{1},\ldots,p_{r}\in\mathbb{C}\langle\mathbf{x}\rangle$ ,

[TABLE]

Otherwise, $\deg(q_{s})=1$ for each $s\in[r]$ , in which case $q_{s}(\mathcal{F}_{N})\in\mathcal{E}_{N}\cup\{\mathbf{E}_{N}^{(0,0)}\}$ and

[TABLE]

where $j_{0}=j_{r}$ . In particular, since the index [math] only comes in pairs $(j_{s-1},k_{s})=(0,0)$ , the limit vanishes if there exist $s,s^{\prime}\in[r]$ such that $\mathbf{E}_{N}^{(j_{s-1},k_{s})}=\mathbf{E}_{N}^{(0,0)}$ and $\mathbf{E}_{N}^{(j_{s^{\prime}-1},k_{s^{\prime}})}\in\mathcal{E}_{N}$ .

Proof.

The proof of (30) will follow from our analysis of (31), which we prove first. By our earlier work, we need only to consider the case of $s,s^{\prime}\in[r]$ such that $\mathbf{E}_{N}^{(j_{s-1},k_{s})}=\mathbf{E}_{N}^{(0,0)}$ and $\mathbf{E}_{N}^{(j_{s^{\prime}-1},k_{s^{\prime}})}\in\mathcal{E}_{N}$ . Moreover, the cyclic invariance of the trace allows us to assume that this occurs precisely at the values

[TABLE]

We think of each occurrence of $\mathbf{E}_{N}^{(j_{s-1},k_{s})}=\mathbf{E}_{N}^{(0,0)}$ as providing its $\frac{1}{N}$ normalization to the test graph $T_{s}$ associated to $p_{s}(\mathcal{Z}_{N})$ . Similarly, each occurrence of a matrix unit $\mathbf{E}_{N}^{(j_{s-1},k_{s})}\in\mathcal{E}_{N}$ creates a special vertex in each of the test graphs $T_{s-1}$ and $T_{s}$ .

To adapt our earlier work, we define

[TABLE]

where $r_{1}+r_{2}=r$ . Similarly, we redefine

[TABLE]

Note that each connected component $T_{(n)}^{\pi}$ of $T^{\pi}$ satisfies at least one of the following conditions:

(i)

$T_{(n)}^{\pi}$ has at least one special vertex with a fixed label, in which case we can apply (21); 2. (ii)

$T_{(n)}^{\pi}$ contains the edges of a test graph $T_{s}$ that has been assigned the normalization $\frac{1}{N}$ of its adjacent term $\mathbf{E}_{N}^{(j_{s-1},k_{s})}=\mathbf{E}_{N}^{(0,0)}$ , in which case we can apply (20),

where the number $u_{2}$ of connected components of type (ii) satisfies $u_{2}\leq r_{2}$ . In particular, the connected component $T_{(n_{*})}^{\pi}$ that contains the edges of $T_{1}$ will satisfy both of these conditions. Indeed, this follows from (32). Rearranging, we can count the connected components of type (ii) first, namely, $T_{(1)}^{\pi},\ldots,T_{(u_{2})}^{\pi}$ with $u_{2}=n_{*}$ . The analogue of (22) and (29) in this case then follows:

[TABLE]

which proves (31).

To prove (30), we use the characterization of a monomial $q_{s}(\mathcal{F}_{N})$ given in Lemma 3.8. In particular, we imagine replacing the terms in the trace

[TABLE]

according to the following scheme:

(a)

if $q_{s}(\mathcal{F}_{N})$ is of the form (i) or (ii), then we replace $q_{s}(\mathcal{F}_{N})$ with $\mathbf{E}_{N}^{(0,0)}$ ; 2. (b)

if $q_{s}(\mathcal{F}_{N})$ is of the form (iii), then we replace $q_{s}(\mathcal{F}_{N})$ with the corresponding matrix unit without the factor of $\frac{1}{N^{t}}$ ; 3. (c)

if $q_{s}(\mathcal{F}_{N})$ is of the form (iv), then we replace $q_{s}(\mathcal{F}_{N})$ with $\mathbf{E}_{N}^{(0,0)}$ without the factor of $\frac{1}{N^{t}}$ .

After this procedure, our work above shows that the resulting trace satisfies

[TABLE]

however, based on our analysis of $\#(A_{N,\pi})$ , the original trace then necessarily satisfies

[TABLE]

Indeed, consider the following interpretation of the replacement scheme. If the original term is of type (i) (resp., type (ii)), then it creates a special vertex in the test graph $T_{s-1}$ (resp., $T_{s}$ ) and contributes a factor of $\frac{1}{N^{t}}$ to the test graph $T_{s}$ , where $t=\frac{\deg(q_{s})}{2}$ . In contrast, its replacement $\mathbf{E}_{N}^{(0,0)}$ only contributes a factor of $\frac{1}{N}$ to $T_{s}$ . Similarly, if the original term is of type (iii) or type (iv), then its replacement simply drops the factor of $\frac{1}{N^{t}}$ , where $t=\frac{\deg(q_{s})-1}{2}$ . In any case, since $\sum_{s=1}^{r}\deg(q_{s})>r$ , we know that a replacement of type (a), (b), or (c) occurs with $t\geq 1$ . Our work in establishing (33) then proves (34). The result now follows. ∎

Corollary 3.11.

Assume that the family $\mathcal{Z}_{N}$ has an infinitesimal distribution. Then the matrices $\mathcal{Z}_{N}$ , $\mathcal{E}_{N}$ , and $\mathbf{K}_{N}$ are asymptotically infinitesimally free.

Proof.

Under the assumption for $\mathcal{Z}_{N}$ , we already know that each pair of the families $\mathcal{Z}_{N}$ , $\mathcal{E}_{N}$ , and $\mathbf{K}_{N}$ are asymptotically infinitesimally free by Lemmas 3.4 and 3.7 and Corollary 3.9. So, we will be done if we can prove that $\mathcal{Z}_{N}$ and $\mathcal{F}_{N}$ are asymptotically infinitesimally free. Again, this follows from applying the criteria in Proposition 3.5 to Lemma 3.10. ∎

This completes the proof of Theorems 1.13 and 1.18. The type $B$ free convolution calculations in Corollaries 1.14 and 1.19 essentially already appear in [Shl18, §4.1.1], so we do not repeat them.

Acknowledgements

The author thanks Paul Bourgade, James Mingo, and Dimitri Shlyakhtenko for helpful conversations. The figures in this article were produced in Inkscape.

Bibliography56

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[ACD + ] Benson Au, Guillaume Cébron, Antoine Dahlqvist, Franck Gabriel, and Camille Male, Large permutation invariant random matrices are asymptotically free over the diagonal , Preprint. https://arxiv.org/abs/1805.07045 v 1 .
2[AGZ 10] Greg W. Anderson, Alice Guionnet, and Ofer Zeitouni, An introduction to random matrices , Cambridge Studies in Advanced Mathematics, vol. 118, Cambridge University Press, Cambridge, 2010. MR 2760897 (2011 m:60016)
3[Au 18] Benson Au, Traffic distributions of random band matrices , Electron. J. Probab. 23 (2018), paper no. 77, 48 pp. MR 3858905 · doi ↗
4[BBAP 05] Jinho Baik, Gérard Ben Arous, and Sandrine Péché, Phase transition of the largest eigenvalue for nonnull complex sample covariance matrices , Ann. Probab. 33 (2005), no. 5, 1643–1697. MR 2165575 · doi ↗
5[BBCF 17] Serban T. Belinschi, Hari Bercovici, Mireille Capitaine, and Maxime Février, Outliers in the spectrum of large deformed unitarily invariant models , Ann. Probab. 45 (2017), no. 6A, 3571–3625. MR 3729610 · doi ↗
6[BEKS 17] Jeff Bezanson, Alan Edelman, Stefan Karpinski, and Viral B. Shah, Julia: a fresh approach to numerical computing , SIAM Rev. 59 (2017), no. 1, 65–98. MR 3605826 · doi ↗
7[BGN 03] Philippe Biane, Frederick Goodman, and Alexandru Nica, Non-crossing cumulants of type B , Trans. Amer. Math. Soc. 355 (2003), no. 6, 2263–2303. MR 1973990 · doi ↗
8[BGN 11] Florent Benaych-Georges and Raj Rao Nadakuditi, The eigenvalues and eigenvectors of finite, low rank perturbations of large random matrices , Adv. Math. 227 (2011), no. 1, 494–521. MR 2782201 · doi ↗

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Finite-rank perturbations of random band matrices via infinitesimal free probability

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

Contents

1. Introduction

1.1. Motivation

Theorem 1.1** ([Voi93, Bia98]).**

Theorem 1.2** (BBP transition).**

1.2. Background

Definition 1.3** (Free probability).**

Remark 1.4*.*

Example 1.5** (Random matrices).**

Definition 1.6** (Infinitesimal free probability).**

Remark 1.7*.*

Example 1.8** (Random matrices, revisited).**

Theorem 1.9** ([BS12]).**

Theorem 1.10** ([Shl18]).**

Corollary 1.11** ([Shl18]).**

Theorem 1.12** ([EM16]).**

1.3. Statement of results

Theorem 1.13**.**

Corollary 1.14**.**

Remark 1.15*.*

Definition 1.16** (Random band matrix).**

Theorem 1.17**.**

Theorem 1.18**.**

Corollary 1.19**.**

Remark 1.20*.*

2. Numerical simulations

3. The infinitesimal distribution of a random band matrix

3.1. A band variant of the genus expansion

Remark 3.1*.*

3.2. Finite-rank perturbations

Definition 3.2** (Traffic probability).**

Example 3.3**.**

Lemma 3.4**.**

Proof.

Proposition 3.5** ([Shl18]).**

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Corollary 3.9**.**

Proof.

Lemma 3.10**.**

Proof.

Corollary 3.11**.**

Proof.

Acknowledgements

Theorem 1.1 ([Voi93, Bia98]).

Theorem 1.2 (BBP transition).

Definition 1.3 (Free probability).

*Remark 1.4**.*

Example 1.5 (Random matrices).

Definition 1.6 (Infinitesimal free probability).

*Remark 1.7**.*

Example 1.8 (Random matrices, revisited).

Theorem 1.9 ([BS12]).

Theorem 1.10 ([Shl18]).

Corollary 1.11 ([Shl18]).

Theorem 1.12 ([EM16]).

Theorem 1.13.

Corollary 1.14.

*Remark 1.15**.*

Definition 1.16 (Random band matrix).

Theorem 1.17.

Theorem 1.18.

Corollary 1.19.

*Remark 1.20**.*

*Remark 3.1**.*

Definition 3.2 (Traffic probability).

Example 3.3.

Lemma 3.4.

Proposition 3.5 ([Shl18]).

Lemma 3.6.

Lemma 3.7.

Lemma 3.8.

Corollary 3.9.

Lemma 3.10.

Corollary 3.11.