Quantum Ergodicity on Graphs : from Spectral to Spatial Delocalization

Nalini Anantharaman (IRMA); Mostafa Sabri (IRMA)

arXiv:1704.02766·math.SP·March 6, 2019

Quantum Ergodicity on Graphs : from Spectral to Spatial Delocalization

Nalini Anantharaman (IRMA), Mostafa Sabri (IRMA)

PDF

TL;DR

This paper establishes a quantum-ergodicity theorem for eigenfunctions on large graphs with Schr"odinger operators, linking spectral properties of the infinite model to spatial delocalization in finite approximations.

Contribution

It proves quantum ergodicity for graphs with a local weak limit, connecting spectral properties of the infinite model to eigenfunction delocalization on finite graphs.

Findings

01

Eigenfunctions become equidistributed in phase space.

02

Absolutely continuous spectrum implies spatial delocalization.

03

Results apply to graphs converging to the Anderson model on a regular tree.

Abstract

We prove a quantum-ergodicity theorem on large graphs, for eigenfunctions of Schr\"odinger operators in a very general setting. We consider a sequence of finite graphs endowed with discrete Schr\"odinger operators, assumed to have a local weak limit. We assume that our graphs have few short loops, in other words that the limit model is a random rooted tree endowed with a random discrete Schr\"odinger operator. We show that absolutely continuous spectrum for the infinite model, reinforced by a good control of the moments of the Green function, imply "quantum ergodicity", a form of spatial delocalization for eigenfunctions of the finite graphs approximating the tree. This roughly says that the eigenfunctions become equidistributed in phase space. Our result applies in particular to graphs converging to the Anderson model on a regular tree, in the r\'egime of extended states studied by…

Equations652

(A_{N} f) (v) = w \sim v \sum f (w),

(A_{N} f) (v) = w \sim v \sum f (w),

(P_{N} f) (x) = \frac{1}{d _{N} ( x )} y \sim x \sum f (y),

(P_{N} f) (x) = \frac{1}{d _{N} ( x )} y \sim x \sum f (y),

N \to \infty lim \frac{∣ { x \in V _{N} : ρ _{G_{N}} ( x ) < r } ∣}{N} = 0,

N \to \infty lim \frac{∣ { x \in V _{N} : ρ _{G_{N}} ( x ) < r } ∣}{N} = 0,

\frac{1}{N} j = 1 \sum N χ (λ_{j}^{(N)}) N ⟶ + \infty ⟶ E (⟨ δ_{o}, χ (H) δ_{o} ⟩) =: ρ (χ),

\frac{1}{N} j = 1 \sum N χ (λ_{j}^{(N)}) N ⟶ + \infty ⟶ E (⟨ δ_{o}, χ (H) δ_{o} ⟩) =: ρ (χ),

E (f) = \int_{T_{*}^{D, A}} f ([T, o, W]) d P ([T, o, W]) .

E (f) = \int_{T_{*}^{D, A}} f ([T, o, W]) d P ([T, o, W]) .

G^{γ} (x, y) = ⟨ δ_{x}, (H - γ)^{- 1} δ_{y} ⟩_{ℓ^{2} (T)} .

G^{γ} (x, y) = ⟨ δ_{x}, (H - γ)^{- 1} δ_{y} ⟩_{ℓ^{2} (T)} .

λ \in I_{1}, η_{0} \in (0, 1) sup E (y : y \sim o \sum ∣ Im \hat{ζ}_{o}^{λ + i η_{0}} (y) ∣^{- s}) < \infty .

λ \in I_{1}, η_{0} \in (0, 1) sup E (y : y \sim o \sum ∣ Im \hat{ζ}_{o}^{λ + i η_{0}} (y) ∣^{- s}) < \infty .

μ_{o} (J) = ⟨ δ_{o}, χ_{J} (H) δ_{o} ⟩ for Borel J \subseteq R .

μ_{o} (J) = ⟨ δ_{o}, χ_{J} (H) δ_{o} ⟩ for Borel J \subseteq R .

\tilde{g}_{N}^{γ} (x, y) = ⟨ δ_{x}, (H_{N} - γ)^{- 1} δ_{y} ⟩_{ℓ^{2} (V_{N})} .

\tilde{g}_{N}^{γ} (x, y) = ⟨ δ_{x}, (H_{N} - γ)^{- 1} δ_{y} ⟩_{ℓ^{2} (V_{N})} .

η_{0} ↓ 0 lim N \to + \infty lim \frac{1}{N} λ_{j}^{(N)} \in I \sum x \in V_{N} \sum a (x) ∣ ψ_{j}^{(N)} (x) ∣^{2} - ⟨ a ⟩_{λ_{j}^{(N)} + i η_{0}} = 0 .

η_{0} ↓ 0 lim N \to + \infty lim \frac{1}{N} λ_{j}^{(N)} \in I \sum x \in V_{N} \sum a (x) ∣ ψ_{j}^{(N)} (x) ∣^{2} - ⟨ a ⟩_{λ_{j}^{(N)} + i η_{0}} = 0 .

\frac{1}{N} # {λ_{j}^{(N)} \in I : x \in V_{N} \sum a (x) ∣ ψ_{j}^{(N)} (x) ∣^{2} - ⟨ a ⟩_{λ_{j}^{(N)} + i η_{0}} > ϵ} N \to + \infty, η_{0} ↓ 0 ⟶ 0 .

\frac{1}{N} # {λ_{j}^{(N)} \in I : x \in V_{N} \sum a (x) ∣ ψ_{j}^{(N)} (x) ∣^{2} - ⟨ a ⟩_{λ_{j}^{(N)} + i η_{0}} > ϵ} N \to + \infty, η_{0} ↓ 0 ⟶ 0 .

⟨ K ⟩_{γ} = \tilde{x} \in D_{N}, \tilde{y} \in V_{N} \sum K (\tilde{x}, \tilde{y}) Φ_{γ}^{N} (\tilde{x}, \tilde{y}) where Φ_{γ}^{N} (\tilde{x}, \tilde{y}) = \frac{Im g ~ _{N}^{γ} ( x ~ , y ~ )}{\sum _{\tilde{x} \in D_{N}} Im g ~ _{N}^{γ} ( x ~ , x ~ )} .

⟨ K ⟩_{γ} = \tilde{x} \in D_{N}, \tilde{y} \in V_{N} \sum K (\tilde{x}, \tilde{y}) Φ_{γ}^{N} (\tilde{x}, \tilde{y}) where Φ_{γ}^{N} (\tilde{x}, \tilde{y}) = \frac{Im g ~ _{N}^{γ} ( x ~ , y ~ )}{\sum _{\tilde{x} \in D_{N}} Im g ~ _{N}^{γ} ( x ~ , x ~ )} .

η_{0} ↓ 0 lim N \to + \infty lim \frac{1}{N} λ_{j}^{(N)} \in I \sum ⟨ ψ_{j}^{(N)}, K ψ_{j}^{(N)} ⟩_{ℓ^{2} (V_{N})} - ⟨ K ⟩_{λ_{j}^{(N)} + i η_{0}} = 0 .

η_{0} ↓ 0 lim N \to + \infty lim \frac{1}{N} λ_{j}^{(N)} \in I \sum ⟨ ψ_{j}^{(N)}, K ψ_{j}^{(N)} ⟩_{ℓ^{2} (V_{N})} - ⟨ K ⟩_{λ_{j}^{(N)} + i η_{0}} = 0 .

K (\tilde{x}, \tilde{y}) = K (x, y) 1 l_{d i s t_{G_{N}} (\tilde{x}, \tilde{y}) \leq R}

K (\tilde{x}, \tilde{y}) = K (x, y) 1 l_{d i s t_{G_{N}} (\tilde{x}, \tilde{y}) \leq R}

η_{0} \in (0, 1) in f N ⟶ \infty lim inf λ \in I_{1} in f ⟨ 1 l_{Λ_{N}} ⟩_{λ + i η_{0}} \geq 2 c_{α} .

η_{0} \in (0, 1) in f N ⟶ \infty lim inf λ \in I_{1} in f ⟨ 1 l_{Λ_{N}} ⟩_{λ + i η_{0}} \geq 2 c_{α} .

\frac{1}{N} # {λ_{j}^{(N)} \in I : 1 l_{Λ_{N}} ψ_{j}^{(N)}^{2} < c_{α}} N ⟶ + \infty ⟶ 0 .

\frac{1}{N} # {λ_{j}^{(N)} \in I : 1 l_{Λ_{N}} ψ_{j}^{(N)}^{2} < c_{α}} N ⟶ + \infty ⟶ 0 .

\frac{1}{N} x \in V_{N} \sum Im \tilde{g}_{N}^{λ + i η_{0}} (x, x) N ⟶ + \infty ⟶ E (Im G^{λ + i η_{0}} (o, o))

\frac{1}{N} x \in V_{N} \sum Im \tilde{g}_{N}^{λ + i η_{0}} (x, x) N ⟶ + \infty ⟶ E (Im G^{λ + i η_{0}} (o, o))

\frac{1}{N} x \in V_{N} \sum y, d (y, x) = k \sum F (N Φ_{λ + i η_{0}}^{N} (\tilde{x}, \tilde{y})) N ⟶ + \infty ⟶ E v, d (v, o) = k \sum F (\frac{Im G ^{λ + i η_{0}} ( o , v )}{E ( Im G ^{λ + i η_{0}} ( o , o ) )}) .

\frac{1}{N} x \in V_{N} \sum y, d (y, x) = k \sum F (N Φ_{λ + i η_{0}}^{N} (\tilde{x}, \tilde{y})) N ⟶ + \infty ⟶ E v, d (v, o) = k \sum F (\frac{Im G ^{λ + i η_{0}} ( o , v )}{E ( Im G ^{λ + i η_{0}} ( o , o ) )}) .

\int_{G_{*}^{D, ϵ A}} f ([G, v, W]) d P_{ϵ} ([G, v, W]) = \int_{Ω} f ([T_{q}, o, W^{ω}]) d P_{ϵ} (ω) = E_{ϵ} [f ([T_{q}, o, W^{ω}])] .

\int_{G_{*}^{D, ϵ A}} f ([G, v, W]) d P_{ϵ} ([G, v, W]) = \int_{Ω} f ([T_{q}, o, W^{ω}]) d P_{ϵ} (ω) = E_{ϵ} [f ([T_{q}, o, W^{ω}])] .

\lim_{\eta_{0}\downarrow 0}\lim_{N\to\infty}\frac{1}{N}\sum_{\lambda_{i}^{\omega_{N}}\in(-\lambda_{0},\lambda_{0})}\big{|}\langle\psi_{i}^{\omega_{N}},K_{N}\psi_{i}^{\omega_{N}}\rangle-\langle K_{N}\rangle_{\lambda_{i}^{\omega_{N}}}^{\eta_{0}}\big{|}=0\,,

\lim_{\eta_{0}\downarrow 0}\lim_{N\to\infty}\frac{1}{N}\sum_{\lambda_{i}^{\omega_{N}}\in(-\lambda_{0},\lambda_{0})}\big{|}\langle\psi_{i}^{\omega_{N}},K_{N}\psi_{i}^{\omega_{N}}\rangle-\langle K_{N}\rangle_{\lambda_{i}^{\omega_{N}}}^{\eta_{0}}\big{|}=0\,,

⟨ K ⟩_{λ}^{η_{0}} = x, y \in V_{N} \sum K (\tilde{x}, \tilde{y}) Φ_{γ} (\tilde{x}, \tilde{y}) and Φ_{γ} (\tilde{x}, \tilde{y}) = \frac{1}{N} \cdot \frac{E _{ϵ} [ Im G ^{γ} ( x ~ , y ~ )]}{E _{ϵ} [ Im G ^{γ} ( o , o )]} .

⟨ K ⟩_{λ}^{η_{0}} = x, y \in V_{N} \sum K (\tilde{x}, \tilde{y}) Φ_{γ} (\tilde{x}, \tilde{y}) and Φ_{γ} (\tilde{x}, \tilde{y}) = \frac{1}{N} \cdot \frac{E _{ϵ} [ Im G ^{γ} ( x ~ , y ~ )]}{E _{ϵ} [ Im G ^{γ} ( o , o )]} .

K_{G} (x, y) = γ \in Γ \sum K (\tilde{x}; γ \cdot \tilde{y}),

K_{G} (x, y) = γ \in Γ \sum K (\tilde{x}; γ \cdot \tilde{y}),

(B f) (x_{0}, x_{1}) = x_{2} \in N_{x_{1}} ∖ {x_{0}} \sum f (x_{1}, x_{2})

(B f) (x_{0}, x_{1}) = x_{2} \in N_{x_{1}} ∖ {x_{0}} \sum f (x_{1}, x_{2})

⟨ δ_{b_{1}}, K_{B} δ_{b_{2}} ⟩_{ℓ^{2} (B)} = {K (o_{b_{1}}; t_{b_{2}}) 0 if B^{k - 1} (b_{1}, b_{2}) \neq = 0, otherwise.

⟨ δ_{b_{1}}, K_{B} δ_{b_{2}} ⟩_{ℓ^{2} (B)} = {K (o_{b_{1}}; t_{b_{2}}) 0 if B^{k - 1} (b_{1}, b_{2}) \neq = 0, otherwise.

K_{B} (b_{1}, b_{2}) = γ \in Γ \sum K (\tilde{b}_{1}; γ \cdot \tilde{b}_{2}),

K_{B} (b_{1}, b_{2}) = γ \in Γ \sum K (\tilde{b}_{1}; γ \cdot \tilde{b}_{2}),

⟨ f, K_{B} g ⟩_{ℓ^{2} (B)} = (x_{0}, \dots, x_{k}) \in B_{k} \sum \overline{f (x_{0}, x_{1})} K (x_{0}; x_{k}) g (x_{k - 1}, x_{k}),

⟨ f, K_{B} g ⟩_{ℓ^{2} (B)} = (x_{0}, \dots, x_{k}) \in B_{k} \sum \overline{f (x_{0}, x_{1})} K (x_{0}; x_{k}) g (x_{k - 1}, x_{k}),

\|K_{B}f\|^{2}_{\ell^{2}(B)}=\sum_{(x_{0},x_{1})\in B}\Big{|}\sum_{{}_{x_{0,1}}(x_{2};x_{k})}K(x_{0};x_{k})f(x_{k-1},x_{k})\Big{|}^{2}\,,

\|K_{B}f\|^{2}_{\ell^{2}(B)}=\sum_{(x_{0},x_{1})\in B}\Big{|}\sum_{{}_{x_{0,1}}(x_{2};x_{k})}K(x_{0};x_{k})f(x_{k-1},x_{k})\Big{|}^{2}\,,

G (v, w; γ) = ⟨ δ_{v}, (H - γ)^{- 1} δ_{w} ⟩_{ℓ^{2} (V (T))} .

G (v, w; γ) = ⟨ δ_{v}, (H - γ)^{- 1} δ_{w} ⟩_{ℓ^{2} (V (T))} .

\frac{- 1}{2 m _{v}^{γ}} = G (v, v; γ) and ζ_{w}^{γ} (v) = - \tilde{g}^{(v ∣ w)} (v, v; γ) .

\frac{- 1}{2 m _{v}^{γ}} = G (v, v; γ) and ζ_{w}^{γ} (v) = - \tilde{g}^{(v ∣ w)} (v, v; γ) .

γ

γ

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Quantum Ergodicity on Graphs : from Spectral to Spatial Delocalization

Nalini Anantharaman and Mostafa Sabri

Université de Strasbourg, CNRS, IRMA UMR 7501, F-67000 Strasbourg, France.

[email protected]

Department of Mathematics, Faculty of Science, Cairo University, Cairo 12613, Egypt.

Université de Strasbourg, CNRS, IRMA UMR 7501, F-67000 Strasbourg, France.

[email protected]

Abstract.

We prove a quantum-ergodicity theorem on large graphs, for eigenfunctions of Schrödinger operators in a very general setting. We consider a sequence of finite graphs endowed with discrete Schrödinger operators, assumed to have a local weak limit. We assume that our graphs have few short loops, in other words that the limit model is a random rooted tree endowed with a random discrete Schrödinger operator. We show that absolutely continuous spectrum for the infinite model, reinforced by a good control of the moments of the Green function, imply “quantum ergodicity”, a form of spatial delocalization for eigenfunctions of the finite graphs approximating the tree. This roughly says that the eigenfunctions become equidistributed in phase space. Our result applies in particular to graphs converging to the Anderson model on a regular tree, in the régime of extended states studied by Klein and Aizenman–Warzel.

Key words and phrases:

Quantum ergodicity, large graphs, delocalization

2010 Mathematics Subject Classification:

Primary 58J51. Secondary 60B20, 81Q10.

1. Introduction

1.1. The problem

Consider a very large, but finite, graph $G=(V,E)$ . Are the eigenfunctions of its adjacency matrix localized, or delocalized ? These words are used in a variety of contexts, with several different meanings.

For discrete Schrödinger operators on infinite graphs (e.g. for the celebrated Anderson model describing the metal-insulator transition), localization can be understood in a spectral, spatial or dynamical sense. Given an interval $I\subset\mathbb{R}$ , one can consider

•

spectral localization : pure point spectrum in $I$ ,

•

exponential localization : the corresponding eigenfunctions decay exponentially,

•

dynamical localization : an initial state with energy in $I$ which is localized in a bounded domain essentially stays in this domain as time goes on.

On the opposite, delocalization may be understood at different levels :

•

spectral delocalization : purely absolutely continuous spectrum in $I$ ,

•

ballistic transport : wave packets with energies in $I$ spread on the lattice at a specific (ideally, linear) rate as time goes on.

In this paper we want to discuss a notion of spatial delocalization. Since the wavefunctions corresponding to absolutely continuous spectrum are not square summable, a natural interpretation of spatial delocalization is to consider a sequence of growing “boxes” or finite graphs $(G_{N})$ approximating the infinite system in some sense, and ask if the eigenfunctions on $(G_{N})$ become delocalized as $N\to\infty$ . Can they concentrate on small regions, or, on the opposite, are they uniformly distributed over $(G_{N})$ ? Large, finite graphs are also a subject of interest on their own. Actually, an infinite system is often an idealized version of a large finite one.

Localization/delocalization of eigenfunctions is believed to bear some relation with spectral statistics : localization is supposedly associated with Poissonian spectral statistics, whereas delocalization should be associated with Random Matrix statistics (GOE/GUE). In the field of quantum chaos, the former notion is often associated with integrable dynamics and the latter with chaotic dynamics [18, 19, 20]. However, specific examples show that the relation is not so straightforward [40, 41, 35] Understanding how far one can push these ideas is one amongst many reasons for studying models of large graphs [32, 42, 43].

Recently, the question of delocalization of eigenfunctions of large matrices or large graphs has been a subject of intense activity. Let us mention several ways of testing delocalization that have been used. Let $M_{N}$ be a large symmetric matrix of size $N\times N$ , and let $(\psi_{j})_{j=1}^{N}$ be an orthonormal basis of eigenfunctions. The eigenfunction $\psi_{j}$ defines a probability measure $\sum_{x=1}^{N}|\psi_{j}(x)|^{2}\delta_{x}$ . The goal is to compare this probability measure with the uniform measure, which puts mass $1/N$ on each point.

•

$\ell^{\infty}$ norms : Can we have a pointwise upper bound on $|\psi_{j}(x)|$ , in other words, is $\|\psi_{j}\|_{\infty}$ small, and how small compared with $1/{\sqrt{N}}$ ?

•

$\ell^{p}$ norms: Can we compare $\|\psi_{j}\|_{p}$ with $N^{1/p-1/2}$ ? In [2], a state $\psi_{j}$ is called non-ergodic (and multi-fractal) if $\|\psi_{j}\|_{p}$ behaves like $N^{f(p)}$ with $f(p)\not=1/p-1/2$ . Related criteria appear in [5].

•

*Scarring : * Can we have full concentration ( $\sum_{x\in\Lambda}|\psi_{j}(x)|^{2}\geq 1-\epsilon$ ) or partial concentration ( $\sum_{x\in\Lambda}|\psi_{j}(x)|^{2}\geq\epsilon$ ) with $\Lambda$ a set of “small” cardinality ? We borrow the term “scarring” from the term used in the theory of quantum chaos [40].

•

Quantum ergodicity : Given a function $a:\{1,\ldots,N\}\longrightarrow\mathbb{C}$ , can we compare $\sum_{x}a(x)|\psi_{j}(x)|^{2}$ with $\frac{1}{N}\sum_{x}a(x)$ ? This criterion, borrowed again from quantum chaos, was applied to discrete regular graphs in [9, 7]. Quantum ergodicity means that the two averages are close for most $j$ . If they are close for all $j$ , one speaks of quantum unique ergodicity.

As was demonstrated in a recent series of papers, adding some randomness may allow to settle the problem completely. For instance almost sure optimal $\ell^{\infty}$ -bounds and quantum unique ergodicity for various models of random matrices and random graphs, such as Wigner matrices, sparse Erdös-Rényi graphs, random regular graphs of slowly increasing or bounded degrees were obtained in [29, 30, 22, 28, 13, 14, 15]. The invariance of the probability distribution under certain elementary transformations plays an important role. The completely different point of view that we adopt is to consider deterministic graphs and to prove delocalization as resulting directly from the geometry of the graphs. Up to now, in this deterministic setting, only eigenfunctions of the adjacency matrix of regular graphs have been treated, taking advantage of the completely explicit Fourier analysis on regular trees. The papers [9, 24, 7] give various proofs of quantum ergodicity; the paper [23] proves the absence of scarring on sets of cardinality $N^{1-\epsilon}$ and also contains (although not stated) a logarithmic upper bound on the $\ell^{\infty}$ norms.

The aim of this paper is to prove a quantum ergodicity theorem for eigenfunctions of discrete Schrödinger operators on quite general large graphs. As we will see, a particularly interesting point of our result is that it gives a direct relation between spectral delocalization of infinite systems and spatial delocalization of large finite system. Our result may be summarized as follows (with proper additional assumptions to be described later) :

“If a large finite system is close (in the Benjamini-Schramm topology) to an infinite system having purely absolutely continuous spectrum in an interval $I$ , then the eigenfunctions (with eigenvalues lying in $I$ ) of the finite system satisfy quantum ergodicity.”

1.2. The results

Consider a sequence of connected graphs without self-loops and multiple edges $(G_{N})_{N\in\mathbb{N}}$ . We assume each vertex has at least $3$ neighbours. It will be convenient to write $G_{N}$ as a quotient of a tree ${\widetilde{G_{N}}}$ by a group of automorphisms $\Gamma_{N}$ , that is, $G_{N}=\Gamma_{N}\backslash{\widetilde{G_{N}}}$ , where $\Gamma_{N}$ acts freely on the vertices of ${\widetilde{G_{N}}}$ , i.e. given $v\in{\widetilde{G_{N}}}$ , $\gamma_{1}v=\gamma_{2}v$ implies $\gamma_{1}=\gamma_{2}$ . In other words, ${\widetilde{G_{N}}}$ is the “universal cover” of $G_{N}$ . We will work under the assumption that the degree of ${\widetilde{G_{N}}}$ is everywhere smaller than some fixed $D$ .

We denote by $\widetilde{V_{N}}$ and $\widetilde{E_{N}}$ the set of vertices and edges of ${\widetilde{G_{N}}}$ , respectively. We denote by $V_{N}$ and $E_{N}$ the vertices and edges of $G_{N}$ , respectively. We assume $|V_{N}|=N$ and work in the limit $N\longrightarrow\infty$ .

Define the adjacency operator $\widetilde{\mathcal{A}}_{N}:\mathbb{C}^{{\widetilde{G_{N}}}}\to\mathbb{C}^{{\widetilde{G_{N}}}}$ by

[TABLE]

where $v\sim w$ means $v$ and $w$ are nearest neighbours. The operator $\widetilde{\mathcal{A}}_{N}$ is bounded on $\ell^{2}({\widetilde{G_{N}}})$ . It also preserves the space of $\Gamma_{N}$ -invariant functions on $\widetilde{V_{N}}$ , in other words it defines an operator on $\ell^{2}(V_{N})$ , that we denote by $\mathcal{A}_{N}$ (we will drop the index $N$ and write $\widetilde{\mathcal{A}},\mathcal{A}$ when no confusion may arise). Consider a bounded function $\widetilde{W_{N}}:\widetilde{V_{N}}\longrightarrow\mathbb{R}$ such that ${\widetilde{W_{N}}}(\gamma\cdot v)={\widetilde{W_{N}}}(v)$ for all $\gamma\in\Gamma_{N}$ . The operator of multiplication by ${\widetilde{W_{N}}}$ is bounded on $\ell^{2}({\widetilde{G_{N}}})$ ; it also preserves the space of $\Gamma_{N}$ -invariant functions on $\widetilde{V_{N}}$ , thus it defines an operator on $\ell^{2}(V_{N})$ , that we denote by $W_{N}$ . We define the discrete Schrödinger operators $\widetilde{H}_{N}=\widetilde{\mathcal{A}}_{N}+\widetilde{W_{N}}$ and $H_{N}=\mathcal{A}_{N}+W_{N}$ . The central object of our study are the eigenfunctions of $H_{N}$ , and their behaviour (localized/delocalized) as $N\longrightarrow+\infty$ . The fact that $\Gamma_{N}$ acts freely implies that $H_{N}$ is symmetric (self-adjoint) on $\ell^{2}(V_{N})$ .

For comfort, we will always work under the assumption that $W_{N}$ takes values in some fixed interval $[-A,A]$ . This implies that the spectrum of all operators we will encounter is contained in some fixed interval $I_{0}=[-A-D,A+D]$ .

We define the Laplacian $P_{N}:\mathbb{C}^{V_{N}}\to\mathbb{C}^{V_{N}}$ by

[TABLE]

where $d_{N}(x)$ stands for the number of neighbours of $x$ . If we introduce the positive measure on $V_{N}$ assigning to $x$ the weight $d_{N}(x)$ , then $P_{N}$ is self-adjoint on $\ell^{2}(V_{N},d_{N})$ .

We shall assume the following conditions on our sequence of graphs:

(EXP) The sequence $(G_{N})$ forms an expander family. By this we mean that the Laplacian $P_{N}$ has a uniform spectral gap in $\ell^{2}(V_{N},d_{N})$ . More precisely, the eigenvalue $1$ of $P_{N}$ is simple, and the spectrum of $P_{N}$ is contained in $[-1+\beta,1-\beta]\cup\{1\}$ , where $\beta>0$ is independent of $N$ .

Note that $1$ is always an eigenvalue, corresponding to constant functions. Our assumption implies in particular that each $G_{N}$ is connected and non-bipartite. It is well-known that a uniform spectral gap for $P_{N}$ is equivalent to a Cheeger constant bounded away from [math] (see for instance [26], §3).

Our second assumption is that $(G_{N})$ has few short loops:

(BST) For all $r>0$ ,

[TABLE]

where $\rho_{G_{N}}(x)$ is the injectivity radius at $x$ , i.e. the largest $\rho$ such that the ball $B_{G_{N}}(x,\rho)$ is a tree.

The general theory of Benjamini-Schramm convergence (or local weak convergence), briefly recalled in Appendix A, allows us to assign a limit object to the sequence $(G_{N},W_{N})$ , which is a probability distribution carried on trees. More precisely, up to passing to a subsequence, assumption (BST) above is equivalent to the following assumption.

(BSCT) The sequence $(G_{N},W_{N})$ has a local weak limit $\mathbb{P}$ which is concentrated on the set of (isomorphism classes of) coloured rooted trees, denoted $\mathscr{T}_{\ast}^{D,A}$ .

Assumption (BSCT) says that $(G_{N},W_{N})$ converges in a distributional sense to a random system of rooted trees $\{[\mathcal{T},o]\}$ , endowed with a map $\mathcal{W}:\mathcal{T}\longrightarrow\mathbb{R}$ . More precisely, the empirical measure of $(G_{N},W_{N})$ , defined by choosing a root $x\in V_{N}$ uniformly at random, converges weakly to a probability measure $\operatorname{\mathbb{P}}$ concentrated on trees.

If $[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}$ and $\mathcal{A}$ is the adjacency matrix of $\mathcal{T}$ , we denote by $\mathcal{H}=\mathcal{A}+\mathcal{W}$ the limiting random Schrödinger operator, which is self-adjoint on $\ell^{2}(\mathcal{T})$ .

Call $(\lambda^{(N)}_{j})_{j=1}^{N}$ the eigenvalues of $H_{N}$ on $\ell^{2}(V_{N})$ . Assumption (BSCT) implies the convergence of the empirical law of eigenvalues : for any continuous $\chi:\mathbb{R}\longrightarrow\mathbb{R}$ , we have

[TABLE]

see Remark A.3. Here $\operatorname{\mathbb{E}}$ is the expectation with respect to $\operatorname{\mathbb{P}}$ , that is,

[TABLE]

The measure $\rho$ is called the integrated density of states in the theory of random Schrödinger operators.

We need some notation for our last assumption. Let $[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}$ . Given $x,y\in\mathcal{T}$ , and $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , we introduce the Green function

[TABLE]

Given $v,w\in\mathcal{T}$ with $v\sim w$ , we denote by ${\mathcal{T}}^{(v|w)}$ the tree obtained by removing from the tree ${\mathcal{T}}$ the branch emanating from $v$ that passes through $w$ . We define the restriction $\mathcal{H}^{(v|w)}(u,u^{\prime})=\mathcal{H}(u,u^{\prime})$ if $u,u^{\prime}\in\mathcal{T}^{(v|w)}$ and zero otherwise. The corresponding Green function is denoted by $\mathcal{G}^{(v|w)}(\cdot,\cdot;\gamma)$ . We then put $\hat{\zeta}_{w}^{\gamma}(v):=-\mathcal{G}^{(v|w)}(v,v;\gamma)$ .

(Green) There is a non-empty open set $I_{1}$ , such that for all $s>0$ we have

[TABLE]

To understand (Green), define the (rooted) spectral measure of $[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}$ by

[TABLE]

Assumption (Green) implies that $\sup_{\lambda\in I_{1},\eta_{0}>0}\operatorname{\mathbb{E}}(|\mathcal{G}^{\gamma}(o,o)|^{2})<\infty$ ; see Remark A.4. As shown in [33], this implies that for $\operatorname{\mathbb{P}}$ -a.e. $[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}$ , the spectral measure $\mu_{o}$ is absolutely continuous in $I_{1}$ , with density $\frac{1}{\pi}\operatorname{Im}\mathcal{G}^{\lambda+i0}(o,o)$ . Hence, (Green) implies that $\operatorname{\mathbb{P}}$ -a.e. operator $\mathcal{H}$ has purely absolutely continuous spectrum in $I_{1}$ . This is a natural assumption since our aim is to prove delocalization properties of eigenfunctions.

Now let $(\psi^{(N)}_{j})_{j=1}^{N}$ be an orthonormal basis of $\ell^{2}(V_{N})$ consisting of eigenfunctions of $H_{N}$ . Pick $j\in\{1,\dots,N\}$ . The problem of quantum ergodicity is to understand if the probability measure $\sum_{x\in V_{N}}|\psi^{(N)}_{j}(x)|^{2}\delta_{x}$ on $V_{N}$ is “localized” (essentially carried by $o(N)$ vertices) or “delocalized” (ideally, close to the uniform measure on $V_{N}$ , or maybe, to some other natural measure on $V_{N}$ , comparable to the uniform measure). More generally, we want to know if the correlations $\overline{\psi^{(N)}_{j}(x)}\psi^{(N)}_{j}(y)$ , for $x$ and $y\in V_{N}$ at some fixed distance, approach some limiting object. From a mathematical point of view, the question was addressed in [9, 24] for eigenfunctions of the adjacency matrix of large deterministic regular graphs, and for the adjacency matrix of random regular graphs or Erdös-Rényi graphs in the recent works [28, 13, 14, 15]. The main motivation of our paper is to extend the results of [9] to disordered systems, that is, to non-regular graphs, possibly with a potential on the vertices or weights on the edges. This necessarily requires a different method from that of [9], that was specific to regular graphs. New methods to prove quantum ergodicity were already explored in [7]. We insist on the fact that, contrary to [28, 13, 14, 15, 31], our sequence of graphs and potentials are deterministic. The results may in particular be applied to random graphs and/or random potentials, provided one knows that Assumptions (EXP), (BSCT) and (Green) hold true for some realizations. We discuss the relation with existing work more extensively in Section 1.5.

Let us state the main abstract result; its concrete meaning will be explored afterwards. For $x,y\in{\widetilde{V}}_{N}$ , and $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , we introduce the lifted Green function

[TABLE]

Recall that we write $G_{N}$ as a quotient $\Gamma_{N}\backslash{\widetilde{G}}_{N}$ where ${\widetilde{G}}_{N}$ is a tree. We denote by $\mathcal{D}_{N}$ a fundamental domain of the action of $\Gamma_{N}$ on the vertices of ${\widetilde{G}}_{N}$ . Thus $\mathcal{D}_{N}$ contains $N$ vertices of ${\widetilde{G}}_{N}$ , each of them projecting to a distinct vertex of $G_{N}$ .

Let $I_{1}$ be the open set of Assumption (Green), and let us fix an interval $I$ (or finite union of intervals) such that $\bar{I}\subset I_{1}$ .

Theorem 1.1.

Assume that the graphs $G_{N}$ and the potentials $W_{N}$ satisfy (BSCT), (EXP) and (Green).

Call $(\lambda^{(N)}_{j})_{j=1}^{N}$ the eigenvalues of the Schrödinger operator $H_{N}$ on $\ell^{2}(V_{N})$ , and let $(\psi^{(N)}_{j})_{j=1}^{N}$ be a corresponding orthonormal eigenbasis.

For each $N$ , let $a=a_{N}$ be a function on $V_{N}$ with $\sup_{N}\sup_{x\in V_{N}}|a_{N}(x)|\leq 1$ . For $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , define $\langle a\rangle_{\gamma}=\sum_{x\in V_{N}}a(x)\Phi_{\gamma}^{N}(\tilde{x},\tilde{x})$ , where $\Phi_{\gamma}^{N}(\tilde{x},\tilde{x})=\frac{\operatorname{Im}{\tilde{g}^{\gamma}_{N}}(\tilde{x},\tilde{x})}{\sum_{\tilde{x}\in\mathcal{D}_{N}}\operatorname{Im}{\tilde{g}^{\gamma}_{N}}(\tilde{x},\tilde{x})}$ . Then

[TABLE]

Here, $\tilde{x}$ is a lift of $x\in V_{N}$ in the universal cover $\widetilde{V}_{N}$ .

Corollary 1.2.

Under the same assumptions, for any $\epsilon>0$ , we have

[TABLE]

More generally, we have the following result on eigenfunction correlators, which says that $\overline{\psi_{j}(x)}\psi_{j}(y)$ “approaches” the function $\Phi_{\lambda_{j}+i0}^{N}(\tilde{x},\tilde{y})$ defined in (1.5). For technical reasons we have to assume the $(\psi_{j})$ are real-valued. More precisely, we need $\overline{\psi_{j}(x)}\psi_{j}(y)$ to be real for any $j=1,\dots,N$ and $x,y\in V_{N}$ with $x\sim y$ .

Theorem 1.3.

Assume that $(G_{N},W_{N})$ satisfies (BSCT), (EXP) and (Green).

Call $(\lambda^{(N)}_{j})_{j=1}^{N}$ the eigenvalues of $H_{N}$ on $\ell^{2}(V_{N})$ , and let $(\psi^{(N)}_{j})_{j=1}^{N}$ be a corresponding orthonormal eigenbasis. Assume the $(\psi_{j})_{j=1}^{N}$ are real-valued.

Fix $R\in\mathbb{N}$ . For each $N$ , let ${\mathbf{K}}={\mathbf{K}}_{N}$ be an operator on $\ell^{2}(V_{N})$ whose kernel $K=K_{N}:V_{N}\times V_{N}\longrightarrow\mathbb{C}$ is such that $K(x,y)=0$ for $d(x,y)>R$ (in other words $K$ is supported at distance $\leq R$ from the diagonal). Assume that $\sup_{N}\sup_{x,y\in V_{N}}|K_{N}(x,y)|\leq 1.$

For $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , define

[TABLE]

Then

[TABLE]

The “kernel” above is the matrix of ${\mathbf{K}}$ in the basis $(\delta_{x})$ , i.e. $K(x,y)=\langle\delta_{x},{\mathbf{K}}\delta_{y}\rangle_{\ell^{2}(V_{N})}$ . To define (1.5) properly, we lift $K$ to ${\widetilde{V}}_{N}\times{\widetilde{V}}_{N}$ by letting

[TABLE]

if $x,y\in V_{N}=\Gamma_{N}\backslash{\widetilde{V}}_{N}$ are the projections of $\tilde{x},\tilde{y}\in{\widetilde{V}}_{N}$ .

If we know in addition that $\rho(\partial I_{1})=0$ , where $\rho$ is the integrated density of states measure (1.2), then our main theorems hold with $I$ replaced by $I_{1}$ ; see the end of Section 10. Note that if (Green) holds on $\overline{I_{1}}$ , then $\rho(\partial I_{1})=0$ .

Although we tend to skip it from the notation, the “observables” $\mathbf{K}$ and $a$ necessarily depend on $N$ . On the other hand, they do not depend on $j$ , the index of the eigenfunction (they are actually allowed to depend on $\lambda^{(N)}_{j}$ in the proof, but this dependence cannot be wild, it has to be at least continuous). We interpret Corollary 1.2 as follows : for a given observable $a$ , the average $\sum_{x\in V_{N}}a(x)|\psi^{(N)}_{j}(x)|^{2}$ is close to $\langle a\rangle_{\lambda^{(N)}_{j}+i\eta_{0}}$ for most indices $j$ . It follows similarly from Theorem 1.3 that $\sum_{x,y\in V_{N}}K(x,y)\overline{\psi^{(N)}_{j}(x)}\psi^{(N)}_{j}(y)$ is close to $\langle{\mathbf{K}}\rangle_{\lambda^{(N)}_{j}+i\eta_{0}}$ for most $j$ . One of the subtleties of the result is that the indices $j$ for which this holds may a priori depend on the observables $a$ , $\mathbf{K}$ . If we wanted to have a common set of indices $j$ that do the job for all observables (whose number is exponential in $N$ ), we would need to have an exponential rate of convergence in Theorems 1.1 and 1.3. As is seen in the case of regular graphs and $W=0$ [7], our proof gives a rate that is at best a negative power of the girth, which is itself typically of order $\log N$ . So, the result is far from showing that $|\psi^{(N)}_{j}(x)|^{2}$ is close to the uniform measure in total variation.

Note the presence of the extra parameter $\eta_{0}$ , in comparison with the case of regular graphs [9, 7]. This is due to the fact that, generally speaking, the quantities $\langle a\rangle_{\lambda^{(N)}_{j}+i\eta_{0}}$ and $\langle{\mathbf{K}}\rangle_{\lambda^{(N)}_{j}+i\eta_{0}}$ are not necessarily bounded as $\eta_{0}\downarrow 0$ for fixed $N$ . They will however stay bounded in the limits $N\to+\infty$ followed by $\eta_{0}\downarrow 0$ (as a result of (A.14) and (Green)).

1.3. Understanding the weighted averages.

In order to clarify the relevance of Theorems 1.1 and 1.3, we now investigate the meaning of the quantities $\langle a\rangle_{\lambda+i\eta_{0}}$ and $\langle\mathbf{K}\rangle_{\lambda_{j}+i\eta_{0}}$ . Let us start with Theorem 1.1. A good illustration is to choose $a_{N}={\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}$ , the characteristic function of a set $\Lambda_{N}\subset V_{N}$ of size $\approx\alpha N$ for some $\alpha\in(0,1)$ , say $\alpha=\frac{1}{2}$ .

In the special case where $(G_{N})$ is regular and $H_{N}=\mathcal{A}_{N}$ , and also for the anisotropic model treated in [7], the Green function $\tilde{g}^{\gamma}_{N}(\tilde{x},\tilde{y})$ does not depend on $N$ , as it coincides with the limiting Green function $\mathcal{G}^{\gamma}(\tilde{x},\tilde{y})$ . Moreover, $\mathcal{G}^{\gamma}(\tilde{x},\tilde{x})=\mathcal{G}^{\gamma}(o,o)$ for all $\tilde{x}\in\mathcal{D}_{N}$ . It follows that $\langle{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\rangle_{\lambda_{j}+i\eta_{0}}=\sum_{x\in\Lambda_{N}}\frac{\mathcal{G}^{\lambda_{j}+i\eta_{0}}(o,o)}{N\mathcal{G}^{\lambda_{j}+i\eta_{0}}(o,o)}=\alpha$ . So Corollary 1.2 implies that $\|{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\psi_{j}^{(N)}\|^{2}\approx\alpha$ for most $\psi_{j}^{(N)}$ . This shows that most $\psi_{j}^{(N)}$ are uniformly distributed, in the sense that if we consider any $\Lambda_{N}\subset V_{N}$ containing half the vertices, we find half the mass of $\|\psi_{j}^{(N)}\|^{2}$ . As we show in the next subsection, such interpretation is also valid for the Anderson model.

For general models, we cannot assert that $\langle{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\rangle_{\lambda+i\eta_{0}}=\alpha$ . Still, we prove in Section A.3 that there exists $c_{\alpha}>0$ such that for any $\Lambda_{N}\subset V_{N}$ with $|\Lambda_{N}|\geq\alpha N$ , we have

[TABLE]

Combined with Corollary 1.2, this implies

Corollary 1.4.

For any $\alpha\in(0,1)$ , there exists $c_{\alpha}>0$ such that for any $\Lambda_{N}\subset V_{N}$ with $|\Lambda_{N}|\geq\alpha N$ , we have

[TABLE]

Hence, while in the simple case we had $\|{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\psi_{j}^{(N)}\|^{2}\approx\alpha$ for most $\psi_{j}^{(N)}$ , in the general case, we can still assert that $\|{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\psi_{j}^{(N)}\|^{2}\geq c_{\alpha}>0$ for most $\psi_{j}^{(N)}$ . This indicates that our theorem can truly be interpreted as a delocalization theorem. The bad indices $j$ (for which $\|{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}\psi_{j}^{(N)}\|^{2}<c_{\alpha}$ ) will a priori depend on $\Lambda_{N}$ .

We now turn to the general averages $\langle\mathbf{K}\rangle_{\gamma_{j}}$ . Recall that $\Phi_{\gamma}^{N}(\tilde{x},\tilde{y})=\frac{\operatorname{Im}{\tilde{g}^{\gamma}_{N}}(\tilde{x},\tilde{y})}{\sum_{\tilde{x}\in\mathcal{D}_{N}}\operatorname{Im}{\tilde{g}^{\gamma}_{N}}(\tilde{x},\tilde{x})}$ . We will show in Section A.3 that under assumption (BSCT), we have

[TABLE]

uniformly in $\lambda\in I_{0}$ . This already shows that $\Phi_{\gamma}^{N}(\tilde{x},\tilde{y})$ is of order $1/N$ , since the denominator in its expression is of order $N$ . We strengthen this observation by proving that for any continuous $F:\mathbb{R}\to\mathbb{R}$ , we have uniformly in $\lambda\in I_{0}$ ,

[TABLE]

This says that the empirical distribution of $\left(N\Phi_{\gamma}^{N}(\tilde{x},\tilde{y})\right)$ (when $x$ is chosen uniformly at random in $V_{N}$ and $y$ is then chosen uniformly among the points at distance $k$ from $x$ ) converges to the law of $\left(\frac{{\operatorname{Im}\mathcal{G}^{\gamma}}(o,v)}{\mathbb{E}\left({\operatorname{Im}\mathcal{G}^{\gamma}}(o,o)\right)}\right)$ ( $v$ being chosen uniformly among the points at distance $k$ from the root $o$ ). This is a second way of saying that $\Phi_{\gamma}^{N}(\tilde{x},\tilde{y})$ is of order $1/N$ : when multiplied by $N$ , it has a non-trivial limiting distribution.

1.4. Case of the Anderson model

It is important to check that the models covered by the assumptions of our main theorems are not reduced to the case of the laplacian on regular graphs, already treated in [9, 24, 7]. Here we consider the important case of the Anderson model on regular graphs, i.e. the laplacian with a random potential. We will show that, if the strength of the disorder is small enough, then the assumptions of Theorem 1.1 and 1.3 are satisfied for almost every realization of the potential.

Let $\mathbb{T}_{q}$ be the $(q+1)$ -regular tree. Let $\nu$ be a probability measure on $\mathbb{R}$ , supported on a compact interval $[-A,A]$ , and for every $\epsilon>0$ let $\nu_{\epsilon}$ be the image of $\nu$ under the homothety $x\mapsto\epsilon x$ ( $\nu_{\epsilon}$ is now supported on $[-\epsilon A,\epsilon A]$ ). Let $\Omega=\mathbb{R}^{\mathbb{T}_{q}}$ , and define $\mathbf{P}_{\epsilon}$ on $\Omega$ by $\mathbf{P}_{\epsilon}=\mathop{\otimes}_{v\in\mathbb{T}_{q}}\nu_{\epsilon}$ . We shall denote by $\mathbf{E}_{\epsilon}$ the expectation with respect to $\mathbf{P}_{\epsilon}$ . Given $\omega=(\omega_{v})\in\Omega$ , define $\mathcal{W}^{\,\omega}(v)=\omega_{v}$ for $v\in\mathbb{T}_{q}$ . Then the $\{\omega_{v}\}_{v\in\mathbb{T}_{q}}$ are i.i.d. random variables with common distribution $\nu_{\epsilon}$ . Here $\epsilon\in\mathbb{R}$ is fixed and parametrizes the strength of the disorder.

Let $G_{N}=(V_{N},E_{N})$ be a (deterministic) sequence of $(q+1)$ -regular graphs with $|V_{N}|=N$ . This means that ${\widetilde{G}}_{N}=\mathbb{T}_{q}$ for all $N$ . Let $\Omega_{N}=\mathbb{R}^{V_{N}}$ and $\mathcal{P}^{\epsilon}_{N}=\mathop{\otimes}_{x\in V_{N}}\nu_{\epsilon}$ on $\Omega_{N}$ . We denote $\widetilde{\Omega}=\prod_{N\in\mathbb{N}}\Omega_{N}$ and let $\mathcal{P}_{\epsilon}$ be any probability measure on $\widetilde{\Omega}$ having $\mathcal{P}^{\epsilon}_{N}$ as a marginal on the factor $\Omega_{N}$ . Given $(\omega_{N})_{N\in\mathbb{N}}\in\widetilde{\Omega}$ , so that $\omega_{N}=(\omega_{x})_{x\in V_{N}}\in\Omega_{N}$ , we define $W^{\omega_{N}}(x)=\omega_{x}$ for $x\in V_{N}$ .

The results of this section are proved in a companion paper [11].

Proposition 1.5.

Suppose $(G_{N})$ satisfies (BST). Then (BSCT) holds for $\mathcal{P}^{\epsilon}$ -almost every realization of the potential. More precisely, for $\mathcal{P}^{\epsilon}$ -a.e. $(\omega_{N})\in\widetilde{\Omega}$ , the sequence $(G_{N},W^{\omega_{N}})$ has a local weak limit $\operatorname{\mathbb{P}}_{\epsilon}$ which is concentrated on $\{[\mathbb{T}_{q},o,\mathcal{W}^{\,\omega}]:\omega\in\Omega\}$ , where $o\in\mathbb{T}_{q}$ is fixed and arbitrary. The measure $\operatorname{\mathbb{P}}_{\epsilon}$ acts by taking the expectation w.r.t. $\mathbf{P}_{\epsilon}$ , that is, if $D=q+1$ , then

[TABLE]

We make the following assumption on the random variables:

(POT) The measure $\nu$ is Hölder continuous, i.e. there exist $C_{\nu}>0$ and $b\in(0,1]$ such that $\nu(I)\leq C_{\nu}|I|^{b}$ for all bounded $I\subset\mathbb{R}$ .

The following proposition is by no means trivial, it comes from the results of [33, 4].

Proposition 1.6.

Fix $0<\lambda_{0}<2\sqrt{q}$ . There exists $\epsilon(\lambda_{0})$ such that if $|\epsilon|<\epsilon(\lambda_{0})$ , then assumption (Green) holds for the measure $\mathbb{P}_{\epsilon}$ of Proposition 1.5 on $I_{1}=(-\lambda_{0},\lambda_{0})$ .

Corollary 1.7.

If the graphs $G_{N}$ form an expander family and satisfy (BST) and if the disorder $\epsilon$ is small enough, the conclusions of Theorems 1.1 and 1.3 hold true for $\mathcal{P}_{\epsilon}$ -a.e. realization $(\omega_{N})\in\widetilde{\Omega}$ , with $I_{1}=(-\lambda_{0},\lambda_{0})$ .

This gives a rich enough family of examples where the assumptions of Theorems 1.1 and 1.3 hold true. Thus the conclusions of the theorems hold for any observables $a_{N},K_{N}$ . If in addition $a_{N}$ or $K_{N}$ are independent of the disorder, some extra averaging takes place, and we may replace $\langle\mathbf{K}\rangle_{\lambda+i\eta_{0}}$ by a simpler average as follows.

Theorem 1.8.

Assume that (POT), (EXP) and (BST) hold. Given $(\omega_{N})\in\widetilde{\Omega}$ , let $(\psi_{i}^{\omega_{N}})_{i=1}^{N}$ be an orthonormal basis of eigenfunctions of $H_{N}^{\omega}=\mathcal{A}_{N}+W^{\omega_{N}}$ in $\ell^{2}(V_{N})$ , with corresponding eigenvalues $(\lambda_{i}^{\omega_{N}})_{i=1}^{N}$ .

Let $K_{N}:V_{N}\times V_{N}\to\mathbb{C}$ , $\sup_{N}\sup_{x,y\in V_{N}}|K_{N}(x,y)|\leq 1$ , $K_{N}(x,y)=0$ if $d(x,y)>R$ , and assume $K_{N}$ is independent of $(\omega_{N})$ . Fix $0<\lambda_{0}<2\sqrt{q}$ . If $|\epsilon|<\epsilon(\lambda_{0})$ , we have for $\mathcal{P}_{\epsilon}$ -a.e. $(\omega_{N})$ ,

[TABLE]

where for $\gamma\in\mathbb{C}\setminus\mathbb{R}$

[TABLE]

As in the previous theorems, if $R=0$ , the $\psi_{j}$ are arbitrary, while if $R>0$ , we assume the $\psi_{j}$ are real-valued.

For the Anderson model, $\mathbf{E}_{\epsilon}\left(\operatorname{Im}\mathcal{G}^{\gamma}(v,w)\right)$ depends only on $d(v,w)$ : $\mathbf{E}_{\epsilon}\left(\operatorname{Im}\mathcal{G}^{\gamma}(v,w)\right)=\mathbf{E}_{\epsilon}\left(\operatorname{Im}\mathcal{G}^{\gamma}(o,u)\right)$ where $u$ is any vertex of $\mathbb{T}_{q}$ such that $d(o,u)=d(v,w)$ .

In the special case $R=0$ , we have $\langle a_{N}\rangle_{\lambda}^{\eta_{0}}=\frac{1}{N}\sum_{x\in V_{N}}a(x)$ . So choosing $a_{N}={\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}$ , Theorem 1.8 implies the strong form of delocalization given by the uniform distribution of $\psi_{j}^{(N)}$ on $V_{N}$ , as explained in Section 1.3.

1.5. Relation with previous work

Our main Theorem 1.3 holds for deterministic sequences of graphs and potentials. For any sequence $(G_{N},W_{N})$ satisfying the assumptions of the theorem, the conclusion holds for any observable $K$ ; in particular, $K$ may depend on the graphs. As already noted, the result only says something about the delocalization of “most” eigenfunctions, where the “good” eigenfunctions exhibiting delocalization may depend on the choice of the observable $K$ .

In the past years, there has been tremendous interest in spectral statistics and delocalization of eigenfunctions of random sequences of graphs and potentials. Many papers consider random regular graphs, with degree going slowly to infinity [46, 27, 13, 14] or fixed [31, 15], sometimes adding a random i.i.d potential [31]. In particular, the recent papers [13, 14, 15] show “quantum unique ergodicity” for the adjacency matrix of random regular graphs : given an observable $a_{N}:\{1,\ldots,N\}\longrightarrow\mathbb{R}$ , for most $(q+1)$ -regular graphs on the vertices $\{1,\ldots,N\}$ we have that $\sum_{x=1}^{N}a_{N}(x)|\psi_{j}^{(N)}(x)|^{2}$ is close to $\langle a_{N}\rangle$ for all indices $j$ . This is a considerable strengthening of Corollary 1.2 (or of the similar result in [9]), that only says something for most indices $j$ . This possibility to prove QUE is, of course, due to the fact that $a_{N}$ has to be independent of the choice of the graph and that results holds for almost all graphs.

When “ergodicity” of eigenfunctions is tested numerically as in the physics papers [2, 3], it is natural to first pick a realization of the graph and of the potential, and then test the eigenfunctions one by one to determine if they can be localized in small parts of the graph. It is then natural to allow the test-observables to depend on the graph and the potential (which our Theorem 1.3 does, but not the results of [13, 15]), but also on the index $j$ of the eigenfunction, which neither of the rigourous mathematical results achieves. The numerical results of [3] seem to indicate that, as soon as a random disorder is turned on, the eigenfunctions will be localized in small parts of the graph. This is not in contradiction with our results : the region of localization of $\psi_{j}^{(N)}$ might depend on $j$ , but our result does not allow to test this. Note also that the results of [2, 3] were recently questioned in [45], where the authors argue that $N$ has not been taken large enough to see the delocalization take place.

The paper [12] proves a very important result, saying that if $\psi_{j}$ is an “almost eigenvector” of the adjacency matrix on a random regular graph $G$ , then for almost all $G$ and all $j$ , the value distribution of $\psi_{j}(x)$ as $x$ runs over $\{1,\ldots,N\}$ is close to a Gaussian $\mathcal{N}(0,\sigma_{j}^{2})$ with $\sigma_{j}\leq 1$ . Proving that $\sigma_{j}=1$ is a challenge; it would amount to proving that eigenfunctions cannot be localized in small parts of the graph. Our result does not say this, again because we can only test one observable $a$ at a time. The indices $j$ for which Corollary 1.2 proves delocalization depend on $a$ . If we wanted to have a common set of indices $j$ that do the job for all observables (whose number is exponential in $N$ ), we would need to have an exponential rate of convergence in Theorems 1.1, 1.3. Our proof gives a rate that is at best a negative power of the girth (itself typically of order $\log N$ ).

Finally we would also like to mention the paper [21], where existence of absolutely continuous spectrum for percolation graphs on the $(q+1)$ -regular tree is proven, if the percolation parameter is close enough to $1$ . Since the absolutely continuous spectrum is mixed with purely discrete spectrum, one cannot expect a quantum ergodicity result that claims delocalization of most eigenfunctions, but only a “partial delocalization” result for a positive proportion of eigenfunctions. These are the contents of [21, Theorem 9]. It would be nice to investigate what the methods of our paper would give for that model.

1.6. Outline of the proof

We borrowed the name “Quantum Ergodicity” from a result about laplacian eigenfunctions on Riemannian manifolds [44, 47, 25, 48]. The proof in the setting of laplacian eigenfunctions on manifolds is made of 4 steps, of unequal difficulty . These 4 steps are also present in our proof :

Step 0. Define the quantum variance. The goal is to show that this goes to [math] as $N\to\infty$ . A novelty of our proof is that we replace the usual quantum variance (10.1) by a “non-backtracking” one (3.3), where we replace the eigenfunctions $\psi_{j}$ by eigenfunctions $f_{j},f_{j}^{*}$ of a non-backtracking random walk (Section 3). These new $f_{j},f_{j}^{*}$ are thus eigenfunctions of a non-selfadjoint problem. This causes new difficulties, that however will be compensated by the fact that the non-backtracking random walk has simpler trajectories than the “simple” random walk generated by the adjacency matrix $\mathcal{A}$ .

Step 1. Show that the quantum variance is controlled by the Hilbert-Schmidt norm of $K$ . Although this is obvious for the original quantum variance, this will be much harder for the “non-backtracking quantum variance” (Section 4). This uses (BSCT) and (Green).

Step 2. Due to the fact that $f_{j},f_{j}^{*}$ satisfy an eigenfunction problem, the quantum variance is invariant under certain transformations (Section 5).

Step 3. One should see behind these transformations the emergence of a “classical dynamical system”. In the setting of laplacian eigenfunctions on manifolds, this is the geodesic flow. Here, what we get is a family of stationary Markov chains on the set of infinite non-backtracking paths (Section 6, Remark 6.1). This step has been called “classicalization” by U. Smilansky in a private conversation; this is supposed to mean the opposite of “quantization”.

Step 4. Iterate the classical dynamical system, use its ergodicity to show that the quantum variance is small (Section 9). Here, the ergodicity of our Markov chains (more precisely, the fact that the mixing rate is independent on $N$ ) comes from the (EXP) condition. Assumption (Green) is also used to control the probability transitions.

There is an additional step that does not exist in the traditional setting :

Step 5. Translate the result for the “non-backtracking quantum variance” (involving $f_{j},f_{j}^{*}$ ) into a result for the original one, involving the $\psi_{j}$ (Section 10). Assumptions (EXP), (BSCT) and (Green) are used here again to show that the transformation sending $\psi_{j}$ to $f_{j},f_{j}^{*}$ is well-behaved in the limit $N\longrightarrow+\infty$ .

2. Basic identities

2.1. “Quantization procedure” on trees and their quotients

Let $G=G_{N}$ , $G=(V,E)$ . Most of the time we will drop the subscript $N$ in the notation. As in Section 1.2, we regard $G$ as a quotient: $G=\Gamma\backslash\widetilde{G}$ , and let $\pi:{\widetilde{V}}\to V$ denote the projection. Fix a fundamental domain $\mathcal{D}\subset{\widetilde{V}}$ for the action of $\Gamma$ on ${\widetilde{V}}$ . Then $|\mathcal{D}|=|V|$ .

Each edge $\{x_{0},x_{1}\}\in{\widetilde{E}}$ , gives rise to two oriented edges $e=(x_{0},x_{1})$ and $\hat{e}=(x_{1},x_{0})$ in the reverse direction. We let $o_{e}$ and $t_{e}$ be the origin and terminus of $e$ , respectively. We then let ${\widetilde{B}}_{1}$ , or simply ${\widetilde{B}}$ , be the set of all such oriented edges of ${\widetilde{G}}$ . More generally, let ${\widetilde{B}}_{k}$ be the set of non-backtracking paths of length $k$ in ${\widetilde{G}}$ . By convention, ${\widetilde{B}}_{0}:={\widetilde{V}}$ . If $\omega=(x_{0},\ldots,x_{k})$ and $\omega^{\prime}=(x_{0}^{\prime},\ldots,x_{k}^{\prime})\in{\widetilde{B}}_{k}$ , we write $\omega\rightsquigarrow\omega^{\prime}$ if $x_{0}^{\prime}=x_{1},\ldots,x_{k-1}^{\prime}=x_{k}$ and $(x_{0},\ldots,x_{k},x_{k}^{\prime})\in{\widetilde{B}}_{k+1}$ . We also denote $o_{\omega}=x_{0}$ , $t_{\omega}=x_{k}$ .

These notions descend to the quotient. We denote by $B_{k}:=\Gamma\backslash\widetilde{B}_{k}$ the set of non-backtracking paths of length $k$ in $G$ . By convention, $B_{0}:=V$ . For $k=1$ we let $B=B_{1}$ . The set $B_{k}$ is in bijection with the subset $\mathcal{D}^{(k)}\subset{\widetilde{B}}_{k}$ of elements having their origin in $\mathcal{D}$ .

Let $\mathscr{H}_{k}=\mathbb{C}^{B_{k}}$ (the complex-valued functions on $B_{k}$ ), $\mathscr{H}=\mathop{\oplus}_{k=0}^{\infty}\mathscr{H}_{k}$ and $\mathscr{H}_{\leq k}:=\mathop{\oplus}_{\ell=0}^{k}\mathscr{H}_{\ell}$ .

It will be convenient to identify $\mathbb{C}^{B_{k}}$ with the $\Gamma$ -invariant elements of $\mathbb{C}^{{\widetilde{B}}_{k}}$ or with $\mathbb{C}^{\mathcal{D}^{(k)}}$ . For $K\in\mathscr{H}_{k}$ and $(x_{0},\ldots,x_{k})\in{\widetilde{B}}_{k}$ , we will sometimes use the short-hand notation $K(x_{0};x_{k})$ for $K(x_{0},\ldots,x_{k})$ . This is justified by the fact than on ${\widetilde{G}}$ , the endpoints $(x_{0};x_{k})$ determine the path $(x_{0},\ldots,x_{k})$ uniquely. We will also use this short-hand notation on $B_{k}$ , although in that case one should keep in mind that $K(x_{0};x_{k})$ actually depends on the full path $(x_{0},\ldots,x_{k})$ .

Any $K\in\mathscr{H}_{k}$ (regarded as a $\Gamma$ -invariant element of $\mathbb{C}^{\widetilde{B}_{k}}$ ) may be used to define an operator $\widehat{K}$ on the space of finitely supported functions on ${\widetilde{V}}$ , with kernel $\langle\delta_{v},\widehat{K}\delta_{w}\rangle_{\ell^{2}({\widetilde{V}})}=K(v;w)$ . It also defines an operator $\widehat{K}_{G}$ on $\mathbb{C}^{V}$ , with kernel

[TABLE]

where $\tilde{x},\tilde{y}\in{\widetilde{V}}$ are representatives of $x,y\in V$ . The map $K\in\mathscr{H}_{k}\mapsto K_{G}$ is a priori not one-to-one. However, if $\rho_{G}(x)\geq k$ , then $K_{G}(x,\cdot)$ determines $K(\tilde{x},\cdot)$ uniquely. To see that $K\in\mathscr{H}_{k}\mapsto K_{G}$ is surjective, consider $\mathbf{k}:V\times V\longrightarrow\mathbb{R}$ supported at distance $k$ from the diagonal, and let $K(\tilde{x},\tilde{y})=\mathbf{k}(\pi(\tilde{x}),\pi(\tilde{y})){\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{dist(\tilde{x},\tilde{y})\leq k}(\sharp\{\gamma\in\Gamma,dist(\tilde{x},\gamma\cdot\tilde{y})\leq k\})^{-1}$ . Then $K_{G}=\mathbf{k}$ and this coincides with the lift (1.6) except at the few points where $\rho_{G}(x)\leq k$ .

Define the non-backtracking adjacency operator $\mathcal{B}:\mathbb{C}^{\widetilde{B}}\to\mathbb{C}^{\widetilde{B}}$ by

[TABLE]

where $\mathcal{N}_{x}$ means the set of neighbours of $x$ . Then an element $K\in\mathscr{H}_{k}$ may also be used to define an operator $\widehat{K}_{{\widetilde{B}}}$ on $\ell^{2}({\widetilde{B}})$ , with kernel

[TABLE]

Thus $\langle\delta_{b_{1}},\widehat{K}_{{\widetilde{B}}}\delta_{b_{2}}\rangle_{\ell^{2}({\widetilde{B}})}\not=0$ only if there is a non-backtracking path of length $k$ in ${\widetilde{G}}$ , starting with the oriented edge $b_{1}$ and ending with $b_{2}$ .

Finally, $K\in\mathscr{H}_{k}$ also defines an operator $\widehat{K}_{B}$ on $\mathbb{C}^{B}$ , with matrix $K_{B}:B\times B\to\mathbb{C}$ given by

[TABLE]

where $\tilde{b}_{1},\tilde{b}_{2}\in{\widetilde{B}}$ are lifts of $b_{1},b_{2}\in B$ . By linearity, this extends to $K\in\mathscr{H}_{\leq k}$ .

Note that if $K\in\mathscr{H}_{k}$ , then $\langle\psi,K_{G}\phi\rangle_{\ell^{2}(V)}=\sum_{(x_{0},\ldots,x_{k})\in B_{k}}\overline{\psi(x_{0})}K(x_{0};x_{k})\phi(x_{k})$ for any $\psi,\phi\in\ell^{2}(V)$ . Similarly, if $f,g\in\ell^{2}(B)$ , we have

[TABLE]

where $\sum_{{}_{x_{0,1}}(x_{2};x_{k})}$ sums over all $(x_{2};x_{k})\in B_{k-2}$ such that $x_{2}\in\mathcal{N}_{x_{1}}\setminus\{x_{0}\}$ . Alternatively, we may simply sum over $(x_{2};x_{k})\in B_{k-2}$ but decide that $K(x_{0};x_{k})=0$ if the path $(x_{0},\ldots,x_{k})$ back-tracks.

Remark 2.1.

The maps $K\mapsto\widehat{K}$ , $K\mapsto\widehat{K}_{G}$ , $K\mapsto\widehat{K}_{{\widetilde{B}}}$ and $K\mapsto\widehat{K}_{B}$ associate an operator to a function on the set of paths. It is tempting to view this as a form of “quantization procedure” as those used for quantum ergodicity on manifolds.

2.2. Green functions on trees

Assumption (BST) says that our graphs have few short loops, in other words, that most balls of a given radius look like trees. One of the ingredients of our proof is that the Green function on trees satisfies certain algebraic relations, that follow from the fact that removing a vertex (or cutting an edge) from a tree suffices to disconnect it.

Here we recall some standard facts that hold for an arbitrary tree $T=(V(T),E(T))$ , endowed with a discrete Schrödinger of the form $H=\mathcal{A}+W$ acting on $\ell^{2}(V(T))$ , where $\mathcal{A}$ is the adjacency matrix and $W:V(T)\longrightarrow\mathbb{R}$ is a bounded function. Given $\gamma\in\mathbb{C}\setminus\mathbb{R}$ and $v,w\in T$ , the Green function is denoted in this section by

[TABLE]

If $v\sim w$ , we denote by ${T}^{(v|w)}$ the tree obtained by removing from ${T}$ the branch emanating from $v$ that passes through $w$ . We define the restriction $H^{(v|w)}(u,u^{\prime})=H(u,u^{\prime})$ if $u,u^{\prime}\in{T}^{(v|w)}$ and zero otherwise. The corresponding Green function is denoted by ${\tilde{g}}^{(v|w)}(\cdot,\cdot;\gamma)$ . We finally denote

[TABLE]

Later on, we will apply these results for $(T,W)=({\widetilde{G}}_{N},{\widetilde{W}}_{N})$ . In this case the (full) Green function will be denoted by $\tilde{g}_{N}^{\gamma}(x,y)$ , and the restricted one by $\zeta_{x}^{\gamma}(y)$ . In the case $(T,W)=(\mathcal{T},\mathcal{W})$ (the random coloured rooted trees of assumption (BSCT)), the Green function will be denoted by $\mathcal{G}^{\gamma}(v,w)$ , and the restricted one by $\hat{\zeta}_{w}^{\gamma}(v)$ . As a general rule, the objects defined on the limit $(\mathcal{T},\mathcal{W})$ will wear a hat $\hat{\cdot}$ to distinguish them from similar objects defined on $({\widetilde{G}}_{N},{\widetilde{W}}_{N})$ (see also Remark A.3).

The Green functions on trees satisfy some classical recursive relations; the following lemma is proved for instance in [10]. Given $v\in V(T)$ , we denote by $\mathcal{N}_{v}$ its set of nearest neighbours.

Lemma 2.2.

For any $v\in{T}$ and $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , we have

[TABLE]

For any non-backtracking path $(v_{0};v_{k})$ in $T$ ,

[TABLE]

Also, for any $w\sim v$ , we have

[TABLE]

For any $v,w\in T$ , we have

[TABLE]

Next, if $\gamma=\lambda\pm i\eta$ with $\lambda\in\mathbb{R}$ , $\eta>0$ , then

[TABLE]

Finally, if $\Psi_{\gamma,v}(w)=\operatorname{Im}G(v,w;\gamma)$ , then for any path $(v_{0},\dots,v_{k})$ in ${T}$ , $k\geq 1$ ,

[TABLE]

Note that $|\zeta_{v}^{\lambda+i\eta}(u)|\leq\eta^{-1}$ . It follows from (2.4b) that for any $\lambda\in[-(A+D),A+D]$ and $\eta\in(0,1)$ ,

[TABLE]

where $c_{D,A}=2(A+D)+1$ .

Corollary 2.3.

Given $\gamma\in\mathbb{C}\setminus\mathbb{R}$ , for any $v_{0},v_{1}\in T$ , $v_{0}\sim v_{1}$ , we have

[TABLE]

Also, for any non-backtracking path $(v_{0};v_{k})$ in $T$ , $k\geq 1$ , we have

[TABLE]

Proof.

By (2.10), $\Psi_{\gamma,v_{0}}(v_{1})-\zeta_{v_{0}}^{\gamma}(v_{1})\Psi_{\gamma,v_{0}}(v_{0})=\operatorname{Im}\zeta_{v_{0}}^{\gamma}(v_{1})\overline{G(v_{0},v_{0};\gamma)}$ . As $\Psi_{\gamma,v_{1}}(v_{0})=\Psi_{\gamma,v_{0}}(v_{1})$ , we thus get using (2.6),

[TABLE]

Next, since $G(v_{1},v_{1};\gamma)=\frac{G(v_{0},v_{1};\gamma)}{\zeta_{v_{1}}^{\gamma}(v_{0})}$ and $\frac{1}{\zeta_{v_{1}}^{\gamma}(v_{0})}=\zeta_{v_{0}}^{\gamma}(v_{1})+2m_{v_{0}}^{\gamma}$ , we have

[TABLE]

so

[TABLE]

and thus

[TABLE]

This completes the proof of the first claim, by (2.14). Next, we use again that $\Psi_{\gamma,v_{0}}(v_{1})-\zeta_{v_{0}}^{\gamma}(v_{1})\Psi_{\gamma,v_{0}}(v_{0})=\operatorname{Im}\zeta_{v_{0}}^{\gamma}(v_{1})\overline{G(v_{0},v_{0};\gamma)}$ . In addition, by (2.2),

[TABLE]

where the last equality is proved as in (2.15). This proves the second claim for $k=1$ .

Now let $k\geq 2$ . If we apply (2.10) with $v_{1}$ instead of $v_{0}$ and use (2.6), we get

[TABLE]

The second claim for $k\geq 2$ now follows by (2.10). ∎

We conclude by recalling the fact that for Lebesgue a.e. $\lambda\in\mathbb{R}$ , the Green function has a finite limit on the real axis almost surely. Remember that $\mathscr{T}_{\ast}^{D,A}$ us the set of coloured rooted trees, and that $\mathbb{P}$ is the probability measure on $\mathscr{T}_{\ast}^{D,A}$ appearing in (BSCT).

Proposition 2.4.

There exists a Lebesgue-null set $\mathfrak{A}\subset\mathbb{R}$ such that, to each $\lambda\in\mathfrak{S}:=\mathbb{R}\setminus\mathfrak{A}$ , there is $\Omega_{\lambda}\subseteq\mathscr{T}_{\ast}^{D,A}$ with $\operatorname{\mathbb{P}}(\Omega_{\lambda})=1$ , such that if $[\mathcal{T},o,\mathcal{W}]\in\Omega_{\lambda}$ , then the limit $G(v,w;\lambda+i0):=\lim_{\eta\downarrow 0}G(v,w;\lambda+i\eta)$ exists for any $v,w\in\mathcal{T}$ .

Proof.

Fix $[\mathcal{T},o,\mathcal{W}]$ . By [10, Lemma 3.3], there is a Lebesgue-null set $\mathfrak{A}_{[\mathcal{T},o,\mathcal{W}]}\subset\mathbb{R}$ such that for any $\lambda\in\mathfrak{S}_{[\mathcal{T},o,\mathcal{W}]}:=\mathbb{R}\setminus\mathfrak{A}_{[\mathcal{T},o,\mathcal{W}]}$ , $G(v,w;\lambda+i0)$ exists for all $v,w\in\mathcal{T}$ . Let $\mathfrak{D}=\{([\mathcal{T},o,\mathcal{W}],\lambda):\text{ the limit does not exist}\}$ . Then

[TABLE]

where $\mathfrak{D}_{[\mathcal{T},o,\mathcal{W}]}=\{\lambda\in\mathbb{R}:([\mathcal{T},o,\mathcal{W}],\lambda)\in\mathfrak{D}\}$ . Since $\mathfrak{D}_{[\mathcal{T},o,\mathcal{W}]}\subseteq\mathfrak{A}_{[\mathcal{T},o,\mathcal{W}]}$ , we have $Leb(\mathfrak{D}_{[\mathcal{T},o,\mathcal{W}]})=0$ for all $[\mathcal{T},o,\mathcal{W}]$ . Hence,

[TABLE]

where $\mathfrak{D}_{\lambda}=\{[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}:([\mathcal{T},o,\mathcal{W}],\lambda)\in\mathfrak{D}\}$ . It follows that $\operatorname{\mathbb{P}}(\mathfrak{D}_{\lambda})=0$ on a Lebesgue-full set $\mathfrak{A}$ . Taking $\Omega_{\lambda}=\mathfrak{D}_{\lambda}^{c}$ completes the proof. ∎

3. The non-backtracking quantum variance

Our strategy follows the one discovered in [7]. We find a transformation turning the eigenfunctions of $\mathcal{A}+W$ on $G=\Gamma\backslash{\widetilde{G}}$ into eigenfunctions of a “non-backtracking” random walk. The new operator is not self-adjoint, but this difficulty is superseded by the fact that the trajectories of non-backtracking random walks (on a tree) are much simpler than those of usual random walks.

The notation is the same as in the introduction except that we drop the subscript $N$ . Suppose $(\psi_{j})$ is an orthonormal basis of eigenfunctions for $H=\mathcal{A}+W$ , say $H\psi_{j}=\lambda_{j}\psi_{j}$ .

Fix $\eta_{0}\in(0,1)$ , let $\gamma_{j}=\lambda_{j}+i\eta_{0}$ and let

[TABLE]

where $\zeta_{x}^{\gamma}(y)=-\tilde{g}_{N}^{(y|x)}(y,y;\gamma)$ (see notation in §2.2). If $\mathcal{B}$ is the non-backtracking operator (2.1), we have

[TABLE]

where we used (2.4b). Hence we get

[TABLE]

where $\tau_{\pm}:\ell^{2}(V)\to\ell^{2}(B)$ are defined by

[TABLE]

In [7] it was possible to set $\eta_{0}=0$ , and (3.1) said exactly that $f_{j}$ was an eigenfunction of the weighted non-backtracking operator $\mathcal{B}\zeta^{\gamma_{j}}$ for the eigenvalue $1$ . At our level of generality, we do not know if $\zeta^{\lambda_{j}+i0}$ is well-defined on $\widetilde{G}_{N}$ . We have to work with $\eta_{0}>0$ and let $\eta_{0}$ tend to [math] only at the end of the proof, after $N$ has gone to $\infty$ . Hence, $f_{j}$ is not exactly an eigenfunction, and our formulas will contain error terms of size $\eta_{0}$ that we will need to estimate precisely, to show that they disappear as $N\to+\infty$ , followed by $\eta_{0}\downarrow 0$ .

Similarly, if we put

[TABLE]

we note that $f_{j}^{\ast}=\iota f_{j}$ where $\iota$ is the edge reversal involution, and we get

[TABLE]

Let $I$ be an open interval such that $\overline{I}\subset I_{1}$ . We define for $K\in\mathscr{H}_{k}$ ,

[TABLE]

The dependence of this quantity on $\eta_{0}$ is hidden in the definition of $f_{j},f_{j}^{*}$ . The scalar product $\langle\cdot,\cdot\rangle$ is on $\ell^{2}(B)$ endowed with the uniform measure; cf. (2.2).

Remark 3.1.

We call (3.3) “quantum variance”, in analogy to the quantity bearing this name in quantum chaos. However, there are some significant differences :

•

we use the functions $f_{j}$ and $f_{j}^{*}$ instead of the original $\psi_{j}$ . They are (quasi)-eigenfunctions, respectively of the non-selfadjoint operators $\mathcal{B}\zeta^{\gamma_{j}}$ and $\mathcal{B}^{\ast}\iota\zeta^{\gamma_{j}}$ .

•

if $K$ is the identity operator $Id$ , we do not have the normalization ${\mathrm{Var_{nb,\eta_{0}}^{I}}}(Id)=1$ .

•

we did not take the square of $\left|\left\langle f_{j}^{\ast},K_{B}f_{j}\right\rangle\right|$ in the definition. This is purely for technical convenience, the square will appear later when we apply the Cauchy-Schwarz inequality.

We will need to extend (3.3) to operators $K$ that depend on the eigenvalue $\lambda_{j}$ in a holomorphic fashion, as spelled out in the following definition. Note that $K$ also depends on $N$ , also this tends to be implicit in our notation. We let $\mathbb{C}^{+}=\{\gamma\in\mathbb{C},\operatorname{Im}\gamma>0\}$ .

Definition 3.2.

Assumptions (Hol).

We assume that $\gamma\mapsto K^{\gamma}=K_{N}^{\gamma}$ is a map from $\gamma\in\mathbb{C}^{+}$ to $\mathscr{H}_{k}$ such that :

•

For $\eta_{0}>0$ , for each $N$ and $(x_{0};x_{k})$ , the function $\lambda\mapsto K^{\lambda+i\eta_{0}}(x_{0};x_{k})$ from $\mathbb{R}\to\mathbb{C}$ has an analytic extension $K_{\eta_{0}}$ to the strip $\{z:|\operatorname{Im}z|<\eta_{0}/2\}$ .

•

Given $\eta_{0}>0$ , we have $\sup_{N}\sup_{\operatorname{Re}z\in I_{1},|\operatorname{Im}z|<\eta_{0}/2}\sup_{(x_{0};x_{k})}|K_{N,\eta_{0}}^{z}(x_{0};x_{k})|<+\infty$ and $\sup_{N}\sup_{\operatorname{Re}z\in I_{1},|\operatorname{Im}z|<\eta_{0}/2}\sup_{(x_{0};x_{k})}|\partial_{z}K_{N,\eta_{0}}^{z}(x_{0};x_{k})|<+\infty$ . We write ${\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}$ for the maximum of these two quantities.

•

For all $s>0$ ,

[TABLE]

If $\gamma\mapsto K^{\gamma}$ is holomorphic on $\mathbb{C}^{+}$ , then it obviously satisfies the first point of the definition with $K_{\eta_{0}}(z)=K^{z+i\eta_{0}}$ . For instance, if $K^{\gamma}(x_{0};x_{k})$ has the form $\sum_{n\geq 0}a_{(x_{0};x_{k})}^{(n)}\gamma^{n}$ , then we see that $\lambda\mapsto K^{\lambda+i\eta_{0}}(x_{0};x_{k})$ extends to $K_{\eta_{0}}(z)=\sum_{n\geq 0}a_{(x_{0};x_{k})}^{(n)}(z+i\eta_{0})^{n}$ . Note that, although $\gamma\mapsto\overline{K^{\gamma}}$ is not holomorphic, its restriction to an horizontal line is still a real-analytic map $\mathbb{R}\ni\lambda\mapsto\overline{K^{\lambda+i\eta_{0}}(x_{0};x_{k})}$ , as it possesses an analytic extension given by $z\mapsto\sum_{n\geq 0}\overline{a_{(x_{0};x_{k})}^{(n)}}(z-i\eta_{0})^{n}$ . So $\overline{K^{\gamma}}$ will satisfy (Hol) if $K^{\gamma}$ does.

Conditions (Hol) are stable under the sum and composition of operators.

We extend (3.3) to this setting, by letting

[TABLE]

Most of the paper is devoted to showing :

Theorem 3.3.

Under assumptions (EXP), (BSCT), (Green), if $K^{\gamma}\in\mathscr{H}_{k}$ has the form $K^{\gamma}=\mathcal{F}_{\gamma}K$ for the operators $\mathcal{F}_{\gamma}$ in Corollary 10.3, then

[TABLE]

These $\gamma\mapsto\mathcal{F}_{\gamma}K$ satisfy (Hol). The fact that this implies Theorem 1.3 is proven in Section 10, that may be read independently of the proof of Theorem 3.3.

4. Step 1 : Bound on the non-backtracking quantum variance

Given $\gamma\in\mathbb{C}^{+}$ , we introduce a norm on each $\mathscr{H}_{k}$ , $k\geq 1$ , defined by

[TABLE]

We denote by $\langle\cdot,\cdot\rangle_{\gamma}$ the associated scalar product. The reason for introducing the weight $\frac{|\operatorname{Im}\zeta_{x}^{\gamma}(y)|}{|\zeta_{x}^{\gamma}(y)|^{2}}$ will be apparent in Section 6. The aim of this section is to prove Theorem 4.1. Here, we assume that $I=(a,b)$ , with $[a,b]\subset I_{1}$ . This implies that there is $\eta_{a,b}$ such that $(a-2\eta,b+2\eta)\subset I_{1}$ for all $\eta\leq\eta_{a,b}$ . We then assume that $\eta\leq\min(\eta_{0}/2,\eta_{a,b})$ .

Theorem 4.1.

Under assumptions (BSCT), (Green), if $K^{\gamma}\in\mathscr{H}_{k}$ satisfies the set of assumptions (Hol), then for any interval $I=(a,b)$ as above,

[TABLE]

In the scheme of §1.6, this corresponds to Step 1. This is more complicated than usual, due to the fact that we have replaced the orthonormal family $(\psi_{j})$ by non-orthogonal functions $(f_{j}),(f_{j}^{*})$ , and also because $K$ “depends on $\lambda_{j}$ ” in (3.5).

Recall that $D$ above is the maximal degree and we assumed $|W_{N}(x)|\leq A$ . In particular, any eigenvalue $\lambda_{j}\in I_{0}:=[-(A+D),A+D]$ . For $\lambda\in\mathbb{R}$ and $\eta_{0}\in(0,1)$ , let

[TABLE]

Denoting $\gamma_{j}={\lambda_{j}+i\eta_{0}}$ , we have (by a double application of the Cauchy-Schwarz inequality)

[TABLE]

We check at the end of the section that

[TABLE]

We now introduce an approximation $\chi$ of ${\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{I}$ by an entire function, by the standard convolution procedure :

Fix $0<\eta\leq\eta_{0}/2$ . Let $\phi(x)=\frac{1}{\pi^{1/2}}e^{-x^{2}}$ and denote $\phi_{\epsilon}(x)=\epsilon^{-1}\phi(x/\epsilon)$ . Let $\chi$ be the convolution $\chi=\phi_{\eta^{3/2}}\ast{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{I}$ on $\mathbb{R}$ . Then $\chi$ extends to an entire function on $\mathbb{C}$ given by

[TABLE]

Note that $0\leq\chi(x)\leq 1$ for $x\in\mathbb{R}$ , and $|\chi(z)|\leq e^{\eta^{5}}$ for $|\operatorname{Im}z|\leq\eta^{4}$ . We assume $\eta$ is small enough so that $\chi\geq\frac{1}{3}{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{I}$ and $|\chi(z)|\leq e^{-1/\eta}$ on $\{z\in\mathbb{C}:|\operatorname{Im}z|\leq\eta^{4},\ d(\operatorname{Re}z,I)\geq 2\eta\}$ . We finally note that $|\frac{\partial\chi}{\partial t_{2}}(t_{1}+it_{2})|\leq C\eta^{-3}e^{\eta^{5}}$ for any $z=t_{1}+it_{2}$ with $t_{1}\in I_{0}$ and $|t_{2}|\leq\eta^{4}$ .

By (4) and (4.3) we have

[TABLE]

Now by (2.3), we have

[TABLE]

where $(x_{0};x_{k})=(x_{0},x_{1},x_{2},\dots,x_{k})$ , $(x_{0};y_{k})=(x_{0},x_{1},y_{2},\dots,y_{k})$ and with the convention that $K^{\gamma_{j}}(x_{0};x_{k})=0$ if the path $(x_{0},x_{1},x_{2},\dots,x_{k})$ backtracks. The function $\lambda\mapsto|\alpha_{\lambda+i\eta_{0}}(x_{0},x_{1})|^{2}=\frac{-\operatorname{Im}\zeta_{x_{1}}^{\lambda+i\eta_{0}}(x_{0})}{|\zeta_{x_{1}}^{\lambda+i\eta_{0}}(x_{0})|^{2}}$ extends analytically to the rectangle $\mathscr{R}=\{z\in\mathbb{C}:\operatorname{Re}z\in[-(A+D+\eta),(A+D+\eta)],\operatorname{Im}z\in[-\eta^{4},\eta^{4}]\}$ through the formula $\frac{\zeta_{x_{1}}^{z-i\eta_{0}}(x_{0})-\zeta_{x_{1}}^{z+i\eta_{0}}(x_{0})}{2i\,\zeta_{x_{1}}^{z+i\eta_{0}}(x_{0})\zeta_{x_{1}}^{z-i\eta_{0}}(x_{0})}$ . We denote this by ${\alpha}_{\eta_{0}}^{z}(x_{0},x_{1})$ (which is not the same as $|\alpha_{z+i\eta_{0}}(x_{0},x_{1})|^{2}$ ). The same is true for the other $\zeta$ terms. We denote the extension of $\lambda\mapsto K^{\lambda+i\eta_{0}}(x_{0};x_{k})\overline{K^{\lambda+i\eta_{0}}(x_{0};y_{k})}$ by ${K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})$ . Again, if $(x_{0};y_{k})=(x_{0};x_{k})$ , this is not the same as $|K^{z+i\eta_{0}}(x_{0};x_{k})|^{2}$ . However, see Lemma 4.4 to compare both.

Given $x,y\in V$ and $z\in\mathbb{C}\setminus\mathbb{R}$ , let

[TABLE]

be the Green function of $H$ on the finite graph $G$ . Then by Cauchy’s integral formula,

[TABLE]

We now observe that the integral over the vertical segments of the contour do not contribute as $\eta,\eta_{0}\downarrow 0$ . More precisely,

Lemma 4.2.

The integral $\frac{-1}{2i\pi N}\int_{z\in\partial\mathscr{R}}F(z)\,\mathrm{d}z$ in (4) may be replaced by $\frac{1}{2i\pi N}(\int_{a-2\eta}^{b+2\eta}F(\lambda+i\eta^{4})\,\mathrm{d}\lambda-\int_{a-2\eta}^{b+2\eta}F(\lambda-i\eta^{4})\,\mathrm{d}\lambda$ , up to an error term at most $C_{k,D,A}\eta_{0}^{-3}\eta^{-4}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{-1/\eta}$ .

Proof.

The error is the integral of $F(z)$ on the two vertical paths $\{\operatorname{Re}z=-A-D-\eta,\operatorname{Im}z\in[-\eta^{4},\eta^{4}]\}$ , $\{\operatorname{Re}z=A+D+\eta,\operatorname{Im}z\in[-\eta^{4},\eta^{4}]\}$ , and the four connected components of the set $\{\operatorname{Im}z=\pm\eta^{4},\operatorname{Re}z\in[-A-D-\eta,A+D+\eta]\setminus(a-2\eta,b+2\eta)\}$ . On these pieces, we know that $|\chi(z)|\leq e^{-1/\eta}$ . Moreover, $|{K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})|\leq{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}$ . Next, $|{\alpha}_{\eta_{0}}^{z}|=\frac{1}{2}\big{|}\frac{1}{\zeta_{x_{1}}^{z+i\eta_{0}}(x_{0})}-\frac{1}{\zeta_{x_{1}}^{z-i\eta_{0}}(x_{0})}\big{|}\leq c_{D,A}\big{(}\frac{1}{\eta_{0}+\eta^{4}}+\frac{1}{\eta_{0}-\eta^{4}}\big{)}$ by (2.11). Since $\eta\leq\eta_{0}/2$ by assumtpion, this yields $|{\alpha}_{\eta_{0}}^{z}|\leq C_{D,A}\eta_{0}^{-1}$ . The Green functions and $\zeta$ terms may be bounded similarly by $4c_{D,A}\eta_{0}^{-2}\eta^{-4}$ . A factor $C_{k,D}$ comes from the number of paths, divided by $N$ . ∎

Our next aim is to lift this expression to the universal cover $\widetilde{G}$ . In other words, we wish to replace $g^{z}$ by $\tilde{g}^{z}$ everywhere, to be able to use the identities of §2.2.

Lemma 4.3.

Denote $z=\lambda+i\eta^{4}$ . Given $R\in\mathbb{N}^{\ast}$ , there is $d_{R,k,\eta}>0$ such that the integral $\frac{1}{2i\pi N}\int_{a-2\eta}^{b+2\eta}F(z)\,\mathrm{d}\lambda$ in Lemma 4.2 may be replaced by

[TABLE]

where $\zeta_{e_{k}}^{\gamma}=\zeta_{x_{k-1}}^{\gamma}(x_{k})$ and $\zeta_{e_{k}^{\prime}}^{\gamma}=\zeta_{y_{k-1}}^{\gamma}(y_{k})$ , up to an error term $(\frac{\#\{\rho_{G}(x_{0})<d_{R,k,\eta}\}}{N}\eta^{-4}+\frac{1}{R})C_{k,D,A}\eta_{0}^{-3}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}$ .

Similarly, $\frac{1}{2i\pi N}\int_{a-2\eta}^{b+2\eta}F(\bar{z})\,\mathrm{d}\lambda$ in Lemma 4.2 may be replaced by

[TABLE]

up to an error term $(\frac{\#\{\rho_{G}(x_{0})<d_{R,k,\eta}\}}{N}\eta^{-4}+\frac{1}{R})C_{k,D,A}\eta_{0}^{-3}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}$ .

Proof.

We first approximate $\lambda\mapsto g^{\lambda+i\eta^{4}}(x,y)$ by a polynomial on the compact interval $I_{0}$ . Let $h_{\eta}(t)=-(t-i\eta^{4})^{-1}$ and choose a polynomial $q_{\eta}$ with $\|h_{\eta}-q_{\eta}\|_{\infty}<\frac{1}{R}$ . Then $\|h_{\eta}(H-\lambda)-q_{\eta}(H-\lambda)\|<\frac{1}{R}$ , so $|g^{\lambda+i\eta}(x,y)-q_{\eta}(H-\lambda)(x,y)|<\frac{1}{R}$ for any $x,y$ and $\lambda$ . So replacing each $g^{\lambda+i\eta^{4}}(x,y)$ by $q_{\eta}(H-\lambda)(x,y)$ in the sums gives an error term $\frac{C_{k,D,A}\eta_{0}^{-3}{\left|\kern-0.75346pt\left|\kern-0.75346pt\left|K\right|\kern-0.75346pt\right|\kern-0.75346pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}}{R}$ as in Lemma 4.2.

Denote $C_{k,D,A,\eta_{0}}=C_{k,D,A}\eta_{0}^{-3}\|K\|_{\eta_{0}}^{2}$ .

Let $d_{R,\eta}$ be the degree of $q_{\eta}$ . Suppose $\rho_{G}(x_{0})\geq d_{R,\eta}+k=:d_{R,k,\eta}$ . Then it is easy to see that $q_{\eta}(H-\lambda)(x_{k},y_{k})=q_{\eta}(\widetilde{H}-\lambda)(\tilde{x}_{k},\tilde{y}_{k})$ , c.f. Lemma A.1. The same holds for the other edges $(x_{k},y_{k-1})$ and so on. The terms with $\rho_{G}(x_{0})<d_{R,k,\eta}$ bring an error term $\frac{\#\{\rho_{G}(x_{0})<d_{R,k,\eta}\}}{N}\eta^{-4}C_{k,D,A,\eta_{0}}$ . Finally, we replace the $q_{\eta}(\widetilde{H}-\lambda)(\tilde{x},\tilde{y})$ by ${\tilde{g}}^{\lambda+i\eta^{4}}(\tilde{x},\tilde{y})$ which yields again an error of the form $\frac{C_{k,D,A,\eta_{0}}}{R}$ .

This proves the first statement, and the second one is proven similarly. ∎

We continue to simplify the expression and record the following.

Lemma 4.4.

If we replace ${\alpha}_{\eta_{0}}^{z}(x_{0},x_{1}){K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})$ and ${\alpha}^{\bar{z}}_{\eta_{0}}(x_{0},x_{1})K^{\bar{z}}_{\eta_{0}}(x_{0};x_{k},y_{k})$ in Lemma 4.3 by $|\alpha_{z+i\eta_{0}}(x_{0},x_{1})|^{2}K^{z+i\eta_{0}}(x_{0};x_{k})\overline{K^{z+i\eta_{0}}(x_{0};y_{k})}$ , then as $N\to\infty$ , the error we get is at most $C_{k,D,A}\eta_{0}^{-6}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}\eta^{4}$ . We may also replace $\chi(\lambda\pm i\eta^{4})$ by $\chi(\lambda)$ , modulo the asymptotic error $C_{k,D,A}\eta_{0}^{-3}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}\eta$ . Finally, we may replace each $\zeta_{e_{k}}^{\bar{z}+i\eta_{0}}$ by $\zeta_{e_{k}}^{z+i\eta_{0}}$ and $\zeta_{e_{k}^{\prime}}^{z-i\eta_{0}}$ by $\zeta_{e_{k}^{\prime}}^{\bar{z}-i\eta_{0}}$ , modulo an asymptotic error $C_{k,D,A}\eta_{0}^{-6}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}\eta^{4}$ .

Proof.

We start with ${\alpha}_{\eta_{0}}^{z}(x_{0},x_{1}){K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})$ . Denote $e=(x_{0},x_{1})$ and $\zeta_{e}^{\gamma}=\zeta_{x_{1}}^{\gamma}(x_{0})$ . We note that

[TABLE]

where we used (2.11) in the first inequality and the resolvent identity in the second one. Similarly, $K^{z+i\eta_{0}}(x_{0};x_{k})\overline{K^{z+i\eta_{0}}(x_{0};y_{k})}$ is the same as ${K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})$ , but with each $z-i\eta_{0}$ replaced by $\bar{z}-i\eta_{0}$ . It follows that $|{K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})-K^{z+i\eta_{0}}(x_{0};x_{k})\overline{K^{z+i\eta_{0}}(x_{0};y_{k})}|\leq 2\sup|\partial_{z}K(v_{0};v_{k})|\sup|K(v_{0};v_{k})|\cdot|z-\bar{z}|\leq 4{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}\eta^{4}$ . Hence, ${\alpha}_{\eta_{0}}^{z}(x_{0},x_{1}){K}^{z}_{\eta_{0}}(x_{0};x_{k},y_{k})$ is the same as $|\alpha_{z+i\eta_{0}}(x_{0},x_{1})|^{2}K^{z+i\eta_{0}}(x_{0};x_{k})\overline{K^{z+i\eta_{0}}(x_{0};y_{k})}$ , modulo $C_{D,A}\eta_{0}^{-4}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}\eta^{4}$ . This error is further multiplied by the function $\chi$ . Bounding the $\zeta$ terms by some $c_{D,A}\eta_{0}^{-2}$ and $|\chi(z)|$ by $e^{\eta^{5}}$ , we end up with an error term at most

[TABLE]

and a similar upper bound for each term involving ${\tilde{g}}^{\lambda\pm i\eta^{4}}$ . Since $I_{\eta}=(a-2\eta,b+2\eta)\subset I_{1}$ , we may use Remark A.5 to deduce that the integrand is uniformly bounded over $\lambda\in I_{\eta}$ by $C_{k,D,A}\eta_{0}^{-6}{\left|\kern-1.07639pt\left|\kern-1.07639pt\left|K\right|\kern-1.07639pt\right|\kern-1.07639pt\right|}_{\eta_{0}}^{2}e^{\eta^{5}}\eta^{4}$ as $N\to\infty$ . Note that $|I_{\eta}|\leq|I_{0}|=2(D+A)$ .

This proves the first claim. The second claim is similar, for example $|{\alpha}^{\bar{z}}_{\eta_{0}}(x_{0},x_{1})-|\alpha^{z+i\eta_{0}}(x_{0},x_{1})|^{2}|\leq C_{D,A}\eta_{0}^{-2}|\zeta_{e}^{z+i\eta_{0}}-\zeta_{e}^{\bar{z}+i\eta_{0}}|\leq 2C_{D,A}\eta_{0}^{-4}\eta^{4}$ . Moreover, $K^{\bar{z}}_{\eta_{0}}(x_{0};x_{k},y_{k})$ is the same as $K^{z+i\eta_{0}}(x_{0};x_{k})\overline{K^{z+i\eta_{0}}(x_{0};y_{k})}$ with each $z+i\eta_{0}$ replaced by $\bar{z}+i\eta_{0}$ , so the proof carries on. For the third claim, note that $|\chi(\lambda\pm i\eta^{4})-\chi(\lambda)|\leq\sup_{z\in\mathscr{R}}|\frac{\partial\chi}{\partial x_{2}}(z)|\cdot\eta^{4}\leq Ce^{\eta^{5}}\eta$ . For the last claim, $|(\zeta_{e}^{z\pm i\eta_{0}})^{-1}-(\zeta_{e}^{\bar{z}\pm i\eta_{0}})^{-1}|\leq 2C_{D,A}\eta_{0}^{-4}\eta^{4}$ as we previously saw when analyzing ${\alpha}_{\eta_{0}}^{z}$ , so we get a similar error. ∎

By virtue of Lemma 4.3 and 4.4, denoting $z=\lambda+i\eta^{4}$ , we know at this stage that modulo some error terms, the expression (4) may be replaced by

[TABLE]

We now make the expression more homogeneous as follows:

Lemma 4.5.

Assume we have made all the replacements in Lemma 4.4. If we finally replace each of the four $\operatorname{Im}{\tilde{g}}^{z}(\tilde{x},\tilde{y})$ by $\operatorname{Im}{\tilde{g}}^{z+i\eta_{0}}(\tilde{x},\tilde{y})$ in (4.7), then the error term vanishes as $N\to\infty$ , followed by $\eta\downarrow 0$ , followed by $\eta_{0}\downarrow 0$ .

Proof.

We only analyze the first error term, the other three are similar.

Choose $p,q,r$ such that $\frac{1}{p}+\frac{1}{q}+\frac{1}{r}=1$ , and use the Hölder’s inequality,

[TABLE]

Here $\int=\int_{a-2\eta}^{b+2\eta}$ . The first sum is bounded by $D^{k-1}\sum_{(x_{0};x_{k})\in B_{k}}|K^{z+i\eta_{0}}(x_{0};x_{k})|^{2p}$ . Assumption (Hol) on $K$ implies that

[TABLE]

Next, by Remark A.3,

[TABLE]

and the RHS is uniformly bounded in $\eta,\eta_{0}\in(0,1)$ by Remark A.4. Remember the convention that objects wearing a hat $\hat{\cdot}$ are defined on the limit $(\mathcal{T},\mathcal{W})$ , by similar formulas to those on $G_{N}$ . We also refer to §2.2 for notation related to Green functions.

Finally, again by Remark A.3 we have

[TABLE]

We check that the RHS vanishes as $\eta,\eta_{0}\downarrow 0$ . Let $X_{\eta}^{\eta_{0}}=\operatorname{Im}\mathcal{G}^{\lambda+i(\eta^{4}+\eta_{0})}(v_{k},w_{k})-\operatorname{Im}\mathcal{G}^{\lambda+i\eta^{4}}(v_{k},w_{k})$ , $X^{\eta_{0}}=\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(v_{k},w_{k})-\operatorname{Im}\mathcal{G}^{\lambda+i0}(v_{k},w_{k})$ and $Y_{\eta}^{\eta_{0}}=X_{\eta}^{\eta_{0}}-X^{\eta_{0}}$ . Denote $\sum_{v_{k},w_{k}}=\sum_{(v_{0};v_{k}),(w_{0};w_{k}),v_{0}=w_{0}=o}$ . For any $M>0$ , we have $\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}=\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}1_{|Y_{\eta}^{\eta_{0}}|\leq M}+\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}1_{|Y_{\eta}^{\eta_{0}}|>M}$ .

By Proposition 2.4, $\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}\to 0$ for Lebesgue-a.e. $\lambda\in\mathbb{R}$ and $\mathbb{P}$ -a.e. $[\mathcal{T},o,\mathcal{W}]\in\mathscr{T}_{\ast}^{D,A}$ as $\eta\downarrow 0$ . So the first term tends to [math] by dominated convergence. For the second, for any $s>r$ , $\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}1_{|Y_{\eta}^{\eta_{0}}|>M}\leq\frac{1}{M^{s-r}}\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{s}\leq\frac{C_{s}}{M^{s-r}}$ by (Green). This vanishes as $M\to\infty$ . Thus, $\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|Y_{\eta}^{\eta_{0}}|^{r}\to 0$ as $\eta\downarrow 0$ . Similarly, $\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|X^{\eta_{0}}|^{r}\to 0$ as $\eta_{0}\downarrow 0$ . Since $|X_{\eta}^{\eta_{0}}|^{r}\leq 2^{r-1}(|Y_{\eta}^{\eta_{0}}|^{r}+|X^{\eta_{0}}|^{r})$ , it follows that $\int\operatorname{\mathbb{E}}\sum_{v_{k},w_{k}}|X_{\eta}^{\eta_{0}}|^{r}\to 0$ as $\eta\downarrow 0$ followed by $\eta_{0}\downarrow 0$ . ∎

By virtue of Lemma 4.5, denoting $\Psi_{\gamma,v}(w)=\operatorname{Im}{\tilde{g}}^{\gamma}(v,w)$ , the term in parentheses (4.7) may be replaced by

[TABLE]

Recall that $e_{k}=(x_{k-1},x_{k})$ , $e_{k}^{\prime}=(y_{k-1},y_{k})$ and that there are non-backtracking paths $(x_{0},x_{1},\dots,x_{k-1},x_{k})$ and $(x_{0},x_{1},\dots,y_{k-1},y_{k})$ . Moreover, $\rho_{G}(x_{0})\geq d_{R,\eta,k}\geq k$ .

Suppose $e_{k}^{\prime}\neq e_{k}$ . Then there is a path $(v_{0},\dots,v_{s})$ with $v_{0}=\tilde{x}_{k}$ , $v_{1}=\tilde{x}_{k-1}$ , $v_{s-1}=\tilde{y}_{k-1}$ and $v_{s}=\tilde{y}_{k}$ . Taking the complex conjugate in identity (2.13), noting that $\Psi_{z+i\eta_{0},v}(w)$ is real, we see that (4.8) is zero. If $e_{k}=e^{\prime}_{k}$ , (2.12) tells us (4.8) equals $\frac{|\operatorname{Im}\zeta_{x_{k-1}}^{z+i\eta_{0}}(x_{k})|}{|\zeta_{x_{k-1}}^{z+i\eta_{0}}(x_{k})|^{2}}$ .

Since $\rho_{G}(x_{0})\geq k$ in Lemma 4.3, the paths $(x_{0},x_{1},x_{2},\cdots,x_{k})$ and $(x_{0},x_{1},y_{2},\cdots,y_{k})$ are determined by $e_{k}$ and $e^{\prime}_{k}$ , respectively. So the terms in the sum are only nonzero if $(x_{0},x_{1},x_{2},\cdots,x_{k})=(x_{0},x_{1},y_{2},\cdots,y_{k})$ . Hence, if we make all replacements in Lemmas 4.4 and 4.5, modulo the errors appearing in these lemmas, the expression (4) finally takes the form

[TABLE]

where we used that $\chi(\lambda)\leq 1$ on $\mathbb{R}$ . Collecting all estimates on the error terms, taking $N\to\infty$ , then $\eta\downarrow 0$ , then $\eta_{0}\downarrow 0$ , then $R\to\infty$ , we finally get $\frac{1}{N}\sum_{j=1}^{N}\chi(\lambda_{j})\|\alpha_{\gamma_{j}}K_{B}^{\gamma_{j}}f_{j}\|^{2}\lesssim\frac{1}{\pi}\int_{a-2\eta}^{b+\eta}\|K^{z+i\eta_{0}}\|_{z+i\eta_{0}}^{2}\,\mathrm{d}\lambda$ . Recalling (4.5), if we prove (4.3), then this will complete the proof of Theorem 4.1.

We have $\|\overline{\alpha_{\gamma_{j}}}^{-1}f_{j}^{\ast}\|^{2}=\sum_{(x_{0},x_{1})\in B}\frac{1}{|\operatorname{Im}\zeta_{x_{1}}^{\gamma_{j}}(x_{0})|}|\psi_{j}(x_{0})-\zeta_{x_{1}}^{\gamma_{j}}(x_{0})\psi_{j}(x_{1})|^{2}$ . Repeating the same arguments, we see that modulo asymptotically vanishing error terms, we have

[TABLE]

The term in square brackets is just $|\operatorname{Im}\zeta_{x_{1}}^{z+i\eta_{0}}(x_{0})|$ by (2.12). Hence, using $\chi(\lambda)\leq 1$ we get $\frac{1}{N}\sum_{\lambda_{j}\in I}\|\overline{\alpha_{\gamma_{j}}}^{-1}f_{j}^{\ast}\|^{2}\lesssim\frac{3(|I|+4\eta)D}{\pi}$ for any small $\eta>0$ , and (4.3) follows.

5. Step 2 : Invariance property of the quantum variance

In the scheme of §1.6, we are now in Step 2 : using the functional equations (3.1) and (3.2) satisfied by $f_{j},f_{j}^{*}$ , we show that there are certain transformations $\mathcal{R}_{n,r}^{\gamma}:\mathscr{H}_{k}=\mathbb{C}^{B_{k}}\to\mathscr{H}_{n+k}=\mathbb{C}^{B_{n+k}}$ that leave the quantum variance (3.3) unchanged.

Recall from Section 3 that $\mathcal{B}(\zeta^{\gamma_{j}}f_{j})=f_{j}-i\eta_{0}\,\tau_{+}\psi_{j}$ and $\mathcal{B}^{\ast}(\iota\zeta^{\gamma_{j}}f_{j}^{\ast})=f_{j}^{\ast}-i\eta_{0}\,\tau_{-}\psi_{j}$ if $\gamma_{j}=\lambda_{j}+i\eta_{0}$ . So

[TABLE]

Iterating $r$ times,

[TABLE]

Similarly

[TABLE]

If we define for $r\leq n$ and $\gamma\in\mathbb{C}\setminus\mathbb{R}$ the operator $\mathcal{R}_{n,r}^{\gamma}:\mathscr{H}_{k}\to\mathscr{H}_{n+k}$ by

[TABLE]

we thus get

[TABLE]

where the $\mathcal{E}$ stands for an “error term” that should vanish as $\eta_{0}\downarrow 0$ :

[TABLE]

Since this holds for each $1\leq r\leq n$ and $K=K^{\gamma}$ , we get by the triangle inequality

[TABLE]

We first show that the latter term may be neglected.

Lemma 5.1.

Suppose $K^{\gamma}\in\mathscr{H}_{k}$ satisfies assumptions (Hol) and let $\bar{I}\subseteq I_{1}$ . Then for all $n\in\mathbb{N}$ ,

[TABLE]

Proof.

We have $\big{(}\frac{1}{N}\sum_{\lambda_{j}\in I}|\frac{1}{n}\sum_{r=1}^{n}\mathcal{E}_{n,r,j}|\big{)}^{2}\leq\frac{1}{n}\sum_{r=1}^{n}\big{(}\frac{1}{N}\sum_{\lambda_{j}\in I}|\mathcal{E}_{n,r,j}|\big{)}^{2}$ . Now, letting as above $\gamma_{j}=\lambda_{j}+i\eta_{0}$ ,

[TABLE]

where $c_{n,r}=n+r(n-r)$ . So it suffices to show that $\limsup_{N}\big{(}\frac{1}{N}\sum_{\lambda_{j}\in I}|\langle\cdot,\cdot\rangle|\big{)}^{2}$ is uniformly bounded in $\eta_{0}$ for each $t,t^{\prime}$ . For the first term, we have

[TABLE]

The first sum is uniformly bounded as $\eta_{0}\downarrow 0$ , by (4.3). Next, by (2.3), we have

[TABLE]

Arguing as in Section 4, applying Lemmas 4.2 to 4.4, we get for $z=\lambda+i\eta^{4}$ ,

[TABLE]

Using Hölder’s inequality as in Lemma 4.5, we see that as $N\to\infty$ , this quantity is uniformly bounded in $\eta,\eta_{0}$ by (Hol) and (Green). One bounds $\frac{1}{N}\sum_{\lambda_{j}}\|K_{B}^{\gamma_{j}}f_{j}\|^{2}$ similarly. Finally,

[TABLE]

which is asymptotically bounded using Hölder’s inequality again as in Lemma 4.5. ∎

Using the invariance law (5.1), Theorem 4.1 with $\tilde{K}^{\gamma}=\frac{1}{n}\sum_{r=1}^{n}\mathcal{R}_{n,r}^{\gamma}K^{\gamma}$ , and Lemma 5.1, we deduce the following statement :

Proposition 5.2.

Under the assumptions of Theorem 4.1,

[TABLE]

6. Step 3 : A stationary Markov chain appears

Denoting $\gamma=\lambda+i(\eta^{4}+\eta_{0})$ in Proposition 5.2, we are now concerned with estimating

[TABLE]

Suppose $r\geq r^{\prime}$ , so that $n-r\leq n-r^{\prime}$ . Then

[TABLE]

Letting ${\eta}_{1}=\operatorname{Im}\gamma$ , (2.9) tells us that $\sum_{x_{0}\in\mathcal{N}_{x_{1}}\setminus\{x_{2}\}}|\operatorname{Im}\zeta_{x_{1}}^{\gamma}(x_{0})|=\frac{|\operatorname{Im}\zeta_{x_{2}}^{\gamma}(x_{1})|}{|\zeta_{x_{2}}^{\gamma}(x_{1})|^{2}}-\eta_{1}$ . Similarly, we have $\sum_{x_{n+k}\in\mathcal{N}_{x_{n+k-1}}\setminus\{x_{n+k-2}\}}|\operatorname{Im}\zeta_{x_{n+k-1}}^{\gamma}(x_{n+k})|=\frac{|\operatorname{Im}\zeta_{x_{n+k-2}}^{\gamma}(x_{n+k-1})|}{|\zeta_{x_{n+k-2}}^{\gamma}(x_{n+k-1})|^{2}}-\eta_{1}$ . By iteration, this induces some simplifications :

[TABLE]

with the error term

[TABLE]

The expression is slightly nicer if we replace $K$ by $Z_{\gamma}K$ defined by

[TABLE]

If $\gamma\mapsto K^{\gamma}$ satisfies (Hol) then so does $\gamma\mapsto Z_{\gamma}K^{\gamma}$ . Using (2.7), we get in that case

[TABLE]

where $u_{x}^{\gamma}(y)$ is the complex number of modulus $1$ given by

[TABLE]

Let us define a positive measure $\mu^{\gamma}_{k}$ on the set $B_{k}$ of non-backtracking paths of length $k$ , by putting

[TABLE]

Let us also introduce the operator

[TABLE]

Then, using (2.7) again, we see that (6.4) takes the nicer form

[TABLE]

where we let $(m^{\gamma}K)(x;y)=m^{\gamma}_{x}K(x;y)$ . Let us also define

[TABLE]

Such operators would be called “transfer operators” in ergodic theory, or “transition matrices” in the theory of Markov chains. Note that $\mathcal{S}_{\gamma}$ has non-negative coefficients and that $\mathcal{S}_{u^{\gamma}}$ just differs from $\mathcal{S}_{\gamma}$ by the “phases” $\overline{u^{\gamma}_{x_{0}}(x_{-1})}$ . The effect of adding a phase to a stochastic operator is a much studied subject in the theory of Markov chains, or more generally in ergodic theory (see Wielandt’s theorem [36, Chapter 8], or in the context of hyperbolic dynamical systems [37, Chapter 4]).

The matrix elements of $\mathcal{S}_{\gamma}$ are given by

[TABLE]

if $\omega=(x_{0};x_{k})$ , $\omega^{\prime}=(x_{-1};x_{k-1})$ and $\omega^{\prime}\rightsquigarrow\omega$ , and $\mathcal{S}_{\gamma}(\omega,\omega^{\prime})=0$ otherwise. Recall from §2.1 that if $\omega=(x_{0};x_{k})$ , we write $\omega^{\prime}\rightsquigarrow\omega$ if $\omega^{\prime}=(x_{-1},x_{0},\dots,x_{k-1})$ for some $x_{-1}\in\mathcal{N}_{x_{0}}\setminus\{x_{1}\}$ .

Note that $\mathcal{S}_{\gamma}$ is substochastic : $\sum_{\omega^{\prime}\in B_{k}}\mathcal{S}_{\gamma}(\omega,\omega^{\prime})\leq 1$ for any $\omega\in B_{k}$ , by (2.9). More precisely, if $\omega=(x_{0};x_{k})$ and $\eta_{1}=\operatorname{Im}\gamma>0$ , then

[TABLE]

Taking the adjoint in $\ell^{2}(\mu_{k}^{\gamma})$ , a direct calculation gives

[TABLE]

The adjoint $\mathcal{S}_{\gamma}^{\ast}$ is also substochastic, with

[TABLE]

Remark 6.1.

By (2.9), for any $(x_{0};x_{k-1})\in B_{k-1}$ , we have

[TABLE]

and for any $(x_{1};x_{k})\in B_{k-1}$ ,

[TABLE]

In (6.1) we take $\gamma=\lambda+i(\eta^{4}+\eta_{0})$ (c.f. Proposition 5.2), and thus $\eta_{1}=\operatorname{Im}\gamma=\eta^{4}+\eta_{0}$ . In the limiting case $\eta_{1}=0$ , (6.13) and (6.14) turn into equalities. Equation (6.13) is then the Kolmogorov compatibility condition : it tells us that the family of measures $(\mu_{k}^{\gamma})$ may be extended to a positive measure (actually, a Markov measure) on the set $B_{\infty}$ of infinite non-backtracking paths. Equality in condition (6.14) means that this Markov chain is stationary. This stationarity is the property that makes the measures $\mu_{k}^{\gamma}$ nice, and this is the reason for introducing (somewhat artificially) the weight $\frac{\operatorname{Im}\zeta_{x}^{\gamma}(y)}{|\zeta_{x}^{\gamma}(y)|^{2}}$ in (4.1).

This family of stationary Markov chains (indexed by $\gamma$ ) is in some sense the “classical dynamical system” that we were seeking in §1.6.

Since $\eta_{1}=\eta^{4}+\eta_{0}$ is non-zero (but small), we do not actually have exact equality in (6.13) and (6.14). This causes some error terms that we need to control as $\eta,\eta_{0}\longrightarrow 0$ .

7. Spectral gap and mixing

In this section, we convert the expanding assumption (EXP) into an estimate on the rate of mixing of the “Markov chains” $(\mu_{k}^{\gamma})$ defined in (6.6). Every transitive Markov chain is mixing, but here we need estimates that are uniform both as $N\longrightarrow+\infty$ and as $\gamma$ approaches the real axis.

A technical difficulty is that the measures $(\mu_{k}^{\gamma})$ are not a priori bounded from above, and the transition probabilities are not bounded from below as $\gamma$ approaches the real axis. Peaks of $(\mu_{k}^{\gamma})$ , as well as small transition probabilities, tend to “disconnect” the graph and are bad for mixing. So we will need to show that there are few peaks and few small transitions (Proposition 7.6).

Let

[TABLE]

be the normalized measure. We denote by $\ell^{2}(\nu_{k}^{\gamma})$ the set $\ell^{2}(B_{k})$ endowed with the scalar product $\langle f,g\rangle_{\nu_{k}^{\gamma}}=\sum_{\omega\in B_{k}}\nu_{k}^{\gamma}(\omega)\overline{f(\omega)}g(\omega)$ .

We anticipate the calculations of Section 10, where we will need to consider the non-backtracking quantum variance of operators $K_{\gamma}$ of the form $K_{\gamma}=\mathcal{F}_{\gamma}K$ where $K$ is independent of $\gamma$ , and $\mathcal{F}_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{k}$ is a $\gamma$ -dependent operator for some $1\leq k\leq m+1$ , having the form $\mathcal{F}_{\gamma}=\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}$ , $\mathcal{T}^{\gamma}$ , $\mathcal{O}_{1}^{\gamma}$ , $\mathcal{U}^{\gamma}_{j}$ , $\mathcal{O}_{j}^{\gamma}$ , $\mathcal{P}_{j}^{\gamma}$ , $j\geq 2$ , or a polynomial combination thereof. See (10.3, 10.4, 10.6, 10.8, 10.9, 10.10) for the definitions. In the case $\mathcal{F}_{\gamma}=\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}$ , the operator depends on an additional parameter $T\in\mathbb{N}^{\ast}$ , that has to be taken arbitrarily large in Corollary 10.3.

Comparing with (6.8), this means that we will need to deal with $\langle\mathcal{S}_{u^{\gamma}}^{{r-r^{\prime}}}K^{\gamma},K^{\gamma}\rangle_{\mu^{\gamma}_{k}}$ where now $K^{\gamma}=B_{\gamma}K$ , $K$ is $\gamma$ -independent, and $B_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{k}$ is defined by

[TABLE]

For simplicity, the calculations below are written for $k=1$ . This suffices for our purposes, as we shall see in Section 9. Like in the statement of Theorem 1.3, we will always assume that the $\gamma$ -independent operator $K$ satisfies $\lVert K\rVert_{\infty}:=\sup_{x,y\in V}|K(x,y)|\leq 1$ .

The main results of this section are the two following propositions, that estimate the norm of the transfer operator $\mathcal{S}_{\gamma}$ (6.10) on proper subspaces. We call $F$ the space of functions $f$ on $B$ such that $f(e)$ “depends only on the terminus”, that is, $f(e)=f(e^{\prime})$ if $t_{e}=t_{e^{\prime}}$ . The first proposition estimates the norm of $\mathcal{S}_{\gamma}$ on the orthogonal of $F$ , and the second one estimates the norm of $\mathcal{S}_{\gamma}^{2}$ on the orthogonal of constant functions.

We denote by $\ell^{2}(B_{1},U)$ the set $\ell^{2}(B_{1})$ endowed with the scalar product $\langle f,g\rangle_{U}=\frac{1}{N}\sum_{e\in B_{1}}\overline{f(e)}g(e)$ . Let $P_{F,U}$ be the orthogonal projector on $F$ in $\ell^{2}(B_{1},U)$ :

[TABLE]

We use as a “reference operator” the transfer operator $\mathcal{S}$ defined by

[TABLE]

where $q(x)=d(x)-1$ . Both $\mathcal{S}$ and $\mathcal{S}^{*}$ are stochastic, if the adjoint of $\mathcal{S}$ is taken in $\ell^{2}(B_{1},U)$ . The influence of the spectral gap assumption (EXP) on the spectrum of $\mathcal{S}$ is studied in [8] and we will use these results below.

We denote $\mathcal{Q}=\mathcal{S}^{\ast}\mathcal{S}$ and $\mathcal{Q}_{2}=\mathcal{S}^{2\,\ast}\mathcal{S}^{2}$ . Note that $\mathcal{Q}(e,e^{\prime})=0$ unless there exists $e^{\prime\prime}$ such that $e\rightsquigarrow e^{\prime\prime}$ and $e^{\prime}\rightsquigarrow e^{\prime\prime}$ . In this case, we say that $[e,e^{\prime}]$ is a pair; $[e,e^{\prime}]$ form a pair iff they share the same terminus. The set of pairs is denoted by $P(B_{1})$ .

Proposition 7.1.

Let $B_{\gamma}K\in\mathscr{H}_{1}$ . Let $w=P_{F^{\bot},\nu}B_{\gamma}K$ be the orthogonal projection of $B_{\gamma}K$ on $F^{\perp}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . Then for any $M>0$ we have

[TABLE]

where

[TABLE]

The sets $Bad(M)$ of bad edges and $Badp(M)$ of bad pairs of edges will be defined in the course of the proof. They correspond to the aforementioned peaks of $\mu_{1}^{\gamma}$ and problems of small transition probabilities. If there were no bad edges and bad pairs, Proposition 7.1 would be a genuine spectral gap estimate.

Proposition 7.2.

Let $B_{\gamma}K\in\mathscr{H}_{1}$ . Let $f=P_{\mathbf{1}^{\bot},\nu}B_{\gamma}K$ be the orthogonal projection of $B_{\gamma}K$ on $\mathbf{1}^{\bot}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . Then for any $M>0$ we have

[TABLE]

where $c(D,\beta)>0$ is explicit and depends only on $D$ (upper bound on the degree) and the spectral gap $\beta$ of (EXP), and

[TABLE]

where $P_{\mathbf{1},U}$ is the orthogonal projector on $\mathbf{1}$ in $\ell^{2}(B_{1},U)$ .

Here, $Badp(2,M)$ is another set of bad edge-couples defined in the proof.

The quantities $C_{N,M}(B_{\gamma}),C_{N,M,2}(B_{\gamma})$ are estimated in Proposition 7.7.

Proof of Proposition 7.1.

Let $\mathcal{Q}^{\gamma}=\mathcal{S}_{\gamma}^{\ast}\mathcal{S}_{\gamma}$ (where now the adjoint is considered in $\ell^{2}(\nu_{1}^{\gamma})$ ). The operator $\mathcal{Q}^{\gamma}$ being self-adjoint on $\ell^{2}(\nu_{1}^{\gamma})$ is equivalent to the relation

[TABLE]

for all $e,e^{\prime}\in B_{1}$ . Note that $\mathcal{Q}^{\gamma}(e,e^{\prime})=0$ unless $[e,e^{\prime}]$ is a pair.

Define $D^{\gamma}(e)=\sum_{e^{\prime}}\mathcal{Q}^{\gamma}(e,e^{\prime})\leq 1$ and $\mathcal{M}^{\gamma}(e,e^{\prime})=D^{\gamma}(e)\delta_{e=e^{\prime}}-\mathcal{Q}^{\gamma}(e,e^{\prime})$ .

Then using (7.4), we have the Dirichlet identity

[TABLE]

We observe that for any $K\in\ell^{2}(\nu_{1}^{\gamma})$ ,

[TABLE]

Indeed, denoting $\langle\cdot,\cdot\rangle_{\nu}:=\langle\cdot,\cdot\rangle_{\nu_{1}^{\gamma}}$ , we have $\|\mathcal{S}_{\gamma}K\|_{\nu}^{2}=\langle K,\mathcal{Q}^{\gamma}K\rangle_{\nu}$ and $\langle K,\mathcal{M}^{\gamma}K\rangle_{\nu}\geq 0$ by Dirichlet, so $\|K\|_{\nu}^{2}\geq\langle K,D^{\gamma}K\rangle_{\nu}\geq\langle K,\mathcal{Q}^{\gamma}K\rangle_{\nu}$ as claimed.

Remark 7.3.

The Dirichlet identity shows that

[TABLE]

Remark 7.4.

If $J\perp F$ in $\ell^{2}(B_{1},U)$ , then $\langle J,(I-\mathcal{Q})J\rangle_{U}\geq\frac{3}{4}\,\|J\|_{U}^{2}$ .

Indeed, $\langle\tau_{+}\delta_{y},J\rangle_{U}=0$ for all $y\in V$ , so $\sum_{x\sim y}J(x,y)=0$ for all $y\in V$ and thus $(\mathcal{Q}J)(x_{0},x_{1})=(\mathcal{S}^{\ast}\mathcal{S}J)(x_{0},x_{1})=\frac{J(x_{0},x_{1})}{q(x_{1})^{2}}$ (recall that $q(x)=d(x)-1$ where $d(x)$ is the degree of $x$ ). As $\min q(x)\geq 2$ , we get $\|\mathcal{Q}J\|_{U}\leq\frac{1}{4}\,\|J\|_{U}$ and the claim follows.

Fix a large $M>0$ . We call $e\in B_{1}$ bad if $\nu_{1}^{\gamma}(e)>\frac{M}{N}$ . We call a pair $[e,e^{\prime}]\in P(B_{1})$ bad if $\nu_{1}^{\gamma}(e)\mathcal{Q}^{\gamma}(e,e^{\prime})<\frac{M^{-1}}{N}$ . We call $Bad(M)$ and $Badp(M)$ the sets of bad $e$ and $[e,e^{\prime}]$ , respectively.

To prove Proposition 7.1, we first note that by (7.5), and letting $K_{\gamma}=B_{\gamma}K$ ,

[TABLE]

where we used $\mathcal{Q}(e,e^{\prime})\leq 1$ . By Remark 7.4,

[TABLE]

Now

[TABLE]

We used that $\|K_{\gamma}-P_{F,U}K_{\gamma}\|_{\nu}^{2}\geq\|w\|_{\nu}^{2}$ since $w=P_{F^{\bot},\nu}(K_{\gamma}-P_{F,U}K_{\gamma})$ . The result is obtained by putting together (7.7) and (7.8). ∎

Proof of Proposition 7.2.

We now let $\mathcal{Q}^{\gamma}_{2}=\mathcal{S}_{\gamma}^{2\,\ast}\mathcal{S}_{\gamma}^{2}$ (where the adjoint is taken in $\ell^{2}(\nu_{1}^{\gamma})$ ). Then $\mathcal{Q}^{\gamma}_{2}(e,e^{\prime})\neq 0$ iff there exists $e^{\prime\prime},e_{1},e^{\prime}_{1}$ such that $e\rightsquigarrow e_{1}\rightsquigarrow e^{\prime\prime}$ and $e^{\prime}\rightsquigarrow e_{1}^{\prime}\rightsquigarrow e^{\prime\prime}$ . We denote the set of such couples $[e,e^{\prime}]$ by $P_{2}(B_{1})$ and let $\mathcal{M}_{2}^{\gamma}(e,e^{\prime})=D_{2}\delta_{e=e^{\prime}}-\mathcal{Q}_{2}(e,e^{\prime})$ , where $D_{2}(e)=\sum_{e^{\prime}}\mathcal{Q}_{2}^{\gamma}(e,e^{\prime})\leq 1$ .

Fix $M>0$ . We say that $[e,e^{\prime}]\in P_{2}(B_{1})$ is bad if $\nu_{1}^{\gamma}(e)\mathcal{Q}_{2}(e,e^{\prime})<\frac{M^{-1}}{N}$ . We call $Badp(2,M)$ the set of bad couples in $P_{2}(B_{1})$ .

The proof is then similar to Proposition 7.1, replacing the space $F$ by the space of constant functions and using [8, Theorem 1.1] instead of Remark 7.4. In particular, the quantity $c(\beta,D)$ is the one appearing in [8, Theorem 1.1]. ∎

Later on, we will need to iterate the result of Proposition 7.2, considering $\mathcal{S}_{\gamma}^{2r}$ instead of $\mathcal{S}_{\gamma}^{2}$ . Since $\mathcal{S}_{\gamma}^{*}$ is not exactly stochastic, $\mathcal{S}_{\gamma}$ does not preserve the orthogonal of constants. Still, we can iterate (6.12) to get $\mathcal{S}_{\gamma}^{\ast\,l}\mathbf{1}=1-\eta_{1}\sum_{s=0}^{l-1}\mathcal{S}_{\gamma}^{\ast\,s}\xi^{\gamma}$ , where $\xi^{\gamma}(x_{0},x_{1})=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})|}$ . Hence, for any $K$ we have $\langle\mathbf{1},\mathcal{S}_{\gamma}^{l}K\rangle_{\nu}=\langle\mathbf{1},K\rangle_{\nu}-\eta_{1}\langle\sum_{s=0}^{l-1}\mathcal{S}_{\gamma}^{\ast\,s}\xi^{\gamma},K\rangle_{\nu}$ . Denoting

[TABLE]

we see that if $K\perp\mathbf{1}$ , then $(\mathcal{S}_{\gamma}^{2l}K+\eta_{1}\mathcal{Z}_{l}K)\perp\mathbf{1}$ .

Proposition 7.5.

Let $K\in\mathscr{H}_{m}$ . Let $f=P_{\mathbf{1}^{\perp},\nu}B_{\gamma}K$ be the orthogonal projection of $B_{\gamma}K$ on $\mathbf{1}^{\bot}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . Then for any $M>0$ we have

[TABLE]

where $C_{N,M,l,2}(B_{\gamma})=C_{N,M,2}((\mathcal{S}_{\gamma}^{2l}+\eta_{1}\mathcal{Z}_{l})P_{\mathbf{1}^{\perp},\nu}B_{\gamma})$ .

Proof.

The proof is by induction on $r$ . This holds for $r=1$ by Proposition 7.2. Assume the result holds for $r$ . If $f\perp\mathbf{1}$ , we have just seen that $(\mathcal{S}_{\gamma}^{2r}+\eta_{1}\mathcal{Z}_{r})f\perp\mathbf{1}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . So using Proposition 7.2 and (7.6),

[TABLE]

Since $\|(\mathcal{S}_{\gamma}^{2r}+\eta_{1}\mathcal{Z}_{r})f\|\leq\|\mathcal{S}_{\gamma}^{2r}f\|+\eta_{1}\|\mathcal{Z}_{r}f\|$ , the claim follows. ∎

The rest of this section is devoted to estimating the “bad” quantities.

Proposition 7.6.

Under assumptions (BSCT) and (Green), for any $s\geq 1$ , there exists $C_{s}$ such that for all $M>1$ we have

[TABLE]

Proof.

We have $\nu_{1}^{\gamma}(Bad)=\nu_{1}^{\gamma}\{e:\nu_{1}^{\gamma}(e)>\frac{M}{N}\}$ , so

[TABLE]

Recalling the definition of $\mu_{1}^{\gamma}$ (6.6), and using Remark A.3, we get

[TABLE]

uniformly in $\operatorname{Re}\gamma\in I_{1}$ , for any fixed $\operatorname{Im}\gamma=\eta_{1}$ . By Remark A.4, this is bounded by some constant $C_{s}$ . The second assertion is proved similarly. ∎

Proposition 7.7.

For all $t\in\mathbb{N}$ ,

[TABLE]

where $(P_{F,U}\nu_{1}^{\gamma})(e)=\frac{1}{d(t_{e})}\sum_{t_{e^{\prime}}=t_{e}}\nu_{1}^{\gamma}(e^{\prime})$ , and

[TABLE]

Similar estimates hold if $B_{\gamma}$ is replaced by $P_{{\mathbf{1}}^{\perp},\nu}B_{\gamma}$ , where $P_{{\mathbf{1}}^{\perp},\nu}$ is the projection on the orthogonal of constants in $\ell^{2}(\nu_{1}^{\gamma})$ .

We first deduce the following corollary. Recall that the operators $\mathcal{F}_{\gamma}$ from Corollary 10.3 depend on a parameter $T\in\mathbb{N}^{\ast}$ , and $B_{\gamma}=m^{\gamma}Z_{\gamma}^{-1}\mathcal{F}_{\gamma}$ . In this section, $T$ is fixed, but will be taken to $+\infty$ in Section 10.

Corollary 7.8.

For any $s>0$ , there exists $C_{s,T}$ such that, for all $M$ ,

[TABLE]

and

[TABLE]

Similar estimates hold if $B_{\gamma}$ is replaced by $P_{{\mathbf{1}}^{\perp},\nu}B_{\gamma}$ .

Proof of Corollary 7.8.

This will follow from Propositions 7.6 and 7.7 if we show that

[TABLE]

( $\alpha=4,6,8$ ) and

[TABLE]

For (7.10), we have by Remark A.3 that

[TABLE]

uniformly in $\operatorname{Re}\gamma\in I_{1}$ , for any fixed $\operatorname{Im}\gamma=\eta_{1}$ . So the claim follows Remark A.4.

For (7.12), we have $\frac{1}{d(t_{e})^{2}}(\sum_{t_{e^{\prime}}=t_{e}}\nu_{1}^{\gamma}(e^{\prime}))^{2}\leq\sum_{t_{e^{\prime}}=t_{e}}\nu_{1}^{\gamma}(e^{\prime})^{2}$ , so we deduce the upper bound $D(\sum_{e}\frac{1}{\nu_{1}^{\gamma}(e)^{2}})^{1/2}(\sum_{e}\nu_{1}^{\gamma}(e)^{4})^{1/2}$ , which is uniformly bounded by

[TABLE]

Finally, for (7.11), we write

[TABLE]

The first two terms are bounded by $\frac{1}{\operatorname{\mathbb{E}}(\sum_{o^{\prime}\sim o}\hat{\mu}_{1}^{\gamma}(o,o^{\prime}))}(\operatorname{\mathbb{E}}\sum_{o^{\prime}\sim o}\frac{\hat{\mu}_{1}^{\gamma}(o,o^{\prime})^{2}(\hat{m}_{o}^{\gamma})^{2\alpha}}{\hat{\zeta}^{\gamma}_{o}(o^{\prime})^{2\alpha}})^{1/2}$ and the last term is shown to be uniformly bounded in Remark 10.4. This completes the proof. ∎

Proof of Proposition 7.7.

An important point here is to obtain a bound that does not depend on $t$ . Recalling (7.3), we first estimate

[TABLE]

where $n(e)=\sum_{e^{\prime}:[e,e^{\prime}]\in Badp(M)}\mathcal{Q}(e,e^{\prime})$ . Using Hölder, this is less than

[TABLE]

But again by Hölder and the fact that $\mathcal{Q}$ is stochastic, we have

[TABLE]

Next, recalling (6.7), (6.9), we have $|\mathcal{S}_{u^{\gamma}}^{t}B_{\gamma}K(e)|\leq(\mathcal{S}_{\gamma}^{t}|B_{\gamma}K|)(e)$ . As $\mathcal{S}_{\gamma}^{t}$ and $\mathcal{S}_{\gamma}^{\ast\,t}$ are substochastic, and $\nu_{1}^{\gamma}(e)\mathcal{S}_{\gamma}^{t}(e,e^{\prime})=\nu_{1}^{\gamma}(e^{\prime})\mathcal{S}_{\gamma}^{\ast\,t}(e^{\prime},e)$ , we have

[TABLE]

Collecting the estimates, we showed that (7.13) is bounded by

[TABLE]

For the second term in (7.3), we have

[TABLE]

and again, as $\mathcal{S}_{\gamma}^{t}$ and $\mathcal{S}_{\gamma}^{\ast\,t}$ are substochastic,

[TABLE]

Also,

[TABLE]

Using that $P_{F,U}$ is stochastic and $\mathcal{S}_{\gamma}^{t}$ is substochastic, we have

[TABLE]

This yields the first inequality. The second one is proven similarly. ∎

Remark 7.9.

Note that if $\|K\|_{\infty}\leq 1$ , then

[TABLE]

so $\sup_{\eta_{1}>0}\limsup_{N\to\infty}\sup_{\operatorname{Re}\gamma\in I_{1},\operatorname{Im}\gamma=\eta_{1}}\|B_{\gamma}K\|_{\nu_{1}^{\gamma}}^{2}\leq C_{T}$ by the proof in Corollary 7.8, see also Remark 10.4.

For a quantity $A(N,\gamma,\kappa)$ depending on $N,\gamma$ (and possibly on an additional parameter $\kappa$ ), we will write $A(N,\gamma,\kappa)=O_{\kappa}(1)_{N\longrightarrow+\infty,\gamma}$ to mean that, for any given $\kappa$ ,

[TABLE]

For instance, if $\|K\|_{\infty}\leq 1$ , then $\|B_{\gamma}K\|_{\nu_{1}^{\gamma}}^{2}=O_{T}(1)_{N\longrightarrow+\infty,\gamma}$ . This is true more generally for $\|B_{\gamma}K\|_{\nu_{k}^{\gamma}}^{2}$ , with $B_{\gamma}=\frac{m^{\gamma}}{Z_{\gamma}}\mathcal{F}_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{k}$ , and $\mathcal{F}_{\gamma}$ as in Corollary 10.3.

Similarly, for the operator $\mathcal{Z}_{l}$ appearing in Proposition 7.5, the arguments in Corollary 7.8 and Remark 10.4 show that $\|\mathcal{Z}_{l}f\|_{\nu_{1}^{\gamma}}=O_{l,T}(1)_{N\longrightarrow+\infty,\gamma}$ .

Finally, by Corollary 7.8, $\sup_{t}C_{N,M,2}(\mathcal{S}_{u^{\gamma}}^{t}B_{\gamma})$ is uniformly bounded by $C_{s,T}M^{-s}$ for any $M$ and $s$ , as $N\to+\infty$ . We use the notation $O_{T}(M^{-\infty})_{N\longrightarrow+\infty,\gamma}$ to express this.

8. Transition matrices with phases

We now consider the operator $\mathcal{S}_{u^{\gamma}}$ given in (6.7). If we denote by $M_{u^{\gamma}}$ the multiplication operator $(M_{u^{\gamma}}K)(x_{0};x_{k})=\overline{u_{x_{1}}^{\gamma}(x_{0})}K(x_{0};x_{k})$ , where ${u_{x_{1}}^{\gamma}(x_{0})}$ is the function of modulus $1$ defined in (6.5), then $\mathcal{S}_{u^{\gamma}}=\mathcal{S}_{\gamma}M_{u^{\gamma}}$ .

It is well known that putting phases into a matrix with positive entries will strictly diminish its spectral radius, unless the phases satisfy very special relations : this is the contents of Wielandt’s theorem [36, Chapter 8]. This is reflected in Proposition 8.1 below. Without the error term, part (i) says that the norm of $\mathcal{S}_{u^{\gamma}}^{4}$ is strictly smaller than one, in contrast to $\mathcal{S}_{\gamma}^{4}$ (the latter only contracts the norm on proper subspaces, see Section 7). The contraction property of $\mathcal{S}_{u^{\gamma}}^{4}$ holds true except in special cases, described in part (ii) of Proposition 8.1.

Note that we are not using Wielandt’s theorem directly, as we want some information on the norm of the operator $\mathcal{S}_{u^{\gamma}}^{4}$ instead of its spectral radius. In addition, as in Section 7, we need estimates that are uniform both as $N\to\infty$ and as $\gamma$ approaches the real axis.

Recall from Section 7 that $B_{\gamma}$ is an operator $\mathscr{H}_{m}\to\mathscr{H}_{k}$ with $1\leq k\leq m$ . As in Section 7, the case $k=1$ suffices for our purposes, but we need more general operators $A_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{1}$ defined in terms of $B_{\gamma}$ . The quantities $C_{N,M}(A_{\gamma}),C_{N,M,2}(A_{\gamma})$ were introduced in Propositions 7.1 and 7.2. In particular, $C_{N,M,2}(I)$ corresponds to the case where $A_{\gamma}$ is the identity operator. The measure $\nu_{1}^{\gamma}$ is defined in (6.6) and (7.1).

Proposition 8.1.

Fix $\gamma\in\mathbb{C}^{+}$ , $A_{\gamma}K\in\mathscr{H}_{1}$ , $\varepsilon\in(0,1)$ , $M>0$ and a graph $G=G_{N}$ . Denote $\eta_{1}=\operatorname{Im}\gamma$ . Then

(i)

Either we have

[TABLE]

with

[TABLE] 2. (ii)

or there exist $\theta:V\to\mathbb{R}$ and constants $s_{j}$ with $|s_{j}|\leq 1$ , $j=1,2$ , such that

[TABLE]

and

[TABLE]

where $\xi^{\gamma}(x_{0},x_{1})=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})|}$ , $n_{x}^{\gamma}=(\overline{m_{x}^{\gamma}})(m_{x}^{\gamma})^{-1}$ and $C^{\prime}_{N,M}=\frac{8M^{2}C_{N,M,2}(I)}{c(D,\beta)}$ .

Moreover, there is an explicit $f(\beta,D)$ , depending only on the spectral gap $\beta$ and on the degree, such that $c_{M,\beta}\leq f(\beta,D)M^{3}$ as $M\to+\infty$ .

In particular, in case (ii),

[TABLE]

Proof.

(a) We start with some preliminary inequalities. Denote $\langle\cdot,\cdot\rangle_{\nu}=\langle\cdot,\cdot\rangle_{\nu_{1}^{\gamma}}$ .

Recall that we denote by $F$ the space of functions on $B$ which depend only on the terminus.

Let $\delta_{1}=\frac{3}{4}M^{-2}$ , $K_{\gamma}=A_{\gamma}K$ and let $w=P_{F^{\bot}}K_{\gamma}$ be the orthogonal projection of $K_{\gamma}$ on $F^{\bot}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . By the proof of Proposition 7.1,

[TABLE]

By Remark 7.3 and the fact that $\mathcal{M}^{\gamma\ast}=\mathcal{M}^{\gamma}$ , we have

[TABLE]

So if $f=P_{F}K_{\gamma}=K_{\gamma}-w\in F$ is the projection of $K_{\gamma}$ on $F$ , we have

[TABLE]

Similarly, if $\delta_{2}=M^{-2}c(D,\beta)$ and $C\,\mathbf{1}=P_{\mathbf{1}}|K_{\gamma}|$ is the projection of $|K_{\gamma}|$ on $\mathbf{1}$ , then using Proposition 7.2, we get

[TABLE]

Now

[TABLE]

and

[TABLE]

(this is true even if $f$ vanishes, if we give an arbitrary value of modulus $1$ to $\frac{f}{|f|}$ in this case). Also,

[TABLE]

and

[TABLE]

Finally, $\left\|\,|K_{\gamma}|-\|K_{\gamma}\|_{\nu}\,\mathbf{1}\right\|_{\nu}\leq\left\|\,|K_{\gamma}|-C\,\mathbf{1}\right\|_{\nu}+\left|\,\|K_{\gamma}\|_{\nu}-C\,\right|\leq 2\,\left\|\,|K_{\gamma}|-C\,\mathbf{1}\right\|_{\nu}$ . Putting all these inequalities together, we obtain

[TABLE]

Comparing with (8.3) and (8.4), this says the following : if $\|\mathcal{S}_{\gamma}^{2}\,|K_{\gamma}|\,\|_{\nu}$ is close to $\|K_{\gamma}\|_{\nu}$ and if $\|\mathcal{S}_{\gamma}K_{\gamma}\|_{\nu}$ is close to $\|K_{\gamma}\|_{\nu}$ , then $K_{\gamma}$ must be close to $\|K_{\gamma}\|_{\nu}\frac{f}{|f|}$ , where $f$ is a function that depends only on the terminus.

Repeating the arguments of (8.3) with $M_{u^{\gamma}}\mathcal{S}_{u^{\gamma}}K_{\gamma}$ instead of $K_{\gamma}$ , then taking $\tilde{f}=P_{F}M_{u^{\gamma}}\mathcal{S}_{u^{\gamma}}K_{\gamma}\in F$ , we get

[TABLE]

Similarly to (8.4), if $\tilde{C}\,\mathbf{1}=P_{\mathbf{1}}|\mathcal{S}_{u^{\gamma}}K_{\gamma}|$ , we get

[TABLE]

Finally, arguing as in (8.5), we have

[TABLE]

(b) We can now start the proof itself. Suppose (i) is not true :

[TABLE]

Using $\|\mathcal{S}_{u^{\gamma}}^{4}K_{\gamma}\|_{\nu}\leq\|\mathcal{S}_{u^{\gamma}}K_{\gamma}\|_{\nu}=\|\mathcal{S}_{\gamma}M_{u^{\gamma}}K_{\gamma}\|_{\nu}$ , $\|\mathcal{S}_{u^{\gamma}}^{4}K_{\gamma}\|_{\nu}\leq\|\mathcal{S}_{\gamma}^{2}\,|K_{\gamma}|\|_{\nu}=\|\mathcal{S}_{\gamma}^{2}\,|M_{u^{\gamma}}K_{\gamma}|\|_{\nu}$ , $\|\mathcal{S}_{u^{\gamma}}^{4}K_{\gamma}\|_{\nu}\leq\|\mathcal{S}_{\gamma}^{2}\,|\mathcal{S}_{u^{\gamma}}K_{\gamma}|\|_{\nu}$ and $\|K_{\gamma}\|_{\nu}\geq\|\mathcal{S}_{u^{\gamma}}K_{\gamma}\|_{\nu}$ , we see that we must also have

[TABLE]

as well as

[TABLE]

Applying (8.3), (8.4) and (8.5) to $M_{u^{\gamma}}K_{\gamma}$ instead of $K_{\gamma}$ , and $f=P_{F}M_{u^{\gamma}}K_{\gamma}$ , it follows that

[TABLE]

Applying (8.6), (8.7) and (8.8) yields

[TABLE]

As $f,\tilde{f}\in F$ , we have $\frac{f}{|f|}(x_{0},x_{1})=e^{i\theta(x_{1})}$ and $\frac{\tilde{f}}{|\tilde{f}|}(x_{0},x_{1})=e^{i\theta^{\prime}(x_{1})}$ for some $\theta,\theta^{\prime}:V\to\mathbb{R}$ . Note that in this case, $(\mathcal{S}_{\gamma}\frac{f}{|f|})(x_{0},x_{1})=e^{i\theta(x_{0})}-\eta_{1}\xi^{\gamma}(x_{1},x_{0})e^{i\theta(x_{0})}$ , where $\xi^{\gamma}(x_{0},x_{1})=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})|}$ , using (6.11). Applying $\mathcal{S}_{\gamma}$ to (8.9), we thus get

[TABLE]

Applying $M_{u^{\gamma}}$ and comparing with (8.10), it follows that

[TABLE]

Repeating the procedure with $K_{\gamma}$ replaced by $\mathcal{S}_{u^{\gamma}}K_{\gamma}$ , and $f$ replaced by $\tilde{f}$ , the same arguments show that there exists $\theta^{\prime\prime}:V\to\mathbb{R}$ such that

[TABLE]

Hence we have proved that $u_{x_{1}}^{\gamma}(x_{0})$ is close to both $e^{i(\theta(x_{0})-\theta^{\prime}(x_{1}))}$ and $e^{i(\theta^{\prime}(x_{0})-\theta^{\prime\prime}(x_{1}))}$ .

(c) Because of relation (2.7), the function $u$ satisfies $u_{x_{1}}^{\gamma}(x_{0})=u_{x_{0}}^{\gamma}(x_{1})\frac{n_{x_{1}}^{\gamma}}{n_{x_{0}}^{\gamma}}$ , where $n_{x}^{\gamma}=(\overline{m_{x}^{\gamma}})(m_{x}^{\gamma})^{-1}$ .

To conclude the proof, we show : if $e^{i(\theta(x_{0})-\theta^{\prime}(x_{1}))}$ and $e^{i(\theta^{\prime}(x_{0})-\theta^{\prime\prime}(x_{1}))}$ are close to $u^{\gamma}$ , and if the function $u_{x_{1}}^{\gamma}(x_{0})$ satisfies the relation above, then this gives constraints on $\theta,\theta^{\prime},\theta^{\prime\prime}$ that imply part (ii) of the proposition.

Let $g(x_{0},x_{1})=e^{i(\theta(x_{0})-\theta^{\prime}(x_{1}))}$ and ${\mathbf{c}}=(112\delta_{1}^{-1}+112\delta_{2}^{-1}+6)$ . We have shown in (b) that $\|u_{x_{1}}^{\gamma}(x_{0})-g\|_{\nu}^{2}\leq{\mathbf{c}}\varepsilon+4\eta_{1}^{2}\|\xi^{\gamma}\|_{\nu}^{2}$ . Recall that we denote by $\iota$ the involution of edge reversal. Hence, if we define $\tilde{g}(x_{0},x_{1})=g(x_{1},x_{0})\frac{n_{x_{1}}^{\gamma}}{n_{x_{0}}^{\gamma}}$ , we get

[TABLE]

Thus, $\|\tilde{g}-g\|_{\nu}^{2}\leq 4{\mathbf{c}}\varepsilon+16\eta_{1}^{2}\,\|\xi^{\gamma}\|_{\nu}^{2}$ . Hence, defining

[TABLE]

we get

[TABLE]

Note that the functions $h_{1},h_{2}$ have modulus $1$ , and $\mathcal{S}_{\gamma}h_{1}=h_{2}-\eta_{1}\iota\xi^{\gamma}h_{2}$ , so

[TABLE]

Consider $P_{\mathbf{1},\nu}h_{1}=s\,\mathbf{1}$ , the projection of $h_{1}$ to the space of constant functions. Arguing as in (8.4), we can write $\|h_{1}-s\,\mathbf{1}\|_{\nu}^{2}\leq\delta_{2}^{-1}(\|h_{1}\|_{\nu}^{2}-\|\mathcal{S}_{\gamma}^{2}h_{1}\|_{\nu}^{2}+4C_{N,M,2}(I))$ . But $\|h_{1}\|^{2}-\|\mathcal{S}_{\gamma}^{2}h_{1}\|^{2}=(\|h_{1}\|+\|\mathcal{S}_{\gamma}^{2}h_{1}\|)(\|h_{1}\|-\|\mathcal{S}_{\gamma}^{2}h_{1}\|)\leq 2\,\|\mathcal{S}_{\gamma}^{2}h_{1}-h_{1}\|$ . Hence,

[TABLE]

We observe that $\|h_{1}-s\,\mathbf{1}\|=\|n_{x_{1}}^{\gamma}e^{i(\theta(x_{1})+\theta^{\prime}(x_{1}))}-s\,\mathbf{1}\|=\|\tilde{g}n_{x_{0}}^{\gamma}e^{i(\theta^{\prime}(x_{0})+\theta^{\prime}(x_{1}))}-s\,\mathbf{1}\|=\|\tilde{g}-\frac{e^{-i(\theta^{\prime}(x_{0})+\theta^{\prime}(x_{1}))}}{n_{x_{0}}^{\gamma}}s\|$ . Thus, comparing with (8.13),

[TABLE]

This is the first half of (ii) with

[TABLE]

Remembering that $\delta_{1}=\frac{3}{4}M^{-2}$ , $\delta_{2}=M^{-2}c(D,\beta)$ and ${\mathbf{c}}=(112\delta_{1}^{-1}+112\delta_{2}^{-1}+6)$ , we see that there is an explicit $f(\beta,D)$ such that $c_{M,\beta}\leq f(\beta,D)M^{3}$ as $M\to+\infty$ . Note that $|s|\leq 1$ since $\|h_{1}\|_{\nu}=1$ .

The second half of (ii) is proven similarly, using (8.12) instead of (8.11). Here we take $g^{\prime}(x_{0},x_{1})=e^{i(\theta^{\prime}(x_{0})-\theta^{\prime\prime}(x_{1}))}$ , $h_{1}^{\prime}(x_{0},x_{1})=\frac{1}{n_{x_{1}}^{\gamma}}e^{-i[\theta^{\prime}(x_{1})+\theta^{\prime\prime}(x_{1})]}$ , $s^{\prime}\mathbf{1}=P_{\mathbf{1}}h_{1}^{\prime}$ and $h_{2}^{\prime}(x_{0},x_{1})=\frac{1}{n_{x_{0}}^{\gamma}}e^{-i[\theta^{\prime}(x_{0})+\theta^{\prime\prime}(x_{0})]}$ .

To prove (8.2), we write $\big{\|}u_{x_{1}}^{\gamma}(x_{0})^{2}-ss^{\prime}\frac{n_{x_{1}}^{\gamma}}{n_{x_{0}}^{\gamma}}\big{\|}^{2}\leq 2\,\big{\|}u_{x_{1}}^{\gamma}(x_{0})[u_{x_{1}}^{\gamma}(x_{0})-s\frac{e^{-i\widetilde{\theta}(x_{0},x_{1})}}{n_{x_{0}}^{\gamma}}]\big{\|}^{2}+2\,\big{\|}s\frac{e^{-i\widetilde{\theta}(x_{0},x_{1})}}{n_{x_{0}}^{\gamma}}[u_{x_{1}}^{\gamma}(x_{0})-s^{\prime}e^{i\widetilde{\theta}(x_{0},x_{1})}n_{x_{1}}^{\gamma}]\big{\|}^{2}$ , where we put $\widetilde{\theta}(x_{0},x_{1})=\theta^{\prime}(x_{0})+\theta^{\prime}(x_{1})$ . Since $u_{x_{1}}^{\gamma}(x_{0})^{2}\frac{n_{x_{0}}^{\gamma}}{n_{x_{1}}^{\gamma}}=u_{x_{1}}^{\gamma}(x_{0})u_{x_{0}}^{\gamma}(x_{1})$ , the proof is complete. ∎

9. Step 4 : End of the proof of Theorem 3.3

Our aim is to show that $\lim_{\eta_{0}\downarrow 0}\lim_{N\to+\infty}{\mathrm{Var_{nb,\eta_{0}}^{I}}}(\mathcal{F}_{\gamma}K)=0$ , for the operators $\mathcal{F}_{\gamma}$ that appear in Corollary 10.3. A main step was carried out in Proposition 5.2, and the upper bound was put in a convenient form in (6.8). We now use the estimates of Sections 7 and 8 to complete the proof. We denote $B_{\gamma}=\frac{m^{\gamma}}{Z_{\gamma}}\mathcal{F}_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{k}$ as in Section 7, where $Z_{\gamma}$ is defined in (6.3). It should be kept in mind that $\mathcal{F}_{\gamma}$ may depend on a parameter $T$ that is fixed in this section, but will be taken arbitrarily large in the next one.

Recall that we take $\gamma=\lambda+i(\eta^{4}+\eta_{0})$ , where $\lambda,\eta,\eta_{0}$ come from Proposition 5.2. In other words, $\gamma=\lambda+i\eta_{1}\in\mathbb{C}^{+}$ with $\lambda\in I_{1}$ and $\eta_{1}=\eta^{4}+\eta_{0}$ . Let $K\in\mathscr{H}_{m}$ so that $B_{\gamma}K\in\mathscr{H}_{k}$ . Applying (6.8), recalling that $\nu_{k}^{\gamma}=\frac{1}{\mu_{k}^{\gamma}(B_{k})}\mu_{k}^{\gamma}$ , we obtain

[TABLE]

Fix $M$ very large and take $n=M^{9}$ . We apply Proposition 8.1 with $\varepsilon=M^{-8}$ to the family of operators $\{\mathcal{S}^{4j}_{u^{\gamma}}B_{\gamma}K\}_{j=1}^{M^{9}}$ . Call ${\tilde{\tilde{C}}}_{N,M}(B_{\gamma})=\max_{j=1}^{M^{9}}\tilde{C}_{N,M,2}(\mathcal{S}^{4j+k-1}_{u^{\gamma}}B_{\gamma})^{1/2}\cdot\sqrt{\frac{\mu_{1}^{\gamma}(B)}{\mu_{k}^{\gamma}(B_{k})}}$ . We use the notation in Remark 7.9 throughout the section. In particular, ${\tilde{\tilde{C}}}_{N,M}(B_{\gamma})=O_{T}(M^{-\infty})_{N\longrightarrow+\infty,\gamma}$ thanks to Corollary 7.8.

Remark 9.1.

It is useful to note that the norm $\|\mathcal{S}_{u^{\gamma}}^{j}\|_{\nu_{k}^{\gamma}\to\nu_{k}^{\gamma}}$ for $k>1$ is controlled by the same norm for $k=1$ . To see this, note that for $K\in\ell^{2}(\nu_{k}^{\gamma})$ , we have $(\mathcal{S}_{u^{\gamma}}^{k-1}K)(x_{0};x_{k})=\sum_{(x_{-k+1};x_{-1})_{x_{0,1}}}\Lambda(x_{-k+1};x_{1})K(x_{-k+1};x_{1})$ for some function $\Lambda(x_{-k+1};x_{1})$ . Here the sum is over those $(x_{-k+1};x_{-1})$ for which the path $(x_{-k+1},x_{-k+2},\ldots,x_{-1},x_{0},x_{1})$ does not backtrack, cf. (2.3). So $(\mathcal{S}_{u^{\gamma}}^{k-1}K)(x_{0};x_{k})$ only depends on $(x_{0},x_{1})$ : we may define $\phi_{K}\in\ell^{2}(\nu_{1}^{\gamma})$ by $\phi_{K}(x_{0},x_{1})=(\mathcal{S}_{u^{\gamma}}^{k-1}K)(x_{0};x_{k})$ . If $\mathscr{I}:\ell^{2}(\nu_{1}^{\gamma})\to\ell^{2}(\nu_{k}^{\gamma})$ is the map $(\mathscr{I}\phi)(x_{0};x_{k})=\phi(x_{0},x_{1})$ , we have for any $j\geq k$ , $[\mathcal{S}_{u^{\gamma}}^{j-k+1}\mathscr{I}\phi_{K}](x_{0};x_{k})=(\mathcal{S}_{u^{\gamma}}^{j}K)(x_{0};x_{k})$ . Moreover, $[\mathcal{S}_{u^{\gamma}}\mathscr{I}\phi](x_{0};x_{k})=[\mathscr{I}(\mathcal{S}_{u^{\gamma}}\phi)](x_{0};x_{k})$ . Thus,

[TABLE]

where we used that $\sum_{{}_{x_{0,1}}(x_{2};x_{k})}\mu_{k}(x_{0};x_{k})\leq\mu_{1}(x_{0},x_{1})$ by (6.13). Hence,

[TABLE]

But using (2.9) repeatedly we have

[TABLE]

and $\mu_{1}^{\gamma}(x_{0},x_{1})|\Lambda(x_{-k+1};x_{1})|=\mu_{k}^{\gamma}(x_{-k+1};x_{1})$ by (6.6) and (2.7). Hence,

[TABLE]

So $\|\phi_{K}\|_{\nu_{1}}^{2}\leq\frac{\mu_{k}^{\gamma}(B_{k})}{\mu_{1}^{\gamma}(B)}\|K\|_{\nu_{k}^{\gamma}}^{2}$ . Summarizing, we have shown that for any $j\geq k$ , we have

[TABLE]

First alternative : For $\gamma$ , $\varepsilon$ as above, assume that case (i) of Proposition 8.1 is satisfied for all the operators $\{\mathcal{S}^{4j}_{u^{\gamma}}B_{\gamma}K\}_{j=1}^{M^{9}}$ . Applying (8.1) for $\mathcal{S}^{4t}_{u^{\gamma}}B_{\gamma}K$ , $t\leq j$ , we obtain if $k=1$ ,

[TABLE]

For higher $k$ , we apply (9.2) to $\phi_{B_{\gamma}K}(x_{0},x_{1})=(\mathcal{S}_{u^{\gamma}}^{k-1}B_{\gamma}K)(x_{0};x_{k})=(A_{\gamma}K)(x_{0},x_{1})$ , where $A_{\gamma}=\mathcal{S}_{u^{\gamma}}^{k-1}B_{\gamma}$ , instead of $B_{\gamma}K$ . We get by Remark 9.1,

[TABLE]

Using the euclidean division $r^{\prime}-r-k+1=4m_{r,r^{\prime}}+n_{r,r^{\prime}}$ with $n_{r,r^{\prime}}<4$ , we see that for $r^{\prime}-r\geq 4+k-1$ ,

[TABLE]

where $c_{k}=\frac{1}{(1-\varepsilon)^{(k-1+n_{r,r^{\prime}})/4}}\leq 2^{\frac{k+2}{4}}$ if $\varepsilon\leq\frac{1}{2}$ . Note that $(1-\varepsilon)^{1/4}\leq(1-\frac{\varepsilon}{5})$ . Hence, since $4+k-1\leq 4k$ , we have

[TABLE]

Recall that $\varepsilon=M^{-8}$ and $n=M^{9}$ . Comparing with (9.1), we get

[TABLE]

Second alternative : Now assume case (ii) of Proposition 8.1 is satisfied; with some complex numbers $s_{j}=s_{j}(N)$ and some function $\theta$ . We denote $\|\,\|_{\nu}=\|\,\|_{\ell^{2}(\nu_{k}^{\gamma})}$ , $\theta_{0}(x_{0};x_{k})=\theta(x_{0})$ , $\theta_{1}(x_{0};x_{k})=\theta(x_{1})$ , $n_{0}^{\gamma}(x_{0};x_{k})=n^{\gamma}_{x_{0}}$ and $n_{1}^{\gamma}(x_{0};x_{k})=n^{\gamma}_{x_{1}}$ . Then we have

Proposition 9.2.

Let $\|K\|_{\infty}\leq 1$ . For $A_{\gamma}K=\mathcal{S}_{u^{\gamma}}^{\ell}B_{\gamma}K$ , we have for any $t\in\mathbb{N}^{\ast}$ ,

[TABLE]

Proof.

Recall that $\mathcal{S}_{u^{\gamma}}=\mathcal{S}_{\gamma}M_{u^{\gamma}}$ with $M_{u^{\gamma}}$ the multiplication by $\overline{u_{x_{1}}^{\gamma}(x_{0})}$ . We have

[TABLE]

Using (7.6) and Cauchy-Schwarz, the first term is bounded by

[TABLE]

But $u^{\gamma},s_{2},n_{0}^{\gamma}$ all have modulus bounded by $1$ , so $|\overline{u_{x_{1}}^{\gamma}(x_{0})}-\overline{s_{2}}n_{0}^{\gamma}e^{i[\theta_{0}+\theta_{1}]}|^{4}\leq 4\,|\overline{u_{x_{1}}^{\gamma}(x_{0})}-\overline{s_{2}}n_{0}^{\gamma}e^{i[\theta_{0}+\theta_{1}]}|^{2}$ . Hence, $\|\overline{u_{x_{1}}^{\gamma}(x_{0})}-\overline{s_{2}}n_{0}^{\gamma}e^{i[\theta_{0}+\theta_{1}]}\|_{\ell^{4}(\nu_{1}^{\gamma})}\leq(4c_{M,\beta}\left[\varepsilon^{1/2}+\eta_{1}O(1)_{N\longrightarrow+\infty,\gamma}\right]+4C^{\prime}_{N,M})^{1/4}$ by the first part of (ii). For higher $k$ , using $\sum_{{}_{x_{0,1}}(x_{2};x_{k})}\mu_{k}(x_{0};x_{k})\leq\mu_{1}(x_{0},x_{1})$ by (6.13), we get $\|\overline{u_{x_{1}}^{\gamma}(x_{0})}-\overline{s_{2}}n_{0}^{\gamma}e^{i[\theta_{0}+\theta_{1}]}\|_{\ell^{4}(\nu_{k}^{\gamma})}\leq(\frac{\mu_{1}^{\gamma}(B)}{\mu_{k}^{\gamma}(B_{k})})^{1/4}\|\overline{u_{x_{1}}^{\gamma}(x_{0})}-\overline{s_{2}}n_{0}^{\gamma}e^{i[\theta_{0}+\theta_{1}]}\|_{\ell^{4}(\nu_{1}^{\gamma})}$ .

Next, $\|\mathcal{S}_{\gamma}M_{u^{\gamma}}A_{\gamma}K\|_{\ell^{4}(\nu_{k}^{\gamma})}=\|\mathcal{S}_{u^{\gamma}}^{\ell+1}B_{\gamma}K\|_{\ell^{4}(\nu_{k}^{\gamma})}$ . Arguing as in Proposition 7.7 and Corollary 7.8, we see this is $O_{T}(1)_{N\longrightarrow+\infty,\gamma}$ . Bounding the second term similarly, we get

[TABLE]

Since $\|B_{\gamma}K\|_{\nu}=O_{T}(1)_{N\longrightarrow+\infty,\gamma}$ (see Remark 7.9), this proves the result for $t=1$ .

For higher $t$ , let $X=\overline{s_{1}s_{2}}e^{i\theta_{0}}\mathcal{S}_{\gamma}^{2}e^{-i\theta_{0}}$ and $Y=\mathcal{S}_{u^{\gamma}}^{2}$ . Then $\|(X^{t}-Y^{t})A_{\gamma}K\|=\|\sum_{i=1}^{t}X^{t-i}(X-Y)Y^{i-1}A_{\gamma}K\|\leq\sum_{i=1}^{t}\|(X-Y)Y^{i-1}A_{\gamma}K\|$ . Again, $\|Y^{i-1}A_{\gamma}K\|_{\ell^{4}(\nu_{k}^{\gamma})}=O_{T}(1)_{N\longrightarrow+\infty,\gamma}$ for each $i$ and the claim follows. ∎

In sums like (9.1), we can make packets of size $2t$ , and we have for all $m$ and for any $t$

[TABLE]

As we will see below, the size $2t$ of packets should be chosen so that $t(c_{M,\beta}\varepsilon^{1/2})^{1/4}$ is small as $M$ gets large. Remembering that $c_{M,\beta}\leq f(D,\beta)M^{3}$ and $\varepsilon=M^{-8}$ , we take $t=M^{\alpha}$ with $0<\alpha<1/4$ . We then group the sum (9.1) into packets and write

[TABLE]

where we estimated $|\sum_{r^{\prime}=1}^{n}\sum_{r=2t(\lfloor\frac{n-r^{\prime}}{2t}\rfloor-1)}^{n-r^{\prime}}\langle\mathcal{S}_{u^{\gamma}}^{r}B_{\gamma}K,B_{\gamma}K\rangle_{\nu}|\leq 4nt\lVert B_{\gamma}K\rVert^{2}_{\nu}$ . Note that $\sum_{r=2ta}^{2t(a+1)-1}\langle\mathcal{S}_{u^{\gamma}}^{r}\cdot,\cdot\rangle=\sum_{r=0}^{t-1}\langle\mathcal{S}_{u^{\gamma}}^{2r+2ta}\cdot,\cdot\rangle+\sum_{r=0}^{t-1}\langle\mathcal{S}_{u^{\gamma}}^{2r+1+2ta}\cdot,\cdot\rangle$ . So using (9.4),

[TABLE]

Lemma 9.3.

Let $\|K\|_{\infty}\leq 1$ . For $A_{\gamma}K=\mathcal{S}_{u^{\gamma}}^{2ta}B_{\gamma}K$ or $\mathcal{S}_{u^{\gamma}}^{2ta+1}B_{\gamma}K$ we have for any $L$

[TABLE]

Proof.

First assume $k=1$ . We decompose $e^{-i\theta_{0}}A_{\gamma}K=C\mathbf{1}+f$ where $f\perp\mathbf{1}$ in $\ell^{2}(\nu^{\gamma}_{1})$ . So $\mathcal{S}_{\gamma}^{2r}e^{-i\theta_{0}}A_{\gamma}K=C\mathcal{S}_{\gamma}^{2r}\mathbf{1}+\mathcal{S}_{\gamma}^{2r}f$ .

For the term $\mathcal{S}_{\gamma}^{2r}f$ we use Proposition 7.5, which yields, for any $L$ ,

[TABLE]

By Corollary 7.8 (recalling that $r\leq t\leq M^{\alpha}$ ), we have $\sum_{l=0}^{r-1}C_{N,L,l,2}(e^{-i\theta_{0}}A_{\gamma})^{1/2}=t\,O_{T}(L^{-\infty})_{N\longrightarrow+\infty,\gamma}$ . Indeed, the term $e^{-i\theta_{0}}$ has no impact, as it can be bounded by $1$ in the proof of Proposition 7.7. We also have $\|f\|_{\nu}\leq\|A_{\gamma}K\|_{\nu}\leq\|B_{\gamma}K\|_{\nu}=O_{T}(1)_{N\longrightarrow\infty,\gamma}$ , and $\|\mathcal{Z}_{l}f\|_{\nu}=O_{l,T}(1)_{N\longrightarrow\infty,\gamma}$ by Remark 7.9. Thus,

[TABLE]

For the term $C\mathcal{S}_{\gamma}^{2r}\mathbf{1}$ , we have $\mathcal{S}_{\gamma}^{l}\mathbf{1}=\mathbf{1}-\eta_{1}\sum_{s=0}^{l-1}\mathcal{S}_{\gamma}^{\,s}\iota\xi^{\gamma}=\mathbf{1}+\eta_{1}O_{l}(1)_{N\longrightarrow\infty,\gamma}$ by (6.11). Thus,

[TABLE]

Since $|C|\leq\|A_{\gamma}K\|_{\nu}\leq\|B_{\gamma}K\|_{\nu}$ , this completes the proof for $k=1$ .

For higher $k$ , as in Remark 9.1, we have $\|\mathcal{S}_{\gamma}^{2r}f\|_{\nu_{k}}\leq\sqrt{\frac{\mu_{1}^{\gamma}(B)}{\mu_{k}^{\gamma}(B_{k})}}\|\mathcal{S}_{\gamma}^{2r-k+1}\phi_{f}\|_{\nu_{1}}$ , where now $\phi_{f}(x_{0},x_{1})=(\mathcal{S}_{\gamma}^{k-1}f)(x_{0};x_{k})$ . We then note that $f\perp\mathbf{1}$ in $\ell^{2}(\nu_{k}^{\gamma})$ iff $\phi_{f}\perp\mathbf{1}$ in $\ell^{2}(\nu_{1}^{\gamma})$ . Indeed, $\langle\mathbf{1},\phi_{f}\rangle_{\nu_{1}}=\frac{\mu_{k}^{\gamma}(B_{k})}{\mu_{1}^{\gamma}(B)}\langle\mathbf{1},f\rangle_{\nu_{k}}$ , since $\langle\mathbf{1},\phi_{f}\rangle_{\nu_{1}}=\sum_{(x_{0},x_{1})}\nu_{1}(x_{0},x_{1})(\mathcal{S}_{\gamma}^{k-1}f)(x_{0};x_{k})$ , so applying (6.9), (6.6) and (2.7), the claim follows. Hence, $\|\mathcal{S}_{\gamma}^{2r-k+1}\phi_{f}\|_{\nu_{1}}\lesssim c(1-L^{-2}C)^{r/2}\|\phi_{f}\|_{\nu_{1}}$ , where $c=\frac{1}{(1-L^{-2})^{(k+3)/4}}\leq 2^{k+1}$ for large $L$ . The error terms are the same, this time with $\|\mathcal{Z}_{l}\phi_{f}\|_{\nu_{1}}=O_{l,T}(1)_{N\longrightarrow\infty,\gamma}$ . Finally, $\|\phi_{f}\|_{\nu_{1}}\leq\sqrt{\frac{\mu_{k}^{\gamma}(B_{k})}{\mu_{1}^{\gamma}(B)}}\|f\|_{\nu_{k}}$ . ∎

Starting from (9.5) and applying the lemma, we obtain for $\|K\|_{\infty}\leq 1$ ,

[TABLE]

Remember that $n=M^{9}$ and $t=M^{\alpha}$ with $0<\alpha<1/4$ . For the term $\frac{1}{t}\frac{2L^{2}}{c(D,\beta)}$ to be small, we choose $L=M^{\alpha^{\prime}}$ with $0<2\alpha^{\prime}<\alpha$ . For instance, take $\alpha=3/16$ and $\alpha^{\prime}=1/16$ . For the other terms, we have $t(c_{M,\beta}\varepsilon^{1/2})^{1/4}=O(M^{\alpha-1/4})$ and $n^{-1}t=M^{-9+\alpha}$ . The terms $\eta_{1}O_{M,T}(1)_{N\longrightarrow+\infty,\gamma}$ tend to [math] as $\eta_{1}=\eta_{0}+\eta\longrightarrow 0$ , $M$ and $T$ being fixed. Finally, $\|B_{\gamma}K\|_{\nu}^{2}=O_{T}(1)_{N\longrightarrow+\infty,\gamma}$ assuming $\|K\|_{\infty}\leq 1$ .

We can gather the first and second alternative into one statement :

Proposition 9.4.

Let $A>0$ .

For all $M$ , for all $\gamma$ that fall either into the first alternative or into the second one with $|s^{\gamma}_{1}(N)s^{\gamma}_{2}(N)-1|\geq A$ , we have for $\|K\|_{\infty}\leq 1$ and for $n=M^{9}$ ,

[TABLE]

Proof.

The arguments in the proof of (7.11) readily show that $\frac{1}{n^{2}}\sum_{r,r^{\prime}=1}^{n}\mathbf{E}_{n,r,r^{\prime}}(\eta_{1},F_{\gamma}K)=\eta_{1}\,O_{n,T}(1)_{N\longrightarrow\infty,\gamma}$ . The assertion follows from (9.1), (9.3) and (9.6). ∎

Proposition 9.5.

Let $I\subset I_{1}$ with $\bar{I}\subset I_{1}$ . There exists $a_{0}$ such that, if $a\leq a_{0}$ , $M$ is large enough, $\eta_{1}$ is small enough ( $M\geq M(a)$ , $\eta_{1}\leq\eta(a)$ ), and $N$ is large enough :

For any $\gamma$ falling into the second alternative on $G_{N}$ , the sequence $s^{\gamma}(N)=s^{\gamma}_{1}(N)s^{\gamma}_{2}(N)$ must satisfy $|s^{\gamma}(N)-1|>a^{13}$ , if $\gamma$ stays in a set of the form

[TABLE]

Before proving the proposition, let us finally give the

Proof of Theorem 3.3.

We apply Proposition 5.2 and use Proposition 9.5 to show that we are in the framework of Proposition 9.4.

Two cases may happen. Either $\mathcal{W}(o)$ is deterministic : there exists $E_{0}$ such that $\mathbb{P}(\mathcal{W}(o)=E_{0})=1$ . In that case, we fix a small $a>0$ , let $J_{1}=I\setminus[E_{0}-2a,E_{0}+2a]$ and $J_{2}=I\cap[E_{0}-2a,E_{0}+2a]$ . We then write $\operatorname{Var_{nb,\eta_{0}}^{I}}(\mathcal{F}_{\gamma}K)=\operatorname{Var_{nb,\eta_{0}}^{J_{1}}}(\mathcal{F}_{\gamma}K)+\operatorname{Var_{nb,\eta_{0}}^{J_{2}}}(\mathcal{F}_{\gamma}K)$ . For $\operatorname{Re}\gamma\in J_{1}$ , we have $|\gamma-E_{0}|>2a$ , so $\mathbb{P}(|\mathcal{W}(o)-\gamma|<a)=0$ and Proposition 9.5 applies with $a$ arbitrarily small. Proposition 9.4, applied with $A=a^{13}$ , thus allows to control $\operatorname{Var_{nb,\eta_{0}}^{J_{1}}}(\mathcal{F}_{\gamma}K)$ , while $\operatorname{Var_{nb,\eta_{0}}^{J_{2}}}(\mathcal{F}_{\gamma}K)=O_{T}(a)$ .

If $\mathcal{W}(o)$ is not deterministic, there exists $a$ such that for all $E\in\mathbb{R}$ , $\mathbb{P}(|\mathcal{W}(o)-E|<a)\leq 1-a$ . Thus, for any complex $\gamma$ , $\mathbb{P}(|\mathcal{W}(o)-\gamma|<a)\leq 1-a$ . In this case Proposition 9.5 may be applied with the fixed value $A=a^{13}$ and all $\gamma$ .

Either way, we showed that there exists $a_{0}$ such that, for all $a\leq a_{0}$ , $M\geq M(a)$ , we have for any $s$ and $T$ ,

[TABLE]

Taking $M\to\infty$ followed by $a\downarrow 0$ , this completes the proof of Theorem 3.3. ∎

We conclude the section with the

Proof of Proposition 9.5.

We will use the following consequences of (Green) :

•

There exists $0<c_{0}<\infty$ such that for all $\gamma\in\mathbb{C}^{+}$ , $\operatorname{Re}\gamma\in I_{1}$ ,

[TABLE]

In fact, $\hat{\mu}_{1}^{\gamma}(o,y)=\frac{|\operatorname{Im}\hat{\zeta}^{\gamma}_{y}(o)\operatorname{Im}\hat{\zeta}^{\gamma}_{o}(y)|}{|\hat{m}_{y}^{\gamma}\hat{\zeta}_{o}^{\gamma}(y)|^{2}}$ , so this follows from (Green) and its consequences (A.9) and (A.10).

•

There exists $0<c_{1}<\infty$ , such that for all $\gamma\in\mathbb{C}^{+}$ , $\operatorname{Re}\gamma\in I_{1}$ ,

[TABLE]

In fact, $\operatorname{\mathbb{E}}(|2\operatorname{Im}\hat{m}_{o}|^{-1})+\operatorname{\mathbb{E}}(|2\hat{m}_{o}^{\gamma}|)\leq c_{1}/2$ by (A.9), so the first claim follows by Markov’s inequality. The second one follows similarly from (A.10).

We may now begin the proof. If $\gamma$ falls into the second alternative, then

[TABLE]

Let $a_{0}=(2c_{0})^{-2}(6+3c_{1})^{-12}$ ; this choice will become clear later on. Take $a\leq a_{0}$ . There exist $M(a)$ , $\eta(a)$ and $N(a)$ such that if $M\geq M(a)$ , $\eta_{1}\leq\eta(a)$ and $N\geq N(a)$ , then the RHS side in (9.10) is $\leq a^{26}$ . We fix $\rho\geq a^{26}$ .

So take any $a\leq a_{0}$ , $M\geq M(a)$ , $\eta_{1}\leq\eta(a)$ , and assume towards a contradiction that we can find a subsequence $N_{k}=N_{k}(\eta_{1})\longrightarrow+\infty$ and a sequence $\gamma_{k}\in A_{a,\eta_{1}}$ , each falling into the second alternative on $G_{N_{k}}$ , such that $|s^{\gamma_{k}}(N_{k})-1|^{2}\leq\rho$ . After extracting further subsequences, let $\lim_{N_{k}\to+\infty}s^{\gamma_{k}}(N_{k})=s$ and $\gamma_{0}=\lim_{N_{k}\to+\infty}\gamma_{k}\in\mathbb{C}$ . Then $|s-1|^{2}\leq\rho$ , $\operatorname{Re}\gamma_{0}\in I_{1},\operatorname{Im}\gamma_{0}=\eta_{1}$ , and by (9.10) and Remark A.3

[TABLE]

which implies, using (9.8),

[TABLE]

By the Cauchy-Schwarz inequality,

[TABLE]

and thus, by (9.8),

[TABLE]

Since the value of $\gamma_{0}$ is now fixed, let us omit it from the notation.

Let us write $\hat{\zeta}^{\gamma_{0}}_{o}(y)=\hat{\zeta}_{o}(y)=r(o,y)e^{-i\theta(o,y)}$ with $r\in\mathbb{R}_{+}$ and $\theta\in\mathbb{R}$ . This implies $\hat{u}_{o}(y)=e^{2i\theta(o,y)}$ and $|\hat{u}_{o}(y)\hat{u}_{y}(o)-1|=|(e^{i\theta(y,o)}+e^{-i\theta(o,y)})(e^{i\theta(y,o)}-e^{-i\theta(o,y)})|$ .

Now (9.11) implies that

[TABLE]

Let us call $\epsilon(o,y)$ the value of $\epsilon$ achieving the min. By (2.7) we have

[TABLE]

for all $y\sim o$ . Thus, using (9.8),

[TABLE]

Denote $t_{o,y}=\epsilon(o,y)r(y,o)^{-1}-r(o,y)\in\mathbb{R}$ . It follows by Markov’s inequality that

[TABLE]

with probability $\geq 1-r$ .

The probability that $|2\operatorname{Im}\hat{m}_{o}|\geq 2r$ and $|2\hat{m}_{o}|\leq\frac{1}{2}r^{-1}$ is at least $1-c_{1}r$ by (9.9a). Thus, (9.14) implies that with probability $\geq 1-r-c_{1}r$ , we have for any $y\sim o$

[TABLE]

Combining (9.14) and (9.15), we see that for any $y,y^{\prime}\sim o$ ,

[TABLE]

Now (2.4a) says that

[TABLE]

Using (9.14) and (9.16), we get for any fixed $y^{\prime}\sim o$ ,

[TABLE]

with probability $\geq 1-r-2c_{1}r$ . Here we used that $\sum_{y\sim o}r(o,y)\leq\frac{1}{2}r^{-1}$ with probability $\geq 1-c_{1}r$ , see (9.9b). Since $|\gamma_{0}-\mathcal{W}(o)|\geq a$ with probability $\geq a$ , it follows that

[TABLE]

with probability $\geq 1-r-2c_{1}r-(1-a)$ . Recall that $r(o,y)$ and $t_{o,u}$ are real. Taking the imaginary part in (9.17), we thus get $|\operatorname{Im}e^{-i\theta(o,y^{\prime})}|\leq\frac{2r^{3}+\eta_{1}}{a-2r^{3}}$ . Assume $\eta_{1}\leq r^{3}$ . Then if $r<a/5$ , we get $|\operatorname{Im}e^{-i\theta(o,y^{\prime})}|<r^{2}$ . Hence, $\operatorname{\mathbb{P}}(|\operatorname{Im}e^{-i\theta(o,y^{\prime})}|\geq r^{2})\leq(2c_{1}+1)r+1-a$ . But we know that $|2\operatorname{Im}\hat{m}_{o}|\geq 2r$ , so taking the imaginary part in (9.14) and using (9.15), we also have that $|\operatorname{Im}e^{-i\theta(o,y^{\prime})}|\geq r^{2}$ with probability $\geq 1-r-c_{1}r$ . If $(2+3c_{1})r<a$ , this will give a contradiction.

To prove the proposition, we take $r=\frac{a}{6+3c_{1}}$ and choose $a_{0}\leq(2c_{0})^{-2}(6+3c_{1})^{-12}$ . Recalling that $2c_{0}\rho^{1/4}=r^{6}$ , we get $\rho^{1/2}=(2c_{0})^{-2}(\frac{a}{6+3c_{1}})^{12}\geq a^{13}$ for $a\leq a_{0}$ , as required. We also take $M>M(a)$ , and $\eta_{1}\leq\min(r^{3},\eta(a))$ . ∎

10. Step 5 : Back to the original eigenfunctions

In this section, we show that it suffices to consider the non-backtracking quantum variance in order to prove quantum ergodicity; in other words Theorem 3.3 can be retrieved from Theorem 1.3. This part may be read before or after the others.

Given $K\in\mathscr{H}_{k}$ , we define the quantum variance by

[TABLE]

where $K_{G}$ is as in Section 2.1.

More generally, fix $\eta_{0}>0$ and suppose $K^{\gamma}\in\mathscr{H}_{k}$ satisfies conditions (Hol). We denote

[TABLE]

where the subscript $\eta_{0}$ indicates that inside the variance, $\operatorname{Im}\gamma$ is fixed and equal to $\eta_{0}$ . Denote $\gamma_{j}=\lambda_{j}+i\eta_{0}$ , and define

[TABLE]

so $g_{j}^{\ast}$ and $g_{j}$ are defined like $f_{j}^{\ast}$ and $f_{j}$ (Section 3), respectively, with $\zeta$ replaced by $\overline{\zeta}$ . Put

[TABLE]

Next, given $\gamma\in\mathbb{C}^{+}$ , define the function $N_{\gamma}:V\longrightarrow\mathbb{R}_{+}$ by

[TABLE]

where $\tilde{x}$ is a point in ${\widetilde{G}}$ projecting down to $G=\Gamma\backslash{\widetilde{G}}$ . Recall the Laplacian $P$ defined in (1.1). We next introduce the operators $P_{\gamma},\mathcal{S}_{T,\gamma},\widetilde{\mathcal{S}}_{T,\gamma}:\mathbb{C}^{V}\to\mathbb{C}^{V}$ defined by

[TABLE]

for $T\in\mathbb{N}^{\ast}$ , and the operators $\mathcal{L}^{\gamma},\widetilde{\mathcal{L}}^{\gamma}:\mathbb{C}^{V}\to\mathbb{C}^{B}$ defined by

[TABLE]

Finally, denote $\operatorname{Var_{\eta_{0}}^{I}}(K-\langle K\rangle_{\gamma}):=\operatorname{Var_{\eta_{0}}^{I}}(K-\langle K\rangle_{\gamma}\mathbf{1})$ where $\mathbf{1}\in\mathscr{H}_{0}$ is the constant function equal to $1$ (so that, with the notation of Section 2.1, $\widehat{\mathbf{1}}$ is the identity operator).

Proposition 10.1.

Fix $\eta_{0}>0$ and $T\in\mathbb{N}^{\ast}$ . For any $J\in\mathscr{H}_{0}$ , we have

[TABLE]

Proof.

We have

[TABLE]

We calculate $\langle g_{j}^{\ast},(\widetilde{\mathcal{L}}^{\gamma_{j}}J)_{B}g_{j}\rangle$ similarly. We then note that

[TABLE]

using that $\frac{|\zeta_{x_{1}}^{\gamma}(x_{0})|^{2}}{|m_{x_{1}}^{\gamma}|^{2}}=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|m_{x_{0}}^{\gamma}|^{2}}$ , by (2.7). Hence,

[TABLE]

Let $\alpha_{x_{0}}^{x_{1}}=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|2m_{x_{0}}^{\gamma}|^{2}N_{\gamma}(x_{1})}$ , and note that $\alpha_{x_{1}}^{x_{0}}=\frac{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|2m_{x_{0}}^{\gamma}|^{2}N_{\gamma}(x_{0})}$ by (2.7). Then

[TABLE]

and

[TABLE]

Hence,

[TABLE]

Now $\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})+\operatorname{Im}\zeta_{x_{1}}^{\gamma}(x_{0})\cdot|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}=|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}\big{[}\frac{\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})}{|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}+\operatorname{Im}\zeta_{x_{1}}^{\gamma}(x_{0})\big{]}=-2\operatorname{Im}m_{x_{1}}^{\gamma}\cdot|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}$ by (2.7). Since $2\operatorname{Im}m_{x_{1}}^{\gamma}=N_{\gamma}(x_{1})|2m_{x_{1}}^{\gamma}|^{2}$ , we get $\frac{\operatorname{Im}\zeta_{x_{0}}^{\gamma}(x_{1})+\operatorname{Im}\zeta_{x_{1}}^{\gamma}(x_{0})|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2}}{|\zeta_{x_{0}}^{\gamma}(x_{1})\zeta_{x_{1}}^{\gamma}(x_{0})|^{2}}=\frac{-N_{\gamma}(x_{1})|2m_{x_{1}}^{\gamma}|^{2}}{|\zeta_{x_{1}}^{\gamma_{j}}(x_{0})|^{2}}$ . Since $\alpha_{x_{0}}^{x_{1}}=\frac{|\zeta_{x_{1}}^{\gamma}(x_{0})|^{2}}{N_{\gamma}(x_{1})|2m_{x_{1}}^{\gamma}|^{2}}$ and $\alpha_{x_{1}}^{x_{0}}=\frac{|\zeta_{x_{1}}^{\gamma}(x_{0})|^{2}}{N_{\gamma}(x_{0})|2m_{x_{1}}^{\gamma}|^{2}}$ by (2.7), we thus have

[TABLE]

Hence,

[TABLE]

Now note that $P_{\gamma}(\mathcal{S}_{T,\gamma}K)=\frac{1}{T}\sum_{s=1}^{T}(T-s+1)P_{\gamma}^{s}K=\mathcal{S}_{T,\gamma}K-K+\widetilde{\mathcal{S}}_{T,\gamma}K$ . Hence,

[TABLE]

for any $K\in\mathscr{H}_{0}$ . Taking $K_{\gamma}=J-\langle J\rangle_{\gamma}$ , we thus get

[TABLE]

We now consider $K\in\mathscr{H}_{m}$ for $m>0$ . Define $\mathcal{T}^{\gamma}:\mathscr{H}_{1}\to\mathscr{H}_{1}$ and $\mathcal{O}^{\gamma}_{1}:\mathscr{H}_{1}\to\mathscr{H}_{0}$ by

[TABLE]

For $m\geq 2$ , define $\mathcal{U}^{\gamma}_{m}:\mathscr{H}_{m}\to\mathscr{H}_{m}$ , $\mathcal{O}_{m}^{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{m-1}$ and $\mathcal{P}_{m}^{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{m-2}$ by

[TABLE]

Proposition 10.2.

Fix $\eta_{0}>0$ . Suppose $\overline{\psi_{j}(x_{0})}\psi_{j}(x_{1})\in\mathbb{R}$ for any $j=1,\dots,N$ and $(x_{0},x_{1})\in B$ . Then for any $K\in\mathscr{H}_{1}$ , we have

[TABLE]

and for any $K\in\mathscr{H}_{m}$ , $m\geq 2$ , we have

[TABLE]

Proof.

Let $K\in\mathscr{H}_{1}$ . Since $\overline{\psi_{j}(x_{0})}\psi_{j}(x_{1})\in\mathbb{R}$ for all $(x_{0},x_{1})$ , we have

[TABLE]

By definition of $\mathcal{T}^{\gamma}$ and $\mathcal{O}_{1}^{\gamma}$ , this implies

[TABLE]

and thus

[TABLE]

Recall the definition of $\langle K\rangle_{\gamma}$ in (1.5). We claim that

[TABLE]

Indeed, we have $\langle K\rangle_{\gamma}=\sum_{(x_{0},x_{1})\in B}K(x_{0},x_{1})\Phi_{\gamma}(x_{0},x_{1})$ . On the other hand,

[TABLE]

But $\frac{\Phi_{\gamma}(x_{1},x_{1})}{\zeta_{x_{0}}^{\gamma}(x_{1})}+\frac{\Phi_{\gamma}(x_{0},x_{0})}{\overline{\zeta_{x_{1}}^{\gamma}(x_{0})}}=\frac{1+\overline{\zeta_{x_{1}}^{\gamma}(x_{0})}\zeta_{x_{0}}^{\gamma}(x_{1})}{\zeta_{x_{0}}^{\gamma}(x_{1})\overline{\zeta_{x_{1}}^{\gamma}(x_{0})}}\Phi_{\gamma}(x_{0},x_{1})$ by (2.13) and the fact that $\Psi_{\gamma,x}(y)=\Psi_{\gamma,y}(x)$ by (2.8), so that $\Phi_{\gamma}(x,y)=\Phi_{\gamma}(y,x)$ . Hence,

[TABLE]

This proves the proposition for $m=1$ . Now let $m\geq 2$ . It is easily checked that

[TABLE]

and thus

[TABLE]

We now note that

[TABLE]

Indeed, we have

[TABLE]

so (10.13) follows from (2.13). Using (10.12), this completes the proof. ∎

We introduce one last operator $\mathcal{X}_{\gamma}:\mathscr{H}_{0}\to\mathscr{H}_{0}$ given by

[TABLE]

The following corollary then holds assuming all eigenfunctions $\psi_{j}$ are real. Note that this assumption is not needed in the special case $m=0$ , corresponding to Theorem 1.1.

Corollary 10.3.

Suppose we have shown that $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{nb,\eta_{0}}^{I}}(\mathcal{F}_{\gamma}K)=0$ , $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\widetilde{\operatorname{Var_{nb,\eta_{0}}^{I}}}(\widetilde{\mathcal{F}}_{\gamma}K)=0$ for any $\mathcal{F}_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{k}$ that is a polynomial combination of $\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}$ , $\mathcal{X}_{\gamma}$ , $\mathcal{U}^{\gamma}_{j}$ , $\mathcal{T}^{\gamma}$ , $\mathcal{O}_{j}^{\gamma}$ and $\mathcal{P}_{j}^{\gamma}$ ( $T$ fixed), $\widetilde{\mathcal{F}}_{\gamma}$ the same combination with $\mathcal{L}^{\gamma}$ replaced by $\widetilde{\mathcal{L}}^{\gamma}$ , and that

[TABLE]

where $C_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{0}$ is any polynomial combination of $\mathcal{U}_{j}^{\gamma}$ , $\mathcal{T}^{\gamma}$ , $\mathcal{O}_{j}^{\gamma}$ and $\mathcal{P}_{j}^{\gamma}$ .

Then it will follow that $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{\eta_{0}}^{I}}(K-\langle K\rangle_{\gamma})=0$ for any $K\in\mathscr{H}_{m}$ . In other words, Theorem 1.3 will follow.

Proof.

The case $m=0$ holds by Proposition 10.1 and the triangle inequality $\operatorname{Var_{nb,\eta_{0}}^{I}}(K-\langle K\rangle_{\gamma})\leq\operatorname{Var_{nb,\eta_{0}}^{I}}(K)+\operatorname{Var_{nb,\eta_{0}}^{I}}(\mathcal{X}_{\gamma}K)$ . Here, $\mathcal{F}_{\gamma}$ has the form $\mathcal{L}^{\gamma}d^{-1}S_{T,\gamma}$ , $\mathcal{L}^{\gamma}d^{-1}S_{T,\gamma}\mathcal{X}_{\gamma}$ and $C_{\gamma}=I$ .

The result for higher $m$ follows by induction using Proposition 10.2. For example, for $m=2$ , the conclusion is obtained by taking $\mathcal{F}_{\gamma}$ of the form $\mathcal{U}^{\gamma}_{2}$ , $\mathcal{T}^{\gamma}\mathcal{O}_{2}^{\gamma}$ , $\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}\mathcal{O}_{1}^{\gamma}\mathcal{O}_{2}^{\gamma}$ , $\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}\mathcal{X}_{\gamma}\mathcal{O}_{1}^{\gamma}\mathcal{O}_{2}^{\gamma}$ , $\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}\mathcal{P}_{2}^{\gamma}$ , $\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}\mathcal{X}_{\gamma}\mathcal{P}_{2}^{\gamma}$ , and $C_{\gamma}$ of the form $\mathcal{O}_{1}^{\gamma}\mathcal{O}_{2}^{\gamma}$ and $\mathcal{P}_{2}^{\gamma}$ . ∎

Remark 10.4.

All the operators in Corollary 10.3 satisfy the assumptions (Hol) from Definition 3.2. Indeed, the first two points of (Hol) are clear (the derivative of any Green function such as $\zeta^{z}$ or $G^{z}$ may be assessed for example using the resolvent equation, yielding $|\partial_{z}\zeta^{z}|\leq(\operatorname{Im}z)^{-2}$ ).

For the third point, we should estimate $\frac{1}{N}\sum_{\omega\in B_{k}}|\mathcal{F}_{\gamma}K(\omega)|^{s}$ . Assume first that $\mathcal{X}_{\gamma}$ is not contained in $\mathcal{F}_{\gamma}$ . Then assuming $\|K\|_{\infty}\leq 1$ , we write

[TABLE]

Now $\mathcal{F}_{\gamma}=A^{(1)}\cdots A^{(\ell)}$ is a composition of operators $A^{(r)}$ , each of which is either a multiplication or of nearest-neighbour type (with $\mathcal{S}_{T,\gamma}$ a composition of Laplacians). So the sum $\sum_{\omega^{\prime}}A^{(r)}(\omega,\omega^{\prime})$ reduces to $\sum_{\omega^{\prime}\approx\omega}A^{(r)}(\omega,\omega^{\prime})$ , where depending on the operator, $\omega^{\prime}\approx\omega$ means $\omega^{\prime}=\omega$ , $\omega^{\prime}\sim\omega$ , $\omega^{\prime}\in\{o_{\omega},t_{\omega}\}$ (origin and terminus of $\omega$ ), $\omega^{\prime}\in\{(x,\omega),(\omega,y):x\sim o_{\omega},y\sim t_{\omega}\}$ or $\omega^{\prime}\in\{(x,\omega,y):x\sim o_{\omega},y\sim t_{\omega}\}$ . In any case, $\#\{\omega^{\prime}\approx\omega\}\leq 2D$ . So $\mathcal{F}_{\gamma}(\omega,\omega^{\prime})=\sum_{\omega_{1}\approx\omega}\dots\sum_{\omega_{\ell-1}\approx\omega_{\ell-2}}A^{(1)}(\omega,\omega_{1})\dots A^{(\ell)}(\omega_{\ell-1},\omega^{\prime})$ and thus $\sum_{\omega^{\prime}\in B_{m}}|\mathcal{F}_{\gamma}(\omega,\omega^{\prime})|\leq\sum_{\omega_{1}\approx\omega}\dots\sum_{\omega_{\ell}\approx\omega_{\ell-1}}|A^{(1)}(\omega,\omega_{1})\dots A^{(\ell)}(\omega_{\ell-1},\omega_{\ell})|$ . It follows that $|\mathcal{F}_{\gamma}K(\omega)|^{s}\leq(2\ell D)^{s-1}\sum_{\omega_{1}\approx\omega}\dots\sum_{\omega_{\ell}\approx\omega_{\ell-1}}|A^{(1)}(\omega,\omega_{1})\dots A^{(\ell)}(\omega_{\ell-1},\omega_{\ell})|^{s}$ . Using Hölder’s inequality, if $\sum_{r=1}^{\ell}\frac{1}{p_{r}}=1$ , we get using Remark A.3 that

[TABLE]

uniformly in $\lambda$ . Here, $\ell$ may depend on $T$ . By definition, all $\hat{A}^{(r)}(\omega,\omega^{\prime})$ are well-behaved functions of $\hat{\zeta}$ and $\mathcal{G}^{z}$ , so the previous expression is finite using Remark A.4. For example, if $\mathcal{F}^{\gamma}=\mathcal{T}^{\gamma}$ , we are reduced to estimating $\operatorname{\mathbb{E}}\big{(}\sum_{o^{\prime}\sim o}|\frac{\overline{\zeta^{\gamma}_{o^{\prime}}(o)}\hat{\zeta}^{\gamma}_{o}(o^{\prime})}{\overline{\hat{\zeta}_{o^{\prime}}^{\gamma}(o)}\hat{\zeta}_{o}^{\gamma}(o^{\prime})+1}|^{s}\big{)}$ . Using (2.7), we observe that $\frac{|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|}{|\hat{\zeta}_{o}^{\gamma}(o^{\prime})+\overline{\hat{\zeta}_{o^{\prime}}^{\gamma}(o)}^{-1}|}=\frac{|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|}{|2\operatorname{Re}\hat{\zeta}_{o}^{\gamma}(o^{\prime})+\overline{2\hat{m}_{o}^{\gamma}}|}\leq\frac{|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|}{2\operatorname{Im}\hat{m}_{o}^{\gamma}}$ , and we know from Remark A.4 that $\sup_{\gamma}\operatorname{\mathbb{E}}\big{(}\sum_{o^{\prime}\sim o}\frac{|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|^{s}}{(2\operatorname{Im}\hat{m}_{o}^{\gamma})^{s}}\big{)}<\infty$ . Similarly, if $\mathcal{F}^{\gamma}=\mathcal{L}^{\gamma}d^{-1}\mathcal{S}_{T,\gamma}$ , then $|(\mathcal{F}^{\gamma}K)(e)|\leq\frac{|\zeta_{o_{e}}^{\gamma}(t_{e})|^{2}}{|m_{o_{e}}^{\gamma}|^{2}}\frac{1}{N_{\gamma}(o_{e})N_{\gamma}(t_{e})}\sum_{r=0}^{T-1}[|(P^{r}d^{-1}N_{\gamma}K)(o_{e})|+\frac{|(P^{r}d^{-1}N_{\gamma}K)(t_{e})|}{|\zeta^{\gamma}_{o_{e}}(t_{e})\zeta^{\gamma}_{t_{e}}(o_{e})|}]$ , so (10.15) reduces to

[TABLE]

for some $p_{1},p_{2}$ .

The previous discussion was under the assumption $A^{(r)}\neq\mathcal{X}_{\gamma}$ . If $\mathcal{F}_{\gamma}=F_{1}^{\gamma}\mathcal{X}_{\gamma}F_{2}^{\gamma}$ with $F_{1}^{\gamma}$ and $F_{2}^{\gamma}$ as in the previous paragraph, we write $\mathcal{F}_{\gamma}K(\omega)=\sum_{\omega^{\prime}}F_{1}^{\gamma}(\omega,\omega^{\prime})\langle F_{2}^{\gamma}K\rangle_{\gamma}$ , with $|\langle F_{2}^{\gamma}K\rangle_{\gamma}|=|\frac{\sum_{x}N_{\gamma}(x)(F_{2}^{\gamma}K)(x)}{\sum_{x}N_{\gamma}(x)}|=|\frac{\sum_{x}\sum_{w}N_{\gamma}(x)F_{2}^{\gamma}(x,w)K(w)}{\sum_{x}N_{\gamma}(x)}|\leq\frac{\sum_{x}\sum_{w}N_{\gamma}(x)|F_{2}^{\gamma}(x,w)|}{\sum_{x}N_{\gamma}(x)}$ . Hence, $|\mathcal{F}_{\gamma}K(\omega)|\leq\sum_{\omega^{\prime}}|F_{1}^{\gamma}(\omega,\omega^{\prime})|\cdot\frac{N}{\sum_{x}N_{\gamma}(x)}\cdot\frac{1}{N}\sum_{x}\sum_{w}N_{\gamma}(x)|F_{2}^{\gamma}(x,w)|$ . Applying Hölder’s inequality to $\frac{1}{N}\sum_{\omega\in B_{k}}(\sum_{\omega^{\prime}}|F_{1}^{\gamma}(\omega,\omega^{\prime})|)^{s}$ and $(\frac{1}{N}\sum_{x}N_{\gamma}(x)\sum_{w}|F_{2}^{\gamma}(x,w)|)^{s}$ and taking the limit, we obtain a uniform control as before. Thus, all points of (Hol) are satisfied.

In view of Remark 10.4, we may use Theorem 3.3 to conclude that for the $\mathcal{F}_{\gamma}$ in Corollary 10.3, we have $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{nb,\eta_{0}}^{I}}(\mathcal{F}_{\gamma}K)=0$ .

Since $\widetilde{\operatorname{Var_{nb,\eta_{0}}^{I}}}(\widetilde{\mathcal{F}}_{\gamma}K)$ is defined exactly like $\operatorname{Var_{nb,\eta_{0}}^{I}}(\mathcal{F}_{\gamma}K)$ except that $\zeta$ is replaced by $\overline{\zeta}$ , it is clear that it can be shown to vanish asymptotically using the same arguments, simply replacing $\zeta$ by $\overline{\zeta}$ when necessary. By Corollary 10.3, to finish the proof of Theorem 1.3, it suffices to show (10.14). This is what we do now.

Recall that we introduced $\|K\|_{\gamma}$ for $K\in\mathscr{H}_{k}$ , $k\geq 1$ , in (4.1). For $K\in\mathscr{H}_{0}$ , we let

[TABLE]

We also define $(Y_{\gamma}K)(x)=\frac{d(x)}{N_{\gamma}(x)}\cdot\frac{\sum_{y\in V}N_{\gamma}(y)K(y)}{\sum_{y\in V}d(y)}$ . Denoting $\langle J\rangle_{U}:=\frac{1}{N}\sum_{x\in V}J(y)$ the uniform average of $J$ , we have $Y_{\gamma}K=\frac{\langle N_{\gamma}K\rangle_{U}}{\langle d\rangle_{U}}\cdot\frac{d}{N_{\gamma}}$ . Fix $I=(a,b)\subset I_{1}$ as in Section 4.

Proposition 10.5.

Under assumptions (BSCT), (Green), if $K^{\gamma}\in\mathscr{H}_{0}$ satisfies the set of assumptions (Hol), then for any interval $I=(a,b)$ as above,

[TABLE]

Proof.

We follow the steps in the proof of Theorem 4.1. Let $J^{\gamma}=(\widetilde{\mathcal{S}}_{T,\gamma}-Y_{\gamma})K^{\gamma}$ and $\alpha_{\gamma_{j}}(x)=N_{\gamma_{j}}^{1/2}(x)$ . Then $\operatorname{Var_{\eta_{0}}^{I}}(J^{\gamma})^{2}\leq(\frac{1}{N}\sum_{\lambda_{j}\in I}\|\alpha_{\gamma_{j}}^{-1}\psi_{j}\|^{2})(\frac{1}{N}\sum_{\lambda_{j}\in I}\|\alpha_{\gamma_{j}}J_{G}^{\gamma_{j}}\psi_{j}\|^{2})$ . As in the proof of (4.3), $\frac{1}{N}\sum_{\lambda_{j}\in I}\|\alpha_{\gamma_{j}}^{-1}\psi_{j}\|^{2}\lesssim\frac{3}{\pi N}\int_{a-2\eta}^{b+2\eta}\sum_{\rho_{G}(x)\geq d_{R,\eta}}\frac{\Psi_{z+i\eta_{0},\tilde{x}}(\tilde{x})}{N_{\lambda+i\eta_{0}}(x)}\,\mathrm{d}\lambda\leq\frac{3(|I|+4\eta)}{\pi}$ for any small $\eta>0$ , since $N_{\gamma}(x)=\Psi_{\gamma,\tilde{x}}(\tilde{x})$ .

Hence, $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{\eta_{0}}^{I}}(J^{\gamma})^{2}\leq\frac{3|I|}{\pi}\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\frac{1}{N}\sum_{\lambda_{j}\in I}\|\alpha_{\gamma_{j}}J_{G}^{\gamma_{j}}\psi_{j}\|^{2}$ . Now $\|\alpha_{\gamma_{j}}J_{G}^{\gamma_{j}}\psi_{j}\|^{2}=\sum_{x\in V}N_{\gamma_{j}}(x)|J^{\gamma_{j}}(x)|^{2}|\psi_{j}(x)|^{2}$ . Arguing as in Section 4, we get

[TABLE]

where $z:=\lambda+i\eta^{4}$ . This is bounded by $\frac{3}{\pi}\int_{a-2\eta}^{b+2\eta}\|J^{z+i\eta_{0}}\|_{z+i\eta_{0}}^{2}\,\mathrm{d}\lambda$ , since $\Psi_{\gamma,\tilde{x}}(\tilde{x})=N_{\gamma}(x)$ and $\chi(\lambda)\leq 1$ on $\mathbb{R}$ .

Summarizing, we have $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{\eta_{0}}^{I}}(J^{\gamma})^{2}\leq\frac{9|I|}{\pi^{2}}\int_{a-2\eta}^{b+2\eta}\|J^{z+i\eta_{0}}\|_{z+i\eta_{0}}^{2}\,\mathrm{d}\lambda$ .

Now recall that $\widetilde{\mathcal{S}}_{T,\gamma}=\frac{1}{T}\sum_{s=1}^{T}P_{\gamma}^{s}$ , and $P_{\gamma}=\frac{d}{N_{\gamma}}P\frac{N_{\gamma}}{d}$ , so that $P_{\gamma}^{s}=\frac{d}{N_{\gamma}}P^{s}\frac{N_{\gamma}}{d}$ . Moreover, $Y_{\gamma}K=\frac{d}{N_{\gamma}}\frac{\langle N_{\gamma}K\rangle_{U}}{\langle d\rangle_{U}}$ . So denoting $\gamma=z+i\eta_{0}$ , $\|K\|_{d}^{2}=\frac{1}{N}\sum_{x\in V}d(x)|K(x)|^{2}$ , we have

[TABLE]

Here we used (EXP) and the fact that $\frac{N_{\gamma}K^{\gamma}}{d}-\frac{\langle N_{\gamma}K^{\gamma}\rangle_{U}}{\langle d\rangle_{U}}\mathbf{1}$ is orthogonal to the constants in $\ell^{2}(V,d)$ . Indeed, the orthogonal projector onto $\mathbf{1}$ in $\ell^{2}(V,d)$ is $P_{\mathbf{1},d}J=\frac{\langle\mathbf{1},J\rangle_{d}}{\langle\mathbf{1},\mathbf{1}\rangle_{d}}\mathbf{1}=\frac{\langle dJ\rangle_{U}}{\langle d\rangle_{U}}\mathbf{1}$ . Since $\frac{\langle N_{\gamma}K^{\gamma}\rangle_{U}}{\langle d\rangle_{U}}\mathbf{1}=\frac{N_{\gamma}Y_{\gamma}K^{\gamma}}{d}$ and $\frac{1}{d}\leq 1$ , the proposition follows. ∎

Corollary 10.6.

For any $C_{\gamma}:\mathscr{H}_{m}\to\mathscr{H}_{0}$ as in Corollary 10.3 and $\bar{I}\subset I_{1}$ , $\|K\|_{\infty}\leq 1$ ,

[TABLE]

Proof.

Let $K_{\gamma}^{\prime}=C_{\gamma}K-\langle C_{\gamma}K\rangle_{\gamma}\mathbf{1}$ . Then $Y_{\gamma}K_{\gamma}^{\prime}=0$ , since $Y_{\gamma}C_{\gamma}K=\frac{d}{N_{\gamma}}\frac{\langle N_{\gamma}C_{\gamma}K\rangle_{U}}{\langle d\rangle_{U}}$ and $\langle C_{\gamma}K\rangle_{\gamma}Y_{\gamma}\mathbf{1}=\frac{\langle N_{\gamma}C_{\gamma}K\rangle_{U}}{\langle N_{\gamma}\rangle_{U}}\frac{d}{N_{\gamma}}\frac{\langle N_{\gamma}\rangle_{U}}{\langle d\rangle_{U}}$ . Hence, denoting $z=\lambda+i(\eta^{4}+\eta_{0})$ ,

[TABLE]

Now $\|C_{z}K\|_{z}^{2}=\frac{1}{N}\sum_{x\in V}N_{z}^{2}(x)|(C_{z}K)(x)|^{2}\leq\frac{1}{N}\sum_{x\in V}N_{z}^{2}(x)[\sum_{w\in B_{m}}|C_{z}(x,w)|]^{2}$ . Similarly, $|\langle C_{z}K\rangle_{\lambda}|\leq\frac{1}{\sum_{x}N_{z}(x)}\sum_{x}N_{z}(x)\sum_{w}|C_{z}(x,w)|$ . For our operators $C_{z}$ , we thus get $\|C_{z}K\|_{z}^{2}=O(1)_{N\longrightarrow+\infty,z}$ and $|\langle C_{z}K\rangle_{z}|=O(1)_{N\longrightarrow+\infty,z}$ , as in Corollary 7.8. ∎

This proves (10.14) and ends the proof of Theorem 1.3 on the interval $I$ .

Suppose further that $\rho(\partial I_{1})=0$ . As $I_{1}$ is open, we have $I_{1}=\cup_{j\in\mathbb{N}}J_{j}$ for open intervals $J_{j}=(a_{j},b_{j})$ . Let $J_{j}^{\varsigma}=(a_{j}+\varsigma,b_{j}-\varsigma)$ with $\varsigma>0$ small. Then $\overline{J_{j}^{\varsigma}}\subset I_{1}$ , so using (9.7) and Corollary 10.6, we get $\lim_{\eta_{0}\downarrow 0}\limsup_{N\to\infty}\operatorname{Var_{\eta_{0}}^{J_{j}^{\varsigma}}}(K-\langle K\rangle_{\gamma})=0$ . Now $\operatorname{Var_{\eta_{0}}^{I_{1}}}(K^{\prime})=\sum_{j=1}^{M}\operatorname{Var_{\eta_{0}}^{J_{j}^{\varsigma}}}(K^{\prime})+\operatorname{Var_{\eta_{0}}^{I_{1}\setminus\cup_{j=1}^{M}J^{\varsigma}_{j}}}(K^{\prime})$ for any given $M$ . By (A.14) and (Green), we have $\operatorname{Var_{\eta_{0}}^{I_{1}\setminus\cup_{j=1}^{M}J^{\varsigma}_{j}}}(K-\langle K\rangle_{\gamma})\leq\frac{\sharp\{\lambda_{j}\in I_{1}\setminus\cup_{k=1}^{M}J_{k}^{\varsigma}\}}{N}\,O(1)_{N\longrightarrow+\infty,\gamma}$ . By the convergence of empirical spectral measures (Remark A.3), and using the fact that $\rho(\partial I_{1})=0$ , we have $\frac{\sharp\{\lambda_{j}\in I_{1}\setminus\cup_{k=1}^{M}J_{k}^{\varsigma}\}}{N}\to\rho(I_{1}\setminus\cup_{k=1}^{M}J_{k}^{\varsigma})$ . Finally, $\rho(I_{1}\setminus\cup_{k=1}^{M}J_{k}^{\varsigma})\to 0$ as $\varsigma\downarrow 0$ and $M\longrightarrow+\infty$ . The conclusion of Theorem 1.3 thus holds with $I$ replaced by $I_{1}$ .

Finally, if (Green) holds on $\overline{I_{1}}$ , then $\rho(\{\lambda\})=\lim_{\eta\downarrow 0}\eta\operatorname{Im}\operatorname{\mathbb{E}}(\mathcal{G}^{\lambda+i\eta}(o,o))=0$ for any $\lambda\in\overline{I_{1}}$ , since $\sup_{\eta>0}\operatorname{Im}\operatorname{\mathbb{E}}(\mathcal{G}^{\lambda+i\eta}(o,o))<\infty$ . In particular, $\rho(\partial I_{1})=0$ .

Appendix A Benjamini–Schramm topology

A.1. Generalities

In this appendix we collect known facts on the Benjamini-Schramm convergence, we refer the reader to [1, 6, 16, 17, 38] for details.

A coloured rooted graph $(G,o,W)$ is a graph $G=(V,E)$ with a marked vertex $o\in V$ called the root, and a map $W:V\to\mathbb{R}$ which we see as a “colouring”; it can also be regarded as a potential on $\ell^{2}(V)$ . This is a special case of what is called a network in [6]. All graphs are assumed to be locally finite, i.e. each vertex has a finite degree.

If $G$ is connected, we denote by $B_{G}(x,r)$ the $r$ -ball $\{y\in V:d_{G}(x,y)\leq r\}$ , where $d_{G}$ is the length of the shortest path between $x$ and $y$ in $G$ .

As in [6], we define a distance between coloured connected graphs by

[TABLE]

Two coloured rooted graphs $(G,o,W)$ and $(G^{\prime},o^{\prime},W^{\prime})$ are equivalent if there is a graph isomorphism $\phi:G\to G^{\prime}$ such that $\phi(o)=o^{\prime}$ and $W^{\prime}\circ\phi=W$ . We denote the equivalence class of $(G,o,W)$ by $[G,o,W]$ .

Let $\mathscr{G}_{\ast}$ be the set of equivalence classes of connected coloured rooted graphs. Then $d_{loc}$ turns $\mathscr{G}_{\ast}$ into a separable complete metric space. We may thus consider the set of probability measures on $\mathscr{G}_{\ast}$ , denoted by $\mathcal{P}(\mathscr{G}_{\ast})$ .

Any finite connected coloured graph $(G,W)$ , $G=(V,E)$ , defines a probability measure $U_{(G,W)}\in\mathcal{P}(\mathscr{G}_{\ast})$ by choosing the root $x$ uniformly at random in $V$ :

[TABLE]

If $(G_{n},W_{n})$ is a sequence of finite coloured graphs, we say that $\operatorname{\mathbb{P}}\in\mathcal{P}(\mathscr{G}_{\ast})$ is the local weak limit of $(G_{n},W_{n})$ if $U_{(G_{n},W_{n})}$ converges weakly- $\ast$ to $\operatorname{\mathbb{P}}$ in $\mathcal{P}(\mathscr{G}_{\ast})$ . This notion of convergence was introduced in [16] and generalized in [6]. In this case, we also say that $(G_{n},W_{n})$ converges in the sense of Benjamini-Schramm.

The subset $\mathscr{G}_{\ast}^{D,A}\subset\mathscr{G}_{\ast}$ of equivalence classes $[G,o,W]$ such that $G$ is of degree bounded by $D$ , and $W$ takes values in $[-A,A]$ , is compact. It follows that $\mathcal{P}(\mathscr{G}_{\ast}^{D,A})$ is compact in the weak- $\ast$ topology. Hence, if $\mathcal{C}^{D,A}_{\text{fin}}$ denotes the set of finite coloured graphs $(G,W)$ , $G=(V,E)$ , of degree bounded by $D$ and colouring $W:V\to[-A,A]$ , then any sequence $(G_{n},W_{n})\subset\mathcal{C}^{D,A}_{\text{fin}}$ has a subsequence which converges in the sense of Benjamini-Schramm.

Let $C(\mathscr{G}_{\ast}^{D,A})$ be the set of continuous functions $f:\mathscr{G}_{\ast}^{D,A}\to\mathbb{R}$ .

Then a sequence $(G_{n},W_{n})\subset\mathcal{C}^{D,A}_{\text{fin}}$ has a local weak limit $\operatorname{\mathbb{P}}$ iff there is an algebra $\mathscr{A}\subset C(\mathscr{G}_{\ast}^{D,A})$ which separates points, such that for all $f\in\mathscr{A}$ ,

[TABLE]

This follows from the compactness of $\mathscr{G}_{\ast}^{D,A}$ , see [34, Chapter 13].

It may not be very clear how a continuous function on $\mathscr{G}_{\ast}^{D,A}$ looks like, so we give a basic example. If $B_{F}(o,r)$ is an $r$ -ball, the sets $\mathscr{C}_{F}=\{[G,x,W]:B_{G}(x,r)\cong B_{F}(o,r)\}$ turn out to be clopen in $\mathscr{G}_{\ast}^{D,A}$ , so the characteristic function $\chi_{\mathscr{C}_{F}}$ is continuous. Here $B_{G}(x,r)\cong B_{F}(o,r)$ means there exists a graph isomorphism $\phi:B_{G}(x,r)\to B_{F}(o,r)$ with $\phi(x)=o$ , Using (A.3), it can be shown that in the special case where there is no colouring, $(G_{n})\subset\mathcal{C}^{D,A}_{\text{fin}}$ has a local weak limit $\operatorname{\mathbb{P}}$ iff

[TABLE]

for any $B_{F}(o,r)$ . This was in fact the original criterion in [16]. Using it, one readily checks that a sequence of $(q+1)$ -regular graphs $(G_{n})$ satisfies (BST) iff it converges to the $(q+1)$ -regular tree $\mathbb{T}_{q}$ in the sense of Benjamini-Schramm, i.e. iff $(G_{n})$ has the local weak limit $\delta_{[\mathbb{T}_{q},o]}$ , with $o\in\mathbb{T}_{q}$ arbitrary. More generally, by considering the clopen sets $\mathscr{C}_{r}=\{[G,x,W]:B_{G}(x,r)\text{ is not a tree}\}$ , one sees that if $(G_{n},W_{n})\subset\mathcal{C}^{D,A}_{\text{fin}}$ has a local weak limit $\operatorname{\mathbb{P}}$ that is concentrated on the subset $\mathscr{T}_{\ast}^{D,A}\subset\mathscr{G}_{\ast}^{D,A}$ of coloured rooted trees, then $(G_{n})$ satisfies (BST). Conversely, if $(G_{n})$ satisfies (BST) and if a subsequence of $(G_{n},W_{n})$ has a local weak limit $\operatorname{\mathbb{P}}$ , then $\operatorname{\mathbb{P}}$ must be concentrated on $\mathscr{T}_{\ast}^{D,A}$ .

A.2. Convergence of empirical spectral measures.

We now show that Benjamini-Schramm convergence implies convergence of the empirical spectral measures. This is already known in some settings [1, 38, 39]. In this paper we need the variant stated as Corollary A.2.

Given $[G,o,W]\in\mathscr{G}_{\ast}^{D,A}$ , $\gamma\in\mathbb{C}^{+}=\{z,\operatorname{Im}z>0\}$ and $x\sim y\in G$ , we define $\zeta_{x}^{\gamma}(y)$ as in §2.2. Like in §2.1, $B_{k}$ is the set of non-backtracking paths of length $k$ on $G$ .

Fix $s\in\mathbb{N}$ . Let $F:(\mathbb{C}\setminus\{0\})^{2s}\to\mathbb{C}$ be a continuous function and $\gamma\in\mathbb{C}^{+}$ . Let

[TABLE]

For $s=1$ , the sum reduces to $\sum_{x_{1}:x_{1}\sim o}$ . One can remark that $F_{\gamma}([{G},{o},{W}])=F_{\gamma}([\widetilde{G},\widetilde{o},\widetilde{W}])$ where ${\widetilde{G}}$ is the universal cover of $G$ and $\widetilde{o},\widetilde{W}$ are lifts of ${o},{W}$ .

Next, given Borel $J\subseteq\mathbb{R}$ , we define the measure

[TABLE]

Fix a compact $I\subset\mathbb{R}$ and fix $\eta\in(0,1)$ .

Lemma A.1.

Suppose $(\lambda_{n},[G_{n},o_{n},W_{n}])\subset I\times\mathscr{G}_{\ast}^{D,A}$ converges to $(\lambda,[G,o,W])$ in $I\times\mathscr{G}_{\ast}^{D,A}$ . Then $\mu_{o_{n},F,\lambda_{n}+i\eta}^{(G_{n},W_{n})}$ converges weakly- $\ast$ to $\mu_{o,F,\lambda+i\eta}^{(G,W)}$ .

Proof.

Since all operators $H_{n}=H_{(G_{n},W_{n})}$ and $H=H_{(G,W)}$ are uniformly bounded by $D+A$ , the supports of the spectral measures is compact, so it suffices to show that for any $k\in\mathbb{N}$ , $\mu_{o_{n},F,\lambda_{n}+i\eta}^{(G_{n},W_{n})}(t^{k})\to\mu_{o,F,\lambda+i\eta}^{(G,W)}(t^{k})$ ; see [34, Chapter 13].

Let $k\in\mathbb{N}$ . Denote $\gamma_{n}=\lambda_{n}+i\eta$ , $\gamma=\lambda+i\eta$ . We have

[TABLE]

We first approximate $F$ by a polynomial.

We have $|\zeta_{x}^{\lambda+i\eta}(y)|\leq\eta^{-1}$ and $|\operatorname{Im}\zeta_{x}^{\lambda+i\eta}(y)|=\eta\,\|({\widetilde{H}}^{(\tilde{y}|\tilde{x})}-\lambda-i\eta)^{-1}\delta_{\tilde{y}}\|_{\ell^{2}({\widetilde{G}})}^{2}$ . Since $\|{\widetilde{H}}^{({x}|{y})}-\lambda-i\eta\|_{\ell^{2}\to\ell^{2}}\leq A+D+c_{I}+1=:c$ for all $\lambda\in I$ and $\eta\in(0,1)$ , we get $|\operatorname{Im}\zeta_{x}^{\lambda+i\eta}(y)|\geq\eta c^{-2}$ .

So let $\mathcal{O}\subset\mathbb{C}$ be the compact region $\{\eta c^{-2}\leq|z|\leq\eta^{-1}\}$ . If $F$ is continuous on $\mathcal{O}^{2s}\subset\mathbb{C}^{2s}$ , by Stone-Weierstrass, given $R\in\mathbb{N}^{\ast}$ , there is a polynomial $P_{R}$ of $4s$ variables such that $\sup_{(z_{1};z_{2s})\in\mathcal{O}^{2s}}|F(z_{1},\dots,z_{2s})-P_{R}(z_{1},\bar{z}_{1},\dots,z_{2s},\bar{z}_{2s})|\leq\frac{1}{2R}$ . Hence, for any $\lambda\in I$ and $(x_{0};x_{s})$ , if $\gamma=\lambda+i\eta$ , then

[TABLE]

Let $h_{\eta}(t)=-(t-i\eta)^{-1}$ . Given $\epsilon>0$ , we may choose a polynomial $Q_{\epsilon}=Q_{\epsilon}^{\eta}$ such that $\|h_{\eta}-Q_{\epsilon}\|_{\infty}<\epsilon$ . It follows that $\|h_{\eta}(H^{(\tilde{x}|\tilde{y})}_{\tilde{G}}-\lambda)-Q_{\epsilon}(H^{(\tilde{x}|\tilde{y})}_{\tilde{G}}-\lambda)\|<\epsilon$ . In particular, if $Z_{\epsilon}^{\gamma}(x,y):=Q_{\epsilon}(H^{(\tilde{y}|\tilde{x})}_{\tilde{G}}-\lambda)(\tilde{y},\tilde{y})$ , we have for any $\lambda\in I$ and $(x,y)\in B$ ,

[TABLE]

As $P_{R}$ is Lipschitz-continuous on $\mathcal{O}^{2s}$ , we may thus find $C_{R,\eta^{-1}}$ such that

[TABLE]

by choosing $\epsilon=\frac{1}{2R}\frac{1}{C_{R,\eta^{-1}}}$ . Using (A.5), we thus get uniformly in $\lambda\in I$ , $(x_{0};x_{s})$ ,

[TABLE]

where we now denote $Z_{R}$ because $\epsilon$ is a function of $R$ . Define

[TABLE]

Then up to an error $\frac{C_{D,s,A,k}}{R}$ , it suffices to consider

[TABLE]

Let $d_{R}$ be the degree of $Q_{R}$ and choose an arbitrary integer $r\geq d_{R}+s+k=:d_{R,s,k}$ . Then we may find $n_{r}$ such that for $n\geq n_{r}$ , there exists $\varphi_{r}:B_{G_{n}}(o_{n},r)\xrightarrow{\sim}B_{G}(o,r)$ with $\|W\circ\varphi_{r}-W_{n}\|_{B_{G_{n}}(o,r)}<1/r$ . Now $\langle\delta_{o_{n}},H_{n}^{k}\delta_{o_{n}}\rangle=\sum_{u_{0},\dots,u_{k-1}}H_{n}(o_{n},u_{0})H_{n}(u_{0},u_{1})\dots H_{n}(u_{k-1},o_{n})$ and $H_{n}(v,w)=\mathcal{A}_{n}(v,w)+W_{n}(v)\delta_{w}(v)$ . This only depends on $B_{G_{n}}(o_{n},k)$ and its colouring. Similarly, the quantity $Z_{R}^{\gamma}(x,y)$ corresponding to $(G_{n},o_{n},W_{n})$ only depends on $B_{G_{n}}(y,d_{R})$ and its colouring. Since $r\geq d_{R,s,k}$ and $\varphi_{r}:B_{G_{n}}(o_{n},r)\xrightarrow{\sim}B_{G}(o,r)$ , if we let $\mathcal{H}_{n}=\mathcal{A}_{G}+W_{n}\circ\varphi_{r}^{-1}$ on $G$ , we get $\langle\delta_{o_{n}},H_{n}^{k}\delta_{o_{n}}\rangle=\langle\delta_{o},\mathcal{H}_{n}^{k}\delta_{o}\rangle$ . Similarly, $P_{\gamma_{n}}([{G}_{n},{o}_{n},{W}_{n}])=P_{\gamma_{n}}([{G},{o},{W}_{n}\circ\varphi_{r}^{-1}])$ . Let ${W}_{n}^{\prime}={W}_{n}\circ\varphi_{r}^{-1}$ . Then for $n\geq n_{r}$ ,

[TABLE]

Writing $\mathcal{H}_{n}^{k}-H^{k}=\sum_{i=1}^{k}\mathcal{H}_{n}^{k-i}(\mathcal{H}_{n}-H)H^{i-1}$ , we have

[TABLE]

A similar argument yields $|P_{\gamma}([{G},{o},{W}_{n}^{\prime}])-P_{\gamma}([{G},{o},{W}])|\leq\frac{C_{R,D,s,A}}{r}$ and $|P_{\gamma_{n}}([{G},{o},{W}_{n}^{\prime}])-P_{\gamma}([{G},{o},{W}_{n}^{\prime}])|\leq C_{R,D,s,A,I}|\lambda_{n}-\lambda|\leq\frac{C_{R,D,s,A,I}}{r}$ for $n\geq n_{r}^{\prime}$ . We thus showed that for any $r\geq d_{R,s,k}$ , there exists $n_{r}^{\prime\prime}$ such that if $n\geq n_{r}^{\prime\prime}$ , then $|\mu_{o_{n},F,\gamma_{n}}^{(G_{n},W_{n})}(t^{k})-\mu_{o,F,\gamma}^{(G,W)}(t^{k})|\leq\frac{C_{D,s,A,k}}{R}+\frac{C^{\prime}_{k,D,A}+C_{R,D,s,A}+C_{R,D,s,A,I}}{r}$ . It follows that $\limsup_{n\to\infty}|\mu_{o_{n},F,\gamma_{n}}^{(G_{n},W_{n})}(t^{k})-\mu_{o,F,\gamma}^{(G,W)}(t^{k})|\leq\frac{C_{D,s,A,k}}{R}$ . Since $R$ is arbitrary, the proof is complete. ∎

If $(G,W)\in\mathcal{C}_{\textup{fin}}^{D,A}$ , we now define, for $\gamma\in\mathbb{C}^{+}$ ,

[TABLE]

Corollary A.2.

Suppose $(G_{n},W_{n})\subset\mathcal{C}_{\textup{fin}}^{D,A}$ has a local weak limit $\operatorname{\mathbb{P}}$ . Fix a compact $I\subset\mathbb{R}$ and $\eta\in(0,1)$ . Then $\mu^{(G_{n},W_{n})}_{F,\lambda+i\eta}$ converges weakly to $\int_{\mathscr{G}_{\ast}^{D,A}}\mu_{o,F,\lambda+i\eta}^{(G,W)}\,\mathrm{d}\operatorname{\mathbb{P}}([G,o,W])$ , uniformly in $\lambda\in I$ . In other words, for any continuous $\varphi:\mathbb{R}\to\mathbb{R}$ , we have uniformly in $\lambda\in I$ ,

[TABLE]

Proof.

Given continuous $\varphi:\mathbb{R}\to\mathbb{R}$ , define $\widehat{\varphi}:I\times\mathscr{G}_{\ast}^{D,A}\to\mathbb{R}$ by $\widehat{\varphi}(\lambda,[G,o,W])=\int\varphi(t)\,\mathrm{d}\mu_{o,F,\lambda+i\eta}^{(G,W)}(t)$ . Lemma A.1 states $\widehat{\varphi}$ is continuous on $I\times\mathscr{G}_{\ast}^{D,A}$ – hence, uniformly continuous. Let $\widehat{\varphi}_{\lambda}([G,o,W])=\widehat{\varphi}(\lambda,[G,o,W])$ . Local convergence means that the measures $U_{(G_{n},W_{n})}$ (defined in (A.2)) converge weakly to $\operatorname{\mathbb{P}}$ . Thus, for any $\lambda\in I$ , $\int\widehat{\varphi}_{\lambda}\,\mathrm{d}U_{(G_{n},W_{n})}\to\int\widehat{\varphi}_{\lambda}\,\mathrm{d}\rho$ , i.e. $\frac{1}{|V_{n}|}\sum_{x\in V_{n}}\widehat{\varphi}_{\lambda}([G_{n},x,W_{n}])\to\int\widehat{\varphi}_{\lambda}([G,o,W])\,\mathrm{d}\operatorname{\mathbb{P}}([G,o,W])$ , which is the statement of the lemma for fixed $\lambda\in I$ .

Uniformity in $\lambda$ comes from the uniform continuity of $\widehat{\varphi}$ , which implies that the maps $\lambda\mapsto\int\widehat{\varphi}_{\lambda}\,\mathrm{d}U_{(G_{n},W_{n})}$ form a uniformly equicontinuous family. ∎

Remark A.3.

Taking $F\equiv 1$ , we get in particular the convergence of empirical spectral measures. On the other hand, when $\varphi\equiv 1$ , we get in particular that under assumption (BSCT), if $I\subset\mathbb{R}$ is compact and $\eta\in(0,1)$ is fixed, then uniformly in $\lambda\in I$ ,

[TABLE]

In the paper, we often encounter expressions of the form $\vartheta_{\gamma}(x_{0},x_{1})=F(\zeta_{x_{0}}^{\gamma}(x_{1}),\zeta_{x_{1}}^{\gamma}(x_{0}))$ in the LHS of (A.8). In this case, we write $\hat{\vartheta}_{\gamma}(v_{0},v_{1}):=F(\hat{\zeta}_{v_{0}}^{\gamma}(v_{1}),\hat{\zeta}_{v_{1}}^{\gamma}(v_{0}))$ for the object defined similarly at the limit. For instance, $\hat{\mu}_{1}^{\gamma}$ is defined like ${\mu}_{1}^{\gamma}$ but on the limiting tree $(\mathcal{T},\mathcal{W})$ . In the particular case of $m^{\gamma}$ , we have $\hat{m}_{o}^{\gamma}=\frac{-1}{2\mathcal{G}^{\gamma}(o,o)}$ .

It is worth noting that $\operatorname{\mathbb{E}}[\sum_{o^{\prime}\sim o}{F}(\hat{\zeta}_{o}^{\gamma}(o^{\prime}))]=\operatorname{\mathbb{E}}[\sum_{o^{\prime}\sim o}{F}(\hat{\zeta}_{o^{\prime}}^{\gamma}(o))]$ . This holds because $\frac{1}{N}\sum_{(x_{0},x_{1})}F(\zeta_{x_{0}}^{\gamma}(x_{1}))=\frac{1}{N}\sum_{(x_{0},x_{1})}F(\zeta_{x_{1}}^{\gamma}(x_{0}))$ .

Remark A.4.

Using (2.4b), we have $|\hat{\zeta}_{o^{\prime}}^{\gamma}(o)|^{s}\leq|\operatorname{Im}\hat{\zeta}_{o}^{\gamma}(u)|^{-s}$ for any $u\in\mathcal{N}_{o}\setminus\{o^{\prime}\}$ . In particular, $|\hat{\zeta}_{o^{\prime}}^{\gamma}(o)|^{s}\leq\sum_{o^{\prime\prime}\sim o}|\operatorname{Im}\hat{\zeta}_{o}^{\gamma}(o^{\prime\prime})|^{-s}$ . We thus see by (Green) that for any $s>0$ ,

[TABLE]

We also have

[TABLE]

To see this, consider for simplicity $\operatorname{\mathbb{E}}[\sum_{(v_{0};v_{2}),v_{0}=o}|\hat{\zeta}_{v_{0}}^{\gamma}(v_{1})\hat{\zeta}_{v_{1}}^{\gamma}(v_{2})|^{s}]$ . This is the limit of $\frac{1}{N}\sum_{(x_{0};x_{2})\in B_{2}}|\zeta_{x_{0}}^{\gamma}(x_{1})\zeta_{x_{1}}^{\gamma}(x_{2})|^{s}$ . This sum is bounded by $(\frac{1}{N}\sum_{(x_{0};x_{2})\in B_{2}}|\zeta_{x_{0}}^{\gamma}(x_{1})|^{2s})^{1/2}\cdot(\frac{1}{N}\sum_{(x_{0};x_{2})\in B_{2}}|\zeta_{x_{1}}^{\gamma}(x_{2})|^{2s})^{1/2}$ for any $N$ . Using $|\mathcal{N}_{x_{1}}|-1\leq D$ and taking $N\to\infty$ , we see the limit is bounded by $D\operatorname{\mathbb{E}}(\sum_{o^{\prime}\sim o}|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|^{2s})^{1/2}\operatorname{\mathbb{E}}(\sum_{o^{\prime}\sim o}|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|^{2s})^{1/2}\leq DC_{s}$ by (A.10), for any $\lambda\in I_{1}$ and $\eta>0$ . Hence, $\sup_{\lambda\in I_{1},\eta>0}\operatorname{\mathbb{E}}[\sum_{(v_{0};v_{2}),v_{0}=o}|\hat{\zeta}_{v_{0}}^{\gamma}(v_{1})\hat{\zeta}_{v_{1}}^{\gamma}(v_{2})|^{s}]\leq DC_{s}$ .

Remark A.5.

Let us now look at the quantity $\frac{1}{N}\sum_{(x_{0},x_{1})}\sum_{(x_{2};x_{k}),(y_{2};y_{k})}|{\tilde{g}}^{\gamma}(\tilde{x}_{k},\tilde{y}_{k})|^{s}$ , which we had to control in Section 4.

Let $x_{k}\wedge y_{k}$ be the vertex of maximal length in $(x_{0};x_{k})\cap(x_{0};y_{k})$ , so $x_{k}\wedge y_{k}=x_{t}$ for some $1\leq t\leq k$ . Then ${\tilde{g}}^{\gamma}(\tilde{x}_{k},\tilde{y}_{k})=\frac{-\prod_{l=0}^{k-t-1}\zeta_{x_{k-l}}^{\gamma}(x_{k-l-1})\cdot\zeta_{x_{t}}^{\gamma}(y_{t+1})\prod_{l=t+1}^{k-1}\zeta_{y_{l}}^{\gamma}(y_{l+1})}{2m_{x_{k}}^{\gamma}}$ . We then write $\frac{1}{N}\sum_{(x_{0},x_{1})}\sum_{(x_{2};x_{k}),(y_{2};y_{k})}=\frac{1}{N}\sum_{(x_{0},x_{1})}\sum_{t=1}^{k}\sum_{(x_{2};x_{k}),(y_{2};y_{k}),x_{k}\wedge y_{k}=x_{t}}$ , use Hölder’s inequality, and take $N\to\infty$ to get a uniform bound involving $\operatorname{\mathbb{E}}[\sum_{o^{\prime}\sim o}|\hat{\zeta}_{o}^{\gamma}(o^{\prime})|^{s_{2}}]$ and $\operatorname{\mathbb{E}}[|2\hat{m}_{o}|^{-s_{1}}]$ , both of which are finite. Hence, $\frac{1}{N}\sum_{(x_{0},x_{1})}\sum_{(x_{2};x_{k}),(y_{2};y_{k})}|{\tilde{g}}^{\gamma}(\tilde{x}_{k},\tilde{y}_{k})|^{s}$ is uniformly bounded as $N\to\infty$ .

A.3. Proofs of auxiliary results

We now turn to the proofs of some claims in Section 1. In what follows, $\eta_{0}\in(0,1)$ is fixed.

Claim (1.8). Let $\chi:\mathscr{G}_{\ast}^{D,A}\to\mathbb{R}$ and $F:\mathbb{C}\to\mathbb{R}$ be continuous. Then under (BSCT),

[TABLE]

uniformly in $\lambda\in I_{0}$ . This is a variant of Corollary A.2 when one considers $F_{\gamma,\chi}:(\lambda,[G,x,W])\mapsto\chi([G,x])\sum_{y,d(y,x)=k}F(\tilde{g}^{\gamma}(x,y))$ instead of $F_{\gamma}$ . In particular, taking $k=0$ and $\chi=1$ , we obtain (1.8).

Claim (1.9). We may assume $F$ is compactly supported (cf. Lemma A.1), hence uniformly continuous. Let $h_{N}(t)=\frac{1}{N}\sum_{x\in V_{N}}\chi([G_{N},x])\sum_{y,d(y,x)=k}F(t\operatorname{Im}\tilde{g}^{\lambda+i\eta_{0}}_{N}(x,y))$ , $h(t)=\mathbb{E}\big{(}\chi((\mathcal{T},o))\sum_{v,d(v,o)=k}F(t\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(o,v))\big{)}$ , let $c_{N}(\lambda)=\frac{N}{\sum_{\tilde{x}\in\mathcal{D}_{N}}\operatorname{Im}\tilde{g}^{\lambda+i\eta_{0}}_{N}(\tilde{x},\tilde{x})}$ and $c(\lambda)=\frac{1}{\operatorname{\mathbb{E}}(\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(o,o))}$ . The family $h_{N}$ is uniformly equicontinuous, and as in (A.11) it converges uniformly to $h$ . By (1.8), $c_{N}(\lambda)\to c(\lambda)$ uniformly in $\lambda$ . So $|h_{N}(c_{N}(\lambda)-h(c(\lambda))|\to 0$ uniformly in $\lambda$ . This proves (1.9).

We now turn to the proof of Claim (1.7). Consider the set of (double)-coloured rooted graphs $(G,o,W,a)$ , where now $W:V\longrightarrow\mathbb{R}$ and $a:V\to\{0,1\}$ . We say $(G,o,W,a)$ and $(G^{\prime},o^{\prime},W^{\prime},a^{\prime})$ are equivalent if there is $\phi:G\to G^{\prime}$ with $\phi(o)=o^{\prime}$ , $W^{\prime}\circ\phi=W$ and $a^{\prime}\circ\phi=a$ . We let $\widehat{\mathscr{G}}_{\ast}^{D,A}$ be the corresponding set of equivalence classes and endow it with a metric $d_{loc}$ defined similarly to (A.1). This amounts to the same definition as before, except that the colourings now take values in $\mathbb{R}\times\{0,1\}$ instead of $\mathbb{R}$ . The notion of local weak limit may obviously be extended to this situation.

Assuming that (BSCT) holds, then up to passing to a subsequence, $(G_{N},W_{N},{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}})$ will have a local weak limit $\hat{\operatorname{\mathbb{P}}}$ concentrated on $\{[\mathcal{T},o,\mathcal{W},a]\}$ , whose marginals on $\mathscr{T}_{\ast}^{D,A}$ coincides with $\operatorname{\mathbb{P}}$ . The fact that $|\Lambda_{N}|\geq\alpha N$ implies $\hat{\mathbb{P}}(a(o)=1)\geq\alpha$ , since $\{a(o)=1\}$ is clopen in $\widehat{\mathscr{G}}_{\ast}^{D,A}$ . We claim that

[TABLE]

uniformly in $\lambda\in I_{0}$ . Indeed, as in Lemma A.1, if $F:I_{0}\times\widehat{\mathscr{G}}_{\ast}^{D,A}\to\mathbb{C}$ is given by $F(\lambda,[G,x,W,a])=a(x)\operatorname{Im}\tilde{g}^{\lambda+i\eta_{0}}(x,x)$ , then $F$ is continuous. So $\int F_{\lambda}\,\mathrm{d}U_{G_{N},W_{N},{\mathchoice{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.0mu{\rm{l}}}{1\mskip-4.5mu{\rm{l}}}{1\mskip-5.0mu{\rm{l}}}}_{\Lambda_{N}}}\to\int F_{\lambda}\,\mathrm{d}\hat{\operatorname{\mathbb{P}}}$ uniformly in $\lambda$ as in Corollary A.2. Combined with (1.8), this yields (A.12). We next note that for any $\alpha>0$ ,

[TABLE]

In fact, suppose on the opposite that for all $\epsilon>0$ , we can find $\lambda\in I_{1},\eta_{0}\in(0,1)$ and $a$ such that $\hat{\mathbb{P}}(a(o)=1)\geq\alpha$ and $\hat{\mathbb{E}}\left(a(o)\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(o,o)\right)\leq\epsilon$ . The latter implies

[TABLE]

On the other hand, since $a$ takes only the values 0 and 1,

[TABLE]

Thus,

[TABLE]

Equation (A.9) with $s=2$ implies that $\hat{\mathbb{P}}(\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(o,o)<\epsilon^{1/2})\leq C\epsilon$ , for some constant $C<\infty$ independent of $\lambda,\eta_{0}$ . So $\hat{\mathbb{P}}(\operatorname{Im}\mathcal{G}^{\lambda+i\eta_{0}}(o,o)\geq\epsilon^{1/2})\geq 1-C\epsilon$ . By assumption, $\hat{\mathbb{P}}(a(o)=0)\leq 1-\alpha$ . Taking $\epsilon\to 0$ we would obtain $\alpha\leq 0$ , a contradiction. We thus proved (A.13). Since (A.12) holds uniformly in $\lambda$ , we get (1.7).

Finally, as in the proof of (A.12), we may consider the set of double-coloured rooted graphs $(G,o,W,K)$ , where $K$ is a colouring of pairs of vertices $x,y\in G$ , $d_{G}(x,y)\leq R$ , with values in $\{|z|\leq 1\}\subset\mathbb{C}$ . Assuming (BSCT) holds, up to passing to a subsequence, $(G_{N},W_{N},K_{N})$ will have a local weak limit $\hat{\operatorname{\mathbb{P}}}$ concentrated on $\{[\mathcal{T},o,\mathcal{W},\mathcal{K}]\}$ whose marginals on $\mathscr{T}_{\ast}^{D,A}$ coincides with $\operatorname{\mathbb{P}}$ . We then deduce as before that uniformly in $\lambda\in I_{0}$ ,

[TABLE]

Acknowledgements : This material is based upon work supported by the Agence Nationale de la Recherche under grant No.ANR-13-BS01-0007-01, by the Labex IRMIA and the Institute of Advance Study of Université de Strasbourg, and by Institut Universitaire de France.

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Abért, A. Thom and B. Virág, Benjamini-Schramm convergence and pointwise convergence of the spectral measure , author homepage.
2[2] A. De Luca, B. L. Altshuler, V. E. Kravtsov, and A. Scardicchio, Anderson Localization on the Bethe Lattice: Nonergodicity of Extended States , Phys. Rev. Lett. 113 (2014) 046806.
3[3] A. De Luca, B. L. Altshuler, V. E. Kravtsov, and A. Scardicchio, Support set of random wave-functions on the Bethe lattice , ar Xiv 2013.
4[4] M. Aizenman, S. Warzel, Absolutely continuous spectrum implies ballistic transport for quantum particles in a random potential on tree graphs , J. Math. Phys. 53 (2012) 095205, 15.
5[5] M. Aizenman, M. Shamis and S. Warzel, Resonances and partial delocalization on the complete graph , Ann. Henri Poincaré 16 (2015), 1969–2003.
6[6] D. Aldous, R. Lyons, Processes on unimodular random networks , Electron. J. Probab. 12 (2007) 1454–1508.
7[7] N. Anantharaman, Quantum ergodicity on regular graphs . To appear in Comm. Math. Phys.
8[8] N. Anantharaman, Some relations between the spectra of simple and non-backtracking random walks , preprint ar Xiv:1703.03852 (2017).

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Quantum Ergodicity on Graphs : from Spectral to Spatial Delocalization

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

1.1. The problem

1.2. The results

Theorem 1.1**.**

Corollary 1.2**.**

Theorem 1.3**.**

1.3. Understanding the weighted averages.

Corollary 1.4**.**

1.4. Case of the Anderson model

Proposition 1.5**.**

Proposition 1.6**.**

Corollary 1.7**.**

Theorem 1.8**.**

1.5. Relation with previous work

1.6. Outline of the proof

2. Basic identities

2.1. “Quantization procedure” on trees and their quotients

Remark 2.1**.**

2.2. Green functions on trees

Lemma 2.2**.**

Corollary 2.3**.**

Proof.

Proposition 2.4**.**

Proof.

3. The non-backtracking quantum variance

Remark 3.1**.**

Definition 3.2**.**

Theorem 3.3**.**

4. Step 1 : Bound on the non-backtracking quantum variance

Theorem 4.1**.**

Lemma 4.2**.**

Proof.

Lemma 4.3**.**

Proof.

Lemma 4.4**.**

Proof.

Lemma 4.5**.**

Proof.

5. Step 2 : Invariance property of the quantum variance

Lemma 5.1**.**

Proof.

Proposition 5.2**.**

6. Step 3 : A stationary Markov chain appears

Remark 6.1**.**

7. Spectral gap and mixing

Proposition 7.1**.**

Proposition 7.2**.**

Proof of Proposition 7.1.

Remark 7.3**.**

Remark 7.4**.**

Proof of Proposition 7.2.

Proposition 7.5**.**

Proof.

Proposition 7.6**.**

Proof.

Proposition 7.7**.**

Corollary 7.8**.**

Proof of Corollary 7.8.

Proof of Proposition 7.7.

Remark 7.9**.**

8. Transition matrices with phases

Proposition 8.1**.**

Proof.

9. Step 4 : End of the proof of Theorem 3.3

Remark 9.1**.**

Proposition 9.2**.**

Proof.

Lemma 9.3**.**

Proof.

Proposition 9.4**.**

Theorem 1.1.

Corollary 1.2.

Theorem 1.3.

Corollary 1.4.

Proposition 1.5.

Proposition 1.6.

Corollary 1.7.

Theorem 1.8.

Remark 2.1.

Lemma 2.2.

Corollary 2.3.

Proposition 2.4.

Remark 3.1.

Definition 3.2.

Theorem 3.3.

Theorem 4.1.

Lemma 4.2.

Lemma 4.3.

Lemma 4.4.

Lemma 4.5.

Lemma 5.1.

Proposition 5.2.

Remark 6.1.

Proposition 7.1.

Proposition 7.2.

Remark 7.3.

Remark 7.4.

Proposition 7.5.

Proposition 7.6.

Proposition 7.7.

Corollary 7.8.

Remark 7.9.

Proposition 8.1.

Remark 9.1.

Proposition 9.2.

Lemma 9.3.

Proposition 9.4.

Proposition 9.5.

Proposition 10.1.

Proposition 10.2.

Corollary 10.3.

Remark 10.4.

Proposition 10.5.

Corollary 10.6.

Lemma A.1.

Corollary A.2.

Remark A.3.

Remark A.4.

Remark A.5.