Tail bounds for gaps between eigenvalues of sparse random matrices

Patrick Lopatto; Kyle Luh

arXiv:1901.05948·math.PR·December 21, 2020

Tail bounds for gaps between eigenvalues of sparse random matrices

Patrick Lopatto, Kyle Luh

PDF

TL;DR

This paper establishes the first eigenvalue repulsion bounds for sparse random matrices, demonstrating simple spectra and applying these results to Erdős–Rényi graphs to unify weak and strong nodal domains.

Contribution

It introduces novel eigenvalue gap bounds for sparse matrices, extending previous work and improving sparsity and error probability ranges.

Findings

01

Sparse matrices have simple spectra due to eigenvalue repulsion.

02

Eigenvalue tail bounds are established for sparse random matrices.

03

Weak and strong nodal domains coincide in sparse Erdős–Rényi graphs.

Abstract

We prove the first eigenvalue repulsion bound for sparse random matrices. As a consequence, we show that these matrices have simple spectrum, improving the range of sparsity and error probability from the work of the second author and Vu. As an application of our tail bounds, we show that for sparse Erd\H{o}s--R\'enyi graphs, weak and strong nodal domains are the same, answering a question of Dekel, Lee, and Linial.

Equations283

P (δ_{min} \leq \frac{δ}{n ^{1/2}}) = o (n δ^{3}) + exp (- c n)

P (δ_{min} \leq \frac{δ}{n ^{1/2}}) = o (n δ^{3}) + exp (- c n)

1 \leq i \leq n - 1 sup P (δ_{i} \leq \frac{δ}{n ^{1/2}}) = O (\frac{δ}{α ^{1/2}}) .

1 \leq i \leq n - 1 sup P (δ_{i} \leq \frac{δ}{n ^{1/2}}) = O (\frac{δ}{α ^{1/2}}) .

m_{ij} = ξ_{ij} χ_{ij},

m_{ij} = ξ_{ij} χ_{ij},

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1

(n p)^{- 1/ (7 + ν)} \leq α \leq \frac{c _{\ref t hm : main}^{'}}{lo g n},

(n p)^{- 1/ (7 + ν)} \leq α \leq \frac{c _{\ref t hm : main}^{'}}{lo g n},

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq C_{\ref t hm : main} \frac{δ}{α} .

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq C_{\ref t hm : main} \frac{δ}{α} .

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq C_{\ref cor : l a r g e g a p} δ lo g n .

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq C_{\ref cor : l a r g e g a p} δ lo g n .

P (δ_{min} \leq \frac{p}{n ^{3/2 + o (1)}}) = o (1) .

P (δ_{min} \leq \frac{p}{n ^{3/2 + o (1)}}) = o (1) .

{\mathbb{P}}(M_{n}\emph{ has eigenvalues with multiplicity})\leq\exp\Big{(}-\frac{1}{2}(np)^{1/(7+\nu)}\Big{)}.

{\mathbb{P}}(M_{n}\emph{ has eigenvalues with multiplicity})\leq\exp\Big{(}-\frac{1}{2}(np)^{1/(7+\nu)}\Big{)}.

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1 - \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n}

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1 - \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n}

(n p)^{- 1/ (7 + ν)} \leq α \leq \frac{c _{\ref t hm : main}^{'}}{lo g n},

(n p)^{- 1/ (7 + ν)} \leq α \leq \frac{c _{\ref t hm : main}^{'}}{lo g n},

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq \frac{δ}{α}

1 \leq i \leq n - 1 sup P (δ_{i} \leq δ exp (- c_{\ref t hm : main} \frac{lo g ( 1/ p )}{lo g n p}) \frac{p}{n}) \leq \frac{δ}{α}

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1 - \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n},

\frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n} \leq p \leq 1 - \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n},

p \geq \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n}

p \geq \frac{C _{\ref t hm : main} lo g ^{7 + ν} n}{n}

M_{n} = (M_{n - 1} X^{T} X m_{nn}),

M_{n} = (M_{n - 1} X^{T} X m_{nn}),

(M_{n - 1} X^{T} X m_{nn}) (x a) = λ_{i} (M_{n}) (x a) .

(M_{n - 1} X^{T} X m_{nn}) (x a) = λ_{i} (M_{n}) (x a) .

(M_{n - 1} - λ_{i} (M_{n})) x + a X = 0.

(M_{n - 1} - λ_{i} (M_{n})) x + a X = 0.

∣ a w^{T} X ∣ = ∣ w^{T} (M_{n - 1} - λ_{i} (M_{n})) x ∣ = ∣ λ_{i} (M_{n - 1}) - λ_{i} (M_{n}) ∣∣ w^{T} x ∣.

∣ a w^{T} X ∣ = ∣ w^{T} (M_{n - 1} - λ_{i} (M_{n})) x ∣ = ∣ λ_{i} (M_{n - 1}) - λ_{i} (M_{n}) ∣∣ w^{T} x ∣.

λ_{i} \in [- K p n, K p n]

λ_{i} \in [- K p n, K p n]

λ_{i + 1} - λ_{i} \leq \hat{δ} \frac{p}{n} .

λ_{i + 1} - λ_{i} \leq \hat{δ} \frac{p}{n} .

∣ w^{T} X ∣ \leq \hat{δ} p .

∣ w^{T} X ∣ \leq \hat{δ} p .

Sparse (m) = {x \in R^{n} : ∣ supp (x) ∣ \leq m} .

Sparse (m) = {x \in R^{n} : ∣ supp (x) ∣ \leq m} .

Comp (m, δ) = {x \in S^{n - 1} : \exists y \in Sparse (m) such that ∥ x - y ∥_{2} \leq δ},

Comp (m, δ) = {x \in S^{n - 1} : \exists y \in Sparse (m) such that ∥ x - y ∥_{2} \leq δ},

Incomp (m, δ) = {x \in S^{n - 1} : x \in / Comp (m, δ)} .

Incomp (m, δ) = {x \in S^{n - 1} : x \in / Comp (m, δ)} .

x_{[m : m^{'}]} (j) = x_{j} \cdot \mathbbm 1_{[m : m^{'}]} (π_{x} (j)) .

x_{[m : m^{'}]} (j) = x_{j} \cdot \mathbbm 1_{[m : m^{'}]} (π_{x} (j)) .

Dom (m, c) = {x \in S^{n - 1} : ∥ x_{[m + 1 : n]} ∥_{2} \leq c m ∥ x_{[m + 1 : n]} ∥_{\infty}} .

Dom (m, c) = {x \in S^{n - 1} : ∥ x_{[m + 1 : n]} ∥_{2} \leq c m ∥ x_{[m + 1 : n]} ∥_{\infty}} .

P (∥ M_{n} ∥ \geq K p n) \leq exp (- c_{\ref l : o p n or m} p n) .

P (∥ M_{n} ∥ \geq K p n) \leq exp (- c_{\ref l : o p n or m} p n) .

ℓ_{0} = ⌈ \frac{lo g 1/ ( 8 p )}{lo g p n} ⌉, ρ = (\overset{ˉ}{C}_{\ref p r o p : co m p r ess ib l e})^{- ℓ_{0} - 6} .

ℓ_{0} = ⌈ \frac{lo g 1/ ( 8 p )}{lo g p n} ⌉, ρ = (\overset{ˉ}{C}_{\ref p r o p : co m p r ess ib l e})^{- ℓ_{0} - 6} .

p \geq \frac{C _{\ref p r o p : co m p r ess ib l e} lo g n}{n}, p^{- 1} \leq m \leq c_{\ref p r o p : co m p r ess ib l e} n, and λ \in [- K p n, K p n],

p \geq \frac{C _{\ref p r o p : co m p r ess ib l e} lo g n}{n}, p^{- 1} \leq m \leq c_{\ref p r o p : co m p r ess ib l e} n, and λ \in [- K p n, K p n],

∥ (M_{n} - λ) x ∥_{2} \geq c_{\ref p r o p : co m p r ess ib l e} ρ p n

∥ (M_{n} - λ) x ∥_{2} \geq c_{\ref p r o p : co m p r ess ib l e} ρ p n

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Tail bounds for gaps between eigenvalues of sparse random matrices

Patrick Lopatto

and

Kyle Luh

Abstract.

We prove the first eigenvalue repulsion bound for sparse random matrices. As a consequence, we show that these matrices have simple spectrum, improving the range of sparsity and error probability from work of the second author and Vu. We also show that for sparse Erdős–Rényi graphs, weak and strong nodal domains are the same, answering a question of Dekel, Lee, and Linial.

P.L. is partially supported by the NSF Graduate Research Fellowship Program under grant DGE-1144152.

K. Luh was partially supported by NSF postdoctoral fellowship DMS-1702533.

1. Introduction

The gaps between eigenvalues of symmetric random matrices have been extensively studied by mathematicians and physicists. For the classical integrable ensembles, the Gaussian Orthogonal Ensemble and Gaussian Unitary Ensemble, the limiting spectral distribution follows the semicircle law. For an individual eigenvalue gap, however, the limiting distribution was only recently obtained [60]. Rapid progress in random matrix theory has permitted the extension of this result to a large class of random matrix models [61, 30, 57, 11, 25, 26, 27, 28, 69, 58, 59, 3, 2, 37, 17, 19, 18, 51, 16].

Much effort has been expended on understanding the extremal eigenvalue gaps, in particular the largest eigenvalue gap in the bulk of the spectrum, $\delta_{\mathrm{max}}$ . Ben Arous and Bourgade [12] demonstrated that for the $n\times n$ GUE normalized so that its spectrum is supported on $[-2,2]$ , so that the typical inter-particle distance in the bulk is about $n^{-1}$ , the largest bulk gap is of order $n^{-1}\sqrt{\log n}$ . Figalli and Guionnet extended this result to $\beta$ -ensembles with $\beta=2$ [34]. In [32], Feng and Wei showed that the fluctuations of the largest gap are of order $n^{-1}\sqrt{\log n}$ and computed the limiting distribution. In work of the first author with Landon and Marcinek, the largest gap results of [12, 32] were extended to generalized Wigner matrices [38], including those with discrete entry distributions. We note that recent work of Bourgade [14], which presents a concise analysis of the convergence to equilibrium of Dyson Brownian motion, is able to recover the same result at the cost of imposing a weak smoothness assumption on the matrix entries.

While we now have a substantial understanding of the largest eigenvalue gap, the smallest gap, $\delta_{\mathrm{min}}$ , is more difficult to investigate because it lies well below the typical inter-particle distance. Bourgade and Ben Arous [12] showed using the determinantal structure of the GUE that its smallest gap is of order $n^{-4/3}$ . In [31], Feng, Tian, and Wei identified the normalized limit of the smallest eigenvalue gap of the GOE and found that the gap is of order $n^{-3/2}$ ; their argument builds on techniques previously developed by Feng and Wei to study circular $\beta$ -ensembles [33]. Currently, the smallest gap lies outside of the purview of traditional universality results such as the Four Moment Theorem [62], and the techniques in the recent work [38] are not applicable. The strongest available result is in the recent work of Bourgade [14], which shows universality of the smallest gap, but requires that the matrix entries possess a weak form of smoothness. At present, no universality results exist for the smallest gap for matrices that are sparse or have discrete entry distributions, such as a matrix of Bernoulli random variables.

While tail bounds are known for the individual gaps when the matrix entries are more general random variables [61, 58], the error rates are not strong enough to take a union bound to conclude anything about the minimum gap. We now scale the matrices so that their spectrum lies on $[-2\sqrt{n},2\sqrt{n}]$ , which makes the average inter-particle distance $n^{-1/2}$ ; we take this convention to match the existing tail bound literature, and it remains in force throughout the rest of the paper. For Hermitian matrices, under stringent smoothness and decay assumptions on the random variables, a result of Erdős, Schlein, and Yau [29] implies that there exists a small constant $c>0$ such that

[TABLE]

for any $\delta>0$ . For discrete random variables, it was a milestone just to show that $\delta_{\mathrm{min}}>0$ [63]. In particular, Tao and Vu showed that for any $A>0$ , with probability at least $1-n^{-A}$ a random symmetric matrix has simple spectrum, meaning every eigenvalue appears with multiplicity one. In follow-up work with Nguyen [48], they showed the following tail bound for the eigenvalue gaps. Given eigenvalues $\lambda_{i}$ labeled in ascending order, we denote the gaps by $\delta_{i}=\lambda_{i+1}-\lambda_{i}$ .

Theorem 1.1 ([48, Theorem 2.1]).

There exists a constant $c>0$ such that the following holds for the eigenvalue gaps, $\delta_{i}$ , of a real symmetric Wigner matrix. For any $n^{-c}\leq\alpha\leq c$ and $\delta\geq n^{-c/\alpha}$ ,

[TABLE]

Setting $\alpha=n^{-c}$ , one can deduce that a real symmetric random matrix has simple spectrum with probability at least $1-O(\exp(-n^{c}))$ . A related problem, posed by Babai, is whether the adjacency matrix of an Erdős–Rényi random graph has simple spectrum. This was resolved affirmatively for all dense random graphs in [63, 48]. A consequence in complexity theory is that for such random graphs the graph isomorphism problem is in complexity class $\mathcal{P}$ [6].

In this work we study the eigenvalue gaps of sparse random matrices. The theory of sparse random matrices is of interest in its own right, but it also has innumerable applications in computer science and statistics. In contexts where sparse random matrices have similar spectral guarantees as their dense counterparts, they offer significant advantages as they require less space to store, allow quicker multiplication, and need fewer random bits to generate [8, 7, 5, 22, 47, 21]. A popular model for such matrices is to consider the Hadamard (entrywise) product of a dense random matrix and a sparse matrix of independent (up to symmetry) indicator variables with expectation $p=p(n)$ . Much work has been done to transfer the results known for dense random matrices to the sparse setting [9, 10, 15, 39, 37, 42, 68, 55, 10]. Although the results resemble their dense analogues, the sparsity brings about a variety of complications in the proofs. Only recently, the second author and Vu showed that for a large class of random variables and for $p\geq n^{-1+{\varepsilon}}$ with ${\varepsilon}>0$ , a sparse random matrix has simple spectrum with probability at least $1-O_{{\varepsilon}}(\exp(-(np)^{1/128}))$ [43], where this notation indicates that the implied constant depends on ${\varepsilon}$ . This implies that the graph isomorphism problem restricted to this class of sparse random graphs is in complexity class $\mathcal{P}$ .

Our main contribution is to go beyond verifying such matrices have simple spectrum and prove a tail bound for the minimal eigenvalue gap of sparse random matrices with $p\geq C\log^{7+{\varepsilon}}(n)/n$ . In comparison with [43], our results represent an improvement in both error probability and the range of sparsity considered. As an application of our tail bound, we show that for sparse Erdős–Rényi graphs, weak and strong nodal domains are the same, answering a question of Dekel, Lee, and Linial [24]. Our results also expand the range of sparse graphs for which the graph isomorphism problem is known to be in $\mathcal{P}$ . Related to this last application is the graph matching problem, for which various algorithms contingent on simple spectrum are known [65, 44, 1]; our results similarly extend their range of applicability.

Acknowledgments. The authors thank the anonymous referees for their detailed comments, which substantially improved the paper.

2. Main Results

We begin with a formal definition of our random matrix model.

Definition 2.1.

We let $M_{n}$ denote a symmetric random matrix with entries

[TABLE]

where the $\xi_{ij}$ are independent (for $i\geq j$ ), mean zero, variance one, and subgaussian with subgaussian moment $B$ , and the $\chi_{ij}$ are independent (for $i\geq j$ ) Bernoulli random variables with $\mathbb{E}\chi_{ij}=p$ .

Theorem 2.2.

Let $M_{n}$ be as in Definition 2.1, and fix $\nu>0$ . There exist constants $C_{\ref{thm:main}},c_{\ref{thm:main}},c^{\prime}_{\ref{thm:main}}>0$ , depending only on the subgaussian moment $B$ , such that for

[TABLE]

and

[TABLE]

the following holds for the gaps between the eigenvalues, $\delta_{i}=\lambda_{i+1}-\lambda_{i}$ . For any $\delta\geq\exp(-\alpha^{-1})$ ,

[TABLE]

Observe that there is a trade-off in the strength of the error bound and the size of the eigenvalue gap, determined by the value of $\alpha$ . For example, if we choose $\alpha=c_{\ref{thm:main}}/\log n$ , we obtain the following result.

Corollary 2.3.

Let $M_{n}$ be as in Definition 2.1, and fix $\nu>0$ . There exist $C_{\ref{cor:largegap}},C_{\ref{cor:largegap}}^{\prime}>1$ , such that for $p\geq\frac{C_{\ref{thm:main}}\log^{7+\nu}n}{n},$

[TABLE]

for $\delta\geq n^{-C_{\ref{cor:largegap}}^{\prime}}$ . By a union bound,

[TABLE]

At the other extreme, setting $\alpha=(np)^{-1/(7+\nu)}$ and $\delta=\exp(-\alpha^{-1})$ , we have the following result.

Corollary 2.4.

Let $M_{n}$ be as in Definition 2.1, and fix $\nu>0$ . For $p\geq\frac{C_{\ref{thm:main}}\log^{7+\nu}n}{n},$

[TABLE]

Observe that when $p=1$ , which is the dense case considered in [48], the above two corollaries recover [48, Corollary 2.2] and [48, Corollary 2.3], which are the analogous extreme cases of the bound in [48, Theorem 2.1].

Remark 2.5.

This result improves the range of sparsity in [43] from $n^{-1+{\varepsilon}}$ for some ${\varepsilon}>0$ to $\log n^{7+\nu}/n$ . Even in the regime $p\geq n^{-1+{\varepsilon}}$ , our result improves on the bound in [43] where the probability of not having a simple spectrum was less than $\exp(-(np)^{1/124})$ . However, we suspect that the optimal bound should be $\exp(-cnp)$ for some constant $c>0$ . The sparsity range of Theorem 2.2 is near optimal as $p=o(\log n/n)$ yields multiple rows and columns entirely of zeros. This generates repeated eigenvalues at 0.

We also have the same result for adjacency matrices of random Erdős–Rényi graphs. Let $G(n,p)$ denote the random graph on $n$ vertices with edges appearing independently and with probability $p$ .

Theorem 2.6.

Let $A_{n}$ be the adjacency matrix of the random Erdős–Rényi graph $G(n,p)$ , and fix $\nu>0$ . There exist constants $C_{\ref{thm:main}},c_{\ref{thm:main}},c^{\prime}_{\ref{thm:main}}>0$ , depending only on the subgaussian moment $B$ , such that for

[TABLE]

and

[TABLE]

the following holds for the gaps between the eigenvalues, $\delta_{i}=\lambda_{i+1}-\lambda_{i}$ . For any $\delta\geq\exp(-\alpha^{-1})$ ,

[TABLE]

Remark 2.7.

Note that an upper bound on $p$ is necessary in this case as $p=1$ generates a deterministic matrix with repeated eigenvalues. Additionally, our argument can be easily applied to random perturbations of a finite rank matrix; see Remark 6.2. However, for perturbations of an arbitrary matrix, new ideas are needed as many of the delicate net arguments cannot be adapted when the operator norm of the perturbed matrix is large. For dense random graphs, this was done in [48, Theorem 2.6].

2.1. Non-degeneration of Eigenvectors and Nodal Domains of a Random Graph

Consider the eigenfunctions of the Laplacian on a Riemannian manifold. The zero sets of these eigenfunctions partition the space into so-called nodal domains. These domains are of great interest to geometers and have been intensively studied (see [20, 46, 40] and the references therein). Here we consider a discrete analogue, the nodal domains of eigenvectors for adjacency matrices of random graphs, which has its roots in graph theory and has recently found uses in data science [35, 23, 24]. Given an eigenvector $u$ of an adjacency matrix $A$ , we call a subset $D$ of the vertices a weak nodal domain if it is connected, $u(x)u(y)\geq 0$ for $x,y\in D$ , and $D$ is a maximal subset under these two conditions. A strong nodal domain is defined similarly using the strict inequality $u(x)u(y)>0$ . Dekel, Lee, and Linial conjectured that the notions of strong and weak domains are equivalent for random graphs [24], and this was shown for $G(n,p)$ with constant $p$ in [48]. A consequence of the following non-degeneration result is that we are able to resolve this conjecture for $p\geq C_{\ref{thm:main}}\log^{7+\nu}(n)/n$ .

Theorem 2.8.

Let $A_{n}$ be the adjacency matrix of the random graph $G(n,p)$ , and fix $\nu>0$ . For any $D>0$ , there exists a $C=C(D)>0$ such that for

[TABLE]

the probability that there exists an eigenvector $v=(v_{1},\dots,v_{n})$ of $A_{n}$ with $|v_{i}|\leq n^{-C}$ for some $i$ is at most $Cn^{-D}$ .

Theorem 2.8 provides a quantitative lower bound on the mass of the eigenvector components, complementing the vast literature on eigenvector delocalization, which provides upper bounds (see [50, Section 4] and [13]).

Corollary 2.9.

For any $D>0$ , there exists $C=C(D)>0$ such that with probability at least $1-Cn^{-D}$ , the strong and weak nodal domains of $G(n,p)$ are the same.

Arora and Bhaskara [4] showed that for random graphs $G(n,p)$ with $p\geq n^{-c}$ , where $c$ is a constant that may be determined explicitly,111The authors give an exact value. However, the published version of an eigenvector delocalization estimate used to prove the result differs slightly from the version given in [4], where it is cited by the authors in pre-publication form. The value of the constant should be adjusted in light of this. all non-first eigenvectors of the adjacency matrix $A_{n}$ of $G(n,p)$ have exactly two weak nodal domains with high probability. Recall that since the adjacency matrix is not centered, the eigenvector corresponding to the largest eigenvalue behaves differently, tending to align itself with the all ones vector [45]. Combining this result with our previous corollary yields the following simple statement.

Corollary 2.10.

There exists $c>0$ such that the following holds. For any $D>0$ and $p\geq n^{-c}$ , there exists $C=C(D)>0$ such that with probability at least $1-Cn^{-D}$ , each eigenvector of $G(n,p)$ (except the first) has exactly two strong nodal domains which partition the vertices.

An identical non-degeneration result applies to matrices $M_{n}$ defined in Definition 2.1.

Theorem 2.11.

Fix $\nu>0$ . For any $D>0$ , there exists a $C=C(D)>0$ such that for

[TABLE]

the probability that there exists an eigenvector $v=(v_{1},\dots,v_{n})$ of $M_{n}$ with $|v_{i}|\leq n^{-C}$ for some $i$ is at most $Cn^{-D}$ .

Remark 2.12.

Theorems 2.8 and 2.11 represent specific examples of a range of possible results. Specifically, varying $\alpha$ in Theorem 2.2 can lead to trade-offs in the size of the entries and the strength of the probability bound. We have chosen to give a simple polynomial bound on the size and probability for the sake of simplifying the presentation.

We also remark that nodal domains were studied in the recent work [36], which showed that there exists a constant $c\geq 0$ such that for $p\geq n^{-c}$ the two nodal domains identified in [4] are balanced, meaning they each contain close to $n/2$ vertices with high probability. Further, [54] shows that, with high probability, any vertex is connected to some vertex in the other domain.

The remainder of the paper is organized as follows. In Section 3, we outline the key steps and intuition for the proof of Theorem 2.2. In Sections 4 and 5, we prove several preliminary results about eigenvectors of sparse random matrices. In Section 6.1, we provide the proof of Theorem 2.2. In Section 6.2 we provide the necessary modifications to extend Theorem 2.2 to non-centered random matrices, such as the adjacency matrices of Erdős–Rényi graphs, proving Theorem 2.6. Finally, in Section 6.3, we prove Theorem 2.8.

3. Proof Strategy

The proof follows the same broad outline as [43]. For $M_{n}$ as in Definition 2.1, we decompose the matrix as

[TABLE]

where $X=[x_{1},\dots,x_{n-1}]\in\mathbb{R}^{1\times(n-1)}$ . For a matrix $W$ , let ${\lambda_{n}(W)\geq\dots\geq\lambda_{1}(W)}$ be the eigenvalues of $W$ . Fix an integer $i$ such that $1\leq i\leq n$ and let $v=(x,a)$ (where $x\in\mathbb{R}^{n-1}$ and $a\in\mathbb{R}$ ) be the unit eigenvector associated to $\lambda_{i}(M_{n})$ . By definition we have

[TABLE]

For the top $n-1$ coordinates this gives (writing $\lambda_{i}(M_{n})$ for $\lambda_{i}(M_{n})\operatorname{Id}$ )

[TABLE]

Let $w$ be the eigenvector of $M_{n-1}$ corresponding to $\lambda_{i}(M_{n-1})$ . Multiplying on the left by $w^{T}$ , we obtain

[TABLE]

By the Cauchy interlacing theorem, we have $\lambda_{i}(M_{n})\leq\lambda_{i}(M_{n-1})\leq\lambda_{i-1}(M_{n})$ .

Since the entries of $M_{n}$ are subgaussian, we have with high probability that

[TABLE]

for some constant $K$ that depends only on the subgaussian moment $B$ of the entries. Therefore, the average size of an eigenvalue gap is roughly $O\left(\frac{\sqrt{pn}}{n}\right)=O\left(\sqrt{\frac{p}{n}}\right).$ For any $\hat{\delta}>0$ , let $\mathcal{E}_{i}=\mathcal{E}_{i}\left(\hat{\delta}\right)$ denote the event that

[TABLE]

We also let $\mathcal{G}_{i}$ be the intersection of the event $\mathcal{E}_{i}$ with the event that the eigenvector $v=(x,a)$ with eigenvalue $\lambda_{i}$ has $|a|\geq n^{-1/2}$ . Therefore, by (3.2) and using $|w^{T}x|\leq 1$ , on the event $\mathcal{G}_{i}$ , we have

[TABLE]

We wish to show this is unlikely.

Recall that the theory of small ball probability (e.g. [49]) examines the probability that a random variable takes values in a small interval. Therefore, we have reduced the problem to understanding the small ball probability of the inner product of a random vector with the eigenvector $w$ . It is known that this small ball probability is related to the amount of “disorder” in the coordinates of the eigenvector. Broadly speaking, a large amount of disorder implies the small ball probability is small. We deal with the case that $w$ has high disorder eigenvectors using these results. To exclude all eigenvectors with low disorder, we employ a covering argument, varying our approach according to the structure of the eigenvector.

The covering argument is completed in multiple stages. For a fixed $\lambda$ , we consider $M_{n}-\lambda\operatorname{Id}$ acting on the unit sphere, where $\operatorname{Id}$ is the identity operator. Following the prescription initiated in a series of works [41, 64, 56, 53, 9, 10], we decompose the sphere into several sets that each offer their own advantages. Compressible vectors are those vectors that are close to $m$ -sparse vectors for some parameter $m$ . In [9], it was shown that the product of the matrix with a compressible vector has many large coordinates and therefore large $\ell_{2}$ norm. We adapt this argument to our symmetric matrix case to exclude compressible vectors. We next consider dominated vectors, which are those vectors whose coordinates outside the $m$ largest coordinates have a small ratio of $\ell_{2}$ norm to $\ell_{\infty}$ norm. This type of vector was introduced in [9]. As these vectors are also nearly sparse, they can be excluded similarly to the compressible vectors.

Finally, for vectors that are neither compressible nor dominated, we use a stratification according to a measure of structure, the LCD. The LCD was introduced in [56] and is defined later. As our random matrix is symmetric, there is dependence between the rows which prevents us from applying small ball probability estimates to each coordinate independently.222This obstacle is what prevents us from reaching the optimal threshold for $p$ by simply following the argument in [9], which considered non-symmetric matrices for $p\geq(C\log n)/n$ . To address this problem, for a fixed $v$ we partition the coordinates of $v$ into small subsets; this is similar to the method used in [66]. For a fixed subset, after conditioning on the columns of $M_{n}-\lambda$ outside of the subset, we can extract more independent coordinates to use in small ball estimates. There is some flexibility in the size of these subsets, and this ultimately results in the trade-off between the error probability and gap size in Theorem 2.2.

The previous steps are done for a fixed $\lambda$ and hold with exponentially high probability. Taking a union bound over a fine enough net of the interval $[-K\sqrt{pn},K\sqrt{pn}]$ completes the argument.

A similar approach was applied in [43], under the assumption that $p\geq n^{-1+{\varepsilon}}$ for some ${\varepsilon}>0$ and therefore small polynomial terms could often be neglected. In our current setting, where $p$ is on the order of $\log^{C}n/n$ , it turns out that the above decomposition is insufficient primarily because the vectors that are not dominated or compressible can have a wide range of $\ell_{2}$ mass in their coordinates outside of the $m$ largest. Therefore, we further decompose the vectors by their $\ell_{2}$ mass in the relevant coordinates. Working in each of these classes allows some key technical estimates that bypass the small polynomial losses from [43]. These technical improvements generate the improvement in the range of sparsity and the error probability. Furthermore, in [43], the result was only concerned with a non-zero separation of the eigenvalues. A more careful accounting of the small ball probability greatly improves the (implicit) small ball estimate in [43].

4. Compressible and Dominated Vectors

The goal of this section is to prove Proposition 4.6, which shows that any eigenvector of $M_{n}$ cannot be close to a sparse vector, in a certain quantitative sense (with high probability). Before proceeding to its proof, we introduce a few necessary definitions and lemmas.

4.1. Decomposition of the sphere

We now formally define the decomposition of the unit sphere used in the proof sketch of Section 3.

Definition 4.1.

Fix $m<n$ . The set of $m$ -sparse vectors is given by

[TABLE]

Furthermore, for $\delta>0$ , we define the compressible and incompressible vectors by

[TABLE]

and

[TABLE]

For any $1\leq n\leq n^{\prime}$ , we let $[n]$ denote the set $\{1,2,\dots,n\}$ and $[n:n^{\prime}]$ denote the set $\{n,n+1,\dots,n^{\prime}\}$ .

Definition 4.2.

For any $x\in\mathbb{S}^{n-1}$ , let $\pi_{x}:[n]\rightarrow[n]$ be a permutation which arranges the absolute values of the coordinates of $x$ in non-increasing order. For $1\leq m\leq m^{\prime}\leq n$ denote by $x_{[m:m^{\prime}]}\in\mathbb{R}^{n}$ the vector with coordinates

[TABLE]

For any $c<1$ and $m\leq n$ , define the set of vectors with dominated tail by

[TABLE]

This definition was first given in [9]. Like compressible vectors, vectors with dominated tail are close to being sparse, though in a different way. This approximate sparsity facilitates the proof of the following key bound, Proposition 4.4.

4.2. Bounds for compressible and dominated vectors

We first state a high probability bound on the operator norm of $M_{n}$ , which was defined in Definition 2.1.

Lemma 4.3 ([43, Proposition 5.2] and [67, Proposition 1.10]).

For $M_{n}$ defined in Definition 2.1, there exist constants $C_{\ref{l:opnorm}},K,c_{\ref{l:opnorm}}>0,$ depending only on the subgaussian moment $B$ , such that for $p\geq\frac{C_{\ref{l:opnorm}}\log n}{n}$ and $n\geq(c_{\ref{l:opnorm}})^{-1}$ ,

[TABLE]

For the remainder of this work, all references to the constant $K$ refer to the $K$ provided by Lemma 4.3.

The compressible and dominated vectors were previously resolved in [43] down to the optimal scale $p\geq C\log n/n$ . Given some $\bar{C}_{\ref{prop:compressible}}>0$ , we define the parameters

[TABLE]

Proposition 4.4 ([43, Proposition 5.3]).

There exist constants $C_{\ref{prop:compressible}},\bar{C}_{\ref{prop:compressible}},c_{\ref{prop:compressible}},c^{\prime}_{\ref{prop:compressible}},>0$ , depending only on the subgaussian moment $B$ of Definition 2.1, such that the following holds. If $p,m,\lambda$ satisfy

[TABLE]

then with probability at least $1-\exp(-{c}^{\prime}_{\ref{prop:compressible}}pn)$ ,

[TABLE]

for all $x\in\operatorname{Comp}(m,\rho)\cup\operatorname{Dom}(m,c^{\prime}_{\ref{prop:compressible}})$ and $n>(c^{\prime}_{\ref{prop:compressible}})^{-1}$ .

Remark 4.5.

Note that if $p\geq n^{-1+c}$ for some constant $c>0$ , then $\rho$ is bounded below by a constant. At the optimal scale $p=C\log n/n$ , there exist constants $C_{1},C_{2}.c_{1},c_{2}>0$ such that

[TABLE]

We now come to the main result of this section, which combines the previous two proposition to exclude the possibility of compressible or dominated eigenvectors.

Proposition 4.6.

Let be $M_{n}$ as in Definition 2.1 with $p\geq C_{\ref{prop:compressible}}\frac{\log n}{n}$ . For $p^{-1}\leq m\leq c_{\ref{prop:compressible}}n$ and $n\geq(c_{\ref{prop:eigvecnotcomp}})^{-1}$ ,

[TABLE]

for some constant $c_{\ref{prop:eigvecnotcomp}}>0$ .

Proof.

Let $\mathcal{N}$ denote a $c_{\ref{prop:compressible}}\rho\sqrt{pn}$ -net of the interval $[-K\sqrt{pn},K\sqrt{pn}]$ with

[TABLE]

If there exists a compressible or dominated eigenvector $v$ with eigenvalue $\lambda\in[-K\sqrt{pn},K\sqrt{pn}]$ , then there exists a $\lambda_{0}\in\mathcal{N}$ such that

[TABLE]

By a union bound and Proposition 4.4, the probability of this event is bounded by

[TABLE]

for large enough $C_{\ref{prop:compressible}}$ and small enough $c_{\ref{prop:eigvecnotcomp}}$ ; to bound $|\mathcal{N}|$ , we used Remark 4.5. Finally, the event that that there exists an eigenvalue outside of the interval $[-K\sqrt{pn},K\sqrt{pn}]$ is bounded by $\exp(-c_{\ref{l:opnorm}}pn)$ , by Lemma 4.3. Shrinking $c_{\ref{prop:eigvecnotcomp}}$ allows us to take a union bound to include this event, and concludes the proof. ∎

5. Incompressible Vectors

In this section, we show that $M_{n}$ does not have structured eigenvectors. We begin with Section 5.1, where we elucidate the connection between small ball probability and our measure of structure, the Least Common Denominator (LCD). Section 5.2 and Section 5.3 are devoted to the proof of Proposition 5.17, which shows it is unlikely an eigenvector of $M_{n}$ has an LCD lying in a given level set. This proposition is the main technical achievement of this section. Finally, we derive Proposition 5.18 as a straightforward consequence of Proposition 5.17 and a union bound, which excludes the possibility of structured eigenvectors altogether. Together with Proposition 4.6, Proposition 5.18 will allow us to complete the outline of Section 3 and prove our main theorems in the next section.

5.1. Small Ball Probability

Recall from the proof sketch in Section 3 that we wish to bound the probability that the inner product of an eigenvector and a random vector is small. This motivates the definition of Lévy concentration, which bounds the small ball probabilities of a random vector $Z$ .

Definition 5.1.

The Lévy concentration of a random vector $Z\in\mathbb{R}^{n}$ is defined to be

[TABLE]

When $X$ is a random vector and $v$ is a fixed vector, the structure of $v$ will greatly influence the Lévy concentration of the random variable $v\cdot X$ . To formalize this concept, we begin with a measure of arithmetic structure for a unit vector.

Definition 5.2 ([66, Definition 6.1]).

Let $p$ be as in Theorem 2.2. We define the least common denominator (LCD) of $x\in\mathbb{S}^{n-1}$ as

[TABLE]

where $\gamma$ is an appropriate constant that is defined in Remark 5.3 below.

Remark 5.3.

There exist constants $\gamma,\bar{{\varepsilon}}_{0}\in(0,1)$ such that for any ${{\varepsilon}\leq\bar{{\varepsilon}}_{0}}$ ,

[TABLE]

where $\chi$ is a Bernoulli random variable such that ${\mathbb{P}}(\chi=1)=p$ and $\xi$ is a subgaussian random variable with unit variance. We fix such a $\gamma$ in Definition 5.2.

Proposition 5.4 ([9, Proposition 4.2]).

Let $X\in\mathbb{R}^{n}$ be a random vector with i.i.d. coordinates of the form $\xi_{j}\chi_{j}$ , where the $\chi_{j}$ ’s are Bernoulli random variables with ${\mathbb{P}}(\chi_{j}=1)=p$ and the $\xi_{j}$ ’s are random variables with unit variance and finite fourth moment. Then for any $v\in\mathbb{S}^{n-1}$ ,

[TABLE]

where $C_{\ref{prop:smallballprobability}}$ depends only on the fourth moment of $\xi$ .

We may tensorize Proposition 5.4 to obtain a bound on the Lévy concentration of $M_{n}x$ . The argument is almost identical to the proof of [9, Proposition 4.3], and we note only the necessary modifications here. Recall the notation $x_{[m:m^{\prime}]}$ from Definition 4.2. For any index set $J\subset[n]$ , we extend this notation to $x_{J}$ in the canonical way.

Proposition 5.5 (Small ball probabilities of $M_{n}x$ via regularized LCD).

There exists a constant $C_{\ref{prop:smallballprob}}$ such that for any $\alpha,{\varepsilon}>0$ and index set $I$ of size $\lceil\alpha n\rceil$ ,

[TABLE]

Proof Sketch.

We first observe that conditioning on elements of $M_{n}$ never decreases (and may increase) $\mathcal{L}(M_{n}x,{\varepsilon}\|v_{I}\|_{2}\sqrt{pn})$ . We therefore condition on all elements not in columns indexed by elements of $I$ , and also condition the elements whose indices $(i,j)$ satisfy $i,j\in I$ . The remaining elements are i.i.d. and consist of $n-\lceil\alpha n\rceil$ rows. The remainder of the argument is nearly identical to the one leading to [9, Proposition 4.3], where an analogous statement was shown for non-symmetric matrices. ∎

The following lemma provides a lower bound for the LCD in terms of the $\ell^{\infty}$ norm.

Proposition 5.6 (Lemma 6.2, [66]).

For all $x\in\mathbb{S}^{n-1}$ ,

[TABLE]

As in [66], we define a regularized version of the LCD. However, our definition is slightly different than the one in [66]. Recall the notation $\operatorname{Incomp}(m,\delta)$ given after Definition 4.1, and observe that the set $I_{0}$ in the following definition takes a distinguished role and is not included in the maximum. Here, $k_{0}$ represents a parameter that will be fixed later, in the material preceding (5.2).

Definition 5.7 (Regularized LCD).

Let $\{I_{j}\}_{j=0}^{k_{0}}$ be any partition of $[n]$ with $k_{0}$ elements.. We define the regularized LCD of a vector $v\in\operatorname{Incomp}(m,\delta)$ as

[TABLE]

In our use of Definition 5.7 below, $I_{0}$ will be (approximately) the $m$ largest coordinates of $v$ . Hence $\widehat{D}(v)$ gives a measure of the structure of the elements of $v$ left over after approximating $v$ by an $m$ -sparse vector.

5.2. Decomposition of Incompressible Vectors

In this section, we define a way to decompose incompressible vectors, which is used in the proof of Proposition 5.15 below. In order to give this decomposition, we first introduction a classification of the incompressible vectors, which allows us to control the amount of mass that is not in the $m$ largest coordinates.

Definition 5.8.

For $\rho\leq\rho_{1}\leq\rho_{2}\leq 1$ and $c<1$ , define

[TABLE]

Remark 5.9.

By definition, $\|v\|_{2}\leq\rho$ for any $v\in\operatorname{Comp}(m,\rho)$ , which gives rise to the condition $\rho\leq\rho_{1}$ in the preceding definition.

We will consider the sets of incompressible vectors $\operatorname{Incomp}_{\rho,c^{\prime}_{\ref{prop:compressible}}}(m,2^{j-1}\rho,2^{j}\rho)$ for $j\in\mathbb{N}$ , where $m$ is a parameter that will be chosen later. For brevity, we introduce the shorthand

[TABLE]

For the remainder of this section we primarily use the fact that the vectors in $\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)$ are not dominated. That they are not compressible is used only in the proof of Proposition 5.17.

We begin with a straightforward upper bound. Recall $\rho$ was defined in Proposition 4.4. Fix $j\in\mathbb{Z}$ and consider a vector ${v\in\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)}$ . Since $v\notin\operatorname{Dom}(m,c^{\prime}_{\ref{prop:compressible}})$ ,

[TABLE]

Furthermore, since $\|v_{[m+1:n]}\|_{2}<2^{j}\rho$ by definition,

[TABLE]

On the other hand, we can also find a large set of coordinates that are uniformly lower-bounded.

Lemma 5.10.

For $v\in\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)$ , the set

[TABLE]

satisfies $|\sigma(v)|\geq(c^{\prime}_{\ref{prop:compressible}})^{2}m/8$ .

Proof.

For the sake of contradiction, assume that $|\sigma(v)|<(c^{\prime}_{\ref{prop:compressible}})^{2}m/8$ . Then by (5.1),

[TABLE]

contradicting the definition of $\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)$ . ∎

We now define a partitioning procedure. For this, we introduce some new notation.

Definition 5.11.

For a set $I\in[n]$ with $|I|\geq k_{2}>k_{1}$ , we use $I_{\langle k_{1}:k_{2}\rangle}$ to denote all the elements from the $k_{1}$ -th to the $k_{2}$ -th in $I$ (inclusive), where we order the elements from least to greatest. For example, if $I=\{2,4,5,6,9\}$ then $I_{\langle 2:4\rangle}=\{4,5,6\}$ .

Let $v\in\mathbb{S}^{n-1}$ be a vector, let $\omega=\omega(n)$ be a parameter satisfying

[TABLE]

and set $m=\omega n$ . We define $k_{0}$ as the largest number of disjoint subsets with $\lceil\omega n\rceil$ elements one can have of $[n]$ whose union does not contain the indices of the $m$ largest elements of $v$ . We consider disjoint index sets $I_{1},\dots,I_{k_{0}}$ , each of size $\lceil\omega n\rceil$ , each not containing any indices of the $m$ largest elements of $v$ . Therefore,

[TABLE]

In our definition, the index sets $I_{j}$ depend on $v$ , but we suppress this dependence in the notation. For a vector $v\in\mathbb{S}^{n-1}$ , let $\tau(v)$ denote the set of indices of the $m$ largest coordinates. By Lemma 5.10, we can choose a subset $\widehat{\sigma}(v)\subset\sigma(v)$ of size exactly $\lceil(c^{\prime}_{\ref{prop:compressible}})^{2}m/8\rceil$ , where $\sigma(v)$ was defined in the statement of that lemma. We observe that $\widehat{\sigma}(v)$ and $\tau(v)$ are disjoint.

Let $\overline{\sigma}(v)=[n]\setminus(\tau(v)\cup\widehat{\sigma}(v)).$ For $1\leq k<k_{0}$ , we define

[TABLE]

For the rest of this work, we drop floor and ceiling functions because they do not influence the argument in a substantial way.

Finally, we define $I_{0}=[n]\setminus\cup_{k=1}^{k_{0}}I_{k}$ . In words, $I_{0}$ contains the $m$ largest coordinates and the smaller coordinates left over from divisibility issues. In particular, $|I_{0}|\leq m+\lceil\omega n\rceil$ . Since the sets $I_{k}$ were chosen to be disjoint for $k\geq 1$ , it follows that $\{I_{k}\}_{k=0}^{k_{0}}$ is a partition of $[n]$ .

The primary objective of this partition is recorded in the following lemma, where we also define the constants $\rho^{\prime}_{j}$ .

Lemma 5.12.

For $v\in\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)$ and $1\leq k\leq k_{0}$ ,

[TABLE]

Also,

[TABLE]

Proof.

The bounds on $\|v_{I_{k}}\|_{2}$ follow from the coordinate-wise bounds of our construction. For the lower bound, we ignore all elements not in $\widehat{\sigma}(v)$ . We obtain

[TABLE]

The claim (5.4) then follows from Lemma 5.10, (5.1), and (5.2).

For the second claim, applying Proposition 5.6 and recalling Definition 5.7 yields

[TABLE]

Then the claim follows from the lower bound on $\|v_{I_{k}}\|_{2}$ in the previous paragraph and (5.1). ∎

5.3. Vectors with Small LCD

We now exclude vectors with small regularized LCD as potential eigenvectors of $M_{n}$ . This is the content of the next proposition, Proposition 5.15, which shows that any vector in $\operatorname{Incomp}(m,2^{j-1}\rho,2^{j}\rho)$ with small regularized LCD is unlikely to be near an eigenvector. We first define level sets of vectors according to their regularized LCD.

Definition 5.13.

For any $L>0$ , we define the level sets

[TABLE]

We also require a preliminary lemma. Recall $\gamma$ was defined in Remark 5.3.

Lemma 5.14 (Lemma 6.13, [43]).

Let $\omega>0$ , and let $f(n)$ be a function such that

[TABLE]

Then for $L>f(n)$ , the set of unit vectors

[TABLE]

admits a $\beta$ -net of size at most

[TABLE]

where $\bar{c}>0$ is a universal constant and

[TABLE]

We now state and prove the main technical result of this section. Recall $S_{L}$ was defined in (5.13), $\rho^{\prime}_{j}$ was defined in Lemma 5.12, and $K$ is the constant given by Lemma 4.3.

Proposition 5.15.

Fix $\nu>0$ . There exist constants $C_{\ref{prop:smallLCD}},c_{\ref{prop:smallLCD}},c^{\prime}_{\ref{prop:smallLCD}},\tilde{c}_{\ref{prop:smallLCD}}>0$ such that for $p\geq C_{\ref{prop:smallLCD}}\frac{\log^{7+\nu}n}{n}$ , $\lambda\in[-K\sqrt{pn},K\sqrt{pn}]$ , $j\in\mathbb{N}$ , and for any

[TABLE]

and

[TABLE]

the following holds for $n\geq(c^{\prime}_{\ref{prop:smallLCD}})^{-1}$ :

[TABLE]

where

[TABLE]

Proof.

We set $m=\alpha n$ , and define

[TABLE]

In outline, this proof implements the following steps:

(1)

Construct a suitable net $\mathcal{M}$ for $\mathcal{K}$ . 2. (2)

Upper bound the size of $\mathcal{M}$ . 3. (3)

Show the claim holds for all $v\in\mathcal{M}$ . 4. (4)

Extend the result from all $v\in\mathcal{M}$ to all $v\in\mathcal{K}$ .

For Step 1, let $v\in\mathcal{K}$ be a vector and consider the partition $\{I_{k}\}_{k=0}^{k_{0}}$ of the coordinates of $v$ constructed in (5.3) with the parameter $\omega=\alpha$ . For the coordinates $I_{0}$ , by a standard volume estimate,333See for example [52, (5.7)]. there exists a $c^{\prime}_{\ref{prop:smallLCD}}\rho^{\prime}_{j}{\varepsilon}_{0}/10K$ -net, $\mathcal{N}_{0}$ , of the values $[0,1]$ such that

[TABLE]

where we recall $|I_{0}|\leq m+\alpha n$ .

For the coordinates in $I_{k}$ with $k\geq 1$ , we use a construction that exploits the LCD structure. Observe that the hypothesis of Lemma 5.14 holds for $v_{I_{k}}/\|v_{I_{k}}\|_{2}$ because

[TABLE]

as shown in the proof of Lemma 5.12 (see (5.5)), and the lower bound tends to infinity as $n\rightarrow\infty$ . For $I_{k}$ with $k\geq 1$ , let $\mathcal{N}_{k}$ denote the $\beta$ -net guaranteed by Lemma 5.14 applied to $v_{I_{k}}/\|v_{I_{k}}\|_{2}$ .444Observe we are applying this lemma when the upper limit is $2L$ , according to the definition of $S_{L}$ , not $L$ . The definition of $\beta$ is adjusted accordingly below.

We next implement a net of scaling factors. Let $\mathcal{J}$ be a $c^{\prime}_{\ref{prop:smallLCD}}{\varepsilon}_{0}\rho^{\prime}_{j}/10Kk_{0}$ -net of $[0,1]$ such that

[TABLE]

As observed earlier, the partition $\{I_{k}\}_{k\geq 0}$ of the coordinates of $v$ is entirely determined by the sets of indices $\tau$ and $\sigma$ . To approximate all $v\in\mathcal{K}$ , we define the preliminary set

[TABLE]

We currently have no guarantee that

[TABLE]

However, this is easily fixed. If there exists $x\in S_{L}$ such that

[TABLE]

we replace $m$ by any such $x$ . Otherwise, we discard $m$ . This creates a new net $\mathcal{M}$ such that $|\mathcal{M}|\leq|\mathcal{M}^{\prime}|$ . This completes Step 1.

We now enter Step 2 of the proof and upper bound the size of $\mathcal{M}$ . We may combinatorially determine the size of $\mathcal{M}$ using the sizes of the $\mathcal{N}_{k}$ and $\mathcal{J}$ . This leads to the following bound on the cardinality of our net:

[TABLE]

The combinatorial factors come from the choices of $\tau$ and $\sigma$ in (5.7).

We now proceed to simplify this bound. From the elementary bound

[TABLE]

we have the following exponential bound for $|\mathcal{M}|$ :

[TABLE]

For the second factor, we recalled that $|I_{0}|\geq m$ , so that the product from $1$ to $k_{0}$ in (5.8) has at most $n-m$ individual terms. Using ${L\leq\exp(2\alpha^{-1})}$ , ${m=\alpha n}$ , $k_{0}\leq\alpha n$ , and $k_{0}\leq\alpha^{-1}$ (from (5.2)), we find

[TABLE]

Recall that $\rho^{\prime}_{j}$ was defined in terms of $\rho$ in Lemma 5.12, and $\log(1/\rho)=O(\log n/\log\log n)$ by Remark 4.5. Note also that $\log(1/\alpha)=O(\log n)$ . Then there exists $C>0$ such that

[TABLE]

From this, we find

[TABLE]

This completes Step 2.

We now begin Step 3 of the outline and prove the result for all the points in our net $\mathcal{M}$ . Set

[TABLE]

By Proposition 5.5 applied with ${\varepsilon}=c_{\ref{prop:smallLCD}}{\varepsilon}_{0}$ , for any $v\in\mathcal{M}$ and $k$ such that $1\leq k\leq k_{0}$ ,

[TABLE]

where we recall from Lemma 5.12 that $\rho^{\prime}_{j}\leq\|v_{I_{k}}\|_{2}$ . Since $v\in S_{L}$ , by the definition of $S_{L}$ we find there exists $1\leq k\leq k_{0}$ such that $D(v_{I_{k}}/\|v_{I_{k}}\|_{2})>L$ . We use this $k$ in the above expression to find

[TABLE]

Straightforward computations show

[TABLE]

Recall that ${\varepsilon}_{0}$ as defined as the minimum of the two upper bounds in (5.10), so

[TABLE]

Then

[TABLE]

Setting $c_{\ref{prop:smallLCD}}=(2C_{\ref{prop:smallballprob}})^{-1}$ and applying a union bound over all elements $x\in\mathcal{M}$ , we obtain

[TABLE]

To bound $|\mathcal{M}|{\varepsilon}_{0}^{n-\alpha n}$ from (5.12), we use (5.9) and divide into two cases. First, suppose $\frac{2\bar{c}L}{\sqrt{\alpha n}}\leq 1$ . By (5.9), we have

[TABLE]

Combining this with (5.12) and absorbing the $13^{n}$ into the exponential yields

[TABLE]

so

[TABLE]

In the last line we used $\alpha=o(1)$ and ${\varepsilon}_{0}\rightarrow 0$ (the latter is by direct calculation), so $\log(1/{\varepsilon}_{0})\rightarrow\infty$ and the term inside the brackets tends to $-\infty$ .

For the case $\frac{2\bar{c}L}{\sqrt{\alpha n}}>1$ , recalling the definition of ${\varepsilon}_{0}$ and that $m=\alpha n$ gives

[TABLE]

Now (5.11) shows that

[TABLE]

This, along with the stipulated range of $\alpha$ , implies that

[TABLE]

Therefore, taking $c^{\prime}_{\ref{prop:smallLCD}}$ small enough in (5.15), we have

[TABLE]

This completes Step 3.

We now proceed to Step 4. Having shown the result for all the points in the net, we now extend to the entire level set $\mathcal{K}$ . Again, we divide into cases.

We assume first that

[TABLE]

For any $w\in\mathcal{K}$ , let $m\in\mathcal{M}$ be the closest element of the net $\mathcal{M}$ . Then, by the definition of $\mathcal{M}$ ,

[TABLE]

In the third inequality, we used that there are $k_{0}$ terms in the sum, that the $y_{k}$ form a $\beta$ -net, and the upper bound on $\|w_{k}\|_{2}$ from (5.4). In the fourth inequality, we used $k_{0}\leq\alpha^{-1}$ from (5.2) and the inequality

[TABLE]

where $C_{\gamma}$ is a constant that depends only on $\gamma$ . The inequality (5.17) follows from the definition of $\beta$ and the hypothesized upper bound $\log(\sqrt{p}L)\leq\alpha^{-1}$ on $L$ . The last inequality follows by direct calculation using the value of ${\varepsilon}_{0}$ given in (5.16) and the assumed lower bound on $\alpha$ .

For the other case, suppose

[TABLE]

For any $w\in\mathcal{K}$ , let $m\in\mathcal{M}$ be the closest element of the net $\mathcal{M}$ . Then, by the definition of $\mathcal{M}$ ,

[TABLE]

In the third line, we used that there are $k_{0}$ terms in the sum, that the $y_{k}$ ’s form a $\beta$ -net, and the upper bound on $\|w_{k}\|_{2}$ from (5.4). The fourth line follows from the definition of $\beta$ . The fifth line is a result of the observation that $\sqrt{\log x}/x$ is a decreasing function for large $x$ , $r\rightarrow\infty$ , and $r<L\sqrt{p}$ . We also used the bound $k_{0}\leq\alpha^{-1}$ from (5.2). In the sixth line, we used the definition of ${\varepsilon}_{0}$ in (5.18). For the the last line, we used $(\log\log n)^{-1}=o(1)$ and took $n$ large enough.

Therefore, if $\|(M_{n}-\lambda)w\|_{2}\geq 2{c_{\ref{prop:smallLCD}}}{\varepsilon}_{0}\sqrt{pn}$ , then using Lemma 4.3,

[TABLE]

with exponentially small error probability, which contradicts the conclusion of Step 3 above. After adjusting $c_{\ref{prop:smallLCD}}$ by a factor of $2$ , this completes the proof. ∎

Remark 5.16.

As noted in Remark 2.5, the optimal result should permit $p$ as small as $C\log(n)/n$ . The restriction that $p\geq C\log^{7+\nu}n/n$ in the above proof comes from the requirement that ${\varepsilon}_{0}\rightarrow 0$ .

We now extend the previous result to all vectors with small LCD.

Proposition 5.17.

Fix $\nu>0$ . There exists a constant $c_{\ref{p:smallLCDallLevels}}>0$ such that for $p\geq C_{\ref{prop:smallLCD}}\frac{\log^{7+\nu}n}{n}$ , $\lambda\in[-K\sqrt{pn},K\sqrt{pn}]$ , $j\in\mathbb{N}$ and for any

[TABLE]

the following holds. The probability that there exists $v\in\operatorname{Incomp}(\alpha n,\rho)$ such that

[TABLE]

is at most $\exp(-c_{\ref{p:smallLCDallLevels}}n)$ for $n\geq(c_{\ref{p:smallLCDallLevels}})^{-1}$ , where

[TABLE]

Proof.

We set $D_{0}=c^{\prime}_{\ref{prop:compressible}}2^{-5}\alpha^{3/2}n^{1/2}$ and recall that $\widehat{D}(v)\geq D_{0}$ by $\eqref{e:Llower}$ . We can decompose the relevant vectors as

[TABLE]

where we used $D_{0}\geq 1$ . Recall $\log(1/\rho)=O(\log n/\log\log n)$ by Remark 4.5. Similarly, the number of $j^{\prime}$ indices in the union is $O(\log n)$ because each of $\log_{2}p^{-1/2}$ and $\log_{2}\exp(\alpha^{-1})$ are $O(\log n)$ . Therefore, taking a union bound, applying Proposition 5.15, and observing $\rho^{\prime}_{j}\geq\rho^{\prime}_{1}$ and ${\varepsilon}_{0}(L)\geq{\varepsilon}_{1}$ for the ${\varepsilon}_{0}(L)$ defined in Proposition 5.15 yields the result. ∎

5.4. Eigenvector Bound

We now come to a key proposition used in the proof of the main theorem.

Proposition 5.18.

For $M_{n}$ as in Definition 2.1, there exists a constant $c_{\ref{p:eigvectors}}>0$ such that for

[TABLE]

the probability that $M_{n}$ has an eigenvector v such that

[TABLE]

is at most $\exp(-c_{\ref{p:eigvectors}}n)$ , for $n\geq(c_{\ref{p:eigvectors}})^{-1}$ .

Proof.

Consider a $c_{\ref{prop:smallLCD}}{\varepsilon}_{1}\rho^{\prime}_{1}\sqrt{pn}$ -net of $[-K\sqrt{pn},K\sqrt{pn}]$ , where ${\varepsilon}_{1}$ was defined in Proposition 5.17. For an eigenvalue $\lambda\in[-K\sqrt{pn},K\sqrt{pn}]$ , there exists a point of the net $\lambda_{0}$ such that for corresponding eigenvector $v$ we have

[TABLE]

However, by a union bound and Proposition 5.17, the probability of this event is bounded by $\exp(-c_{\ref{p:eigvectors}}n)$ for some $c_{\ref{p:eigvectors}}>0$ . By Lemma 4.3, decreasing the value of $c_{\ref{p:eigvectors}}$ can account for the event that there exists an eigenvalue of $M_{n}$ outside the interval $[-K\sqrt{pn},K\sqrt{pn}]$ . This concludes the proof. ∎

6. Proofs of Main Results

6.1. Proof of Theorem 2.2

In preparation for the main proof, we record the following lemma from [43].

Lemma 6.1 ([43, Lemma 6.1]).

For any $v\in\operatorname{Incomp}(m,\rho)$ ,

[TABLE]

Proof of Theorem 2.2.

We repeat the decomposition described in Section 3. Let

[TABLE]

where $X=(x_{1},\dots,x_{n-1})\in\mathbb{R}^{n-1}$ . Let $v=(x,a)$ (where $x\in\mathbb{R}^{n-1}$ and ${a\in\mathbb{R}}$ ) be the unit eigenvector associated to $\lambda_{i}(M_{n})$ . Because $v$ is an eigenvector with eigenvalue $\lambda_{i}$ ,

[TABLE]

Considering the top $n-1$ coordinates gives

[TABLE]

Let $w$ be the eigenvector of $M_{n-1}$ corresponding to $\lambda_{i}(M_{n-1})$ . After multiplying on the left by $w^{T}$ , we arrive at

[TABLE]

Since $|w^{T}x|\leq 1$ by the Cauchy–Schwarz inequality, this implies

[TABLE]

By the Cauchy interlacing law, we must have $\lambda_{i}(M_{n})\leq\lambda_{i}(M_{n-1})\leq\lambda_{i-1}(M_{n})$ . For any $\hat{\delta}>0$ , let $\mathcal{E}_{i}=\mathcal{E}_{i}\left(\hat{\delta}\right)$ denote the event that

[TABLE]

On $\mathcal{E}_{i}$ , (6.3) implies

[TABLE]

Now note that the decomposition (6.1) can be done along any coordinate, not just the last. For any $A>0$ , let $n_{A}$ be the number of coordinates with absolute value at least $A$ , and let $N$ be a parameter. Therefore, repeating the argument leading to (6.5) with the coordinate $a$ chosen uniformly at random, and considering the probability that we choose a coordinate with absolute value at least $A$ , and $\mathcal{E}_{i}$ obtains, we find

[TABLE]

Setting $m=c_{\ref{prop:compressible}}n$ in Proposition 4.6 shows that any eigenvector $v$ will not be in $\operatorname{Comp}(c_{\ref{prop:compressible}}n,\rho)$ with exponentially high probability. When $v\notin\operatorname{Comp}(c_{\ref{prop:compressible}}n,\rho)$ , by Lemma 6.1, there are greater than $c_{\ref{prop:compressible}}n\rho^{2}/2$ coordinates whose absolute values are larger than $\rho/\sqrt{2n}$ . We set $N=c_{\ref{prop:compressible}}n\rho^{2}/2$ and $A=\rho/\sqrt{2n}$ in (6.7) to find

[TABLE]

With probability at least $1-\exp(-c_{\ref{p:eigvectors}}pn)$ ,

[TABLE]

by Proposition 4.6 (applied with $m=\alpha n$ ) and Proposition 5.18. At this point, we would like to apply Proposition 5.4 to control the probability ${\mathbb{P}}\left(|w^{T}X|\leq\hat{\delta}\rho^{-1}\sqrt{2p}\right)$ in (6.8). However, this proposition applies to the LCD $D(w)$ , not the regularized LCD $\widehat{D}(w)$ , so a slightly more delicate argument is required.

By the definition of regularized LCD, there exists some subset $J$ of coordinate indices such that

[TABLE]

To adjust for the regularized LCD, we observe that conditioning on a subset of $X$ can only increase the Lévy function $\mathcal{L}(w^{T}X,{\varepsilon})$ for any ${\varepsilon}>0$ . We condition on all the random variables in $X$ whose indices do not lie in the subset $J$ . Also, to apply Proposition 5.4, we need to normalize this subset to be on the unit sphere. Therefore, by Proposition 5.4,

[TABLE]

for all $\hat{\delta}\geq\rho e^{-\alpha^{-1}}/\sqrt{2}$ . By Lemma 5.12, $\|w_{J}\|_{2}\geq c^{\prime}_{\ref{prop:compressible}}2^{-3}\rho\alpha$ . Therefore, putting (6.9) into (6.8), we find

[TABLE]

We set $\delta=\hat{\delta}\rho^{-4}$ . Then the above holds for $\delta\geq\rho^{-3}e^{-\alpha^{-1}}/\sqrt{2}$ . Recall that $\rho^{-3}=\exp(O(\log n/\log\log n))$ . Thus, we obtain the theorem after lowering $c^{\prime}_{\ref{thm:main}}$ , which constrains the range of $\alpha$ . ∎

6.2. Proof of Theorem 2.6

Let $G(n,p)$ denote the Erdős–Rényi random graph on $n$ vertices with edge probability $p$ , and let $A_{n}$ denote the adjacency matrix of $G(n,p)$ . In other words, $A_{n}$ is a symmetric matrix of Bernoulli variables with parameter $p$ , with all [math] entries on the diagonal. We have $\mathbb{E}A_{n}=p(J_{n}-I_{n})$ where $J_{n}$ is the matrix of all ones, so our main theorem does not apply. However, only small modifications are necessary to handle this case, which we detail in this section, following closely the analogous argument in [43, Section 8].

First, we observe that Proposition 4.4 can be adapted so that the proposition holds for $A_{n}$ in place of $M_{n}$ . This was proved in [43, Appendix B]. It follows that Proposition 4.6 also holds for $A_{n}$ (by repeating the proof of Proposition 4.6 using the analogue of Proposition 4.4 for $A_{n}$ ).

Next, we claim that Proposition 5.17 can be adapted to hold for the matrix $A_{n}-p(J_{n}-I_{n})$ in place of $M_{n}$ , with the additional restriction that we must suppose $p\leq 1/2$ . The restriction is due to the fact that we will write the off-diagonal entries of this matrix as $a_{ij}=\delta_{ij}\xi_{ij}$ , where $\delta$ is Bernoulli with parameter $2p$ and $\xi_{ij}$ is Bernoulli with parameter $1/2$ (as in the definition of $M_{n}$ ). Our arguments for Proposition 5.17 revolved around Lévy concentration and nets. The use of Lévy concentration in Proposition 5.5 does not need to be modified for the random graph case, since it is invariant under changes in the mean of the matrix.555However, it does require the aforementioned decomposition $a_{ij}=\delta_{ij}\xi_{ij}$ , giving rise to the $p\leq 1/2$ restriction. For the nets, we required the operator norm bound Lemma (4.3); we claim the analogue of this statement for $A_{n}-p(J_{n}-I_{n})$ also holds. A straightforward modification of the proof of [9, Theorem 1.7] shows

[TABLE]

for some $K^{\prime},c^{\prime}>0$ . We obtain that Proposition 5.17 holds for $A_{n}-p(J_{n}-I_{n})$ , if $p\leq 1/2$ .

Additionally, we need a slight generalization of Proposition 5.17, which lower bounds not just $\|(A_{n}-p(J_{n}-I_{n})-\lambda)v\|_{2}$ , but

[TABLE]

for any fixed vector $x$ . This generalization holds because the high probability lower bounds used to prove Proposition 5.17 come from Proposition 5.5, and the latter proposition concerns Lévy concentration, which is by definition translation invariant.

We now turn to the proof of Theorem 2.6.

Proof of Theorem 2.6.

Above, we established that the analogue of Proposition 5.17 holds for $A_{n}-p(J_{n}-I_{n})$ , if $p\leq 1/2$ . This restriction motivates the following division into cases.

Case I: $p\leq 1/2$ . Our preliminary goal to is establish that Proposition 5.18 holds for $A_{n}$ . We have

[TABLE]

where ${\bf 1}$ is the vector $(1,\dots,1)$ of all ones. Set $\mathcal{X}_{n}=\{\kappa\cdot{\bf 1}\colon\kappa\in[-pn,pn]\}$ . Let $\mathcal{B}$ be a $c_{\ref{p:smallLCDallLevels}}\varepsilon_{0}\rho^{\prime}\sqrt{pn}$ -net of $\mathcal{X}_{n}$ such that

[TABLE]

For $x,x^{\prime}\in\mathcal{X}_{n}$ , the reverse triangle inequality yields

[TABLE]

so any $(A_{n}-p(J_{n}-I_{n})-\lambda)v-y$ with $y\in\mathcal{X}_{n}$ can be well approximated by $(A_{n}-p(J_{n}-I_{n})-\lambda)v-x$ for some $x\in\mathcal{B}$ .

Define

[TABLE]

By (6.15), a union bound over the net $\mathcal{B}$ , and the analogue of Proposition 5.17 for (6.12) stated above, we obtain

[TABLE]

for any single $\lambda\in[-K^{\prime}\sqrt{pn},K^{\prime}\sqrt{pn}]$ . After observing that

[TABLE]

we find

[TABLE]

Using (6.18) in place of Proposition 5.17 in the proof of Proposition 5.18, we find that Proposition 5.18 holds for $A_{n}$ in place of $M_{n}$ .

We can now repeat the proof of Theorem 2.2 to prove theorem in this case, with the appropriate analogues for $A_{n}$ substituting for Proposition 5.18 and Proposition 4.6. (The latter was noted at the beginning of Section 6.2.)

Case II: $p>1/2$ . Observe that the adjacency matrix $A_{n}(p)$ of $G(n,p)$ is equal in distribution to $J_{n}-I_{n}-A_{n}(1-p)$ . Hence controlling

[TABLE]

is equivalent to controlling

[TABLE]

This reduces the problem to Case I and completes the proof. ∎

Remark 6.2.

The size of the one-dimensional net $\mathcal{B}$ in (6.14) is compensated by the $\exp(-cn)$ error probability used for the union bound in (6.16). For general finite-rank perturbations by a finite linear combination of matrices of the form $n\cdot vv^{T}$ for $v\in\mathbb{S}^{n}$ , one simply adds more one-dimensional nets and completes the argument in the same way. However, for perturbations whose rank grows even moderately quickly, the combined size of the necessary supplemental nets becomes too large.

6.3. Proof of Theorem 2.8

The following is essentially Lemma 9.1 of [48]. We provide the proof for completeness.

Lemma 6.3.

For any $A>0$ there exists $B=B(A)>0$ such that the following holds with probably at least $1-O(n^{-A})$ . If there exist $\lambda\in\mathbb{R}$ and $v\in\mathbb{S}^{n-1}$ such that $\|(A_{n}-\lambda)v\|\leq n^{-B}$ , then $A_{n}$ has an eigenvector $u_{i_{0}}\in\mathbb{S}^{n-1}$ and corresponding eigenvalue $\lambda_{i_{0}}$ such that

[TABLE]

Proof.

From our main result, Theorem 2.6, we may suppose that all eigenvalue gaps satisfy $|\lambda_{j}-\lambda_{i}|\geq n^{-B/2}$ . Let $v=\sum c_{i}u_{i}$ express $v$ as a linear combination of unit eigenvectors of $A$ . There must exist $i_{0}$ such that $c_{i_{0}}\geq n^{-1/2}$ . So

[TABLE]

implies, assuming $\|(A_{n}-\lambda)v\|\leq n^{-B}$ , that $|\lambda-\lambda_{i_{0}}|\leq n^{-B+1/2}$ . This implies the first conclusion. Then because all gaps satisfy $|\lambda_{j}-\lambda_{i}|\geq n^{-B/2}$ we have that $|\lambda-\lambda_{i}|\geq n^{-B/2}/2$ for all $i\neq i_{0}$ . But then we must have $|c_{i}|=O(n^{-B/2})$ for $i\neq i_{0}$ , implying the second conclusion. ∎

Proof of Theorem 2.8.

We follow the proof of Theorem 3.3 in [48]. After adjusting $C$ by adding $1$ , it suffices to prove the claim for a single coordinate and use a union bound. Write $A=A_{n}$ and let its first column be $(a_{11},X)$ where $X$ is a vector of $n-1$ coordinates. Let $v=(v_{1},v^{\prime})$ be an eigenvector with eigenvalue $\lambda$ so that

[TABLE]

Suppose that $|v_{1}|\leq n^{-D}$ where $D$ will be chosen later. By taking $D$ large enough, using that the entries of $A$ are bounded, and adding $O(N^{-D})$ mass to the first component of $v^{\prime}$ to make it unit norm, it suffices to show that

[TABLE]

occur jointly with low probability. By Lemma 6.3, if the first condition holds then there exists an eigenvector $u^{\prime}$ of $A_{n-1}$ with $\|u^{\prime}-v^{\prime}\|_{2}\leq n^{-D/8}$ . Then $|(v^{\prime})^{T}X|\leq N^{-D/2}$ implies $|(u^{\prime})^{T}X|\leq n^{-D/16}$ . We claim this contradicts a statement established in the proof of Theorem 2.2.

In (6.9) and the following lines, we showed

[TABLE]

where $\delta$ was defined below (6.10) (in terms of $\hat{\delta}$ ). Now we take $\delta=n^{-D/16}/\rho^{3}\sqrt{p}$ , $\alpha=(np)^{-1/(7+\nu)}$ and $p>C\log^{7+\nu}(n)/n$ , which proves the theorem after taking $D$ large enough. ∎

Bibliography69

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Yonathan Aflalo, Alex Bronstein, and Ron Kimmel. Graph matching: relax or not? ar Xiv preprint ar Xiv:1401.7623 , 2014.
2[2] Amol Aggarwal. Bulk universality for generalized Wigner matrices with few moments. Probability Theory and Related Fields , 173(1-2):375–432, 2019.
3[3] Amol Aggarwal, Patrick Lopatto, and Horng-Tzer Yau. GOE statistics for Lévy matrices. ar Xiv preprint ar Xiv:1806.07363 , 2018.
4[4] Sanjeev Arora and Aditya Bhaskara. Eigenvectors of random graphs: delocalization and nodal domains. https://theory.epfl.ch/bhaskara/files/deloc.pdf , 2011.
5[5] Enrico Au-Yeung. Sparse signal recovery using a new class of random matrices. Adv. Pure Appl. Math. , 8(2):79–89, 2017.
6[6] László Babai, D. Yu. Grigoryev, and David Mount. Isomorphism of graphs with bounded eigenvalue multiplicity. In Proceedings of the fourteenth annual ACM symposium on Theory of computing , pages 310–324. ACM, 1982.
7[7] Bubacarr Bah and Jared Tanner. On construction and analysis of sparse random matrices and expander graphs with applications to compressed sensing. ar Xiv preprint ar Xiv:1307.6477 , 2013.
8[8] Grey Ballard, Aydin Buluc, James Demmel, Laura Grigori, Benjamin Lipshitz, Oded Schwartz, and Sivan Toledo. Communication optimal parallel multiplication of sparse random matrices. In Proceedings of the twenty-fifth annual ACM symposium on Parallelism in algorithms and architectures , pages 222–231. ACM, 2013.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Tail bounds for gaps between eigenvalues of sparse random matrices

Abstract.

Contents

1. Introduction

Theorem 1.1** ([48, Theorem 2.1]).**

2. Main Results

Definition 2.1**.**

Theorem 2.2**.**

Corollary 2.3**.**

Corollary 2.4**.**

Remark 2.5**.**

Theorem 2.6**.**

Remark 2.7**.**

2.1. Non-degeneration of Eigenvectors and Nodal Domains of a Random Graph

Theorem 2.8**.**

Corollary 2.9**.**

Corollary 2.10**.**

Theorem 2.11**.**

Remark 2.12**.**

3. Proof Strategy

4. Compressible and Dominated Vectors

4.1. Decomposition of the sphere

Definition 4.1**.**

Definition 4.2**.**

4.2. Bounds for compressible and dominated vectors

Lemma 4.3** ([43, Proposition 5.2] and [67, Proposition 1.10]).**

Proposition 4.4** ([43, Proposition 5.3]).**

Remark 4.5**.**

Proposition 4.6**.**

Proof.

5. Incompressible Vectors

5.1. Small Ball Probability

Definition 5.1**.**

Definition 5.2** ([66, Definition 6.1]).**

Remark 5.3**.**

Proposition 5.4** ([9, Proposition 4.2]).**

Proposition 5.5** (Small ball probabilities of MnxM_{n}xMn​x via regularized LCD).**

Proof Sketch.

Proposition 5.6** (Lemma 6.2, [66]).**

Definition 5.7** (Regularized LCD).**

5.2. Decomposition of Incompressible Vectors

Definition 5.8**.**

Remark 5.9**.**

Lemma 5.10**.**

Proof.

Definition 5.11**.**

Lemma 5.12**.**

Proof.

5.3. Vectors with Small LCD

Definition 5.13**.**

Lemma 5.14** (Lemma 6.13, [43]).**

Proposition 5.15**.**

Proof.

Remark 5.16**.**

Proposition 5.17**.**

Proof.

5.4. Eigenvector Bound

Proposition 5.18**.**

Proof.

6. Proofs of Main Results

6.1. Proof of Theorem 2.2

Lemma 6.1** ([43, Lemma 6.1]).**

Proof of Theorem 2.2.

6.2. Proof of Theorem 2.6

Proof of Theorem 2.6.

Remark 6.2**.**

6.3. Proof of Theorem 2.8

Lemma 6.3**.**

Proof.

Proof of Theorem 2.8.

Theorem 1.1 ([48, Theorem 2.1]).

Definition 2.1.

Theorem 2.2.

Corollary 2.3.

Corollary 2.4.

Remark 2.5.

Theorem 2.6.

Remark 2.7.

Theorem 2.8.

Corollary 2.9.

Corollary 2.10.

Theorem 2.11.

Remark 2.12.

Definition 4.1.

Definition 4.2.

Lemma 4.3 ([43, Proposition 5.2] and [67, Proposition 1.10]).

Proposition 4.4 ([43, Proposition 5.3]).

Remark 4.5.

Proposition 4.6.

Definition 5.1.

Definition 5.2 ([66, Definition 6.1]).

Remark 5.3.

Proposition 5.4 ([9, Proposition 4.2]).

Proposition 5.5 (Small ball probabilities of $M_{n}x$ via regularized LCD).

Proposition 5.6 (Lemma 6.2, [66]).

Definition 5.7 (Regularized LCD).

Definition 5.8.

Remark 5.9.

Lemma 5.10.

Definition 5.11.

Lemma 5.12.

Definition 5.13.

Lemma 5.14 (Lemma 6.13, [43]).

Proposition 5.15.

Remark 5.16.

Proposition 5.17.

Proposition 5.18.

Lemma 6.1 ([43, Lemma 6.1]).

Remark 6.2.

Lemma 6.3.