The strong circular law: a combinatorial view

Vishesh Jain

arXiv:1904.11108·math.PR·June 2, 2020

The strong circular law: a combinatorial view

Vishesh Jain

PDF

TL;DR

This paper introduces a simple, novel approach to analyze the least singular value of shifted random matrices, improving error bounds and extending inverse Littlewood-Offord theory to complex variables, thereby advancing understanding of the circular law.

Contribution

It provides a new method for bounding the least singular value of complex random matrices without additive combinatorics, extending inverse Littlewood-Offord results to complex variables.

Findings

01

Bound on the probability that the least singular value is small

02

Extension of inverse Littlewood-Offord theory to complex variables

03

Improved error rates over previous results

Abstract

Let $N_{n}$ be an $n \times n$ complex random matrix, each of whose entries is an independent copy of a centered complex random variable $z$ with finite non-zero variance $σ^{2}$ . The strong circular law, proved by Tao and Vu, states that almost surely, as $n \to \infty$ , the empirical spectral distribution of $N_{n} / (σ n)$ converges to the uniform distribution on the unit disc in $C$ . A crucial ingredient in the proof of Tao and Vu, which uses deep ideas from additive combinatorics, is controlling the lower tail of the least singular value of the random matrix $x I - N_{n} / (σ n)$ (where $x \in C$ is fixed) with failure probability that is inverse polynomial. In this paper, using a simple and novel approach (in particular, not using tools from additive combinatorics or any net arguments), we show that for any fixed matrix $M$ with operator norm at…

Equations222

Pr (s_{n} (M + N_{n}) \leq η) ≲ n^{C} η + exp (- n^{c}),

Pr (s_{n} (M + N_{n}) \leq η) ≲ n^{C} η + exp (- n^{c}),

μ_{n} (s, t) := \frac{1}{n} \cdot ∣ {k \in [n] ∣ ℜ (λ_{k}) \leq s; ℑ (λ_{k}) \leq t} ∣,

μ_{n} (s, t) := \frac{1}{n} \cdot ∣ {k \in [n] ∣ ℜ (λ_{k}) \leq s; ℑ (λ_{k}) \leq t} ∣,

μ_{\infty} (s, t) := \frac{1}{π} area {x \in C ∣ ∣ x ∣ \leq 1, ℜ (x) \leq s, ℑ (x) \leq t} .

μ_{\infty} (s, t) := \frac{1}{π} area {x \in C ∣ ∣ x ∣ \leq 1, ℜ (x) \leq s, ℑ (x) \leq t} .

Pr (s_{n} (M + N_{n}) \leq n^{- B}) ≲ n^{- A} .

Pr (s_{n} (M + N_{n}) \leq n^{- B}) ≲ n^{- A} .

Pr (s_{n} (M + N_{n}) \leq η) \leq C (n^{5/2} η + exp (- c n^{1/50})),

Pr (s_{n} (M + N_{n}) \leq η) \leq C (n^{5/2} η + exp (- c n^{1/50})),

C n η + C exp (- c n),

C n η + C exp (- c n),

ρ_{r, z} (v) := x \in C sup Pr (v_{1} z_{1} + \dots + v_{n} z_{n} \in B (x, r)),

ρ_{r, z} (v) := x \in C sup Pr (v_{1} z_{1} + \dots + v_{n} z_{n} \in B (x, r)),

v \in S^{n - 1} sup ρ_{c_{\ref l e mma : an t i co n ce n t r a t i o n}, z} (v) \leq 1 - c_{\ref l e mma : an t i co n ce n t r a t i o n} .

v \in S^{n - 1} sup ρ_{c_{\ref l e mma : an t i co n ce n t r a t i o n}, z} (v) \leq 1 - c_{\ref l e mma : an t i co n ce n t r a t i o n} .

Pr (∥ (M + N_{n}) v ∥_{2} \leq c_{\ref l e mma : in v er t ibi l i t y - s in g l e - v ec t or} n) \leq (1 - c_{\ref l e mma : in v er t ibi l i t y - s in g l e - v ec t or})^{n},

Pr (∥ (M + N_{n}) v ∥_{2} \leq c_{\ref l e mma : in v er t ibi l i t y - s in g l e - v ec t or} n) \leq (1 - c_{\ref l e mma : in v er t ibi l i t y - s in g l e - v ec t or})^{n},

ρ_{t, \sum_{j = 1}^{n} z_{j}} (1) \leq C_{\ref t hm : r o g oz in} t^{2} (j = 1 \sum n t_{j}^{4} (1 - ρ_{t_{j}, z_{j}} (1)))^{- 1/2},

ρ_{t, \sum_{j = 1}^{n} z_{j}} (1) \leq C_{\ref t hm : r o g oz in} t^{2} (j = 1 \sum n t_{j}^{4} (1 - ρ_{t_{j}, z_{j}} (1)))^{- 1/2},

Pr (C^{- 1} \leq ∣ z_{1} - z_{2} ∣ \leq C) \geq C^{- 1},

Pr (C^{- 1} \leq ∣ z_{1} - z_{2} ∣ \leq C) \geq C^{- 1},

ρ_{1, z} (w) \leq C_{\ref c l aim : an t i co n c - W} n^{- 0.495} .

ρ_{1, z} (w) \leq C_{\ref c l aim : an t i co n c - W} n^{- 0.495} .

ρ_{v_{z}, w_{j} z_{j}} (1) \leq ρ_{∣ w_{j} ∣ v_{z}, w_{j} z_{j}} (1) \leq ρ_{v_{z}, z_{j}} (1) \leq u_{z} .

ρ_{v_{z}, w_{j} z_{j}} (1) \leq ρ_{∣ w_{j} ∣ v_{z}, w_{j} z_{j}} (1) \leq ρ_{v_{z}, z_{j}} (1) \leq u_{z} .

ρ_{v_{z}, \sum_{j = 1}^{n} w_{j} z_{j}} (1) \leq \frac{C _{\ref t hm : r o g oz in}}{∣ supp ( w ) ∣ ( 1 - u _{z} )} .

ρ_{v_{z}, \sum_{j = 1}^{n} w_{j} z_{j}} (1) \leq \frac{C _{\ref t hm : r o g oz in}}{∣ supp ( w ) ∣ ( 1 - u _{z} )} .

LCD_{γ, α} (a) := θ \in C in f {∣ θ ∣ > 0 : dist (θ a, (Z + i Z)^{n}) < min {γ ∣ θ ∣∥ a ∥_{2}, α}} .

LCD_{γ, α} (a) := θ \in C in f {∣ θ ∣ > 0 : dist (θ a, (Z + i Z)^{n}) < min {γ ∣ θ ∣∥ a ∥_{2}, α}} .

δ \geq \frac{n ^{0.1} α}{LCD _{α, γ} ( a )},

δ \geq \frac{n ^{0.1} α}{LCD _{α, γ} ( a )},

ρ_{δ, z} (a) \leq C_{\ref t hm : L C D - co n t r o l s - s b p} (\frac{n δ}{γ} + exp (- C_{\ref t hm : L C D - co n t r o l s - s b p}^{- 1} α^{2})),

ρ_{δ, z} (a) \leq C_{\ref t hm : L C D - co n t r o l s - s b p} (\frac{n δ}{γ} + exp (- C_{\ref t hm : L C D - co n t r o l s - s b p}^{- 1} α^{2})),

∥ w ∥_{z}^{2} := E ∥ℜ {w (z_{1} - z_{2})} ∥_{R / Z}^{2},

∥ w ∥_{z}^{2} := E ∥ℜ {w (z_{1} - z_{2})} ∥_{R / Z}^{2},

ρ_{r, z} (v) \leq e^{π r^{2}} P_{z} (v) \leq e^{π r^{2}} \int_{C} exp (- i = 1 \sum n ∥ v_{i} ξ ∥_{z}^{2} /2 - π ∣ ξ ∣^{2}) d ξ .

ρ_{r, z} (v) \leq e^{π r^{2}} P_{z} (v) \leq e^{π r^{2}} \int_{C} exp (- i = 1 \sum n ∥ v_{i} ξ ∥_{z}^{2} /2 - π ∣ ξ ∣^{2}) d ξ .

P_{z} (v) := E_{x_{1}, \dots, x_{n}} exp (- π ∣ v_{1} x_{1} + \dots + v_{n} x_{n} ∣^{2}),

P_{z} (v) := E_{x_{1}, \dots, x_{n}} exp (- π ∣ v_{1} x_{1} + \dots + v_{n} x_{n} ∣^{2}),

ρ_{1, z} (v)^{2}

ρ_{1, z} (v)^{2}

\leq exp (2 π) P_{z} (v) P_{z} (i v)

\leq 2 exp (2 π) P_{z} (w)

\leq 2 exp (2 π) \int_{C} exp (- j = 1 \sum n (∥ v_{j} ξ ∥_{z}^{2} + ∥ i v_{j} ξ ∥_{z}^{2}) /2 - π ∣ ξ ∣^{2}) d ξ,

j = 1 \sum n (∥ v_{j} ξ ∥_{z}^{2} + ∥ i v_{j} ξ ∥_{z}^{2})

j = 1 \sum n (∥ v_{j} ξ ∥_{z}^{2} + ∥ i v_{j} ξ ∥_{z}^{2})

= E j = 1 \sum n (∥ℜ {v_{j} ξ (z_{1} - z_{2})} ∥_{R / Z}^{2} + ∥ℑ {v_{j} ξ (z_{1} - z_{2})} ∥_{R / Z}^{2})

= E [dist^{2} (v ξ (z_{1} - z_{2}), (Z + i Z)^{n})]

\displaystyle\geq\mathbb{E}\left[\text{dist}^{2}\left(\boldsymbol{v}\xi(z_{1}-z_{2}),(\mathbb{Z}+i\mathbb{Z})^{n}\right)\bigg{|}|z_{1}-z_{2}|\in[C_{z}^{-1},C_{z}]\right]C_{z}^{-1},

ρ_{1, z} (v)^{2}

ρ_{1, z} (v)^{2}

\leq 2 exp (2 π) ∣ y ∣ \in [C_{z}^{- 1}, C_{z}] sup \int_{C} exp (- C_{z}^{- 1} dist^{2} (v ξ y, (Z + i Z)^{n}) /2 - π ∣ ξ ∣^{2}) d ξ .

A := {ξ \in C ∣ dist (v ξ y_{0}, (Z + i Z)^{n}) \geq α /2} \cup {ξ \in C ∣ ∣ ξ ∣ \geq α},

A := {ξ \in C ∣ dist (v ξ y_{0}, (Z + i Z)^{n}) \geq α /2} \cup {ξ \in C ∣ ∣ ξ ∣ \geq α},

\int_{C} = \int_{A} + \int_{B} .

\int_{C} = \int_{A} + \int_{B} .

\int_{A} ≲ exp (- Ω_{C_{z}} (α^{2})),

\int_{A} ≲ exp (- Ω_{C_{z}} (α^{2})),

dist (a δ^{- 1} (ξ^{'} - ξ^{''}) y_{0}, (Z + i Z)^{n}) = dist (v (ξ^{'} - ξ^{''}) y_{0}, (Z + i Z)^{n}) < α .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

The strong circular law: a combinatorial view

Vishesh Jain Massachusetts Institute of Technology. Department of Mathematics. Email: [email protected].

Abstract

Let $N_{n}$ be an $n\times n$ complex random matrix, each of whose entries is an independent copy of a centered complex random variable $z$ with finite non-zero variance $\sigma^{2}$ . The strong circular law, proved by Tao and Vu, states that almost surely, as $n\to\infty$ , the empirical spectral distribution of $N_{n}/(\sigma\sqrt{n})$ converges to the uniform distribution on the unit disc in $\mathbb{C}$ .

A crucial ingredient in the proof of Tao and Vu, which uses deep ideas from additive combinatorics, is controlling the lower tail of the least singular value of the random matrix $xI-N_{n}/(\sigma\sqrt{n})$ (where $x\in\mathbb{C}$ is fixed) with failure probability that is inverse polynomial. In this paper, using a simple and novel approach (in particular, not using tools from additive combinatorics or any net arguments), we show that for any fixed matrix $M$ with operator norm at most $n^{3/4-\epsilon}$ and for all $\eta\geq 0$ ,

[TABLE]

where $s_{n}(M+N_{n})$ is the least singular value of $M+N_{n}$ and $C,c$ are absolute constants. Our result is optimal up to the constants $C,c$ and the inverse exponential-type error rate improves upon the inverse polynomial error rate due to Tao and Vu.

Our proof relies on an extension of the solution to the so-called counting problem in inverse Littlewood–Offord theory, developed by Ferber, Luh, Samotij, and the author, along with a novel ‘rounding trick’ based on controlling the $\infty\to 2$ operator norm of heavy-tailed random matrices.

1 Introduction

Let $N_{n}$ be an $n\times n$ complex random matrix, each of whose entries is an independent copy of a complex random variable $z$ with mean [math] and finite non-zero variance $\sigma^{2}$ . The empirical spectral distribution (ESD) $\mu_{n}$ of $N_{n}$ is defined on $\mathbb{R}^{2}$ by the expression

[TABLE]

where $\lambda_{1},\dots,\lambda_{n}$ denote the eigenvalues of $N_{n}/\sigma\sqrt{n}$ . The celebrated strong circular law of Tao and Vu [27] asserts that almost surely, as $n$ tends to infinity, $\mu_{n}$ converges uniformly to

[TABLE]

The circular law has a long history dating back to the 1950s when it was conjectured as a natural non-Hermitian counterpart to Wigner’s famous semi-circle law, and prior to Tao and Vu’s definitive work, many researchers obtained partial results requiring extra distributional assumptions on the random variable $z$ , and very often weakening the notion of convergence from almost sure convergence to convergence in probability (this is not just a technical point, and genuinely new ideas are required to obtain almost sure convergence; see the discussion in Section 2 of [24]). We refer the reader to the survey [4] and the references therein for a much more detailed discussion of the history of this problem.

In the case when we further assume that $z$ has $2+\eta$ moments for some $\eta>0$ , the approach of Bai [1], Bai and Silverstein [2], and Girko [10] reduces the problem to controlling the lower tail of the least singular value of the random matrix $xI-N_{n}/(\sigma\sqrt{n})$ , where $x\in\mathbb{C}$ is fixed; even when we assume that $z$ has only non-zero finite variance, controlling the lower tail of the least singular value of this random matrix is a fundamental step in Tao and Vu’s proof (recall that the least singular value of a complex matrix $M_{n}$ , denoted by $s_{n}(M_{n})$ , is the smallest eigenvalue of the positive semidefinite matrix $\sqrt{M_{n}^{\dagger}M_{n}}$ ). To this end, Tao and Vu [24] showed using sophisticated techniques from additive combinatorics that for any constants $A,C>0$ , there exists a constant $B>0$ such that for any $n\times n$ fixed (complex) matrix $M$ of operator norm at most $n^{C}$ ,

[TABLE]

The dependence of $B$ on $A$ and $C$ can be made explicit and was subsequently sharpened in [26]. Note that for the proof of the circular law, fixing $C=O(\sqrt{n})$ is sufficient.

Our goal in the present work is to provide a simple and elementary proof of a quantitative strengthening of Equation 1 in the setting of the strong circular law. More precisely, we show:

Theorem 1.1.

Let $z$ be a complex random variable with mean [math] and variance $1$ and let $N_{n}$ be an $n\times n$ random matrix, each of whose entries is an independent copy of $z$ . Let $M$ be a fixed complex matrix with operator norm at most $n^{0.51}$ . Then, for all $\eta\geq 0$ ,

[TABLE]

where $C,c$ are constants depending only on $z$ .

Remark 1.2.

(1) In the above theorem, the choice of the power $n^{0.51}$ is arbitrarily made for convenience and could be replaced by $n^{0.75-\epsilon}$ for any $\epsilon>0$ ; in follow-up work of the author [12] which builds on some of the ideas in this paper, we will show (using a more complicated proof) how to obtain a bound on the lower tail of $M+N_{n}$ even if $\|M\|=O(\exp(n^{c}))$ .

(2) We have not tried to optimize any of the constants $C,c,5/2,1/50$ , but note here that with additional work, the constant $5/2$ can be replaced by the nearly optimal value $1/2+\epsilon$ for any $\epsilon>0$ in the case when $\|M\|=O(\sqrt{n})$ . Compared to Equation 1, our bound is closer to (optimal) bounds of the form

[TABLE]

which have been obtained under stronger assumptions: for the case when $z$ is a real subgaussian random variable and $\|M\|=O(\sqrt{n})$ in the landmark work of Rudelson and Vershynin [20], and for the case when $z$ is a real random variable and (much more restrictively) $M=0$ by Rebrova and Tikhomirov [18].

(3) We impose no constraints on the relationship between the real and imaginary parts of $z$ (for instance, in [14], the real and imaginary parts are required to be i.i.d. subgaussian). In this generality, our result (along with a follow-up result of the author [12]) is the only one showing that a random matrix, each of whose entries is an independent copy of a complex random variable of mean [math] and variance $1$ , is singular with probability at most $\exp(-n^{c})$ for some $c>0$ . Previously, this was not known, even if we further assume that the complex random variables are subgaussian – the geometric machinery, pioneered by Rudelson and Vershynin [20], runs into the obstacle that the real dimension of the complex unit sphere is $2n-1$ (see the discussion in Section 10 of [23]).

Apart from the quantitative strengthening of Equation 1, we believe that our result is also interesting for the simplicity of the proof techniques, making use only of some standard Fourier analytic techniques along with elementary combinatorial ideas. In particular, in contrast to previous works in this area, we make no use of tools from additive combinatorics or net arguments. Parts of our proof which we believe may be of independent interest are the complex anti-concentration inequality Theorem 2.11 and Proposition 3.3, which shows how to bypass the lack of control over the $2\to 2$ operator norm in our setting by means of the $\infty\to 2$ -norm. We hope that some of the ideas introduced in this work can aid in proving strong circular laws in other contexts such as [3, 5] where only weak circular laws are known so far.

**Organization: **The rest of this paper is organized as follows. In Section 2, we collect some auxiliary results needed for the proof of our main theorem – the key results here are Theorem 2.11, Theorem 2.14, and Proposition 2.16 (proved in Appendix A). In Section 3, we prove Theorem 1.1 by combining these results. The key ingredient there is Proposition 3.3.

**Notation: ** Throughout the paper, we will omit floors and ceilings when they make no essential difference. For convenience, we will also say ‘let $p=x$ be a prime’, to mean that $p$ is a prime between $x$ and $2x$ ; again, this makes no difference to our arguments. We will use $\mathbb{S}^{2n-1}$ to denote the set of unit vectors in $\mathbb{C}^{n}$ , $B(x,r)$ to denote the ball of radius $r$ centered at $x$ , and $\Re(\boldsymbol{v}),\Im(\boldsymbol{v})$ to denote the real and imaginary parts of a complex vector $\boldsymbol{v}\in\mathbb{C}^{n}$ . As is standard, we will use $[n]$ to denote the discrete interval $\{1,\dots,n\}$ . We will also use the asymptotic notation $\lesssim,\gtrsim,\ll,\gg$ to denote $O(\cdot),\Omega(\cdot),o(\cdot),\omega(\cdot)$ respectively. For a matrix $M$ , we will use $\|M\|$ to denote its standard $\ell^{2}\to\ell^{2}$ operator norm. All logarithms are natural unless noted otherwise.

**Acknowledgements: **I am indebted to Nick Cook for the suggestion to consider the complex setting, as well as very helpful discussions around the circular law. I would also like to thank Kyle Luh for pointing out that a statement similar to Proposition 2.16 also appears in [19], Hoi Nguyen for discussing his work [16], and Elizaveta Rebrova for discussing her work [18].

2 Tools and auxiliary results

In this section, we collect some preliminary results which will be used in the proof of Theorem 1.1.

2.1 Anti-concentration

The goal of the theory of anti-concentration is to obtain upper bounds on the Lévy concentration function, defined as follows.

Definition 2.1 (Lévy concentration function).

Let $z$ be an arbitrary complex random variable and let $\boldsymbol{v}:=(v_{1},\dots,v_{n})\in\mathbb{C}^{n}$ . We define the Lévy concentration function of $\boldsymbol{v}$ at radius $r$ with respect to $z$ by

[TABLE]

where $z_{1},\dots,z_{n}$ are independent copies of $z$ .

Remark 2.2.

In particular, note that $\rho_{r,z}(1)=\sup_{x\in\mathbb{C}}\Pr(z\in B(x,r))$ . We will use this notation repeatedly.

The next lemma shows that weighted sums of random variables with finite non-zero variance are not too close to being a constant.

Lemma 2.3.

*(see, e.g., Lemma 6.3 in [26])

Let $z$ be a complex random variable with finite non-zero variance. Then, there exists a constant $c_{\ref{lemma:anticoncentration}}\in(0,1)$ depending only on $z$ such that for any $\boldsymbol{v}\in\mathbb{S}^{2n-1}$ ,*

[TABLE]

Combining this with the so-called tensorization lemma (see Lemma 2.2 in [20]), we get the following standard estimate for ‘invertibility with respect to a single vector’.

Lemma 2.4.

Let $z$ be a complex random variable with finite non-zero variance. Let $M$ be an arbitrary $n\times n$ complex matrix and let $N_{n}$ be an $n\times n$ complex random matrix each of whose entries is an independent copy of $z$ . Then, for any fixed $\boldsymbol{v}\in\mathbb{S}^{2n-1}$ ,

[TABLE]

where $c_{\ref{lemma:invertibility-single-vector}}\in(0,1)$ is a constant depending only on $z$ .

The next classical lemma, due to Esseen, is a generalization (up to constants) of the Erdős-Littlewood-Offord anti-concentration inequality.

Lemma 2.5 (Theorem 2 in [6]).

Let $z_{1},\dots,z_{n}$ be jointly independent complex random variables and let $t_{1},\dots,t_{n}$ be some positive real numbers. Then, for any $t\geq\max_{j}t_{j}$ , we have

[TABLE]

where $C_{\ref{thm:rogozin}}\geq 1$ is an absolute constant.

The next definition isolates a convenient property of the random variables we consider in this paper.

Definition 2.6.

We say that a complex random variable $z$ is $C$ -good if

[TABLE]

where $z_{1}$ and $z_{2}$ denote independent copies of $z$ . The smallest $C\geq 1$ with respect to which $z$ is $C$ -good will be denoted by $C_{z}$ .

Indeed, it is straightforward to see that complex random variables with finite non-zero variance are $C$ -good for some finite $C$ , so that there is no loss of generality for us in imposing this additional restriction.

Observation 2.7.

Let $z$ be a complex random variable with variance $1$ . Then, $z$ is $C_{z}$ -good for some $C_{z}\geq 1$ .

We conclude this subsection with the following consequence of Lemma 2.5.

Lemma 2.8.

Let $z$ be a complex random variable with variance $1$ . There exists a constant $C_{\ref{claim:anticonc-W}}\geq 1$ depending only on $z$ such that for all $\boldsymbol{w}:=(w_{1},\dots,w_{n})\in(\mathbb{Z}+i\mathbb{Z})^{n}$ with support of size at least $n^{0.99}$ ,

[TABLE]

Proof.

As above, we know that $\rho_{v_{z},z}(1)\leq u_{z}$ for some $u_{z},v_{z}\in(0,1)$ . Therefore, for all $j\in{\bf supp}(\boldsymbol{w})$ ,

[TABLE]

Hence, by Lemma 2.5,

[TABLE]

Since $|{\bf supp}(\boldsymbol{w})|\geq n^{0.99}$ , and since $\rho_{1,z}(\boldsymbol{w})=O(v_{z}^{-2}\rho_{v_{z},\sum_{j=1}^{n}w_{j}z_{j}}(1))$ (since any ball in the complex plane of radius $1$ can be covered by $O(v_{z}^{-2})$ balls of radius $v_{z}$ ) , the desired conclusion follows. ∎

2.2 The Least Common Denominator

The proof of Theorem 1.1 will be based on a ‘rounding argument’ which extracts a ‘not-too-large’ Gaussian integer vector certifying that the least singular value of a complex matrix is small (see [11] for the most basic version of this argument). For this, we will use (albeit in a quite different manner from Rudelson and Vershynin) the notion of the Least Common Denominator (LCD) of a vector, and its connection to the Lévy concentration function, as developed in [20].

Remark 2.9.

Our definition of the LCD is different from the ones appearing in the literature for the complex case, and has been made keeping in mind our application to rounding vectors.

Definition 2.10 (Least Common Denominator (LCD)).

Let $\boldsymbol{a}\in\mathbb{C}^{n}\setminus\{\boldsymbol{0}\}$ . For $\gamma\in(0,1)$ and $\alpha>0$ , define

[TABLE]

Note that the requirement that the distance is smaller than $\gamma|\theta|\|\boldsymbol{a}\|_{2}$ forces us to consider only non-trivial Gaussian integer points as approximations of $\theta\boldsymbol{a}$ .

The following theorem shows that vectors with large LCD have small Lévy concentration function on scales which are larger (up to some small polynomial losses) than $\Omega(1/\text{LCD})$ .

Theorem 2.11.

Let $z$ denote a $C_{z}$ -good complex random variable. Then, for every $\boldsymbol{a}\in\mathbb{S}^{2n-1}$ , for every $\alpha\in(0,\sqrt{n}),\gamma\in(0,1)$ , and for

[TABLE]

we have

[TABLE]

where $C_{\ref{thm:LCD-controls-sbp}}\geq 1$ is a constant depending only on $C_{z}$ .

A more precise version of this theorem appears for real random variables in [21]. Actually, a version for complex random variables is also stated there although, as noted above, their definition of $\operatorname{LCD}$ is different from ours. The proof of Theorem 2.11 follows from standard Fourier analytic arguments for the real case (in particular, we will use a crude version of the argument of Friedland and Sodin in [9]) once we use a novel ‘doubling trick’.

We will need a preliminary result from Section 4 of [24].

Definition 2.12.

Let $z$ be an arbitrary complex random variable. For any $w\in\mathbb{C}$ , we define

[TABLE]

where $z_{1},z_{2}$ denote i.i.d. copies of $z$ and $\|\cdot\|_{\mathbb{R}/\mathbb{Z}}$ denotes the distance to the nearest integer.

Lemma 2.13 (Lemma 5.2 in [24]).

Let $\boldsymbol{v}:=(v_{1},\dots,v_{n})\in\mathbb{C}^{n}$ and let $z$ be an arbitrary complex random variable. Then,

[TABLE]

Here,

[TABLE]

where $x_{1},\dots,x_{n}$ are i.i.d. copies of $(z_{1}-z_{2})\cdot\text{Ber}(1/2)$ , with $z_{1},z_{2}$ distributed as $z$ , and $\text{Ber}(1/2),z_{1},z_{2}$ mutually independent.

Proof of Theorem 2.11.

Since $\rho_{\delta,z}(\boldsymbol{a})=\rho_{1,z}(\delta^{-1}\boldsymbol{a})$ , it suffices to bound $\rho_{1,z}(\boldsymbol{v})$ for $\boldsymbol{v}:=\delta^{-1}\boldsymbol{a}$ . Let $\boldsymbol{w}\in\mathbb{C}^{2n}$ denote the vector whose first $n$ components are $\boldsymbol{v}$ and last $n$ components are $i\boldsymbol{v}$ . Then, we have

[TABLE]

where the first line uses $\rho_{1,z}(\boldsymbol{v})=\rho_{1,z}(i\boldsymbol{v})$ , the second line is due to Lemma 2.13, the third line follows from Lemma 4.5(iii) in [24], and the last line is again due to Lemma 2.13.

Next, note that

[TABLE]

where the final inequality follows from the $C_{z}$ -goodness of $z$ .

Therefore, from Jensen’s inequality, we get that

[TABLE]

Now, fix $y_{0}\in\mathbb{C}$ with $|y_{0}|\in[C_{z}^{-1},C_{z}]$ ; we will obtain a uniform (in $y_{0}$ ) upper bound on the integral appearing in Equation 3. Let

[TABLE]

let $B:=\mathbb{C}\setminus A=B(0,\alpha)\setminus A$ , and split the integral above as

[TABLE]

Since

[TABLE]

it only remains to bound $\int_{B}$ .

For this, we begin by noting that if $\xi^{\prime},\xi^{\prime\prime}\in B$ , then by the triangle inequality and the lattice structure of the Gaussian integers,

[TABLE]

Hence, by the definition of $\operatorname{LCD}_{\gamma,\alpha}(\boldsymbol{a})$ , we have one of two possibilities: either

[TABLE]

or

[TABLE]

It follows that $B$ is contained in a union of balls of radius $C_{z}\sqrt{n}\delta/\gamma$ whose centers are separated by at least $\delta\operatorname{LCD}_{\gamma,\alpha}(\boldsymbol{a})/C_{z}$ . Each such ball can contribute at most $\pi C_{z}^{2}n\delta^{2}/\gamma^{2}$ to the integral, and since $\delta\operatorname{LCD}_{\gamma,\alpha}(\boldsymbol{a})\gg\alpha$ , there is at most one such ball in $B$ . It follows that

[TABLE]

Finally, combining the estimates on $\int_{A}$ and $\int_{B}$ and using Equation 3 completes the proof. ∎

2.3 The counting problem in inverse Littlewood-Offord theory

The inverse Littlewood-Offord problem, posed by Tao and Vu [29], asks for the underlying reason that the Lévy concentration function of a vector $\boldsymbol{v}\in\mathbb{C}^{n}$ can be large. Using deep Frieman-type results from additive combinatorics, they showed that, roughly speaking, the only reason for this to happen is that most of the coordinates of the vector $\boldsymbol{v}$ belong to a generalized arithmetic progression (GAP) of ‘small rank’ and ‘small volume’. Their results [29, 25] were subsequently sharpened by Nguyen and Vu [16], who proved an ‘optimal inverse Littlewood–Offord theorem’. We refer the reader to the survey [17] and the textbook [28] for complete definitions and statements, and much more on both forward and inverse Littlewood-Offord theory.

Recently, motivated by applications, especially those in random matrix theory, the following counting variant of the inverse Littlewood–Offord problem was isolated in work [8] of the author along with Ferber, Luh, and Samotij: for how many vectors $\boldsymbol{a}$ in a given collection $\mathcal{A}\subseteq\mathbb{Z}^{n}$ is $\rho_{1,z}(\boldsymbol{a})$ greater than some prescribed value, where $z$ is a symmetric Bernoulli random variable? Indeed, the inverse Littlewood-Offord theorems are typically used precisely through such counting corollaries [17], and one of the main contributions of [8] (see Theorem 1.7 there) was to show that one may obtain useful bounds for the counting variant of the inverse Littlewood-Offord problem directly, without providing a precise structural characterization like Tao-Vu. In fact, since this approach is not hampered by losses coming from the black-box application of various theorems from additive combinatorics, it provides quantitatively better bounds, and this was used in [8, 7, 11] to provide quantitative improvements for several problems in combinatorial random matrix theory.

A crucial ingredient in our proof will be the following extension of Theorem 1.7 of [8] due to the author

Theorem 2.14 (Theorem 1.3 in [12]).

Let $z$ be a $C_{z}$ -good random variable. For $\rho\in(0,1)$ (possibly depending on $n$ ), let

[TABLE]

There exists a constant $C_{\ref{thm:counting-continuous}}\geq 1$ , depending only on $C_{z}$ , for which the following holds. Let $n,s,k\in\mathbb{N}$ with $1000C_{z}\leq k\leq\sqrt{s}\leq s\leq n/\log{n}$ . If $\rho\geq C_{\ref{thm:counting-continuous}}\max\left\{e^{-s/k},s^{-k/4}\right\}$ and $p$ is an odd prime such that $2^{n/s}\geq p\geq C_{\ref{thm:counting-continuous}}\rho^{-1}$ , then

[TABLE]

where $\varphi_{p}$ denotes the natural map from $(\mathbb{Z}+i\mathbb{Z})^{n}\to(\mathbb{F}_{p}+i\mathbb{F}_{p})^{n}$ .

Remark 2.15.

The inverse Littlewood-Offord theorems may be used to deduce similar statements, provided we further assume that $\rho\geq n^{-C}$ for some constant $C>0$ . It is the freedom of taking $\rho$ to be much smaller which allows us to obtain the exponential-type rate in Theorem 1.1.

2.4 Norms of large projections of random matrices

The key difficulty with extending the geometric techniques of Rudelson and Vershynin [20, 22]) to the setting when the random variables have heavy tails is the lack of control on the operator norm of the random matrix. For our techniques, the following proposition will turn out to be an appropriate substitute for controlling the operator norm.

For a subset $I\subseteq[n]$ , let $P_{I}:\mathbb{C}^{n}\to\mathbb{C}^{n}$ denote the orthogonal projection onto the subspace spanned by the vectors $\{e_{i}:i\in I\}$ . We have:

Proposition 2.16.

Let $N_{n}:=(m_{ij})$ be an $n\times n$ complex random matrix with i.i.d. entries, each with mean [math] and variance $1$ . For $\epsilon,\delta\in(0,1/2)$ with $\delta\geq 4\epsilon$ , there exists $C_{\ref{prop:operator-norm-control}}(\epsilon)\geq 1$ such that, except with probability at most $C_{\ref{prop:operator-norm-control}}(\epsilon)\exp\left(-{n^{1-\epsilon}}/{8}\right)$ , the following hold.

There exists $I\subseteq[n]$ with $|I|\geq n-2n^{1-\epsilon}$ such that

[TABLE] 2. 2.

For every $J\subseteq[n]$ with $|J|=n^{1-\delta}$ , there exists some $I(J)\subseteq[n]$ such that $|I(J)|\geq n-2n^{1-\epsilon}$ , and

[TABLE]

Remark 2.17.

A statement similar to the one above, and with some common proof ideas, already appears in the work of Rebrova and Vershynin [19]. In that work, the primary interest is in obtaining optimal bounds on the restricted operator norms and consequently, the proofs are much more involved. In contrast, we do not require such optimal bounds for our application, and are therefore able to give a much shorter proof of the above proposition.

The complete proof of this proposition is deferred to Appendix A.

3 Proof of Theorem 1.1

In this section, we will take $\alpha:=n^{1/100}$ and $\gamma:=n^{-1/2}$ . Moreover, since Theorem 1.1 is trivially true for $\eta\geq n^{-2}$ , we will henceforth assume that $2^{-n^{0.0001}}\leq\eta<n^{-2}$ . Recall that $M$ is a fixed $n\times n$ matrix with operator norm at most $n^{0.51}$ ; we set $M_{n}:=M+N_{n}$ .

We decompose $\mathbb{S}^{2n-1}$ into $\Gamma^{1}(\eta)\cup\Gamma^{2}(\eta)$ , where

[TABLE]

and $\Gamma^{2}(\eta):=\mathbb{S}^{2n-1}\setminus\Gamma^{1}(\eta)$ . Accordingly, we have

[TABLE]

Therefore, Theorem 1.1 follows from the following two propositions and the union bound.

Proposition 3.1.

$\Pr\left(\exists\boldsymbol{a}\in\Gamma^{1}(\eta):\|M_{n}\boldsymbol{a}\|_{2}\leq\eta\right)\leq 2nC_{\ref{thm:LCD-controls-sbp}}\left(n^{3/2}\eta+\exp(-C_{\ref{thm:LCD-controls-sbp}}^{-1}n^{1/50})\right)$ .

Proposition 3.2.

$\Pr\left(\exists\boldsymbol{a}\in\Gamma^{2}(\eta):\|M_{n}\boldsymbol{a}\|_{2}\leq\eta\right)\leq C_{\ref{prop:eliminate-small-LCD}}\left(e^{-n^{0.98}}+\exp(-c_{\ref{prop:eliminate-small-LCD}}n)\right),$ * where $C_{\ref{prop:eliminate-small-LCD}}\geq 1$ and $c_{\ref{prop:eliminate-small-LCD}}>0$ are constants depending only on $z$ .*

The proof of Proposition 3.1 is relatively simple, and follows from a conditioning argument developed in [13], once we observe the crucial fact (Theorem 2.11) that for any $\boldsymbol{a}\in\Gamma^{1}(\eta)$ ,

[TABLE]

for all $\delta\geq\eta$ . We defer the (by now standard) details to Appendix B.

The remainder of this section is devoted to the proof of Proposition 3.2.

3.1 Reduction to Gaussian integer vectors

Let $\mathcal{K}:=\{K\subseteq[n]:|K|\geq n-6n^{0.99}\}$ and

[TABLE]

As a first and crucial step towards the proof of Proposition 3.2, we will prove the following:

Proposition 3.3.

With notation as above,

[TABLE]

where $C_{\ref{prop:reduction-to-integer-heavy-tailed}}\geq 1$ is an absolute constant.

Remark 3.4.

As we will see shortly, the crucial point in the above proposition is that $n^{0.21}\ll n^{1/2-\epsilon}$ and $n^{0.711}\ll n^{0.75-\epsilon}$ .

Proof.

Let $\epsilon=0.01$ , $\delta_{1}=0.2$ , and $\delta_{2}=0.6$ . Let $\mathcal{G}$ denote the event appearing in the conclusion of Proposition 2.16 for $(\epsilon,\delta_{1})$ and $(\epsilon,\delta_{2})$ simultaneously. Since $\Pr(\mathcal{G}^{c})\leq 2C_{\ref{prop:operator-norm-control}}(0.01)\exp(-n^{0.99}/8)$ , we may restrict ourselves to the event $\mathcal{G}$ .

Let $\boldsymbol{a}\in\Gamma^{2}(\eta)$ . Then, by definition, there exists some $\theta\in\mathbb{C}$ with $0<|\theta|\leq\operatorname{LCD}_{\alpha,\gamma}(\boldsymbol{a})\leq n^{3/4}\eta^{-1}$ and some $\boldsymbol{w}\in(\mathbb{Z}+i\mathbb{Z})^{n}\setminus\{\boldsymbol{0}\}$ such that $\|\theta\boldsymbol{a}-\boldsymbol{w}\|_{2}\leq\min\{\gamma|\theta|,\alpha\}$ . Note also that $\|\theta\boldsymbol{a}-\boldsymbol{w}\|_{\infty}\leq\min\{\gamma|\theta|,1\}$ . To leverage the control we have over various norms associated to the matrix $M_{n}$ , we decompose the ‘error’ vector $\theta\boldsymbol{a}-\boldsymbol{w}$ into a ‘small’ part (with respect to the $\ell^{\infty}$ -norm), a ‘sparse and small’ part, and a ‘very sparse’ part.

Accordingly, let $\boldsymbol{v}_{\text{sp}}\in\mathbb{C}^{n}$ denote the vector obtained by keeping the largest (in absolute value) $n^{0.4}$ coordinates of $\theta\boldsymbol{a}-\boldsymbol{w}$ , let $\boldsymbol{v}_{\text{ss}}$ denote the vector obtained by keeping the next $n^{0.8}-n^{0.4}$ largest coordinates of $\theta\boldsymbol{a}-\boldsymbol{w}$ , and let $\boldsymbol{v}_{\text{sm}}=\theta\boldsymbol{a}-\boldsymbol{w}-\boldsymbol{v}_{\text{sp}}-\boldsymbol{v}_{\text{ss}}$ . Then, we have that

[TABLE]

and

[TABLE]

Indeed, the first inequality is immediate from $\|\theta\boldsymbol{a}-\boldsymbol{w}\|_{\infty}\leq\min\{\gamma|\theta|,1\}$ , whereas the second inequality follows from

[TABLE]

Let $J_{1}\subseteq[n]$ denote the support of $\boldsymbol{v}_{\text{sp}}$ and let $J_{2}\subseteq[n]$ denote the support of $\boldsymbol{v}_{\text{sp}}+\boldsymbol{v}_{\text{ss}}$ . By extending these sets if need be, we may assume that $|J_{1}|=n^{0.4}$ and $|J_{2}|=n^{0.8}$ . Moreover, since we have restricted to $N_{n}\in\mathcal{G}$ , let $I\subseteq[n]$ denote a subset of size at least $n-2n^{1-\epsilon}$ with respect to which conclusion 1. of Proposition 2.16 holds.

Note that since $\|M\|\leq n^{0.51}$ , we have $\|MP_{J}\|_{\infty\to 2}\leq n^{0.51}\sqrt{|J|}$ for every $J\subseteq[n]$ . Therefore,

[TABLE]

and similarly,

[TABLE]

Then, from the triangle inequality, we have

[TABLE]

where the second line uses that $P_{J_{2}}\boldsymbol{v}_{\text{ss}}=\boldsymbol{v}_{\text{ss}}$ and $P_{J_{1}}\boldsymbol{v}_{\text{sp}}=\boldsymbol{v}_{\text{sp}}$ ; the fourth line uses the above estimates on the $\infty$ -to- $2$ norms and Equation 4, and the fifth line uses the parameter value $\gamma=n^{-1/2}$ .

Thus, if $\|M_{n}\boldsymbol{a}\|_{2}\leq\eta$ , it follows from the triangle inequality that

[TABLE]

where the fourth line follows since $\eta\ll n^{0.21}$ and $|\theta|\eta\leq n^{0.7}\ll n^{0.711}$ , and the last line follows since $\|\boldsymbol{w}\|_{2}\geq|\theta|(1-\gamma)\geq|\theta|/2$ . Since $|I(J_{1})^{c}\cup I(J_{2})^{c}\cup I^{c}|\leq|I(J_{1})^{c}|+|I(J_{2})^{c}|+|I^{c}|\leq 6n^{0.99}$ , we get the desired conclusion. ∎

In view of Proposition 3.3, it suffices to show the following in order to prove Proposition 3.2, and hence, complete the proof of Theorem 1.1.

Proposition 3.5.

$\Pr(\exists\boldsymbol{w}\in\boldsymbol{V}\text{ and }K\in\mathcal{K}:\|P_{K}M_{n}\boldsymbol{w}\|_{2}\leq C_{\ref{prop:reduction-to-integer-heavy-tailed}}\min\{n^{0.21}\|\boldsymbol{w}\|_{2},n^{0.711}\})\leq C_{\ref{prop:integers-heavy-tailed}}e^{-c_{\ref{prop:integers-heavy-tailed}}n},$ * where $C_{\ref{prop:integers-heavy-tailed}}\geq 1$ and $c_{\ref{prop:integers-heavy-tailed}}>0$ are constants depending only on $z$ .*

The proof of this proposition is the content of the next two subsections.

3.2 Dealing with sparse Gaussian integer vectors

Throughout this subsection and the next one, $p=2^{n^{0.001}}$ is a prime. Note, in particular, that $p\gg\eta^{-1}n^{3/4}$ . The proof of Proposition 3.5 proceeds in two steps. The first step is to show that the probability of the event appearing in Proposition 3.5 is small, provided we restrict ourselves only to sufficiently sparse Gaussian integer vectors. Let

[TABLE]

Lemma 3.6.

$\Pr\left(\exists\boldsymbol{w}\in\boldsymbol{S}\text{ and }K\in\mathcal{K}:\|P_{K}M_{n}\boldsymbol{w}\|_{2}\leq C_{\ref{prop:reduction-to-integer-heavy-tailed}}n^{0.21}\|\boldsymbol{w}\|_{2}\right)\leq C_{\ref{lemma:sparse-vectors-heavy-tails}}\exp(-c_{\ref{lemma:invertibility-single-vector}}n/4),$ * where $C_{\ref{lemma:sparse-vectors-heavy-tails}}\geq 1$ is an absolute constant.*

Proof.

By taking the union bound over all the at most $n\binom{n}{6n^{0.99}}\ll\exp(n^{0.991})$ choices of $K\in\mathcal{K}$ , it suffices to show that for a fixed $K_{0}\in\mathcal{K}$ ,

[TABLE]

for some absolute constant $C\geq 1$ . The number of vectors $\boldsymbol{w}\in\boldsymbol{S}$ is at most

[TABLE]

By Lemma 2.4 applied to the matrix $P_{K_{0}}M_{n}$ , for any such vector,

[TABLE]

Therefore, the union bound gives the desired conclusion. ∎

3.3 Dealing with non-sparse Gaussian integer vectors

It remains to deal with Gaussian integer vectors with support of size at least $n^{0.99}$ . Let

[TABLE]

Note that for our choice of parameters, the natural map

[TABLE]

is injective.

In view of Lemma 3.6, since $\eta\leq n^{-2}$ , and taking the union bound over all the at most $n\binom{n}{6n^{0.99}}\ll\exp(n^{0.991})$ choices of $K\in\mathcal{K}$ , the following proposition suffices to prove Proposition 3.5.

Proposition 3.7.

For all $K_{0}\in\mathcal{K}$ ,

[TABLE]

where $C_{\ref{prop:non-sparse-vectors-heavy-tails}}\geq 1$ and $c_{\ref{prop:non-sparse-vectors-heavy-tails}}>0$ are constants depending only on $z$ .

The proof of Proposition 3.7 is accomplished by a simple union bound. To execute this, we need the following preliminary claims.

Claim 3.8.

For all $\boldsymbol{w}\in\boldsymbol{W}$ , $\rho_{1,z}(\boldsymbol{w})\geq n^{-1/2}\eta^{4}/10$ .

Proof.

The random variable $\sum_{j=1}^{n}w_{j}\xi_{j}$ has mean [math] and variance at most $n\eta^{-8}$ . Therefore, by Markov’s inequality,

[TABLE]

Hence, by the pigeonhole principle, it follows that

[TABLE]

as desired. ∎

For the next claim, let

[TABLE]

Note that the previous claim along with Lemma 2.8 shows that $\boldsymbol{W}_{t}$ is nonempty only if $n^{-1/2}\eta^{4}/10\leq t\leq C_{\ref{claim:anticonc-W}}n^{-0.495}$ .

Claim 3.9.

For all $n^{-1/2}\eta^{4}/10\leq t\leq C_{\ref{claim:anticonc-W}}n^{-0.495}$ ,

[TABLE]

where $C_{\ref{claim:size-of-W_t}}\geq 1$ is a constant depending only on $C_{\ref{thm:counting-continuous}},C_{\ref{claim:anticonc-W}}$ .

Proof.

Fix $s=n^{0.997}$ and $k=n^{0.097}$ . Then, $1\ll k\leq\sqrt{s}\leq s\leq n/\log{n}$ , $n^{-1/2}\eta^{4}\gg\max\{e^{-s/k},s^{-k/4}\}$ , and $2^{n/s}\geq p\gg n^{1/2}\eta^{-4}$ . Hence, for large enough $n$ , the hypotheses of Theorem 2.14 are satisfied, so that

[TABLE]

where the first line follows from the injectivity of $\varphi_{p}$ on $\boldsymbol{W}$ , the third line follows from Theorem 2.14, and the last line follows since $t^{-1}\gg n^{0.49}$ . ∎

We now have all the ingredients to prove Proposition 3.7.

Proof of Proposition 3.7.

Let $D_{x}$ denote the unit polydisc in $\mathbb{C}^{n}$ centered at $x$ . For all $n$ sufficiently large, we have

[TABLE]

where the third line follows since the number of points of $(\mathbb{Z}+i\mathbb{Z})^{n}$ in $B(0,n^{0.712})$ is at most $(400n^{0.212})^{2n}$ , the fifth line follows from 3.9, and the seventh and eighth lines follow from the assumed bounds on $\eta$ . ∎

Appendix A Proof of Proposition 2.16

The proof will make use of the subgaussian concentration inequality, which we now recall.

Definition A.1.

A random variable $X$ is said to be $C$ -subgaussian if, for all $t>0$ ,

[TABLE]

Lemma A.2 (see, e.g., Corollary 5.17 in [30]).

There exists an absolute constant $C_{\ref{lemma:subgaussian-concentration}}>0$ with the following property. Let $X_{1},\dots,X_{n}$ be independent centered $\tilde{C}_{\xi}$ -subgaussian random variables. Then,

[TABLE]

We begin with a simple lemma showing that, with high probability, most rows of a random matrix with i.i.d. centered entries of finite variance have small $\ell_{1}$ and $\ell_{2}$ norms.

Lemma A.3.

Let $A:=(a_{ij})$ be an $n\times m$ complex random matrix with i.i.d. entries, each with mean [math] and variance $1$ . For $\epsilon\in(0,1/2)$ , let $I\subseteq[n]$ denote the (random) subset of coordinates such that for each $i\in I$ ,

[TABLE]

Then,

[TABLE]

Proof.

Since for each $i\in[n]$ ,

[TABLE]

it follows from Markov’s inequality that

[TABLE]

and

[TABLE]

Let $I_{1}\subseteq[n]$ denote the subset of coordinates such that for each $i\in I_{1}$ ,

[TABLE]

and let $I_{2}\subseteq[n]$ denote the subset of coordinates such that for each $i\in I_{2}$ ,

[TABLE]

Since the rows of the matrix are independent, it follows from the standard Chernoff bound that for $k\in\{1,2\}$

[TABLE]

Hence, by the union bound,

[TABLE]

except with probability at most $2\exp\left(-\frac{n^{1-\epsilon}}{4}\right).$ ∎

The next proposition controls the $\infty\to 2$ operator norm of a random matrix with i.i.d. entries, conditioned on no row having $\ell_{1}$ or $\ell_{2}$ norm which is ‘too large’, and essentially appears as Proposition 3.10 in [18]. Since our statement uses somewhat different parameters than in [18], we provide a complete proof below for the reader’s convenience.

Proposition A.4.

Fix $\epsilon\in(0,1/2)$ . Let $B:=(b_{ij})$ be a fixed $n\times m$ complex matrix, with $0.9n\leq m\leq 1.1n$ , such that the $\ell_{2}$ norm of every row is at most $n^{\epsilon}\sqrt{m}$ and such that for all $i\in[n]$ ,

[TABLE]

Let $\pi_{1},\dots,\pi_{n}$ be independent random permutations uniformly distributed on the symmetric group $S_{m}$ , and let $\tilde{B}:=(\tilde{b}_{ij})$ denote the random $n\times m$ complex matrix whose entries are given by

[TABLE]

Then,

[TABLE]

where $C_{\ref{prop:control-infty-to-2-norm}}\geq 1$ is an absolute constant.

The following concentration inequality will be used to establish the subgaussianity of certain random variables appearing in the proof of Proposition A.4. It appears as Lemma 3.9 in [18], and is a direct application of Theorem 7.8 in [15].

Lemma A.5 (Lemma 3.9 in [18]).

Let $\boldsymbol{y}:=(y_{1},\dots,y_{m})$ be a non-zero complex vector and let $\boldsymbol{v}\in\{\pm 1\}^{m}$ . Consider the function $f:S_{m}\to\mathbb{C}$ defined by

[TABLE]

Then, for all $t>0$ ,

[TABLE]

Remark A.6.

In [18], the above lemma is stated and proved (with better constants) for real vectors $\boldsymbol{y}$ . However, the version above for complex vectors immediately follows from this by separately considering the real and imaginary parts of $f$ and using the union bound.

Proof of Proposition A.4.

If $\|\tilde{B}\|_{\infty\to 2}\geq C_{\ref{prop:control-infty-to-2-norm}}\sqrt{mn}n^{\epsilon}$ , then there exists a complex vector $\boldsymbol{w}=\boldsymbol{w_{1}}+i\boldsymbol{w_{2}}$ , where $\boldsymbol{w_{1}},\boldsymbol{w_{2}}\in\mathbb{R}^{m}$ and $\|\boldsymbol{w_{1}}\|_{\infty},\|\boldsymbol{w_{2}}\|_{\infty}\leq 1$ , such that

[TABLE]

Therefore, it suffices to control the $\infty$ -to- $2$ norm of $\tilde{B}$ restricted to vectors in $\mathbb{R}^{m}$ . For this, it suffices by convexity and the union bound to show that for any fixed $\boldsymbol{v}\in\{\pm 1\}^{m}$ ,

[TABLE]

To see this, we begin by noting that the random variables $X_{i}:=\langle\tilde{B}\boldsymbol{v},e_{i}\rangle$ are independent and

[TABLE]

In particular, if $\ell$ denotes the number of ones in $(v_{1},\dots,v_{m})$ , then

[TABLE]

By Lemma A.5, for all $t>0$ , we have

[TABLE]

In particular, the random variables $n^{-\epsilon}m^{-1/2}|X_{i}-\mathbb{E}[X_{i}]|$ are $16$ -subgaussian so that by Lemma A.2

[TABLE]

Finally, since

[TABLE]

it follows that

[TABLE]

which completes the proof. ∎

Given the above results, Proposition 2.16 is almost immediate.

Proof of Proposition 2.16.

1. Let $N_{n}$ be the $n\times n$ complex random matrix appearing in the statement of the proposition, and let $\mathcal{E}$ denote the ‘good’ event appearing in Lemma A.3 i.e. $\mathcal{E}$ is the event that there exists some $I\subseteq[n]$ with $|I|\geq n-2n^{1-\epsilon}$ such that for all $i\in I$ ,

[TABLE]

Since $\Pr(\mathcal{E}^{c})\leq 2\exp(-n^{1-\epsilon}/4)$ by Lemma A.3, it suffices to show that

[TABLE]

where $\mathcal{I}$ denotes the collection of subsets of $[n]$ of size at least $n-2n^{1-\epsilon}$ . For this, note that since both the event $\mathcal{E}$ as well as our distribution on $n\times n$ matrices are invariant under permuting each row of $N_{n}$ separately, it suffices to show the following: for each (fixed) $n\times n$ complex matrix $A_{n}$ for which there exists a subset $I\subseteq[n]$ as above,

[TABLE]

where $\tilde{A}_{n}$ is the random complex matrix obtained by permuting each row of $A_{n}$ independently and uniformly. But this follows immediately from Proposition A.4 applied to the $n\times n$ matrix $P_{I}A_{n}$ .

2. The proof of this part is very similar to the previous one. Let $\mathcal{J}$ denote the collection of all subsets of $[n]$ of size $n^{1-\delta}$ and let $\mathcal{I}$ denote the collection of all subsets of $[n]$ of size at least $n-2n^{1-\epsilon}$ . We show that the desired conclusion in 2. holds with sufficiently high probability for fixed $J\in\mathcal{J}$ ; the proof is completed by taking the union bound over the at most

[TABLE]

choices for $J\in\mathcal{J}$ , where $C(\epsilon)\geq 1$ depends only on $\epsilon$ , and the last inequality uses that $\delta\geq 4\epsilon$ .

For such a fixed $J\in\mathcal{J}$ , let $\mathcal{E}_{\epsilon,\delta}$ denote the event that there exists some $I\in\mathcal{I}$ such that for all $i\in I$ ,

[TABLE]

As before, by Lemma A.3 applied to the operator $N_{n}P_{J}$ viewed as an $n\times|J|$ matrix, we see that $\Pr(\mathcal{E}_{\epsilon,\delta}^{c})\leq 2\exp(-n^{1-\epsilon}/4)$ . Therefore, it suffices to show that

[TABLE]

But this follows by exactly the same argument (using Proposition A.4) as above. ∎

Appendix B Proof of Proposition 3.1

Proof of Proposition 3.1 following [13, 29].

Since $M_{n}^{\dagger}$ and $M_{n}$ have the same singular values, it follows that a necessary condition for a matrix $M_{n}$ to satisfy the event in Proposition 3.1 is that there exists a unit row vector $\boldsymbol{a^{\prime}}=(a^{\prime}_{1},\dots,a^{\prime}_{n})$ such that $\|\boldsymbol{a^{\prime}}^{T}M_{n}\|_{2}\leq\eta$ . To every matrix $M_{n}$ , associate such a vector $\boldsymbol{a^{\prime}}$ arbitrarily (if one exists) and denote it by $\boldsymbol{a^{\prime}}_{M_{n}}$ ; this leads to a partition of the space of all matrices with least singular value at most $\eta$ . Then, by taking a union bound, it suffices to show the following.

[TABLE]

To this end, we expose the first $n-1$ rows $X_{1},\dots,X_{n-1}$ of $M_{n}$ . Note that if there is some $\boldsymbol{a}\in\Gamma^{1}(\eta)$ satisfying $\|M_{n}\boldsymbol{a}\|_{2}\leq\eta$ , then there must exist a vector $\boldsymbol{y}\in\Gamma^{1}(\eta)$ , depending only on the first $n-1$ rows $X_{1},\dots,X_{n-1}$ , such that

[TABLE]

In other words, once we expose the first $n-1$ rows of the matrix, either the matrix cannot be extended to one satisfying the event in Proposition 3.1, or there is some unit vector $\boldsymbol{y}\in\Gamma^{1}(\eta)$ , which can be chosen after looking only at the first $n-1$ rows, and which satisfies the equation above. For the rest of the proof, we condition on the first $n-1$ rows $X_{1},\dots,X_{n-1}$ (and hence, a choice of $\boldsymbol{y}$ ).

For any vector $\boldsymbol{w^{\prime}}\in\mathbb{S}^{2n-1}$ with $w^{\prime}_{n}\neq 0$ , we can write

[TABLE]

where $\boldsymbol{u}:=\boldsymbol{w^{\prime}}^{T}M_{n}$ . Thus, restricted to the event $\{s_{n}(M_{n})\leq\eta\}\bigwedge\{\|\boldsymbol{a^{\prime}}_{M_{n}}\|_{\infty}=|a^{\prime}_{n}|\}$ , we have

[TABLE]

where the second line is due to the Cauchy-Schwarz inequality and the particular choice $\boldsymbol{w^{\prime}}=\boldsymbol{a^{\prime}}_{M_{n}}$ . It follows that the probability in Equation 5 is bounded by

[TABLE]

which completes the proof. ∎

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Z. Bai. Circular law. The Annals of Probability , 25(1):494–529, 1997.
2[2] Z. Bai and J. W. Silverstein. Spectral analysis of large dimensional random matrices , volume 20. Springer, 2010.
3[3] A. Basak, N. Cook, and O. Zeitouni. Circular law for the sum of random permutation matrices. Electronic Journal of Probability , 23, 2018.
4[4] C. Bordenave and D. Chafaï. Around the circular law. Probability surveys , 9, 2012.
5[5] N. A. Cook. The circular law for random regular digraphs. ar Xiv preprint ar Xiv:1703.05839 , 2017.
6[6] C. Esseen. On the Kolmogorov-Rogozin inequality for the concentration function. Probability Theory and Related Fields , 5(3):210–216, 1966.
7[7] A. Ferber and V. Jain. Singularity of random symmetric matrices–a combinatorial approach to improved bounds. ar Xiv:1809.04718 , 2018.
8[8] A. Ferber, V. Jain, K. Luh, and W. Samotij. On the counting problem in inverse Littlewood–Offord theory. ar Xiv:1904.10425 , 2019.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

The strong circular law: a combinatorial view

Abstract

1 Introduction

Theorem 1.1**.**

Remark 1.2**.**

2 Tools and auxiliary results

2.1 Anti-concentration

Definition 2.1** (Lévy concentration function).**

Remark 2.2**.**

Lemma 2.3**.**

Lemma 2.4**.**

Lemma 2.5** (Theorem 2 in [6]).**

Definition 2.6**.**

Observation 2.7**.**

Lemma 2.8**.**

Proof.

2.2 The Least Common Denominator

Remark 2.9**.**

Definition 2.10** (Least Common Denominator (LCD)).**

Theorem 2.11**.**

Definition 2.12**.**

Lemma 2.13** (Lemma 5.2 in [24]).**

Proof of Theorem 2.11.

2.3 The counting problem in inverse Littlewood-Offord theory

Theorem 2.14** (Theorem 1.3 in [12]).**

Remark 2.15**.**

2.4 Norms of large projections of random matrices

Proposition 2.16**.**

Remark 2.17**.**

3 Proof of Theorem 1.1

Proposition 3.1**.**

Proposition 3.2**.**

3.1 Reduction to Gaussian integer vectors

Proposition 3.3**.**

Remark 3.4**.**

Proof.

Proposition 3.5**.**

3.2 Dealing with sparse Gaussian integer vectors

Lemma 3.6**.**

Proof.

3.3 Dealing with non-sparse Gaussian integer vectors

Proposition 3.7**.**

Claim 3.8**.**

Proof.

Claim 3.9**.**

Proof.

Proof of Proposition 3.7.

Appendix A Proof of Proposition 2.16

Definition A.1**.**

Lemma A.2** (see, e.g., Corollary 5.17 in [30]).**

Lemma A.3**.**

Proof.

Proposition A.4**.**

Lemma A.5** (Lemma 3.9 in [18]).**

Remark A.6**.**

Proof of Proposition A.4.

Proof of Proposition 2.16.

Appendix B Proof of Proposition 3.1

Proof of Proposition 3.1 following [13, 29].

Theorem 1.1.

Remark 1.2.

Definition 2.1 (Lévy concentration function).

Remark 2.2.

Lemma 2.3.

Lemma 2.4.

Lemma 2.5 (Theorem 2 in [6]).

Definition 2.6.

Observation 2.7.

Lemma 2.8.

Remark 2.9.

Definition 2.10 (Least Common Denominator (LCD)).

Theorem 2.11.

Definition 2.12.

Lemma 2.13 (Lemma 5.2 in [24]).

Theorem 2.14 (Theorem 1.3 in [12]).

Remark 2.15.

Proposition 2.16.

Remark 2.17.

Proposition 3.1.

Proposition 3.2.

Proposition 3.3.

Remark 3.4.

Proposition 3.5.

Lemma 3.6.

Proposition 3.7.

Claim 3.8.

Claim 3.9.

Definition A.1.

Lemma A.2 (see, e.g., Corollary 5.17 in [30]).

Lemma A.3.

Proposition A.4.

Lemma A.5 (Lemma 3.9 in [18]).

Remark A.6.