
TL;DR
This paper introduces a simple, novel approach to analyze the least singular value of shifted random matrices, improving error bounds and extending inverse Littlewood-Offord theory to complex variables, thereby advancing understanding of the circular law.
Contribution
It provides a new method for bounding the least singular value of complex random matrices without additive combinatorics, extending inverse Littlewood-Offord results to complex variables.
Findings
Bound on the probability that the least singular value is small
Extension of inverse Littlewood-Offord theory to complex variables
Improved error rates over previous results
Abstract
Let be an complex random matrix, each of whose entries is an independent copy of a centered complex random variable with finite non-zero variance . The strong circular law, proved by Tao and Vu, states that almost surely, as , the empirical spectral distribution of converges to the uniform distribution on the unit disc in . A crucial ingredient in the proof of Tao and Vu, which uses deep ideas from additive combinatorics, is controlling the lower tail of the least singular value of the random matrix (where is fixed) with failure probability that is inverse polynomial. In this paper, using a simple and novel approach (in particular, not using tools from additive combinatorics or any net arguments), we show that for any fixed matrix with operator norm at…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The strong circular law: a combinatorial view
Vishesh Jain Massachusetts Institute of Technology. Department of Mathematics. Email: [email protected].
Abstract
Let be an complex random matrix, each of whose entries is an independent copy of a centered complex random variable with finite non-zero variance . The strong circular law, proved by Tao and Vu, states that almost surely, as , the empirical spectral distribution of converges to the uniform distribution on the unit disc in .
A crucial ingredient in the proof of Tao and Vu, which uses deep ideas from additive combinatorics, is controlling the lower tail of the least singular value of the random matrix (where is fixed) with failure probability that is inverse polynomial. In this paper, using a simple and novel approach (in particular, not using tools from additive combinatorics or any net arguments), we show that for any fixed matrix with operator norm at most and for all ,
[TABLE]
where is the least singular value of and are absolute constants. Our result is optimal up to the constants and the inverse exponential-type error rate improves upon the inverse polynomial error rate due to Tao and Vu.
Our proof relies on an extension of the solution to the so-called counting problem in inverse Littlewood–Offord theory, developed by Ferber, Luh, Samotij, and the author, along with a novel ‘rounding trick’ based on controlling the operator norm of heavy-tailed random matrices.
1 Introduction
Let be an complex random matrix, each of whose entries is an independent copy of a complex random variable with mean [math] and finite non-zero variance . The empirical spectral distribution (ESD) of is defined on by the expression
[TABLE]
where denote the eigenvalues of . The celebrated strong circular law of Tao and Vu [27] asserts that almost surely, as tends to infinity, converges uniformly to
[TABLE]
The circular law has a long history dating back to the 1950s when it was conjectured as a natural non-Hermitian counterpart to Wigner’s famous semi-circle law, and prior to Tao and Vu’s definitive work, many researchers obtained partial results requiring extra distributional assumptions on the random variable , and very often weakening the notion of convergence from almost sure convergence to convergence in probability (this is not just a technical point, and genuinely new ideas are required to obtain almost sure convergence; see the discussion in Section 2 of [24]). We refer the reader to the survey [4] and the references therein for a much more detailed discussion of the history of this problem.
In the case when we further assume that has moments for some , the approach of Bai [1], Bai and Silverstein [2], and Girko [10] reduces the problem to controlling the lower tail of the least singular value of the random matrix , where is fixed; even when we assume that has only non-zero finite variance, controlling the lower tail of the least singular value of this random matrix is a fundamental step in Tao and Vu’s proof (recall that the least singular value of a complex matrix , denoted by , is the smallest eigenvalue of the positive semidefinite matrix ). To this end, Tao and Vu [24] showed using sophisticated techniques from additive combinatorics that for any constants , there exists a constant such that for any fixed (complex) matrix of operator norm at most ,
[TABLE]
The dependence of on and can be made explicit and was subsequently sharpened in [26]. Note that for the proof of the circular law, fixing is sufficient.
Our goal in the present work is to provide a simple and elementary proof of a quantitative strengthening of Equation 1 in the setting of the strong circular law. More precisely, we show:
Theorem 1.1**.**
Let be a complex random variable with mean [math] and variance and let be an random matrix, each of whose entries is an independent copy of . Let be a fixed complex matrix with operator norm at most . Then, for all ,
[TABLE]
where are constants depending only on .
Remark 1.2**.**
(1) In the above theorem, the choice of the power is arbitrarily made for convenience and could be replaced by for any ; in follow-up work of the author [12] which builds on some of the ideas in this paper, we will show (using a more complicated proof) how to obtain a bound on the lower tail of even if .
(2) We have not tried to optimize any of the constants , but note here that with additional work, the constant can be replaced by the nearly optimal value for any in the case when . Compared to Equation 1, our bound is closer to (optimal) bounds of the form
[TABLE]
which have been obtained under stronger assumptions: for the case when is a real subgaussian random variable and in the landmark work of Rudelson and Vershynin [20], and for the case when is a real random variable and (much more restrictively) by Rebrova and Tikhomirov [18].
(3) We impose no constraints on the relationship between the real and imaginary parts of (for instance, in [14], the real and imaginary parts are required to be i.i.d. subgaussian). In this generality, our result (along with a follow-up result of the author [12]) is the only one showing that a random matrix, each of whose entries is an independent copy of a complex random variable of mean [math] and variance , is singular with probability at most for some . Previously, this was not known, even if we further assume that the complex random variables are subgaussian – the geometric machinery, pioneered by Rudelson and Vershynin [20], runs into the obstacle that the real dimension of the complex unit sphere is (see the discussion in Section 10 of [23]).
Apart from the quantitative strengthening of Equation 1, we believe that our result is also interesting for the simplicity of the proof techniques, making use only of some standard Fourier analytic techniques along with elementary combinatorial ideas. In particular, in contrast to previous works in this area, we make no use of tools from additive combinatorics or net arguments. Parts of our proof which we believe may be of independent interest are the complex anti-concentration inequality Theorem 2.11 and Proposition 3.3, which shows how to bypass the lack of control over the operator norm in our setting by means of the -norm. We hope that some of the ideas introduced in this work can aid in proving strong circular laws in other contexts such as [3, 5] where only weak circular laws are known so far.
**Organization: **The rest of this paper is organized as follows. In Section 2, we collect some auxiliary results needed for the proof of our main theorem – the key results here are Theorem 2.11, Theorem 2.14, and Proposition 2.16 (proved in Appendix A). In Section 3, we prove Theorem 1.1 by combining these results. The key ingredient there is Proposition 3.3.
**Notation: ** Throughout the paper, we will omit floors and ceilings when they make no essential difference. For convenience, we will also say ‘let be a prime’, to mean that is a prime between and ; again, this makes no difference to our arguments. We will use to denote the set of unit vectors in , to denote the ball of radius centered at , and to denote the real and imaginary parts of a complex vector . As is standard, we will use to denote the discrete interval . We will also use the asymptotic notation to denote respectively. For a matrix , we will use to denote its standard operator norm. All logarithms are natural unless noted otherwise.
**Acknowledgements: **I am indebted to Nick Cook for the suggestion to consider the complex setting, as well as very helpful discussions around the circular law. I would also like to thank Kyle Luh for pointing out that a statement similar to Proposition 2.16 also appears in [19], Hoi Nguyen for discussing his work [16], and Elizaveta Rebrova for discussing her work [18].
2 Tools and auxiliary results
In this section, we collect some preliminary results which will be used in the proof of Theorem 1.1.
2.1 Anti-concentration
The goal of the theory of anti-concentration is to obtain upper bounds on the Lévy concentration function, defined as follows.
Definition 2.1** (Lévy concentration function).**
Let be an arbitrary complex random variable and let . We define the Lévy concentration function of at radius with respect to by
[TABLE]
where are independent copies of .
Remark 2.2**.**
In particular, note that . We will use this notation repeatedly.
The next lemma shows that weighted sums of random variables with finite non-zero variance are not too close to being a constant.
Lemma 2.3**.**
*(see, e.g., Lemma 6.3 in [26])
Let be a complex random variable with finite non-zero variance. Then, there exists a constant depending only on such that for any ,*
[TABLE]
Combining this with the so-called tensorization lemma (see Lemma 2.2 in [20]), we get the following standard estimate for ‘invertibility with respect to a single vector’.
Lemma 2.4**.**
Let be a complex random variable with finite non-zero variance. Let be an arbitrary complex matrix and let be an complex random matrix each of whose entries is an independent copy of . Then, for any fixed ,
[TABLE]
where is a constant depending only on .
The next classical lemma, due to Esseen, is a generalization (up to constants) of the Erdős-Littlewood-Offord anti-concentration inequality.
Lemma 2.5** (Theorem 2 in [6]).**
Let be jointly independent complex random variables and let be some positive real numbers. Then, for any , we have
[TABLE]
where is an absolute constant.
The next definition isolates a convenient property of the random variables we consider in this paper.
Definition 2.6**.**
We say that a complex random variable is -good if
[TABLE]
where and denote independent copies of . The smallest with respect to which is -good will be denoted by .
Indeed, it is straightforward to see that complex random variables with finite non-zero variance are -good for some finite , so that there is no loss of generality for us in imposing this additional restriction.
Observation 2.7**.**
Let be a complex random variable with variance . Then, is -good for some .
We conclude this subsection with the following consequence of Lemma 2.5.
Lemma 2.8**.**
Let be a complex random variable with variance . There exists a constant depending only on such that for all with support of size at least ,
[TABLE]
Proof.
As above, we know that for some . Therefore, for all ,
[TABLE]
Hence, by Lemma 2.5,
[TABLE]
Since , and since (since any ball in the complex plane of radius can be covered by balls of radius ) , the desired conclusion follows. ∎
2.2 The Least Common Denominator
The proof of Theorem 1.1 will be based on a ‘rounding argument’ which extracts a ‘not-too-large’ Gaussian integer vector certifying that the least singular value of a complex matrix is small (see [11] for the most basic version of this argument). For this, we will use (albeit in a quite different manner from Rudelson and Vershynin) the notion of the Least Common Denominator (LCD) of a vector, and its connection to the Lévy concentration function, as developed in [20].
Remark 2.9**.**
Our definition of the LCD is different from the ones appearing in the literature for the complex case, and has been made keeping in mind our application to rounding vectors.
Definition 2.10** (Least Common Denominator (LCD)).**
Let . For and , define
[TABLE]
Note that the requirement that the distance is smaller than forces us to consider only non-trivial Gaussian integer points as approximations of .
The following theorem shows that vectors with large LCD have small Lévy concentration function on scales which are larger (up to some small polynomial losses) than .
Theorem 2.11**.**
Let denote a -good complex random variable. Then, for every , for every , and for
[TABLE]
we have
[TABLE]
where is a constant depending only on .
A more precise version of this theorem appears for real random variables in [21]. Actually, a version for complex random variables is also stated there although, as noted above, their definition of is different from ours. The proof of Theorem 2.11 follows from standard Fourier analytic arguments for the real case (in particular, we will use a crude version of the argument of Friedland and Sodin in [9]) once we use a novel ‘doubling trick’.
We will need a preliminary result from Section 4 of [24].
Definition 2.12**.**
Let be an arbitrary complex random variable. For any , we define
[TABLE]
where denote i.i.d. copies of and denotes the distance to the nearest integer.
Lemma 2.13** (Lemma 5.2 in [24]).**
Let and let be an arbitrary complex random variable. Then,
[TABLE]
Here,
[TABLE]
where are i.i.d. copies of , with distributed as , and mutually independent.
Proof of Theorem 2.11.
Since , it suffices to bound for . Let denote the vector whose first components are and last components are . Then, we have
[TABLE]
where the first line uses , the second line is due to Lemma 2.13, the third line follows from Lemma 4.5(iii) in [24], and the last line is again due to Lemma 2.13.
Next, note that
[TABLE]
where the final inequality follows from the -goodness of .
Therefore, from Jensen’s inequality, we get that
[TABLE]
Now, fix with ; we will obtain a uniform (in ) upper bound on the integral appearing in Equation 3. Let
[TABLE]
let , and split the integral above as
[TABLE]
Since
[TABLE]
it only remains to bound .
For this, we begin by noting that if , then by the triangle inequality and the lattice structure of the Gaussian integers,
[TABLE]
Hence, by the definition of , we have one of two possibilities: either
[TABLE]
or
[TABLE]
It follows that is contained in a union of balls of radius whose centers are separated by at least . Each such ball can contribute at most to the integral, and since , there is at most one such ball in . It follows that
[TABLE]
Finally, combining the estimates on and and using Equation 3 completes the proof. ∎
2.3 The counting problem in inverse Littlewood-Offord theory
The inverse Littlewood-Offord problem, posed by Tao and Vu [29], asks for the underlying reason that the Lévy concentration function of a vector can be large. Using deep Frieman-type results from additive combinatorics, they showed that, roughly speaking, the only reason for this to happen is that most of the coordinates of the vector belong to a generalized arithmetic progression (GAP) of ‘small rank’ and ‘small volume’. Their results [29, 25] were subsequently sharpened by Nguyen and Vu [16], who proved an ‘optimal inverse Littlewood–Offord theorem’. We refer the reader to the survey [17] and the textbook [28] for complete definitions and statements, and much more on both forward and inverse Littlewood-Offord theory.
Recently, motivated by applications, especially those in random matrix theory, the following counting variant of the inverse Littlewood–Offord problem was isolated in work [8] of the author along with Ferber, Luh, and Samotij: for how many vectors in a given collection is greater than some prescribed value, where is a symmetric Bernoulli random variable? Indeed, the inverse Littlewood-Offord theorems are typically used precisely through such counting corollaries [17], and one of the main contributions of [8] (see Theorem 1.7 there) was to show that one may obtain useful bounds for the counting variant of the inverse Littlewood-Offord problem directly, without providing a precise structural characterization like Tao-Vu. In fact, since this approach is not hampered by losses coming from the black-box application of various theorems from additive combinatorics, it provides quantitatively better bounds, and this was used in [8, 7, 11] to provide quantitative improvements for several problems in combinatorial random matrix theory.
A crucial ingredient in our proof will be the following extension of Theorem 1.7 of [8] due to the author
Theorem 2.14** (Theorem 1.3 in [12]).**
Let be a -good random variable. For (possibly depending on ), let
[TABLE]
There exists a constant , depending only on , for which the following holds. Let with . If and is an odd prime such that , then
[TABLE]
where denotes the natural map from .
Remark 2.15**.**
The inverse Littlewood-Offord theorems may be used to deduce similar statements, provided we further assume that for some constant . It is the freedom of taking to be much smaller which allows us to obtain the exponential-type rate in Theorem 1.1.
2.4 Norms of large projections of random matrices
The key difficulty with extending the geometric techniques of Rudelson and Vershynin [20, 22]) to the setting when the random variables have heavy tails is the lack of control on the operator norm of the random matrix. For our techniques, the following proposition will turn out to be an appropriate substitute for controlling the operator norm.
For a subset , let denote the orthogonal projection onto the subspace spanned by the vectors . We have:
Proposition 2.16**.**
Let be an complex random matrix with i.i.d. entries, each with mean [math] and variance . For with , there exists such that, except with probability at most , the following hold.
There exists with such that
[TABLE] 2. 2.
For every with , there exists some such that , and
[TABLE]
Remark 2.17**.**
A statement similar to the one above, and with some common proof ideas, already appears in the work of Rebrova and Vershynin [19]. In that work, the primary interest is in obtaining optimal bounds on the restricted operator norms and consequently, the proofs are much more involved. In contrast, we do not require such optimal bounds for our application, and are therefore able to give a much shorter proof of the above proposition.
The complete proof of this proposition is deferred to Appendix A.
3 Proof of Theorem 1.1
In this section, we will take and . Moreover, since Theorem 1.1 is trivially true for , we will henceforth assume that . Recall that is a fixed matrix with operator norm at most ; we set .
We decompose into , where
[TABLE]
and . Accordingly, we have
[TABLE]
Therefore, Theorem 1.1 follows from the following two propositions and the union bound.
Proposition 3.1**.**
.
Proposition 3.2**.**
* where and are constants depending only on .*
The proof of Proposition 3.1 is relatively simple, and follows from a conditioning argument developed in [13], once we observe the crucial fact (Theorem 2.11) that for any ,
[TABLE]
for all . We defer the (by now standard) details to Appendix B.
The remainder of this section is devoted to the proof of Proposition 3.2.
3.1 Reduction to Gaussian integer vectors
Let and
[TABLE]
As a first and crucial step towards the proof of Proposition 3.2, we will prove the following:
Proposition 3.3**.**
With notation as above,
[TABLE]
where is an absolute constant.
Remark 3.4**.**
As we will see shortly, the crucial point in the above proposition is that and .
Proof.
Let , , and . Let denote the event appearing in the conclusion of Proposition 2.16 for and simultaneously. Since , we may restrict ourselves to the event .
Let . Then, by definition, there exists some with and some such that . Note also that . To leverage the control we have over various norms associated to the matrix , we decompose the ‘error’ vector into a ‘small’ part (with respect to the -norm), a ‘sparse and small’ part, and a ‘very sparse’ part.
Accordingly, let denote the vector obtained by keeping the largest (in absolute value) coordinates of , let denote the vector obtained by keeping the next largest coordinates of , and let . Then, we have that
[TABLE]
and
[TABLE]
Indeed, the first inequality is immediate from , whereas the second inequality follows from
[TABLE]
Let denote the support of and let denote the support of . By extending these sets if need be, we may assume that and . Moreover, since we have restricted to , let denote a subset of size at least with respect to which conclusion 1. of Proposition 2.16 holds.
Note that since , we have for every . Therefore,
[TABLE]
and similarly,
[TABLE]
Then, from the triangle inequality, we have
[TABLE]
where the second line uses that and ; the fourth line uses the above estimates on the -to- norms and Equation 4, and the fifth line uses the parameter value .
Thus, if , it follows from the triangle inequality that
[TABLE]
where the fourth line follows since and , and the last line follows since . Since , we get the desired conclusion. ∎
In view of Proposition 3.3, it suffices to show the following in order to prove Proposition 3.2, and hence, complete the proof of Theorem 1.1.
Proposition 3.5**.**
* where and are constants depending only on .*
The proof of this proposition is the content of the next two subsections.
3.2 Dealing with sparse Gaussian integer vectors
Throughout this subsection and the next one, is a prime. Note, in particular, that . The proof of Proposition 3.5 proceeds in two steps. The first step is to show that the probability of the event appearing in Proposition 3.5 is small, provided we restrict ourselves only to sufficiently sparse Gaussian integer vectors. Let
[TABLE]
Lemma 3.6**.**
* where is an absolute constant.*
Proof.
By taking the union bound over all the at most choices of , it suffices to show that for a fixed ,
[TABLE]
for some absolute constant . The number of vectors is at most
[TABLE]
By Lemma 2.4 applied to the matrix , for any such vector,
[TABLE]
Therefore, the union bound gives the desired conclusion. ∎
3.3 Dealing with non-sparse Gaussian integer vectors
It remains to deal with Gaussian integer vectors with support of size at least . Let
[TABLE]
Note that for our choice of parameters, the natural map
[TABLE]
is injective.
In view of Lemma 3.6, since , and taking the union bound over all the at most choices of , the following proposition suffices to prove Proposition 3.5.
Proposition 3.7**.**
For all ,
[TABLE]
where and are constants depending only on .
The proof of Proposition 3.7 is accomplished by a simple union bound. To execute this, we need the following preliminary claims.
Claim 3.8**.**
For all , .
Proof.
The random variable has mean [math] and variance at most . Therefore, by Markov’s inequality,
[TABLE]
Hence, by the pigeonhole principle, it follows that
[TABLE]
as desired. ∎
For the next claim, let
[TABLE]
Note that the previous claim along with Lemma 2.8 shows that is nonempty only if .
Claim 3.9**.**
For all ,
[TABLE]
where is a constant depending only on .
Proof.
Fix and . Then, , , and . Hence, for large enough , the hypotheses of Theorem 2.14 are satisfied, so that
[TABLE]
where the first line follows from the injectivity of on , the third line follows from Theorem 2.14, and the last line follows since . ∎
We now have all the ingredients to prove Proposition 3.7.
Proof of Proposition 3.7.
Let denote the unit polydisc in centered at . For all sufficiently large, we have
[TABLE]
where the third line follows since the number of points of in is at most , the fifth line follows from 3.9, and the seventh and eighth lines follow from the assumed bounds on . ∎
Appendix A Proof of Proposition 2.16
The proof will make use of the subgaussian concentration inequality, which we now recall.
Definition A.1**.**
A random variable is said to be -subgaussian if, for all ,
[TABLE]
Lemma A.2** (see, e.g., Corollary 5.17 in [30]).**
There exists an absolute constant with the following property. Let be independent centered -subgaussian random variables. Then,
[TABLE]
We begin with a simple lemma showing that, with high probability, most rows of a random matrix with i.i.d. centered entries of finite variance have small and norms.
Lemma A.3**.**
Let be an complex random matrix with i.i.d. entries, each with mean [math] and variance . For , let denote the (random) subset of coordinates such that for each ,
[TABLE]
Then,
[TABLE]
Proof.
Since for each ,
[TABLE]
it follows from Markov’s inequality that
[TABLE]
and
[TABLE]
Let denote the subset of coordinates such that for each ,
[TABLE]
and let denote the subset of coordinates such that for each ,
[TABLE]
Since the rows of the matrix are independent, it follows from the standard Chernoff bound that for
[TABLE]
Hence, by the union bound,
[TABLE]
except with probability at most ∎
The next proposition controls the operator norm of a random matrix with i.i.d. entries, conditioned on no row having or norm which is ‘too large’, and essentially appears as Proposition 3.10 in [18]. Since our statement uses somewhat different parameters than in [18], we provide a complete proof below for the reader’s convenience.
Proposition A.4**.**
Fix . Let be a fixed complex matrix, with , such that the norm of every row is at most and such that for all ,
[TABLE]
Let be independent random permutations uniformly distributed on the symmetric group , and let denote the random complex matrix whose entries are given by
[TABLE]
Then,
[TABLE]
where is an absolute constant.
The following concentration inequality will be used to establish the subgaussianity of certain random variables appearing in the proof of Proposition A.4. It appears as Lemma 3.9 in [18], and is a direct application of Theorem 7.8 in [15].
Lemma A.5** (Lemma 3.9 in [18]).**
Let be a non-zero complex vector and let . Consider the function defined by
[TABLE]
Then, for all ,
[TABLE]
Remark A.6**.**
In [18], the above lemma is stated and proved (with better constants) for real vectors . However, the version above for complex vectors immediately follows from this by separately considering the real and imaginary parts of and using the union bound.
Proof of Proposition A.4.
If , then there exists a complex vector , where and , such that
[TABLE]
Therefore, it suffices to control the -to- norm of restricted to vectors in . For this, it suffices by convexity and the union bound to show that for any fixed ,
[TABLE]
To see this, we begin by noting that the random variables are independent and
[TABLE]
In particular, if denotes the number of ones in , then
[TABLE]
By Lemma A.5, for all , we have
[TABLE]
In particular, the random variables are -subgaussian so that by Lemma A.2
[TABLE]
Finally, since
[TABLE]
it follows that
[TABLE]
which completes the proof. ∎
Given the above results, Proposition 2.16 is almost immediate.
Proof of Proposition 2.16.
1. Let be the complex random matrix appearing in the statement of the proposition, and let denote the ‘good’ event appearing in Lemma A.3 i.e. is the event that there exists some with such that for all ,
[TABLE]
Since by Lemma A.3, it suffices to show that
[TABLE]
where denotes the collection of subsets of of size at least . For this, note that since both the event as well as our distribution on matrices are invariant under permuting each row of separately, it suffices to show the following: for each (fixed) complex matrix for which there exists a subset as above,
[TABLE]
where is the random complex matrix obtained by permuting each row of independently and uniformly. But this follows immediately from Proposition A.4 applied to the matrix .
2. The proof of this part is very similar to the previous one. Let denote the collection of all subsets of of size and let denote the collection of all subsets of of size at least . We show that the desired conclusion in 2. holds with sufficiently high probability for fixed ; the proof is completed by taking the union bound over the at most
[TABLE]
choices for , where depends only on , and the last inequality uses that .
For such a fixed , let denote the event that there exists some such that for all ,
[TABLE]
As before, by Lemma A.3 applied to the operator viewed as an matrix, we see that . Therefore, it suffices to show that
[TABLE]
But this follows by exactly the same argument (using Proposition A.4) as above. ∎
Appendix B Proof of Proposition 3.1
Proof of Proposition 3.1 following [13, 29].
Since and have the same singular values, it follows that a necessary condition for a matrix to satisfy the event in Proposition 3.1 is that there exists a unit row vector such that . To every matrix , associate such a vector arbitrarily (if one exists) and denote it by ; this leads to a partition of the space of all matrices with least singular value at most . Then, by taking a union bound, it suffices to show the following.
[TABLE]
To this end, we expose the first rows of . Note that if there is some satisfying , then there must exist a vector , depending only on the first rows , such that
[TABLE]
In other words, once we expose the first rows of the matrix, either the matrix cannot be extended to one satisfying the event in Proposition 3.1, or there is some unit vector , which can be chosen after looking only at the first rows, and which satisfies the equation above. For the rest of the proof, we condition on the first rows (and hence, a choice of ).
For any vector with , we can write
[TABLE]
where . Thus, restricted to the event , we have
[TABLE]
where the second line is due to the Cauchy-Schwarz inequality and the particular choice . It follows that the probability in Equation 5 is bounded by
[TABLE]
which completes the proof. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Z. Bai. Circular law. The Annals of Probability , 25(1):494–529, 1997.
- 2[2] Z. Bai and J. W. Silverstein. Spectral analysis of large dimensional random matrices , volume 20. Springer, 2010.
- 3[3] A. Basak, N. Cook, and O. Zeitouni. Circular law for the sum of random permutation matrices. Electronic Journal of Probability , 23, 2018.
- 4[4] C. Bordenave and D. Chafaï. Around the circular law. Probability surveys , 9, 2012.
- 5[5] N. A. Cook. The circular law for random regular digraphs. ar Xiv preprint ar Xiv:1703.05839 , 2017.
- 6[6] C. Esseen. On the Kolmogorov-Rogozin inequality for the concentration function. Probability Theory and Related Fields , 5(3):210–216, 1966.
- 7[7] A. Ferber and V. Jain. Singularity of random symmetric matrices–a combinatorial approach to improved bounds. ar Xiv:1809.04718 , 2018.
- 8[8] A. Ferber, V. Jain, K. Luh, and W. Samotij. On the counting problem in inverse Littlewood–Offord theory. ar Xiv:1904.10425 , 2019.
