A Refined Non-asymptotic Tail Bound of Sub-Gaussian Matrix

Xianjie Gao; Hongwei Zhang

arXiv:1906.10432·math.PR·June 26, 2019

A Refined Non-asymptotic Tail Bound of Sub-Gaussian Matrix

Xianjie Gao, Hongwei Zhang

PDF

Open Access

TL;DR

This paper derives a refined non-asymptotic tail bound for the largest singular value of sub-Gaussian matrices and applies it to Gaussian Toeplitz matrices, enhancing understanding of their spectral properties.

Contribution

It provides a new, sharper tail bound for sub-Gaussian matrices and demonstrates its application to Gaussian Toeplitz matrices.

Findings

01

Refined tail bound for sub-Gaussian matrices

02

Application to Gaussian Toeplitz matrices

03

Improved understanding of spectral edge behavior

Abstract

In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we use the obtained theorem to compute the tail bound of the Gaussian Toeplitz matrix.

Equations66

\frac{1}{n} i = 1 \sum n x_{i} \to g, (n \to \infty)

\frac{1}{n} i = 1 \sum n x_{i} \to g, (n \to \infty)

f_{sc} (x) = \frac{1}{2 π} 4 - x^{2}, x \in [- 2, 2] .

f_{sc} (x) = \frac{1}{2 π} 4 - x^{2}, x \in [- 2, 2] .

f_{m p} (x) = {\frac{1}{2 π x y} (b - x) (x - a) 0 a \leq x \leq b; o t h er w i se,

f_{m p} (x) = {\frac{1}{2 π x y} (b - x) (x - a) 0 a \leq x \leq b; o t h er w i se,

s_{m i n} (A) = m - n + o (n),

s_{m i n} (A) = m - n + o (n),

s_{m a x} (A) = m + n + o (n), \mbox almostsurely .

{\mathbb{P}}\bigg{(}\frac{1}{\sqrt{n}}\sum_{i=1}^{n}x_{i}>t\bigg{)}\leq 2{\rm e}^{-t^{2}/2}.

{\mathbb{P}}\bigg{(}\frac{1}{\sqrt{n}}\sum_{i=1}^{n}x_{i}>t\bigg{)}\leq 2{\rm e}^{-t^{2}/2}.

s_{m a x} (B) = ∥ B ∥ = x \in R^{n} \ {0} sup \frac{∥ B x ∥ _{2}}{∥ x ∥ _{2}} = x \in S^{n - 1} sup ∥ B x ∥_{2} .

s_{m a x} (B) = ∥ B ∥ = x \in R^{n} \ {0} sup \frac{∥ B x ∥ _{2}}{∥ x ∥ _{2}} = x \in S^{n - 1} sup ∥ B x ∥_{2} .

H (B) = [0 B * B 0] .

H (B) = [0 B * B 0] .

P (∣ x ∣ > t) \leq 2 e^{- c t^{2}} .

P (∣ x ∣ > t) \leq 2 e^{- c t^{2}} .

E e^{θ x} \leq e^{b^{2} θ^{2} /2} .

E e^{θ x} \leq e^{b^{2} θ^{2} /2} .

P {∥ B ∥ > t} \leq 2 \cdot 5^{(m + n)} \cdot exp (- c t^{2}) .

P {∥ B ∥ > t} \leq 2 \cdot 5^{(m + n)} \cdot exp (- c t^{2}) .

P (∥ B ∥ > t)

P (∥ B ∥ > t)

\leq

\leq

\leq

\leq

E e^{x θ H} ⪯ e^{θ^{2} b^{2} H^{2} /2} .

E e^{x θ H} ⪯ e^{θ^{2} b^{2} H^{2} /2} .

ρ := ∥ k \sum H_{k}^{2} ∥.

ρ := ∥ k \sum H_{k}^{2} ∥.

{\mathbb{P}}\Big{\{}\lambda_{\max}\Big{(}\sum_{k}x_{k}{\bf H}_{k}\Big{)}\geq t\Big{\}}\leq d\cdot\exp\bigg{(}-\frac{t^{2}}{2b^{2}\rho}\bigg{)}.

{\mathbb{P}}\Big{\{}\lambda_{\max}\Big{(}\sum_{k}x_{k}{\bf H}_{k}\Big{)}\geq t\Big{\}}\leq d\cdot\exp\bigg{(}-\frac{t^{2}}{2b^{2}\rho}\bigg{)}.

\displaystyle{\mathbb{P}}\Big{\{}\lambda_{\max}\Big{(}\sum_{k}x_{k}{\bf H}_{k}\Big{)}\geq t\Big{\}}

\displaystyle{\mathbb{P}}\Big{\{}\lambda_{\max}\Big{(}\sum_{k}x_{k}{\bf H}_{k}\Big{)}\geq t\Big{\}}

\leq

\leq

\leq

=

=

\rho:=\max\Big{\{}\Big{\|}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}\Big{\|}\,\Big{\|}\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\Big{\|}\Big{\}}.

\rho:=\max\Big{\{}\Big{\|}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}\Big{\|}\,\Big{\|}\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\Big{\|}\Big{\}}.

{\mathbb{P}}\Big{\{}\Big{\|}\sum_{k}x_{k}{\bf D}_{k}\Big{\|}\geq t\Big{\}}\leq(m+n)\cdot\exp\bigg{(}-\frac{t^{2}}{2b^{2}\rho}\bigg{)}.

{\mathbb{P}}\Big{\{}\Big{\|}\sum_{k}x_{k}{\bf D}_{k}\Big{\|}\geq t\Big{\}}\leq(m+n)\cdot\exp\bigg{(}-\frac{t^{2}}{2b^{2}\rho}\bigg{)}.

\Big{\|}\sum_{k}x_{k}{\bf D}_{k}\Big{\|}=\lambda_{\max}\Big{(}{\mathcal{H}}{\Big{(}\sum_{k}x_{k}{\bf D}_{k}\Big{)}}\Big{)}=\lambda_{\max}\Big{(}\sum_{k}x_{k}{\mathcal{H}}({{\bf D}_{k}})\Big{)}.

\Big{\|}\sum_{k}x_{k}{\bf D}_{k}\Big{\|}=\lambda_{\max}\Big{(}{\mathcal{H}}{\Big{(}\sum_{k}x_{k}{\bf D}_{k}\Big{)}}\Big{)}=\lambda_{\max}\Big{(}\sum_{k}x_{k}{\mathcal{H}}({{\bf D}_{k}})\Big{)}.

\rho=\Big{\|}\sum_{k}{\mathcal{H}}({{\bf D}_{k})^{2}}\Big{\|}=\left\|\begin{matrix}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}&0\\ 0&\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\end{matrix}\right\|=\max\Big{\{}\Big{\|}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}\Big{\|}\,\Big{\|}\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\Big{\|}\Big{\}}

\rho=\Big{\|}\sum_{k}{\mathcal{H}}({{\bf D}_{k})^{2}}\Big{\|}=\left\|\begin{matrix}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}&0\\ 0&\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\end{matrix}\right\|=\max\Big{\{}\Big{\|}\sum_{k}{\bf D}_{k}{\bf D}_{k}^{*}\Big{\|}\,\Big{\|}\sum_{k}{\bf D}_{k}^{*}{\bf D}_{k}\Big{\|}\Big{\}}

{\mathbb{P}}\{\|{\bf B}\|>t\}\leq(m+n)\cdot\exp\Big{(}-\frac{t^{2}}{2b^{2}m}\Big{)}.

{\mathbb{P}}\{\|{\bf B}\|>t\}\leq(m+n)\cdot\exp\Big{(}-\frac{t^{2}}{2b^{2}m}\Big{)}.

B = ij \sum x_{ij} E_{ij}, i = 1, \dots, m, j = 1, \dots, n .

B = ij \sum x_{ij} E_{ij}, i = 1, \dots, m, j = 1, \dots, n .

\displaystyle{\mathbb{P}}\{\|{\bf B}\|>t\}\leq\begin{cases}(m+n)\cdot\exp\Big{(}-\frac{t^{2}}{2b^{2}m}\Big{)}&\text{$0<t\leq\sqrt{\frac{2b^{2}m}{1-2b^{2}mc}\log\frac{m+n}{2\cdot 5^{m+n}}}$};\\ 2\cdot 5^{(m+n)}\cdot\exp(-ct^{2})&\text{$t>\sqrt{\frac{2b^{2}m}{1-2b^{2}mc}\log\frac{m+n}{2\cdot 5^{m+n}}}$}.\end{cases}

\displaystyle{\mathbb{P}}\{\|{\bf B}\|>t\}\leq\begin{cases}(m+n)\cdot\exp\Big{(}-\frac{t^{2}}{2b^{2}m}\Big{)}&\text{$0<t\leq\sqrt{\frac{2b^{2}m}{1-2b^{2}mc}\log\frac{m+n}{2\cdot 5^{m+n}}}$};\\ 2\cdot 5^{(m+n)}\cdot\exp(-ct^{2})&\text{$t>\sqrt{\frac{2b^{2}m}{1-2b^{2}mc}\log\frac{m+n}{2\cdot 5^{m+n}}}$}.\end{cases}

T = γ_{0} γ_{- 1} γ_{- 2} ⋮ γ_{- (d - 1)} γ_{1} γ_{0} γ_{- 1} ⋮ γ_{- (d - 2)} γ_{2} γ_{1} γ_{0} ⋮ γ_{- (d - 3)} \dots \dots \dots ⋱ \dots γ_{d - 1} γ_{d - 2} γ_{d - 3} ⋮ γ_{0},

T = γ_{0} γ_{- 1} γ_{- 2} ⋮ γ_{- (d - 1)} γ_{1} γ_{0} γ_{- 1} ⋮ γ_{- (d - 2)} γ_{2} γ_{1} γ_{0} ⋮ γ_{- (d - 3)} \dots \dots \dots ⋱ \dots γ_{d - 1} γ_{d - 2} γ_{d - 3} ⋮ γ_{0},

T = γ_{0} I + j = 1 \sum d - 1 γ_{j} C^{j} + j = 1 \sum d - 1 γ_{- j} (C^{j})^{T},

T = γ_{0} I + j = 1 \sum d - 1 γ_{j} C^{j} + j = 1 \sum d - 1 γ_{- j} (C^{j})^{T},

C = 010 1 ⋱ ⋱ 0 10 .

C = 010 1 ⋱ ⋱ 0 10 .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRandom Matrices and Applications · Advanced Combinatorial Mathematics · Matrix Theory and Algorithms

Full text

A Refined Non-asymptotic Tail Bound of Sub-Gaussian Matrix

Xianjie Gao, Hongwei Zhang

School of Mathematical Sciences, Dalian University of Technology

Dalian, Liaoning, 116024, P.R. China

[email protected];[email protected]

Abstract

In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we use the obtained theorem to compute the tail bound of the Gaussian Toeplitz matrix.

Keywords: Non-asymptotic theory; largest singular value; tail bound; sub-Gaussian matrix.

1 Introduction

Random matrix theory (RMT) has been widely applied in many fields, e.g., multivariate statistics [1], high-dimensional data analysis [2], the matrix approximation [3], the combinatorial optimization [4] and the compressed sensing [5]. One main research concern on RMT is to study the tail behavior of the extreme eigenvalues (or singular values) of random matrices.

In general, there are two types of probabilistic statements on the study of probability theory: asymptotic and non-asymptotic. The former aims to analyze the limit behavior of some probability terms, e.g., the central limit theorem

[TABLE]

for Bernoulli random variables $x_{1},x_{2},\cdots,x_{n},\cdots$ , where $g$ is Gaussian variable. There have been many well-known asymptotic results on RMT:

Wigner’s semicircle law

[6]: Let ${\bf A}_{n}$ be a $n\times n$ symmetric matrices whose entries are independent Gaussian variables. As dimension $n\rightarrow\infty$ , the spectrum of the Wigner matrices ${\bf W}_{n}=n^{-1/2}{\bf A}_{n}$ is distributed according to the semicircle law with density:

[TABLE]

Marchenko-Pastur law

[7]: Let ${\bf A}_{m,n}$ $(m\geq n)$ be a $m\times n$ random Gaussian matrix. As the dimensions $m,n\rightarrow\infty$ while the aspect ratio $n/m$ converges to a fix number $y\in(0,1]$ , the spectrum of the matrices $\frac{1}{m}{\bf A}^{*}{\bf A}$ is distributed according to the Marchenko-Pastur law with density:

[TABLE]

where $a=(1-\sqrt{y})^{2}$ and $b=(1+\sqrt{y})^{2}$ .

Bai-Yin’s law

[8]: Let ${\bf A}_{m,n}$ $(m\geq n)$ be a $m\times n$ random matrix whose entries are independent copies of a random variable with zero mean, unit variance, and finite fourth moment. As the dimensions $m,n\rightarrow\infty$ with $n/m$ converging to a fix number $y\in(0,1]$ , the $s_{\min}(\bf A)$ and $s_{\max}(\bf A)$ are subjected to Bai-Yin’s law:

[TABLE]

Although these asymptotic statements can provide a precise limit result when the matrix dimension or sample number goes to the infinity, they cannot describe in what rate these probability terms converge to their limits. To handle this issue, there arise the non-asymptotic viewpoint to study these probability terms. For example, one of the non-asymptotic statement of the central limit theorem is Hoeffding’s inequality:

[TABLE]

There have been many research works on RMT from the non-asymptotic viewpoint. Vershynin [9] gave non-asymptotic methods about the properties of sub-Gaussian and sub-exponential matrix. Tropp [10] proposed a user-friendly framework to study the tail behavior of sums of random matrices. Moreover, there are also other methods for developing the matrix concentration inequalities, e.g., exchangeable pairs [11] and Markov chain couplings [12]. To eliminate the dimension dependence of these tail results for random matrices, the intrinsic dimension (or effective dimension) was employed to improve them (see [13],[14]). Recently, Zhang et al. [15] applied a diagonalization method to obtain the dimension-free tail inequalities of largest singular value for sums of random matrices.

In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. We first give a tail bound for the norm of a sub-Gaussian matrix by transforming a sub-Gaussian matrix into a sub-Gaussian variable. We also obtain a tail bound for the norm of a sub-Gaussian matrix by decomposing a sub-Gaussian matrix into a series of sub-Gaussian matrices. By combining the two resulted tail bounds, we obtain the final tail results. As an application, we use the resulted tail inequalities to study the tail behavior of Gaussian Toeplitz matrix.

The rest of this paper is organized as follows. In the next section, we give some preliminary knowledge on random matrices and sub-Gaussian distributions. In Section 3, we present the main results. Section 4 present the application of our results in the study of Gaussian Toeplitz matrix, and the last section concludes paper.

2 Notations and Preliminaries

In this section, we give some preliminary knowledge on random matrices and sub-Gaussian distributions.

A random matrix is a matrix whose entries are random variables. Its distribution is characterized by the joint distribution of the entries. The expected value of an $m\times n$ random matrix ${\bf B}$ is the $m\times n$ matrix ${\mathbb{E}}({\bf B})$ whose entries are the expected values of the corresponding entries of ${\bf B}$ , assuming that they all exist.

Let ${\bf B}_{m\times n}$ be a random matrix. Let $S^{n-1}=\{x\in{\mathbb{R}}^{n}:\|x\|_{2}=1\}$ denote the Euclidean sphere in ${\mathbb{R}}^{n}$ . The largest singular value of ${\bf B}$ is by definition

[TABLE]

Given an arbitrary matrix ${\bf B}$ , the Hermitian dilation of ${\bf B}$ is defined by

[TABLE]

It is ture that $\lambda_{\max}({\mathcal{H}}{(\bf B)})=\|{\mathcal{H}}{(\bf B)}\|=\|{\bf B}\|$ , where $\lambda_{\max}$ denotes the largest eigenvalue. The relationship for real function $f$ is the transfer rule. If $f(a)\leq g(a)$ for $a\in I$ , then $f({\bf H})\preceq g({\bf H})$ for the eigenvalues of ${\bf H}$ lie in $I$ .

Sub-gaussian distributions are referring to a large class of probability distributions, e.g., normal random variables, Bernoulli and all bounded random variables.

Definition 2.1 A real-valued random variable $x$ is said to be sub-Gaussian if there exits $c>0$ such that for every $t>0$

[TABLE]

Assuming the sub-Gaussian random variable’s mean is zero, the following lemma presents equivalent conditions.

Lemma 2.2 Let $x$ be a mean zero (centered) random variable, the following statements are equivalent: 1) $x$ is sub-Gaussian; and 2) $\exists b>0$ , $\forall\theta\in{\mathbb{R}}$ , there holds that

[TABLE]

There are more and more research interests lying in the sub-Gaussian distributions, including spectral properties of random matrices [16] and tail inequalities of sub-Gaussian random vectors [17].

3 Main Results

In this section, we obtain a refined upper bound for the largest singular value (the norm) of sub-Gaussian matrix. We first give a upper bound for the norm of sub-Gaussian matrix by converting into a random sub-Gaussian variable.

Theorem 3.1 Let ${\bf B}$ be an $m\times n$ random sub-Gaussian matrix. That is, its entries $x_{ij}$ are i.i.d. centered random variables, obeys the sub-Gaussian distribution. Then there holds that for all $t\geq 0$ ,

[TABLE]

The proof of Theorem 3.1 is similar to the Proposition 2.4 of [9], where $m=n$ . Here we give the proof of the general case.

Proof The main idea of the proof of Theorem 3.1 is to convert the random matrix into a random variable, i.e., $\langle{\bf B}x,y\rangle$ is a sub-Gaussian random variable. We then use the covering number to complete the proof.

[TABLE]

where $\mathcal{N}$ , $\mathcal{M}$ are $\frac{1}{2}$ -nets of $S^{n-1}$ , $S^{m-1}$ respectively, and the bounds on cardinality of the net are $|{\mathcal{N}}|\leq(1+\epsilon/2)^{n}$ and $|{\mathcal{M}}|\leq(1+\epsilon/2)^{m}$ . $\blacksquare$

A minor shortcoming of above result is that when the matrix dimension increases, the result becomes very loose. Another method is to obtain the tail bound for matrix sub-Gaussian series. We first introduce the matrix sub-Gaussian moment generating function (mgf) bound.

Proposition 3.2 Assume that ${\bf H}$ is a fixed Hermitian matrix and the random variable $x$ obeys the centered sub-Gaussian distribution. Then, there holds that,

[TABLE]

According to the transfer rule, it is easy to get the proposition. Based on the mgf result (3.2), we develop a tail bound for the matrix sub-Gaussian series.

Theorem 3.3 Consider a finite sequence $\{{\bf H}_{k}:k=1,\ldots,K\}$ of fixed Hermitian matrices with dimension $d$ , and $\{x_{k}:k=1,\ldots,K\}$ be a finite sequence of independent centered sub-Gaussian random variables. Compute the variance parameter

[TABLE]

Then, for all $t\geq 0$ ,

[TABLE]

Proof It follows from Proposition 3.2 that, for any $\theta>0$ ,

[TABLE]

where $\rho:=\|\sum_{k}{\bf H}_{k}^{2}\|$ , the first inequality follows from Theorem 3.6 of [10]. This inequality holds for any positive $\theta$ , so we may take an infimum to complete the proof. The infimum is attained when $\theta=\frac{t}{b^{2}\rho}$ . $\blacksquare$

We apply above result to study the sum of rectangular matrix series by using matrices Hermitian dilation. The following is the general version of Theorem 3.3.

Corollary 3.4 Consider a finite sequence $\{{\bf D}_{k}:k=1,\ldots,K\}$ of fixed matrices with dimension $m\times n$ , and $\{x_{k}:k=1,\ldots,K\}$ be a finite sequence of independent centered sub-Gaussian random variables. Compute the variance parameter

[TABLE]

Then, for all $t\geq 0$ ,

[TABLE]

Proof According to Hermitian dilation we know that

[TABLE]

We invoke Theorem 3.3 to obtain the tail bound for the sum of rectangular matrix series. The matrix variance parameter $\rho$ satisfies the relation:

[TABLE]

This completes the proof. $\blacksquare$

Based on the general version of tail bound for matrix sub-Gaussian series, we obtain another tail bound for the norm of the sub-Gaussian matrix.

Theorem 3.5 Under the notations and conditions in Theorem 3.1. Then there holds that for all $t\geq 0$ ,

[TABLE]

Proof In order to use Corollary 3.4, we decompose matrix as a matrix sub-Gaussian series:

[TABLE]

The matrix ${\bf E}_{ij}$ has a element one in the $(i,j)$ position and zeros elsewhere. By calculating $\rho=m$ , the conclusion is established by using Corollary 3.4. $\blacksquare$

The combination of Theorem 3.1 and Theorem 3.5 leads to the following refined upper bound for the largest singular value (the soft edge) of sub-Gaussian matrix.

Theorem 3.6 Follow the notations and conditions in Theorem 3.1. Then there holds that for all $t\geq 0$ ,

[TABLE]

4 Application: Gaussian Toeplitz Matrix

In this section, we use our theoretical findings to compute the tail bound of the Gaussian Toeplitz matrix. The Gaussian Toeplitz matrix is an example of Gaussian random matrix which has been widely used in various fields, *e.g.,*differential equations, spline functions, and signal processing [18]. We consider a unsymmetric Gaussian Toeplitz matrix ${\bf T}\in\mathbb{C}^{d\times d}$ in the following form:

[TABLE]

where $\gamma_{-(d-1)},\ldots,\gamma_{d-1}$ are independent standard normal variables. The Gaussian Toeplitz matrix ${\bf T}$ can be represented as a matrix Gaussian series:

[TABLE]

where ${\bf C}^{j}$ is the $j$ -th power of ${\bf C}$ with

[TABLE]

Using the Theorem 3.6, we can compute the tail bound of the Gaussian Toeplitz matrix. First, we calculate

[TABLE]

The matrix variance parameter $\rho=d$ can be calculated:

[TABLE]

For Gaussian matrix, $b=1$ , $c=\frac{1}{2}$ . Through the application of Theorem 3.6, tail bound of the Gaussian Toeplitz matrix is presented, for all $t\geq 0$ ,

[TABLE]

5 Conclusion

In this paper, we first present the tail bounds for the largest singular value of sub-Gaussian matrix and matrix sub-Gaussian series. We then obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we finally compute the tail bound of Gaussian Toeplitz matrix.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1(1) [1] R. Muirhead. Aspects of Multivariate Statistical Theory . John Wiley & Sons, Inc., New York, 1982.
2(2) [2] P. Bühlmann and S. Van De Geer. Statistics for High-dimensional Data: Methods, Theory and Applications . Springer Science & Business Media, 2011.
3(3) [3] N. Halko, P. G. Martinsson, and J. A. Tropp. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions . SIAM Review, 2011, 53 (2): 217–288.
4(4) [4] A. Naor, O. Regev, and T. Vidick. Efficient rounding for the noncommutative grothendieck inequality. in: Proceedings of the Forty-fifth Annual ACM Symposium on Theory of Computing . ACM, 2013, 71–80.
5(5) [5] V. Chandrasekaran, B. Recht, P. A. Parrilo, and A. S. Willsky. The convex geometry of linear inverse problems . Foundations of Computational Mathematics, 2012, 12 (6): 805–849.
6(6) [6] E. P. Wigner. On the distribution of the roots of certain symmetric matrices . Annals of Mathematics, 1958, 325–327.
7(7) [7] V. A. Marchenko and L. A. Pastur. Distribution of eigenvalues for some sets of random matrices . Matematicheskii Sbornik, 1967, 114 (4): 507–536.
8(8) [8] Z. Bai and Y. Yin. Limit of the smallest eigenvalue of a large dimensional sample covariance matrix . The Annals of Probability, 1993, 21 (3): 1275–1294.