A Refined Non-asymptotic Tail Bound of Sub-Gaussian Matrix
Xianjie Gao, Hongwei Zhang

TL;DR
This paper derives a refined non-asymptotic tail bound for the largest singular value of sub-Gaussian matrices and applies it to Gaussian Toeplitz matrices, enhancing understanding of their spectral properties.
Contribution
It provides a new, sharper tail bound for sub-Gaussian matrices and demonstrates its application to Gaussian Toeplitz matrices.
Findings
Refined tail bound for sub-Gaussian matrices
Application to Gaussian Toeplitz matrices
Improved understanding of spectral edge behavior
Abstract
In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we use the obtained theorem to compute the tail bound of the Gaussian Toeplitz matrix.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRandom Matrices and Applications · Advanced Combinatorial Mathematics · Matrix Theory and Algorithms
A Refined Non-asymptotic Tail Bound of Sub-Gaussian Matrix
Xianjie Gao, Hongwei Zhang
School of Mathematical Sciences, Dalian University of Technology
Dalian, Liaoning, 116024, P.R. China
[email protected];[email protected]
Abstract
In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we use the obtained theorem to compute the tail bound of the Gaussian Toeplitz matrix.
Keywords: Non-asymptotic theory; largest singular value; tail bound; sub-Gaussian matrix.
1 Introduction
Random matrix theory (RMT) has been widely applied in many fields, e.g., multivariate statistics [1], high-dimensional data analysis [2], the matrix approximation [3], the combinatorial optimization [4] and the compressed sensing [5]. One main research concern on RMT is to study the tail behavior of the extreme eigenvalues (or singular values) of random matrices.
In general, there are two types of probabilistic statements on the study of probability theory: asymptotic and non-asymptotic. The former aims to analyze the limit behavior of some probability terms, e.g., the central limit theorem
[TABLE]
for Bernoulli random variables , where is Gaussian variable. There have been many well-known asymptotic results on RMT:
Wigner’s semicircle law
[6]: Let be a symmetric matrices whose entries are independent Gaussian variables. As dimension , the spectrum of the Wigner matrices is distributed according to the semicircle law with density:
[TABLE]
Marchenko-Pastur law
[7]: Let be a random Gaussian matrix. As the dimensions while the aspect ratio converges to a fix number , the spectrum of the matrices is distributed according to the Marchenko-Pastur law with density:
[TABLE]
where and .
Bai-Yin’s law
[8]: Let be a random matrix whose entries are independent copies of a random variable with zero mean, unit variance, and finite fourth moment. As the dimensions with converging to a fix number , the and are subjected to Bai-Yin’s law:
[TABLE]
Although these asymptotic statements can provide a precise limit result when the matrix dimension or sample number goes to the infinity, they cannot describe in what rate these probability terms converge to their limits. To handle this issue, there arise the non-asymptotic viewpoint to study these probability terms. For example, one of the non-asymptotic statement of the central limit theorem is Hoeffding’s inequality:
[TABLE]
There have been many research works on RMT from the non-asymptotic viewpoint. Vershynin [9] gave non-asymptotic methods about the properties of sub-Gaussian and sub-exponential matrix. Tropp [10] proposed a user-friendly framework to study the tail behavior of sums of random matrices. Moreover, there are also other methods for developing the matrix concentration inequalities, e.g., exchangeable pairs [11] and Markov chain couplings [12]. To eliminate the dimension dependence of these tail results for random matrices, the intrinsic dimension (or effective dimension) was employed to improve them (see [13],[14]). Recently, Zhang et al. [15] applied a diagonalization method to obtain the dimension-free tail inequalities of largest singular value for sums of random matrices.
In this paper, we obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. We first give a tail bound for the norm of a sub-Gaussian matrix by transforming a sub-Gaussian matrix into a sub-Gaussian variable. We also obtain a tail bound for the norm of a sub-Gaussian matrix by decomposing a sub-Gaussian matrix into a series of sub-Gaussian matrices. By combining the two resulted tail bounds, we obtain the final tail results. As an application, we use the resulted tail inequalities to study the tail behavior of Gaussian Toeplitz matrix.
The rest of this paper is organized as follows. In the next section, we give some preliminary knowledge on random matrices and sub-Gaussian distributions. In Section 3, we present the main results. Section 4 present the application of our results in the study of Gaussian Toeplitz matrix, and the last section concludes paper.
2 Notations and Preliminaries
In this section, we give some preliminary knowledge on random matrices and sub-Gaussian distributions.
A random matrix is a matrix whose entries are random variables. Its distribution is characterized by the joint distribution of the entries. The expected value of an random matrix is the matrix whose entries are the expected values of the corresponding entries of , assuming that they all exist.
Let be a random matrix. Let denote the Euclidean sphere in . The largest singular value of is by definition
[TABLE]
Given an arbitrary matrix , the Hermitian dilation of is defined by
[TABLE]
It is ture that , where denotes the largest eigenvalue. The relationship for real function is the transfer rule. If for , then for the eigenvalues of lie in .
Sub-gaussian distributions are referring to a large class of probability distributions, e.g., normal random variables, Bernoulli and all bounded random variables.
Definition 2.1 A real-valued random variable is said to be sub-Gaussian if there exits such that for every
[TABLE]
Assuming the sub-Gaussian random variable’s mean is zero, the following lemma presents equivalent conditions.
Lemma 2.2 Let be a mean zero (centered) random variable, the following statements are equivalent: 1) is sub-Gaussian; and 2) , , there holds that
[TABLE]
There are more and more research interests lying in the sub-Gaussian distributions, including spectral properties of random matrices [16] and tail inequalities of sub-Gaussian random vectors [17].
3 Main Results
In this section, we obtain a refined upper bound for the largest singular value (the norm) of sub-Gaussian matrix. We first give a upper bound for the norm of sub-Gaussian matrix by converting into a random sub-Gaussian variable.
Theorem 3.1 Let be an random sub-Gaussian matrix. That is, its entries are i.i.d. centered random variables, obeys the sub-Gaussian distribution. Then there holds that for all ,
[TABLE]
The proof of Theorem 3.1 is similar to the Proposition 2.4 of [9], where . Here we give the proof of the general case.
Proof The main idea of the proof of Theorem 3.1 is to convert the random matrix into a random variable, i.e., is a sub-Gaussian random variable. We then use the covering number to complete the proof.
[TABLE]
where , are -nets of , respectively, and the bounds on cardinality of the net are and .
A minor shortcoming of above result is that when the matrix dimension increases, the result becomes very loose. Another method is to obtain the tail bound for matrix sub-Gaussian series. We first introduce the matrix sub-Gaussian moment generating function (mgf) bound.
Proposition 3.2 Assume that is a fixed Hermitian matrix and the random variable obeys the centered sub-Gaussian distribution. Then, there holds that,
[TABLE]
According to the transfer rule, it is easy to get the proposition. Based on the mgf result (3.2), we develop a tail bound for the matrix sub-Gaussian series.
Theorem 3.3 Consider a finite sequence of fixed Hermitian matrices with dimension , and be a finite sequence of independent centered sub-Gaussian random variables. Compute the variance parameter
[TABLE]
Then, for all ,
[TABLE]
Proof It follows from Proposition 3.2 that, for any ,
[TABLE]
where , the first inequality follows from Theorem 3.6 of [10]. This inequality holds for any positive , so we may take an infimum to complete the proof. The infimum is attained when .
We apply above result to study the sum of rectangular matrix series by using matrices Hermitian dilation. The following is the general version of Theorem 3.3.
Corollary 3.4 Consider a finite sequence of fixed matrices with dimension , and be a finite sequence of independent centered sub-Gaussian random variables. Compute the variance parameter
[TABLE]
Then, for all ,
[TABLE]
Proof According to Hermitian dilation we know that
[TABLE]
We invoke Theorem 3.3 to obtain the tail bound for the sum of rectangular matrix series. The matrix variance parameter satisfies the relation:
[TABLE]
This completes the proof.
Based on the general version of tail bound for matrix sub-Gaussian series, we obtain another tail bound for the norm of the sub-Gaussian matrix.
Theorem 3.5 Under the notations and conditions in Theorem 3.1. Then there holds that for all ,
[TABLE]
Proof In order to use Corollary 3.4, we decompose matrix as a matrix sub-Gaussian series:
[TABLE]
The matrix has a element one in the position and zeros elsewhere. By calculating , the conclusion is established by using Corollary 3.4.
The combination of Theorem 3.1 and Theorem 3.5 leads to the following refined upper bound for the largest singular value (the soft edge) of sub-Gaussian matrix.
Theorem 3.6 Follow the notations and conditions in Theorem 3.1. Then there holds that for all ,
[TABLE]
4 Application: Gaussian Toeplitz Matrix
In this section, we use our theoretical findings to compute the tail bound of the Gaussian Toeplitz matrix. The Gaussian Toeplitz matrix is an example of Gaussian random matrix which has been widely used in various fields, *e.g.,*differential equations, spline functions, and signal processing [18]. We consider a unsymmetric Gaussian Toeplitz matrix in the following form:
[TABLE]
where are independent standard normal variables. The Gaussian Toeplitz matrix can be represented as a matrix Gaussian series:
[TABLE]
where is the -th power of with
[TABLE]
Using the Theorem 3.6, we can compute the tail bound of the Gaussian Toeplitz matrix. First, we calculate
[TABLE]
The matrix variance parameter can be calculated:
[TABLE]
For Gaussian matrix, , . Through the application of Theorem 3.6, tail bound of the Gaussian Toeplitz matrix is presented, for all ,
[TABLE]
5 Conclusion
In this paper, we first present the tail bounds for the largest singular value of sub-Gaussian matrix and matrix sub-Gaussian series. We then obtain a refined non-asymptotic tail bound for the largest singular value (the soft edge) of sub-Gaussian matrix. As an application, we finally compute the tail bound of Gaussian Toeplitz matrix.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) [1] R. Muirhead. Aspects of Multivariate Statistical Theory . John Wiley & Sons, Inc., New York, 1982.
- 2(2) [2] P. Bühlmann and S. Van De Geer. Statistics for High-dimensional Data: Methods, Theory and Applications . Springer Science & Business Media, 2011.
- 3(3) [3] N. Halko, P. G. Martinsson, and J. A. Tropp. Finding structure with randomness: Probabilistic algorithms for constructing approximate matrix decompositions . SIAM Review, 2011, 53 (2): 217–288.
- 4(4) [4] A. Naor, O. Regev, and T. Vidick. Efficient rounding for the noncommutative grothendieck inequality. in: Proceedings of the Forty-fifth Annual ACM Symposium on Theory of Computing . ACM, 2013, 71–80.
- 5(5) [5] V. Chandrasekaran, B. Recht, P. A. Parrilo, and A. S. Willsky. The convex geometry of linear inverse problems . Foundations of Computational Mathematics, 2012, 12 (6): 805–849.
- 6(6) [6] E. P. Wigner. On the distribution of the roots of certain symmetric matrices . Annals of Mathematics, 1958, 325–327.
- 7(7) [7] V. A. Marchenko and L. A. Pastur. Distribution of eigenvalues for some sets of random matrices . Matematicheskii Sbornik, 1967, 114 (4): 507–536.
- 8(8) [8] Z. Bai and Y. Yin. Limit of the smallest eigenvalue of a large dimensional sample covariance matrix . The Annals of Probability, 1993, 21 (3): 1275–1294.
