New upper bounds for the spectral variation of a general matrix
Xuefeng Xu

TL;DR
This paper derives new upper bounds for the spectral variation of general matrices under perturbations, extending classical results like Hoffman–Wielandt to non-normal matrices and improving existing bounds.
Contribution
It introduces novel upper bounds for spectral variation applicable to non-normal matrices, generalizing and improving upon classical spectral stability results.
Findings
New bounds improve existing estimates for spectral variation.
Results extend spectral stability analysis to non-normal matrices.
Some bounds are tighter or more general than previous ones.
Abstract
Let be a normal matrix with spectrum , and let be a perturbed matrix with spectrum . If is still normal, the celebrated Hoffman--Wielandt theorem states that there exists a permutation of such that , where denotes the Frobenius norm of a matrix. This theorem reveals the strong stability of the spectrum of a normal matrix. However, if or is non-normal, the Hoffman--Wielandt theorem does not hold in general. In this paper, we present new upper bounds for , provided that both and are general…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Spectral Theory in Mathematical Physics · Random Matrices and Applications
New upper bounds for the spectral variation of a general matrix
Xuefeng Xu
Department of Mathematics, Purdue University, West Lafayette, IN 47907, USA
[email protected]; [email protected]
Abstract.
Let be a normal matrix with spectrum , and let be a perturbed matrix with spectrum . If is still normal, the celebrated Hoffman–Wielandt theorem states that there exists a permutation of such that \big{(}\sum_{i=1}^{n}|\widetilde{\lambda}_{\pi(i)}-\lambda_{i}|^{2}\big{)}^{1/2}\leq\|E\|_{F}, where denotes the Frobenius norm of a matrix. This theorem reveals the strong stability of the spectrum of a normal matrix. However, if or is non-normal, the Hoffman–Wielandt theorem does not hold in general. In this paper, we present new upper bounds for \big{(}\sum_{i=1}^{n}|\widetilde{\lambda}_{\pi(i)}-\lambda_{i}|^{2}\big{)}^{1/2}, provided that both and are general matrices. Some of our estimates improve or generalize the existing ones.
Key words and phrases:
Hoffman–Wielandt theorem, spectral variation, perturbation, upper bound
2010 Mathematics Subject Classification:
15A18, 47A55, 65F15
1. Introduction
Let be the set of all complex matrices, and let be the identity matrix. For any , the symbols , , and denote the conjugate transpose, the spectral norm, and the Frobenius norm of , respectively. For any , , , , and stand for its trace, diagonal part, strictly lower triangular part, and strictly upper triangular part, respectively. Furthermore, we define
[TABLE]
Obviously, , and if and only if .
Let and have the spectra and , respectively. For any permutation of , we define
[TABLE]
If and are normal matrices, Hoffman and Wielandt [5] proved that there exists a permutation of such that
[TABLE]
This is the well-known Hoffman–Wielandt theorem, which reveals the strong stability of the spectrum of a normal matrix. However, the inequality (1.3) may fail when or is non-normal. Over the past decades, various extensions or analogues of the Hoffman–Wielandt theorem have been developed by many researchers; see, e.g., [4, 12, 2, 3, 6, 7, 11, 9, 8, 1, 10, 13].
If is normal and is non-normal, Sun [12, Theorem 1.1] showed that
[TABLE]
Recently, Xu and Zhang [13, Theorem 3.6] derived that
[TABLE]
which improved the estimate (1.4) due to . Nevertheless, the estimates (1.3)–(1.5) may be invalid for a general matrix . As is well known, for any , there is a nonsingular matrix such that
[TABLE]
where each () is a Jordan block. Let
[TABLE]
It was proved by Song [11, Theorem 2.1] that
[TABLE]
In this paper, we establish some new upper bounds for the spectral variation of a general matrix. One of our main results is
[TABLE]
In view of (1.1), involved in (1.7) is \delta(E_{Q})=\big{(}\|E_{Q}\|_{F}^{2}-\frac{1}{n}|\operatorname*{tr}(E)|^{2}\big{)}^{\frac{1}{2}}. Theoretical analysis shows that the new estimate (1.7) is sharper than (1.6) (see Remark 3.2 for details). Moreover, it is easy to check that (1.7) will reduce to (1.5) if is a normal matrix. That is, the new estimate (1.7) also generalizes the existing one (1.5).
The rest of this paper is organized as follows. In Section 2, we introduce several auxiliary estimates, which play an important role in our analysis. In Section 3, we present new upper bounds for the spectral variation of a general matrix.
2. Preliminaries
For any square matrix , the first lemma provides an upper bound for [13, Lemma 3.1].
Lemma 2.1**.**
Let be a square matrix. Then
[TABLE]
where is defined by (1.1).
The following lemma gives an upper bound for the spectral variation of a normal matrix [13, Theorem 3.6], which plays a key role in the subsequent analysis.
Lemma 2.2**.**
Let be a normal matrix with spectrum , and let be a perturbed matrix with spectrum . Then there exists a permutation of such that
[TABLE]
where is defined by (1.1).
For any , it can be factorized as
[TABLE]
where is nonsingular, and each is a Jordan block with the form
[TABLE]
Let be a parameter, and let
[TABLE]
where T_{i}=\operatorname*{diag}\big{(}1,\varepsilon,\ldots,\varepsilon^{m_{i}-1}\big{)} for all . Then
[TABLE]
where \Lambda=\operatorname*{diag}\big{(}\lambda_{1}I_{m_{1}},\ldots,\lambda_{p}I_{m_{p}}\big{)}, and \Omega=\operatorname*{diag}\big{(}\Omega_{1},\dots,\Omega_{p}\big{)} with
[TABLE]
We are now in a position to present the fundamental estimate of this paper.
Lemma 2.3**.**
Let be factorized as in (2.3), and let be a perturbed matrix. Let
[TABLE]
where T_{i}=\operatorname*{diag}\big{(}1,\varepsilon,\ldots,\varepsilon^{m_{i}-1}\big{)} with . Then, it holds that
[TABLE]
where
[TABLE]
with and .
Proof.
From (2.4), we have
[TABLE]
which yields
[TABLE]
In what follows, we establish the upper bounds for , , and .
(i) Partitioning into the block form with , we have
[TABLE]
Hence,
[TABLE]
Note that
[TABLE]
Thus,
[TABLE]
(ii) It is easy to see that
[TABLE]
Due to , it follows that
[TABLE]
Since (see (2.1)), we obtain
[TABLE]
(iii) In addition, we have
[TABLE]
Combining (2.6)–(2.9), we can arrive at the estimate (2.5). ∎
3. Main results
In light of (2.2) and (2.5), we can derive the following estimate.
Theorem 3.1**.**
Let have the factorization (2.3), and let , where is a perturbation. Let and be the spectra of and , respectively. Then there exists a permutation of such that
[TABLE]
where and .
Proof.
Observe that is a normal matrix with spectrum , and the spectrum of is . Applying Lemma 2.2 to and yields
[TABLE]
where we have used the estimate (2.5). Take
[TABLE]
Direct calculations yield
[TABLE]
Thus, the estimate (3.1) is valid. ∎
Remark 3.2*.*
If , then (3.1) reads
[TABLE]
Due to
[TABLE]
it follows that
[TABLE]
On the other hand, if , then (3.1) reads
[TABLE]
Then
[TABLE]
Hence, the estimate (3.1) is sharper than (1.6).
The next two estimates are based on the different constraints for .
Theorem 3.3**.**
Under the assumptions of Theorem 3.1, it holds that
[TABLE]
Proof.
Take
[TABLE]
Direct computation yields
[TABLE]
Similarly to Theorem 3.1, one can show that the estimate (3.2) holds. ∎
Theorem 3.4**.**
Under the assumptions of Theorem 3.1, it holds that
[TABLE]
where
[TABLE]
Proof.
We first note that is diagonalizable if and only if (or ).
(i) If is diagonalizable, then , , and . In this case, the estimate (2.5) reduces to
[TABLE]
An application of Lemma 2.2 yields
[TABLE]
(ii) If cannot be diagonalized, then and . Direct calculation yields
[TABLE]
Here, denotes the derivative of with respect to . It is easy to check that
[TABLE]
Take
[TABLE]
Direct computation yields
[TABLE]
The rest of this proof is similar to Theorem 3.1. ∎
Remark 3.5*.*
If is diagonalizable, the condition will be satisfied. From (3.3), we have
[TABLE]
which coincides with (3.4). That is, (3.3) has contained the diagonalizable case.
Remark 3.6*.*
In particular, if is normal, then can be chosen as a unitary matrix. In this case, the estimates (3.1)–(3.3) all reduce to
[TABLE]
which is exactly (2.2).
Example 3.7**.**
Let
[TABLE]
where , , and . In this case,
[TABLE]
The upper bounds in (1.6), (3.1), (3.2), and (3.3) are listed below.
Table 1 displays that the new upper bounds in (3.1)–(3.3) are smaller than that in (1.6).
Under the assumptions of Lemma 2.2, if the original matrix is Hermitian, then the following estimate (see [13, Theorem 4.2]) holds:
[TABLE]
In what follows, we consider a special case that the eigenvalues of are all real. In such a case, we can derive more accurate estimates for based on (3.5), which are presented in the following three theorems.
Theorem 3.8**.**
Let be factorized as in (2.3), and let be a perturbed matrix with spectrum . If the eigenvalues of are all real, then there exists a permutation of such that
[TABLE]
Theorem 3.9**.**
Under the assumptions of Theorem 3.8, it holds that
[TABLE]
Theorem 3.10**.**
Under the assumptions of Theorem 3.8, it holds that
[TABLE]
where and are given in Theorem 3.4.
Example 3.11**.**
Let
[TABLE]
where . In this example, it holds that
[TABLE]
The upper bounds in (1.6), (3.6), (3.7), and (3.8) are listed below.
From Table 2, one can see that the new estimates (3.6)–(3.8) are sharper than (1.6).
Remark 3.12*.*
Define
[TABLE]
Using
[TABLE]
one can derive some deductive estimates for . Furthermore, using the relation , one can readily obtain the corresponding estimates for .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] R. Bhatia, Perturbation Bounds for Matrix Eigenvalues , SIAM, Philadelphia, 2007.
- 2[2] R. Bhatia, F. Kittaneh, and R.-C. Li, Some inequalities for commutators and an application to spectral variation. II , Linear Multilinear Algebra 43 (1997), 207–219.
- 3[3] S. C. Eisenstat and I. C. F. Ipsen, Three absolute perturbation bounds for matrix eigenvalues imply relative bounds , SIAM J. Matrix Anal. Appl. 20 (1998), 149–158.
- 4[4] L. Elsner and S. Friedland, Singular values, doubly stochastic matrices, and applications , Linear Algebra Appl. 220 (1995), 161–169.
- 5[5] A. J. Hoffman and H. W. Wielandt, The variation of the spectrum of a normal matrix , Duke Math. J. 20 (1953), 37–39.
- 6[6] I. C. F. Ipsen, Relative perturbation results for matrix eigenvalues and singular values , Acta Numer. 7 (1998), 151–201.
- 7[7] R.-C. Li, Relative perturbation theory: I. eigenvalue and singular value variations , SIAM J. Matrix Anal. Appl. 19 (1998), 956–982.
- 8[8] W. Li and J.-X. Chen, The eigenvalue perturbation bound for arbitrary matrices , J. Comput. Math. 24 (2006), 141–148.
