A note on variance bounds and location of eigenvalues
R. Sharma, A. Sharma, R. Saini

TL;DR
This paper explores advanced variance bounds for real and complex numbers and extends these to eigenvalues and matrix spread, providing refined mathematical tools for spectral analysis.
Contribution
It introduces new extensions and refinements of variance bounds and applies them to eigenvalues and matrix spread analysis.
Findings
Derived tighter variance bounds for real and complex numbers
Extended bounds to eigenvalues and matrix spread
Provided mathematical tools for spectral analysis
Abstract
We discuss some extensions and refinements of the variance bounds for both real and complex numbers. The related bounds for the eigenvalues and spread of a matrix are also derived here.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Mathematical functions and polynomials · Mathematical Inequalities and Applications
**A note on variance bounds and location of eigenvalues **
R. Sharma, A. Sharma and R. Saini
Department of Mathematics & Statistics
Himachal Pradesh University
Shimla - 171005,
India
email: [email protected]
**Abstract. **We discuss some extensions and refinements of the variance bounds for both real and complex numbers. The related bounds for the eigenvalues and spread of a matrix are also derived here.
**AMS classification. ** 15A42, 26C10, 60E15
Key words and phrases. Variance, complex numbers, leptokurtic distributions, eigenvalues, trace, spread, polynomial, span, Jung’s theorem in the plane.
1 Introduction
Let denote complex numbers. Their arithmetic mean is the number
[TABLE]
In literature, the number
[TABLE]
or its equivalent expressions have been studied in several different contexts and notations and is termed as the variance of complex numbers at many places. For example, see Audenaert , Bhatia and Sharma Merikoski and Kumar , and Park .
The number
[TABLE]
is also important in this context. If ’s are all real we denote them by ’s with and The arithmetic mean by and variance by the lower case letter In this case but in general rather than is more consistent with For instance, if and only if all the ’s are equal. This is not the case with for example, for three distinct complex numbers we have It however turns out that for some purposes is more consistent with
[TABLE]
than Note that the analogue of the Popoviciu inequality [18]
[TABLE]
for the complex numbers says that
[TABLE]
But it is not always true that For example, for and and
The corresponding inequality for is
[TABLE]
where is the radius of the smallest disk containing all the numbers ’s, see
A classical theorem of Jung [9] says that the complex numbers ’s in a plane can be contained in a closed disk of radius We thus have
[TABLE]
In this context it is in interesting to note a case when the given complex numbers lie on the boundary of the smallest disk containing them. We here show that if the complex numbers lie on a circle with centre at their arithmetic mean then this circle is the smallest circle enclosing these points, (see Theorem 2.1 & 3.1 below). Further, if the complex numbers are all collinear then , and conversely, ( Theorem 2.2). A necessary and sufficient condition is given for which the numbers , and are all equal, ( Theorem 2.2 ). We obtain a complex analogue of the inequality, Mallows and Richter [11],
[TABLE]
where is the arithmetic mean of any subset of numbers chosen from the real numbers (Theorem 2.3).
On the other hand we find in literature that the inequality (1.5) and its complementary Nagy’s inequality [13],
[TABLE]
also provide bounds for the spread of a complex matrix when the eigenvalues of are all real. The spread of a matrix is the maximum distance between two eigenvalues of a matrix, Spd We have,
[TABLE]
where and tr denotes the trace of see [6, 23].
We show that the inequalities, [3, 21],
[TABLE]
provide some further refinements of the inequalities (1.5) and (1.9) and consequently we get better bounds for the spread of a matrix for some special cases, ( Theorem 2.4, 2.5, 3.2). A refinement of the inequality (1.5) is obtained for Leptokurtic and Mesokurtic distributions, (Theorem 2.6). We obtain refinements of the eigenvalue bounds in some special cases, (Theorem 3.3, 3.4). Likewise, the bounds for the span of a polynomial are given, (Theorem 3.5).
2 Main Results
**Theorem 2.1. **If the complex numbers ’s lie on a circle in the complex plane with centre and radius then is the radius of the smallest disk containing all the points ’s.
**Proof. **For any complex number we can write (1.2) in the form
[TABLE]
Under the condition of the theorem, for all and therefore
[TABLE]
Combining (2.1) and (2.2), we get that
[TABLE]
From the first inequality (1.7), So the minimum value of the is This implies that if then is the radius of the smallest disk containing the points ’s. For (2.3) gives This proves the theorem.
**Theorem 2.2. **Let be the points in the finite complex plane and let and be defined as in (1.2), (1.3) and (1.4), respectively. Then, if and only if all the points lie on a straight line.
**Proof. **In the complex plane the convex combination of complex numbers lie in the convex hull of these numbers. It follows that if the points ’s are collinear then also lies on the straight line passing through ’s.
From (1.2) - (1.4), we see that if and only if
[TABLE]
The equality occurs in triangle inequality
[TABLE]
if and only if the ratio of any two non-zero terms is positive that is see Ahlfors [1]. This means (2.4) holds true if and only if the ratio of any two non zero terms in (2.4) is positive, that is
[TABLE]
The square of a complex number is positive if and only if is real and therefore (2.5) implies that is real. Also, is real if and only if lies on the straight line passing through and If for some then and so lies on the straight line passing through and
We need following lemma to extend the inequality (1.8) for complex numbers.
Lemma 2.1 Let and be two sets of complex numbers. Denote by and the arithmetic mean and variance of ’s, respectively. Then the combined variance of the set is given by
[TABLE]
Proof. The combined variance of the set of numbers can be written as
[TABLE]
where
[TABLE]
We note that
[TABLE]
[TABLE]
Therefore,
[TABLE]
On using similar arguments, we have
[TABLE]
The assertions of the theorem now follow on using (2.8) and (2.9) in (2.7).
**Theorem 2.3. **Let be the arithmetic mean of any subset of numbers chosen from the set of complex numbers and let be defined as in (1.4). Then the inequality
[TABLE]
holds true for
**Proof. **Let and be the disjoint sets of and numbers chosen from the numbers respectively. Denote by and the variance of and respectively. We now apply Lemma 2.1 and find that
[TABLE]
Further,
[TABLE]
and therefore (2.1) can be written as
[TABLE]
On using similar arguments, we have
[TABLE]
On applying triangle inequality we find from (2.13) that
[TABLE]
From (2.12) and (2.14), we get that
[TABLE]
Again by triangle inequality, and therefore
[TABLE]
The inequality (2.10) now follows from (2.15) and (2.16).
The inequality (2.10) is an extension of Mallows and Richter inequality [11]. For we obtain the generalisation of the well known Samuelson’s inequality [20],
[TABLE]
Likewise, we can prove the following extension of Nagy’s inequality [13],
[TABLE]
Note that** **for , and therefore from (2.13) on using triangle inequality we get that
[TABLE]
Similarly, from (2.12), we have
[TABLE]
and by addition we obtain the inequality
[TABLE]
It then follows inductively that the inequality
[TABLE]
holds true for and therefore for we have
[TABLE]
for all The inequality (2.18) implies (2.17). Also, see [24].
Theorem 2.4. For we have
[TABLE]
and
with
[TABLE]
Proof. The second inequality (1.11) implies that
[TABLE]
and therefore for we can write
[TABLE]
It is clear that and since increases in the interval we find that
[TABLE]
Combining (2.22) and the second inequality (1.11); we immediately get (2.19).
Further, it follows from (2.21) that for
[TABLE]
Combining (2.23) with the first inequality (1.11); a little computation leads to (2.20).
It may be noted here that the inequality (2.20) can equivalently be written as
[TABLE]
where
We mention an alternative proof of (2.24). From the second inequality (1.11),
[TABLE]
Also, for from the inequality (1.5), we have and for
[TABLE]
The inequality (2.24) follows from (2.25) and (2.26).
**Theorem 2.5. **For and we have
[TABLE]
and with
[TABLE]
Proof. We write (1.8) in the form
[TABLE]
Using arithmetic geometric mean inequality,
[TABLE]
Thus, from (2.29) and (2.30),
[TABLE]
It follows from (2.31) that
[TABLE]
and consequently, for and we have
[TABLE]
It is clear that for and since decreases in the interval we find that
[TABLE]
Combining (2.33) with the second inequality (1.11); we immediately get (2.27).
From (2.33), we also have
[TABLE]
The inequality (2.28) follows from (2.34) and the first inequality (1.11).
Sharma et al. [22] have proved that
[TABLE]
where and
If the distribution is Leptokurtic or Mesokurtic, we have, see [10],
[TABLE]
We prove a refinement of the inequality (1.5) in the following theorem.
**Theorem 2.6. **For a Leptokurtic or Mesokurtic distribution, we have
[TABLE]
**Proof. **Under the assumptions of the theorem the inequalities (2.35) and (2.36) hold true. By (2.36), and we obtain from (2.18) that
[TABLE]
This gives the first inequality (2.37). The second inequality (2.37) follows from (2.38) on using arithmetic - geometric mean inequality,
We remark that the inequality (2.37) also holds true for both discrete and continuous distributions.
3 Bounds for eigenvalues
Let denote the algebra of all complex matrices. We assume that the eigenvalues of are all real, and may define respectively their arithmetic mean and variance to be
[TABLE]
and
[TABLE]
where
The spread of a matrix is the greatest distance between its eigenvalues. The notion of the spread was introduced by Mirsky [14,15] and several authors have studied bounds for the spread of a matrix, see [6,8,13,24].
**Theorem 3.1. **If trace of a unitary matrix is zero then the unit circle is the smallest circle enclosing the eigenvalues of and greatest lower bound on the Spd is
**Proof. **The eigenvalues of a unitary matrix all lie on the unit circle and by assumption of the theorem tr So, the eigenvalues ’s satisfy the conditions of the Theorem 2.1 and hence theunit circle is the smallest circle containing ’s. It also follows from the second inequality (1.7) that Spd
**Example 1. **The basic circulant matrix with first row is a unitary matrix and its trace is zero. By Theorem 3.1 the unit disk is the smallest disk containing eigenvalues of and Spd Also, for we have Spd
The following theorem is a consequence of Theorem 2.4 and provides refinements of the inequalities (1.10).
**Theorem 3.2. **Let the eigenvalues of an element be all non negative and let tr Then
[TABLE]
and
[TABLE]
**Proof. **Under the condition tr , we have
[TABLE]
Further, the eigenvalues of are all non-negative, therefore and Spd So we can apply Theorem 2.1, the inequalities (3.3) and (3.4) follow on using (3.1) and (3.2) in (2.10) and (2.20), respectively.
Example 2. Let
[TABLE]
From (1.9), SpdThe matrix is positive definite and tr So, from our bounds (3.3) and (3.4) we have better estimate Spd
Likewise, we can obtain another refinement of the inequality (1.10) on applying Theorem 2.5. if and tr then
[TABLE]
and
[TABLE]
Further, Wolkowicz and Styan [23] have shown that if the eigenvalues of are all real and then
[TABLE]
and
[TABLE]
The inequalities (3.8) and (3.9) follow respectively from the inequalities, [7,20],
[TABLE]
and
[TABLE]
We now discuss extensions of these inequalities for the case when any one eigenvalue of is known as in case of stochastic and singular matrices.
It is clear from Lemma 2.1 that if is the variance of numbers obtained by excluding a number from the real numbers then
[TABLE]
**Theorem 3.3. **Let the eigenvalues of be all real. Let be an eigenvalue of and denote the remaining eigenvalues by Then, for
[TABLE]
and
[TABLE]
**Proof. **The arithmetic mean of eigenvalues can be written as
[TABLE]
By the use of (3.13) the variance of these eigenvalues is
[TABLE]
On applying (3.10) to numbers and using (3.16) and (3.17); we immediately get (3.12). Likewise, (3.15) follows from (3.11).
**Theorem 3.4. **Under the conditions of the Theorem 3.3, we have
[TABLE]
and
[TABLE]
**Proof. **On using the inequalities (1.5) and (1.9), for numbers we have
[TABLE]
Inserting (3.17) in (3.20), we immediately get (3.18) and (3.19) on simplifications.
Example 3. Let
[TABLE]
From the inequalities (3.8) , we have The largest eigenvalue of is as all row sum of the positive definite matrix is . From (3.14) we have better estimate for the smallest root,
We now consider polynomials with real zeros. Let be a monic polynomial
[TABLE]
with only real zeros. Then the length of the smallest interval containing all the zeros of is called Span of see Denote by the span of then
[TABLE]
See Corollary 6.1.4 and Theorem 6.1.6 in [19].
We prove a refinement of (3.22) in the following theorem.
**Theorem 3.5. **Let the zeros of the polynomial (3.21) be all non-negative and let Then
[TABLE]
and with
[TABLE]
**Proof. **Let be the roots of the polynomial (3.21). Then, on using relation between roots and coefficient of polynomial, we have
[TABLE]
and
[TABLE]
The assertions of the theorem now follow on applying Theorem 2.4.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ahlfors, L. V., Complex Analysis , Mc Graw-Hill (3rd ed) (1979).
- 2[2] Audenaert K.M.R., Variance bounds with an application to norm bounds for commutators , Linear Algebra Appl. 432 (2010), 1126-1143.
- 3[3] Bhatia, R. and Davis, C., A better bound on the Variance , Amer. Math. Month., 107 (2000), 353-357.
- 4[4] Bhatia, R., Sharma, R., Some inequalities for positive linear maps , Linear Algebra Appl. 436 (2012), 1562-1571.
- 5[5] Bhatia, R., Sharma, R., Positive linear maps and Spread of matrices-II , Linear Algebra Appl. 491 (2016), 30-40.
- 6[6] Brauer, A., Mewborn, A.C., The greatest distance between two characteristic roots of a matrix , Duke Math. J., 26 (1959), 653-661.
- 7[7] Brunk, H.D., Note on two papers of K.R. Nair , J. Indian Soc. Agricultural Statist. 11 (1959), 186-189.
- 8[8] Johnson , C.R., Kumar, R. and Wolkowicz, H., Lower bounds for the spread of a matrix , Linear Algebra Appl. 71 (1985), 161-173.
