Orthogonality to matrix subspaces, and a distance formula
Priyanka Grover

TL;DR
This paper provides a precise condition for matrix orthogonality to subspaces and derives a formula for the distance from matrices to certain subalgebras, advancing understanding of matrix geometry.
Contribution
It introduces a necessary and sufficient condition for matrix orthogonality to subspaces and a new distance formula to unital C*-subalgebras.
Findings
Characterization of Birkhoff-James orthogonality to subspaces
Explicit distance formula to unital C*-subalgebras
Enhanced understanding of matrix subspace geometry
Abstract
We obtain a necessary and sufficient condition for a matrix to be Birkhoff-James orthogonal to any subspace of . Using this we obtain an expression for the distance of from any unital subalgebra of .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Advanced Topics in Algebra · Holomorphic and Operator Theory
Orthogonality to matrix subspaces, and a distance formula
Priyanka Grover
*Theoretical Statistics and Mathematics Unit, Indian Statistical Institute, Delhi Centre, 7, S.J.S. Sansanwal Marg, New Delhi-110016, India
Email: [email protected]*
Abstract
We obtain a necessary and sufficient condition for a matrix to be Birkhoff-James orthogonal to any subspace of . Using this we obtain an expression for the distance of from any unital subalgebra of .
*AMS classification: * 15A60, 15A09, 47A12
*Keywords: * Birkhoff-James orthogonality, Subdifferential, Singular value decomposition, Moore-Penrose inverse, Pinching, Variance.
1 Introduction
Let be the space of complex matrices and let be any subspace of . For any , let
[TABLE]
be the operator norm of . Then is said to be (Birkhoff-James) orthogonal to if
[TABLE]
The space is a complex Hilbert space under the inner product and a real Hilbert space under the inner product . Let be the orthogonal complement of , where the orthogonal complement is with respect to the usual Hilbert space orthogonality in with the inner product or , depending upon whether is a real or complex subspace. Note that if such that , then is orthogonal to .
Bhatia and emrl [6] obtained an interesting characterisation of orthogonality when , where is any matrix in . They showed that is orthogonal to if and only if there exists a unit vector such that and . In other words, is orthogonal to if and only if there exists a positive semidefinite matrix of rank one such that and Such positive semidefinite matrices with trace 1 are called density matrices. We use the notation to mean is positive semidefinite.
Let the subspace of all diagonal matrices with real entries, and let be any Hermitian matrix. Then is called minimal if for all . Andruchow, Larotonda, Recht, and Varela [1, Theorem 1] showed that a Hermitian matrix is minimal if and only if there exists a density matrix such that and all diagonal entries of are zero. In our notation, is minimal is same as saying that is orthogonal to the subspace . If is Hermitian, then note that is orthogonal to if and only if is orthogonal to . Now is the subspace of all matries such that their diagonal entries are zero. The condition is same as and diagonal entries of are same as diagonal entries of . Therefore Theorem 1 in [1] can be interpreted as follows. A Hermitian matrix is orthogonal to if any only if and . The following theorem is a generalization of this result as well as Bhatia-emrl theorem.
Theorem 1**.**
Let and let be the multiplicity of the maximum singular value of . Let be any (real or complex) subspace of Then is orthogonal to if and only if there exists a density matrix of complex rank at most such that and . (If rank , then has the form where are unit vectors such that and are such that and .)
Here, is the best possible upper bound on rank . This has been illustrated later in Remark 4 in Section 4. When , the above theorem says that is orthogonal to if and only if there exists a of the form such that , and . By the Hausdorff-Toeplitz theorem, we get a unit vector such that and . The first condition is stronger than that in [6, Theorem 1.1].
Let denote the distance of a matrix from the subspace , defined as
[TABLE]
Audenaert [2] showed that when , then
[TABLE]
Further the maximisation over on the right hand side of (2) can be restricted to density matrices of rank 1. The quantity is called the variance of with respect to the density matrix . Bhatia and Sharma [7] showed that if is any positive unital linear map, then
[TABLE]
By choosing for different density matrices , they obtained various interesting bounds on
It would be interesting to have a generalisation of (2) with replaced by any unital subalgebra of . (This problem has also been raised by M. Rieffel in [13].) Let be any unital subalgebra of . Let denote the projection of onto . We note that is a bimodule map:
[TABLE]
In particular, when is the subalgebra of block diagonal matrices, the matrix is called a pinching of and is denoted by . It is defined as follows. If X=\left[\begin{array}[]{cccc}X_{11}&\cdots&X_{1k}\\ X_{21}&\cdots&X_{2k}\\ \vdots&\vdots&\vdots\\ X_{k1}&\cdots&X_{kk}\end{array}\right] then
[TABLE]
Properties of pinchings are studied in detail in [3] and [4].
Our next result provides a generalisation of (2) for distance of to any unital subalgebra of .
Theorem 2**.**
Let be any unital subalgebra of . Let denote the projection of onto .
Then
[TABLE]
where denotes the Moore-Penrose inverse of . The maximum on the right hand side of (5) can be restricted to rank .
We prove Theorem 1 using ideas of subdifferential calculus. A brief summary of these is given in Section 2. The proofs are given in Section 3.
2 Preliminaries
Let be a complex Hilbert space. Let be a convex function. Then the subdifferential of at any point , denoted by , is the set of such that
[TABLE]
It follows from (6) that is minimized at if and only if .
We use an idea similar to the one in [8, Theorem 2.1]. Let . This is the composition of two functions namely from into and from into . Thus we need to find subdifferentials of composition maps. For that we need a chain rule.
Proposition 1**.**
Let be any two Hilbert spaces. Let be a convex function. Let be a linear map and let be the affine map defined by , for some . Then
[TABLE]
where is the adjoint of defined as
[TABLE]
In our setting, is the map . The subdifferential of this map has been calculated by Watson [14].
Proposition 2**.**
Let . Then
[TABLE]
where denotes the convex hull of a set .
These elementary facts can be found in [11]. In this book the author deals with convex functions . The same proofs can be extended to functions , where is any Hilbert space.
3 Proofs
Proof of Theorem 1 Suppose there exists a positive semidefinite with such that and . Then for any
[TABLE]
Now for any ,
[TABLE]
where denotes the trace norm. So,
[TABLE]
Since , we have . The matrices and are positive semidefinite, therefore and by our assumption, . Using these in (10) we get that .
Conversely, suppose
[TABLE]
Let be the inclusion map. Then is the projection onto the subspace . Let be the map defined as
[TABLE]
Let be the map taking an matrix to . Then (11) can be rewritten as
[TABLE]
that is, is minimized at [math]. Therefore . Using Proposition 1, we get
[TABLE]
By Proposition 2,
[TABLE]
From (12) and (13) it follows that there exist unit vectors such that and numbers such that , and
[TABLE]
Let . Then and . Note that
[TABLE]
So, from (14) we get , that is, . Since each is a right singular vector for , we have . Using this we obtain
[TABLE]
Now let . We now show that if satisfies (15), then rank . First note that and commute and therefore can be diagonalised simultaneously. So we can assume and in (15) to be diagonal matrices. By hypothesis of the diagonal entries of are equal to . Let A^{*}A=\left[\begin{array}[]{ccccccc}\|A\|^{2}&&&&&&\\ &\ddots&&&&&\\ &&\|A\|^{2}&&&\\ &&&s_{k+1}^{2}&&\\ &&&&\ddots&\\ &&&&&s_{n}^{2}\end{array}\right], where for all . If P=\left[\begin{array}[]{ccc}p_{1}&&\\ &\ddots&\\ &&p_{n}\end{array}\right], then from (15) we obtain
[TABLE]
So for all . Hence rank . ∎
Proof of Theorem 2 We first show that it is sufficient to prove the result when is a subalgebra of block diagonal matrices in . If is any subalgebra of then there exist with such that is -isomorphic to , the -isomorphism being for some unitary matrix (see [9, p. 249], [10, p. 74]). By definition
[TABLE]
Let denote the matrix . Since is unitarily invariant, we get
[TABLE]
Next we show that for any density matrix ,
[TABLE]
where is the pinching map as defined in (4). Since
[TABLE]
we have
[TABLE]
Now note that for any . . Therefore the above expression is same as
[TABLE]
This gives (17). So it is enough to prove (5) when is a subalgebra of block diagonal matrices. We first show that
[TABLE]
Let be any density matrix. Then Therefore
[TABLE]
Let . Applying the translation in (21) we get
[TABLE]
We show that the expression on the left hand side is invariant under this translation. By expanding the expression on the left hand side of (22), we get
[TABLE]
We show that except for the first term, , the rest of the terms in (23) are zero. We shall prove that the second term
[TABLE]
in (23) is zero. The proof for the other two terms is similar.
By using (3), the expression in (24) is equal to
[TABLE]
By (18) this is equal to
[TABLE]
If is invertible then this is clearly zero. So let be not invertible. This means that if \mathcal{C}(P)=\left[\begin{array}[]{ccc}P_{1}&&\\ &\ddots&\\ &&P_{k}\end{array}\right], then there exists such that is not invertible. Let denote the block diagonal unitary matrix
[TABLE]
where if is invertible and U_{i}^{*}P_{i}U_{i}=\left[\begin{array}[]{ccc}\Lambda_{i}&\\ &O\\ \end{array}\right], if is not invertible. (Here is the diagonal matrix with eigenvalues of as its diagonal entries.) Let denote the matrix . Then from (3) and (18), we get that the expression in (25) is same as
[TABLE]
Now \mathcal{C}(P^{\prime})=\left[\begin{array}[]{ccccc}\Lambda_{1}&&&&\\ &O&&&\\ &&\Lambda_{2}&&\\ &&&O&\\ &&&&\ddots\end{array}\right]. Write and as -block matrices, respectively such that whenever is not invertible, we have and .
The -entry of is . Suppose . Since , we have for all . Hence the -entry of is zero. So let . Then the -entry of is zero. Therefore the expression in (27) is zero, and hence the expression in (25) is zero. Therefore from (22), we obtain
[TABLE]
for all and for all density matrices . Equation (20) now follows from here.
To show equality in (20), let where for some . Then is orthogonal to . By Theorem 1 there exists a density matrix such that
[TABLE]
and
[TABLE]
From (28) we get that
[TABLE]
By using (3), we obtain
[TABLE]
Substituting (29) in (30) we get
[TABLE]
Now consider . From (29) we see that this is same as . If is invertible, then this is equal to . If is not invertible, then we define as done in (26). From (3) and (18), we obtain
[TABLE]
By definition of , this is equal to , which again by (3) and (18), is same as . Therefore from (31) we have
[TABLE]
4 Remarks
It is clear from the proof of Theorem 1 that the condition can be replaced by the weaker condition in the statement of Theorem 1. 2. 2.
As one would expect, the set need not be a subspace. As an example consider the subspace of . Let A_{1}=\left[\begin{array}[]{ccc}0&1&0\\ 1&0&1\\ 0&1&0\end{array}\right] and A_{2}=\left[\begin{array}[]{ccc}0&0&1\\ 0&0&0\\ 1&0&0\end{array}\right]. It can be checked from Theorem 1 that are orthogonal to . (Take P=\left[\begin{array}[]{ccc}0&0&0\\ 0&1&0\\ 0&0&0\end{array}\right] for and P=\left[\begin{array}[]{ccc}1&0&0\\ 0&0&0\\ 0&0&0\end{array}\right] for , respectively.) Then A_{1}+A_{2}=\left[\begin{array}[]{ccc}0&1&1\\ 1&0&1\\ 1&1&0\end{array}\right], and . But . Hence is not orthogonal to . 3. 3.
Let . Then . In Section 1, we stated that if such that then is orthogonal to . Therefore all the scalar matrices are orthogonal to . We show that if then there exists a matrix with such that . Let and denote the diagonal and off-diagonal parts of , respectively. Then , and . So it is enough to find such that . Let where each occurs on the diagonal times and . Assume . Take Then has trace zero and It is easy to check that . Hence for this particular we have that . 4. 4.
In Theorem 1, is the best possible upper bound on rank . Consider . From Remark 2, we get that if a matrix is orthogonal to then it has to be of the form , for some . When then . Let be any density matrix satisfying . Then , for some . If also satisfies , then we get . Hence rank . 5. 5.
For and any subalgebra of , we can restrict maximum on the right hand side of (5) over rank one density matrices. By the same argument as in the proof of Theorem 2 it is sufficient to prove this for , the subalgebra of diagonal matrices with complex entries. We show
[TABLE]
where is the projection onto . From Theorem 2 we have
[TABLE]
Note that
[TABLE]
Let A=\left[\begin{array}[]{ccc}a&b\\ c&d\end{array}\right] and without loss of generality assume that . Then . For x=\left[\begin{array}[]{ccc}0\\ 1\end{array}\right]
[TABLE]
Combining this with (33), we obtain
[TABLE] 6. 6.
For and any subalgebra of , we note that
[TABLE]
Again it is enough to show that
[TABLE]
If is an off-diagonal matrix, that is, A=\left[\begin{array}[]{ccc}0&b\\ c&0\end{array}\right] then by Theorem 2.1 in [5] we obtain . Conversely let be such that . Then by taking , we have . Again by using Theorem 2.1 in [5] we obtain that . So is of the form \left[\begin{array}[]{ccc}a&b\\ c&d\end{array}\right], where Since norm of each row and each colum is less than or equal to , we get that . Hence .
Acknowledgement. I would like to thank Professor Rajendra Bhatia for several useful discussions and Professor Ajit Iqbal Singh for helpful comments in this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Andruchow, G. Larotonda, L. Recht, A. Varela, A characterization of minimal Hermitian matrices, Linear Algebra Appl. 436 (2012) 2366–2374.
- 2[2] K.M.R. Audenaert, Variance bounds, with an application to norm bounds for commutators, Linear Algebra Appl. 432 (2010) 1126–1143.
- 3[3] R. Bhatia, Matrix Analysis , Springer, 1997.
- 4[4] R. Bhatia, Positive Definite Matrices , Princeton University Press, 2007.
- 5[5] R. Bhatia, M-D. Choi, C. Davis, Comparing a matrix to its off-diagonal part, Oper. Theory Adv. Appl. 40/41 (1989) 151–164.
- 6[6] R. Bhatia, P. S ˇ ˇ S \check{\text{S}} emrl, Orthogonality of matrices and some distance problems, Linear Algebra Appl. 287 (1999) 77–86.
- 7[7] R. Bhatia, R. Sharma, Some inequalities for positive linear maps, Linear Algebra Appl. 436 (2012) 1562–1571.
- 8[8] T. Bhattacharyya, P. Grover, Characterization of Birkhoff-James orthogonality, J. Math. Anal. Appl. 407 (2013) 350–358.
