Eigenvector of a matrix in $SO_3(\mathbb{R})$
Amol Sasane, Victor Ufnarovski

TL;DR
This paper provides multiple proofs that a specific vector, constructed from the entries of a matrix in $O_3( eal)$, is an eigenvector associated with eigenvalue 1, assuming it exists.
Contribution
It introduces several different proofs for the eigenvector property of a particular vector in $SO_3( eal)$ matrices, enhancing understanding of their structure.
Findings
The vector $V$ is an eigenvector of $A$ with eigenvalue 1.
Multiple proofs confirm the eigenvector property.
Conditions for the existence of $V$ are discussed.
Abstract
Let . We give several different proofs of the fact that the vector if it exists, is an eigenvector of corresponding to the eigenvalue .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMatrix Theory and Algorithms · Advanced Topics in Algebra · Graph theory and applications
Eigenvector of a matrix in
Amol Sasane
Department of Mathematics
London School of Economics
Houghton Street
London WC2A 2AE
United Kingdom
and
Victor Ufnarovski
Department of Mathematics
Lund University
Sölvegatan 18, 223 62 Lund
Sweden
Abstract.
Let . We give several different proofs of the fact that the vector
[TABLE]
if it exists, is an eigenvector of corresponding to the eigenvalue .
Key words and phrases:
Orthogonal matrices, rotations in , eigenvectors
2010 Mathematics Subject Classification:
Primary 15A18 ; Secondary 15-01, 97Axx
1. Introduction
Let be a real matrix and suppose that we want to find an eigenvector for Every student learns an algorithm for this, but is it possible to skip the toil, and write down explicitly in terms of ? For example, we can easily do this for a matrix of rank If is a nonzero column, then we can simply take Indeed, we know that for some vector and
[TABLE]
where we have used that the matrix can be identified with the inner product . Another interesting example is when we consider skew-symmetric matrices:
Theorem 1.1**.**
For any skew-symmetrical matrix
[TABLE]
the vector
[TABLE]
belongs to its kernel, thus
This can be checked directly, but in fact we can generalise this to any matrix of rank
Theorem 1.2**.**
Let where is a minor obtained by deleting the row and column from the matrix If A has rank , then all three vectors belong to its kernel and at least one of them is non-zero eigenvector.
Proof.
It is well-known that (see for example [4, Theorem 3.15,p.69])
[TABLE]
where is if and [math] otherwise. In the case of rank , we get that , and at least one of the vectors is non-zero. ∎
What can be said about non-singular matrices? If we know an eigenvalue we can simply apply the same arguments to the matrix to find the eigenvector (the case will be special, but here we can take any non-zero vector). We always know an eigenvalue for an orthogonal matrices. For example it is well-known that describes a rotation in about some axis described by a vector (see e.g. [1, Thm. 5.5, p.124]), and this is an eigenvector of corresponding to the eigenvalue . So we want to express axis of rotation in terms of the matrix entries of . But unexpectedly, we can get the vector quite easily.
Theorem 1.3**.**
Let . Let
[TABLE]
Then , so any of these vectors if it exists and is non-zero, is an eigenvector with eigenvalue . If then at least one of them exists and is non-zero.
The most unexpected one is the vector so we concentrate on it.
Theorem 1.4**.**
Let . If the vector
[TABLE]
exists that is, the denominators are non-zeros, then
In fact, this result appears as an exercise in M. Artin’s classic textbook Algebra [1, Ex.14, §5, Chap.4, p.149]. Our plan is to give several different proofs of Theorem 1.4 obtaining simultaneously the proof of Theorem 1.3.
Acknowledgement: The authors thank their colleagues Mikael Sundqvist and Jörg Schmeling for useful discussions.
2. Two algebraic proofs
We start from some useful statements.
Theorem 2.1**.**
For arbitrary and any , one has , where and is a minor obtained by deleting the row and column from the matrix
Proof.
It is well-known that for any invertible matrix, In our case and , which proves the claim. ∎
Lemma 2.2**.**
Let . Let be three different indices between and . Then
[TABLE]
Proof.
By symmetry, it is sufficient to consider the case only. Using the previous theorem we have:
[TABLE]
Consequently,
The second equality follows from the orthogonality:
[TABLE]
and we are done.
For the last equality we write:
[TABLE]
where we used the orthogonality conditions. ∎
Now we are ready for the first proof of Theorem 1.4.
Proof.
We have
[TABLE]
We want to prove that
[TABLE]
(the proofs for other coordinates are similar). Suppose first that Then this is equivalent to
[TABLE]
By Lemma 2.2 this transforms to
[TABLE]
and we can apply Lemma 2.2 again.
It remains to consider the case But then
[TABLE]
Similarly we get But this contradicts ∎
So straightforward calculations was not so obvious as expected. We can slightly improve them in our second proof.
Proof.
If we apply Theorem 1.2 to the matrix which has rank we get the eigenvector directly. Suppose that this is for example
[TABLE]
obtaining the vector from Theorem 1.3, so we get part of this theorem as well. Vectors lead us naturally to To finish the proof of Theorem 1.4, we divide the obtained vector by (which is non-zero), and it remains to show that
[TABLE]
By Lemma 2.2 we have
[TABLE]
which finishes the proof. ∎
3. Origin of the non-trivial eigenvector
Now we want to understand the origin of this non-trivial eigenvector. We find one possible source in skew-symmetric matrices.
Theorem 3.1**.**
Let be an orthogonal matrix of any size. If , then Moreover, if has only one real eigenvalue , then .
Proof.
We have
[TABLE]
which proves the first statement.
Let be a (complex) basis of eigenvectors (which exists because is a normal matrix). If , then
[TABLE]
which means that all corresponding to complex eigenvalues should be equal to zero and is proportional to the only eigenvector with real eigenvalue. ∎
Now we are ready for the third proof of Theorem 1.4.
Proof.
Suppose first that , that is, Then has some complex eigenvalue It follows that is another eigenvalue, and the third one is (because and ). Since
[TABLE]
by Theorem 1.1, and is a non-zero vector, we can apply Theorem 3.1 to get . We need only to show that for some non-zero We put , and note that , as well, for example
[TABLE]
Then
[TABLE]
It remains to consider the case that is, , and we need to prove that for
[TABLE]
we have This can be done explicitly, for example for the first coordinate we have
[TABLE]
So we need only to prove
[TABLE]
which follows from Theorem 2.1. Note also that we completed the proof of Theorem 1.3 regarding the vector ∎
4. A geometric interpretation of the eigenvector
Now we want to find some geometrical interpretation of our eigenvector and consider fourth proof of Theorem 1.4.
Proof.
The starting point is that any matrix can be written as a product of two reflections. (This is easy to see in the plane, and as every rotation in has an axis of rotation, the result for rotations in follows from the planar case.) So let be two unit vectors such that The case when and are proportional is not interesting for us (in this case ). So we suppose that they are linear independent and let be their (nonzero) vector product. First we note that is the eigenvector we are looking for. Indeed, and similarly giving As we know that
[TABLE]
we need only to prove that our vector is proportional to this one, that is,
[TABLE]
By symmetry, it is sufficient to consider the case only. We have
[TABLE]
Let Then , and for ,
[TABLE]
Our aim is
[TABLE]
Now we use the fact that we have unit vectors.
[TABLE]
and we are done because ∎
5. A proof using the Lie algebra of the rotation group
Define the Lie algebra
[TABLE]
of the Lie group . We recall the following well-known result; see for example [6, Lemma 1B,p.31].
Proposition 5.1**.**
Let . Then there exists a and a matrix such that . Moreover, defining by
[TABLE]
* is a rotation about through the angle using the right-hand rule.*
We will also need the fact that for ,
[TABLE]
where denotes the (entrywise) inverse one-sided Laplace transform. The following fact is well-known (see for example, [2, §27,p.218]):
Proposition 5.2**.**
For large enough , .
In the above, the integral of a matrix whose elements are functions of is defined entrywise. If is not an eigenvalue of , then is invertible, and by Cramer’s rule,
[TABLE]
So we see that each entry of is a polynomial in whose degree is at most , where denotes the size of , that is, is an matrix. Consequently, each entry of is a rational function in , whose inverse Laplace transform gives the matrix exponential . We now give the fifth proof of Theorem 1.4.
Proof.
Let be as in Proposition 5.1. By Cramer’s rule,
[TABLE]
Hence
[TABLE]
This yields
[TABLE]
which is a multiple of . ∎
6. A quaternionic proof
Let be the ring of all quaternions, with and , , . We define the norm of by
[TABLE]
and the conjugate of by
[TABLE]
It can be checked that for , and . We identify as a subset of via
[TABLE]
If then for any , , for example
[TABLE]
So the map maps vectors in to vectors in and clearly is linear. In fact, this collection of maps , is precisely the set of rotations in !
To see this note first that if , then its Euclidean norm coincides with its quaternionic norm. Therefore is also a rigid motion, since
[TABLE]
so our map corresponds to an orthogonal matrix. But because
[TABLE]
we have an invariant vector as well (when we can take any vector), so our matrix belongs to and is a rotation. We can describe it explicitly.
Since , we can find a unique such that to get
[TABLE]
We leave to the reader to prove that the angle of rotation around is exactly . It is clear that every rotation then arises in this manner.
Now we are ready to give the sixth proof of Theorem 1.4.
Proof.
We need to consider the case only. By feeding in into , we can now compute the matrix of in terms of the entries of , where . We already know the first column and the rest we get by cyclic symmetry:
[TABLE]
Now it is easy to check that
[TABLE]
which is a multiple of . ∎
7. A proof using the Cayley transform
We only consider the case when is not eigenvalue of , since the case when is an eigenvalue of (implying that ) has been covered before in our third proof.
Theorem 7.1**.**
If such that is not an eigenvalue of , then there exists a skew-symmetric such that .
Proof.
As is not an eigenvalue of , is invertible. Define
[TABLE]
Then
[TABLE]
where we use the commutativity to get the last equality. So is skew-symmetric. But then is invertible. From the definition of , it follows that , and solving for , we obtain . ∎
Now we are ready to give the seventh proof of Theorem 1.4.
Proof.
Given , we can write as for some skew-symmetric
[TABLE]
Then
[TABLE]
and
[TABLE]
which is an eigenvector of corresponding to eigenvalue , by Theorem 1.1. ∎
8. A proof using contour integral of the resolvent
We recall the following; see for example [3, §8.2, p.127]:
Proposition 8.1**.**
For an isolated eigenvalue of a square matrix , enclosed inside a simple closed curve running in the anti-clockwise direction, the projection onto the eigenspace is given by
[TABLE]
We are now ready to give the eighth proof of Theorem 1.4.
Proof.
Let . Again we restrict ourselves to the case that . Then we have that is an isolated simple eigenvalue. Let the other two eigenvalues be denoted by , and let be the minor obtained by deleting the row and column from the matrix . If encloses , but not the other two eigenvalues , then we have
[TABLE]
where we have used the Cauchy Integral Formula [5, Cor.3.5, p.94] to obtain the last equality. In particular
[TABLE]
for some constant . ∎
Note that we recover the vector from Theorem 1.3. can be found similarly.
9. What about zeros?
Now it is time to think about the conditions What if some of them failed e.g. The eigenvector still exists, but how does it look now? Note first that
[TABLE]
Similarly So our matrix looks now as
[TABLE]
where Suppose first that . The orthogonality conditions for the first two rows gives:
[TABLE]
For the first two columns we get instead
[TABLE]
thus
Now for we simply put We have
[TABLE]
where the last equality follows from Theorem 2.1.
If we take instead with similar argument:
[TABLE]
where again the last equality follows from Theorem 2.1.
Thus the rule is easy: for exactly one pair of indices we have If is the remaining index put
In fact we can describe matrices above almost explicitly. To make calculations more homogeneous we put as well. Consider the remaining orthogonal conditions for different rows:
[TABLE]
[TABLE]
Pairwise multiplications of the obtained equations and cancelling gives:
[TABLE]
Now the last orthogonality condition is
[TABLE]
or (other rows and columns gives the same). Now we can choose as parameters (with natural restrictions, e.g. ) and reconstruct the rest choosing signs. As example we get
[TABLE]
It remains to consider the case If for example then by the orthogonality of two first rows as well and similarly for other cases we get that at least two of are zero. Then the corresponding column containing them is an eigenvector directly.
10. Possible generalisations
So far we concentrated on real matrices, especially on the case . But we now ask: what can be generalised? Theorem 1.4 is obviously valid for any orthogonal matrix (that is why we have in the abstract), and moreover, it is valid for any matrix with Theorem 1.3 is valid as well if we replace the constant in the vectors by
For larger sizes, we still have the analogues of Theorem 1.2 and Theorem 2.1 and can imitate the second proof to obtain the analogues of the vectors But already for the size (where the vector with exists), the expressions involve determinants of size , and its is hardly attractive to write them here. The vector obtained in the third proof is also in principle available, but we have no easy analogue of Theorem 1.1, while an analogue of Theorem 1.2 produces the determinants of high order. And the idea to generalise Theorem 1.4 to higher dimensions looks hopeless.
What if we change the field? Because the conditions and are purely algebraic, all purely algebraic proofs survive, and we have the same Theorem 1.3 but we need some modifications.
First of all, we should understand why is still an eigenvalue. This is easy. If are our eigenvalues, then is the same set of numbers, but they may be in a different order. If for example, then and the condition gives The only remaining case is , and then , and similarly for and , but because their product is at least one of them is equal to as well. So the second proof survives completely, and the third need only an adjustment in the place where we used Theorem 3.1.
The first proof has another weak point: for arbitrary field does not imply which we have used in the special case The case when can really happen. Here is a a nice example in :
[TABLE]
But still have a correct eigenvector. The proof therefore should be modified (e.g. consider in our field such that write and and continue in the same style as we have done in the previous section to describe all possible exceptional matrices), but we prefer to skip this and restrict ourselves by only one algebraic proof).
So the conditions and are sufficient to our main theorems. The interesting question is therefore: what is the class of the matrices that satisfy those conditions? It is obviously a group. We study matrices of size first.
[TABLE]
therefore , , and For the complex numbers, we put for some complex number and get all the solutions. So matrices such as
[TABLE]
and their products belongs to our group, so it is large enough. For finite fields we can have difficulties to find ”cosines” (for example, in , we have or ), but already in we have which produces some matrices. But we prefer to skip this intriguing topic for now.
Any time one gets a result about the orthogonal matrices, it is natural to wonder about their complex relatives - unitary matrices. What can be said about them? Most parts of the proofs fail, which is not surprising, because now , and skew-Hermitian matrix can be invertible, and can have non-zero elements on the main diagonal. So we have no direct analogue of Theorem 1.4. We can get some results if we know the eigenvalue, but is nothing else than the direct application of Theorem 1.2 (as in the second proof).
Theorem 10.1**.**
Let be an unitary matrix with simple eigenvalue equal to Then for all the vectors
[TABLE]
we have , and at least one of them is non-zero, and therefore is the eigenvector.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Artin. Algebra. Prentice-Hall, 1991.
- 2[2] R. Bellman. Introduction to Matrix Analysis. Classics in Applied Mathematics, 19. Society for Industrial and Applied Mathematics (SIAM), 1997.
- 3[3] S. Godunov. Modern Aspects of Linear Algebra. Translations of Mathematical Monographs, 175. American Mathematical Society, 1998.
- 4[4] A. Holst and V. Ufnarovski. Matrix Theory . Studentlitteratur, 2014.
- 5[5] S. Maad-Sasane and A. Sasane. A Friendly Approach to Complex Analysis . World Scientific, 2014.
- 6[6] W. Rossmann. Lie Groups. An Introduction through Linear Groups. Oxford Graduate Texts in Mathematics, 5. Oxford University Press, 2002.
