Eigenvector of a matrix in $SO_3(\mathbb{R})$

Amol Sasane; Victor Ufnarovski

arXiv:1905.07404·math.GM·May 21, 2019

Eigenvector of a matrix in $SO_3(\mathbb{R})$

Amol Sasane, Victor Ufnarovski

PDF

Open Access

TL;DR

This paper provides multiple proofs that a specific vector, constructed from the entries of a matrix in $O_3( eal)$, is an eigenvector associated with eigenvalue 1, assuming it exists.

Contribution

It introduces several different proofs for the eigenvector property of a particular vector in $SO_3( eal)$ matrices, enhancing understanding of their structure.

Findings

01

The vector $V$ is an eigenvector of $A$ with eigenvalue 1.

02

Multiple proofs confirm the eigenvector property.

03

Conditions for the existence of $V$ are discussed.

Abstract

Let $A = [a_{ij}] \in O_{3} (R)$ . We give several different proofs of the fact that the vector $V := [\frac{1}{a _{23} + a _{32}} \frac{1}{a _{13} + a _{31}} \frac{1}{a _{12} + a _{21}}]^{T},$ if it exists, is an eigenvector of $A$ corresponding to the eigenvalue $1$ .

Equations155

V:=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}+a_{32}}&\displaystyle\frac{1}{a_{13}+a_{31}}&\displaystyle\frac{1}{a_{12}+a_{21}}\end{array}\right]^{T},

V:=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}+a_{32}}&\displaystyle\frac{1}{a_{13}+a_{31}}&\displaystyle\frac{1}{a_{12}+a_{21}}\end{array}\right]^{T},

A X = X Y^{T} X = X ⟨ Y, X ⟩ = ⟨ Y, X ⟩ X,

A X = X Y^{T} X = X ⟨ Y, X ⟩ = ⟨ Y, X ⟩ X,

Q=\left[\begin{array}[]{rrr}0&-r&q\\ r&0&-p\\ -q&p&0\end{array}\right]

Q=\left[\begin{array}[]{rrr}0&-r&q\\ r&0&-p\\ -q&p&0\end{array}\right]

V=\left[\begin{array}[]{r}p\\ q\\ r\end{array}\right]

V=\left[\begin{array}[]{r}p\\ q\\ r\end{array}\right]

k = 1 \sum 3 a_{ik} A_{j k} = δ_{ij} det A,

k = 1 \sum 3 a_{ik} A_{j k} = δ_{ij} det A,

V

V

U

W_{1}

W_{2}

W_{3}

V=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}+a_{32}}&\displaystyle\frac{1}{a_{13}+a_{31}}&\displaystyle\frac{1}{a_{12}+a_{21}}\end{array}\right]^{T}

V=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}+a_{32}}&\displaystyle\frac{1}{a_{13}+a_{31}}&\displaystyle\frac{1}{a_{12}+a_{21}}\end{array}\right]^{T}

(1 + a_{ii}) (a_{j k} + a_{k j})

(1 + a_{ii}) (a_{j k} + a_{k j})

(a_{j j} + a_{k k}) (a_{j k} + a_{k j})

(a_{ij}^{2} + a_{ik}^{2}) (a_{ij} a_{ik} + a_{j i} a_{k i})

a_{23} + a_{32}

a_{23} + a_{32}

(a_{22} + a_{33}) (a_{23} + a_{32}) = (a_{22} a_{23} + a_{33} a_{33}) + (a_{22} a_{32} + a_{23} a_{33}) = - a_{21} a_{31} - a_{12} a_{13}

(a_{22} + a_{33}) (a_{23} + a_{32}) = (a_{22} a_{23} + a_{33} a_{33}) + (a_{22} a_{32} + a_{23} a_{33}) = - a_{21} a_{31} - a_{12} a_{13}

(a_{12}^{2} + a_{13}^{2}) (a_{13} a_{12} + a_{31} a_{21}) = (a_{12} a_{31} + a_{21} a_{13}) (a_{12} a_{21} + a_{13} a_{31})

(a_{12}^{2} + a_{13}^{2}) (a_{13} a_{12} + a_{31} a_{21}) = (a_{12} a_{31} + a_{21} a_{13}) (a_{12} a_{21} + a_{13} a_{31})

AV=\left[\begin{array}[]{ccc}\displaystyle\frac{a_{11}}{a_{23}+a_{32}}+\frac{a_{12}}{a_{13}+a_{31}}+\frac{a_{13}}{a_{12}+a_{21}}\\[8.5359pt] \displaystyle\frac{a_{21}}{a_{23}+a_{32}}+\frac{a_{22}}{a_{13}+a_{31}}+\frac{a_{23}}{a_{12}+a_{21}}\\[8.5359pt] \displaystyle\frac{a_{31}}{a_{23}+a_{32}}+\frac{a_{32}}{a_{13}+a_{31}}+\frac{a_{33}}{a_{12}+a_{21}}\\[8.5359pt] \end{array}\right].

AV=\left[\begin{array}[]{ccc}\displaystyle\frac{a_{11}}{a_{23}+a_{32}}+\frac{a_{12}}{a_{13}+a_{31}}+\frac{a_{13}}{a_{12}+a_{21}}\\[8.5359pt] \displaystyle\frac{a_{21}}{a_{23}+a_{32}}+\frac{a_{22}}{a_{13}+a_{31}}+\frac{a_{23}}{a_{12}+a_{21}}\\[8.5359pt] \displaystyle\frac{a_{31}}{a_{23}+a_{32}}+\frac{a_{32}}{a_{13}+a_{31}}+\frac{a_{33}}{a_{12}+a_{21}}\\[8.5359pt] \end{array}\right].

\frac{a _{11}}{a _{23} + a _{32}} + \frac{a _{12}}{a _{13} + a _{31}} + \frac{a _{13}}{a _{12} + a _{21}} = \frac{1}{a _{23} + a _{32}}

\frac{a _{11}}{a _{23} + a _{32}} + \frac{a _{12}}{a _{13} + a _{31}} + \frac{a _{13}}{a _{12} + a _{21}} = \frac{1}{a _{23} + a _{32}}

\frac{( 1 - a _{11} ) ( 1 + a _{11} )}{( 1 + a _{11} ) ( a _{23} + a _{32} )} = \frac{a _{12}}{a _{13} + a _{31}} + \frac{a _{13}}{a _{12} + a _{21}} .

\frac{( 1 - a _{11} ) ( 1 + a _{11} )}{( 1 + a _{11} ) ( a _{23} + a _{32} )} = \frac{a _{12}}{a _{13} + a _{31}} + \frac{a _{13}}{a _{12} + a _{21}} .

\frac{1 - a _{11}^{2}}{a _{12} a _{31} + a _{21} a _{13}} = \frac{a _{12}^{2} + a _{13}^{2} + a _{12} a _{21} + a _{13} a _{31}}{( a _{13} + a _{31} ) ( a _{12} + a _{21} )}

\frac{1 - a _{11}^{2}}{a _{12} a _{31} + a _{21} a _{13}} = \frac{a _{12}^{2} + a _{13}^{2} + a _{12} a _{21} + a _{13} a _{31}}{( a _{13} + a _{31} ) ( a _{12} + a _{21} )}

a_{12}^{2} + a_{13}^{2} = 1 - a_{11}^{2} = 0 \Rightarrow a_{12} = a_{13} = 0.

a_{12}^{2} + a_{13}^{2} = 1 - a_{11}^{2} = 0 \Rightarrow a_{12} = a_{13} = 0.

V_{1}

V_{1}

\frac{1 + a _{11} - a _{22} - a _{33}}{( a _{12} + a _{21} ) ( a _{13} + a _{31} )} = \frac{1}{a _{23} + a _{32}} .

\frac{1 + a _{11} - a _{22} - a _{33}}{( a _{12} + a _{21} ) ( a _{13} + a _{31} )} = \frac{1}{a _{23} + a _{32}} .

(1 + a_{11} - a_{22} - a_{33}) (a_{23} + a_{32})

(1 + a_{11} - a_{22} - a_{33}) (a_{23} + a_{32})

w u mba w a y w a y = (1 + a_{11}) (a_{23} + a_{32}) - (a_{22} + a_{33}) (a_{23} + a_{32})

w u mba w a y w a y = a_{12} a_{31} + a_{21} a_{13} + a_{21} a_{31} + a_{12} a_{13}

w u mba w a y w a y = (a_{12} + a_{21}) (a_{13} + a_{31}),

(A - A^{T}) U = 0 \Leftrightarrow A U = A^{T} U \Leftrightarrow A^{2} U = U,

(A - A^{T}) U = 0 \Leftrightarrow A U = A^{T} U \Leftrightarrow A^{2} U = U,

A^{2} U - U = \sum x_{i} (λ_{i}^{2} - 1) e_{i} = 0,

A^{2} U - U = \sum x_{i} (λ_{i}^{2} - 1) e_{i} = 0,

U=\left[\begin{array}[]{c}a_{23}-a_{32}\\ a_{31}-a_{13}\\ a_{12}-a_{21}\end{array}\right]\in\ker(A-A^{T}),

U=\left[\begin{array}[]{c}a_{23}-a_{32}\\ a_{31}-a_{13}\\ a_{12}-a_{21}\end{array}\right]\in\ker(A-A^{T}),

a_{23}^{2} - a_{32}^{2} = a_{31}^{2} - a_{13}^{2} \Leftrightarrow a_{13}^{2} + a_{23}^{2} = a_{31}^{2} + a_{32}^{2} \Leftrightarrow 1 - a_{33}^{2} = 1 - a_{33}^{2} .

a_{23}^{2} - a_{32}^{2} = a_{31}^{2} - a_{13}^{2} \Leftrightarrow a_{13}^{2} + a_{23}^{2} = a_{31}^{2} + a_{32}^{2} \Leftrightarrow 1 - a_{33}^{2} = 1 - a_{33}^{2} .

c V

c V

V^{\prime}=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}}&\displaystyle\frac{1}{a_{13}}&\displaystyle\frac{1}{a_{12}}\end{array}\right]^{T},

V^{\prime}=\left[\begin{array}[]{ccc}\displaystyle\frac{1}{a_{23}}&\displaystyle\frac{1}{a_{13}}&\displaystyle\frac{1}{a_{12}}\end{array}\right]^{T},

\frac{a _{11}}{a _{23}} + \frac{a _{12}}{a _{13}} + \frac{a _{13}}{a _{12}} = \frac{1}{a _{23}} \Leftrightarrow \frac{a _{12}^{2} + a _{13}^{2}}{a _{12} a _{13}} = \frac{1 - a _{11}}{a _{23}} \Leftrightarrow \frac{1 - a _{11}^{2}}{a _{12} a _{13}} = \frac{1 - a _{11}}{a _{23}}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMatrix Theory and Algorithms · Advanced Topics in Algebra · Graph theory and applications

Full text

Eigenvector of a matrix in $SO_{3}(\mathbb{R})$

Amol Sasane

Department of Mathematics

London School of Economics

Houghton Street

London WC2A 2AE

United Kingdom

[email protected]

and

Victor Ufnarovski

Department of Mathematics

Lund University

Sölvegatan 18, 223 62 Lund

Sweden

[email protected]

Abstract.

Let $A=[a_{ij}]\in O_{3}(\mathbb{R})$ . We give several different proofs of the fact that the vector

[TABLE]

if it exists, is an eigenvector of $A$ corresponding to the eigenvalue $1$ .

Key words and phrases:

Orthogonal matrices, rotations in $\mathbb{R}^{3}$ , eigenvectors

2010 Mathematics Subject Classification:

Primary 15A18 ; Secondary 15-01, 97Axx

1. Introduction

Let $A$ be a $3\times 3$ real matrix and suppose that we want to find an eigenvector $V$ for $A.$ Every student learns an algorithm for this, but is it possible to skip the toil, and write down $V$ explicitly in terms of $a_{ij}$ ? For example, we can easily do this for a matrix of rank $1.$ If $X$ is a nonzero column, then we can simply take $V=X.$ Indeed, we know that $A=XY^{T}$ for some vector $Y$ and

[TABLE]

where we have used that the $1\times 1$ matrix $Y^{T}X$ can be identified with the inner product $\langle Y,X\rangle$ . Another interesting example is when we consider skew-symmetric matrices:

Theorem 1.1.

For any $3\times 3$ skew-symmetrical matrix

[TABLE]

the vector

[TABLE]

belongs to its kernel, thus $QV=0.$

This can be checked directly, but in fact we can generalise this to any matrix of rank $2.$

Theorem 1.2.

Let $A_{ij}=(-1)^{i+j}D_{ij}$ where $D_{ij}$ is a minor obtained by deleting the row $i$ and column $j$ from the matrix $A.$ If A has rank $2$ , then all three vectors $V_{j}=[A_{j1}\;\;A_{j2}\;\;A_{j3}]^{T}$ belong to its kernel and at least one of them is non-zero eigenvector.

Proof.

It is well-known that (see for example [4, Theorem 3.15,p.69])

[TABLE]

where $\delta_{ij}$ is $1$ if $i=j$ and [math] otherwise. In the case of rank $2$ , we get that $\det A=0\Rightarrow AV_{j}=0$ , and at least one of the vectors $V_{j}$ is non-zero. ∎

What can be said about non-singular matrices? If we know an eigenvalue $\lambda$ we can simply apply the same arguments to the matrix $A-\lambda I$ to find the eigenvector (the case $A=\lambda I$ will be special, but here we can take any non-zero vector). We always know an eigenvalue $\pm 1$ for an orthogonal matrices. For example it is well-known that $A\in SO_{3}(\mathbb{R})$ describes a rotation in $\mathbb{R}^{3}$ about some axis described by a vector $V$ (see e.g. [1, Thm. 5.5, p.124]), and this $V$ is an eigenvector of $A$ corresponding to the eigenvalue $1$ . So we want to express axis of rotation in terms of the matrix entries of $A$ . But unexpectedly, we can get the vector $V$ quite easily.

Theorem 1.3.

Let $A=[a_{ij}]\in SO_{3}(\mathbb{R})$ . Let

[TABLE]

Then $AV=V,AU=U,AW_{i}=W_{i}$ , so any of these vectors $($ if it exists and is non-zero $)$ , is an eigenvector with eigenvalue $1$ . If $A\neq I$ then at least one of them exists and is non-zero.

The most unexpected one is the vector $V$ so we concentrate on it.

Theorem 1.4.

Let $A=[a_{ij}]\in SO_{3}(\mathbb{R})$ . If the vector

[TABLE]

exists $($ that is, the denominators are non-zeros $)$ , then $AV=V.$

In fact, this result appears as an exercise in M. Artin’s classic textbook Algebra [1, Ex.14, §5, Chap.4, p.149]. Our plan is to give several different proofs of Theorem 1.4 obtaining simultaneously the proof of Theorem 1.3.

Acknowledgement: The authors thank their colleagues Mikael Sundqvist and Jörg Schmeling for useful discussions.

2. Two algebraic proofs

We start from some useful statements.

Theorem 2.1.

For arbitrary $n$ and any $A\in SO_{n}(\mathbb{R})$ , one has $A_{ij}=a_{ij}$ , where $A_{ij}=(-1)^{i+j}D_{ij}$ and $D_{ij}$ is a minor obtained by deleting the row $i$ and column $j$ from the matrix $A.$

Proof.

It is well-known that for any invertible matrix, $A^{-1}=\frac{1}{\det A}[A_{ij}]^{T}.$ In our case $\det A=1$ and $A^{-1}=A^{T}$ , which proves the claim. ∎

Lemma 2.2.

Let $A=[a_{ij}]\in SO_{3}(\mathbb{R})$ . Let $i,j,k$ be three different indices between $1$ and $3$ . Then

[TABLE]

Proof.

By symmetry, it is sufficient to consider the case $i=1,j=2,k=3$ only. Using the previous theorem we have:

[TABLE]

Consequently, $(1+a_{11})(a_{23}+a_{32})=a_{12}a_{31}+a_{21}a_{13}.$

The second equality follows from the orthogonality:

[TABLE]

and we are done.

For the last equality we write:

[TABLE]

where we used the orthogonality conditions. ∎

Now we are ready for the first proof of Theorem 1.4.

Proof.

We have

[TABLE]

We want to prove that

[TABLE]

(the proofs for other coordinates are similar). Suppose first that $a_{11}+1\neq 0.$ Then this is equivalent to

[TABLE]

By Lemma 2.2 this transforms to

[TABLE]

and we can apply Lemma 2.2 again.

It remains to consider the case $a_{11}=-1.$ But then

[TABLE]

Similarly we get $a_{21}=a_{31}=0.$ But this contradicts $a_{12}+a_{21}\neq 0.$ ∎

So straightforward calculations was not so obvious as expected. We can slightly improve them in our second proof.

Proof.

If we apply Theorem 1.2 to the matrix $A-I$ which has rank $2$ we get the eigenvector directly. Suppose that this is for example

[TABLE]

obtaining the vector $W_{1}$ from Theorem 1.3, so we get part of this theorem as well. Vectors $V_{2},V_{3}$ lead us naturally to $W_{2},W_{3}.$ To finish the proof of Theorem 1.4, we divide the obtained vector by $(a_{12}+a_{21})(a_{13}+a_{31})$ (which is non-zero), and it remains to show that

[TABLE]

By Lemma 2.2 we have

[TABLE]

which finishes the proof. ∎

3. Origin of the non-trivial eigenvector

Now we want to understand the origin of this non-trivial eigenvector. We find one possible source in skew-symmetric matrices.

Theorem 3.1.

Let $A$ be an orthogonal matrix $($ of any size $)$ . If $U\in\ker(A-A^{T})$ , then $A^{2}U=U.$ Moreover, if $A$ has only one real eigenvalue $\lambda$ , then $AU=\lambda U$ .

Proof.

We have

[TABLE]

which proves the first statement.

Let $\{e_{i}\}$ be a (complex) basis of eigenvectors (which exists because $A$ is a normal matrix). If $U=\sum x_{i}e_{i}$ , then

[TABLE]

which means that all $x_{i}$ corresponding to complex eigenvalues $\lambda_{i}$ should be equal to zero and $U$ is proportional to the only eigenvector with real eigenvalue. ∎

Now we are ready for the third proof of Theorem 1.4.

Proof.

Suppose first that $A\neq A^{T}$ , that is, $A^{2}\neq I.$ Then $A$ has some complex eigenvalue $\lambda.$ It follows that $\overline{\lambda}$ is another eigenvalue, and the third one is $1$ (because $|\lambda|=1$ and $\det A=1$ ). Since

[TABLE]

by Theorem 1.1, and is a non-zero vector, we can apply Theorem 3.1 to get $AU=U$ . We need only to show that $cV=U$ for some non-zero $c.$ We put $c=a_{23}^{2}-a_{32}^{2}$ , and note that $c=a_{31}^{2}-a_{13}^{2}$ , $c=a_{12}^{2}-a_{21}^{2}$ as well, for example

[TABLE]

Then

[TABLE]

It remains to consider the case $A=A^{T},$ that is, $a_{ij}=a_{ji}$ , and we need to prove that for

[TABLE]

we have $AV^{\prime}=V^{\prime}.$ This can be done explicitly, for example for the first coordinate we have

[TABLE]

So we need only to prove

[TABLE]

which follows from Theorem 2.1. Note also that we completed the proof of Theorem 1.3 regarding the vector $U.$ ∎

4. A geometric interpretation of the eigenvector

Now we want to find some geometrical interpretation of our eigenvector and consider fourth proof of Theorem 1.4.

Proof.

The starting point is that any matrix $A\in SO_{3}(\mathbb{R})$ can be written as a product of two reflections. (This is easy to see in the plane, and as every rotation in $\mathbb{R}^{3}$ has an axis of rotation, the result for rotations in $\mathbb{R}^{3}$ follows from the planar case.) So let $X,Y$ be two unit vectors such that $A=(I-2XX^{T})(I-2YY^{T}).$ The case when $X$ and $Y$ are proportional is not interesting for us (in this case $A=I$ ). So we suppose that they are linear independent and let $Z=X\times Y$ be their (nonzero) vector product. First we note that $Z$ is the eigenvector we are looking for. Indeed, $X^{T}Z=\langle X,Z\rangle=0$ and similarly $Y^{T}Z=0,$ giving $AZ=(I+BX^{T}+CY^{T})Z=IZ=Z.$ As we know that

[TABLE]

we need only to prove that our vector $v$ is proportional to this one, that is,

[TABLE]

By symmetry, it is sufficient to consider the case $i=1,j=2$ only. We have

[TABLE]

Let $c=\langle X,Y\rangle.$ Then $A=I-2XX^{T}-2YY^{T}+4cXY^{T}$ , and for $i\neq j$ ,

[TABLE]

Our aim is

[TABLE]

Now we use the fact that we have unit vectors.

[TABLE]

and we are done because $c=x_{1}y_{1}+x_{2}y_{2}+x_{3}y_{3}.$ ∎

5. A proof using the Lie algebra of the rotation group

Define the Lie algebra

[TABLE]

of the Lie group $SO_{3}(\mathbb{R})$ . We recall the following well-known result; see for example [6, Lemma 1B,p.31].

Proposition 5.1.

Let $A\in SO_{3}(\mathbb{R})$ . Then there exists a $t\in[0,2\pi)$ and a matrix $Q\in\mathfrak{so}_{3}(\mathbb{R})$ such that $A=e^{tQ}$ . Moreover, defining $U=[p\;\;q\;\;r]^{T}\in\mathbb{R}^{3}$ by

[TABLE]

$A$ * is a rotation about $U$ through the angle $t$ using the right-hand rule.*

We will also need the fact that for $t\geq 0$ ,

[TABLE]

where $\mathcal{L}^{-1}$ denotes the (entrywise) inverse one-sided Laplace transform. The following fact is well-known (see for example, [2, §27,p.218]):

Proposition 5.2.

For large enough $s$ , $\displaystyle\int_{0}^{\infty}e^{-st}e^{tQ}dt=(sI-Q)^{-1}$ .

In the above, the integral of a matrix whose elements are functions of $t$ is defined entrywise. If $s$ is not an eigenvalue of $Q$ , then $sI-Q$ is invertible, and by Cramer’s rule,

[TABLE]

So we see that each entry of $\textrm{adj}(sI-Q)$ is a polynomial in $s$ whose degree is at most $n-1$ , where $n$ denotes the size of $Q$ , that is, $Q$ is an $n\times n$ matrix. Consequently, each entry $m_{ij}$ of $(sI-Q)^{-1}$ is a rational function in $s$ , whose inverse Laplace transform gives the matrix exponential $e^{tQ}$ . We now give the fifth proof of Theorem 1.4.

Proof.

Let $Q,U$ be as in Proposition 5.1. By Cramer’s rule,

[TABLE]

Hence

[TABLE]

This yields

[TABLE]

which is a multiple of $U$ . ∎

6. A quaternionic proof

Let $\mathbf{D}:=\{\mathbf{q}=a+b\mathbf{i}+c\mathbf{j}+d\mathbf{k}:a,b,c,d\in\mathbb{R}\}$ be the ring of all quaternions, with $\mathbf{i}^{2}=\mathbf{j}^{2}=\mathbf{k}^{2}=-1$ and $\mathbf{i}\cdot\mathbf{j}=-\mathbf{j}\cdot\mathbf{i}=\mathbf{k}$ , $\mathbf{j}\cdot\mathbf{k}=-\mathbf{k}\cdot\mathbf{j}=\mathbf{i}$ , $\mathbf{k}\cdot\mathbf{i}=-\mathbf{i}\cdot\mathbf{k}=\mathbf{j}$ . We define the norm of $\mathbf{q}=a+b\mathbf{i}+c\mathbf{j}+d\mathbf{k}$ by

[TABLE]

and the conjugate $\overline{\mathbf{q}}$ of $\mathbf{q}$ by

[TABLE]

It can be checked that for $\mathbf{q}_{1},\mathbf{q}_{2}\in\mathbf{D}$ , $|\mathbf{q}_{1}\mathbf{q}_{2}|=|\mathbf{q}_{1}||\mathbf{q}_{2}|$ and $|\mathbf{q}|^{2}=\mathbf{q}\overline{\mathbf{q}}$ . We identify $\mathbb{R}^{3}$ as a subset of $\mathbf{D}$ via

[TABLE]

If $|\mathbf{q}|=1$ then for any $\mathbf{w}\in\mathbb{R}^{3}$ , $\mathbf{q}\mathbf{w}\mathbf{q}^{-1}\in\mathbb{R}^{3}$ , for example

[TABLE]

So the map $T_{\mathbf{q}}:\mathbf{w}\mapsto\mathbf{q}\mathbf{w}\mathbf{q}^{-1}$ maps vectors in $\mathbb{R}^{3}$ to vectors in $\mathbb{R}^{3}$ and clearly is linear. In fact, this collection of maps $T_{\mathbf{q}}$ , $|\mathbf{q}|=1,$ is precisely the set $SO(3)$ of rotations in $\mathbb{R}^{3}$ !

To see this note first that if $\mathbf{w}\in\mathbb{R}^{3}$ , then its Euclidean norm $\|\mathbf{w}\|_{2}$ coincides with its quaternionic norm. Therefore $T_{\mathbf{q}}$ is also a rigid motion, since

[TABLE]

so our map corresponds to an orthogonal matrix. But because

[TABLE]

we have an invariant vector as well (when $\mathbf{q}=a$ we can take any vector), so our matrix belongs to $SO(3)$ and is a rotation. We can describe it explicitly.

Since $|a|\leq 1$ , we can find a unique $t\in[0,2\pi)$ such that $\displaystyle\cos\frac{t}{2}=a$ to get

[TABLE]

We leave to the reader to prove that the angle of rotation around $\mathbf{v}$ is exactly $t$ . It is clear that every rotation then arises in this manner.

Now we are ready to give the sixth proof of Theorem 1.4.

Proof.

We need to consider the case $\mathbf{v}\neq 0$ only. By feeding in $\mathbf{i},\mathbf{j},\mathbf{k}$ into $T_{\mathbf{q}}$ , we can now compute the matrix $A$ of $T_{\mathbf{q}}$ in terms of the entries of $[b\;\;c\;\;d]^{T}$ , where $\mathbf{v}=b\mathbf{i}+c\mathbf{j}+d\mathbf{k}$ . We already know the first column and the rest we get by cyclic symmetry:

[TABLE]

Now it is easy to check that

[TABLE]

which is a multiple of $\mathbf{v}$ . ∎

7. A proof using the Cayley transform

We only consider the case when $-1$ is not eigenvalue of $A$ , since the case when $-1$ is an eigenvalue of $A$ (implying that $A^{2}=I$ ) has been covered before in our third proof.

Theorem 7.1.

If $A\in SO_{3}(\mathbb{R})$ such that $-1$ is not an eigenvalue of $A$ , then there exists a skew-symmetric $Q$ such that $A=(I+Q)(I-Q)^{-1}$ .

Proof.

As $-1$ is not an eigenvalue of $A$ , $A+I$ is invertible. Define

[TABLE]

Then

[TABLE]

where we use the commutativity to get the last equality. So $Q$ is skew-symmetric. But then $I-Q$ is invertible. From the definition of $Q$ , it follows that $Q(A+I)=A-I$ , and solving for $A$ , we obtain $A=(I+Q)(I-Q)^{-1}$ . ∎

Now we are ready to give the seventh proof of Theorem 1.4.

Proof.

Given $A$ , we can write $A$ as $A=(I+Q)(I-Q)^{-1}$ for some skew-symmetric $Q$

[TABLE]

Then

[TABLE]

and

[TABLE]

which is an eigenvector of $A$ corresponding to eigenvalue $1$ , by Theorem 1.1. ∎

8. A proof using contour integral of the resolvent

We recall the following; see for example [3, §8.2, p.127]:

Proposition 8.1.

For an isolated eigenvalue of a square matrix $A$ , enclosed inside a simple closed curve $\gamma$ running in the anti-clockwise direction, the projection $P$ onto the eigenspace $\ker(\lambda I-A)$ is given by

[TABLE]

We are now ready to give the eighth proof of Theorem 1.4.

Proof.

Let $A\in SO_{3}(\mathbb{R})$ . Again we restrict ourselves to the case that $A\neq I$ . Then we have that $1$ is an isolated simple eigenvalue. Let the other two eigenvalues be denoted by $\lambda,\overline{\lambda}$ , and let $p_{ij}(z)$ be the minor obtained by deleting the row $i$ and column $j$ from the matrix $zI-A$ . If $\gamma$ encloses $1$ , but not the other two eigenvalues $\lambda,\overline{\lambda}$ , then we have

[TABLE]

where we have used the Cauchy Integral Formula [5, Cor.3.5, p.94] to obtain the last equality. In particular

[TABLE]

for some constant $c$ . ∎

Note that we recover the vector $W_{1}$ from Theorem 1.3. $W_{2},W_{3}$ can be found similarly.

9. What about zeros?

Now it is time to think about the conditions $a_{ij}+a_{ji}\neq 0.$ What if some of them failed e.g. $a_{12}+a_{21}=0?$ The eigenvector still exists, but how does it look now? Note first that

[TABLE]

Similarly $a_{23}=\pm a_{32}.$ So our matrix looks now as

[TABLE]

where $\varepsilon^{2}=\zeta^{2}=1.$ Suppose first that $pqr\neq 0$ . The orthogonality conditions for the first two rows gives:

[TABLE]

For the first two columns we get instead

[TABLE]

thus $\varepsilon\zeta=-1\Leftrightarrow\zeta=-\varepsilon.$

Now for $\varepsilon=-1$ we simply put $V=[0\;\;q\;\;-r]^{T}.$ We have

[TABLE]

where the last equality follows from Theorem 2.1.

If $\varepsilon=1$ we take instead $V=[p\;\;0\;\;r]^{T}$ with similar argument:

[TABLE]

where again the last equality follows from Theorem 2.1.

Thus the rule is easy: for exactly one pair of indices $i,j$ we have $a_{ij}=a_{ji}.$ If $k$ is the remaining index put $v_{k}=0,v_{i}=a_{k,j},v_{j}=-a_{k,i}.$

In fact we can describe matrices above almost explicitly. To make calculations more homogeneous we put $c=\varepsilon d$ as well. Consider the remaining orthogonal conditions for different rows:

[TABLE]

Pairwise multiplications of the obtained equations and cancelling gives:

[TABLE]

Now the last orthogonality condition is

[TABLE]

or $a-b+d=\pm 1$ (other rows and columns gives the same). Now we can choose $a,b$ as parameters (with natural restrictions, e.g. $|a|<1$ ) and reconstruct the rest choosing signs. As example we get

[TABLE]

It remains to consider the case $pqr=0.$ If for example $p=0$ then by the orthogonality of two first rows $qr=0$ as well and similarly for other cases we get that at least two of $p,q,r$ are zero. Then the corresponding column containing them is an eigenvector directly.

10. Possible generalisations

So far we concentrated on $3\times 3$ real matrices, especially on the case $A\in SO_{3}(\mathbb{R})$ . But we now ask: what can be generalised? Theorem 1.4 is obviously valid for any orthogonal matrix (that is why we have $A\in O_{3}(\mathbb{R})$ in the abstract), and moreover, it is valid for any matrix $A=cA^{\prime}$ with $A^{\prime}\in SO_{3}(\mathbb{R}).$ Theorem 1.3 is valid as well if we replace the constant $1$ in the vectors $W_{i}$ by $c\neq 0.$

For larger sizes, we still have the analogues of Theorem 1.2 and Theorem 2.1 and can imitate the second proof to obtain the analogues of the vectors $W_{i}.$ But already for the size $5$ (where the vector $V$ with $AV=V$ exists), the expressions involve determinants of size $3$ , and its is hardly attractive to write them here. The vector $U$ obtained in the third proof is also in principle available, but we have no easy analogue of Theorem 1.1, while an analogue of Theorem 1.2 produces the determinants of high order. And the idea to generalise Theorem 1.4 to higher dimensions looks hopeless.

What if we change the field? Because the conditions $A^{-1}=A^{T}$ and $\det A=1$ are purely algebraic, all purely algebraic proofs survive, and we have the same Theorem 1.3 but we need some modifications.

First of all, we should understand why $1$ is still an eigenvalue. This is easy. If $\alpha,\beta,\gamma$ are our eigenvalues, then $\frac{1}{\alpha},\frac{1}{\beta},\frac{1}{\gamma}$ is the same set of numbers, but they may be in a different order. If for example, $\frac{1}{\alpha}=\beta$ then $\alpha\beta=1$ and the condition $\det A=1$ gives $\gamma=1.$ The only remaining case is $\frac{1}{\alpha}=\alpha$ , and then $\alpha=\pm 1$ , and similarly for $\beta$ and $\gamma$ , but because their product is $1$ at least one of them is equal to $1$ as well. So the second proof survives completely, and the third need only an adjustment in the place where we used Theorem 3.1.

The first proof has another weak point: for arbitrary field $x^{2}+y^{2}=0$ does not imply $x=y=0$ which we have used in the special case $a_{11}=-1.$ The case when $a_{12}\neq 0$ can really happen. Here is a a nice example in $\mathbb{Z}_{5}$ :

[TABLE]

But $A$ still have a correct eigenvector. The proof therefore should be modified (e.g. consider $i$ in our field such that $i^{2}=-1,$ write $a_{13}=ia_{12}$ and $a_{31}=\pm ia_{21}$ and continue in the same style as we have done in the previous section to describe all possible exceptional matrices), but we prefer to skip this and restrict ourselves by only one algebraic proof).

So the conditions $A^{-1}=A^{T}$ and $\det A=1$ are sufficient to our main theorems. The interesting question is therefore: what is the class of the matrices that satisfy those conditions? It is obviously a group. We study matrices of size $2$ first.

[TABLE]

therefore $a=d$ , $c=-b$ , and $a^{2}+b^{2}=1.$ For the complex numbers, we put $a=\cos z,b=\sin z$ for some complex number $z$ and get all the solutions. So matrices such as

[TABLE]

and their products belongs to our group, so it is large enough. For finite fields we can have difficulties to find ”cosines” (for example, in $\mathbb{Z}_{5}$ , we have $a^{2}+b^{2}=1\Rightarrow a=0,b=1$ or $a=1,b=0$ ), but already in $\mathbb{Z}_{7}$ we have $2^{2}+2^{2}=1$ which produces some matrices. But we prefer to skip this intriguing topic for now.

Any time one gets a result about the orthogonal matrices, it is natural to wonder about their complex relatives - unitary matrices. What can be said about them? Most parts of the proofs fail, which is not surprising, because now $A_{ij}=\overline{a_{ij}}$ , and skew-Hermitian matrix can be invertible, and can have non-zero elements on the main diagonal. So we have no direct analogue of Theorem 1.4. We can get some results if we know the eigenvalue, but is nothing else than the direct application of Theorem 1.2 (as in the second proof).

Theorem 10.1.

Let $A\in SU(3)$ be an unitary matrix with $($ simple $)$ eigenvalue equal to $\lambda.$ Then for all the vectors

[TABLE]

we have $AW_{i}=\lambda W_{i}$ , and at least one of them is non-zero, and therefore is the eigenvector.

Bibliography6

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] M. Artin. Algebra. Prentice-Hall, 1991.
2[2] R. Bellman. Introduction to Matrix Analysis. Classics in Applied Mathematics, 19. Society for Industrial and Applied Mathematics (SIAM), 1997.
3[3] S. Godunov. Modern Aspects of Linear Algebra. Translations of Mathematical Monographs, 175. American Mathematical Society, 1998.
4[4] A. Holst and V. Ufnarovski. Matrix Theory . Studentlitteratur, 2014.
5[5] S. Maad-Sasane and A. Sasane. A Friendly Approach to Complex Analysis . World Scientific, 2014.
6[6] W. Rossmann. Lie Groups. An Introduction through Linear Groups. Oxford Graduate Texts in Mathematics, 5. Oxford University Press, 2002.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Eigenvector of a matrix in SO3(R)SO_{3}(\mathbb{R})SO3​(R)

Abstract.

Key words and phrases:

2010 Mathematics Subject Classification:

1. Introduction

Theorem 1.1**.**

Theorem 1.2**.**

Proof.

Theorem 1.3**.**

Theorem 1.4**.**

2. Two algebraic proofs

Theorem 2.1**.**

Proof.

Lemma 2.2**.**

Proof.

Proof.

Proof.

3. Origin of the non-trivial eigenvector

Theorem 3.1**.**

Proof.

Proof.

4. A geometric interpretation of the eigenvector

Proof.

5. A proof using the Lie algebra of the rotation group

Proposition 5.1**.**

Proposition 5.2**.**

Proof.

6. A quaternionic proof

Proof.

7. A proof using the Cayley transform

Theorem 7.1**.**

Proof.

Proof.

8. A proof using contour integral of the resolvent

Proposition 8.1**.**

Proof.

9. What about zeros?

10. Possible generalisations

Theorem 10.1**.**

Eigenvector of a matrix in $SO_{3}(\mathbb{R})$

Theorem 1.1.

Theorem 1.2.

Theorem 1.3.

Theorem 1.4.

Theorem 2.1.

Lemma 2.2.

Theorem 3.1.

Proposition 5.1.

Proposition 5.2.

Theorem 7.1.

Proposition 8.1.

Theorem 10.1.