On the Equivariance Properties of Self-adjoint Matrices
Michael Dellnitz, Bennet Gebken, Raphael Gerlach, Stefan Klus

TL;DR
This paper characterizes the equivariance properties of self-adjoint matrices, linking their symmetry groups to eigenvalue multiplicities, with implications for bifurcation theory, graph symmetries, and matrix approximation problems.
Contribution
It provides a complete characterization of the symmetry groups of self-adjoint matrices based on their eigenvalues and multiplicities, extending understanding of their equivariance properties.
Findings
Self-adjoint matrices are equivariant under a specific group isomorphic to a product of Z2 groups.
Multiple eigenvalues induce larger symmetry groups isomorphic to products of orthogonal groups.
Results have applications in bifurcation analysis, graph symmetry, and matrix approximation problems.
Abstract
We investigate self-adjoint matrices with respect to their equivariance properties. We show in particular that a matrix is self-adjoint if and only if it is equivariant with respect to the action of a group which is isomorphic to . If the self-adjoint matrix possesses multiple eigenvalues -- this may, for instance, be induced by symmetry properties of an underlying dynamical system -- then is even equivariant with respect to the action of a group where are the multiplicities of the eigenvalues of . We discuss implications of this result for equivariant bifurcation problems, and we briefly address further applications for the Procrustes problem, graph symmetries and Taylor expansions.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
On the Equivariance Properties
of Self-adjoint Matrices
Michael Dellnitz
Department of Mathematics, Paderborn University, D-33095 Paderborn, Germany
Bennet Gebken
Department of Mathematics, Paderborn University, D-33095 Paderborn, Germany
Raphael Gerlach
Department of Mathematics, Paderborn University, D-33095 Paderborn, Germany
Stefan Klus
Department of Mathematics and Computer Science, Freie Universität Berlin, D-14195 Berlin, Germany
Abstract
We investigate self-adjoint matrices with respect to their equivariance properties. We show in particular that a matrix is self-adjoint if and only if it is equivariant with respect to the action of a group which is isomorphic to . If the self-adjoint matrix possesses multiple eigenvalues – this may, for instance, be induced by symmetry properties of an underlying dynamical system – then is even equivariant with respect to the action of a group where are the multiplicities of the eigenvalues of . We discuss implications of this result for equivariant bifurcation problems, and we briefly address further applications for the Procrustes problem, graph symmetries and Taylor expansions.
Key words: self-adjoint matrix; equivariance; bifurcation theory; Procrustes problem; Taylor expansion
AMS subject classifications. 15B57, 15A24, 37G40, 41A58
1 Introduction
For more than 30 years the influence of symmetry properties of a dynamical system on its qualitative temporal behavior has been intensively studied. Such symmetry properties are typically induced by network structures or geometric properties of the underlying mathematical model. The related research focuses on a variety of topics, for instance, the classification of symmetry breaking bifurcations ([1]) or the explanation of the occurrence of stable heteroclinic cycles. For an overview of this area and their relevance in the sciences we refer to [2].
Formally, symmetry properties of a dynamical system manifest themselves by an equivariance property of the right-hand side. That is, satisfies
[TABLE]
where is a compact Lie group. It is well known that equivariance properties are inherited by the linearization of from the symmetry properties of the steady-state solutions . In fact, this is the reason why generically may possess multiple eigenvalues, which implies the occurrence of complex symmetry breaking bifurcations in dynamical systems. This happens, for instance, if () or (), where is the dihedral group of order , that is, the symmetry group of the -sided regular polygon.
The investigations in this article are motivated by the analysis of equivariant dynamical systems where the linearization is additionally self-adjoint, that is, the matrix satisfies . Recently, it has been observed that a matrix is self-adjoint if and only if it is equivariant with respect to the action of a group which is isomorphic to (see [3]). This underlying equivariance property is implicitly present in articles concerning the development of dynamical systems for the solution of certain optimization problems (e.g. [4, 5, 6]). But to the best of our knowledge it has not explicitly been stated elsewhere before – and definitely not in the dynamical systems context.
In this article, we extend this result from [3] significantly in the sense that we completely characterize the equivariance properties of self-adjoint matrices by their spectra. In fact, we will show in our main result on the equivariance properties of self-adjoint matrices (Corollary 4.5) that is isomorphic to where are the multiplicities of the eigenvalues of . In particular, if has only simple eigenvalues, then .
One important consequence of this result is the following observation: Suppose that the underlying dynamical system is -equivariant for an . Then – as already mentioned above – the linearization at a -symmetric steady-state solution generically possesses double eigenvalues. Our result implies that in this case the linearization will not just be -equivariant but (at least) even equivariant with respect to an action of .
Moreover, if in addition the entire function is -equivariant, then symmetry-related bifurcations of the system will be governed by rather than , and this leads to phenomena which would generically be unexpected if only is taken into account. This would apply, for instance, to numerical discretizations of the cubic or quintic Ginzburg–Landau equation on a -symmetric spatial domain ([7, 8]). Thus, from an abstract point of view our results are strongly related to the notion of hidden symmetries which has been introduced in connection with the occurrence of unexpected bifurcations in partial differential equations with Neumann boundary conditions ([9, 2]). We will illustrate this fact by several examples in the following sections.
A detailed outline of the structure of this article is as follows. In Section 2, we introduce a specific -equivariant dynamical system as a guiding example. This system exhibits unexpected dynamical phenomena driven by the underlying -equivariance: A symmetry-preserving pitchfork bifurcation and the existence of an entire orbit of steady-state solutions. Then, in Section 3, we review briefly the main result from [3]. This will allow us to reveal the structure which leads to the symmetric pitchfork bifurcation in the guiding example. Our main results concerning the equivariance properties of self-adjoint matrices are stated in Section 4. In Section 5, we discuss the consequences for bifurcations in equivariant dynamical systems. Finally, in Section 6, we discuss a couple of further applications: First we characterize all solutions of the two-sided orthogonal Procrustes problem (Section 6.1). Then we briefly discuss consequences for the graph isomorphism problem for undirected graphs (Section 6.2). We conclude with the construction of simple approximations of derivatives of higher order for real valued functions (Section 6.3). Here we make use of the fact that each Hessian is symmetric and therefore also -equivariant.
2 Motivation – the Guiding Example
As a guiding example we consider the differential equation
[TABLE]
where and
[TABLE]
Observe that this problem has an obvious -symmetry: first the matrix commutes for all with the permutation matrix
[TABLE]
That is,
[TABLE]
Moreover, is invariant under orthogonal transformations. Therefore, the right-hand side in (1) is -equivariant and satisfies
[TABLE]
Thus, by genericity results from classical bifurcation or singularity theory (see [1]) we would particularly expect that
the only steady-state bifurcations that occur in (1) are turning points or (symmetry-breaking) pitchfork bifurcations corresponding to the underlying symmetry given by (e.g. [1, 10]);
equilibria of (1) are isolated.
In contrast to this expectation we observe the following two phenomena for (1):
Apparently the system undergoes a pitchfork bifurcation at . The corresponding local bifurcation diagram is shown in Figure 1 (a). However, at the bifurcation point a normalized kernel vector of is given by
[TABLE]
In particular, this eigenvector is -symmetric () rather than antisymmetric () as expected. Accordingly, also the equilibria on the bifurcating branches are -symmetric, see Figure 1 (a).
For we find not just , but in addition an entire continuous orbit of equilibria for (1), see Figure 1 (b). In fact, we will see in Section 5 that such orbits exist for an entire range of parameter values.
It is the purpose of this work to explain such phenomena, and we will see that this is strongly related to the fact that is self-adjoint. In fact, our results will imply that the dynamical system (1) is -equivariant with , and this will explain the phenomena described in and .
3 Self-adjoint Matrices are Equivariant – a Warm-up
In this section, we briefly summarize the main result from [3]. With this we illustrate the underlying structure, namely that equivariance properties of self-adjoint matrices are induced by the symmetry properties of diagonal matrices.
Let be the abelian group consisting of the matrices
[TABLE]
For any diagonal matrix
[TABLE]
we obviously have
[TABLE]
In fact, it is easy to verify for an arbitrary matrix that
[TABLE]
Proposition 3.1** ([3]).**
A matrix is self-adjoint (i.e. ) if and only if there is an orthogonal matrix such that
[TABLE]
where the group is defined by
[TABLE]
We state the proof for the sake of completeness.
Proof.
Suppose that . Then there is such that
[TABLE]
is a diagonal matrix. By (4) we have for all
[TABLE]
Therefore, satisfies the equivariance condition (5).
Now suppose that (5) is satisfied for some . Then the matrix commutes with every , and by (4) it follows that is a diagonal matrix. Therefore,
[TABLE]
as desired. ∎
Remark 3.2**.**
- (a)
Observe that the implication ‘’ could also be proved by using the well-known fact that two matrices and commute if there is an orthogonal transformation such that both and are diagonal.
- (b)
By construction all the eigenvalues of every are or . In particular for all , and . Moreover, by (a) the matrix and all possess the same set of eigenvectors.
- (c)
Obviously, analogous results can be obtained for Hermitian or normal matrices: Using essentially the same proof as in Proposition 3.1 one can show that a matrix is normal (i.e. ) if and only if there is a unitary matrix such that
[TABLE]
where the group is defined by
[TABLE]
Example 3.3**.**
Let us consider the matrix from our guiding example in Section 2, i.e.
[TABLE]
The matrix
[TABLE]
transforms into a diagonal matrix with the eigenvalues of on the diagonal. With Proposition 3.1 we can compute eight matrices which commute with – the elements of –, and these matrices are given by
[TABLE]
and for . (The entries in the matrices are exact up to four decimal places.) For the eigenvector in (3) we compute
[TABLE]
Thus, a symmetry breaking pitchfork bifurcation at is induced by the group rather than (see (2)), and this explains the phenomenon discussed in Section 2.
Observe that is not among the matrices , i.e. , so that there is still another structure to be revealed. We will see in the following section that this is related to the fact that is a double eigenvalue of .
4 Self-adjoint Matrices are Equivariant – the General Case
In this section, we generalize Proposition 3.1 significantly. In fact, we will show that in general the group may be much more complex – even in the case where is not equivariant in the classical sense where e.g. underlying geometric symmetries lead to equvariance properties of a dynamical system.
4.1 Orthogonal Isotropy Subgroups for Matrices
The following observation forms the theoretical basis for our analytical investigations. With this result we state a useful characterization of the group containing all which commute with a given matrix .
Proposition 4.1**.**
Let and . Define the compact group
[TABLE]
and let
[TABLE]
Then for every
[TABLE]
In particular, does not depend on , and we refer to as the orthogonal isotropy subgroup of .
Proof.
Let such that . Then we have for each
[TABLE]
Remark 4.2**.**
- (a)
Recall that the isotropy subgroup for a point in some space characterizes the symmetry of with respect to a certain group action. More precisely consider a group action of a group on a linear space . Then the isotropy subgroup of is given by .
If we let act on matrices by
[TABLE]
then in Proposition 4.1 is the isotropy subgroup of with respect to this action.
- (b)
If we replace by (unitary matrices) or (invertible matrices) and the matrix by or , respectively, then we obtain an analogous result for the unitary and invertible isotropy subgroup. It is also possible to formulate Proposition 4.1 for general orthogonal, unitary or invertible operators.
We now show that in (7) is unique up to orthogonal transformations.
Corollary 4.3**.**
Let .
- (a)
For each there exists such that
[TABLE]
- (b)
For each there exists such that
[TABLE]
Thus,
[TABLE]
Proof.
Let be given. Then, by Proposition 4.1, we have for
[TABLE]
and therefore
[TABLE]
With we obtain (a) and (b) follows by setting . ∎
4.2 Equivariance Properties of Self-adjoint Matrices
We now return to the case where is self-adjoint. Our aim is to extend significantly Proposition 3.1. This leads to the surprising fact that self-adjoint matrices may possess hidden symmetries due to repeated eigenvalues. Denote by the (real) sorted eigenvalues of with multiplicities and let so that , where is a diagonal matrix containing the sorted eigenvalues of on its diagonal.
Definition 4.4**.**
Let and so that . Define to be the set of block-diagonal matrices where the -th block is in , i.e.
[TABLE]
With this useful definition we are able to completely characterize the symmetries of a self-adjoint matrix .
Corollary 4.5**.**
Let be self-adjoint and so that diagonalizes (and the eigenvalues on the diagonal are sorted). Then
[TABLE]
where is the vector that contains the multiplicities of the eigenvalues of . In particular, by Proposition 4.1 we have
[TABLE]
Proof.
Using the fact that diagonalizes we can write as
[TABLE]
Thus, we only need to show that is equivalent to the fact that is a block-diagonal matrix. Let and write with rectangular blocks . Then we have and . Therefore, translates into
[TABLE]
which is equivalent to for and for . ∎
Remark 4.6**.**
- (a)
The order of the eigenvalues is not relevant as long as is chosen in such a way that all instances of the same eigenvalue on the diagonal of are next to each other. Otherwise, the elements in are not block-diagonal. 2. (b)
Analogous results for normal matrices in the unitary case and diagonalizable matrices in the invertible case (cf. Remark 4.2 (b)) follow in the same way. 3. (c)
Since and for every , we have for each self-adjoint matrix (independently of the multiplicities of the eigenvalues of ). That is, we always have (cf. Proposition 3.1 and Remark 3.2 (b)), and equality holds if and only if has only simple eigenvalues. In particular, is finite if and only if has only simple eigenvalues.
Example 4.7**.**
- (a)
Let us return to our guiding example from Section 2 and consider the matrix
[TABLE]
The eigenvalues of are a simple eigenvalue and a double eigenvalue . Thus, Corollary 4.5 yields and it turns out that is in fact -equivariant. Here (cf. Example 3.3), and a reflection and a rotation by (exact up to four decimal places) within are given by
[TABLE]
It follows that if we have an equilibrium which is not -symmetric, then we obtain an entire nontrivial -orbit of equilibria. This explains the phenomenon described in for the guiding example in Section 2, see also Figure 1 (b).
Finally, observe that the bifurcating equilibria in Figure 1 (a) are -symmetric and therefore isolated for each fixed value of . Note that generically, it is not expected for a one-parameter family of self-adjoint matrices to possess multiple eigenvalues (see Appendix 10 in **[11]**; or also **[12]** where results of **[13]** on general matrices have been extended to self-adjoint matrices in the equivariant context). Rather we have constructed this family for the purpose of illustration.
- (b)
Consider the parameter-dependent family of matrices (see **[12]**)
[TABLE]
with
[TABLE]
and
[TABLE]
for . Then it is easy to verify that is -equivariant. Here the action of the dihedral group is generated by a rotation and a reflection , where . Written in a -block structure these are given by
[TABLE]
However, is self-adjoint and therefore this matrix family is not just - but even -equivariant where is given in (9). Now has simple and double eigenvalues, where the multiple eigenvalues are induced by the two-dimensional irreducible representations of (see **[1]**). Hence, for each parameter value is indeed -equivariant by Corollary 4.5 and (9).
Finally, we explicitly list for a couple of elements of and for illustration purposes. For instance, the matrices and above are given by
[TABLE]
where and
[TABLE]
and
[TABLE]
A couple of ‘hidden symmetries’ – i.e. elements of which are not contained in – are given by
[TABLE]
[TABLE]
or
[TABLE]
[TABLE]
5 Implications for Equivariant Dynamical Systems
In this section, we discuss by an example the implications for dynamical systems of the form
[TABLE]
where is self-adjoint for all and is -equivariant, i.e.
[TABLE]
It follows that the right-hand side in (10) is -equivariant. In particular, the symmetry group varies with the parameter , and a detailed bifurcation analysis in this context should be developed elsewhere. Here, we rather focus on the description of qualitative dynamical phenomena induced by the hidden symmetries.
Remark 5.1**.**
- (a)
Observe that the requirement on is satisfied if, for instance,
[TABLE]
where is -invariant, that is for all . In particular, the equivariance condition (11) would hold for or where is -invariant.
- (b)
If we consider e.g. the nonlinear Schrödinger/Gross–Pitaevskii equation **[14]** or the cubic Ginzburg–Landau equation **[8]** in two dimensions then a numerical discretization by the method of lines yields a dynamical system of the form (10), where does not explicitly depend on . Moreover, if the underlying spatial domain is -symmetric () then symmetry related bifurcations of the system will be governed by rather than just . In fact, in this case we expect to observe phenomena driven by the hidden symmetries as already described in Example 4.7 (b).
As a concrete example, we consider our guiding example introduced in Section 2 and let
[TABLE]
and (see (12)). The eigenvalues of are with multiplicity and with multiplicity .
In the following analytic considerations, we will use the fact that a point is an equilibrium of (10) if and only if is an eigenvalue of and is a corresponding eigenvector. This follows immediately from the structure of (10). For this means that every appropriately scaled eigenvector of a positive eigenvalue of is an equilibrium and vice versa (i.e. every equilibrium is an appropriately scaled eigenvector of ).
Since [math] is always an equilibrium it will be omitted in the following considerations. By the results in this work, we immediately know how the set of equilibria of (10) changes with respect to . Observe that if is an equilibrium, then the -equivariance implies that is an equilibrium for all .
- •
: There is a circle of equilibria induced by the equivariance of (see Example 4.7 (a)).
- •
: There still is a circle of equilibria and a pitchfork bifurcation occurs (cf. Section 2).
- •
: There is a circle of equilibria and two isolated equilibrium points.
- •
: Since possesses the threefold eigenvalue the set of equilibria becomes a sphere.
- •
: The sphere breaks up and there is again a circle of equilibria and two isolated equilibrium points.
- •
: A subcritical pitchfork bifurcation occurs, and the circle of equilibria disappears.
- •
: There are only two equilibria left.
Figure 2 shows the set of equilibria for different values of .
6 Other Applications
In addition to the implications for symmetry breaking bifurcation phenomena as illustrated in Section 5, our results have further applications. We illustrate this briefly by the following three examples.
6.1 Two-sided orthogonal Procrustes problem
Given two symmetric matrices , the two-sided orthogonal Procrustes problem can be defined as follows: Find an orthogonal matrix such that the cost function is minimized. It is well known – see, for instance, [4, 15, 16] – that an optimal solution is given by , where and are the eigendecompositions of and , respectively. Note that the eigenvalues in and both have to be sorted in nonincreasing (or, alternatively, nondecreasing) order. If the cost function is to be maximized, the eigenvalues need to be ordered in opposite order. With the aid of the results from Section 4, we can now characterize all solutions of this form, i.e.
[TABLE]
since for such we obtain
[TABLE]
which is indeed the optimal solution. Here, we used the invariance of the Frobenius norm under unitary transformations and the equivariance properties.
Remark 6.1**.**
- (a)
If , then and it suffices to consider matrices of the form for (or, equivalently, ). 2. (b)
If, furthermore, all eigenvalues are distinct, then the eigenvectors are determined up to the sign and we obtain the group and thus the special case derived in **[4]**. 3. (c)
Since minimizing the Procrustes cost function corresponds to maximizing the cost function and vice versa, the results can be extended to the orthogonal relaxation of the quadratic assignment problem (QAP) **[15, 16]**. 4. (d)
Given two undirected graphs and with adjacency matrices and , the graphs are isomorphic if they are isospectral and contains a permutation matrix, see also **[17]**.
6.2 Graph Symmetries
An isomorphism from a graph to itself is called an automorphism. Let be the adjacency matrix of an undirected graph , then the automorphism group (or symmetry group) of is defined as
[TABLE]
A graph is called asymmetric if is trivial, i.e. . Since is self-adjoint, we can use Corollary 4.5 to identify the orthogonal commutator of . Permutation matrices are orthogonal, hence .
Our results show that even in the case where the graph is asymmetric it typically possesses additional symmetries – namely the elements of the group . We illustrate this with the following example (cf. [18], Figure 5):
Example 6.2**.**
Consider the graph with adjacency matrix shown in Figure 3. The graph is asymmetric (due to the edge between the vertices and ). The eigenvalues are with corresponding multiplicities . (Here we use the notation from Section 4.2.) By Corollary 4.5, we know that if is the orthogonal matrix that diagonalizes , then all are of the form
[TABLE]
with an arbitrary . For instance, we have for
[TABLE]
or for
[TABLE]
6.3 Taylor Expansions
Finally, let us briefly mention one implication involving Taylor expansions. In fact, in this context our main result Corollary 4.5 can be used to develop a novel general technique for the construction of higher order stencils for real valued functions of several variables.
Suppose that is smooth in a neighborhood of . In what follows, we use Corollary 4.5 to construct a four-point stencil which provides a second order approximation of evaluations of the fourth-order derivative in . For convenience, we write the Taylor expansion of in as
[TABLE]
where , , and is the Hessian matrix of at .
Corollary 6.3**.**
Denote by the group in (9) corresponding to the Hessian matrix . Then for all we have
[TABLE]
and therefore for all
[TABLE]
In particular, .
Proof.
For and we compute
[TABLE]
Therefore, using the fact that
[TABLE]
and (13), (14) immediately follow. ∎
Obviously, if then this result is not useful. However, for all other choices of this leads to interesting approximations of the fourth-order derivative as long as is not an eigenvector of ().
Example 6.4**.**
Let be defined by
[TABLE]
We choose and compute
[TABLE]
The choice of
[TABLE]
where leads to
[TABLE]
For we obtain
[TABLE]
and for one computes
[TABLE]
as expected.
Acknowledgements
This work is supported by the Priority Programme SPP 1881 Turbulent Superstructures of the Deutsche Forschungsgemeinschaft. We also thank an anonymous reviewer for important comments on the contents of this article.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] M. Golubitsky, I. Stewart, and D. Schaeffer. Singularities and Groups in Bifurcation Theory . Springer, 1988.
- 2[2] M. Golubitsky and I. Stewart. The Symmetry Perspective: from equilibrium to chaos in phase space and physical space . Birkhäuser, 2003.
- 3[3] M. Dellnitz. Self-adjoint matrices are equivariant. ar Xiv preprint 1701.07020 , 2017.
- 4[4] P. Schönemann. On two-sided orthogonal Procrustes problems. Psychometrika , 33(1):19–33, 1968.
- 5[5] R.W. Brockett. Least squares matching problems. Linear Algebra and its Applications , 122:761–777, 1989.
- 6[6] R.W. Brockett. Dynamical systems that sort lists, diagonalize matrices and solve linear programming problems. Linear Algebra and its Applications , 146:79–91, 1991.
- 7[7] H. Chaté and P. Manneville. Phase diagram of the two-dimensional complex Ginzburg–Landau equation. Physica A: Statistical Mechanics and its Applications , 224:348–368, 1996.
- 8[8] I.S. Aranson and L. Kramer. The world of the complex Ginzburg–Landau equation , volume 74 of Reviews of Modern Physics . The American Physical Society, 2002.
