A Gentle Introduction to a Beautiful Theorem of Molien
Holger Schellwat

TL;DR
This paper provides an accessible proof of Molien's Theorem in Invariant Theory using modern linear algebra and group theory to ensure its understanding and preservation.
Contribution
It offers a simplified, modern proof of Molien's Theorem, making this important result more accessible to contemporary mathematicians.
Findings
Clear proof of Molien's Theorem presented
Bridges classical invariant theory with modern linear algebra
Aims to prevent the theorem from being forgotten
Abstract
The purpose of this note is to give an accessible proof of Moliens Theorem in Invariant Theory, in the language of today's Linear Algebra and Group Theory, in order to prevent this beautiful theorem from being forgotten.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Mathematical Identities · Commutative Algebra and Its Applications · Algebraic Geometry and Number Theory
A Gentle Introduction to a Beautiful Theorem of Molien
Holger Schellwat
[email protected], Örebro universitet, Sweden
Universidade Eduardo Mondlane, Moçambique
(12 January, 2017)
Abstract
The purpose of this note is to give an accessible proof of Moliens Theorem in Invariant Theory, in the language of today’s Linear Algebra and Group Theory, in order to prevent this beautiful theorem from being forgotten.
Contents
- 1 Preliminaries
- 2 The Magic Square
- 3 Averaging over the Group
- 4 Eigenvectors and eigenvalues
- 5 Moliens Theorem
- 6 Symbol table
- 7 Lost and found
- References
Introduction
We present some memories of a visit to the ring zoo in 2004. This time we met an animal looking like a unicorn, known by the name of invariant theory. It is rare, old, and very beautiful. The purpose of this note is to give an almost self contained introduction to and clarify the proof of the amazing theorem of Molien, as presented in [Slo77]. An introduction into this area, and much more, is contained in [Stu93]. There are many very short proofs of this theorem, for instance in [Sta79], [Hu90], and [Tam91].
Informally, Moliens Theorem is a power series generating function formula for counting the dimensions of subrings of homogeneous polynomials of certain degree which are invariant under the action of a finite group acting on the variables. As an apetizer, we display this stunning formula:
[TABLE]
We can immediately see elements of linear algebra, representation theory, and enumerative combinatorics in it, all linked together. The paper [Slo77] nicely shows how this method can be applied in Coding theory. For Coding Theory in general, see [Bie04].
Before we can formulate the Theorem, we need to set the stage by looking at some Linear Algebra (see [Rom 08]), Group Theory (see [Hu96]), and Representation Theory (see [Sag 91] and [Tam91]).
1 Preliminaries
Let be a finite dimensional complex inner product space with orthonormal basis and let be the orthonormal basis of the algebraic dual space satisfying . Let be a finite group acting unitarily linear on from the left, that is, for every the mapping is a unitary bijective linear transformation. Using coordinates, this can be expressed as , where is unitary. Thus, the action is a unitary representation of , or in other words, a –module. Note that we are using left composition and column vectors, i.e. , c. f. [Ant73].
The elements of are linear forms(linear functionals), and the elements , looking like variables, are also linear forms, this will be important later.
Thinking of as variables, we may view (see [Tam91]) , the symmetric algebra on as the algebra of polynomial functions or polynomials in these variables (linear forms). It is naturally graded by degree as , where is the vector space spanned by the polynomials of (total) degree , in particular, , and .
The action of on can be lifted to an action on .
1.1 Proposition**.**
Let , , as above. Then the mapping defined by for is a left action.
Proof.
For , , and we check
2. 2.
[TABLE]
∎
In fact, we know more.
1.2 Proposition**.**
Let , , as above. For every , the mapping is an algebra automorphism preserving the grading, i.e. (here we do not bother about surjectivity).
Proof.
For , , , and we check
[TABLE] 2. 2.
[TABLE] 3. 3.
4. 4.
By part it is clear that the grading is preserved. 5. 5.
To show that is bijective it is enough to show that this mapping is injective on the finite dimensional homogeneous components . Let us introduce a name for this mappig, say . Now implies that , i.e. is a polynomial mapping from to of degree vanishing identically, . By definition of the extended action we have . Since acts on this implies that , so is the zero mapping. Since our ground field has characteristic [math], this implies that is the zero polynomial, which we may view as an element of every . See for instance [Cox91], proposition 5 in section 1.1. 6. 6.
Note that every is also surjective, since all group elements have their inverse in .
∎
Both propositions together give us a homomorphism from into . They also clarify the rôle of the induced matrices, which are classical in this area, as mentionend in [Slo77]. Since the monomials of degree one form a basis for , it follows from the proposition that their products form a basis for , and, in general, the monomials of degree in the linear forms (!) form a basis of . Clearly, they certainly span , and by the last observation in the last proof they are linearly independent.
1.3 Definition**.**
In the context from above, that is , , and , we define
[TABLE]
1.4 Remark*.*
In particular, we have see proposition 1.6 below.
Keep in mind that a function maps to . Setting , then is the –th induced matrix in [Slo77], because . Also, if are eigenvectors of corresponding to the eigenvalues , then is an eigenvector of with eigenvalue , because . All this generalizes to , we will get back to that later.
We end this section by verifying two little facts needed in the next section.
1.5 Proposition**.**
The first induced operator of the inverse of a group element is given by .
Proof.
Since , it is sufficient to prove that . Keep in mind that . For arbitrary we see that
[TABLE]
∎
We will be mixing group action notation and composition freely, depending on the context. The following observation is a translation device.
1.6 Proposition**.**
For nd the following holds:
[TABLE]
Proof.
For we see ∎
2 The Magic Square
Remember that we require a unitary representation of , that is the operators need to be unitary, i.e. . The first goal of this sections is to show that this implies that the induced operators are also unitary. We saw that , the algebraic dual of . In order to understand the operator duals of and we need to look on their inner products first. We may assume that the operators are unitary with respect to the standard inner product , where denotes the dot product.
Before we can speak of unitarity of the induced operators we have to make clear which inner product applies on . Quite naively, for we are tempted to define .
We will motivate this in a while, but first we take a look at the diagram in [Rom 08], chapter10, with our objects:
[TABLE]
Here (“Rho” ) denotes the Riesz map, see [Rom 08], Theorem 9.18, where it is called , but denotes already our big ring. We started by looking at the operator , which is unitary, so its inverse is the Hilbert space adjoint . Omiting the names of the bases we have . We also see the operator adjoint with matrix , the transpose. However, the arrow for is not in the original diagram, but soon we will see it there, too.
Fortunately, the Riesz map turns a linear form into a vector and its inverse maps a vector to a linear form, both are conjugate isomorphisms. This is mostly all we need in order to show that is unitary. In the following three propositions we use that has the orthonormal basis and that has the orthonormal basis .
2.1 Proposition**.**
For every the coordinates of its Riesz vector are given by
[TABLE]
Proof.
Writing for the inverse of , we need to show that
[TABLE]
which is equivalent to
[TABLE]
It is sufficient to show the latter for values of on the basis vectors , . We obtain
[TABLE]
∎
In particular, this implies that .
2.2 Proposition**.**
Our makeshift inner product on satisfies
[TABLE]
where .
Proof.
By our vague definition we have . It is enough to show that . From the comment after the proof of Proposition 2.1 we obtain
[TABLE]
∎
Hence, our guess for the inner product on was correct. We will now relate the Riesz vector of to the Riesz vector of . Recall that the Riesz vector of is the unique vector such that for all . If it can be found by scaling any nonzero vector in the cokernel of , which is one–dimensional, see [Rom 08], in particular Theorem 9.18.
2.3 Proposition**.**
Let be unitary, , the vector of . Then is the Riesz vector of , i.e. the Riesz vector of .
Proof.
We may assume that . Using the notation for the one–dimensional subspace spanned by , we start with a little diagram:
[TABLE]
wheere denotes the orthogonal direct sum.
We need to show that , i.e. that for all . Since the vector of , we have for all . We obtain
[TABLE]
From remark 1.4 we conclude that . ∎
Observe that proposition 2.3 implies the commutativity of the following two diagrams.
[TABLE]
Indeed, 2.3 implies
[TABLE]
2.4 Proposition**.**
The first induced operator is unitary.
Proof.
We may use that is unitary, that is,
[TABLE]
Let arbitrary, , and . We need to check that . We see that
[TABLE]
∎
After having looked at eigenvalues we will see that this generalizes to higher degree, that is diagonalizable for all . But first let us look at the matrix version of proposition 2.4.
2.5 Proposition**.**
[TABLE]
Proof.
Let and . We will use the commutativity of the diagram, i.e. , which we will mark as . No, the proof is not finished here. We get and
[TABLE]
On the other hand, implies . Together we obtain , and the proposition follows. ∎
3 Averaging over the Group
Now we apply averaging to obtain self-adjoint operators.
3.1 Definition**.**
We define the following operators:
2. 2.
These are sometimes called the Reynolds operator of .
3.2 Proposition**.**
The operators and are self-adjoint (Hermitian).
Proof.
The idea of the averaging trick is that if runs through all group element and is fixed, then the products run also through all group elements. We will make use of the facts that every and every is unitary.
We need to show that for arbitrary . We obtain
[TABLE] 2. 2.
The same proof, mutitis mutandis, replacing , , , and shows that
∎
Consequently, and are unitarily diagonalizable with real spectrum.
3.3 Proposition**.**
The operators and are idempotent, i.e.
** 2. 2.
* .*
In particular, the eigenvalues of both operators are either [math] or .
Proof.
Again, we show only one part, the other part is analog. To begin with, let be fixed. Then
[TABLE]
From this it follows that
[TABLE]
From we conclude that . Thus the minimal polynomial of divides the polynomial , so all eigenvalues are contained in . ∎
We will now look at the eigenvalues of and and their interrelation. Since both operators are unitary, their eigenvalues have absolute value .
3.4 Proposition**.**
If is an eigenvector of for the eigenvalue , then is an eigenvector of for the eigenvalue . 2. 2.
If is an eigenvector of for the eigenvalue , then is an eigenvector of for the eigenvalue . 3. 3.
If is an eigenvector of for the eigenvalue , then is an eigenvector of for the eigenvalue . 4. 4.
If is an eigenvector of for the eigenvalue , then is an eigenvector of for the eigenvalue .
Proof.
We will make use of the commutativity of Proposition 2.3. Observe that and .
[TABLE] 2. 2.
[TABLE] 3. 3.
[TABLE] 4. 4.
[TABLE]
∎
This implies that if we consider the union of the spectra over all , then we obtain the same (multi)set, no matter if we take or .
4 Eigenvectors and eigenvalues
Now we continue from where we left at the end of section 1, fixing one group element and compare with for . By a method called stars and bars it is easy to see that
[TABLE]
Remember that every is unitarily diagonalizable with eigenvalues of absolute value . If , then has an orthonormal basis , such that for all , and . Moreover,
[TABLE]
where is unitary.
For put
[TABLE]
all monomials in the of total degree , numbered from to .
These are certainly linear independent, since we have no relations amongst the variables, and span , since every monomial of total degree can be written as a linear combination of these. So the form a basis for . We will not require that this can be made into an orthonormal basis, we do not even consider any inner product on for .
We rather want to establish that
[TABLE]
is a basis of eigenvectors of diagonalizing , using the same numbering.
Arranging the eigenvalues of in the sam way we put
[TABLE]
Now we establish that the , are the eigenvectors for the eigenvalues of .
4.1 Proposition**.**
In the context above,
[TABLE]
for all .
Proof.
The key is proposition 1.2, as in the preliminary observations at the end of section 1. Let
[TABLE]
and
[TABLE]
where and the sum of these exponents is . Then
[TABLE]
∎
As a consequence, has a basis of eigenvectors of and is similar to the diagonal matrix .
5 Moliens Theorem
We will now make some final preparations and then present the proof of Moliens Theorem.
For and we say that is an invariant of if and that is a (simple) invariant of if . The method of averaging from section 3 can also be applied to create invariants:
5.1 Proposition**.**
For put . Then is an invariant of .
Proof.
Let be arbitrary. We will show that . Clearly, from proposition 1.6 we get that
[TABLE]
∎
Now, we call
[TABLE]
the algebra of invariants of .
5.2 Proposition**.**
* is a subalgebra of .*
Proof.
Since the mapping is linear for every , is the intersection of subspaces, and hence a subspace. Let us check the subring conditions in more detail. For arbritrary , , and we have ,
For the zero we obtain , so . 2. 2.
We see
[TABLE] 3. 3.
Likewise,
[TABLE]
∎
Our subalgebra is graded in the same way as .
5.3 Proposition**.**
The algebra of invariants of is naturally graded as
[TABLE]
where , called the –th homogeneous component of .
Proof.
This follows directly from proposition 1.1 and proposition 1.2. ∎
5.4 Definition** (Molien series).**
Viewing as a vector space, we define
[TABLE]
the number of linearly independent homogeneous invariants of degree , and
[TABLE]
the Molien series of .
Thus, the Molien series of is an ordinary power series generating function whose coefficients are the numbers of linearly independent homogeneous invariants of degree . The following beautiful formula gives these numbers, its proof is the aim of this paper.
5.5 Theorem** (Molien, 1897).**
[TABLE]
Following [Slo77] we first look the number of linearly independent homogeneous invariants of degree .
5.6 Theorem** (Theorem 13 in [Slo77]).**
[TABLE]
Proof.
First, we note that the equation follows from the remark at the end of section 3, since the sum for the trace runs over all group elements. Remember that the trace is independent of the choice of basis. From proposition 3.3 we know that both operators are idempotent hermitian and has a an orthornormal basis of eigenvectors of , corresponding to the eigenvalues , so
[TABLE]
Let us say that this matrix has entries and the remaining entries [math]. By rearranging the eigenvalues and eigenvectors we may assume that the first entries are and the remaining are [math], i.e.
[TABLE]
Hence for and for . Any linear invariant of is certainly fixed by , so . On the other hand, by proposition 5.1, is an invariant of for every , so . Together, . ∎
Before the final proof, let us introduce a handy notation.
5.7 Definition**.**
Let or . Then denotes the coefficient of in .
So, for example and .
Proof.
(Moliens Theorem) We just established the case , so the reader is probably expecting a proof by induction over . But this is not the case. Rather, the case applies to all . Note that is equal to the number of linearly independent invariants of all of the . So Theorem 5.6 gives us
[TABLE]
where the latter includes the first. From definition 3.1 we also have
[TABLE]
so we already know that
[TABLE]
So all we need to show is
[TABLE]
We will show that for every summand (group element) the equation
[TABLE]
holds. From proposition 4.1 we get for every that
[TABLE]
sum of the products of the , taken of them at a time. On the other hand, for the same we obtain from section 4 that so that
[TABLE]
so
[TABLE]
and here the coefficient of is also sum of the products of , taken of them at a time.
Again, the last claim
[TABLE]
follows from the remark at the end of section 3.2, since the sum runs over all group elements. ∎
6 Symbol table
number of linearly independent homogeneous invariants of degree
Dimension of
ON basis for
Finite group
eigenvalue of ([Slo77] )
“Rho” Riesz vector of .
Unitary representation
Big algebra, direct sum of
Direct summand of degree
Ring of invariants of
Degree summand
representation of on , ([Slo77] )
Complex inner product space
Algebraic dual of
7 Lost and found
Some things to explore from here:
- •
If we know the conjugacy classes of , we may be able to say more, since every unitary representation splits into irreducible components.
- •
There seems to be a link to Pólya enumeration.
- •
We have GAP code, see [GAP].
- •
An example would be nice.
- •
Relations on the generators in of the Cayley graph should lead to conditions of the minimal polynomial of its adjacency operator .
- •
Also, Cayley graphs of some finite reflection groups [Hu90] should become accessible.
- •
Check some more applications, as mentioned in [Slo77].
- •
For finding invariants, check also [Cox91], Gröbner bases.
Index
- algebra of invariants §5
- coefficient Definition 5.7
- diagonal matrix §4
- first induced operator Proposition 1.5
- homogeneous component Proposition 5.3
- idempotent Proposition 3.3
- invariant §5
- left composition §1
- linear forms §1
- Molien series Definition 5.4
- Reynolds §3
- Riesz map §2
- stars and bars §4
- symmetric algebra §1
1701.04692.tex Typeset:
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[Ant 73] Howard Anton, Elementary Linear Algebra , 6 t h superscript 6 𝑡 ℎ 6^{th} ed., John Wiley and Sons, New York, 1973.
- 2[Bie 04] Jürgen Bierbrauer, Introduction to Coding Theory , Discrete Mathematics and Its Applications, Volume: 28, CRC Press Inc, Boca Raton, 2004.
- 3[Cox 91] D. Cox, J. Little, D. O’Shea, Ideals, Varieties, and Algorithms , Springer-Verlag, New York, 1991.
- 4[GAP] The GAP Group, GAP – Groups, Algorithms, and Programming, Version 4.4 ; 2004, (http://www.gap-system.org) .
- 5[Hu 96] John F. Humphreys, A Course in Group Theory , Oxford University Press, Oxford, 1994.
- 6[Hu 90] James E. Humphreys, Reflection Groups and Coxeter Groups , Cambridge University Press, Cambridge, 1990.
- 7[Rom 08] Steven Roman, Advanced linear algebra, 3 r d superscript 3 𝑟 𝑑 3^{rd} Edition , Springer-Verlag, New York, 2008.
- 8[Sag 91] Bruce E. Sagan, The Symmetric Group , Wadsworth & Brooks, Pacific Grove, 1991.
