A Gentle Introduction to a Beautiful Theorem of Molien

Holger Schellwat

arXiv:1701.04692·math.GM·January 18, 2017

A Gentle Introduction to a Beautiful Theorem of Molien

Holger Schellwat

PDF

Open Access

TL;DR

This paper provides an accessible proof of Molien's Theorem in Invariant Theory using modern linear algebra and group theory to ensure its understanding and preservation.

Contribution

It offers a simplified, modern proof of Molien's Theorem, making this important result more accessible to contemporary mathematicians.

Findings

01

Clear proof of Molien's Theorem presented

02

Bridges classical invariant theory with modern linear algebra

03

Aims to prevent the theorem from being forgotten

Abstract

The purpose of this note is to give an accessible proof of Moliens Theorem in Invariant Theory, in the language of today's Linear Algebra and Group Theory, in order to prevent this beautiful theorem from being forgotten.

Equations141

Φ_{G} (λ) := \frac{1}{∣ G ∣} g \in G \sum \frac{1}{det ( id - λ T _{g} )}

Φ_{G} (λ) := \frac{1}{∣ G ∣} g \in G \sum \frac{1}{det ( id - λ T _{g} )}

((h g) . f) (v) = f ((h g)^{- 1} . v) = f ((g^{- 1} h^{- 1}) . v) = f (g^{- 1} . (h^{- 1} . v)) = (g . f) (h^{- 1} . v) = (h . (g . f)) (v)

((h g) . f) (v) = f ((h g)^{- 1} . v) = f ((g^{- 1} h^{- 1}) . v) = f (g^{- 1} . (h^{- 1} . v)) = (g . f) (h^{- 1} . v) = (h . (g . f)) (v)

(g . (f + f^{'})) (v) = (f + f^{'}) (g^{- 1} . v) = f (g^{- 1} . v) + f^{'} (g^{- 1} . v) = (g . f) (v) + (g . f^{'}) (v) = (g . f + g . f^{'}) (v), thus g . (f + f^{'}) = g . f + g . f^{'}

(g . (f + f^{'})) (v) = (f + f^{'}) (g^{- 1} . v) = f (g^{- 1} . v) + f^{'} (g^{- 1} . v) = (g . f) (v) + (g . f^{'}) (v) = (g . f + g . f^{'}) (v), thus g . (f + f^{'}) = g . f + g . f^{'}

(g . (f \cdot f^{'})) (v) = (f \cdot f^{'}) (g^{- 1} . v) = f (g^{- 1} . v) \cdot f^{'} (g^{- 1} . v) = (g . f) (v) \cdot (g . f^{'}) (v) = (g . f \cdot g . f^{'}) (v), thus g . (f \cdot f^{'}) = g . f \cdot g . f^{'}

(g . (f \cdot f^{'})) (v) = (f \cdot f^{'}) (g^{- 1} . v) = f (g^{- 1} . v) \cdot f^{'} (g^{- 1} . v) = (g . f) (v) \cdot (g . f^{'}) (v) = (g . f \cdot g . f^{'}) (v), thus g . (f \cdot f^{'}) = g . f \cdot g . f^{'}

T_{g}^{d} : R_{d} \to R_{d}, f \mapsto g . f : R^{d} \to C, v \mapsto f (g^{- 1} . v) = f (T_{g^{- 1}} (v)) .

T_{g}^{d} : R_{d} \to R_{d}, f \mapsto g . f : R^{d} \to C, v \mapsto f (g^{- 1} . v) = f (T_{g^{- 1}} (v)) .

(T_{g^{- 1}}^{1} \circ T_{g}^{1}) (f) = T_{g^{- 1}}^{1} (T_{g}^{1} (f)) = T_{g^{- 1}}^{1} (g . f) = g^{- 1} . (g . f) = (g^{- 1} g) . f = f .

(T_{g^{- 1}}^{1} \circ T_{g}^{1}) (f) = T_{g^{- 1}}^{1} (T_{g}^{1} (f)) = T_{g^{- 1}}^{1} (g . f) = g^{- 1} . (g . f) = (g^{- 1} g) . f = f .

T^{1} (f) = g . f = f \circ T_{g^{- 1}} .

T^{1} (f) = g . f = f \circ T_{g^{- 1}} .

T_{g}^{1} T_{g} T_{g}^{\times} ↓ ⏐ P T_{g}^{*} ↓ ⏐ P

T_{g}^{1} T_{g} T_{g}^{\times} ↓ ⏐ P T_{g}^{*} ↓ ⏐ P

[P (f)]_{e} = (\overline{f (e_{1})}, \dots, \overline{f (e_{n})}) .

[P (f)]_{e} = (\overline{f (e_{1})}, \dots, \overline{f (e_{n})}) .

P (f) = i = 1 \sum n \overline{f (e_{i})} e_{i}

P (f) = i = 1 \sum n \overline{f (e_{i})} e_{i}

f = τ (i = 1 \sum n \overline{f (e_{i})} e_{i}) .

f = τ (i = 1 \sum n \overline{f (e_{i})} e_{i}) .

(τ (i = 1 \sum n \overline{f (e_{i})} e_{i})) (e_{j})

(τ (i = 1 \sum n \overline{f (e_{i})} e_{i})) (e_{j})

= \overline{\overline{f (e_{i})}} i = 1 \sum n ⟨ e_{j}, e_{i} ⟩ = f (e_{i}) \cdot 1.

⟨ f, g ⟩ = ⟨ P (f), P (g) ⟩,

⟨ f, g ⟩ = ⟨ P (f), P (g) ⟩,

⟨ P (x_{i}), P (x_{j}) ⟩ = ⟨ e_{i}, e_{j} ⟩ = δ_{ij} = e_{i} ∙ e_{j} = [x_{i}]_{x, x} ∙ \overline{[x_{j}]_{x, x}} .

⟨ P (x_{i}), P (x_{j}) ⟩ = ⟨ e_{i}, e_{j} ⟩ = δ_{ij} = e_{i} ∙ e_{j} = [x_{i}]_{x, x} ∙ \overline{[x_{j}]_{x, x}} .

⟨ w ⟩ ⊙ ker (f) ⟶ T_{g} ⟨ T_{g} (w) ⟩ ⊙ ker (f \circ T_{g}^{- 1}),

⟨ w ⟩ ⊙ ker (f) ⟶ T_{g} ⟨ T_{g} (w) ⟩ ⊙ ker (f \circ T_{g}^{- 1}),

(f \circ T_{g}^{- 1}) (v)

(f \circ T_{g}^{- 1}) (v)

V^{*} ↓ ⏐ P V T_{g}^{1} T_{g} V^{*} ↓ ⏐ P V and V^{*} ↓ ⏐ P V (T_{g}^{1})^{- 1} (T_{g})^{- 1} V^{*} ↓ ⏐ P V

V^{*} ↓ ⏐ P V T_{g}^{1} T_{g} V^{*} ↓ ⏐ P V and V^{*} ↓ ⏐ P V (T_{g}^{1})^{- 1} (T_{g})^{- 1} V^{*} ↓ ⏐ P V

P \circ T_{g}^{1}

P \circ T_{g}^{1}

P \circ (T_{g}^{1})^{- 1}

⟨ T_{g} (v), w ⟩ = ⟨ v, (T_{g})^{- 1} (w) ⟩ = ⟨ v, (T_{g^{- 1}}) (w) ⟩ (*) .

⟨ T_{g} (v), w ⟩ = ⟨ v, (T_{g})^{- 1} (w) ⟩ = ⟨ v, (T_{g^{- 1}}) (w) ⟩ (*) .

⟨ (T_{g}^{1}) (f), h ⟩

⟨ (T_{g}^{1}) (f), h ⟩

= ⟨ (T_{g} (P)) (f), P (h) ⟩ = ⟨ T_{g} (w), u ⟩ = * ⟨ w, T_{g}^{- 1} (u) ⟩

= ⟨ P (f), T_{g}^{- 1} (P (h)) ⟩ = ⟨ P (f), (T_{g}^{- 1} \circ P) (h) ⟩

= (2) ⟨ P (f), (P \circ (T_{g}^{1})^{- 1}) (h) ⟩ = ⟨ P (f), P ((T_{g}^{1})^{- 1} (h)) ⟩

= ⟨ f, (T_{g}^{1})^{- 1} (h) ⟩

[T_{g}^{1}]_{x, x} = \overline{[T_{g}]_{e, e}}

[T_{g}^{1}]_{x, x} = \overline{[T_{g}]_{e, e}}

T_{g}^{1} (x_{i})

T_{g}^{1} (x_{i})

= \ref pp r o p a P^{- 1} (T_{g} (e_{i})) = P^{- 1} (k = 1 \sum n a_{k, i} e_{k}) = konj. k = 1 \sum n \overline{a_{k, i}} P^{- 1} (e_{k})

= \ref pp r o p a k = 1 \sum n \overline{a_{k, i}} x_{k}

⟨ \hat{T} (v), w ⟩

⟨ \hat{T} (v), w ⟩

= u ni t . \frac{1}{∣ G ∣} g \in G \sum ⟨ v, (T_{g})^{- 1} (w) ⟩ = \frac{1}{∣ G ∣} g \in G \sum ⟨ v, (T_{g^{- 1}}) (w) ⟩

= \frac{1}{∣ G ∣} g^{'} \in G \sum ⟨ v, (T_{g^{'}}) (w) ⟩ = ⟨ v, \hat{T} (w) ⟩

T_{s} \circ \hat{T}

T_{s} \circ \hat{T}

= \frac{1}{∣ G ∣} g \in G \sum T_{s g} = \frac{1}{∣ G ∣} g^{'} \in G \sum T_{g^{'}} = \hat{T} .

\hat{T} \circ \hat{T}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Mathematical Identities · Commutative Algebra and Its Applications · Algebraic Geometry and Number Theory

Full text

A Gentle Introduction to a Beautiful Theorem of Molien

Holger Schellwat

[email protected], Örebro universitet, Sweden

Universidade Eduardo Mondlane, Moçambique

(12 January, 2017)

Abstract

The purpose of this note is to give an accessible proof of Moliens Theorem in Invariant Theory, in the language of today’s Linear Algebra and Group Theory, in order to prevent this beautiful theorem from being forgotten.

Introduction

We present some memories of a visit to the ring zoo in 2004. This time we met an animal looking like a unicorn, known by the name of invariant theory. It is rare, old, and very beautiful. The purpose of this note is to give an almost self contained introduction to and clarify the proof of the amazing theorem of Molien, as presented in [Slo77]. An introduction into this area, and much more, is contained in [Stu93]. There are many very short proofs of this theorem, for instance in [Sta79], [Hu90], and [Tam91].

Informally, Moliens Theorem is a power series generating function formula for counting the dimensions of subrings of homogeneous polynomials of certain degree which are invariant under the action of a finite group acting on the variables. As an apetizer, we display this stunning formula:

[TABLE]

We can immediately see elements of linear algebra, representation theory, and enumerative combinatorics in it, all linked together. The paper [Slo77] nicely shows how this method can be applied in Coding theory. For Coding Theory in general, see [Bie04].

Before we can formulate the Theorem, we need to set the stage by looking at some Linear Algebra (see [Rom 08]), Group Theory (see [Hu96]), and Representation Theory (see [Sag 91] and [Tam91]).

1 Preliminaries

Let $V\cong{\mathbf{C}}^{n}$ be a finite dimensional complex inner product space with orthonormal basis $\mathcal{B}=(\mathbf{e}_{1},\dots,\mathbf{e}_{n})$ and let $\mathbf{x}=(x_{1},\dots,x_{n})$ be the orthonormal basis of the algebraic dual space $V^{\ast}$ satisfying $\forall 1\leq i,j\leq n:x_{i}(\mathbf{e}_{j})=\delta_{ij}$ . Let $G$ be a finite group acting unitarily linear on $V$ from the left, that is, for every $g\in G$ the mapping $V\to V,\mathbf{v}\mapsto g.\mathbf{v}$ is a unitary bijective linear transformation. Using coordinates, this can be expressed as $[g.\mathbf{v}]_{\mathcal{B}}=[g]_{\mathcal{B},\mathcal{B}}[\mathbf{v}]_{\mathcal{B}}$ , where $[g]_{\mathcal{B},\mathcal{B}}$ is unitary. Thus, the action is a unitary representation of $G$ , or in other words, a $G$ –module. Note that we are using left composition and column vectors, i.e. $\mathbf{v}=(v_{1},\dots,v_{n})\overset{convention}{=}[v_{1}\,v_{2}\,\dots\,v_{n}]^{\top}$ , c. f. [Ant73].

The elements of $V^{\ast}$ are linear forms(linear functionals), and the elements $x_{1},\dots,x_{n}$ , looking like variables, are also linear forms, this will be important later.

Thinking of $x_{1},\dots,x_{n}$ as variables, we may view (see [Tam91]) $S(V^{\ast})$ , the symmetric algebra on $V^{\ast}$ as the algebra $R:={\mathbf{C}}[\mathbf{x}]:={\mathbf{C}}[x_{1},\dots,x_{n}]$ of polynomial functions $V\to{\mathbf{C}}$ or polynomials in these variables (linear forms). It is naturally graded by degree as $R=\bigoplus_{d\in{\mathbf{N}}}R_{d}$ , where $R_{d}$ is the vector space spanned by the polynomials of (total) degree $d$ , in particular, $R_{0}={\mathbf{C}}$ , and $R_{1}=V^{\ast}$ .

The action of $G$ on $V$ can be lifted to an action on $R$ .

1.1 Proposition.

Let $V$ , $G$ , $R$ as above. Then the mapping $.:G\times R\to R,(g,f)\mapsto g.f$ defined by $(g.f)(\mathbf{v}):=f(g^{-1}.\mathbf{v})$ for $\mathbf{v}\in V$ is a left action.

Proof.

For $\mathbf{v}\in V$ , $g,h\in G$ , and $f\in R$ we check

$(1.f)(\mathbf{v})=f(1^{-1}.\mathbf{v})=f(1.\mathbf{v})=f(\mathbf{v})$ 2. 2.

[TABLE]

∎

In fact, we know more.

1.2 Proposition.

Let $V$ , $G$ , $R$ as above. For every $g\in G$ , the mapping $T_{g}:R\to R,f\mapsto g.f$ is an algebra automorphism preserving the grading, i.e. $g.R_{d}\subset R_{d}$ (here we do not bother about surjectivity).

Proof.

For $\mathbf{v}\in V$ , $g\in G$ , $c\in{\mathbf{C}}$ , and $f,f^{\prime}\in R$ we check

[TABLE] 2. 2.

[TABLE] 3. 3.

$(g.(cf))(\mathbf{v})=(cf)(g^{-1}.\mathbf{v})=c(f(g^{-1}.\mathbf{v}))=c((g.f)(\mathbf{v}))=(c(g.f))(\mathbf{v})$ 4. 4.

By part $2.$ it is clear that the grading is preserved. 5. 5.

To show that $f\mapsto g.f$ is bijective it is enough to show that this mapping is injective on the finite dimensional homogeneous components $R_{d}$ . Let us introduce a name for this mappig, say $T_{g}^{d}:R_{d}\to R_{d},f\mapsto g.f$ . Now $f\in\ker(T_{g}^{d})$ implies that $g.f=0\in R_{d}$ , i.e. $g.f$ is a polynomial mapping from $V$ to ${\mathbf{C}}$ of degree $d$ vanishing identically, $\forall\mathbf{v}\in V:(g.f)(\mathbf{v})=0$ . By definition of the extended action we have $\forall\mathbf{v}\in V:f(g^{-1}.\mathbf{v})=0$ . Since $G$ acts on $V$ this implies that $\forall\mathbf{v}\in V:f(\mathbf{v})=0$ , so $f$ is the zero mapping. Since our ground field has characteristic [math], this implies that $f$ is the zero polynomial, which we may view as an element of every $R_{d}$ . See for instance [Cox91], proposition 5 in section 1.1. 6. 6.

Note that every $T_{g}^{d}$ is also surjective, since all group elements have their inverse in $G$ .

∎

Both propositions together give us a homomorphism from $G$ into $\mathrm{Aut}(R)$ . They also clarify the rôle of the induced matrices, which are classical in this area, as mentionend in [Slo77]. Since the monomials $x_{1},\dots,x_{n}$ of degree one form a basis for $R_{1}$ , it follows from the proposition that their products $\mathbf{x}_{2}:=(x_{1}^{2},x_{1}x_{2},x_{1}x_{3},\dots,x_{1}x_{n},x_{2}^{2},x_{2}x_{3},\dots)$ form a basis for $R_{2}$ , and, in general, the monomials of degree $d$ in the linear forms (!) $x_{1},\dots,x_{n}$ form a basis $\mathbf{x}_{d}$ of $R_{d}$ . Clearly, they certainly span $R_{d}$ , and by the last observation in the last proof they are linearly independent.

1.3 Definition.

In the context from above, that is $g\in G$ , $f\in R^{d}$ , and $\mathbf{v}\in V$ , we define

[TABLE]

*1.4 Remark**.*

In particular, we have $(T_{g}^{1}(f))(\mathbf{v})=f(T_{g^{-1}}(\mathbf{v})),$ see proposition 1.6 below.

Keep in mind that a function $f\in R_{d}$ maps to $T_{g}^{d}(f)=g.f$ . Setting $A_{g}:=[T_{g}^{1}]_{\mathbf{x},\mathbf{x}}$ , then $A_{g}^{[d]}:=[T_{g}^{d}]_{\mathbf{x}_{d},\mathbf{x}_{d}}$ is the $d$ –th induced matrix in [Slo77], because $T_{g}^{1}(f\cdot f^{\prime})=T_{g}^{1}(f)\cdot T_{g}^{1}(f^{\prime})$ . Also, if $f,f^{\prime}$ are eigenvectors of $T_{g}^{1}$ corresponding to the eigenvalues $\lambda,\lambda^{\prime}$ , then $f\cdot f^{\prime}$ is an eigenvector of $T_{g}^{2}$ with eigenvalue $\lambda\cdot\lambda^{\prime}$ , because $T_{g}(f\cdot f^{\prime})=T_{g}(f)\cdot T_{g}(f^{\prime})=(\lambda f)\cdot(\lambda^{\prime}f^{\prime})=(\lambda\lambda^{\prime})(f\cdot f^{\prime})$ . All this generalizes to $d>2$ , we will get back to that later.

We end this section by verifying two little facts needed in the next section.

1.5 Proposition.

The first induced operator of the inverse of a group element $g\in G$ is given by $T_{g^{-1}}^{1}=(T_{g}^{1})^{-1}$ .

Proof.

Since $\dim(V^{\ast})<\infty$ , it is sufficient to prove that $T_{g^{-1}}^{1}\circ T_{g}^{1}=\mathrm{id}_{V^{\ast}}$ . Keep in mind that $(T_{g}^{1}(f))(\mathbf{v})=f(T_{g^{-1}}(\mathbf{v}))$ . For arbitrary $f\in V^{\ast}$ we see that

[TABLE]

∎

We will be mixing group action notation and composition freely, depending on the context. The following observation is a translation device.

1.6 Proposition.

For $g\in G$ nd $f\in V^{\ast}$ the following holds:

[TABLE]

Proof.

For $\mathbf{v}\in V$ we see $(T^{1}(f))(\mathbf{v})=(g.f)(\mathbf{v})\overset{def}{=}f(g^{-1}.\mathbf{v})=f(T_{g^{-1}}(\mathbf{v})).$ ∎

2 The Magic Square

Remember that we require a unitary representation of $G$ , that is the operators $T_{g}:V\to V$ need to be unitary, i.e. $\forall g\in G:(T_{g})^{-1}=(T_{g})^{\ast}$ . The first goal of this sections is to show that this implies that the induced operators $T_{g}^{d}:R_{d}\to R_{d},f\mapsto g.f$ are also unitary. We saw that $T_{g}^{1}=V^{\ast}$ , the algebraic dual of $V$ . In order to understand the operator duals of $V$ and $V^{\ast}$ we need to look on their inner products first. We may assume that the operators $T_{g}$ are unitary with respect to the standard inner product $\left\langle\mathbf{u}\,,\mathbf{v}\right\rangle=[\mathbf{u}]_{\mathcal{B},\mathcal{B}}\bullet\overline{[\mathbf{v}]_{\mathcal{B},\mathcal{B}}}$ , where $\bullet$ denotes the dot product.

Before we can speak of unitarity of the induced operators $T_{g}^{d}$ we have to make clear which inner product applies on $R^{1}=V^{\ast}$ . Quite naively, for $f,g\in V^{\ast}$ we are tempted to define $\left\langle f\,,g\right\rangle=[f]_{\mathbf{x},\mathbf{x}}\bullet\overline{[g]_{\mathbf{x},\mathbf{x}}}$ .

We will motivate this in a while, but first we take a look at the diagram in [Rom 08], chapter10, with our objects:

[TABLE]

Here $P$ (“Rho” ) denotes the Riesz map, see [Rom 08], Theorem 9.18, where it is called $R$ , but $R$ denotes already our big ring. We started by looking at the operator $T_{g}$ , which is unitary, so its inverse is the Hilbert space adjoint $T_{g}^{\ast}$ . Omiting the names of the bases we have $[T_{g}^{\ast}]=[T_{g}]^{\ast}$ . We also see the operator adjoint $T_{g}^{\times}$ with matrix $[T_{g}^{\times}]=[T_{g}]^{\top}$ , the transpose. However, the arrow for $T_{g}^{1}$ is not in the original diagram, but soon we will see it there, too.

Fortunately, the Riesz map $P$ turns a linear form into a vector and its inverse $\tau:V\to V^{\ast}$ maps a vector to a linear form, both are conjugate isomorphisms. This is mostly all we need in order to show that $T_{g}^{1}$ is unitary. In the following three propositions we use that $V$ has the orthonormal basis $\mathcal{B}$ and that $V^{\ast}$ has the orthonormal basis $\mathbf{x}$ .

2.1 Proposition.

For every $f\in V^{\ast}$ the coordinates of its Riesz vector are given by

[TABLE]

Proof.

Writing $\tau$ for the inverse of $P$ , we need to show that

[TABLE]

which is equivalent to

[TABLE]

It is sufficient to show the latter for values of $f$ on the basis vectors $\mathbf{e}_{j}$ , $1\leq j\leq n$ . We obtain

[TABLE]

∎

In particular, this implies that $P(x_{i})=\mathbf{e}_{i}$ .

2.2 Proposition.

Our makeshift inner product on $V^{\ast}$ satisfies

[TABLE]

where $f,g\in V^{\ast}$ .

Proof.

By our vague definition we have $\left\langle f\,,g\right\rangle=[f]_{\mathbf{x},\mathbf{x}}\bullet\overline{[g]_{\mathbf{x},\mathbf{x}}}$ . It is enough to show that $\left\langle x_{i}\,,x_{j}\right\rangle=\left\langle P(x_{i})\,,P(x_{j})\right\rangle$ . From the comment after the proof of Proposition 2.1 we obtain

[TABLE]

∎

Hence, our guess for the inner product on $V^{\ast}$ was correct. We will now relate the Riesz vector of $f\in V^{\ast}$ to the Riesz vector of $f\circ T_{g}^{-1}$ . Recall that the Riesz vector of $f\in V^{\ast}$ is the unique vector $\mathbf{w}=P(f)$ such that $f(\mathbf{v})=\left\langle\mathbf{v}\,,\mathbf{w}\right\rangle$ for all $\mathbf{v}\in V$ . If $f\neq 0$ it can be found by scaling any nonzero vector in the cokernel of $f$ , which is one–dimensional, see [Rom 08], in particular Theorem 9.18.

2.3 Proposition.

Let $T_{g}:V\to V$ be unitary, $f\in V^{\ast}$ , $\mathbf{w}=P(f)$ the vector of $f\in V^{\ast}$ . Then $T_{g}(\mathbf{w})$ is the Riesz vector of $f\circ T_{g}^{-1}$ , i.e. the Riesz vector of $T^{1}_{g}(f)$ .

Proof.

We may assume that $f\neq 0$ . Using the notation $\left\langle\mathbf{w}\right\rangle$ for the one–dimensional subspace spanned by $\mathbf{w}$ , we start with a little diagram:

[TABLE]

wheere $\odot$ denotes the orthogonal direct sum.

We need to show that $f\circ T_{g}^{-1}=\left\langle\cdot\,,T_{g}(\mathbf{w})\right\rangle$ , i.e. that $(f\circ T_{g}^{-1})(\mathbf{v})=\left\langle\mathbf{v}\,,T_{g}(\mathbf{w})\right\rangle$ for all $\mathbf{v}\in V$ . Since $\mathbf{w}=P(f)$ the vector of $f$ , we have $f(\mathbf{v})=\left\langle\mathbf{v}\,,\mathbf{w}\right\rangle$ for all $\mathbf{v}\in V$ . We obtain

[TABLE]

From remark 1.4 we conclude that $f\circ T_{g}^{-1}=T^{1}_{g}(f)$ . ∎

Observe that proposition 2.3 implies the commutativity of the following two diagrams.

[TABLE]

Indeed, 2.3 implies

[TABLE]

2.4 Proposition.

The first induced operator $T_{g}^{1}$ is unitary.

Proof.

We may use that $T_{g}$ is unitary, that is,

[TABLE]

Let $f,h\in V^{\ast}$ arbitrary, $\mathbf{w}:=P(f)$ , and $\mathbf{u}:=P(h)$ . We need to check that $\left\langle(T_{g}^{1})(f)\,,h\right\rangle=\left\langle f\,,(T_{g}^{1})^{-1}(h)\right\rangle$ . We see that

[TABLE]

∎

After having looked at eigenvalues we will see that this generalizes to higher degree, that $T_{g}^{d}$ is diagonalizable for all $d\in{\mathbf{Z}}^{+}$ . But first let us look at the matrix version of proposition 2.4.

2.5 Proposition.

[TABLE]

Proof.

Let $A:=[T_{g}]_{\mathcal{B},\mathcal{B}}=[A_{1}|\cdots|A_{i}|\cdots|A_{n}]=[a_{i,j}]$ and $B:=[T_{g}^{1}]_{\mathbf{x},\mathbf{x}}=[B_{1}|\cdots|B_{i}|\cdots|B_{n}]=[b_{i,j}]$ . We will use the commutativity of the diagram, i.e. $P^{-1}\circ T_{g}\circ P=T_{g}$ , which we will mark as $\square$ . No, the proof is not finished here. We get $T_{g}(\mathbf{e}_{i})=A_{i}=\sum_{k=1}^{n}\,a_{k,i}\mathbf{e}_{k}$ and

[TABLE]

On the other hand, $[T^{1}_{g}(x_{i})]_{\mathbf{x}}=[T^{1}_{g}]_{\mathbf{x},\mathbf{x}}\mathbf{e}_{i}=B_{i}$ implies $T^{1}_{g}(x_{i})=\sum_{k=1}^{n}\,b_{k,i}\mathbf{e}_{k}$ . Together we obtain $b_{k,i}=\overline{a_{k,i}}$ , and the proposition follows. ∎

3 Averaging over the Group

Now we apply averaging to obtain self-adjoint operators.

3.1 Definition.

We define the following operators:

$\displaystyle\hat{T}:V\to V,\mathbf{v}\mapsto\hat{T}(\mathbf{v}):=\frac{1}{|G|}\sum_{g\in G}\,T_{g}(\mathbf{v})$ 2. 2.

$\displaystyle\hat{T^{1}}:V^{\ast}\to V^{\ast},f\mapsto\hat{T^{1}}(f):=\frac{1}{|G|}\sum_{g\in G}\,T^{1}_{g}(f)$

These are sometimes called the Reynolds operator of $G$ .

3.2 Proposition.

The operators $\hat{T}$ and $\hat{T^{1}}$ are self-adjoint (Hermitian).

Proof.

The idea of the averaging trick is that if $g\in G$ runs through all group element and $g^{\prime}\in G$ is fixed, then the products $g^{\prime}g$ run also through all group elements. We will make use of the facts that every $T_{g}$ and every $T^{1}_{g}$ is unitary.

We need to show that $\left\langle\hat{T}(\mathbf{v})\,,\mathbf{w}\right\rangle=\left\langle\mathbf{v}\,,\hat{T}(\mathbf{w})\right\rangle$ for arbitrary $\mathbf{v},\mathbf{w}\in V$ . We obtain

[TABLE] 2. 2.

The same proof, mutitis mutandis, replacing $\hat{T}\leftrightarrow\hat{T^{1}}$ , $T_{g}\leftrightarrow T_{g}^{1}$ , $\mathbf{v}\leftrightarrow f$ , and $\mathbf{w}\leftrightarrow h$ shows that $\left\langle\hat{T^{1}}(f)\,,h\right\rangle=\left\langle f\,,\hat{T^{1}}(h)\right\rangle.$

∎

Consequently, $\hat{T}$ and $\hat{T^{1}}$ are unitarily diagonalizable with real spectrum.

3.3 Proposition.

The operators $\hat{T}$ and $\hat{T^{1}}$ are idempotent, i.e.

$\hat{T}\circ\hat{T}=\hat{T}$ ** 2. 2.

$\hat{T^{1}}\circ\hat{T^{1}}=\hat{T^{1}}$ * .*

In particular, the eigenvalues of both operators are either [math] or $1$ .

Proof.

Again, we show only one part, the other part is analog. To begin with, let $s\in G$ be fixed. Then

[TABLE]

From this it follows that

[TABLE]

From $\hat{T}\circ\hat{T}=\hat{T}$ we conclude that $\hat{T}\circ(\hat{T}-\mathrm{id})=0$ . Thus the minimal polynomial of $T$ divides the polynomial $\lambda(\lambda-1)$ , so all eigenvalues are contained in $\left\{0,1\right\}$ . ∎

We will now look at the eigenvalues of $T_{g}$ and $T^{1}_{g}$ and their interrelation. Since both operators are unitary, their eigenvalues have absolute value $1$ .

3.4 Proposition.

If $\mathbf{v}\in V$ is an eigenvector of $T_{g}$ for the eigenvalue $\lambda$ , then $\mathbf{v}$ is an eigenvector of $T_{g^{-1}}$ for the eigenvalue $\overline{\lambda}=\frac{1}{\lambda}$ . 2. 2.

If $f\in V^{\ast}$ is an eigenvector of $T^{1}_{g}$ for the eigenvalue $\lambda$ , then $f$ is an eigenvector of $T^{1}_{g^{-1}}$ for the eigenvalue $\frac{1}{\lambda}$ . 3. 3.

If $f\in V^{\ast}$ is an eigenvector of $T^{1}_{g}$ for the eigenvalue $\lambda$ , then $P(f)\in V$ is an eigenvector of $T_{g}$ for the eigenvalue $\overline{\lambda}=\frac{1}{\lambda}$ . 4. 4.

If $\mathbf{v}\in V$ is an eigenvector of $T_{g}$ for the eigenvalue $\lambda$ , then $P^{-1}(\mathbf{v})\in V^{\ast}$ is an eigenvector of $T^{1}_{g}$ for the eigenvalue $\overline{\lambda}=\frac{1}{\lambda}$ .

Proof.

We will make use of the commutativity of Proposition 2.3. Observe that $g.\mathbf{v}=T_{g}(\mathbf{v})$ and $g.f=f\circ T_{g}$ .

[TABLE] 2. 2.

[TABLE] 3. 3.

[TABLE] 4. 4.

[TABLE]

∎

This implies that if we consider the union of the spectra over all $g\in G$ , then we obtain the same (multi)set, no matter if we take $T_{g}$ or $T^{1}_{g}$ .

4 Eigenvectors and eigenvalues

Now we continue from where we left at the end of section 1, fixing one group element $g\in G$ and compare $T_{g}^{1}$ with $T_{g}^{d}$ for $d>1$ . By a method called stars and bars it is easy to see that

[TABLE]

Remember that every $T_{g}^{1}$ is unitarily diagonalizable with eigenvalues of absolute value $1$ . If $\mathrm{spec}(T_{g}^{1})=(\omega_{1},\dots,\omega_{n})\in U(1)^{n}$ , then $V^{\ast}$ has an orthonormal basis $\mathbf{y}_{g}^{1}:=(y_{1},\dots,y_{n})$ , such that $T_{g}^{1}(y_{i})=\omega_{i}\cdot y_{i}$ for all $1\leq i\leq n$ , and $[T_{g}^{1}]_{\mathbf{y}_{g}^{1},\mathbf{y}_{g}^{1}}=\mathrm{diag}(\omega_{1},\dots,\omega_{n})$ . Moreover,

[TABLE]

where $[\mathrm{id}]_{\mathbf{y}_{g}^{1},\mathbf{x}}=[\mathrm{id}]_{\mathbf{x},\mathbf{y}_{g}^{1}}^{\ast}$ is unitary.

For $d>1$ put

[TABLE]

all monomials in the $x_{i}$ of total degree $d$ , numbered from $1$ to $\tilde{d}$ .

These are certainly linear independent, since we have no relations amongst the variables, and span $R_{d}$ , since every monomial of total degree $d$ can be written as a linear combination of these. So the form a basis for $R_{d}$ . We will not require that this can be made into an orthonormal basis, we do not even consider any inner product on $R_{d}$ for $d>1$ .

We rather want to establish that

[TABLE]

is a basis of eigenvectors of $T_{g}^{d}$ diagonalizing $T_{g}^{d}$ , using the same numbering.

Arranging the eigenvalues of $T_{g}^{1}$ in the sam way we put

[TABLE]

Now we establish that the $\tilde{y_{i}}$ , $1\leq i\leq\tilde{d}$ are the eigenvectors for the eigenvalues $\tilde{\omega_{1}}$ of $T_{g}^{d}$ .

4.1 Proposition.

In the context above,

[TABLE]

for all $1\leq i\leq\tilde{d}$ .

Proof.

The key is proposition 1.2, as in the preliminary observations at the end of section 1. Let

[TABLE]

and

[TABLE]

where $\epsilon_{j}\in{\mathbf{N}}$ and the sum of these exponents is $d$ . Then

[TABLE]

∎

As a consequence, $R_{d}$ has a basis of eigenvectors of $T_{g}^{d}$ and $T_{g}^{d}$ is similar to the diagonal matrix $\mathrm{diag}(\tilde{\omega_{1}},\dots,\tilde{\omega}_{\tilde{d}})$ .

5 Moliens Theorem

We will now make some final preparations and then present the proof of Moliens Theorem.

For $f\in R$ and $g\in G$ we say that $f$ is an invariant of $g$ if $g.f=f$ and that $f$ is a (simple) invariant of $G$ if $\forall g\in G:g.f=f$ . The method of averaging from section 3 can also be applied to create invariants:

5.1 Proposition.

For $f\in V^{\ast}$ put $\hat{f}:=\hat{T^{1}}(f)$ . Then $\hat{f}$ is an invariant of $G$ .

Proof.

Let $g\in G$ be arbitrary. We will show that $g.\hat{f}=\hat{f}$ . Clearly, from proposition 1.6 we get that

[TABLE]

∎

Now, we call

[TABLE]

the algebra of invariants of $G$ .

5.2 Proposition.

$R^{G}$ * is a subalgebra of $R$ .*

Proof.

Since the mapping $f\mapsto g.f$ is linear for every $g\in G$ , $R^{G}$ is the intersection of subspaces, and hence a subspace. Let us check the subring conditions in more detail. For arbritrary $g\in G$ , $f,h\in R^{G}$ , and $\mathbf{v}\in V$ we have $g.f=f$ , $g.h=h$

For the zero $0\in R$ we obtain $(g.0)(\mathbf{v})=0(g^{-1}.\mathbf{v})=0(\mathbf{v})$ , so $0\in R^{G}$ . 2. 2.

We see

[TABLE] 3. 3.

Likewise,

[TABLE]

∎

Our subalgebra $R^{G}$ is graded in the same way as $R$ .

5.3 Proposition.

The algebra of invariants of $G$ is naturally graded as

[TABLE]

where $R^{G}_{d}=\left\{\,f\in R_{d}{\,\,:\,\,}\forall g\in G:g.f=f\,\right\}$ , called the $d$ –th homogeneous component of $R^{G}$ .

Proof.

This follows directly from proposition 1.1 and proposition 1.2. ∎

5.4 Definition (Molien series).

Viewing $R^{G}_{d}$ as a vector space, we define

[TABLE]

the number of linearly independent homogeneous invariants of degree $d\in{\mathbf{N}}$ , and

[TABLE]

the Molien series of $G$ .

Thus, the Molien series of $G$ is an ordinary power series generating function whose coefficients are the numbers of linearly independent homogeneous invariants of degree $d$ . The following beautiful formula gives these numbers, its proof is the aim of this paper.

5.5 Theorem (Molien, 1897).

[TABLE]

Following [Slo77] we first look the number $a_{1}$ of linearly independent homogeneous invariants of degree $d$ .

5.6 Theorem (Theorem 13 in [Slo77]).

[TABLE]

Proof.

First, we note that the equation $\mathrm{Tr}(\hat{T})=\mathrm{Tr}(\hat{T^{1}})$ follows from the remark at the end of section 3, since the sum for the trace runs over all group elements. Remember that the trace is independent of the choice of basis. From proposition 3.3 we know that both operators are idempotent hermitian and $V^{\ast}$ has a an orthornormal basis $\mathbf{f}=(\mathbf{f}_{a},\dots,\mathbf{f}_{n})$ of eigenvectors of $\hat{T^{1}}$ , corresponding to the eigenvalues $\lambda_{1},\dots,\lambda_{n}\in\left\{0,1\right\}$ , so

[TABLE]

Let us say that this matrix has $r$ entries $1$ and the remaining $n-d$ entries [math]. By rearranging the eigenvalues and eigenvectors we may assume that the first $r$ entries are $1$ and the remaining $n-d$ are [math], i.e.

[TABLE]

Hence $\hat{T^{1}}(f_{i})=f_{i}$ for $1\leq i\leq r$ and $\hat{T^{1}}(f_{i})=0$ for $r+1\leq i\leq n$ . Any linear invariant of $G$ is certainly fixed by $\hat{T^{1}}$ , so $a_{1}\leq r$ . On the other hand, by proposition 5.1, $\hat{f_{i}}:=\hat{T^{1}}(f_{i})=\lambda_{i}f_{i}$ is an invariant of $G$ for every $1\leq i\leq r$ , so $a_{1}\geq r$ . Together, $a_{1}=r$ . ∎

Before the final proof, let us introduce a handy notation.

5.7 Definition.

Let $p(\lambda)\in{\mathbf{C}}[\lambda]$ or $p(\lambda)\in{\mathbf{C}}[[\lambda]]$ . Then $[\lambda^{i}]:p(\lambda)$ denotes the coefficient of $\lambda^{i}$ in $p(\lambda)$ .

So, for example $[x^{2}]:2x^{3}+42x^{2}-6=42$ and $[\lambda^{d}]:\Phi_{G}(\lambda)=a_{d}$ .

Proof.

(Moliens Theorem) We just established the case $d=1$ , so the reader is probably expecting a proof by induction over $d$ . But this is not the case. Rather, the case $d=1$ applies to all $d>1$ . Note that $a_{d}$ is equal to the number of linearly independent invariants of all of the $T_{g}^{d}$ . So Theorem 5.6 gives us

[TABLE]

where the latter includes the first. From definition 3.1 we also have

[TABLE]

so we already know that

[TABLE]

So all we need to show is

[TABLE]

We will show that for every summand (group element) the equation

[TABLE]

holds. From proposition 4.1 we get for every $g\in G$ that

[TABLE]

sum of the products of the $\omega_{1},\omega_{2},\dots,\omega_{n}$ , taken $d$ of them at a time. On the other hand, for the same $g\in G$ we obtain from section 4 that $[T_{g}^{1}]_{\mathbf{y}_{g}^{1},\mathbf{y}_{g}^{1}}=\mathrm{diag}(\omega_{1},\dots,\omega_{n})$ so that

[TABLE]

so

[TABLE]

and here the coefficient of $\lambda^{d}$ is also sum of the products of $\omega_{1},\omega_{2},\dots,\omega_{n}$ , taken $d$ of them at a time.

Again, the last claim

[TABLE]

follows from the remark at the end of section 3.2, since the sum runs over all group elements. ∎

6 Symbol table

$a_{d}$

number of linearly independent homogeneous invariants of degree $d$

$\tilde{d}$

Dimension of $R_{d}$

$\mathcal{B}$

ON basis for $V$

$G$

Finite group

$\omega_{i}$

eigenvalue of $T_{g}^{1}$ ([Slo77] $=w_{i}$ )

$P(f)$

“Rho” Riesz vector of $f$ .

$\rho$

Unitary representation $\rho:G\to U(V),g\mapsto T_{g}$

$R$

Big algebra, direct sum of

$R_{d}$

Direct summand of degree $d$

$R^{G}$

Ring of invariants of $d$

$R^{G}_{d}$

Degree $d$ summand

$T_{g}$

representation of $g$ on $V$ , ([Slo77] $A_{\alpha}=[T_{g_{\alpha}}]_{\mathcal{B},\mathcal{B}}$ )

$V$

Complex inner product space

$V^{\ast}$

Algebraic dual of $V$

7 Lost and found

Some things to explore from here:

•

If we know the conjugacy classes of $G$ , we may be able to say more, since every unitary representation splits into irreducible components.

•

There seems to be a link to Pólya enumeration.

•

We have GAP code, see [GAP].

•

An example would be nice.

•

Relations on the generators in $S$ of the Cayley graph $\Gamma(G,S)$ should lead to conditions of the minimal polynomial of its adjacency operator $Q(\Gamma(G,S))$ .

•

Also, Cayley graphs of some finite reflection groups [Hu90] should become accessible.

•

Check some more applications, as mentioned in [Slo77].

•

For finding invariants, check also [Cox91], Gröbner bases.

Index

algebra of invariants §5
coefficient Definition 5.7
diagonal matrix §4
first induced operator Proposition 1.5
homogeneous component Proposition 5.3
idempotent Proposition 3.3
invariant §5
left composition §1
linear forms §1
Molien series Definition 5.4
Reynolds §3
Riesz map §2
stars and bars §4
symmetric algebra §1

1701.04692.tex Typeset:

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[Ant 73] Howard Anton, Elementary Linear Algebra , 6 t h superscript 6 𝑡 ℎ 6^{th} ed., John Wiley and Sons, New York, 1973.
2[Bie 04] Jürgen Bierbrauer, Introduction to Coding Theory , Discrete Mathematics and Its Applications, Volume: 28, CRC Press Inc, Boca Raton, 2004.
3[Cox 91] D. Cox, J. Little, D. O’Shea, Ideals, Varieties, and Algorithms , Springer-Verlag, New York, 1991.
4[GAP] The GAP Group, GAP – Groups, Algorithms, and Programming, Version 4.4 ; 2004, (http://www.gap-system.org) .
5[Hu 96] John F. Humphreys, A Course in Group Theory , Oxford University Press, Oxford, 1994.
6[Hu 90] James E. Humphreys, Reflection Groups and Coxeter Groups , Cambridge University Press, Cambridge, 1990.
7[Rom 08] Steven Roman, Advanced linear algebra, 3 r d superscript 3 𝑟 𝑑 3^{rd} Edition , Springer-Verlag, New York, 2008.
8[Sag 91] Bruce E. Sagan, The Symmetric Group , Wadsworth & Brooks, Pacific Grove, 1991.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

A Gentle Introduction to a Beautiful Theorem of Molien

Abstract

Contents

Introduction

1 Preliminaries

1.1 Proposition**.**

Proof.

1.2 Proposition**.**

Proof.

1.3 Definition**.**

1.4 Remark*.*

1.5 Proposition**.**

Proof.

1.6 Proposition**.**

Proof.

2 The Magic Square

2.1 Proposition**.**

Proof.

2.2 Proposition**.**

Proof.

2.3 Proposition**.**

Proof.

2.4 Proposition**.**

Proof.

2.5 Proposition**.**

Proof.

3 Averaging over the Group

3.1 Definition**.**

3.2 Proposition**.**

Proof.

3.3 Proposition**.**

Proof.

3.4 Proposition**.**

Proof.

4 Eigenvectors and eigenvalues

4.1 Proposition**.**

Proof.

5 Moliens Theorem

5.1 Proposition**.**

Proof.

5.2 Proposition**.**

Proof.

5.3 Proposition**.**

Proof.

5.4 Definition** (Molien series).**

5.5 Theorem** (Molien, 1897).**

5.6 Theorem** (Theorem 13 in [Slo77]).**

Proof.

5.7 Definition**.**

Proof.

6 Symbol table

7 Lost and found

Index

1.1 Proposition.

1.2 Proposition.

1.3 Definition.

*1.4 Remark**.*

1.5 Proposition.

1.6 Proposition.

2.1 Proposition.

2.2 Proposition.

2.3 Proposition.

2.4 Proposition.

2.5 Proposition.

3.1 Definition.

3.2 Proposition.

3.3 Proposition.

3.4 Proposition.

4.1 Proposition.

5.1 Proposition.

5.2 Proposition.

5.3 Proposition.

5.4 Definition (Molien series).

5.5 Theorem (Molien, 1897).

5.6 Theorem (Theorem 13 in [Slo77]).

5.7 Definition.