Elementary Proof of a Theorem of Hawkes, Isaacs and \"Ozaydin
Matth\'e van der Lee

TL;DR
This paper provides an elementary proof of a theorem relating subgroup lattice M"obius functions in finite groups and applies it to count solutions of certain subgroup-generated equations, also including a new result on Brown's quantity.
Contribution
It offers a simplified proof of Hawkes, Isaacs, and "Ozaydin's theorem and extends its application to counting subgroup solutions and a result on Brown's quantity.
Findings
Elementary proof of the subgroup lattice theorem
Application to counting solutions of subgroup-generated equations
New result on K.S. Brown's quantity
Abstract
We present an elementary proof of the theorem of Hawkes, Isaacs and \"Ozaydin, which states that mod , where denotes the M\"obius function for the subgroup lattice of a finite group , ranges over the conjugates of a given subgroup of with divisible by , and over the supergroups of the for which divides . We apply the theorem to obtain a result on the number of solutions of , for said and a natural number . The present version of the article includes an additional result on a quantity studied by K.S. Brown.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoint processes and geometric inequalities
Elementary Proof of a Theorem of Hawkes, Isaacs and Özaydin
Matthé van der Lee
Abstract
We present an elementary proof of the theorem of Hawkes, Isaacs and Özaydin, which states that mod , where denotes the Möbius function for the subgroup lattice of a finite group , ranges over the conjugates of a given subgroup of with divisible by , and over the supergroups of the for which divides . We apply the theorem to obtain a result on the number of solutions of , for said and a natural number . The present version of the article includes an additional result on a quantity studied by K.S. Brown.
Keywords: Möbius function, arithmetic functions, subgroup lattice
**2010 MSC: 05E15, 11A25, 20D30 **
1 Introduction
The purpose of this note is to present a simple proof for the theorem of Hawkes, Isaacs and Özaydin, an important tool in the area of counting problems in finite groups.
We refer to [3], where the result has appeared as Theorem 5.1, for background information regarding the subject.
The exposition uses minimal group theory. The ingredients for the proof are Burnside’s Lemma, the incidence algebra of a finite partially ordered set (of which we use only the basic properties), and a feature of arithmetic functions: Corollary 3.4.
Next, we derive a result on the number of solutions of , for a finite group , a subgroup , and (Theorem 6.1), and some results on a quantity studied by K.S. Brown (Propositions 6.2 and 6.3).
2 A group action
For positive integers and , we define the number as the binomial coefficient
[TABLE]
This function will play a pivotal role.
Let be positive integers, and a finite group of order . acts on the set of its subsets of cardinality in the natural way
[TABLE]
This is in fact the group action used in Wielandt’s proof of Sylow’s theorem, see [4], §1.7.
If fixes an element , then where is the subgroup generated by . So must be a union of right cosets of in . As equals the order of in , this means divides . Writing , we find that is the union of of the right cosets of . Conversely, any such union is an element of and is fixed by . It follows that the number of fixed points in of an element of order equals
[TABLE]
And, clearly, if does not divide , cannot have any fixed points in .
By Burnside’s Lemma, the number of orbits of in is equal to the average number of fixed points of the elements of . Denoting the number of elements of order in by , the number of orbits is therefore equal to
[TABLE]
Because this must be an integer, we have the following result.
Proposition 2.1**.**
If and are positive integers and is a group of order , one has
[TABLE]
3 Arithmetic functions modulo
Let be a fixed positive integer, and consider the set of functions from to . We can think of the standard arithmetic functions such as the Euler and Möbius functions as elements of this set, taking their values modulo .
The set is a commutative ring under the operations of point-wise addition and convolution product multiplication
[TABLE]
Unity is the function , given by and for . The group of units of is:
[TABLE]
Indeed, if , then mod and mod . So must be the inverse of in , and the value of for can recursively be determined in from the second congruence, in which it has coefficient .
By the well-known formula , the inverse \mu^{\scalebox{0.6}[0.75]{-}1} of the ordinary Möbius function in the ring is the function given by for all . Thus,
[TABLE]
We now discuss the set mod of the arithmetic functions mod that are, to give them a name, special.
Proposition 3.1**.**
* is a unitary subring of , and one has . That is, if is a unit in , it is a unit in .*
Proof.
is closed under addition, and . If and are special mod and is a divisor of , each term in the sum is divisible by , and is again special.
Finally, if , let with . Then is relatively prime to , hence to . Furthermore, f(1)\,f^{\scalebox{0.6}[0.75]{-}1}(t)\equiv\scalebox{0.6}[0.75]{-}\Sigma_{1<d\mid t}\,f(d)\,f^{\scalebox{0.6}[0.75]{-}1}(t/d) mod , so certainly modulo . By induction, all terms on the right-hand side can be assumed to be multiples of . Hence f(1)\,f^{\scalebox{0.6}[0.75]{-}1}(t) is a multiple of , and therefore so is f^{\scalebox{0.6}[0.75]{-}1}(t). ∎
Corollary 3.2**.**
Let , and assume . Then
- (a)
**
If, moreover, , the following equivalences hold
** 2. 3.
h*E\in R_{n}\Leftrightarrow h*f^{\scalebox{0.6}[0.75]{-}1}\in R_{n}**
Proof.
If , then since is also in , so is their product . By equation (1), that is just , giving (a).
Now if is a unit, so is . As this function is in , by Proposition 3.1 its inverse f^{\scalebox{0.6}[0.75]{-}1}*\mu is also in . So if , the product h*f*f^{\scalebox{0.6}[0.75]{-}1}*\mu=h*\mu is in too, establishing (b).
For (c), note that h*E\in R_{n}\Leftrightarrow h*(f^{\scalebox{0.6}[0.75]{-}1}*f)*E\in R_{n}\Leftrightarrow h*f^{\scalebox{0.6}[0.75]{-}1}\in R_{n}, the latter equivalence because of . ∎
Note that the Corollary applies in particular to . Indeed, as , one has , where denotes the identity function, (taken modulo ). And since .
Viewing the binomial coefficient function b_{m}:d\mapsto{dm\scalebox{0.6}[0.75]{-}1\choose d\scalebox{0.6}[0.75]{-}1} introduced in Section 2 as an element of , we now have
Proposition 3.3**.**
For and , the following are equivalent
- (a)
** 2. (b)
**
Proof.
Let be a divisor of . We apply Proposition 2.1 to the cyclic group . It has for every divisor of , and hence we obtain . Writing for , it follows that . By (b) of Corollary 3.2, applied to and , we find .
As , the function is a unit in . Taking inverses, we get b^{\scalebox{0.6}[0.75]{-}1}*E\in R_{n}. Part (c) of the Corollary, for f:=b^{\scalebox{0.6}[0.75]{-}1}, then gives the desired equivalence for . ∎
A slightly stronger statement is:
Corollary 3.4**.**
For , the following are equivalent
- (i)
** 2. (ii)
**
Proof.
(i) just paraphrases (a) of the Proposition, and (ii) is a trivial consequence of (b). To see that (ii) implies (i), take a divisor of , and suppose one already knows that . By the Proposition, for all and all with . Using (ii), pick an such that . Then clearly for this particular . Applying Proposition 3.3 again, one finds that , and so (i) holds for as well. ∎
4 The incidence algebra of a finite poset
Let be a finite poset (partially ordered set), and a commutative ring. The incidence algebra of over is the set of functions in two variables on with values in , which can only assume non-zero values when the arguments are comparable:
[TABLE]
The set is a, generally non-commutative unitary ring under point-wise addition and convolution product multiplication
[TABLE]
Unity is Kronecker’s delta function, if and otherwise.
is an -module, with acting by . And is a ring homomorphism making it an -algebra. Its group of units is given by:
[TABLE]
For if for all , we may compute the values of the right inverse of (the with ) for a given inductively for the by means of
[TABLE]
(This works because is finite.) Similarly, the values of the left inverse of are found, given , using
[TABLE]
By associativity, is the unique two-sided inverse of .
Key elements of are the following functions , and :
[TABLE]
The latter has the following properties, resulting from these definitions
[TABLE]
Many combinatorial aspects of the poset are reflected in the ring . For instance, the number of elements of the interval , and the number of chains of length from to are given by, respectively
[TABLE]
As -chains of length cannot exist, we have . And since and , it follows that
[TABLE]
Remark 4.1**.**
By this formula, is equal to (the image under of) the number of even-length chains from to in minus the number of odd-length ones. (Here, is considered to form a -chain of length zero.) This result is due to P. Hall.
5 Proof of the Theorem
Let be a finite group of order , and let be its subgroup lattice. It is partially ordered by the inclusion relation , which we shall denote as . Using as base ring, we will write for the associated incidence algebra .
We denote , the Möbius function of the poset , simply by , and similarly write just for the zeta function.
We revisit the action of on discussed in Section 2. For , define:
[TABLE]
where is the stabilizer of . If , then is a union of right cosets of in , so divides . Putting , the size of the orbit equals .
A point in this orbit has stabilizer G_{gT}=gHg^{\scalebox{0.6}[0.75]{-}1}, a conjugate of . So if denotes the conjugacy class of in , it follows that is a union of orbits of in , each of length . As the are disjoint, we obtain .
Hence if and is any set of subgroups of order of closed under conjugation, we have:
[TABLE]
Now let fix . Then , so again divides . With , must be a union of of the right cosets of . As we saw earlier, the number of these unions equals , which is therefore the number of fixed points of in . It follows that for any with
[TABLE]
Indeed, is a fixed point of iff , and divides for every .
Let be a subgroup of , of order dividing , and consider the collection
[TABLE]
The remaining computations will take place in the incidence algebra of the poset . As is convex in , that is, for in with one has , it is clear that, for and in , has the same value whether taken in or in . (See also Remark 4.1.) The same goes for the zeta function, and so we can use the notations and unambiguously.
In order to apply Möbius inversion to (4), we introduce two auxiliary functions and from the ring
[TABLE]
In terms of these, we have for :
[TABLE]
For in case and this follows by (4), and the formula is trivial otherwise.
Reformulating it as , the equation shows that in . Inverting now gives . Hence, for any with :
[TABLE]
By the definition of and formula (3) applied to , one obtains, dividing out
[TABLE]
Collecting the that are of equal index over the corresponding and eliminating produces
[TABLE]
This last formula is valid for any for which . So, given any divisor of , one can employ the formula with . In this situation, the for which are just the elements of the conjugacy class of , and we arrive at:
[TABLE]
We are now in a position to apply Corollary 3.4, yielding
[TABLE]
And this is precisely the content of the theorem of Hawkes, Isaacs and Özaydin.
Theorem 5.1**.**
([3, Theorem 5.1])* If is a finite group, a subgroup, the conjugacy class of in , and a divisor of , the following congruence applies*
[TABLE]
Taking the sum over all conjugacy classes of subgroups of order , and letting , one has, as a consequence
[TABLE]
Loosely speaking, the sum of the , taken over all ”at level ” and all ”at level at most ”, is a multiple of the ”height of the resulting slice” of .
6 Further observations
Write for the number of subgroups of order in .
Taking , a prime, with , with , and with , we either have and by properties (2), or and covers (in the sense that no intermediate subgroups exist), so that \mu(H,K)=\scalebox{0.6}[0.75]{-}1, again by (2). Hence, according to (5):
[TABLE]
or, putting ,
[TABLE]
Sylow’s theorem follows by an easy induction: if is a power of a prime , and one assumes that for all , the left-hand side of (6) and all terms on the right-hand side are congruent to 1 mod , so that , being the number of terms on the right, must be mod as well.
Note that Frobenius’ theorem, by which for a finite group of order divisible by , follows from Proposition 2.1, with the aid of a little arithmetic. Indeed, by Proposition 2.1, and Corollary 3.4 applies.
Next, we note that the proof of Proposition 3.3 goes through for any function (notation from that proof) which is a unit in and satisfies , i.e., for any in the coset \varphi^{\scalebox{0.6}[0.75]{-}1}*R_{n}^{*} of in . This coset equals \mu^{\scalebox{0.6}[0.75]{-}1}*R_{n}^{*}=E*R_{n}^{*}, by (b) of Corollary 3.2 applied to . Thus, for any function in the coset and any function , there is an equivalence similar to the one in Proposition 3.3:
[TABLE]
Such include the inverse of , given by \varphi^{\scalebox{0.6}[0.75]{-}1}(a)=\prod_{\,p\mid a,\,p\,\text{prime}}\,(1\scalebox{0.6}[0.75]{-}p) for , and the function . For and . Generally, putting and , so that is the sum of the -th powers of the divisors of , the equivalence holds for . Also, for it is easily seen that , so that the function is in the coset when .
We conclude by taking a look at the following elements and of the incidence algebra :
[TABLE]
Then , so for subgroups we have , and hence
[TABLE]
It follows that for any :
[TABLE]
For both sides vanish when . We use induction on for the case . If , both sides are equal to . And assuming the equation holds for all with , by (7) we find:
[TABLE]
But , and therefore
The theorem of Frobenius mentioned above is the case of:
Theorem 6.1**.**
If is a finite group, , the conjugacy class of in , and divides , one has
[TABLE]
That is, divides times the number of for which .
Proof.
Using (8) and the definition of ,
[TABLE]
For a fixed value of , the appearing in the final expression form a family closed under conjugation, so by Theorem 5.1 the corresponding terms add up to a multiple of . ∎
The number of -tupels for which is traditionally denoted ([2]). In our notation it is . We use the symbol for a related but different notion. Writing for , equation (7), applied to and , gives , generalizing the household formula (which is the special case ). When is cyclic, one has , the ordinary Euler -function applied to the group order , and otherwise.
The smallest number of generators of a group over a subgroup is the least such that , as follows by induction from (8), using the fact that, again by (8), is always non-negative.
The inverse of the index function in is given by
[TABLE]
For we have by (2), and that simply equals .
From (9) and the definition of one obtains:
[TABLE]
It is an open problem, raised by Kenneth Brown ([1]), whether \varphi^{\scalebox{0.6}[0.75]{-}1}(1,G) can ever be zero for a finite group . Gaschütz ([2]) has shown that for solvable it cannot, and Brown has derived some interesting divisibility properties of \varphi^{\scalebox{0.6}[0.75]{-}1}(1,G) for general . Whether or not \varphi^{\scalebox{0.6}[0.75]{-}1}(H,G) can be zero for in general is also unknown.
Note that, for , one has 0=\delta(1,G)=\Sigma_{1\leq H\leq G}\,\varphi(1,H)\,\varphi^{\scalebox{0.6}[0.75]{-}1}(H,G)=\Sigma_{H\leq G,\,H\,\text{cyclic}}\,\varphi(|H|)\,\varphi^{\scalebox{0.6}[0.75]{-}1}(H,G) =\Sigma_{g\in G}\,\varphi^{\scalebox{0.6}[0.75]{-}1}(\langle g\rangle,G). So, writing \boldsymbol{\varphi_{\scalebox{0.6}[0.75]{-}1}} for the function G\to\mathbb{Z},\,g\mapsto\varphi^{\scalebox{0.6}[0.75]{-}1}(\langle g\rangle,G), it follows that:
[TABLE]
Proposition 6.2**.**
The class function \varphi_{\scalebox{0.6}[0.75]{-}1} is an integral linear combination of irreducible characters of G.
Proof.
Let be the coset poset of , ordered by inclusion. (Usually, itself is not considered to be an element of the coset poset, but we include it here.) Every right coset Hx=x(x^{\scalebox{0.6}[0.75]{-}1}Hx) is equally a left coset, and hence becomes a -set by putting for . The action of is compatible with the ordering of . The fixed points of are the with , that is, the right cosets of the subgroups of that contain the element . We claim that:
[TABLE]
where denotes the set of fixed points of in and is the Möbius function of the poset . For if is the set of right cosets of in , one has , as noted by S. Bouc (cf. [1, Section 3]), and iff all elements of are in , and for all because is a convex subset of .
Lemma 2.8 of [3], applied to the dual poset of (which has the ”same” Möbius function), now shows that the function \varphi_{\scalebox{0.6}[0.75]{-}1} is a difference of permutation characters of . ∎
By (11), the coefficient of the trivial character in the decomposition of \varphi_{\scalebox{0.6}[0.75]{-}1} into irreducible characters is zero for non-trivial .
Proposition 6.3**.**
For , the following formula holds:
[TABLE]
Proof.
First, we note that for one has
[TABLE]
We now prove the formula by induction on . If , its left-hand side is \varphi^{\scalebox{0.6}[0.75]{-}1}(G,G)=1, and the right-hand side is the sum \Sigma_{F\leq H\leq K}\,\varphi^{\scalebox{0.6}[0.75]{-}1}(F,H)\,[K:H], which in view of (12) also equals 1. Thus we may assume that .
Using (12) plus the fact that for every sandwich , the join of and in the lattice is either itself or a subgroup of , with either or , we obtain
[TABLE]
As to the second equality, the second terms on the last two lines agree by (12). By the induction hypothesis, the third terms agree as well, seeing as, for , the restrictions to of the index and Möbius functions on are just the corresponding elements of , so that the same goes for and \varphi^{\scalebox{0.6}[0.75]{-}1}.
Now \Sigma_{K<L<G}\,\varphi^{\scalebox{0.6}[0.75]{-}1}(K,L)\,[G:L]=(\varphi^{\scalebox{0.6}[0.75]{-}1}*i)(K,G)-\varphi^{\scalebox{0.6}[0.75]{-}1}(K,K)\,[G:K]-\varphi^{\scalebox{0.6}[0.75]{-}1}(K,G)\,[G:G], which is equal to \zeta(K,G)-[G:K]-\varphi^{\scalebox{0.6}[0.75]{-}1}(K,G)=1-[G:K]-\varphi^{\scalebox{0.6}[0.75]{-}1}(K,G), and this establishes the result. ∎
The proposition provides an elegant liaison between the subgroups of that contain and the ones that together with generate , when one applies it with :
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Brown, K. S., The Coset Poset and Probabilistic Zeta Function of a Finite Group, J. Algebra 225 (2) (2000), 989–1012.
- 2[2] Gaschütz, W., Die Eulersche Funktion endlicher auflösbarer Gruppen, Illinois J. Math. 3 (1959), 469–476.
- 3[3] Hawkes, T., I. M. Isaacs, and M. Özaydin, On the Möbius Function of a Finite Group, Rocky Mountain J. Math. 19 (4) (1989), 1003–1034.
- 4[4] Huppert, B., Endliche Gruppen I, (Grundlehren; 134), Springer Verlag, Berlin-Heidelberg-New York, 1967.
