Words, permutations, and the nonsolvable length of a finite group
Alexander Bors, Aner Shalev

TL;DR
This paper investigates how certain identities influence the structure of finite groups, showing that specific word map properties imply bounds on the group's nonsolvable length, supporting a conjecture by Larsen.
Contribution
It establishes new connections between word identities and the structural bounds of finite groups, particularly relating to nonsolvable length and solvable subgroups.
Findings
Groups with large fiber size in word maps have bounded nonsolvable length.
Certain words imply the existence of a normal solvable subgroup of bounded index.
Most elements in large permutation groups have large support.
Abstract
We study the impact of certain identities and probabilistic identities on the structure of finite groups. More specifically, let be a nontrivial word in distinct variables and let be a finite group for which the word map has a fiber of size at least for some fixed . We show that, for certain words , this implies that has a normal solvable subgroup of index bounded above in terms of and . We also show that, for a larger family of words , this implies that the nonsolvable length of is bounded above in terms of and , thus providing evidence in favor of a conjecture of Larsen. Along the way we obtain results of some independent interest, showing roughly that most elements of large finite permutation groups have large support.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Words, permutations, and the nonsolvable length of a finite group
Alexander Bors The University of Western Australia, Centre for the Mathematics of Symmetry and Computation, 35 Stirling Highway, Crawley 6009, WA, Australia.
E-mail: [email protected]
The first author is supported by the Austrian Science Fund (FWF), project J4072-N32 “Affine maps on finite groups”.
Aner Shalev Einstein Institute of Mathematics, Hebrew University, Jerusalem 91904, Israel.
E-mail: [email protected]
The second author acknowledges the support of ISF grant 686/17, BSF grant 2016072 and the Vinik chair of mathematics which he holds.
2010 Mathematics Subject Classification: Primary: 20E10, 20P05. Secondary: 20B05, 20D06, 20F22.
Key words and phrases: Finite groups, Nonsolvable length, Probabilistic identities, Identities, Word maps
Abstract
We study the impact of certain identities and probabilistic identities on the structure of finite groups. More specifically, let be a nontrivial word in distinct variables and let be a finite group for which the word map has a fiber of size at least for some fixed . We show that, for certain words , this implies that has a normal solvable subgroup of index bounded above in terms of and . We also show that, for a larger family of words , this implies that the nonsolvable length of is bounded above in terms of and , thus providing evidence in favor of a conjecture of Larsen. Along the way we obtain results of some independent interest, showing roughly that most elements of large finite permutation groups have large support.
1 Introduction
The impact of identities on the structure of groups has been a central research topic for over a century. Major examples include the Burnside problems and their solutions, the theory of group varieties, as well as parts of combinatorial and geometric group theory.
In the realm of finite groups, Zelmanov’s solution to the Restricted Burnside Problem bounds the order of a -generator finite group satisfying the power identity in terms of and [16, 17]. The Hall-Higman reduction of this problem to -groups involves bounding the -length of solvable groups satisfying this identity for all primes [4]. A recent related result of Segal bounds the generalized Fitting height of finite groups satisfying in terms of [12, Theorem 10].
More generally, in recent years there has been extensive interest in probabilistic identities (defined below) of finite and residually finite groups. Finitely generated linear groups which satisfy a probabilistic identity were shown in [7] to be virtually solvable. Arbitrary residually finite groups satisfying a probabilistic identity were shown in [8] (using results from [2]) to have nonabelian upper composition factors of bounded size. Probabilistically nilpotent finite and infinite groups were recently studied in [13] and in [9].
It is easy to see that every finite group has a normal series each of whose factors is either solvable or a direct product of nonabelian finite simple groups. The smallest number of nonsolvable factors in a shortest such series is defined by Khukhro and Shumyatsky in [6] to be the nonsolvable length of , and is denoted by (see also Section 2 below for an alternative definition, which was also already given in [6, first paragraph of the Introduction]); while this concept was explicitly introduced and studied in [6], the idea of writing a finite group as an extension of two finite groups with smaller nonsolvable lengths for inductive purposes is already implicit in the Hall-Higman paper, see [4, proof of Theorem 4.4.1].
The main purpose of this paper is to present some ideas relating identities and probabilistic identities in finite groups with the nonsolvable length, and sometimes with the index of the solvable radical. We combine some machinery already developed by the first author in [3] (building on earlier work of Nikolov from [10]) with some new methods. Let us now explain this in some more detail.
For a positive integer , denote by the free group freely generated by . Elements of these groups are called words. For the definition of probabilistic identity, let be a nontrivial word. Then for every (not necessarily finite) group , one has the word map , induced by substitution into . If is finite and , it makes sense to define
[TABLE]
the proportion in of the fiber of under . For profinite groups , denotes the (normalized) Haar measure (in ) of the fiber . We say that satisfies a probabilistic identity with respect to and if and only if there is an element such that . A residually finite group is said to satisfy a probabilistic identity if its profinite completion satisfies a probabilistic identity.
In this paper, we will be interested in the following property of nontrivial words:
Definition 1.1**.**
A nontrivial word is called nonsolvable-length-bounding (or NLB for short) if and only if there is a function such that for every and every finite group , if satisfies a probabilistic identity with respect to and , then .
We can now state the following.
Conjecture 1.2**.**
All nontrivial words are NLB.
This conjecture, due to Michael Larsen (private communication), seems very challenging, in view of the fact that it is even unknown for , namely when is an identity of .
Conjecture 1.3**.**
The nonsolvable length of a finite group which satisfies a nontrivial identity is bounded above in terms of .
See also the last paragraph of [12] for a related problem, where the nonsolvable length is replaced by the generalized Fitting height.
Conjecture 1.3 is reduced to bounding the Fitting height of finite solvable groups satisfying a nontrivial identity in terms of alone; indeed this reduction follows from [6, Corollary 1.2].
In [3], the first author studied another property of nontrivial words , that of being multiplicity-bounding (or MB for short), see [3, Definition 1.1.1]. This just means that if a finite group satisfies a probabilistic identity with respect to and , then for each nonabelian finite simple group , the multiplicity of as a composition factor of is bounded from above in terms of , and . Several stronger and weaker properties than that of being MB were also studied in [3], such as the ones in the last two enumeration points of the following definition:
Definition 1.4**.**
Let be a nontrivial word.
A variation of is a word obtained from by “splitting variables”, i.e., by adding, for each , to each occurrence of in some second index. 2. 2.
For a nonabelian finite simple group , we say that is a coset identity over if and only if there are such that is constant on the Cartesian product of cosets of in . 3. 3.
* is called weakly multiplicity-bounding (or WMB for short) if and only if is not a coset identity over any nonabelian finite simple group.* 4. 4.
* is called very strongly multiplicity-bounding (or VSMB for short) if and only if every variation of is WMB.*
Note that our definition of a variation slightly differs from the one in [3, Definition 2.4(2)], which included a technical restriction on the second indices which one can assume w.l.o.g. anyway, but we will not need it here.
By [8, Theorem 5.2], if a finite group satisfies a probabilistic identity with respect to and , then the orders of the nonabelian composition factors of are bounded from above in terms of and . Letting denote the solvable radical of a finite group (namely the largest solvable normal subgroup of ), this implies the following.
Corollary 1.5**.**
A nontrivial word is MB if and only if the assumption that a finite group satisfies a probabilistic identity with respect to and implies that the radical index is bounded from above in terms of and . In particular if is MB then it is NLB.
The proof of this result will be given in Subsection 4.2 for the reader’s convenience.
Hence [3, Theorem 1.1.2] provides us with some examples of NLB words. Also by [3, Theorem 1.1.2(1)], the shortest nontrivial words which are not MB are of the form where is a variable. We will, however, be able to show that such words are NLB, and the crucial observation is that while these words are not MB, in particular not VSMB, they are “almost” VSMB, in the following exact sense:
Definition 1.6**.**
Let be a nontrivial word. is called almost very strongly multiplicity-bounding (or almost VSMB for short) if and only if every proper variation of (i.e., such that the number of variables occurring in is strictly larger than the number of variables in ) is WMB.
Our first main result relates the concepts of almost VSMB and NLB words:
Theorem 1.7**.**
Almost VSMB words are NLB.
Thus almost VSMB words satisfy Conjecture 1.2. This theorem is proved using a result of independent interest, showing that if is a finite permutation group, and the proportion of elements whose support has size at most is at least , then is bounded above in terms of and . See Theorem 3.2 below, as well as Theorem 3.1 and Proposition 3.3 for related results on permutation groups and the support of their elements.
Using the above result, Corollary 1.5 and [3, Theorem 1.1.2(3)], the following is immediate:
Corollary 1.8**.**
Let be a nontrivial word of length at most . Then is NLB.
Theorem 1.7 and Corollary 1.8 provide evidence in favor of Larsen’s conjecture mentioned above. We note that while is also not MB, the authors cannot exclude the possibility that all words of lengths , and are VSMB, in particular NLB, thus possibly allowing to replace the constant in Corollary 1.8 by . However, compared to studying words of lengths up to as done by the first author in [3, Section 6], the computational cost of doing so even just for words of length is considerable and would most likely require a medium- to large-scale parallel computation. Still, with some more theoretical machinery, we will at least be able to show the following:
Corollary 1.9**.**
Let be a nontrivial word of length at most . Then there is a constant such that if a finite group satisfies the identity , then .
Thus words of length at most satisfy Conjecture 1.3. The proof of Corollary 1.9 is based on a result allowing one to infer, under certain assumptions on a nontrivial word , that if a finite group without nontrivial solvable normal subgroups satisfies the identity , then the so-called permutation part of (see Definition 4.1.1(1) below) satisfies a shorter identity. This result is formulated in detail in Subsection 5.1 as Theorem 5.1.4.
Apart from new techniques for relating (probabilistic) identities with the nonsolvable length, we will also give infinitely many new (i.e., not already implicit in [3, Theorem 1.1.2(1)]) examples of both MB and non MB words. Recall that the power words are MB for all odd (as shown in the above reference). Answering [3, Question 7.1] we show the following.
Theorem 1.10**.**
Let be a variable. Then the following hold:
Let be a positive integer such that every prime divisor of satisfies . Then is MB. 2. 2.
Let be a positive integer with and . Then is not MB.
Obtaining a better understanding for which positive integers the word is (or is not) MB is of intrinsic interest, but it also relates to bounding in terms of the group exponent , see Subsection 6.1. We note that Theorem 1.10(2) partially contradicts the first author’s result [3, Theorem 1.1.2(1)]; more precisely, [3, Theorem 1.1.2(1)] wrongly states that is MB, but it is not. However, as clarified in an erratum on [3] prepared by the first author, [3, Theorem 1.1.2(1)] does become true if one replaces the set in its statement by (so is the only exponent for which the original version of [3, Theorem 1.1.2(1)] makes a wrong statement on the MB property status of ). Except for the paragraph at hand, whenever we cite [3, Theorem 1.1.2(1)] in our paper (as we already did above), we are actually always referring to the above mentioned corrected version of it.
This paper is organized as follows. In Section 2 we introduce some notation. Section 3 is devoted to permutation groups and the supports of its elements. We obtain there results of independent interest, some of which are applied in subsequent sections. In Section 4 we study probabilistic identities and prove Theorem 1.7 and Corollary 1.8. Section 5 is devoted to identities and the proof of Corollary 1.9. Finally, in Section 6, we prove Theorem 1.10 as well as a few other results on the impact of power word identities on the group structure. In particular we show there that the nonsolvable length of a finite group is bounded above by the exponent of its Sylow -subgroups.
2 Some notation and prerequisites
We first discuss an equivalent, but more explicit (though also more technical) definition of .
Definition 2.1**.**
Let be a finite group.
We denote by the solvable radical of , the largest solvable normal subgroup of . 2. 2.
We denote by the socle of , the subgroup of generated by all the minimal normal subgroups of . 3. 3.
We define sequences , , and of characteristic sections of recursively as follows:
- (a)
. 2. (b)
For , . 3. (c)
For , . 4. (d)
For , . 5. (e)
For , .
We call a finite group semisimple if and only if is trivial, i.e., if and only if has no nontrivial solvable normal subgroups. For the basic structure theory of finite semisimple groups (from which several of the subsequently listed facts follow), see [11, pp. 89ff.].
For every finite group , the groups are by definition all solvable, the groups are semisimple, and the groups are direct products of nonabelian finite simple groups. Moreover, since embeds into the automorphism group of , we have that is trivial if and only if is trivial, so there is a unique non-negative integer such that are all nontrivial and for . Actually, , by [6, first paragraph in the Introduction].
We now introduce some more notation and terminology that will be used throughout the paper. We denote by the set of natural numbers (including [math]) and by the set of positive integers. When is a function and , then denotes the restriction of to , and denotes the element-wise image of under . Euler’s constant will be denoted by (which is to be distinguished from the variable ). For , we denote by the base logarithm, and denotes . For a set , the symmetric group on is denoted by , and for , denotes the symmetric group on . The group of units of a field is denoted by , and the algebraic closure of by . For a prime power , the finite field with elements is denoted by . For a subset of a finite group , we denote by the least common multiple of the orders of the elements of . Finally, for a nonabelian finite simple group and a word , a coset word equation with respect to over is an equation of the form where are fixed automorphisms of , and are variables ranging over (so that the solution set of such an equation is always a subset of ).
3 Permutation groups
Some of our proofs require the study of the support of permutations and its distributions in finite permutation groups. In this section we obtain results in this direction, which may be of some independent interest.
For a permutation group and we let denote the number of points moved by and the number of points moved by some element of . We also let denote the number of fixed points of , and .
Theorem 3.1**.**
Let be a permutation group (where and are not assumed to be finite). Let be a positive integer, and suppose for all . Then
; 2. 2.
* if .*
We note that the bound in part (1) is best possible for all (take acting on .
The bound in part (2) is also best possible at least when for some , by the following example: let be the -Hadamard code (see e.g. [15, p. 248]). By its definition, it is clear that there is a unique coordinate where all elements of are [math]; we project onto the other coordinates, resulting in a subspace (which we regard as an additive group) with the following properties:
- •
every nonzero element of has exactly nonzero entries (equal to );
- •
for each , there is an element of having entry in the -th coordinate.
Set . Consider the function where is the product of the transpositions for those where . Then is an injective group homomorphism, so the image is actually a subgroup of , and it satisfies and that all nontrivial elements of have support size exactly .
We now prove Theorem 3.1.
Proof.
We first assume is finite, and then deduce the result without this assumption.
Set . We may assume has no orbits of size in its action on , since we may delete these orbits from , thereby obtaining a subset , and regard as a permutation group on .
Suppose has orbits on , of sizes . Then
[TABLE]
Since for all , we have for all . Consider the random variable , where is assumed to be chosen uniformly at random. Then, by the Cauchy-Frobenius Lemma (“The Lemma that is not Burnside’s”), . This yields . In fact, since we have , hence
[TABLE]
Since we have , and so
[TABLE]
This proves part (2).
To prove part (1) we claim that
[TABLE]
We prove the claim by induction on , the case being trivial.
Assuming the claim for we have
[TABLE]
Set . Then, since we have
[TABLE]
Hence
[TABLE]
proving the claim. We conclude that
[TABLE]
proving part (1).
Suppose now is infinite. Let be the support of , as above. We claim that is finite, hence, regarding as a permutation group on , we reduce to the finite case.
To prove the claim, choose and denote its support by . If then has size at most and we are done. Otherwise there exists with support which is not contained in . If we are done. Otherwise we proceed so that in step we choose with support which is not contained in . Let be the subgroup generated by and let . Then is finite (of size at most ) and . By the finite case we have . Since the sequence is increasing the process must stop, which means that, for some , is finite. This completes the proof.
∎
We now prove a result on permutation groups that will be used later.
Let , and let be a permutation group. We denote by the set of all whose support on is of size at most .
Theorem 3.2**.**
There is a function such that the following holds: Let , , and assume that is a permutation group of finite degree such that . Then
[TABLE]
Indeed, one may choose to be the following function:
[TABLE]
Proof.
This is clear if , since then , whence is equivalent to , and
[TABLE]
The assertion is also clear if . So we may henceforth assume that . We first show the following claim: “If is transitive, then .”
To see that this claim holds true, consider the random variable , where is assumed to be chosen uniformly at random. Then as noted in the proof of Theorem 3.1, by the Cauchy-Frobenius Lemma, .
Moreover, the Markov inequality (see for instance [1, p. 265]) shows that, for each positive integer ,
[TABLE]
Applied with , this yields
[TABLE]
so that
[TABLE]
or equivalently, . This concludes the proof of the above claim.
The claim yields in particular that the asserted upper bound on holds when is transitive. Let us now give an argument for general . Let be the partition of into the orbits of . For , denote by the (transitive) image of under the restriction homomorphism , . Observe that , and so as well. Hence if, for any , one has , one gets a contradiction to the above claim. So we may assume that for each ; in particular, .
Aiming for a contradiction, assume now additionally that
[TABLE]
Then
[TABLE]
for , allowing us to choose, for , a length sequence of pairwise distinct indices from such that for each ,
[TABLE]
What this means is that among all the elements of , there occur distinct values in the -th coordinate, and after fixing any of the many values in the -th coordinate and considering only such elements of , there still occur distinct values in the -th coordinate, and after fixing both the -th and -th coordinate, there still occur distinct values in the -th coordinate, and so on.
Now consider , the projection of to the coordinates number . The image still satisfies that , but on the other hand,
[TABLE]
Letting be such that the multisets and are equal, this yields the following upper bound on the proportion of elements in with support size at most :
[TABLE]
We thus get the desired contradiction if we can argue that
[TABLE]
Recall that , and set , so that . Then
[TABLE]
and that last expression is strictly smaller than if and only if
[TABLE]
Now by definition,
[TABLE]
and so
[TABLE]
Hence Formula (1) is implied by
[TABLE]
which is clear by definition of . ∎
In various cases we can obtain better bounds on also for intransitive groups. Let denote the number of orbits of , and let denote the rank of (namely the number of orbits on ordered pairs of points). Clearly .
Proposition 3.3**.**
With the above notation we have:
The probability that a random element satisfies is at least for any . Thus this probability tends to as . 2. 2.
The probability that a random element satisfies is at least for any . Thus this probability tends to as .
Proof.
The Markov inequality applied in the proof of the above theorem shows that, for any fixed we obtain (substituting ),
[TABLE]
which tends to provided . Part (1) follows.
For part (2) we use the second moment method for the random variable (). Then , and as is well-known, by applying the Cauchy-Frobenius Lemma to the action of on , one also gets . Therefore
[TABLE]
By the Chebyshev inequality (see for instance [1, p. 267]) we have
[TABLE]
Writing we obtain
[TABLE]
Clearly implies , which yields
[TABLE]
The result follows. ∎
Note that statement (1) of Proposition 3.3 implies that , which, adopting the notation from Theorem 3.2 yields that , and so
[TABLE]
Similarly, statement (2) of Proposition 3.3 implies that, with the above notation, we have , which yields
[TABLE]
We conclude this section with the following example, which shows that (in the notation used in Proposition 3.3(2)) is not always in :
Example 3.4*.*
Let in its regular action on itself (hence on points). Then is sharply -transitive. For , , let be the set of length sequences such that for and . Note that each such sequence is a generating sequence for . Set , denote by , for , the projection to the -th coordinate, and let
[TABLE]
Then is a -fold subdirect power of ; in particular, all orbits , for , of are of length . Note also that the listed generators of are pairwise distinct, so that . For each and each , the point stabilizer consists only of even length products of the listed generators of ; in particular, for each , the restriction of each element of to is contained in the unique index subgroup of the corresponding (sharply -transitive) action of on . Hence is intransitive on each orbit of , whence each Cartesian product of orbits of splits into at least two distinct orbits under the component-wise action of . In particular, , and so
[TABLE]
4 Probabilistic identities
4.1 Permutation-part-bounding words
We now introduce another word property that will be relevant for the proof of Theorem 1.7:
Definition 4.1.1**.**
Consider the following notations and concepts:
Let be a nontrivial finite semisimple group, say
[TABLE]
where are pairwise nonisomorphic nonabelian finite simple groups and . For , denote by the projection to the -th coordinate, and let be the image of under , which is again semisimple, with socle . We introduce the following notations for isomorphism invariants of :
- (a)
* for the so-called permutation part of , which we can view naturally as a subgroup of .* 2. (b)
* for the multiset , and* 3. (c)
* for the number .* 2. 2.
Let be a nontrivial word. We say that is permutation-part-bounding (or PPB for short) if and only if there is a function such that for every nonsolvable finite group satisfying a probabilistic identity with respect to and , .
Clearly, MB words are PPB. Moreover, we have the following:
Lemma 4.1.2**.**
The following hold:
Let be a finite group. Then is a section of . 2. 2.
PPB words are NLB.
Proof.
For (1): By definition,
[TABLE]
It is thus sufficient to show that has a solvable normal subgroup such that is isomorphic to a subgroup of . Letting where are pairwise nonisomorphic nonabelian finite simple groups and , we may view, up to natural isomorphism,
[TABLE]
We then find that
[TABLE]
is a suitable choice.
For (2): Let be PPB, and assume that is a finite group that satisfies a probabilistic identity with respect to that word and some given . We want to bound in terms of and . If is solvable, then , so assume that is nonsolvable. Then , where is as in the definition of PPB words. In other words, for each . Moreover, by [8, Theorem 5.2], there is an such that all nonabelian composition factors of have order at most . In particular, the number of nonisomorphic simple direct factors in is bounded from above by (because for each , the number of isomorphism types of nonabelian finite simple groups up to order is at most , as the orders of nonabelian finite simple groups are even and for each given order, there are at most two nonisomorphic nonabelian finite simple groups of that order). Using statement (1), it follows that
[TABLE]
and thus
[TABLE]
∎
In particular, the proof of Theorem 1.7 is now reduced to the following, which we will show next:
Lemma 4.1.3**.**
Almost VSMB words are PPB.
Proof.
Let be an almost VSMB word, let , and assume that a finite nonsolvable group satisfies a probabilistic identity with respect to and . Then every quotient of also satisfies a probabilistic identity with respect to and ; in particular, writing where are pairwise nonisomorphic nonabelian finite simple groups and , for , the group , defined as the projection of to the -th coordinate, satisfies a probabilistic identity with respect to and . Note that up to isomorphism, , and that when setting
[TABLE]
one has by definition that . So our goal is to find an upper bound in terms of and on .
To that end, fix , and for notational simplicity, write instead of , instead of , instead of , and instead of . For , denote by the set of points moved by (so that, using the notation from Section 3, ). Recall from above that satisfies a probabilistic identity with respect to and , so we can fix an element such that . Note: If is a repetition-free word, i.e., if the maximum multiplicity of a variable in is (no variable occurs more than once in ), then the probabilistic identity implies that ; in particular, then, and we are done. So we may assume that is not repetition-free. Writing where is the length of , and , we can find indices with such that , for all , and the (possibly empty) word segment is repetition-free. Moreover, for , define the word
[TABLE]
see also [3, Lemma 2.7] and our Notation 5.1.1(1), and set , see also Notation 5.1.1(2). Note that by choice of and , is a nonempty reduced word in which some variable occurs with multiplicity .
We bound the number of solutions to the equation , where are variables ranging over , in a -coset-wise counting argument. More precisely, fix first a -tuple . There are two fundamentally different cases in the counting argument, according to whether or not , where and is chosen such that all nonabelian composition factors of a finite group that satisfies a probabilistic identity with respect to and have order at most .
Assume first that , i.e., that . For each , fix one of the many cosets of in that have permutation part , say with coset representative , and consider the equation
[TABLE]
where the , for and , are variables ranging over . As described in [3, Lemma 2.7], this equation can be rewritten into the conjunction of the single word equation and the system of coset word equations over with respect to some variations of whose -th equation, for , looks like this:
[TABLE]
where is the unique function such that for , and for .
Hence for each , the underlying word of the -th coset word equation in the above equation system is a proper variation of , as follows by considering the -th and -th factors in the product on the left-hand side: (i.e., has the same variable, possibly with different exponents , in those positions), but
[TABLE]
(so the second indices of the variables at those positions in the -th coset word equation are different). As is assumed to be almost VSMB, this implies that each coset word equation labeled by an index from is not universally solvable; in particular, since , its proportion of solutions (among the variables that occur in it) is at most .
But as in [3, proof of Lemma 2.12], since , we can find at least pairwise distinct indices in such that the corresponding equations in the above system have pairwise disjoint occurring variable sets (i.e., they are “pairwise independent”), and this implies that the proportion of solutions (in ) of the entire system of equations is at most
[TABLE]
where the equality is by definition of . 2. 2.
Assume now that . Then we do not give a nontrivial upper bound on the number of solutions per -tuple of socle cosets with permutation parts , but we note that since contains some variable with multiplicity , the proportion of such -tuples in is exactly .
In combination, this yields the following:
[TABLE]
and thus
[TABLE]
so that an application of Theorem 3.2 shows that can indeed be bounded from above in terms of and , as required. ∎
4.2 Proof of Corollary 1.5
Let be a finite group. Assume first that for some constant . Then since is solvable (i.e., it only has abelian composition factors), for each nonabelian finite simple group , the multiplicities of in and are the same. It follows that , and hence , is an upper bound on the product of the orders of the nonabelian composition factors of , counted with multiplicities. In particular, the maximum multiplicity of a nonabelian composition factor of is at most . This shows the implication “” in the first sentence of Corollary 1.5.
Now assume that for each nonabelian finite simple group , the multiplicity of in is at most for some constant that may depend on . Assume also that the maximum order of a nonabelian composition factor of is bounded from above by another constant . Then let be the maximum value of where ranges over the (finitely many) nonabelian finite simple groups of order at most , so that any nonabelian composition factor of occurs with multiplicity at most . It follows that the socle of , which is of the form where are pairwise nonisomorphic nonabelian finite simple groups and , satisfies
[TABLE]
where the latter inequality uses that there are at most distinct isomorphism types of nonabelian finite simple groups of order at most (as was already observed in the proof of Lemma 4.1.2(2) above). This concludes the proof of the implication “” in the first sentence of Corollary 1.5.
For the second sentence (the “In particular”), just observe that . This concludes the proof of Corollary 1.5.
We thus have the following implication diagram between the various word properties considered in this paper:
VSMBalmost VSMBMBPPBNLBWMB
4.3 Proofs of Theorem 1.7 and Corollary 1.8
The proof of Theorem 1.7 is immediate by combining Lemmas 4.1.2 and 4.1.3. For Corollary 1.8, note that by [3, Theorem 1.1.2(3)], all nontrivial words of length at most are almost VSMB, so that we can conclude by an application of Theorem 1.7.
5 Identities
5.1 Segment identities
As noted in the Introduction, we will prove a result (Theorem 5.1.4 below) which will allow us to show that under certain assumptions, if a finite semisimple group satisfies some identity , then the permutation part satisfies a shorter identity , where is some proper segment of . Let us first introduce some notations and terminology and then formulate and prove Theorem 5.1.4.
Notation 5.1.1**.**
Let , say where is the length of , and for , and .
For , set
[TABLE] 2. 2.
For , set
[TABLE]
Note the following two simple facts:
Remark 5.1.2*.*
Using the notation from Notation 5.1.1, we note the following:
The words are segments of . 2. 2.
is empty if and only if , and . In particular, since is reduced, is always nonempty if and are such that . 3. 3.
if and only if , , and .
Definition 5.1.3**.**
Let , with notation as in Notation 5.1.1. Moreover, let be a variation of , and let . We say that is an -split variation of if and only if and .
Theorem 5.1.4**.**
Let , with notation as in Notation 5.1.1. Also, assume that for some given with and , all -split variations of are WMB. Then, if a finite semisimple group satisfies the identity , then the permutation part satisfies the identity . In particular, there is a nontrivial word of length strictly smaller than such that satisfies the identity .
Proof.
The “In particular” follows from the main statement, as by Remark 5.1.2(1,2), is a nonempty segment of , and so usually, one will simply choose , unless , which by Remark 5.1.2(3) can only happen if with is not cyclically reduced, in which case and thus satisfies the identity . We thus focus on the proof of the main statement now.
Say where are pairwise nonisomorphic nonabelian finite simple groups and . Then is a subdirect product of semisimple groups , , such that for each , and such that is a subdirect product of the permutation parts , for . Hence it suffices to show that each satisfies the identity . This shows that we may assume w.l.o.g. that for some nonabelian finite simple group and some .
Aiming for a contradiction, we will also assume that does not satisfy . Then we can fix with . Moreover, we fix with , and set . Finally, we fix automorphism tuples , for , such that .
By assumption, we have that . In particular, letting , for and , be variables ranging over , then by [3, Lemma 2.7], we have that a certain system of coset word equations over in the variables is universally solvable, and setting for and denoting by the unique function such that for , , one of the equations from the system is
[TABLE]
Note that by assumption, , but also
[TABLE]
Hence Equation (2) is a universally solvable coset word equation over with respect to some -split variation of . But by assumption, does not satisfy any coset identity with respect to , which is the desired contradiction. ∎
5.2 A consequence of Theorem 5.1.4
Using Theorem 5.1.4, we can show the following, which will be used in the proof of Corollary 1.9:
Proposition 5.2.1**.**
Let , with notation as in Notation 5.1.1. Also, assume that for some given , . Finally, let with such that . Then if a finite semisimple group satisfies the identity , satisfies ; in particular, satisfies a nontrivial identity of length strictly shorter than then.
Proof.
The proof of the “In particular” is as for Theorem 5.1.4. For the main statement: Since , in each -split variation of , there will be a variable that occurs with multiplicity exactly . Hence is VSMB, in particular WMB, by [3, Proposition 3.1(1)]. ∎
5.3 Proof of Corollary 1.9
By [3, Theorem 1.1.2(3)] and Corollary 1.8, it suffices to consider words of lengths , or . We start with the length case. Then the existence of (actually, with ) is clear if is a power of single variable. So we may also assume that contains at least two distinct variables. But if the total number of variables occurring in is at least , then since , there is a variable occurring with multiplicity at most in . Hence by Proposition 5.2.1, satisfies an identity for some word of length at most . By Corollary 1.8, is NLB, and so satisfying entails that (and thus ) is bounded from above by some constant, as required.
So we may henceforth assume that is a two-variable word, and moreover (by an argument as in the previous paragraph, using Proposition 5.2.1), we may assume that each variable that occurs in does so with multiplicity at least . Since , one of the two variables, say w.l.o.g. , occurs with multiplicity exactly in . Using the notation of Notation 5.1.1 for (with , of course), fix a pair with and .
We will now argue that each -split variation of is WMB. Since , at least one of the variables in derived from , say , must occur with multiplicity at most . If , is VSMB, in particular WMB, by [3, Proposition 3.1(1)]. So assume that . The segment between the two occurrences of in is of length at most , and thus it is VSMB by [3, Theorem 1.1.2(3)]. In view of this and [3, Proposition 3.1(2,3)], is VSMB, in particular WMB.
An application of Theorem 5.1.4 now yields that satisfies an identity of the form where is a word of length at most . Again, by Corollary 1.8, is NLB, and so is bounded from above by some constant.
The arguments for words of length are largely similar, so we only sketch them. The first paragraph of the above argument can almost literally be carried over, replacing by , of course, and not only referring to Corollary 1.8 at the end, but also to the cases of length resp. lengths and already done by then. In the two-variable case with , sine , we get that one of the two variables, say w.l.o.g. , occurs with multiplicity or in . When choosing the pair with with , one must also choose it such that the difference is maximal among all such pairs. This way, in the third paragraph of the argument, it is ensured that the segment between the two occurrences of in is of length at most (not just , as in the argument for length words). For , one can then conclude as in the length case, and for , one needs the additional observation that cannot be an -th or -th power of a single variable, for then some variable (necessarily ) occurs in with multiplicity at least , so that , a contradiction.
6 Power words
6.1 Identities
It is clear by a result of Segal [12, Theorem 10] that for each positive integer , if a finite group satisfies the identity (in other words, if ), then is bounded from above in terms of (actually, Segal’s result says that the same holds true if is replaced by the generalized Fitting height of , which is an upper bound on ). Now Segal’s proof uses the following, which is based on [4, proof of Theorem 4.4.1] and the Feit-Thompson theorem:
Lemma 6.1.1**.**
Let be a variable, let , and let be a nontrivial finite semisimple group satisfying the identity (in particular, is even). Then satisfies the identity .∎
The aim of this subsection is two-fold: Firstly, to show a slightly stronger variant of Lemma 6.1.1 (see Lemma 6.1.3 below), and secondly, to use a Segal-like argument for gaining a simple explicit upper bound on the nonsolvable length in terms of (see Proposition 6.1.4 below).
Let us start with the stronger version of Lemma 6.1.1, for which we introduce the following:
Definition 6.1.2**.**
Let be any fixed variable. Call a positive integer good if and only if the word is MB, and otherwise, call bad. Moreover, for fixed , denote by the set of all positive divisors of that are bad.
Lemma 6.1.3**.**
Let be a variable, let , and let be a nontrivial finite semisimple group satisfying the identity (in particular, is bad). Then satisfies the identity .
Proof.
We may w.l.o.g. assume that for some nonabelian finite simple group and some (as is, in general, a subdirect product of such groups, and likewise, is a subdirect product of the permutation parts of those groups). Fix . We will show that can only have cycles of lengths of the form where , and once we will have shown that, we will be done, as this implies that .
So let be a length cycle of . Note firstly that , since , being a quotient of , also satisfies the identity . Now fix such that . It follows that for all ,
[TABLE]
and the expression on the left-hand side can be written as an element of whose -th entry is
[TABLE]
which must in particular also be for all choices of . This shows that satisfies a coset identity with respect to , and so is bad by [3, Proposition 2.9(3)], i.e., for some , as required. ∎
Note that by [3, Corollary 5.2], all bad positive integers are even, and so in Lemma 6.1.3, , whence Lemma 6.1.3 does imply Lemma 6.1.1, as asserted above. While it is true that the greatest common divisor of all bad positive integers is (since, for example, and are bad by [3, Theorem 1.1.2(1)]), and thus that Lemma 6.1.3 does not always provide strictly stronger information than Lemma 6.1.1, in some cases, it is better. As a somewhat extreme example, note that by [3, Theorem 1.1.2(1)], and so by Lemma 6.1.3, satisfying the identity implies that is trivial (as opposed to it just satisfying the identity , which is what Lemma 6.1.1 gives).
Using the bound from Lemma 6.1.1, we will now show, similarly to Segal’s proof of [12, Theorem 10]:
Proposition 6.1.4**.**
For every finite group , .
Proof.
By induction on . If , then is solvable by the Feit-Thompson Theorem, so , and the bound is clear in that case. Now assume that , and also assume that is nonsolvable (otherwise, again, and the bound is clear). Then since satisfies the identity , so does . By Lemma 6.1.1, it follows that satisfies the identity , and thus, by the induction hypothesis,
[TABLE]
which yields the desired bound . ∎
6.2 Probabilistic identities
In this subsection, we are concerned with the proof of Theorem 1.10. It relies on the following two lemmas of some independent interest:
Lemma 6.2.1**.**
Let , let be a nonabelian finite simple group, and let . The following are equivalent:
. 2. 2.
.
We note that by Lemma 6.2.1, [3, Corollary 5.2] may be viewed as a direct consequence of Nikolov’s earlier result [10, Proposition 10].
Lemma 6.2.2**.**
Let be odd, let , and let be an automorphism of with nontrivial diagonal part and whose field part is of order . Then .
Computer calculations show that the statement of Lemma 6.2.2 is also true for , so it might actually hold for all . Let us now prove these two lemmas before proceeding with the proof of Theorem 1.10.
Proof of Lemma 6.2.1.
For “(1) (2)”: We will show the contraposition: Assume that . Then, since , it follows that , and so , as required.
For “(2) (1)”: Let . Then there is an with . But
[TABLE]
It follows that . In particular, cannot be equal to the singleton , as required. ∎
Proof of Lemma 6.2.2.
We view as the subgroup of consisting of the images under the canonical projection of all matrices in whose determinant is a square in . Note that the order of every element of is divisible by and that by [5, Proposition 4.1], lies in some copy of inside the simple Chevalley group containing . In particular, the order of is an element of , so we are done if we can show the following two statements:
- •
For all , .
- •
There is an with .
Let us start with the first statement. Write where , is any fixed element of , and is a field automorphism of order (not necessarily the entry-wise Frobenius automorphism ). Then for each ,
[TABLE]
and so, since and is odd, it follows that the order of is even. This concludes the proof of the first statement.
For the second statement, denote again by the common field part of the elements of . Since , we have that contains an element of order . Observe that the lift of must have determinant , for its determinant must be a non-square in . But since is odd, is also a non-square in , so lies in and also has order . Since is centralized by and , it follows that has order , as required. ∎
We are now ready for the
Proof of Theorem 1.10.
Let us start with the proof of statement (2), because it is shorter and easier. Firstly, note that since is not MB by [3, Theorem 1.1.2(1)], we also have that is not MB for any (if a finite group satisfies a probabilistic identity with respect to and , it also satisfies one with respect to and ). We may thus assume that ; in other words, for some odd with . But by Lemma 6.2.2, if is as in the formulation of Lemma 6.2.2, then , whence is bad by [3, Proposition 2.9(3)].
We now give the proof of statement (1). First, note that the assumption implies that each prime divisor of is larger than . We need to show that for every nonabelian finite simple group and all , . By [3, Theorem 5.1], it suffices to show this for of one of the two forms or . In what follows, we denote by the field automorphism group of , which is cyclic, generated by , the entry-wise Frobenius automorphism (for this to make sense in the Suzuki case, view as a subgroup of as in [14]). As in the proof of Lemma 6.2.2 above, we view as a subgroup of , and we denote the image of a matrix under the canonical projection by .
Case: . Then , so we may choose for some . Then centralizes . In particular, there is an of order centralized by . It follows that . 2. 2.
Case: . We make a subcase distinction:
- (a)
Subcase: . Then , so we may choose for some . Then centralizes . In particular, there is an of order centralized by . It follows that . 2. (b)
Subcase: . Then
[TABLE]
where is some fixed generator of . We may thus choose for some and some . If , then we can conclude as in Subcase 1, using that contains an element of order . So assume that . We make a subsubcase distinction:
- i.
Subsubcase: or . Note that the centralizer of in contains the element
[TABLE]
whose order is . In particular, since , there is an of order centralized by . We will now argue that does not divide , then we can conclude as in Subcase 1. To that end, note that by the subsubcase assumption, , so it suffices to show that is not of the form or for some that is odd and satisfies the congruence .
- •
If : Then . By assumption, , so that and thus . But , so one would need to have , which is impossible since by assumption.
- •
If : Then . By assumption, , so that and thus . But , so one would need to have , which is impossible since by assumption. 2. ii.
Subsubcase: and (i.e., is a generator of ). By Lemma 6.2.1, it suffices to show that . Since for all , this is clear if , so assume that . Note that by the argument from the previous subsubcase, we always have that . In particular, if , or if and is even, then , so that and we are done. So we may assume that and is odd. But then Lemma 6.2.2 yields that some element in has order divisible by ; in particular, the -th power of that element is nontrivial, as required.
∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D. P. Bertsekas and J. N. Tsitsiklis, Introduction to Probability , Athena Scientific, Belmont, MA, 2008.
- 2[2] A. Bors, Fibers of automorphic word maps and an application to composition factors, J. Group Theory 20 (6):1103-1134, 2017.
- 3[3] A. Bors, Fibers of word maps and the multiplicities of nonabelian composition factors, Internat. J. Algebra Comput. 27 (8):1121–1148, 2017.
- 4[4] P. Hall and G. Higman, On the p 𝑝 p -length of p 𝑝 p -soluble groups and reduction theorems for Burnside’s problem, Proc. London Math. Soc. (3) 6 :1–42, 1956.
- 5[5] B. Hartley, A general Brauer-Fowler theorem and centralizers in locally finite simple groups, Pacific J. Math. 152 (1):101–117, 1992.
- 6[6] E.I. Khukhro and P. Shumyatsky, Nonsoluble and non- p 𝑝 p -soluble length of finite groups, Israel J. Math. 207 :507–525, 2015.
- 7[7] M. Larsen and A. Shalev, A probabilistic Tits alternative and probabilistic identities, Algebra and Number Theory 10 : 1359–1371, 2016.
- 8[8] M. Larsen and A. Shalev, Words, Hausdorff dimension and randomly free groups, Math. Ann. 371 (3–4):1409–1427, 2018.
