Short Laws for Finite Groups and Residual Finiteness Growth
Henry Bradford, Andreas Thom

TL;DR
This paper constructs short laws for finite groups of bounded order and uses these to establish new lower bounds on the residual finiteness growth of non-abelian free groups, advancing understanding of finite group laws.
Contribution
It introduces shorter laws for finite groups of bounded size and applies these to improve residual finiteness growth bounds for free groups.
Findings
Existence of laws of length n^{2/3} log(n)^{3+δ} for all finite groups of order at most n
Improved lower bounds on residual finiteness growth of non-abelian free groups
Enhanced understanding of the relationship between finite group laws and residual finiteness
Abstract
We prove that for every and there exists a word of length which is a law for every finite group of order at most . This improves upon the main result of [A. Thom, About the length of laws for finite groups, Isr. J. Math.]. As an application we prove a new lower bound on the residual finiteness growth of non-abelian free groups.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Short Laws for Finite Groups
and Residual Finiteness Growth
Henry Bradford and Andreas Thom
Abstract
We prove that for every and there exists a word of length which is a law for every finite group of order at most . This improves upon the main result of [24] by the second named author. As an application we prove a new lower bound on the residual finiteness growth of non-abelian free groups.
1 Introduction
A law for a group is an equation which holds identically in . The qualitative study of laws in groups is a classical subject, often phrased in terms of varieties of groups, [21, 22]. Moreover certain specific laws have been the subject of intense study over the years, particularly power laws, which delineate the territory of the bounded and restricted Burnside problems. A more recently opened avenue of research has been the asymptotic behaviour of lengths of laws for sequences of finite groups, motivated by connections with other invariants of interest in asymptotic group theory.
1.1 Statement of Results
Our focus in this paper will be on the length of laws which are satisfied simultaneously by all sufficiently small finite groups. Our main result is as follows.
Theorem 1.1**.**
For all there exists a word of length
**
such that for every finite group satisfying , is a law for .
In the statement of the previous theorem, denotes the iterated logarithm, i.e. the smallest natural number , such that the -fold application of to yields a real number less than . Note that grows slower than any iteration of logarithms.
The precise definition of a law for a group is given in Subsection 2.1. Theorem 1.1 improves asymptotically on the result of [24], in which the second named author obtained an upper bound of
[TABLE]
Our second result, which is a direct application of Theorem 1.1, is a new lower bound on the residual finiteness growth of nonabelian free groups. Recall that a group is residually finite if, for every , there exists a finite group and a homomorphism such that . Bou-Rabee [2] introduced a quantitative version of this property, as follows. Given a residually finite group and , we define:
.
Now, given a generating set for , there is a naturally associated length function on . For , let be the set of elements of length at most . We define:
.
Informally, we may think of groups for which grows slowly as being those in which non-trivial elements are “easy” to detect in finite quotients. Now, , being defined in terms of the generating set , is not an invariant of alone. However, if is finitely generated, then turns out to be independent of up to the equivalence relation induced by a natural partial order on functions, which we make precise below. Thus we can speak without ambiguity about the residual finiteness growth of . Since the introduction of this notion, particular attention has been paid to the task of estimating for the nonabelian free groups (), starting with Bou-Rabee’s original paper [2], and continuing through a series of papers by various authors [3, 17, 24]. Since has a very rich family of finite quotients, one expects its residual finiteness growth to be very slow. Consequently, any significant lower bounds on represent a surprising group-theoretic phenomenon. There is a clear connection between such lower bounds and laws for finite groups: a non-trivial element of length which is a law for all finite groups of order at most witnesses that . From this observation, Theorem 1.1 immediately implies the following new lower bound on the residual finiteness growth of .
Theorem 1.2**.**
.
It is likely that the conclusion of Theorem 1.2 is best possible up to logarithmic factors: following on from work of Hadad [12], Kassabov and Matucci ([17], Remark 9) propose that the shortest non-trivial law satisfied simultaneously by the groups , as varies over all finite commutative rings with , should be of length . They further note that, if this conjecture is correct, then .
1.2 Background
The first result on residual finiteness growth of free groups appeared in Bou-Rabee’s original paper introducing the notion (see also Rivin [23]).
Theorem 1.3** ([2]).**
.
This result is an immediate consequence of a corresponding theorem on the residual finiteness growth of linear groups, via the embedding of into . Theorem 1.3 implies that a law holding simultaneously in all finite groups of order at most must have length . To date these are the best complementary bounds to Theorem 1.1 and 1.2.
The first attempt to investigate the problem addressed by Theorem 1.1 was made by Bou-Rabee and McReynolds, though their result is again phrased in terms of residual finiteness growth.
Theorem 1.4** ([3]).**
.
Theorem 1.4 implies that there exists a word of length satisfying the conclusion of Theorem 1.1. Note that this already improves dramatically over the obvious laws and say that merely show . Bou-Rabee and McReynolds’ construction was refined by Kassabov and Matucci, who obtained the following.
Theorem 1.5** ([17]).**
For all , there exists a word of length such that for every finite group satisfying , is a law for . Consequently .
As discussed above, prior to the present paper the best upper bound for the lengths of the was the following result of the second author.
Theorem 1.6** ([24]).**
For all , there exists a word of length
**
such that for every finite group satisfying , is a law for .
The proof of this last result differs significantly from those which had preceded it, in that it makes extensive use of deep results from finite group theory, including the Classification of Finite Simple Groups.
It was a great surprise to the authors to discover that the main term appearing in the bound from Theorem 1.6 was not best possible. Indeed it was conjectured in [24] that it should be. In a related vein, Kassabov and Matucci asked the following.
Question 1.7** ([17]).**
For , is it the case that ?
Although Theorem 1.6 already implies a negative answer to Question 1.7 (by the observations from our Subsection 2.3, it follows from Theorem 1.6 that ), until our work it remained eminently plausible that linear bounds for and the length of the shortest words satisfying the conclusion of Theorems 1.1 and 1.6 could not be beaten by more than a polylogarithmic factor. The fact that polynomial improvements could be achieved was quite unexpected.
As regards individual groups, much recent attention has been devoted to the length of laws for simple groups. For the symmetric (and therefore also alternating) groups, the best known upper bound is provided by a result of Kozma and the second author.
Theorem 1.8** ([20]).**
There exists a law for of length at most:
.
If is a finite group of Lie type, then the best available bound appears in another paper of the authors.
Theorem 1.9** ([4]).**
Let be a finite group of Lie type over a field of order , such that the natural module for has dimension . Then there is a word of length:
**
which is a law for .
Furthermore, the exponent of in Theorem 1.9 is known to be sharp: Hadad [12] gives a lower bound of for the length of the shortest law for , using an embedding of . Thus, the worst case is – a reoccuring theme in finite group theory. The same phenomenon will appear in our study of laws for all groups up to size .
We will employ Theorem 1.9 in the proof of Theorem 1.1. As such a few words on the proof of Theorem 1.9 are in order. The proof strategy was inspired by that of Theorem 1.8: in both cases the problem is first divided into a search for two words vanishing, respectively, on generating and non-generating pairs of elements in our group . The constructions of these two words are quite different. For generating pairs, we identify a large subset satisfying a short identity. For , the set is the set of -cycles; for of Lie type it is usually the union of split maximal tori. We then show that many short words in our generating pair of elements will lie in , using results on mixing times and random walks in (see subsection 2.2) and compose these words with the identity holding in .
A non-generating pair of elements lie in a common maximal subgroup of . By classifying the possibilities for we can produce laws by induction. For , this is facilitated by the O’Nan-Scott Theorem, and consequences thereof due to Liebeck (see [20] for details). To carry out the induction needed to prove Theorem 1.9 in full generality is a deep matter, requiring Aschbacher’s Theorem on maximal subgroups of finite classical groups, the Classification of Finite Simple Groups, results on the classification of maximal subgroups in exceptional groups of Lie type, and dimension bounds for permutational and linear representations of finite simple groups. That said, to prove Theorem 1.1 we will only apply Theorem 1.9 in the case or , for which the determination of maximal subgroups is classical (see [19]).
One common feature of the proofs of Theorems 1.1, 1.8 and 1.9 is the use of probabalistic arguments to demonstrate the existence of words with certain properties. That is, it is shown that, according to a certain model of random words, a word has the desired property with positive probability, and it is deduced that such a word must exist. As such, the proofs are non-constructive: they do not facilitate the description of explicit laws for the given groups. By contrast, Theorem 1.6 is constructive, and words satisfying the conclusions of that theorem could in principle be explicitly written down. It remains a beguiling question what the form of the words arising in Theorem 1.1 might be.
1.3 Outline of the Paper
In Section 2 we assemble some basic tools for constructing laws in groups (Subsection 2.1), material on mixing times and random walks in finite groups (Subsection 2.2), and notions relating to residual finiteness growth, including the deduction of Theorem 1.2 from Theorem 1.1 (Subsection 2.3).
The proof of Theorem 1.1, to which Section 3 is devoted, has much in common with that of Theorem 1.6 found in [24]. There, the problem of constructing a short law valid in all sufficiently small finite groups was first reduced to that of constructing a short law valid only in all sufficiently small simple groups. This reduction will also be the first step in the proof of Theorem 1.1. It is summarised in Subsection 3.1, and appears in full detail in [24]. Further standard reductions (discussed in Subsection 2.1) allow us to consider separately each of the eighteen infinite families of finite simple groups (since the bound in Theorem 1.1 is asymptotic, the sporadic groups are easily eliminated).
The key novelty of our work lies in our treatment of groups of the form , and (for all other finite simple groups, the laws constructed in [24] suffice; this is explained in Subsection 3.2). Once again, we may consider these three classes separately from the other finite simple groups, and from each other. and are then easily dealt with using Theorem 1.9: we have a short law for each individual group and these can be combined over all sufficiently small . This is carried out in Subsection 3.3.
By contrast, combining laws for individual groups of the form into one law valid for all of them is too expensive (essentially, there are too many such groups of small order). This issue provided the bottleneck for the bound in Theorem 1.6. Instead, we take an approach closer in spirit to that used in the proof of Theorem 1.9, as it was sketched in Subsection 1.2. Namely, given a generating pair in some , we use random walks to locate many short words potentially satisfying a short relation. Those words are then combined using an iterated commutator to yield one word that works with high probability.
In contrast to previous work, however, we run random walks in pairs, and locate pairs of elements lying in a common Borel subgroup. Since there is a common law satisfied by the Borel subgroups of every (a double commutator), there is no need to combine laws for the individual groups: the words we produce using the random walk method will already provide laws holding simultaneously in for all sufficiently small . This approach works as stated for a set of good primes arising from the work of Breuillard and Gamburd [5] on uniform expansion for . This argument, the subject of Subsection 3.4, is the technical heart of our work. The groups for other primes, prime powers and other finite simple groups are dealt with by a more direct argument. The paper concludes with a short survey of open problems.
2 Preliminaries
2.1 Laws in Finite Groups
We start out with some basic definitions.
Definition 2.1**.**
Fix an ordered basis for the free group and let . For any group define the evaluation map (also denoted ) by , where is the (unique) homomorphism extending , . We call a law for if .
Of course we could equally define word maps associated to elements of for any , and thereby seek laws for within , however it turns out that very little is lost by restricting to . For if , standard embeddings of into associate to every law for a law for , of length depending linearly on the length of , while conversely, an inclusion of a basis for into a basis for turns every law for in into a law in . Meanwhile, a nontrivial element is a law for iff the exponent of divides (viewed as an integer).
We note two basic facts about the structure of laws in finite groups, which will enable us to construct new laws from old. The first allows us to combine words vanishing on subsets of a group to a new word vanishing on the union of those subsets, and is proved as Lemma 2.2 in [20]. To this end, recall for a group and a word the definition of the vanishing set of on from [24]:
.
Lemma 2.2**.**
Let be non-trivial words. Then there exists a non-trivial word of length at most such that for all groups ,
.
Note that, as well as allowing us to increase the vanishing set of words within a single group, Lemma 2.2 allows us to take a family of groups and, given a law for each group in the family, produce a new law which holds in every group in the family simultaneously.
The second fact allows us to construct laws for group extensions. It is proved as Lemma 2.1 in [24].
Lemma 2.3**.**
Let be an extension of groups. Suppose satisfy non-trivial laws in of length , respectively. Then satisfies a non-trivial law of length at most .
The previous lemma is stated in [24] with in place of , but the additional summand is easily removed.
Example 2.4**.**
A group is abelian iff the word is a law for . By Lemma 2.3 and induction, it follows that if is soluble of derived length at most , then satisfies a law of length at most . Since every nilpotent group of class at most is soluble of derived length at most , these groups also satisfy such a law.
The work of Elkasapy and the second author [9] on the derived and lower central series of allows one to construct even shorter laws for soluble and nilpotent groups, of which we will avail ourselves in the sequel.
2.2 Mixing times and Random Walks
Let be an finite group, and let be a symmetric generating set. We will consider a lazy random walk associated with the set , as follows: let be independent random variables, each with distribution function:
where is the indicator function of and is the Dirac mass at the identity. Let be the random variable on given by . We are interested in the mixing time of the random walk.
Seminal results about the mixing times of random walks on were derived from diameter bounds proved by Helfgott [13] (for the case prime) and generalizations (to arbitrary ) due to Dinai [8] and Varjú [25]. We are going to use the following result due to Breuillard and Gamburd [5] that holds for for a sufficiently large set of primes.
Theorem 2.5** (Breuillard-Gamburd).**
There is a constant such that for all all the number of rational primes less than for which has uniform spectral gap less than is at most .
For sake of convenience, we will refer to the set of primes arising from the previous theorem as the set of good primes. Note that a spectral gap of just means that the spectral radius of the random walk acting on mean-zero functionals is bounded from above by and thus, the random walk approaches equidistribution exponentially fast. We therefore have the following corollary.
Corollary 2.6**.**
Let and a good prime and let be a generating set and let . Then:
**
for all .
For more details on the relevant concepts and results consult [6].
2.3 Residual Finiteness Growth
We recall the definition of the residual finiteness growth function from the Introduction. Let be a finitely generated residually finite group and let . Define:
.
Fix a finite generating set for . For :
.
Here denotes the ball of radius with respect to the word metric associated with . The function , known as the residual finiteness growth function for (with respect to ) was introduced and studied by Bou-Rabee [2], up to a natural notion of equivalence of functions.
Definition 2.7**.**
For any , write if there exists such that for all , and write when both and .
It is clear from this definition that is an equivalence relation. Of course any can be extended to via . Such functions may thereby also be compared under .
Lemma 2.8** ([2]).**
Let be a subgroup of generated by a finite set . Then .
We conclude the following corollary.
Corollary 2.9**.**
Let be finite generating sets for . Then . Thus we may speak without ambiguity about the (-class of the) residual finiteness growth function of itself.
Since any non-abelian finite-rank free group embeds into any other, we also obtain the following consequence.
Corollary 2.10**.**
Let be finite-rank free groups of rank . Then .
There is a close relationship between residual finiteness growth of free groups and laws for finite groups.
Proposition 2.11**.**
Let be a strictly increasing function. Suppose that for all there is a law of length at most simultaneously valid in all groups of order at most . Let be a finite-rank free group. Then:
.
Proof.
By Corollary 2.10 we may assume . Fix a basis for . Let be a non-trivial word of length at most , which is simultaneously a law for all groups of order at most . Then , so , for all . The claim now follows from Corollary 2.9. ∎
We may now complete:
Proof of Theorem 1.2.
By Theorem 1.1 there exists such that we may take as in Proposition 2.11 with:
.
∎
3 Proof of Theorem 1.1
3.1 Reduction to Simple Groups
As discussed above, the proof of Theorem 1.6 from [24] begins by reducing the problem of constructing laws valid in all finite groups up to the order bound to that of constructing laws valid only in all finite simple groups up to the order bound. Via the same reductions, our Theorem 1.1 will follow from the following result, which improves asymptotically on Proposition 4.1 from [24].
Proposition 3.1**.**
For all , there exists a word of length:
**
such that for every finite simple group satisfying , is a law for .
Theorem 1.1 follows from substituting Proposition 3.1 in place of Proposition 4.1 from [24], and proceeding mutatis mutandis with the argument as in [24]. We refer the reader there for the details (and in particular for references for many of the assertions made in the proof), and here restrict ourselves to an outline.
Proof of Theorem 1.1 (sketch).
First suppose that is nilpotent. The bound implies the nilpotency class of is at most (the maximal length of a subgroup chain in ). By Example 2.4, satisfies a law of length:
A more precise analysis using results from [9] yields a bound of , see Proposition 3.1 in [24].
Now more generally assume is soluble. Let be the Fitting subgroup of . Fix a prime and let be the Sylow -subgroup of . The action of on the Frattini quotient of (whose rank we denote by ) induces a map , whose kernel is a -group. Moreover the natural map is an embedding modulo the center of . Since is nilpotent and by Lemma 2.3, it suffices to find a law for .
Since is soluble, so is its image in . The derived length of the latter is . Note that, since , . We apply the bound for laws in soluble groups from Example 2.4 to Putting all this together, satisfies a law of length:
.
Again, a more precise analysis using the results of Elkasapy and the second author on the length of the shortest non-trivial element of in the th step of the derived series [9] yields a bound of , see Proposition 3.2 in [24]. Let’s fix a constant , such that there exists a word of length , which is satisfied by each solvable group of size at most .
Finally consider a general group . Recall that every finite group is soluble-by-semisimple, so by Lemma 2.3 and the previous paragraph, we may assume is semisimple (in the sense of Fitting).
There exist finite simple groups and such that may be identified with a subgroup of , for finite groups satisfying:
,
and in such a way that each . The bound implies upper bounds on and . By Theorem 1.8 and Lemma 2.3, the problem is reduced to the construction of short laws for the .
For a finite simple group, the solution to Schreier’s Conjecture implies that is soluble of derived length at most . The result now follows from Proposition 3.1 and final applications of Lemma 2.3 and Lemma 2.2.
In total, having obtained a law of length , valid for all simple groups up to size , following the arguments in [24], we obtain a constant and laws of the length bounded by valid for all semisimple groups of size up to .
Consider now an increasing sequence of real numbers , where
[TABLE]
and is the last index, where . It is easy to see that , where denotes the iterated logarithm.
Now, let be a finite group of size at most and let be its solvable radical with the associated semisimple quotient . It is clear that there must exist some such that and Indeed, just take to be the last index with .
Now, in each of the cases, Lemma 2.3 yields a law of length bounded by
[TABLE]
that is valid for those that fall into this particular case. Combining all the cases using Lemma 2.2, we obtain a law of length
[TABLE]
that is valid for all groups of size at most . Again, for details we refer to [24]. ∎
3.2 Reduction to Low-Rank Simple Groups of Lie Type
In fact, the only finite simple groups of Lie type for which the laws constructed in the proof of Proposition 4.1 from [24] do not already satisfy the requirements of Proposition 3.1 are those of the form , and . That is, we reduce the proof of Proposition 3.1 to the following statement.
Proposition 3.2**.**
For all , there exists a word of length:
**
such that if is a finite simple group not of the form , or for some prime power , and , then is a law for .
Roughly speaking, the reason these families provided the bottleneck for the length of laws in [24] was that they are the only families in which a finite simple group may contain an element of large order compared to the order of .
Proof of Proposition 3.1.
First assume to be a finite simple group of Lie type. We refer to the tables from [24] (p.5), which in turn are based on [16]. The tables record such that, if is defined over a field of order ,
(with the implied constants absolute). Moreover by inspection of the tables, we may take in all cases, except for when is of the form , or . Excluding the latter possibility, we have:
[TABLE]
Meanwhile, a classical result of Landau shows that the maximal order of an element of is at most \exp\big{(}O((k\log(k))^{1/2})\big{)}, so also satisfies (1). Thus there exists an absolute constant such that, if is a finite simple group other than , or , then:
Applying Lemma 2.2 to the words for , we obtain a law of length valid simultaneously in all such . Combining (by Lemma 2.2 again) this last law with the law obtained in Proposition 3.2, we obtain the required result. ∎
We will conclude by constructing, for each of the three families , and , a law of the length valid in all groups in the family of order at most , this is Proposition 3.4 and Proposition 3.11. The proof of Proposition 3.2, and hence of Theorem 1.1, is then completed by combining these three laws using Lemma 2.2.
3.3 Short Laws for &
The required result for groups of the form and will be obtained by combining the laws produced by Theorem 1.9 as varies over all sufficiently small prime powers. To this end, we record the following standard consequence of the Prime Number Theorem (see [10]).
Lemma 3.3**.**
For any , the number of prime powers at most is , and the number of these which are proper powers of primes is .
Proposition 3.4**.**
For all , there exists a word of length:
**
such that if is equal to or for some prime power , and , then is a law for .
Proof.
First note that and satisfy the conditions of Theorem 1.9 with , so satisfy laws of length .
Alternatively and without using Theorem 1.9 from [4], we could have studied the orders of elements in and more carefully and noted that the order of each element divides or . Using Lemma 2.2, this implies immediately that there is a law of length satisfied by these groups – enough for us to proceed.
Recall that and have order proportional to , so if is isomorphic to some or and , then . By Lemma 3.3, there are such prime powers .
We may thus apply Lemma 2.2 with and of length (or merely using the alternative argument) to obtain a law of length (or just on the alternative route) valid in all groups of the required form. ∎
3.4 Short Laws for
Observe first that the argument of the previous subsection necessarily fails for . For let be a law for . Then has length (see [12]). Combining the laws by Lemma 2.2 over all values of such that (that is, for ) would yield a law of length , which is unacceptable for our purposes. However, this general strategy will be the way to treat prime powers and primes in the set of bad primes arising from Theorem 2.5 (recall that good and bad primes were defined in Subsection 2.2).
Lemma 3.5**.**
For all there exists a word of length bounded by such that for with is either a proper prime power or a bad prime and satisfying , is a law for .
Proof.
The size of is about so that we have to consider prime powers up to size . The size of the set of proper prime powers and bad primes below is bounded by , see Lemma 3.3. It is well-known that laws of length for exist (see the proof of Proposition 4.1. in [24]). Combining all these laws yields the desired law using Lemma 2.2 of length . ∎
We are now left to deal with groups where is a good prime. We divide the problem and seek separately short words whose associated vanishing sets contain, respectively, generating and non-generating pairs of elements in groups of the form . The strategy in both cases closely parallels that employed in the proofs of Theorems 1.8 and 1.9.
For generating pairs we will use upper bounds on the mixing time of arising from Theorem 2.5. This allows us to give a probabalistic construction of pairs of words, whose evaluation maps have images contained in a common soluble subgroup of (and hence satisfy a short relation). For non-generating pairs we use the classification of subgroups of .
We start by recording an elementary observation from linear algebra.
Lemma 3.6**.**
The number of elements of which are diagonalisable over is .
Since the order of is proportional to , Lemma 3.6 says precisely that an absolutely positive proportion of elements are diagonalisable over . Now consider the subgroup of upper-triangular elements. contains the diagonal subgroup of , so by Lemma 3.6, an absolutely positive proportion of the elements of are conjugate into .
By a Borel subgroup of we shall mean the image, under the natural projection , of a conjugate in of . This is not really the “right” way to define these subgroups, but it is the most convenient for our purposes. The only facts we require about Borel subgroups in the sequel are:
Every Borel subgroup is metabelian;
Every Borel subgroup has index in ;
A positive proportion of the elements of lie in a Borel subgroup
Indeed, and are well-known and is just the summary of the preceding discussion.
We shall also require some facts about free groups. The first of these is standard.
Lemma 3.7** (see for instance [15] Chapter 1).**
Let . Then either
* and commute, and are powers of a common element of , or*
the subgroup of generated by and is isomorphic to , and is a free generating set.
We further recall Kesten’s result on the exponential decay of simple random walks on .
Theorem 3.8** ([18]).**
There exists a constant such that, for any , if is the result of a simple random walk of length on a free generating set for ,
.
The key consequence of these two facts that we shall use is the following, which tells us that with high probability, the outcomes of a pair of random walks on fall into case (ii) of Lemma 3.7, see also [5, Lemma 2.1].
Corollary 3.9**.**
Let be as in Theorem 3.8. Let be the results of two independent simple random walks of length on a free generating set for . Then:
\mathbb{P}\big{[}[u_{l},v_{l}]=1\big{]}\ll l\exp(-\alpha l).
Proof.
It is a basic fact that in for some non-trivial implies that the word length of is at least . Moreover, it follows from Lemma 3.7 that each non-trivial lies in a unique maximal abelian subgroup, which is isomorphic to – generated by some maximal root of . Thus, for any fixed of length at most , there is a maximal root and with of length at most implies for some satisfying by Lemma 3.7. Hence, as long as is non-trivial, the probability that satisfies is bounded by . However, the probability that is trivial is bounded by . This proves the claim. ∎
Finally, we record a taxonomy of the subgroups of , which follows from the classical results of Dickson [7].
Theorem 3.10** (Dickson).**
Let be a prime and let be a proper subgroup of . Then one of the following holds.
* is metabelian;*
.
We have arrived at the heart of the proof – our new approach to treat the crucial case .
Proposition 3.11**.**
For all , there exists a word of length:
**
such that if is equal to for some prime power , and , then is a law for .
Proof.
Let be the results of independent lazy random walks of length on a free generating set for , where is a sufficiently large absolute constant. Fix (for the time being) a good prime such that and a generating pair .
For each , the probability that lies in a Borel subgroup is at least a positive absolute constant , by Lemma 3.6 and Corollary 2.6 (since we are assuming is sufficiently large, Corollary 2.6 is indeed applicable).
Suppose does indeed lie in a Borel subgroup . By independence of and , and applying Corollary 2.6 again, the probability that lies in is at least , for an absolute constant . Thus the probability that and do not lie in a common Borel subgroup is at most
.
By independence of the , the probability that for every the pair fail to lie in a common Borel subgroup is at most . Setting , for a sufficiently large constant, we have
for a constant, which we may take to be arbitrarily large.
The number of possible generating pairs for is no more than , while the number of possibilities for is bounded by using Lemma 3.3. Thus, taking a union bound over all possible good primes and , the probability of the event: “for every there exists a generating pair for some of order at most such that the pair fails to lie in a common Borel subgroup” is at most:
[TABLE]
We will choose .
Meanwhile, by Corollary 3.9, for fixed the probability that and commute (in ) is , so taking a union bound, the probability that there exists such that and commute is:
[TABLE]
(of course, we could get a better bound by using the independence of the pairs , but (3) will prove more than adequate for our current needs).
Combining (2) and (3), the probability of the event “either there exists such that and commute or for every there exists a generating pair for some of order at most such that the pair fails to lie in a common Borel subgroup” is at most:
for , and larger than an absolute constant.
Therefore there exist deterministically words with of length at most such that no pair commutes in , and such that for every of order at most , where is a good prime, and every generating pair , there exists for which lie in a common Borel subgroup.
Consider the word . It is easy to see that is a non-trivial element of of length . Thus, on the one hand is non-trivial for every , since by Lemma 3.7 freely generate a group isomorphic to . On the other hand, since every Borel subgroup of is metabelian, for every of order at most , where is a good prime, and every generating pair , there exists for which
[TABLE]
Applying Lemma 2.2 to the words , we obtain a word of length bounded by
[TABLE]
which vanishes at all such generating pairs .
Finally we produce a word of bounded length whose vanishing set contains all non-generating pairs. This will be achieved using Theorem 3.10. Indeed, if generate a metabelian subgroup or a group of bounded order (conclusions (i) or (ii) of Theorem 3.10), then lies in the vanishing set of a word of bounded length . Applying Lemma 2.2 one more time, to and the law obtained from Lemma 3.5 to cover also the case when is a proper prime power or a bad prime, we have the required result. ∎
4 Open Problems
We end this article with various possibly approachable open problems. At the moment, our constructions (or rather proof of existence) of laws are inherently random and say only very little about the shape of these elements in the free group.
Problem 4.1**.**
Give an explicit construction of short laws holding in:
symmetric groups;
finite simple groups of Lie type;
all groups of order at most (improving on Theorem 1.6).
It remains plausible that there could exist laws of polynomial length for . Note that assuming Babai’s Conjecture saying that should be (see [1]), Kozma and the second named author showed in [20] that laws of of length exist. Maybe a refined argument could also prove a polynomial bound assuming Babai’s Conjecture. Based on existing polynomial diameter estimates for random generators of [14], words in of length can be constructed that are laws for almost all of , see [26]. Following [26], there exist similar improved bounds for almost laws of finite simple groups of Lie type. Indeed, for example the group satisfies an almost law of length . This will be explained in more detail in [4, 26].
Problem 4.2**.**
Give new upper bounds on the length of the shortest law for . Can there be laws of length bounded by ?
On the other side, lower bounds are equally interesting.
Problem 4.3**.**
Give new lower bounds on the length of the shortest law for , say of the form or .
Note that for , so that a sharp complementary bound in Theorem 1.1 following the strategy by Kassabov-Matucci (see ([17, Remark 9] and the remarks after Theorem 1.2), would potentially also prove with respect to the previous problem. As regards a possible result complementary to Theorem 1.1, even a bound of would be an improvement to the state of the art.
Problem 4.4**.**
Give an improved upper bound on , smaller than , for .
A strengthening of Problem 4.3 is the following, which already seems very achievable.
Problem 4.5**.**
Find a generating set of such that the associated Cayley graph has girth .
Again, the best construction of generating sets of with respect to Problem 4.5 is random in nature and yields a bound on the girth of , see [11, Thm. 3]. We are not aware of an explicit family of sets of generators satisfying this lower bound on the girth. The authors of [11] conjecture a positive answer to Problem 4.5 for random generators.
Needless to say, the exact determination the of the length and shape of shortest laws for symmetric groups or all finite group up to a certain size remains an outstanding open problem in finite group theory.
Acknowledgements
The first named author wishes to thank the Institut für Geometrie of the Technische Universität Dresden for providing him with a hospitable welcome on the occasion of his visit in June 2016, during which the essence of the proof of Theorem 1.1 was first distilled. This research was supported by ERC Starting Grant No. 277728 and ERC Consolidator Grant No. 681207.
We thank the unknown referee for interesting comments that led us to include a refined version of our main result.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Babai and Á. Seress. On the diameter of Cayley graphs of the symmetric group. J. Combinatorial Theory-A 49 (1988), 175-179.
- 2[2] K. Bou-Rabee. Quantifying residual finiteness. J. Algebra 323 (2010), 729-737
- 3[3] K. Bou-Rabee and D.B. Mc Reynolds. Asymptotic growth and least common multiples in groups. Bull. Lond. Math. Soc. 43 (2011) no. 6, 1059-1068
- 4[4] H. Bradford and A. Thom. Short laws for finite groups of Lie type. In preparation.
- 5[5] E. Breuillard and A. Gamburd. Strong uniform expansion in SL(2,p). Geom. Funct. Anal. 20 (2010), no. 5, 1201–1209.
- 6[6] P. Diaconis and L. Saloff-Coste. Comparison techniques for random walk on finite groups. Ann. Probab. 21, Issue 4 (1993), 2131-2156
- 7[7] L.E. Dickson. Linear groups, with an Exposition of the Galois Field Theory. Teubner, Leipzig, 1901.
- 8[8] O. Dinai. Growth in SL 2 subscript SL 2 \operatorname{SL}_{2} over finite fields. J. Group Theory 14, Issue 2 (2011), 273–297
