Surface Words are Determined by Word Measures on Groups
Michael Magee, Doron Puder

TL;DR
This paper characterizes words in free groups by the probability measures they induce on compact groups, showing that only surface words and commutators produce identical measures across all such groups.
Contribution
It proves a converse theorem identifying surface words and commutators uniquely by their induced measures on all compact groups.
Findings
Unique characterization of surface words and commutators by their measures
Analysis of word measures on unitary, orthogonal, and symmetric groups
Development of new methods for studying word measures on generalized symmetric groups
Abstract
Every word in a free group naturally induces a probability measure on every compact group . For example, if is the commutator word, a random element sampled by the -measure is given by the commutator of two independent, Haar-random elements of . Back in 1896, Frobenius showed that if is a finite group and an irreducible character, then the expected value of is . This is true for any compact group, and completely determines the -measure on these groups. An analogous result holds with the commutator word replaced by any surface word. We prove a converse to this theorem: if induces the same measure as on every compact group, then, up to an automorphism of the free group, is equal to . The same…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\sidecaptionvpos
figurec
Surface Words are Determined by Word Measures on Groups
Michael Magee and Doron Puder
Abstract
Every word in a free group naturally induces a probability measure on every compact group . For example, if is the commutator word, a random element sampled by the -measure is given by the commutator of two independent, Haar-random elements of . Back in 1896, Frobenius showed that if is a finite group and an irreducible character, then the expected value of is . This is true for any compact group, and completely determines the -measure on these groups. An analogous result holds with the commutator word replaced by any surface word.
We prove a converse to this theorem: if induces the same measure as on every compact group, then, up to an automorphism of the free group, is equal to . The same holds when is replaced by any surface word.
The proof relies on the analysis of word measures on unitary groups and on orthogonal groups, which appears in separate papers, and on new analysis of word measures on generalized symmetric groups that we develop here.
Contents
1 Introduction
Let be the free group on generators , and let be any finite, or more generally, compact group. Every word induces a map, called a word map,
[TABLE]
defined by substitutions. For example, if , then . The push-forward via this word map of the Haar probability measure (uniform measure in the finite case) on is called the -measure on . Put differently, for each , substitute with an independent, Haar-distributed random element of , and evaluate the product defined by to obtain a random element in sampled by the -measure. We say the resulting element is a -random element of .
Measures induced by surface words
The study of word measures in groups has its seeds in the 1896 work of Frobenius [Fro96]. Let be111Throughout this paper, the letters , , and also and denote different generators in the same basis of . the commutator word. Frobenius shows that the -measure on a finite group is given by
[TABLE]
where marks the set of irreducible characters of and is the identity element of . As word measures on finite groups are class functions, this is equivalent to the fact that for every , the expected value of under the -measure is . In 1906, Frobenius and Schur [FS06] showed that the -measure on a finite group is given by
[TABLE]
where is the Frobenius-Schur indicator of :
[TABLE]
The statement is equivalent to that the expected value of an irreducible character under the -measure is . In fact, these two results hold for any compact group and can be generalized to any surface word:
Theorem 1.1** (Frobenius, Frobenius-Schur).**
Let be a compact group and an irreducible character of . Then,
For , the expected value of under the -measure is . 2. 2.
For , the expected value of under the -measure is .
Of course, here are distinct letters. See Section 2 for some details about the proof.
Do word measures determine the word?
For write if there is \theta\in\mathrm{Aut}\big{(}\mathrm{\mathbf{F}}_{r}\big{)} with . It is easy to see that applying elementary Nielsen transformations on a word does not change the measures it induces on groups (e.g., see [MP15, Fact 2.5]), and thus
Fact 1.2**.**
If then and induce the same measure on every compact group.
For example, and so the expected value of under the -measure is . More generally, every non-trivial word in which every letter appears exactly twice is mapped by \mathrm{Aut}\big{(}\mathrm{\mathbf{F}}_{r}\big{)} to one of the words in Theorem 1.1. By the classification of surfaces, the suitable word is determined by the homeomorphism type of the surface obtained from gluing the sides of a -gon according to the letters of (see, for instance, [Sti12, Chapter 1.3]). The expected value of irreducible characters is then described by the suitable case in the theorem.
Several mathematicians, including A. Amit, T. Gelander, A. Lubotzky, A. Shalev and U. Vishne, conjecture that the converse is also true and that every other pair of words is “measurably separable”:
Conjecture 1.3**.**
Let . If and induce the same measure on every compact group, then .
This conjecture appears in the literature in a stronger form, where and are only assumed to induce the same measure on every *finite *group – see [AV11, Question 2.2], [Sha13, Conjecture 4.2] and [PP15, Section 8].
Conjecture 1.3 seems to be extremely challenging. Our focus here, instead, is on special cases, where is some fixed word. A case which attracted considerable attention was that of primitive words, namely the \mathrm{Aut}\big{(}\mathrm{\mathbf{F}}_{r}\big{)}-orbit containing the free generators of . This special case was settled by the second author and Parzanchevski [PP15], who showed that induces the uniform measure on the symmetric group for all if and only if is primitive. To the best of our knowledge, the only \mathrm{Aut}\big{(}\mathrm{\mathbf{F}}_{r}\big{)}-orbits for which the expected value of irreducible characters have a simple explicit formula for every compact group, are the primitive case (where all characters but the trivial one have expectation zero) and surface words as in Theorem 1.1 – see also [PS14] and the references therein. In this sense, surface words are a natural next case to consider. And, indeed, we settle Conjecture 1.3 when is a surface word:
Theorem 1.4**.**
Let .
If induces the same measure as on every compact group, then (, and) . 2. 2.
If induces the same measure as on every compact group, then (, and) .
Remark 1.5*.*
Let us mention another new result in the same spirit as Theorem 1.4. Let be either a primitive power, say , or any power of the simple commutator . In a forthcoming paper [HMP19], Hanani, Meiri and the second author show that if a word induces the same measure as on every finite group, then . In the case of the simple commutator , this strengthens Theorem 1.4, as it only relies on measures on finite groups. On the other hand, unlike the current paper where we use specific families of groups (\mathcal{U}\big{(}N\big{)} and generalized permutation groups), the finite groups relied upon in [HMP19] are not explicit.
Word measures on , on ,
and on generalized symmetric groups
Our proof of Theorem 1.4 relies on the measures induced by words on unitary groups, on orthogonal groups, and on generalized symmetric groups. While we study the former two in separate papers, we study the latter one here. In fact, it is enough to consider the expected value of the standard character, namely, the expected value of the trace, in all three families of groups, of the “defining representation”222Similarly, the standard character of the symmetric group was sufficient for the result in [PP15] considering the primitive orbit.. We denote the expected value of the trace of a -random element in a matrix group by ††margin:
.
To present our results, we introduce some notation. Recall that given a free group , the commutator subgroup is the kernel of the homomorphism mapping every generator to a different element of the standard generating set of . Similarly, for , let be the cyclic group of order , and denote by††margin:
[TABLE]
the kernel of the homomorphism mapping every generator to a different element of a standard generating set of . Note that even though the homomorphism depends on the choice of basis of , its kernel does not, and it consists of all words where divides the total exponent of every generator. For efficiency of presenting our results, we also denote ††margin:
.
Is it a standard fact that is a product of squares if and only if . Likewise, is a product of commutators if and only if .333These facts about and do not generalize to for . We can now extract from [MP19a, MP19b] the results we need here.
Definition 1.6**.**
Let . The commutator length of is defined as††margin:
[TABLE]
In particular, if , then . Similarly, the square length of is defined as††margin:
[TABLE]
In particular, if , then .
Theorem 1.7**.**
[MP19a, Corollary 1.8]** Fix and consider the measure it induces on the unitary groups . The expected trace of a -random unitary matrix in satisfies
[TABLE]
Theorem 1.8**.**
[MP19b, Corollary 1.10]** Fix and consider the measure it induces on the orthogonal groups . The expected trace of a -random orthogonal matrix in satisfies
[TABLE]
In the current paper we obtain similar results for generalized symmetric groups. Specifically, let ††margin:
denote the wreath product of with , namely, this is the subgroup of consisting of matrices with exactly one non-zero entry in every row and column and all non-zero entries having absolute value . Likewise, for , let ††margin:
be the wreath product444The group is sometimes denoted – see, for example,
https://en.wikipedia.org/wiki/Generalized_symmetric_group. of , the cyclic group of order , with . This is the subgroup of where all non-zero entries are -th roots of unity. Note that when , the group is the signed symmetric group, known also as the hyper-octahedral group or the Coxeter group of type .
The first observation we make is that the expected value under the -measure of the trace of any of these groups is given by a rational function in :
Lemma 1.9**.**
Fix and let for some fixed or . Then there is some rational function , such that for every large enough , .
For example, if and , then {\cal T}r_{w}\left(\text{C_{2}\wr S_{N}}\right)=\frac{3N-4}{N\left(N-1\right)} for all . The proof of this lemma appears in Section 3.1. We stress that a statement of this sort is not surprising: a similar statement is known to hold for various families of characters of the groups when [Nic94, LP10], when [MP19a], when or [MP19b], or when is the general linear group over the finite field [PW19].
Our main result with respect to word measures on these generalized symmetric groups revolves around the leading term of the rational expression from Lemma 1.9. The exponent of the leading term is described by the number in the following definition.
Definition 1.10**.**
Let and or . Denote††margin:
* *
[TABLE]
If the set in the right hand side of (1.1) is empty, we set .
In words, we look for subgroups of smallest rank such that contains , and take their Euler characteristic which equals . It is easy to see that whenever , and so if is not , it is in fact at least . Thus, in , the function takes values in , where and if and only if for some .
To illustrate, is not a proper power, so . For , for (as well as for and for ) and so . For or , . As another example, consider the orientable surface word . Then for every , and one can show that .
Theorem 1.11**.**
Let and or . If , consider a -random matrix in the group , and if consider a -random matrix in . Then
[TABLE]
where is a natural number counting the number of subgroups with and . In particular, vanishes if .
Namely, the coefficient in (1.2) counts the number of the subgroups demonstrating the value of determined in (1.1). This number is always finite – see Section 3.1. In fact, we have a more detailed result which is required for the proof of Theorem 1.4 – see Theorem 3.6 below. Theorem 1.11 is similar in spirit to [PP15, Theorem 1.8], where is analyzed. The group can be regarded as the case in the current terminology. The analog there of is the “primitivity rank” of . Moreover, the more detailed version of Theorem 1.11, Theorem 3.6 below, relies on much of the analysis from [PP15]. A crucial difference between the current groups and is that the standard defining -dimensional representation is reducible for but irreducible for the groups considered in Theorem 1.11. We further explain these connections in Sections 2 and 3.1.
Overview of the proof
The proof of Theorem 1.4 uses both the measures on the classical groups and , and the measures on generalized symmetric groups. The roles they play are somewhat complement. Let us illustrate these complementing roles by considering the commutator length of a word. Let , and consider the measure it induces on . If , Theorem 1.7 yields an upper bound on the commutator length: .
In contrast, if then , where which has rank at most and thus . Hence if , we deduce the lower bound .
If induces the same measure as on every compact group, then, in particular, . The preceding two paragraphs then show that . Moreover, they show the group from the preceding paragraph has rank exactly , and so are free words, namely, there is no non-trivial relation on them. Together with Theorem 3.6 below (a strengthening of Theorem 1.11), it is possible to deduce that are, in fact, part of a basis of , and therefore .
The paper is organized as follows. Section 2 contains some background regarding measures induced by surface words, as well as background regarding word measures on and some results from [PP15] we use here. It also introduces the notions of algebraic extensions and of core graphs. In Section 3 we analyze word measures on generalized symmetric groups and prove Lemma 1.9, Theorem 1.11, and the stronger Theorem 3.6. We prove Theorem 1.4 in Section 4 and conclude with some open questions in Section 5.
Notation
We use the following asymptotic notation. Let be two functions defined on the positive integers. We write
- •
if there is a constant such that for every large enough ,
- •
if there is a constant such that for every large enough , and
- •
if both and .
2 Preliminaries
Measures induced by surface words
We begin this section with some remarks regarding the proof of Theorem 1.1. We have already mentioned a reference [Fro96] for the case where is finite and . In fact, this case is at the level of an exercise for an arbitrary compact group , as long as one is aware of the following classical facts: matrix coefficients of unitary realizations of all irreducible representations of a compact group form an orthogonal basis for the space of complex functions on , and the -norm of a matrix coefficient of a -dimensional irreducible representation is .
The case of and finite was first proved in [FS06]. For an English proof see [Isa76, Chapter 4]. Although the book [Isa76] deals with finite groups, this proof applies just as well to general compact groups.
Finally, for , note that when the letters appearing in are distinct from those in , then the -measure on is the convolution of the -measure and the -measure, and using the fact that a -measure is always invariant under conjugation, we get that for every irreducible character of . This explains the complete statement of Theorem 1.1. See also [PS14] and the references therein.
Expected traces in
Next, we extract some terminology and results from [PP15] that are needed here. That paper analyzes , the expected trace of a -random permutation in , where the permutation is thought of as an matrix. In other words, it studies the expected number of fixed points in a -random permutation. We remark that word measures on alone do not suffice to establish Theorem 1.4: all irreducible characters of are afforded by real representations, and so the words and induce the exact same measure on for all .
Let denote the length of the reduced form of . A first observation in the study of , going back to Nica [Nic94], is that for , is a rational expression in . Unlike the other groups mentioned above, this -dimensional representation of is reducible: it is the sum of the trivial representation and an -dimensional irreducible representation. It is thus not surprising that the rational expression for has a contribution of coming from the trivial representation, and the interesting part is the deviation from . This deviation is measured by the “primitivity rank” of a word , denoted , which was first introduced in [Pud14]. Recall that an element of a free group is called primitive if it belongs to some basis (free generating set). The primitivity rank of is the following number:
[TABLE]
The functions defined above are closely related to . In fact, one can give a single definition which applies to all these functions simultaneously. Indeed, for define
[TABLE]
Now as , and for , because all elements of are automatically non-primitive in . These different functions of words also share some properties. For instance, for all , takes values in – this555To be precise, is never zero: a cyclic group has a trivial commutator subgroup. was explained above for , and for , this is [Pud14, Corollary 4.2]. The role of in the study of is also analogous to the role of in Theorem 1.11:
Theorem 2.1**.**
[PP15, Theorem 1.8]** Let . Then
[TABLE]
*where is the number of subgroups of rank containing as a non-primitive element.
In particular, for all if and only if , which holds if and only if is primitive.*
Random subgroups in
The results in [PP15] apply not only to random elements of with measures induced by words , but more generally, to random subgroups of with measures induced by subgroups . Given , sample a random subgroup of by choosing a homomorphism uniformly at random and considering . When , the resulting random subgroup is the one generated by a -random permutation.
If are free groups, we say that is a free extension of , or that is a *free factor *of , and denote ††margin:
, if some (and therefore every) basis of can be extended to a basis of . Clearly, for , is primitive in if and only if . Hence, the following notion of primitivity rank for subgroups generalizes (2.1). For , the primitivity rank of is defined to be
[TABLE]
We can now state the more general form of Theorem 2.1.
Theorem 2.2**.**
[PP15, Theorem 1.8]** Let be a finitely generated subgroup, and let be a uniformly random homomorphism. The expected number of points in fixed by all elements of the subgroup is
[TABLE]
*where is the number of subgroups satisfying and containing but not as a free factor.
In particular, this value is for all if and only if , which holds if and only if .*
Algebraic extensions
We now describe the notion of *algebraic extensions *in free groups which is used in Section 3.2 below. Let be a free group and two subgroups. We call an algebraic extension of , denoted ††margin:
, if and only if and there is no intermediate proper free factor of , namely, if whenever , we must have . To give a sense of this notion, we mention some of its properties: algebraic extensions form a partial order on the set of subgroups of ; for every extension of free groups there is a unique intermediate subgroup satisfying ; and every finitely generated subgroup of has finitely many algebraic extensions. See the survey [MVW07] or Section 4 of [PP15] for more details.
In the language of algebraic extensions, is the smallest rank of a proper algebraic extension of , and is the smallest rank of a proper algebraic extension of .
Core graphs
Recall that we have a fixed basis for . Call it . Associated with every (finitely generated) subgroup of is a rooted, directed and edge-labeled (finite) graph, where the edges are labeled by . This graph, denoted ††margin:
, is called the *(Stallings) core graph *of and was introduced in [Sta83]. It can be obtained from the Schreier graph depicting the right action of on , the right cosets of in , by trimming all “hanging trees”. For more details we refer the reader to [PP15, Section 3]. We illustrate the concept in Figure 2.1.
Let us mention here a few basic facts about core graphs and some further notations that we will need below. The elements of correspond exactly to the non-backtracking closed paths at the root of . The labels and directions of the edges give rise to a graph-morphism to the bouquet of directed loops, labeled by , and this morphism is always an immersion. In other words, every vertex of has at most one outgoing edge with a given label, and at most one incoming edge with a given label.
A morphism of rooted, directed and edge labeled graphs from to exists if and only if . When this morphism is surjective, we say that “-covers” , and denote ††margin:
. This relation constitutes a partial order on the set of finitely generated subgroups of , a partial order which depends on the choice of basis . The easiest way to explain why there is a rational expression for and, moreover, to compute this formula explicitly, is by considering the finite set††margin:
[TABLE]
of subgroups which are -covered by the subgroup (see [Pud14, Section 5]). We shall use these graphs below to prove Lemma 1.9 and Theorem 1.11.
3 Expected
trace in generalized symmetric groups
3.1 Rational expressions
and their leading term
Fix and let be one of the groups (), , or merely , realized as complex matrices. If (here and ), we analyze the following expression:
[TABLE]
where are independent Haar-uniform elements of . In all cases considered but this is the uniform measure on . The Haar measure on is given by a uniform distribution on to determine the non-zero entries and independent Lebesgue measure on the unit circle for every non-zero entry of the matrix.
Consider an assignment of values in to the indices . Every assignment induces a partition on , where two indices and belong to the same block if and only if . Such a partition can be described by a rooted, directed, edge-labeled graph as follows: the vertices correspond to the blocks in the partition on , the root is the block containing , and for every there is a directed edge labeled connecting the block of with the block of , and directed towards the block if or towards the block in case . There is at most one -edge directed from a vertex to a vertex . For example, if and the assignment is , the graph is the following:
[TABLE]
However, this assignment contributes zero to the summation in (3.1): in all groups considered here, there is exactly one non-zero entry in every column and every line, yet the assignment leads to the integral over the monomial , which involves two entries from the same row of and is thus identically zero. This happens exactly when the graph associated with the assignment has a vertex with two out-going edges with the same label, or a vertex with two incoming edges with the same label. This shows that we can restrict our attention to assignments associated with graphs which are core graphs. Moreover, these graphs are precisely the graphs which are -covered by , namely the graphs for .
We can now group together all assignments leading to the same core graph , and see that the contribution of all these assignments is given by a rational function in (which depends on the family of groups we consider). Because the set is finite, this leads to a rational expression for for families of generalized symmetric groups. The number of assignments associated with a given is , where denotes the number of vertices in . The probability that the uniformly random has non-zero entries which correspond to a given assignment associated with is precisely , where is the number of -edges in . Overall, for , if for all , the contribution of to (3.1) is666We use the notation which is used for this expression in [PP15].††margin:
[TABLE]
times the expected value of the product of non-zero entries of involved in the monomial in (3.1). In the case of , these non-zero entries are identically , and so, as depicted in [Pud14, Section 5], for all large enough,
[TABLE]
For example, in the case of , there are precisely subgroups in , and the total contribution is , holding for . The detailed computation for is depicted in [Pud14, Page 53].
For the other groups considered here, the non-zero entries are not identically one and actually have zero expectation. So often, the contribution of an assignment to (3.1) vanishes even when the assignment does correspond to some core graph. For example, in the case of the group , an assignment gives a non-zero contribution if and only if it corresponds to a core graph, *and *every entry is repeated in the monomial some multiple of times, when we count with signs. E.g., if the entry appears in the monomial in (3.1), it must appear a total number of 0 times as in , a total number of times as in , and so on. For uniformly random and every , conditioning on that is non-zero, the expected value of is . Fortunately, this property is a feature of the core graphs and not only of the particular assignment: by definition, if and only if and every edge of is covered by some edge of in the graph morphism . In other words, the closed path at the root of which corresponds to must go through every edge of the graph. The restriction that every non-zero entry repeats some multiple of times in the monomial (counted with signs), is equivalent to that the path of goes through every edge a total signed number of times which is a multiple of . This generalizes to the following explicit form of Lemma 1.9:
Lemma 3.1**.**
Let . For every denote
[TABLE]
Then for every large enough ,
[TABLE]
Likewise, denote
[TABLE]
Then for every large enough ,
[TABLE]
Although this is not explicit from the notation, note that because the set depends on the choice of basis , so do the sets and .
As an example, if , six of the seven subgroups in belong to none of (). The remaining subgroup is itself, with core graph \vphantom{\Big{|}}\Gamma=\lx@xy@svg{\hbox{\raise 0.0pt\hbox{\kern 31.09259pt\hbox{\ignorespaces\ignorespaces\ignorespaces\hbox{\vtop{\kern 0.0pt\offinterlineskip\halign{\entry@#!@&&\entry@@#!@\cr\crcr}}}\ignorespaces{\hbox{\kern-4.8889pt\raise 0.0pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 1.0pt\raise 0.0pt\hbox{\textstyle{\otimes\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces}}}}}}}}\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}{}{}{}{{}{{}{{}{{}{{}{{}{{}}{}{{}}{}{{}{{}}{}{{}{{}}}}}}}}}}{}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{{}}{{}{}{}{}}{}}}}\ignorespaces{}\ignorespaces{}{}{}{}{{}{{}}}\ignorespaces\ignorespaces{\hbox{\kern-31.09259pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise-0.8625pt\hbox{\scriptstyle{x_{1}}}}}\kern 3.0pt}}}}}}\ignorespaces{}{}{}{}{{}{{}}{}{{}}{}{{}}{}{{}{{}}{}{{}}{}{{}}{}{{}{{}{{}}{}{{}}{}{{}{{}}}}}}}{\hbox{\kern-4.88779pt\raise 4.21596pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{}{{}}{{}{}{}{}\lx@xy@spline@}{}}}}\ignorespaces{}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{{}}{{}{}{}{}}{}}}}\ignorespaces{}\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces\ignorespaces{}{}{}{}{{}{{}{{}{{}{{}{{}{{}}{}{{}}{}{{}{{}}{}{{}{{}}}}}}}}}}{}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{{}}{{}{}{}{}}{}}}}\ignorespaces{}\ignorespaces{}{}{}{}{{}{{}}}\ignorespaces\ignorespaces{\hbox{\kern 19.0919pt\raise 0.0pt\hbox{{}\hbox{\kern 0.0pt\raise 0.0pt\hbox{\hbox{\kern 3.0pt\hbox{\hbox{\kern 0.0pt\raise-0.8625pt\hbox{\scriptstyle{x_{2}}}}}\kern 3.0pt}}}}}}\ignorespaces{}{}{}{}{{}{{}}{}{{}}{}{{}}{}{{}{{}}{}{{}}{}{{}}{}{{}{{}{{}}{}{{}}{}{{}{{}}}}}}}{\hbox{\kern 4.88779pt\raise 4.21596pt\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{\lx@xy@tip{1}\lx@xy@tip{-1}}}}}}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{}{{}}{{}{}{}{}\lx@xy@spline@}{}}}}\ignorespaces{}\ignorespaces\hbox{\hbox{\kern 0.0pt\raise 0.0pt\hbox{{}{}{{}}{{}{}{}{}}{}}}}\ignorespaces{}\ignorespaces}}}}\ignorespaces, which belongs to but does not belong to for . Thus, for every , whereas for every and . (Note how this agrees with Theorem 1.1.)
To say how large should be, one needs to go over the elements of . However, as every edge in is covered at least twice (in the same direction or in different directions), the formulas in Lemma 3.1 holds at least for .
Let be a subgroup containing . Our next observation is that the conditions above regarding how many times traverses every edge of are, in fact, algebraic:
Lemma 3.2**.**
Let and be a subgroup containing , and let . The number of times, counted with signs, that traverses every edge in is a multiple of (or [math] if ), if and only if .
Proof.
Recall that if and only if when is written as a word in a fixed but arbitrary basis, the total exponent of every generator, counted with signs, is zero modulo (or zero if ). Let be any spanning tree in the core graph . There are edges outside the tree, and after an arbitrary orientation of these edges, we obtain a basis for : the element associated with the oriented edge is the one corresponding to the closed path which goes from the root of to the origin of through , traverses , and returns to the root through . Recall that every element of corresponds to a closed, non-backtracking path at the base-point of , and to write this element in the basis we have just constructed, we simply keep track of each time the corresponding closed path traverses one of the edges outside the spanning tree. Now, if traverses every edge of a multiple of times, then any choice of spanning tree shows that .
Conversely, assume that , and let be an edge of . If is not a bridge (namely, not a separating edge the removal of which disconnects the graph), then there is a spanning tree not containing and thus traverses a multiple-of- times. If is a bridge, then every closed path traverses it in a “balanced” fashion, namely, the same number of times in each of the two directions. ∎
Corollary 3.3**.**
For ,
[TABLE]
The subgroups of minimal rank in coincide with the subgroups of minimal rank among those containing in their “-kernel”:
Lemma 3.4**.**
For and , the subgroups of minimal rank in are precisely
[TABLE]
In particular, the number of subgroup in the set (3.5) is finite.
Proof.
First, by Corollary 3.3, every group satisfies , and so every subgroup in has rank at least . if , then both sets considered in the lemma are empty.
Assume , and let satisfy and . We claim that – this would show that the minimal rank of subgroups in is precisely and that the two sets considered are identical. Indeed, assume by contradiction that . Then the morphism is not surjective, and the image of in is a subgraph which is the core graph of some , and in particular . Moreover, is then a proper free factor of and thus has smaller rank than . This is impossible as it contradicts the definition of . Thus .
Finally, as is finite, so is and therefore so is the set in (3.5). ∎
We can now complete the proof of Theorem 1.11 and show that for (when ) or G\left(N\right)=$${\cal T}r_{w}\left(C_{m}\wr S_{N}\right) (for ), with the size of the set in (3.5).
Proof of Theorem 1.11.
Let and or accordingly. The summand corresponding to in (3.3) or in (3.4) has leading term , and so the summand is . By Lemma 3.4, there are precisely elements in of the minimal rank and all others have larger rank. Therefore
[TABLE]
∎
3.2 The second term of the rational expressions
Lemma 1.9 shows that for generalized symmetric groups , the expected trace is given by a rational expression in , and Theorem 1.11 gives an algebraic interpretation for the leading term of this expression. We now want to strengthen Theorem 1.11 and show that the rational expression does not only tell us about the subgroup of minimal rank with the property that , but also about the “second” minimal group. If there is more than one group of minimal rank, this is already captured by Theorem 1.11. But we want to deal also with the case that there is a unique subgroup as above of minimal rank.
To define the second minimal group, we do not rely on the set which depends on the given basis . Instead, we consider only algebraic extensions of which also contain in their “-kernel”. Namely, for and , denote††margin:
[TABLE]
If is an algebraic extension of then -covers for every basis . Indeed, if and does not -cover then the image of in constitutes an intermediate subgroup which is a proper free factor of . In particular, . Moreover, all subgroups of minimal rank with are algebraic extensions of , because if and , then clearly , so cannot be of minimal rank unless it is an algebraic extension. Thus
[TABLE]
Definition 3.5**.**
Let and . If , define and . Otherwise, let be an arbitrary subgroup of minimal rank, and define
[TABLE]
Also, define to be the number of subgroups in of minimal rank, namely, with .
Note that the numbers and do not depend on the arbitrary subgroup . If the constant from Theorem 1.11 is at least two, then and . If , then .
Theorem 3.6**.**
Fix and let in which case , or in which case . Then
[TABLE]
The point of Theorem 3.6 is that one can always read off from the expression for the ranks of the two subgroups of minimal rank in . In particular, we get the following corollary which we use below in the proof of Theorem 1.4:
Corollary 3.7**.**
Fix and let in which case , or in which case . Then is of the form (for some ) if and only if .
Proof of Theorem 3.6.
We claim that
[TABLE]
Indeed, as mentioned in Section 2 above, for every extension of free groups , there is a unique intermediate subgroup such that (see [PP15, Claim 4.5] for the proof in the finitely generated case, which is the case we need here). If and is the unique intermediate subgroup then and so . Moreover, in this case because the surjective core-graph morphism factors as , so must too be surjective. On the other hand, if and then and as “” is transitive, . In addition, . Hence .
Thus, we get from Lemma 3.1 that if for an arbitrary finitely generated subgroup we denote
[TABLE]
then
[TABLE]
Now let be an arbitrary finitely generated subgroup. Because , and all the subgroups in (3.6) satisfy with equality if and only if , it is clear that
[TABLE]
Recall Theorem 2.2 above (originally [PP15, Theorem 1.8]), by which the expected number of points in fixed by all elements of in a uniformly random is
[TABLE]
where is the number of subgroups of rank and which contain but not as a free factor. Parallel to (3.2), which gives a formula for the expected number of fixed points of a single word, the expected number of common fixed points of can be computed by
[TABLE]
(recall that ) – see [Pud14, Section 5]. As in the case of single words, is precisely the smallest rank of a proper algebraic extension of . The primitivity rank of is sometimes smaller than or equal to , but in the cases where , Theorem 2.2 and (3.9) can be interpreted as follows: in the formula (3.10) there is a leading term of coming from , but then all contributions coming from free extensions of in , together with , cancel out in all terms of order . (The positive coefficient of comes from algebraic extensions of , not from free extensions.) Hence,
[TABLE]
We may assume that , for otherwise Theorem 3.6 follows immediately from Theorem 1.11. Let be the unique algebraic extension with of rank , and let be those of rank . Because algebraic extensions is a transitive relation and , every proper algebraic extension of is also in and so its rank is at least . In particular, for every . From (3.8) and (3.11) it now follows that
[TABLE]
Plugging these expressions in (3.7) completes the proof of Theorem 3.6. ∎
A nice corollary of Theorem 1.11 is the following. Recall from Definition 1.6 that denotes the commutator length of and denotes the square length of .
Corollary 3.8**.**
We have
[TABLE]
In particular,
[TABLE]
Proof.
If then there are words such that . Let , and note that and and that . Hence and likewise .
Similarly, if then there are words such that . Let , and note that and that . Thus . ∎
4 Surface words and the proof of Theorem
4.1 Orientable surface words
First, we prove the first part of Theorem 1.4, which deals with the orientable surface word .
Proof of Theorem 1.4, orientable
case.
Assume that some word induces the same measure as on every compact group . In particular, the expected value of any irreducible character of under the -measure is . In the case of the unitary groups , the trace is an irreducible -dimensional character and thus . From Theorem 1.7 it now follows that . In particular, .
On the other hand, the trace of is also an -dimensional irreducible character, and so . From Theorem 1.11 it follows that . If then, as in Corollary 3.8 and its proof,
[TABLE]
where , and we obtain that .
Thus . Moreover, all the weak inequalities in (4.1) are equalities, and . This shows that has minimal rank among the subgroups with , and so . In addition, is a free factor of : otherwise, it would have a non-trivial algebraic extension , and then and , in contradiction to Corollary 3.7 which applies in this case.
As , the words are free and constitute a basis for , and as , they are part of a basis for . Therefore and . ∎
Remark 4.1*.*
We mentioned above that Conjecture 1.3 sometimes appears in the literature in a stronger version, where only finite groups are involved rather than all compact groups. In our proof of the conjecture for the case of orientable surface words, we used two compact infinite groups: and . However, the latter can be easily replaced by finite groups: let denote the length of the word . If , then cannot traverse any edge of a core graph times, times, or times (when counted with signs). So in this case, if then also , and for all . So for every , one may replace the group in the proof above with the group for any . This means that the only infinite groups one actually needs for the proof are . See also Question 1 in Section 5.
4.2 Non-orientable surface words
Proof of Theorem 1.4, non-orientable
case.
Assume that some word induces the same measure as on every compact group. In particular, the expected value of any irreducible character of under the -measure in . In the case of the group , the trace is an irreducible -dimensional character with and thus . From Corollary 3.8 we deduce that , namely, .
In the case of the orthogonal groups , the trace is an irreducible -dimensional real character with and thus . As , Theorem 1.8 says in this case that . It follows that .
On the other hand, the trace of is also -dimensional irreducible with , and so . From Theorem 1.11 it follows that . If then where . Hence
[TABLE]
and we obtain that .
Thus . Moreover, all the weak inequalities in (4.2) are equalities, and . This shows that has minimal rank among the subgroups with , and so . In addition, is a free factor of : otherwise, it would have a non-trivial algebraic extension , and then and , in contradiction to Corollary 3.7 which applies in this case.
As , the words are free and constitute a basis for , and as , they are part of a basis for . Therefore and . ∎
Remark 4.2*.*
As in the orientable case, the small role of the infinite group in the last proof can be also played by the groups for large enough .
5 Open Questions
We conclude with some open questions naturally arising from the results in this paper.
Can Theorem 1.4 be proven also based on word measures on finite groups only? Namely, can the role played in the proof by and be also played by some finite groups? (And see Remarks 4.1 and 4.2.) 2. 2.
Corollary 3.7 has the potential of yielding a solution of more special cases of Conjecture 1.3. Namely, if there is a relatively small set of \mathrm{Aut}\big{(}\mathrm{\mathbf{F}}_{r}\big{)}-orbits, along surface words, with the property that for some , then one can hope to prove Conjecture 1.3 for these orbits. Let us mention two examples: the words and both satisfy that .
Acknowledgments
We thank Henry Wilton and Liviu Pãunescu for beneficial comments. D.P. was supported by ISF grant 1071/16.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AV 11] Alon Amit and Uzi Vishne. Characters and solutions to equations in finite groups. J. Algebra Appl. , 10(4):675–686, 2011.
- 2[Fro 96] Georg Frobenius. Über gruppencharaktere. Sitzungsberichte Akademie der Wissenschaften zu Berlin , pages 985–1021, 1896.
- 3[FS 06] Georg Frobenius and Issai Schur. Über die reellen darstellungen der endlichen gruppen. Sitzungsberichte Akademie der Wissenschaften zu Berlin , pages 186–208, 1906.
- 4[HMP 19] Liam Hanani, Chen Meiri, and Doron Puder. Some orbits of free words that are determined by measures on finite groups. In preparation, 2019.
- 5[Isa 76] I. Martin Isaacs. Character theory of finite groups , volume 69. Academic Press, 1976.
- 6[LP 10] Nati Linial and Doron Puder. Word maps and spectra of random graph lifts. Random Structures and Algorithms , 37(1):100–135, 2010.
- 7[MP 15] Michael Magee and Doron Puder. Word measures on unitary groups. ar Xiv preprint 1509.07374, 2015.
- 8[MP 19a] Michael Magee and Doron Puder. Matrix group integrals, surfaces, and mapping class groups I: U ( n ) 𝑈 𝑛 {U}(n) . Inventiones Mathematicae , 2019. to appear, available at ar Xiv:1802.04862 v 2.
