This paper extends previous results on the Whitehead algorithm's efficiency, showing that for a broad class of random elements in free groups, the algorithm performs in quadratic or linear time under certain conditions.
Contribution
It generalizes the understanding of Whitehead's algorithm's generic-case complexity to wider random processes and introduces the notion of $(M,
u, heta)$-minimal conjugacy classes.
Findings
01
Whitehead's algorithm is quadratic on generic random elements in free groups.
02
For elements close to a filling current, the algorithm runs in linear time.
03
A wide class of random processes produce elements with quadratic generic-case complexity.
Abstract
In \cite{KSS06} it was shown that with respect to the simple non-backtracking random walk on the free group FN=F(a1,…,aN) the Whitehead algorithm has strongly linear time generic-case complexity and that "generic" elements of FN are "strictly minimal" in their Out(FN)-orbits. Here we generalize these results, with appropriate modifications, to a much wider class of random processes generating elements of FN. We introduce the notion of a ''(M,λ,ϵ)-minimal" conjugacy class [w] in FN, where M≥1,λ>1 and 0<ϵ<1. Roughly, being (M,λ,ϵ)-minimal means that every ϕ∈Out(FN) either increases the length ∣∣w∣∣A by a factor of at least λ, or distorts the length ∣∣w∣∣A multiplicatively by a factor ϵ-close to 1, and that the number of automorphically minimal [u] in the orbit Out(FN)[w] is…
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGeometric and Algebraic Topology · Topological and Geometric Data Analysis · Mathematical Dynamics and Fractals
Full text
Generic-case complexity of Whitehead’s algorithm, revisited
Ilya Kapovich
Department of Mathematics and Statistics, Hunter College of CUNY
In [29] it was shown that with respect to the simple non-backtracking random walk on the free group FN=F(a1,…,aN) the Whitehead algorithm has strongly linear time generic-case complexity and that ”generic” elements of FN are ”strictly minimal” in their Out(FN)-orbits.
Here we generalize these results, with appropriate modifications, to a much wider class of random processes generating elements of FN. We introduce the notion of a (M,λ,ε)-minimal conjugacy class [w] in FN, where M≥1,λ>1 and 0<ε<1. Roughly, [w] being (M,λ,ε)-minimal means that every φ∈Out(FN) either increases the length ∣∣w∣∣A by a factor of at least λ, or distorts the length ∣∣w∣∣A multiplicatively by a factor ε-close to 1, and that the number of automorphically minimal [u] in the orbit Out(FN)[w] is bounded by M. We then show that if a conjugacy class [w] in FN is sufficiently close to a “filling” projective geodesic current [ν]∈P\mboxCurr(FN), then, after applying a single “reducing” automorphism ψ=ψ(ν)∈Out(FN) depending on ν only, the element ψ([w]) is (M,λ,ε)-minimal for some uniform constants M,λ,ε. Consequently, for such [w], Whitehead’s algorithm for the automorphic equivalence problem in FN works in quadratic time on the input ([w],[w′]) where [w′] is arbitrary, and in linear time if [w′] is also projectively close to [ν]. We then show that a wide class of random processes produce ”random” conjugacy classes [wn] that projectively converge to some filling current in P\mboxCurr(FN). For such [wn] Whitehead’s algorithm has at most quadratic generic-case complexity.
Let FN=F(A) be a free group of finite rank N≥2, with a fixed free basis A={a1,…,aN}. The automorphism problem for FN asks, given two freely reduced words w,w′∈FN=F(A), whether there there exists φ∈Aut(FN) such that w′=φ(w), that is, whether Aut(FN)w=Aut(FN)w′. A complete algorithmic solution to this problem was provided in 1936 classic paper of Whitehead [46], via the procedure that came to be called Whitehead’s algorithm. We briefly recall how this algorithm works, and refer the reader to Section 2 below.
For an element g∈FN, we denote by ∣g∣A and by ∣∣g∣∣A the freely reduced length and the cyclically reduced length of g with respect to A accordingly. We also denote by [g] the conjugacy class of g in FN. For w,w′∈FN we have Aut(FN)w=Aut(FN)w′ if and only if Out(FN)[w]=Out(FN)[w′]. For that reason we usually think of the automorphism problem in FN in this latter form, as the question about [w],[w′] being in the same Out(FN)-orbit. We denote CN={[g]∣g∈FN}.
The group Aut(FN) has a particularly nice finite generating set WN of so-called Whitehead automorphisms or Whitehead moves (we use the same terminology for the images of Whitehead automorphisms in Out(FN)); see Definition 2.1 below. Whitehead moves are divided into two types: Whitehead moves τ of the first kind have the form ai↦aσ(i)±1 for some permutation σ∈Sn. They have the property that for each w∈FN∣∣τ(w)∣∣A=∣∣w∣∣A. Whitehead moves of the second kind can change the cyclically reduced length of an element of FN.
An element [g]∈CN is called Out(FN)-minimal if for every φ∈Out(FN) we have ∣∣g∣∣A≤∣∣φ(g)∣∣A. For [g]∈CN we denote by M([g]) the set of all Out(FN)-minimal elements in the orbit Out(FN)[g].
An element [g]∈CN is called Whitehead-minimal if for every Whitehead move τ∈WN we have ∣∣g∣∣A≤∣∣τ(g)∣∣A. Whitehead’s “peak reduction lemma” implies the following two key facts: If [w] is not Out(FN)-minimal, then there exists τ∈WN such that ∣∣τ(w)∣∣A<∣∣w∣∣A. This fact already has the following important implication: an element [w]∈CN is Out(FN)-minimal if and only if [w] is Whitehead minimal.
The second fact says that for two Out(FN)-minimal [u],[u′]∈CN we have Out(FN)[u]=Out(FN)[u′] if and only if ∣∣w∣∣A=∣∣w′∣∣A and there exists a finite length-stable chain of Whitehead moves τ1…,τk∈WN (with k≥0) such that τk…τ1[u]=[u′] and that ∣∣τi…τ1[u]∣∣A=∣∣u∣∣A for all i≤k. Whitehead’s algorithm on the input ([w],[w′]) consists of two parts. The first one, the Whitehead minimization algorithm, starting from [w]∈CN consists of iteratively looking for a Whitehead move that decreases the cyclically reduced length of an element. Once we have arrived at [u] where no such moves are available, we know that [u] is a Whitehead-minimal and hence Out(FN)-minimal element of the orbit Out(FN)[w]. Since the set WN is finite and fixed, this process runs in at most quadratic time in terms of ∣∣w∣∣A. Also, do the same thing to [w′] to produce an Out(FN)-minimal element [u′] of the orbit Out(FN)[w]. If ∣∣u∣∣A=∣∣u′∣∣A, then Out(FN)[w]=Out(FN)[w′] and we are done. The second, hard, part of Whitehead’s algorithm, that we call Whitehead’s stabilization algorithm, deals with the case where ∣∣u∣∣A=∣∣u′∣∣A=n≥1. In this case one looks for a length-stable chain τ1…,τk∈WN of Whitehead’s move which satisfies τk…τ1[u]=[u′]. Since the ball of radius n in FN(A) has exponential size in n, this second process has a priori exponential in n time complexity. Although a few incremental improvements have been obtained over the years (e.g. see [10, 31, 35, 36, 40, 41, 44]), the questions about the computational complexity of the automorphism problem in FN and about the actual worst-case complexity of Whitehead’s algorithm remain wide open and the exponential time bound is the best one known in general. The only exception is the case of rank N=2 where it is known that Whitehead’s algorithm works in polynomial (in fact, quadratic) time [41, 31]. For the general case N≥2, the best known partial results are due to Donghi Lee [35, 36], who proved that Whitehead’s algorithm terminates on w∈FN in polynomial time (with degree of the polynomial depending on N), if some Out(FN)-minimal element [u]∈M([w]) satisfies a certain technical condition.
In [29] Kapovich, Schupp and Shpilrain initiated a probabilistic study of Whitehead’s algorithm, that is, its behavior on “random” or “generic” inputs in FN. In that paper “generic” meant for a large n≥1, either choosing a uniformly at random freely reduced word of length n in F(A), or taking a a uniformly at random cyclically reduced word of length n in F(A). It turned out that on such “generic” input both parts of Whitehead’s algorithm work very fast. As defined in [29], an element [w]∈CN is called strictly minimal if for every non-inner τ∈WN of the second kind we have ∣∣w∣∣A<∣∣τ(w)∣∣A. Thus, in particular, a strictly minimal element is Whitehead-minimal and therefore Out(FN)-minimal. Thus the Whitehead minimization algorithm on [w] terminates in a single step with [u]=[w], and takes linear time in ∣∣w∣∣A. Also, in this case (for [w] strictly minimal), if [u′] is another Out(FN)-minimal element with ∣∣u′∣∣A=∣∣u∣∣A=n and with Out(FN)[u]=Out(FN)[u′] any length-stable chain τ1,…,τk connecting [w]=[u] to [u′] consists only of inner automorphisms and Whitehead moves of the first kind. By composing them we see that Out(FN)[u]=Out(FN)[u′] if and only if there exists τ∈WN of the second kind such that τ[u]=[u′]. Thus in this case the Whitehead stabilization algorithm also terminates in linear time in n=∣∣w∣∣A. The overall complexity
of Whitehead’s algorithm on the input [w],[w′], where [w] is strictly minimal, is O(max{∣∣w∣∣A,∣∣w′∣∣A2}). A key probabilistic result of [29] says that a “generic” (in the above basic sense of taking a
uniformly random freely reduced or cyclically reduced word of length n) element [w]∈CN is strictly minimal. Therefore, if both [w],[w′] are “generic” in this sense, Whitehead’s algorithm on input [w],[w′] runs in O(max{∣∣w∣∣A,∣∣w′∣∣A}) time; and if [w] is generic and [w′] is arbitrary, it runs in O(max{∣∣w∣∣A,∣∣w′∣∣A2}) time. The results of [29] were generalized in [44] for the version of Whitehead’s algorithm for Out(FN)-orbits of conjugacy classes of finitely generated subgroups of FN.
The proof in [29] that “generic” [w] in FN is strictly minimal crucially relied on the fact that for such [w] the weights (normalized by ∣∣w∣∣A) on edges in the Whitehead graph of w are close to being uniform. Roughly, that means that frequencies of 1-letter and 2-letter subwords in [w] are close to being uniform (e.g. that for i=1,…,N the frequency of each ai±1 in [w] is close to N1). This close-to-uniform property of frequencies no longer holds if [w] generated by other random processes.
Example 1.1*.*
For example, consider the case N=2 and F2=F(a,b). Let wn be a positive word in {a,b} of length n, where every letter is chosen independently, with probability p(a)=1/10 and p(b)=9/10. Then the frequency of a in a ”random” wn will tend to 1/10 as n→∞. Moreover, it is not hard to see that wn will not be strictly minimal. Here is an informal argument. In this case wn will contain 100081n+o(n) occurrences of ab2, as well as 10001n+o(n) occurrences of a3 and 10009n+o(n) occurrences of aba. Consider the Whitehead move τ(a)=ab−1,τ(b)=b. Note that τ(ab2)=ab. The portion of wn covered by the 100081n+o(n) occurrences of ab2 has total length 1000243n+o(n) but its image under τ has total length 1000162n+o(n). Since τ(aba)=aab−1, the image of the portion of wn covered by occurrences of aba in wn does not change in length in τ(wn). The portion of wn covered by the 10001n+o(n) occurrences of a3 has total length 10003n+o(n), and its image in τ(wn) has total length 10006n+o(n) there. One can conclude from here that ∣∣wn∣∣A−∣∣τ(wn)∣∣A≥100078n+o(n). Thus ∣∣τ(wn)∣∣A<∣∣wn∣∣A and, moreover (since ∣∣wn∣∣A=n), ∣∣wn∣∣A∣∣τ(wn)∣∣A≤1000922+o(1). Hence wn badly fails to be strictly minimal.
In the present paper we consider the generic-case behavior of Whitehead’s algorithm on random inputs for much more general types of random processes than in the [29] setting (in particular, including Example 1.1). Our results have some similarities to the results from [29] but, of course, with important differences that are inherently necessary, as demonstrated by Example 1.1.
The main notions replacing strict minimality are the notions of an (M,λ,ε)-minimal element[u]∈CN and of (M,λ,ε,WN)-minimal element[u]∈CN (where M≥1, λ>1 and 0<ε<1); see Definition 3.1 and Definition 3.3 below. Roughly, [u] being (M,λ,ε)-minimal means that [u] belongs to a subset S⊆Out(FN)[u] of cardinality at most M such that for each element [u′] of S an arbitrary φ∈Out(FN) either increases the length ∣∣u′∣∣A by a factor of at least λ, or distorts the length multiplicatively by a factor ε-close to 1. For [u] being (M,λ,ε,WN)-minimal the definition is similar, but instead of arbitrary φ∈Out(FN) we only require these conditions to hold for arbitrary τ∈WN. From the definitions we see that being (M,λ,ε)-minimal directly implies being (M,λ,ε,WN)-minimal. The converse is almost true: given λ,ε, for all ”sufficiently stringent” λ′>λ and 0<ε′<ε being (M,λ′,ε′,WN)-minimal implies being (M,λ,ε)-minimal. See Proposition 3.5 below for a precise statement. For fixed M,λ,ε, deciding if an element [u]∈CN is (M,λ′,ε′,WN)-minimal can be done in linear time in ∣∣u∣∣A, while the algorithm for deciding if [u] is (M,λ,ε)-minimal has a priori exponential time complexity. See Section 3.3 below for details.
We summarize the main results of the present paper:
∙ We show in Theorem 3.11 that Whitehead’s algorithm on an input [w],[w′] works fast if at least one of the inputs is (M,λ,ε)-minimal: Whitehead minimization algorithm works in linear time on any (M,λ,ε)-minimal element [w]; if both [w],[w′] are (M,λ,ε)-minimal, the full Whitehead algorithm the input ([w],[w′]) works in linear time; if [w] is (M,λ,ε)-minimal and [w′] is arbitrary, the full Whitehead algorithm the input ([w],[w′]) works in time O(∣∣w∣∣A,∣∣w′∣∣A2). Also, for an (M,λ,ε)-minimal [w] the stabilizer StabOut(FN)([w]) has uniformly bounded rank rank (the smallest cardinality of a generating set).
∙ We exhibit a rich source of (M,λ,ε)-minimal elements. We prove that if ν∈\mboxCurr(FN) is a “filling” geodesic current then there exist M≥1,λ>1,ε>0, a neighborhood U of [ν] in P\mboxCurr(FN), and a finite set W⊆Out(FN) of ≤M “shortening” automorphisms with the following property: For every [w] which belongs in U (when [w] is viewed as a projective current), and every ψ∈W, the element [ψ(w)] is (M,λ,ε)-minimal, and, moreover M([w])⊆W([w]). We also produce several sources of “filling” currents.
∙ We define the notion of an FN-valued random process W=W1,W2,… being adapted to a current 0=ν∈\mboxCurr(FN). We prove that if W is adapted to a filling current ν then Whitehead’s algorithm has low complexity when one or both of [w],[w′] are “randomly” generated by W. These conclusions, obtained in Theorem 6.11 and Theorem 6.12, are the main genericity results of this paper.
∙ We show, in Theorem 7.6, that for a large class of “group random walks” W=W1,W2,… on FN, the walk is adapted to some filling current [ν] and hence Theorem 6.11 and Theorem 6.12 apply.
∙ We also show that for a large class of “graph non-backtracking random walks” W=W1,W2,… on FN, the walk is adapted to some filling current [ν] (see Theorem 9.11, Theorem 9.12 and Proposition 9.14) and hence again Theorem 6.11 and Theorem 6.12 apply.
In [29] the probabilistic results about Whitehead’s algorithm are stated in terms of generic-case complexity. This notion, introduced in [28], is designed to capture practically observable (as distinct from worst-case and average-case) behavior of various algorithms. In [28] generic-case complexity in the FN=F(A) context is defined via asymptotic density, that is, essentially, the uniform probability measure on large spheres or balls in F(A); the same definition is still used in [29]. Since then the notion of generic-case complexity has been significantly expanded and generalized, to allow for more general and more natural models of random generation of inputs; see [42] for some background and further details. All the probabilistic complexity results about Whitehead’s algorithm obtained in this paper are, in fact, generic-case complexity results. However, to be precise, we state all these results exactly, precisely and explicitly (including quantification of various constants) in terms of the random processes involved, rather than using the language of generic-case complexity. Note that the case of a simple non-backtracking random walk on F(A), which was the context of the results in [29], is a very special case of the random process W˘ considered Theorem 9.12.
We expect the results of Section 3 about (M,λ,ε)-minimal elements to be of independent interest, apart from any probabilistic applications.
Geodesic currents provide a measure-theoretic generalization of the notion of a conjugacy class. Geodesic currents, originally introduced by Bonahon [5] in the context of hyperbolic surfaces, proved particularly useful in recent years in the study of Out(FN) and of the Culler-Vogtmann Outer space, see e.g. [11, 17, 4, 20]. A key tool in the theory is the notion of a geometric intersection form between currents and points of the Thurston-like closure of the Outer space. The intersection form was developed by Kapovich and Lustig [26, 27]. The connection between currents and generic-case complexity was first pointed out in our article [24], but this connection is explored in detail for the first time in the present paper. In particular, the intersection form defines, for every ν∈\mboxCurr(FN), the “length” ∣∣ν∣∣A≥0 of ν with respect to A. For 1=w∈FN, we have ∣∣ηw∣∣A=∣∣w∣∣A, where ηw∈\mboxCurr(FN) is the “counting” current associated with [w].
Example 1.2* (Simple non-backtracking random walk on FN).*
Consider the simple nonbacktracking random walk W=W1,W2,…,Wn,… of FN with respect to A, as is done in [29]. This means that the Wn=X1…Xn is a freely reduced word of length n in F(A), where the first letter X1 is chosen uniformly at random from A±1 with probability 2N1 each; and if the i-th letter Xi=a∈A±1 is already chosen, the letter Xi+1 is chosen uniformly at random from A±1−{a}, with probability 2N−11 for each element there. Thus Wn induces the uniform probability distribution on the n-sphere in F(A), where every element of the sphere has probability 2N(2N−1)n−11. We will explain here the properties of the walk W in the terminology of this paper, omitting the detailed justification of these properties.
For a.e. trajectory w1,w2,… of the walk W we have limn→∞n1ηwn=νA in \mboxCurr(FN), where νA is the uniform current on FN corresponding to A (see Definition 4.7). Thus, in the language of the present paper, the walk W is adapted to νA. Moreover, νA has full support in ∂2FN and therefore νA is filling, by Proposition 5.2. Also, we have ∣∣νA∣∣A=1. Let ℑ⊆Out(FN) be the set of all Whitehead automorphisms of the first kind. Put W=ℑ⊆Out(FN). The results of [24, 22] imply that for any φ∈Out(FN), we have φ(νA)=νA if φ∈ℑ, and
[TABLE]
if φ∈ℑ. This fact implies that if we choose and fix any 1<λ<λ0, then any [w] that is sufficiently close to [νA] in P\mboxCurr(FN) is strictly minimal (in particular ∣∣φ(w)∣∣A=∣∣w∣∣A for every φ∈ℑ), and, moreover, for any φ∈ℑ we have ∣∣φ([w])∣∣A/∣∣w∣∣A≥λ. Then for any sufficiently small ε>0, for n→∞ our “random” wn is (M,λ,ε)-minimal with M=#ℑ, and the set Sn=ℑ([wn])=W([wn]) is (M,λ,ε)-minimazing (in the sense of Definition 3.1). This example is the simplest case illustrating how our definitions and results work.
Remark 1.3* (A note on the speed of convergence).*
In [28, 29] the main results are stated in terms of “strong genericity”, meaning that various probabilities converging to 1 do so exponentially fast as n→∞. Parts (b) of Theorem 6.11 and Theorem 6.12 are also stated in terms of probabilities of various events at step n converging to 1 as n→∞. We do not include the speed of convergence estimates there because for the moment our main new ”group random walks” application, Theorem 7.6, does not come with a speed of convergence estimate. The reason is that the proof of this theorem relies on the use of a recent result of Gekhtman [19, Theorem 1.5] about approximating harmonic measure by counting currents along a random walk on a word-hyperbolic group acting on a CAT(−1) space does not have any speed of convergence estimates. We expect that Gekhtman’s result actually holds in much greater generality (e.g. for an arbitrary geometric action of a nonelementary word-hyperbolic group G, and with much milder assumptions on the measure μ defining the walk), with exponentially fast convergence. Once that is proved, the applications of Theorem 6.11 and Theorem 6.12 to the group random walk context can be supplied with the speed of convergence estimates. (Definition 6.9 of a random process adapted to a current would have to be refined to include quantificantion by the speed of convergence.) On the other hand, in the context of our results about graph-based non-backtracking random walks, namely Theorem 9.11, Theorem 9.12, one can already show that the convergence is either exponentially or slightly subexponentially fast.
We are extremely grateful to Vadim Kaimanovich and Joseph Maher for many helpful discussions about random walks, for help with the references and for clarifying some random walks arguments. In particular the proof of Proposition 7.5 was explained to us by Kaimanovich. We are also grateful to the organizers of the March 2019 Dagstuhl conference ”Algorithmic Problems in Group Theory” for providing impetus and motivation for completing this paper.
2. Whitehead’s algorithm
Our main background reference for Whitehead’s algorithm Lyndon and Schupp, Chapter I.4 [37], and we refer the reader there for additional details.
Some other useful details and complexity results are available in [29, 44].
We recall the basic definitions and results here.
In this section we fix a free group FN=F(A) of rank N≥2, with a fixed free basis A={a1,…,aN}. Put ΣA=A⊔A−1.
We will also denote by CN the set of all FN-conjugacy classes [g] where g∈FN.
Definition 2.1** (Whitehead automorphisms).**
A Whitehead automorphism of FN with respect to A is an automorphism τ∈Aut(FN) of
FN of one of the following two types:
(1) There is a permutation t of ΣA such that
τ∣ΣA=t. In this case τ is called a relabeling
automorphism or a Whitehead automorphism of the first
kind.
(2) There is an element a∈ΣA, the multiplier, such
that for any x∈ΣA
[TABLE]
In this case we say that τ is a Whitehead automorphism of
the second kind. (Note that since τ is an automorphism of FN,
we always have τ(a)=a in this case).
We also refer to the images of Whitehead automorphisms in Out(FN) as Whitehead moves and sometimes again as Whitehead automorphisms. We denote by WN the set of all Whitehead moves τ∈Out(FN) such that τ=1 in Out(FN).
Note that for any a∈ΣA the inner automorphism ad(a)∈Aut(FN) is a
Whitehead automorphism of the second kind. Note also that if τ∈WN then τ−1∈WN.
To simplify the exposition, we formulate all the definitions and results related to Whitehead’s algorithm in terms of conjugacy classes of elements of FN.
In this context we usually think of an input [w]∈CN as given by a cyclically reduced word w∈F(A) and the complexity of various algorithms is estimated in terms of ∣∣w∣∣A. Since for w∈FN we have ∣∣w∣∣A≤∣w∣A, and since it takes linear time in ∣w∣A to find a cyclically reduced form of w∈F(A) (see [29] for additional discussion on this topic), the same complexity estimates hold in terms of ∣w∣A.
Definition 2.2** (Minimal and Whitehead-minimal elements).**
A conjugacy class [w]∈CN is Out(FN)-minimal with respect to A if for every φ∈Out(FN) we have ∣∣w∣∣A≤∣∣φ(w)∣∣A.
A conjugacy class [w]∈CN is Whitehead-minimal with respect to A if for every Whitehead move τ∈WN we have ∣∣w∣∣A≤∣∣τ(w)∣∣A.
For [w]∈CN, denote \mathcal{M}([w])=\{[u]\in{\rm Out}(F_{N})[w]|[u]\text{ is Out(F_{N})-minimal}\}.
Note that, by definition, an Out(FN)-minimal [w] is necessarily Whitehead-minimal.
Definition 2.3** (Automorphism graph).**
The automorphism graph of FN is an oriented labelled graph T defined as follows.
The vertex set VT is CN, the set of all conjugacy classes [w] where w∈FN.
The edges of T are defined as follows. Suppose that [w]=[w′]∈VT are such that ∣∣w∣∣A=∣∣w′∣∣A=n≥0. If there exists a Whitehead move τ∈WN such that τ([w])=[w′] (and hence τ−1[w′]=[w], with τ−1∈WN) there is a topological edge e connecting [w] and [w′]. There are two possible orientations on e resulting in mutually inverse oriented edges: the edge with the orientation from [w] to [w′] is labelled by τ, and the edge e with the orientation from [w′] to [w] is labelled by τ−1.
Also, for n≥0 denote by Tn the subgraph of T spanned by all vertices [w]∈VT with ∣∣w∣∣A=n. For a vertex [w] of Tn denote by Tn[w] the connected component of Tn containing [w].
We first state the following simplified version of Whitehead’s “peak reduction” lemma (see [29, Proposition 1.2]):
Proposition 2.4**.**
The following hold:
(1)
An element [w]∈CN is Out(FN)-minimal if and only if [w] is Whitehead-minimal. (Thus if [w] is not Out(FN)-minimal then there exists τ∈WN such that ∣∣τ(w)∣∣A<∣∣w∣∣A).
2. (2)
Suppose that [w]=[w′] are both Out(FN)-minimal. Then Out(FN)[w]=Out(FN)[w′] if and only if ∣∣w∣∣A=∣∣w′∣∣A=n≥0, and there exists a finite sequence τ1,…τk∈WN such that τk…τ1[w]=[w′] and that for i=1,…,k we have
[TABLE]
Proposition 2.4 implies that if [w]∈CN is Out(FN)-minimal with ∣∣w∣∣A=n then M([w])=Tn[w].
We also record the following more general version of ”peak reduction”:
Proposition 2.5**.**
[37, Proposition 4.17]**
Let [w],[w′]∈CN and φ∈OutN be such that [w′]=φ([w]) and that ∣∣w′∣∣A≤∣∣w∣∣A.
Then there exists a factorization φ=τk…τ1 in Out(FN), where τi∈W and where ∣∣τi…τ1w∣∣A≤∣∣w∣∣A for i=1,…,k.
Definition 2.6** (Whitehead algorithm).**
Let FN=F(A) be free of rank N≥2, with a fixed free basis A.
∙ The Whitehead minimization algorithm is the following process. Given [w]∈CN put [w1]=[w]. If [wi] is already constructed, check if there exists τ∈WN such that ∣∣τ(wi)∣∣A<∣∣wi∣∣A. If not, declare that [wi]∈M([w]) (that is [wi] is an Out(FN)-minimal element in Out(FN)[w] and terminate the algorithm. Put [wi+1]=[τ(wi)].
∙ The Whitehead stabillization algorithm is the following process. Suppose that [w]∈CN is Whitehead-minimal (and therefore Out(FN)-minimal) with ∣∣w∣∣A=n≥0. Construct the component Tn([w]) of Tn using the “breadth-first” stabilization process. Start with S1={[w]}. Now if a finite collection Si of conjugacy classes with ∣∣.∣∣A=n is already constructed, for each element [u]∈Si and each τ∈WN, put
[TABLE]
Terminate the process with the output Si for the smallest i≥1 such that Si+1=Si. Declare that Si=VTn([w])=M([w]).
∙ The Whitehead algorithm is the following process. Given [w],[w′]∈CN, first apply the Whitehead minimization process to each of [w],[w′] to output elements [u],[u′] accordingly. Declare that [u]∈M([w]) and [u′]∈M([w′]). If ∣∣u∣∣A=∣∣u′∣∣A, declare that Out(FN)[w]=Out(FN)[w′] and terminate the process. Suppose that ∣∣u∣∣A=∣∣u′∣∣A=n≥1. Apply the Whitehead stabilization algorithm to [u] to produce the set S. Declare that S=Tn([u])=M([w]). Then check wither [u′]∈S. If [u′]∈S, declare that Out(FN)[w]=Out(FN)[w′], and if [u′]∈S, declare that Out(FN)[w]=Out(FN)[w′], and terminate the process.
Remark 2.7*.*
Part (1) of Proposition 2.4 implies that the Whitehead minimization algorithm on an input [w]∈CN always terminates in O(∣∣w∣∣A2) time (where we assume that [w] is given to us as a cyclically reduced word in F(A)) and indeed outputs an element of M([w]). The quadratic time bound arises since going from [wi] to [wi+1] takes a priori linear time in ∣∣wi∣∣A, and since ∣∣w1∣∣A>∣∣w2∣∣A>…, the process terminates with some [wi] with i≤∣∣w∣∣A.
Part (2) of Proposition 2.4 implies that the Whitehead minimization algorithm on an Out(FN)-minimal input [w] in FN=F(A) with ∣∣w∣∣A=n, always terminates in O(#VTn([w])) time, and indeed outputs the set M([w])=Tn([w]). Taken together, Proposition 2.4 implies that the Whitehead algorithm on the input [w],[w′]∈CN does correctly decide whether or not Out(FN)[w]=Out(FN)[w′].
Overall, the a priori worst-case complexity of Whitehead’s algorithm on the input [w],[w′] is exponential in max{∣∣w∣∣A,∣∣w′∣∣A} because for [u]∈VTn the cardinality #VTn([u]) is at most exponential in n.
Definition 2.8**.**
Suppose that W⊆Out(FN) is a fixed finite set of “auxiliary” automorphisms.
•
The W-speed-up of the Whitehead minimization algorithm consists in taking the input [w]=1, computing W[w]={ψ[w]∣ψ∈W} first and then applying the Whitehead minimization algorithm, in parallel to [w] and each of the elements of W([w]). The result is again an element of M([w]).
•
The W-speed-up of the Whitehead’s algorithm consists in doing the following. Given [w],[w′]=1, first apply the W-speed-up of the Whitehead minimization algorithm to both [w] and [w′] to find [u]∈M([w]) and [u′]∈M([w′]). Then proceed exactly as in Whitehead’s algorithm to decide whether or not Out(FN)[w]=Out(FN)[w′].
Since W is finite an fixed, the a priori complexity estimates for these speed-up versions are the same as in Remark 2.7, although with worse multiplicative constants.
3. (M,λ,ε)-minimality and Whitehead’s algorithm
Let FN=F(A) be free of rank N≥2 where A={a1,…,aN} is a fixed free basis of FN.
3.1. Main definitions
Definition 3.1**.**
Let M≥1 be an integer, let λ>1 and let 0≤ε<λ−1.
A finite set S of conjugacy classes of nontrivial elements of FN is called (M,λ,ε)-minimizing if it satisfies the following properties:
(1)
We have #(S)≤M.
2. (2)
For any [u],[u′]∈S we have Out(FN)[u]=Out(FN)[u′].
3. (3)
For any [u],[u′]∈S we have 1−ε≤∣∣u∣∣A∣∣u′∣∣A≤1+ε.
4. (4)
For every [u]∈S and every φ∈Out(FN) such that φ([u])∈S we have ∣∣u∣∣A∣∣φ(u)∣∣A≥λ>1+ε.
In this case for any [u]∈S we also say that S is a (M,λ,ε)-minimizing set for [u].
We say that a nontrivial conjugacy class [u] in FN is (M,λ,ε)-minimal if there exists an (M,λ,ε)-minimizing set S for [u] (and thus [u]∈S).
Note that if S is a (M,λ,ε)-minimizing set and if [u]∈S then for φ∈Out(FN) either φ(u)∈S or ∣∣u∣∣A∣∣φ(u)∣∣A≥λ, and these outcomes are mutually exclusive.
We record the following useful immediate corollary of the above definition:
Lemma 3.2**.**
Let M≥1 be an integer, let λ>1, let 0≤ε<λ−1 and let S be an (M,λ,ε)-minimizing set of conjugacy classes in FN.
Then for any [u]∈S and φ∈Out(FN) such that ∣∣φ(u)∣∣A≤(1+ε)∣∣u∣∣A we have φ([u])∈S.
∎
Definition 3.3**.**
Let M≥1 be an integer, let λ>1 and let 0≤ε<λ−1.
A finite set S⊆CN of conjugacy classes of nontrivial elements of FN is called (M,λ,ε,WN)-minimizing if it satisfies the following properties:
(1)
We have #(S)≤M.
2. (2)
For any [u],[u′]∈S we have Out(FN)[u]=Out(FN)[u′].
3. (3)
For any [u],[u′]∈S we have 1−ε≤∣∣u∣∣A∣∣u′∣∣A≤1+ε.
4. (4)
For any [u]∈S and τ∈WN exactly one of the following occurs:
(i)
We have τ([u])∈S.
(ii)
We have τ([u])∈S and ∣∣u∣∣A∣∣τ(u)∣∣A≥λ>1+ε.
In this case for any [u]∈S we also say that S is a (M,λ,ε,W)-minimizing set for [u].
We say that a nontrivial conjugacy class [u] in FN is (M,λ,ε,WN)-minimal if there exists an (M,λ,ε,WN)-minimizing set S for [u] (and thus [u]∈S).
Lemma 3.4**.**
Let M≥1 be an integer, let λ>1, 0<ε<1 be such that ε<λ−1 and λ(1−ε)>1. Let S⊆CN be a finite set of conjugacy classes of nontrivial elements of FN such that S is (M,λ,ε,WN)-minimizing.
(1)
For any [u]∈S and τ∈WN such that ∣∣τ(u)∣∣A≤(1+ε)∣∣u∣∣A we have τ([u])∈S.
2. (2)
For any [u]∈S and φ∈Out(FN) such that ∣∣φ(u)∣∣A≤∣∣u∣∣A we have φ([u])∈S.
Proof.
Part (1) follows from conditions (3), (4) of Definition 3.3.
For (2), suppose that [u]∈S and φ∈Out(FN) are such that ∣∣φ(u)∣∣A≤∣∣u∣∣A. By Proposition 2.5, there exist τ1,…,τk∈WN such that φ=τk…τ1 and that for [u0]=[u], [ui]=τi…τ1([u]) for i=1,…,k we have ∣∣ui∣∣A≤∣∣u∣∣A. Note that [uk]=φ([u]).
We argue by induction on i that [ui]∈S for i=1,…,k.
We have [u]=[u0]∈S. Suppose now 0≤i<k and [ui]∈S. We need to show that [ui+1]=τi+1[ui]∈S. Suppose, on the contrary, that [ui+1]∈S. Then ∣∣ui+1∣∣A/∣∣ui∣∣A≥λ. Since [u],[ui]∈S, also have ∣∣ui∣∣A/∣∣u∣∣A≥1−ε. Therefore ∣∣ui+1∣∣A/∣∣u∣∣A≥λ(1−ε), so that ∣∣ui+1∣∣A≥λ(1−ε)∣∣u∣∣A>∣∣u∣∣A since λ(1−ε)>1. This contradicts the choice of τ1,…,τk. Thus [ui+1]∈S, as required.
Hence [uk]=φ([u])∈S, and part (2) of the lemma is verified.
∎
The definitions directly imply that an (M,λ,ε,WN)-minimizing set S is (M,λ,ε)-minimizing. It turns out that the converse also holds, but with slightly smaller ε and slightly bigger λ.
Proposition 3.5**.**
Let M≥1 be an integer, let λ>1 and let 0≤ε<λ−1. Let 0<ε′<ε and λ′>λ>1 be such that be such that λ′(1−ε′)>λ.
Let S⊆CN be a finite set of conjugacy classes of nontrivial elements of FN be such that S is (M,λ′,ε′,WN)-minimizing.
Then S is (M,λ,ε)-minimizing.
Proof.
We need to verify that conditions (1)-(4) of Definition 3.1 of an (M,λ,ε)-minimizing set hold for S.
Since S is (M,λ′,ε′,WN)-minimizing, it follows that #(S)≤M, any two elements of S are in the same Out(FN)-orbit, and for any [u],[u′]∈S we have ∣∣u∣∣A∣∣u′∣∣A∈[1−ε′,1+ε′]⊆[1−ε,1+ε]. Thus we only need to verify condition (4) of Definition 3.1 for S.
Let [u]∈S and let ψ∈Out(FN) be such that φ([u])∈S. Part (2) of Lemma 3.4 implies that ∣∣u∣∣A<∣∣φ(u)∣∣A.
Therefore by
Proposition 2.5, there exist τ1,…,τk∈WN such that φ=τk…τ1 and that for [v0]=[u], [vi]=τi…τ1([u]) for i=1,…,k we have ∣∣vi∣∣A≤∣∣φ(u)∣∣A. Note that [vk]=φ([u]). Since [vk]∈S, the set {i≥0∣[vi]∈S} is nonempty. Put j=min{i≥0∣[vi]∈S}. Since [v0]=[u]∈S, we have j≥1, and [vi]∈S for all 0≤i<j.
Since [vj−1],[u]∈S, we have ∣∣vj−1∣∣A≥(1−ε′)∣∣u∣∣A. Since [vj−1]∈S and [vj]=τj([vj−1])∈S, it follows that ∣∣vj∣∣A≥λ∣∣vj−1∣∣A and therefore ∣∣vj∣∣A≥λ(1−ε′)∣∣u∣∣A. We also have ∣∣φ(u)∣∣A≥∣∣vj∣∣A and hence
[TABLE]
Thus the set S is (M,λ,ε)-minimizing, as required.
∎
Definition 3.6**.**
Let M≥1 be an integer, let λ>1 and let 0≤ε<λ−1.
Let 1=w∈FN. We say that [w] is (M,λ,ε)-minimizable in FN=F(A) if there exists a subset W⊆Out(FN) such that #(W)≤M and that the set S=W[w] is (M,λ,ε)-minimizing (or, equivalently, if the orbit Out(FN)[w] contains a (M,λ,ε)-minimal element). In this case we say that W is (M,λ,ε)-reducing for [w].
Note that for 1=w∈FN the conjugacy class [w] is (M,λ,ε)-minimizable if and only if the orbit Out(FN)[w] contains a (M,λ,ε)-minimal element.
3.2. Behavior of Whitehead’s algorithm
We now have:
Proposition 3.7**.**
Let λ>1, let 0≤ε<λ−1, let [u]∈CN be a (M,λ,ε)-minimal element and let S⊆CN be an (M,λ,ε)-minimizing set for [u] (so that [u]∈S). Then the following hold:
(1)
We have M([u])⊆S, and, in particular, #M([u])≤M.
2. (2)
For every [u′]∈M([u]) we have 1≤∣∣u′∣∣A∣∣u∣∣A≤1+ε.
3. (3)
For every [u′]∈M([u]) we have M([u])=VTn([u′]) where n=∣∣u′∣∣A.
4. (4)
If τ∈WN is a Whitehead automorphism such that ∣∣τ(u)∣∣A<∣∣u∣∣A then τ([u])∈S.
5. (5)
If τ1,τ2,…τk∈WN are such that
[TABLE]
then k≤M−1 and we have [ui]:=τi…τ1([u])∈S for i=1,…,k.
6. (6)
If such a sequence τ1,τ2,…τk∈WN as in (4) is such that is [uk] is WN-minimal then [uk]∈M([u]).
Proof.
Parts (1) and (2) follow directly from Definition 3.1. Part (3) holds by the general peak reduction properties of Whitehead’s algorithm. Part (4) follows from property (4) in Definition 3.1.
Suppose now that τ1,τ2,…τk∈WN are as in part (5) of the proposition. Since the cyclically reduced lengths ∣∣u∣∣A>∣∣u1∣∣A>⋯>∣∣uk∣∣A are strictly decreasing, the conjugacy classes [u],[u1],…[uk] are distinct. Since by assumption [u]∈S, part (4) of the proposition implies that [u],[u1],…,[uk]∈S. Since #S≤M, it follows that k≤M−1.
Part (6) follows from part (5) since every WN-minimal conjugacy class is Out(FN)-minimal.
∎
Let λ>1, let 0≤ε<λ−1 and let [w]∈CN be (M,λ,ε)-minimizable, with a (M,λ,ε)-reducing for [w] set W⊆Out(FN). Let S=W[w] (so that S is (M,λ,ε)-minimizing). Then:
(1)
We have M([w])⊆S, and, in particular, #M([w])≤M.
2. (2)
For every [u′]∈M([w]) we have M([w])=VTn([u′]) where n=∣∣u′∣∣A.
∎
Definition 3.9**.**
Let M≥1, λ>1, and 0≤ε<λ−1.
(1)
We denote by UN(M,λ,ε) the set of all 1=u∈FN such that [u] is (M,λ,ε)-minimal.
2. (2)
We denote by YN(M,λ,ε) the set of all 1=w∈FN such that there exists [u]∈Out(FN)[w] such that [u] is (M,λ,ε)-minimal. [That is, YN(M,λ,ε) is the set of all 1=w∈FN such that [w] is (M,λ,ε)-minimizable.]
3. (3)
Let ψ∈Out(FN). Denote by UN(M,λ,ε;ψ) the set of all 1=w∈FN such that ψ([w]) is (M,λ,ε)-minimal.
Lemma 3.10**.**
Let u∈YN(M,λ,ε) and let [u′]∈M([u]) (that is, [u′] is an Out(FN)-minimal element in the orbit Out(FN)[u]). Then u′∈UN(M,λ,ε) (that is, [u′] is (M,λ,ε)-minimal).
Proof.
Since u∈YN(M,λ,ε), there exists φ∈Out(FN) such that φ([u]) is (M,λ,ε)-minimal, so that φ([u]) belongs to some (M,λ,ε)-minimizing set S.
Part (1) of Proposition 3.7 implies that M(φ([u]))⊆S, so that every element of M(φ([u])) is (M,λ,ε)-minimal. Since M(φ([u]))=M([u]), the statement of the lemma follows.
∎
We now summarize algorithmic properties of (M,λ,ε)-minimal in relation to Whitehead’s algorithm.
Theorem 3.11**.**
Let M≥1, λ>1, and 0≤ε<λ−1. Then there exists a constant K≥1 such that the following hold:
(a)
For any u∈UN(M,λ,ε) the Whitehead minimization algorithm on the input u terminates time ≤K∣u∣A and produces an element of M([u]).
2. (b)
For any u1,u2∈UN(M,λ,ε), the Whitehead algorithm for the automorphic equivalence problem in FN terminates in time at most Kmax{∣u1∣A,∣u2∣A}, on the input (u1,u2).
3. (c)
For any u1∈YN(M,λ,ε) and any 1=u2∈FN, the Whitehead algorithm for the automorphic equivalence problem in FN terminates in time at most Kmax{∣u1∣A2,∣u2∣A2}, on the input (u1,u2).
4. (d)
For any u1∈UN(M,λ,ε) and any 1=u2∈FN, the Whitehead algorithm for the automorphic equivalence problem in FN terminates in time Kmax{∣u1∣A,∣u2∣A2}, on the input (u1,u2).
5. (e)
Let ψ∈Out(FN) be a fixed element. Then there is K′=K′≥1 such that for any u1,u2∈UN(M,λ,ε;ψ), the ψ-speed-up of Whitehead’s algorithm decides in time at most K′max{∣u1∣A,∣u2∣A}, whether or not Out(FN)[u1]=Out(FN)[u2].
6. (f)
Let ψ∈Out(FN) be a fixed element. Then there is K′=K′≥1 such that for any u1∈UN(M,λ,ε;ψ) and any 1=u2∈FN, the ψ-speed-up of Whitehead’s algorithm decides
in time at most K′max{∣u1∣A,∣u2∣A2}, whether or not Out(FN)[u1]=Out(FN)[u2].
Proof.
(a) Let u∈UN(M,λ,ε) be arbitrary. Let S be an (M,λ,ε)-minimizing set containing [u]. Thus #S≤M. By Lemma 3.2, if [v]∈S and τ∈WN is a Whitehead move such that ∣∣τ(v)∣∣A<∣∣v∣∣A then τ([v])∈S. Therefore starting with u and iteratively looking for Whitehead moves that decrease the ∣∣.∣∣A-length terminates after a chain of ≤M such moves with a conjugacy class that is Whitehead-minimal and therefore is Out(FN)-minimal, that is, an element of M([u]). This process takes at most time C1∣u∣A for some constant C1>0 depending only on N,M,λ,ε.
(b) Let u1,u2∈UN(M,λ,ε) so that [u1],[u2] are (M,λ,ε)-minimal. By part (a) above, applying the Whitehead minimization algorithm to [ui] terminates in at most M steps with an Out(FN)-minimal element [ui′] such that ni=∣∣ui′∣∣A≤∣∣ui∣∣A≤∣ui∣A. Each of these ≤M takes at most linear times in ∣ui∣A since the number of whitehead automorphisms in WN is finite and fixed. Thus it takes linear time in ∣ui∣A to produce [ui′]∈M([ui]). If n1=n2 then Out(FN)[u1]=Out(FN)[u2] and we are done. Suppose that n=n1=n2. By part (3) of Proposition 3.7 we have M([ui])=VTn([ui′]) for i=1,2. Moreover, by part (1) of Proposition 3.7 we have #VTn([ui′])≤M here. Since M is fixed, it takes linear time in ∣ui∣A to construct the graph Tn([ui′])≤M from ui′. Then Out(FN)[u1]=Out(FN)[u2] if and only if #VTn([u1′])∩#VTn([u2′])=∅, and this condition can be checked in linear time in max{∣u1∣A,∣u2∣A}. Summing up we get that the total running time of the Whitehead algorithm for the automorphic equivalence problem in FN is time at most C2max{∣u1∣A,∣u2∣A}, for some constant C2>0 depending only on N,M,λ,ε.
(c) Now suppose that u1∈YN(M,λ,ε) and 1=u2∈FN. We first apply the Whitehead minimization algorithm to each of u1,u2 to find Out(FN)-minimal elements [ui′]∈Out(FN)[ui] for i=1,2. Producing ui′ from ui takes quadratic time in terms of ∣ui∣A. Note that by Lemma 3.10 the element [u1′] is (M,λ,ε)-minimal, that is u1′∈UN(M,λ,ε).
Again put ni=∣∣ui∣A. If n1=n2 then Out(FN)[u1]=Out(FN)[u2] and we are done. Suppose that n=n1=n2. Since u1′ is Out(FN)-minimal and (M,λ,ε)-minimal, by parts (3) and (1) of Proposition 3.7 we have M([u1])=M([u1′])=VTn([u1′]) and #VTn([u1′])≤M. Then, since M is fixed, it takes at most linear time in n=∣∣u1′∣∣A≤∣u1∣A to construct the graph Tn([u1′]). Recall also that [u2′]∈M([u2]) and ∣∣u2∣∣A=n. Then we have Out(FN)[u1]=Out(FN)[u2] if and only if [u2′]∈Tn([u1′]). This last condition can be checked in linear time in n. Again, summing up we see that the total running time of the Whitehead algorithm on (u1,u2) is at most C3max{∣u1∣A2,∣u2∣A2}, for some constant C3>0 depending only on N,M,λ,ε.
(d) Now let u1∈UN(M,λ,ε) and 1=u2∈FN. We first apply the Whitehead minimization algorithm to each of u1,u2 to find Out(FN)-minimal elements [ui′]∈Out(FN)[ui] for i=1,2. As in (b), producing u1′ from u1 takes linear time in ∣u1∣A, because u1 is (M,λ,ε)-minimal. Producing u2′ from u2 takes at most quadratic time in ∣u2∣A, by the general Whitehead’s minimization algorithm properties. After that we proceed exactly in (2) above to decide if [u1′] and [u2′] are Out(FN)-equivalent. Summing up we see that the total running time of the Whitehead algorithm on (u1,u2) is at most C4max{∣u1∣A,∣u2∣A2} in this case, for some constant C4>0 depending only on N,M,λ,ε.
(e) Choose an automorphism Ψ∈Aut(FN) in the outer automorphism class ψ and put C=maxi=1N∣Ψ(ai)∣A. Since φ and Ψ are fixed, given u1,u2∈UN(M,λ,ε;ψ), for i=1,2 it takes linear time in ∣ui∣A to compute the element Ψ(ui), and ∣Ψ(ui)∣A≤C∣ui∣A. Moreover, the assumption that u1,u2∈UN(M,λ,ε;ψ) implies that Ψ(u1),Ψ(u2) are (M,λ,ε)-minimal. Then by part (b) above, the Whitehead algorithm on the input (Ψ(u1),Ψ(u2)) terminates in linear time in Cmax{∣u1∣A,∣u2∣A} and decides whether or not Aut(FN)Ψ(u1)=Aut(FN)Ψ(u2), that is, whether or not Out(FN)[u1]=Out(FN)[u2]. The total running time required for this process is at most C5max{∣u1∣A,∣u2∣A}, for some constant C5>0 depending only on N,M,λ,ε and Ψ.
(f) We chose Ψ∈Aut(FN) and C>0 as in the proof of part (e) above. Given any u1∈UN(M,λ,ε;ψ) and any 1=u2∈FN we first compute, in linear time in ∣u1∣A, the element Ψ(u1) and again observe that Ψ(u1) is (M,λ,ε)-minimal and that ∣Ψ(u1)∣A≤C∣u1∣A. We then apply to the pair (Ψ(u1),u2) the algorithm from part (d) of this proposition to decide whether or not Out(FN)[u1]=Out(FN)[u2]. The overall running time is of this process is at most C6max{∣u1∣A,∣u2∣A2}, for some constant C6>0 depending only on N,M,λ,ε and Ψ.
∎
We recall another basic fact related to Whitehead’s algorithm which describes Out(FN)-stabilizers of conjugacy classes in FN:
Proposition 3.12**.**
Let 1=u∈FN be such that [u] is Out(FN)-minimal, and let n=∣∣u∣∣A. Then for φ∈Out(FN) we have φ([u])=[u] if and only if there exists a sequence of Whitehead automorphisms τ1,…,τk∈WN such that for ui=τi…τ1([u]), we have ∣∣ui∣∣A=n for i=1,…,k and [uk]=[u] and such that φ=τk…τ1 in Out(FN).
Recall that for n≥1 the oriented edges of the graph Tn are labelled by Whitehead moves τ∈WN. Thus oriented edge-paths in Tn are labelled by products of Whitehead moves. Recall also that for a vertex [u] of Tn, the graph Tn[u] is the connected component of Tn containing [u]. Therefore we get a natural labelling homomorphism ρ[u]:π1(Tn[u],[u])→Out(FN) where a closed loop at [u] in Tn is mapped to the element of Out(FN) given by the label of this loop in Tn.
Note also that, since the set of possible edge labels WN is finite, the rank of the free group π1(Tn,[u]) is bounded above by some constant K=K(N,#VTn[u]).
Let 1=u∈FN be such that [u] is Out(FN)-minimal, and let n=∣∣u∣∣A. Then:
(1)
We have StabOut(FN)([u])=ρ[u](π1(Tn[u],[u])).
2. (2)
We have rankStabOut(FN)([u])≤K(N,#VTn[u]).
Proposition 3.14**.**
Let M≥1, λ>1, and 0≤ε<λ−1. Let u∈YN(M,λ,ε) (that is 1=u∈FN and the orbit Out(FN)[u] contains an (M,λ,ε)-minimal element).
Then
[TABLE]
Proof.
Let [u′]∈Out(FN)[u] be an (M,λ,ε)-minimal element. Let [u′′] be an Out(FN)-minimal element in Out(FN)[u]=Out(FN)[u′] and put n=∣∣u′′∣∣A. Part (1) of Proposition 3.7 implies that [u′′] is also (M,λ,ε)-minimal and that VTn[u′′]=M[u]=M[u′]=M[u′′] has cardinality ≤M. Then by Corollary 3.13 we have rankStabOut(FN)([u])≤K(N,M), as claimed.
∎
3.3. Algorithmic detectability
For a finite nonempty subset S⊆CN denote ∣∣S∣∣A=max{∣∣u∣∣A∣[u]∈S}.
Remark 3.15*.*
Let an integer M≥1 and rational numbers 0<ε<1 and λ>1+ε be fixed.
(1) Since the set WN⊆Out(FN) of Whitehead moves is finite and fixed, given a subset S⊆CN−{1} of cardinality ≤M we can check in linear time in ∣∣S∣∣A whether or not S is (M,λ,ε,WN)-minimizing.
(2) In this setting, given such S it is also possible to algorithmically check whether or not (M,λ,ε)-minimizing, but only with the exponential time, in terms of ∣∣S∣∣A, complexity estimate. Indeed, we first check (in linear time), if conditions (1) and (3) of Definition 3.1 of an (M,λ,ε)-minimizing set hold for S. We then use Whitehead’s algorithm to check if condition (2) of Definition 3.1 also holds. Suppose they do (otherwise S is not (M,λ,ε)-minimizing). Then for every [u]∈S compute, using Whitehead’s algorithm, the (finite) set F[u]={[u′]∈Out(FN)[u]∣∣∣u′∣∣A≤∣∣u∣∣A}. For each [u]∈S check whether F[u]⊆S. If not then S is not (M,λ,ε)-minimizing. So suppose that for all [u]∈SF[u]⊆S.
Since balls in the Cayley graphs of FN=F(A) are finite and since ∣∣w∣∣A∈Z≥0 for all w∈FN, we can then use Whitehead’s algorithm to compute, for each [u]∈S the number \rho([u]):=\min\{\frac{||u^{\prime}||_{A}}{||u||_{A}}\big{|}[u^{\prime}]\in{\rm Out}(F_{N})[u]\text{ and }||u^{\prime}||_{A}>||u||_{A}\}. Then compute ρ=min[u]∈Sρ([u]). Then S is (M,λ,ε)-minimizing if and only if ρ≥λ.
The complexity of this procedure for deciding if a subset S⊆CN with #S≤M is (M,λ,ε)-minimizing is exponential time in ∣∣S∣∣A (when M,λ,ε are fixed).
(3) We can then also decide, given [u]∈CN, whether or not [u] is (M,λ,ε)-minimal, that is, whether or not [u] belongs to some (M,λ,ε)-minimizing subset. Namely, list all subsets S⊆CN of cardinality ≤M containing [u] and with ∣∣S∣∣A≤(1+ε)∣∣u∣∣A and for each of them run the algorithm from (2) to decide if S is (M,λ,ε)-minimizing. Again, since M,λ,ε are fixed, this check can be done in exponential time in ∣∣S∣∣A.
It turns out that deciding whether an element [u] is (M,λ,ε,WN)-minimal can be done in linear time in ∣∣u∣∣A (under slightly more stringent assumptions in λ,ε).
Lemma 3.16**.**
Let M≥1 be an integer, let λ>1, 0<ε<1 be such that ε<λ−1 and λ1+ε1−ε>1. Let S⊆CN be a finite set of conjugacy classes of nontrivial elements of FN such that S is (M,λ,ε,WN)-minimizing. Let [u]∈S.
Then for [u′]∈CN,[u′]=[u] the following conditions are equivalent:
(1)
We have [u′]∈S.
2. (2)
There exists a chain τ1,…,τk∈WN such that k≤M, that τk…τ1[u]=[u′] and that with [u0]=[u][ui]=τi…τ1[u] we have ∣∣ui+1∣∣A≤(1+ε)∣∣ui∣∣A for all i≤k.
We now need to show that (1) implies (2).
Suppose that [u′]∈S. Then ∣∣u′∣∣A≤(1+ε)∣∣u∣∣A.
Since Out(FN)[u]=Out(FN)[u′], there is φ∈Out(FN) such that φ[u]=[u′]. Proposition 2.5 implies that there exist τ1,…,τk∈WN such that for [u0]=[u], [ui]=τi…τ1([u]) for i=1,…,k we have
[TABLE]
and that [uk]=[u′]. We can assume that we have eliminated repetitions among [ui], so that [u]=[u0],[u1],…,[uk]=[u′] are distinct.
Case 1.
Suppose first that for every i≥1 we have ∣∣ui∣∣A≤(1+ε)∣∣ui−1∣∣A. Since [u0]=[u]∈S, Lemma 3.4(1) then implies that [ui]∈S for all i=1,…,k. Since #S≤M and all [ui] are distinct, it follows that k≤M. Thus the conclusion of part (2) of the lemma holds in this case.
Case 2. Suppose there is some i≥1 we have ∣∣ui∣∣A>(1+ε)∣∣ui−1∣∣A. Let i0 be the smallest among such i. Then for all j<i0 we have ∣∣uj∣∣A≤(1+ε)∣∣uj−1∣∣A, and we also have ∣∣ui0∣∣A>(1+ε)∣∣ui0−1∣∣A. Again by Lemma 3.4(1) we conclude that [uj]∈S for all j<i0. In particular [ui0−1]∈S. Since for [ui0]=τi0[ui0−1] we have ∣∣ui0∣∣A>(1+ε)∣∣ui0−1∣∣A, condition (3) of Definition 3.3 implies that [ui0]∈S. Therefore by part (4)(ii) of Definition 3.3 we have ∣∣ui0∣∣A≥λ∣∣ui0−1∣∣A. Since [u],[ui0−1]∈S, we have ∣∣ui0∣∣A≥(1−ε)∣∣u∣∣A. Therefore ∣∣ui0∣∣A≥λ(1−ε)∣∣u∣∣A>(1+ε)∣∣u∣∣A, yielding a contradiction. Thus Case 2 is impossible.
Therefore the conclusion of part (2) of the lemma holds, as required.
∎
Corollary 3.17**.**
Let M≥1 be an integer, let λ>1, 0<ε<1 be rational numbers such that ε<λ−1 and λ1+ε1−ε>1.
Then there is an algorithm that, given 1=[u]∈CN decides in linear time in ∣∣u∣∣A whether or not [u] is (M,λ,ε,WN)-minimal (that is, whether [u] belongs to some (M,λ,ε,WN)-minimizing set S).
Proof.
Suppose we are given an input 1=[u]∈CN. We need to decide if there exists an (M,λ,ε,WN)-minimizing set S containing [u].
We first enumerate all chains of k≤M Whitehead moves as in part (2) of Lemma 3.16 and collect all [u′] reachable from [u] by applying such chains. Denote the resulting subset of CN by S′. Computing S′ from [u] takes at most linear time in ∣∣u∣∣A since M is fixed and the set WN is also finite and fixed.
Lemma 3.16 implies that if [u] belongs to some (M,λ,ε,WN)-minimizing set S then S=S′.
We then check if conditions (1)-(4) of Definition 3.3 hold for S′. Again this can be done in linear time in ∣∣u∣∣A since M is fixed.
We conclude that [u] is (M,λ,ε,WN)-minimal if and only if conditions (1)-(4) of Definition 3.3 do hold for S′.
∎
4. Geodesic currents on free groups
We provide some basic background on geodesic currents on FN here and refer the reader to [23, 26, 27] for further details. For the remainder of this section let FN be a free group of finite rank N≥2. We denote by ∂FN the hyperbolic boundary of FN and denote ∂2FN:={(x,y)∣x,y∈∂FN,x=y}.
We give ∂2FN the subspace topology from ∂FN×∂FN and endow ∂2FN with the natural diagonal translation action of FN by homeomorphisms. The space ∂2FN also comes with a natural “flip” involution ϖ:∂2FN→∂2FN, ϖ:(x,y)↦(y,x). The boundary ∂FN is homeomorphic to the Cantor set, and ∂2FN is a locally compact totally disconnected but non-compact metrizable topological space.
4.1. Basic notions
Definition 4.1**.**
A geodesic current on FN is a locally finite (i.e. finite on compact subsets) positive Borel measure ν on ∂2FN such that ν is FN-invariant and flip-invariant. The set of all geodesic currents on FN is denoted \mboxCurr(FN).
The set \mboxCurr(FN) is equipped with the weak-* topology, which makes \mboxCurr(FN) locally compact. Any automorphism Φ∈Out(FN) is a quasi-isometry of FN and hence extends to a homeomorphism, which we still denote by Φ:∂FN→∂FN. Diagonally extending this homeomorphism we also get a homeomorphism Φ:∂2FN→∂2FN. There is a natural left action of Aut(FN) by homeomorphisms on \mboxCurr(FN), where for Φ∈Aut(FN) and ν∈\mboxCurr(FN) we have (Φν)(S)=ν(Φ−1(S)) for S⊆∂2FN. The subgroup Inn(FN)≤Aut(FN) is contained in the kernel of this action, and therefore the action descends to the action of Out(FN) on \mboxCurr(FN). There is also a multiplication by a scalar action of R>0 in \mboxCurr(FN)−{0}, with the quotient space P\mboxCurr(FN)=(\mboxCurr(FN)−{0})/R>0, equipped with the quotient topology. The space P\mboxCurr(FN) is compact, although infinite dimensional. For 0=ν∈\mboxCurr(FN) we denote the R>0-equivalence class of ν by [ν]. Thus [ν]={cν∣c∈R>0} and [ν]∈P\mboxCurr(FN). We call elements of P\mboxCurr(FN)projectivized geodesic currents on FN.
Let 1=g∈FN. Then g determines a pair of distinct “poles” g−∞,g∞∈∂FN, where g∞=limn→∞gn and g−∞=limn→∞g−n in FN∪∂FN. Thus (g−∞,g∞)∈∂2FN. For h∈FN we have hg∞=(hgh−1)∞, and we also have g−∞=(g−1)∞.
Definition 4.2** (Counting and rational currents).**
Let 1=g∈FN.
Then
[TABLE]
is a geodesic current on FN called the counting current for g. We call currents of the form cηg∈\mboxCurr(FN), where c>0 and 1=g∈FN, rational currents.
It is known that the set of all rational currents is a dense subset of \mboxCurr(FN), and that for any 1=g∈FN and any u∈FN we have ηg=ηugu−1=ηg−1. Therefore we also denote η[g]:=ηg where [g] is the conjugacy class of g in FN. Moreover, for φ∈Aut(FN) and 1=g∈FN, one has φηg=ηφ(g).
4.2. Simplicial charts and weights
We adopt the conventions of [14] regarding graphs. All graphs are 1-cell complexes, where 0-cells are called vertices and 1-cells are called topological edges. Every topological edge is homeomorphic to an interval (0,1) and thus admits exactly two orientations. An oriented edge of a e graph is a topological edge with a choice of an orientation. The same topological edge with the opposite orientation is denoted e−1. The set of all oriented edges of a graph Δ is denoted EΔ. We also denote by VΔ the set of all vertices of Δ. Unless specified otherwise, by an edge of a graph we always mean an oriented edge. Every oriented edge e∈EΔ has an initial vertex denoted o(e)∈VΔ and a terminal vertext(e)∈EΔ. We also have o(e−1)=t(e) and t(e−1)=o(e). An edge-pathγ of length n≥1 in Δ is a sequence of edges e1,…,en such that t(ei)=o(ei+1). We also consider a vertex v of Δ to be a path of length [math]. An edge-path γ in Δ is reduced if it does not contain subpaths of the form e,e−1 where e∈EΔ. We denote by ∣γ∣ the length of an edge-path γ.
Definition 4.3** (Simplicial chart).**
Let FN be free of rank N≥2. A simplicial chart on FN is a pair (Γ,κ) where Γ is a finite connected oriented graph with all vertices of degree ≥3 and with the first betti number b(Γ)=N, and that where κ:FN→π1(Γ,x0) is a group isomorphism (with x0∈VΓ some base-vertex), called a marking.
When talking about simplicial charts, we usually suppress explicit mention of κ.
We equip Γ and T0=(Γ,x0) with simplicial metrics, where every edge has length 1.
In this setting we denote by Ω(Γ) the set of all semi-infinite reduced edge-paths e1,e2,…, in Γ. For n≥1 denote by Ωn(Γ) the set of all reduced edge-paths e1,e2,…,en of length n in Γ. Also denote Ω∗=∪n=1∞Ωn(Γ).
If A={a1,…,aN} is a free basis of FN, then the graph RA, with a single vertex x0 and with N petal-edges marked a1,…,aN, is a simplicial chart on FN. In this case the corresponding covering tree TA:=RA is exactly the Cayley tree of FN with respect to A. We refer to such simplicial chart RA as an N-rose.
For a simplicial chart Γ, the marking κ induces an FN-equivariant quasi-isometry FN→T0, which we use to identify ∂FN with ∂T0. For (x,y)∈∂2FN denote by γx,y the bi-infinite geodesic in T0 from x to y. The group FN=π1(Γ,x0) acts on T0=Γ by covering transformations, which is a free and isometric discrete action with T0/FN=Γ.
Definition 4.4** (Cylinders and weights).**
Let Γ be a simplicial chart on FN, with T0=Γ.
(1) For two distinct vertices p,q∈T0 denote by CylΓ([p,q]) the set of all (x,y)∈∂2FN such that the bi-infinite geodesic γx,y contains [p,q] as a subsegment. The set CylΓ([p,q])⊆∂2FN is called the cylinder set corresponding to [p,q].
For any g∈FN and any p,q∈VT0,p=q we have gCylΓ([p,q])=CylΓ([gp,gq]). The cylinder sert CylΓ([p,q])⊆∂2FN are compact and open, and the collection of all such cylinder sets forms a basis for the subspace topology on ∂2FN defined above.
(2) For a geodesic current η∈\mboxCurr(FN) denote by ⟨v,η⟩Γ:=η(CylΓ([p,q])) where [p,q] is any lift of v to T0. The number 0≤⟨v,η⟩Γ<∞ is called the weight of v in η with respect to Γ.
If Γ=RA is an N-rose, we use the subscript A rather than RA for chart-related notations. E.g. ⟨v,η⟩A:=⟨v,η⟩RA, etc.
Proposition 4.5**.**
[23]**
Let FN be free of rank N≥2 and let Γ be a simplicial chart on FN. Then:
(1)
For η,ηn∈\mboxCurr(FN), where n=1,2,…, we have limn→∞ηn=η in \mboxCurr(FN) if and only if for every v∈Ω∗(Γ) we have
[TABLE]
2. (2)
Let η∈\mboxCurr(FN). Then for every k≥1 and every v∈Ωk(Γ) we have
[TABLE]
Moreover, any system of finite nonnegative weights on Ω∗(Γ) satisfying uniquely determines a current η∈\mboxCurr(FN) realizing these weights.
Condition (‡) is often called the switch condition for Γ.
For v∈Ω∗(Γ) and a nondegenerate closed reduced and cyclically reduced edge-path w in Γ, denote by ⟨v,w⟩Γ the number of ways in which v can be read, reading forwards or backwards, in a circle of length ∣w∣ labelled by w. The number ⟨v,w⟩Γ≥0 is called the number of occurrences of v in w.
A key useful fact that follows from the definitions is:
Lemma 4.6**.**
Let FN be free of rank N≥2 and let Γ be a simplicial chart on FN. Let v∈Ω∗(Γ) and let w be a nondegenerate closed reduced and cyclically reduced edge-path in Γ. Then ⟨v,w⟩Γ=⟨v,ηw⟩Γ.
∎
Definition 4.7** (Uniform current).**
Let FN=F(A) be free of rank N≥2 with a free basis A. The uniform currentνA∈\mboxCurr(FN) corresponding to A is the current given by the weights ⟨v,νA⟩A=N(2N−1)k−11 for every 1=v∈FN with ∣v∣A=k≥1.
For a current η∈\mboxCurr(FN) the support\mboxSupp(η)⊆∂2FN is
[TABLE]
Thus \mboxSupp(η) is a closed FN-invariant subset of ∂2FN.
Remark 4.8*.*
Let Γ be a simplicial chart on FN. If η∈\mboxCurr(FN) and (x,y)∈∂2FN then (x,y)∈\mboxSupp(μ) if and only if every finite nondegenerate edge subpath of γx,y projects to a reduced edge-path v in Γ with ⟨v,η⟩Γ>0.
4.3. Geometric intersection form
We refer the reader to [3, 16, 27, 45] for the background and basic info regarding the Outer space, and only recall a few facts and definitions here.
Denote by \mboxcvN the (unprojectivized) Culler-Vogtmann Outer space for FN. Elements of \mboxcvN are equivariant FN-isometry classes of free and discrete minimal isometric actions of FN on R-trees. In particular, if Γ is a simplicial chart on FN then T0=Γ defines a point of \mboxcvN.
There is a natural “axes” topology on \mboxcvN and a (right) action of Out(FN) on \mboxcvN by homeomorphisms. Moreover, the closure \mboxcvN of \mboxcvN in the axes topology is known to consist of all minimal nontrivial ”very small” isometric actions on FN on R-trees (again considered up to FN-equivariant isometry), and the action of Out(FN) extends to \mboxcvN. For T∈\mboxcvN and g∈FN denote by ∣∣g∣∣T the translation length of g in T, that is ∣∣g∣∣T=infx∈TdT(x,gx).
Let FN be free of finite rank N≥2. Then there exists a continuous geometric intersection form
[TABLE]
satisfying the following properties:
(1)
The map ⟨−,−⟩ is R≥0-homogeneous with respect to the first argument and R≥0-linear with respect to the second argument.
2. (2)
For every φ∈Out(FN), every T∈\mboxcvN and every η∈\mboxCurr(FN) we have
[TABLE]
3. (3)
For every 1=g∈FN and every T∈\mboxcvN we have ⟨T,ηg⟩=∣∣g∣∣T.
In view of the above proposition, for T∈\mboxcvN and η∈\mboxCurr(FN) we denote ∣∣η∣∣T=⟨T,η⟩.
For every T∈\mboxcvN there is an associated dual laminationL(T)⊆∂2FN, which is a certain closed FN-invariant and flip-invariant subset of ∂2FN recording the information about sequences of elements of FN with translation length in T converging to [math]. We refer the reader to [27] for the precise definition of L(T) and additional details.
Then η∈\mboxCurr(FN) has full support if and only if for every nondegenerate edge-path v in Γ we have ⟨v,η⟩Γ>0.
Lemma 5.4**.**
Let 0=ν∈\mboxCurr(FN).
Let w be a nondegenerated closed reduced and cyclically reduced edge-path in Γ such that for every n≥1 we have ⟨wn,ν⟩Γ>0. Then \mboxSupp(ηw)⊆\mboxSupp(ν).
Proof.
Since geodesic currents are flip-invariant, the assumptions of the lemma imply that the points p+=(w−∞,w∞),p−=(w∞,w−∞)∈∂2FN belong to \mboxSupp(ν). Since \mboxSupp(ηw)=∪h∈FNh{p+,p−}, we then have \mboxSupp(ηw)⊆\mboxSupp(ν).
∎
Lemma 5.5**.**
Let 1=g∈FN be a filling element and let 0=ν∈\mboxCurr(FN) be a current such that \mboxSupp(ηg)⊆\mboxSupp(ν). Then ν is a filling current.
Proof.
Suppose, on the contrary, that ν is not filling. Then there exists T∈\mboxcvN such that ⟨T,ν⟩=0.
By [27, Theorem 1.1] this implies that \mboxSupp(ν)⊆L(T). Hence \mboxSupp(ηg)⊆L(T) as well. Therefore, again by [27, Theorem 1.1], we have 0=⟨T,ηg⟩=∣∣g∣∣T, which contradicts that g is filling.
∎
Corollary 5.6**.**
Let z be a nondegenerated closed reduced and cyclically reduced edge-path in Γ representing the conjugacy class of a filling element g∈FN.
Let 0=ν∈\mboxCurr(FN) be such that for every n≥1 we have ⟨zn,ν⟩Γ>0. Then ν is a filling current.
Proof.
Lemma 5.4 implies that \mboxSupp(ηg)⊆\mboxSupp(ν).
Therefore, by Lemma 5.5, the current ν is filling.
∎
Proposition 5.7**.**
Let 0=ν∈\mboxCurr(FN) be such that for some free basis A={a1,…,aN} the following holds. For i=1,…,N let wi be a closed reduced and cyclically reduced edge-path in Γ representing the conjugacy class of ai in FN. For 1≤i<j≤N let wi,j be a closed reduced and cyclically reduced edge-path in Γ representing the conjugacy class of ai in FN. Suppose that we have ⟨win,ν⟩Γ>0 for i=1,…,N and that we have ⟨wijn,ν⟩Γ>0 for all 1≤i<j≤N and all n≥1. Then the current νX∈\mboxCurr(FN) is filling.
Proof.
Indeed, suppose ν is not filling. Then there exists T∈\mboxcvN such that ⟨T,ν⟩=0. By [27, Theorem 1.1] this implies that \mboxSupp(ν)⊆L(T).
Lemma 5.4 implies that for all i=1,…,N we have \mboxSupp(ηai)⊆\mboxSupp(ν), and for all 1≤i<j≤N we have \mboxSupp(ηaiaj)⊆\mboxSupp(ν).
Since \mboxSupp(ν)⊆L(T), [27, Theorem 1.1] implies that for i=1,…,N
[TABLE]
and for all 1≤i<j≤N we have
[TABLE]
Thus all ai and aiaj act elliptically on T and so have nonempty fixed sets in T.
For 1≤i<j≤N, the elements ai,aj,aij act elliptically on T, and therefore, by [43, Proposition 1.8], FixT(ai)∩FixT(aj)=∅. Thus FixT(a1),…FixT(aN) are nonempty subtrees of T with pairwise nonempty intersections. Therefore ∩i=1NFixT(ai)=∅. Hence FN has a global fixed point in T, which contradicts the fact that T∈\mboxcvN is a nontrivial FN-tree.
∎
Proposition 5.8**.**
Let FN=F(A) (where N≥2) and let w∈F(A) be a freely and cyclically reduced word such that for every v∈F(A) with ∣v∣A=3, the word v occurs as a subword of some cyclic permutation of w or of w−1. Then:
(1)
The element w∈FN is filling.
2. (2)
If 0=ν∈\mboxCurr(FN) is such that for all n≥1⟨wn,ν⟩A>0, then the current ν is filling in FN.
Also, as before, we denote by TA the Cayley graph of FN with respect to the free basis A. Thus TA is a simplicial tree with all edges of length 1.
Definition 6.1**.**
Let 0=ν∈Curr(FN). The automorphic distortion spectrum of ν with respect to the free basis A of FN is the set
[TABLE]
Also denote JA(ν):=infDA(ν).
Remark 6.2*.*
Thus DA(ν)⊆R>0. Since ⟨TA,φν,⟩=⟨TAφ,ν⟩=⟨Tφ(A),ν⟩, it is easy to see that the set DA(ν) is independent of the choice of a free basis A of FN and depends only on ν. Nevertheless, we will keep the subscript A in the notation DA(ν) since for our purposes the fixed choice of A is important.
Note also that for 1=w∈FN and φ∈Out(FN) we have ⟨TA,φηw⟩=⟨TA,ηφ(w)⟩=∣∣φ(w)∣∣A. Therefore in this case
DA(ηw)={∣∣φ(w)∣∣A∣φ∈Out(FN)}⊆Z>0, and JA(ηw) is the smallest ∣∣.∣∣A-length of elements in the orbit Out(FN)[w].
We need the following useful result essentially proved in [27, Theorem 1.2]:
Proposition 6.3**.**
Let ν∈\mboxCurr(FN) (where N≥2) be a filling current and let A be a free basis of FN. Then:
(1)
The set DA(ν) is a discrete unbounded subset of [0,∞).
2. (2)
For every C>0 the set {φ∈Out(FN)∣⟨TA,φν⟩≤C} is finite.
Proof.
The proof is a verbatim copy of the proof of [27, Theorem 11.2] where the same result was established under the assumption that ν∈\mboxCurr(FN) is filling. The only place in the proof of Theorem 11.2 in [27] where the filling assumption on ν was used is at the bottom of page 1461 in [27], to show that ⟨T∞,ν⟩=0 for a certain tree T∞∈\mboxcvN constructed earlier in the proof. However, in our case ⟨T∞,ν⟩=0 since ν is assumed to be filling in the present proposition.
∎
For FN and A as in Proposition 6.3 let ν∈Curr(FN) be a filling current. Then:
(1)
We have JA(ν)∈DA(ν), so that JA(ν)=minDA(ν).
2. (2)
The set ΔA(ν)={φ∈Out(FN)∣⟨TA,φν⟩=JA(ν)} is finite and nonempty.
Definition 6.5**.**
Let FN be free of finite rank N≥2, let A be a free basis of A and let ν∈Curr(FN) be a filling current.
We call the set ΔA(ν):={φ∈Out(FN)∣⟨TA,φν⟩=JA(ν)} the A-minimizing set for ν and we call the integer MA(ν):=#ΔA(ν)≥1 the minimizing multiplicity for ν with respect to A. Also put
Also let JA′(ν)=min(DA(ν)∖{JA(ν)}) and let λA(ν)=JA(ν)JA′(ν), so that λA(ν)>1. We call λA(ν) the distortion threshold for ν with respect to A. Finally denote ℑA(ν)=ΔA(ν)ν={φν∣φ∈ΔA(ν)}⊆Curr(FN) and call ℑA the orbit floor for ν.
The following statement is a key technical result of this paper:
Theorem 6.6**.**
Let FN be free of finite rank N≥2, let A be a free basis of A and let ν∈\mboxCurr(FN) be a filling current. Let λ be such that 1<λ<λA(ν) and let 0<ε<1 be such that λA(ν)>λ>1+ε.
Let W=ΔA(ν) and let M=MA(ν)=#W.
Then there exists a neighborhood U=U([ν],λ,ε) of [ν] in P\mboxCurr(FN) such that for every 1=w∈FN with [ηw]∈U the set S=W[w]⊆CN is (M,λ,ε,WN)-minimizing.
Proof.
Denote ℑ=ℑA(ν)=Wν⊆\mboxCurr(FN). Since ℑ=Wν and S=W[w], it follows that #ℑ≤M and #S≤M.
Also, by construction, S⊆Out(FN)[w]. Therefore for every [u],[u′]∈S we have Out(FN)[u]=Out(FN)[u′]. Thus conditions (1) and (2) of Definition 3.3 hold for S.
We also have ∣∣ν′∣∣A=JA(ν) for all ν′∈ℑ. Moreover, if ν′∈ℑ and ψ∈Out(FN) is such that ψν′∈ℑ then ∣∣ψν′∣∣A/JA(ν)≥λA(ν)>1. In particular, the latter statement holds whenever τ∈WN is a Whitehead move such that τν′∈ℑ.
For each ν′∈ℑ denote
[TABLE]
Proposition 6.3 also implies that for each ν′∈ℑ the set Rℑ(ν′) is finite.
Moreover, for every ν′,ν′′∈ℑ there are φ′,φ′′∈W such that φ′ν=ν′ and φ′′ν=ν′′ so that φ′′(φ′)−1∈Rℑ(ν′) and φ′(φ′′)−1∈Rℑ(ν′′). Therefore for every ν′∈ℑ we have Rℑ(ν′)ν′=ℑ.
For exactly the same reason, if [u]=φ′[w], where φ′∈W and ν′=φ′ν∈ℑ then
[TABLE]
Since λA(ν)(1−2ε)>λ>1+ε, we can choose λ<λ1<λA(ν) so that
[TABLE]
By continuity of the intersection form ⟨−,−⟩ and of the action of Out(FN) on PCurr(FN), there exist neighborhoods U([ν′]) of [ν′] in PCurr(FN), where ν′∈S, and there exists a neighborhood U([ν]) of [ν], such that the following hold:
(a)
If ν′∈ℑ and [η]∈U([ν′]) then for every ψ∈Rℑ(ν′) with ν′′=ψν∈ℑ we have ψ[η]∈U([ν′′]) and
[TABLE]
(b)
If ν′∈ℑ, [η]∈U([ν′]) and τ∈WN is a Whitehead move such that τν′∈ℑ then
[TABLE]
(c)
For every φ∈W (so that ψν∈ℑ) we have φU([ν])⊆U([φν]).
Recall that S=W[w]={φ([w])∣φ∈W}.
We will show that for every 1=w∈FN with [ηw]∈U([ν]) the set S is (M,λ,ε,WN)-minimizing, that is that U([ν]) satisfies the conclusion of this proposition.
Thus suppose that 1=w∈FN is such that [ηw]∈U([ν]).
We have already seen above that conditions (1) and (2) of Definition 3.3 hold for S.
Let [u]∈S be arbitrary. Thus [u]=φ′[w] for some φ∈W, with ν′=φ′ν∈ℑ.
By property (c) φ′[ηw]∈U([ν′]). We also have ηu=φ′ηw. Thus φ′[ηw]=[ηu]∈U([ν′]).
Claim 0. For any [x]∈S we have ∣∣x∣∣A/∣∣u∣∣A∈[1−ε,1+ε].
Indeed, let [x]∈S, so that [x]=φ′[w] for some ψ′∈W, so that ν′′=φ′′ν∈ℑ. Then, as for [u], we have φ′′[ηw]=[ηx]∈U([ν′′]). Then ψ=φ′′φ−1ν′=ν′′, so that ψ∈Rℑ(ν′). We also have [x]=ψ[u]. Then property (a) implies that ∣∣x∣∣A/∣∣u∣∣A∈[1−ε,1+ε], as required.
Claim 0 shows that conditions (3) of Definition 3.3 hold for S.
Claim 1. For any Whitehead move τ∈WN exactly one of the following occurs:
(i) We have ∣∣τ(u)∣∣A/∣∣u∣∣A∈[1−ε,1+ε], τ∈Rℑ(ν′), τν′∈ℑ and τ[u]∈S.
(ii) We have ∣∣τ(u)∣∣A/∣∣u∣∣A≥λ1>λ>1+ε and τ∈Rℑ(ν′) and τ[u]∈S.
Indeed, suppose first that τ∈Rℑ(ν′). Thus ν′′=τν′∈ℑ. Since [ηu]∈U([ν′]), property (a) implies that τ[ηu]∈U([ν′′]) and that ∣∣τ(u)∣∣A/∣∣u∣∣A∈[1−ε,1+ε].
Suppose now that τ∈Rℑ(ν′), so that τν′∈ℑ. Since [ηu]∈U([ν′]) property (b) implies that ∣∣τ(u)∣∣A/∣∣u∣∣A≥λ1>λ>1+ε. Since ∣∣τ(u)∣∣A/∣∣u∣∣A>1+ε, Claim 0 now implies that τ[u]∈S. Thus Claim 1 is verified.
Claim 1 now implies that condition (4) of Definition 3.1 hold for S.
Thus S is (M,λ,ε,WN)-minimizing, as required.
∎
Corollary 6.7**.**
Let FN be free of finite rank N≥2, let A be a free basis of A and let ν∈\mboxCurr(FN) be a filling current. Let λ be such that 1<λ<λA(ν) and let 0<ε<1 be such that λA(ν)>λ>1+ε.
Let W=ΔA(ν) and let M=MA(ν)=#W.
Then there exists a neighborhood U1=U1([ν],λ,ε) of [ν] in P\mboxCurr(FN) such that for every 1=w∈FN with [ηw]∈U the set S=W[w]⊆CN is (M,λ,ε)-minimizing.
Proof.
First choose λ′ such that λA(ν)>λ′>λ>1. Then choose ε′ such that 0<ε′<ε and
that λ′(1−ε′)>λ. By Theorem 6.6, there exists a neighborhood U=U([ν],λ′,ε′) of [ν] in P\mboxCurr(FN) such that for every 1=w∈FN with [ηw]∈U the set S=W[w] is (M,λ′,ε′,WN)-minimizing. Therefore, by Proposition 3.5, the set S is (M,λ,ε)-minimizing. Therefore U1:=U satisfies the requirements of the corollary.
∎
Remark 6.8*.*
Suppose we are in the context of Theorem 6.6 and that U⊆P\mboxCurr(FN) is a neighborhood of [ν] in P\mboxCurr(FN) provided by the conclusion of Theorem 6.6.
Then U contains a “basic” neighborhood U0⊆U of [ν] defined as follows. There exist a finite collection V⊆F(A)−{1} and ε0>0 such that for [η]∈P\mboxCurr(FN) we have [η]∈U0 if and only if for every v∈V
[TABLE]
Therefore, if 1=w∈FN is such that for all v∈V
[TABLE]
then [ηw]∈U0⊆U and the conclusion of Theorem 6.6 applies to w.
Definition 6.9**.**
Let W=W1,W2,…,Wn,… be a sequence of FN-valued random variables.
(1)
We say that W is tame if for some (equivalently, any) free basis A of FN there exists C>0 such that we always have ∣Wn∣A≤Cn where n≥1.
2. (2)
Let 0=ν∈\mboxCurr(FN). We say that the sequence W is ν-adapted if a.e. trajectory w1,w2,…,wn,… of W we have:
[TABLE]
in PCurr(FN).
In Definition 6.9 above, a random trajectory of W is implicitly required to satisfy wn=1 for all sufficiently large n (which is needed in order for ηwn to be defined), but we do not require ∣∣wn∣∣A→∞ as n→∞.
In particular, if ν=ηw for some 1=w∈FN, and the random process W always outputs Wn=ηw for all n≥1, then W is ν-adapted.
The following statement is key for our paper:
Proposition 6.10**.**
Let FN be free of finite rank N≥2, let A be a free basis of A and let ν∈\mboxCurr(FN) be a filling current. Let λ be such that 1<λ<λA(ν) and let ε>0 be such that λA(ν)>λ>1+ε.
Let W=ΔA(ν) and let M=MA(ν)=#W.
Let W=W1,W2,…,Wn,… be a ν-adapted sequence of FN-valued random variables. Then the following hold:
(1)
For a.e. trajectory ξ=(w1,w2,…,wn,…) of W there exists n0=n0(ξ)≥1 such that for all n≥n0 the set Sn=W[wn] is (M,λ,ε)-minimizing.
2. (2)
We have
[TABLE]
Proof.
Let U1=U1([ν],λ,ε)⊆P\mboxCurr(FN) be a neighborhood of [ν] in P\mboxCurr(FN) whose existence is provided by Corollary 6.7. Choose a basic neighborhood U0⊆U1 of [ν] in P\mboxCurr(FN) defined by some ε>0 and some finite collection V⊆F(A)−{1}, as in Remark 6.8
Since W is adapted to ν, for a.e. trajectory ξ=(w1,w2,…,wn,…) of W we have [ηwn]∈U0 and, since a.e. convergence implies convergence in probability, we also have
[TABLE]
Now Corollary 6.7 implies that statements (1) and (2) of Proposition 6.10 hold.
∎
Note in the context of Proposition 6.10, if 1=w∈FN is such that S=W[w] is (M,λ,ε)-minimizing then #S≤M and every element of S is (M,λ,ε)-minimal.
Theorem 6.11**.**
Let FN=F(A) be a free group of finite rank N≥2 with a free basis A.
Let W=W1,W2,… be a sequence of F(A)-valued random variables. Let 0=ν∈\mboxCurr(FN) be a filling geodesic current such that W is adapted to ν.
Then there exist M≥1, λ>1 and a subset W⊆Out(FN) with #W≤M such that for every 0<ε<1 with λ>1+ε the following hold:
(a)
For a.e. trajectory ξ=(w1,w2,…,wn,…) of W there exists n0=n0(ξ)≥1 the following holds for all n≥n0:
(1)
The set Sn=W[wn] is (M,λ,ε)-minimizing.
2. (2)
For for every φ∈W the conjugacy class φ[wn]∈Sn is (M,λ,ε)-minimal.
3. (3)
We have M([wn])⊆W[wn], and in particular, #M([wn])≤M.
4. (4)
We have rankStabOut(FN)([wn])≤K(N,M), where K(N,M)≥1 is some constant depending only on N and M.
(b)
The probability of each of the following events tends to 1 as n→∞:
(1)
The set W[Wn] is (M,λ,ε)-minimizing.
2. (2)
For for every φ∈W the conjugacy class φ[Wn] is (M,λ,ε)-minimal.
3. (3)
We have M([Wn])⊆W[Wn], and #M([Wn])≤M.
4. (4)
We have rankStabOut(FN)([Wn])≤K(N,M), where K(N,M)≥1 is some constant depending only on N and M.
Proof.
Put W=ΔA(ν) and let M=MA(ν)=#W.
Choose λ be such that 1<λ<λA(ν). L 0<ε<1 be such that λ>1+ε.
Now Proposition 6.10 implies that statemenst (a)(1) and (b)(1) of Theorem 6.11 holds, which, by definition of (M,λ,ε)-minimality, implies that (a)(2) and (b)(2) hold as well.
Now Proposition 3.7(1) implies that statements (a)(3) and (b)(3) of Theorem 6.11 holds. Finally, Proposition 3.14 implies that statements (a)(4) and (b)(4) of Theorem 6.11 hold.
∎
Theorem 6.12**.**
Let FN=F(A), ν, W⊆Out(FN) and W=W1,W2,… and be as in Theorem 6.11. Assume also that W is tame.
Then there exists K0≥1 such that the following hold:
(a)
For a.e. independently chosen trajectories ξ=w1,w2,… and ξ′=w1′,w2′,… of W there exist n0,m0≥1 such that the following hold:
(1)
For all n≥n0, the W-speed-up of Whitehead’s minimization algorithm on the input wn terminates in time at most K0n and produces an element of M([wn]).
2. (2)
If n≥n0, then for any u∈FN the W-speed-up of Whitehead’s algorithm decides in time at most K0max{n,∣u∣A2}, whether or not Aut(FN)wn=Aut(FN)u.
3. (3)
For all n≥n0,m≥m0, the W-speed-up of Whitehead’s algorithm decides in time at most K0max{n,m}, whether or not Aut(FN)wn=Aut(FN)wm′.
(b)
The probability of each of the following events tends to 1 as n→∞:
(1)
The W-speed-up of Whitehead’s minimization algorithm on the input Wn terminates in time at most K0n and produces an element of M([Wn]).
2. (2)
The element Wn has the property that for any u∈FN the W-speed-up of Whitehead’s algorithm decides in time at most K0max{n,∣u∣A2}, whether or not Aut(FN)Wn=Aut(FN)u.
3. (3)
Let W′=W1′,W2′,… be an independent copy of W. Let ni,mi≥1 be such that limi→∞min{ni,mi}=∞. Then the probability of the following event tends to 1 as i→∞:
The W-speed-up of Whitehead’s algorithm decides in time at most K0max{ni,mi}, whether or not Aut(FN)Wni=Aut(FN)Wmi′.
Proof.
Since W is tame, there exists C>0 such that for all n≥1 we always have ∣Wn∣A≤Cn.
Let K≥1 be the constant provided by Theorem 3.11.
We will show that part (a) of Theorem 6.12 holds as the proof of part (b) is essentially the same.
By part (1) of Theorem 6.11 we know that for all big enough n≥n0 (where n0 depends on the random trajectory ξ) the set Sn=W[wn] is (M,λ,ε)-minimizing and every element of Sn is (M,λ,ε)-minimal. The same is true for all Sm′=W[wm′] for all m≥m0=m0(ξ′).
(a)(1) Pick an element ψ∈W. Thus there is C′≥1 such that for every g∈FN we have ∣∣ψ(g)∣∣A≤C′∣∣g∣∣A.
We first compute [un]=ψ([wn])∈Sn. Thus ∣∣un∣∣A≤C′∣∣wn∣∣A≤CC′n, and [un] is (M,λ,ε)-minimal. Therefore, by part (a) of Theorem 3.11, the Whitehead minimization algorithm on [un] terminates in time ≤K∣∣un∣∣A≤KCC′n and and produces an element of M([un])=M([wn]).
(a)(2) Let n≥n0. Again choose any ψ∈W. Then we have wn∈UN(M,λ,ε;ψ). Therefore, by part (f) of Theorem 3.11, the ψ-speed-up of Whitehead’s algorithm decides in time at most Kmax{∣wn∣A,∣u∣A2}≤KCmax{n,∣u∣A2}, whether or not Aut(FN)wn=Aut(FN)u.
(a)(3) Suppose n≥n0,m≥m0. Thus Sn,Sm′ are (M,λ,ε)-minimizing and their elements are (M,λ,ε)-minimal. Choose any ψ∈W.
Since ψ[wn]∈Sn and ψ[wm′]∈Sm′ and since Sn,Sm′ are (M,λ,ε)-minimizing, it follows that wn,wm′∈UN(M,λ,ε;ψ). Thus, by part (e) of Theorem 3.11, the ψ-speed-up of Whitehead’s algorithm decides in time at most Kmax{∣wn∣A,∣wm′∣A}≤KCmax{m,n}, whether or not Aut(FN)wn=Aut(FN)wm′.
∎
7. Group random walks as a source of (M,λ,ε)minimality
Convention 7.1* (Terminology regarding random processes).*
Let B be a set with the discrete topology (such as a discrete group, the set of vertices of a graph, words in a finite alphabet, etc).
For any infinite sequence of B-valued random variables W=W1,W2,…,Wn,… we assume that the sample space Ω=Bω (as usual given the product topology for the discrete topologies on the factors B) is a probability space equipped with a Borel probability measure Pr. We will usually suppress the explicit mention of this probability measure Pr.
Thus a trajectory of W is a sequence ζ=(w1,w2,…,wn,…)∈Ω, where all wi∈B. We say that some property Eholds for a.e. trajectory of W if
[TABLE]
Convention 7.2*.*
For a discrete probability measure μ:G→[0,1] on a group G, we denote by ⟨\mboxSupp(μ)⟩+ the subsemigroup of G generated by the support \mboxSupp(μ) of μ. Note that we have ⟨\mboxSupp(μ)⟩+=∪n=1∞\mboxSupp(μ(n)) where μ(n) is the n-fold convolution of μ.
Thus for g∈G we have g∈⟨\mboxSupp(μ)⟩+ if and only if there exist n≥1 and g1,…,gn∈G such that g=g1…gn and μ(gi)>0 for i=1,…,n.
Definition 7.3** (Group random walk).**
Let G be a group and let μ:G→[0,1] be a discrete probability measure on G. Let X1,X2,…,Xn,… be a sequence of G-valued i.i.d. random variables, where each Xi has distribution μ. Put Wn=X1…Xn∈G, where n=1,2…. The random process
[TABLE]
is called the random walk on G defined by μ.
Recall that if G is a group acting on a set X, and μ is a discrete probability measure on G, then a measure λ on X is called μ-stationary if λ=∑g∈Gμ(g)gλ.
If G is a non-elementary word-hyperbolic group, a discrete probability measure μ on G is called non-elementary if ⟨\mboxSupp(μ)⟩+ contains some two independent loxodromic elements of G (which, for a word-hyperbolic G means some two elements g1,g2∈G of infinite order such that ⟨g1⟩∩⟨g2⟩={1}).
We need the following well-known fact (see, e.g. [39, Theorem 1.1] for the most general version of this statement for random walks on groups acting on Gromov-hyperbolic spaces; see [21, Theorem 7.6] specifically for the case of a word-hyperbolic G):
Proposition 7.4**.**
Let G be a non-elementary word-hyperbolic group and let μ be a non-elementary discrete probability measure on G. Let W=W1,W2,…,Wn,… be the random walk on G defined by μ. Then:
(1)
For a.e. trajectory w1,w2,… of W there exists x∈∂G such that limn→∞wn=p in G∪∂G.
2. (2)
Putting, for S⊆∂G, λ(S) to be the probability that a trajectory of W converges to a point of S, defines a μ-stationary Borel probability measure λ on ∂G.
This measure λ is called the exit measure or the hitting measure for W.
Recall also that if G is a word-hyperbolic group and H≤G is a non-elementary subgroup, then ∂G contains a unique nonempty minimal closed H-invariant subset Λ(H)⊆∂G called the limit set of H (see [30, 25] for details).
We need the following fact which seems be general folklore knowledge, although it does not seem to appear in the literature. We include a proof, explained to us by Vadim Kaimanovich, for completeness.
Proposition 7.5**.**
Let G be a non-elementary word-hyperbolic group, let μ be a non-elementary discrete probability measure on G, and let λ be the exit measure on ∂G for the random walk on G defined by μ.
Suppose H≤G is a non-elementary subgroup such that H⊆⟨\mboxSupp(μ)⟩+. Then Λ(H)⊆\mboxSupp(λ).
In particular if Λ(H)=∂G then \mboxSupp(λ)=∂G.
Proof.
Let λ be the exit measure on ∂G for the random walk determined by μ.
For any k≥1, the measure λ is also an exit measure for the random walk based on μ(k), and therefore λ is μ(k)-stationary.
Thus for every n≥1 we have λ=∑g∈Gμ(n)(g)⋅gλ. Hence λ dominates gλ whenever n≥1 and μ(n)(g)>0, that is, whenever g∈⟨\mboxSupp(μ)⟩+. Since H⊆⟨\mboxSupp(μ)⟩+, it follows that λ dominates hλ for every h∈H. Since H is a subgroup of G, this implies that for all h∈H the measures λ and hλ are in the same measure class. Hence for every h∈H\mboxSupp(λ)=h\mboxSupp(λ). Thus \mboxSupp(λ) is a nonempty closed H-invariant subset of ∂G, and therefore Λ(H)⊆\mboxSupp(λ), as claimed.
∎
Note that if ⟨\mboxSupp(μ)⟩+ contains a subgroup H of G such that H has finite index in G, or such that H is an infinite normal subgroup of G, then Λ(H)=∂G (see [30]) and therefore we get \mboxSupp(λ)=∂G in the conclusion of Proposition 7.5.
Theorem 7.6**.**
Let FN=F(A) be a free group of finite rank N≥2 with a free basis A. Let μ:FN→[0,1] be a finitely supported probability measure such that ⟨\mboxSupp(μ)⟩+=FN.
Let W=W1,W2,… be the random walk on FN defined by μ. Then W is tame, and there exists a filling current 0=ν∈\mboxCurr(FN) such that W is adapted to ν.
Proof.
Let TA be the Cayley graph of FN with respect to A. Thus TA is a 2N-regular simplicial tree.
Since μ is finitely, supported, we have C:=\max\{|g|_{A}\big{|}g\in F_{N},\mu(g)>0\}<\infty. Then for all n we have ∣Wn∣A≤Cn. Thus W is tame.
As usual, define by μˇ:FN→[0,1] the probability measure on FN given by the formula μˇ(g)=μ(g−1) for g∈FN.
Note that \mboxSupp(μˇ)=(\mboxSupp(μ))−1={g−1∣g∈FN,μ(g)>0}. Enumerate \mboxSupp(g) as \mboxSupp(g)={g1,…,gr} for some r≥1. Since ⟨\mboxSupp(μ)⟩+=FN, for each basis element ai∈A (where i=1,…,N) and for each ε∈{±1} there exists a positive word Ui,ε(x1,…,xr) such that aiε=Ui,ε(g1,…,gr) in FN. Then ai−ε=Ui,εR(g1−1,…,gr−1), where Ui,εR(x1,…,xr) is the word Ui,ε read in the reverse (but without inverting the letters). Since g1−1,…,gr−1∈\mboxSupp(μˇ), and 1≤i≤N, ε=±1 were arbitrary, it follows that a1±1,…,ar±1 belong to ⟨\mboxSupp(μˇ)⟩+. Hence this ⟨\mboxSupp(μˇ)⟩+=FN.
Let λ and λˇ be the exit measures on ∂FN for the random walks defined by μ and μˇ accordingly. Then, by Proposition 7.5, we have \mboxSupp(λ)=\mboxSupp(λˇ)=∂FN.
The Cayley tree TA of FN is a proper CAT(−1) geodesic metric space equipped with a properly discontinuous cocompact isometric action of FN. Therefore by a result of Gekhtman [19, Theorem 1.5] there exists a geodesic current 0=ν∈\mboxCurr(FN) such that W is adapted to ν, and, moreover, ν belongs to the measure class λˇ×λ on ∂2FN. Since both λ and λˇ have full support on ∂FN, it follows that ν has full support on ∂2FN. Therefore by [27, Corollary 1.6] the current ν is filling in FN.
We can now conclude that algebraic and algorithmic conclusions of Theorem 6.11 and Theorem 6.12 apply in the case of μ-random walks on FN, where μ has finite support with ⟨\mboxSupp(μ)⟩+=FN:
Corollary 7.7**.**
Let FN=F(A) be free of rank N≥2, with a free basis A. Let μ:FN→[0,1] be a finitely supported discrete probability measure such that ⟨\mboxSupp(μ)⟩+=FN.
Let W=W1,…,Wn,… be the random walk on FN defined by μ.
Then there exist M≥1, 0<ε<1 and λ>1+ε and a subset W⊆Out(FN) with #W≤M such that the conclusions of Theorem 6.11 and Theorem 6.12 hold for W with these choices of M,λ,ε,W.
8. Finite-state Markov chains and the frequency measures
We recall some basic notions and facts regarding finite state Markov chains here and refer the reader to [15, 18, 32, 33] for proofs and additional details.
8.1. Finite-state Markov chains.
Recall that a finite-state Markov chain, or FSMCX is defined by a finite nonempty set S of states and by a family of transition probabilitiespX(s,s′)≥0, where s,s′∈S such that for every s∈S∑s′∈SpX(s,s′)=1. Then for every integer n≥1 we also get the n-step transition probabilitiespX(n)(s,s′) where pX(1)(s,s′)=pX(s,s′) and where for n≥2 and s,s′∈S we have
[TABLE]
The sample space associated with X is the product space SN={ξ=(s1,s2,s3,…,sn,…)∣si∈S for i≥1}. The set S is given the discrete topology and SN is given the corresponding product topology, which makes SN a compact metrizable totally disconnected topological space. For i≥1 we denote by Xi:SN→S the function picking out the i-th coordinate of an element of SN.
The transition matrixM=M(X) is an S×S matrix where for s,s′∈S the entry M(s,s′) of M is defined as M(s,s′)=pX(n)(s,s′). Thus M(X) is a nonnegative matrix, where the sum of the entries in each row is equal to 1. Also, for all n≥1 and s,s′∈S we have pX(n)(s,s′)=(Mn)(s,s′). A FSMC X as above is called irreducible if for all s,s′∈S there exists n≥1 such that pX(n)(s,s′)>0. Thus X is irreducible if and only if the nonnegative matrix M(X) is irreducible in the sense of Perron-Frobenius theory.
For an FSMC X, given a initial probability distributionμ on S, we obtain the corresponding Markov ProcessXμ=X1,…,Xn,… where each Xi is an S-valued random variable with probability distribution μi on S, where μ1=μ and where for i≥2 and s′∈S we have μi(s′)=∑s∈Sμi−1(s)pX(s,s′). An initial distribution μ on S is called stationary for X if μi=μ for all i≥1 (equivalently, if μ2=μ).
It is well-known, by the basic result of Perron-Frobenius theory, that if X is an irreducible finite-state Markov chain with state set S, then there is a unique stationary probability distribution μ on S for X, and that it satisfies μ(s)>0 for all s∈S. In this case the vector (μ(s))s∈S is the Perron-Frobenius eigenvector of ∣∣.∣∣1-norm 1 for the matrix M(X) with eigenvalue λ=1, and, moreover, λ=1 is the Perron-Frobenius eigenvalue for M(X). In particular, the eigenvalue λ=1 is simple and is equal to the spectral radius of M(X).
For an FSMC X with state set S, a word w=s1…sn∈Sn of length n≥2 is called feasible if pX(s1,s2)…pX(sn−1,sn)>0. Also, we consider all words w=s∈S of length n=1 to be feasible. (Hence every nonempty subword of a feasible word is also feasible). An element ξ=(s1,s2,…)∈SN is feasible for X if for every n≥1 the word s1…sn is feasible. Denote by (SN)+ the set of all feasible ξ∈SN. Also, for every n≥1 denote by (Sn)+ the set of all feasible s1…sn∈Sn.
For a word w=s1…sn∈Sn (where n≥2) put
[TABLE]
Any initial probability distribution μ on S defines a Borel probability μ∞ via the standard convolution formulas. Namely, if n≥1,s1,…sn∈S then
[TABLE]
where Cyl(s1…sn)={ξ∈SN∣Xi(ξ)=si for i=1,…,n}.
If μ is strictly positive on S, then the support supp(μ∞) of μ∞ is equal to (SN)+. In particular, that is the case if X is an irreducible FSMC and μ is the unique stationary probability distribution on S.
Definition 8.1** (Occurrences and frequencies).**
Let X be an irreducible finite-state Markov chain with state set S.
(1) For a word w=s1…sn∈Sn (where n≥1) and an element s∈S we denote by ⟨s,w⟩ the number of those i∈{1,…,n} such that si=s. We call ⟨s,w⟩ the number of occurrences of s in w. We also put θs(w)=∣w∣⟨s,w⟩, where ∣w∣=n is the length of w. We call θs(w) the frequency of s in w.
(2) The above notions can be extended from s to arbitrary nonempty words v∈S∗ as follows. Let v=y1…ym∈Sm where yj∈S for j=1,…,m. Also denote by w∞ the semi-infinite word w∞=wwww…. For an arbitrary integer i≥1 we still denote by si∈S the i-th letter of w∞. Now define ⟨v,w⟩ to be the number of i∈{1,…,n} such that in w∞ we have si=y1,si+1=y2,…,si+m−1=ym. We call ⟨v,w⟩ the number of occurrences of v in w, and we call θv(w)=∣w∣⟨v,w⟩ the frequency of v in w.
We record the following immediate corollary of the above definition (which holds since we defined the numbers of occurrences in w cyclically).
Lemma 8.2**.**
Let w∈Sn where n≥1. Then the following hold:
(1)
We have n=∣w∣=∑s∈S⟨s,w⟩ and 1=∑s∈Sθs(w).
2. (2)
For every m≥1 we have n=∣w∣=∑v∈Sm⟨v,w⟩ and 1=∑v∈Smθv(w).
3. (3)
For every m≥1 and every v∈Sm we have
[TABLE]
and
[TABLE]
For a finite-state Markov chain X with state set S and an element ξ=(s1,s2,s3,…,sn,…) of SN, we denote wn=s1…sn∈Sn, where n≥1.
The Strong Law of Large numbers applies to a finite-state Markov chain implies:
Proposition 8.3**.**
Let X be an irreducible finite-state Markov chain with state set S and let μ0 be the unique stationary probability distribution on S. Let μ be an arbitrary initial distribution on S defining the corresponding Markov process Xμ=X1,…,Xn,…. Then the following hold:
(1)
For every s∈S and for μ∞-a. e. trajectory ξ=(s1,s2,s3,…,sn,…)∈SN of Xμ, we have
[TABLE]
2. (2)
For every 0<ε≤1 and every s∈S
[TABLE]
and the convergence in this limit is exponentially fast as n→∞.
8.2. Iterated Markov Chains
Let X be a finite-state Markov chain with state set S. Let k≥1 be an integer. Consider a finite-state Markov chain X[k] with the state set (Sk)+ and with transition probabilities defined as follows. Suppose s1…sk∈(Sk)+ and s∈S are such that pX(sk,s)>0 (so that s1…sks∈Sk+1 is feasible for X, and s2…sks∈(Sk)+). Then put pX[k](s1…sk,s2…sks)=pX(sk,s). Set all other transition probabilities in X[k] to be [math]. Note that we have X[1]=X.
It is not hard to see that if X as above is irreducible then for every k≥1 the FSMC X[k] is also irreducible. Moreover, in this case there is a natural canonical homeomorphism between the set of infinite feasible trajectories (SN)+ of X and the set (((Sk)+)N)+ of infinite feasible trajectories for X[k]. Under this homeomorphism a sequence ξ=(s1,…,sn…)∈(SN)+ goes to (v1,v2…,vn,…)∈(((Sk)+)N)+ where vi=sisi+1…si+k−1.
Moreover, if μ0 is the unique stationary distribution for X on S then
[TABLE]
where s1…sk∈(Sk)+, is the unique stationary probability distribution for X[k]. Using these facts and the application of Proposition 8.3, standard results about Markov chains imply the following statement; see [6, Proposition 3.13] for a more detailed version of this statement, with explicit speed of convergence estimates:
Proposition 8.4**.**
Let X be a finite-state Markov chain with state set S and let μ0 be the unique stationary probability distribution on S. Let μ be an arbitrary strictly positive initial distribution on S defining the corresponding Markov process Xμ=X1,…,Xn,…. Let k≥1 be an integer and let μ0[k] be the distribution on (Sk)+ defined by () above. We extend μ0[k] to Sk by setting μ0[k](v)=0 for all v∈Sk∖(Sk)+.*
Then the following hold:
(1)
For every v∈Sk and for μ∞-a. e. trajectory ξ=(s1,s2,s3,…,sn,…)∈SN of Xμ, we have
[TABLE]
2. (2)
For every 0<ε≤1 and every v∈Sk
[TABLE]
and the convergence in this limit is exponentially fast as n→∞.
∎
Corollary 8.5**.**
Let X, S, μ0 and μ be as in Proposition 8.4 above. Then:
(1)
For every m≥1 we have 1=∑v∈Smμ0[k](v).
2. (2)
For every m≥1 and every v∈Sm we have
[TABLE]
Proof.
Take a random trajectory ξ=(s1,s2,s3,…,sn,…)∈SN of Xμ to which the conclusion of Proposition 8.4 applies and put wn=s1…sn for all n≥1.
The conclusion of part (1) of the corollary now follows directly from part (1) of Proposition 8.4 and from part (1) of Lemma 8.2.
Now let m≥1 and let v∈Sm. Then by part (3) of Lemma 8.2 we have
[TABLE]
By passing to the limit as n→∞ and applying part (1) of Proposition 8.4 , we obtain part (2) of the corollary.
∎
8.3. Quasi-inversions
We also need the following, somewhat technical to state but mathematically fairly straightforward, statement to later rule out the situation where a random reduced path in a finite graph is closed but far from being cyclically reduced.
We say that a FSMC X with state set S is tight if pX(s,s′)<1 for all s,s′∈S.
Let X be a FSMC with state set S where #S≥2. Let ι:S′→S be an injective function where S′⊆S. For a word w∈(S′)∗, w=s1…sn with si∈S′ put ι(w)=ι(s1)…ι(sn). For w∈S∗∖(S′)∗ put ι(w)=ε, the empty word.
Also, for a word w∈S∗ denote by wR the reverse word. That is, if w=s1…sn with si∈S then wR=sn…s1.
Proposition 8.6**.**
Let X be an irreducible tight finite-state Markov chain with state set S where #S≥2, and let μ0 be the unique stationary probability distribution on S.
Put σ=maxs,s′pX(s,s′) (so that 0<σ<1).
Let μ be an arbitrary initial distribution on S defining the corresponding Markov process Xμ=X1,…,Xn,…. Let ι:S′→S be an injective function where S′⊆S (with ι:S∗→S∗ extended as above as well). Then the following hold for a trajectory ξ=(s1,s2,s3,…,sn,…)∈SN of Xμ:
(1)
We have
[TABLE]
and, in particular,
[TABLE]
2. (2)
Let a,b∈S be two states. Then for the conditional probability, conditioned on s1=a,sn=b, we have:
[TABLE]
and, in particular,
[TABLE]
Proof.
(1) For a trajectory ξ=(s1,s2,s3,…,sn,…)∈SN of Xμ denote wn=wn(ξ)=s1…sn. For m≤n denote by αm(wn) the initial segment of wn of length m.
Let n≥1 and let En be the event that s1s2…s⌊n⌋=(ι(sn−⌊n⌋+1...sn))R. For u=y1…yn−⌊n⌋∈Sn−⌊n⌋ let t(u)=yn−⌊n⌋∈S be the last letter of u.
For any fixed u=y1…yn−⌊n⌋∈Sn−⌊n⌋ the conditional probability Pr(wn∈En∣wn−⌊n⌋=u) is equal to
[TABLE]
Then
[TABLE]
Thus part (1) is verified. The proof of part (2) is similar and we leave the details to the reader.
∎
Remark 8.7*.*
In fact, the assumption that X be tight is not crucial in Proposition 8.6 and a similar result holds if we assume that X is an irreducible FSMC with #S≥2. We make the tightness assumption to simplify the argument.
9. Graph-based non-backtracking random walks
Convention 9.1*.*
In this section we will assume that FN=F(A) is a free group of finite rank N≥2, that Γ is a finite connected oriented graph with all vertices of degree ≥3 and with the first betti number b(Γ)=N, and that α:FN→π1(Γ,x0) is a fixed isomorphism, where x0∈VΓ is some base-vertex. We equip Γ and T0=Γ with simplicial metrics, where every edge has length 1.
Note that for Γ as above we always have #EΓ≤6N.
Definition 9.2**.**
Under the above convention, a FSMC X with state set S is Γ-based if the following hold:
(1)
We have S⊆EΓ, with #S≥2.
2. (2)
Whenever e,e′∈S are such that pX(e,e′)>0 then t(e)=o(e′) in Γ and e′=e−1.
Thus for a Γ-based FSMC X as above, the space of feasible trajectories (SN)+ can be thought of as a subset of the set Ω(Γ) of all reduced semi-infinite edge-paths γ=e1,e2,… in Γ. Similarly, (Sn)+ can be thought of as a subset of the set Ωn(Γ) of all reduced length n edge-paths e1,e2,…,en in Γ.
Proposition 9.3**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let μ0 be the unique stationary probability distribution on S. For every k≥1 we extend μ0[k] to Ωk(Γ) by setting μ0[k](v)=0 for every v∈Ωk(Γ)−(Sk)+.
There exists a unique geodesic current ν on FN with the following properties:
(1)
For every k≥1 and every v∈Ωk(Γ) we have ⟨v,ν⟩Γ=μ0[k](v)+μ0[k](v−1).
2. (2)
We have ⟨T0,ν⟩=1.
Proof.
We use the formulas in part (1) of the proposition to define a system of weights ν on ∪n≥1Ωn(Γ).
Note that these weights are already symmetrized since the defining equations for the weights in (1) give the same answers for v and v−1.
Now Corollary 8.5 implies that these ν weights satisfy the switch conditions. Therefore they do define a geodesic current ν∈\mboxCurr(FN).
Also, part (1) of Corollary 8.5 implies that ∑e∈EΓμ0[1](e)=1=∑e∈EΓμ0[1](e−1) and therefore
[TABLE]
Thus part (1) of the proposition holds and, in particular ν=0 in \mboxCurr(FN).
∎
Definition 9.4** (Characteristic current).**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let 0=ν∈\mboxCurr(FN) be the geodesic current constructed in Proposition 9.3 above.
We call ν the characteristic current of X and denote it ν=νX
Definition 9.5** (X-directed random walk on Γ).**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let μ be any initial probability distribution on S
defining the corresponding Markov process Xμ=X1,…,Xn,…. For every n=1,2,… put Wn=X1…Xn so that Wn takes values in Sn.
The random process Wμ=W1,…,Wn,… is called the X-directed non-backtracking random walk on Γ corresponding to μ.
Note that for Wμ and any n≥1 the only feasible values of Xn are contained in Sn∩Ωn(Γ).
Since in general Γ may have more than one vertex, a reduced edge-path in Γ (such as, for example, the length-n path given by Wn in the above setting) is not necessarily closed and thus may not define a conjugacy class in π1(Γ,x0). To get around this issue, we modify Wμ slightly, in two different ways to output closed paths in Γ.
Definition 9.6** (Closing path system).**
Let Γ be as in Convention 9.1. A closing path system for Γ is a family B=(βe,e′)e,e′∈EΓ of reduced edge-paths in Γ such that for every e,e′∈EΓeβe,e′e′ is a reduced edge-path in Γ.
For a non-degenerate reduced edge-path γ in Γ define the B-closingγ of γ as γ=γβe,e′ where e is the last edge of γ and e′ is the first edge of γ. Note also that for any nondegenerate reduced edge-path γ in Γ the B-closing γ is a reduced and cyclically reduced closed edge-path in Γ.
Note that B is above, if e,e′∈EΓ then t(e)=o(βe,e′) and o(e′)=t(βe,e′). It is also easy to see that for every Γ some closing path system B=(βe,e′)e,e′∈EΓ exists, and we can always choose B so that ∣βe,e′∣≤∣EΓ∣≤6N for all e,e′∈EΓ.
Definition 9.7** (B-closing of a non-backtracking walk on Γ).**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let B=(βe,e′)e,e′∈EΓ be a closing path system for Γ.
Let μ be any initial probability distribution on S and let Wμ=W1,…,Wn,… be the X-directed non-backtracking random walk on Γ corresponding to μ.
Define the random process Wμ=W1,…,Wn,…, where Wn the B-closing of Wn.
We call Wμ the B-closing of Wμ.
An advantage of using Wμ is that it always outputs reduced and cyclically reduced closed paths Wn of length n≤∣Wn∣≤n+C, where C=maxe,e′∣βe,e′∣. However, a weakness of this approach is that the path Wn is already a closed edge-path in Γ, with asymptotically positive probability as n→∞ (if X is irreducible and Γ-based). Therefore we offer a variation of the Wμ approach which takes this fact into account.
For a reduced nondegenerate closed edge-path γ in Γ denote by cyc(γ) the subpath of γ obtained from γ by a maximal cyclic reduction. Thus cyc(γ) is a nondegenerate closed reduced and cyclically reduced edge-path in Γ.
Notation 9.8*.*
Let B be a closing path system for Γ. For a nondegenerate reduced edge-path γ in Γ let γ˘:=cyc(γ) is γ is a closed path, and let γ˘:=γ otherwise. Thus in both cases γ˘ is a closed reduced and cyclically reduced edge-path in Γ (but it may now have length <n). We call γ˘ the modified B-closing of γ.
Definition 9.9**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let B=(βe,e′)e,e′∈EΓ be a closing path system for Γ.
Let μ be any initial probability distribution on S. Let Wμ=W1,…,Wn,… be the X-directed non-backtracking random walk on Γ corresponding to μ.
Define the random process Wμ˘=W˘1,…,W˘n,…, where W˘n the modified B-closing of Wn.
We call Wμ˘ the modified B-closing of Wμ.
Remark 9.10*.*
It is easy to see that the random processes Wμ, Wμ˘ considered above always satisfy condition (1) of Definition 6.9.
Theorem 9.11**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let μ be any initial probability distribution on S. Let B=(βe,e′)e,e′∈EΓ be a closing path system for Γ. Let νX∈\mboxCurr(FN) be the characteristic current for X.
Let Wμ=W1,…,Wn,… be the B-closing of Wμ.
Then Wμ is tame and adapted to νX.
Proof.
Put C=maxe,e′∣βe,e′∣. We have ∣Wn∣≤n+C for all n≥1, which implies that Wμ is tame.
Let ξ=e1,…,en,… be a random trajectory of Xμ, where ei∈S⊆EΓ for all i≥1. Denote wn=e1…en, so that the B-closing of wn is wn=e1…enβen,e1. Thus wn is a closed reduced and cyclically reduced edge-path in Γ with n≤∣wn∣≤n+C. We can then also think, via the marking isomorphism, of wn as defining a nontrivial conjugacy class in FN. Recall that for a nontrivial reduced edge-path v in Γ of length ∣v∣=k≥1 we have ⟨v,ηwn⟩Γ=⟨v,wn⟩+⟨v−1,wn⟩, where the latter two terms are the numbers of occurrences of v±1 in wn in the sense of Definition 8.1.
Now Proposition 8.4 implies that
[TABLE]
Therefore limn→∞n1ηwn=νX in \mboxCurr(FN), and hence limn→∞[ηwn]=[νX] in P\mboxCurr(FN). This means that Wμ is adapted to νX, as required.
∎
Theorem 9.12**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let μ be any initial probability distribution on S. Let B=(βe,e′)e,e′∈EΓ be a closing path system for Γ. Let νX∈\mboxCurr(FN) be the characteristic current for X.
Let Wμ˘=W˘1,…,W˘n,… be the modified B-closing of Wμ.
Then Wμ˘ is tame and adapted to νX.
Proof.
Again put C=maxe,e′∣βe,e′∣. We have ∣W˘n∣≤n+C for all n≥1, which implies that Wμ˘ is tame.
Let ξ=e1,…,en,… be a random trajectory of Xμ, where ei∈S⊆EΓ for all i≥1. Denote wn=e1…en. Thus the corresponding trajectory of Wμ˘ is w˘1,w˘2,…,w˘n,…. We need to show that limn→∞[ηw˘n]=[νX] in P\mboxCurr(FN).
For every n≥1 such that wn is a non-closed path, we have w˘n=wn, and the conclusion of Proposition LABEL:p:cl applies. Thus it remains to show that for any infinite increasing sequence ni of indices such that wni is a closed path we have limi→∞[ηw˘ni]=[νX] in P\mboxCurr(FN).
Let ni be such a sequence. Then for all i≥1 we have w˘ni=cyc(wni).
Let S1={e∈S∣e∈S} and define ι:S1→S as ι(e)=e−1 for e∈S1.
Since X is tight, Proposition 8.6 implies that we have
[TABLE]
for i→∞. Recall also that cyc(wni) is a subpath of wni.
Let v be an arbitrary nondegenerate reduced edge-path in Γ of length k≥1. Then we have
Therefore limi→∞ni1ηwni=νX in \mboxCurr(FN), and hence limi→∞[ηwni]=[νX] in P\mboxCurr(FN). As noted above, this implies that Wμ˘ is adapted to νX, as required.
∎
In summary, we get:
Corollary 9.13**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let μ be any initial probability distribution on S. Let B=(βe,e′)e,e′∈EΓ be a closing path system for Γ. Let νX∈\mboxCurr(FN) be the characteristic current for X. Suppose that νX is filling in FN.
Then Wμ and W˘μ are adapted to the characteristic current νX. Therefore Theorem 6.11 and Theorem 6.12 apply to Wμ and W˘μ.
We next explain several situations where one can guarantee that the current νX∈\mboxCurr(FN) is filling.
Proposition 9.14**.**
Let X be an irreducible Γ-based FSMC with state set S⊆EΓ. Let νX∈\mboxCurr(FN) be the characteristic current for X.
(1)
Suppose that X has the property that S=EΓ and that for every e,e′∈EΓ such that ee′ is a reduced edge-path in Γ we have pX(e,e′)>0. Then the current νX∈\mboxCurr(FN) is filling.
2. (2)
Suppose that Γ=RA, the N-rose corresponding to a free basis A={a1,…,aN} of FN (so that we can identify E(RA)=A±1). Suppose that X is such A⊆S and that for all 1≤i,j≤N we have pX(ai,aj)>0. Then the current νX∈\mboxCurr(FN) is filling.
3. (3)
Suppose there exists a nondegenerate reduced cyclically reduced closed edge-path w in Γ such that w represents a filling element of FN and that for every n≥2 we have pX(wn)>0. Then the current νX∈\mboxCurr(FN) is filling.
4. (4)
Suppose there exists a free basis A={a1,…,ak} such that the following hold. For i=1,…,N let wi be a closed reduced and cyclically reduced edge-path in Γ representing the conjugacy class of ai in FN. For 1≤i<j≤N let wi,j be a closed reduced and cyclically reduced edge-path in Γ representing the conjugacy class of ai in FN. Suppose that we have pX(wi2)>0 for i=1,…,N and that we have pX(wij2)>0 for all 1≤i<j≤N. Then the current νX∈\mboxCurr(FN) is filling.
Proof.
Let μ0 be the unique stationary probability distribution on S for X.
(1) The assumption on X implies that for every reduced edge-path v in Γ of length k≥1 we have μ0[k](v)>0, and therefore, by definition of νX, we also have ⟨v,νX⟩Γ>0. Thus νX∈\mboxCurr(FN) has full support and therefore νX is filling in FN.
(2) The assumptions on X (with Γ=RA) imply that for all n≥1 and all 1≤i,j≤N we have μ0[n](ain),μ0[n](ajn),μ0[2n]((aiaj)n)>0. Therefore, by definition of νX, we also have ⟨ai,νX⟩A>0 and ⟨(aiaj)n,νX⟩A>0. Therefore, by Proposition 5.7, the current νX∈\mboxCurr(FN) is filling.
(3) Again, similarly to (1) and (2) we see that for every n≥1⟨wn,νX⟩Γ>0. Therefore by Corollary 5.6 the current νX∈\mboxCurr(FN) is filling.
(4) Recall that for a reduced edge-path v in Γ of length k≥2 and starting with e1∈EΓ we have μ0[k](v)=μ0(e1)pX(v). Thus μ0[k](v)>0 if and only if e1∈S and the transition probabilities pX(e′,e′′) are >0 for all length-2 subpaths e′e′′ of v. Note also that if for the 2-nd edge e2 of v we have pX(e1,e2)>0 then e1,e2∈S.
Hence the assumptions in part (4) imply that for i=1,…,N and all n≥1 we have μ0[n](win)>0, and, similarly, for all 1≤i<j≤N and all n≥1 we have μ0[n](wi,jn)>0. Therefore, by definition of νX, we have ⟨win,νX⟩Γ>0, and, also, for all 1≤i<j≤N and all n≥1 we have ⟨wijn,νX⟩Γ>0. Therefore, by Proposition 5.7, the current νX∈\mboxCurr(FN) is filling.
∎
Example 9.15*.*
Let A={a1,…,aN} be a free basis of FN=F(A) and let Γ=RA be the corresponding N-rose.
(1) Consider an RA-based FSMC X with state set S=A±1 and transition probabilities pX(aiε,ajδ)=2N−11 if aiε=aj−δ and pX(aiε,ai−ε)=0, where ε,δ=±1. Then X is irreducible and tight. The stationary distribution μ0 is the uniform probability distribution on A±1. Then X, with an initial distrubution μ on A±1, defines the standard non-backtracking simple random walk Wμ=W1,W2… on FN=F(A). In this case the characteristic current νX is the uniform current νA corresponding to A. The current νX=νA has full support and therefore is filling. Since RA has only one vertex, the edge-path Wn is always closed, and we have W˘n=Wn. Using a closing path system B produces cyclically reduced words Wn=Wnβ, where β∈B is an appropriate closing path. The fact that Wn is adapted to νA is explained in more detail in [24] and exploited in the context of Whitehead’s algorithm there. In this case νA already has the ”strict minimality” properties similar to those of strictly minimal elements of FN. Again see [24] for details.
(2) Let Γ be a simplicial chart on FN. Consider a Γ-based FSMC X with state set S=EΓ and transition probabilities satisfying pX(e,e′)>0 if and only if ee′ is a reduced length-2 edge-path in Γ. Then X is irreducible, tight. The characteristic current νX has full support, and therefore is filling.
(3) Let X be an RA-based FSMC with state set S=A and transition probabilities satisfying pX(ai,aj)>0 for all 1≤i,j≤N. Then X is irreducible and tight.The characteristic current νX has the property that for 1=v∈F(A) we have ⟨v,νX⟩>0 if and only if v or v−1 is a positive word over A. The current νX is filling in FN by Proposition 5.7. We again have Wn=W˘n in this case, and moreover, Wn is already cyclically reduced because it is a positive word.
(4) Let Γ be a ”fan of lollipops”. Namely, Γ is a graph with a central vertex x0 with N non-closed oriented edges e1,…eN emanating from x0 with N distinct end-vertices yi=t(ei), i=1,…,N. For each of these ei at the vertex yi there is a closed oriented loop edge fi attached, with label ai∈A indicating the marking (so that fi−1 is marked ai−1). Consider a Γ-based FSMC X with state set S=EΓ−{f1−1,…,fN−1}. The transition probabilities satisfy pX(e,e′)>0 whenever e,e′∈S and ee′ is a reduced length-2 edge-path in Γ. Then again X is irreducible and tight. Moreover, the characteristic current νX∈\mboxCurr(FN) is filling by Proposition 5.7.
In (1), (2), (3) and (4) above, the processes Wμ and W˘μ (where μ is any initial distribution on the state set S of X) are adapted to the characteristic current νX of the defining tight irreducible FSMC, and νX is filling in FN. Therefore Theorem 6.11 and Theorem 6.12 apply to Wn and W˘n in these cases.
Bibliography46
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] A. Ancona, Positive harmonic functions and hyperbolicity. Potential theory – surveys and problems (Prague, 1987), 1–23, Lecture Notes in Math., 1344, Springer, Berlin, 1988
2[2] F. Bassino, C. Nicaud, and P. Weil, On the genericity of Whitehead minimality. J. Group Theory 19 (2016), no. 1, 137–159
3[3] M. Bestvina, Geometry of outer space. Geometric group theory, 173–206, IAS/Park City Math. Ser., 21, Amer. Math. Soc., Providence, RI, 2014
4[4] M. Bestvina, P. Reynolds, The boundary of the complex of free factors. Duke Math. J. 164 (2015), no. 11, 2213–2251
5[5] F. Bonahon, The geometry of Teichmüller space via geodesic currents , Invent. Math. 92 (1988), no. 1, 139–162
6[6] D. Calegari and J. Maher, Statistics and compression of scl. Ergodic Theory Dynam. Systems 35 (2015), no. 1, 64–110
7[7] C. Cashen and J. Manning. Virtual geometricity is rare. LMS J. Comput. Math. 18 (2015), no. 1, 444–455
8[8] D. Calegari, The ergodic theory of hyperbolic groups. Geometry and topology down under, 15–52, Contemp. Math., 597, Amer. Math. Soc., Providence, RI, 2013