The language of self-avoiding walks
Christian Lindorfer, Wolfgang Woess

TL;DR
This paper characterizes when the language of self-avoiding walks on an infinite, labeled graph is regular or context-free, based on the graph's end structure and symmetry properties.
Contribution
It provides a complete characterization of the formal language class of self-avoiding walk labels in terms of graph topology and automorphism group actions.
Findings
Language is regular if and only if the graph has more than one end with all ends of size 1.
Language is context-free if and only if the graph has at most two ends.
The characterization links graph topology with formal language classification.
Abstract
Let be an infinite, locally finite, connected graph without loops or multiple edges. We consider the edges to be oriented, and is equipped with an involution which inverts the orientation. Each oriented edge is labelled by an element of a finite alphabet . The labelling is assumed to be deterministic: edges with the same initial (resp. terminal) vertex have distinct labels. Furthermore it is assumed that the group of label-preserving automorphisms of acts quasi-transitively. For any vertex of , consider the language of all words over which can be read along self-avoiding walks starting at . We characterize under which conditions on the graph structure this language is regular or context-free. This is the case if and only if the graph has more than one end, and the size of all ends is , or at most ,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Cellular Automata and Applications · DNA and Biological Computing
The language of self-avoiding walks
Christian LINDORFER and Wolfgang WOESS
Institut für Diskrete Mathematik,
Technische Universität Graz,
Steyrergasse 30, A-8010 Graz, Austria
[email protected], [email protected]
Abstract.
Let be an infinite, locally finite, connected graph without loops or multiple edges. We consider the edges to be oriented, and is equipped with an involution which inverts the orientation. Each oriented edge is labelled by an element of a finite alphabet . The labelling is assumed to be deterministic: edges with the same initial (resp. terminal) vertex have distinct labels. Furthermore it is assumed that the group of label-preserving automorphisms of acts quasi-transitively. For any vertex of , consider the language of all words over which can be read along self-avoiding walks starting at . We characterize under which conditions on the graph structure this language is regular or context-free. This is the case if and only if the graph has more than one end, and the size of all ends is , or at most , respectively.
Key words and phrases:
Self-avoiding walks, labelled graph, Cayley graph, regular and context-free languages
2010 Mathematics Subject Classification:
20F10, 68Q45, 05C25
1. Introduction
Let be a locally finite, connected, infinite graph without loops or multiple edges. We think of the edges as being directed, so that each has an initial vertex and a terminal vertex ; there is an involution which exchanges those endpoints. In drawing a graph, every pair will usually be represented (and thought of) as one undirected edge. We shall always consider quasi-transitive graphs, i.e., the automorphism group of is supposed to act with finitely many orbits on .
A walk in is a subgraph spanned by a sequence with Its length is , and its initial and terminal vertices are and , respectively. This comprises the empty walk starting (and ending) at a vertex, which has length [math]. A self-avoiding walk (SAW) is one which visits no vertex twice.
We choose a root vertex and consider the number of all SAWs of length starting at . Hammersley [18] proved that in a quasi-transitive graph, the limit
[TABLE]
exists and is independent of the choice of the root . It is the reciprocal of the radius of convergence of the SAW-generating function
[TABLE]
The number is called the connective constant of . The study of self-avoiding walks, primarily in lattice graphs, was initiated by the chemist and Nobel laureate Paul J. Flory [13] as a tool for studying polymer growth. It is a combinatorial issue whose mathematical study has been advanced to a large extent within Statistical Physics and Probability. Good references are the monograph by Madras and Slade [24] and the lecture notes by Bauerschmidt et al. [3]. The explicit computation of or just is a difficult task. A highlight is the rigorous result of Duminil-Copin and Smirnov [11] for the hexagonal lattice, while even for the standard square lattice, is not known rigorously.
Here, we propose an apparently new approach, by connecting self-avoiding walks with the theory of formal languages. Given a finite alphabet , a language over is a subset of , the free monoid over consisting of all words with finite length whose letters come from . Our setting is as follows. We have a pair , where is a graph as above, and is a labelling which assigns to every oriented edge a label . Our assumptions are that the labelling is deterministic, that is, different edges with the same initial or terminal vertex have distinct labels, and that the group of all graph automorphisms of which preserve acts quasi-transitively (with finitely many orbits). The most significant class of labelled graphs are the Cayley graphs of finitely generated groups.
The labelling extends to all walks: for , we set
[TABLE]
For any set of walks in the labelled graph , the associated language is
[TABLE]
Now let be the set of all self-avoiding walks starting at . The associated language of self-avoiding walks is L_{\textrm{SAW}}=L_{\textrm{SAW},o}(X)=L\bigl{(}\Pi_{\textrm{SAW},o}\bigr{)}.
In the Chomsky-hierarchy of formal languages, the first basic class consists of the regular languages, which are those accepted by a finite state automaton, or equivalently, generated by a right-linear grammar. The second class consists of the context-free languages, which are those accepted by a pushdown automaton, resp., generated by a context free grammar. (Here, we shall use grammars.) Further details will be given below, and an excellent reference is the monograph by Harrison [19]. Our main result is the following.
Theorem 1.1**.**
*Let be a deterministically labelled, quasi-transitive graph, connected and locally finite. For any choice of the root vertex, the following holds.
(a) is regular if and only if has more than one end and all ends have size .
(b) is context-free if and only if has more than one end and all ends have size at most . In this case, is unambiguous context-free.*
See below for the meaning of “unambiguous”. A precise definition of an end of a graph and its size will also be given below; a reference close to the spirit of the present paper is Thomassen and Woess [30].
Relating walks in labelled graphs, in particular Cayley graphs, with formal language theory has an important history. Let be the set of all walks in starting and ending at , possibly with several self-intersections. Let W=W(o)=L\bigl{(}\Pi(o,o)\bigr{)}. For a Cayley graph of a group, this language is called the word problem.
Anisimov [2] showed that the word problem is regular if and only if the group is finite, and this extends to quasi-transitive labelled graphs. In ground-breaking work, Muller and Schupp [25] showed that the word problem is context-free if and only if the group is virtually free. In particular, regularity, resp. context-freeness of the word problem are group invariants which do not depend on the specific generating set. In subsequent work [26], context-free labelled graphs were defined via structural properties (not necessarily quasi-transitive), and these are precisely the (deterministically) labelled graphs for which is context-free; see Ceccherini and Woess [6]. For further work on language-theoretic issues related with groups , see e.g. Pélecq [27], as well as [5], [33], and for a new proof of the main result of [25] and related material, Dieckert and Weiss [9].
Applied to a Cayley graph of a group, Theorem 1.1 says that the group is virtually free if is context-free, but the latter property is not a group invariant.
The inspiration for the present work came from a note by Gilch and Müller [15], who determined for free products of finite graphs – an instance of the case where is regular. Furthermore, in the computation by Zeilberger [35] of for the bi-infinite ladder graph, a context-free grammar is inherent although not mentioned or used directly.
This paper is organised as follows. In §2, we provide the necessary background on the end space of graphs, as well as on regular and context-free languages. In §3, we discuss strips in locally finite graphs, that is, two-ended quasi-transitive subgraphs. Their ends have the same finite size – the size of the strip. In the quasi-transitive case, if there is an end of finite size , there must be a strip of the same size. Furthermore, if there is a thick end which is fixed by some non-elliptic automorphism (i.e., one which does not fix a finite subset of ), then there are strips of arbitrary size. In §4, the Pumping Lemmas for regular, resp. context-free languages are then used to show the following. If is quasi-transitive and contains a strip of size 2, then cannot be regular, and if it contains a strip of size 3, then cannot be context-free.
Thus, we are left with considering graphs whose ends have size at most 2. In §5 we first consider the case when all ends have size 1. Then the cut-vertex tree decomposition of has finite blocks, and we derive that is regular. If all ends have size , then we use the 3-block tree decomposition of of Droms, Servatius and Servatius [10] to construct an unambiguous context-free grammar for . (Alternatively, one might use the vertex cuts of Dunwoody and Krön [12].) To conclude, if contains ends of both sizes and , one can combine -connectedness of the (possibly infinite) blocks of the cut-vertex tree decomposition with the method of the preceding case (ends of size 1) to get context-freeness.
In the final §6, we start with a discussion of implications and future work, recall some context-free examples, including one of Lindorfer [23], and provide an additional more detailed example, where the SAW-generating function is algebraic over , but not rational.
2. Basic background
** A**. Ends and automorphisms of graphs
Recall that for our language-theoretic approach, it is convenient to consider the edge set of our (locally finite, connected, infinite) graph as being directed and with an involution which inverts the orientation. When speaking about ends, it is sufficient to identify each pair of oppositely oriented edges with the same endpoints with one non-oriented edge. We shall frequently switch back and forward between these two viewpoints. The standard graph distance in a connected graph is denoted . Walks will also be written in terms of vertices instead of edges, so that with is the same as , where . “Path” is a synonym for “self-avoiding walk”.
The space of ends of a connected graph was introduced in Graph Theory by Halin [16], and – without graph terminology – earlier by Freudenthal [14].
If , then is the subgraph obtained from by removing and all edges incident to vertices in . If removing disconnects then is called a separating set. In this case, if then is called a cut-vertex. A ray is a subgraph of spanned by a sequence of distinct vertices such that for all . A double ray is defined analogously. Two rays are called equivalent, if for any finite , both end up in the same component of , i.e., all but finitely many of their vertices are contained in that component. An end is an equivalence class of rays. If is an end and is finite, then we write for the unique component of in which all representing rays of end up, and we say that belongs to that component.
A defining sequence for an end consists of a sequence of finite subsets of such that for each . If there is a finite number such that for all , then the end is called thin, and the minimal such is called the size of . Otherwise, the end is called thick with size . It is a consequence of Menger’s theorem that an end of size contains (as an equivalence class) disjoint rays, and is maximal with respect to this property.
The space of all ends is the boundary of a compactification of : the topology on is discrete on . In the above notation, let be the union of with the set of all ends belonging to that component. Then for any defining sequence of , the family () is a neighbourhood base of . The vertex set is open and dense in .
The automorphism group consists of all permutations of which preserve neighbourhood. A subgroup is said to act quasi-transitively if there are only finitely many orbits (), and transitively, if for some (every) . In this case, the graph itself is called (quasi-)transitive. It is well-known that an infinite, locally finite, connected graph which is quasi-transitive has one, two or infinitely many ends, see [14]. If it has one end, then this end is thick. If it has two ends, then both are thin and have the same size (this situation will be important in §3). If it has infinitely many ends, then there are thin ends. See e.g. [17].
Now we return to the setting where the oriented edges of are labelled by a finite alphabet . We always assume that the labelling is deterministic. The most typical example arises when is a finitely generated group and is a finite, symmetric set of generators. The Cayley graph of with respect to has vertex set . We choose , and for each and there is an oriented edge from to with label . The group acts transitively by multiplication from the left. More generally, we have the following.
Lemma 2.1**.**
The group of all graph automorphisms of the connected graph which preserve the edge labels acts fixed-point freely: if and for some then , the unit element of .
In particular, if acts quasi-transitively then it is finitely generated.
Proof.
Suppose . Since the labelling is deterministic, for all neighbours of . By connectedness of , we must have . If acts quasi-transitively then let be the diameter of the (finite, connected) factor graph . Given , we have . Now consider the new graph with the same vertex set , where two vertices are connected by a non-oriented edge whenever . In , each orbit induces a connected, locally finite subgraph on which acts transitively and fixed-point freely. Therefore that subgraph is a Cayley graph of , and is finitely generated, see e.g. the old note by Sabidussi [28]. ∎
Even if our main interest is in Cayley graphs, we need the quasi-transitive case throughout our proofs.
At this point, we also recall the notion of a quasi-isometry between two metric spaces and . This is a mapping such that there are constants , such that for all and ,
[TABLE]
Every quasi-isometry has a quasi-inverse , a quasi-isometry such that and are at bounded distance from the respective identity mappings.
By a quasi-isometry between two connected graphs, we mean a quasi-isometry between the vertex sets, equipped with the respective standard graph metrics.
Remark 2.2**.**
For any two Cayley graphs of the same finitely generated group with respect to two different finite, symmetric sets of generators, the identity mapping is a quasi-isometry with in (1), i.e., the mapping is bi-Lipschitz. In the situation of Lemma 2.1 and its proof, the identity mapping on any orbit is a quasi-isometry from the Cayley graph of (given as the subgraph of induced by that orbit) to the original graph . Indeed, this is a quasi-surjective bi-Lipschitz embedding, i.e., in (1).
In the following Lemma, the subscripts and refer to the respective graphs, their metrics, and so on.
Lemma 2.3**.**
Let and be two connected graphs with bounded vertex degrees. If is a quasi-isometry, then it extends to a continuous mapping which restricts to a homeomorphism between the spaces of ends of and .
There is an increasing function with the following property: if and with then there is with
[TABLE]
In particular, if has a defining sequence with then has a defining sequence with , and if is a thick end, then so is .
This follows with some small additional effort from [34, Lem. 21.3 and 21.4] and their proofs. Using [30, Thm. 4.4] and its proof for the case where both and are quasi-transitive, we obtain the following.
Corollary 2.4**.**
Let and be connected, quasi-transitive graphs and be a quasi-isometry. Then there is an increasing function such that any end of size maps to an end of size at most .
** B**. Regular and context-free languages
As mentioned above, [19] is an optimal source on context-free languages. We recall from the introduction that for any alphabet (set) , we denote by
[TABLE]
the set of all words over . Here, is the length of , and when , this is the empty word .
A context-free grammar is a quadruple , where is the finite set of variables (with ), the variable is the start symbol, and is a finite set of production rules. We write if . For , a rightmost derivation step has the form , where and with , and . A rightmost derivation is a finite sequence such that ; we then write . Each generates the language . The language generated by is . The grammar is called unambiguous, if every has a unique rightmost derivation.
A grammar and the language which it generates are called linear, if each production is of the form
[TABLE]
and if in that situation one always has , then the grammar and the language are called right-linear or regular. In this case, the language is accepted by a (deterministic) finite state automaton, see [19].
Typical tools to show that a language is not regular, resp. context-free, are the well known Pumping Lemmas.
Lemma 2.5** (Pumping Lemma for regular languages).**
Let be a regular language. Then there is a pumping length such that every with can be written as , where , and for all .
Lemma 2.6** (Pumping Lemma for context-free languages).**
Let be a context-free language. Then there is a pumping length such that every with can be written as , where , and for all .
3. Strips in locally finite graphs
The action of the automorphism group of a locally finite, connected graph extends in an obvious way to the space of ends. The automorphisms can be classified into 3 types:
- •
is elliptic, if it fixes a finite subset of ,
- •
a non-elliptic is parabolic, if it fixes a unique end, and
- •
it is hyperbolic, if it fixes each of a unique pair of ends.
While this terminology was not used by Halin [17], he showed that for non-elliptic automorphism and every the sequence uniquely defines an end of which is called the direction of , denoted by and fixed by . The following theorem serves as one of the main pillars for our results.
Theorem 3.1** (Halin [17, Thm. 9]).**
Let be a non-elliptic automorphism acting on the locally finite connected graph . Then the following holds:
- (a)
* and have the same size .* 2. (b)
* (i.e. is hyperbolic) if and only if .* 3. (c)
There are disjoint double rays which are invariant under . 4. (d)
If is hyperbolic then there are a set with and an integer such that and are defining sequences for and respectively. Each meets every () in precisely one vertex.
Definition 3.2**.**
A locally finite, connected graph is called a strip if it is quasi-transitive and has precisely two ends.
The structure of strips is well understood. We collect without proof those basic facts which will be needed below. More about strips can also be found in Jung and Watkins [22] and Imrich and Seifter [20], [21].
For a strip there is some hyperbolic automorphism fixing each of the two ends and of . Thus by the previous theorem and have the same finite size . Moreover it provides
- •
a finite set with , together with
- •
an automorphism , such that and are defining sequences for and , respectively and
- •
disjoint, -invariant double rays, each of them intersecting every in precisely one vertex.
By replacing with a suitable power , we can assume that the subgraph of spanned by is finite and connected.
In this situation we call a -strip of size . We use the same terminology if is a subgraph of a bigger graph , and is an automorphism whose restriction to has the above properties.
The following lemma refines the well-known argument that in a quasi-transitive graph with more than one end, the directions of hyperbolic automorphisms are dense in the space of ends.
Lemma 3.3**.**
Let be locally finite and connected and act quasi-transitively. If has a thin end of size then it contains a -strip of size for some .
Proof.
Let the end of have size , take disjoint rays in and let be a defining sequence for such that for all and each meets each of the rays. Fix a finite set of representatives of the orbits given by the action of on . For write and let such that contains an element of . Then every is a tight -vertex cut, i.e., a set of cardinality such that has at least two components and every vertex of has at least one neighbour in each of them. Also is a component of .
By [30, Prop. 4.2] there are only finitely many tight -vertex cuts containing some given vertex and clearly every cut splits the graph in finitely many components, hence is a finite set. Pick some such that and let . Then
[TABLE]
We show that is hyperbolic and the direction has size . Indeed, is a defining sequence for the end belonging to all components , where . In particular has size at most . By Theorem 3.1, is hyperbolic. On the other hand there are disjoint paths from to . Their images under build disjoint -invariant double rays , implying that has size , as required. Add to those double rays a finite collection of finite paths connecting those double rays with each other and all their images under , . After possibly replacing by a suitable power , , we obtain a subgraph of which is a -strip of size . ∎
Lemma 3.4**.**
Let be locally finite and connected and act quasi-transitively on . If contains a parabolic element then for every , contains a -strip of size at least for some suitable .
Proof.
Suppose that is parabolic and is the unique end fixed by . By Theorem 3.1, there are countably many disjoint double rays , , which are invariant under and represent . For , we can find some connected, finite subgraph of which meets each of . Then the subgraph spanned by together with all the , , is a strip and its two ends have size at least . Application of Lemma 3.3 concludes the proof. ∎
Remark 3.5**.**
Whenever a graph contains a -strip of size , it also contains a -strip of any size in . This can be seen by deleting vertices from the set from the definition of -strips and their images under , .
4. Context-freeness and ends
In this section we prove the first half of Theorem 1.1, namely
Theorem 4.1**.**
Let be a connected, locally finite, deterministically edge-labelled graph on which acts quasi-transitively.
- (1)
If is context-free, every end of has size at most 2. 2. (2)
If is regular, every end of has size 1.
The proof is based on the following two lemmas and two propositions.
Lemma 4.2**.**
Let be a subgraph of which is invariant under a subgroup of acting quasi-transitively on . Suppose that is regular, resp. context-free. Then there is such that is also regular, resp. context-free.
Proof.
is also a deterministically labelled graph. If we choose any and is the set of all walks in starting at , then is clearly a regular language. Namely, the factor graph is a finite state automaton for , easily converted into a right-linear grammar; see [19].
Now there is some with , and there is a path in of that length from to . Let , and let be the set of all concatenated paths , where . Thus, is again regular.
If is regular (resp. context-free), then by [19] also is regular (resp. context-free). Since is the only vertex of which is contained in ,
[TABLE]
If we delete from the latter language the common prefix , we also get a regular (resp. context-free) language. ∎
Lemma 4.3**.**
Let be a connected, infinite, locally finite and deterministically labelled graph and let act quasi-transitively on . Assume that is context-free for a choice of . Then contains a non-elliptic element.
Proof.
Let be the pumping length of given by Lemma 2.6. Let with . Then it can be written as where , and for all . Now either or and . Set , in the first case and , in the second case (so that and ). Let be the end-vertex of the path starting at and labelled by the word . Then, for every , we have the unique self-avoiding walk of length which starts at and has label . Thus, is a self-avoiding extension of , and in the limit we obtain a ray . Using that acts quasi-transitively on there must be some and some such that . Without loss of generality (up to truncation of an initial piece of ), we assume . Then for every , and [17, Prop. 12] yields that is non-elliptic. ∎
Remark 4.4**.**
In group theoretical terms the last lemma says that if the finitely generated group acts quasi-transitively on and is an infinite torsion group then is not context-free.
For the proofs of the next two propositions, we adopt the following notation: if is the label of an arbitrary walk in , then denotes the label of the reversed walk.
Proposition 4.5**.**
Let be a connected, infinite, deterministically edge-labelled graph on which acts quasi-transitively.
If contains a -strip of size , where , then is not regular.
Proof.
We suppose that is regular for and will reach a contradiction.
The strip has two -invariant doubly infinite rays , and there must be a shortest path connecting the two rays, which is contained in the connected subgraph mentioned in the definition of -strips. Without loss of generality, we assume that goes from to . Then the subgraph of spanned by , and all () is a -invariant subdivision of the bi-infinite ladder, see Figure 1.
We can apply Lemma 4.2 to , and there is such that is regular. Without loss of generality, we assume that either lies on between and and is distinct from the latter, or that lies on and is distinct from . (Otherwise we can exchange the two rays.) In Figure 1, we indicate the possible positions of .
Let be the label of the path from via to . Write for the common label of all the paths () and for the label of the reversed paths. Next, let denote the common label of each of the paths from any to within . And finally, let be the common label of the paths from any to within (). Each of the words is non-empty, but they are in general not just elements of .
The language consisting of all words
[TABLE]
is regular, so that by the closure properties of regular languages, also its intersection with is regular. Now, consists of the labels of all self-avoiding walks starting at which are of the form (2). Looking at Figure 1, such a walk goes from to , then upwards to and to the right to along , then downwards to , and finally to the left to . Thus, in order to be self-avoiding, one must have .
Let be the pumping length of Lemma 2.5 for , and let . In the decomposition of the lemma, implies that is a postfix of . That is, for some word . Now also must be in , so that there must be such that
[TABLE]
Since the labelling is deterministic and the first symbol of and the first symbol of are both labels of different edges starting at , these symbols must be different. We can conclude that and . This is longer than , so that . But then the walk with label starting at is not self-avoiding, a contradiction. ∎
Proposition 4.6**.**
Let be as in Proposition 4.5.
If contains a -strip of size , where , then is not context-free.
Proof.
We suppose that is context-free and will again arrive at a contradiction.
The strip contains three -invariant rays , and . Up to renumbering the rays, exchanging them, and possibly replacing with a power of itself, we may assume to have the following situation: there is a path from to which meets and only at its endpoints and does not meet ; there is a path from to () which meets and only at its endpoints and does not meet , and furthermore, lies strictly between and Then the subgraph of spanned by the three rays and all images and is a subdivision of the bi-infinite -ladder, see Figure 2.
Again, Lemma 4.2 applies to , and there is such that is context-free.
Up to possibly renumbering the rays, inverting their direction or exchanging with , we can assume without loss of generality that lies on the “rectangle” with corners and , but not on the path . In Figure 2, we indicate the possible positions of .
We let be the label of the self-avoiding walk starting at and running around that rectangle in clockwise order up to . Write for the label of and for the label of . Next, let and denote the labels of the subpaths of from to and from to , respectively. Thereafter, denotes the label of the path from to on and finally is the label of the path from to on . The automorphism is label preserving, any translates of the previous paths are also labelled by the same words.
Each of the words is non-empty, but again, they are in general not just elements of .
Similarly to the previous proposition, the language consisting of all words
[TABLE]
is regular, so that by the closure properties of context-free languages, its intersection with is context-free. Following the arrows in Figure 2, one can see a self-avoiding walk with such a label, with . In general, for a walk with label as in (3) to be self-avoiding, one must have
[TABLE]
Let be the pumping length of Lemma 2.6 for , and let . In the decomposition of the lemma, implies that is a subword of [Case 1], or of [Case 2] (or both, meaning that it is contained in .
In both cases, for any , there must be such that
[TABLE]
In Case 1, , and thus , must end with Since the last symbol of and the last symbol of are labels of different edges ending at , they have to be different. It follows that . Using that we get that either or , contradicting (4).
In Case 2, we get in the same way that for all , and either or . Again this contradicts (4).∎
We are now almost ready for the proof of Theorem 4.1. We will need Bass-Serre theory. As the topic cannot be briefly introduced, we will not give all definitions here. The reader is referred to Serre [29] and Dicks and Dunwoody [8].
The ends of a finitely generated group are the ends of any of its Cayley graphs with respect to a finite, symmetric set of generators. In fact, they do not depend on the choice of the generating set; see Remark 2.2.
A group is called accessible if it is the fundamental group of a finite graph of groups having finite edge groups and vertex groups which are finite or have one end. The following lemma is [8, Corollary IV.1.9].
Lemma 4.7**.**
A group is the fundamental group of a finite graph of finite groups if and only if is finitely generated and virtually free.
A locally finite graph is called accessible, if there is an integer such that any two ends of can be separated by a set containing or fewer vertices. Thomassen and Woess [30] showed that a connected, locally finite, transitive graph is accessible if and only if there is an integer such that each thin end of has size at most . Moreover, they also proved that a finitely generated group is accessible if and only if some (and therefore all) of its Cayley-graphs are accessible.
Proof of Theorem 4.1.
First, cannot be one-ended. Indeed, in that case, that end has to be thick. If has only elliptic elements, then is not context-free by Lemma 4.3. Otherwise, has parabolic elements, and Lemma 3.4 combined with Proposition 4.6 implies as well that is not context-free.
Thus, has more than one end, whence there are thin ends. If is context-free then by Lemma 3.3 and Proposition 4.6 all thin ends have size at most . We need to prove that there are no thick ends.
Recall from Lemma 2.1 and its proof that in the graph , the orbit of induces a Cayley graph of the finitely generated group . The identity mapping induces a quasi-isometric embedding of into . Indeed, it is bi-Lipschitz and quasi-surjective, i.e., in (1). Consequently, by Corollary 2.4, all thin ends of have size at most . By [30], the group is accessible. Thus, it is the fundamental group of a finite graph of finitely generated (sub)groups which are finite or one-ended. If all of them are finite, then by Lemma 4.7, is virtually free, so that it only has thin ends since is quasi-isometric with a tree.
Thus, if has a thick end, then it must have a finitely generated subgroup which has one (thick) end. Above, we have identified with the (vertex set of) in , and is contained in that orbit. Under this identification, the group unit corresponds to the “root” vertex . Let be a finite, symmetric set of generators of . Then for each there is a (shortest) path in from to the image . We can choose these paths such that , where is the reversal of . Let and be the sets of vertices and edges of , respectively. Then we consider the subgraph of with
[TABLE]
Clearly, is a connected subgraph of which inherits the labels from the edges of . Also, is quasi-isometric with the Cayley graph . Indeed, the embedding , is bi-Lipschitz and quasi-surjective, i.e., in (1).
Therefore, has one end, which has to be thick, and it is quasi-transitive under . But then we are back to the situation of the beginning of this proof, that is, cannot be context-free. But this contradicts Lemma 4.2.
We conclude that and thus also have no thick ends. ∎
5. Graphs with context-free language of SAWs
The goal of this section is to prove the second half of Theorem 1.1, namely
Theorem 5.1**.**
Let be a connected, locally finite, deterministically edge-labelled graph on which acts quasi-transitively. Then for every vertex of the following holds:
- (1)
If all ends of have size 1, then is regular. 2. (2)
If all ends of have size at most 2, then is unambiguous context-free.
For an integer a graph is called -connected if it has more than vertices and no set of less than vertices is a separating set in . A (2-)block in a graph is a maximal connected subgraph of containing no cutvertex. If is connected and has at least 2 vertices, every block of is either a pair of vertices connected by an edge or a 2-connected graph. The intersection of 2 different blocks of is either empty or a cutvertex in . The block-cutvertex tree corresponding to is the graph having as vertices the blocks and the cutvertices of , where a block is adjacent to every cutvertex it contains. Denote for an edge of by the block and by the cutvertex incident to . More about blocks and the block-cutvertex tree can be found in [32].
If is locally finite, every cutvertex of belongs to a finite number of blocks and therefore has finite degree in . On the other hand an infinite block of can contain infinitely many cutvertices, so need not be locally finite.
For any automorphism , the image of a block of is again a block of and the same holds for cutvertices. Therefore, whenever a subgroup acts quasi-transitively on , it also acts quasi-transitively on the graph .
The following lemma shows that blocks of quasi-transitive graphs are always quasi-transitive graphs.
Lemma 5.2**.**
Let be a connected, locally finite graph and suppose acts quasi-transitively on . Then for any block of , the set-wise stabilizer of in acts quasi-transitively on .
Proof.
acts quasi-transitively on , so it acts with finitely many orbits on , i.e., the set is finite. But every mapping some edge of onto another edge of clearly fixes the block and is therefore also contained in . This implies that also acts with finitely many orbits on and thus quasi-transitively on . ∎
The following lemma is a simple consequence of the fact that a block cannot contain ends of size 1, because defining sequences of such ends consist of cutvertices.
Lemma 5.3**.**
Let be a locally finite graph such that all ends of are of size 1. Then every block of is finite.
In the case where the graph has infinite blocks, we want to further decompose them. There are different ways to do this. One natural way is to use 3-block-decompositions, first introduced by Tutte (see [32]) for finite graphs and then generalized to infinite graphs by Droms, B. Servatius and H. Servatius in [10]. In this theory, sometimes graphs may have multiple edges between a single pair of vertices, we will call them multi-graphs.
Let and be two multi-graphs and let and be (directed) edges. The edge amalgam of and along and is the graph obtained from the disjoint union of and by identifying the vertices with , with and erasing the edges and . A convenient way to represent a sequence of edge amalgamations of a (not necessarily finite) set of multi-graphs is the edge-amalgam tree . Vertices of are the multi-graphs used in the amalgamation. For clarity we will denote by the multi-graph corresponding to the vertex of . Two vertices and are connected by an undirected edge in , if and are amalgamated along some edges and . We additionally introduce a label function assigning to every directed edge the edge of used in the amalgamation of and . For an edge of is called virtual, if it is the label of some edge of , otherwise it is called non-virtual. Denote the resulting multi-graph obtained from a sequence of edge amalgamations given by an amalgamation tree by . Then virtual edges disappear during the progress, while non-virtual edges are still present in .
A multilink is a multi-graph consisting of 2 vertices and a (finite) positive number of undirected edges between these vertices. A multi-graph is said to be a 3-block if it contains at least 3 edges and is either a cycle (closed path), a multilink or a locally finite 3-connected graph. An edge-amalgam tree is called a 3-block tree if for every , is a 3-block and additionally for every edge the corresponding graphs , are neither both multilinks nor both cycles. Note that in general, 3-block trees need not be locally finite. This is due to the fact that there can be infinite 3-blocks and therefore also infinitely many amalgamations of a single 3-block.
Theorem 5.4** (Droms, B. Servatius, H. Servatius [10, Thm. 1]).**
For any locally finite, 2-connected graph there is a unique 3-block tree such that .
For a given 2-connected graph the unique 3-block tree given by the above theorem will henceforth be denoted by . The proof of the theorem is constructive and allows us to “decompose” in a unique way into (possibly infinitely many) 3-blocks, such that is obtained from amalgamating these 3-blocks as given by . The set of vertices and the set of non-virtual edges of each 3-block will be considered as a subsets of and , respectively. A single vertex of may appear in different blocks.
has the following 2 properties:
- (a)
For every virtual edge , , there is a finite sub-tree of containing but not and a path in connecting the endpoints of and consisting of edges of . 2. (b)
Let and . Then is contained in a finite sub-tree of such that all edges of incident to are edges of .
Due to the uniqueness of the decomposition, symmetries on the graph carry over to in a canonical way. Moreover, as in the case of 2-blocks, any acting quasi-transitively on also acts quasi-transitively on .
Lemma 5.5**.**
Let be a simple, locally finite, 2-connected graph such that all ends of are of size at most 2. Then every 3-block of is finite.
Proof.
Clearly, multilinks and cycles are finite. Every end of a 3-connected graph must be of size at least 3, because every defining sequence of an end consists of separating sets.
Assume that there is an infinite 3-block of . Then it contains an end of size at least 3 and this end contains 3 disjoint rays. By property (a) of , we can replace all virtual edges contained in the rays by finite paths consisting of non-virtual edges of which are not in . We obtain 3 disjoint rays in , which belong to the same end of . This is a contradiction to the assumption that all ends of have size at most 2. ∎
Theorem 5.6**.**
Let be a 2-connected, locally finite, deterministically edge labelled graph and let act quasi-transitively on . If every end of is of size at most 2, then for every , the language is unambiguous context-free.
Proof.
Let be a set of representatives of the finite set of orbits of directed edges of . For and a vertex write for the representative of in and for the vertex in representing . Define for and the set of self-avoiding walks in starting at and not containing the virtual edge . Let be the subset of consisting of all walks not containing the second vertex of . Note that both sets are finite because is finite. Fix some vertex of such that the 3-block contains the vertex of . We denote by the set of self-avoiding walks in starting at .
We extend the label function on to 3-blocks of . Labels of non-virtual edges are inherited from and for any label by and by . The extended label function will be again denoted by and maps into , where is the label alphabet of and .
We will now present a grammar generating the language of self-avoiding walks in starting at . The finite set of variables is given by
[TABLE]
Let and . Then for any directed edge in starting at the productions given below are contained in .
[TABLE]
The set of productions is completed by adding for every edge starting at the following rules.
[TABLE]
We will now briefly discuss why the given grammar unambiguously generates the language of self-avoiding walks in starting at .
Let be an edge of and be the component of containing . A self-avoiding walk of length at least 1 in is called a V-walk with direction if it starts at a vertex and all edges of the walk are contained in . A U-walk with direction is a V-walk with direction also ending in .
Then the following statements hold.
- (a)
For and , the variable unambiguously generates the language of all U-walks with direction starting at . 2. (b)
For and , the variable unambiguously generates the language of all V-walks with direction starting at , and the variable unambiguously generates the language of all V-walks with direction starting at and not containing the second vertex of . 3. (c)
The start symbol unambiguously generates the language .
Rigorous proofs for these statements are long and technical, so we only sketch them here.
Let be a vertex of and be its corresponding 3-block. Define the projection of a self-avoiding walk onto in the following way: Let be the sequence of vertices of which are contained in ordered by their occurrence in and be the subwalk of connecting and . For every there are 2 cases: If is a single edge of , add this edge to . Otherwise there is an edge of such that is a U-walk with direction and we add the virtual edge connecting and . This edge can be seen as a shortcut for the U-walk . For reasons of ambiguity, if is a U-walk and ends at , we do not add the corresponding virtual edge and consider our projection to end at . The resulting is a self-avoiding walk in .
For every U-walk with direction starting at we can obtain the word corresponding to the projection of onto the 3-block from the variable in a single derivation step. A simple induction shows that is generated by . Moreover for any word generated by , the walk starting at and having label is a U-walk with direction starting at . The word can only be obtained from a unique sequence of rightmost derivations because in every step we have to generate the string corresponding to the projection of onto some 3-block. Statement (a) follows.
For any V-walk with direction starting at we can derive in a single step of derivation from and if does not contain the second vertex of , also from . Using (a) and induction, this implies that is generated by the corresponding variables. Furthermore note that only appears in the right hand side of productions, if the second vertex of is not contained in the projection of on any block previously visited by . Using this observation it is not hard to show that walks corresponding to words generated by and are indeed V-walks with direction a starting at and that every such walk is generated unambiguously. We obtain (b).
Finally, for every SAW starting at we can derive in a single step from and (c) follows from (a) and (b) as before. ∎
We are now able to prove the main result in this section. This result together with Theorem 4.1 then implies Theorem 1.1.
Proof of Theorem 5.1.
The group acts quasi-transitively on the block-cutvertex tree . Let denote the finite, connected factor graph. Every vertex of corresponds to either a class of blocks or a class of cutvertices of under the action of , so for we can again write (block) and (cutvertex) for the two vertices of incident to . Note that for any cutvertex of , all edges of incident to lie in different orbits with respect to because by Lemma 2.1, acts fixed-point-freely on . For let . Fix some block containing the vertex .
Let be the regular language generated by the grammar , where
[TABLE]
and the set of productions consists of
[TABLE]
For let be the language of self-avoiding walks of length at least 1 in the block starting at a fixed representative of the vertex , seen as a class of vertices of . Clearly, does not depend on the choice of . By Lemma 5.2 the stabilizer acts quasi-transitively on the graph and by assumption on all ends of have size at most 2. Hence Lemma 5.6 applies and is context-free. Denote for by the subset of corresponding to walks ending at vertices of , which are in the vertex class . Note that may be an empty language if or if and is the only representative of in . As the intersection of the unambiguous context-free language and the regular language of all walks starting at and ending at a representative of , is unambiguous context-free. In a similar way let be the language of all walks in starting at and be the subset of corresponding to walks ending at a representative of .
Let be the substitution of languages given for by
[TABLE]
Then by [19, Thm. 3.4.1] the result of the substitution is context-free.
If every end of is of size at most 1, by Lemma 5.3 every block of is finite and thus also the language of self-avoiding walks in the block is finite. We conclude that in this case is regular.
For , a self-avoiding walk of length at least 1 in is called a W-walk with direction if it starts at and its first edge lies in the block . Then the following statements hold:
- (a)
The variable generates a regular language such that is the language of W-walks with direction . 2. (b)
is the unambiguous context-free language of self-avoiding walks in starting at .
As before we will only sketch the proofs for these statements. For we denote by the orbit of under , which is an edge of .
Let be a W-walk with direction . Let be the part of contained in the block . If , then is contained in and therefore obtained in a single step of derivation. Otherwise there is some such that leaves via and is contained in . In this case enters one of the other blocks containing , which are blocks , , such that . The rest of is a W-walk with direction . A simple induction shows that is contained in .
On the other hand, every word corresponds to a unique walk starting at labelled by . This walk is self-avoiding, because the parts of contained in blocks are self-avoiding, and whenever leaving a block , , can never enter again because is a tree and does not contain . This implies that is a W-walk with direction and proves (a).
In the same way it can be seen that for any , the label of the part contained in is contained in if and in , if leaves via a vertex in the class . Therefore is contained in . In the same way as in (a) we obtain that every walk starting in and corresponding to a word in is self-avoiding.
Let and be the SAW starting at and having label . There is a unique word such that , which is given by the sequence of blocks visited by . Furthermore, for every the language is unambiguous context-free, hence is also unambiguous context-free. This shows statement (b) and finishes the proof. ∎
6. Discussion and examples
The proof of Theorem 5.1 is constructive and the obtained grammar can be used to calculate the generating function of self-avoiding walks and the connective constant of graphs satisfying the conditions of the theorem.
Given some language , the ordinary generating function is the power series
[TABLE]
Using the algebraic theory of context-free languages Chomsky and Schützenberger developed in [7], the productions of an unambiguous context-free grammar generating the language can be translated into a system of polynomial equations having as one of its solutions the language generating function .
Recall that a power series is called algebraic over a field , if it satisfies a polynomial equation of the form P\bigl{(}t,F(t)\bigr{)}=0, where is a bivariate polynomial in . From classical elimination theory it follows that any component of a solution of a system of polynomial equations having coefficients in is algebraic over , in particular is algebraic over .
Let be a connected, locally finite, deterministically edge labelled graph and be a vertex of such that the language is unambiguous context-free. Then the label function acts as a bijection between the set of self-avoiding walks starting at and its language , whence the SAW-generating function satisfies . All singularities of algebraic functions are algebraic numbers, so in particular the radius of convergence of and thus also the connective constant of the graph are algebraic numbers.
Note that there are quasi-transitive graphs which do not admit any deterministic labelling such that acts quasi-transitively on . A simple example is the Grandparent Graph , given by Trofimov in [31] as a graph with a non-unimodular automorphism group . As a consequence, every subgroup of acting quasi-transitively on cannot act fixed point freely and by Lemma 2.1, admits no labelling as above. Nevertheless, the following statement can be shown using the previous discussions and the ideas and proofs of Section 5, but generating functions counting walks instead of grammars and language theory.
Corollary 6.1**.**
Whenever all ends of a connected, locally finite, quasi-transitive graph are of size at most 2, the SAW-generating function is algebraic over . In particular the connective constant is an algebraic number.
Alm and Janson showed in [1] that the generating functions of self-avoiding walks on one-dimensional lattices are algebraic over , independently of the size of the ends. In future work, we shall examine how our results can be extended to quasi-transitive graphs having only thin ends. A further issue for future work is to investigate under which structural conditions on the graph, is accepted by a multipass automaton as in [4].
Some concrete examples, where we used Theorem 5.1 and its constructive proof to obtain the SAW-generating functions and the connective constants, include the “sandwich” of two -regular trees, which was already treated in a slightly different way in [23], the above mentioned Grandparent Graph and the Cayley graph of the group with respect to the generators , which has non-transitive blocks.
Finally we provide the following example, where we end up with an algebraic SAW-generating function, which is not rational.
Example 6.2**.**
Consider the group , which is a free product with amalgamation of the dihedral groups and and let be the Cayley graph . Edges are labelled by the generators in the usual way. The resulting graph can be seen in Figure 3. All ends of have size 2, so the language of self-avoiding walks in is context-free. The 3-block-decomposition of can be seen in Figure 3, where virtual edges are dashed. It yields 3 types of 3-blocks, which will be denoted by , and in correspondence to the labels they contain. contains 4 types of edges, they will be denoted by , , , , depending on the pair of 3-blocks they connect (e.g. starts at and ends at ).
A grammar generating the languages of self-avoiding walks constructed as in the proof of Lemma 5.6 is given by the following productions:
[TABLE]
Translating this set of productions into the corresponding system of equations and solving this system yields the SAW-generating function
[TABLE]
where and are two polynomials of degree 23 and 17, respectively. The connective constant of is the reciprocal of the smallest positive root of the denominator of :
[TABLE]
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Alm, S. E., and Janson, S.: Random self-avoiding walks on one-dimensional lattices, Comm. Statist. Stochastic Models 6 (1990) 169–212.
- 2[2] Anisimov, A. V.: Group languages, Kibernetika 4 (1971) 18–24.
- 3[3] Bauerschmidt, R., Duminil-Copin, H., Goodman, J., and Slade, G.: Lectures on self-avoiding walks. In Probability and Statistical Physics in Two and more Dimensions, 395–467, Clay Math. Proc. 15 , Amer. Math. Soc., Providence, RI, 2012.
- 4[4] Ceccherini-Silberstein, T., Coornaert, M., Fiorenzi, F., Schupp, P. E., and Touikan, N. W. M.: Multipass automata and group word problems, Theoret. Comput. Sci. 600 (2015) 19–33.
- 5[5] Ceccherini-Silberstein, T., and Woess, W.: Growth and ergodicity of context-free languages , Trans. Amer. Math. Soc. 354 (2002) 4597–4625.
- 6[6] Ceccherini-Silberstein, T., and Woess, W.: Context-free pairs of groups I: Context-free pairs and graphs, European J. Combin. 33 (2012) 1449–1466.
- 7[7] Chomsky, N. and Schützenberger, M.-P.: The algebraic theory of context-free languages, in Computer Programming and Formal Systems 26 (P. Braffort abd D. Hirschberg, eds.) North-Holland, Amsterdam, 1963, pp. 118–161.
- 8[8] Dicks, W., and Dunwoody, M. J.: Groups acting on graphs, Cambridge Studies in Advanced Mathematics 17 . Cambridge Univ. Press, Cambridge, 1989.
