The word problem of the Brin-Thompson group is coNP-complete

J.C. Birget

arXiv:1902.03852·math.GR·February 12, 2020

The word problem of the Brin-Thompson group is coNP-complete

J.C. Birget

PDF

Open Access

TL;DR

This paper establishes that the word problem for the Brin-Thompson groups nV and the Thompson group V is coNP-complete, highlighting the computational complexity of these algebraic problems.

Contribution

It proves coNP-completeness of the word problem for nV groups for all n ≥ 2 and for Thompson group V over a specific generator set, advancing understanding of their computational complexity.

Findings

01

Word problem of nV is coNP-complete for all n ≥ 2

02

Word problem of Thompson group V over certain generators is coNP-complete

03

Highlights computational complexity of these algebraic problems

Abstract

We prove that the word problem of the Brin-Thompson group nV over a finite generating set is coNP-complete for every n \ge 2. It is known that the groups nV are an infinite family of infinite, finitely presented, simple groups. We also prove that the word problem of the Thompson group V over a certain infinite set of generators, related to boolean circuits, is coNP-complete.

Equations2

g\ =\ \left[\begin{array}[]{ccc}x_{1}&\ldots&x_{n}\\ y_{1}&\ldots&y_{n}\end{array}\right]\ \ \longmapsto\ \ \theta(g)\ =\ \left[\begin{array}[]{c ccc}1&0x_{1}&\ldots&0x_{n}\\ 1&0y_{1}&\ldots&0y_{n}\end{array}\right].

g\ =\ \left[\begin{array}[]{ccc}x_{1}&\ldots&x_{n}\\ y_{1}&\ldots&y_{n}\end{array}\right]\ \ \longmapsto\ \ \theta(g)\ =\ \left[\begin{array}[]{c ccc}1&0x_{1}&\ldots&0x_{n}\\ 1&0y_{1}&\ldots&0y_{n}\end{array}\right].

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicssemigroups and automata theory · Geometric and Algebraic Topology · Computability, Logic, AI Algorithms

Full text

The word problem of the Brin-Thompson group is coNP-complete

J.C. Birget

( 10.ii.2020)

Abstract

We prove that the word problem of the Brin-Thompson group $nV$ over a finite generating set is coNP-complete for every $n\geq 2$ . It is known that $\{nV:n\geq 1\}$ is an infinite family of infinite, finitely presented, simple groups. We also prove that the word problem of the Thompson group $V$ over a certain infinite set of generators, related to boolean circuits, is coNP-complete.

1 Introduction

The group $nV$ was introduced by Brin [14] as an $n$ -dimensional generalization of Richard Thompson’s group $V$ , for any positive integer $n$ (with $\,1V=V$ ).

Brin proved that $2V$ is finitely generated and simple, that $V$ is not isomorphic to $2V$ [14], that $2V$ is finitely presented [15], and that all $nV$ are simple [16]. Hennig and Mattucci [24] show that all $nV$ are finitely presented. Bleak and Lanoue [11] show that all $nV$ are non-isomorphic. In short, the groups $nV$ are an infinite family of infinite, finitely presented, simple groups.

The word problem of $nV$ is decidable, as is easy to see from the definition of $nV$ . The main result of the present paper is the following.

Theorem 1.1

The word problem of $nV$ over any finite generating set is coNP-complete, for all $n\geq 2$ .

Remarks on the theorem:

This is only the second example of a finitely presented group with coNP-complete word problem; the first example appeared in [6]. This is also the first “naturally occurring” example of a finitely presented group with either NP-complete or coNP-complete word problem. The proof of Theorem 1.1 strengthens the connection between acyclic circuits and finite group presentations; such a connection already played a crucial role in [6].

The Theorem implies that if ${\sf NP}\neq{\sf coNP}$ then the Dehn function of $nV$ (for $n\geq 2$ ) has no polynomial upper bound; more strongly, $nV$ cannot be embedded into a finitely presented group with polynomially bounded Dehn function (by [39, 4]).

The Theorem implies that if ${\sf P}\neq{\sf NP}$ then $2V$ is not embeddable into $V$ . It was proved recently [33, Coroll. 11.20] that $(n+1)V$ does not embed into $nV$ for any $n\geq 1$ .

The groups $nV$ for $n\geq 2$ are the first examples of finitely presented simple groups whose word problem is harder than P (if ${\sf P}\neq{\sf NP}$ ).111 The Higman-Thompson groups $G_{k,s}$ have their word problem in P (in fact in coCFL, by Lehnert and Schweitzer [30]). For other currently known finitely presented infinite simple groups (Meier [35, 36], Röver [38], Burger and Mozes [17], Lodha [31]), the complexity of the word problem has not been studied, but appears to be in P. Finitely presented infinite simple groups are related to the Boone-Higman theorem [13]. In [13] the authors ask whether their theorem can be strengthened as follows: Does a finitely generated group $G$ have a decidable word problem iff $G$ is embeddable into a finitely presented simple group? In contrast, it was observed in [6, Section 1] that all known finitely presented simple groups have a word problem of very low complexity; even coNP is a low complexity class on the scale of all decidable problems. The enormous gap between what is asked, and what has been observed so far motivates the following.

Question: Are the computational complexities of the word problems of all finitely presented simple groups unbounded?

More precisely, the negation of the question is: Is there a time-constructible total function $t$ such that the word problems of all finitely presented simple groups belong to ${\sf DTime}(t)$ ? (See e.g. [26] for the definitions of “time-constructible” and “ ${\sf DTime}(t)$ ”.) In case of a negative answer, the Boone-Higman question also has a negative answer. If the answer is positive then there is a chance that the Boone-Higman question has a positive answer; in that case, the proof of the answer to the Question above might be easier than the proof of a strengthened Boone-Higman theorem, and could be a useful step along the way.

Overview: In section 2 we define the Higman-Thompson groups $G_{k,1}$ and the Brin-Thompson groups $nV$ and $nG_{k,1}$ by (partial) actions on finite strings, or $n$ -tuples of strings. For this, the concept of prefix code of strings is generalized to the concept of joinless code of $n$ -tuples of strings. For the study of the computational complexity of the word problem, the string-based formalism is more convenient than the geometric approach. It follows fairly directly that the word problem of $nV$ over a finite generating set belongs to coNP (section 3).

The proof of coNP-hardness is given in section 4. It goes through several steps, following the same strategy as the first half of [6] (where it was proved that a certain subgroup of $G_{3,1}$ , over a certain infinite generating set, has a coNP-complete word problem. Based on this we show that the Thompson group $V$ , over a certain infinite generating set, has a coNP-complete word problem. This infinite generating set of $V$ consists of a finite generating set, together with all the bit-position transpositions $\tau_{i,i+1}$ (where $\,\tau_{i,i+1}:$ $\,x_{1}\,\ldots\,x_{i-1}\,x_{i}\,x_{i+1}\,x_{i+2}\,\ldots\,$ $\longmapsto$ $\,x_{1}\,\ldots\,x_{i-1}\,x_{i+1}\,x_{i}\,x_{i+2}\,\ldots\$ ). An alternative approach, based on bijective circuits and the work of Jordan [27], is described in subsection 4.5. Finally, we show that $\tau_{i,i+1}$ can be expressed by $\tau_{1,2}$ and the shift $\sigma$ . This reduces the word problem of $V$ , over an infinite generating set that includes position transpositions, to the word problem of $2V$ over a finite generating set (subsection 4.6).

Summary of abbreviations and notations:

– The word function in this paper means partial function. The domain of a function $f:X\to Y$ is denoted by ${\rm Dom}(f)\subseteq X$ , and the image by ${\rm Im}(f)\subseteq Y$ . Most often, the sets $X$ and $Y$ will be free monoids $A^{*}$ , or Cantor spaces $A^{\omega}$ , or their direct powers $nA^{*}$ or $nA^{\omega}$ .

– $A^{*}$ , the free monoid freely generated by $A$ , a.k.a. the set of all strings over $A$ ;

– $\varepsilon$ , the empty string;

– $A^{+}$ , the free semigroup; $A^{+}=A^{*}\smallsetminus\{\varepsilon\}$ ;

– $|x|$ , the length of the string $x\in A^{*}$ ;

– $x\leq_{\rm pref}y$ , $x$ ( $\in A^{*}$ ) is a prefix of $y$ ( $\in A^{*}\cup A^{\omega}$ );

– $x\parallel_{\rm pref}y$ , $x$ is prefix-comparable with $y$ ;

– $nA^{*}$ , $nA^{\omega}$ , the $n$ -fold cartesian product X ${}_{{}_{i=1}}^{{}^{n}}A^{*}$ , respectively X ${}_{{}_{i=1}}^{{}^{n}}A^{\omega}$ ;

– $(\varepsilon)^{n}$ , the $n$ -tuple of empty strings;

– $A_{\varepsilon,n}=\,\bigcup_{1\leq i\leq n}$ $\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i}$ , the unique minimum generating set of the monoid $nA^{*}$ ;

– $\ell(x)$ , $\max\{|x_{1}|,\,\ldots,|x_{n}|\}\$ if $\,x=(x_{1},\,\ldots,x_{n})$ $\in nA^{*}$ ;

– $x\leq_{\rm init}y$ , $x$ ( $\in nA^{*}$ ) is an initial factor of $y$ ( $\in nA^{*}\cup nA^{\omega}$ );

– dag, directed acyclic graph;

– $f|_{M}$ , the restriction of a function $f$ to a set $M$ .

2 Definition of $nV$ based on strings

The standard definitions in computational complexity require strings as inputs. Brin’s original definition of $nV$ uses geometric actions, but for the proof of coNP-completeness of the word problem of $nV$ we also need a (partial) action of $nV$ on $n$ -tuples of strings. The groups $nV$ are generalizations of $V$ . We first look at $V.$

2.1 Definition of $V$ based on strings

The group $V$ can be defined in many ways; see e.g. [44, 34, 45, 25, 41, 19]. We will mostly use two definitions of $V$ from [5] (which are is similar to [41], except that we use the terminology of prefix codes, right ideals, and right-ideal morphisms).

We recall some standard notions. An alphabet is any finite set, although we mostly use $\{0,1\}\,$ (the bits), and $\{0,1,\,\ldots,k-1\}\,$ for any integer $k\geq 2$ . For an alphabet $A$ and $m\in{\mathbb{N}}$ , $A^{m}$ denotes the set of sequences of length $m$ over $A$ (called set of strings of length $m$ ), and for $x\in A^{m}$ we say that $|x|=m$ (i.e., the length of $x$ is $m$ ); $A^{\leq m}$ is the set of strings of length $\leq m$ . The empty string is denoted by $\varepsilon$ , and $|\varepsilon|=0$ . The set of all strings over $A$ is denoted by $A^{*}$ , and the set of all infinite strings indexed by the ordinal $\omega$ is denoted by $A^{\omega}$ . By default a “string” is finite; for infinite strings we explicitly say “infinite”. For $x_{1},x_{2}\in A^{*}$ the concatenation is denoted by $x_{1}x_{2}$ or $x_{1}\cdot x_{2}$ ; it has length $|x_{1}|+|x_{2}|$ . For two subsets $S_{1},S_{2}\subseteq A^{*}$ , we define the concatenation by $\,S_{1}\cdot S_{2}=\{x_{1}\cdot x_{2}:$ $x_{1}\in S_{1}$ and $x_{2}\in S_{2}\}$ .

For $x,p\in A^{*}$ we say that $p$ is a prefix of $x$ iff $(\exists u\in A^{*})\,x=pu$ ; this is denoted by $p\leq_{\rm pref}x$ . Two strings $x,y\in A^{*}$ are called prefix-comparable (denoted by $x\parallel_{\rm pref}y\,$ ) iff $\,x\leq_{\rm pref}y\,$ or $\,y\leq_{\rm pref}x$ . A prefix code (a.k.a. a prefix-free set) is any subset $P\subset A^{*}$ such that for all $p_{1},p_{2}\in P$ : $\,p_{1}\parallel_{\rm pref}p_{2}\,$ implies $p_{1}=p_{2}$ . A right ideal of $A^{*}$ is, by definition, any subset $R\subseteq A^{*}$ such that $R=R\cdot A^{*}$ . A subset $C\subseteq R$ is said to generate $R$ as a right ideal iff $R=C\cdot A^{*}$ . It is easy to prove that every finitely generated right ideal is generated by a unique finite prefix code, and this prefix code is the minimum generating set of the right ideal (with respect to $\subseteq$ ). By definition, a maximal prefix code is a prefix code $P\subset A^{*}$ that is not a strict subset of any other prefix code of $A^{*}$ . An essential right ideal is, by definition, a right ideal $R\subseteq A^{*}$ such that all right ideals of $A^{*}$ intersect $R$ (i.e., have a non- $\varnothing$ intersection with $R$ ). It is well known (see e.g. [5, Lemma 8.1]) that a right ideal $R\subseteq A^{*}$ is essential iff the unique prefix code that generates $R$ is maximal.

A right ideal morphism of $A^{*}$ is, by definition, a function $f:A^{*}\to A^{*}$ such that for all $x\in{\rm Dom}(f)$ and all $w\in A^{*}$ : $f(xw)=f(x)\ w$ . In that case, ${\rm Dom}(f)$ is a right ideal; one easily proves that ${\rm Im}(f)$ is also a right ideal. The prefix code that generates ${\rm Dom}(f)$ is denoted by ${\rm domC}(f)$ , and is called the domain code of $f$ ; the prefix code that generates ${\rm Im}(f)$ is denoted by ${\rm imC}(f)$ , and is called the image code. We are interested in the following monoid:

${\cal RI}_{A}^{\sf fin}$ $\,=\,$ $\{f:f$ is a right ideal morphism of $A^{*}$ such that $f$ is injective, and

${\rm domC}(f)$ and ${\rm imC}(f)$ are finite maximal prefix codes}.

We usually write ${\cal RI}^{\sf fin}$ for ${\cal RI}_{A}^{\sf fin}$ since we usually just deal with one alphabet $A$ at a time. It is proved in [5, Prop. 2.1] that every $f\in{\cal RI}^{\sf fin}$ is contained in a unique $\subseteq$ -maximum right ideal morphism in ${\cal RI}^{\sf fin}$ ; this is called the maximum extension of $f$ . The Higman-Thompson group $G_{k,1}$ (where $k=|A|$ ) is a homomorphic image of ${\cal RI}^{\sf fin}$ :

Definition 2.1

(Thompson group $V$ and Higman-Thompson groups $G_{k,1}$ ).* The Thompson group $V$ , as a set, consists of the right ideal morphisms $f\in{\cal RI}_{\{0,1\}}^{\sf fin}$ that are maximum extensions in ${\cal RI}_{\{0,1\}}^{\sf fin}$ . The multiplication in $V$ consists of composition, followed by maximum extension.*

The same definition for ${\cal RI}_{A}^{\sf fin}$ with $A=\{0,1,\,\ldots,k-1\}$ yields the Higman-Thompson group $G_{k,1}\,$ for every $k\geq 2$ ; $\,V=G_{2,1}$ .

Every element $f\in{\cal RI}^{\sf fin}$ (and in particular, every $f\in G_{k,1}$ ) is determined by the restriction of $f$ to ${\rm domC}(f)$ . This restriction $\,f_{{\rm domC}(f)}:{\rm domC}(f)\to{\rm imC}(f)\,$ is a finite bijection, called the table of $f$ [25]. Obviously, $f$ ( $\in{\cal RI}^{\sf fin}$ ) determines ${\rm domC}(f)$ and hence a unique table. When we use tables we do not always assume that $f$ is a maximum extension. The well known tree representation of $G_{k,1}$ is obtained by using the prefix trees of ${\rm domC}(f)$ and ${\rm imC}(f)$ .

Lemma 2.2

Let $P,Q\subset A^{*}$ be finite maximal prefix codes. The right ideal morphism $f\in{\cal RI}^{\sf fin}$ determined by a table $F$ : $P\to Q$ can be extended iff there exist $s,t\in A^{*}$ such that for every $\alpha\in A$ : $s\alpha\in P$ , $t\alpha\in Q$ , and $\,F(s\alpha)=t\alpha$ .

In that case, $f$ can be extended by defining $\,f(s)=t$ . The table for this extension is obtained be replacing $\,P\,$ by $\,(P\smallsetminus sA)\cup\{s\}$ , $\,Q\,$ by $\,(Q\smallsetminus qA)\cup\{q\}$ , and $\,\{(s\alpha,t\alpha):\alpha\in A\}\,$ by $\,\{(s,t)\}$ .

This is called an extension step of the table $F$ .

Proof. See [5, Lemma 2.2] and [25]. $\Box$

Since in an extension step the cardinality of ${\rm domC}(f)$ decreases, only finitely steps are needed to reach the maximum extension of $f$ ; the number of steps is $\,<|{\rm domC}(f)|$ .

Based on the representation of the elements of $V$ (and of $G_{k,1}$ ) by tables, one can show easily that the word problem of these groups is in P. A much stronger result is that the word problem is in coCFL (the set of languages whose complement is context-free) [30]; coCFL is a strict subclass of the parallel complexity class ${\sf AC}^{1}$ , which is a subclass of P (see e.g., [23]).

The $A^{\omega}$ definition of $G_{k,1}$ : Maximality of finite prefix codes has the following characterization in terms of $A^{\omega}$ . A finite prefix code $P\subset A^{*}$ is maximal iff $\,PA^{\omega}=A^{\omega}$ . (This is not true for infinite prefix codes; a counter example is $\,0^{*}1$ .)

It follows that every element $f\in G_{k,1}$ determines a permutation of $A^{\omega}$ . Conversely, let $P\subset A^{*}$ be a finite maximal prefix code. Then for every $w\in A^{\omega}$ there exists a unique $p\in P$ and $v\in A^{\omega}$ such that $w=pv$ . Let $f$ be a permutation of $A^{\omega}$ for which there exists a table $F$ : $P\to Q$ such $f$ is defined by $\,f(pv)=F(p)\ v$ (for every $p\in P$ and $v\in A^{\omega}$ ). Then $f\in G_{k,1}$ .

Thus, $G_{k,1}$ can be defined as a certain group of permutations of $A^{\omega}$ .

Lemma 2.3

Let $F_{1}$ : $P_{1}\to Q_{1}$ and $F_{2}$ : $P_{2}\to Q_{2}$ be two tables that determine, respectively, the right ideal morphisms $f_{1},f_{2}\in$ ${\cal RI}^{\sf fin}$ . Then the following are equivalent:

(1)* $F_{1}$ and $F_{2}$ determine the same element of $G_{k,1}$ (by maximum extension); *

(2)* $f_{1}$ and $f_{2}$ have the same maximum extension in ${\cal RI}^{\sf fin}$ ; *

(3)* $f_{1}$ and $f_{2}$ have a common restriction in ${\cal RI}^{\sf fin}$ ; *

(4)* $f_{1}$ and $f_{2}$ have a common restriction to an essential right ideal of $A^{*}$ ; *

(5)* $F_{1}$ and $F_{2}$ determine the same function on $A^{\omega}$ ; *

(6)* $f_{1}$ and $f_{2}$ determine the same function on $A^{\omega}$ .*

Proof. (1) and (2) are equivalent by the definition of $G_{k,1}$ . (2) implies (3) (which implies (4)): The intersection $f_{1}\cap f_{2}$ is a common restriction; by [5, Lemma 8.3], ${\rm Dom}(f_{1})\cap{\rm Dom}(f_{2})$ is an essential right ideal. Moreover, ${\rm domC}(f_{1}\cap f_{2})\subset{\rm domC}(f_{1})\cup{\rm domC}(f_{2})$ ; hence ${\rm domC}(f_{1}\cap f_{2})$ is finite. (4) implies (2) by uniqueness of maximum extensions in ${\cal RI}^{\sf fin}$ (see [5, Lemma 2.1], which does not require finiteness of prefix codes). (3) implies (5) in an obvious way. And (5) implies (1), based on finiteness and uniqueness of maximum extension. (5) and (6) are obviously equivalent. $\Box$

The piecewise linear definition of $V$ : Brin’s definition of $nV$ extends the definition of $V$ as given in [19]; the latter is based on piecewise linear actions on the interval $[0,1]$ $\subset$ ${\mathbb{R}}$ . We use half-open intervals, so neighboring intervals do not intersect; however, when the right boundary is 1, we use “ $1]$ ”. The boundary-points of the subintervals that appear are binary rational numbers (i.e., the denominator is a power of 2). A string $s=s_{1}\ldots s_{m}\in\{0,1\}^{*}$ with $m=|s|$ determines the half-open subinterval $\,[0.s,\ 0.s+2^{-|s|}[\,$ ; but if $s+2^{-|s|}=1$ then we take $\,[0.s,\,1]$ , i.e., in that case we close the interval. Here, $0.s$ is a rational number written in fractional binary representation; i.e., $0.s=\sum_{i=1}^{m}s_{i}\,2^{-i}$ . E.g., 01100 (of length 5) determines the subinterval $\,[0.011,\ 0.011+2^{-5}[$ $\,=\,$ $[0.011,\ 0.01101[\,$ .

2.2 Right ideals of $nA^{*}$

Here we completely develop the string description of $nV$ , which is briefly alluded to in [14, subsection 4.3]. A hybrid string-geometric description was used in [11] (where some crucial concepts appear only in geometric form). Our description is entirely based on strings, but the correspondence with geometric concepts is often pointed out. The present subsection focuses on finitely generated right ideals of $nA^{*}$ ; in the next subsection, $nV$ will be defined based on right-ideal morphisms of $nA^{*}$ .

As before, let $A$ be a finite alphabet of cardinality $k\geq 1$ , usually denoted by $\{0,\,\ldots,k-1\}$ or $\{a_{0},\,\ldots,a_{k-1}\}$ . The $n$ -fold cartesian product X ${}_{{}_{i=1}}^{{}^{n}}A^{*}\,$ will be denoted by $nA^{*}$ ; we choose this notation in analogy with the notation $nV$ , and also in order to avoid confusion with $n$ -fold concatenation (of the form $\,S^{n}=\{s_{1}\cdot\ldots\cdot s_{n}:\,s_{1},\ldots,s_{n}\in S\}$ $\subseteq A^{*}$ ). Similarly, $nA^{\omega}$ denotes the $n$ -fold cartesian product X ${}_{{}_{i=1}}^{{}^{n}}A^{\omega}$ . Multiplication in $\,nA^{*}$ is done coordinatewise, i.e., $nA^{*}$ is the direct product of $n$ copies of the free monoid $A^{*}$ . For $u\in nA^{*}$ we denote the coordinates of $u$ by $u_{i}\in A^{*}$ , for $1\leq i\leq n$ ; i.e., $u=(u_{1},\,\ldots,u_{n})$ .

Geometrically: $x=(x_{1},\ldots,x_{n})\in n\{0,1\}^{*}$ represents the hyperrectangle X ${}_{{}_{i=1}}^{{}^{n}}[0.x_{i},\ 0.x_{i}+2^{-|x_{i}|}[\,$ (except that “ $0.x_{i}+2^{-|x_{i}|}[$ ” is replaced by “ $1]$ ” if $\,0.x_{i}+2^{-|x_{i}|}=1$ ). The measure of this hyperrectangle is $2^{-(|x_{1}|\,+\ \ldots\ +\,|x_{n}|)}$ . In particular, $(\varepsilon)^{n}$ represents $[0,1]^{n}$ and has measure 1.

The concept of prefix is similar to the one in $A^{*}$ , but in order to avoid confusion we will use the phrase “initial factor”. So the initial factor order on $nA^{*}$ is defined for $u,v\in nA^{*}$ by $u\leq_{\rm init}v$ iff there exists $x\in nA^{*}$ such that $ux=v$ . In a similar way we have the concepts of comparability (denoted by $\|_{\rm init}$ ), right ideal, generating set of a right ideal, and essential right ideal. It is easy to prove that $\,u\leq_{\rm init}v$ in $nA^{*}$ iff $u_{i}\leq_{\rm pref}v_{i}$ for all $i=1,\,\ldots,n$ . For any $u,v\in nA^{*}$ there exists a unique $\leq_{\rm init}$ -maximum common initial factor, denoted by $u\wedge v$ . In terms of coordinates, $(u\wedge v)_{i}=u_{i}\wedge_{\rm pref}v_{i}$ , where $u_{i}\wedge_{\rm pref}v_{i}$ is the longest common prefix of the strings $u_{i}$ and $v_{i}$ .

An initial factor code is a set $S\subset nA^{*}$ such that no two different elements of $S$ are $\leq_{\rm init}$ -comparable.

As we shall see, a crucial way in which $nA^{*}$ with $n\geq 2$ differs from $A^{*}$ concerns the join operation with respect to $\,\leq_{\rm init}$ . For all $n$ , the join of $u,v\in nA^{*}$ is defined by $\ u\vee v$ $\,=\,$ $\min_{\leq_{\rm init}}\{z\in nA^{*}:$ $\,u\leq_{\rm init}z$ and $v\leq_{\rm init}z\}$ . Of course, $u\vee v$ does not always exist.

Definition 2.4

A set $S\subset nA^{*}$ is joinless iff no two elements of $S$ have a join with respect to $\,\leq_{\rm init}$ in $nA^{*}$ . Joinless sets will be called joinless codes, since they are necessarily initial factor codes.

A set $S\subset nA^{*}$ is a maximal joinless code iff $\,S$ is $\subseteq$ -maximal among the joinless codes of $nA^{*}$ . (In other words, adding a new element to a maximal joinless code $S$ results in a set, some of whose elements have joins.)

A right ideal $R\subseteq nA^{*}$ is called joinless generated iff $R$ is generated, as a right ideal, by a joinless code.

(About the grammar: “Joinlessly generated” would not make sense since it is not the generating process that is joinless.)

Examples: Not every initial factor code is joinless; e.g., $\{(\varepsilon,0),\ (0,\varepsilon)\}$ is an initial factor code where $\,(\varepsilon,0)\vee(0,\varepsilon)=(0,0)$ . An example of a maximal joinless code is $\,\{(\varepsilon,0),\ (0,1),\ (1,1)\}$ . A maximal joinless code is usually not maximal as an initial factor code; for example, in $\,\{(\varepsilon,0),\ (0,1),\ (1,1)\}\,$ one could add $(00,\varepsilon)$ ; the result would be a initial factor code (that is not joinless). The only maximal joinless code that is also maximal as an initial factor code is $\{(\varepsilon,\varepsilon)\}$ .

From here on, a joinless code will be called maximal if it is maximal as a joinless code.

Connection with the geometric description: For $u,v\in nA^{*}$ we have $v\leq_{\rm init}u$ iff the hyperrectangle $u$ is contained in the hyperrectangle $v\,$ (i.e., $\leq_{\rm init}$ corresponds to $\supseteq$ ); note that “shorter” $n$ -tuples correspond to “larger” hyperrectangles. The join $u\vee v$ represents the hyperrectangle obtained by intersecting the hyperrectangles $u$ and $v$ (so $\vee$ corresponds to $\cap$ ). Note that $u\vee v$ does not exist iff the intersection is the empty set (since the empty set is not a hyperrectangle). The meet $u\wedge v$ (which always exists) does not represent the union, nor the smallest hyperrectangle that contains $u$ and $v$ , but the smallest hyperrectangle representable by an $n$ -tuple in $nA^{*}$ that contains $u$ and $v$ . Joinlessness of a code means that any two hyperrectangles in the chosen subdivision of $[0,1]^{n}$ are disjoint as sets. A joinless code is maximal iff its hyperrectangles form a tiling of $[0,1]^{n}$ . In an initial factor code, $\leq_{\rm init}$ -incomparability means that no hyperrectangle in the code is contained in another one.

Examples (for the correspondence between strings and geometry): Fig. 1 shows a few elements of $\,2\,\{0,1\}^{*}$ . The large square $[0,1]\times[0,1]$ is represented by $(\varepsilon,\varepsilon)$ . The numbers use fractional binary representation; e.g., $0.1101=\frac{1}{2}+\frac{1}{4}+\frac{1}{16}$ .

$(0,00)\in 2\,\{0,1\}^{*}$ represents the rectangle $\,[0,\,0.1[\,\times\,[0,\,0.01[$ (horizontally hashed); and $(010,0)$ represents $\,[0.01,\,0.011[\,\times\,[0,\,0.1[$ (vertically hashed). The join $(010,00)=(0,\,00)\vee(010,\,0)\,$ represents $\,[0.01,\,0.011[\,\times\,[0,\,0.01[$ (doubly hashed). $(0,0)=(0,00)\,\wedge\,(010,0)\,$ represents $\,[0,0.1[\,\times\,[0,0.1[$ .

The rectangle $[0.1,\,0.11[\,\times\,[0.1101,\,0.111[$ is represented by $(10,1101)$ (horizontally hashed). And $[0.1,\,1]\times[0.111,\,0.1111]$ is represented by $(1,1110)$ (vertically hashed). Here, $(10,1101)\vee(1,1110)$ does not exist, and the meet $(10,1101)\wedge(1,1110)=(1,11)$ represents $[0.1,\,1]\times[0.11,\,1]$ .

\begin{picture}(110.0,60.0)\par\put(5.0,10.0){\framebox(40.0,40.0)[]{}} \put(5.0,10.0){\framebox(20.0,20.0)[]{}} \par\par\put(-2.0,0.0){\makebox(0.0,0.0)[cc]{\sf Fig. 1}} \par\put(5.0,10.0){\framebox(20.0,10.0)[]{}} \put(15.0,10.0){\framebox(5.0,20.0)[]{}} \par\put(25.0,42.5){\framebox(10.0,2.5)[]{}} \put(25.0,45.0){\framebox(20.0,2.5)[]{}} \par\put(25.0,40.0){\framebox(20.0,10.0)[]{}} \par \par\put(5.0,10.0){\line(1,0){20.0}}\put(5.0,10.5){\line(1,0){20.0}}\put(5.0,11.0){\line(1,0){20.0}}\put(5.0,11.5){\line(1,0){20.0}}\put(5.0,12.0){\line(1,0){20.0}}\put(5.0,12.5){\line(1,0){20.0}}\put(5.0,13.0){\line(1,0){20.0}}\put(5.0,13.5){\line(1,0){20.0}}\put(5.0,14.0){\line(1,0){20.0}}\put(5.0,14.5){\line(1,0){20.0}}\put(5.0,15.0){\line(1,0){20.0}}\put(5.0,15.5){\line(1,0){20.0}}\put(5.0,16.0){\line(1,0){20.0}}\put(5.0,16.5){\line(1,0){20.0}}\put(5.0,17.0){\line(1,0){20.0}}\put(5.0,17.5){\line(1,0){20.0}}\put(5.0,18.0){\line(1,0){20.0}}\put(5.0,18.5){\line(1,0){20.0}}\put(5.0,19.0){\line(1,0){20.0}}\put(5.0,19.5){\line(1,0){20.0}} \put(15.0,10.0){\line(0,1){20.0}}\put(15.5,10.0){\line(0,1){20.0}}\put(16.0,10.0){\line(0,1){20.0}}\put(16.5,10.0){\line(0,1){20.0}}\put(17.0,10.0){\line(0,1){20.0}}\put(17.5,10.0){\line(0,1){20.0}}\put(18.0,10.0){\line(0,1){20.0}}\put(18.5,10.0){\line(0,1){20.0}}\put(19.0,10.0){\line(0,1){20.0}}\put(19.5,10.0){\line(0,1){20.0}} \par\put(25.0,42.5){\line(1,0){10.0}}\put(25.0,43.0){\line(1,0){10.0}}\put(25.0,43.5){\line(1,0){10.0}}\put(25.0,44.0){\line(1,0){10.0}}\put(25.0,44.5){\line(1,0){10.0}} \put(25.0,45.0){\line(0,1){2.5}}\put(25.5,45.0){\line(0,1){2.5}}\put(26.0,45.0){\line(0,1){2.5}}\put(26.5,45.0){\line(0,1){2.5}}\put(27.0,45.0){\line(0,1){2.5}}\put(27.5,45.0){\line(0,1){2.5}}\put(28.0,45.0){\line(0,1){2.5}}\put(28.5,45.0){\line(0,1){2.5}}\put(29.0,45.0){\line(0,1){2.5}}\put(29.5,45.0){\line(0,1){2.5}}\put(30.0,45.0){\line(0,1){2.5}}\put(30.5,45.0){\line(0,1){2.5}}\put(31.0,45.0){\line(0,1){2.5}}\put(31.5,45.0){\line(0,1){2.5}}\put(32.0,45.0){\line(0,1){2.5}}\put(32.5,45.0){\line(0,1){2.5}}\put(33.0,45.0){\line(0,1){2.5}}\put(33.5,45.0){\line(0,1){2.5}}\put(34.0,45.0){\line(0,1){2.5}}\put(34.5,45.0){\line(0,1){2.5}}\put(35.0,45.0){\line(0,1){2.5}}\put(35.5,45.0){\line(0,1){2.5}}\put(36.0,45.0){\line(0,1){2.5}}\put(36.5,45.0){\line(0,1){2.5}}\put(37.0,45.0){\line(0,1){2.5}}\put(37.5,45.0){\line(0,1){2.5}}\put(38.0,45.0){\line(0,1){2.5}}\put(38.5,45.0){\line(0,1){2.5}}\put(39.0,45.0){\line(0,1){2.5}}\put(39.5,45.0){\line(0,1){2.5}}\put(40.0,45.0){\line(0,1){2.5}}\put(40.5,45.0){\line(0,1){2.5}}\put(41.0,45.0){\line(0,1){2.5}}\put(41.5,45.0){\line(0,1){2.5}}\put(42.0,45.0){\line(0,1){2.5}}\put(42.5,45.0){\line(0,1){2.5}}\put(43.0,45.0){\line(0,1){2.5}}\put(43.5,45.0){\line(0,1){2.5}}\put(44.0,45.0){\line(0,1){2.5}}\put(44.5,45.0){\line(0,1){2.5}} \par\end{picture}

Fig. 1

For $u,v\in A^{*}$ , $\,u\vee_{\rm pref}v$ exists in $A^{*}$ iff $u$ and $v$ have a common upper bound for $\leq_{\rm pref}$ . This holds iff $u\parallel_{\rm pref}v$ ; in that case, $u\vee_{\rm pref}v=u$ if $v\leq_{\rm pref}u$ , and $u\vee_{\rm pref}v=v$ if $u\leq_{\rm pref}v$ . Hence in $A^{*}$ , prefix codes are the same thing as joinless codes. This is not the case for $nA^{*}$ with $n\geq 2$ ; here, joinless codes are a special case of initial factor codes, and the join is characterized as follows:

Lemma 2.5

(join for $\leq_{\rm init}$ in $nA^{*}$ ).* For all $\,u=(u_{1},\,\ldots,u_{n}),\ v=(v_{1},\,\ldots,v_{n})$ $\in nA^{*}$ , the following are equivalent:*

(1)* the join $\,u\vee v\,$ (with respect to $\,\leq_{\rm init}$ ) exists; *

(2)* $u$ and $v$ have a common upper bound for $\,\leq_{\rm init}$ , i.e., $\,(\exists z)\,[\,u\leq_{\rm init}z$ $\,{\rm and}\,$ $v\leq_{\rm init}z\,]$ ; *

(3)* for all $i=1,\,\ldots,n$ : $u_{i}\parallel_{\rm pref}v_{i}\$ in $A^{*}$ .*

Moreover, if $\,u\vee v=((u\vee v)_{i}:i=1,\,\ldots,n)\,$ exists, then

$(u\vee v)_{i}\ =\ \left\{\begin{array}[]{ll}u_{i}&\ \ \ \mbox{if$ ,v_{i}\leq_{\rm pref}u_{i}, $(in$ A^{} $),}\\ v_{i}&\ \ \ \mbox{if$ ,u_{i}\leq_{\rm pref}v_{i}, $(in$ A^{} $).}\end{array}\right.$ **

In other words, if $u\vee v$ exists then $\,(u\vee v)_{i}\,=\,\max_{\leq_{\rm pref}}\{u_{i},v_{i}\}$ , and $\,|(u\vee v)_{i}|\,=\,\max\{|u_{i}|,|v_{i}|\}$ .

So, in $nA^{*}$ the relation $\,\|_{\rm init}\,$ is not equivalent to coordinatewise $\parallel_{\rm pref}$ ; the latter is equivalent to the existence of a join; $\,\|_{\rm init}\,$ implies (but is not equivalent to) existence of a join.

Proof. [(1) $\Rightarrow$ (2)] is obvious. [(2) $\Rightarrow$ (3)] is straightforward: If $u\leq_{\rm init}z$ and $v\leq_{\rm init}z$ for some $z\in nA^{*}$ then $ur=vt=z$ for some $s,t,z\in nA^{*}$ . Hence, $u_{i}s_{i}=v_{i}t_{i}=z_{i}$ , so $\,u_{i}\parallel_{\rm pref}v_{i}$ in $A^{*}$ .

[(3) $\Rightarrow$ (1)] Suppose $u_{i}\parallel_{\rm pref}v_{i}$ for all $i$ . Then $u_{i}\leq_{\rm pref}v_{i}$ for some $i$ , and $v_{i}\leq_{\rm pref}u_{i}$ for the other $i$ . Hence, $(u\vee v)_{i}=u_{i}$ if $v_{i}\leq_{\rm pref}u_{i}$ (in $A^{*}$ ), and $(u\vee v)_{i}=v_{i}$ otherwise; so $u\vee v$ exists. $\Box$

Notation 2.6

Let $A_{\varepsilon,n}\ =\$ $\bigcup_{1\leq i\leq n}$ $\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i}$ .

Note that $A_{\varepsilon,n}$ is the unique minimum generating set of $nA^{*}$ as a monoid; the cardinality is $\,|A_{\varepsilon,n}|=n\,|A|$ .

Lemma 2.7

.

(1)* Every right ideal $R\subseteq nA^{*}$ is generated, as a right ideal, by a unique initial factor code. (Finiteness of generating sets is not assumed here.)*

(2)* If a right ideal $R\subseteq nA^{*}$ is generated by a joinless code then the unique initial factor code that generates $R$ is joinless.*

Proof. Let $P\ =\ R\ \smallsetminus\ R\cdot A_{\varepsilon,n}$ . We claim that $P$ is an initial factor code that generates $R$ , and that $P$ is the unique such initial factor code. (We closely follow the proof of [8, Lemma 8.1(1)].)

Let us show that $P$ generates $R$ . Obviously, since $P\subset R$ , we have $P\,(nA^{*})\subseteq R\,(nA^{*})=R$ . Conversely, to show that $R\subseteq P\,(nA^{*})$ , consider any $r\in R$ . In $nA^{*}$ , $\,r$ has only finitely many initial factors, hence there exists a (not necessarily unique) $p\in R$ which is an initial factor of $r$ and is $\leq_{\rm init}$ -minimal in $R$ . So $r=px$ for some $x\in nA^{*}$ . And $p\not\in R\,A_{\varepsilon,n}$ , otherwise there would exist $p=r^{\prime}a$ for some $r^{\prime}\in R,a\in A_{\varepsilon,n}$ , which would contradict that $p$ is $\leq_{\rm init}$ -minimal in $R$ . Hence $p\in R\smallsetminus R\,A_{\varepsilon,n}$ .

To show that $P$ is an initial factor code, let $p,p^{\prime}\in P$ and suppose $p=p^{\prime}x$ for some $x\in nA^{*}$ . If $x\neq(\varepsilon)^{n}$ then $p\in RA_{\varepsilon,n}$ , contradicting the assumption that $p\in P$ ( $=R\smallsetminus RA_{\varepsilon,n}$ ). So, $p=p^{\prime}$ .

To prove uniqueness of the initial factor code that generates $R$ , we generalize the proof of [8, Lemma 8.1(1’)]. If $P_{1}\,(nA^{*})=P_{2}\,(nA^{*})$ for two initial factor codes $P_{1},P_{2}$ , then for every $p_{1}\in P_{1}$ there exists $p_{2}\in P_{2}$ such that $p_{1}=p_{2}x$ (for some $x\in nA^{*}$ ). Also, there is $p_{1}^{\prime}\in P_{1}$ such that $p_{2}=p_{1}^{\prime}y$ (for some $y\in nA^{*}$ ). Hence $p_{1}=p_{1}^{\prime}xy$ , which implies $x=y=(\varepsilon)^{n}$ , since $P_{1}$ is an initial factor code. Thus, $p_{1}=p_{2}\in P_{2}$ . Therefore, $P_{1}\subseteq P_{2}$ . Similarly we have $P_{2}\subseteq P_{1}$ , so $P_{1}=P_{2}$ .

Part (2) follows immediately from the uniqueness of the initial factor code that generates $R$ . $\Box$

Lemma 2.8

Let $P\subset nA^{*}$ be a finite maximal joinless code. Then every $w\in nA^{\omega}$ has a unique initial factor in $P$ ; i.e., $\,(\forall w\in nA^{\omega})(\exists!\,p\in P,$ $u\in nA^{\omega})\,[\,w=pu\,]$ .

Proof. If there were two different initial factors $p,q$ of $w$ in $P$ then $p$ and $q$ would be initial factors of a finite initial factor of $w$ ; hence $p$ and $q$ would have a join, contradicting that $P$ is joinless. This shows uniqueness.

Let us show existence. Since $P$ is a maximal joinless code, every initial factor $v$ of $w$ has a join with some element of $P$ . Let us pick $v$ so that its coordinates (in $A^{*}$ ) are longer than all the coordinates of the elements of $P$ . Then the element of $P$ that has a join with $v$ is an initial factor of $v$ . $\Box$

Lemma 2.9

Let $P\subset nA^{*}$ be any finite joinless code, and let $\,R=P\cdot(nA^{*})\,$ be the right ideal generated. (Recall that by Lemma 2.7, $\,P$ is uniquely determined by $R$ .) Then the following are equivalent:

(1)* $R$ is an essential right ideal;*

(2)* $P$ is maximal as a joinless code;*

(3)* $P\cdot(nA^{\omega})\,=\,nA^{\omega}$ ;*

(4)* $R\cdot(nA^{\omega})\,=\,nA^{\omega}$ .*

Proof. $[(1)\Leftrightarrow(2)]$ Suppose $P$ is a finite joinless code. Then $P$ is maximal joinless iff every $v\in nA^{*}$ has a join with some element of $P$ (as follows directly from the definition of maximality). This is equivalent to the property that every monogenic right-ideal of $nA^{*}$ intersects $P\,(nA^{*})$ ; i.e., $P\,(nA^{*})$ is essential.

$[(3)\Rightarrow(1)]$ If $\,P\,(nA^{\omega})=nA^{\omega}$ , then every $w\in nA^{\omega}$ has an initial factor in $P$ . It follows that for every right ideal $R\subset nA^{*}$ , $R\,(nA^{\omega})\subseteq P\,(nA^{\omega})$ . Hence $R$ intersects $P\,(nA^{*})$ . So, $P\,(nA^{*})$ is essential.

$[(2)\Rightarrow(3)]$ Suppose $P$ is a finite maximal joinless code. Let $w\in nA^{\omega}$ , and for any $(i_{1},\,\ldots,i_{n})\in{\mathbb{N}}^{n}$ , let $w^{(i_{1},\,\ldots,i_{n})}$ be the initial factor of $w$ in $A^{i_{1}}\times\,\,\ldots\,\times A^{i_{n}}$ . Then $w^{(i_{1},\,\ldots,i_{n})}$ has a join with some $p\in P$ . Since $P$ is finite, $p$ is an initial factor of $w^{(i_{1},\,\ldots,i_{n})}$ if each of $i_{1},\,\ldots,i_{n}$ is larger than $\max\{|p_{i}|:p\in P,\ i\in\{1,\,\ldots,n\}\}$ . Hence, $p$ is an initial factor of $w$ , so $w\in P\,(nA^{\omega})$ . Since for every $w\in nA^{\omega}$ such a $p\in P$ exists (by Lemma 2.8), we conclude that $nA^{\omega}\subseteq P\ (nA^{\omega})$ .

The equivalence of (3) and (4) is obvious since $\,nA^{*}\cdot nA^{\omega}=nA^{\omega}$ , so $\,R\cdot(nA^{\omega})$ $=$ $P\cdot(nA^{*})\cdot(nA^{\omega})=P\cdot(nA^{\omega})$ . $\Box$

Remark. Lemma 2.9 only talks about joinless generated right ideals. Indeed, an essential finitely generated right ideal in $nA^{*}$ is not necessarily joinless generated. An example for $\,A=\{0,1\}\,$ is

$R$ $=$ $\{(\varepsilon,0),(0,\varepsilon),(1,1)\}\cdot(2A^{*})$ .

It is easy to prove that $R$ is essential, and that $\{(\varepsilon,0),$ $(0,\varepsilon),$ $(1,1)\}$ is an initial-factor code that is not joinless (since $(\varepsilon,0)\vee(0,\varepsilon)=(0,0)$ exists). By Lemma 2.7, this initial factor code is unique, i.e., $R$ is not generated by any other initial-factor code; hence $R$ is not joinless generated.

Section 5 of version 1 of [10] gives a detailed proof (independently of Lemma 2.7) that $R$ is essential in $2A^{*}$ , and that $R$ is not generated (as a right ideal) by any finite joinless code.

DAGs and $nA^{*}$ : The following generalizes the well known concepts of the tree of $A^{*}$ and the tree of a prefix code. We abbreviate directed acyclic graph by dag. A few definitions: The leaves of a dag are the vertices of out-degree 0; all the other vertices are interior vertices. For a dag $D$ , the sub-dag spanned by the interior vertices of $D$ is called the interior dag of $D$ . The sources of a dag are the vertices of in-degree 0; if there is only one source, and all vertices are reachable from this source, this source is called the root, and the dag is then called rooted. The depth of a vertex $v$ in a rooted dag is defined to be the length of the shortest path from the root to $v$ ; by “path” we will always mean a directed path.

$\bullet$ The dag of $nA^{*}$ is the infinite rooted dag with vertex set $nA^{*}$ and root $(\varepsilon)^{n}$ ; the edges are the ordered pairs $(s,t)\in(nA^{*})\times(nA^{*})$ such that there exists $i\in\{1,\,\ldots,n\}$ and $a\in A$ with $t=(s_{1},\,\ldots,s_{i-1},\,s_{i}a,\,s_{i+1},\,\ldots,s_{n})\,$ (where $\,s=(s_{1},\,\ldots,s_{i-1},\,s_{i},\,s_{i+1},\,\ldots,s_{n})$ ). Hence every vertex has $|A_{\varepsilon,n}|$ ( $=n\cdot|A|$ ) children; see Notation 2.6. And $u\leq_{\rm init}v$ iff there exists a directed path from $u$ to $v$ in the dag. It is easy to show that the depth of a vertex $\,v=(v_{1},\,\ldots\,,v_{n})$ in the dag of $nA^{*}$ is $\ \sum_{i=1}^{n}|v_{i}|$ .

The dag of $nA^{*}$ is the right Cayley graph of the monoid $nA^{*}$ over the generating set $A_{\varepsilon,n}$ .

$\bullet$ For any finite subset $P\subset nA^{*}$ we define the initial factor dag of $P$ (also called the $P$ -dag): This is a finite rooted subdag of the dag of $nA^{*}$ ; the root of the $P$ -dag is the root of the dag of $nA^{*}$ ; the vertices and edges are those vertices, respectively edges, of the dag of $nA^{*}$ that appear on any path from the root to any vertex in $P$ . Hence the vertices of the $P$ -dag are all the initial factors of the elements of $P$ (so the $P$ -dag is uniquely determined by $P$ ). The set of leaves of the $P$ -dag is $P$ iff $P$ is an initial factor code.

Note that the trees and dags considered here are not ordered trees or dags; i.e., the children of a vertex are defined as a set, not a sequence; similarly, the leaves form a set, not a sequence.

Lemma 2.10

Let $P\subset nA^{*}$ be a finite maximal joinless code such that $P\neq\{\varepsilon\}^{n}$ . Let $v=(v_{1},\,\ldots,v_{n})$ be any leaf of the interior dag of the dag of $P$ , and let $v_{+}$ be the set of children of $v$ in the $P$ -dag; so $v_{+}=v\cdot A_{\varepsilon,n}\,\cap\,P$ , and $v_{+}$ is non-empty (since $v$ is an interior vertex).

(0)* Then $v_{+}$ satisfies*

$v_{+}\ \subseteq\$ * $\{(v_{1},\,\ldots,v_{i-1},\,v_{i}a,\,v_{i+1},\,\ldots,v_{n})$ $\,:\,$ $a\in A\}$ $\ =\$ $v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})\,$ ,*

for some $\,i\in\{1,\,\ldots,n\}$ ; and $i$ is unique (for a given $v$ and $P$ ).

(1)* For $n=1$ , part (0) holds with equality for every leaf $v$ of the interior dag: $v_{+}\ =\ \{va:a\in A\}$ .*

(2) (Lawson and Vdovina [29, Thm. 12.11], but with a different formalism.)* For $n=2$ and $|A|=2$ , part (0) holds with equality for some maximum-depth leaf $v$ of the interior dag of $P$ :*

$v_{+}\ =\ \{(v_{1}a,\,v_{2}):a\in A\}$ * or $v_{+}\ =\ \{(v_{1},\,v_{2}a):a\in A\}$ .*

However, equality does not necessarily hold for every interior leaf, not even for every interior leaf of maximum depth.

(3) (Lawson and Vdovina [29, Ex. 12.8])* For $n\geq 3$ there exist finite maximal joinless codes $P\subset n\,\{0,1\}^{*}$ for which the inclusion in part (0) is strict. I.e., for every leaf $v$ of the interior dag and for every $i\in\{1,\,\ldots,n\}$ :*

$v_{+}\ \neq\$ * $\{(v_{1},\,\ldots,v_{i-1},\,v_{i}a,\,v_{i+1},\,\ldots,v_{n})$ $\,:\,$ $a\in A\}$ .*

Proof. (0) Since $v$ is interior without having interior children, it contains a least one child in $P$ , of the form $(v_{1},\,\ldots,v_{i-1},\,v_{i}a,\,v_{i+1},\,\ldots,v_{n})$ , for some $a\in A$ , $i\in\{1,\,\ldots,n\}$ .

Any possible child of $v$ belongs to $v\cdot A_{\varepsilon,n}$ . If, in addition to $(v_{1},\,\ldots,v_{i-1},\,v_{i}a,\,v_{i+1},\,\ldots,v_{n})$ , $v$ had an additional child of the form $(v_{1},$ $\ldots,$ $v_{j-1},$ $\,v_{j}b,$ $\,v_{j+1},$ $\ldots,$ $v_{n})$ with $i\neq j$ (for any $b\in A$ ), then $P$ would not be joinless. Indeed, these two children have the join $(v_{1},\,\ldots,v_{j}b,\,\ldots,v_{i}a,\,\ldots,v_{n})$ (if $j<i$ ), or $(v_{1},\,\ldots,v_{i}a,\,\ldots,v_{j}b,\,\ldots,v_{n})$ (if $i<j$ ). This shows that all children of $v$ belong to $\,\{(v_{1},\,\ldots,v_{i-1},\,v_{i}a,\,v_{i+1},\,\ldots,v_{n}):$ $a\in A\}\,$ for one particular $i$ (depending on $v$ ).

(1) For $n=1$ the Lemma is folklore knowledge.

(2) (This result is equivalent to [29, Thm. 12.11], but the proof given here is rather different.)

Here $A=\{0,1\}$ . Let $v=(v_{1},v_{2})$ be a maximum-depth leaf of the interior dag of $P$ . Since $v$ is an interior leaf, at least one of its children is in $P$ . Hence either $(v_{1}a,v_{2})\in P$ or $(v_{1},v_{2}a)\in P$ , for some $a\in A$ .

Let us assume that $a=0$ and that $(v_{1}0,v_{2})\in P$ ; the other cases are very similar. Since $(v_{1},v_{2})$ has maximum depth in the interior dag, $(v_{1}0,v_{2})$ has maximum depth in $P$ .

If it is also the case that $(v_{1}1,v_{2})\in P$ , then $\{(v_{1}0,v_{2}),(v_{1}1,v_{2})\}\subseteq P$ , and the Lemma holds. Therefore, from here on we only consider the situation where $(v_{1}1,v_{2})\not\in P$ (but $(v_{1}0,v_{2})\in P$ ). Then there exists $(u_{1},u_{2})\in P\smallsetminus\{(v_{1}0,v_{2})\}$ with $(u_{1},u_{2})\neq(v_{1}1,v_{2})$ , such that $(u_{1},u_{2})$ has a join with $(v_{1}1,v_{2})$ . By Prop. 2.5, this is equivalent to $\,u_{1}\,\|_{\rm pref}\,v_{1}1\,$ and $\,u_{2}\,\|_{\rm pref}\,v_{2}$ .

This leads to four cases.

Case 1: $v_{1}1\leq_{\rm pref}u_{1}$ and $v_{2}\leq_{\rm pref}u_{2}$ .

Then $v_{1}<_{\rm pref}u_{1}$ and $v_{2}\leq_{\rm pref}u_{2}$ . Since $(v_{1},v_{2})$ is a leaf of the interior dag of $P$ , and $(u_{1},u_{2})\in P$ , it follows that $(u_{1},u_{2})$ is a child of $(v_{1},v_{2})$ . Since $(u_{1},u_{2})$ $\not\in$ $\{(v_{1}0,v_{2}),(v_{1}1,v_{2})\}$ , it follows that $(u_{1},u_{2})$ is of the form $(v_{1},v_{2}c)$ for some $c\in A$ . But then $(u_{1},u_{2})$ ( $=(v_{1},v_{2}c)$ ) has a join with $(v_{1}0,v_{2})\in P$ , contradicting the fact that $P$ is joinless. So, case 1 is ruled out.

Case 2: $v_{1}1\geq_{\rm pref}u_{1}$ and $v_{2}\geq_{\rm pref}u_{2}$ ; since $(u_{1},u_{2})\neq(v_{1}1,v_{2})$ , at least one of these $\geq_{\rm pref}$ is strict.

Case 2.1: $v_{1}1>_{\rm pref}u_{1}$ and $v_{2}\geq_{\rm pref}u_{2}$ :

Then $(u_{1},u_{2})$ is interior, since $(v_{1},v_{2})$ is interior. But $(u_{1},u_{2})$ being an interior vertex contradicts the assumption that $(u_{1},u_{2})\in P$ . So case 2.1 is ruled out.

Case 2.2: $v_{1}1=u_{1}$ and $v_{2}>_{\rm pref}u_{2}$ :

Then $u_{2}=v_{2}cz$ , for some $c\in A$ and $z\in A^{*}$ . But now $(u_{1},u_{2})=(v_{1}1,v_{2}cz)$ has greater depth than $(v_{1}0,v_{2})$ , which has maximum depth in $P$ . So case 2.2 is ruled out.

Case 3: $v_{1}1\geq_{\rm pref}u_{1}$ and $v_{2}\leq_{\rm pref}u_{2}$ ; since $(u_{1},u_{2})\neq(v_{1}1,v_{2})$ , at least one of $\geq_{\rm pref}$ or $\leq_{\rm pref}$ is strict.

Case 3.1: $v_{1}1>_{\rm pref}u_{1}$ and $v_{2}\leq_{\rm pref}u_{2}$ .

Then $v_{1}1=u_{1}x1$ , and $u_{2}=v_{2}y$ for some $x,y\in A^{*}$ ; so $v_{1}=u_{1}x$ . But then $\,(u_{1},u_{2})\vee(v_{1}0,v_{2})$ $=$ $(u_{1},v_{2}y)\vee(u_{1}x0,v_{2})$ $=$ $(u_{1}x0,v_{2}y)\,$ exists, contradicting the fact that $\{(u_{1},u_{2}),(v_{1}0,v_{2})\}$ $\subseteq P$ . So case 3.1 is ruled out.

Case 3.2: $v_{1}1=u_{1}$ and $v_{2}<_{\rm pref}u_{2}$ .

Then $(u_{1},u_{2})$ has greater depth than $(v_{1}0,v_{2})$ , contradicting the fact that $(v_{1}0,v_{2})$ has maximum depth in $P$ . So case 3.2 is ruled out.

Case 4: $v_{1}1\leq_{\rm pref}u_{1}$ and $v_{2}\geq_{\rm pref}u_{2}$ .

Then $u_{1}=v_{1}1x$ and $v_{2}=u_{2}y$ for some $x,y\in A^{*}$ . Since $(v_{1}0,v_{2})$ has maximum depth in $P$ we have $|u_{1}|+|u_{2}|\leq|v_{1}0|+|v_{2}|$ , hence $|v_{1}|+1+|x|+|u_{2}|\leq|v_{1}|+1+|u_{2}|+|y|$ , hence $|x|\leq|y|$ . Moreover, $y\neq\varepsilon$ , otherwise $|x|=0$ , hence $x=\varepsilon$ , hence $(u_{1},u_{2})=(v_{1}1,v_{2})$ , which would imply $(v_{1}1,v_{2})\in P$ . In summary this proves:

$|x|\leq|y|\neq 0$ and $v_{2}>_{\rm pref}u_{2}$ .

Notation (used in the remainder of the proof): For any $z\in\{0,1\}^{+}$ , let $z^{-}$ denote the bitstring obtained by complementing the right-most bit of $z$ . And $z\{0,1\}^{-1}$ denotes the bitstring obtained by removing the right-most bit of $z$ .

Note that since $(v_{1}0,v_{2})\in P$ , if we prove that $(v_{1}0,v_{2}^{-})\in P\,$ then the Lemma holds for the interior vertex $(v_{1}0,\,v_{2}\{0,1\}^{-1})$ .

Claim: $(v_{1}0,v_{2}^{-})\in P$ .

Proof of the Claim: Assume by contradiction that there exists $(w_{1},w_{2})\in P$ such that $(w_{1},w_{2})\neq(v_{1}0,v_{2}^{-})$ , and $(w_{1},w_{2})$ has a join with $(v_{1}0,v_{2}^{-})$ . The existence of this join is equivalent to $w_{1}\,\|_{\rm pref}\,v_{1}0$ and $w_{2}\,\|_{\rm pref}\,v_{2}^{-}$ .

This leads to four cases.

Case 4.1: $w_{1}\leq_{\rm pref}v_{1}0$ and $w_{2}\leq_{\rm pref}v_{2}^{-}$ . At least one of the $\leq_{\rm pref}$ is strict.

Case 4.1.1: $w_{1}\leq_{\rm pref}v_{1}0$ and $w_{2}\leq_{\rm pref}v_{2}^{-}A^{-1}=v_{2}A^{-1}$ .

Then the join $(w_{1},w_{2})\vee(v_{1}0,v_{2})=(v_{1}0,v_{2})$ exists, contradicting the fact that $(w_{1},w_{2})$ and $(v_{1}0,v_{2})$ belong to $P$ . So case 4.1.1 is ruled out.

Case 4.1.2: $w_{1}\leq_{\rm pref}v_{1}$ and $w_{2}\leq_{\rm pref}v_{2}^{-}$ .

Then $v_{1}=w_{1}\alpha$ and $v_{2}^{-}=u_{2}y^{-}=w_{2}\beta$ for some $\alpha,\beta\in A^{*}$ . The latter equality implies that $w_{2}\,\|_{\rm pref}\,u_{2}$ . Recall that in case 4, $u_{1}=v_{1}1x$ ; this and $v_{1}=w_{1}\alpha$ imply that $u_{1}=w_{1}\alpha 1x$ , hence $u_{1}\,\|_{\rm pref}\,w_{1}$ . Now, since $u_{1}\,\|_{\rm pref}\,w_{1}$ and $w_{2}\,\|_{\rm pref}\,u_{2}$ , the join $(w_{1},w_{2})\vee(u_{1},u_{2})$ exists, which contradicts the fact that $(w_{1},w_{2})$ and $(u_{1},u_{2})$ belong to $P$ . So case 4.1.2 is ruled out.

Case 4.2: $w_{1}\geq_{\rm pref}v_{1}0$ and $w_{2}\geq_{\rm pref}v_{2}^{-}$ .

Since $(v_{1}0,v_{2})$ has maximum depth in $P$ , and $(v_{1}0,v_{2}^{-})$ has the same depth, it follows that $(w_{1},w_{2})=(v_{1}0,v_{2}^{-})$ . This contradicts the assumption $(w_{1},w_{2})\neq(v_{1}0,v_{2}^{-})$ . So case 4.2 is ruled out.

Case 4.3: $w_{1}\leq_{\rm pref}v_{1}0$ and $w_{2}\geq_{\rm pref}v_{2}^{-}$ .

Case 4.3.1: $w_{1}=v_{1}0$ and $w_{2}>_{\rm pref}v_{2}^{-}$ (since $(w_{1},w_{2})\neq(v_{1}0,v_{2}^{-})$ , equality in the first coordinate implies strictness in the second).

Then $|w_{1}|+|w_{2}|>|v_{1}0|+|v_{2}^{-}|=|v_{1}0|+|v_{2}|$ , i.e., $(w_{1},w_{2})$ has greater depth than $(v_{1}0,v_{2})$ , which contradicts the fact that $(v_{1}0,v_{2})$ has maximum depth in $P$ . So case 4.3.1 is ruled out.

Case 4.3.2: $w_{1}<_{\rm pref}v_{1}0$ and $w_{2}\geq_{\rm pref}v_{2}^{-}$ .

Then $w_{1}\leq_{\rm pref}v_{1}=w_{1}s$ , and $w_{2}=v_{2}^{-}t=u_{2}y^{-}t$ , for some $s,t\in A^{*}$ . Recall that $y\neq\varepsilon$ in case 4. Then $(w_{1},w_{2})\vee(u_{1},u_{2})=(w_{1},u_{2}y^{-}t)\vee(w_{1}s,u_{2})=$ $(w_{1}s,u_{2}y^{-}t)$ exists. This contradicts the fact that $(w_{1},w_{2})$ and $(u_{1},u_{2})$ belong to $P$ . So case 4.3.2 is ruled out.

Case 4.4: $w_{1}\geq_{\rm pref}v_{1}0$ and $w_{2}\leq_{\rm pref}v_{2}^{-}$ ; since $(w_{1},w_{2})\neq(v_{1}0,v_{2}^{-})$ , $\leq_{\rm pref}$ or $\geq_{\rm pref}$ is strict.

Case 4.4.1: $w_{1}>_{\rm pref}v_{1}0$ and $w_{2}=v_{2}^{-}$ .

Then $|w_{1}|+|w_{2}|>|v_{1}0|+|v_{2}^{-}|=|v_{1}0|+|v_{2}|$ , hence $(w_{1},w_{2})$ has greater depth than $(v_{1}0,v_{2})$ , which contradicts the fact that $(v_{1}0,v_{2})$ has maximum depth in $P$ . So case 4.4.1 is ruled out.

Case 4.4.2: $w_{1}\geq_{\rm pref}v_{1}0$ and $w_{2}<_{\rm pref}v_{2}^{-}$ .

Then $w_{1}=v_{1}0s$ ; also, $w_{2}<_{\rm pref}v_{2}$ (since $v_{2}$ and $v_{2}^{-}$ only differ in the right-most bit), so $v_{2}=w_{2}t$ , for some $s,t\in A^{*}$ . Now, $(w_{1},w_{2})\vee(v_{1}0,v_{2})=(v_{1}0s,w_{2})\vee(v_{1}0,w_{2}t)=$ $(v_{1}0s,w_{2}t)$ exists. This contradicts the fact that $(w_{1},w_{2})$ and $(v_{1}0,v_{2})$ belong to $P$ . So case 4.4.2 is ruled out.

Since we now ruled out all sub-cases of case 4, this completes the proof (by contradiction) of the Claim.

Summary of the proof so far: We have $(v_{1}0,v_{2})\in P$ for some maximum-depth vertex $(v_{1},v_{2})$ in the interior of the $P$ -dag. (The cases where, instead, we have $(v_{1}1,v_{2})$ or $(v_{1},v_{2}0)$ , or $(v_{1},v_{2}1)$ in $P$ , are similar.)

If we also have $(v_{1}1,v_{2})\in P$ then the Lemma holds.

If $(v_{1}1,v_{2})\not\in P$ then there exists $(u_{1},u_{2})\in P$ that has a join with $(v_{1}1,v_{2})$ . Four cases are possible, of which cases 1, 2, and 3 were ruled out. In case 4 we showed that $(v_{1}0,v_{2}^{-})\in P$ ; hence in case 4, $(v_{1}0,v_{2})$ and $(v_{1}0,v_{2}^{-})$ belong to $P$ , i.e., the Lemma holds for the interior vertex $(v_{1}0,v_{2}\{0,1\}^{-1})$ .

The following is an example where not every maximum-depth interior leaf has two children in $P$ . Consider the maximal joinless code $P=$ $\{(0,0),(0,1),(1,\varepsilon)\}$ . Here the interior leaf $v=(\varepsilon,0)$ has maximum depth, and has only one child in $P$ (namely $(0,0)$ ). Nevertheless, there is another maximum-depth interior leaf, namely $(0,\varepsilon)$ , that has two children in $P$ (namely $(0,0)$ and $(0,1)$ ).

(3) Example (from [29, Ex. 12.8]): Let $P=$ $\{(0,0,\varepsilon),\,(1,\varepsilon,0),\,(\varepsilon,1,1),$ $(0,1,0),\,(1,0,1)\}\,\subset\,3\,\{0,1\}^{*}$ . It is easy to verify that $P$ is a finite maximal joinless code, and that no leaf of the interior dag has two children in $P$ . $\Box$

Remark about Lemma 2.10: Version 1 of this paper (see [10]) stated incorrectly that “for every $n\geq 1$ and every leaf $v$ of the interior dag of $P$ : $\ v_{+}=$ $v\cdot(\{\varepsilon\}^{i}\times A\times\{\varepsilon\}^{n-i-1})\,$ (for some $i$ , $0\leq i<n$ )”. This statement had to be modified for $n=2$ (from “for every leaf” to “there exists a leaf”), and dropped for $n\geq 3$ . The above counter-example for $n\geq 3$ was given in [28] and [29, Ex. 12.8].

Lemma 2.11

Let $P\subset nA^{*}$ be a finite set. For any $\,p=(p_{1},\,\ldots,p_{n})\in P$ and $i\in\{1,\,\ldots,n\}$ , let

$P_{p,i}^{\prime}\ =\ (P\smallsetminus\{p\})$ * $\ \cup\$ $\{(p_{1},\,\ldots,p_{i-1},\,p_{i}a,\,p_{i+1},\,\ldots,p_{n})\,:\,$ $a\in A\}$ .*

Then we have:

(1)* $P$ is joinless iff $\,P_{p,i}^{\prime}\,$ is joinless.*

(2)* $P$ is a maximal joinless code iff $\,P_{p,i}^{\prime}\,$ is a maximal joinless code.*

The set $P_{p,i}^{\prime}$ is called a one-step restriction of $P$ (“restriction” because $\,P_{p,i}^{\prime}\cdot(nA^{*})\,\subsetneqq\,P\cdot(nA^{*})$ ); and $P$ is called a one-step extension of $P_{p,i}^{\prime}$ . Clearly, $\ |P_{p,i}^{\prime}|=|P|-(|A|-1)$ .

Proof. (1) $[\Rightarrow]$ Let us assume that $P$ is joinless. For any $a,a^{\prime}\in A$ with $a\neq a^{\prime}$ , the join of $\,(p_{1},$ $\ldots,$ $p_{i-1},$ $p_{i}a,$ $p_{i+1},$ $\ldots,$ $p_{n})\,$ and $\,(p_{1},$ $\ldots,$ $p_{i-1},$ $p_{i}a^{\prime},$ $p_{i+1},$ $\ldots$ , $p_{n})\,$ does not exist, since $p_{i}a$ and $p_{i}a^{\prime}$ are not prefix-comparable.

If $q\in P\smallsetminus\{p\}$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ were both initial factors of some $z\in nA^{*}$ , then $q$ and $p$ would also both be initial factors of $z$ , contradicting the assumption that $P$ is joinless.

Finally, all pairs $q_{1},q_{2}\in P\smallsetminus\{p\}$ ( $\subset P_{p,i}^{\prime}$ ) are joinless since $P$ is joinless. Thus $P_{p,i}^{\prime}$ is joinless.

$[\Leftarrow]$ Let us assume that $P_{p,i}^{\prime}$ is joinless. Then every pair $q_{1},q_{2}\in P\smallsetminus\{p\}$ ( $\subset P_{p,i}^{\prime}$ ) is joinless.

If $q\in P\smallsetminus\{p\}$ and $p$ had a join $z$ , then both $p$ and $q$ would be initial factors of $z$ . By Lemma 2.5, $z_{j}=\max\{q_{j},p_{j}\}$ for all $j\in\{1,\,\ldots,n\}$ . We have two cases.

Case 1: $z_{i}=p_{i}$ (for the $i$ used in $P_{p,i}^{\prime}$ ).

This is equivalent to $q_{i}$ being a prefix of $p_{i}$ . Then $q_{i}$ is a prefix of $p_{i}a$ as well (for every $a\in A$ ), hence $q\in P\smallsetminus\{p\}$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ have a join. But this contradicts the assumption that $P_{p,i}^{\prime}$ is joinless.

Case 2: $z_{i}\neq p_{i}$ , and $z_{i}=q_{i}$ (for the $i$ used in $P_{p,i}^{\prime}$ ).

Then $p_{i}$ is a strict prefix of $q_{i}$ ( $=z_{i}$ ), hence $p_{i}a$ is a prefix of $q_{i}$ for some $a\in A$ . It follows that $z$ has $q$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ as initial factors; this contradicts the assumption that $P_{p,i}^{\prime}$ is joinless.

(2) $[\Rightarrow]$ Suppose $P$ is a maximal joinless code. Hence, every $x\in nA^{*}$ has a join with some $q\in P$ (otherwise $x$ could be added to $P$ , which would contradict that $P$ is maximal joinless). We want to show that $x$ also has a join with some element of $P_{p,i}^{\prime}$ .

If $q\neq p$ then $q\in P_{p,i}^{\prime}$ , hence $x$ also has a join with some $q\in P_{p,i}^{\prime}$ .

If $q=p$ , i.e., $z=x\vee p$ , then $z_{j}=\max\{x_{j},\,p_{j}\}$ for all $j\in\{1,\,\ldots,n\}$ . We have two cases.

Case 1: $z_{i}=p_{i}$ (for the $i$ used in $P_{p,i}^{\prime}$ ).

This is equivalent to $x_{i}$ being a prefix of $p_{i}$ . Then $x_{i}$ is a prefix of $p_{i}a$ too (for every $a\in A$ ), hence $x$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ have a join. So, $x$ has a join with some element of $P_{p,i}^{\prime}$ .

Case 2: $z_{i}\neq p_{i}$ , and $z_{i}=x_{i}$ (for the $i$ used in $P_{p,i}^{\prime}$ ).

Then $p_{i}$ is a strict prefix of $x_{i}$ ( $=z_{i}$ ), hence $p_{i}a$ is a prefix of $x_{i}$ for some $a\in A$ . It follows that $z$ has $x$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ as initial factors; this implies that $x$ has a join with $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ $\in P_{p,i}^{\prime}$ (for this particular $a\in A$ ).

$[\Leftarrow]$ Suppose that $P_{p,i}^{\prime}$ is maximal joinless. Then every $x\in nA^{*}$ has a join with some $q\in P_{p,i}^{\prime}$ . We want to show that $x$ also has a join with some element of $P$ .

If $q\neq(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ for all $a\in A$ , then $q\in P$ so $x$ also has a join with $q\in P$ .

If $q=(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ for some $a\in A$ , then let $z$ be the join of $x$ and $(p_{1},$ $\ldots,$ $p_{i-1},$ $p_{i}a,$ $p_{i+1},$ $\ldots,p_{n})$ . Then $z$ has $x$ and $(p_{1},\,\ldots,p_{i-1},p_{i}a,p_{i+1},\,\ldots,p_{n})$ as initial factors, hence $p$ is an initial factor of $z$ . Hence $x\vee p$ exists, so $x$ has a join with an element of $P$ . $\Box$

The properties of joinless codes given in Lemma 2.11 do not hold for initial factor codes in general. For example, for $A=\{0,1\}$ consider the initial factor code $\,P=\{(\varepsilon,0),\,(0,\varepsilon)\}$ . Then for $p=(0,\varepsilon)$ and $i=2$ we obtain $\,P_{p,i}^{\prime}=\{(\varepsilon,0),\,(0,0),\,(0,1)\}$ , which is not an initial factor code.

The process of one-step restriction or extension can be iterated, which inspires the following definition and the algorithm.

Definition 2.12

(parse trees).* Let $P\subset nA^{*}$ be a finite joinless code. A parse tree of $P$ is any subtree $T$ of the dag of $P$ with the following properties:*

(1)* The root of $T$ is $(\varepsilon)^{n}$ (i.e., the root of the dag of $P$ ); and the set of leaves of $T$ is $P$ (i.e., the leaves of the dag of $P$ ).*

(2)* For every interior vertex $v$ of $T$ the set of children in $T$ is $\ v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})$ , for a unique $i\in\{1,\,\ldots,n\}$ . So $v$ has exactly $|A|$ children in $T$ .*

Given the dag of $P$ and a subtree $T$ , it is easy to check whether $T$ is a parse tree of $P$ ; one just needs to check that $(\varepsilon)^{n}$ occurs in $T$ , and that every vertex in $T$ is reachable from $(\varepsilon)^{n}$ ; moreover, for each vertex $v$ of $T$ one checks whether it is in $P$ , or whether its set of children is of the form $\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})$ . Recall the dags and trees are not oriented (children and leaves are not ordered).

A maximal joinless code $P$ can have more than one parse tree. E.g., the joinless set $\ \{(0,0),$ $(0,1),$ $(1,0),$ $(1,1)\}\$ has the following two parse trees:

$(\varepsilon,\varepsilon)$ $(\varepsilon,\varepsilon)$

$/$ $\setminus$ $/$ $\setminus$

$(0,\varepsilon)$ $(1,\varepsilon)$ $(\varepsilon,0)$ $(\varepsilon,1)$

$/$ $\setminus$ $/$ $\setminus$ $/$ $\setminus$ $/$ $\setminus$

$(0,0)$ $(0,1)$ $(1,0)$ $(1,1)$ $(0,0)$ $(1,0)$ $(0,1)$ $(1,1)$

Burillo and Cleary [18] give a similar tree description of tilings of $[0,1]^{2}$ , and point out that the tree is not unique.

If $P$ is not maximal (as a joinless code) then it has no parse tree (according to our definition of parse tree).

By Lemma 2.10(2), every maximal joinless code in $\,2\,\{0,1\}^{*}$ has at least one parse tree. But in $nA^{*}$ with $n\geq 3$ there are maximal joinless codes that have no parse tree, by Lemma 2.10(3); geometrically, codes in $3\,\{0,1\}^{*}$ without parse tree correspond to tilings of the cube that cannot be obtained by successive bipartitions of cuboids (perpendicularly to an axis). This motivates the following.

Questions: Is there a simple geometric or combinatorial characterization of the finite maximal joinless codes in $nA^{*}$ (for $n\geq 3$ ) that have no parse tree? Is the non-existence of a parse tree equivalent to the presence of one of certain joinless subsets (“forbidden patterns”)? An example of such a forbidden pattern is the subset $\,\{(0,0,\varepsilon),\,(1,\varepsilon,0),\,(\varepsilon,1,1)\}\,$ of Lawson and Vdovina [29], used in 2.10(3).

The following algorithm nondeterministically constructs any parse tree of $P$ , if a parse tree exists. If $P$ has no parse tree the algorithm will discover this for some (but not all) nondeterministic choices. For a finite joinless code $P\subset\,2\,\{0,1\}^{*}$ , the deterministic version of the algorithm decides whether $P$ is maximal (as a joinless code).

*Outline of the algorithm: * Initially, the algorithm puts $P$ into $T$ (as its leaf set), and makes a working copy $P_{0}$ of $P$ . The algorithm keeps looking for an initial factor $v$ of an element of $P_{0}$ such that $\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})\,$ $\subseteq$ $P_{0}$ (for some $i\in\{1,\,\ldots,n\}$ ). When such a $v$ is found, it is added to $T$ and to $P_{0}$ ; and $\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})\,$ is removed from the working copy $P_{0}$ . If $(\varepsilon)^{n}$ is reached, and put into $T$ , the construction of $T$ is complete and the algorithm concludes that $P$ is maximal (as a joinless code), and that it has a parse tree.

The algorithm can be made deterministic by picking a total order for $nA^{*}$ (e.g., the lexicographic dictionary order), and always picking the first $v$ that works.

Notation: $\,{\sf init}(P_{0})\,$ denotes the set of strict initial factors of the elements of $P_{0}$ ; because of strictness (and since $P_{0}$ is joinless), $P_{0}\,\cap\,{\sf init}(P_{0})=\varnothing$ .

Algorithm

Input: A finite set $P\subset nA^{*}$ , given by a list of $n$ -tuples of strings in $A^{*}$ .

Precondition: $P\neq\{(\varepsilon)^{n}\}$ , and $P$ is joinless. (This can easily be checked, by Lemma 2.5.)

Output: A set of vertices $V(T)$ and edges $E(T)$ of a parse tree of $P$ , if $P$ has a parse tree;

$P_{0}:=P$ ;

$P_{0}$ is a a working copy of $P$

$V(T):=P$ ; $E(T):=\varnothing$ ;

while $\,(\exists v\in{\sf init}(P_{0}))$ $(\exists i\in\{1,\,\ldots,n\})$ $[\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})$ $\,\subseteq\,$ $P_{0}\,]$ :

choose any $v$ that satisfies the while-condition;

for a deterministic algorithm, pick the first $v$

that works (in a fixed total order)

$V(T):=V(T)\cup\{v\}$ ;

$E(T):=E(T)$ $\cup$ set of all edges from $v$ to the elements of $\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i})$ ;

$P_{0}:=(P_{0}$ $\,\smallsetminus\,$ $\,v\cdot(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i}))$ $\ \cup\$ $\{v\}$ ; # Hence $P_{0}$ remains joinless.

if $\,(\varepsilon)^{n}\in V(T)$ :

then output $(V(T),E(T))$ and conclude that $P$ is maximal;

else (in case $n=2$ and $A=\{0,1\}$ ) conclude that $P$ is not maximal

(and hence has no parse tree).

$\Box$

Proposition 2.13

Let $P$ be any finite joinless code in $\,2\,\{0,1\}^{*}$ .

Then $P$ has a parse tree iff $P$ is maximal as a joinless code.

The Algorithm (deterministic version) decides maximality of $P$ and finds a parse tree in polynomial time, when $P$ is given as a list of pairs of bitstrings.

Proof. The Algorithm uses one-step extensions of maximal joinless codes; by Lemma 2.11, each one-step extension or restriction preserves joinlessness and maximality. Since $\{\varepsilon\}^{n}$ is a maximal joinless code, it follows that $P$ is maximal if the root $(\varepsilon)^{n}$ is reached. It follows also that if the root is reached, a parse tree of $P$ exists (and the Algorithm returns such a tree).

Conversely (for $n=2$ and $A=\{0,1\}$ ), if $P$ (or, at any later stage, $P_{0}$ ) is maximal, then by Lemma 2.10(2) there exists $v$ in the interior dag such that $\,v\cdot(\{\varepsilon\}\times\{0,1\}$ $\,\cup\,$ $\{0,1\}\times\{\varepsilon\})$ $\subseteq$ $P$ (or $\subseteq$ $P_{0}$ ). And this process does not stop until $P_{0}=\{\varepsilon\}^{n}$ . $\Box$

Corollary 2.14

(cardinality of joinless codes).**

Let $n$ be any positive integer and $A$ any finite alphabet.

(0.1)* For every $k_{1},\ldots,k_{n}\in{\mathbb{N}}$ : X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}\$ is a maximal joinless code that has a parse tree.*

(0.2)* For any finite joinless code $P\subset nA^{*}$ : $P$ is maximal iff $P$ can be transformed into X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}\,$ by a finite sequence of restriction steps, where $k_{i}=$ $\max\{|v_{i}|:(v_{1},\ldots,v_{n})\in P\}\,$ for $1\leq i\leq n$ .*

(1)* For every finite maximal joinless code $P$ $\subseteq$ $nA^{*}$ there exists $N\in{\mathbb{N}}$ such that*

$|P|\,=\,1+(|A|-1)\cdot N$ .

(1.1)* If $P$ has a parse tree then $P$ can be obtained from $\{\varepsilon\}^{n}$ by a finite sequence of one-step restrictions. The number of one-step restrictions used is equal to the number of interior vertices of every parse tree of $P$ , and is equal to $\,N=(|P|-1)/(|A|-1)$ .*

(1.2)* If $P$ has no parse tree, then $P$ can be obtained from $\{\varepsilon\}^{n}$ by a finite sequence of one-step restrictions, followed by a finite sequence of one-step extensions.*

Even when $P$ has no parse tree, $N=(|P|-1)/(|A|-1)$ is still the number of interior vertices in any parse tree of any maximal joinless code that has a parse tree and that has the same cardinality as $P\,$ (e.g., of the form $P_{1}\times\{\varepsilon\}^{n-1}$ where $P_{1}$ is a prefix code in $A^{*}$ ).

(2)* Conversely, for all $N\in{\mathbb{N}}$ there are maximal joinless codes in $nA^{*}$ of cardinality $\,1+(|A|-1)\cdot N$ . In particular, when $|A|=2$ every positive integer is the cardinality of some maximal joinless code.*

Proof. (0.1) Let $\,C(k_{1},\ldots,k_{i},\ldots,k_{n})$ $\,=\,$ X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}$ . Let us prove by induction on $\,\sum_{i=1}^{n}k_{i}$ that $C(k_{1},\ldots,k_{i},\ldots,k_{n})$ has a parse tree. For $C(0\ldots,0,\ldots,0)=\{\varepsilon\}^{n}$ , the parse tree consists of one vertex. Inductively,

$C(k_{1},\ldots,k_{i-1},k_{i}+1,k_{i+1},\ldots,k_{n})$ $=$ X ${}_{{}_{j=1}}^{{}^{i-1}}A^{k_{j}}$ $\,\times\,A^{k_{i}+1}\times\,$ X ${}_{{}_{j=i+1}}^{{}^{n}}A^{k_{j}}$

$=\$ (X ${}_{{}_{j=1}}^{{}^{n}}A^{k_{j}}$ ) $\cdot$ $(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i-1})$

$=\$ $C(k_{1},\ldots,k_{i},\ldots,k_{n})$ $\cdot$ $(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i-1})$

$=\$ $\bigcup_{v\in A^{k_{i}}}$ $\big{(}$ (X ${}_{{}_{j=1}}^{{}^{i-1}}A^{k_{j}}$ $\times\{v\}\times$ X ${}_{{}_{j=i+1}}^{{}^{n}}A^{k_{j}}$ ) $\cdot$ $(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i-1})\big{)}$ .

So, $C(k_{1},\ldots,k_{i-1},k_{i}+1,k_{i+1},\ldots,k_{n})$ is obtained form $C(k_{1},\ldots,k_{i},\ldots,k_{n})$ by $|A|^{k_{i}}$ one-step restrictions (one one-step restriction for every $v\in A^{k_{i}}$ ). It follows that if $C(k_{1},\ldots,k_{i},\ldots,k_{n})$ has a parse tree then $C(k_{1},\ldots,k_{i-1},k_{i}+1,k_{i+1},\ldots,k_{n})$ has a parse tree. Moreover, any joinless code that has a parse tree is maximal.

(0.2) Let

$\ell_{i}(P)$ $\,=\,$ $\max\{|v_{i}|:(v_{1},\ldots,v_{n})\in P\}$ , for $1\leq i\leq n$ ; and

$\nu(P)$ $\,=\,$ $\,\prod_{i=1}^{n}|A|^{\ell_{i}(P)}\ -\ \sum_{u\in P}\sum_{i=1}^{n}|u_{i}|\,$ .

The fact that $P$ can be restricted to $C(\ell_{1}(P),\,\ldots\,,\ell_{n}(P))$ follows by induction on $\nu(P)$ :

If $\nu(P)=0$ then $P=C(\ell_{1}(P),\,\ldots\,,\ell_{n}(P))$ .

If $\nu(P)>0$ , and $(u_{1},\ldots,u_{n})\in P$ is such that $|u_{i}|<\ell_{i}(P)$ for some $i$ , then a one-step restriction decreases $\nu(P)$ , as $(u_{1},\ldots,u_{n})$ is replaced by $\,(u_{1},\ldots,u_{n})\cdot$ $(\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i-1})$ .

(1.1) We prove the equivalent statement that from $P$ one can reach $\{\varepsilon\}^{n}$ by $N=(|P|-1)/(|A|-1)$ ones-step extensions. We use induction on $|P|$ . When $|P|=1$ then $P=\{\varepsilon\}^{n}$ , and the formula holds. For $|P|>1$ , an extension step can be applied to some leaf of the interior of a parse tree of $P$ , by Lemmas 2.10 and 2.11. In this extension step, a new maximal joinless code $Q$ is obtained; one leaf of the interior the parse tree of $P$ becomes a leaf of the parse tree of $Q$ , so this parse tree of $Q$ has $N-1$ interior vertices; and $|Q|=|P|-(|A|-1)$ . By induction, $N-1=(|Q|-1)/(|A|-1)$ ; and the latter is equal to $(|P|-(|A|-1)-1)/(|A|-1)$ $=$ $(|P|-1)/(|A|-1)-1$ . Hence $N=(|P|-1)/(|A|-1)$ .

(1.2) By applying one-step restrictions as in part (0.2), from any maximal joinless code $P$ one can reach X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}$ , where $k_{i}$ is as in part (0.2). And from X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}$ one can reach $\{\varepsilon\}^{n}$ by one-step extensions by (0.1). In any one-step restriction or extension the cardinality of the maximal joinless code increases or decreases by $|A|-1$ . So, to reach $P$ from $\{\varepsilon\}^{n}$ we can apply restrictions to reach X ${}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}$ , then apply extensions to obtain $P$ .

(1) The formula follows from (1.1) and (1.2).

(2) For the existence of codes of the given cardinality, take for example $\,Q\times\{\varepsilon\}^{n-1}$ , where $Q$ is any maximal prefix code in $A^{*}$ , and apply the corresponding result for maximal prefix codes (which is folklore; see e.g. [6, Lemma 9.9(0)]). $\Box$

Proposition 2.15

*There exist polynomial-time algorithms that on input $P\subset nA^{*}$ (a finite set, given by an explicit list of $n$ -tuples of strings) decide whether $P$ has the following properties:

(1) $P$ is joinless;

(2) $P$ is maximal as a joinless code.*

Proof. The input to the algorithms is $P$ , given as a list of $n$ -tuples of strings, so the input size is $\,\sum_{p\in P}\sum_{i=1}^{n}|p_{i}|$ .

(1) Lemma 2.5, applied to every two elements $u,v\in P$ with $u\neq v$ , will decide in quadratic time whether $P$ is joinless.

(2) For finite joinless codes in $\,2\,\{0,1\}^{*}$ , the Algorithm given after Def. 2.12 has polynomial time complexity, in view of Coroll. 2.14 which proves that every parse tree of $P$ has size that is linearly bounded in terms of $|P|$ .

For $nA^{*}$ in general, the Algorithm that follows Prop. 2.17, based on the generalized Kraft equality, decides in polynomial time whether a joinless code is maximal. $\Box$

The corresponding questions about initial factor codes are also decidable in polynomial time. Suppose $P\subset nA^{*}$ is finite and given by an explicit list of $n$ -tuples of strings. It is easy to decide whether $P$ is an initial factor code; it is sufficient to check for every two elements $u,v\in P$ with $u\neq v$ , whether $u\,\|_{\rm init}\,v$ . A code $P$ is maximal as an initial factor code iff for every interior vertex $v$ of the $P$ -dag, the $n\,|A|$ children of $v$ in the $nA^{*}$ -dag are also children of $v$ in the $P$ -dag.

An algorithm for testing maximality of a joinless code can be derived from the following generalization of the Kraft (in)equality to higher dimensions. We mentioned earlier that in the geometric description of the Brin-Thompson groups, a word $x=(x_{1},\ldots,x_{n})\in n\,\{0,1\}^{*}$ represents the hyperrectangle X ${}_{{}_{i=1}}^{{}^{n}}[0.x_{i}\,,\ 0.x_{i}+2^{-|x_{i}|}[\$ (where we close the intervals whose right-bound is 1). The measure of this hyperrectangle is $2^{-(|x_{1}|\,+\ \ldots\ +\,|x_{n}|)}$ . More generally, we have the following.

Definition 2.16

Let $A$ be an alphabet of cardinality $|A|=k\geq 2$ . For every $\,x=(x_{1},\ldots,x_{n})\in nA^{*}\,$ we define the measure

$\mu(x)\,=\,k^{-(|x_{1}|+\,\ldots\,+|x_{n}|)}$ .

For every joinless code $P\subset nA^{*}$ (not necessarily finite) we define the measure

$\mu(P)\,=\,\sum_{x\in P}\,\mu(x)$ .

Proposition 2.17

( $n$ -dimensional Kraft (in)equality).* Let $P\subset nA^{*}$ be a finite joinless code, where $|A|\geq 2$ and $n\geq 1$ . Then we have:*

(1)* $\mu(P)\leq 1$ .*

(2)* $P$ is maximal (as a joinless code) iff $\mu(P)=1$ .*

Proof. This follows from the geometric picture. For a joinless code $P$ , all the words in $P$ represent non-overlapping hyperrectangles in $[0,1]^{n}$ , so their total measure is at most the measure of $[0,1]^{n}$ , which is 1.

And $P$ is maximal iff the corresponding hyperrectangles tile $[0,1]^{n}$ , which is iff the sum of the measures of the hyperrectangles is 1. $\Box$

Prop. 2.17 probably holds for infinite joinless codes too; but since we don’t need it in that case, we’ll that question open.

Prop. 2.17 leads to the following algorithm.

Algorithm (maximality of a finite joinless code)

Input: A finite set $P\subset nA^{*}$ , given as an explicit list of words.

Precondition: $P$ is joinless. (This is easily checked, by Prop. 2.15(1).)

Question: Is $P$ maximal?

Compute $\ \mu(P)=\sum_{x\in P}k^{-\sum_{i=1}^{n}|x_{i}|}$ in fractional base- $k$ representation;

if $\,\mu(P)=1$ , output “yes”;

else, output “no”. $\Box$

This algorithm runs in polynomial time, in terms of the total input length $\,\sum_{x\in P}\sum_{i=1}^{n}|x_{i}|$ . In fractional base- $k$ representation the sum $\mu(P)$ is easy to compute.

We will need the intersection of joinless generated right ideals, and the elementwise join of joinless codes.

Proposition 2.18

Let $P,Q\subset nA^{*}$ be joinless codes.

(1)* The elementwise join $P\vee Q$ , defined by*

$P\vee Q\,=\,\{p\vee q\,:\,p\in P,\ q\in Q\}$ ,

is a joinless code. (Here, $p\vee q$ ranges over the joins that exist.)

Hence, $|P\vee Q|\,\leq\,|P|\cdot|Q|$ .

(2)* $P$ and $Q$ are both maximal (as joinless codes) iff $P\vee Q$ is maximal.*

(3)* $(P\vee Q)\cdot(nA^{*})\ =\$ $P\cdot(nA^{*})\ \cap\ Q\cdot(nA^{*})$ .*

Hence, if $P\,(nA^{*})$ and $Q\,(nA^{*})$ are joinless generated then so is $\,P\,(nA^{*})\,\cap\,Q\,(nA^{*})$ .

Proof. (1) Suppose $p,p^{\prime}\in P$ , $q,q^{\prime}\in Q$ , and $p\neq p^{\prime}$ or $q\neq q^{\prime}$ . Then $(p\vee q)\vee(p^{\prime}\vee q^{\prime})\,$ does not exist, because $(p\vee q)\vee(p^{\prime}\vee q^{\prime})$ would have $p$ , $p^{\prime}$ , $q$ , and $q^{\prime}$ as prefixes. But either $p$ and $p^{\prime}$ (if different) or $q$ and $q^{\prime}$ (if different) do not have a join.

(2) $[\Leftarrow]$ If $P\vee Q$ is maximal then every $x\in nA^{*}$ has a join with some $p\vee q\in P\vee Q$ , i.e., $x$ and $p\vee q$ are initial factors of some $z\in nA^{*}$ . Then $p$ and $q$ are also initial factors of $z$ , so $x\vee p$ and $x\vee q$ exist. Hence, every $x\in nA^{*}$ has a join with some $p\in P$ and some $q\in Q$ , thus $P$ and $Q$ are maximal.

$[\Rightarrow]$ If $P$ is maximal then every $x\in nA^{*}$ has a join with some $p\in P$ ; and if $Q$ is maximal, $x\vee p$ has a join with some $q\in Q$ . Hence, $x$ , $p$ , and $q$ , are all initial factors of some word $z$ , hence $z\vee p\vee q$ exists. So, every $x\in nA^{*}$ has a join with some $p\vee q$ , so $P\vee Q$ is maximal.

(3) $[\supseteq]$ Every $w\in P\,(nA^{*})\,\cap\,Q\,(nA^{*})$ satisfies $w=pu=qv$ for some $p\in P$ , $q\in Q$ , and $u,v\in nA^{*}$ . This implies that $p$ and $q$ are initial factors of $w$ , so $p\vee q$ exists, and is an initial factor of $w$ . Hence, $w\in(P\vee Q)\cdot(nA^{*})$ .

$[\subseteq]$ If $p\vee q$ exists then it has $p$ and $q$ as initial factors, hence $p\vee q\in P\,(nA^{*})\,\cap\,Q\,(nA^{*})$ . $\Box$

2.3 Right ideal morphisms of $nA^{*}$ , and string-based

definition of $nG_{k,1}$ and $nV$

Just as for $A^{*},$ one defines the concepts of right ideal morphism, domain code, and image code in $nA^{*}$ . We only consider domain and image codes that are joinless. Indeed, if $P\subset nA^{*}$ is not joinless, some definitions of right ideal morphisms on $P$ will be inconsistent. E.g., let $P=\{(0,\varepsilon),\,(\varepsilon,0)\}$ , so $(0,\varepsilon)\vee(\varepsilon,0)=(0,0)$ ; and let $f(0,\varepsilon)=(0,0)$ and $f(\varepsilon,0)=(1,1)$ ; then $\,f(0,0)=f((0,\varepsilon)\cdot(\varepsilon,0))=$ $(0,0)\cdot(\varepsilon,0)=(0,00)\neq$ $(10,1)=(1,1)\cdot(0,\varepsilon)=$ $f((\varepsilon,0)\cdot(0,\varepsilon))=f(0,0)$ ; so $f(0,0)$ receives two different values.

Before we get to $nG_{k,1}$ we define the following monoid:

Definition 2.19

.

$n{\cal RI}_{A}^{\sf fin}$ * $\ =\$ $\{f:\,f$ is a right ideal morphism of $nA^{*}$ such that $f$ is injective,*

and ${\rm domC}(f)$ and ${\rm imC}(f)$ are finite, maximal, joinless codes}* .*

“Maximal” means maximal as a joinless code. Usually we just write $n{\cal RI}^{\sf fin}$ when a fixed alphabet $A$ is used.

Lemma 2.20

For every $f\in n{\cal RI}^{\sf fin}$ : $f({\rm domC}(f))={\rm imC}(f)$ .

Hence, if $f\in n{\cal RI}^{\sf fin}$ then $f^{-1}\in n{\cal RI}^{\sf fin}$ , and ${\rm domC}(f^{-1})={\rm imC}(f)$ , ${\rm imC}(f^{-1})={\rm domC}(f)$ .

Proof. For every $p_{1}\in{\rm domC}(f)$ : $\,f(p_{1})=q_{1}u\in{\rm Im}(f)$ , for some $q_{1}\in{\rm imC}(f)$ and $u\in nA^{*}$ . Since $q_{1}\in{\rm Im}(f)$ , $q_{1}=f(p_{2}v)$ for some $p_{2}\in{\rm domC}(f)$ and $v\in nA^{*}$ . Hence, $q_{1}u=f(p_{2}v)\ u=f(p_{2}vu)$ . Thus, $f(p_{1})=q_{1}u=f(p_{2}vu)$ . Since $f$ is injective, this implies that $p_{1}=p_{2}vu$ . Since $p_{1},p_{2}\in{\rm domC}(f)$ , which is an initial factor code, $p_{1}=p_{2}$ and $u=v=(\varepsilon)^{n}$ . Hence, $f(p_{1})=q_{1}u=q_{1}\in{\rm imC}(f)$ . So $f({\rm domC}(f))\subseteq{\rm imC}(f)$ .

Conversely, if $q\in{\rm imC}(f)$ , then $q=f(p)\,v$ for some $p\in{\rm domC}(f)$ and $v\in nA^{*}$ . Since $f(p)\in{\rm Im}(f)$ and $q\in{\rm imC}(f)$ (which is the initial factor code that generates ${\rm Im}(f)$ ), we conclude that $q=f(p)$ and $v=(\varepsilon)^{n}$ . Hence, $q\in f({\rm domC}(f)$ . So, ${\rm imC}(f)\subseteq f({\rm domC}(f))$ .

Now $f^{-1}$ satisfies the following: For $q\in{\rm imC}(f)$ , $f^{-1}(q)=p$ iff $p\in{\rm domC}(f)$ and $f(p)=q$ . Hence $f^{-1}\in n{\cal RI}^{\sf fin}$ , and ${\rm domC}(f^{-1})={\rm imC}(f)$ , and ${\rm imC}(f^{-1})={\rm domC}(f)$ . $\Box$

Lemma 2.21

Let $f\in n{\cal RI}^{\sf fin}$ and let $P\subset nA^{*}$ be a finite set.

(1.1)* If $P\subset{\rm Dom}(f)$ we have: $f(P)$ is joinless iff $P$ is joinless.*

(1.2)* If $P\subset{\rm Dom}(f)$ and $P$ is joinless, we have: $P$ is maximal iff $f(P)$ is maximal.*

(2.1)* In general (not assuming $P\subset{\rm Dom}(f)$ ), we have:*

$f(P\,\vee\,{\rm domC}(f))$ * is joinless iff $P$ is joinless.*

(2.2)* In general, if $P$ is joinless then the following are equivalent:*

$P$ * is maximal,*

$P\vee{\rm domC}(f)\$ * is maximal,*

$f(P\vee{\rm domC}(f))\$ * is maximal.*

Proof. (1.1) $[\Leftarrow]$ Let $p,q\in P$ , and assume by contradiction that there exists $z\in nA^{*}$ such that $f(p)$ and $f(q)$ are initial factors of $z$ . Then $z=f(p)\,u=f(q)\,v$ for some $u,v\in nA^{*}$ . Hence, $f^{-1}(z)=f^{-1}(f(p)\,u)$ $=$ $f^{-1}(f(p))\ u$ ; the latter holds since $f^{-1}\in n{\cal RI}^{\sf fin}$ , and $f(p)\in{\rm Dom}(f^{-1})$ $=$ ${\rm Im}(f)$ (by Lemma 2.20). Hence, $f^{-1}(z)=pu$ . Similarly, $f^{-1}(z)=qv$ . So, $pu=qv$ , but that contradicts the assumption that $P$ is joinless.

(1.1) $[\Rightarrow]$ Conversely, if some $p,q\in P$ have a join $z$ then $z=pu=qv$ for some $u,v\in\in nA^{*}$ . Then $\,f(z)=f(p)\,u=f(q)\,v$ , so $f(p)\vee f(q)$ exists, hence $f(P)$ is not joinless.

(1.2) $[\Rightarrow]$ Suppose $P$ is maximal, and assume by contradiction that $f(P)$ is not maximal. Then there exists $x\in nA^{*}$ such that $\{x\}\cup f(P)$ is a joinless code. Since $f^{-1}\in n{\cal RI}^{\sf fin}$ , $f^{-1}(\{x\}\cup f(P))$ is joinless (by what was proved in the previous paragraph). So, $f^{-1}(\{x\}\cup f(P))=P\cup\{f^{-1}(x)\}$ is joinless, which contradicts $P$ the assumption that $P$ is maximal. Thus, if $P$ is maximal then $f(P)$ is maximal.

(1.2) $[\Leftarrow]$ Similarly, if $f(P)$ is maximal then $f^{-1}f(P)$ is maximal (since $f^{-1}\in n{\cal RI}^{\sf fin}$ ). Hence if $f(P)$ is maximal, $P$ is maximal.

(2.1) If $P$ is joinless iff $P\vee{\rm domC}(f)$ is joinless, by Lemma 2.18(1), since ${\rm domC}(f)$ is joinless for all $f\in n{\cal RI}^{\sf fin}$ . And $P\vee{\rm domC}(f)$ is joinless iff $f(P\vee{\rm domC}(f))$ is joinless, by (1.1).

(2.2) If $P$ is maximal then $P\vee{\rm domC}(f)$ is maximal by Lemma 2.18(2), since ${\rm domC}(f)$ is maximal for $f\in n{\cal RI}^{\sf fin}$ . This implies that $f(P\vee{\rm domC}(f))$ is maximal, by (1.2). And if $f(P\vee{\rm domC}(f))$ is maximal then $P\vee{\rm domC}(f)$ is maximal, again by (1.2). Moreover, maximality of $P\vee{\rm domC}(f)$ implies maximality of $P$ (and of ${\rm domC}(f)$ ), by Lemma 2.18(2). $\Box$

Every right ideal morphism $f\in n{\cal RI}^{\sf fin}$ is uniquely determined by its restriction to ${\rm domC}(f)$ ; this is an obvious consequence of the fact that $f$ is a right-ideal morphism and ${\rm domC}(f)$ is a joinless code. So $f$ is determined by the finite function $\,f$ : ${\rm domC}(f)\to{\rm imC}(f)$ .

Conversely, let $P,Q\subset nA^{*}$ be two finite maximal joinless codes with the same cardinality, and let $F$ : $P\to Q$ by any bijection from $P$ onto $Q$ . Then $F$ determines a right ideal morphism $f$ of $nA^{*}$ , such that $F$ is the restriction of $f$ to its domain code; $f$ is defined in a unique way by $f(pv)=F(p)\ v$ for all $p\in P$ , $\,v\in nA^{*}$ . Since $P$ is joinless, $f$ is well defined.

Definition 2.22

(table).* A bijection $F$ : $P\to Q$ between finite maximal joinless codes $P,Q\subset nA^{*}\,$ is called a table.*

Tables and right ideal morphisms in $n{\cal RI}^{\sf fin}$ determine each other bijectively, and can the treated as “the same thing”.

Every function $f\in n{\cal RI}^{\sf fin}$ determines a permutation $f^{(\omega)}$ of $nA^{\omega}$ , as follows. For any $w\in nA^{\omega}$ there exists a unique $p\in{\rm domC}(f)$ such that $w=pu$ for some $u\in nA^{\omega}$ , by Lemma 2.8. Then we define $f^{(\omega)}$ by

$\,f^{(\omega)}(w)=f(p)\ u$ .

The converse does not hold; i.e., $f\in n{\cal RI}^{\sf fin}$ is not determined by $f^{(\omega)}$ , as will be seen in Lemma 2.24.

Definition 2.23

(end-equivalence).* Two right ideal morphisms $f,g\in n{\cal RI}^{\sf fin}$ are end-equivalent iff $f$ and $g$ agree on $\,{\rm Dom}(f)\,\cap\,{\rm Dom}(g)$ . This will be denoted by $f\equiv_{\rm end}g$ .*

By Prop. 2.18, $\,{\rm Dom}(f)\cap{\rm Dom}(g)$ is generated by a joinless code, namely $\,{\rm domC}(f)\vee{\rm domC}(g)$ .

In [9] the congruence $\equiv_{\rm end}$ is defined in much greater generality, and other congruences are introduced.

Lemma 2.24

For all $f,g\in n{\cal RI}^{\sf fin}$ : $f\equiv_{\rm end}g$ iff $f^{(\omega)}=g^{(\omega)}$ .

Proof. For every $f\in n{\cal RI}^{\sf fin}$ , ${\rm domC}(f)$ and ${\rm imC}(f)$ are maximal joinless codes. Therefore (by Lemma 2.9): ${\rm domC}(f)\cdot(nA^{\omega})$ $=$ $nA^{\omega}$ $=$ ${\rm imC}(f)\cdot(nA^{\omega})$ . And by Lemma 2.18, $\,{\rm domC}(f)\vee{\rm domC}(g)$ is also a maximal joinless code.

Let $R={\rm Dom}(f)\cap{\rm Dom}(g)$ , and let $f|_{R}$ and $g|_{R}$ be the restrictions of $f$ or $g$ to $R$ . Then $f\equiv_{\rm end}g$ is equivalent to $f|_{R}=g|_{R}$ .

$[\Rightarrow]$ Suppose $f\equiv_{\rm end}g$ , i.e., $f|_{R}=g|_{R}$ , where $R={\rm Dom}(f)\cap{\rm Dom}(g)$ . For every $w\in nA^{\omega}$ , let $z\in nA^{*}$ be an initial factor of $w$ such that in all coordinates, $z$ is longer than the longest coordinate of any element of $P={\rm domC}(f)\vee{\rm domC}(g)$ . And $w=zu$ for some $u\in nA^{\omega}$ . Since $P$ is a maximal joinless code, $z$ has a join with an element of $P$ ; by the chosen length of $z$ , $z$ has an initial factor in $P$ , hence $z\in R$ . Now $f^{(\omega)}(zu)=f(z)\ u$ , since $z\in R\subseteq{\rm Dom}(f)$ ; and $g^{(\omega)}(zu)=g(z)\ u$ , since $z\in R\subseteq{\rm Dom}(g)$ . Since $f(z)=g(z)$ (because $f|_{R}=g|_{R}$ ), if follows that $f^{(\omega)}(zu)=g^{(\omega)}(zu)$ .

$[\Leftarrow]$ Suppose $f^{(\omega)}=g^{(\omega)}$ . For every $r\in R$ and every $u\in nA^{\omega}$ , $f^{(\omega)}(ru)=g^{(\omega)}(ru)$ . And since $r\in R=$ ${\rm Dom}(f)\cap{\rm Dom}(g)$ , $f^{(\omega)}(ru)=f(r)\ u$ , and $g^{(\omega)}(ru)=g(r)\ u$ . From $f(r)\ u=g(r)\ u$ it follows that $f(r)=g(r)$ . Hence, $f|_{R}=g|_{R}$ , i.e., $f\equiv_{\rm end}g$ . $\Box$

Lemma 2.25

For all $f_{1},f_{2}\in n{\cal RI}^{\sf fin}$ : $(f_{2}\circ f_{1})^{(\omega)}=f_{2}^{(\omega)}\circ f_{1}^{(\omega)}$ .

The relation $\equiv_{\rm end}$ is a congruence on $n{\cal RI}^{\sf fin}$ .

Proof. For every $w\in nA^{\omega}$ there exist $r\in{\rm Dom}(f_{2}\circ f_{1})$ and $u\in nA^{\omega}$ such that $w=ru$ ; this follows from Lemma 2.8. Then $r\in{\rm Dom}(f_{1})$ and $f_{1}(r)\in{\rm Dom}(f_{2})$ . Now by the definition of $f^{(\omega)}(w)$ , ( $(f_{2}\circ f_{1})^{(\omega)}(w)=(f_{2}\circ f_{1})(r)\ u$ $=f_{2}(f_{1}(r))\ u$ . And $f_{2}^{(\omega)}(f_{1}^{(\omega)}(ru))=f_{2}^{(\omega)}(f_{1}(r)\,u)$ $=f_{2}(f_{1}(r))\ u$ ; the latter holds since $f_{1}(r)\in{\rm Dom}(f_{2})$ . This proves that $(f_{2}\circ f_{1})^{(\omega)}(w)=$ $f_{2}^{(\omega)}(f_{1}^{(\omega)}(w)$ .

It follows immediately that $\equiv_{\rm end}$ is a congruence on $n{\cal RI}^{\sf fin}$ (by Lemma 2.24). $\Box$

Next we develop criteria about extensions and restrictions of functions in $n{\cal RI}^{\sf fin}$ that enable us to decide efficiently whether two tables determine end-equivalent functions. The Remark below applies to finite maximal joinless codes in $\,2\,\{0,1\}^{*}$ and is similar to a criterion for end-equivalence of finite maximal prefix codes in $A^{*}$ . But because of Lemma 2.10(3) it does not apply for $n\geq 3$ . For $nA^{*}$ in general, Prop. 2.26 gives an efficient algorithm for deciding whether two tables determine end-equivalent functions.

Remark (extension-restriction criterion): Let $P,Q$ be finite maximal joinless codes in $\,2\,\{0,1\}^{*}$ , let $F$ : $P\to Q$ be a table, and let $f\in 2\,{\cal RI}^{\sf fin}$ be the corresponding right ideal morphism of $\,2\,\{0,1\}^{*}$ . Then: $f$ is extendable in $2\,{\cal RI}^{\sf fin}$ iff there exist $\ p=(p_{1},p_{2}),\ q=(q_{1},q_{2})\in 2\,\{0,1\}^{*}$ such that for every $a\in\{0,1\}$ :

(1) $\{(p_{1},p_{2}a):a\in\{0,1\}\}\,\subseteq\,P$ (or $\{(p_{1}a,p_{2}):a\in\{0,1\}\}\,\subseteq\,P$ ), and

(2) $\{(q_{1},q_{2}a):a\in\{0,1\}\}\,\subseteq\,Q$ (or $\{(q_{1}a,q_{2}):a\in\{0,1\}\}\,\subseteq\,Q$ ), and

(3) $F(p_{1},p_{2}a)\,=\,(q_{1},q_{2}a)$ (or $F(p_{1}a,p_{2})\,=\,(q_{1}a,q_{2})$ ).

In that case, let

$P^{\prime}\,=\,$ $(P$ $\smallsetminus$ $\{(p_{1},p_{2}a):a\in\{0,1\}\})\ \cup\ \{p\}$ , (or $(P\smallsetminus\{(p_{1}a,p_{2}):a\in\{0,1\}\})\ \cup\ \{p\}$ ), and

$Q^{\prime}\,=\,$ $(Q$ $\smallsetminus$ $\{(q_{1},q_{2}a):a\in\{0,1\}\})\ \cup\ \{q\}$ , (or $(Q\smallsetminus\{(q_{1}a,q_{2}):a\in\{0,1\}\})\ \cup\ \{q\}$ ).

Then $P^{\prime}$ and $Q^{\prime}$ are finite maximal joinless codes in $\,2\,\{0,1\}^{*}$ , and $f$ can be extended to a function $f^{\prime}\in 2\,{\cal RI}^{\sf fin}$ with table $F^{\prime}$ : $P^{\prime}\to Q^{\prime}\,$ defined by

$F^{\prime}(p)=q$ , and

$F^{\prime}(p^{\prime})=F(p^{\prime})\,$ for all $\,p^{\prime}\in P$ $\smallsetminus$ $\{(p_{1},p_{2}a):a\in\{0,1\}\}$ (or $P$ $\smallsetminus$ $\{(p_{1}a,p_{2}):a\in\{0,1\}\}$ ).

The passage from $f$ to $f^{\prime}$ is called a one-step extension, and $f$ is called a one-step restriction of $f^{\prime}$ .

The Remark follows from Lemma 2.10(2), in the same way as for prefix codes in $A^{*}$ (see [5, Lemma 2.2] and [25]). The Remark is not always applicable when $n\geq 3$ , by Lemma 2.10(3).

Proposition 2.26

(restrictions, and deciding $\,\equiv_{\rm end}$ ).**

(1)* Let $F:P\to Q$ be a table, and let $f\in n{\cal RI}_{A}^{\sf fin}$ be the right-ideal morphism given by this table. Suppose $P^{\prime}\cdot(nA^{*})\subseteq P\cdot(nA^{*})$ , where $P^{\prime}\subset nA^{*}$ is a finite maximal joinless code. Then the restriction $f^{\prime}=f|_{P^{\prime}\cdot(nA^{*})}\,$ of $f$ to $\,P^{\prime}\cdot(nA^{*})\,$ is an element of $\,n{\cal RI}_{A}^{\sf fin}$ with table $\,F^{\prime}:P^{\prime}\to f(P^{\prime})$ .*

Moreover, $\ f\equiv_{\rm end}f|_{P^{\prime}\cdot(nA^{*})}$ .

(2)* Let $F^{(j)}:P^{(j)}\to Q^{(j)}$ be tables (for $j=1,2$ ), and let $\,f^{(j)}\in n{\cal RI}_{A}^{\sf fin}$ be the right-ideal morphisms given by these tables. Then:*

$f^{(1)}\equiv_{\rm end}f^{(2)}$ * iff $F^{(1)}|_{P^{(1)}\vee P^{(2)}}$ $\,=\,$ $F^{(2)}|_{P^{(1)}\vee P^{(2)}}$ ,*

where $F^{(j)}|_{P^{(1)}\vee P^{(2)}}$ is the table of the restriction of $f^{(j)}$ to the finite maximal joinless code $P^{(1)}\vee P^{(2)}$ .

Hence there is a polynomial-time algorithm that decides whether the tables $F^{(1)}$ and $F^{(2)}$ represent $\equiv_{\rm end}$ -equivalent elements of $n{\cal RI}_{A}^{\sf fin}$ .

Proof. (1) The restricted table $F^{\prime}:P^{\prime}\to f(P^{\prime})$ is defined as follows. For every $p^{\prime}\in P^{\prime}$ there exists $p=(p_{1},\ldots,p_{n})\in P$ and $w=(w_{1},\ldots,w_{n})\in nA^{*}$ such that $p^{\prime}=pw$ . We define $F^{\prime}(p^{\prime})=F(p)\ w$ .

In order to verify that $F^{\prime}$ is a well defined function, suppose that $p^{\prime}=p^{(1)}\,u=p^{(2)}\,v$ for some $p^{(1)},p^{(2)}\in P$ and $u,v\in nA^{*}$ . Since $P$ is joinless, it follows that $p^{(1)}=p^{(2)}$ ; let $p^{(1)}=p^{(2)}=p$ . Since multiplication in $nA^{*}$ is cancelative, $pu=pv$ implies $u=v$ . So, $p^{\prime}\in P^{\prime}$ determines a unique $p\in P$ and $w=u=v\in nA^{*}$ such that $p^{\prime}=pw$ . Hence $F^{\prime}(p^{\prime})=F(p)\ w$ defines $F^{\prime}(p^{\prime})$ in a unique way.

(2) The largest common restriction of $f^{(1)}$ and $f^{(2)}$ is $f^{(1)}\cap f^{(2)}$ , which has domain ${\rm Dom}(f^{(1)})\cap{\rm Dom}(f^{(2)}$ , and domain code ${\rm domC}(f^{(1)}\cap f^{(2)})=P^{(1)}\vee P^{(2)}$ (by Lemma 2.18). So, $f^{(1)}\equiv_{\rm end}f^{(2)}\,$ iff the tables of $f^{(1)}$ and $f^{(2)}$ , restricted to $P^{(1)}\vee P^{(2)}$ , are the same.

We have $\,|P^{(1)}\vee P^{(2)}|\leq|P^{(1)}|\cdot|P^{(2)}|$ . And for all $p^{(1)}\in P^{(1)}$ and $p^{(2)}\in P^{(2)}$ , $|p^{(1)}\vee p^{(2)}|_{\rm max}\leq$ $\max\{|p^{(1)}|_{\rm max},\,|p^{(1)}|_{\rm max}\}$ . (Recall the notation $|x|_{\rm max}=\max\{|x_{i}|:1\leq i\leq n\}$ .) Hence one can check in polynomial time whether $f^{(1)}\equiv_{\rm end}f^{(2)}$ . $\Box$

Lemma 2.27

(non-uniqueness of maximal extensions).* There exists a right ideal morphism $f\in 2\,{\cal RI}^{\sf fin}_{2}$ such that $f$ has two maximal extensions in $2\,{\cal RI}^{\sf fin}_{2}$ .*

As a consequence, $nV$ with $n\geq 2$ cannot be defined by maximum extended morphisms (unlike $V$ ).

Proof. Let $f$ be defined by $\,{\rm domC}(f)={\rm imC}(f)=$ $\{(0,0),(0,1),(1,0),(1,10),(1,11)\}$ , and the table

[TABLE]

.

The geometric representation of $f$ is given in Fig. 2 (with mapping-by-number as in [14]):

\begin{picture}(110.0,60.0)\par\put(5.0,10.0){\framebox(40.0,20.0)[]{}} \put(5.0,10.0){\framebox(20.0,40.0)[]{}} \put(25.0,30.0){\framebox(20.0,10.0)[]{}} \put(25.0,40.0){\framebox(20.0,10.0)[]{}} \put(15.0,20.0){\makebox(0.0,0.0)[cc]{\sf 1}} \put(35.0,20.0){\makebox(0.0,0.0)[cc]{\sf 2}} \put(15.0,40.0){\makebox(0.0,0.0)[cc]{\sf 3}} \put(35.0,35.0){\makebox(0.0,0.0)[cc]{\sf 4}} \put(35.0,45.0){\makebox(0.0,0.0)[cc]{\sf 5}} \par\put(60.0,27.0){\vector(1,0){20.0}} \put(70.0,30.0){\makebox(0.0,0.0)[cc]{ $f$ }} \par\put(95.0,10.0){\framebox(40.0,20.0)[]{}} \put(95.0,10.0){\framebox(20.0,40.0)[]{}} \put(115.0,30.0){\framebox(20.0,10.0)[]{}} \put(115.0,40.0){\framebox(20.0,10.0)[]{}} \put(105.0,20.0){\makebox(0.0,0.0)[cc]{\sf 1}} \put(125.0,20.0){\makebox(0.0,0.0)[cc]{\sf 2}} \put(105.0,40.0){\makebox(0.0,0.0)[cc]{\sf 3}} \put(125.0,35.0){\makebox(0.0,0.0)[cc]{\sf 5}} \put(125.0,45.0){\makebox(0.0,0.0)[cc]{\sf 4}} \par\put(-2.0,0.0){\makebox(0.0,0.0)[cc]{\sf Fig. 2}} \end{picture}

12345

$f$

12354

Fig. 2

In Fig. 2, the squares labeled “1” and “2” could be merged into one binary rectangle; alternatively, the squares labeled “1” and “3” could be merged into one binary rectangle. After either step, no further extension is possible. Thus, $f$ has the following two maximal extensions $F_{1}$ and $F_{2}$ :

(1) ${\rm domC}(F_{1})={\rm imC}(F_{1})=$ $\{(\varepsilon,0),(0,1),(1,10),(1,11)\}$ , and

$F_{1}\,=\,\{((\varepsilon,0),(\varepsilon,0)),\ ((0,1),(0,1)),$ $\ ((1,10),(1,11)),\ ((1,11),(1,10))\}$ .

(2) ${\rm domC}(F_{2})={\rm imC}(F_{2})=$ $\{(0,\varepsilon),(1,0),(1,10),(1,11)\}$ , and

$F_{2}\,=\,\{((0,\varepsilon),(0,\varepsilon)),\ ((1,0),(1,0)),$ $\ ((1,10),(1,11)),\ ((1,11),(1,10))\}$ . $\Box$

We now give the definition of $nG_{k,1}$ and $nV$ based on strings.

Definition 2.28

(Brin-Thompson groups $nV$ and $nG_{k,1}$ ).* Let $A=\{0,\,\ldots,k-1\}$ and $n\geq 2$ . The Brin-Thompson group $nG_{k,1}$ is $\ n\,{\cal RI}^{\sf fin}_{A}/\!\!\equiv_{\rm end}$ . Equivalently, $nG_{k,1}$ is the group determined by the action of $n\,{\cal RI}^{\sf fin}_{A}$ on $nA^{\omega}$ . When $k=2$ we obtain $nV$ .*

Every element of $nG_{k,1}$ can be represented (in infinitely many ways) by a table of the form $F$ : $P\to Q$ , which is a bijection between two finite maximal joinless codes.

Lemma 2.29

(composition in $nG_{k,1}$ based on tables).* Let $F_{j}:P_{j}\to Q_{j}$ be a table representing $f_{j}\in n\,{\cal RI}^{\sf fin}$ , which in turn determines $f_{j}^{(\omega)}\in nG_{k,1}$ (for $j=1,2$ ). Then the composite $\,f_{2}\circ f_{1}$ , and hence also $f_{2}^{(\omega)}\circ f_{1}^{(\omega)}$ , is represented by the table*

$(f_{2}\circ f_{1})|_{P}:\,P\to Q$ , where

$P=f_{1}^{-1}(P_{2}\vee Q_{1})$ ,

$Q=f_{2}(P_{2}\vee Q_{1})$ .

Proof. It is a general fact about partial functions $f_{2},f_{1}$ , that $\,{\rm Dom}(f_{2}\circ f_{1})=f_{1}^{-1}({\rm Dom}(f_{2})\cap{\rm Im}(f_{1}))$ , and $\,{\rm Im}(f_{2}\circ f_{1})=f_{2}({\rm Dom}(f_{2})\cap{\rm Im}(f_{1}))$ . Obviously, $f_{2}\circ f_{1}=(f_{2}\circ f_{1})|_{{\rm Dom}(f_{2}\circ f_{1})}$ .

For $f_{2},f_{1}\in n\,{\cal RI}^{\sf fin}$ , given by tables, ${\rm Dom}(f_{2})\cap{\rm Im}(f_{1})=(P_{2}\vee Q_{1})\cdot(nA^{*})\,$ (by Lemma 2.18). And $f_{1}^{-1}(P_{2}\vee Q_{1})$ and $f_{2}(P_{2}\vee Q_{1})$ are maximal joinless codes (by Lemmas 2.20 and 2.21). Moreover, $f_{1}^{-1}({\rm Dom}(f_{2})\cap{\rm Im}(f_{1}))$ $=$ $f_{1}^{-1}(P_{2}\vee Q_{1})\cdot(nA^{*})$ , and $\,f_{2}({\rm Dom}(f_{2})\cap{\rm Im}(f_{1}))$ $=$ $f_{2}(P_{2}\vee Q_{1})\cdot(nA^{*})$ . Hence, $(f_{2}\circ f_{1})|_{{\rm domC}(f_{2}\circ f_{1})}$ is given by the table described in the Lemma. $\Box$

3 The word problem of $nV$ is in coNP

For a fixed group $G$ with a fixed finite generating set $\Gamma$ the word problem is the following decision problem.

Input: A string $\,w\in(\Gamma^{\pm 1})^{*}$ .

Question: Does $w$ represent the identity element of $G$ ?

We mentioned in the Introduction that $nV$ is finitely generated for all $n\geq 1$ . The groups $nG_{k,1}$ , for $k>2$ , are presumably finitely generated too, but this has not been proved, so we will only address the word problem of $nV$ here.

Notation. We mostly use the alphabet $A=\{0,1,\,\ldots\,,k-1\}$ , usually with $k=2$ .

For any integer $j\geq 0$ , let $nA^{\leq j}\,=\,\{(x_{1},\ldots,x_{n})\in nA^{*}:$ $\,|x_{i}|\leq j$ for $i=1,\ldots,n\}$ ; for a string $w\in A^{*}$ , $\,|w|$ denotes the length of $w$ .

Definition of coNP and NP: We use the following logic-based definitions of coNP and NP (see e.g., [23]). Let $\Gamma$ be a finite alphabet. A set $S\subseteq\Gamma^{*}$ is in coNP iff there exists $m\geq 1$ , a two-variable predicate $\,R(.,.)\subseteq mA^{*}\times\Gamma^{*}$ , and a polynomial $p(.)$ , such that

(1) $R\in{\sf P}$ (i.e., the membership problem of $R$ is in P);

(2) $S\,=\,\{w\in\Gamma^{*}\,:\ \$ $(\forall x\in mA^{\leq p(|w|)})\,R(x,w)\,\}$ .

Similarly, $S$ is in NP iff for some $m\geq 1$ , some $\,R(.,.)\subseteq mA^{*}\times\Gamma^{*}$ in P, and some polynomial $p(.)$ ,

$S\,=\,\{w\in\Gamma^{*}\,:\ \$ $(\exists x\in mA^{\leq p(|w|)})\,R(x,w)\,\}$ .

Definition 3.1

(max length).**

For every $z=(z_{1},\,\ldots,z_{n})\in nA^{*}$ : $\ell(z)\,=\,\max\{|z_{1}|,\,\ldots,|z_{n}|\}$ .

For every finite set $P\subset nA^{*}$ : $\ell(P)$ $\,=\,$ $\max\{\ell(z)\,:\,z\in P\}$ .

*For every $f\in n\,{\cal RI}^{\sf fin}$ : $\ell(f)$ $\,=\,$ $\max\{\ell(z)\,:\,z\in{\rm domC}(f)\,\cup\,{\rm imC}(f)\}$ . *

Proposition 3.2

(length formula).* For all $f_{2},f_{1}\in n{\cal RI}^{\sf fin}$ : $\ell(f_{2}\circ f_{1})\,\leq\,\ell(f_{2})+\ell(f_{1})$ .*

Proof. Let $F_{i}:P_{i}\to Q_{i}$ be a table for $f_{i}$ ( $i=1,2$ ). Recall the table for $f_{2}\circ f_{1}$ , given in Lemma 2.29. We have:

(L1) $\ell(P_{2}\vee Q_{1})=\max\{\ell(P_{2}),\ \ell(Q_{1})\}$ .

Indeed, for every $p=(p_{1},\,\ldots,p_{n})\in P_{2}$ , $q=(q_{1},\,\ldots,q_{n})\in Q_{1}$ , and $i\in\{1,\,\ldots,n\}$ we have: $|(p\vee q)_{i}|=\max\{|p_{i}|,\,|q_{i}|\}\,$ (by Lemma 2.5).

We have:

(L2) $\ell(f_{2}(P_{2}\vee Q_{1}))$ $\,\leq\,$ $\ell(Q_{2})+\ell(Q_{1})$ $\,\leq\,$ $\ell(f_{2})+\ell(f_{1})$ .

Indeed, $(p\vee q)_{i}=\max_{\leq_{\rm pref}}\{p_{i},q_{i}\}$ , for every $p\in P_{2}$ , $q\in Q_{1}$ , and $i\in\{1,\,\ldots,n\}$ . By Prop. 2.18(3), $p\vee q\in{\rm Dom}(f_{2})$ . Since $p$ is an initial factor of $p\vee q$ there exists $u\in nA^{*}$ such that $pu=p\vee q$ . Since $(p\vee q)_{i}=\max_{\leq_{\rm pref}}\{p_{i},q_{i}\}$ , the following holds: $\,u_{i}=\varepsilon$ when $(p\vee q)_{i}=p_{i}$ , and $u_{i}$ is a suffix of $q_{i}$ when $(p\vee q)_{i}=q_{i}$ . Hence, $\ell(u)\leq\ell(q)$ . Now, $f_{2}(p\vee q)=f_{2}(p)\,u$ , where $f_{2}(p)\in Q_{2}$ (since $p\in P_{2}$ ). And $q\in Q_{1}$ . Hence $\ell(f_{2}(p\vee q))\leq\ell(f_{2}(p))+\ell(u)$ $\leq$ $\ell(Q_{2})+\ell(Q_{1})$ .

We also have:

(L3) $\ell(f_{1}^{-1}(P_{2}\vee Q_{1}))$ $\,\leq\,$ $\ell(P_{1})+\ell(P_{2})$ $\,\leq\,$ $\ell(f_{2})+\ell(f_{1})$ .

Indeed, $f_{1}^{-1}$ is given by the table $f_{1}^{-1}|_{Q_{1}}$ : $Q_{1}\to P_{1}$ . Consider any $p\vee q$ for $p\in P_{2}$ , $q\in Q_{1}$ . Since $q$ is an initial factor of $p\vee q$ there exists $v\in nA^{*}$ such that $qv=p\vee q$ . Since $(p\vee q)_{i}=\max_{\leq_{\rm pref}}\{p_{i},q_{i}\}$ , the following holds: $\,v_{i}=\varepsilon$ when $(p\vee q)_{i}=q_{i}$ , and $v_{i}$ is a suffix of $p_{i}$ when $(p\vee q)_{i}=p_{i}$ . Hence, $\ell(v)\leq\ell(p)$ . Now, $f_{1}^{-1}(p\vee q)=f_{1}^{-1}(q)\ v$ , where $f_{1}^{-1}(q)\in P_{1}$ (since $q\in Q_{1}$ ). And $p\in P_{2}$ . Hence $\ell(f_{1}^{-1}(p\vee q))$ $\leq$ $\ell(f_{1}^{-1}(q))+\ell(v)$ $\leq$ $\ell(P_{1})+\ell(P_{2})$ .

Finally, since $\,\ell(f_{2}\circ f_{1})$ $\,=\,$ $\max\{\ell(f_{1}^{-1}(P_{2}\vee Q_{1}))$ , $\ell(f_{2}(P_{2}\vee Q_{1}))\}$ , we obtain:

(L4) $\,\ell(f_{2}\circ f_{1})\,\leq\,\max\{\ell(Q_{2})+\ell(Q_{1}),\$ $\ell(P_{1})+\ell(P_{2})\}$ $\,\leq\,$ $\ell(f_{2})+\ell(f_{1})$ .

$\Box$

Corollary 3.3

Let $f_{t},\,\ldots,f_{1}\in n{\cal RI}^{\sf fin}$ , and let $\lambda\in{\mathbb{N}}$ be such that $\ell(f_{j})\leq\lambda$ for $j=1,\,\ldots,t$ . Then $\ell(f_{t}\circ\,\ldots\,\circ f_{1})\leq\lambda\,t$ . $\Box$

Lemma 3.4

For any $f\in n{\cal RI}^{\sf fin}$ and $\lambda=\ell({\rm domC}(f))$ we have: $nA^{\lambda}\subset{\rm Dom}(f)$ .

Hence, $f|_{nA^{\lambda}}$ determines $f^{(\omega)}$ . In particular, $f|_{nA^{\lambda}}={\sf id}|_{nA^{\lambda}}\,$ iff $\,f^{(\omega)}={\bf 1}\,$ in $\,nG_{k,1}$ .

Proof. By Coroll. 2.14(0), $nA^{\lambda}$ is a maximal joinless code, hence every element $p\in{\rm domC}(f)$ has a join with some element $u\in nA^{\lambda}$ . Since $\lambda=\ell({\rm domC}(f))$ , $p$ is actually an initial factor of $u$ . Hence, $u\in P\cdot(nA^{*})$ ( $={\rm Dom}(f)$ ). This proves that $nA^{\lambda}\subset{\rm Dom}(f)$ .

For any finite maximal joinless code $P\subset{\rm Dom}(f)$ , the restriction $f|_{P}$ : $P\to f(P)$ is a table for $f^{(\omega)}$ , hence it determines $f^{(\omega)}$ . Since $nA^{\lambda}$ is a finite maximal joinless code contained in ${\rm Dom}(f)$ , the result follows. $\Box$

Lemma 3.5

The word problem of $nV$ over any finite generating set belongs to coNP.

Proof. Let $\Gamma$ be any finite generating set of $nV$ . To simplify the notation we assume that $\Gamma$ is closed under inverse, i.e., $\Gamma=\Gamma^{\pm 1}$ . Every $\gamma\in\Gamma$ is represented by a table $F_{\gamma}$ : $P_{\gamma}\to Q_{\gamma}$ . For any $w\in\Gamma^{*}$ , let $f_{w}\in n{\cal RI}^{\sf fin}$ be the function obtained by composing the generators in $w$ (given by tables). Let $F_{w}$ : $P\to Q$ be the table of $f_{w}$ . By Prop. 3.2 and Coroll. 3.3: $\,\ell(f_{w})\leq c_{{}_{\Gamma}}\,|w|$ , where $\,c_{{}_{\Gamma}}=\max\{\ell(\gamma):\gamma\in\Gamma\}$ , and $|w|$ denotes the length of $w$ as a word over $\Gamma$ . So $c_{{}_{\Gamma}}$ is a known constant, determined by the finite generating set $\Gamma$ .

For the word problem we have: $w={\bf 1}$ in $nV$ iff $f_{w}^{(\omega)}={\sf id}\,$ (the identity function on $A^{\omega}$ ) iff $\,f_{w}={\sf id}|_{{\rm Dom}(f_{w})}$ in $n{\cal RI}^{\sf fin}$ . Since ${\rm domC}(f_{w})=P\,$ (in the table $F_{w}$ : $P\to Q$ ), we have: $\,f_{w}={\sf id}|_{{\rm Dom}(f_{w})}\,$ iff $\,P=Q\,$ and $\,F_{w}={\sf id}|_{P}$ . By Prop. 3.2, $\,P\cup Q\subset$ $nA^{\leq\,c_{{}_{\Gamma}}|w|}$ . We can further restrict $f_{w}$ to $nA^{c_{{}_{\Gamma}}|w|}\cdot(nA^{*})$ ; then by Lemma 3.4, we obtain the following coNP-formula for the word problem:

$w={\bf 1}$ in $nV$ iff $(\forall x\in nA^{c_{{}_{\Gamma}}|w|})$ $[\,f_{w}(x)=x\,]$ .

We still need to show that the predicate $R(x,w)$ , defined by

$R(x,w)$ $\Leftrightarrow$ $[\,(\forall i\in\{1,\ldots,n\})[|x_{i}|=c_{{}_{\Gamma}}|w|]$ $\ \Rightarrow\$ $f_{w}(x)=x\,]$ ,

belongs to P. I.e., we want a deterministic polynomial-time algorithm that on input $w\in\Gamma^{*}$ and $x\in nA^{c_{{}_{\Gamma}}|w|}$ , checks whether $f_{w}(x)=x$ . To do this we apply, to $x\in nA^{c_{{}_{\Gamma}}|w|}$ , the tables of the generators $\gamma_{j}\in\Gamma$ that appear in $w=\gamma_{t}\,\,\ldots\,\gamma_{1}$ . We compute $\,x\longmapsto\gamma_{1}(x)=y^{(1)}$ $\longmapsto$ $\gamma_{2}(y^{(1)})=y^{(2)}$ $\longmapsto$ $\ \ \ldots\ \$ $\longmapsto$ $\gamma_{t}(y^{(t-1)})=y^{(t)}=f_{w}(x)$ . Since $x\in nA^{c_{{}_{\Gamma}}|w|}$ $\subseteq$ ${\rm Dom}(f_{w})$ , every $y^{(j)}$ is defined. Moreover, $x=pu$ for some $p\in P$ and $u\in nA^{\leq c_{{}_{\Gamma}}|w|}$ . By Prop. 3.2, $\,|y^{(j)}|\leq\,n\,c_{{}_{\Gamma}}\,j+|u|\,$ $\leq$ $\,2n\,c_{{}_{\Gamma}}\,|w|$ . After computing $y^{(t)}$ we check whether $y^{(t)}=x$ .

The application of the table of $\gamma_{j}$ to $y^{(j-1)}$ takes time proportional to $\,|y^{(j-1)}|\,$ (for $j=1,\,\ldots,t$ ). So, the time complexity of verifying whether $x$ and $w$ satisfy the predicate is (up to a constant multiple) $\ \leq\$ $|x|\,+\,\sum_{j=1}^{t}|y^{(j)}|$ $\,\leq\,$ $n\,c_{{}_{\Gamma}}\,|w|+|w|\cdot 2\,n\,c_{{}_{\Gamma}}\,|w|$ . Hence the time-complexity of the predicate is quadratic in $|w|$ . $\Box$

4 coNP-completeness of the word problem of $nV$

In this section we prove that the word problem of $nV$ with $n\geq 2$ , over any finite generating set, is coNP-hard with respect to polynomial-time many-one reduction. The result for all $nV,n\geq 2$ , follows quickly from the result for $2V$ . We proved already in Lemma 3.5 that the word problem of $nV$ belongs to coNP.

For $2V$ , coNP-hardness of the word problem follows fairly directly from the coNP-hardness of the word problem of $V$ over the infinite generating set $\Gamma_{\!V}\cup\tau$ , by making use of the shift $\sigma\,$ (subsection 4.6). Here, $\Gamma_{\!V}$ is any finite generating set of $V$ and $\tau$ is the set of position transpositions $\{\tau_{i,i+1}:i\geq 1\}$ .

At the end of subsection 4.1 we show that the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ belongs to coNP. The main difficulty is to prove that the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ is coNP-hard; this is proved in subsections 4.2 - 4.4, by constructing a binary conjunctive polynomial-time reduction of the circuit equivalence problem to this word problem. An alternative proof, that gives a polynomial-time many-one reduction, appears in subsection 4.5.

4.1 Preliminaries on the word problem and complexity

We give some definitions and facts about complexity and the word problem of a group, especially when an infinite generating set is used. Here we use finite and infinite alphabets (but we always point out when an alphabet is infinite).

Definition 4.1

Let $\Sigma_{1},\Sigma_{2}$ be two finite alphabets, and let $m$ be a positive integer. A polynomial-time conjunctive reduction of arity $m$ from $L_{1}\subseteq\Sigma_{1}^{*}$ to $L_{2}\subseteq\Sigma_{2}^{*}$ is a polynomial-time computable total function $\,\rho:\Sigma_{1}^{*}\,\to\,m\Sigma_{2}^{*}$ such that for all $x\in\Sigma_{1}^{*}$ :

$x\in L_{1}$ * iff $\rho(x)\,\in\ {\large\sf X}_{{}_{j=1}}^{{}^{m}}L_{2}$ ( $=mL_{2}$ ).*

Equivalently, $L_{1}=\rho^{-1}({\large\sf X}_{{}_{j=1}}^{{}^{m}}L_{2})$ . In other words, $\rho$ reduces the problem $L_{1}$ to $m$ instances of the problem $L_{2}$ , and the $m$ answers are combined by “and”.

A polynomial-time conjunctive reduction of arity 1 is called a many-one reduction.

The reductions in Def. 4.1 are a very special case of polynomial-time truth-table reductions; see e.g. [20, Def. 7.18]. It is straightforward to show that each of P, NP, and coNP, is closed under downward polynomial-time conjunctive reduction of bounded arity.

In this paper we use the following definition of coNP-hardness and coNP-completeness.

Definition 4.2

Let $\Sigma_{0}$ be a finite alphabet. A problem $L_{0}\subseteq\Sigma_{0}^{*}$ is coNP-hard iff for every finite alphabet $\Sigma$ and every problem $L\subseteq\Sigma^{*}$ there exists a polynomial-time conjunctive reduction $\rho$ of bounded arity that reduces $L$ to $L_{0}$ .

Moreover, $L_{0}\subseteq\Sigma_{0}^{*}$ is coNP-complete iff $L_{0}$ is coNP-hard and $L_{0}$ belongs to coNP.

There are many well-known coNP-complete problems, e.g., the tautology problem for boolean formulas, the integer linear programming equivalence problem, the 4-coloring problem, the connectivity lower-bound problem (see e.g. [22], [6, Introduction]). We will use the equivalence problem for acyclic boolean circuits (defined in Section 4.2).

Definition 4.3

Let $G$ be a group, and let $\Gamma$ ( $\,\subseteq G$ ) be a (possibly infinite) generating set for $G$ . For words $w_{1},w_{2}\in(\Gamma^{\pm 1})^{*}$ we say that “ $w_{1}=w_{2}$ in $G$ ” iff the generator sequences $w_{1}$ and $w_{2}$ have the same value when their elements are multiplied in $G$ . In a similar way, for $w\in(\Gamma^{\pm 1})^{*}$ and $g\in G$ , we say “ $w=g$ in $G$ ” iff $g$ is the value obtained when the elements of $w$ are multiplied in $G$ . We also use the notation $w_{1}=_{G}w_{2}$ or $w=_{G}g$ for this.

To simplify the notation, we will from now on take group generating sets $\Gamma$ that are closed under inverse, i.e., $\Gamma=\Gamma^{\pm 1}$ .

Lemma 4.4

(folklore).* Let $G_{2}$ be a finitely generated subgroup of a finitely generated group $G_{1}$ , and let $\Gamma_{i}$ be a finite generating set of $G_{i}$ for $i=1,2$ .*

(1)* If the word problem of $G_{1}$ over $\Gamma_{1}$ is decidable in deterministic (or nondeterministic, or co-nondeterministic) time $\leq t_{1}(.)$ , then the word problem of $G_{2}$ over $\Gamma_{2}$ is decidable in deterministic (respectively in nondeterministic, or co-nondeterministic) time $\leq t_{1}(O(.))$ .*

(2)* If a problem $L\subseteq\Sigma^{*}$ (where $\Sigma$ is finite) is reducible to the word problem of $G_{2}$ over $\Gamma_{2}$ by a polynomial-time conjunctive reduction of arity $m$ , then $L$ is also reducible to the word problem of $G_{1}$ over $\Gamma_{1}$ by a polynomial-time conjunctive reduction of arity $m$ .*

Proof. To simplify the notation, let us assume that $\Gamma_{1}$ and $\Gamma_{2}$ are closed under inverse.

(1) Since $G_{2}\subseteq G_{1}$ , for every generator $\gamma\in\Gamma_{2}$ there exists a word $w_{\gamma}\in\Gamma_{1}^{\ *}$ such that $\,\gamma=_{G_{1}}w_{\gamma}$ . Then the total function

$\rho_{2,1}:\ x_{1}\,\ldots x_{n}\in\Gamma_{2}^{\ *}$ $\ \longmapsto\$ $w_{x_{1}}\cdot\,\ldots\,\cdot w_{x_{n}}\in\Gamma_{1}^{\ *}$

is a one-to-one linear-time reduction of the word problem of $G_{2}$ over $\Gamma_{2}$ to the word problem of $G_{1}$ over $\Gamma_{1}$ ; here, “ $\cdot$ ” denotes concatenation. The length of $w_{x_{1}}\cdot\ldots\cdot w_{x_{n}}$ is $\,\leq c\,|x_{1}\ldots x_{n}|$ , where $\,c=\max\{|w_{\gamma}|:\gamma\in\Gamma_{2}\}$ . Hence, if the word problem of $G_{1}$ over $\Gamma_{1}$ has time-complexity $\leq t_{1}(.)$ , then the word problem of $G_{2}$ over $\Gamma_{2}$ has time-complexity $\leq t_{1}(cn)$ for inputs of length $n$ .

(2) For every $\gamma\in\Gamma_{2}$ let $w_{\gamma}\in\Gamma_{1}^{\,*}$ be such that $\gamma=w_{\gamma}$ in $G_{1}$ , and let $W(.)$ be the free monoid homomorphism from $\Gamma_{2}^{\,*}$ into $\Gamma_{1}^{\,*}$ determined by $W(\gamma)=w_{\gamma}$ for all $\gamma\in\Gamma_{2}$ . Let $\,\rho:\Sigma^{*}\to m\,\Gamma_{2}^{\,*}$ be a polynomial-time conjunctive reduction of arity $m$ , such that for all $v\in\Sigma^{*}$ : $v\in L$ iff $\rho(v)=(\varepsilon)^{m}\,$ . Then

$\,x\in\Sigma^{*}\ \longmapsto\ \rho(x)=(y_{1},\ldots,y_{m})\in$ $m\,\Gamma_{2}^{\,*}$ $\longmapsto$ $(W(y_{1}),\ldots,W(y_{m}))\in$ $m\,\Gamma_{1}^{\,*}\,$

is a polynomial-time conjunctive reduction of arity $m$ , from $L$ to the he word problem of $G_{1}$ over $\Gamma_{1}$ . $\Box$

For the word problem of groups, infinite generating sets cannot always be avoided, because some groups are not finitely generated, and because some finitely generated groups have interesting infinite generating sets. In order to apply the concepts of decidability or computational complexity to groups with infinite generating sets, we encode countable generating sets over a finite alphabet. We will use the following.

Definition 4.5

(encoding).**

An encoding of a countable set $\Gamma$ is an injective total function $\,{\sf code}:\Gamma\to\{0,1\}^{*}$ such that $\,{\sf code}(\Gamma)\,$ is a prefix code that is accepted by a finite-state automaton.

For a word of generators $\,w=w_{1}\,\ldots\,w_{m}\in\Gamma^{*}$ we define ${\sf code}(w)$ by the concatenation ${\sf code}(w)\$ $=$ $\ {\sf code}(w_{1})\cdot\,\ldots\,\cdot{\sf code}(w_{m})$ . Hence, ${\rm Im}({\sf code(.)})={\sf code}(\Gamma^{*})=({\sf code}(\Gamma))^{*}$ , which is a finite-state language.

Since the function code is injective it has an inverse function, ${\sf code}^{-1}$ , whose domain is ${\sf code}(\Gamma^{*})$ . Every countable set admits an encoding of the above type (e.g., with image set $\,0^{*}1=\{0^{n}1:n\in\omega\}\,$ ).

The word problem for a group $G$ with an infinite generating set $\Gamma$ and encoding ${\sf code}:\Gamma\to\{0,1\}^{*}$ is specified as follows.

Input: $x\in\{0,1\}^{*}$ .

Precondition: $x\in{\sf code}(\Gamma)^{*}$ . (Since ${\sf code}(\Gamma)^{*}$ is finite-state, the precondition is easy to check.)

Question: ${\sf code}^{-1}(x)={\bf 1}$ in $G\,$ ? (Here, 1 denotes the identity element of $G$ .)

Equivalently, the word problem is the membership problem of the language

$\,{\rm WP}_{G,\Gamma,{\sf code}}\,=\,$ $\{x\in\{0,1\}^{*}:\ {\sf code}^{-1}(x)={\bf 1}\ {\rm in}\ G\}$ .

From now on, by complexity of the word problem of $G$ over $\Gamma$ we mean the complexity of ${\rm WP}_{G,\Gamma,{\sf code}}$ ; note that the problem depends on $G$ , $\Gamma$ , and code.

Lemma 4.6

Let $G_{2}$ be a subgroup of a countable group $G_{1}$ , let $\Gamma_{i}\,$ $(\subseteq G_{i}$ ) be a countable generating set of $G_{i}$ , and let ${\sf code}_{i}(.)$ be an encoding of $\Gamma_{i}$ , for $i=1,2$ . We also assume that there is a total function $h:\Gamma_{2}\to\Gamma_{1}^{\,*}$ with the following properties (that connect the encodings ${\sf code}_{1}$ and ${\sf code}_{2}$ ):

$\bullet$ * For all $\gamma\in\Gamma_{2}$ : $\gamma=h(\gamma)\,$ in $G_{1}$ ;*

$\bullet$ * the function $h_{0}:\,{\sf code}_{2}(\gamma)\in{\sf code}(\Gamma_{2})$ $\ \longmapsto\$ ${\sf code}_{1}(h(\gamma))\in{\sf code}(\Gamma_{1})^{*}$ is computable in*

linear time.

The function $h$ is extended to a free-monoid homomorphism $\Gamma_{2}^{\,*}\to\Gamma_{1}^{\,*}\,$ that will also be called $h$ ; so for every $w\in\Gamma_{2}^{\,*}$ we have: $w=h(w)$ in $G_{1}$ .

Then the following hold:

(1)* If $\,{\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ is in ${\sf DTime}(t)$ (or in ${\sf NTime}(t)$ , or in ${\sf coNTime}(t)$ ), then $\,{\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ is in ${\sf DTime}(t(O(.)))$ (respectively in ${\sf NTime}(t(O(.)))$ , or ${\sf coNTime}(t(O(.)))$ ).*

(2)* If $L\subseteq\Sigma^{*}$ (where $\Sigma$ is finite) is reducible to $\,{\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ by a polynomial-time conjunctive reduction of arity $m$ , then $L$ is also reducible to $\,{\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ by a polynomial-time conjunctive reduction of arity $m$ . Hence, if $\,{\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ is hard for a complexity class (e.g., coNP), then $\,{\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ is also hard for that complexity class.*

The functions $h$ and $h_{0}$ in the Lemma have the commuting diagram

$h_{0}\circ{\sf code}_{2}(.)={\sf code}_{1}\circ h(.)\,$ ;

equivalently, $\,h_{0}={\sf code}_{1}\circ h(.)\circ{\sf code}_{2}^{-1}(.)$ , and $\,h(.)={\sf code}_{1}^{-1}\circ h_{0}\circ{\sf code}_{2}(.)$ . Note that in general $h$ cannot be viewed as a computable function (as opposed to $h_{0}$ ), since its domain and image are arbitrary countable sets.

Proof. (1) For any $w\in\Gamma_{2}^{\ *}$ we have: $w={\bf 1}$ in $G_{2}$ over $\Gamma_{2}$ iff $h(w)={\bf 1}$ in $G_{1}$ over $\Gamma_{1}$ . Therefore, $x={\sf code}_{2}(w)\in{\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ iff $h_{0}(x)={\sf code}_{1}(h(w))$ $\in$ ${\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ .

Thus we have the following algorithm for the membership problem of ${\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ on input $x\in\{0,1\}^{*}$ : First, check whether $x\in{\sf code}_{2}(\Gamma_{2})^{*}$ ; this can be checked in linear time, since ${\sf code}_{2}(\Gamma_{2})^{*}$ is finite-state. Second, compute $h_{0}(x)$ (in linear time). Finally, check whether $h_{0}(x)$ is in ${\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ , in time $\,\leq t(|h_{0}(x)|)\leq t(O(|x|))$ .

(2) Let $\rho:x\in\{0,1\}^{*}\longmapsto$ $(y_{1},\,\ldots,y_{m})\in m\,\{0,1\}^{*}$ be a polynomial-time conjunctive reduction of arity $m$ from $L$ to ${\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ . Hence, $x\in L$ iff $\{y_{1},\,\ldots,y_{m}\}\subset$ ${\rm WP}_{G_{2},\Gamma_{2},{\sf code}_{2}}$ . By the definition of $h$ and $h_{0}$ , the latter holds iff $\{h_{0}(y_{1}),\,\ldots\,,h_{0}(y_{m})\}\subset$ ${\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ . Thus the function $x\longmapsto(h_{0}(y_{1}),\,\ldots\,,h_{0}(y_{m}))$ , where $(y_{1},\,\ldots,y_{m})=\rho(x)$ , is a polynomial-time conjunctive reduction of arity $m$ from $L$ to ${\rm WP}_{G_{1},\Gamma_{1},{\sf code}_{1}}$ . $\Box$

Some conventions and a fact about the Thompson group $V$ over $\Gamma_{\!V}\cup\tau$ :

We pick a finite generating set $\Gamma_{\!V}$ for $V$ , and for notational convenience we will assume that $\,\Gamma_{\!V}=\Gamma_{\!V}^{\pm 1}$ . We also use the set of bit position transpositions $\tau=$ $\{\tau_{j,j+1}:j\geq 2\}$ . We assume $\,\Gamma_{\!V}\,\cap\,\tau\,$ $=$ $\varnothing$ .

Definition 4.7

(size of a generator).* For any generator $\delta\in\Gamma_{\!V}\,\cup\,\tau\,$ we define the size $\|\delta\|$ as follows: For $\delta=\gamma\in\Gamma_{\!V}$ we let $\,\|\gamma\|=1$ , and for $\delta=\tau_{j,j+1}\in\tau$ we let $\,\|\tau_{j,j+1}\|=j+1$ . For a string of generators $w=w_{m}\,\ldots\,w_{1}\,$ with $w_{i}\in\Gamma_{\!V}\,\cup\,\tau\,$ for $i=1,\,\ldots,m$ , the size of $w$ is defined by $\|w\|=\sum_{i=1}^{m}\|w_{i}\|$ .*

For the word $w$ as above, the length of $w$ is $|w|=m$ .

Lemma 4.8

The word problem of $V$ over $\Gamma_{\!V}\cup\tau$ belongs to coNP.

Proof. We have $\ell(\tau_{j,j+1})=j+1=\|\tau_{j,j+1}\|\,$ (where $\ell(.)$ was defined in Def. 3.1 based on tables of elements of $n\,{\cal RI}^{\sf fin}$ ). And there is a positive integer constant $c$ such that for all $\gamma\in\Gamma_{\!V}$ : $\,\ell(\gamma)\leq c$ . Hence, for any $w\in$ $(\Gamma_{\!V}\cup\tau)^{*}$ we have (by Prop. 3.2): $\ell(w)\leq c\,\|w\|$ .

Now the proof of Lemma 3.5 can be applied. For any $v\in$ $(\Gamma_{\!V}\cup\tau)^{*}$ , let $f_{v}\in{\cal RI}^{\sf fin}$ be the right ideal morphism of $\{0,1\}^{*}$ generated by $v$ . Then for every $\,w\in(\Gamma_{\!V}\cup\tau)^{*}$ we have:

$w={\bf 1}\,$ in $V$ iff $(\forall x\in\{0,1\}^{c\,\|w\|})\,[\,f_{w}(x)=x\,]$ .

The predicate $R(x,w)$ , defined by $\,[\,x\in\{0,1\}^{c\,\|w\|}\,\Rightarrow\,f_{w}(x)=x\,]$ , is in P. This uses the same proof as Lemma 3.5, and the fact that $w$ is encoded over $\{0,1\}^{*}$ in such a way that $\ell(w)$ , $\|w\|$ , and the length of the encoding, are linearly related. Hence the above $\forall$ -formula is a coNP-formula for the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ . $\Box$

Outline of the proof of coNP-hardness of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ :

In subsections 4.2 - 4.4 we follow (a part of) the strategy of [6], where another finitely presented group with coNP-complete word problem was constructed.

Every acyclic boolean circuit $C$ is “simulated” by a element of $V$ , represented by a word $w_{C}$ over $\Gamma_{\!V}\cup\tau$ , such that the size of $w_{C}$ is polynomially bounded by the size of $C$ (subsection 4.2, Def. 4.10 and Theorem 4.12).
The equivalence problem for acyclic boolean circuits is reduced (by a polynomial-time one-one reduction) to the generalized word problem of the subgroup ${\rm pFix}_{V}(0)$ in $V$ (subsection 4.3, Coroll. 4.16).
Thanks to the “commutation test”, the generalized word problem of ${\rm pFix}_{V}(0)$ in $V$ is reduced to two instances of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ (subsection 4.4, Lemma 4.19). This reduction is a 2-ary conjunctive linear-time reduction (“2” comes from the fact that $V$ is 2-generated).

4.2 Circuits and the Thompson group $V$

Our first step in the proof of coNP-hardness is to represent acyclic boolean circuits by words over the generating set $\Gamma_{\!V}\cup\tau$ of $V$ .

An acyclic boolean circuit is specified by a directed acyclic graph (dag) without isolated vertices, together with a vertex labeling. This labeling associates (1) an input variable with each source vertex, (2) an output variable with each sink vertex, and (3) a gate (of type not, fork, and, or or) with each interior vertex. By definition, a source vertex is a vertex of in-degree 0; a sink vertex of out-degree 0; an interior vertex is a vertex whose in-degree and out-degree are both non-zero. A source vertex is also called input port, and a sink vertex is also called output port.

A gate is, by definition, a total function $\{0,1\}^{m}\to\{0,1\}^{n}$ (for some $m,n\geq 1$ ). We consider the following four types of gates, where $u\in\{0,1\}^{j-1}$ , $\,x_{j},x_{j+1}\in\{0,1\}$ , and $v\in\{0,1\}^{n-j}\,\cup\,\{0,1\}^{n-j-1}$ .

not ${}_{j}:\ u\,x_{j}\,v\,\mapsto\,u\,\overline{x}_{j}v$ ; here, $m\geq 1$ and $j\leq n=m$ ;

and ${}_{j,j+1}:\ u\,x_{j}x_{j+1}\,v\,\mapsto\,$ $u\,(x_{j}\,\&\,x_{j+1})\,v$ ; here, $m\geq 2$ and $j\leq m-1=n$ ;

or ${}_{j,j+1}:\ u\,x_{j}x_{j+1}\,v\,\mapsto$ $\,u\,(x_{j}\,{\sf or}\,x_{j+1})\,v$ ; here, $m\geq 2$ and $j\leq m-1=n$ ;

fork ${}_{j}:\ u\,x_{j}\,v\,\mapsto\,u\,x_{j}\,x_{j}\,v$ ; here, $1\leq j\leq m$ , and $n=m+1$ .

The operation forkj makes an extra copy of $x_{j}$ . In traditional circuit theory, forks are not used separately; instead, not, and, and or are allowed to produce several copies of the output bit. However, using fork as a separate gate simplifies the conversion of a circuit into a sequence of functions. We also use the wire-crossing operation, which swaps the “wires” $i$ and $j$ (where $1\leq i<j\leq m$ ); this is the function

$\tau_{i,j}:\ u\,x_{i}\,v\,x_{j}\,w\,\longmapsto\,$ $u\,x_{j}\,v\,x_{i}\,w$ ,

where, $u\in\{0,1\}^{i-1}$ , $v\in\{0,1\}^{j-i}$ , $w\in\{0,1\}^{m-j-1}$ , $m\geq 2$ , and $n=m$ . This operation is not a gate; it is not associated with a vertex, but follows from the incidence relation of the graph.

Note that all the gates notj are different functions for different values of $j$ ; the same applies to all andj,j+1, and all orj,j+1. However in the presence of the operations $\tau_{i,j}$ it is sufficient to use just one set of gates {not, and, or, fork}, applied to bit positions 1, or 1 and 2. E.g., noti $=$ $\tau_{i,1}$ $\circ$ not1 $\circ$ $\tau_{i,1}$ . Thus, here we view acyclic circuits as expressions over the generating set {not, and, or, fork} $\cup$ $\{\tau_{i,j}:j>i\geq 1\}$ . Note that $\tau_{i,j}\in V$ , with ${\rm domC}(\tau_{i,j})$ $=$ ${\rm imC}(\tau_{i,j})$ $=$ $\{0,1\}^{j}$ .

An acyclic circuit $C$ with sequence of input variables $(x_{1},\,\ldots\,,x_{m})\,$ (with values ranging over $\{0,1\}^{m}$ ), and sequence of output variables $(y_{1},,\ldots\,,y_{n})\,$ (with values in $\{0,1\}^{n}$ ), determines an input-output function $\,f_{C}:\{0,1\}^{m}\to\{0,1\}^{n}$ ; this is a total function. Any total function of the form $\,F:\{0,1\}^{m}\to\{0,1\}^{n}$ is called a boolean function. In circuit theory it is proved that for every boolean function $F$ there exists an acyclic circuit whose input-output function is $F$ ; see e.g. [23, 46, 40, 21].

Two circuits $C_{1}$ and $C_{2}$ are called equivalent iff $\,f_{C_{1}}=f_{C_{2}}$ .

The equivalence problem for acyclic boolean circuits (in short, the circuit equivalence problem) is specified as follows:

Input: $C_{1}$ , $C_{2}\,$ (two circuits, described by dags with gate labels on the vertices);

Question: $f_{C_{1}}=f_{C_{2}}$ ?

In order to consider the complexity of problems about circuits we need to define the size of an acyclic boolean circuit $C$ , denoted by $|C|$ , and simply called circuit size; it is defined as follows: If $C$ has $k_{1}$ gates of type not or fork, $k_{2}$ gates of type and or or, and $n$ output variables, then the size of $C$ is defined to be $\,|C|=k_{1}+2\cdot k_{2}+n$ . Equivalently, $|C|$ is the number of edges (or wires) between gates, or from an input to a gate, or from a gate to an output (for that reason, gates with two input variables are counted twice).

Remarks concerning circuit definitions: Acyclic circuits and their sizes are defined in a variety of ways in the literature [23, 37, 40, 46, 22, 26, 20, 27]; however, all these definitions lead to sizes that are polynomially equivalent (i.e., each one is polynomially bounded in terms of every other one). In the theory of NP- or coNP-completeness, polynomial differences are not significant.

(1) In the literature, the circuit size is usually defined as the number of vertices. Since we do not use isolated vertices in a circuit, we have $n_{V}\leq n_{E}\leq n_{V}^{\ 2}\,$ (where $n_{V}$ and $n_{E}$ denote the number of vertices and edges). So $n_{V}$ and $n_{E}$ are polynomially equivalent.

(2) In the literature the input and output variables are usually not called vertices, but in that case they are nevertheless counted among the vertices in the definition of circuit size.

(3) When a circuit is described by a bitstring $s_{C}$ , the length satisfies $n_{E}\leq|s_{C}|\leq c\,n_{E}\,\log_{2}n_{V}$ , for some constant $c\geq 1$ . Typically, such a description of $C$ lists all the edges, where each edge is given as a pair of strings (the names of two vertices, each vertex name having length $\leq 1+\log_{2}n_{V}$ ). An additional list is given that associates a gate or an input variable or an output variable with each vertex. An input variable $x_{i}$ is described by a code word (for $x$ ) and the binary representation of $i$ ; the output variables $y_{j}$ are described similarly. In any case, $|s_{C}|$ and $n_{E}$ are polynomially equivalent.

(4) In the literature, the fork-gate is usually not used explicitly; instead, the and-, or-, and not- gates, as well as the input variables, are allowed to have a fan-out. However, even in that case, every wire goes to a gate or an output variable, so the total of all be fan-outs is $\leq n_{V}^{\ 2}$ . A gate with fan-out $k$ can be replaced by a gate with fan-out 1 and $k-1$ fork-gates. This leads to a circuit with gates that have fan-out 1, except for fork-gates with fan-out 2; the size increase is polynomially bounded.

(5) In the literature, and and or-gates are allowed to have a fan-in $\geq 2$ . But every fan-in wire comes from a gate of an input variable, so the total of all be fan-ins is $\leq n_{V}^{\ 2}$ . An or-gate with fan-in $k$ can be replaced by $k-1$ or-gates with fan-in 2 (and similarly for and). This leads to a circuit with gates that have fan-in $\leq 2$ ; the size increase is polynomially bounded.

Remarks on complexity: The circuit equivalence problem is a well-known problem that is coNP-complete. It is fairly straightforward to prove that the problem is in coNP. Moreover, the tautology problem for boolean formulas (which is a classical coNP-complete problem) is a special case of the circuit equivalence problem (and is reduced to the circuit equivalence problem by converting a boolean formula into a circuit and asking whether a given circuit is equivalent to a circuit for the constant-1 function). See [27, Introduction] for comments on the circuit equivalence problem, see [37] for a circuit-based proof of NP-completeness of the satisfiability problem for boolean formulas, and see [23, 22] for general information.

The following well-known fact implies that every $\tau_{i,j}$ can be expressed as a composition of elements of $\tau=\{\tau_{k,k+1}:k\geq 1\}$ ; the expression has linear length in terms of $j$ .

Lemma 4.9

As elements of $V$ the transpositions satisfy

$\tau_{i,j}\ =\$ * $\tau_{i,i+1}\ \tau_{i+1,i+2}\ \ldots\ \tau_{j-2,j-1}\ \tau_{j-1,j}$ $\ \tau_{j-2,j-1}\ \ldots\ \tau_{i+1,i+2}\ \tau_{i,i+1}$ , if $1\leq i<j$ .*

The word length of $\tau_{i,j}$ over $\tau$ is therefore $\ \leq 2(j-i)-1$ . $\Box$

We want to represent the circuit gates not, or, and, and fork, by elements of $V$ . For this, the main problem is that the input-output function of a circuit is not necessarily a permutation. Therefore we introduce the following notion of “simulation” of a circuit $C$ by a Thompson group element $\Phi_{C}$ and by a word $w_{C}$ over $\Gamma_{\!V}\cup\tau$ (Def. 4.10 and Theorem 4.12 below). See the discussion in [6] for additional motivation of our definition of simulation.

Definition 4.10

(simulation).* Let $\,f:\{0,1\}^{m}\to\{0,1\}^{n}\,$ be a total function. An element $\Phi_{f}\in V$ simulates $f$ iff for all $\,x\in\{0,1\}^{m}$ : $\Phi_{f}(0\,x)\ =\ 0\ f(x)\ x$ .*

When $\Phi_{f}$ is represented by a word $w_{f}\in$ $(\Gamma_{\!V}\cup\tau)^{*}$ we say that $w_{f}$ simulates $f$ .

According to this definition, $f$ is faithfully described by the action of $\Phi_{f}$ on $0\,\{0,1\}^{*}$ ; but there are no constraints on the values of $\Phi_{f}$ for input strings in $1\,\{0,1\}^{*}$ . Since $\Phi_{f}$ is an element of $V$ it is a bijection between finite maximal prefix codes, whereas $f$ need not be injective nor surjective. So there has to be a big difference between $\Phi_{f}$ and $f$ somewhere. In subsections 4.3 and 4.4 we show that, nevertheless, the equivalence problem of circuits can be reduced to the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ . In the rest of this subsection we construct $\Phi_{f}$ .

The next Lemma follows immediately from the definition of simulation.

Lemma 4.11

Let $f$ and $g$ be any boolean functions with the same number of input variables and the same number of output variables. If $f$ and $g$ are simulated by $\Phi_{f}$ , respectively $\Phi_{g}$ , then we have

$f=g$ * iff $(\Phi_{f})|_{0\{0,1\}^{*}}\ =\ (\Phi_{g})|_{0\{0,1\}^{*}}$ . $\Box$ *

We choose the following elements of $V$ to describe the gates not, or, and, and fork:

$\varphi_{\neg}\ =\ \left[\begin{array}[]{ll}0&1\\ 1&0\end{array}\right],$

$\varphi_{\vee}\ =\ \left[\begin{array}[]{ll}0x_{1}x_{2}&1x_{1}x_{2}\\ (x_{1}\vee x_{2})\,x_{1}x_{2}&({\overline{x_{1}\vee x_{2}}})\ x_{1}x_{2}\end{array}\right],\hskip 21.68121pt\varphi_{\wedge}\ =\ \left[\begin{array}[]{ll}0x_{1}x_{2}&1x_{1}x_{2}\\ (x_{1}\wedge x_{2})\,x_{1}x_{2}&({\overline{x_{1}\wedge x_{2}}})\ x_{1}x_{2}\end{array}\right]$ ,

where $x_{1}$ and $x_{2}$ range over $\{0,1\}$ . Hence, ${\rm domC}(\varphi_{\neg})={\rm imC}(\varphi_{\neg})=\{0,1\}$ , and ${\rm domC}(\varphi_{\vee})={\rm imC}(\varphi_{\vee})$ $=$ ${\rm domC}(\varphi_{\wedge})={\rm imC}(\varphi_{\wedge})=\{0,1\}^{3}$ . In order to represent the fork function we first define

$\varphi_{\rm 0f}\ =\ \left[\begin{array}[]{ccc}0&\ 10&\ 11\\ 00&\ 01&\ 1\end{array}\right]$ ;

so, ${\rm domC}(\varphi_{\rm 0f})=\{0,10,11\}$ , and ${\rm imC}(\varphi_{\rm 0f})=\{00,01,1\}$ . Then fork is simulated by

$\varphi_{\rm f}$ $=$ $\tau_{1,2}\circ\varphi_{\vee}\circ\varphi_{\rm 0f}$ .

Indeed, for all $x_{1}\in\{0,1\}$ : $\tau_{1,2}\circ\varphi_{\vee}\circ\varphi_{\rm 0f}(0x_{1})$ $\,=\,0x_{1}x_{1}$ .

For every acyclic boolean circuit $C$ we want to find a word $w_{C}\in(\Gamma_{\!V}\cup\tau)^{*}$ that simulates $C$ ; and we want the map $\,C\mapsto w_{C}$ to be polynomial-time computable (in terms of $|C|$ ).

A standard property of dags is that every vertex has a level (or “layer”) corresponding to its “depth” in the dag. The source vertices have level 0. A gate or an output variable has level 1 iff only input variables of the circuit feed into it. A gate or an output variable has level $\ell$ iff it receives input from levels $<\ell$ only, and at least one of its inputs comes from level $\ell-1$ . Equivalently, the level of a vertex $v$ is the length of a longest path from a source to $v$ . The maximum level of any sink vertex is called the depth of the dag.

The following theorem is a simplification of [6, Thm. 3.5]. For a word $w\in(\Gamma_{\!V}\cup\tau)^{*}$ we use the size, denoted by $\|w\|$ , as defined in Def. 4.7.

Theorem 4.12

(existence of simulation).* There is an injective function $C\mapsto w_{C}$ from the set of acyclic boolean circuits to the set of words over $\Gamma_{\!V}\cup\tau$ with the following properties:*

(1)* $w_{C}$ simulates the input-output function $f_{C}$ of $C$ ;*

(2)* the size of $w_{C}$ satisfies $\|w_{C}\|\,<\,c\ |C|^{6}$ (for some constant $c>0$ );*

(3)* $w_{C}$ is computable from $C$ in polynomial time, in terms of $|C|$ .*

Proof. Item (1) refers to simulation as in Def. 4.10. In the proof we assume that $\varphi_{\neg}$ , $\varphi_{\vee}$ , $\varphi_{\wedge}$ , $\varphi_{\rm f}$ , $\varphi_{\rm 0f}$ , and $\tau_{1,2}$ , belong to $\Gamma_{\!V}$ . (If this were not the case, we could express them by fixed words over $\Gamma_{\!V}$ .)

We can assume that our acyclic circuits are strictly layered, i.e., a gate or an output variable at level $\ell$ only receives inputs from level $\ell-1$ . Hence, all the output variables of the circuit are at the same level $L$ , where $L$ is the depth of the circuit. If the layering of a circuit $C$ is not strict, we can insert identity gates to obtain strictness. An identity gate has one input variable and one output variable, connected by a wire; the two variables carry the same boolean value. We will count these identity gates as gates in the evaluation of circuit size. In order to make a circuit $C$ strictly layered, fewer than $|C|^{2}$ identity gates need to be introduced. (Indeed, for each gate we add fewer than $|C|$ identity gates above it; so, in total we add fewer than $|C|^{2}$ identity gates.)

An acyclic circuit $C$ has input variables $x_{1},\ldots,x_{m}$ , output variables $y_{1},\ldots,y_{n}$ , and internal variables which correspond to the boolean values carried by internal wires (between gates or between a gate and an input or an output port). The internal variables at level $\ell$ (for $0\leq\ell\leq L$ ) are denoted by $y_{1}^{\ell}$ , $y_{2}^{\ell}$ , $\ldots$ , $y_{n_{\ell}}^{\ell}$ . When $\ell=L$ (output level) we have $n_{L}=n$ and $y_{i}^{L}=y_{i}$ ; when $\ell=0$ (input level) we have $n_{0}=m$ and $y_{i}^{0}=x_{i}$ .

For every level $\ell$ (with $1\leq\ell\leq L$ ) there is a circuit $C_{\ell}$ , called the slice of $C$ at level $\ell$ : The input variables of the slice $C_{\ell}$ are $y_{1}^{\ell-1}$ , $\ldots$ , $y_{n_{\ell-1}}^{\ell-1}$ ; the output variables are $y_{1}^{\ell}$ , $\ldots$ , $y_{n_{\ell}}^{\ell}$ ; the gates of $C_{\ell}$ are the gates of $C$ at level $\ell$ ; we use the fact that $C$ is strictly layered. In addition to gates, a slice $C_{\ell}$ also contains wire-swappings of its inputs, i.e., a bit-position permutation is applied to the $n_{\ell-1}$ input variables. Every permutation of $n_{\ell-1}$ wires can be written as the composite of $\,\leq n_{\ell-1}$ ( $<|C_{\ell}|$ ) transpositions. And each $\tau_{i,j}$ has word length $\,\leq 2(j-i)-1\,$ over $\tau$ (by Lemma 4.9), hence it has size $\,\|\tau_{i,j}\|<|C_{\ell}|^{2}$ . Thus the input-wire permutation of a slice $C_{\ell}$ has size $\,<|C_{\ell}|^{3}$ . Moreover, every $\tau_{i,j}$ belongs to $V$ , so it does not need any simulation.

We use the notation $Y^{\ell}$ $=$ $y_{1}^{\ell}y_{2}^{\ell}$ $\ \ldots\$ $y_{n_{\ell}}^{\ell}$ (i.e., the concatenation of the variables $y_{i}^{\ell}$ , for $i=1,\ldots,n_{\ell}$ , and $\ell=0,\ldots,L$ ).

Simulation of one slice

In order to construct $w_{C}$ we first consider the special case where the circuit $C$ consists of just one slice, hence $C$ has depth 2 (the gates of the slice have depth 1, the output variables have depth 2). Identity gates are allowed. We number the gates of $C$ from left to right.

For $k\geq 0$ , let $K$ consist of the first $k$ gates of a slice; so, $K$ is a one-slice circuit that has $k$ gates. When $k=0$ , $K$ is empty, $w_{K}$ is the empty string, and its input-output function is the identity function ( $\in V$ ). Inductively, let $C$ be a slice obtained from $K$ by adding one gate (and, or, not, identity, or fork) on the right of $K$ (with number $k+1$ ). Inductively we assume that $K$ satisfies the Theorem and that $w_{K}$ has been constructed. Let $x_{1},\ldots,x_{m}$ be the input variables and let $y_{1},\ldots,y_{n}$ be the output variables of $K$ . We now construct $w_{C}$ from $w_{K}$ and the gate being added.

Case 1: Suppose the slice $C$ is obtained from $K$ by adding, on the right of $K$ , an identity gate or a not gate, with new input variable $x_{m+1}$ and new output variable $y_{n+1}$ . If a not gate is added, the input-output function of $C$ is $\,f_{C}(x_{1},\ldots,x_{m},x_{m+1})=$ $(y_{1},\ldots,y_{n},\overline{x_{m+1}})$ , where $\,f_{K}(x_{1},\ldots,x_{m})=(y_{1},\ldots,y_{n})$ . The boolean function $f_{C}$ is to be simulated by a Thompson group element $\Phi_{f_{C}}$ such that

$\Phi_{f_{C}}(0\,x_{1}\ldots x_{m},x_{m+1})\ =\$ $0\,y_{1}\ldots y_{n}\ \overline{x_{m+1}}\ x_{1}\ldots x_{m}x_{m+1}$ .

We have $w_{K}\in(\Gamma_{\!V}\cup\tau)^{*}$ , where $\Phi_{f_{K}}\in V$ is the simulation of $f_{K}$ , which exists by induction. We find $w_{C}$ as follows:

$0\,x_{1}\ldots x_{m}\ x_{m+1}\$ $\stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\$ $0\ y_{1}y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ x_{m+1}\$

$\stackrel{{\scriptstyle\tau_{2,n+m+2}}}{{\longmapsto}}\$ $0\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}$ $\stackrel{{\scriptstyle\varphi_{\rm f}}}{{\longmapsto}}\$ $0\ x_{m+1}\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}$

$\stackrel{{\scriptstyle\tau_{1,2}}}{{\longmapsto}}\$ $x_{m+1}\ 0\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}$

$\stackrel{{\scriptstyle\varphi_{\neg}}}{{\longmapsto}}\$ $\overline{x_{m+1}}\ 0\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}$ $\ \stackrel{{\scriptstyle\tau_{3,n+m+3}}}{{\longmapsto}}$ $\stackrel{{\scriptstyle\tau_{1,2}}}{{\longmapsto}}\$ $0\ \overline{x_{m+1}}\ y_{1}y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}x_{m+1}$ ;

applying $\tau_{n+1,n+2}\ \tau_{n,n+1}\ \dots\ \tau_{3,4}\ \tau_{2,3}(.)$ then yields

$0\ y_{1}\ldots y_{n}\ \overline{x_{m+1}}\ x_{1}\ldots x_{m}x_{m+1}$ .

So, $w_{C}$ $\,=\,$ $\tau_{n+1,n+2}\ \tau_{n,n+1}\ \dots\ \tau_{3,4}\ \tau_{2,3}$ $\tau_{1,2}$ $\tau_{3,n+m+3}$ $\varphi_{\neg}$ $\tau_{1,2}$ $\varphi_{\rm f}$ $\tau_{2,n+m+2}$ $w_{K}$ .

The case where, instead of a not gate, an identity gate is added is similar (except that we simply omit $\varphi_{\neg}$ ). By Lemma 4.9, we can express $\tau_{2,n+m+2}$ and $\tau_{3,n+m+3}$ over $\tau=\{\tau_{k,k+1}:k\geq 1\}$ . Then the size of $w_{C}$ is

$\|w_{C}\|$ $\ \leq\$ $\|w_{K}\|$ $+$ $\|\tau_{2,n+m+2}\|$ $+$ $\|\tau_{3,n+m+3}\|+4$ $+$ $\,\sum_{k=2}^{n+1}\|\tau_{k,k+1}\|$

$\leq\,\|w_{K}\|\,+\,c\,(n+m)^{2}+c$ , for some constant $c>1$ .

Case 2: Suppose our slice $C$ is obtained by adding an and gate or an or gate to $K$ on the right, with new output variable $y_{n+1}$ and new input variables $x_{m+1},\,x_{m+2}$ . We only analyze the or case, the and case being almost the same. The input-output function of $C$ is

$f_{C}(x_{1},\ldots,x_{m},x_{m+1},x_{m+2})\ =\$ $(y_{1},\ldots,y_{n},\ x_{m+1}\vee x_{m+2})$ ,

where $f_{K}(x_{1},\ldots,x_{m})=(y_{1},\ldots,y_{n})$ . The function $f_{C}$ is to be simulated by a Thompson group element $\Phi_{f_{C}}$ such that

$\Phi_{f_{C}}(0\,x_{1}\ldots x_{m}\ x_{m+1}x_{m+2})\ =\$ $0\ y_{1}\ldots y_{n}\ (x_{m+1}\vee x_{m+2})\ x_{1}\ldots x_{m}x_{m+1}x_{m+2}$

Let $w_{K}\in(\Gamma_{2}\cup\tau)^{*}$ be such that $\Phi_{f_{K}}\in V$ simulates $f_{K}$ . Then we construct $w_{C}$ as follows:

$0\,x_{1}\ldots x_{m}\ x_{m+1}x_{m+2}\$ $\stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\$ $0\,y_{1}\ldots y_{n}\ x_{1}\ldots x_{m}\,x_{m+1}\,x_{m+2}$

$\stackrel{{\scriptstyle\varphi_{\rm 0f}}}{{\longmapsto}}$ $\ 00\ y_{1}\ldots y_{n}\ x_{1}\ldots x_{m}\,x_{m+1}\,x_{m+2}$

$\stackrel{{\scriptstyle\tau_{2,n+m+3}}}{{\longmapsto}}$ $\stackrel{{\scriptstyle\tau_{3,n+m+4}}}{{\longmapsto}}\$ $0\,x_{m+1}x_{m+2}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ 0y_{1}$

$\stackrel{{\scriptstyle\varphi_{\vee}}}{{\longmapsto}}$ $(x_{m+1}\vee x_{m+2})\ x_{m+1}x_{m+2}\ y_{2}\ldots y_{n}\$ $x_{1}\ldots x_{m}\ 0y_{1}$

$\stackrel{{\scriptstyle\tau_{2,n+m+3}}}{{\longmapsto}}$ $\stackrel{{\scriptstyle\tau_{3,n+m+4}}}{{\longmapsto}}\$ $(x_{m+1}\vee x_{m+2})\ 0\ y_{1}y_{2}\ldots y_{n}\$ $x_{1}\ldots x_{m}\,x_{m+1}\,x_{m+2}$ ;

applying $\tau_{n+1,n+2}\ \ldots\ \tau_{2,3}\,\tau_{1,2}$ then yields

$0\ y_{1}y_{2}\ldots y_{n}\ (x_{m+1}\vee x_{m+2})\ x_{1}\ldots x_{m}$ $x_{m+1}x_{m+2}$ .

Thus $C$ is simulated by the word

$w_{C}=$ $\tau_{n+1,n+2}\ \ldots\ \tau_{2,3}\,\tau_{1,2}$ $\tau_{3,n+m+4}\,\tau_{2,n+m+3}$ $\varphi_{\vee}$ $\tau_{3,n+m+4}$ $\tau_{2,n+m+3}$ $\varphi_{\rm 0f}$ $w_{K}$

of size

$\|w_{C}\|$ $\ \leq\$ $\|w_{K}\|$ $+$ $2\,\|\tau_{2,n+m+3}\|$ $+$ $2\,\|\tau_{2,n+m+4}\|+2$ $+$ $\,\sum_{k=1}^{n+1}\|\tau_{k,k+1}\|$

$\leq\,\|w_{K}\|\,+\,c\,(n+m)^{2}+c$ , for some constant $c>1$ .

Case 3: Suppose our slice $C$ is obtained by adding a fork gate on the right of $K$ , with a new input variable $x_{m+1}$ and two new output variables $y_{n+1}$ and $y_{n+2}$ . The input-output function of $C$ is

$f_{C}(x_{1},\ldots,x_{m},x_{m+1})\ =\ (y_{1},\ldots,y_{n},x_{m+1},x_{m+1})$ ,

where $f_{K}(x_{1},\ldots,x_{m})=(y_{1},\ldots,y_{n})$ . The boolean function $f_{C}$ is to be simulated by a Thompson group element $\Phi_{f}$ such that

$\Phi_{f}(0\,x_{1}\ldots x_{m}x_{m+1})\ =\$ $0\ y_{1}\ldots y_{n}\ x_{m+1}x_{m+1}\ x_{1}\ldots x_{m}x_{m+1}$ .

Let $w_{K}\in(\Gamma_{\!V}\cup\tau)^{*}$ and $\Phi_{f_{K}}\in V$ be the simulation of $f_{K}$ , which exists by induction. Then

$0\,x_{1}\ldots x_{m}\ x_{m+1}\$ $\stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\$ $0\ y_{1}y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ x_{m+1}\$

$\stackrel{{\scriptstyle\tau_{2,n+m+2}}}{{\longmapsto}}\$ $0\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}\$ $\stackrel{{\scriptstyle\varphi_{\rm f}}}{{\longmapsto}}$ $\stackrel{{\scriptstyle\varphi_{\rm f}}}{{\longmapsto}}\$ $0\,x_{m+1}x_{m+1}x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}\$

$\stackrel{{\scriptstyle\tau_{4,n+m+4}}}{{\longmapsto}}\$ $0\,x_{m+1}\,x_{m+1}\ y_{1}\ldots y_{n}\ x_{1}\ldots x_{m}\ x_{m+1}$ ;

applying $\tau_{n+2,n+3}\ \ldots\ \tau_{3,4}$ and then $\tau_{n+1,n+2}\ \ldots\ \tau_{2,3}$ yields

$0\ y_{1}y_{2}\ldots y_{n}\ x_{m+1}x_{m+1}\ x_{1}\ldots x_{m}x_{m+1}$ .

This simulates $f_{C}$ by a word

$w_{C}=$ $\tau_{n+1,n+2}\ \ldots\ \tau_{2,3}$ $\tau_{n+2,n+3}\ \ldots\ \tau_{3,4}$ $\tau_{4,n+m+4}$ $\varphi_{\rm f}\ \varphi_{\rm f}\ \tau_{2,n+m+2}$ $w_{K}$

of size

$\|w_{C}\|\ \leq\ \|w_{K}\|+2+\|\tau_{2,n+m+2}\|+\|\tau_{4,n+m+4}\|$ $+$ $\ \sum_{k=3}^{n+2}\|\tau_{k,k+1}\|$ $\ +\$ $\sum_{k=2}^{n+1}\|\tau_{k,k+1}\|$

$\leq\,$ $\|w_{K}\|+c\,(m+n)^{2}+c$ , for some constant $c>1$ .

In all three cases the slice $C$ is simulated by a word $w_{C}\in$ $(\Gamma_{\!V}\cup\tau)^{*}$ of size

$\|w_{C}\|\leq\|w_{K}\|+c\,(m+n)^{2}+c$ .

Let $S$ now be any slice, and let $n_{i}$ be the number of interior vertices of $S$ (i.e., the vertices labeled by gates). Then if $w_{S}$ is constructed by adding $n_{i}$ ( $<|S|)$ gates to slices (starting with $K$ being the empty slice, and ending with $K$ being the desired slice $S$ ), the size of $w_{S}$ is

$\|w_{S}\|\ \leq\ n_{i}\,(c\,(m+n)^{2}+c)\ \leq\ c_{0}\,|S|^{3}$ ,

for some constant $c_{0}>1$ (that does not depend on $S$ ).

Moreover, as we saw when we introduced the notion of slice, in all three cases a bit-position permutation of the input wires of the slice $S$ is attached at the beginning of $w_{S}$ . This permutation belongs to $V$ and has size $<|S|^{3}$ .

The above construction of each word $w_{S}$ from $S$ is a polynomial-time algorithm (in terms of $|S|$ ).

Simulation of a multi-slice circuit

Assume that $C$ is a circuit of depth $L>2$ ; the depth is the number of slices. In order to define $w_{C}$ we use the fact that we have already defined the word $w_{C_{\ell}}$ that simulates the slice $C_{\ell}$ of $C$ (for every $\ell$ , $1\leq\ell\leq L$ ). Each word $w_{C_{\ell}}$ has all the properties claimed in Theorem 4.12; in particular, $w_{C_{\ell}}$ represents the function

$\Phi_{C_{\ell}}:\ \ 0\ Y^{\ell-1}\ \longmapsto\$ $0\ Y^{\ell}\ Y^{\ell-1}$ .

Hence, since $\Phi_{C_{\ell}}$ is a right ideal isomorphism, we also have

$0\ Y^{\ell-1}\ Y^{\ell-2}\ \ldots\ Y^{1}\ x_{1}\ldots x_{m}$ $\ \ \stackrel{{\scriptstyle\Phi_{C_{\ell}}}}{{\longmapsto}}\ \$ $0\ Y^{\ell}\ Y^{\ell-1}\ Y^{\ell-2}\ \ldots\ Y^{1}\ x_{1}\ldots x_{m}$ .

Therefore, $w_{C_{L}}\,w_{C_{L-1}}\ \ldots\ w_{C_{\ell}}\ \ldots\ w_{C_{1}}$ represents the function

$\Phi_{C_{L}C_{L-1}\ldots C_{1}}:$ $\ \ 0\,x_{1}\ldots x_{m}\ \ \longmapsto\ \$ $0\ y_{1}\ldots y_{n}\,Y^{L-1}\,\ldots\,Y^{\ell}\,\ldots\,$ $Y^{2}\,Y^{1}\,x_{1}\ldots x_{m}\ \ (=_{\rm def}\ Z)$ ,

where $\,y_{1}\ldots y_{n}=Y^{L}\,$ is the output of $C$ , and $x_{1}\ldots x_{m}$ is the input of $C$ .

The length of the word $Z$ ( $\in\{0,1\}^{*}$ ) is $|Z|\,\leq\,1+|C|$ . Indeed, the total number of variables in the circuit (i.e., $n_{L}+\,\ldots\,+n_{1}+m$ ) is equal to the total number of wires (i.e., $|C|$ ); the “ $+1$ ” comes from the leading bit [math].

Let $\,\sigma_{i,j}=\tau_{j-1,j}\,\tau_{j-2,j-1}\ \ldots\$ $\tau_{i+1,i+2}\,\tau_{i,i+1}(.)$ (for $1\leq i<j$ ). Then $\ \pi_{1}\ =\ (\sigma_{1,|Z|})^{n}\$ transforms the word $Z$ into

$0\ Y^{L-1}\ \ldots\ Y^{\ell}\ \ldots\ Y^{2}\ Y^{1}\ x_{1}\ldots x_{m}$ $y_{1}\ldots y_{n}$ .

Next (and this is a fundamental and crucial idea from reversible computing, see e.g., [3, 2, 21]), to the latter string we apply

$(w_{C_{L-1}}\ \ldots\ w_{C_{\ell}}\ \ldots\ w_{C_{2}}\,w_{C_{1}})^{-1}$

in order to clear away intermediate outputs of all the internal slices. This yields

$0\ x_{1}\ldots x_{m}\ y_{1}\ldots y_{n}$ .

Finally, applying the permutation $\,\pi_{2}\ =\ (\sigma_{1,n+m})^{m}\,$ produces the desired final output

$0\ y_{1}\ldots y_{n}\ x_{1}\ldots x_{m}$ .

Therefore we define $w_{C}\in(\Gamma_{\!V}\cup\tau)^{*}$ ) by

$w_{C}\ =\ \pi_{2}\ (w_{C_{L-1}}\ \ldots\ w_{C_{1}})^{-1}\ \pi_{1}\$ $w_{C_{L}}\ w_{C_{L-1}}\ \ldots\ w_{C_{1}}\,$ .

The word length of $\pi_{1}$ over $\tau$ is less than $\,n\ |Z|$ . Since all subscripts in $\sigma_{1,|Z|}$ are $\leq|Z|$ , the size of $\pi_{1}$ is $\,\|\pi_{1}\|\,<\,|Z|\ n\ |Z|\,\leq\,(|C|+1)^{3}$ . Since $m+n\leq|C|$ , the size of $\pi_{2}$ is also less than $\,(|C|+1)^{3}$ .

For the size of $w_{C}$ we have

$\|w_{C}\|\leq\|\pi_{2}\|+\|\pi_{1}\|+\|w_{C_{L}}\|$ $+\ 2\,\sum_{\ell=1}^{L-1}\|w_{C_{\ell}}\|$ .

We saw that $\ \|w_{C_{\ell}}\|\leq c_{0}\,|C_{\ell}|^{3}\$ (for $1\leq\ell\leq L$ ); and $\,\sum_{\ell=1}^{L}|C_{\ell}|=|C|\,$ implies $\,\sum_{\ell=1}^{L}|C_{\ell}|^{3}\leq|C|^{3}$ . Thus $\,\|w_{C}\|\leq c\cdot|C|^{3}$ , for some positive constant $c$ .

Since $|C|$ was at most squared in order to obtain strict layering, the above bound becomes

$\|w_{C}\|\,\leq\,c\ |C|^{6}$ ,

in terms ot the original (not necessarily strictly layered) circuit $C$ .

The word $w_{C}$ can be written down in linear time, based on the words $w_{C_{\ell}}$ ( $1\leq\ell\leq L$ ), and we saw that each $w_{C_{\ell}}$ can be computed in polynomial time from $C_{\ell}$ . $\Box$

4.3 Reduction to a generalized word problem of $V$

(over an infinite generating set)

We first extend the classical concepts of stabilizer and fixator to the case of partial injections.

Definition 4.13

A function $g$ partially stabilizes a set $S\subseteq\{0,1\}^{*}$ iff $g(S)\cup g^{-1}(S)\subseteq S$ . For a subgroup $G\subseteq V$ , the partial stabilizer of $S$ (in $G$ ) is

${\rm pStab}_{G}(S)\ =\$ * $\{g\in G:\ g(S)\,\cup\,g^{-1}(S)\,\subseteq\,S\}$ .*

A function $g$ partially fixes a set $S$ iff $g(x)=x\,$ for every $\,x\in$ $S\,\cap\,{\rm Dom}(g)\,\cap\,{\rm Im}(g)$ . This is also called partial pointwise stabilization. For a subgroup $G\subseteq V$ , the partial fixator of $S$ (in $G$ ) is

${\rm pFix}_{G}(S)\ =\ \{g\in G:\$ * $(\forall x\in S\,\cap\,{\rm Dom}(g)\,\cap\,{\rm Im}(g))$ $[\,g(x)=x\,]\,\}$ .*

We will only use partial stabilizers and fixators for sets $S$ that are right ideals; then ${\rm pStab}_{G}(S)$ and ${\rm pFix}_{G}(S)$ are groups [6, Lemma 4.1]. When $S=P\{0,1\}^{*}$ is a right ideal, where $P$ is a prefix code, we will abbreviate ${\rm pFix}_{G}(P\,\{0,1\}^{*})$ and ${\rm pStab}_{G}(P\,\{0,1\}^{*})$ by ${\rm pFix}_{G}(P)$ , respectively ${\rm pStab}_{G}(P)$ . In particular, we abbreviate ${\rm pFix}_{V}(0\,\{0,1\}^{*})$ to ${\rm pFix}_{V}(0)$ .

Lemma 4.14

We have: ${\rm pFix}_{V}(0)\ \subset\$ ${\rm pStab}_{V}(0\,\{0,1\}^{*})\ \cap\ {\rm pStab}_{V}(1\,\{0,1\}^{*})$ .

Proof. Obviously, ${\rm pFix}_{V}(0)\subset$ ${\rm pStab}_{V}(0\,\{0,1\}^{*})$ . Moreover, if we had $g(1x)=0y$ for any $g\in{\rm pFix}_{V}(0)$ and $x,y\in\{0,1\}^{*}$ , then $0y=g^{-1}(0y)=g^{-1}g(1x)=1x$ ; the first equality holds since $g^{-1}\in{\rm pFix}_{V}(0)$ . But $0y=1x$ is false since a string does not start with both 0 and 1. $\Box$

The following is little more than a reformulation of the definition of simulation and Lemma 4.11.

Lemma 4.15

Let $f$ and $g$ be any boolean functions such that $f$ and $g$ have the same number of input variables, and $f$ and $g$ have the same number of output variables. Suppose $f$ and $g$ are simulated by $\Phi_{f}$ , respectively $\Phi_{g}$ ( $\Phi_{f},\Phi_{g}\in V$ ). Then,

$f=g$ * iff $\Phi_{f}^{-1}\,\Phi_{g}\,\in\,{\rm pFix}_{V}(0)$ .*

Proof. Let $\{0,1\}^{m}$ be the common domain of $f$ and $g$ . Then by Lemma 4.11, $f=g$ iff for all $x\in\{0,1\}^{m}$ : $\,\Phi_{f}(0x)=\Phi_{g}(0x)$ . Then for all $x\in\{0,1\}^{m}$ : $\,0x=\Phi_{f}^{-1}\,\Phi_{g}(0x)=\Phi_{g}^{-1}\,\Phi_{f}(0x)\,$ (and $\Phi_{g}^{-1}\,\Phi_{f}=(\Phi_{f}^{-1}\,\Phi_{g})^{-1}$ ). Hence, $f=g\,$ iff $\,\Phi_{f}^{-1}\,\Phi_{g}\in{\rm pFix}_{V}(0)$ . $\Box$

Theorem 4.12 and Lemma 4.15 give a polynomial-time one-one reduction from the circuit equivalence problem to the generalized word problem of $\,{\rm pFix}_{V}(0)$ in $V$ , where the elements of $V$ written over $\Gamma_{\!V}\cup\tau$ . Since the circuit equivalence problem is coNP-complete, it follows that this generalized word problem is coNP-hard. Hence we have:

Corollary 4.16

(coNP-hard generalized word problem).* The generalized word problem of $\,{\rm pFix}_{V}(0)$ in $V$ over $\Gamma_{\!V}\cup\tau$ is coNP-hard. $\Box$ *

4.4 Reduction to the word problem of $V$

We will give a linear-time $2$ -ary conjunctive reduction from the generalized word problem of ${\rm pFix}_{V}(0)$ to the word problem of $V$ over the infinite generating set $\Gamma_{\!V}\cup\tau$ . This reduction is based on a “commutation test”, that was studied in greater generality in [6, Section 5]; here we just use $V$ , based on an alphabet of size 2, which makes everything simpler.

We first need a few lemmas. Recall the notation $u\parallel_{\rm pref}v\,$ ( $u$ and $v$ are prefix-comparable) and its negation $\nparallel_{\rm pref}$ . For $x\in\{0,1\}^{*}$ and $L\subseteq\{0,1\}^{*}$ , we define $\,x^{-1}L\,=\,\{v\in\{0,1\}^{*}:xv\in L\}$ .

Lemma 4.17

If $g\not\in{\rm pFix}_{V}(0)$ but $g\in{\rm pStab}_{V}(0)$ , then there exists $0x\in{\rm domC}(g)$ such that

$0x\nparallel_{\rm pref}g(0x)$ .

Hence, $\,0xu\nparallel_{\rm pref}g(0xu)\,$ ( $=g(0x)\,u$ ), for all $u\in\{0,1\}^{*}$ .

Proof. Lemma 4.17 is a special case of [6, Lemma 9.6], with a simpler proof. (Note that in [6] the notation $\leq_{\rm pref}$ for the prefix order was reversed; here, “ $p\leq_{\rm pref}w$ ” always means $p$ is a prefix of $w$ .)

We prove the contrapositive, i.e., if for all $0x\in{\rm domC}(g)$ we have $0x\parallel_{\rm pref}g(0x)$ , then $g\in{\rm pFix}_{V}(0)$ .

Case 1: $0x<_{\rm pref}g(0x)$ .

Then $g(0x)=0x\,v$ , for some $v\in\{0,1\}^{+}$ , so $v\in(0x)^{-1}{\rm imC}(g)$ . Now, $(0x)^{-1}({\rm imC}(g))$ is a maximal finite prefix code (by [6, Lemma 9.4]), which contains the non-empty string $v$ . Hence $(0x)^{-1}{\rm imC}(g)$ contains at least one other non-empty string (by [6, Lemma 9.5]), i.e., ${\rm imC}(g)$ contains $0xw$ ( $\neq 0xv$ ), for some $w\in\{0,1\}^{+}$ . Hence (since $g^{-1}$ stabilizes $0\{0,1\}^{*}$ ), there exists $0x^{\prime}\in{\rm domC}(g)$ such that $0x^{\prime}\neq 0x$ , and $g(0x^{\prime})\in{\rm imC}(g)$ and $g(0x^{\prime})=0x\,w>_{\rm pref}0x$ . Since ${\rm imC}(g)$ is a prefix code, $g(0x)\nparallel_{\rm pref}g(0x^{\prime})$ .

By the (contrapositive) assumption, $0x^{\prime}\parallel_{\rm pref}g(0x^{\prime})$ . Hence there are two possibilities:

(1) $0x^{\prime}\geq_{\rm pref}g(0x^{\prime})$ : Then $0x^{\prime}\geq_{\rm pref}g(0x^{\prime})>_{\rm pref}0x$ . So $0x^{\prime}>_{\rm pref}0x$ , which contradicts the fact that ${\rm domC}(g)$ is a prefix code.

(2) $0x^{\prime}<_{\rm pref}g(0x^{\prime})$ : Then $0x^{\prime}<_{\rm pref}g(0x^{\prime})=0x^{\prime}\,z$ , for some $z\in\{0,1\}^{+}$ ; and we saw that also $g(0x^{\prime})=0x\,w$ . This implies that $0x\|_{\rm pref}0x^{\prime}$ . Again, this contradicts that ${\rm domC}(g)$ is a prefix code.

Thus, case 1 is impossible.

Case 2: $0x>_{\rm pref}g(0x)$ .

Then $0x=g(0x)\,u$ , for some $u\in\{0,1\}^{+}$ , hence $u\in(g(0x))^{-1}{\rm domC}(g)$ . Now $(g(0x))^{-1}{\rm domC}(g)$ is a finite maximal prefix code, containing the non-empty string $u$ , hence it contains some other non-empty string. So there exists $0x^{\prime}$ ( $\neq 0x$ ) with $0x^{\prime}\in{\rm domC}(g)\,\cap\,g(0x)\,\{0,1\}^{+}$ .

By the (contrapositive) assumption, $0x^{\prime}\parallel_{\rm pref}g(0x^{\prime})$ . Again, we have two possibilities:

(1) $0x^{\prime}\leq_{\rm pref}g(0x^{\prime})$ : Then $g(0x^{\prime})\geq_{\rm pref}0x^{\prime}$ , and $0x^{\prime}>_{\rm pref}g(0x)$ (since $0x^{\prime}\in g(0x)\,\{0,1\}^{+}$ ). Thus, $g(0x^{\prime})>_{\rm pref}g(0x)$ , which contradicts the fact that ${\rm imC}(g)$ is a prefix code.

(2) $0x^{\prime}>_{\rm pref}g(0x^{\prime})$ : Then $0x^{\prime}=g(0x^{\prime})\,z$ , for some $z\in\{0,1\}^{+}$ ; and $0x^{\prime}=g(0x)\,w$ , for some $w\in\{0,1\}^{+}$ (since $0x^{\prime}\in g(0x)\,\{0,1\}^{+}$ ). Thus, $0x^{\prime}=g(0x^{\prime})\,z=g(0x)\,w$ , which implies $g(0x^{\prime})\parallel_{\rm pref}g(0x)$ . Again, this contradicts the fact that ${\rm imC}(g)$ is a prefix code.

We conclude that case 2 is impossible.

Now, having ruled out cases 1 and 2, the only remaining possibility is that $0x=g(0x)$ , for all $0x\in{\rm domC}(g)$ . This means that $g\in{\rm pFix}_{V}(0)$ . $\Box$

Lemma 4.18

For every $\,0x,0y\in 0\,\{0,1\}^{*}$ such that $0x\nparallel_{\rm pref}0y$ , there exists $f_{0}\in{\rm pFix}_{V}(1)$ and $u\in\{0,1\}^{*}$ such that

$f_{0}(0xu)=0xu$ * and $f_{0}(0yu)\neq 0yu$ .*

Proof. This Lemma is a simplification of [6, Prop. 9.14(1)], and we adapt that proof.

Let $0x,0y\in 0\,\{0,1\}^{*}$ be two prefix-incomparable strings, and let $a,b\in\{0,1\}$ with $a\neq b$ . Then $0x,0ya,0yb$ are prefix-incomparable two-by-two (as is easy to check). We now use [6, Lemma 9.7] to construct a finite maximal prefix code $\,Q\,\cup\,\{0x,0ya,0yb,1\}$ , with $Q\subset 0\,\{0,1\}^{*}$ . We define $f_{0}\in V$ by

$f_{0}(0ya)=0yb,\ \ f_{0}(0yb)=0ya,\ \ f_{0}(0x)=0x$ , and

$f_{0}$ is the identity on $Q\,\cup\,\{1\}$ .

So, $Q\,\cup\,\{0x,0ya,0yb,1\}$ is the domain code and image code of $f_{0}$ . Then $f_{0}\in{\rm pFix}_{V}(1)$ , $f_{0}(0ya)\neq 0ya$ , and $f_{0}(0xa)=0xa$ (since $f_{0}(0x)=0x$ ). So here, $a$ plays the role of $u$ . $\Box$

Lemma 4.19

(commutation test).* For all $g\in V$ we have:*

$g\in{\rm pFix}_{V}(0)$ * iff $\big{(}\forall f\in{\rm pFix}_{V}(1)\big{)}\,[\,fg=gf\,]$ .*

In words: An element $g\in V$ belongs to the subgroup ${\rm pFix}_{V}(0)$ iff $g$ commutes with all the elements of the subgroup ${\rm pFix}_{V}(1)$ .

Proof. ${\boldmath[\Leftarrow]}$ Suppose $fg=gf$ , for all $f\in{\rm pFix}_{V}(1)$ , and hence also $\,g^{-1}f=fg^{-1}$ .

(1) We first prove that $g\in{\rm pStab}_{V}(0)$ .

If $g(0x)=1y$ for some $x,y\in\{0,1\}^{*}$ , then $fg(0x)=f(1y)=1y$ for all $f\in{\rm pFix}_{V}(1)$ . And $1y=fg(0x)=gf(0x)$ . Hence, $g(0x)=1y=g(f(0x))$ , hence by injectiveness, $0x=f(0x)$ for all $f\in{\rm pFix}_{V}(1)$ . So, $f(0x0)=0x0$ and $f(0x1)=0x1$ , and $0x0\nparallel_{\rm pref}0x1$ . Hence by Lemma 4.18, there exists $f_{o}\in{\rm pFix}_{V}(1)$ such that $f_{o}(0x0u)=0x0u$ , and $f_{o}(0x1u)\neq 0x1u$ (for some $u\in\{0,1\}^{*}$ ). The latter inequality contradicts the fact that $f(0x)=0x$ for all $f\in{\rm pFix}_{V}(1)$ .

In a similar way one obtains a contradiction if $g^{-1}(0x)=1y$ for some $x,y\in\{0,1\}^{*}$ .

(2) We prove next that $g\in{\rm pFix}_{V}(0)$ .

Suppose $fg=gf$ for all $f\in{\rm pFix}_{V}(1)$ ; we saw that then $g\in{\rm pStab}_{V}(0)$ . If, by contradiction, $g\not\in{\rm pFix}_{V}(0)$ , then by Lemma 4.17, there exists $0x\in{\rm domC}(g)$ such that $\,0x\nparallel_{\rm pref}g(0x)=0y$ .

Then, $fg(0x)=f(0y)=gf(0x)$ . By Lemma 4.18 there exists $f_{o}\in{\rm pFix}_{V}(1)$ such that $f_{o}(0xu)=0xu$ , and $f_{o}(0yu)\neq 0yu$ (for some $u\in\{0,1\}^{*}$ ). Then $f_{o}(0yu)=f_{o}g(0xu)=gf_{o}(0xu)=g(0xu)=0yu$ . So, $f_{o}(0yu)=0yu$ , which contradicts $f_{o}(0yu)\neq 0yu$ .

$[\Rightarrow]$ Let $g\in{\rm pFix}_{V}(0)$ and $f\in{\rm pFix}_{V}(1))$ . Then ${\rm domC}(f)=\{1\}\,\cup\,0P$ , and ${\rm domC}(g)=\{0\}\,\cup\,1Q$ , where $P,Q\subset\{0,1\}^{*}\,$ are finite maximal prefix codes. So, $0P\,\cup\,1Q$ is a finite maximal prefix code.

Then for every $0x\in 0P$ : $\,fg(0x)=f(0x)$ , since $g\in{\rm pFix}_{V}(0)$ ; and $\,gf(0x)=f(0x)$ , since $f(0x)\in 0\{0,1\}^{*}$ and $g\in{\rm pFix}_{V}(0)$ . So, $fg(0x)=gf(0x)$ .

Similarly, for all $1x\in 1Q$ : $\,gf(1x)=g(1x)$ , since $f\in{\rm pFix}_{V}(1))$ ; and $\,fg(1x)=g(1x)$ , since $g(1x)\in 1\{0,1\}^{*}$ and $f\in{\rm pFix}_{V}(1))$ . So, $fg(1x)=gf(1x)$ .

Hence, $fg=gf$ on the finite maximal prefix code $\,0P\,\cup\,1Q$ . Hence $fg=gf$ in $V$ . $\Box$

Lemma 4.20

The subgroups ${\rm pFix}_{V}(1)$ and ${\rm pFix}_{V}(0)$ are isomorphic to $V$ .

Proof. Every element of $V$ has a table of the form $\,\{(x_{i},y_{i}):1\leq i\leq n\}$ , where $\{x_{1},\ldots,x_{n}\}$ and $\{y_{1},\ldots,y_{n}\}$ are finite maximal prefix codes over $\{0,1\}$ . An isomorphism $\,V\to{\rm pFix}_{V}(1)\,$ is given by

[TABLE]

The map $\theta$ is obviously a bijection from $V$ onto ${\rm pFix}_{V}(1)$ , and it is easy to check that it is a homomorphism. $\Box$

coNP**-hardness of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ :**

The commutation test of Lemma 4.19 reduces the generalized word problem of ${\rm pFix}_{V}(0)$ in $V$ (over $\Gamma_{\!V}\cup\tau$ ) to an infinite set of word problems of $V$ , namely $\,\{fg=gf:f\in{\rm pFix}_{V}(1)\}$ .

However, ${\rm pFix}_{V}(1)$ is 2-generated; this follows from Lemma 4.20 and the fact that $V$ is 2-generated [45, 32, 12]. Obviously, $g$ commutes with all of ${\rm pFix}_{V}(1)$ iff $g$ commutes with the two generators of ${\rm pFix}_{V}(1)$ . This reduces the generalized word problem of ${\rm pFix}_{V}(0)$ in $V$ (over $\Gamma_{\!V}\cup\tau$ ) to the conjunction of two instances of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ . Hence, the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ is coNP-hard with respect to 2-ary conjunctive polynomial-time reduction.

Theorem 4.21

(coNP-complete word problem).* The word problem of $V$ over the generating set $\Gamma_{\!V}\cup\tau$ is coNP-complete.*

Proof. By Lemma 4.8, this word problem belongs to coNP. By the reasoning in the above few lines, the word problem is coNP-hard. $\Box$

4.5 Alternative proof of coNP-completeness

of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$

The above proof of coNP-completeness of the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ was derived from a similar proof for $G_{3,1}$ [6] (in 2003). Since then, Stephen Jordan [27] (in 2013) proved that the equivalence problem for bijective circuits built from copies of the Fredkin gate is coNP-complete. A bijective circuit is an acyclic circuit in which every gate has a permutation of $\{0,1\}^{j}$ as its input-output function (for some $j>0$ , depending on the gate). The input-output function of such a circuit is a permutation of $\{0,1\}^{n}$ for some $n>0$ (see e.g. [43]). The Fredkin gate, on an input $x_{1}x_{2}x_{3}\in\{0,1\}^{3}$ , is defined by

${\small\sf F}(0\,x_{2}x_{3})=0\,x_{2}x_{3}\,$ ,

${\small\sf F}(1\,x_{2}x_{3})=1\,x_{3}x_{2}\,$ ;

see e.g. [21]. This gate is also called the “controlled transposition” (of $x_{2}$ and $x_{3}$ ). Clearly, ${\small\sf F}$ is the table of an element of $V$ ; moreover, with $\{{\small\sf F}\}\cup\tau$ we can compute ${\small\sf F}(x_{i}x_{j}x_{k})$ for any three different variables in an input $x_{1}\ldots x_{n}$ with $i,j,k\in\{1,\ldots,n\}$ . Hence Jordan’s result can be recast as follows:

Theorem 4.22

(Thompson group form of Jordan’s theorem).* The subgroup of $V$ generated by $\,\{{\small\sf F}\}\cup\tau$ has a coNP-complete word problem, with respect to many-one polynomial-time reduction. $\Box$ *

See [7] (and [43]) for further connections between bijective (“reversible”) circuits and the Thompson groups.

Theorem 4.22 immediately implies Theorem 4.21, as the word problem of the subgroup $\langle\{{\small\sf F}\}\cup\tau\rangle_{{}_{V}}$ reduces to the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ by the inclusion map. (Here we assume that ${\small\sf F}\in\Gamma_{\!V}$ ; if that is not the case we can represent ${\small\sf F}$ by a fixed word over $\Gamma_{\!V}$ for the reduction; see Lemma 4.6(2).)

An advantage of our method of subsections 4.2 - 4.4 is that it is direct, whereas Jordan’s theorem is based on Barrington’s theorem [1], which is itself a deep result. However, using Jordan’s theorem has the advantage that it yields the following: The word problem of $V$ over $\Gamma_{\!V}\cup\tau$ is coNP-complete with respect to polynomial-time many-one reduction. The earlier proof only yields polynomial-time binary conjunctive reduction.

4.6 The shift, and the word problem of $nV$

For all $\tau_{j,j+1}\in\tau\subset V$ with $j\geq 1$ , we define $\tau_{j,j+1}\times{\bf 1}$ : $\{0,1\}^{*}\times\{0,1\}^{*}$ $\,\longrightarrow\,$ $\{0,1\}^{*}\times\{0,1\}^{*}$ by

$\tau_{j,j+1}\times{\bf 1}:$ $\ \ (x,y)\ \longmapsto\ (\tau_{j,j+1}(x),\ y)$ .

So, ${\rm domC}(\tau_{j,j+1}\times{\bf 1})=$ $\{0,1\}^{j+1}\times\{\varepsilon\}$ .

The shift $\,\sigma\in 2V$ is defined by $\,{\rm domC}(\sigma)=\{\varepsilon\}\times\{0,1\}$ , $\,{\rm imC}(\sigma)=\{0,1\}\times\{\varepsilon\}$ , and

$\sigma(\varepsilon,a)=(a,\varepsilon)$ ,

for all $a\in\{0,1\}$ . Hence, $\sigma(x,\,ay)=(ax,\,y)$ , for all $a\in\{0,1\}$ , and $\,(x,y)\in\{0,1\}^{*}\times\{0,1\}^{*}$ .

Lemma 4.23

For all $j\geq 1$ : $\tau_{j,j+1}\times{\bf 1}(.)\ =\$ $\sigma^{j-1}\circ(\tau_{1,2}\times{\bf 1})\circ\sigma^{-j+1}(.)$ .

Proof. For any $(x,y)\in\{0,1\}^{*}\times\{0,1\}^{*}$ , where $\,x=u\,x_{j}x_{j+1}\,v$ with $|u|=j-1\geq 0$ , and $v\in\{0,1\}^{*}$ , we have:

$(u\,x_{j}x_{j+1}v,\,y)\ \stackrel{{\scriptstyle\sigma^{-j+1}}}{{\longmapsto}}\$ $(x_{j}x_{j+1}\,v,\ u^{\rm rev}\,y)\$ $\stackrel{{\scriptstyle\tau_{1,2}\times{\bf 1}}}{{\longmapsto}}\$ $(x_{j+1}x_{j}\,v,\ u^{\rm rev}\,y)\$ $\stackrel{{\scriptstyle\sigma^{j-1}}}{{\longmapsto}}\$ $(u\,x_{j+1}x_{j}\,v,\ y)$ .

Here, $u^{\rm rev}$ denotes the reverse of $u$ . $\Box$

Proof of Theorem 1.1:

By Lemma 3.5 the word problem of $nV$ belongs to coNP.

By Theorem 4.21 the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ is coNP-hard. By Lemma 4.23, the word problem of $V$ over $\Gamma_{\!V}\cup\tau$ , reduces to the word problem of $2V$ over a finite generating set; this reduction is the one-one reduction that replaces every generator $\gamma\in\Gamma_{\!V}$ by $\gamma\times{\bf 1}$ , and replaces $\tau_{j,j+1}$ by $\,\sigma^{j-1}\circ(\tau_{1,2}\times{\bf 1})\circ\sigma^{-j+1}$ , as in Lemma 4.23. We can include the set $\{\gamma\times{\bf 1}:\gamma\in\Gamma_{\!V}\}\cup\{\sigma\}$ into the finite generating set of $2V$ , or we can express all the elements of this set by a finite set of strings over some other finite generating set of $2V$ . Thus the word problem of $2V$ over a finite generating set is coNP-hard.

To show that word problem of $nV$ (for $n>2$ ) over a finite generating set is coNP-hard, we use the fact that $2V$ is a finitely generated subgroup of $nV$ , and apply Lemma 4.4(2). $\Box$

Remark on the distortion of $V$ in $2V$ : Burillo and Cleary [18] show that $V$ is exponentially distorted in $2V$ (when both $V$ and $2V$ are over finite generating sets). In Lemma 4.23 we proved that $\tau_{j-1,j}$ has linear word length in $2V$ (as a function of $j$ ); but $\tau_{j-1,j}$ has exponential word length in $V$ over any finite generating set. This, again, shows that the distortion of $V$ in $2V$ is at least exponential.

Indeed, for all $j\geq 2$ , $\,\tau_{j-1,j}$ has a table $ux_{j-1}x_{j}\in\{0,1\}^{j}\longmapsto ux_{j}x_{j-1}\in\{0,1\}^{j}$ (see the beginning of subsection 4.4). It follows from Lemma 2.2 that the table of $\tau_{j-1,j}$ is maximally extended. So, $\tau_{j-1,j}$ has table-size $2^{j}$ . Therefore, by [5, Thm. 3.8], the word length $|\tau_{j-1,j}|_{{}_{V}}$ of $\tau_{j-1,j}$ in $V$ (over any finite generating set) satisfies $\alpha\,2^{j}\,\leq\,|\tau_{j-1,j}|_{{}_{V}}\,\leq\,\beta\,j\,2^{j}$ (for some constants $\alpha,\beta>0$ ). On the other hand, the embedding of $V$ into $2V$ , used in Lemma 4.23, represents $\tau_{j-1,j}$ by a word of length $2j-3$ .

“Why” is the word problem of $2V$ coNP-complete? The table-size of an element $f\in 2V$ can be exponentially larger than the word length of $f$ (over a finite generating set); hence, the polynomial-time algorithm for the word problem of $V$ (consisting of simply composing the tables of the generators) turns into an exponential-time algorithm in $2V$ . In $V$ we have the table-size formula $\,|{\rm domC}(f_{2}\circ f_{1})|\leq|{\rm domC}(f_{2})|+|{\rm domC}(f_{1})|$ ; in $2V$ there is no such formula. However, the length-formula of Lemma 3.2 implies rather directly that the word problem of $2V$ belongs to coNP.

The coNP-hardness is less intuitive. The proof that $V$ (over $\Gamma_{\!V}\cup\tau$ ) can simulate circuits is intuitive (if tedious). The commutation test, reducing a generalized word problem to a word problem, is less intuitive, and it is a priori not related to computing. The alternative proof of coNP-hardness of the word problem of $V$ over $\Gamma\cup\tau$ is derived from Jordan’s theorem, which is itself based on Barrington’s theorem; the latter has always been considered a surprising result.

Using the shift to represent the infinite set $\tau$ by a finite set is easy. But the shift is not a circuit element (although it has a computational meaning, namely, as an operation in multi-stack machines).

Acknowledgement: I would like to thank the referee for a thorough reading of the paper.

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] D.A. Barrington, “Bounded-width polynomial-size branching programs recognize exactly those languages in NC 1 superscript NC 1 {\rm NC}^{1} ”, J. of Computer and System Sciences 38.1 (1989) 150-164.
2[2] C. Bennett, “Logical reversibility of computation”, IBM J. Research and Development 17 (1973) 525-532.
3[3] C. Bennett, “Time/Space tradeoffs for reversible computation”, SIAM J. of Computing 18 (1989) 766-776.
4[4] J.C. Birget, A. Ol’shanskii, E. Rips, M.V. Sapir, “Isoperimetric functions of groups and computational complexity of the word problem”, Annals of Mathematics 156.2 (Sept. 2002) 467-518.
5[5] J.C. Birget, “The groups of Richard Thompson and complexity”, International J. of Algebra and Computation 14(5,6) (Dec. 2004) 569-626. Preprint: https://arxiv.org/abs/math/0204292
6[6] J.C. Birget, “Circuits, co NP-completeness, and the groups of Richard Thompson”, International J. of Algebra and Computation 16(1) (Feb. 2006) 35-90. Preprint: https://arxiv.org/abs/math/0310335
7[7] J.C. Birget, “Factorizations of the Thompson-Higman groups, and circuit complexity”, International J. of Algebra and Computation 18.2 (2008) 285-320.
8[8] J.C. Birget, “Monoid generalizations of the Richard Thompson groups”, J. of Pure and Applied Algebra 213(2) (2009) 264-278.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The word problem of the Brin-Thompson group is coNP-complete

Abstract

1 Introduction

Theorem 1.1

2 Definition of nVnVnV based on strings

2.1 Definition of VVV based on strings

Definition 2.1

Lemma 2.2

Lemma 2.3

2.2 Right ideals of nA∗nA^{*}nA∗

Definition 2.4

Lemma 2.5

Notation 2.6

Lemma 2.7

Lemma 2.8

Lemma 2.9

Lemma 2.10

Lemma 2.11

Definition 2.12

P0P_{0}P0​ is a a working copy of PPP

for a deterministic algorithm, pick the first vvv

Proposition 2.13

Corollary 2.14

Proposition 2.15

Definition 2.16

Proposition 2.17

Proposition 2.18

2.3 Right ideal morphisms of nA∗nA^{*}nA∗, and string-based

Definition 2.19

Lemma 2.20

Lemma 2.21

Definition 2.22

Definition 2.23

Lemma 2.24

Lemma 2.25

Proposition 2.26

Lemma 2.27

Definition 2.28

Lemma 2.29

3 The word problem of nVnVnV is in coNP

Definition 3.1

Proposition 3.2

Corollary 3.3

Lemma 3.4

Lemma 3.5

4 coNP-completeness of the word problem of nVnVnV

4.1 Preliminaries on the word problem and complexity

Definition 4.1

Definition 4.2

Definition 4.3

Lemma 4.4

Definition 4.5

Lemma 4.6

Definition 4.7

Lemma 4.8

4.2 Circuits and the Thompson group VVV

Lemma 4.9

Definition 4.10

Lemma 4.11

Theorem 4.12

4.3 Reduction to a generalized word problem of VVV

Definition 4.13

Lemma 4.14

Lemma 4.15

Corollary 4.16

4.4 Reduction to the word problem of VVV

Lemma 4.17

Lemma 4.18

Lemma 4.19

Lemma 4.20

Theorem 4.21

4.5 Alternative proof of coNP-completeness

Theorem 4.22

2 Definition of $nV$ based on strings

2.1 Definition of $V$ based on strings

2.2 Right ideals of $nA^{*}$

$P_{0}$ is a a working copy of $P$

for a deterministic algorithm, pick the first $v$

2.3 Right ideal morphisms of $nA^{*}$ , and string-based

3 The word problem of $nV$ is in coNP

4 coNP-completeness of the word problem of $nV$

4.2 Circuits and the Thompson group $V$

4.3 Reduction to a generalized word problem of $V$

4.4 Reduction to the word problem of $V$

4.6 The shift, and the word problem of $nV$