The word problem of the Brin-Thompson group is coNP-complete
J.C. Birget

TL;DR
This paper establishes that the word problem for the Brin-Thompson groups nV and the Thompson group V is coNP-complete, highlighting the computational complexity of these algebraic problems.
Contribution
It proves coNP-completeness of the word problem for nV groups for all n ≥ 2 and for Thompson group V over a specific generator set, advancing understanding of their computational complexity.
Findings
Word problem of nV is coNP-complete for all n ≥ 2
Word problem of Thompson group V over certain generators is coNP-complete
Highlights computational complexity of these algebraic problems
Abstract
We prove that the word problem of the Brin-Thompson group nV over a finite generating set is coNP-complete for every n \ge 2. It is known that the groups nV are an infinite family of infinite, finitely presented, simple groups. We also prove that the word problem of the Thompson group V over a certain infinite set of generators, related to boolean circuits, is coNP-complete.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Geometric and Algebraic Topology · Computability, Logic, AI Algorithms
The word problem of the Brin-Thompson group is coNP-complete
J.C. Birget
( 10.ii.2020)
Abstract
We prove that the word problem of the Brin-Thompson group over a finite generating set is coNP-complete for every . It is known that is an infinite family of infinite, finitely presented, simple groups. We also prove that the word problem of the Thompson group over a certain infinite set of generators, related to boolean circuits, is coNP-complete.
1 Introduction
The group was introduced by Brin [14] as an -dimensional generalization of Richard Thompson’s group , for any positive integer (with ).
Brin proved that is finitely generated and simple, that is not isomorphic to [14], that is finitely presented [15], and that all are simple [16]. Hennig and Mattucci [24] show that all are finitely presented. Bleak and Lanoue [11] show that all are non-isomorphic. In short, the groups are an infinite family of infinite, finitely presented, simple groups.
The word problem of is decidable, as is easy to see from the definition of . The main result of the present paper is the following.
Theorem 1.1
The word problem of over any finite generating set is coNP-complete, for all .
Remarks on the theorem:
This is only the second example of a finitely presented group with coNP-complete word problem; the first example appeared in [6]. This is also the first “naturally occurring” example of a finitely presented group with either NP-complete or coNP-complete word problem. The proof of Theorem 1.1 strengthens the connection between acyclic circuits and finite group presentations; such a connection already played a crucial role in [6].
The Theorem implies that if then the Dehn function of (for ) has no polynomial upper bound; more strongly, cannot be embedded into a finitely presented group with polynomially bounded Dehn function (by [39, 4]).
The Theorem implies that if then is not embeddable into . It was proved recently [33, Coroll. 11.20] that does not embed into for any .
The groups for are the first examples of finitely presented simple groups whose word problem is harder than P (if ).111 The Higman-Thompson groups have their word problem in P (in fact in coCFL, by Lehnert and Schweitzer [30]). For other currently known finitely presented infinite simple groups (Meier [35, 36], Röver [38], Burger and Mozes [17], Lodha [31]), the complexity of the word problem has not been studied, but appears to be in P. Finitely presented infinite simple groups are related to the Boone-Higman theorem [13]. In [13] the authors ask whether their theorem can be strengthened as follows: Does a finitely generated group have a decidable word problem iff is embeddable into a finitely presented simple group? In contrast, it was observed in [6, Section 1] that all known finitely presented simple groups have a word problem of very low complexity; even coNP is a low complexity class on the scale of all decidable problems. The enormous gap between what is asked, and what has been observed so far motivates the following.
Question: Are the computational complexities of the word problems of all finitely presented simple groups unbounded?
More precisely, the negation of the question is: Is there a time-constructible total function such that the word problems of all finitely presented simple groups belong to ? (See e.g. [26] for the definitions of “time-constructible” and “”.) In case of a negative answer, the Boone-Higman question also has a negative answer. If the answer is positive then there is a chance that the Boone-Higman question has a positive answer; in that case, the proof of the answer to the Question above might be easier than the proof of a strengthened Boone-Higman theorem, and could be a useful step along the way.
Overview: In section 2 we define the Higman-Thompson groups and the Brin-Thompson groups and by (partial) actions on finite strings, or -tuples of strings. For this, the concept of prefix code of strings is generalized to the concept of joinless code of -tuples of strings. For the study of the computational complexity of the word problem, the string-based formalism is more convenient than the geometric approach. It follows fairly directly that the word problem of over a finite generating set belongs to coNP (section 3).
The proof of coNP-hardness is given in section 4. It goes through several steps, following the same strategy as the first half of [6] (where it was proved that a certain subgroup of , over a certain infinite generating set, has a coNP-complete word problem. Based on this we show that the Thompson group , over a certain infinite generating set, has a coNP-complete word problem. This infinite generating set of consists of a finite generating set, together with all the bit-position transpositions (where \,x_{1}\,\ldots\,x_{i-1}\,x_{i+1}\,x_{i}\,x_{i+2}\,\ldots\). An alternative approach, based on bijective circuits and the work of Jordan [27], is described in subsection 4.5. Finally, we show that can be expressed by and the shift . This reduces the word problem of , over an infinite generating set that includes position transpositions, to the word problem of over a finite generating set (subsection 4.6).
Summary of abbreviations and notations:
– The word function in this paper means partial function. The domain of a function is denoted by , and the image by . Most often, the sets and will be free monoids , or Cantor spaces , or their direct powers or .
– , the free monoid freely generated by , a.k.a. the set of all strings over ;
– , the empty string;
– , the free semigroup; ;
– , the length of the string ;
– , () is a prefix of ();
– , is prefix-comparable with ;
– , , the -fold cartesian product X, respectively X;
– , the -tuple of empty strings;
– , the unique minimum generating set of the monoid ;
– , \max\{|x_{1}|,\,\ldots,|x_{n}|\}\ if ;
– , () is an initial factor of ();
– dag, directed acyclic graph;
– , the restriction of a function to a set .
2 Definition of based on strings
The standard definitions in computational complexity require strings as inputs. Brin’s original definition of uses geometric actions, but for the proof of coNP-completeness of the word problem of we also need a (partial) action of on -tuples of strings. The groups are generalizations of . We first look at
2.1 Definition of based on strings
The group can be defined in many ways; see e.g. [44, 34, 45, 25, 41, 19]. We will mostly use two definitions of from [5] (which are is similar to [41], except that we use the terminology of prefix codes, right ideals, and right-ideal morphisms).
We recall some standard notions. An alphabet is any finite set, although we mostly use (the bits), and for any integer . For an alphabet and , denotes the set of sequences of length over (called set of strings of length ), and for we say that (i.e., the length of is ); is the set of strings of length . The empty string is denoted by , and . The set of all strings over is denoted by , and the set of all infinite strings indexed by the ordinal is denoted by . By default a “string” is finite; for infinite strings we explicitly say “infinite”. For the concatenation is denoted by or ; it has length . For two subsets , we define the concatenation by and .
For we say that is a prefix of iff ; this is denoted by . Two strings are called prefix-comparable (denoted by ) iff or . A prefix code (a.k.a. a prefix-free set) is any subset such that for all : implies . A right ideal of is, by definition, any subset such that . A subset is said to generate as a right ideal iff . It is easy to prove that every finitely generated right ideal is generated by a unique finite prefix code, and this prefix code is the minimum generating set of the right ideal (with respect to ). By definition, a maximal prefix code is a prefix code that is not a strict subset of any other prefix code of . An essential right ideal is, by definition, a right ideal such that all right ideals of intersect (i.e., have a non- intersection with ). It is well known (see e.g. [5, Lemma 8.1]) that a right ideal is essential iff the unique prefix code that generates is maximal.
A right ideal morphism of is, by definition, a function such that for all and all : . In that case, is a right ideal; one easily proves that is also a right ideal. The prefix code that generates is denoted by , and is called the domain code of ; the prefix code that generates is denoted by , and is called the image code. We are interested in the following monoid:
is a right ideal morphism of such that is injective, and
and are finite maximal prefix codes}.
We usually write for since we usually just deal with one alphabet at a time. It is proved in [5, Prop. 2.1] that every is contained in a unique -maximum right ideal morphism in ; this is called the maximum extension of . The Higman-Thompson group (where ) is a homomorphic image of :
Definition 2.1
(Thompson group and Higman-Thompson groups ).* The Thompson group , as a set, consists of the right ideal morphisms that are maximum extensions in . The multiplication in consists of composition, followed by maximum extension.*
The same definition for with yields the Higman-Thompson group for every ; .
Every element (and in particular, every ) is determined by the restriction of to . This restriction is a finite bijection, called the table of [25]. Obviously, () determines and hence a unique table. When we use tables we do not always assume that is a maximum extension. The well known tree representation of is obtained by using the prefix trees of and .
Lemma 2.2
Let be finite maximal prefix codes. The right ideal morphism determined by a table : can be extended iff there exist such that for every : , , and .
In that case, can be extended by defining . The table for this extension is obtained be replacing by , by , and by .
This is called an extension step of the table .
Proof. See [5, Lemma 2.2] and [25].
Since in an extension step the cardinality of decreases, only finitely steps are needed to reach the maximum extension of ; the number of steps is .
Based on the representation of the elements of (and of ) by tables, one can show easily that the word problem of these groups is in P. A much stronger result is that the word problem is in coCFL (the set of languages whose complement is context-free) [30]; coCFL is a strict subclass of the parallel complexity class , which is a subclass of P (see e.g., [23]).
The definition of : Maximality of finite prefix codes has the following characterization in terms of . A finite prefix code is maximal iff . (This is not true for infinite prefix codes; a counter example is .)
It follows that every element determines a permutation of . Conversely, let be a finite maximal prefix code. Then for every there exists a unique and such that . Let be a permutation of for which there exists a table : such is defined by (for every and ). Then .
Thus, can be defined as a certain group of permutations of .
Lemma 2.3
Let : and : be two tables that determine, respectively, the right ideal morphisms . Then the following are equivalent:
(1)* and determine the same element of (by maximum extension); *
(2)* and have the same maximum extension in ; *
(3)* and have a common restriction in ; *
(4)* and have a common restriction to an essential right ideal of ; *
(5)* and determine the same function on ; *
(6)* and determine the same function on .*
Proof. (1) and (2) are equivalent by the definition of . (2) implies (3) (which implies (4)): The intersection is a common restriction; by [5, Lemma 8.3], is an essential right ideal. Moreover, ; hence is finite. (4) implies (2) by uniqueness of maximum extensions in (see [5, Lemma 2.1], which does not require finiteness of prefix codes). (3) implies (5) in an obvious way. And (5) implies (1), based on finiteness and uniqueness of maximum extension. (5) and (6) are obviously equivalent.
The piecewise linear definition of : Brin’s definition of extends the definition of as given in [19]; the latter is based on piecewise linear actions on the interval . We use half-open intervals, so neighboring intervals do not intersect; however, when the right boundary is 1, we use “”. The boundary-points of the subintervals that appear are binary rational numbers (i.e., the denominator is a power of 2). A string with determines the half-open subinterval ; but if then we take , i.e., in that case we close the interval. Here, is a rational number written in fractional binary representation; i.e., . E.g., 01100 (of length 5) determines the subinterval .
2.2 Right ideals of
Here we completely develop the string description of , which is briefly alluded to in [14, subsection 4.3]. A hybrid string-geometric description was used in [11] (where some crucial concepts appear only in geometric form). Our description is entirely based on strings, but the correspondence with geometric concepts is often pointed out. The present subsection focuses on finitely generated right ideals of ; in the next subsection, will be defined based on right-ideal morphisms of .
As before, let be a finite alphabet of cardinality , usually denoted by or . The -fold cartesian product X will be denoted by ; we choose this notation in analogy with the notation , and also in order to avoid confusion with -fold concatenation (of the form ). Similarly, denotes the -fold cartesian product X. Multiplication in is done coordinatewise, i.e., is the direct product of copies of the free monoid . For we denote the coordinates of by , for ; i.e., .
Geometrically: represents the hyperrectangle X (except that “” is replaced by “” if ). The measure of this hyperrectangle is . In particular, represents and has measure 1.
The concept of prefix is similar to the one in , but in order to avoid confusion we will use the phrase “initial factor”. So the initial factor order on is defined for by iff there exists such that . In a similar way we have the concepts of comparability (denoted by ), right ideal, generating set of a right ideal, and essential right ideal. It is easy to prove that in iff for all . For any there exists a unique -maximum common initial factor, denoted by . In terms of coordinates, , where is the longest common prefix of the strings and .
An initial factor code is a set such that no two different elements of are -comparable.
As we shall see, a crucial way in which with differs from concerns the join operation with respect to . For all , the join of is defined by and . Of course, does not always exist.
Definition 2.4
A set is joinless iff no two elements of have a join with respect to in . Joinless sets will be called joinless codes, since they are necessarily initial factor codes.
A set is a maximal joinless code iff is -maximal among the joinless codes of . (In other words, adding a new element to a maximal joinless code results in a set, some of whose elements have joins.)
A right ideal is called joinless generated iff is generated, as a right ideal, by a joinless code.
(About the grammar: “Joinlessly generated” would not make sense since it is not the generating process that is joinless.)
Examples: Not every initial factor code is joinless; e.g., is an initial factor code where . An example of a maximal joinless code is . A maximal joinless code is usually not maximal as an initial factor code; for example, in one could add ; the result would be a initial factor code (that is not joinless). The only maximal joinless code that is also maximal as an initial factor code is .
From here on, a joinless code will be called maximal if it is maximal as a joinless code.
Connection with the geometric description: For we have iff the hyperrectangle is contained in the hyperrectangle (i.e., corresponds to ); note that “shorter” -tuples correspond to “larger” hyperrectangles. The join represents the hyperrectangle obtained by intersecting the hyperrectangles and (so corresponds to ). Note that does not exist iff the intersection is the empty set (since the empty set is not a hyperrectangle). The meet (which always exists) does not represent the union, nor the smallest hyperrectangle that contains and , but the smallest hyperrectangle representable by an -tuple in that contains and . Joinlessness of a code means that any two hyperrectangles in the chosen subdivision of are disjoint as sets. A joinless code is maximal iff its hyperrectangles form a tiling of . In an initial factor code, -incomparability means that no hyperrectangle in the code is contained in another one.
Examples (for the correspondence between strings and geometry): Fig. 1 shows a few elements of . The large square is represented by . The numbers use fractional binary representation; e.g., .
represents the rectangle (horizontally hashed); and represents (vertically hashed). The join represents (doubly hashed). represents .
The rectangle is represented by (horizontally hashed). And is represented by (vertically hashed). Here, does not exist, and the meet represents .
\begin{picture}(110.0,60.0)\par\put(5.0,10.0){\framebox(40.0,40.0)[]{}} \put(5.0,10.0){\framebox(20.0,20.0)[]{}} \par\par\put(-2.0,0.0){\makebox(0.0,0.0)[cc]{\sf Fig. 1}} \par\put(5.0,10.0){\framebox(20.0,10.0)[]{}} \put(15.0,10.0){\framebox(5.0,20.0)[]{}} \par\put(25.0,42.5){\framebox(10.0,2.5)[]{}} \put(25.0,45.0){\framebox(20.0,2.5)[]{}} \par\put(25.0,40.0){\framebox(20.0,10.0)[]{}} \par \par\put(5.0,10.0){\line(1,0){20.0}}\put(5.0,10.5){\line(1,0){20.0}}\put(5.0,11.0){\line(1,0){20.0}}\put(5.0,11.5){\line(1,0){20.0}}\put(5.0,12.0){\line(1,0){20.0}}\put(5.0,12.5){\line(1,0){20.0}}\put(5.0,13.0){\line(1,0){20.0}}\put(5.0,13.5){\line(1,0){20.0}}\put(5.0,14.0){\line(1,0){20.0}}\put(5.0,14.5){\line(1,0){20.0}}\put(5.0,15.0){\line(1,0){20.0}}\put(5.0,15.5){\line(1,0){20.0}}\put(5.0,16.0){\line(1,0){20.0}}\put(5.0,16.5){\line(1,0){20.0}}\put(5.0,17.0){\line(1,0){20.0}}\put(5.0,17.5){\line(1,0){20.0}}\put(5.0,18.0){\line(1,0){20.0}}\put(5.0,18.5){\line(1,0){20.0}}\put(5.0,19.0){\line(1,0){20.0}}\put(5.0,19.5){\line(1,0){20.0}} \put(15.0,10.0){\line(0,1){20.0}}\put(15.5,10.0){\line(0,1){20.0}}\put(16.0,10.0){\line(0,1){20.0}}\put(16.5,10.0){\line(0,1){20.0}}\put(17.0,10.0){\line(0,1){20.0}}\put(17.5,10.0){\line(0,1){20.0}}\put(18.0,10.0){\line(0,1){20.0}}\put(18.5,10.0){\line(0,1){20.0}}\put(19.0,10.0){\line(0,1){20.0}}\put(19.5,10.0){\line(0,1){20.0}} \par\put(25.0,42.5){\line(1,0){10.0}}\put(25.0,43.0){\line(1,0){10.0}}\put(25.0,43.5){\line(1,0){10.0}}\put(25.0,44.0){\line(1,0){10.0}}\put(25.0,44.5){\line(1,0){10.0}} \put(25.0,45.0){\line(0,1){2.5}}\put(25.5,45.0){\line(0,1){2.5}}\put(26.0,45.0){\line(0,1){2.5}}\put(26.5,45.0){\line(0,1){2.5}}\put(27.0,45.0){\line(0,1){2.5}}\put(27.5,45.0){\line(0,1){2.5}}\put(28.0,45.0){\line(0,1){2.5}}\put(28.5,45.0){\line(0,1){2.5}}\put(29.0,45.0){\line(0,1){2.5}}\put(29.5,45.0){\line(0,1){2.5}}\put(30.0,45.0){\line(0,1){2.5}}\put(30.5,45.0){\line(0,1){2.5}}\put(31.0,45.0){\line(0,1){2.5}}\put(31.5,45.0){\line(0,1){2.5}}\put(32.0,45.0){\line(0,1){2.5}}\put(32.5,45.0){\line(0,1){2.5}}\put(33.0,45.0){\line(0,1){2.5}}\put(33.5,45.0){\line(0,1){2.5}}\put(34.0,45.0){\line(0,1){2.5}}\put(34.5,45.0){\line(0,1){2.5}}\put(35.0,45.0){\line(0,1){2.5}}\put(35.5,45.0){\line(0,1){2.5}}\put(36.0,45.0){\line(0,1){2.5}}\put(36.5,45.0){\line(0,1){2.5}}\put(37.0,45.0){\line(0,1){2.5}}\put(37.5,45.0){\line(0,1){2.5}}\put(38.0,45.0){\line(0,1){2.5}}\put(38.5,45.0){\line(0,1){2.5}}\put(39.0,45.0){\line(0,1){2.5}}\put(39.5,45.0){\line(0,1){2.5}}\put(40.0,45.0){\line(0,1){2.5}}\put(40.5,45.0){\line(0,1){2.5}}\put(41.0,45.0){\line(0,1){2.5}}\put(41.5,45.0){\line(0,1){2.5}}\put(42.0,45.0){\line(0,1){2.5}}\put(42.5,45.0){\line(0,1){2.5}}\put(43.0,45.0){\line(0,1){2.5}}\put(43.5,45.0){\line(0,1){2.5}}\put(44.0,45.0){\line(0,1){2.5}}\put(44.5,45.0){\line(0,1){2.5}} \par\end{picture}
Fig. 1
For , exists in iff and have a common upper bound for . This holds iff ; in that case, if , and if . Hence in , prefix codes are the same thing as joinless codes. This is not the case for with ; here, joinless codes are a special case of initial factor codes, and the join is characterized as follows:
Lemma 2.5
(join for in ).* For all , the following are equivalent:*
(1)* the join (with respect to ) exists; *
(2)* and have a common upper bound for , i.e., ; *
(3)* for all : u_{i}\parallel_{\rm pref}v_{i}\ in .*
Moreover, if exists, then
(u\vee v)_{i}\ =\ \left\{\begin{array}[]{ll}u_{i}&\ \ \ \mbox{if ,v_{i}\leq_{\rm pref}u_{i},A^{}),}\\ v_{i}&\ \ \ \mbox{if ,u_{i}\leq_{\rm pref}v_{i},A^{}).}\end{array}\right. **
In other words, if exists then , and .
So, in the relation is not equivalent to coordinatewise ; the latter is equivalent to the existence of a join; implies (but is not equivalent to) existence of a join.
Proof. [(1) (2)] is obvious. [(2) (3)] is straightforward: If and for some then for some . Hence, , so in .
[(3) (1)] Suppose for all . Then for some , and for the other . Hence, if (in ), and otherwise; so exists.
Notation 2.6
Let A_{\varepsilon,n}\ =\ .
Note that is the unique minimum generating set of as a monoid; the cardinality is .
Lemma 2.7
.
(1)* Every right ideal is generated, as a right ideal, by a unique initial factor code. (Finiteness of generating sets is not assumed here.)*
(2)* If a right ideal is generated by a joinless code then the unique initial factor code that generates is joinless.*
Proof. Let . We claim that is an initial factor code that generates , and that is the unique such initial factor code. (We closely follow the proof of [8, Lemma 8.1(1)].)
Let us show that generates . Obviously, since , we have . Conversely, to show that , consider any . In , has only finitely many initial factors, hence there exists a (not necessarily unique) which is an initial factor of and is -minimal in . So for some . And , otherwise there would exist for some , which would contradict that is -minimal in . Hence .
To show that is an initial factor code, let and suppose for some . If then , contradicting the assumption that (). So, .
To prove uniqueness of the initial factor code that generates , we generalize the proof of [8, Lemma 8.1(1’)]. If for two initial factor codes , then for every there exists such that (for some ). Also, there is such that (for some ). Hence , which implies , since is an initial factor code. Thus, . Therefore, . Similarly we have , so .
Part (2) follows immediately from the uniqueness of the initial factor code that generates .
Lemma 2.8
Let be a finite maximal joinless code. Then every has a unique initial factor in ; i.e., .
Proof. If there were two different initial factors of in then and would be initial factors of a finite initial factor of ; hence and would have a join, contradicting that is joinless. This shows uniqueness.
Let us show existence. Since is a maximal joinless code, every initial factor of has a join with some element of . Let us pick so that its coordinates (in ) are longer than all the coordinates of the elements of . Then the element of that has a join with is an initial factor of .
Lemma 2.9
Let be any finite joinless code, and let be the right ideal generated. (Recall that by Lemma 2.7, is uniquely determined by .) Then the following are equivalent:
(1)* is an essential right ideal;*
(2)* is maximal as a joinless code;*
(3)* ;*
(4)* .*
Proof. Suppose is a finite joinless code. Then is maximal joinless iff every has a join with some element of (as follows directly from the definition of maximality). This is equivalent to the property that every monogenic right-ideal of intersects ; i.e., is essential.
If , then every has an initial factor in . It follows that for every right ideal , . Hence intersects . So, is essential.
Suppose is a finite maximal joinless code. Let , and for any , let be the initial factor of in . Then has a join with some . Since is finite, is an initial factor of if each of is larger than . Hence, is an initial factor of , so . Since for every such a exists (by Lemma 2.8), we conclude that .
The equivalence of (3) and (4) is obvious since , so .
Remark. Lemma 2.9 only talks about joinless generated right ideals. Indeed, an essential finitely generated right ideal in is not necessarily joinless generated. An example for is
.
It is easy to prove that is essential, and that is an initial-factor code that is not joinless (since exists). By Lemma 2.7, this initial factor code is unique, i.e., is not generated by any other initial-factor code; hence is not joinless generated.
Section 5 of version 1 of [10] gives a detailed proof (independently of Lemma 2.7) that is essential in , and that is not generated (as a right ideal) by any finite joinless code.
DAGs and : The following generalizes the well known concepts of the tree of and the tree of a prefix code. We abbreviate directed acyclic graph by dag. A few definitions: The leaves of a dag are the vertices of out-degree 0; all the other vertices are interior vertices. For a dag , the sub-dag spanned by the interior vertices of is called the interior dag of . The sources of a dag are the vertices of in-degree 0; if there is only one source, and all vertices are reachable from this source, this source is called the root, and the dag is then called rooted. The depth of a vertex in a rooted dag is defined to be the length of the shortest path from the root to ; by “path” we will always mean a directed path.
The dag of is the infinite rooted dag with vertex set and root ; the edges are the ordered pairs such that there exists and with (where ). Hence every vertex has () children; see Notation 2.6. And iff there exists a directed path from to in the dag. It is easy to show that the depth of a vertex in the dag of is .
The dag of is the right Cayley graph of the monoid over the generating set .
For any finite subset we define the initial factor dag of (also called the -dag): This is a finite rooted subdag of the dag of ; the root of the -dag is the root of the dag of ; the vertices and edges are those vertices, respectively edges, of the dag of that appear on any path from the root to any vertex in . Hence the vertices of the -dag are all the initial factors of the elements of (so the -dag is uniquely determined by ). The set of leaves of the -dag is iff is an initial factor code.
Note that the trees and dags considered here are not ordered trees or dags; i.e., the children of a vertex are defined as a set, not a sequence; similarly, the leaves form a set, not a sequence.
Lemma 2.10
Let be a finite maximal joinless code such that . Let be any leaf of the interior dag of the dag of , and let be the set of children of in the -dag; so , and is non-empty (since is an interior vertex).
(0)* Then satisfies*
v_{+}\ \subseteq\* \ =\ ,*
for some ; and is unique (for a given and ).
(1)* For , part (0) holds with equality for every leaf of the interior dag: .*
(2) (Lawson and Vdovina [29, Thm. 12.11], but with a different formalism.)* For and , part (0) holds with equality for some maximum-depth leaf of the interior dag of :*
* or .*
However, equality does not necessarily hold for every interior leaf, not even for every interior leaf of maximum depth.
(3) (Lawson and Vdovina [29, Ex. 12.8])* For there exist finite maximal joinless codes for which the inclusion in part (0) is strict. I.e., for every leaf of the interior dag and for every :*
v_{+}\ \neq\* .*
Proof. (0) Since is interior without having interior children, it contains a least one child in , of the form , for some , .
Any possible child of belongs to . If, in addition to , had an additional child of the form with (for any ), then would not be joinless. Indeed, these two children have the join (if ), or (if ). This shows that all children of belong to for one particular (depending on ).
(1) For the Lemma is folklore knowledge.
(2) (This result is equivalent to [29, Thm. 12.11], but the proof given here is rather different.)
Here . Let be a maximum-depth leaf of the interior dag of . Since is an interior leaf, at least one of its children is in . Hence either or , for some .
Let us assume that and that ; the other cases are very similar. Since has maximum depth in the interior dag, has maximum depth in .
If it is also the case that , then , and the Lemma holds. Therefore, from here on we only consider the situation where (but ). Then there exists with , such that has a join with . By Prop. 2.5, this is equivalent to and .
This leads to four cases.
Case 1: and .
Then and . Since is a leaf of the interior dag of , and , it follows that is a child of . Since , it follows that is of the form for some . But then () has a join with , contradicting the fact that is joinless. So, case 1 is ruled out.
Case 2: and ; since , at least one of these is strict.
Case 2.1: and :
Then is interior, since is interior. But being an interior vertex contradicts the assumption that . So case 2.1 is ruled out.
Case 2.2: and :
Then , for some and . But now has greater depth than , which has maximum depth in . So case 2.2 is ruled out.
Case 3: and ; since , at least one of or is strict.
Case 3.1: and .
Then , and for some ; so . But then exists, contradicting the fact that . So case 3.1 is ruled out.
Case 3.2: and .
Then has greater depth than , contradicting the fact that has maximum depth in . So case 3.2 is ruled out.
Case 4: and .
Then and for some . Since has maximum depth in we have , hence , hence . Moreover, , otherwise , hence , hence , which would imply . In summary this proves:
and .
Notation (used in the remainder of the proof): For any , let denote the bitstring obtained by complementing the right-most bit of . And denotes the bitstring obtained by removing the right-most bit of .
Note that since , if we prove that then the Lemma holds for the interior vertex .
Claim: .
Proof of the Claim: Assume by contradiction that there exists such that , and has a join with . The existence of this join is equivalent to and .
This leads to four cases.
Case 4.1: and . At least one of the is strict.
Case 4.1.1: and .
Then the join exists, contradicting the fact that and belong to . So case 4.1.1 is ruled out.
Case 4.1.2: and .
Then and for some . The latter equality implies that . Recall that in case 4, ; this and imply that , hence . Now, since and , the join exists, which contradicts the fact that and belong to . So case 4.1.2 is ruled out.
Case 4.2: and .
Since has maximum depth in , and has the same depth, it follows that . This contradicts the assumption . So case 4.2 is ruled out.
Case 4.3: and .
Case 4.3.1: and (since , equality in the first coordinate implies strictness in the second).
Then , i.e., has greater depth than , which contradicts the fact that has maximum depth in . So case 4.3.1 is ruled out.
Case 4.3.2: and .
Then , and , for some . Recall that in case 4. Then exists. This contradicts the fact that and belong to . So case 4.3.2 is ruled out.
Case 4.4: and ; since , or is strict.
Case 4.4.1: and .
Then , hence has greater depth than , which contradicts the fact that has maximum depth in . So case 4.4.1 is ruled out.
Case 4.4.2: and .
Then ; also, (since and only differ in the right-most bit), so , for some . Now, exists. This contradicts the fact that and belong to . So case 4.4.2 is ruled out.
Since we now ruled out all sub-cases of case 4, this completes the proof (by contradiction) of the Claim.
Summary of the proof so far: We have for some maximum-depth vertex in the interior of the -dag. (The cases where, instead, we have or , or in , are similar.)
If we also have then the Lemma holds.
If then there exists that has a join with . Four cases are possible, of which cases 1, 2, and 3 were ruled out. In case 4 we showed that ; hence in case 4, and belong to , i.e., the Lemma holds for the interior vertex .
The following is an example where not every maximum-depth interior leaf has two children in . Consider the maximal joinless code . Here the interior leaf has maximum depth, and has only one child in (namely ). Nevertheless, there is another maximum-depth interior leaf, namely , that has two children in (namely and ).
(3) Example (from [29, Ex. 12.8]): Let . It is easy to verify that is a finite maximal joinless code, and that no leaf of the interior dag has two children in .
Remark about Lemma 2.10: Version 1 of this paper (see [10]) stated incorrectly that “for every and every leaf of the interior dag of : (for some , )”. This statement had to be modified for (from “for every leaf” to “there exists a leaf”), and dropped for . The above counter-example for was given in [28] and [29, Ex. 12.8].
Lemma 2.11
Let be a finite set. For any and , let
* \ \cup\ .*
Then we have:
(1)* is joinless iff is joinless.*
(2)* is a maximal joinless code iff is a maximal joinless code.*
The set is called a one-step restriction of (“restriction” because ); and is called a one-step extension of . Clearly, .
Proof. (1) Let us assume that is joinless. For any with , the join of and , does not exist, since and are not prefix-comparable.
If and were both initial factors of some , then and would also both be initial factors of , contradicting the assumption that is joinless.
Finally, all pairs () are joinless since is joinless. Thus is joinless.
Let us assume that is joinless. Then every pair () is joinless.
If and had a join , then both and would be initial factors of . By Lemma 2.5, for all . We have two cases.
Case 1: (for the used in ).
This is equivalent to being a prefix of . Then is a prefix of as well (for every ), hence and have a join. But this contradicts the assumption that is joinless.
Case 2: , and (for the used in ).
Then is a strict prefix of (), hence is a prefix of for some . It follows that has and as initial factors; this contradicts the assumption that is joinless.
(2) Suppose is a maximal joinless code. Hence, every has a join with some (otherwise could be added to , which would contradict that is maximal joinless). We want to show that also has a join with some element of .
If then , hence also has a join with some .
If , i.e., , then for all . We have two cases.
Case 1: (for the used in ).
This is equivalent to being a prefix of . Then is a prefix of too (for every ), hence and have a join. So, has a join with some element of .
Case 2: , and (for the used in ).
Then is a strict prefix of (), hence is a prefix of for some . It follows that has and as initial factors; this implies that has a join with (for this particular ).
Suppose that is maximal joinless. Then every has a join with some . We want to show that also has a join with some element of .
If for all , then so also has a join with .
If for some , then let be the join of and . Then has and as initial factors, hence is an initial factor of . Hence exists, so has a join with an element of .
The properties of joinless codes given in Lemma 2.11 do not hold for initial factor codes in general. For example, for consider the initial factor code . Then for and we obtain , which is not an initial factor code.
The process of one-step restriction or extension can be iterated, which inspires the following definition and the algorithm.
Definition 2.12
(parse trees).* Let be a finite joinless code. A parse tree of is any subtree of the dag of with the following properties:*
(1)* The root of is (i.e., the root of the dag of ); and the set of leaves of is (i.e., the leaves of the dag of ).*
(2)* For every interior vertex of the set of children in is , for a unique . So has exactly children in .*
Given the dag of and a subtree , it is easy to check whether is a parse tree of ; one just needs to check that occurs in , and that every vertex in is reachable from ; moreover, for each vertex of one checks whether it is in , or whether its set of children is of the form . Recall the dags and trees are not oriented (children and leaves are not ordered).
A maximal joinless code can have more than one parse tree. E.g., the joinless set (1,1)\}\ has the following two parse trees:
Burillo and Cleary [18] give a similar tree description of tilings of , and point out that the tree is not unique.
If is not maximal (as a joinless code) then it has no parse tree (according to our definition of parse tree).
By Lemma 2.10(2), every maximal joinless code in has at least one parse tree. But in with there are maximal joinless codes that have no parse tree, by Lemma 2.10(3); geometrically, codes in without parse tree correspond to tilings of the cube that cannot be obtained by successive bipartitions of cuboids (perpendicularly to an axis). This motivates the following.
Questions: Is there a simple geometric or combinatorial characterization of the finite maximal joinless codes in (for ) that have no parse tree? Is the non-existence of a parse tree equivalent to the presence of one of certain joinless subsets (“forbidden patterns”)? An example of such a forbidden pattern is the subset of Lawson and Vdovina [29], used in 2.10(3).
The following algorithm nondeterministically constructs any parse tree of , if a parse tree exists. If has no parse tree the algorithm will discover this for some (but not all) nondeterministic choices. For a finite joinless code , the deterministic version of the algorithm decides whether is maximal (as a joinless code).
*Outline of the algorithm: * Initially, the algorithm puts into (as its leaf set), and makes a working copy of . The algorithm keeps looking for an initial factor of an element of such that (for some ). When such a is found, it is added to and to ; and is removed from the working copy . If is reached, and put into , the construction of is complete and the algorithm concludes that is maximal (as a joinless code), and that it has a parse tree.
The algorithm can be made deterministic by picking a total order for (e.g., the lexicographic dictionary order), and always picking the first that works.
Notation: denotes the set of strict initial factors of the elements of ; because of strictness (and since is joinless), .
Algorithm
Input: A finite set , given by a list of -tuples of strings in .
Precondition: , and is joinless. (This can easily be checked, by Lemma 2.5.)
Output: A set of vertices and edges of a parse tree of , if has a parse tree;
;
is a a working copy of
; ;
while :
choose any that satisfies the while-condition;
for a deterministic algorithm, pick the first
that works (in a fixed total order)
;
set of all edges from to the elements of ;
\ \cup\ ; # Hence remains joinless.
if :
then output and conclude that is maximal;
else (in case and ) conclude that is not maximal
(and hence has no parse tree).
Proposition 2.13
Let be any finite joinless code in .
Then has a parse tree iff is maximal as a joinless code.
The Algorithm (deterministic version) decides maximality of and finds a parse tree in polynomial time, when is given as a list of pairs of bitstrings.
Proof. The Algorithm uses one-step extensions of maximal joinless codes; by Lemma 2.11, each one-step extension or restriction preserves joinlessness and maximality. Since is a maximal joinless code, it follows that is maximal if the root is reached. It follows also that if the root is reached, a parse tree of exists (and the Algorithm returns such a tree).
Conversely (for and ), if (or, at any later stage, ) is maximal, then by Lemma 2.10(2) there exists in the interior dag such that (or ). And this process does not stop until .
Corollary 2.14
(cardinality of joinless codes).**
Let be any positive integer and any finite alphabet.
(0.1)* For every : X{}_{{}_{i=1}}^{{}^{n}}A^{k_{i}}\ is a maximal joinless code that has a parse tree.*
(0.2)* For any finite joinless code : is maximal iff can be transformed into X by a finite sequence of restriction steps, where for .*
(1)* For every finite maximal joinless code there exists such that*
.
(1.1)* If has a parse tree then can be obtained from by a finite sequence of one-step restrictions. The number of one-step restrictions used is equal to the number of interior vertices of every parse tree of , and is equal to .*
(1.2)* If has no parse tree, then can be obtained from by a finite sequence of one-step restrictions, followed by a finite sequence of one-step extensions.*
Even when has no parse tree, is still the number of interior vertices in any parse tree of any maximal joinless code that has a parse tree and that has the same cardinality as (e.g., of the form where is a prefix code in ).
(2)* Conversely, for all there are maximal joinless codes in of cardinality . In particular, when every positive integer is the cardinality of some maximal joinless code.*
Proof. (0.1) Let X. Let us prove by induction on that has a parse tree. For , the parse tree consists of one vertex. Inductively,
X X
=\ (X)
=\
=\ \big{(}(X X) (\{\varepsilon\}^{i-1}\times A\times\{\varepsilon\}^{n-i-1})\big{)}.
So, is obtained form by one-step restrictions (one one-step restriction for every ). It follows that if has a parse tree then has a parse tree. Moreover, any joinless code that has a parse tree is maximal.
(0.2) Let
, for ; and
.
The fact that can be restricted to follows by induction on :
If then .
If , and is such that for some , then a one-step restriction decreases , as is replaced by .
(1.1) We prove the equivalent statement that from one can reach by ones-step extensions. We use induction on . When then , and the formula holds. For , an extension step can be applied to some leaf of the interior of a parse tree of , by Lemmas 2.10 and 2.11. In this extension step, a new maximal joinless code is obtained; one leaf of the interior the parse tree of becomes a leaf of the parse tree of , so this parse tree of has interior vertices; and . By induction, ; and the latter is equal to . Hence .
(1.2) By applying one-step restrictions as in part (0.2), from any maximal joinless code one can reach X, where is as in part (0.2). And from X one can reach by one-step extensions by (0.1). In any one-step restriction or extension the cardinality of the maximal joinless code increases or decreases by . So, to reach from we can apply restrictions to reach X, then apply extensions to obtain .
(1) The formula follows from (1.1) and (1.2).
(2) For the existence of codes of the given cardinality, take for example , where is any maximal prefix code in , and apply the corresponding result for maximal prefix codes (which is folklore; see e.g. [6, Lemma 9.9(0)]).
Proposition 2.15
*There exist polynomial-time algorithms that on input (a finite set, given by an explicit list of -tuples of strings) decide whether has the following properties:
(1) is joinless;
(2) is maximal as a joinless code.*
Proof. The input to the algorithms is , given as a list of -tuples of strings, so the input size is .
(1) Lemma 2.5, applied to every two elements with , will decide in quadratic time whether is joinless.
(2) For finite joinless codes in , the Algorithm given after Def. 2.12 has polynomial time complexity, in view of Coroll. 2.14 which proves that every parse tree of has size that is linearly bounded in terms of .
For in general, the Algorithm that follows Prop. 2.17, based on the generalized Kraft equality, decides in polynomial time whether a joinless code is maximal.
The corresponding questions about initial factor codes are also decidable in polynomial time. Suppose is finite and given by an explicit list of -tuples of strings. It is easy to decide whether is an initial factor code; it is sufficient to check for every two elements with , whether . A code is maximal as an initial factor code iff for every interior vertex of the -dag, the children of in the -dag are also children of in the -dag.
An algorithm for testing maximality of a joinless code can be derived from the following generalization of the Kraft (in)equality to higher dimensions. We mentioned earlier that in the geometric description of the Brin-Thompson groups, a word represents the hyperrectangle X{}_{{}_{i=1}}^{{}^{n}}[0.x_{i}\,,\ 0.x_{i}+2^{-|x_{i}|}[\ (where we close the intervals whose right-bound is 1). The measure of this hyperrectangle is . More generally, we have the following.
Definition 2.16
Let be an alphabet of cardinality . For every we define the measure
.
For every joinless code (not necessarily finite) we define the measure
.
Proposition 2.17
(-dimensional Kraft (in)equality).* Let be a finite joinless code, where and . Then we have:*
(1)* .*
(2)* is maximal (as a joinless code) iff .*
Proof. This follows from the geometric picture. For a joinless code , all the words in represent non-overlapping hyperrectangles in , so their total measure is at most the measure of , which is 1.
And is maximal iff the corresponding hyperrectangles tile , which is iff the sum of the measures of the hyperrectangles is 1.
Prop. 2.17 probably holds for infinite joinless codes too; but since we don’t need it in that case, we’ll that question open.
Prop. 2.17 leads to the following algorithm.
Algorithm (maximality of a finite joinless code)
Input: A finite set , given as an explicit list of words.
Precondition: is joinless. (This is easily checked, by Prop. 2.15(1).)
Question: Is maximal?
Compute in fractional base- representation;
if , output “yes”;
else, output “no”.
This algorithm runs in polynomial time, in terms of the total input length . In fractional base- representation the sum is easy to compute.
We will need the intersection of joinless generated right ideals, and the elementwise join of joinless codes.
Proposition 2.18
Let be joinless codes.
(1)* The elementwise join , defined by*
,
is a joinless code. (Here, ranges over the joins that exist.)
Hence, .
(2)* and are both maximal (as joinless codes) iff is maximal.*
(3)* (P\vee Q)\cdot(nA^{*})\ =\ .*
Hence, if and are joinless generated then so is .
Proof. (1) Suppose , , and or . Then does not exist, because would have , , , and as prefixes. But either and (if different) or and (if different) do not have a join.
(2) If is maximal then every has a join with some , i.e., and are initial factors of some . Then and are also initial factors of , so and exist. Hence, every has a join with some and some , thus and are maximal.
If is maximal then every has a join with some ; and if is maximal, has a join with some . Hence, , , and , are all initial factors of some word , hence exists. So, every has a join with some , so is maximal.
(3) Every satisfies for some , , and . This implies that and are initial factors of , so exists, and is an initial factor of . Hence, .
If exists then it has and as initial factors, hence .
2.3 Right ideal morphisms of , and string-based
definition of and
Just as for one defines the concepts of right ideal morphism, domain code, and image code in . We only consider domain and image codes that are joinless. Indeed, if is not joinless, some definitions of right ideal morphisms on will be inconsistent. E.g., let , so ; and let and ; then ; so receives two different values.
Before we get to we define the following monoid:
Definition 2.19
.
* \ =\ is a right ideal morphism of such that is injective,*
and and are finite, maximal, joinless codes}* .*
“Maximal” means maximal as a joinless code. Usually we just write when a fixed alphabet is used.
Lemma 2.20
For every : .
Hence, if then , and , .
Proof. For every : , for some and . Since , for some and . Hence, . Thus, . Since is injective, this implies that . Since , which is an initial factor code, and . Hence, . So .
Conversely, if , then for some and . Since and (which is the initial factor code that generates ), we conclude that and . Hence, . So, .
Now satisfies the following: For , iff and . Hence , and , and .
Lemma 2.21
Let and let be a finite set.
(1.1)* If we have: is joinless iff is joinless.*
(1.2)* If and is joinless, we have: is maximal iff is maximal.*
(2.1)* In general (not assuming ), we have:*
* is joinless iff is joinless.*
(2.2)* In general, if is joinless then the following are equivalent:*
* is maximal,*
P\vee{\rm domC}(f)\* is maximal,*
f(P\vee{\rm domC}(f))\* is maximal.*
Proof. (1.1) Let , and assume by contradiction that there exists such that and are initial factors of . Then for some . Hence, ; the latter holds since , and (by Lemma 2.20). Hence, . Similarly, . So, , but that contradicts the assumption that is joinless.
(1.1) Conversely, if some have a join then for some . Then , so exists, hence is not joinless.
(1.2) Suppose is maximal, and assume by contradiction that is not maximal. Then there exists such that is a joinless code. Since , is joinless (by what was proved in the previous paragraph). So, is joinless, which contradicts the assumption that is maximal. Thus, if is maximal then is maximal.
(1.2) Similarly, if is maximal then is maximal (since ). Hence if is maximal, is maximal.
(2.1) If is joinless iff is joinless, by Lemma 2.18(1), since is joinless for all . And is joinless iff is joinless, by (1.1).
(2.2) If is maximal then is maximal by Lemma 2.18(2), since is maximal for . This implies that is maximal, by (1.2). And if is maximal then is maximal, again by (1.2). Moreover, maximality of implies maximality of (and of ), by Lemma 2.18(2).
Every right ideal morphism is uniquely determined by its restriction to ; this is an obvious consequence of the fact that is a right-ideal morphism and is a joinless code. So is determined by the finite function : .
Conversely, let be two finite maximal joinless codes with the same cardinality, and let : by any bijection from onto . Then determines a right ideal morphism of , such that is the restriction of to its domain code; is defined in a unique way by for all , . Since is joinless, is well defined.
Definition 2.22
(table).* A bijection : between finite maximal joinless codes is called a table.*
Tables and right ideal morphisms in determine each other bijectively, and can the treated as “the same thing”.
Every function determines a permutation of , as follows. For any there exists a unique such that for some , by Lemma 2.8. Then we define by
.
The converse does not hold; i.e., is not determined by , as will be seen in Lemma 2.24.
Definition 2.23
(end-equivalence).* Two right ideal morphisms are end-equivalent iff and agree on . This will be denoted by .*
By Prop. 2.18, is generated by a joinless code, namely .
In [9] the congruence is defined in much greater generality, and other congruences are introduced.
Lemma 2.24
For all : iff .
Proof. For every , and are maximal joinless codes. Therefore (by Lemma 2.9): . And by Lemma 2.18, is also a maximal joinless code.
Let , and let and be the restrictions of or to . Then is equivalent to .
Suppose , i.e., , where . For every , let be an initial factor of such that in all coordinates, is longer than the longest coordinate of any element of . And for some . Since is a maximal joinless code, has a join with an element of ; by the chosen length of , has an initial factor in , hence . Now , since ; and , since . Since (because ), if follows that .
Suppose . For every and every , . And since , , and . From it follows that . Hence, , i.e., .
Lemma 2.25
For all : .
The relation is a congruence on .
Proof. For every there exist and such that ; this follows from Lemma 2.8. Then and . Now by the definition of , ( . And ; the latter holds since . This proves that .
It follows immediately that is a congruence on (by Lemma 2.24).
Next we develop criteria about extensions and restrictions of functions in that enable us to decide efficiently whether two tables determine end-equivalent functions. The Remark below applies to finite maximal joinless codes in and is similar to a criterion for end-equivalence of finite maximal prefix codes in . But because of Lemma 2.10(3) it does not apply for . For in general, Prop. 2.26 gives an efficient algorithm for deciding whether two tables determine end-equivalent functions.
Remark (extension-restriction criterion): Let be finite maximal joinless codes in , let : be a table, and let be the corresponding right ideal morphism of . Then: is extendable in iff there exist such that for every :
(1) (or ), and
(2) (or ), and
(3) (or ).
In that case, let
, (or ), and
, (or ).
Then and are finite maximal joinless codes in , and can be extended to a function with table : defined by
, and
for all (or ).
The passage from to is called a one-step extension, and is called a one-step restriction of .
The Remark follows from Lemma 2.10(2), in the same way as for prefix codes in (see [5, Lemma 2.2] and [25]). The Remark is not always applicable when , by Lemma 2.10(3).
Proposition 2.26
(restrictions, and deciding ).**
(1)* Let be a table, and let be the right-ideal morphism given by this table. Suppose , where is a finite maximal joinless code. Then the restriction of to is an element of with table .*
Moreover, .
(2)* Let be tables (for ), and let be the right-ideal morphisms given by these tables. Then:*
* iff ,*
where is the table of the restriction of to the finite maximal joinless code .
Hence there is a polynomial-time algorithm that decides whether the tables and represent -equivalent elements of .
Proof. (1) The restricted table is defined as follows. For every there exists and such that . We define .
In order to verify that is a well defined function, suppose that for some and . Since is joinless, it follows that ; let . Since multiplication in is cancelative, implies . So, determines a unique and such that . Hence defines in a unique way.
(2) The largest common restriction of and is , which has domain , and domain code (by Lemma 2.18). So, iff the tables of and , restricted to , are the same.
We have . And for all and , . (Recall the notation .) Hence one can check in polynomial time whether .
Lemma 2.27
(non-uniqueness of maximal extensions).* There exists a right ideal morphism such that has two maximal extensions in .*
As a consequence, with cannot be defined by maximum extended morphisms (unlike ).
Proof. Let be defined by , and the table
[TABLE]
.
The geometric representation of is given in Fig. 2 (with mapping-by-number as in [14]):
\begin{picture}(110.0,60.0)\par\put(5.0,10.0){\framebox(40.0,20.0)[]{}} \put(5.0,10.0){\framebox(20.0,40.0)[]{}} \put(25.0,30.0){\framebox(20.0,10.0)[]{}} \put(25.0,40.0){\framebox(20.0,10.0)[]{}} \put(15.0,20.0){\makebox(0.0,0.0)[cc]{\sf 1}} \put(35.0,20.0){\makebox(0.0,0.0)[cc]{\sf 2}} \put(15.0,40.0){\makebox(0.0,0.0)[cc]{\sf 3}} \put(35.0,35.0){\makebox(0.0,0.0)[cc]{\sf 4}} \put(35.0,45.0){\makebox(0.0,0.0)[cc]{\sf 5}} \par\put(60.0,27.0){\vector(1,0){20.0}} \put(70.0,30.0){\makebox(0.0,0.0)[cc]{}} \par\put(95.0,10.0){\framebox(40.0,20.0)[]{}} \put(95.0,10.0){\framebox(20.0,40.0)[]{}} \put(115.0,30.0){\framebox(20.0,10.0)[]{}} \put(115.0,40.0){\framebox(20.0,10.0)[]{}} \put(105.0,20.0){\makebox(0.0,0.0)[cc]{\sf 1}} \put(125.0,20.0){\makebox(0.0,0.0)[cc]{\sf 2}} \put(105.0,40.0){\makebox(0.0,0.0)[cc]{\sf 3}} \put(125.0,35.0){\makebox(0.0,0.0)[cc]{\sf 5}} \put(125.0,45.0){\makebox(0.0,0.0)[cc]{\sf 4}} \par\put(-2.0,0.0){\makebox(0.0,0.0)[cc]{\sf Fig. 2}} \end{picture}
12345
12354
Fig. 2
In Fig. 2, the squares labeled “1” and “2” could be merged into one binary rectangle; alternatively, the squares labeled “1” and “3” could be merged into one binary rectangle. After either step, no further extension is possible. Thus, has the following two maximal extensions and :
(1) , and
.
(2) , and
.
We now give the definition of and based on strings.
Definition 2.28
(Brin-Thompson groups and ).* Let and . The Brin-Thompson group is . Equivalently, is the group determined by the action of on . When we obtain .*
Every element of can be represented (in infinitely many ways) by a table of the form : , which is a bijection between two finite maximal joinless codes.
Lemma 2.29
(composition in based on tables).* Let be a table representing , which in turn determines (for ). Then the composite , and hence also , is represented by the table*
, where
,
.
Proof. It is a general fact about partial functions , that , and . Obviously, .
For , given by tables, (by Lemma 2.18). And and are maximal joinless codes (by Lemmas 2.20 and 2.21). Moreover, , and . Hence, is given by the table described in the Lemma.
3 The word problem of is in coNP
For a fixed group with a fixed finite generating set the word problem is the following decision problem.
Input: A string .
Question: Does represent the identity element of ?
We mentioned in the Introduction that is finitely generated for all . The groups , for , are presumably finitely generated too, but this has not been proved, so we will only address the word problem of here.
Notation. We mostly use the alphabet , usually with .
For any integer , let for ; for a string , denotes the length of .
Definition of coNP and NP: We use the following logic-based definitions of coNP and NP (see e.g., [23]). Let be a finite alphabet. A set is in coNP iff there exists , a two-variable predicate , and a polynomial , such that
(1) (i.e., the membership problem of is in P);
(2) S\,=\,\{w\in\Gamma^{*}\,:\ \ .
Similarly, is in NP iff for some , some in P, and some polynomial ,
S\,=\,\{w\in\Gamma^{*}\,:\ \ .
Definition 3.1
(max length).**
For every : .
For every finite set : .
*For every : . *
Proposition 3.2
(length formula).* For all : .*
Proof. Let be a table for (). Recall the table for , given in Lemma 2.29. We have:
(L1) .
Indeed, for every , , and we have: (by Lemma 2.5).
We have:
(L2) .
Indeed, , for every , , and . By Prop. 2.18(3), . Since is an initial factor of there exists such that . Since , the following holds: when , and is a suffix of when . Hence, . Now, , where (since ). And . Hence .
We also have:
(L3) .
Indeed, is given by the table : . Consider any for , . Since is an initial factor of there exists such that . Since , the following holds: when , and is a suffix of when . Hence, . Now, , where (since ). And . Hence .
Finally, since , , we obtain:
(L4) \,\ell(f_{2}\circ f_{1})\,\leq\,\max\{\ell(Q_{2})+\ell(Q_{1}),\ .
Corollary 3.3
Let , and let be such that for . Then .
Lemma 3.4
For any and we have: .
Hence, determines . In particular, iff in .
Proof. By Coroll. 2.14(0), is a maximal joinless code, hence every element has a join with some element . Since , is actually an initial factor of . Hence, (). This proves that .
For any finite maximal joinless code , the restriction : is a table for , hence it determines . Since is a finite maximal joinless code contained in , the result follows.
Lemma 3.5
The word problem of over any finite generating set belongs to coNP.
Proof. Let be any finite generating set of . To simplify the notation we assume that is closed under inverse, i.e., . Every is represented by a table : . For any , let be the function obtained by composing the generators in (given by tables). Let : be the table of . By Prop. 3.2 and Coroll. 3.3: , where , and denotes the length of as a word over . So is a known constant, determined by the finite generating set .
For the word problem we have: in iff (the identity function on ) iff in . Since (in the table : ), we have: iff and . By Prop. 3.2, . We can further restrict to ; then by Lemma 3.4, we obtain the following coNP-formula for the word problem:
in iff .
We still need to show that the predicate , defined by
\ \Rightarrow\ ,
belongs to P. I.e., we want a deterministic polynomial-time algorithm that on input and , checks whether . To do this we apply, to , the tables of the generators that appear in . We compute \ \ \ldots\ \ . Since , every is defined. Moreover, for some and . By Prop. 3.2, . After computing we check whether .
The application of the table of to takes time proportional to (for ). So, the time complexity of verifying whether and satisfy the predicate is (up to a constant multiple) \ \leq\ . Hence the time-complexity of the predicate is quadratic in .
4 coNP-completeness of the word problem of
In this section we prove that the word problem of with , over any finite generating set, is coNP-hard with respect to polynomial-time many-one reduction. The result for all , follows quickly from the result for . We proved already in Lemma 3.5 that the word problem of belongs to coNP.
For , coNP-hardness of the word problem follows fairly directly from the coNP-hardness of the word problem of over the infinite generating set , by making use of the shift (subsection 4.6). Here, is any finite generating set of and is the set of position transpositions .
At the end of subsection 4.1 we show that the word problem of over belongs to coNP. The main difficulty is to prove that the word problem of over is coNP-hard; this is proved in subsections 4.2 - 4.4, by constructing a binary conjunctive polynomial-time reduction of the circuit equivalence problem to this word problem. An alternative proof, that gives a polynomial-time many-one reduction, appears in subsection 4.5.
4.1 Preliminaries on the word problem and complexity
We give some definitions and facts about complexity and the word problem of a group, especially when an infinite generating set is used. Here we use finite and infinite alphabets (but we always point out when an alphabet is infinite).
Definition 4.1
Let be two finite alphabets, and let be a positive integer. A polynomial-time conjunctive reduction of arity from to is a polynomial-time computable total function such that for all :
* iff ().*
Equivalently, . In other words, reduces the problem to instances of the problem , and the answers are combined by “and”.
A polynomial-time conjunctive reduction of arity 1 is called a many-one reduction.
The reductions in Def. 4.1 are a very special case of polynomial-time truth-table reductions; see e.g. [20, Def. 7.18]. It is straightforward to show that each of P, NP, and coNP, is closed under downward polynomial-time conjunctive reduction of bounded arity.
In this paper we use the following definition of coNP-hardness and coNP-completeness.
Definition 4.2
Let be a finite alphabet. A problem is coNP-hard iff for every finite alphabet and every problem there exists a polynomial-time conjunctive reduction of bounded arity that reduces to .
Moreover, is coNP-complete iff is coNP-hard and belongs to coNP.
There are many well-known coNP-complete problems, e.g., the tautology problem for boolean formulas, the integer linear programming equivalence problem, the 4-coloring problem, the connectivity lower-bound problem (see e.g. [22], [6, Introduction]). We will use the equivalence problem for acyclic boolean circuits (defined in Section 4.2).
Definition 4.3
Let be a group, and let () be a (possibly infinite) generating set for . For words we say that “ in ” iff the generator sequences and have the same value when their elements are multiplied in . In a similar way, for and , we say “ in ” iff is the value obtained when the elements of are multiplied in . We also use the notation or for this.
To simplify the notation, we will from now on take group generating sets that are closed under inverse, i.e., .
Lemma 4.4
(folklore).* Let be a finitely generated subgroup of a finitely generated group , and let be a finite generating set of for .*
(1)* If the word problem of over is decidable in deterministic (or nondeterministic, or co-nondeterministic) time , then the word problem of over is decidable in deterministic (respectively in nondeterministic, or co-nondeterministic) time .*
(2)* If a problem (where is finite) is reducible to the word problem of over by a polynomial-time conjunctive reduction of arity , then is also reducible to the word problem of over by a polynomial-time conjunctive reduction of arity .*
Proof. To simplify the notation, let us assume that and are closed under inverse.
(1) Since , for every generator there exists a word such that . Then the total function
\ \longmapsto\
is a one-to-one linear-time reduction of the word problem of over to the word problem of over ; here, “” denotes concatenation. The length of is , where . Hence, if the word problem of over has time-complexity , then the word problem of over has time-complexity for inputs of length .
(2) For every let be such that in , and let be the free monoid homomorphism from into determined by for all . Let be a polynomial-time conjunctive reduction of arity , such that for all : iff . Then
is a polynomial-time conjunctive reduction of arity , from to the he word problem of over .
For the word problem of groups, infinite generating sets cannot always be avoided, because some groups are not finitely generated, and because some finitely generated groups have interesting infinite generating sets. In order to apply the concepts of decidability or computational complexity to groups with infinite generating sets, we encode countable generating sets over a finite alphabet. We will use the following.
Definition 4.5
(encoding).**
An encoding of a countable set is an injective total function such that is a prefix code that is accepted by a finite-state automaton.
For a word of generators we define by the concatenation {\sf code}(w)\ . Hence, , which is a finite-state language.
Since the function code is injective it has an inverse function, , whose domain is . Every countable set admits an encoding of the above type (e.g., with image set ).
The word problem for a group with an infinite generating set and encoding is specified as follows.
Input: .
Precondition: . (Since is finite-state, the precondition is easy to check.)
Question: in ? (Here, 1 denotes the identity element of .)
Equivalently, the word problem is the membership problem of the language
.
From now on, by complexity of the word problem of over we mean the complexity of ; note that the problem depends on , , and code.
Lemma 4.6
Let be a subgroup of a countable group , let ) be a countable generating set of , and let be an encoding of , for . We also assume that there is a total function with the following properties (that connect the encodings and ):
* For all : in ;*
* the function \ \longmapsto\ is computable in*
linear time.
The function is extended to a free-monoid homomorphism that will also be called ; so for every we have: in .
Then the following hold:
(1)* If is in (or in , or in ), then is in (respectively in , or ).*
(2)* If (where is finite) is reducible to by a polynomial-time conjunctive reduction of arity , then is also reducible to by a polynomial-time conjunctive reduction of arity . Hence, if is hard for a complexity class (e.g., coNP), then is also hard for that complexity class.*
The functions and in the Lemma have the commuting diagram
;
equivalently, , and . Note that in general cannot be viewed as a computable function (as opposed to ), since its domain and image are arbitrary countable sets.
Proof. (1) For any we have: in over iff in over . Therefore, iff .
Thus we have the following algorithm for the membership problem of on input : First, check whether ; this can be checked in linear time, since is finite-state. Second, compute (in linear time). Finally, check whether is in , in time .
(2) Let be a polynomial-time conjunctive reduction of arity from to . Hence, iff . By the definition of and , the latter holds iff . Thus the function , where , is a polynomial-time conjunctive reduction of arity from to .
Some conventions and a fact about the Thompson group over :
We pick a finite generating set for , and for notational convenience we will assume that . We also use the set of bit position transpositions . We assume .
Definition 4.7
(size of a generator).* For any generator we define the size as follows: For we let , and for we let . For a string of generators with for , the size of is defined by .*
For the word as above, the length of is .
Lemma 4.8
The word problem of over belongs to coNP.
Proof. We have (where was defined in Def. 3.1 based on tables of elements of ). And there is a positive integer constant such that for all : . Hence, for any we have (by Prop. 3.2): .
Now the proof of Lemma 3.5 can be applied. For any , let be the right ideal morphism of generated by . Then for every we have:
in iff .
The predicate , defined by , is in P. This uses the same proof as Lemma 3.5, and the fact that is encoded over in such a way that , , and the length of the encoding, are linearly related. Hence the above -formula is a coNP-formula for the word problem of over .
Outline of the proof of coNP-hardness of the word problem of over :
In subsections 4.2 - 4.4 we follow (a part of) the strategy of [6], where another finitely presented group with coNP-complete word problem was constructed.
-
Every acyclic boolean circuit is “simulated” by a element of , represented by a word over , such that the size of is polynomially bounded by the size of (subsection 4.2, Def. 4.10 and Theorem 4.12).
-
The equivalence problem for acyclic boolean circuits is reduced (by a polynomial-time one-one reduction) to the generalized word problem of the subgroup in (subsection 4.3, Coroll. 4.16).
-
Thanks to the “commutation test”, the generalized word problem of in is reduced to two instances of the word problem of over (subsection 4.4, Lemma 4.19). This reduction is a 2-ary conjunctive linear-time reduction (“2” comes from the fact that is 2-generated).
4.2 Circuits and the Thompson group
Our first step in the proof of coNP-hardness is to represent acyclic boolean circuits by words over the generating set of .
An acyclic boolean circuit is specified by a directed acyclic graph (dag) without isolated vertices, together with a vertex labeling. This labeling associates (1) an input variable with each source vertex, (2) an output variable with each sink vertex, and (3) a gate (of type not, fork, and, or or) with each interior vertex. By definition, a source vertex is a vertex of in-degree 0; a sink vertex of out-degree 0; an interior vertex is a vertex whose in-degree and out-degree are both non-zero. A source vertex is also called input port, and a sink vertex is also called output port.
A gate is, by definition, a total function (for some ). We consider the following four types of gates, where , , and .
not; here, and ;
and ; here, and ;
or ; here, and ;
fork; here, , and .
The operation forkj makes an extra copy of . In traditional circuit theory, forks are not used separately; instead, not, and, and or are allowed to produce several copies of the output bit. However, using fork as a separate gate simplifies the conversion of a circuit into a sequence of functions. We also use the wire-crossing operation, which swaps the “wires” and (where ); this is the function
,
where, , , , , and . This operation is not a gate; it is not associated with a vertex, but follows from the incidence relation of the graph.
Note that all the gates notj are different functions for different values of ; the same applies to all andj,j+1, and all orj,j+1. However in the presence of the operations it is sufficient to use just one set of gates {not, and, or, fork}, applied to bit positions 1, or 1 and 2. E.g., noti not1 . Thus, here we view acyclic circuits as expressions over the generating set {not, and, or, fork} . Note that , with .
An acyclic circuit with sequence of input variables (with values ranging over ), and sequence of output variables (with values in ), determines an input-output function ; this is a total function. Any total function of the form is called a boolean function. In circuit theory it is proved that for every boolean function there exists an acyclic circuit whose input-output function is ; see e.g. [23, 46, 40, 21].
Two circuits and are called equivalent iff .
The equivalence problem for acyclic boolean circuits (in short, the circuit equivalence problem) is specified as follows:
Input: , (two circuits, described by dags with gate labels on the vertices);
Question: ?
In order to consider the complexity of problems about circuits we need to define the size of an acyclic boolean circuit , denoted by , and simply called circuit size; it is defined as follows: If has gates of type not or fork, gates of type and or or, and output variables, then the size of is defined to be . Equivalently, is the number of edges (or wires) between gates, or from an input to a gate, or from a gate to an output (for that reason, gates with two input variables are counted twice).
Remarks concerning circuit definitions: Acyclic circuits and their sizes are defined in a variety of ways in the literature [23, 37, 40, 46, 22, 26, 20, 27]; however, all these definitions lead to sizes that are polynomially equivalent (i.e., each one is polynomially bounded in terms of every other one). In the theory of NP- or coNP-completeness, polynomial differences are not significant.
(1) In the literature, the circuit size is usually defined as the number of vertices. Since we do not use isolated vertices in a circuit, we have (where and denote the number of vertices and edges). So and are polynomially equivalent.
(2) In the literature the input and output variables are usually not called vertices, but in that case they are nevertheless counted among the vertices in the definition of circuit size.
(3) When a circuit is described by a bitstring , the length satisfies , for some constant . Typically, such a description of lists all the edges, where each edge is given as a pair of strings (the names of two vertices, each vertex name having length ). An additional list is given that associates a gate or an input variable or an output variable with each vertex. An input variable is described by a code word (for ) and the binary representation of ; the output variables are described similarly. In any case, and are polynomially equivalent.
(4) In the literature, the fork-gate is usually not used explicitly; instead, the and-, or-, and not- gates, as well as the input variables, are allowed to have a fan-out. However, even in that case, every wire goes to a gate or an output variable, so the total of all be fan-outs is . A gate with fan-out can be replaced by a gate with fan-out 1 and fork-gates. This leads to a circuit with gates that have fan-out 1, except for fork-gates with fan-out 2; the size increase is polynomially bounded.
(5) In the literature, and and or-gates are allowed to have a fan-in . But every fan-in wire comes from a gate of an input variable, so the total of all be fan-ins is . An or-gate with fan-in can be replaced by or-gates with fan-in 2 (and similarly for and). This leads to a circuit with gates that have fan-in ; the size increase is polynomially bounded.
Remarks on complexity: The circuit equivalence problem is a well-known problem that is coNP-complete. It is fairly straightforward to prove that the problem is in coNP. Moreover, the tautology problem for boolean formulas (which is a classical coNP-complete problem) is a special case of the circuit equivalence problem (and is reduced to the circuit equivalence problem by converting a boolean formula into a circuit and asking whether a given circuit is equivalent to a circuit for the constant-1 function). See [27, Introduction] for comments on the circuit equivalence problem, see [37] for a circuit-based proof of NP-completeness of the satisfiability problem for boolean formulas, and see [23, 22] for general information.
The following well-known fact implies that every can be expressed as a composition of elements of ; the expression has linear length in terms of .
Lemma 4.9
As elements of the transpositions satisfy
\tau_{i,j}\ =\* , if .*
The word length of over is therefore .
We want to represent the circuit gates not, or, and, and fork, by elements of . For this, the main problem is that the input-output function of a circuit is not necessarily a permutation. Therefore we introduce the following notion of “simulation” of a circuit by a Thompson group element and by a word over (Def. 4.10 and Theorem 4.12 below). See the discussion in [6] for additional motivation of our definition of simulation.
Definition 4.10
(simulation).* Let be a total function. An element simulates iff for all : .*
When is represented by a word we say that simulates .
According to this definition, is faithfully described by the action of on ; but there are no constraints on the values of for input strings in . Since is an element of it is a bijection between finite maximal prefix codes, whereas need not be injective nor surjective. So there has to be a big difference between and somewhere. In subsections 4.3 and 4.4 we show that, nevertheless, the equivalence problem of circuits can be reduced to the word problem of over . In the rest of this subsection we construct .
The next Lemma follows immediately from the definition of simulation.
Lemma 4.11
Let and be any boolean functions with the same number of input variables and the same number of output variables. If and are simulated by , respectively , then we have
* iff . *
We choose the following elements of to describe the gates not, or, and, and fork:
\varphi_{\neg}\ =\ \left[\begin{array}[]{ll}0&1\\ 1&0\end{array}\right],
\varphi_{\vee}\ =\ \left[\begin{array}[]{ll}0x_{1}x_{2}&1x_{1}x_{2}\\ (x_{1}\vee x_{2})\,x_{1}x_{2}&({\overline{x_{1}\vee x_{2}}})\ x_{1}x_{2}\end{array}\right],\hskip 21.68121pt\varphi_{\wedge}\ =\ \left[\begin{array}[]{ll}0x_{1}x_{2}&1x_{1}x_{2}\\ (x_{1}\wedge x_{2})\,x_{1}x_{2}&({\overline{x_{1}\wedge x_{2}}})\ x_{1}x_{2}\end{array}\right],
where and range over . Hence, , and . In order to represent the fork function we first define
\varphi_{\rm 0f}\ =\ \left[\begin{array}[]{ccc}0&\ 10&\ 11\\ 00&\ 01&\ 1\end{array}\right];
so, , and . Then fork is simulated by
.
Indeed, for all : .
For every acyclic boolean circuit we want to find a word that simulates ; and we want the map to be polynomial-time computable (in terms of ).
A standard property of dags is that every vertex has a level (or “layer”) corresponding to its “depth” in the dag. The source vertices have level 0. A gate or an output variable has level 1 iff only input variables of the circuit feed into it. A gate or an output variable has level iff it receives input from levels only, and at least one of its inputs comes from level . Equivalently, the level of a vertex is the length of a longest path from a source to . The maximum level of any sink vertex is called the depth of the dag.
The following theorem is a simplification of [6, Thm. 3.5]. For a word we use the size, denoted by , as defined in Def. 4.7.
Theorem 4.12
(existence of simulation).* There is an injective function from the set of acyclic boolean circuits to the set of words over with the following properties:*
(1)* simulates the input-output function of ;*
(2)* the size of satisfies (for some constant );*
(3)* is computable from in polynomial time, in terms of .*
Proof. Item (1) refers to simulation as in Def. 4.10. In the proof we assume that , , , , , and , belong to . (If this were not the case, we could express them by fixed words over .)
We can assume that our acyclic circuits are strictly layered, i.e., a gate or an output variable at level only receives inputs from level . Hence, all the output variables of the circuit are at the same level , where is the depth of the circuit. If the layering of a circuit is not strict, we can insert identity gates to obtain strictness. An identity gate has one input variable and one output variable, connected by a wire; the two variables carry the same boolean value. We will count these identity gates as gates in the evaluation of circuit size. In order to make a circuit strictly layered, fewer than identity gates need to be introduced. (Indeed, for each gate we add fewer than identity gates above it; so, in total we add fewer than identity gates.)
An acyclic circuit has input variables , output variables , and internal variables which correspond to the boolean values carried by internal wires (between gates or between a gate and an input or an output port). The internal variables at level (for ) are denoted by , , , . When (output level) we have and ; when (input level) we have and .
For every level (with ) there is a circuit , called the slice of at level : The input variables of the slice are , , ; the output variables are , , ; the gates of are the gates of at level ; we use the fact that is strictly layered. In addition to gates, a slice also contains wire-swappings of its inputs, i.e., a bit-position permutation is applied to the input variables. Every permutation of wires can be written as the composite of () transpositions. And each has word length over (by Lemma 4.9), hence it has size . Thus the input-wire permutation of a slice has size . Moreover, every belongs to , so it does not need any simulation.
We use the notation \ \ldots\ (i.e., the concatenation of the variables , for , and ).
Simulation of one slice
In order to construct we first consider the special case where the circuit consists of just one slice, hence has depth 2 (the gates of the slice have depth 1, the output variables have depth 2). Identity gates are allowed. We number the gates of from left to right.
For , let consist of the first gates of a slice; so, is a one-slice circuit that has gates. When , is empty, is the empty string, and its input-output function is the identity function (). Inductively, let be a slice obtained from by adding one gate (and, or, not, identity, or fork) on the right of (with number ). Inductively we assume that satisfies the Theorem and that has been constructed. Let be the input variables and let be the output variables of . We now construct from and the gate being added.
Case 1: Suppose the slice is obtained from by adding, on the right of , an identity gate or a not gate, with new input variable and new output variable . If a not gate is added, the input-output function of is , where . The boolean function is to be simulated by a Thompson group element such that
\Phi_{f_{C}}(0\,x_{1}\ldots x_{m},x_{m+1})\ =\ .
We have , where is the simulation of , which exists by induction. We find as follows:
0\,x_{1}\ldots x_{m}\ x_{m+1}\ \stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\ 0\ y_{1}y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ x_{m+1}\
\stackrel{{\scriptstyle\tau_{2,n+m+2}}}{{\longmapsto}}\ \stackrel{{\scriptstyle\varphi_{\rm f}}}{{\longmapsto}}\
\stackrel{{\scriptstyle\tau_{1,2}}}{{\longmapsto}}\
\stackrel{{\scriptstyle\varphi_{\neg}}}{{\longmapsto}}\ \stackrel{{\scriptstyle\tau_{1,2}}}{{\longmapsto}}\ ;
applying then yields
.
So, .
The case where, instead of a not gate, an identity gate is added is similar (except that we simply omit ). By Lemma 4.9, we can express and over . Then the size of is
\ \leq\
, for some constant .
Case 2: Suppose our slice is obtained by adding an and gate or an or gate to on the right, with new output variable and new input variables . We only analyze the or case, the and case being almost the same. The input-output function of is
f_{C}(x_{1},\ldots,x_{m},x_{m+1},x_{m+2})\ =\ ,
where . The function is to be simulated by a Thompson group element such that
\Phi_{f_{C}}(0\,x_{1}\ldots x_{m}\ x_{m+1}x_{m+2})\ =\
Let be such that simulates . Then we construct as follows:
0\,x_{1}\ldots x_{m}\ x_{m+1}x_{m+2}\ \stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\
\stackrel{{\scriptstyle\tau_{3,n+m+4}}}{{\longmapsto}}\
(x_{m+1}\vee x_{m+2})\ x_{m+1}x_{m+2}\ y_{2}\ldots y_{n}\
\stackrel{{\scriptstyle\tau_{3,n+m+4}}}{{\longmapsto}}\ (x_{m+1}\vee x_{m+2})\ 0\ y_{1}y_{2}\ldots y_{n}\ ;
applying then yields
.
Thus is simulated by the word
of size
\ \leq\
, for some constant .
Case 3: Suppose our slice is obtained by adding a fork gate on the right of , with a new input variable and two new output variables and . The input-output function of is
,
where . The boolean function is to be simulated by a Thompson group element such that
\Phi_{f}(0\,x_{1}\ldots x_{m}x_{m+1})\ =\ .
Let and be the simulation of , which exists by induction. Then
0\,x_{1}\ldots x_{m}\ x_{m+1}\ \stackrel{{\scriptstyle\Phi_{f_{K}}}}{{\longmapsto}}\ 0\ y_{1}y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ x_{m+1}\
\stackrel{{\scriptstyle\tau_{2,n+m+2}}}{{\longmapsto}}\ 0\ x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}\ \stackrel{{\scriptstyle\varphi_{\rm f}}}{{\longmapsto}}\ 0\,x_{m+1}x_{m+1}x_{m+1}\ y_{2}\ldots y_{n}\ x_{1}\ldots x_{m}\ y_{1}\
\stackrel{{\scriptstyle\tau_{4,n+m+4}}}{{\longmapsto}}\ ;
applying and then yields
.
This simulates by a word
of size
\ +\
, for some constant .
In all three cases the slice is simulated by a word of size
.
Let now be any slice, and let be the number of interior vertices of (i.e., the vertices labeled by gates). Then if is constructed by adding ( gates to slices (starting with being the empty slice, and ending with being the desired slice ), the size of is
,
for some constant (that does not depend on ).
Moreover, as we saw when we introduced the notion of slice, in all three cases a bit-position permutation of the input wires of the slice is attached at the beginning of . This permutation belongs to and has size .
The above construction of each word from is a polynomial-time algorithm (in terms of ).
Simulation of a multi-slice circuit
Assume that is a circuit of depth ; the depth is the number of slices. In order to define we use the fact that we have already defined the word that simulates the slice of (for every , ). Each word has all the properties claimed in Theorem 4.12; in particular, represents the function
\Phi_{C_{\ell}}:\ \ 0\ Y^{\ell-1}\ \longmapsto\ .
Hence, since is a right ideal isomorphism, we also have
\ \ \stackrel{{\scriptstyle\Phi_{C_{\ell}}}}{{\longmapsto}}\ \ .
Therefore, represents the function
\ \ 0\,x_{1}\ldots x_{m}\ \ \longmapsto\ \ ,
where is the output of , and is the input of .
The length of the word () is . Indeed, the total number of variables in the circuit (i.e., ) is equal to the total number of wires (i.e., ); the “” comes from the leading bit [math].
Let \,\sigma_{i,j}=\tau_{j-1,j}\,\tau_{j-2,j-1}\ \ldots\ (for ). Then \ \pi_{1}\ =\ (\sigma_{1,|Z|})^{n}\ transforms the word into
.
Next (and this is a fundamental and crucial idea from reversible computing, see e.g., [3, 2, 21]), to the latter string we apply
in order to clear away intermediate outputs of all the internal slices. This yields
.
Finally, applying the permutation produces the desired final output
.
Therefore we define ) by
w_{C}\ =\ \pi_{2}\ (w_{C_{L-1}}\ \ldots\ w_{C_{1}})^{-1}\ \pi_{1}\ .
The word length of over is less than . Since all subscripts in are , the size of is . Since , the size of is also less than .
For the size of we have
.
We saw that \ \|w_{C_{\ell}}\|\leq c_{0}\,|C_{\ell}|^{3}\ (for ); and implies . Thus , for some positive constant .
Since was at most squared in order to obtain strict layering, the above bound becomes
,
in terms ot the original (not necessarily strictly layered) circuit .
The word can be written down in linear time, based on the words (), and we saw that each can be computed in polynomial time from .
4.3 Reduction to a generalized word problem of
(over an infinite generating set)
We first extend the classical concepts of stabilizer and fixator to the case of partial injections.
Definition 4.13
A function partially stabilizes a set iff . For a subgroup , the partial stabilizer of (in ) is
{\rm pStab}_{G}(S)\ =\* .*
A function partially fixes a set iff for every . This is also called partial pointwise stabilization. For a subgroup , the partial fixator of (in ) is
{\rm pFix}_{G}(S)\ =\ \{g\in G:\* .*
We will only use partial stabilizers and fixators for sets that are right ideals; then and are groups [6, Lemma 4.1]. When is a right ideal, where is a prefix code, we will abbreviate and by , respectively . In particular, we abbreviate to .
Lemma 4.14
We have: {\rm pFix}_{V}(0)\ \subset\ .
Proof. Obviously, . Moreover, if we had for any and , then ; the first equality holds since . But is false since a string does not start with both 0 and 1.
The following is little more than a reformulation of the definition of simulation and Lemma 4.11.
Lemma 4.15
Let and be any boolean functions such that and have the same number of input variables, and and have the same number of output variables. Suppose and are simulated by , respectively (). Then,
* iff .*
Proof. Let be the common domain of and . Then by Lemma 4.11, iff for all : . Then for all : (and ). Hence, iff .
Theorem 4.12 and Lemma 4.15 give a polynomial-time one-one reduction from the circuit equivalence problem to the generalized word problem of in , where the elements of written over . Since the circuit equivalence problem is coNP-complete, it follows that this generalized word problem is coNP-hard. Hence we have:
Corollary 4.16
(coNP-hard generalized word problem).* The generalized word problem of in over is coNP-hard. *
4.4 Reduction to the word problem of
We will give a linear-time -ary conjunctive reduction from the generalized word problem of to the word problem of over the infinite generating set . This reduction is based on a “commutation test”, that was studied in greater generality in [6, Section 5]; here we just use , based on an alphabet of size 2, which makes everything simpler.
We first need a few lemmas. Recall the notation ( and are prefix-comparable) and its negation . For and , we define .
Lemma 4.17
If but , then there exists such that
.
Hence, (), for all .
Proof. Lemma 4.17 is a special case of [6, Lemma 9.6], with a simpler proof. (Note that in [6] the notation for the prefix order was reversed; here, “” always means is a prefix of .)
We prove the contrapositive, i.e., if for all we have , then .
Case 1: .
Then , for some , so . Now, is a maximal finite prefix code (by [6, Lemma 9.4]), which contains the non-empty string . Hence contains at least one other non-empty string (by [6, Lemma 9.5]), i.e., contains (), for some . Hence (since stabilizes ), there exists such that , and and . Since is a prefix code, .
By the (contrapositive) assumption, . Hence there are two possibilities:
(1) : Then . So , which contradicts the fact that is a prefix code.
(2) : Then , for some ; and we saw that also . This implies that . Again, this contradicts that is a prefix code.
Thus, case 1 is impossible.
Case 2: .
Then , for some , hence . Now is a finite maximal prefix code, containing the non-empty string , hence it contains some other non-empty string. So there exists () with .
By the (contrapositive) assumption, . Again, we have two possibilities:
(1) : Then , and (since ). Thus, , which contradicts the fact that is a prefix code.
(2) : Then , for some ; and , for some (since ). Thus, , which implies . Again, this contradicts the fact that is a prefix code.
We conclude that case 2 is impossible.
Now, having ruled out cases 1 and 2, the only remaining possibility is that , for all . This means that .
Lemma 4.18
For every such that , there exists and such that
* and .*
Proof. This Lemma is a simplification of [6, Prop. 9.14(1)], and we adapt that proof.
Let be two prefix-incomparable strings, and let with . Then are prefix-incomparable two-by-two (as is easy to check). We now use [6, Lemma 9.7] to construct a finite maximal prefix code , with . We define by
, and
is the identity on .
So, is the domain code and image code of . Then , , and (since ). So here, plays the role of .
Lemma 4.19
(commutation test).* For all we have:*
* iff \big{(}\forall f\in{\rm pFix}_{V}(1)\big{)}\,[\,fg=gf\,].*
In words: An element belongs to the subgroup iff commutes with all the elements of the subgroup .
Proof. Suppose , for all , and hence also .
(1) We first prove that .
If for some , then for all . And . Hence, , hence by injectiveness, for all . So, and , and . Hence by Lemma 4.18, there exists such that , and (for some ). The latter inequality contradicts the fact that for all .
In a similar way one obtains a contradiction if for some .
(2) We prove next that .
Suppose for all ; we saw that then . If, by contradiction, , then by Lemma 4.17, there exists such that .
Then, . By Lemma 4.18 there exists such that , and (for some ). Then . So, , which contradicts .
Let and . Then , and , where are finite maximal prefix codes. So, is a finite maximal prefix code.
Then for every : , since ; and , since and . So, .
Similarly, for all : , since ; and , since and . So, .
Hence, on the finite maximal prefix code . Hence in .
Lemma 4.20
The subgroups and are isomorphic to .
Proof. Every element of has a table of the form , where and are finite maximal prefix codes over . An isomorphism is given by
[TABLE]
The map is obviously a bijection from onto , and it is easy to check that it is a homomorphism.
coNP**-hardness of the word problem of over :**
The commutation test of Lemma 4.19 reduces the generalized word problem of in (over ) to an infinite set of word problems of , namely .
However, is 2-generated; this follows from Lemma 4.20 and the fact that is 2-generated [45, 32, 12]. Obviously, commutes with all of iff commutes with the two generators of . This reduces the generalized word problem of in (over ) to the conjunction of two instances of the word problem of over . Hence, the word problem of over is coNP-hard with respect to 2-ary conjunctive polynomial-time reduction.
Theorem 4.21
(coNP-complete word problem).* The word problem of over the generating set is coNP-complete.*
Proof. By Lemma 4.8, this word problem belongs to coNP. By the reasoning in the above few lines, the word problem is coNP-hard.
4.5 Alternative proof of coNP-completeness
of the word problem of over
The above proof of coNP-completeness of the word problem of over was derived from a similar proof for [6] (in 2003). Since then, Stephen Jordan [27] (in 2013) proved that the equivalence problem for bijective circuits built from copies of the Fredkin gate is coNP-complete. A bijective circuit is an acyclic circuit in which every gate has a permutation of as its input-output function (for some , depending on the gate). The input-output function of such a circuit is a permutation of for some (see e.g. [43]). The Fredkin gate, on an input , is defined by
,
;
see e.g. [21]. This gate is also called the “controlled transposition” (of and ). Clearly, is the table of an element of ; moreover, with we can compute for any three different variables in an input with . Hence Jordan’s result can be recast as follows:
Theorem 4.22
(Thompson group form of Jordan’s theorem).* The subgroup of generated by has a coNP-complete word problem, with respect to many-one polynomial-time reduction. *
See [7] (and [43]) for further connections between bijective (“reversible”) circuits and the Thompson groups.
Theorem 4.22 immediately implies Theorem 4.21, as the word problem of the subgroup reduces to the word problem of over by the inclusion map. (Here we assume that ; if that is not the case we can represent by a fixed word over for the reduction; see Lemma 4.6(2).)
An advantage of our method of subsections 4.2 - 4.4 is that it is direct, whereas Jordan’s theorem is based on Barrington’s theorem [1], which is itself a deep result. However, using Jordan’s theorem has the advantage that it yields the following: The word problem of over is coNP-complete with respect to polynomial-time many-one reduction. The earlier proof only yields polynomial-time binary conjunctive reduction.
4.6 The shift, and the word problem of
For all with , we define : by
.
So, .
The shift is defined by , , and
,
for all . Hence, , for all , and .
Lemma 4.23
For all : \tau_{j,j+1}\times{\bf 1}(.)\ =\ .
Proof. For any , where with , and , we have:
(u\,x_{j}x_{j+1}v,\,y)\ \stackrel{{\scriptstyle\sigma^{-j+1}}}{{\longmapsto}}\ (x_{j}x_{j+1}\,v,\ u^{\rm rev}\,y)\ \stackrel{{\scriptstyle\tau_{1,2}\times{\bf 1}}}{{\longmapsto}}\ (x_{j+1}x_{j}\,v,\ u^{\rm rev}\,y)\ \stackrel{{\scriptstyle\sigma^{j-1}}}{{\longmapsto}}\ .
Here, denotes the reverse of .
Proof of Theorem 1.1:
By Lemma 3.5 the word problem of belongs to coNP.
By Theorem 4.21 the word problem of over is coNP-hard. By Lemma 4.23, the word problem of over , reduces to the word problem of over a finite generating set; this reduction is the one-one reduction that replaces every generator by , and replaces by , as in Lemma 4.23. We can include the set into the finite generating set of , or we can express all the elements of this set by a finite set of strings over some other finite generating set of . Thus the word problem of over a finite generating set is coNP-hard.
To show that word problem of (for ) over a finite generating set is coNP-hard, we use the fact that is a finitely generated subgroup of , and apply Lemma 4.4(2).
Remark on the distortion of in : Burillo and Cleary [18] show that is exponentially distorted in (when both and are over finite generating sets). In Lemma 4.23 we proved that has linear word length in (as a function of ); but has exponential word length in over any finite generating set. This, again, shows that the distortion of in is at least exponential.
Indeed, for all , has a table (see the beginning of subsection 4.4). It follows from Lemma 2.2 that the table of is maximally extended. So, has table-size . Therefore, by [5, Thm. 3.8], the word length of in (over any finite generating set) satisfies (for some constants ). On the other hand, the embedding of into , used in Lemma 4.23, represents by a word of length .
“Why” is the word problem of coNP-complete? The table-size of an element can be exponentially larger than the word length of (over a finite generating set); hence, the polynomial-time algorithm for the word problem of (consisting of simply composing the tables of the generators) turns into an exponential-time algorithm in . In we have the table-size formula ; in there is no such formula. However, the length-formula of Lemma 3.2 implies rather directly that the word problem of belongs to coNP.
The coNP-hardness is less intuitive. The proof that (over ) can simulate circuits is intuitive (if tedious). The commutation test, reducing a generalized word problem to a word problem, is less intuitive, and it is a priori not related to computing. The alternative proof of coNP-hardness of the word problem of over is derived from Jordan’s theorem, which is itself based on Barrington’s theorem; the latter has always been considered a surprising result.
Using the shift to represent the infinite set by a finite set is easy. But the shift is not a circuit element (although it has a computational meaning, namely, as an operation in multi-stack machines).
Acknowledgement: I would like to thank the referee for a thorough reading of the paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] D.A. Barrington, “Bounded-width polynomial-size branching programs recognize exactly those languages in NC 1 superscript NC 1 {\rm NC}^{1} ”, J. of Computer and System Sciences 38.1 (1989) 150-164.
- 2[2] C. Bennett, “Logical reversibility of computation”, IBM J. Research and Development 17 (1973) 525-532.
- 3[3] C. Bennett, “Time/Space tradeoffs for reversible computation”, SIAM J. of Computing 18 (1989) 766-776.
- 4[4] J.C. Birget, A. Ol’shanskii, E. Rips, M.V. Sapir, “Isoperimetric functions of groups and computational complexity of the word problem”, Annals of Mathematics 156.2 (Sept. 2002) 467-518.
- 5[5] J.C. Birget, “The groups of Richard Thompson and complexity”, International J. of Algebra and Computation 14(5,6) (Dec. 2004) 569-626. Preprint: https://arxiv.org/abs/math/0204292
- 6[6] J.C. Birget, “Circuits, co NP-completeness, and the groups of Richard Thompson”, International J. of Algebra and Computation 16(1) (Feb. 2006) 35-90. Preprint: https://arxiv.org/abs/math/0310335
- 7[7] J.C. Birget, “Factorizations of the Thompson-Higman groups, and circuit complexity”, International J. of Algebra and Computation 18.2 (2008) 285-320.
- 8[8] J.C. Birget, “Monoid generalizations of the Richard Thompson groups”, J. of Pure and Applied Algebra 213(2) (2009) 264-278.
