Birecurrent sets
Francesco Dolce, Dominique Perrin, Antonio Restivo and, Christophe Reutenauer, Giuseppina Rindone

TL;DR
This paper investigates birecurrent sets, which are recurrent sets with strongly connected minimal automata and their reversals, providing characterizations and properties related to their reducibility and structure.
Contribution
It characterizes completely reducible sets as linear combinations of birecurrent sets, advancing understanding of their algebraic and automata-theoretic properties.
Findings
Birecurrent sets are completely reducible.
Main result: completely reducible sets are linear combinations of birecurrent sets.
Provides new characterizations of birecurrent sets.
Abstract
A set is called recurrent if its minimal automaton is strongly connected and birecurrent if it is recurrent as well as its reversal. We prove a series of results concerning birecurrent sets. It is already known that any birecurrent set is completely reducible (that is, such that the minimal representation of its characteristic series is completely reducible). The main result of this paper characterizes completely reducible sets as linear combinations of birecurrent sets
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · DNA and Biological Computing · Mathematical Dynamics and Fractals
Birecurrent sets
Francesco Dolce1, Dominique Perrin2,
Antonio Restivo3, Christophe Reutenauer1, Giuseppina Rindone2
1 Université du Québec à Montréal, LaCIM, 2 Université Paris Est, LIGM,
3 Università di Palermo
Abstract
A set is called recurrent if its minimal automaton is strongly connected and birecurrent if it is recurrent as well as its reversal. We prove a series of results concerning birecurrent sets. It is already known that any birecurrent set is completely reducible (that is, such that the minimal representation of its characteristic series is completely reducible). The main result of this paper characterizes completely reducible sets as linear combinations of birecurrent sets
Contents
1 Introduction
A recurrent set of words is such that its minimal automaton is strongly connected. A birecurrent set is a recurrent set such that its reversal is also recurrent. The recurrent (resp. birecurrent) sets contains the submonoids generated by prefix (resp. bifix) codes. The automata recognizing birecurrent are a generalization of well known families of automata such as group automata. There are more general sets and the general form of birecurrent sets does seem easy to describe.
Our interest in birecurrent sets is motivated by their relation with completely reducible sets put in evidence in [7]. Indeed, it is shown in [7] that the syntactic representation of the characteristic series of a birecurrent set is completely reducible. This generalizes the result of Reutenauer [10] which proves that the complete reducibility holds for the submonoid generated by a bifix code.
Our main result characterizes completely reducible sets as linear combinations of birecurrent sets (Theorem 4.3.4). We also prove a number of other results concerning birecurrent sets, and in particular birecurrent sets of finite type, which are a generalization of the submonoids generated by finite bifix codes.
The paper is organized as follows.
We first give in Section 2 a number of definitions concerning words, automata and formal series.
We introduce recurrent and birecurrent sets in Section 3. In Section 3.2 we prove a result which characterizes the minimal automata of birecurrent sets (Theorem 3.2.1). This result extends a property proved in [7] and allows to construct directly birecurrent automata by choosing an appropriate set of terminal states of a strongly connected deterministic automaton.
In Section 3.3, we define birecurrent sets of finite type. A recurrent set is of the form where is a prefix code and a set of proper prefixes of . Thus a birecurrent set has the form , as above, as well as where is a suffix code and a set of proper suffixes of . We say that is of finite type if and are finite. Thus a birecurrent set of finite type is a generalization of a submonoid generated by a finite bifix code (which corresponds to the case where and are reduced to the empty word). We prove a result allowing one to build birecurrent sets of finite type (Theorem 3.4.2).
In Section 3.6, we define the degree and the index of a dense birecurrent set. We prove that the density of a dense birecurrent set with respect to a positive Bernoulli distribution is the inverse of its index (Theorem 3.6.2).
In Section 3.7, we prove that if a recognizable maximal prefix code is indecomposable, then either it is synchronized, or it is the left root of a dense birecurrent set (Theorem 3.7.1). We relate this result with an old conjecture of Schützenberger.
In Section 4, we come to the connection with complete reducibility.
We start in Section 4.1 with an introduction to the linear representations of formal series. In Section 4.2, we prove a statement (Theorem 4.2.1) which characterizes completely reducible sets by a property of their syntactic monoid. We derive from this result several corollaries and, in particular, the main result of [7] asserting that a birecurrent set is completely reducible.
We then present in Section 4.3 our main result which characterizes completely reducible sets as linear combinations of birecurrent sets (Theorem 4.3.4).
In Section 5, we consider unambiguous automata. We characterize the unambiguous automata recognizing recurrent sets (Theorem 5.2.1). We derive as a corollary a characterization of unambiguous automata recognizing birecurrent sets (Corollary 5.2.2). We also give examples of such automata with an iteration of the construction of Section 3.4, using an argument originally developped in [15].
Acknowledgements
The authors wish to thank for their help several colleagues, including Clelia De Felice, Andrew Ryzhikov and the anonymous referee.
2 Preliminaries
We recall briefly some terminology about words, automata and, in some more detail, formal series. We refer to [2] or [11] for undefined terms.
2.1 Words
Let be a finite alphabet and let be the free monoid over . The elements of are called words and the subsets of formal languages. We denote by the empty word.
The reversal of a word is the word . By extension, the reversal of is the set .
For and , we denote . Symmetrically, we denote .
A set is said to be dense in if for any word , there are words such that . A set which is not dense is said to be thin.
A set is said to be right dense in if for any , there is such that .
2.2 Automata
We denote by an automaton with as set of states, as set of initial states and as set of terminal states, given by a set of edges which are triples . The automaton is said to be finite if is finite.
The set recognized by the automaton is the set of labels of paths from to . A set is recognizable if it can be recognized by a finite automaton.
The automaton is deterministic if and for each and there is at most one edge . For and , we denote by the unique state such that there is an edge from to labeled . Thus, the set recognized by is the set of words such that . We denote and if .
The minimal automaton of a set is the deterministic automaton having for states the nonempty sets for , with as initial state and the family as terminal states.
An automaton is trim if for any there are words such that there is a path from to labeled and a path from to labeled .
A deterministic automaton on the alphabet is complete if for any and any one has . A set is right dense if and only if its minimal automaton is complete.
For a deterministic automaton and a word , we denote by the map from into . The monoid is the transition monoid of the automaton . For , denote if the image of by is , or equivalently if and . Similarly, for , we denote .
The rank of a word is the rank of the map , that is the cardinality of the set . When is a strongly connected finite automaton, the image by of the set of words of minimal nonzero rank is, together with [math], the unique [math]-minimal ideal of the monoid . It is the union of all [math]-minimal right (resp. left) ideals. It is formed of [math] and a regular -class (see [2, Corollary 1.12.10]). Each -class of which is a group is a transitive permutation group on the common image of its elements (see [2, Theorem 9.3.10]).
Let be a deterministic automaton. The domain of a word , denoted , is the set , that is, the set of such that . The kernel of a word is the partial equivalence on defined by if .
The degree of a strongly connected deterministic automaton , denoted , is the minimal nonzero rank of all words. The automaton is synchronized if its degree is .
Example 1
Let be the automaton represented in Figure 2.1.
The word is synchronizing since it has rank . The [math]-minimal ideal of the monoid is represented in Figure 2.2.
We represent the -class by an array in which the rows are the -classes and the columns are the -classes. An -class containing an idempotent is indicated by a .
2.3 Prefix and bifix codes
A set is a prefix code if no word in is a proper prefix of another word in . The set is a bifix code if and are prefix codes.
Let be a prefix code. The literal automaton of is the deterministic automaton where is the set of proper prefixes of and for and , one has
[TABLE]
It is a strongly connected deterministic automaton recognizing .
The degree of a prefix code is the degree of the minimal automaton of . The prefix code is said to be synchronized if .
Example 2
The automaton of Example 1 is both the minimal and the literal automaton of where is the synchronized prefix code .
2.4 Formal series
Let be a field. A formal series on the alphabet with coefficients in is a map . We denote by the algebra of these series. For , we denote by the value of on . We denote by the corresponding ring of polynomials, which are the series with a finite number of nonzero coefficients.
We denote by the characteristic series of a set .
Let be an integer. Let be a row -vector, let be a morphism from into the monoid of -matrices and let be a column -vector, all with coefficients in . The triple is called a linear representation. It is said to recognize a series if for every , one has
[TABLE]
Equivalently, one can define a linear representation as a triple formed, for some vector space , of a vector , a morphism and a linear form on such that Equation (2.1) holds for every . The vector is called the initial vector. The endomorphism and the linear form act on the right of their argument and thus the expression in Equation (2.1) is read parenthesized from left to right as . It can also be parenthesized from right to left considering as the endomorphism of the dual of defined by the formula .
3 Birecurrent sets
In this section, we define birecurrent sets and prove some elementary properties. We prove a characterization (Theorem 3.2.1) which is used in the next section. A recognizable set is recurrent if its minimal automaton is strongly connected.
Clearly, a recognizable set is recurrent if and only if it recognized by a strongly connected deterministic automaton. Indeed, if is a strongly connected deterministic automaton recognizing , then the minimal automaton of is also strongly connected.
As a simple and well known example, for any recognizable prefix code , the submonoid generated by is recurrent.
A recognizable set is birecurrent if and its reversal are both recurrent.
Thus, for example, the submonoid generated by a bifix code is a birecurrent set.
As a related example, let us consider a reversible automaton, that is a deterministic automaton such that every letter defines an injective map on the set of states. This class of automata has been considered frequently (see for example [1], [9] or [8]). The reversal of a reversible automaton is still deterministic and thus the set recognized by a strongly connected reversible automaton is birecurrent.
The two examples of submonoids generated by bifix codes and of sets recognized by strongly connected reversible automata partially overlap. Indeed, if is a reversible automaton with a unique terminal state equal to the initial state, the set recognized by is a submonoid generated by a bifix code. However, there are many examples of submonoids generated by bifix codes which are not recognizable by a reversible automaton (see for instance the bifix code of Example 11).
We will see that there are many other examples of birecurrent sets.
We begin by recalling some basic and well-known facts concerning the reversal of an automaton.
3.1 Reversal of an automaton
Let be an automaton. The reversal of is the automaton obtained by reversing the edges of and exchanging and . Clearly, the reversal of recognizes the reversal of the set recognized by .
For an automaton , we denote by the * determinization* of . Its states are the nonempty sets I\cdot w=\{q\in Q\mid i\stackrel{{\scriptstyle w}}{{\rightarrow}}q\mbox{ for some i\in I}\}. The initial state is and the set of terminal states is the family of states of such that .
The deterministic reversal of is the automaton obtained by determinization of the reversal of . We denote by the set of states of . Thus is the family of nonempty sets of the form
[TABLE]
The set of terminal states of is . Clearly, recognizes the reversal of the set recognized by .
The following statement is well known (see [5] p. 48).
Proposition 3.1.1
If is a trim deterministic automaton recognizing , then is the minimal automaton of .
Proof.
Since is trim, for any word , one has if and only if . Moreover, for any , one has
[TABLE]
as one may easily verify. Since the nonempty sets are the reversals of the states of the minimal automaton of , the map is a bijection which identifies with the minimal automaton of . ∎
It follows from Proposition 3.1.1 that if is the minimal automaton of a recognizable set , then is birecurrent if and only if and are strongly connected.
3.2 A characterization of birecurrent sets
The following statement characterizes birecurrent sets. One direction appears as [7, Proposition 5.7].
We say that a set is saturated by a word if is a union of classes of the kernel of . Note that is saturated by if and only if for some .
Theorem 3.2.1
Let be a recurrent set and let be its minimal automaton. Then is birecurrent if and only if is saturated by a word of minimal nonzero rank.
Proof.
Let us first show that the condition is necessary. Let be a word of nonzero minimal rank such that . Since and since is strongly connected, there is a word such that . Thus is saturated by .
Conversely, set , and let be the [math]-minimal ideal of . Assume that is saturated by a word of minimal nonzero rank. Then . Let be a word such that . We have to show that there is a word such that . This will prove that is accessible from in , which implies that is strongly connected.
Since , we have . Since , the left ideal generated by is [math]-minimal. Thus is in the -class of . Let be a word such that . Since for all , the idempotent which is a power of is such that . Since is saturated by , there is a set such that . Then . Finally, let be such that and let . Then and the proof is complete. ∎
Note that if the minimal automaton of a birecurrent set is complete and synchronized, then is the trivial automaton with only one state and . This can of course be proved directly, but it follows easily from the characterization above. Indeed, in this case, is saturated by a word of minimal rank if and only if , which implies the conclusion .
Theorem 3.2.1 gives a method to find birecurrents sets starting with a stongly connected automaton with an uspecified set of terminal states. Provided one knows a word of minimal nonzero rank, one may choose as set of terminal states a set saturated by and finally minimize the resulting automaton.
However, this does not give a substantially more efficient algorithm than the computation of the deterministic reversal. Indeed, it is shown in [12] that deciding whether a partial automaton is strongly connected as well as its deterministic reversal is PSPACE-complete.
The following examples illustrate Theorem 3.2.1.
Example 3
The automaton represented in Figure 3.1 on the left and its deterministic reversal on the right are both strongly connected.
In agreement with Theorem 3.2.1, the set of terminal states of the first automaton is the preimage of , which has rank .
We give a second example with a minimal rank larger than .
Example 4
Consider the complete deterministic automaton given in Figure 3.2 on the left with its deterministic reversal represented on the right.
The [math]-minimal ideal of the monoid is represented in Figure 3.3. We can check that the set is a class of the kernel of , which is a word of minimal rank equal to .
In the last example, we show a simple case in which is a union of several classes of the kernel of a word of minimal nonzero rank.
Example 5
Let be the deterministic automaton on and with where is the circular permutation . Choosing and , we obtain for the automaton on the set of states on which acts again as a circular permutation. Thus is strongly connected (see Figure 3.4). Actually, this holds for any strongly connected group automaton.
3.3 Birecurrent sets of finite type
In this section, we define birecurrent sets of finite type and in the next section, we prove a statement which allows to build an infinite family of such sets (Theorem 3.4.2). We first prove the following elementary statement concerning recurrent sets.
Proposition 3.3.1
A set is recurrent if and only if there is a recognizable prefix code and a recognizable set of proper prefixes of such that .
Proof.
Assume first that is recurrent. Let be the minimal automaton of . Let be the prefix code generating the submonoid recognized by the automaton and let be the set of proper prefixes of such that . Then .
Conversely, let be the literal automaton of . Recall from Section 2 that is the set of proper prefixes of and that recognizes . Then is recognized by the automaton which is a strongly connected automaton. Moreover, since and are recognizable, is recognizable. Thus is recurrent. ∎
A pair as above is called a decomposition of the recurrent set . The prefix code obtained as above using the minimal automaton of is called the left root of .
Note that a recurrent set has in general several decompositions. However, the left root of is such that for any such pair , we have , that is the submonoid is maximal for this property. Indeed, let be the literal automaton of . Then recognizes and thus there is a reduction from onto the minimal automaton of which sends to . The image by this reduction of a path in from to is a path in from to and thus .
Proposition 3.3.2
The left root of a recurrent set is finite if and only if has a decomposition with finite.
Proof.
Let and be two decompositions of a recurrent set with the left root of . Assume that is finite.
Any word of is a prefix of a word in . Indeed, let and let be some element of . Then with and . Thus is a prefix of a word in . Since , this implies that is a prefix of a word in . This shows that the length of is bounded by the maximal length of the words in . Therefore is finite.
The other implication is clear. ∎
For a birecurrent set , we have
[TABLE]
where is the left root of and is a set of proper suffixes of . The suffix code is called the right root of . When is the submonoid generated by a bifix code , then is the left root of and is its right root.
We say that a birecurrent set is of finite type if its left and right roots are finite.
For example, for every finite bifix code , the set is a birecurrent set of finite type.
On the contrary, the birecurrent set of Example 3 is not of finite type (see Example 8).
We give two examples of birecurrent sets of finite type. The first one is Example 4. Recall from Section 2 that denotes the characteristic series of a set . Note that when is a decomposition of a recurrent set , we have . Indeed, since is a prefix code, we have for the submonoid and because the product is unambiguous (see [2]).
Example 6
The birecurrent set of Example 4 is of finite type. Indeed, its left root is the finite prefix code with and its right root is the finite suffix code . One has with . One has actually . This can be checked by comparing the two automata of Figure 3.2 or directly by observing that implies that , whence since .
The second example is Example 3.6.13 in [2].
Example 7
Let be the automaton given by Table 3.1 with and .
The minimal rank is equal to as one may check by computing the minimal images which are , , and . The set is a class of the kernel of which has image and thus minimal rank. The automaton is thus strongly connected by Theorem 3.2.1 and the set recognized by is birecurrent. The left root of is the finite maximal prefix code represented in Figure 3.5 as the set of leaves of a binary tree. In this figure, as in the following ones, we represent words on the alphabet by nodes of a binary tree. The edges going up are labeled , those going down are labeled .
The deterministic reversal of has the set of states represented in Table 3.2.
The transitions of are given in Table 3.3.
The reversal of the right root of is represented in Figure 3.6.
Since is finite, is a birecurrent set of finite type.
We give now two examples of birecurrent sets such that the left root is finite but the right root is infinite and thus which are not of finite type.
Example 8
Consider again the set recognized by the automaton of Figure 3.1 on the left with its deterministic reversal on the right (Example 3). The left root of is and thus it is finite. However, the right root of is and it is infinite. Note that the two factorizations of correspond to the identity
[TABLE]
which is itself a particular case of the sumstar identity .
In the second example, the birecurrent set is dense.
Example 9
Consider the finite maximal prefix code . The minimal automaton of is given by its transitions in Table 3.4 on the left with as initial and terminal state.
The word has minimal rank and its kernel is . Keeping the same initial state and taking , we obtain a deterministic automaton recognizing the set with . The set of states of is given in Table 3.5.
Its transitions are given in Table 3.4 on the right. The right root of is the maximal suffix code where is the submonoid recognized by with as initial and terminal state. Since, in , there is a loop , the code is infinite.
3.4 A construction of birecurrent sets of finite type
Examples 6 and 7 are particular cases of a general construction that we now describe. Let be a set of words. A word is said to be a pure square for if for some and if . The following result is [2, Exercise 14.1.9 (a)].
Proposition 3.4.1
If is a finite maximal prefix code and if is a pure square for , then, denoting and , the expression
[TABLE]
is the characteristic polynomial of a finite maximal prefix code. Moreover, the expression
[TABLE]
is the characteristic polynomial of a finite maximal code.
The prefix code defined by Equation (3.2) is denoted . The maximal code defined by Equation (3.3) is denoted . If is bifix, then is a suffix code since is the reversal of .
Theorem 3.4.2
Let be a finite maximal bifix code and let be a pure square for . The set is a dense birecurrent set of finite type.
Proof.
Set and . Observe that, by defintion of and ,
[TABLE]
Thus, multiplying on the left by and on the right by , we obtain . This shows that .
Since , the set is recurrent by Proposition 3.3.1 and its left root is finite by Proposition 3.3.2.
Similarly, since , the set is birecurrent. Since finite, the right root of is also finite. Thus is birecurrent of finite type.
Since is a maximal prefix code, the submonoid is right dense. Thus is right dense. ∎
Observe that if is the set of proper prefixes of and the set of proper prefixes of , the characteristic polynomial of the set of proper prefixes of is
[TABLE]
Indeed, since and are maximal prefix codes, we have , and . Thus Equations (3.2) and (3.4) are equivalent.
In the next two examples, we show that Examples 6 and 7 can be obtained by the contruction decribed in Theorem 3.4.2.
Example 10
Let be the birecurrent set of Example 4. We have seen in Example 6 that with and . We obtain as in Theorem 3.4.2 using and . Indeed, we have and thus Equation (3.4) gives
[TABLE]
which is the characteristic polynomial of the set of proper prefixes of .
Example 11
Let be the birecurrent set of Example 7. We start from the finite maximal bifix code
[TABLE]
which is represented in Figure 3.7 on the left with its reversal on the right.
The word with is a pure square for . We have and .
The set of proper prefixes of is and the set of proper prefixes of is .
The set is represented in Figure 3.8 on the left, the set is represented in the middle, and the set on the right (the white nodes being not in the set).
Finally, the set is represented in Figure 3.9 (with the nodes of in black and those of in red). By Equation (3.4), it is the set of proper prefixes of the maximal prefix code represented in Figure 3.5.
Theorem 3.4.2 gives an infinite family of examples of birecurrent sets of finite type which are not submonoids generated by a bifix code. Indeed, for any even , the word is a pure square for the bifix code .
3.5 Multiple factorizations of noncommutative polynomials
Let be a dense birecurrent set of finite type. Then where (resp. ) is a finite maximal prefix code (resp. a finite maximal suffix code). Since all products are non ambiguous, we have the equality
[TABLE]
Let be the set of proper prefixes of and let be the set of proper suffixes of . Then
[TABLE]
Combining Equations (3.5) and (3.6), we obtain
[TABLE]
We conjecture that, with the above notation there exist sets such that
[TABLE]
This is true when is generated by a maximal bifix code and when it is obtained by the construction of Section 3.4. Indeed, in this case, we have and by Equation (3.4)
[TABLE]
and
[TABLE]
where denote the set of proper prefixes of and denote the set of proper suffixes of . Thus with and with . Moreover .
A weak form of this conjecture is proved in [4, Theorem 3.1].
3.6 Degree of a dense birecurrent set
Let be a dense birecurrent set. We define its degree as the degree of its minimal automaton. Recall from Section 2 that the degree of a prefix code is, by definition, the degree of the minimal automaton of . Thus the degree of a dense birecurrent set is the degree of its left root (this is true even if the minimal automaton of is a quotient of the minimal automaton of , see [2, Proposition 9.5.2]).
Thus, when with a maximal bifix code, the degree of is the degree of .
Note that the degree of the right root may be different from the degree of the left root, and thus of the degree of the reversal, as shown in the following example.
Example 12
Let be the dense birecurrent set recognized by the automaton represented in Figure 3.10 on the left. The automaton is represented on the right.
The degree of is since the transition monoid of is the symmetric group on elements. But the degree of the right root of is since the transition monoid of the automaton is a representation of on elements.
Let be a dense birecurrent set of degree and let be its minimal automaton. By Theorem 3.2.1, the set is saturated by a word of minimal rank. The rank of is by definition the degree of . Let be the number of classes of the kernel of whose union is . We define the index of a dense birecurrent set, denoted , as the rational number .
Since , we have .
Note that the index does not depend on the choice of the word . Indeed, since is dense, any minimal image (that is the image of a word of minimal rank) is a set of representatives of any kernel of a word of minimal rank and thus is the number of elements of in each minimal image.
When with a maximal bifix code, the index of is the degree of .
Example 13
Let be the dense birecurrent set of Example 12. Since the degree of is , since is a group and since , we have .
Example 14
Let be the birecurrent set of Example 5. Then .
Proposition 3.6.1
Let be a dense birecurrent set and let be the minimal automaton of . Then for every -class of the minimal ideal of , one has
[TABLE]
Proof.
Set and let be the degree of . Let be a word of rank which saturates . Let be the -class of and let be the number of classes of the kernel of whose union is . Let be the common image of the elements of . Since is a system of representatives of the kernel of , it contains elements of . Let be the element of such that for every . Then, for every , one has if and only if . Thus the set is a union of cosets of the subgroup of fixing . Thus ∎
Note that, as a consequence, the index of a dense birecurrent set and of its reversal are the same (contrary to the degree). Indeed, the monoids and are antiisomorphic and antiisomorphic groups are isomorphic.
By Equation (3.9), the index of a birecurrent set is a measure of its size. We will make this idea more precise using probabilities. We begin with some definitions on Bernoulli distributions (see [2] for more details).
Let be a positive Bernoulli distribution on . By definition, is a morphism from into the interval such that . For , we denote by the (possibly infinite) sum .
For any code, one has and when is a thin maximal code, one has (see [2, Theorem 2.5.19]). Moreover, if is prefix, then we have where is the average length of and where is the set of proper prefixes of (see [2, Corollary 3.7.13]).
The density of a recognizable set , denoted is the Cesaro limit of the numbers ). Thus . By [2, Theorem 13.2.9], when is a thin maximal code, we have .
The following result shows that the density of a dense birecurrent set is a rational number, independent of the Bernoulli ditribution . This surprising property was first put in evidence for recognizable bifix codes in [14].
Theorem 3.6.2
The density of a dense birecurrent set is the inverse of its index.
Proof.
Let be a dense birecurrent set and let be its minimal automaton. Set and , and let be the minimal ideal of . Let where denotes the density. Then, by [2, Theorem 13.4.7], is a probability measure on the family of subsets of . Moreover, and for every , one has
[TABLE]
where is the -class of . Let denote the family of -classes of . Then, using Proposition 3.6.1,
[TABLE]
∎
Theorem 3.6.2 implies the following result for a dense birecurrent set of finite type.
Corollary 3.6.3
Let be a dense birecurrent set of finite type with left root . Then, for any positive Bernoulli distribution ,
[TABLE]
Proof.
Since , we have by [2, Proposition 13.2.5]. Since is a finite maximal prefix code, we have by [2, Theorem 13.2.9]. Thus, by Theorem 3.6.2 we have . ∎
Note that when Equation (3.8) holds, we have . The fact that is a rational number independant of is itself a consequence of the equation which implies by left Euclidean division for polynomial and some scalar and thus .
By a well-known result, for every and any finite alphabet, there is only a finite number of finite maximal bifix codes of degree on this alphabet (see [2, Theorem 6.5.2]). There is no similar property for dense birecurrent sets of finite type, as shown by the following example.
Example 15
For , let , let and let . Then is a birecurent set (the case is Example 4). Indeed, we have and thus . The degree of is for every because and thus the rank of is .
3.7 Indecomposable prefix codes
In this section, we relate birecurrent sets with the notion of decomposition of prefix codes.
A prefix code is indecomposable if , with a prefix code, implies or . Otherwise is said to be decomposable over (see [2] for a more detailed presentation).
If is decomposable over , let be a bijection of with an alphabet , extended to a morphism from onto . Then is a prefix code on the alphabet . We denote . The prefix code is thin maximal, if and only if and are also thin maximal prefix codes. Moreover, one has [2, Proposition 11.1.2]. In particular, is synchronized (that is, of degree ) if and only if and are synchronised.
We will prove the following result. It singles out two basic building blocks for recognizable maximal prefix codes: synchronized ones on the one hand, and left roots of dense birecurrent sets on the other. Note that no nontrivial prefix code (that is, distinct of the alphabet) can belong to both families.
Theorem 3.7.1
Let be a recognizable maximal prefix code. If is indecomposable, either is synchronized or it is the left root of a dense birecurrent set.
To prove Theorem 3.7.1, we introduce the following notion, which plays a role in the solution of the road coloring problem (see [2, Lemma 10.4.3]).
Let be a finite deterministic automaton. A pair of states is called synchronizable if there is a word such that . It is called strongly synchronizable if for every the pair is synchronizable.
We note that the equivalence on defined by if are strongly synchronizable is a stable equivalence. This means that if , then for every word .
Proposition 3.7.2
Let be a strongly connected and complete finite deterministic automaton. Two states of are strongly synchronizable if and only if for every word of minimal rank.
Proof.
The condition is necessary. Indeed, let be strongly synchronizable and let be a word of minimal rank. Let be a word such that . Since generates a minimal right ideal, there is a word such that . Thus .
Conversely, assume that the condition is satisfied. Let be a word of minimal rank. For every word , the word has minimal rank and thus . Thus are strongly synchronizable. ∎
Let be a recognizable maximal prefix code and let be the minimal automaton of . If is a stable equivalence on , then the set is a submonoid generated by a prefix code with . If , then must be the equality since is minimal. Thus, if is indecomposable, must be the equality.
Proof of Theorem 3.7.1. Assume that is not synchronized. Let be the minimal automaton of . Let be a word of minimal rank and let be a class of the kernel of . Let . The set recognized by is birecurrent by Theorem 3.2.1. Let us verify that is minimal. This will imply our conclusion since then is the left root of the birecurrent set .
We first observe that since is indecomposable, two strongly synchronizable states are equal. Indeed, this follows from the observation made above that the equivalence on defined by if are strongly synchronizable is a stable equivalence.
Let be two states such that for every word , if and only if . Let be a word. Since is strongly connected, there is some word such that . Then and thus . This shows that are strongly synchronizable. Since is indecomposable, this implies . Thus is minimal.
Theorem 3.7.1 is related with an old conjecture of Schützenberger asserting that if a finite maximal prefix code is indecomposable, either it is synchronized or it is bifix. The conjecture is not true as shown by the left root of the birecurrent set of Example 7 (which appeared originally in [6]). In fact is indecomposable (see below), is not bifix and not synchronized since its degree is .
Example 16
Let us show that the left root of the birecurrent set of Example 7 is indecomposable and, more generally, that if is a finite maximal bifix code of prime degree and is a pure square for , the prefix code given by Equation (3.2) is indecomposable (this is already proved in [6], but we reproduce the proof for convenience).
We first observe that for all . Indeed, one has as for any finite maximal bifix code of degree [2, Proposition 6.5.1]. Next, since , it is odd and thus we cannot have .
Let be a prefix code such that . Then, since is prime, one has for any , either or . Set . Since , either is synchronized or is synchronized. We show that or .
Assume first that is synchronized. Then has degree . If for some , then is synchronized (some power of is a synchronizing word). This is impossible since . Therefore for every and thus .
Assume next that is synchronized. Then, since has degree , for every . Fix some . By inspection of Equation 3.2, the suffixes of which are in are of length at most and the only proper prefixes of such that are and . This implies that the only proper prefixes of which possibly belong to are and . Indeed, if and , then and thus is multiple of . But if , we have since which is impossible since the integer such that is strictly less than . This forces and shows that is indecomposable.
Note that the statement is not true for since the code of Example 10 is decomposable.
We formulate the following open problem, as an attempt to replace the conjecture of Schützenberger by a weaker statement: if the prefix code in Theorem 3.7.1 is additionnaly finite, can one prove that either it is synchronized or it is the left root of a dense birecurrent set of finite type?
4 Complete reducibility
In this section, we develop the link between birecurrent sets and completely reducible sets. We begin with an introduction to formal series (see [3] or [13] for a more detailed presentation).
4.1 Recognizable series
We consider formal series with coefficients in the field . Recall from Section 2 that we denote by the ring of formal series with coefficients in and noncommutative variables in .
A series is recognizable (or equivalently, by Schützenberger’s theorem, rational) if there is a linear representation over a finite dimensional space recognizing it. There is a unique linear representation of minimal dimension recognizing a given recognizable series (up to the choice of a basis if the representation is given in matrix terms). One can compute it following three steps.
Start from any linear representation recognizing . 2. 2.
Take the representation obtained by taking and by restricting and to the subspace generated by the vectors for . 3. 3.
Take the representation obtained by taking and by restricting and to the subspace generated by the vectors for .
Example 17
Let . The linear representation
[TABLE]
recognizes the characteristic series of the set . It is minimal. Choosing , the representation recognizes . It is not minimal because . It is easy to see that the minimal representation of is of dimension .
The computation of the minimal representation of the characteristic series of a set is closely related to the computation of the minimal automaton of and of its reversal. It can be described as follows.
Consider the vector space as containing , identifying with its characteristic function. Let be the following linear representation on the space . Set . For , define as the endomorphism of such that . Finally set where is the linear form on which is the characteristic function of . Then obviously recognizes and we have performed the first step of the algorithm given above to compute the minimal representation. Step 2 does not change the representation since each state of the minimal automaton is accessible from the initial state. To perform Step 3, we compute the set of states of the reversal of . For , the vector is precisely . Thus, the minimal representation is the restriction of to the subspace of generated by the vectors for (considered as a column vector on which each acts on the left).
We illustrate this algorithm in the following examples. In the first one, the representation given by the minimal automaton is the minimal one.
Example 18
The minimal automaton of is represented in Figure 4.1.
The linear representation built from the automaton is that of Example 17. The states of are and . Their characteristic functions are linearly independent. Thus Step 3 does not modify the representation, which is minimal.
In the second example, the representation given by the minimal automaton is not minimal.
Example 19
Let be the bifix code represented in Figure 3.7 on the right (it is the unique maximal bifix code of degree with internal part ). The minimal automaton of is defined by its transitions given in Table 4.1.
The set of states of the reversal automaton are given in Table 4.2.
The vector space generated by the corresponding characteristic vectors has dimension because . Choosing the basis formed of the minimal representation of is
[TABLE]
Let us verify for example the value of the first column of . We have . Similarly, the last column of is computed as .
Let be a morphism. We let the endomorphisms of act on the left of the vectors in . A subspace of is said to be invariant with respect to if for every and , one has . The vector space is said to be irreducible with respect to if and if its only invariant subspaces are [math] and . Finally, it is said to be completely reducible with respect to if where each is an invariant irreducible subspace of . Let be the restriction of to . The representation is the direct sum of the representations which are called the irreducible components of .
We also say that a linear representation is completely reducible if the underlying vector space is completely reducible with respect to .
Example 20
The linear representation of Example 17 is completely reducible. Indeed, the subspaces generated respectively by and by are invariant and obviously irreducible. In the basis formed by these vectors, the representation takes the following equivalent form.
[TABLE]
For , we denote by the series defined by for every .
The syntactic space of a series , denoted is the vector space generated by the series for all . The syntactic representation of is the linear representation defined for and by
[TABLE]
The linear representation defined by the triple with and defined by recognizes . Indeed, one has for every ,
[TABLE]
It can be shown that the syntactic representation of a recognizable series is its minimal representation (see [3]).
If a series has a completely reducible representation, then its syntactic representation is also completely reducible.
Example 21
The syntactic space of the series has dimension and a basis is formed by and . The corresponding linear representation is given in Example 17.
4.2 Completely reducible sets
A set is completely reducible over if the syntactic representation of the series is completely reducible. Equivalently, its syntactic algebra is semi-simple [3, Chapter 12].
Example 22
The set is completely reducible since its syntactic space has dimension . The set is also completely reducible since its syntactic representation is completely reducible by Examples 20 and 21.
We give a second example, in which is the submonoid generated by a finite bifix code.
Example 23
Consider again the set of Example 19. We have found a minimal representation of dimension . This representation is not irreducible because the space generated by is invariant by (with acting on the left). The space generated by , and is a stable complement and the representation takes in the basis , , , the form of a direct sum of two representations of dimension and of dimension . In this basis, the vector becomes (its components are the values of the linear form defined by on each vector of the basis), and , , become
[TABLE]
The value of results from the computation of . The representations and are irreducible. This is obvious for . For it can be proved directly by verifying that the matrices generate the algebra of -matrices over or deduced as a consequence of [7, Theorem 2.2]).
Let be a recognizable set, let and let be its minimal automaton. Set , and . By [2, Proposition 14.7.1] for all , one has
[TABLE]
In particular, and are isomorphic.
An element of is a linear map from into itself and, as such, it has a kernel and an image which are subspaces of .
The eventual kernel of , denoted , is the intersection of the kernels of all elements of minimal nonzero rank of .
Symmetrically, the eventual range of , denoted , is the subspace spanned by the images of all elements of minimal nonzero rank of .
Both and are invariant subspaces of . Indeed, let and let . For any of minimal nonzero rank, is either zero or has minimal nonzero rank . In both cases . Thus . Similarly, let and let . Since , we have where denotes the set of elements of minimal nonzero rank in and . Then and thus .
Theorem 4.2.1
A recurrent set is completely reducible if and only if .
We first prove the following statement.
Proposition 4.2.2
Let be a recognizable set. If is completely reducible, then .
Proof.
Set and . Let be the set of elements of of nonzero minimal rank. Set . Since is invariant, and since is completely reducible, there is a complement of which is an invariant subspace of .
Let . Then, as for any element of , we have for some . Since , there exist and such that . Then
[TABLE]
Indeed, since , we have for every . This implies that , and finally . Therefore we conclude that . ∎
Proof of Theorem 4.2.1. Let be the minimal automaton of the recurrent set . Set , , and .
Assume first that the eventual kernel of is [math].
It is enough to prove the property under the hypothesis that contains the empty word. Indeed, Let be the set recognized by the automaton for some . Then is a set recognized by a strongly connected deterministic automaton and contains the empty word. Since is strongly connected, is a residual of and is a residual of . Thus the syntactic representations of and only differ by the choice of the initial vector. Thus is completely reducible whenever is and the eventual kernel of is also [math].
Since is strongly connected, the monoid has a unique [math]-minimal ideal which is a regular -class plus [math] (see Section 3). Let be such that is an idempotent of . We may assume that . Indeed, since is strongly connected, the state is, as any state of , in the image of some word of minimal nonzero rank. Since is regular -class, the -class of contains an idempotent which has the same image as . Since is idempotent, implies that . Since is isomorphic to , is an idempotent of .
Set . We verify that the conditions of [7, Corollary 1] are satisfied by and .
- (i)
The subspace is completely reducible with respect to the restriction of to . This is true by Maschke’s Theorem asserting that any linear representation of a finite group on a field with characteristic zero is completely reducible. Indeed, is a finite group or a finite group union zero. 2. (ii)
The space is generated by . Indeed implies . Thus contains which generates . 3. (iii)
We have . Indeed, if and only if is in the kernel of . Since is [math]-minimal, we have and thus the kernel of any element of minimal rank is equal to the kernel of some . Hence we have if and only if belongs to the intersection of the kernels of the elements of .
Thus we can conclude that is completely reducible.
Conversely, since is recurrent, we have and thus by Proposition 4.2.2.
We obtain as a corollary [7, Theorem 5.2], which is a generalization of the result of [10] asserting that the submonoid generated by a bifix code is completely reducible.
Corollary 4.2.3
Any recognizable birecurrent set is completely reducible.
Proof.
Let be the minimal automaton of the set . Set and let be the linear representation associated to the automaton . Thus . The minimal representation of is, as seen before, the restriction of to the subspace of column vectors generated by the vectors . We consider as a representation acting on the right on a supplementary subspace of the ortogonal of .
By Theorem 3.2.1, is saturated by a word of minimal nonzero rank. Let be the [math]-minimal ideal of the monoid . Then . Let be an idempotent of in the -class of and let be such that . Then is an idempotent. Since and belong to the same -class, and have the same kernel. Thus is a union of classes of the kernel of , which implies and thus .
Let be such that for every . Then, for any , we have since . But since , we have and thus for all . This shows that is orthogonal to , and thus that .
We conclude that and thus that is completely reducible by Theorem 4.2.1. ∎
Note the following precision on Theorem 4.2.1. Let be a finite monoid having a unique [math]-minimal ideal . Then all -classes of which are groups are isomorphic. The Suschkevitch group of is any of them (see [2] or [11]).
Proposition 4.2.4
Let be a recurrent completely reducible set, let and let . The number of irreducible constituents of is equal to the number of irreducible contituents of its restriction to the Suschkevitch group of .
Proof.
Let be a word such that is an idempotent of the [math]-minimal ideal of the monoid . Set . The restriction of to group is a representation of on . Moreover, by [7, Corollary 1], if is a decomposition of in irreducible subspaces, then is a decomposition of in irreducible subspaces. Thus the number of irreducible components of is equal to the number of irreducible components of the restriction of to . ∎
Note that, in particular, if the Suschkevitch group of is trivial, then is irreducible.
The following example shows that the hypothesis of Theorem 4.2.1 can be satisfied by a set which is not birecurrent.
Example 24
Consider again the strongly connected automaton of Figure 3.1 on the left with its deterministic reversal on the right (Example 3). We change the automaton into an automaton by choosing this time as set of terminal states. The automata and are represented in Figure 4.2. Let be the set recognized by . Since is not strongly connected, is not birecurrent.
The [math]-minimal ideal of the monoid is represented in Figure 4.3 on the left and the set of states of in Table 4.3.
Except , all states of are classes of the kernel of an element of minimal rank (see Figure 4.3). The space generated by the characteristic functions of these states has dimension , as for the same space corresponding to . Indeed is an element of and is in the space generated by the characteristic functions of the states of since . Thus is completely reducible althought it is not birecurrent.
We now prove a second corollary of Theorem 4.2.1 (actually of Proposition 4.2.2). It shows that a completely reducible set which is dense satisfies a property which is well-known for the submonoid generated by a maximal bifix code.
Corollary 4.2.5
Let be a set recognized by a strongly connected unambiguous finite automaton . If is completely reducible and dense, then meets every -class of the minimal ideal of the monoid .
Proof.
We use the same notation as in the proof of Theorem 4.2.1. Since is strongly connected and is dense, the monoid does not contain [math] and has a unique minimal ideal which is a union of groups. Moreover meets the minimal ideal of . Since is the image of by a morphism, it has also a unique minimal ideal which meets . Let be the minimal ideal of the monoid .
For a subset of , denote .
Note that for any -class of , . Indeed, let and let be such that . Then which is if and [math] otherwise.
Let be two -classes of of the monoid contained in the same -class . Then the vector belongs to the eventual range .
Moreover, it belongs to the eventual kernel . Indeed, for every , we have since both are equal to the -class . Thus, by Theorem 4.2.1, we have .
By Proposition 4.2.2, this forces for all -classes included in the same -class.
Since satisfies the same hypotheses as (using the automaton ), the conclusion follows. ∎
We give below an example illustrating Corollary 4.2.5
Example 25
Consider again the deterministic automaton of Example 9. We change the automaton into an automaton by choosing this time as set of terminal states instead of . Let be the set recognized by . The minimal ideal of the monoid is represented in Figure 4.4.
The set of states of the automaton is represented in Table 4.4.
Except , all states of are classes of the kernel of an element of minimal rank (see Figure 4.4). The space generated by the characteristic functions of these states has dimension , as for the same space corresponding to . Indeed is an element of and is in the space generated by the characteristic functions of the states of since . Thus is completely reducible althought it is not birecurrent. To see that is not birecurrent, one may either use the fact that is not saturated by any word of minimal rank and apply Theorem 3.2.1 or compute directly (see Figure 4.5).
Note that the hypothesis of Theorem 4.2.1 is satisfied because it is already satisfied by the minimal representation of , which is the same as that of except for the vectors and .
We finally deduce from Corollary 4.2.5 the following result, originally proved in [10] (see also [2, Theorem 14.7.7]).
Corollary 4.2.6
Let be a recognizable maximal code. If is completely reducible, then is bifix.
Proof.
Let be the minimal automaton of . Since is a recognizable code, it is thin by [2, Proposition 2.5.20]. Since is a thin maximal code, is dense by [2, Theorem 2.5.5]. As for any recognizable code, there is a trim unambiguous finite automaton recognizing (see [2, Proposition 4.1.2]). Since is trim with a unique initial state equal to the unique terminal state, it is strongly connected. By Corollary 4.2.5, meets every -class of the minimal ideal of . This implies that is recurrent and thus is prefix by [2, Proposition 3.3.11]. Symmetrically, is suffix, whence the conclusion. ∎
4.3 A characterization of completely reducible sets
In this section, we consider rational series over whose set of coefficients is finite. It is known that the minimal representation of such a series satisfies the following finiteness property: the matrix monoid is finite. This follows from a theorem of Schützenberger, see [3, Corollary 2.3].
Moreover, such a series is a linear combination over of characteristic series of rational languages. This follows from the fact that for any , the set is rational. For this, see [3, Theorem 2.10], another theorem of Schützenberger.
For a rational series whose set of coefficients is finite, we may construct a special kind of deterministic automaton, called an automaton with scalar output function. It is a deterministic automaton where is a mapping called the terminal function, which recognizes in the following sense. For each word , one has . In other words, one reads on the automaton, starting from the initial state and one reaches a state . The coefficient of in is . If there is no path from labeled , we let .
The notions of trim automaton and of minimal automaton extend easily to these automata.
For a word and a series , we denote here the series defined by (this series is denoted in Section 4.1) and symmetrically the series defined by .
There is also a Nerode criterium for these series. Indeed, a series is rational (and has a finite set of coefficients) if and only if the set is finite.
We say that a rational series with a finite number of coefficients is recurrent if its minimal automaton with scalar output function is strongly connected.
Clearly, a series is recurrent if and only if it is recognized by a strongly connected automaton with scalar output function. Moreover, if is recurrent, then so are and for any word , since they are recognized by the minimal automaton with output function of .
It is called birecurrent if and are both recurrent, where is the series such that for all .
Proposition 4.3.1
Let be a birecurrent series. Then is a linear combination over of characteristic series of birecurrent sets.
Proof.
Let I=\{\alpha\in\mathbb{Q}\setminus 0\mid(S,w)=\alpha\mbox{ for some w\in A^{*}}\}. Then is finite. It is enough to show that for any , the set is recurrent.
Let be a strongly connected automaton with scalar output function recognizing . Let be the automaton defined by . Then this automaton recognizes . It is deterministic and strongly connected and thus is recurrent. ∎
Proposition 4.3.2
Let be a completely reducible series. Then is a linear combination over of birecurrent series.
Proof.
Let be a minimal representation of . Since the set of coefficients of is finite, we know that is a finite monoid.
Since the representation is completely reducible, it is isomorphic to a direct sum of representations which are irreducible over .
It follows that is the corresponding sum of irreducible series. We may therefore assume that is irreducible. Note that is finite.
The set is finite. One obtains a right action of on this set by . There exists therefore a word such that the set is a minimal invariant subset for this action.
Similarly, acts on the left on the finite set by . There exists similarly a word such that the set is minimal for this action.
Consider the linear representation . It recognizes a rational series, call it , whose set of coefficients is finite since is finite.
For this series , we may construct the following deterministic automaton with scalar output function. Its set of states is , its initial state is and its terminal function is defined by for any state .
Clearly this automaton recognizes . Since its set of states is a minimal invariant subset for the right action of , this automaton is strongly connected. Thus is recurrent.
By symmetry, is also recurrent. Hence is birecurrent.
Turning back to the representation , there exists, since it is irreducible, polynomials such that and where the monoid morphism is extended to an algebra morphism still denoted from into .
Finally, for any word , one has
[TABLE]
Thus, since , we have , and therefore . Since is birecurrent, each is birecurrent. Hence is a linear combination of birecurrent series. ∎
Proposition 4.3.3
Let be a rational series whose set of coefficients is finite. Then is completely reducible if and only if it is a linear combination over of characteristic series of birecurrent sets.
Proof.
Suppose that is completely reducible. Then the conclusion follows by Propositions 4.3.1 and 4.3.2.
Conversely, suppose that is a linear combination over of characteristic series of birecurrent sets. By Corollary 4.2.3, each such series is completely reducible. By [7, Proposition 4.1], a linear combination over of completely reducible series is completely reducible. Thus is completely reducible. ∎
We thus obtain as a main result of this section.
Theorem 4.3.4
A language is completely reducible if and only if its characteristic series is -linear combination of characteristic series of birecurrent languages.
The following example shows that the linear combination need not have coefficients in .
Example 26
Let be the automaton represented in Figure 4.6 on the left with its reversal on the right.
Let be the set recognized by . Since is strongly connected, is recurrent. Since is not strongly connected, is not birecurrent. Let be the set recognized using, instead of , the set of terminal states for with , and . Since these sets are saturated respectively by , and , which have rank , the sets are birecurrent (this can also be seen easily in Figure 4.6). Since , we have . Note that, by Proposition 4.2.4, the linear representation associated to the automaton is irreducible. Indeed, this representation is clearly minimal and since the letters are of rank , the Suschkevitch group of the monoid is trivial. Thus we obtain a decomposition of an irreducible set as a -linear combination of birecurrent sets. We conjecture that cannot be obtained as a linear combination of birecurrent sets with coefficients in .
5 Unambiguous automata
In this section, we generalize some of the results concerning birecurrent sets using unambiguous automata, which are nondeterministic automata closely linked with linear representation of series. For an introduction to this class of automata, see [2].
5.1 Unambiguous automata and linear representations
An automaton is unambiguous if for any word there is at most one path from a state to a state labeled .
A trim unambiguous automaton has the following property (used in [2] as a definition of unambiguous automata). For every word and every pair of states there is at most one path from to labeled . Indeed, since is trim, there is a path with and a path with . Since there is at most one path from to labeled , there is at most one path from to labeled .
Clearly a deterministic automaton is unambiguous.
Example 27
The automaton represented in Figure 5.1 is unambiguous.
One can check this by computing the automaton of pairs and by checking that there is no path from a pair to a pair using a pair with .
To every unambiguous automaton recognizing a set , we may associate a linear representation recognizing the characteristic series of the set . We consider, as in Section 2 a field and the vector space . Consider the representation where is the characteristic function of considered as a row vector, considered as a column vector and for , is the -matrix defined by
[TABLE]
Then extends to a morphism from into and for every . When is trim, all the matrices for have coefficients [math] or .
Example 28
The linear representation corresponding to the unambiguous automaton of Figure 5.1 is
[TABLE]
For a set of states and a word , we denote
[TABLE]
Let be an unambiguous automaton. Recall that we denote by the determinization of . Thus the states of are the nonempty sets for . The initial state is and the set of terminal states is the set of such that . Note that since is unambiguous, contains at most one element and that and recognize the same set of words.
When is trim, the action of on the states of is the same as the right multiplication by the matrices on the characteristic vectors. Indeed, one has if and only if .
Also recall that we denote by the deterministic reversal of the automaton .
Example 29
Let be the unambiguous automaton of Figure 5.1. The automaton is represented in Figure 5.2 on the left and the automaton on the right.
The rank of a word with respect to an unambiguous automaton is the rank of the linear map . This definition is consistent with the one given for a deterministic automaton in Section 3. Indeed, when is deterministic, the matrix is the matrix of a partial map from into itself and the rank of is equal to the rank of the map. The definition of rank for an unambiguous automaton given in [2] is different but equivalent to this one (see [2, Exercise 9.3.2]).
Let be a trim unambiguous automaton. Then the monoid is formed of -matrices and thus it is finite. As for a deterministic automaton, the set of elements of minimal nonzero rank of the monoid is the unique [math]-minimal ideal of . It is the union of all [math]-minimal right (resp. left) ideals. It is formed of a regular -class, plus possibly [math]. Each -class of is formed of elements which have the same set of rows. Each -class of which is a group is a transitive permutation group on the common set of rows of its elements.
Example 30
Let be the automaton represented in Figure 5.1. The minimal ideal of the monoid is represented in Figure 5.3.
There is no word of rank [math] and the minimal rank is . For example, we have
[TABLE]
the first one being of rank and the second of rank . We represent for each -class of the set of rows of the elements (representing a row as a set of states) and for each -class its set of columns.
5.2 Unambiguous automata and birecurrent sets of finite type
The following result is a generalization of Theorem 3.2.1 which gives a sufficient condition for the set recognized by an unambiguous automaton to be recurrent. The proof is quite similar.
Theorem 5.2.1
Let be a strongly connected unambiguous finite automaton. The automaton is strongly connected if and only if there is a word of minimal nonzero rank such that .
Proof.
Assume first that is strongly connected. Let be a word of minimal nonzero rank. Since has nonzero rank there exist such that . Since is strongly connected, there is a word such that for some . Then is a word of minimal nonzero rank such that . Thus is a state of . Since is strongly connected, there is a word such that . Since has minimal nonzero rank, the conclusion follows.
Conversely, assume that with of minimal nonzero rank. Consider a state of . Then for some such that . Then and thus is a word of minimal nonzero rank. Set . Since the right ideal generated by is [math]-minimal, there is a word such that . Let with be the idempotent which is a power of . Since , we have . Then . Set . Then and thus . This shows that and belong to the same strongly connected component and thus that is strongly connected. ∎
A symmetric result holds for the automaton , which is strongly conected if and only if there is a word of minimal nonzero rank such that .
Theorem 3.2.1 follows easily from the symmetric version of Theorem 5.2.1. In fact, assume that is deterministic. A set is saturated by a word of minimal nonzero rank if and only if there is a word of minimal nonzero rank such that . Indeed, the condition is sufficient. Conversely, if for some and some of minimal nonzero rank, let be a word such that is a nonzero idempotent in the right ideal generated by . Then and thus .
Corollary 5.2.2
Let be a strongly connected unambiguous finite automaton recognizing a set . If there are words of [math]-minimal rank such that and , then is birecurrent.
Proof.
By Theorem 5.2.1, since , the automaton is strongly connected and conversely. Symmetrically, since , the automaton is strongly connected and conversely. ∎
Iterating the construction of Section 3.4, one obtains examples of birecurrent sets defined by unambiguous automata as in Corollary 5.2.2. The iteration relies on the following result from [15] (see [2, Exercise 14.1.9]) where we use again the notation and introduced after Proposition 3.4.1.
Proposition 5.2.3
Let be a finite maximal prefix code and let be a pure square for . Then is a pure square for and for . The sets and satisfy
[TABLE]
and the sets and satisfy
[TABLE]
We deduce from Proposition 5.2.3 the following result.
Theorem 5.2.4
Let be a finite maximal bifix code, let be a pure square for and let . Then is a birecurrent set of finite type.
Proof.
Set , , and . Since is a finite maximal prefix code, and are finite maximal prefix codes. Symmetrically, since is a finite maximal suffix code, and are finite maximal suffix codes.
Let be as in Proposition 5.2.3. We have by Equation (3.3)
[TABLE]
and by Equation (3.2)
[TABLE]
Thus
[TABLE]
Therefore . This shows that is recurrent with a finite left root. Similarly
[TABLE]
Since and , we have
[TABLE]
showing that
[TABLE]
This implies that is birecurrent and that its right root is finite. ∎
Example 31
Let be the nondeterministic automaton with transitions given in Table 5.1 with and .
One may verify that this automaton is unambiguous. The word has rank since the matrix has distinct nonzero rows which are the characteristic vectors of , and .
This rank is minimal as one can check by computing the images of this -element set by the action of the letters. Since is a row of , the hypothesis of Theorem 5.2.1 are satisfied. The automaton is strongly connected and has also states represented in Table 5.2.
The transitions of are represented in Table 5.3.
The set is a column of (the columns of index and ). Thus, by the symmetric of Theorem 5.2.1, the automaton is also strongly connected. Thus the set recognized by is birecurrent.
The set is an instance of Theorem 5.2.4. To see this, we start with the finite maximal bifix code represented in Figure 3.7 on the right. We choose the word which is a pure square for . Then is the maximal prefix code represented in Figure 3.6. The word is a pure square for and the set is a maximal code generating the set recognized by the automaton with as initial and terminal state.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Dana Angluin. Inference of reversible languages. J. Assoc. Comput. Mach. , 29(3):741–765, 1982.
- 2[2] Jean Berstel, Dominique Perrin, and Christophe Reutenauer. Codes and Automata . Cambridge University Press, 2009.
- 3[3] Jean Berstel and Christophe Reutenauer. Noncommutative rational series with applications , volume 137 of Encyclopedia of Mathematics and its Applications . Cambridge University Press, Cambridge, 2011.
- 4[4] Véronique Bruyère and Clelia De Felice. Synchronization and decomposability for a family of codes. Internat. J. Algebra Comput. , 2(4):367–393, 1992.
- 5[5] Samuel Eilenberg. Automata, languages, and machines. Vol. A . Academic Press [A subsidiary of Harcourt Brace Jovanovich, Publishers], New York, 1974. Pure and Applied Mathematics, Vol. 58.
- 6[6] Dominique Perrin. Codes asynchrones. Bull. Soc. Math. France , 105(4):385–404, 1977.
- 7[7] Dominique Perrin. Completely reducible sets. Internat. J. Algebra Comput. , 23(4):915–941, 2013.
- 8[8] Jean-Eric Pin. On reversible automata. In LATIN ’92 (São Paulo, 1992) , volume 583 of Lecture Notes in Comput. Sci. , pages 401–416. Springer, Berlin, 1992.
