Parikh-reducing Church-Rosser representations for some classes of   regular languages

Tobias Walter

arXiv:1703.10056·cs.FL·March 30, 2017

Parikh-reducing Church-Rosser representations for some classes of regular languages

Tobias Walter

PDF

Open Access

TL;DR

This paper investigates Parikh-reducing Church-Rosser systems for specific regular language classes, providing finite representations and analyzing their complexity, especially for languages with abelian group syntactic monoids.

Contribution

It demonstrates the existence of finite Parikh-reducing Church-Rosser systems for certain regular languages and constructs monoid representations with abelian subgroups.

Findings

01

Existence of finite systems for languages with abelian group syntactic monoids

02

Construction of monoid representations with all subgroups abelian

03

Analysis of the complexity of these representations

Abstract

In this paper the concept of Parikh-reducing Church-Rosser systems is studied. It is shown that for two classes of regular languages there exist such systems which describe the languages using finitely many equivalence classes of the rewriting system. The two classes are: 1.) the class of all regular languages such that the syntactic monoid contains only abelian groups and 2.) the class of all group languages over a two-letter alphabet. The construction of the systems yield a monoid representation such that all subgroups are abelian. Additionally, the complexity of those representations is studied.

Equations85

∣ a ∣_{c} = {10 if a = c else.

∣ a ∣_{c} = {10 if a = c else.

\overline{H} = {M ∣ M every group in M is in H every group in M is in H}

\overline{H} = {M ∣ M every group in M is in H every group in M is in H}

T^{'} := {c ℓ \to cr ∣ c ℓ \to cr ℓ \to r \in T ℓ \to r \in T} \subseteq A^{*} \times A^{*}

T^{'} := {c ℓ \to cr ∣ c ℓ \to cr ℓ \to r \in T ℓ \to r \in T} \subseteq A^{*} \times A^{*}

T_{Δ} = {δ^{t + n} \to δ^{t} δ^{t + n} \to δ^{t} δ \in Δ δ \in Δ}

T_{Δ} = {δ^{t + n} \to δ^{t} δ^{t + n} \to δ^{t} δ \in Δ δ \in Δ}

T_{Δ} = {δ^{t + n} \to δ^{t} δ^{t + n} \to δ^{t} δ \in Δ δ \in Δ}

T_{Δ} = {δ^{t + n} \to δ^{t} δ^{t + n} \to δ^{t} δ \in Δ δ \in Δ}

t_{k} = 2 t_{k - 1} + t \leq ∣ p f q ∣ = ∣ pq ∣ + ∣ f ∣ \leq ∣ pq ∣ + t

t_{k} = 2 t_{k - 1} + t \leq ∣ p f q ∣ = ∣ pq ∣ + ∣ f ∣ \leq ∣ pq ∣ + t

γ (u) = c^{3 n} γ_{a_{1}} (v) \dots γ_{a_{s}} (v) γ_{c} (v) .

γ (u) = c^{3 n} γ_{a_{1}} (v) \dots γ_{a_{s}} (v) γ_{c} (v) .

t - 7 n = 3 n (s + 2) \leq ∣ γ (u) ∣ < 3 n (s + 2) + n = t - 6 n .

t - 7 n = 3 n (s + 2) \leq ∣ γ (u) ∣ < 3 n (s + 2) + n = t - 6 n .

T_{Ω} = {ω u ω^{'} \to ω γ (u) ω^{'} ∣ ω u ω^{'} \to ω γ (u) ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'}}

T_{Ω} = {ω u ω^{'} \to ω γ (u) ω^{'} ∣ ω u ω^{'} \to ω γ (u) ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'}}

∣ v ∣ = ∣ ω v^{'} ω^{'} ∣ \geq ∣ ω ∣ + ∣ μ γ (u^{'}) ∣ + ∣ ω^{'} ∣ > t + 2 n > t .

∣ v ∣ = ∣ ω v^{'} ω^{'} ∣ \geq ∣ ω ∣ + ∣ μ γ (u^{'}) ∣ + ∣ ω^{'} ∣ > t + 2 n > t .

x ℓ T ⟹ x r = x ω γ (u) ω^{'} = δ^{t + n} z_{3} γ (u) ω^{'} T ⟹ δ^{t} z_{3} γ (u) ω^{'} = z_{1} z_{2} z_{3} γ (u) ω^{'}

x ℓ T ⟹ x r = x ω γ (u) ω^{'} = δ^{t + n} z_{3} γ (u) ω^{'} T ⟹ δ^{t} z_{3} γ (u) ω^{'} = z_{1} z_{2} z_{3} γ (u) ω^{'}

ℓ^{'} y T ⟹ r^{'} y = δ^{t} y = z_{1} ω u ω^{'} T ⟹ z_{1} ω γ (u) ω^{'} = z_{1} z_{2} z_{3} γ (u) ω^{'}

x ℓ T ⟹ x r = x ω γ (u) ω^{'} = μ u^{'} μ^{'} z_{3} γ (u) ω^{'} T ⟹ μ γ (u^{'}) μ^{'} z_{3} γ (u) ω^{'} = μ γ (u^{'}) z_{1} z_{2} z_{3} γ (u) ω^{'}

x ℓ T ⟹ x r = x ω γ (u) ω^{'} = μ u^{'} μ^{'} z_{3} γ (u) ω^{'} T ⟹ μ γ (u^{'}) μ^{'} z_{3} γ (u) ω^{'} = μ γ (u^{'}) z_{1} z_{2} z_{3} γ (u) ω^{'}

ℓ^{'} y T ⟹ r^{'} y = μ γ (u^{'}) μ^{'} y = μ γ (u^{'}) z_{1} ω u ω^{'} T ⟹ μ γ (u^{'}) z_{1} ω γ (u) ω^{'} = μ γ (u^{'}) z_{1} z_{2} z_{3} γ (u) ω^{'}

T^{'} = {c ℓ \to cr \in A^{*} \times A^{*} ∣ c ℓ \to cr \in A^{*} \times A^{*} ℓ \to r \in T ℓ \to r \in T}

T^{'} = {c ℓ \to cr \in A^{*} \times A^{*} ∣ c ℓ \to cr \in A^{*} \times A^{*} ℓ \to r \in T ℓ \to r \in T}

T_{Ω} = {ω u ω^{'} \to ω v_{φ (u)} ω^{'} ω u ω^{'} \to ω v_{φ (u)} ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'}}

T_{Ω} = {ω u ω^{'} \to ω v_{φ (u)} ω^{'} ω u ω^{'} \to ω v_{φ (u)} ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'} t \leq ∣ ω u ω^{'} ∣ \leq t_{Ω} and ω, ω^{'} are Ω -maximal for ω u ω^{'}}

φ ∣_{B^{*}} : B^{*} \to M

φ ∣_{B^{*}} : B^{*} \to M

K = IRR_{R} (B^{*}) c .

K = IRR_{R} (B^{*}) c .

φ (c ℓ)

φ (c ℓ)

= ψ (u_{1} c) \circ \dots \circ ψ (u_{n} c)

= ψ (ℓ) = ψ (r)

= ψ (v_{1} c) \circ \dots \circ ψ (v_{m} c)

= φ (c v_{1} c) \circ \dots \circ φ (c v_{m} c) = φ (cr) .

T = {c ℓ \to cr ∣ c ℓ \to cr ℓ \to r \in T^{'} ℓ \to r \in T^{'}} .

T = {c ℓ \to cr ∣ c ℓ \to cr ℓ \to r \in T^{'} ℓ \to r \in T^{'}} .

∣ A^{*} / S ∣ \in 2^{2^{m^{O (n^{2})}}} .

∣ A^{*} / S ∣ \in 2^{2^{m^{O (n^{2})}}} .

∣ A^{*} / S ∣ = ∣ B^{*} / R ∣ + ∣ B^{*} / R ∣^{2} \cdot ∣ K^{*} / T ∣ \leq 2 ∣ B^{*} / R ∣^{2} \cdot ∣ K^{*} / T ∣

∣ A^{*} / S ∣ = ∣ B^{*} / R ∣ + ∣ B^{*} / R ∣^{2} \cdot ∣ K^{*} / T ∣ \leq 2 ∣ B^{*} / R ∣^{2} \cdot ∣ K^{*} / T ∣

∣ K^{*} / T ∣ \leq ∣ K ∣^{O (n^{2} m) \cdot 2^{∣ K ∣^{3 n}}} .

∣ K^{*} / T ∣ \leq ∣ K ∣^{O (n^{2} m) \cdot 2^{∣ K ∣^{3 n}}} .

2 ∣ K^{*} / T ∣ \leq 2^{2^{m^{c n^{2}}}} .

2 ∣ K^{*} / T ∣ \leq 2^{2^{m^{c n^{2}}}} .

ms (n, m) = max {ms (φ) ∣ ms (φ) φ : A^{*} \to G, ∣ A ∣ \leq m, G \in Ab, ∣ G ∣ \leq n φ : A^{*} \to G, ∣ A ∣ \leq m, G \in Ab, ∣ G ∣ \leq n}

ms (n, m) = max {ms (φ) ∣ ms (φ) φ : A^{*} \to G, ∣ A ∣ \leq m, G \in Ab, ∣ G ∣ \leq n φ : A^{*} \to G, ∣ A ∣ \leq m, G \in Ab, ∣ G ∣ \leq n}

ms (n, m) \leq ms (n, m - 1)^{2} \cdot 2^{2^{m^{c n^{2}}}}

ms (n, m) \leq ms (n, m - 1)^{2} \cdot 2^{2^{m^{c n^{2}}}}

ms (n, m)

ms (n, m)

\leq 2^{2^{(m - 1)^{c n^{2} + 2} + 1}} \cdot 2^{2^{m^{c n^{2}}}}

= 2^{2^{(m - 1)^{c n^{2} + 2} + 1} + 2^{m^{c n^{2}}}}

\leq 2^{2^{(m - 1)^{c n^{2} + 2} + 1 + m^{c n^{2}}}}

\leq 2^{2^{m^{c n^{2} + 2}}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

Topicssemigroups and automata theory · Coding theory and cryptography · Chemical Synthesis and Analysis

Full text

Parikh-reducing Church-Rosser representations for some classes of regular languages

Tobias Walter111Supported by the German Research Foundation (DFG) under grant DI 435/6-1.

FMI

University of Stuttgart

Abstract

In this paper the concept of Parikh-reducing Church-Rosser systems is studied. It is shown that for two classes of regular languages there exist such systems which describe the languages using finitely many equivalence classes of the rewriting system. The two classes are: 1.) the class of all regular languages such that the syntactic monoid contains only abelian groups and 2.) the class of all group languages over a two-letter alphabet. The construction of the systems yield a monoid representation such that all subgroups are abelian. Additionally, the complexity of those representations is studied.

1 Introduction

The class of Church-Rosser congruential languages has been introduced by Narendran, McNaughton and Otto in 1988, see [Nar84, MNO88]. A language is Church-Rosser congruential if it is a finite union of equivalence classes of a finite length-reducing Church-Rosser rewriting system. It is natural to ask whether every regular language is Church-Rosser congruential. After some initial progress [Nie00, NW02, RT03, DKW12], this question has been solved affirmatively, see [DKRW15]. The main idea of the solution in [DKRW15] is to prove a stronger statement. Instead of proving that for every regular language there exists a length-reducing Church-Rosser system which saturates the language it is proved that for every regular language and every weight function there exists such a weight-reducing Church-Rosser system. In particular, the initial problem is included by choosing length as the weight function. This result on regular languages became possible by utilizing the concept of local divisors. In this paper we use the same technique of local divisors to study a stronger property. Instead of requiring weight-reducing systems for a given weight we ask the question whether for every regular language there exists a Church-Rosser system which saturates the language and is weight-reducing for every weight function. We call such a rewriting system a Parikh-reducing Church-Rosser system. Some of the initial progress already satisfied the Parikh-reducing condition, namely the construction for aperiodic languages [DKW12], for languages of polynomial density [Nie00] and for cyclic groups of order two [NW02]. Our result comprises these results. Namely, the following is the main result: for every language such that its syntactic monoid contains only abelian groups there exists a Parikh-reducing Church-Rosser system which saturates the language. Moreover, all groups appearing in the corresponding Church-Rosser representation are abelian. Furthermore, we show the existence of such Parikh-reducing systems for all group languages over a two-letter alphabet. Having established the existence of Parikh-reducing systems we study the size of the resulting Church-Rosser representations. Naively, analyzing the construction yields a non-primitive function for this size. We introduce an alphabet reduction technique which reduce the size of the resulting Church-Rosser representations to a quadruple exponential function. On the other side of the spectrum we prove an exponential lower bound for cyclic groups.

2 Preliminaries

Words and Languages

An alphabet is a non-empty finite set $A$ . An element of $a\in A$ is called a letter. A (finite) word $w=a_{1}\cdots a_{n}$ is a finite concatenation of letters $a_{1},\ldots,a_{n}\in A$ . The set of finite words with letters in $A$ is denoted by $A^{*}$ . The empty word is denoted by $1$ . The set of finite words $A^{*}$ forms a monoid with the concatenation operation, the free monoid. Let $\left\lVert\mathinner{\cdot}\right\rVert:A\to\mathbb{N}$ be a function with $\left\lVert\mathinner{a}\right\rVert>0$ for all $a\in A$ . The unique homomorphism, which extends $\left\lVert\mathinner{\cdot}\right\rVert$ , is also denoted by $\left\lVert\mathinner{\cdot}\right\rVert$ and called a weight. A special weight is length $\left|\mathinner{\cdot}\right|:A^{*}\to\mathbb{N}$ which is induced by $\left|\mathinner{a}\right|=1$ for all $a\in A$ . For a letter $c\in A$ we also define $\left|\mathinner{\cdot}\right|_{c}:A^{*}\to\mathbb{N}$ to be the homomorphism which is induced by

[TABLE]

We set $A^{\leq n}=\left\{w\in A^{*}\mathrel{\left|\vphantom{w\in A^{*}}\vphantom{\left|\mathinner{w}\right|\leq n}\right.}\left|\mathinner{w}\right|\leq n\right\}$ to be the set of words of length at most $n$ .

A language $L$ is a subset of $A^{*}$ . Let $\varphi:A^{*}\to M$ be a homomorphism in a finite monoid $M$ . A language $L\subseteq A^{*}$ is recognized by $\varphi$ if $L=\varphi^{-1}(\varphi(L))$ . A language $L$ is regular if it can be recognized by some homomorphism in a finite monoid.

Algebra

We want to study subclasses of regular languages which are characterized by special classes of monoids. A variety $\mathbf{V}$ is a class of finite monoids which is closed under taking submonoids, homomorphic images and finite direct products. In particular, taking the empty direct product, every variety contains the trivial monoid. A variety which contains only groups is called a variety of groups. We assign every variety $\mathbf{V}$ a corresponding language class $\mathbf{V}(A^{*})$ such that $L\in\mathbf{V}(A^{*})$ if and only if there exists a monoid $M\in\mathbf{V}$ and a homomorphism $\varphi:A^{*}\to M$ that recognizes $L$ . Examples of such varieties include the variety $\mathbf{G}$ of all groups and the variety $\mathbf{Ab}$ of all abelian groups.

Let $\mathbf{H}$ be a variety of finite groups. We define

[TABLE]

to be the maximal class of monoids whose subsemigroups, which are groups, are in $\mathbf{H}$ . It turns out that $\overline{\mathbf{H}}$ is the maximal variety such that $\overline{\mathbf{H}}\cap\mathbf{G}=\mathbf{H}$ , see [Eil76, Proposition V.10.4]. Our main result is concerned with the language class $\overline{}\mathbf{Ab}(A^{*})$ . An important concept used in this paper are local divisors. Let $M$ be a monoid and $c\in M$ . We set $M_{c}=cM\cap Mc$ and introduce a multiplication $\circ$ on $M_{c}$ given by $uc\circ cv=ucv$ . Since $uc\in cM$ and $cv\in Mc$ , the result of $uc\circ cv$ is in $M_{c}$ . The structure $(M_{c},\circ,c)$ forms a monoid, the local divisor of $M$ at $c$ . Indeed, $M_{c}$ is a divisor of $M$ , that is, a homomorphic image of a submonoid of $M$ , see [DK15]. If $c\in M$ is not a unit, then $\left|\mathinner{M_{c}}\right|<\left|\mathinner{M}\right|$ since $1\not\in cM\cap Mc$ .

Combinatorics on Words

Let $x=uvw\in A^{*}$ be a word. Then we call $u$ a prefix, $v$ a factor and $w$ a suffix of $x$ . The factor $v$ is proper if $u$ and $w$ are not empty. The set of factors is given by $\mathrm{Factors}(w)=\left\{u\mathrel{\left|\vphantom{u}\vphantom{u\text{ is a factor of }w}\right.}u\text{ is a factor of }w\right\}$ . The word $a_{1}\cdots a_{n}$ , with $a_{i}\in A$ , is a subword of a word $u$ if $u\in A^{*}a_{1}A^{*}\cdots A^{*}a_{n}A^{*}$ . The word $u$ is a power of the word $v$ if $u=v^{i}$ for some $i\in\mathbb{N}$ . Let $w=a_{1}\cdots a_{n}\in A^{*}$ be a word with $a_{i}\in A$ letters. We say that $p\in\mathbb{N}$ is a period of $w$ if $a_{i}=a_{i+p}$ for all $1\leq i\leq n-p$ . The theorem of Fine and Wilf describes an important property of periods.

Theorem 2.1 (Fine and Wilf, [FW65]).

Let $p,q$ be periods of some word $w$ . If $\left|\mathinner{w}\right|\geq p+q-\gcd(p,q)$ , then $\gcd(p,q)$ is a period of $w$ .

A word $u$ is called primitive if it is only a power of itself, that is, if $u=v^{i}$ with $i\geq 1$ implies $i=1$ . The following well-known characterization of primitive words will be useful.

Lemma 2.2.

A word $u\in A^{*}$ is primitive if and only if $u$ is not a proper factor of $u^{2}$ .

Rewriting systems

A semi-Thue system $S$ over the alphabet $A$ is a finite subset of $A^{*}\times A^{*}$ . An element $(\ell,r)\in S$ is called a rule, where $\ell$ is the left side and $r$ is the right side of the rule. The idea of a semi-Thue system is, that left sides of rules can be replaced by right sides of the rule. Thus, one often also calls a semi-Thue system a rewriting system. For a semi-Thue system $S$ we define the rewriting relation $\underset{S}{\Longrightarrow}$ given by $u_{1}\ell u_{2}\underset{S}{\Longrightarrow}u_{1}ru_{2}\text{ for }u_{1},u_{2}\in A^{*}\text{ and }(\ell,r)\in S$ , that is, $u\underset{S}{\Longrightarrow}v$ if $v$ results from $u$ by replacing the left side of a rule with the right side. The reflexive transitive closure of $\underset{S}{\Longrightarrow}$ is denoted by $\overset{*}{\underset{S}{\Longrightarrow}}$ and the symmetric closure of $\overset{*}{\underset{S}{\Longrightarrow}}$ is denoted by $\overset{*}{\underset{S}{\Longleftrightarrow}}$ . We write $v\underset{S}{\Longleftarrow}u$ for $u\underset{S}{\Longrightarrow}v$ . A semi-Thue system $S$ is confluent or Church-Rosser, if $u\overset{*}{\underset{S}{\Longrightarrow}}v_{1}$ and $u\overset{*}{\underset{S}{\Longrightarrow}}v_{2}$ imply that there exists a word $w\in A^{*}$ such that $v_{1}\overset{*}{\underset{S}{\Longrightarrow}}w$ and $v_{2}\overset{*}{\underset{S}{\Longrightarrow}}w$ . It is locally confluent, if $u\underset{S}{\Longrightarrow}v_{1}$ and $u\underset{S}{\Longrightarrow}v_{2}$ imply that there exists a word $w\in A^{*}$ such that $v_{1}\overset{*}{\underset{S}{\Longrightarrow}}w$ and $v_{2}\overset{*}{\underset{S}{\Longrightarrow}}w$ . It is weight-reducing for a weighted alphabet $(A,\left\lVert\mathinner{\cdot}\right\rVert)$ , if $\left\lVert\mathinner{\ell}\right\rVert>\left\lVert\mathinner{r}\right\rVert$ for all rules $(\ell,r)\in S$ and it is Parikh-reducing, if for all $a\in A$ and all rules $(\ell,r)\in S$ it holds $\left|\mathinner{\ell}\right|_{a}\geq\left|\mathinner{r}\right|_{a}$ and for all rules $(\ell,r)\in S$ there exists a letter $a\in A$ such that $\left|\mathinner{\ell}\right|_{a}>\left|\mathinner{r}\right|_{a}$ . Furthermore, $S$ is subword-reducing, if $r\neq\ell$ and $r$ is a subword of $\ell$ for each rule $(\ell,r)\in S$ .

The notion Parikh-reducing comes from the connection to Parikh images. A Parikh image of a word $w\in A^{*}$ is the vector $(\left|\mathinner{w}\right|_{a})_{a\in A}$ . A semi-Thue system $S$ is Parikh-reducing if and only if the Parikh image $(\left|\mathinner{r}\right|_{a})_{a\in A}$ is smaller than $(\left|\mathinner{\ell}\right|_{a})_{a\in A}$ for every rule $(\ell,r)\in S$ . By definition every subword-reducing system is Parikh-reducing. Further, it is rather easy to see that a semi-Thue system $S\subseteq A^{*}\times A^{*}$ is Parikh-reducing if and only if it is weight-reducing for every weight $\left\lVert\mathinner{\cdot}\right\rVert:A^{*}\to\mathbb{N}$ .

A classical lemma states that $S$ is confluent if it is Parikh-reducing and locally confluent, see [BO93]. In the following we study different cases which may occur when checking for local confluence. Let $(\ell,r),(\ell^{\prime},r^{\prime})\in S$ be two rules and consider the word $u\ell v\ell^{\prime}w$ . Then

$u\ell v\ell^{\prime}w$$urv\ell^{\prime}w$$u\ell vr^{\prime}w$$urvr^{\prime}w$$S$$S$$S$$S$

Thus, checking for local confluence in this case is trivial. The only non-trivial cases appear when two rules overlap. There are two different kinds of overlaps:

$w=x\ell=\ell^{\prime}y$ , 2. 2.

$w=\ell=x\ell^{\prime}y$

for rules $(\ell,r),(\ell^{\prime},r^{\prime})\in S$ . The resulting pairs $(xr,r^{\prime}y)$ and $(r,xr^{\prime}y)$ are called critical pairs. The first kind is called overlap critical and the second kind is called factor critical, see also Figure 1.

We say that a critical pair $(u,v)$ resolves if there exists a word $w\in A^{*}$ such that $u\overset{*}{\underset{S}{\Longrightarrow}}w\overset{*}{\underset{S}{\Longleftarrow}}v$ holds. Summarized, we obtain the following:

Lemma 2.3 ([KB70]).

A semi-Thue system is locally confluent if and only if all its critical pairs resolve.

Lemma 2.3 will be used without explicitly referring to it.

A word $w$ is irreducible in $S$ if no left-side of a rule in $S$ appears in $w$ . We denote the set of irreducible elements of $S$ by $\mathrm{IRR}_{S}(A^{*})$ . The relation $\overset{*}{\underset{S}{\Longleftrightarrow}}$ is a congruence on $A^{*}$ . Thus, one can consider the monoid $A^{*}\!/S=A^{*}\!/\!\!\overset{*}{\underset{S}{\Longleftrightarrow}}$ . The elements of $A^{*}\!/S$ are equivalence classes $[u]_{S}=\left\{v\in A^{*}\mathrel{\left|\vphantom{v\in A^{*}}\vphantom{u\overset{*}{\underset{S}{\Longleftrightarrow}}v}\right.}u\overset{*}{\underset{S}{\Longleftrightarrow}}v\right\}$ of the congruence $\overset{*}{\underset{S}{\Longleftrightarrow}}$ . The number of elements in $A^{*}\!/S$ is called index of $S$ . If $S$ is Parikh-reducing and (locally) confluent, then there is a bijection between $A^{*}\!/S$ and $\mathrm{IRR}_{S}(A^{*})$ . In this case, we denote elements of the monoid $A^{*}\!/S$ with the corresponding irreducible words. In fact, we call a locally confluent Parikh-reducing system a Parikh-reducing Church-Rosser system. Let $\varphi:A^{*}\to M$ be a homomorphism and $S\subseteq A^{*}\times A^{*}$ be a semi-Thue system. We say that $\varphi$ factorizes through $S$ if for all $u\underset{S}{\Longrightarrow}v$ it holds $\varphi(u)=\varphi(v)$ , that is, equivalence classes of $S$ map to the same element in $M$ . We also say that $S$ is $\varphi$ -invariant if $\varphi$ factorizes through $S$ . This notion is algebraically motivated. Let $S$ be a semi-Thue system such that $\varphi$ factorizes through $S$ , then $\psi:A^{*}\!/S\to\varphi(A^{*})$ given by $\psi([u]_{S})=\varphi(u)$ is a well-defined homomorphism. Let $\pi_{S}:A^{*}\to A^{*}\!/S$ be the natural projection and $L$ be some language which is recognized by $\varphi$ and $\pi_{L}$ be the syntactic homomorphism of $L$ . Then we obtain the situation in Figure 2. In particular, $\pi_{S}$ recognizes $L$ .

Since $\varphi$ factorizes through $S$ if and only if $\varphi:A^{*}\to\varphi(A^{*})$ factorizes through $S$ , we may assume that $\varphi$ is surjective. If further $S$ is a Church-Rosser system, we call $A^{*}\!/S$ a Church-Rosser representation of $\varphi$ (or $M$ ).

3 Parikh-reducing Church-Rosser systems

3.1 Outline

In this subsection we give an outline on the proof strategy which will be used in Theorem 3.2. The macro structure of the proof is as follows: Given a homomorphism $\varphi:A^{*}\to G$ , we construct a system $S$ which is $\varphi$ -invariant by induction on $A$ . The construction is based on the following lemma:

Lemma 3.1 ([DKW12, DKRW15]).

Let $A$ be an alphabet of size at least two, $\varphi:A^{*}\to M$ be a homomorphism and $B=A\setminus\left\{\mathinner{c}\right\}$ for some $c\in A$ . Assume that $R\subseteq B^{*}\times B^{*}$ is a Parikh-reducing Church-Rosser system of finite index which is $\varphi$ -invariant. Let $K=\mathrm{IRR}_{R}(B^{*})c$ be a new alphabet and $T\subseteq K^{*}\times K^{*}$ be a Parikh-reducing Church-Rosser system of finite index such that

[TABLE]

is $\varphi$ -invariant. Then

a)

$S=R\cup T^{\prime}\subseteq A^{*}\times A^{*}$ * is a $\varphi$ -invariant Parikh-reducing Church-Rosser system of finite index.* 2. b)

All groups in $A^{*}\!/S$ are contained in $B^{*}\!/R$ or in $K^{*}\!/T$ . 3. c)

The index of $A^{*}/S$ is $\left|\mathinner{B^{*}\!/R}\right|+\left|\mathinner{B^{*}\!/R}\right|^{2}\left|\mathinner{K^{*}\!/T}\right|$ .

Proof.

a) is proved in [DKRW15]. By [DKW12], $A^{*}\!/S$ is a so-called Rees extension monoid and the statement of b) follows from general properties of Rees extension monoids, see [AK16].

It remains to calculate the size of the index of $S$ . Every irreducible word in $S$ which contains no $c$ is contained in $B^{*}\!/R$ . Conversely, every element of $B^{*}\!/R$ is irreducible in the rewriting system given by $S$ . Every irreducible word in $S$ which contains at least one $c$ is of the form $ucvw$ for $u,w\in B^{*}\!/R$ and $v\in K^{*}\!/T$ . By the definition of the rule set $S$ every such word $ucvw$ is also irreducible. This shows that there are exactly $\left|\mathinner{B^{*}\!/R}\right|^{2}\left|\mathinner{K^{*}\!/T}\right|$ irreducible words in $S$ which contains at least one $c$ . ∎

For a fixed letter $c\in A$ we remove $c$ and obtain the alphabet $B=A\setminus\left\{\mathinner{c}\right\}$ . Inductively, one obtains a system $R\subseteq B^{*}\times B^{*}$ which factorizes through $\varphi$ . Now, consider a new alphabet $K=\mathrm{IRR}_{R}(B^{*})c$ . By Lemma 3.1, it remains to construct a system $T\subseteq K^{*}\times K^{*}$ . The system $T$ contains two kinds of rules: $\Delta$ -rules and $\Omega$ -rules. The idea of these rules is to deal with different kind of words. The set $T_{\Delta}$ of $\Delta$ -rules deals with long repetitions of short words. Whenever there is no long repetition of short words, this is witnessed by a marker word $\omega$ . The set $T_{\Omega}$ of $\Omega$ -rules contains rules of the form $\omega u\omega\to\omega\gamma(u)\omega$ for some normal forms $\gamma(u)$ . Lemma 3.6 shows that such rules appear for sufficiently large words and Lemma 3.9 shows the confluence of the constructed system.

3.2 Commutative Groups

In this section we study Parikh-reducing Church-Rosser systems for abelian groups. Let $\varphi:A^{*}\to G$ be a homomorphism in an abelian group $G$ . We construct a system for $G$ by sorting the letters $a$ and then reducing them modulo their order. Thus, we actually construct a Church-Rosser representation for the group $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ . The situation obtained in Theorem 3.2 is shown in the commutative diagram Figure 3.

Theorem 3.2.

Let $\varphi:A^{*}\to G$ be a homomorphism to a finite commutative group $G$ . Then there exists a Parikh-reducing Church-Rosser system $S$ of finite index which factorizes through $\varphi$ . Further, all groups contained in $A^{*}\!/S$ are isomorphic to some subgroup of $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ .

Proof.

Let $n$ be the least common multiple of $\mathop{\mathrm{ord}}(\varphi(a))$ for $a\in A$ . We do an inductive proof on the number of letters $\left|\mathinner{A}\right|$ . If $A=\left\{\mathinner{c}\right\}$ , then we may set $S=\left\{\mathinner{c^{n}\to 1}\right\}$ . This system is Parikh-reducing, locally confluent and it holds $A^{*}\!/S\simeq\mathbb{Z}/n\mathbb{Z}$ . Thus, we may assume that $\left|\mathinner{A}\right|>1$ . Let $A=\left\{\mathinner{a_{1},\ldots,a_{s},c}\right\}$ be the alphabet and $c\in A$ be an arbitrary letter of $A$ . We consider the alphabet $B=A\setminus\left\{\mathinner{c}\right\}$ . Inductively, $B$ is smaller than $A$ , we get a Parikh-reducing Church-Rosser system $R\subseteq B^{*}\times B^{*}$ of finite index which factorizes through $\varphi_{|B^{*}}:B^{*}\to G$ . The idea is to first reduce the words over $B^{*}$ and then work over a new alphabet $K$ . Let $K=\mathrm{IRR}_{R}(B^{*})c$ be the new alphabet of irreducible words over $B^{*}$ appended by the letter $c$ which poses as a separator. We will first construct a Parikh-reducing (over $A^{*}$ ) Church-Rosser system $T\subseteq K^{*}\times K^{*}$ of finite index. Note that this system $T$ is not Parikh-reducing over $K^{*}$ . We will use two different sets of rules. One for long repetitions of short words and one for longer words which are not repetitions of such short words. Let us first define the set of short words as $\Delta=K^{\leq n}\setminus\left\{\mathinner{1}\right\}$ , that is, as the set of nonempty words of length at most $n$ . Let further be

[TABLE]

the system of $\Delta$ -rules whereas $t=3n(s+4)+n$ . The choice of the parameter $t$ will be explained later. For now, the fact that $t>2n$ is sufficient to obtain that $T_{\Delta}$ is a Parikh-reducing (over $K^{*}$ , and thus also over $A^{*}$ ) Church-Rosser system by Lemma 3.3.

Lemma 3.3 ([DKRW15]).

Let $\Delta\subseteq K^{\leq n}$ be a set of nonempty words of length at most $n$ which is closed under nontrivial factors, $t>2n$ and $n\geq 1$ . Then

[TABLE]

is a subword-reducing Church-Rosser system. In particular, $T_{\Delta}$ is Parikh-reducing and weight-reducing for every weight.

Next, we will introduce marker words. They basically mark the absence of a long repetition of words in $\Delta$ , i.e., a long enough word in $K^{*}$ will either contain a marker word or a rule in $T_{\Delta}$ . The next lemma shows that the length of such markers can be bounded by $2n$ .

Lemma 3.4 ([DKRW15]).

Let $\Delta\subseteq K^{\leq n}$ be a set and let $F=\bigcup_{\delta\in\Delta,i\in\mathbb{N}}\mathrm{Factors}(\delta^{i})$ . Then $K^{*}\setminus F$ is an ideal which is generated by a set $J\subseteq K^{\leq 2n}$ of words of length at most $2n$ , that is, $K^{*}\setminus F=K^{*}JK^{*}$ .

Thus, letting $F=\bigcup_{\delta\in\Delta,i\in\mathbb{N}}\mathrm{Factors}(\delta^{i})$ , we obtain $K^{*}\setminus F=K^{*}JK^{*}$ for some $J\subseteq K^{\leq 2n}$ . In order to ensure that we find such a marker which does not start with a $c\in K$ , we increase the length of a marker to $3n$ . Formally, let $\Omega=K^{3n}\setminus(cK^{*}\cup F)$ be the set of markers.

Let $\preceq$ be a total preorder on $\Omega$ with the following properties:

•

$\omega,\eta\in\Omega$ with $\omega\in K^{*}(K\setminus\left\{\mathinner{c}\right\})c^{i},\eta\in K^{*}(K\setminus\left\{\mathinner{c}\right\})c^{j}$ and $i>j$ implies $\omega\preceq\eta$ .

•

$\preceq$ is a total order on $\Omega\setminus Kc^{3n-1}$ .

•

$\omega,\eta\in\Omega\cap Kc^{3n-1}$ implies $\omega\preceq\eta$ .

Thus, the larger the block of $c$ ’s at the suffix of an $\omega$ , the smaller it is with respect to $\preceq$ . Additionally, all elements in $\Omega$ with a maximal block of $c$ ’s at the suffix are equivalent with respect to $\preceq$ . In particular, $\omega\preceq\eta$ and $\eta\preceq\omega$ implies either $\eta=\omega$ or there exists $b_{1},b_{2}\in K$ with $\omega=b_{1}c^{3n-1}$ and $\eta=b_{2}c^{3n-1}$ . Let $u\in K^{*}\omega K^{*}$ for some $\omega\in\Omega$ . We say that $\omega$ is a maximal $\Omega$ -factor of $u$ , if $u\in K^{*}\eta K^{*}$ with $\eta\in\Omega$ implies $\eta\preceq\omega$ . We want to show that every long word contains sufficiently large factors which are surrounded by “locally” maximal $\Omega$ -factors. The first step is to show the existence of $\Omega$ -factors.

Lemma 3.5.

There exists a number $t_{0}$ such that for every word $v\in K^{*}$ with length at least $t_{0}$ has a factor $\delta^{t+n}$ for some $\delta\in\Delta$ or a factor $\omega\in\Omega$ .

Proof.

Let $t_{0}=(t+n+3)(n+1)$ . If $v\notin\mathrm{IRR}_{T_{\Delta}}(K^{*})$ the statement is true. Thus, we assume that for all $\delta\in\Delta$ there is no factor $\delta^{t+n}$ of $v$ . There is a factorization $v=c^{\ell}v_{1}v_{2}$ such that $v_{1}\in F$ is maximal and $v_{1}$ has no $c$ as a prefix.

Hence we obtain $\ell<t+n$ and $\left|\mathinner{v_{1}}\right|<(t+n)n$ which implies $\left|\mathinner{v_{2}}\right|\geq 3n+3>3n-1$ by definition of $t_{0}$ . As $v_{1}\in F$ , there is some $\delta\in\Delta$ which does not have $c$ as prefix and $v$ is a prefix of $\delta^{+}$ . Consider the first factor $u$ of length $2n$ of $v_{1}v_{2}$ which is not in $F$ . Since $v_{1}$ is a prefix of $\delta^{+}$ , one must take at most $n-1$ additional letters left from $u$ in order to obtain a factor $u^{\prime}$ of $v_{1}v_{2}$ which is not in $F$ , has length at most $3n$ and does not start with a $c$ . Filling up $u^{\prime}$ with letters from the right, we obtain a factor $u^{\prime\prime}$ of $v_{1}v_{2}$ which is not in $F$ , has length $3n$ and does not start with a $c$ , that is, $u^{\prime\prime}\in\Omega$ . ∎

Lemma 3.6.

There exists a number $t_{\Omega}$ such that every word $v\in K^{*}$ of length at least $t_{\Omega}$ contains either

•

a factor $\delta^{t+n}$ for $\delta\in\Delta$ or

•

a factor $\omega u\omega$ with $\omega\in\Omega$ , $t<\left|\mathinner{\omega u\omega}\right|\leq t_{\Omega}$ and for every $\eta\in\Omega$ with $\omega u\omega\in A^{*}\eta A^{*}$ we have $\eta\preceq\omega$ .

Proof.

Let $\Omega_{v}=\left\{\omega\in\Omega\mathrel{\left|\vphantom{\omega\in\Omega}\vphantom{v\in A^{*}\omega A^{*}}\right.}v\in A^{*}\omega A^{*}\right\}$ be the set of $\Omega$ -factors of $v$ and let $t_{k}$ be defined by the recursion $t_{k}=2t_{k-1}+t$ . A quick calculation verifies the explicit formula $t_{k}=2^{k}(t_{0}+t)-t$ . We prove the following statement by induction on $k$ : For every word $v$ of length at least $t_{k}$ which has at least $k$ different $\Omega$ -factors, i.e., $k\geq\left|\mathinner{\Omega_{v}}\right|$ and which does not contain a factor $\delta^{t+n}$ for $\delta\in\Delta$ , there exists a factor $\omega u\omega$ of $v$ such that

•

$\omega\in\Omega$ ,

•

$t<\left|\mathinner{\omega u\omega}\right|\leq t_{k}$ and

•

$\omega$ is a maximal $\Omega$ -factor of $\omega u\omega$ .

The case $k=0$ is trivial since by hypothesis every word $v$ with length at least $t_{0}$ and $\left|\mathinner{\Omega_{v}}\right|=0$ must contain a factor $\delta^{t+n}$ for $\delta\in\Delta$ . Consider the case $k>0$ . Since we require that the length of the factor $\omega u\omega$ is smaller or equal to $t_{k}$ , we consider the prefix of $v$ of length $t_{k}$ . In particular, we can assume that every proper factor of $v$ has length smaller than $t_{k}$ .

Consider the factorization $v=pfq$ with $f\in(\omega A^{*}\cap A^{*}\omega)$ such that $\omega$ is a maximal $\Omega$ -factor of $v$ and $f$ is maximal with regard to length. If $\left|\mathinner{f}\right|\leq t$ , we obtain

[TABLE]

which implies $\left|\mathinner{pq}\right|\geq 2t_{k-1}$ . Since $p$ and $q$ contain no factor $\omega$ , we can apply induction to either $p$ or $q$ . If $\left|\mathinner{f}\right|>t$ , then $f$ has the form $f=\omega u\omega$ for a word $u$ because of $t>2\max_{\omega\in\Omega}\left|\mathinner{\omega}\right|$ and $f\in(\omega A^{*}\cap A^{*}\omega)$ . The factor $f$ has the required properties since $\left|\mathinner{f}\right|\leq\left|\mathinner{v}\right|\leq t_{k}$ . This concludes the induction. We infer the statement of the lemma by setting $t_{\Omega}=t_{\left|\mathinner{\Omega}\right|}$ . ∎

In particular, Lemma 3.6 shows the existence of a number $t_{\Omega}$ such that every $v\in\mathrm{IRR}_{T_{\Delta}}(K^{*})$ with $\left|\mathinner{v}\right|\geq t_{\Omega}$ contains a factor $\omega u\omega^{\prime}$ with $\omega,\omega^{\prime}$ being $\Omega$ -maximal for this factor and $t<\left|\mathinner{\omega u\omega^{\prime}}\right|\leq t_{\Omega}$ . The idea is to reduce $u$ to a normal form $\gamma(u)$ . This is the part where commutativity of $G$ is needed. Let $a\in A$ be a letter and $\left|\mathinner{u}\right|_{a}$ be the number of occurrences of $a$ in $u$ . Define $\gamma_{a}(u)=a^{\left|\mathinner{u}\right|_{a}\mod\mathop{\mathrm{ord}}(\varphi(a))}c^{3n}$ and

[TABLE]

The mapping $\gamma$ is a normal form in the group $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(a)\mathbb{Z}$ , i.e., let $\psi:A^{*}\to\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(a)\mathbb{Z}$ be the homomorphism counting the different letters $a$ modulo $\mathop{\mathrm{ord}}(a)$ , then $\psi(u)=\psi(v)$ if and only if $\gamma(u)=\gamma(v)$ . By choice of $\gamma_{a}(u)$ we have $\gamma(u)\in K^{*}$ . Since $\left|\mathinner{\gamma_{a}(u)}\right|=3n$ for $a\in B$ and $3n\leq\left|\mathinner{\gamma_{c}(u)}\right|<4n$ , we obtain

[TABLE]

In particular, $\varphi(u)=\varphi(\gamma(u))$ and $\gamma(u\gamma(u^{\prime}))=\gamma(uu^{\prime})=\gamma(u^{\prime}u)=\gamma(\gamma(u^{\prime})u)$ . Additionally, if $u\in K^{*}$ with $\left|\mathinner{u}\right|\geq 3n(s+2)+n=t-6n$ , then $u\mapsto\gamma(u)$ is Parikh-reducing over $A^{*}$ since at least the number of $c$ decreases. Note that the inequality $t-n\leq\left|\mathinner{\omega\gamma(u)\omega^{\prime}}\right|<t$ is actually the reason for the definition of $t$ . Let

[TABLE]

be the set of $\Omega$ -rules. By definition of $\gamma$ the set of $\Omega$ -rules is Parikh-reducing over $A^{*}$ . Note that for a $\Omega$ -rule, either $\omega$ and $\omega^{\prime}$ are minimal elements in $\Omega$ or $\omega=\omega^{\prime}$ . By Lemma 3.6 the system $T=T_{\Delta}\cup T_{\Omega}$ has only finitely many irreducible elements. It remains to prove that $T$ is Church-Rosser. By Lemma 3.3 the set $T_{\Delta}$ of $\Delta$ -rules is (locally) confluent. Next, we will study properties of $\Omega$ -rules which are crucial for showing that $T$ is Church-Rosser. First, we show that $T$ -rules preserve $\Omega$ -maximal elements.

Lemma 3.7.

Let $u\underset{T}{\Longrightarrow}v$ and let $\omega$ be a maximal $\Omega$ -factor of $u$ . Then $\eta\preceq\omega$ for every $\Omega$ -factor $\eta$ of $v$ .

Proof.

As $T=T_{\Delta}\cup T_{\Omega}$ there are two cases for the rule set of $u\underset{T}{\Longrightarrow}v$ .

In the case that $u\underset{T_{\Delta}}{\Longrightarrow}v$ there must exists a $\delta\in\Delta$ and a factorization $u=u_{1}\delta^{t+n}u_{2}$ such that $v=u_{1}\delta^{t}u_{2}$ . By construction, we have $t>3n=\left|\mathinner{\omega}\right|$ . Thus, every element of $\Omega$ is a factor of $u$ if and only if it is also a factor of $v$ . Since $\omega$ is $\Omega$ -maximal for $u$ , it is also $\Omega$ -maximal for $v$ .

If $u\underset{T_{\Omega}}{\Longrightarrow}v$ , there is a factorization $u=u_{1}\omega_{1}\hat{u}\omega_{2}u_{2}$ such that $v=u_{1}\omega_{1}\gamma(\hat{u})\omega_{2}u_{2}$ and $\omega_{1},\omega_{2}$ are maximal $\Omega$ -factors of $\omega_{1}\hat{u}\omega_{2}$ . Since every marker in $\Omega$ has fixed length $3n$ , it remains to show that $\omega_{1}\gamma(\hat{u})\omega_{2}$ has no $\Omega$ -factors larger than $\omega_{1}$ (and by $\omega_{1}\preceq\omega$ , also no $\Omega$ -factors larger than $\omega$ ). Note that $\gamma(\hat{u})$ has $c^{3n}$ as prefix and suffix. Every $\Omega$ -factor of $\omega_{1}\gamma(\hat{u})$ which is not an $\Omega$ -factor of $\gamma(\hat{u})$ has the form $\zeta c^{i}$ for some $i\geq 0$ and $\zeta$ is a suffix of $\omega_{1}$ . Since the block of $c$ ’s at the suffix of $\zeta c^{i}$ may only increase, we obtain $\zeta c^{i}\preceq\omega_{1}$ by definition of $\preceq$ . Since every element of $\Omega$ has length $3n$ and does not have $c$ as a prefix, there is no $\Omega$ -factor in $\gamma(\hat{u})\omega_{2}$ which is neither in $\gamma(\hat{u})$ nor equals $\omega_{2}$ . By construction, every $\Omega$ -factor of $\gamma(\hat{u})$ is of the form $\gamma_{a}(\hat{u})$ for some $a\in B$ . However, $\gamma_{a}(\hat{u})$ is a minimal element of $\Omega$ by construction. In particular, $\eta\preceq\omega_{1}\preceq\omega$ for every $\Omega$ -factor $\eta$ of $\omega_{1}\gamma(\hat{u})\omega_{2}$ . ∎

Next, as an intermediate step, we show local confluence in the case of a left side $\omega u\omega^{\prime}$ of a rule in $T_{\Omega}$ . In particular, we show that every word of this form can be reduced to a fixed normal form.

Lemma 3.8.

Let $\omega u\omega^{\prime}$ be a word such that $\omega$ and $\omega^{\prime}$ are maximal $\Omega$ -factors of $\omega u\omega^{\prime}$ and $\left|\mathinner{\omega u\omega^{\prime}}\right|\geq t$ . Then $\omega u\omega^{\prime}\underset{T}{\Longrightarrow}v$ implies $v\overset{*}{\underset{T}{\Longrightarrow}}\omega\gamma(u)\omega^{\prime}$ .

Proof.

The statement is clear if $v=\omega\gamma(u)\omega^{\prime}$ which is why we may assume $v\neq\omega\gamma(u)\omega^{\prime}$ . We show the lemma inductively on the length of $\omega u\omega^{\prime}$ . In order to apply the induction step we show that $v=\omega v^{\prime}\omega^{\prime}$ and $\left|\mathinner{v}\right|\geq t$ . The precondition that $\omega$ and $\omega^{\prime}$ are maximal $\Omega$ -factors of $v$ is satisfied by Lemma 3.7.

In the case of $\omega u\omega^{\prime}\underset{T_{\Omega}}{\Longrightarrow}v$ , some rule $\mu u^{\prime}\mu^{\prime}\to\mu\gamma(u^{\prime})\mu^{\prime}\in T_{\Omega}$ was applied. As such rules preserve the prefixes and suffixes of length $3n$ , the word $v$ must have the correct form. In the case of $\omega u\omega^{\prime}\underset{T_{\Delta}}{\Longrightarrow}v$ , some rule $\delta^{t+n}\to\delta^{t}$ was applied. Since $t>6n$ and elements of $\Omega$ all have length $3n$ , the $\Omega$ -factors $\omega$ and $\omega^{\prime}$ are preserved by the application of the $\Delta$ -rule $\delta^{t+n}\to\delta^{t}$ . In both cases we conclude that $v=\omega v^{\prime}\omega^{\prime}$ for some word $v^{\prime}$ .

It remains to show, that $\left|\mathinner{v}\right|\geq t$ . Since $\left|\mathinner{\delta^{t}}\right|\geq t$ , the case of an application of a rule in $T_{\Delta}$ is trivial. Let $v$ stem from the application of a rule $\mu u^{\prime}\mu^{\prime}\to\mu\gamma(u^{\prime})\mu^{\prime}\in T_{\Omega}$ . If either $\mu u^{\prime}$ or $u^{\prime}\mu^{\prime}$ is a factor of $u$ , we have that either $\mu\gamma(u^{\prime})$ or $\gamma(u^{\prime})\mu^{\prime}$ is a factor of $v^{\prime}$ . Thus, using $\left|\mathinner{\gamma(u^{\prime})}\right|>t-7n$ and $\left|\mathinner{\omega}\right|=3n$ for every element $\omega\in\Omega$ , we obtain

[TABLE]

It remains to prove $\left|\mathinner{v}\right|\geq t$ for the situation which is depicted below.

$\omega\vphantom{\mu\delta^{t}}$$u\vphantom{\mu\delta^{t}}$$\omega^{\prime}\vphantom{\mu\delta^{t}}$$\mu\vphantom{\mu\delta^{t}}$$u^{\prime}\vphantom{\mu\delta^{t}}$$\mu^{\prime}\vphantom{\mu\delta^{t}}$

If $\omega\neq\omega^{\prime}$ , then there exists $b_{1},b_{2}\in K\setminus\left\{\mathinner{c}\right\}$ such that $\omega=b_{1}c^{3n-1}$ and $\omega^{\prime}=b_{2}c^{3n-1}$ . However, as no element of $\Omega$ starts with the letter $c$ , we can conclude $\omega=\mu$ and thus by $\mu^{\prime}\preceq\mu$ we obtain $\omega^{\prime}=\mu^{\prime}$ by the same argument. In this case we have $\omega u\omega^{\prime}=\mu u^{\prime}\mu^{\prime}$ and henceforth $v=\omega\gamma(u)\omega^{\prime}$ . The case that $\mu\neq\mu^{\prime}$ is similar: $\omega^{\prime}$ has no $c$ as prefix and thus $\mu^{\prime}=\omega^{\prime}$ . Again, $\omega=\mu$ and $v=\omega\gamma(u)\omega^{\prime}$ holds. Hence, we may assume $\omega=\omega^{\prime}$ and $\mu=\mu^{\prime}$ .

Combining both overlaps, we obtain the following picture.

$x\vphantom{\mu\delta^{t}}$$\omega\vphantom{\mu\delta^{t}}$$y\vphantom{\mu\delta^{t}}$$\mu\vphantom{\mu\delta^{t}}$$y^{\prime}\vphantom{\mu\delta^{t}}$$x^{\prime}\vphantom{\mu\delta^{t}}$$\mu\vphantom{\mu\delta^{t}}$

In the notation of the picture above we have $u=yu^{\prime}x$ . Thus, $v=\omega y\gamma(u^{\prime})x\omega$ and by $\gamma(u^{\prime})>t-7n$ and $\left|\mathinner{\omega}\right|=3n$ it suffices to show $\left|\mathinner{x^{\prime}}\right|=\left|\mathinner{yx}\right|\geq n$ . By $\mu y^{\prime}=x^{\prime}\mu$ we have that $\mu$ is a factor of $x^{\prime+}$ . We conclude $x^{\prime}\not\in\Delta$ which implies $\left|\mathinner{x^{\prime}}\right|>n$ . In summary, $v=\omega v^{\prime}\omega^{\prime}$ and $\left|\mathinner{v}\right|\geq t$ holds. If $\left|\mathinner{v}\right|\leq t_{\Omega}$ , then we can directly apply the $T_{\Omega}$ -rule with left side $v$ . Else, $v$ must be reducible by Lemma 3.6 and we can apply induction. ∎

Combining the previous lemmas we show that $T$ is locally confluent.

Lemma 3.9.

$T$ * is locally confluent.*

Proof.

Let $\ell\to r,\ell^{\prime}\to r^{\prime}\in T$ be two rules. We have to show that every overlap of the left sides of those rules resolves. The system $T_{\Delta}$ is locally confluent by Lemma 3.3. Hence, we may assume that $\ell\to r\in T_{\Omega}$ . Let $\omega u\omega^{\prime}=\ell$ and consequently $r=\omega\gamma(u)\omega^{\prime}$ . Consider first the case that $\delta^{t+n}=\ell^{\prime}\to r^{\prime}\in T_{\Delta}$ . If $\ell^{\prime}$ is a factor of $\ell$ , that is, if $\ell=x\ell^{\prime}y$ , then $\ell\overset{}{\underset{T}{\Longrightarrow}}xr^{\prime}y\overset{*}{\underset{T}{\Longrightarrow}}r$ by Lemma 3.8. By definition of $\Omega$ , the left side $\ell$ which contains an element of $\Omega$ cannot be a factor of $\delta^{t+n}$ . Hence, the system resolves in the case of factor critical pairs. Consider thus the case of an overlap critical pair $x\ell=\ell^{\prime}y$ (the case $x\ell^{\prime}=\ell y$ is symmetric). Since $\omega$ is no factor of $\delta^{+}$ and $t\geq 3n$ by definition, we have the following situation:

$\delta^{n}\vphantom{\mu\delta^{t}}$$\delta^{t}\vphantom{\mu\delta^{t}}$$\omega\vphantom{\mu\delta^{t}}$$u\omega^{\prime}\vphantom{\mu\delta^{t}}$

Let $\delta^{t}=z_{1}z_{2}$ and $\omega=z_{2}z_{3}$ be the overlap, then

[TABLE]

Consider the case that $\ell^{\prime}\to r^{\prime}\in T_{\Omega}$ and let $\ell^{\prime}=\mu v\mu^{\prime}$ . Again, if $\ell^{\prime}=x\ell y$ , then $\ell^{\prime}\overset{}{\underset{T}{\Longrightarrow}}xry\overset{*}{\underset{T}{\Longrightarrow}}r^{\prime}$ by Lemma 3.8. Hence, by symmetry, it suffices to consider the case $x\ell=\ell^{\prime}y$ . If $\ell$ and $\ell^{\prime}$ overlap at most $3n$ positions,

$\mu u^{\prime}$$\mu^{\prime}$$\omega\vphantom{\mu\delta^{t}}$$u\omega^{\prime}\vphantom{\mu\delta^{t}}$

then the rules can be applied independently; let again be $\mu^{\prime}=z_{1}z_{2}$ and $\omega=z_{2}z_{3}$ be the overlap, then

[TABLE]

and the system resolves in this case.

Hence, we assume that $\ell$ and $\ell^{\prime}$ overlap more than $3n$ positions. In this case $\mu^{\prime}$ is a factor of $\ell$ and $\omega$ is a factor of $\ell^{\prime}$ . This implies that $\mu$ and $\omega^{\prime}$ are maximal $\Omega$ -factors of $x\ell=\ell^{\prime}y=\mu u^{\prime\prime}\omega^{\prime}$ . We conclude $x\ell\overset{}{\underset{T}{\Longrightarrow}}xr\overset{*}{\underset{T}{\Longrightarrow}}\mu\gamma(u^{\prime\prime})\omega^{\prime}$ and $\ell^{\prime}y\overset{}{\underset{T}{\Longrightarrow}}r^{\prime}y\overset{*}{\underset{T}{\Longrightarrow}}\mu\gamma(u^{\prime\prime})\omega^{\prime}$ by Lemma 3.8. ∎

By construction, the system $T$ is $\varphi$ -invariant and thus the system

[TABLE]

is $\varphi$ -invariant. By Lemma 3.6 the system $T$ is of finite index over $K^{*}$ . We can apply Lemma 3.1 and obtain a $\varphi$ -invariant Parikh-reducing Church-Rosser system $S$ of finite index over $A^{*}$ . This concludes the proof of the first part of Theorem 3.2. It remains to study the groups in $A^{*}\!/S$ . As an intermediate step, we study the groups in $K^{*}\!/T$ .

Lemma 3.10.

Let $H\subseteq K^{*}\!/T$ be a subsemigroup which is a group and identify $H$ with the corresponding elements in $\mathrm{IRR}_{T}(K^{*})$ . Then either there exists some $\delta\in\Delta$ such that $H\subseteq\left\{\mathinner{\delta^{t},\ldots,\delta^{t+n-1}}\right\}$ is a cyclic group whose order is divisible by $n$ or there is an injective homomorphism $\eta:H\to\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ .

Proof.

Without loss of generality, we may assume that $H$ is non-trivial. Let $e^{2}=e\in H$ be the identity element of $H$ . Note that by definition of the rules $T$ and the set $\Omega$ , the irreducible word of every word $w\in K^{*}\Omega K^{*}$ also contains an $\Omega$ -factor. Thus, by $ex=x$ and $x^{\left|\mathinner{H}\right|}=e$ for all $x\in H$ either all elements in $H\subseteq K^{*}\!/T$ contain some factor in $\Omega$ or none of the elements contains an $\Omega$ -factor. All words $x\in H$ must have length at least $t-n>2n$ by definition of the rules $T$ .

Let us first consider the case that none of the elements contain an $\Omega$ -factor. We show that there exists some $\delta\in\Delta$ such that for all $x\in H$ there exists $i\in\mathbb{N}$ such that $x=\delta^{i}$ . Let $u\delta^{t+n}v\underset{T_{\Delta}}{\Longrightarrow}u\delta^{t}v$ be an application of a rule in $T_{\Delta}$ and let $w\in J$ be a minimal factor of $u\delta^{t+n}v$ which is not in $F$ . By Lemma 3.4 $\left|\mathinner{w}\right|\leq 2n$ and since $t>2n$ , the factor $w$ is also a factor of $u\delta^{t}v$ . Thus, the number of factors in $J$ does not decrease by an application of a rule in $T_{\Delta}$ . Consider any $x\in H$ . Since the number of factors in $J$ does not decrease by some application of a rule in $T_{\Delta}$ , $x^{\left|\mathinner{H}\right|+1}=x$ and no rule in $T_{\Omega}$ is applicable, we deduce that the number of factors in $J$ of $x^{\left|\mathinner{H}\right|+1}$ and $x$ is the same. In particular, this number is zero and we obtain $x\in F$ for all $x\in H$ . Next, we show that $x=\delta^{i}$ for some $\delta\in\Delta$ . Since $x\in F$ and $\Delta$ is closed under conjugation, there exists a primitive word $\delta\in\Delta$ and $i\in\mathbb{N}$ such that $x=\delta^{i}\delta^{\prime}$ for some prefix $\delta^{\prime}$ of $\delta$ . In particular, $\left|\mathinner{\delta}\right|$ is a period of $x$ . Note that $i\geq 2$ since $\left|\mathinner{x}\right|>2n$ . Consider the word $x^{2}$ . By the above, we obtain $x^{2}\in F$ , that is, again there exists a primitive word $\hat{\delta}\in\Delta$ , a prefix $\hat{\delta}^{\prime}$ of $\hat{\delta}$ and a number $j\geq 2$ such that $x^{2}=\hat{\delta}^{j}\hat{\delta}^{\prime}$ . Therefore, $\left|\mathinner{\hat{\delta}}\right|$ is a period of $x^{2}$ and, hence, also of $x$ . Since $\left|\mathinner{x}\right|>2n$ , we may use Theorem 2.1 and conclude that $\gcd(\left|\mathinner{\delta}\right|,|\hat{\delta}|)$ is a period of $x$ . Since $\delta$ is primitive, this implies $\gcd(\left|\mathinner{\delta}\right|,|\hat{\delta}|)=\left|\mathinner{\delta}\right|$ . Since $\hat{\delta}$ is a prefix of $x$ , this yields that $\hat{\delta}$ is a power of $\delta$ which implies $\delta=\hat{\delta}$ by primitivity of $\hat{\delta}$ . In particular, $\left|\mathinner{\delta}\right|$ is a period of $x^{2}$ and $\delta^{\prime}\delta$ is a prefix of $\delta^{2}$ . Since $\delta$ is primitive this implies that $\delta^{\prime}$ is not a proper prefix of $\delta$ by Lemma 2.2 and we conclude that for every $x\in H$ there exists $\delta\in\Delta$ and $i\in\mathbb{N}$ such that $x=\delta^{i}$ . Thus, consider $\delta_{1}^{i},\delta_{2}^{j}\in H$ with $\delta_{1}\neq\delta_{2}$ primitive words in $\Delta$ . Again, $\left|\mathinner{\delta_{1}}\right|$ is a period of $\delta_{1}^{i}$ and there must exist a period $p\leq n$ of $\delta_{1}^{i}\delta_{2}^{j}\in F$ . By Theorem 2.1 $\gcd(\left|\mathinner{\delta_{1}}\right|,p)$ is a period of $\delta_{1}^{2}$ . By primitivity of $\delta_{1}$ , this yields that $\left|\mathinner{\delta_{1}}\right|$ is a divisor of $p$ . In particular, since $p$ is a period of $\delta_{1}^{i}\delta_{2}^{j}$ , this yields $\delta_{1}^{i}\delta_{2}^{j}=\delta_{1}^{i}\delta_{1}^{k}\delta_{1}^{\prime}$ for some $k\geq 2$ and $\delta_{1}^{\prime}$ a prefix of $\delta_{1}$ . Using Theorem 2.1 again, we see that $\gcd(\left|\mathinner{\delta_{1}}\right|,\left|\mathinner{\delta_{2}}\right|)$ is a period of $\delta_{2}^{j}$ , that is, $\left|\mathinner{\delta_{2}}\right|$ is a divisor of $\left|\mathinner{\delta_{1}}\right|$ by primitivity of $\delta_{2}$ . By symmetry, this yields $\left|\mathinner{\delta_{1}}\right|=\left|\mathinner{\delta_{2}}\right|$ and thus $\delta_{1}=\delta_{2}$ .

Fix some primitive word $\delta\in\Delta$ such that $H\subseteq\delta^{+}$ . Since $ex=x$ for all $x\in H$ and the right side of rules in $T_{\Delta}$ have length at least $t$ and since $\delta^{t+n}$ is reducible, we conclude $H\subseteq\left\{\mathinner{\delta^{t},\ldots,\delta^{t+n-1}}\right\}$ and thus $H$ is a subgroup of the cyclic group $\left\{\mathinner{\delta^{t},\ldots,\delta^{t+n-1}}\right\}$ of order $n$ which finishes this case.

The second case is that all words in $H$ contain an $\Omega$ -factor. Consider the maximal $\Omega$ -factors of $e$ and factorize $e=e_{1}\omega e_{2}\omega^{\prime}e_{3}$ with $\omega,\omega^{\prime}\in\Omega$ maximal for $e$ such that $e_{1}\omega$ and $\omega^{\prime}e_{3}$ contains no other maximal $\Omega$ -factors of $e$ . Since $e^{2}=e$ , we conclude that $e_{2}$ is some normal form. By $ex=x=xe$ for all $x\in H$ and Lemma 3.8, there must exist a factorization $x=e_{1}\omega\hat{x}\omega^{\prime}e_{3}$ such that $\hat{x}=\gamma(\hat{x})$ is a normal form. In particular, $\widehat{xy}=\gamma(\hat{x}\omega^{\prime}e_{3}e_{1}\omega\hat{y})$ by Lemma 3.8. Consider the homomorphism $\psi:A^{*}\to\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ which counts the number of $a\in A$ modulo $\mathop{\mathrm{ord}}(a)$ and the function $\eta:H\to\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(a)\mathbb{Z}$ given by $\eta(x)=\psi(\hat{x})\cdot\psi(\omega^{\prime}e_{3}e_{1}\omega)$ . Note that $\psi(\widehat{xy})=\psi(\hat{x})\psi(\hat{y})\psi(\omega^{\prime}e_{3}e_{1}\omega)$ implies that $\eta$ is a homomorphism. It holds $\eta(x)=\eta(y)$ if and only if $\psi(\hat{x})=\psi(\hat{y})$ . By definition of the normal forms $\gamma(\cdot)$ , it holds $\psi(\hat{x})=\psi(\hat{y})$ if and only if $\hat{x}=\hat{y}$ and therefore $\eta$ is injective. ∎

By Lemma 3.1, we obtain that the subgroups in $A^{*}\!/S$ are isomorphic to subgroups of $B^{*}\!/R$ and $K^{*}\!/T$ . By induction, all groups in $B^{*}\!/R$ are isomorphic to some subgroup of $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ . All groups in $K^{*}\!/T$ are either cyclic of order divisible by $n$ or isomorphic to some subgroup of $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ by Lemma 3.10. However, since $n$ is defined as the least common multiple of $\mathop{\mathrm{ord}}(\varphi(a))$ , the cyclic group of order $n$ is a subgroup of $\prod_{a\in A}\mathbb{Z}/\mathop{\mathrm{ord}}(\varphi(a))\mathbb{Z}$ . This proves the statement. ∎

3.3 Group languages over an alphabet of size two

The same technique as in Subsection 3.2 can be used to obtain Parikh-reducing Church-Rosser systems which factorize through homomorphisms $\varphi:\left\{\mathinner{a,b}\right\}^{*}\to G$ for an arbitrary group $G$ . We will only sketch the proof, as it is essentially the proof of Theorem 3.2.

Theorem 3.11.

Let $A=\left\{\mathinner{a,b}\right\}$ be an alphabet of size two and let $\varphi:A^{*}\to G$ be a homomorphism into a finite group $G$ . Then there exists a Parikh-reducing Church-Rosser system $S$ of finite index which factorizes through $\varphi$ . All groups in $A^{*}/S$ are subgroups of $G$ or of $\mathbb{Z}/n\mathbb{Z}$ where $n$ is the exponent of $G$ .

Sketch of proof.

Let $n$ be the exponent of $G$ and let $R=\left\{\mathinner{a^{n}\to 1}\right\}\subseteq\left\{\mathinner{a}\right\}^{*}\times\left\{\mathinner{a}\right\}^{*}$ be the set of rules over the alphabet $\left\{\mathinner{a}\right\}$ . Set $K=\mathrm{IRR}_{R}(a^{*})b=\left\{a^{i}b\mathrel{\left|\vphantom{a^{i}b}\vphantom{0\leq i<n}\right.}0\leq i<n\right\}$ . In the remainder of the sketch, we have to construct a system over $K^{*}$ . As the set of short words we choose $\Delta=K^{\leq n^{2}}\setminus\left\{\mathinner{1}\right\}$ . The corresponding set of rules is $T_{\Delta}=\left\{\delta^{t+n}\to\delta^{t}\mathrel{\left|\vphantom{\delta^{t+n}\to\delta^{t}}\vphantom{\delta\in\Delta}\right.}\delta\in\Delta\right\}$ for $t=n^{2}(3n+7)$ . Note that since $t>2n^{2}$ the system $T_{\Delta}$ is confluent by Lemma 3.3.

Let $F=\bigcup_{\delta\in\Delta,i\in\mathbb{N}}\mathrm{Factors}(\delta^{i})$ and set $\Omega=K^{3n^{2}}\setminus(bK^{*}\cup F)$ . Choose a preorder $\preceq$ on $\Omega$ such that

•

$\omega,\eta\in\Omega$ with $\omega\in K^{*}(K\setminus\left\{\mathinner{b}\right\})b^{i},\eta\in K^{*}(K\setminus\left\{\mathinner{b}\right\})b^{j}$ and $i>j$ implies $\omega\preceq\eta$ .

•

$\preceq$ is a total order on $\Omega\setminus Kb^{3n^{2}-1}$ .

•

$\omega,\eta\in\Omega\cap Kb^{3n^{2}-1}$ implies $\omega\preceq\eta$ .

In order to complete the construction, it remains to choose the normal forms $v_{g}$ . Note that every representation of $g\in G$ needs less than $n$ a’s by the pigeonhole principle. Thus, for every $g\in G$ there exists a word $v_{g}=b^{3n^{2}}v_{1}b^{3n^{2}}\cdots b^{3n^{2}}v_{n-1}b^{3n^{2}}\in K^{*}$ with $\varphi(v_{g})=g$ and $v_{i}\in\left\{ab^{k},b^{k}\mathrel{\left|\vphantom{ab^{k},b^{k}}\vphantom{1\leq k\leq n}\right.}1\leq k\leq n\right\}$ . For every $g\in G$ we choose such a word $v_{g}$ such that the number of $a$ ’s is minimal. Note that by construction $\left|\mathinner{\left|\mathinner{v_{g}}\right|-\left|\mathinner{v_{h}}\right|}\right|<n^{2}$ as a word over $K$ . This is the reason for the choice of $\Delta$ . Furthermore, $t-7n^{2}<\left|\mathinner{v_{g}}\right|<t-6n^{2}$ , which explains the choice of the parameter $t$ . The choice of $v_{g}$ also yields that there are no $\Omega$ -factors in $v_{g}$ apart from $ab^{3n^{2}}$ , which is $\Omega$ -minimal.

Adapting the proof of Lemma 3.5, we prove the existence of a number $t_{0}$ such that every word $v\in K^{*}$ of length at least $t_{0}$ has a factor $\delta^{t+n}$ for a $\delta\in\Delta$ or a factor $\omega\in\Omega$ . Lemma 3.6 yields the existence of a number $t_{\Omega}$ such that every $v\in\mathrm{IRR}_{T_{\Delta}}(K^{*})$ contains a factor $\omega\,u\,\omega^{\prime}$ with $\omega,\omega^{\prime}$ being $\Omega$ -maximal for this factor and $t<\left|\mathinner{\omega u\omega^{\prime}}\right|\leq t_{\Omega}$ . Again, let

[TABLE]

and $T=T_{\Delta}\cup T_{\Omega}$ . We want to apply Lemma 3.1 to obtain a system $S\subseteq\left\{\mathinner{a,b}\right\}^{*}\times\left\{\mathinner{a,b}\right\}^{*}$ . Confluence of $T$ follows along the lines of Lemma 3.7, Lemma 3.8 and Lemma 3.9, whereas the statement about the groups in $A^{*}/S$ is analogously to Lemma 3.10. ∎

4 Beyond Groups

In this section we apply local divisors in order to lift the construction of Church-Rosser systems for groups to the general case of monoids. Instead of directly constructing a system over $K=\mathrm{IRR}_{R}(B^{*})c$ , we obtain a system inductively by going over to the local divisor. This decreases the size of the monoid, but increases the size of alphabet. The first part of this theorem has been published in [DKRW15], whereas the second part is based on the use of Rees extensions, see [DKW12, DW16].

Theorem 4.1.

Let $\mathbf{H}$ be a group variety such that for every homomorphism $\varphi:A^{*}\to G$ for $G\in\mathbf{H}$ there exists a Parikh-reducing Church-Rosser system $S$ of finite index which factorizes through $\varphi$ . Let $\varphi:A^{*}\to M$ be a homomorphism with $M\in\overline{\mathbf{H}}$ .

There exists a $\varphi$ -invariant Parikh-reducing Church-Rosser system $S$ of finite index. 2. 2.

If every homomorphism $\varphi:A^{*}\to G$ in a group $G\in\mathbf{H}$ has a Church-Rosser representation in $\overline{\mathbf{H}}$ , then $A^{*}\!/S\in\overline{\mathbf{H}}$ .

Proof.

1. We use induction on $(\left|\mathinner{M}\right|,\left|\mathinner{A}\right|)$ , ordered lexicographically. Since $\overline{\mathbf{H}}$ is closed under taking submonoids, we can restrict ourselves on surjective homomorphisms $\varphi$ . If $M$ is a group, then $M\in\mathbf{H}$ and there exists such a system $S$ by the preconditions. Thus, we can assume that there is a letter $c\in A$ such that $\varphi(c)$ is not a unit. Let $B=A\setminus\left\{\mathinner{c}\right\}$ . By induction the restriction

[TABLE]

admits a Parikh-reducing Church-Rosser system $R\subseteq B^{*}\times B^{*}$ . Consider the set

[TABLE]

This is a prefix code and will be considered as a new alphabet. Let $\psi:K^{*}\to M_{\varphi(c)}$ be the homomorphism to the local divisor at $\varphi(c)$ induced via $\psi(uc)=\varphi(cuc)$ . We have $\left|\mathinner{M_{\varphi(c)}}\right|<\left|\mathinner{M}\right|$ and $M_{\varphi(c)}\in\overline{\mathbf{H}}$ and thus, by induction, there exists a Parikh-reducing Church-Rosser system $T^{\prime}\subseteq K^{*}\times K^{*}$ of finite index, such that $T^{\prime}$ factorizes through $\psi$ . In particular, we have $\psi(\ell)=\psi(r)$ for a rule $(\ell,r)\in T^{\prime}$ . We show that $\varphi(c\ell)=\varphi(cr)$ . For this let $\ell=u_{1}c\ldots u_{n}c$ and $r=v_{1}c\ldots v_{m}c$ . It holds

[TABLE]

Hence, the rule $c\ell\to cr$ is $\varphi$ -invariant. We set

[TABLE]

The system $S=R\cup T$ has the required properties by Lemma 3.1.

2. The statement is clear if $M$ is a group. Consequently, the construction above is applied. By induction we may assume that $B^{*}\!/R,K^{*}\!/T\in\overline{\mathbf{H}}$ and Lemma 3.1 implies that $A^{*}\!/S\in\overline{\mathbf{H}}$ . ∎

A direct combination of Theorem 3.2 and Theorem 4.1 yields the following corollary.

Corollary 4.2.

Let $M\in\overline{\mathbf{Ab}}$ be a monoid and $\varphi:A^{*}\to M$ be a homomorphism, then there exists a Parikh-reducing Church-Rosser system $S\subseteq A^{*}\times A^{*}$ such that $S$ factorizes through $\varphi$ and $A^{*}\!/S\in\overline{\mathbf{Ab}}$ . In particular, every language $L\subseteq A^{*}$ recognized by $\varphi$ is given as a finite union $L=\bigcup_{u\in L}[u]_{S}$ .

In particular, Theorem 4.1 shows that one can control the groups in the Church-Rosser representation. However, in general one may not preserve other properties, for instance, commutativity.

Proposition 4.3.

Let $\varphi:A^{*}\to\mathbb{Z}/2\mathbb{Z}$ be the homomorphism mapping each letter to the generator of $\mathbb{Z}/2\mathbb{Z}$ . If $\left|\mathinner{A}\right|>1$ , there is no abelian Church-Rosser representation of $\varphi$ .

Proof.

Assume that there exists a Church-Rosser system $S$ of finite index such that $A^{*}/S$ is abelian and there exists a homomorphism $\psi:A^{*}/S\to\mathbb{Z}/2\mathbb{Z}$ with $\varphi=\psi\circ\pi_{S}$ . Let $a,b\in A$ be letters such that $a\neq b$ . Since $S$ factorizes through $\varphi$ , we have $\left|\mathinner{r}\right|\equiv\left|\mathinner{\ell}\right|\bmod 2$ for every rule $(\ell,r)\in S$ and it holds $a\neq b$ in $A^{*}/S$ . Since $A^{*}/S$ is abelian, we obtain $ab=ba$ in $A^{*}/S$ . In particular, $ab\to_{S}1\leftarrow_{S}ba$ and $A^{*}/S$ must be a group. Let $2n$ be the order of $a$ and $b$ . Then $a^{n}=a^{n}b^{n}b^{n}=b^{n}$ holds in $A^{*}/S$ and thus there must be a irreducible word $w$ with $a^{n}\overset{*}{\underset{S}{\Longrightarrow}}w\overset{*}{\underset{S}{\Longleftarrow}}b^{n}$ . By the argumentation above, there exists a number $k<n$ such that $w\in\left\{\mathinner{a^{k},b^{k}}\right\}$ . Thus, either $a^{n-k}=1$ or $b^{n-k}=1$ which is a contradiction to the definition of $n$ . ∎

5 Complexity of Church-Rosser systems

In this section we analyze the size of a Church-Rosser representation as constructed by Theorem 4.1 and Theorem 3.2. We will restrict our analysis on the construction of the Parikh-reducing Church-Rosser representation. Similiar calculations can be made for the analysis of the size of the Church-Rosser system.

Before we prove upper bounds for the size of the constructed Church-Rosser systems, we reconsider the construction. Our constructions used Lemma 3.1 as the basic building block of the construction. Let $\varphi:A^{*}\to M$ be a homomorphism. For $B=A\setminus\left\{\mathinner{c}\right\}$ and a system $R\subseteq B^{*}\times B^{*}$ one needs a system $T\subseteq K^{*}\times K^{*}$ for the alphabet $K=\mathrm{IRR}_{R}(B^{*})c$ . Now, unlike in the general case, we are able to reduce the alphabet itself by exploiting the structure of the alphabet. Let $b_{1}\cdots b_{k}c\in K$ with $b_{i}\in B$ and $k>\left|\mathinner{M}\right|$ . By the pigeonhole principle there exist $i<j$ such that $\varphi(b_{1}\cdots b_{i})=\varphi(b_{1}\cdots b_{j})$ and ${i+(k-j)\leq n}$ . Thus, we may introduce the subword-reducing222subword-reducing seen as a rule over $A^{*}$ , not over $K^{*}$ . rule $b_{1}\cdots b_{k}c\to b_{1}\cdots b_{i}b_{j+1}\cdots b_{k}c$ . If $b_{1}\cdots b_{i}b_{j+1}\cdots b_{k}$ is reducible in $R$ , reduce it further in $R$ . Repeating this process yields a new alphabet for $K$ which is a subset of $B^{\leq n}c$ and therefore, if $\left|\mathinner{B}\right|>1$ , has at most $(\left|\mathinner{B}\right|^{n+1}-1)/(\left|\mathinner{B}\right|-1)$ elements. One can check, that the proofs of Theorem 4.1 and Theorem 3.2 also work adding this reduction technique of the alphabet $K$ . We refrained from directly adding it to the theorems, as they are already quite technical.

Proposition 5.1.

Let $\varphi:A^{*}\to G$ be a homomorphism in $G\in\mathbf{Ab}$ , $n=\left|\mathinner{G}\right|$ and $m=\left|\mathinner{A}\right|>1$ , then there exists a Parikh-reducing Church-Rosser system $S$ such that $S$ factorizes through $\varphi$ and

[TABLE]

Proof.

Let $S$ be the Parikh-reducing Church-Rosser system constructed using Theorem 3.2 and the reduction technique described above. Lemma 3.1 shows that for $m>1$ it holds

[TABLE]

where $B=A\setminus\left\{\mathinner{c}\right\}$ . In the case of Theorem 3.2, $R$ is constructed inductively whereas $T$ is constructed directly. By Lemma 3.6, every irreducible word in $\mathrm{IRR}_{T}(K^{*})$ has length at most $t_{\Omega}$ and therefore $\left|\mathinner{K^{*}\!/T}\right|\leq\left|\mathinner{K}\right|^{t_{\Omega}}$ . The construction of $t_{\Omega}$ in the proof of Lemma 3.6 shows that $t_{\Omega}\leq 2^{\left|\mathinner{\Omega}\right|}(t_{0}+t)$ whereas $t_{0}+t\in\mathcal{O}(n^{2}m)$ . Since $\Omega\subseteq K^{3n}$ we obtain

[TABLE]

Using the alphabet reduction technique, we can assume $\left|\mathinner{K}\right|\leq m^{n+1}$ . Note that $\left|\mathinner{K}\right|^{3n}\leq(m^{n+1})^{3n}=m^{(n+1)3n}$ does not yield another exponential jump. A straightforward calculation yields the existence of a constant $c\in\mathbb{N}$ such that

[TABLE]

Now let $\mathrm{ms}(\varphi)$ denote the smallest size of a Parikh-reducing Church-Rosser representation of $\varphi$ and set

[TABLE]

to be the complexity over all possible homomorphisms with $\left|\mathinner{A}\right|\leq m$ and $\left|\mathinner{G}\right|\leq n$ . We have seen that the recursion

[TABLE]

holds and show $\mathrm{ms}(n,m)\leq 2^{2^{m^{cn^{2}+2}}}$ inductively using this recursion. Note that $\mathrm{ms}(n,1)=n$ and thus the inequality is true in the base case $m=2$ . Also $\mathrm{ms}(1,m)=1$ and therefore we assume $n>1$ . For $m>2$ and $n>1$ it holds

[TABLE]

The last inequality holds since

[TABLE]

The triple exponential upper bound given by Proposition 5.1 seems huge, however there is already a single exponential lower bound which is fairly easy to see. The lower bound comes from the fact that Church-Rosser systems cannot directly represent group identities which preserve length, such as commutation.

Proposition 5.2.

For every $n\in\mathbb{N}$ there exists a homomorphism $\varphi:A^{*}\to G$ into an abelian group $G$ of size $n$ such that for every length-reducing Church-Rosser system $S$ which factorizes through $\varphi$ all words of length smaller than $n$ are irreducible, that is, $A^{<n}\subseteq\mathrm{IRR}_{S}(A^{*})$ . In particular, if $\left|\mathinner{A}\right|>1$ :

[TABLE]

Proof.

Consider the cyclic group $G$ of order $n$ and the homomorphism $\varphi:A^{*}\to G$ which maps all letters $a\in A$ to the same generator $g$ of $G$ . Let $S\subseteq A^{*}\times A^{*}$ be a length-reducing Church-Rosser system which factorizes through $\varphi$ . We show that every word of length less than $n$ is irreducible in $S$ . Let $w\in A^{*}$ be a word with $\left|\mathinner{w}\right|<n$ . Assume that $w\overset{}{\underset{S}{\Longrightarrow}}v$ for some word $v$ . Since $S$ is length-reducing, $\left|\mathinner{v}\right|<\left|\mathinner{w}\right|$ . However, $\varphi(w)=\varphi(v)$ implies $g^{\left|\mathinner{w}\right|-\left|\mathinner{v}\right|}=1$ . Since the order of $g$ is $n$ , this is a contradiction to $0<\left|\mathinner{w}\right|-\left|\mathinner{v}\right|<n$ and $w$ must be irreducible. ∎

Note that this proof does not use the Church-Rosser property and thus one could expect a larger size of the Church-Rosser representation.

Example 5.3.

Niemann and Waldmann constructed an explicit Parikh-reducing system $S$ for the case $\varphi:A^{*}\to\mathbb{Z}/2\mathbb{Z}$ with $\varphi(a)=1$ for all $a\in A$ [NW02, Nie02]. Their system is given by $S=\left\{xyz\to\max(x,z)\mathrel{\left|\vphantom{xyz\to\max(x,z)}\vphantom{x,y,z\in A,y=\min(x,y,z)}\right.}x,y,z\in A,y=\min(x,y,z)\right\}$ for some arbitrary order on $A$ . The irreducible elements in $A^{*}\!/S$ are exactly the sequences which are first strictly increasing and then strictly decreasing, that is

[TABLE]

This yields $\left|\mathinner{A^{*}\!/S}\right|=\left|\mathinner{\mathrm{IRR}_{S}(A^{*})}\right|=1+\sum_{i=1}^{\left|\mathinner{A}\right|}2^{2i-1}=(2^{2\left|\mathinner{A}\right|+1}+1)/3$ which is significantly larger than the lower bound $\left|\mathinner{A}\right|+1$ given in Proposition 5.2.

In the monoid case, the minimal size of a Church-Rosser representation is bounded by a quadruple exponential function. This increase in complexity, compared to the group case, comes from the fact that, unlike in the group case, the system $T\subseteq K^{*}\times K^{*}$ is constructed by induction. However, this is also the reason that the alphabet reduction technique is even more powerful in this case. Consider the function $f:\mathbb{N}^{2}\to\mathbb{N}$ given by $f(1,m)=1$ , $f(n,1)=n$ and $f(n,m)=2f(n,m-1)^{2}\cdot f(n-1,f(n,m-1))$ for $n,m>1$ . This function gives an upper bound for the maximal size of a Church-Rosser representation of a monoid of size $n$ and an alphabet of size $m$ without any optimization. Consider further the hyperoperation function $A_{1}(n)=2n$ , $A_{k}(1)=2$ and $A_{k}(n)=A_{k-1}(A_{k}(n-1))$ .333The notation $A$ comes from Ackermann, since the function $A$ is a modified Ackermann function. For fixed $k$ , the function $A_{k}$ is primitive recursive, however the two-variable function $A$ grows faster than any primitive recursive function, see e.g. [DW83]. An induction shows that $f(n,m)\geq A_{n-1}(m)$ for $n>1,m\geq 1$ . Hence, without the alphabet reduction the recursive formula would yield a non-primitive recursive function.

Proposition 5.4.

Let $\varphi:A^{*}\to M$ be a homomorphism in $M\in\overline{\mathbf{Ab}}$ , $n=\left|\mathinner{M}\right|$ and $m=\left|\mathinner{A}\right|$ . Then there exists a Parikh-reducing Church-Rosser system $S$ such that $S$ factorizes through $\varphi$ and

[TABLE]

Proof 5.5.

If $M\in\mathbf{Ab}$ , we know that there exists such a system $S$ with $\left|\mathinner{A^{*}\!/S}\right|\in 2^{2^{m^{\mathcal{O}(n^{2})}}}$ by Proposition 5.1. If $m=1$ , then there exists a system $S$ such that $\left|\mathinner{A^{*}\!/S}\right|\leq n$ . In the other case we will use the local divisor construction of Theorem 4.1. Note that by the alphabet reduction technique we may assume that $\left|\mathinner{K}\right|<m^{n+1}$ .

Let $\mathrm{ms}(\varphi)$ denote the smallest size of a Parikh-reducing Church-Rosser representation of $\varphi$ and set

[TABLE]

to be the complexity over all possible homomorphisms with $\left|\mathinner{A}\right|\leq m$ and $\left|\mathinner{M}\right|\leq n$ .

The base cases are $m=1$ or $M$ is a group. For $m=1$ there exists a system of size $n$ . In all other cases we have the following recursion formula for $\mathrm{ms}(n,m)$ :

[TABLE]

Note that $n>1$ since $M$ is not a group. Choose $c\in\mathbb{N}$ such that $\mathrm{ms}(n,m)\leq 2^{2^{m^{c(n+1)!}+n}}$ for all base cases. This is possible since the group case is in $2^{2^{m^{\mathcal{O}(n^{2})}}}$ . We show that

[TABLE]

in general. Inductively, it holds

[TABLE]

The last inequality holds because for $n,m>1$

[TABLE]

and thus $(m-1)^{c(n+1)!}+n+1<m^{c(n+1)!}+n-1$ .

6 Conclusion

In this paper we introduced the notion of Parikh-reducing Church-Rosser representations. We were able to construct such representations in the case of languages in $\overline{\mathbf{Ab}}$ and for group languages over a two-element alphabet. Furthermore, we studied algebraic properties of such representations and the complexity of the corresponding systems. Several questions remain open as future work. Most importantly, does there exist a finite Parikh-reducing Church-Rosser representation for every homomorphism into a finite group? Note that this already implies the case for every finite monoid by Theorem 4.1. Another interesting open question is which algebraic properties can be preserved by Church-Rosser representations. For example, it seems unlikely that every homomorphism into a finite group has a Church-Rosser representation which is a group again, although it may happen in some special cases. Additionally, there is a huge gap between our lower and upper bounds for the complexity. Therefore it is interesting whether there are constructions for Church-Rosser representations which yield a better upper bound and what a good lower bound for the size of a Church-Rosser representation is.

Bibliography16

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[AK 16] Jorge Almeida and Ondřej Klíma. On the irreducibility of pseudovarieties of semigroups. Journal of Pure and Applied Algebra , 220(4):1517–1524, 2016.
2[BO 93] Ron Book and Friedrich Otto. String-Rewriting Systems . Springer-Verlag, 1993.
3[DK 15] Volker Diekert and Manfred Kufleitner. A survey on the local divisor technique. Theoretical Computer Science , 610:13–23, 2015.
4[DKRW 15] Volker Diekert, Manfred Kufleitner, Klaus Reinhardt, and Tobias Walter. Regular languages are Church-Rosser congruential. J. ACM , 62:39:1–39:20, November 2015.
5[DKW 12] Volker Diekert, Manfred Kufleitner, and Pascal Weil. Star-free languages are Church-Rosser congruential. Theoretical Computer Science , 454:129–135, 2012.
6[DW 83] Martin D. Davis and Elaine J. Weyuker. Computability, Complexity, and Languages . Academic Press, 1983.
7[DW 16] Volker Diekert and Tobias Walter. Characterizing classes of regular languages using prefix codes of bounded synchronization delay. In Ioannis Chatzigiannakis, Michael Mitzenmacher, Yuval Rabani, and Davide Sangiorgi, editors, 43rd International Colloquium on Automata, Languages, and Programming (ICALP 2016) , Leibniz International Proceedings in Informatics (LIP Ics), pages 129:1—–129:13, 2016.
8[Eil 76] Samuel Eilenberg. Automata, Languages, and Machines , volume B. Academic Press, New York and London, 1976.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Parikh-reducing Church-Rosser representations for some classes of regular languages

Abstract

1 Introduction

2 Preliminaries

Words and Languages

Algebra

Combinatorics on Words

Theorem 2.1** (Fine and Wilf, [FW65]).**

Lemma 2.2**.**

Rewriting systems

Lemma 2.3** ([KB70]).**

3 Parikh-reducing Church-Rosser systems

3.1 Outline

Lemma 3.1** ([DKW12, DKRW15]).**

Proof.

3.2 Commutative Groups

Theorem 3.2**.**

Proof.

Lemma 3.3** ([DKRW15]).**

Lemma 3.4** ([DKRW15]).**

Lemma 3.5**.**

Proof.

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Lemma 3.8**.**

Proof.

Lemma 3.9**.**

Proof.

Lemma 3.10**.**

Proof.

3.3 Group languages over an alphabet of size two

Theorem 3.11**.**

Sketch of proof.

4 Beyond Groups

Theorem 4.1**.**

Proof.

Corollary 4.2**.**

Proposition 4.3**.**

Proof.

5 Complexity of Church-Rosser systems

Proposition 5.1**.**

Proof.

Proposition 5.2**.**

Proof.

Example 5.3**.**

Proposition 5.4**.**

Proof 5.5**.**

6 Conclusion

Theorem 2.1 (Fine and Wilf, [FW65]).

Lemma 2.2.

Lemma 2.3 ([KB70]).

Lemma 3.1 ([DKW12, DKRW15]).

Theorem 3.2.

Lemma 3.3 ([DKRW15]).

Lemma 3.4 ([DKRW15]).

Lemma 3.5.

Lemma 3.6.

Lemma 3.7.

Lemma 3.8.

Lemma 3.9.

Lemma 3.10.

Theorem 3.11.

Theorem 4.1.

Corollary 4.2.

Proposition 4.3.

Proposition 5.1.

Proposition 5.2.

Example 5.3.

Proposition 5.4.

Proof 5.5.