This paper investigates the algebraic properties of tree languages that are not definable by first-order logic with ancestor predicate, providing recursive proofs of nondefinability and extending algebraic concepts.
Contribution
It introduces a recursive proof method for non-definability of tree languages and extends algebraic notions like aperiodicity within forest algebra frameworks.
Findings
01
Recursive proofs exist for all non-definable languages.
02
Extended algebraic structures generalize existing notions like aperiodicity.
03
A new algebra of mappings aids in analyzing non-definability.
Abstract
We explore from an algebraic viewpoint the properties of the tree languages definable with a first-order formula involving the ancestor predicate, using the description of these languages as those recognized by iterated block products of forest algebras defined from finite counter monoids. Proofs of nondefinability are infinite sequences of sets of forests, one for each level of the hierarchy of quantification levels that defines the corresponding variety of languages. The forests at a given level are built recursively by inserting forests from previous level at the ports of a suitable set of multicontexts. We show that a recursive proof exists for the syntactic algebra of every non-definable language. We also investigate certain types of uniform recursive proofs. For this purpose, we define from a forest algebra an algebra of mappings and an extended algebra, which we also use to…
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
Topicssemigroups and automata theory · Computability, Logic, AI Algorithms · Advanced Algebra and Logic
Full text
Proving that a Tree Language is not First-Order Definable
Martin Beaudry
Département
d’informatique, Université de Sherbrooke,
Sherbrooke (Qc) Canada, J1K 2R1, [email protected].
Work supported by NSERC of Canada.
Abstract
We explore from an algebraic viewpoint the properties of the tree languages
definable with a first-order formula involving the ancestor predicate,
using the description of these languages
as those recognized by
iterated block products of forest algebras
defined from threshold-τ, period-π counter monoids.
Ehrenfeucht-Fraïssé games, i.e.
proofs of non-definability,
are infinite sequences of sets of forests,
one for each level of the hierarchy
of quantification levels that defines the corresponding
variety of languages.
A proof is recursive when the forests at a given level
are built by inserting forests from the previous level at the ports of
a suitable set of multicontexts.
We show that a recursive proof exists for the
syntactic algebra of every non-definable language.
We also investigate certain types of uniform recursive proofs.
For this purpose, we define from a forest algebra
an “algebra of mappings” and an “extended algebra”,
which we also use to redefine the notion of aperiodicity
in a way that generalizes the existing ones.
Words and trees are used almost universally in Computer Science, and
logical formalisms are among the most convenient tools for specifying
these objects, or sets thereof.
Automata constitute another class of tools, procedural in nature,
widely used to define languages;
underlying this formalism is a rich algebraic theory, through which
further tools from other areas of Mathematics can be used to better understand
the properties of word and tree languages.
Moreover,
the most significant classes of these languages happen to have
descriptions in several formalisms. For example,
the regular word languages are exactly those that are
recognized by finite automata and monoids, and those that
are definable by monadic second-order formulas with a
unary predicate for each letter and a “left-of” binary positional
predicate.
Similarly the regular tree languages are simultaneously
those that are definable
by monadic second-order formulas with two
positional predicates (“ancestor-of” and “next-sibling-of”)
and those that are recognized by finite tree automata,
as well as various sorts of algebraic structures
(e.g. finite term algebras, finite forest algebras, finitary preclones).
In the same vein, the word languages definable with
first-order logical formulas with the “left-of” predicate
have several algebraic and combinatorial descriptions,
see [16, 23, 12, 22, 29].
In particular, these languages are
precisely those whose syntactic monoid is
aperiodic;
thanks to this property,
whether a regular word language is
first-order definable
can be determined with
a straightforward
algorithm.
In the world of tree languages,
however, none of the definitions for
aperiodicity of the
syntactic algebra tried so far has managed
to characterize precisely the
first-order definable languages
[11, 4],
and the techniques invented to show that
certain important subclasses of these languages
are indeed decidable [1, 3, 2, 17, 18] did not seem to extend to the whole class.
Forest algebras combine two monoids (horizontal
and vertical) in a way which makes it easy for researchers to
apply techniques from the theory of monoids and word languages.
An encouraging harvest of results has already been obtained with this tool [6, 3, 4].
In this paper, we look at a counterpart, in the world of trees and forests, to the description of the
aperiodic monoids as
the variety of monoids generated by iterated block products of semilattices [22].
This is a
description of the first-order definable languages,
developed in terms of the variety of
finitary preclones [9] generated by iterated
block products of preclones that count occurrences
of node labels, regardless of the actual tree
structure.
Intuitively, a block product
works as the
combination of two tree automata
where at every node x, the second automaton,
besides the label of x, also reads
the current state reached by the first automaton
after reading the subtree rooted
at x (“below” x)
and the outcome of the
processing, by the first automaton,
of the context of x within the tree (“above” x).
The description of first-order definable languages
developed in [10, 13] suggests that
the ≡τ,π threshold-τ, period-π numerical congruences
are a fundamental feature in the combinatorics of first-order definable
languages.
Consistently with this we use the same kind of counting
and the corresponding quotient, counter monoids
Nτ,π=N/≡τ,π
(in these notations,
the Boolean OR U1 and the cyclic group Zπ
are respectively N1,1 and N0,π).
In our formalism,
we denote by ODM the one-dimensional algebra where M is the horizontal monoid,
by
⟨⟨Nτ,π⟩⟩ the variety of forest algebras generated by
ODNτ,π,
by
∗∗n⟨⟨Nτ,π⟩⟩ the variety generated by iterated block
products of n algebras from ODNτ,π,
and by ∗∗⟨⟨Nτ,π⟩⟩ the closure of these varieties
over joint.
Let
FO[≺] and FOModπ[≺] denote
both the class of forest languages definable by first-order
formulas built with the
“ancestor” positional predicate and the usual
quantifiers only (for FO[≺]) or the same with the
∃iπ, 0≤i<π, modular quantifiers,
and the varieties generated by their syntactic algebras.
Using the formalism of finitary preclones they introduced in
[8], by Esík and Weil
have established in [9] the correspondences
FO[≺]=∗∗⟨⟨N1,1⟩⟩
and
FOModπ[≺]=∗∗⟨⟨N1,π⟩⟩.
We explore two ways of defining from G=(G,W)
another algebra, where multicontexts are the underlying objects.
The algebra of mappings G#, and the
“multivertical monoid” of G derived from it
enable us to
define notions of pumping and aperiodicity that generalize two
of the known necessary conditions for membership in
FO[≺], namely aperiodicity of the vertical monoid and the
“absence of vertical confusion on uniform multicontexts”
defined in [4].
The extended algebra G%=(G%,W%), where
G%
is the powerset of G, makes explicit some properties of G that are not directly visible in G or G#.
An example is described in Section 5.4:
this is a language whose syntactic algebra
has aperiodic vertical and multivertical monoids,
but where W% is divided by the group Z2.
The language is defined with a FOMod2[≺] formula
that, among other things, counts the parity of the length of
certain node-to-leaf paths;
a pair of elements of W% does precisely this counting.
An algebra G lies outside of the variety
∗∗⟨⟨Nτ,π⟩⟩
if, and only if there exists
an infinite sequence of sets S(n), one for each n≥1, of forests belonging
to different languages recognized by G, such that
the elements of
S(n) cannot be told apart by any forest algebra in ∗∗n⟨⟨Nτ,π⟩⟩.
This sequence is usually described through an Ehrenfeucht-Fraïssé game.
We call such a sequence a proof of non-membership in ∗∗⟨⟨Nτ,π⟩⟩.
An Ehrenfeucht-Fraïssé game actually builds a recursive proof,
where each forest of
S(n+1) is built by inserting
copies of the elements of S(n) at the ports of
the corresponding
element of a set M(n) of multicontexts.
Such a proof can be specified with an infinite sequence of
such sets M(n+1);
we denote by
RC(G⋆Hτ,πn)
the proposition that states the existence a M(n+1)
that has the required properties;
RC stands for “recursive construction”.
We prove that
G∈∗∗⟨⟨Nτ,π⟩⟩
if, and only if RC(G⋆Hτ,πn) holds
for every n≥1,
that is, every algebra that is not first-order has a recursive proof
of non-membership. Next, we observe that in the existing proofs, the circuit
M(n+1) is either identical to
M(n), in which case each forest of
S(n) is built from copies of the same, finite set
of multicontexts (“proof-by-copy”), or
M(n+1) is obtained by pumping a starting set
of multicontexts (“proof-by-pumping”).
The questions of the existence of a
proof-by-copy
and of a restricted form of proof-by-pumping
are both recursively enumerable.
Section 2 contains background
on forests, multicontexts and circuits,
and on forest algebras
and the varieties ∗∗⟨⟨Nτ,π⟩⟩.
In Section 3,
we define the algebras G# and G%.
and the related notions of
pumping and aperiodicity.
In Section 4, we prove that an algebra is outside
∗∗⟨⟨Nτ,π⟩⟩ if, and only if
this can be asserted with
a recursive proof; we then explore the notions of
proof-by-copy and proof-by-pumping.
In Section 5, we
discuss in our formalism
some typical examples of
non-membership proofs.
We conclude with some comments and open questions.
2 Definitions and Background
2.1 Forests, Multicontexts, and Circuits
We consistently work with a finite alphabet
A, which we assume to always contain a neutral letter e, such that for every forest homomorphism γ,
γ(e□) is the identity mapping. Let B be another alphabet, disjoint from A.
A multicontextm over (A,B) is a sequence of trees in which
a subset of the leaves consists of ports, where
every non-port node y
carries a label λ(m,y)∈A.
We denote by nodes(m) the set of all nodes in m,
by ports(m) the set of its ports and by interior(m) the set of the non-port nodes.
We work with multicontext where each port either has a label ν(x)∈B,
or has several labels, each
specified as a mapping from ports(m) to a set that is disjoint from A.
A forest is a multicontext without ports;
a context in the usual sense is a forest with a unique port that
carries the special label □∈(A∪B),
called its □-port. Throughout the paper,
this port is considered apart from the others.
Given x∈nodes(m),
we denote by Δ(m,x) the multicontext
consisting of all subtrees of m rooted at the sons of x.
The subtree rooted at x, i.e. Δ(m,x) plus the node x,
is denoted Δ+(m,x).
The
context of x within m, with notation ∇(m,x),
is built from m and x by replacing
Δ+(m,x) with a □-port.
The ancestors of this port constitute the trunk of the context.
If we deal with a set of multicontexts M instead of an individual m, we
use the notations nodes(M), λ(M,x), Δ(M,x), etc.
The sets of all forests, and contexts over A are
respectively denoted
HA and VA. We use the notations MA,B and CA,B,
respectively,
for the set of all multicontexts over (A,B) for the
set of all multicontexts with a □-port
(the contexts-in-multicontexts, so to speak).
We use the
standard representation for individual forests of multicontexts,
where nodes are listed in preorder and where
concatenation and + represent the father-son relation and
horizontal addition, respectively. For example, a(b+c)
is a tree with a root labelled a and two sons labelled b and c,
while
ab+c is a forest of two trees, where nodes a and c are roots,
and nodes b and c are leaves.
Inserting t in a context s consists in replacing the □-port of s
with a copy of t; the resulting forest is
denoted s⋅t, or st.
Insertion in multicontexts is done
here either on a wholesale basis, i.e. something
is inserted at every port,
or
on a selective basis, when insertion occurs at a
pre-specified set of ports.
The latter method is defined in Section 3;
the former is associated with circuits and the construction of witnesses,
as follows.
Let A, B and C be three sets.
A circuit over (A,B,C)
is a set M with
an element mc for every c∈C;
this component is a multicontext m∈MA,B.
We can regard M as having an input wire
for every b∈B, an output wire for every c∈C,
and mc is the result of
unraveling into a tree all those nodes of M from which
the output wire c is accessible.
A set S of forests over (A,B)
is defined similarly, with an element sb∈HA
for every b∈B. The insertion of S in
a circuit M over (A,B,C) consists in
inserting a copy of the forest sν(x)
at every x∈ports(M);
the result is a set of forests over (A,C),
denoted M⋅S.
If M and M′ are circuits over (A,B,B)
then inserting M′ in M
builds a circuit M⋅M′ over (A,B,B).
It can be verified, using standard methods, that this operation is associative.
2.2 Forest algebras
The reader is assumed to be knowledgeable with the notions of
semigroups and monoids, and their relations with regular languages,
word congruences and monoid homomorphisms (see [14, 16]).
Two types of notations are used for
the monoids discussed in the article. There is an additive, or “horizontal” notation where
the identity and operation are denoted [math] and +, respectively, although this does in no way
imply that the latter is commutative. In the multiplicative, or “vertical” notation, the neutral element is denoted
ε
and the operation is written with ⋅ or by concatenation of the arguments.
A * transformation * of a set S is a mapping S→S, i.e. an element
of the monoid SS.
A * translation * in a monoid M (with the additive notation) is a mapping
[u+ε+v]:M→M, where u,v∈M,
defined by s↦u+s+v. If M is commutative, then
the translations are of the form [u+ε].
The set M(M)={[u+ε+v]∣u,v∈M} with the composition of functions
is the translation monoid of M.
Definition 2.1
A forest algebra is
a pair H=(H,V) where (H,+) is a monoid and (V,⋅)
is a submonoid of HH which contains M(H).
Monoids H and V are the
horizontal and vertical monoids of H, respectively.
Because V is a submonoid of HH, its action on H
is faithful.
Forest algebras were introduced in [6] as pairs of abstract monoids;
in that case, faithfulness has to be specified in the definition.
A forest algebra homomorphism
from H=(H,V) to G=(G,W)
is a pair of mappings α=(αH,αV) where
αH and αV are monoid homomorphisms H→G
and V→W, respectively, and such that
αH∘w=αV(w)∘αH for every w∈V.
The free forest algebra over A is
FA=(HA,VA); since it is generated from
{a□∣a∈A},
a homomorphism α:FA→H
is completely specified once H and every αV(a□), a∈A, are known.
A forest congruence in FA is a pair of
equivalence relations, both denoted by ≈, such that in HAs≈t iff ps≈pt for every context p,
and
in VAp≈q iff pt≈qt for every forest t.
A congruence ≈refines another congruence
≃ over the same domain,
when x≈x′⇒x≃x′ for all x,x′.
A homomorphism α defines its nuclear congruence:
s≈αt⇔α(s)=α(t),
and conversely a congruence ≈ defines a homomorphism from FA to FA/≈.
A set L⊂HA is
recognized byH if there exist
a homomorphism α:FA→H
and a subset F⊂H
such that for all t∈HA,
t∈L⇔α(t)∈F. A context language K⊂VA
is recognized in the same way, with an accepting set P⊂V.
The syntactic congruences of these languages are refined by
≈α.
A variety of forest algebras is a class of finite forest algebras closed under finite direct
product and division.
Given forest algebras
(G,W) and (H,V), we say that G is a subalgebra of H iff G⊆H and W⊆V,
and that it divides H, with notation G≺H, if it is the homomorphic image of a subalgebra
of H.
A variety of forest languages is formally defined
as a mapping W such that, for every alphabet A, W(A)
is closed under finite boolean operations, inverse homomorphism of free algebras
and context quotients.
With L a language and p∈VA, the context quotient of L by p is the
set p−1L={s∣ps∈L}; a
forest algebra which recognizes L also recognizes p−1L.
The lattices of varieties of forest algebras and of varieties of forest languages are isomorphic [5].
Let G=(G,W) be a forest algebra and let g,h∈G.
An element h is accessible from g
when g=wh for some w∈W.
A set is
strongly connected when its elements are mutually accessible;
a strongly connected component of G is a subset that is maximal for this property.
Let K be such a set: we define from it the set
W−1K={g∈G:∃w∈W,wg∈K}
of all elements from which K is accessible, and
its complement I(K), which is an ideal,
that is,
a subset of G closed under the
action of every element of W.
Let K⊂G.
The leaf-completion of a multicontext m through
a mapping χ:ports(m)→K
is the forest
χ˘(m)∈HA∪K,
obtained by
labeling every port x with χ(x).
Consistently with this,
the leaf-extension of
a homomorphism γ:FA→G
to HA∪K
is built by defining γ(tk)=k,
for every h∈K and one-node tree
tk with label k.
Then
γ∘χ˘(m) is
the image by γ of the leaf-completion of m through
χ.
2.3 Block product congruences
It is known that the equivalence relation over FA where every class consists in all forests that model the same
set of formulas of quantifier depth n, is a forest congruence.
A generalized version of this congruence is
≈τ,πn, where n,τ,π≥1
are integers,
defined as follows:
∙
it is built around the threshold-τ, period-π counting
congruence over N, defined by
[TABLE]
the quotient monoid N/≡τ,π is denoted Nτ,π;
∙
given s,t∈HA, we have
s≈τ,π1t if, and only if, for every a∈A,
the number of nodes with label a in s and in t
are congruent under ≡τ,π;
the quotient algebra FA/≈τ,π1 is denoted
Hτ,π1 ; the corresponding surjective homomorphism
is ατ,π1:FA→Hτ,π1;
∙
for n≥1,
given that ≈τ,πn and the quotient algebra
Hτ,πn=(Hτ,πn,Vτ,πn)
are already known,
we define a relabeling operation
s↦sατ,πn which consists in
replacing, at every node x of s, the label λ(s,x)∈A
with the triple
[TABLE]
this defines the relabeling alphabet
Dατ,πn=A×Hτ,πn,×Vτ,πn;
the same is done in a context t∈VA; however, the new label of x
is different depending on whether x is on the trunk,
so that
tατ,πn is a context over
Eατ,πn=(Aoff∪Atrunk)×Hτ,πn,×Vτ,πn,
where Aoff and Atrunk are disjoint copies of A;
∙
for n≥1 and s,t∈HA, we have
s≈τ,πn+1t if, and only if s≈τ,πnt
and
sατ,πn≈τ,π1tατ,πn.
Example:
we have (a+□)≈τ,π1a□ and
(a+□)≈τ,π2a□, which illustrates
the distinction between trunk and off-trunk nodes.
The quotient algebra FA/≈τ,π1 is isomorphic
to the one-dimensional algebra ODNτ,π∣A∣.
A one-dimensional forest algebra111Also called flat algebras in previous works on
the topic: the homomorphic image of a forest is the image in a monoid
of a “flattened”,
“one-dimensional” version of the forest. The wording is also a
reference to the notion
that an
algebra ODM□ODN recognizes
forest languages that are “more two-dimensional” than
those recognized by ODM.
is a pair (M,M(M)) such that for every homomorphism
γ:FA→(M,M(M)) and every a∈A, there exists m∈M such that
γ(a□)=[m+ε]. In such an algebra,
the homomorphic image of a forest t∈HA is independent of its structure, that is, the algebra only
considers the string η(t) of its node labels, given in a predetermined order (e.g. in preorder).
Therefore, (M,M(M)) associates to γ a monoid homomorphism
γ^:A∗→M, such that γH=γ^∘η.
We denote by ODM the (unique)
one-dimensional algebra built from M.
The congruences ≈τ,πn can be defined algebraically, as follows.
Let ⟨Nτ,π⟩
and
∗∗1⟨⟨Nτ,π⟩⟩
denote respectively the variety of monoids generated by Nτ,π
and the variety of forest algebras generated by the algebras
ODM where M∈⟨Nτ,π⟩.
Then for every language L⊆HA,
its syntactic forest algebra G(L) belongs to ∗∗1⟨⟨Nτ,π⟩⟩
if, and only if ≈τ,π1 refines its syntactic congruence,
or equivalently, iff G(L) divides Hτ,π1.
Next, every algebra Hτ,πn+1=HA/≈τ,πn+1 is a block product
Hτ,πn□ODNτ,πN, with
N≥∣Dατ,πn∪Eατ,πn∣.
We use ∗∗n+1⟨⟨Nτ,π⟩⟩ to denote the variety
generated by block products of the form G□ODNτ,πN
with G∈∗∗n⟨⟨Nτ,π⟩⟩.
Finally, ∗∗⟨⟨Nτ,π⟩⟩=⋁n≥1∗∗n⟨⟨Nτ,π⟩⟩.
We will make abundant use of the following.
Proposition 2.2
The following statements on a finite-index congruence ≃ over FA are equivalent:
FA/≃∈∗∗n⟨⟨Nτ,π⟩⟩;
FA/≃≺Hτ,πn; the congruence
≈τ,πn refines ≃.
□
Let FO[≺] denote the variety of all forest languages
definable with first-order logic formulas with the ∀ and ∃
quantifiers and the ‘ancestor’ positional predicate; for q≥2,
let FOModq[≺] denote the variety defined in terms of the same
sort of formulas, where now the ∃iq, 0≤i<q, modular quantifiers are
also allowed.
It was proved in
[9] that the syntactic preclones of the languages in FO[≺]
generate the same variety as the iterated block products of
preclones defined in terms of counting under threshold one (i.e. the monoid U1);
adding to the generating preclones those
defined with counting under the congruence ≡0,π
yields a characterization for FOModπ[≺].
It can be verified that these equivalences
translate into
FO[≺]=⋁τ≥1∗∗⟨⟨Nτ,1⟩⟩
and
FOModπ[≺]=⋁τ≥1∗∗⟨⟨Nτ,π⟩⟩.
Remark. Actually,
FO[≺]=∗∗⟨⟨N1,1⟩⟩, where
N1,1=U1 is the Boolean OR monoid, and
similarly
FOModπ[≺]=∗∗⟨⟨N1,π⟩⟩,
so that working in terms of nontrivial thresholds τ is not mandatory.
However, doing so makes it possible to follow
more closely the counting-under-threshold that seems to be inherent to the
construction of proofs of non-membership in FO[≺],
and is reminiscent to the description of the
first-order definable forest
languages developed in [10, 13].
Note that a characterization Modπ[≺]=∗∗⟨⟨Zπ⟩⟩
also exists,
where Zπ is the cyclic group of order π;
we put aside this special case in the current version of this paper.
3 Algebras for Multicontexts
Forest algebras were designed as tools to handle trees, forests, and contexts over A.
Dealing with multicontexts over (A,B) as we do in this article demands that
a suitable algebraic structure be developed to describe how a
forest algebra works on them.
A first approach consists, given
a forest algebra K=(K,U),
in regarding a
multicontext as a specification for a multivariate mapping
from KB to K.
This defines the algebra of mappingsK#; it is used to
define the notion of pumping, which underlies the construction
of certain Ehrenfeucht-Fraïssé games,
and to associate to K a threshold and a period that are consistent with those used
so far in the literature.
A second approach consists in considering
that a port label specifies elements of K
are allowed as inputs at that port.
This leads to the definition of the extended algebraK%,
which we use to
generalize once more the notions of threshold, period,
and aperiodicity.
Necessary conditions for
first-order definability, that supersede
some of the existing ones,
are defined from the latter.
3.1 Multicontexts
We use both m and (m,ν) to denote the pair
consisting of a multicontext m, where
every interior node carries a label λ(y)∈A,
and a port labeling ν(x)∈B.
When this pair is equipped with a second port labeling
ψ, we denote the resulting tuple t=(m,ν,ψ)
when ψ is fixed and the emphasis is on t as a whole,
and ψ˘(m) when it is understood that
(m,ν) is fixed and ψ is one of several possible
second port labelings.
Next, instead of labeling a port directly with a horizontal
monoid element, as it was done in the previous section,
we take ν(x) and ψ in sets B and C,
respectively, such that
A, B and C are pairwise disjoint;
when dealing with specific algebras,
leaf extensions of the appropriate homomorphisms are
then defined on B and C.
Note that we are ultimately interested in
the recognition of languages over A,
so that B and C are artefacts used in this
process and the ultimate results should not depend on them.
The tuples over A and B×C,
along with the contexts defined from them
by replacing a leaf
with a special port □,
constitute a forest algebra
FA,B×C=(MA,B×C,CA,B×C);
those over A and B constitute
FA,B=(MA,B,CA,B);
the reader can verify that both
are free algebras.
Besides the insertion in a context, i.e. the monoid operations in
VA, CA,B and CA,B×C,
we define
an operation that does multiple, simultaneous insertions in a multicontext from MA,B.
Given sets of multicontexts M1 and M2 and Z⊂ports(M1),
with Z∩ports(m1)=∅ for every m1∈M1,
we denote by M1⋅ZM2
the set of all multicontexts mnew that can be built by taking an element m1∈M1,
inserting at each port x∈Z∩ports(m1) a multicontext m(x)∈M2
and replacing the label of x with the neutral letter e;
with this new label, x has no effect on the image by a homomorphism
of mnew, while it remains available to be used in reasonings
and proofs.
No other label is modified, so that in particular
if m(x) is a copy of m2∈M2 and y∈nodes(m2),
then the counterpart y′ of y in m(x)
satisfies λ(mnew,y′)=λ(m(x),y)=λ(m2,y).
Let M⊂MA,B and
let Z⊆ports(M).
Then with m∈M,
we use the notations Z(m) and Z(M)
for Z∩ports(m) and Z∩ports(M),
respectively. Given a congruence ≅,
we say that Z is ≅-stable
when every pair x,x′ of ports satisfies
∇(M,x)≅∇(M,x′).
Next,
let B′⊆B and let Z be the set of all
ports with label in B′.
With θ≥1 we define the set M(θ,Z)
obtained by pumping θ times the set M at Z,by:
M(1,Z)=M and
M(θ+1,Z)=M(θ,Z)⋅ZM.
This definition of pumping is consistent
with the definition of the
vertical monoid of FA/≅
(where both B and Z are singletons),
with
the
“vertical confusion” defined in [4] (where B and M are singletons),
and with the
“vertical confusion on uniform multicontexts”
also discussed in [4] (where B and M are singletons and
the ports of Z are indistinguishable
by any congruence).
3.2 The algebra of mappings
Let K=(K,U) be a finite algebra and
γ:FA→K a
surjective homomorphism.
We look for a reasonable way of extending K to
FA,B,
besides the one that consists in defining a leaf extension of γ to B.
With this in mind, we define
the algebra of mappings of K, which we denote K#.
To do so, we
show how to translate a congruence in FA,B
into a congruence in FA, and vice versa.
Define a mapping ε^K from MA,B to HA,
that turns m∈MA,B into a forest ε^K(m) by
replacing all port labels with the neutral letter e; define
ε^V from CA,B to VA in exactly the same way;
both mappings constitute a surjective homomorphism from FA,B
to FA.
Thus, given a forest congruence ≈ over FA,B, a forest congruence
is defined in a natural way over FA.
In the other direction, let B={b1,…,bN} be finite
and let ξ be simultaneously regarded as
a vector ξ∈KN a mapping ξ:B→K.
Given m∈MA,B,
define μ:ports(m)→K by μ=ξ∘ν.
From m and ξ,
we build a forest μ˘(m,ξ) over A∪K by
replacing in m every port label ν(x) with ξ(x).
We build μ˘(w,ξ) from w∈CA,B in the same way;
the □-port of w retains its label.
We extend γ to HA∪K by defining γ(h)=h
for every h∈K,
so that ξ in fact is one of the ∣K∣N leaf extensions for γ that can be
built
on B,
and define a mapping γ[m]:KN→K
by
γ[m](ξ)=γ(μ˘(m,ξ));
similarly, we define
γ[w]:KN→U by
γ[w](ξ)=γ(μ˘(w,ξ)).
Next,
we define K# and U#, the sets of all mappings
KN→K and KN→U, respectively, and
given f,f′∈K# and u,u′∈U#,
the operations and vertical action
f+f′:ξ↦f(ξ)+f′(ξ),
uu′:ξ↦u(ξ)u′(ξ), and
uf:ξ↦u(ξ)f(ξ),
so that the pair K#=(K#,U#)
constitutes a forest algebra222A
notation that mentions B, e.g.
K♯B, would actually
be more accurate, if more cumbersome.
and
γ:FA,B→K#
defined by m↦γ[m] and w↦γ[w]
is a homomorphism.
Let ≈ be the nuclear congruence of γ:
we define from it an equivalence between
nodes, also denoted ≈.
Let x,x′∈nodes(M) where M is a set of multicontexts
closed under ≈:
x≈x′ iff λ(x)=λ(x′) and
Δ(M,x)≈Δ(M,x′) and
∇(M,x)≈∇(M,x′);
nodes equivalent under this relation “cannot be told apart by K#”.
We also write (m,x)≈(m′,x′)
and (M,x)≈(M,x′)
in order to specify where the nodes are located.
Given m∈M
and h1,…,hN−1∈H we define
the mapping
[TABLE]
Since M is closed under ≈,
α[m] is the same for every m∈M
and we can use the notation γ[M].
Then with Z={x∈ports(m):ν(x)=bN}, we observe
[TABLE]
[TABLE]
Therefore, every operation
⋅Z
satisfies the compatibility property
for ≈
[7, Definition 5.1].
Proposition 3.1
Let H#n and α#n
be built from Hτ,πn=FA/≈τ,πn and ατ,πn.
Then the nuclear congruence of α#n
is refined by the congruence ≈τ,πn built recursively
over FA,B,
as in Section 2.3.
Hence, H#n≺FA,B/≈τ,πn.
Proof.
By induction on n.
Recall that a given ξ is regarded
both as a mapping ξ:B→Hτ,πn and
as a vector ξ∈(Hτ,πn)∣B∣.
For the n=1 case, we associate to every m∈MA,B
a vector p1(m) with component in Nτ,π and labels in A∪B,
where p1(m)a, a∈A, and p1(m)b, b∈B,
are respectively the number of nodes y
with label λ(y)=a and the number of ports x
with ν(x)=b. The algebra
Hτ,π1=FA/≈τ,π1
is isomorphic to (Nτ,π)∣A∣,
so that, with some abuse of notations, we can write
ατ,π1(m)=p(m),
and given a mapping ξ:B→Hτ,π1,
the image ξ(b) of b∈B
can be represented as a vector ξ1(b)∈(Nτ,π)∣A∣.
Within MA,B, there is an equivalence class
under ≈τ,π1 for every possible value of
p1(m), i.e. every vector in (Nτ,π)∣A∪B∣.
Then given m∈MA,B,
[TABLE]
From there, if
p1(m)=p1(m′), then ατ,π1[m]=ατ,π1[m′].
With n≥2,
the induction hypothesis states that if
m≈τ,πn−1m′, then for every
mapping ξ:B→Hτ,πn−1,
the leaf completions of m and m′ through ξ
satisfy
ξ˘(m)≈τ,πn−1ξ˘(m′).
Assume that
m≈τ,πnm′.
Two nodes or ports x of m and x′ of m′ receive the same label
in the versions of m and m′ relabeled according to
ατ,πn−1 iff
λ(m,x)=λ(m′,x′),
Δ(m,x)≈τ,πn−1Δ(m′,x′)
and
∇(m,x)≈τ,πn−1∇(m′,x′).
By the induction hypothesis, the last two items imply, for every ξ:
[TABLE]
which means that x and x′ receive the same label
in the versions of
ξ˘(m) and ξ˘(m′) relabeled according to
ατ,πn−1, that is,
ξ˘(m)≈τ,πnξ˘(m′).
□
The algebras H#n
and FA,B/≈τ,πn
are not isomorphic, however.
To see this, let τ=2 and π=1,
so that N2,1={0,1,∞},
n=1 and
A={a},
so that
FA/≈2,11
is isomorphic to ODN2,1,
where,
and finally
B={b}.
With m=aab and m′=aa(b+b),
we have α2,11(m)=α2,11(m′),
while
α2,11[m]=α2,11[m′]
is the constant function
that map {0,1,∞} onto ∞.
3.3 Equivalence under pumping
We use the algebra of mappings to define a
“threshold τ, period π equivalence under pumping”
congruence ↔τ,π=⋃i≥0↔τ,π(i)
within FA,B.
First, let ↔∞ denote the relation where two forests are
equivalent iff they are the same up to horizontal permutations within a sum
(we might as well use = instead of ↔∞).
Then we consider a special case of
a multicontext m∈MA,B where any two ports x,x′
satisfy ν(x)=ν(x′)⇔∇(m,x)↔∞∇(m,x′),
that is, their contexts within m are indistinguishable.
Then the ↔∞-stable sets of ports
are exactly the sets ν−1(b), b∈B;
we say that m is suitable for pumping.
Pumping333Note that this formalism also covers the case where
pumping is done “horizontally”, i.e. where we
are dealing with a multicontext of the form m=c+x1+⋯+xk
and where Z={x1,…,xk}.
the singleton {m} along
a ↔∞-stable
set of ports Z, we obtain for each
θ∈N a singleton {m}(θ,Z)={m(θ,Z)}.
We now define ↔τ,π(i), for
every i≥0.
First,
↔τ,π(0) coincides with ↔∞.
Next, the forest congruence ↔τ,π(1) is
generated by the pairs (m(θ,Z),m(θ′,Z))
and the corresponding context congruence by the pairs
(∇(m(θ,Z),x),∇(m(θ′,Z),x′)),
where θ≡τ,πθ′, x∈Z(m(θ,Z)) and x′∈Z(m(θ′,Z)).
Then recursively for i≥1, given a set M of multicontexts closed under
↔τ,π(i) and a set
Z⊂ports(M) that is ↔τ,π(i)-stable,
↔τ,π(i+1)
is the congruence generated by the pairs
(m,m′)
and
(∇(m,x),∇(m′,x′))
where θ≡τ,πθ′,
m∈M(θ,Z), m′∈M(θ′,Z),
x∈Z(m) and x′∈Z(m′).
We denote by Jτ,π=(Jτ,π,Uτ,π) the
(infinite)
quotient algebra
FA,B/↔τ,π
and by
βτ,π the corresponding surjective homomorphism.
Proposition 3.2
Every congruence of finite index over FA is refined by a congruence
↔τ,π.
Proof.
Let H=(H,V)=FA/≈=α(FA)
be finite.
Let B={b1,…,bN} and
let m∈MA,B be suitable for pumping.
Assume that
Z=ν−1(bN); we
pump the singleton {m} along Z.
Given h1,…,hN∈H we define
g=α[m](h1,…,hN) and the mapping
[TABLE]
Observe that α[m(2,Z)](h1,…,hN)=ζ(g)
and in general,
[TABLE]
The mapping ζ generates a subsemigroup
⟨ζ⟩ of HH;
from the threshold and period of
⟨ζ⟩ we obtain integers τ and π
such that, for all combination of m and Z,
we have
α[m(θ,Z)]=α[m(θ′,Z)]
as soon as
θ≡τ,πθ′.
We prove by induction on i, that with these τ and π,
for every ↔τ,π(i)-closed set M
and every ↔τ,π(i)-stable Z⊆ports(M),
every combination of multicontexts
m^∈M(θ,Z) and
m^′∈M(θ′,Z)
satisfies
α[m^]=α[m^′], whenever
θ≡τ,πθ′.
At i=0
the relation ↔τ,π(0) coincides with ↔∞.
If M is ↔∞-closed, then
every pair m,m′∈M satisfies α[m]=α[m′].
Now let Z⊆ports(M) be ↔∞-stable.
For every pair m^,m^′∈M(θ,Z) and any θ≥2,
we have α[m^]=α[m^′],
in particular when m^=m(θ,Z), which means
α[m^′]=α[m(θ,Z)] for all m^′∈M(θ,Z).
Since
M(θ,Z)∪M(θ′,Z) is ↔τ,π(1)-closed whenever
θ≡τ,πθ′, every m^′∈M(θ,Z)∪M(θ′,Z)
satisfies simultaneously m^′↔τ,π(1)m(θ,Z)
and α[m^′]=α[m(θ,Z)].
From this we can deduce that the proposition holds for ↔τ,π(1).
The induction step i≥1 is proved in the same way.
□
Following [4], we say that multicontext m
is uniform when ports(m)
is ↔∞-stable, i.e
all nodes at every depth level have the same label and
the same number of sons. We denote by mVA
the set of all
uniform multicontexts over A; it can be seen as a subset of
MA,{b} where b∈A.
Define a symbol
⊗n
for each n∈N,
and let
⊗N={⊗n:n∈N};
regarding VA as another alphabet, we can see a
uniform multicontext as the image of a word
of ⊗N(VA⊗N)∗ by a mapping
ζ such that
ζ(v)=v for every v∈VA,
ζ(⊗1w)=ζ(w),
ζ(⊗2w)=ζ(w)+ζ(w), etc.
with notation ζ(⊗nw)=⊗nζ(w), and
ζ(ww′)=ζ(w)⋅Zζ(w′),
where Z=ports(ζ(w)).
It is a standard exercise to verify that
mVA is a monoid of transformations of HA and that
ζ is a
homomorphism.
We then associate to H=(H,V)=α(FA)
its multivertical monoid,
defined as mV=α(mVA).
The proof of Proposition 3.2 shows that
the smallest integers τ and π such that
↔τ,π refines ≈
are the threshold and period of the
multivertical monoid mV.
Also, the algorithm described in
[4] to decide whether an algebra has
vertical confusion on uniform multicontexts can be adapted to compute
the values of τ and π.
We generalize the condition tested by this algorithm
to ∗∗⟨⟨Nτ,π⟩⟩
and the variety of monoids
Solτ,π,
generated by iterated block products
of elements of ⟨Nτ,π⟩,
which means a generalization from the
aperiodic to the solvable monoids,
Proposition 3.3
If an algebra G=(G,W) belongs to
∗∗⟨⟨Nτ,π⟩⟩,
then its multivertical monoid belongs to
Solτ,π.
Proof.
It suffices to prove that the
multivertical monoid of a block product of forest algebras
is a block product of their
multivertical monoids.
Let m∈mVA and let x∈nodes(m);
the subtree s=Δ+(m,x) rooted at x
belongs to a sum of 1 or more copies of
s; let
μ(x) be the number of copies of s in this sum,
other than Δ+(m,x) itself. Let
∇−(m,x) be the context in which this
sum is inserted, so that
∇(m,x)=∇−(m,x)(□+⊗μ(x)s).
Let α:FA→H
be a surjective homomorphism: the label of
x in the version mα of m relabeled
according to α is
[TABLE]
Given n∈mVA and y∈nodes(n), we have
[TABLE]
and define “above” and ”below” actions, such that
λ((m⋅Zn)α,x)=λ(mα,x).α(n)
and
λ((m⋅Zn)α,y)=α(m).λ(nα,y).
The fact that the operation ⋅Z is associative
is used to verify
that these actions are monoidal; this ensures that
we indeed have a block product.
□
3.4 The extended algebra
Let K=(K,U) be a forest algebra and let
γ:FA→K be surjective.
Given (m,ν)∈MA,B,
we build all forests s that can be obtained by insertion
of elements of HA at the ports of (m,ν)
and gather their images images γ(s) in a set γ%(m).
This is done under a restriction: we
define a mapping γ%:B→P(K)
and demand that the forest t(x) inserted at x
satisfy γ(t)∈γ%(ν(x)).
To simplify the reasonings and definitions, and
avoid having to deal with irrelevant special cases,
we assume that γ%−1(F)=∅
for every nonempty F⊆K, and
that there exists
a partition C=⋃b∈BCb
where ∣Cb∣=∣γ%(b)∣ for every b∈B,
as well as a bijective leaf extension
γ:Cb→γ%(b).
We define
the notation D=⋃b∈B{b}×Cb,
so that we can write that we work on
FA,D rather than on FA,B×C,
and say that the γ% defined above is the the
universal leaf extension of γ
to D.
We restrict our work to labelings
ψ:ports(m)→C that are consistent with ν,
in the sense that ψ(x)∈Cν(x) for every port x.
Given (m,ν)∈MA,B, we denote by
Ψ(m,ν)=Ψ(m) the set of all mappings
ψ consistent with ν, and given
ψ∈Ψ(m),
we use the notation ψ˘(m,ν) for the multicontext
(m,ν,ψ)∈MA,D.
At some point we will look at more than one algebra at once,
e.g. G and K: then we will wlog assume that
the universal extensions φ% and γ%
are defined in such a way that
(φ%−1(F)∩γ%−1(F))=∅
for every nonempty F⊆G×K.
From the universal extension of γ we define the mapping
γ%:MA,D→P(K)
by
γ%(m,ν)={γ(ψ˘(m,ν)):ψ∈Ψ(m,ν)}.
This is a homomorphism:
we verify this with the horizontal addition,
leaving the rest of the proof to the reader.
[TABLE]
The image of FA,D by γ%
is the extended algebra of K,
with the notation K%=(K%,U%).
In the special case of Hτ,πn,
since τ,π are fixed parameters
most of the time,
we use the
notations H%n and α%n
instead of Hτ,πn%
and
ατ,πn%.
Proposition 3.4
Let H and
K be finite forest algebras.
If K≺H,
then
K%≺H%.
Proof.
Let γ:FA→K and
α:FA→H be
be surjective homomorphisms.
Assume
that for all t,t′∈MA,D,
α(t)=α(t′) implies
γ(t)=γ(t′).
Then with m,m′∈MA,B, we have
[TABLE]
Inclusion in the other direction is proved in the same way,
so that if
α%(m)=α%(m′),
then
γ%(m)=γ%(m′).
□
Proposition 3.5
For every m,m′∈MA,B and n≥1, if
α%n(m)=α%n(m′),
then
m≈τ,πnm′.
Proof.
By induction on n.
For the n=1 case,
as in the proofs of Propositions 3.1,
we associate to every m∈MA,B
a vector p1(m) with component labels in A∪B,
where p1(m)a, a∈A, and p1(m)b, b∈B,
are respectively the number of nodes y
with label λ(y)=a and the number of ports x
with ν(x)=b; we define for t∈MA,D a vector
p1(t) in the same way, with component labels in
A∪C.
Let m∈MA,B: its image
α%1(m)={ατ,π1(ψ˘(m)):ψ∈Ψ(m)}
is determined by the set of all vectors
p1(ψ˘(m)) that can be obtained from p1(m).
While p1(m)a=p1(ψ˘(m))a for every a∈A,
given b∈B we have
[TABLE]
so that for any ψ′∈Ψ(m′),
p1(ψ˘(m))≡τ,πp1(ψ˘′(m′))
implies
p1(m)≡τ,πp1(m′),
from which we conclude
α%1(m)=α%1(m′)⇒m≈τ,π1m′.
Let n≥2:
the induction hypothesis
states that if
ατ,πn−1(ψ˘(m))=ατ,πn−1(ψ˘′(m′))
for some ψ∈Ψ(m) and ψ′∈Ψ(m′),
then
m≈τ,πn−1m′.
Let
pn(ψ˘(m)) denote the
vectors with a component for every
element of the appropriate relabeling alphabet,
which counts the number of its occurrences
in the version of ψ˘(m) relabeled according to
ατ,πn−1.
Then
[TABLE]
Two nodes x and x′
in ψ˘(m) and ψ˘′(m′)
carry the same
element of the relabeling alphabet
only if they carry the same label in m and m′,
and
[TABLE]
by the induction hypothesis, this implies
[TABLE]
hence x and x′
carry with the same
label in the
relabeled versions of m and m′.
Therefore,
α%n(m)=α%n(m′)⇒m≈τ,πnm′.
□
Proposition 3.6
For every n≥1,
H%n∈∗∗⟨Nτ,π⟩.
Proof.
By induction on the structure of the horizontal monoid of
H%n; the induction base
is done on H%1, however.
As in the proofs of Propositions 3.1 and 3.5, we
associate to every m∈MA,B
a vector p1(m),
where it was seen that
for every b∈B we have
[TABLE]
Then
α%1(m)=α%1(m′)
when p1(m)a≡τ,πp1(m′)a for every a∈A
and
p1(m)b≡σ,πp1(m′)b
for every b∈B, where σ≥τ⋅∣K∣, so that
having at least σ ports with
ν(x)=b ensures that
there is enough room for τ occurrences of ψ(x)=c,
for every c∈Cb.
Therefore, H%1∈∗∗1⟨⟨Nσ,π⟩⟩.
At the induction step, let I be the minimal ideal of the horizontal monoid
Hτ,πn. We assume the
existence of q≥n such that for every h∈I,
the language
Lh={m∈MA,B:h∈α%n(m)}
is recognized by an algebra K∈∗∗q⟨⟨Nσ,π⟩⟩.
Let m∈MA,B for which we know that
α%n(m)∩I=∅,
that is,
ατ,πn(ψ˘(m))∈I for at least one ψ∈Ψ(m).
We say that a node y of m is a pathhead
for H%n
when
α%n(Δ+(m,y))∩I=∅
and
α%n(Δ+(m,z))⊆Hτ,πn∖I
for every son z of y.
We build from m
a multicontext m whose
interior nodes are the pathheads and their ancestors,
and where a port x is created
along every edge (y,z) such that
y is an interior node of m and
its son z is not, so that
Δ(m,x)=Δ+(m,z).
Given k∈I
and a mapping χ:ports(m)→Hτ,πn∖I,
the algebra Hτ,πn can recognize
whether the resulting forest χ˘(m)
belongs to the set (ατ,πn)−1(k).
Next, for every pair z,z′∈ports(m) and h,h′∈I,
whether h∈α%n(Δ(m,z))
is determined by K, and the same holds for the question of
whether
h′∈α%n(Δ(m,z′)).
The two hold simultaneously iff there exist
labelings ψ∈Ψ(Δ(m,z)) and ψ′∈Ψ(Δ(m,z′))
that
ensure
ατ,πn∘ψ˘(Δ(m,z))=h and
ατ,πn∘ψ˘′(Δ(m,z′))=h′.
Since the sets
ports(Δ(m,z)) and ports(Δ(m,z′)) are disjoint,
the hypotheses on ψ and ψ′
are independent, and can be reworded as the
existence of a suitable labeling in
Ψ(Δ(m,z)+Δ(m,z′)).
From the induction hypothesis,
whether a node is a pathhead is recognized by an algebra
of the form K□ODM, with
M∈⟨Nσ,π⟩;
a further block product
(Hτ,πn×(K□ODM))□ODM′,
with
M′∈⟨Nσ,π⟩,
recognizes whether k∈α%n(m),
where k∈I.
□
Proposition 3.7
If G∈∗∗⟨⟨Nτ,π⟩⟩,
then
G%∈∗∗⟨⟨Nτ,π⟩⟩
and
mW%∈Solτ,π.
Proof.
This is a consequence of Propositions
3.3,
3.4, and 3.6.
□
Let σ and ρ be the smallest
integers such that
↔σ,ρ
refines
the canonical congruence of
G%:
given Proposition 3.7,
it makes sense to call them the threshold and period of G%.
In the special case of ∗∗⟨⟨Nτ,1⟩⟩,
the condition mW%∈A
supersedes both the aperiodicity of W
and the absence of vertical confusion on uniform multicontexts.
As mentioned earlier, Potthoff’s algebra,
described in Section 5.4,
satisfies these two conditions while
its extended algebra has a non-aperiodic vertical monoid.
4 Recursive Proofs
In this section, we consistently use the following notations.
The
algebra G=(G,W)
is the one whose membership the variety ∗∗⟨⟨Nτ,π⟩⟩ is to be decided.
We associate to G a surjective homomorphism
φ:FA→G with nuclear congruence ≃G.
Given n≥1
and the variety ∗∗n⟨⟨Nτ,π⟩⟩,
let ≈τ,πn,
Hτ,πn=FA/≈τ,πn
and ατ,πn:FA→Hτ,πn,
such that ≈τ,πn is
the finest congruence over FA that satisfies
Hτ,πn∈∗∗n⟨⟨Nτ,π⟩⟩
and
ατ,πn is a surjective homomorphism.
We use the notation ∗∗n⟨⟨Nτ,π⟩⟩
both for the variety of algebras and for the variety of
the forest or context languages that they recognize.
4.1 ≈τ,πN-tight sets
It is already folklore that non-membership in
FO[≺]
of
an algebra G=(G,W) ultimately has to do with mutually accessible elements of its
horizontal monoid, and to R-classes of W.
The following preliminaries confirm this, and introduce some
definitions and facts that are used later.
For every g∈G we define the language Lg=φ−1(g)
and, for K⊂G, LK=⋃k∈KLk.
It is a basic fact that for any alphabet A of size at least 2 and variety of algebras W,
G∈W iff for every surjective homomorphism φ:FA→G,
there exists an element g∈G such that Lg∈W.
Therefore, G∈∗∗⟨⟨Nτ,π⟩⟩ implies the existence of a partition of G into
Gout={g∈G:Lg∈∗∗⟨⟨Nτ,π⟩⟩}
and its complement Gin.
Since we always work with τ≥1,
both
L0=φ−1(0)
and the set TA of all trees over A
belong to ∗∗⟨⟨Nτ,π⟩⟩,
so that Gin=∅.
Given g∈Gout,
we define
Lgmin as the subset of Lg∩TA
where φ(Δ+(y))∈Gout
for every non-root node.
A pathhead for g
is a node x such that
Δ+(s,x)∈Lgmin; there exists
an algebra in ∗∗⟨⟨Nτ,π⟩⟩ that
can recognize whether a node of s is a
pathhead for g. We say that g∈Gout
is minimal
when at least one forest in HA contains a pathhead for g.
A subset of Gout is minimal if
it contains at least one minimal element.
There exists therefore
an algebra in ∗∗⟨⟨Nτ,π⟩⟩ which recognizes every language Lh,
as well as every Lhtree=Lh∩TA, h∈Gin
and every Lgmin for g∈Gout minimal.
We now show that in
every G∈∗∗⟨⟨Nτ,π⟩⟩ the set Gout has at least one minimal
strongly connected subset.
Proposition 4.1
In G∈∗∗⟨⟨Nτ,π⟩⟩,
at least two elements of Gout are mutually accessible.
Proof.
Assume the existence of g∈Gout such that g≤h does not hold for
any h∈Gout, h=g. This implies that g≤g+h does not hold either,
so that in particular, either g+g=g or g+g<g.
We assume that g+g=g; the other case is dealt with in a similar, and simpler, manner.
Consider a forest t=s1+⋯+sn with each si∈TA, and let t∈Lg;
there are two possible cases for t. First, none of the si’s belongs to Lgtree
and whether t∈Lg is determined by
counting, for each h∈Gin, the number of trees in t that belong to
Lhtree; this is done by a block product Hin□ODM
with M∈⟨Nτ,π⟩.
In the second case, at least one tree in t satisfies si∈Lgtree.
This means satisfying two conditions: that
si∈Lgpathhead−⋃k=gLkpathhead,
where
Lgpathhead is the set of all trees that
contain a pathhead for g;
that
the context within si of every pathhead maps g to itself,
which makes it belong to
the subset of VA generated by
[TABLE]
where
GX={h∈Gin∣g+h=g}.
Each set mentioned here is recognizable by an algebra in ∗∗⟨⟨Nτ,π⟩⟩.
Finally, a forest with at least one tree in Lgtree and no tree in
Lktree for any k∈Gout will belong to Lg iff everyone of its other trees
belongs to ⋃h∈GXLhtree.
□
Given J⊂G,
we say that a set of forests S is
diagonal for J if φ works as a bijection
from S to J, i.e.
we can write
S={sj:j∈J} with φ(sj)=j for every j∈J.
If
G∈∗∗⟨⟨Nτ,π⟩⟩, then
there exists
for every n≥1 a non-singleton
J(n)⊆G and a set of forests
S(n)={sj(n):j∈J(n)},
closed under ≈τ,πn-closed and
diagonal for J(n),
which can serve as witnesses for ≈τ,πn.
At least one J⊆G
occurs infinitely often in the sequence, and since
≈τ,πn refines every ≈τ,πm, m<n,
a sequence of witnesses exists where J(n)=J for every n.
A strongly connected J⊆G with this property is said
to be
≈τ,πN-tight.
We denote by J the set of all ≈τ,πN-tight
subsets of G.
Proposition 4.2
An algebra G=(G,W) is outside of ∗∗⟨⟨Nτ,π⟩⟩ iff
G has
a nonsingleton ≈τ,πN-tight subset.
Proof.
It suffices to prove that if G∈∗∗⟨⟨Nτ,π⟩⟩,
then G contains a ≈τ,πN-tight pair.
Assume that G∈∗∗⟨⟨Nτ,π⟩⟩ and that
in every
set of witnesses where φ(si)=k and φ(si′)=k′ for every i,
at least one of k and k′
is not accessible from the other.
Let this be k: denote by
F the strongly connected component to
which k belongs, so that
k is only accessible from elements of F∪Gin,
k′∈F∪Gin
and F=Gout.
We denote by G′ the image of G by the homomorphism
that maps G−(F∪Gin) onto an absorbing element ∞
and works as the identity on F∪Gin.
Then G′∈∗∗⟨⟨Nτ,π⟩⟩, Gout′=F∪{∞} and
the hypothesis implies that
∞ belongs to every ≈τ,πN-tight pair of G′.
Since no subset {h,k} of F is ≈τ,πN-tight,
there exists a large enough n
such that Hτ,πn=ατ,πn(FA)
recognizes Gin and,
given a forest s∈φ−1(Gout′),
determines which subset Lg∪L∞ it belongs to.
This means that for each g∈G′, there exists a first-order formula Φg(s) that takes
value true iff s∈HA satisfies
s∈Lg when g∈Gin, and
s∈Lg∪L∞ when g∈F.
Let s∈L∞ have a subtree t∈L∞
which is minimal in the sense that no strict subforest of t is in L∞;
this t is of the form x⋅t′ where x is the root of t,
λ(x)=a
and g=φ(t′),
with
(φ(a□))g=∞.
If g∈Gin, then
the formula Φg(t′)∧(λ(x)=a)
asserts that
φ(s)=∞.
Otherwise, g∈F: this holds only if for every y∈nodes(t′)
such that φ(Δ+(t′,y))∈Gout′,
there exist k∈F and b∈A
such that Φk(Δ(t′,y)) is true,
λ(y)=b and (φ(b□))k=∞.
Therefore, the existence in s of a minimal subtree
t∈L∞ can be asserted with a first-order formula.
Next, the formulas developed in this way are used to deal
with the case where
s∈L∞ has no subtree t∈L∞, i.e.
s=t1+⋯+tp with φ(ti)∈Gin∪F
for each i.
□
Proposition 4.3
Let algebra G=(G,W) be outside of ∗∗⟨⟨Nτ,π⟩⟩.
There exists an integer n0 such that, for every
strongly connected set J⊆Gout,
if there exists a set of witnesses for J and ≈τ,πn0,
then J is ≈τ,πN-tight.
Proof.
Let G∈∗∗⟨⟨Nτ,π⟩⟩ and let
J⊆Gout be strongly connected.
If J is not ≈τ,πN-tight, then
no set of witnesses exists for J and ≈τ,πn,
for some n∈N, and no such set exists either for any n′≥n.
Denote by nJ the smallest integer with this property.
Since G is finite, there is a maximum N for the value of nJ over all
subsets J that are not ≈τ,πN-tight.
Define n0=N+1: the existence of a set of witnesses
for J and ≈τ,πn0
implies that J is ≈τ,πN-tight.
□
From now on, we denote by n0 an integer that satisfies the
conditions of Proposition 4.3, and such that
Hτ,πn0
recognizes every language Lh,
as well as every Lhtree=Lh∩TA, h∈Gin
and every Lgmin for g∈Gout minimal.
4.2 Recursive proofs for non-membership
In a recursive proof, a set S(n+1) of witnesses is built by inserting
copies of elements of S(n) in the components of a circuit
M(n+1).
This circuit component is a tuple, i.e. a multicontext equipped with
suitable port labelings, whose purpose is to specify, for
every port x, which witness from S(n), or rather copy thereof, is to
be inserted at x. The
tuples in a circuit must be related to each other,
in such a way that certain of the resulting forests
mapped by φ to distinct elements of the same
≈τ,πN-tight subset of G
are undistinguishable by the congruence
≈τ,πn+1.
From this constraint we define a relation
⟨⋆Hτ,πn⟩
between tuples, such that if
t⟨⋆Hτ,πn⟩t′, then the forests
built from t and t′ are equivalent under ≈τ,πn+1.
This leads to the definition of Condition
RC(G⋆Hτ,πn), which states
the existence of a circuit suitable
to the construction of witnesses for the congruence
≈τ,πn+1,
so that RC(G⋆Hτ,πn)
implies
G∈∗∗n+1⟨⟨Nτ,π⟩⟩.
In the other direction, Lemma 4.6 shows that a circuit that
satisfies RC(G⋆Hτ,πn)
can always be extracted from a set of witnesses for ≈τ,πn+1.
As a consequence, a recursive proof of
non-membership exists for every algebra outside of
∗∗⟨⟨Nτ,π⟩⟩.
To prove
Lemma 4.6 it is convenient to work in terms of “full” proofs,
where the number of witnesses in S(n) increases with n;
this does not describe the
actual proofs that exist in the literature.
Theorem 4.7 shows that a “slender”
proof always exists for every algebra outside of
∗∗⟨⟨Nτ,π⟩⟩.
We define a mapping
ι:⋃n≥n0Hτ,πn→P(G), given by
ι(h)={j∈G:h∈ατ,πn(Lj)},
which
is not necessarily injective.
By our assumption on n0, either
ι(h)⊆Gin and is a singleton,
or
ι(h)⊆Gout
and may have two or more elements.
If ∣ι(h)∣≥2, then every forest in
the set (ατ,πn)−1(h) can be used as a witness
for ι(h) and
≈τ,πn, and therefore
by Proposition 4.3, ι(h)
is ≈τ,πN-tight.
We define
J={ι(h):∣ι(h)∣≥2} and,
for each n≥2 the set
D(n)={(h,j):h∈Hτ,πn∧j∈ι(h)}.
We work in
terms of D(n) instead of
{(h,j)∈D(n):ι(h)∈J},
that is, we
include in the discussion those h∈Hτ,πn
for which ι(h) is a singleton;
this will be useful in the proof of Lemma 4.6.
We say that a set of forests S(n) is full
when it contains a forest sk,ℓ for every
(k,ℓ)∈D(n),
with the notations
S(n)={sk,ℓ(n):(k,ℓ)∈D(n)}.
Similarly, we say that a circuit over
(A,D(n),D(n+1))
is full when it contains a multicontext
mh,j(n+1) for every
pair (h,j)∈D(n+1).
A sequence S(n), n≥1, is
a recursive proof when every forest sh,j(n+1),
h∈Hτ,πn+1, j∈ι(h)
is built by inserting copies of elements of
S(n) at the ports of a
multicontext mh,j(n+1) that belongs to a circuit M(n+1)
over (A,D(n),D(n+1)).
Let s and m be shorthands for sh,j(n+1)
and mh,j(n+1), respectively.
To every x∈ports(m), we assign two labels
μτ,πn(x)∈Hτ,πn
and
ψ(x)∈G.
The labels are consistent
when ψ(x)∈ι∘μτ,πn(x),
which is equivalent to
(φ−1∘ψ)(x)∩((ατ,πn)−1∘μτ,πn)(x)=∅,
and means that
at least one forest r exists that can be inserted at the port
in a consistent way, i.e. such that
φ(r)=ψ(x) and ατ,πn(r)=μτ,πn(x).
Combined with m, the mappings μτ,πn
and ψ define
the tuple t=(m,μτ,πn,ψ);
it is said to be consistent if the two mappings are
consistent at every port of m.
The leaf-completion of m through
each of these mappings will be used at several places.
We establish a relation between two consistent tuples
t=(m,μτ,πn,ψ) and t′=(m′,μτ,πn,ψ).
Let μ˘τ,πn(m), a forest over A∪Hτ,πn,
be the leaf-completion of m through μτ,πn(x);
define on it the leaf extension of
ατ,πn.
Then in FA∪Hτ,πn,
the nuclear congruence of ατ,πn coincides with
the congruence
≈τ,πn+1.
Next, with
h∈Hτ,πn, v∈Vτ,πn and j∈G,
let
P[Hτ,πn](t)h,v,j denote the number of
ports x∈ports(m) that satisfy
h=μτ,πn(x),
v=ατ,πn(∇(μ˘τ,πn(m),x))
and
j=ψ(x).
From this we define
P[Hτ,πn](t)≡τ,πP[Hτ,πn](t′)
as a shorthand for
[TABLE]
Finally, we define the relation ⟨⋆Hτ,πn⟩ between tuples:
[TABLE]
We now describe the meaning of this relation.
We want to determine conditions that are sufficient for two tuples
t=(m,μτ,πn,ψ) and t′=(m′,μτ,πn,ψ)
to be suitable for the construction of witnesses s and s′
such that
s≈τ,πn+1s′, while
φ(s) and φ(s′) are distinct elements of the
same ≈τ,πN-tight subset of G.
The forests s and s′ are built from
t and t′ by inserting at their ports elements of
a set S(n) of witnesses for ≈τ,πn;
the insertion at port x of
the forest s(x) is consistent, in the sense defined above.
Let
s~ and s~′ denote the relabeled versions
of s and s′, relative to
ατ,πn, and let
An=A×Hτ,πn×Vτ,πn
be the corresponding relabeling alphabet.
We assume that φ(s) and φ(s′)
have the appropriate values and we look at how to
satisfy
the constraint
s≈τ,πn+1s′.
Satisfaction is obtained when every symbol of
An occurs the same number of times
(up to ≡τ,π) in s~ and s~′.
We verify this separately on the nodes
of the multicontexts
of m and m′ and on those that belongs to the
copies of elements of S(n)
that are inserted at their ports.
Given x∈ports(m), we have by construction
[TABLE]
where the leftmost equality comes from the fact that in
s, the label of x is the neutral letter e.
Therefore, for every interior node y of m,
[TABLE]
which explains the component
μ˘τ,πn(m)≈τ,πn+1μ˘τ,πn(m′)
in the definition of
t⟨⋆Hτ,πn⟩t′.
Next, we deal with the copies of witnesses from S(n)
inserted at the ports of m and m′.
Define as above the counter
P[Hτ,πn](t)h,v,j
and the shorthand
P[Hτ,πn](t)≡τ,πP[Hτ,πn](t′).
The witness s(x) is a copy of sh,j(n);
let y be one of its nodes,
with
a=λ(sh,j(n),y),
we have
[TABLE]
and
[TABLE]
so that
λ(s~,y)=⟨a,k,vw⟩.
If P[Hτ,πn](t)≡τ,πP[Hτ,πn](t′), then for every combination of
sh,j(n)∈S(n) and v∈Vτ,πn,
the numbers of copies of sh,j(n) that are
inserted at ports x such that
v=ατ,πn(∇(μ˘τ,πn(m),x))
are the same in m and m′, up to ≡τ,π.
Definition 4.4
*We denote by
RC(G⋆Hτ,πn)
the existence of a
circuit M(n+1) over
(A,D(n),D(n+1)) such that
i.
every tuple th,j(n+1)=(m,μτ,πn,ψ)
is consistent and satisfies
ατ,πn+1(μ˘τ,πn(m))=h and
φ(ψ˘(m))=j, and
ii.
every combination of
th,j(n+1) and th,j′(n+1) satisfies condition
th,j(n+1)⟨⋆Hτ,πn⟩th,j′(n+1).
Lemma 4.5
Let n≥2: if a circuit M(n+1)
satisfies RC(G⋆Hτ,πn),
then it also satisfies RC(G⋆Hτ,πn−1),
and can be used to prove
G∈∗∗n+1⟨⟨Nτ,π⟩⟩.
Proof.
Let M(n+1) satisfy RC(G⋆Hτ,πn).
Define the surjective homomorphism σ:Hτ,πn→Hτ,πn−1
by σ=ατ,πn−1∘(ατ,πn)−1.
Then from every consistent tuple (m,μτ,πn,ψ)
in M(n+1) one can build
a consistent (m,μτ,πn−1,ψ),
where
μτ,πn−1=σ∘μτ,πn.
We define from
(m,μτ,πn−1,ψ)
its leaf completion
μ˘τ,πn−1(m);
we have
[TABLE]
and,
for every combination of h∈Hτ,πn,
v∈Vτ,πn and j∈G,
the counters
P[Hτ,πn−1](t)h,v,j satisfy
[TABLE]
Therefore, if S(n−1) is a vector of witnesses for
≈τ,πn−1, then
M(n+1)⋅S(n−1) is a vector of witnesses for
≈τ,πn.
Note that since
∣Hτ,πn+1∣>∣Hτ,πn∣,
some pairs (h,j)∈D(n)
end up with more than one witness.
Sets of witnesses
S(1),…,S(n)
can be built from a
circuit M(n+1)
that satisfies
RC(G⋆Hτ,πn),
as follows.
First, select
a set S(0) that contains a forest
sJ,j for every (J,j) and h
with J=ι(h) and (h,j)∈D(n),
such that φ(sJ,j)=j,
and build
S(1)=M⋅S(0):
for any two tuples
th,j(n+1) and th,j′(n+1)
and the corresponding forests
sh,j(1) and sh,j′(1),
having
th,j(n+1)(G⋆Hτ,πn)th,j′(n+1)
ensures that every a∈A occurs the same number of times
(up to ≡τ,π) in sh,j(1) and sh,j′(1),
so that
S(1) constitutes a set of witnesses for
≈τ,π1.
Then recursively,
S(n)=M⋅S(n−1)=Mn⋅S(0),
for every n≥2.
□
Lemma 4.6
G∈∗∗⟨⟨Nτ,π⟩⟩*
if, and only if there exists a sequence of full circuits M(n+1), n≥1,
that satisfies RC(G⋆Hτ,πn).*
Proof.
By the above discussion, satisfaction of
RC(G⋆Hτ,πn) by every M(n+1), n≥1,
ensures that this sequence of circuits
can be used to build a recursive proof.
For the only if direction,
assuming G∈∗∗⟨⟨Nτ,π⟩⟩,
we want to prove the existence of a sequence of circuits where each
M(n+1), n≥1, satisfies RC(G⋆Hτ,πn).
By Lemma 4.5, it suffices to prove this for n above a
large enough threshold: we select for this
the parameter n0 defined at the end of Section 4.1.
The proof uses the algebra
G′=G×Hτ,πn;
since
G∈∗∗⟨⟨Nτ,π⟩⟩,
we have
G′∈∗∗⟨⟨Nτ,π⟩⟩.
With the notations
G′=(G′,W′)
and Gout′={g′∈G′:Lg′∈∗∗⟨⟨Nτ,π⟩⟩},
we observe that every
≈τ,πN-tight subset of Gout′
can be written J′=J×{h},
where J⊆ι(h) and
ι(h) is ≈τ,πN-tight.
To see this, consider two elements (j,h) and (k,ℓ)
of J′.
If h=ℓ, then Hτ,πn
can tell apart the languages L(j,h) and L(k,ℓ),
which means that J′ is not ≈τ,πN-tight.
Next, if h=ℓ
but j and k do not belong to the same
≈τ,πN-tight
subset of Gout, then by
Proposition 4.3,
ατ,πn maps the
sets φ−1(j) and φ−1(k)
onto distinct subsets
(actually, singletons)
of Hτ,πn.
Meanwhile, given
h∈Hτ,πn, a contradiction argument shows that if
ι(h) is a ≈τ,πN-tight
subset of Gout, then ι(h)×{h} is also ≈τ,πN-tight.
The proof consists in showing that any full vector of witnesses
for ≈τ,πn+1 can be used to define a full circuit
M(n+1) that satisfies RC(G⋆Hτ,πn).
Let s be such a witness; we build
a multicontext m from s as follows.
In m, every leaf is a port; the interior nodes,
i.e. the set interior(m),
are exactly those y∈nodes(s)
for which Δ(s,y) contains at least one pathhead
for Gout′; in other words, every strict
ancestor of a pathhead
becomes
an interior node of m.
Next, we insert a port x along every edge (y,z) where
y∈interior(m) and z∈nodes(s)−interior(m),
and we remove the subtree of s rooted at z.
We will prove that
the resulting multicontext has the required properties.
Let y∈nodes(s) and let s~ denote the version of s relabeled
relative to ατ,πn: we show that the triple
λ(s~,y)=⟨λ(s,y),ατ,πn(Δ(s,y)),ατ,πn(∇(s,y))⟩
can be used to
determine whether y is a pathhead for Gout′.
Let TA denote the set of all trees over A,
J the set of all ≈τ,πN-tight
subsets of Gout,
and define
[TABLE]
Since TA is recognized by
every algebra
Hτ,π2 where τ≥1,
each set (ατ,πn)−1(h) is either a subset of
TA or disjoint with it.
Then the set of all forests that contain
a pathhead for Gout′ is recognized
by Hτ,πn through
ατ,πn
and the set Hph of all elements of
Hτ,πn
accessible from
HTree.
A node y is a pathhead
for Gout′ when
ατ,πn(Δ+(s,y))
belongs to Hph and
ατ,πn(Δ(s,y))
does not.
The nodes of m are those y∈nodes(s)
for which ατ,πn(Δ(s,y))∈Hph,
that is, y is a strict ancestor
of at least one pathhead.
To build the ports of m,
we take each father-son pair y,z
in s
where y∈interior(m) and z∈interior(m),
and insert a port x between the two,
so that
Δ(s,x)=Δ+(s,z)
and also
β(∇(s,x))=β(∇(s,z))
for every homomorphism β.
By construction, m and x satisfy
ατ,πn(∇(μ˘τ,πn(m),x))=ατ,πn(∇(s,z)),
ατ,πn(μ˘τ,πn(m))=ατ,πn(s)
and
φ(ψ˘(m))=φ(s).
We also observe that
φ(Δ(s,x))=φ(Δ+(s,z))
is determined by ατ,πn(Δ+(s,z)).
Indeed, either z is not a pathhead and the set
ι(ατ,πn(Δ+(s,z)))
is not ≈τ,πN-tight,
or z is a pathhead, a case
discussed in the proof of Proposition 4.1.
Hence,
the value of λ(s~,z)
determines both
μτ,πn(x)=ατ,πn(Δ+(s,z))
and
ψ(x)=φ(Δ+(s,z));
from there, it determines which counter
P[Hτ,πn](t)h,v,j the port x
contributes to.
Given a vector of integers P,
the algebra Hτ,πn+1
can therefore recognize whether
P[Hτ,πn](t)h,v,j≡τ,πPh,v,j.
As a consequence, given another witness s′
and the multicontext m′ built from s′,
if s′≈τ,πn+1s, then
the corresponding tuples
t and t′
satisfy P[Hτ,πn](t)≡τ,πP[Hτ,πn](t′).
Finally, the fact that the interior nodes of m are exactly those y∈nodes(s)
for which Δ(s,y) contains a pathhead,
means that the labels they carry in s~ are distinct from those of
the other nodes of s.
Therefore, defining
μτ,πn(x)=ατ,πn(Δ(s,x))
for every port of m, we obtain
μ˘τ,πn(m)≈τ,πn+1μ˘τ,πn(m′).
Defining further ψ(x)=φ(Δ(s,x)), we build tuples that
are equivalent under
⟨⋆Hτ,πn⟩
and thus
constitute a circuit that satisfies RC(G⋆Hτ,πn).
□
The number of forests in a full vector of witnesses S(n)
increases with n, while the existing examples of proofs
describe sequences S(n), n≥1, where all vectors have
the same size. We say that a proof is
slender when,
for every combination of a ≈τ,πN-tight
set J and of j∈J, it contains
at most one witness sh,j(n) such that ι(h)=J.
Theorem 4.7
We have \mathcal{G}\not\in{{\mathbf{**}}}\langle\!\langle\mathbb{N}_{\tau,\pi}\rangle\!\rangle\ iff
G has a slender recursive proof of non-membership
which for each n,
involves a circuit that satisfies Condition RC(G⋆Hτ,πn).
□
Proof.
It suffices to prove the only if direction.
Recall that J denotes the set of all
≈τ,πN-tight subsets of G.
Given n≥1, we define
δn(J)={h∈Hτ,πn:ι(h)∈J},
we pick a representative of every class
for the equivalence relation
δn∘ι(h),
and define
ρn:δn(J)→δn(J)
which maps every h∈δn(J) to its representative.
Let t=(m,μτ,πn,ψ) and
t′=(m′,μτ,πn,ψ) be two tuples in
M(n+1), let s and s′ denote the witnesses
built from them, with
s≈τ,πn+1s′ and
φ(s)=φ(s′).
In order to use m in
a slender proof,
we modify the labeling of every x∈ports(m),
we replacing μτ,πn(x)
with
ρn(μτ,πn(x)).
We then build from m a forest sρ
by inserting at every port x a copy of sρn(μτ,πn(x)),ψ(x)(n),
(instead of sμτ,πn(x),ψ(x)(n));
the construction of
sρ uses a slender subset
Sρ(n) of S(n),
where sh,j(n)∈Sρ(n)
iff h=ρn(h).
By construction, we have
φ(sρ)=φ(s).
We build
sρ′ from m′ in the same way.
Verifying that
sρ≈τ,πn+1sρ′
will lead us to conclude that
sρ and sρ′ are witnesses
for the same ≈τ,πN-tight set as s and s′.
This is done by induction on n,
proving for every i≤n and every pair
x,x′ of nodes or ports in m and m′ that if
Δ(s,x)≈τ,πiΔ(s′,x′)
and
∇(s,x)≈τ,πi∇(s′,x′),
then
Δ(sρ,x)≈τ,πiΔ(sρ′,x′)
and
∇(sρ,x)≈τ,πi∇(sρ′,x′)
hold as well.
As a consequence, those nodes of m
and m′ that carry the same label (element of An)
in the relabeled versions of s and s′
will also carry the same label (but possibly not the original one)
in the relabeled versions of sρ and sρ′;
the tuples tρ and tρ′ built from them
are equivalent under
⟨⋆Hτ,πn⟩.
Therefore, with this method one can start with a sequence S(n),
n≥1, of full
sets of witnesses and build from it a slender proof.
□
If I is a subset
of L such that ⋃J∈IJ
is strongly connected
and is maximal relative to this property, then
the witnesses for I constitute by themselves a recursive proof.
We can therefore from now restrict our work on
proofs that involve sets L where ⋃J∈LJ
is strongly connected.
Let
NRC(G⋆Hτ,πn)
denote the negation of
RC(G⋆Hτ,πn);
by Lemma 4.5,
it
states
that there does not exist witnesses for
G relatively to Hτ,πn.
It
is tempting to try to obtain from it a simpler formula,
such as
∀t,t′:t⟨⋆Hτ,πn⟩t′⇒φ(t)=φ(t′),
which is an equational definition for a variety of forest algebras.
This does not work, because
RC(G⋆Hτ,πn)
applies only within the strongly connected components of G.
For example,
consider the language L over A={a,b}
defined by
at least one node is an ancestor of exactly one node labelled b,
whose syntactic algebra belongs to
∗∗2⟨⟨N2,1⟩⟩, and the tuples t=ψ˘(m) and t′=ψ˘′(m) built over the
multicontext
m=a(a(x1+x2)+a(x3+x4)) with three interior nodes (labelled a)
and four ports where the mappings ψ and ψ′ are defined by
ψ(x1)=ψ(x3)=ψ′(x1)=ψ′(x2)=a
and ψ(x2)=ψ(x4)=ψ′(x3)=ψ′(x4)=b,
so that ψ˘(m)∈L, ψ˘′(m)∈L,
and
t⟨⋆H2,12⟩t′.
4.3 Uniform proofs
Recall that ↔σ,ρ
denotes the
equivalence-under-pumping congruence in FA,B,
and Jσ,ρ=(Jσ,ρ,Uσ,ρ) the quotient algebra.
We use the same notations for the corresponding objects defined over FA.
Given a tuple
t=(m,ν,ψ), for
every combination of
v∈Uσ,ρ and (J,j)∈D we define
the counter
[TABLE]
From this, we define a relation in MA,D:
[TABLE]
In a circuit M(n+1) found in a proof-by-pumping,
every subcircuit TJ(n+1) is diagonal and closed under
⟨⋆Jσ,ρ⟩: we denote by
RC(G⋆Jσ,ρ)
the proposition that states the existence of such a circuit
M(n+1).
Since
↔σ,ρ
refines ≈τ,πn+1,
the relation
⟨⋆Jσ,ρ⟩ refines
⟨⋆Hτ,πn⟩, and therefore
RC(G⋆Jσ,ρ)
implies
RC(G⋆Hτ,πn).
Consider an algebra
G∈∗∗⟨⟨Nτ,π⟩⟩ and
let σ and ρ be the threshold and period of G%.
This means that G belongs to the variety
∗∗⟨⟨Nτ,π⟩⟩∧⟨⟨Jσ,ρ⟩⟩,
where
⟨⟨Jσ,ρ⟩⟩ is the variety of
finite forest algebras
generated by Jσ,ρ.
We can repeat with the congruences
(≈τ,πn∩↔σ,ρ),
for every n≥1, the work done for Proposition 2.2 and
Theorem 4.7 and assert the existence of an algebra
Kτ,πn,σ,ρ=FA/(≈τ,πn∩↔σ,ρ)
such that
G∈∗∗⟨⟨Nτ,π⟩⟩∧⟨⟨Jσ,ρ⟩⟩
if, and only if
NRC(G⋆Kτ,πn,σ,ρ)
for some n≥1.
Since
Kτ,πn,σ,ρ≺Jσ,ρ,
this implies
NRC(G⋆Jσ,ρ).
We obtain the following.
Proposition 4.8
With σ and ρ
the threshold and period of G%,
if G∈∗∗⟨⟨Nτ,π⟩⟩,
then
NRC(G⋆Jσ,ρ).
The values of σ and ρ
are computable in finite time;
with them in hand,
determining the existence of a
circuit T that satisfies
RC(G⋆Jσ,ρ)
is a recursively enumerable problem.
If the corresponding algorithm stops,
then
G∈∗∗⟨⟨Nτ,π⟩⟩.
However, it
is not clear whether
a whole proof can be built from T,
that is, whether for every n≥2,
knowledge of T suffices to determine
every set of witnesses S(n).
If so, then this would be a uniform proof of non-membership.
The results of Section 4.2 make no mention
of uniformity. However,
the existing Ehrenfeucht-Fraïssé games are uniform
and are built along
two construction mechanisms,
which we call
proof-by-copy and proof-by-pumping,
where the former is a special case of the latter.
4.3.1 Proof-by-copy
In certain cases such as the Boolean algebra (see Section 5.1),
condition
RC(G⋆Hτ,πn)
can be satisfied
with a circuit T where each subcircuit TJ is
built over the same multicontext,
that is,
mJ,j↔∞mJ,j′
for all j,j′∈J.
Then T actually satisfies
RC(G⋆Hτ,πn)
for every n≥1 and can be used in
the construction of every set of witnesses
S(n), in the way described in the
proof of Lemma 4.5,
and
proving
G∈∗∗⟨⟨Nτ,π⟩⟩
is just a matter of creating copies of T and assembling them444Observe that
the depth of this construction increases linearly with n. into witnesses,
hence the name “proof-by-copy”.
We verify that the existence of such a proof is a
recursively enumerable problem.
Formally, we work with G=(G,W)=φ(FA),
a connected component F, the set
D={(J,j):∅=J⊆F,j∈J},
and, to make things simpler, the assumption that
F is either the minimal ideal of G,
i.e.
wF⊆F for every w∈W,
or F∪{∞} is an ideal,
where w∞=∞ for every w∈W.
Given J⊆F, we define
J∞=J∪{∞}.
We use
tuples t=(m,ν,ψ)∈MA,D
with (ν,ψ):ports(m)→D.
Since
↔∞
coincides with =, up to horizontal permutation of siblings,
we confound FA with FA/↔∞
and the canonical homomorphism with the identity mapping.
The support of a tuple t is
[TABLE]
its set of inputs is
inp(t)=inp(m)={J⊆F:∃x∈ports(m),ν(x)=J}.
Next, for
every combination of
v∈VA and (J,j)∈D we define
the counter
[TABLE]
with P[FA](t)v,J,j=0
whenever J∈inp(t)
or
v∈sup(t).
Finally, we define a relation in MA,D:
[TABLE]
A proof-by-copy for G, if it exists, can be found by an
algorithm that traverses the set of all slender
circuits T over subsets of F,
and stops when it has found one where every
subcircuit TJ is diagonal and closed under
⟨⋆τ,π⟩.
4.3.2 Proof-by-pumping
Other Ehrenfeucht-Fraïssé games
are sequences of witnesses
where each subcircuit
of T(n+1) is
built over multicontexts that are
equivalent under an
“equivalence under pumping” congruence
↔σ,ρ
that refines
≈τ,πn+1;
hence the name “proof-by-pumping”.
In the examples described in
the next section,
pumping is used in two different ways.
In Section 5.3,
the circuits consists in uniform multicontexts
where all ports belong to the same set Z
and that are obtained from
a single
m∈MA,B by pumping along Z.
In Section 5.4, work starts with a unique
multicontext whose ports are partitioned into two
sets Y and Z; pumping θ times along
Z
creates a multicontext denoted pθ(1);
copies of pθ(1) and pθ+1(1)
are then assembled, by insertion at the Y-ports,
into the actual components of the circuit
used in the construction of the witnesses.
In this section, we investigate the first of
these two techniques; then we examine another
situation where pumping is involved,
but no example of which is known to the author.
An algorithm adapted from the one
described at the end of Section 4.3.1 can verify the existence of a
suitable T where every subcircuit
TJ is diagonal and closed under
⟨⋆Jσ,ρ⟩,
for a given pair σ,ρ.
The question then arises, whether
it is sufficient to run this test
for only one such pair,
that is, whether given
a suitable circuit M for some
σ, ρ and n, such that
↔σ,ρ refines
≈τ,πn+1,
one can for every sequence of pairs
σi,ρi, i≥1, for which
↔σi,ρi refines
≈τ,πn+i,
obtain by pumping the components of M a suitable circuit where every subcircuit
is closed under
⟨⋆Jσi,ρi⟩.
We now develop tools that make it possible to explore pumping
in some depth.
Let M⊂MA,B be a set closed under
↔σ,ρ
and let Z⊂ports(M) be
↔σ,ρ-stable,
which means that pumping M along Z is possible.
Let ν(x)=J for every x∈Z
and let T be a set of tuples defined over M,
with for each j∈J at least one element t
such that φ(t)=j.
We work with a tuple
t be defined on
m^∈M(θ,Z)
and built from T through
consistent insertions, where a tuple t∈T can be
inserted at a port x only if
φ(t)=ψ(x).
Within m^ we associate to
every copy m of an element of M
the parameters depth and height, as follows.
The
depth level d=1 in m^
is a singleton mˉ1 containing
a unique copy of a multicontext of M.
Then recursively, depth level d+1 is the set mˉd+1 of all copies of
elements of M inserted at the ports of Z(mˉd),
which we call the Z-ports of mˉd.
Then m is at depth d iff m∈mˉd.
We define the height
together with the equivalence class
[[m^]]σ,ρ=[[M(θ,Z)]]σ,ρ
of
m^∈M(θ,Z) under
↔σ,ρ,
as follows.
If θ<σ,
then
[[m^]]σ,ρ=M(θ,Z),
all Z-ports of m^
are located at depth θ, and
the height of m∈mˉd is defined
as the class
of h=θ−d under the congruence ≡σ,ρ.
Otherwise θ≥σ,
and the class
[[m^]]σ,ρ
consists in the union of all
sets
M(θ′,Z) for which
θ′≡σ,ρθ plus,
because ↔σ,ρ
is a congruence,
every multicontext obtained from
an element of [[m^]]σ,ρ
by replacing a sub-multicontext
n^
with some n^′
that satisfies
n^′↔σ,ρn^.
Let m^′ be obtained from
m^ by this substitution,
and let x∈ports(mˉd) be the port in which
n^ has been replaced with n^′.
If n^∈M(δ,Z) for some δ<σ,
then
n^′∈M(δ,Z),
and the height of x and of the m∈mˉd it belongs to,
is unmodified.
Otherwise, n^∈[[M(δ,Z)]]σ,ρ
with δ≥σ, and the height of nˉ1, the top
depth level in n^, is
the class
of h(n)=θ−1 under the congruence ≡σ,ρ.
Making the induction hypothesis, that
n^′↔σ,ρn^
implies h(n′)≡σ,ρh(n), we conclude that
the height of x and of the m∈mˉd it belongs to,
is unmodified.
Let x be located in m^ at height h and depth d:
the subforest
Δ(m^,x) belongs to [[M(h,Z)]]σ,ρ
and
the context
∇(m^,x) can be seen as the result of taking
a multicontext in [[M(d,Z)]]σ,ρ
consisting in the top d depth levels of m^,
and inserting
an element of [[M(h,Z)]]σ,ρ at every Z-port of
mˉd other than x.
Next, given another multicontext
m^′∈[[M(θ′,Z)]]σ,ρ,
d′≤θ′ and x′∈Z(mˉd′′),
we have
[TABLE]
We say that the Z-ports of m^ are at
the bottom and that the other ports are on the side.
Then, given y∈ports(mˉd) and
y′∈ports(mˉd′′) located on the sides,
with m(y) and m′(y′) the copies of elements of M
they belong to, we have
[TABLE]
Let
t=(m^,ν,ψ)
and t′=(m^′,ν,ψ) satisfy
t⟨⋆Jσ,ρ⟩t′,
so that
m^↔σ,ρm^′,
and assume that m^∈[[M(δ,Z)]]σ,ρ
with δ≥σ.
Let y∈ports(t) and y′∈ports(t′) be located on the sides,
at depth levels d and d′,
and heights h and h′, respectively.
For every u∈Uσ,ρ,
they contribute to the counters
P[Jσ,ρ](t)u,I,i and
P[Jσ,ρ](t′)u,I,i,
only if
y↔σ,ρy′.
Depending on d we distinguish three cases:
if d<σ,
i.e. y is located in the top σ−1 levels of m^,
this implies d′=d;
else if
h<σ,
i.e. y is located at height less than σ in m^,
this implies h′=h.
Hence
if t⟨⋆Jσ,ρ⟩t′,
then the components of
P[Jσ,ρ](t) and
P[Jσ,ρ](t′) corresponding to the top and bottom
regions match
on a level-by-level basis.
Otherwise y is located
in the “middle region” of m^, where
d≥σ and h≥σ, and
where
the ports are partitioned according to (d−σ)modρ
and (h−σ)modρ,
and the ports within the same class contribute to the same counters.
The contributions of all ports within the same class are added up.
Until now, dealing with modular quantifier and arithmetics
was no more complex than working only on the variety
∗∗⟨⟨Nτ,1⟩⟩=FO[≺],
i.e. the case where π=1. The situation changes here,
as there will be places where
modular arithmetics adds an extra layer of complexity to the
work being done.
From now on, therefore, we restrict ourselves to the case where
π=1.
We now look at a type of circuit
where the multicontexts of TJ are obtained by
pumping at the Z-ports of a set M,
i.e. MJ⊂[[M(θ,Z)]]σ,ρ
for some θ∈N
and the multicontexts of TJ have Z-ports
that are available for further pumping.
The sets of tuples
with this property is recursively enumerable.
We show that
for every χ>σ
a set UJ closed under
⟨⋆Jχ,1⟩
can be built from TJ by pumping along Z.
If every subcircuit
of a circuit T that satisfies
RC(G⋆Jσ,1)
falls into this case, then
for every χ>σ
one can obtain by pumping T a circuit U that satisfies
RC(G⋆Jχ,1);
in other words, a uniform
proof-by-pumping for
G∈∗∗⟨⟨Nτ,1⟩⟩
can always be built from T.
Later in this section, we will
look at another type of circuits
and show that in some circumstances,
TJ cannot be used to build an entire proof.
Let the vector P[Jσ,ρ](t)Z
consist of the components of
P[Jσ,ρ](t) associated to the
Z-ports, and
P[Jσ,ρ](t)Y
consist in all the other components.
With the notations mˉ1={mtop} and
mˉ1′={mtop′},
we define tuples
ttop=(mtop,ν,ψ)
and ttop′=(mtop′,ν,ψ)
from t and t′;
we observe that if
t⟨⋆Jσ,ρ⟩t′,
then
P[Jσ,ρ](ttop)Y≡τ,πP[Jσ,ρ](ttop′)Y.
Also, since
mtop↔σ,ρmtop′,
we have
∣Z(mtop)∣≡σ,ρ∣Z(mtop′)∣.
Given a
subcircuit TJ={tj:j∈J} of a circuit that satisfies
RC(G⋆Jσ,ρ),
we define in the same way
the sets Mtop={mjtop:j∈J}
of multicontexts and Ttop={tjtop:j∈J}
of tuples,
and for every θ≥2 we build by
consistent pumping
the set
Ttop(θ,Z)={tj(θ,Z):j∈J}.
We associate to every Z-port x of MJ its depth
level θx; it satisfies θx≥σ.
Then we build UJ
in three steps, where the first two are:
i.
build the set Ttop(χ,Z);
ii.
for each j∈J, build u~j from tj(χ,Z),
by inserting
at every Z-port x a copy of tψ(x).
Every
Z-port x of u~j is a copy of some Z-port z of TJ
and is located at depth θx=χ+θz.
For all j,j′∈J, the properties of Ttop imply
[TABLE]
which means that the counters corresponding to
the top χ levels of u~j and u~j′ match
(under the ≡τ,1).
The third step is:
iii.
build uj by inserting, at every Z-port x of u~j,
a copy of tψ(x)(η(x),Z).
To determine the integer η(x), we
associate to Ttop a ∣J∣×∣J∣
matrix A with entries in Nτ,1, where
Aij=∣{x∈Z(titop):ψ(x)=j}∣.
Then (Aδ)ij is the number of Z-ports
x in ti(δ,Z) such that ψ(x)=j.
The powers of this matrix constitute a finite semigroup
with idempotent element Aω: if every η(x) is a multiple
of ω, then
[TABLE]
will hold for every pair x,x′ of Z-ports of u~j
that satisfy ψ(x)=ψ(x′).
Since for all j,j′∈J and for all t,t′∈TJ,
[TABLE]
we have
P[Jχ,1](u~j)Z≡τ,1P[Jχ,1](u~j′)Z
and from there
P[Jχ,1](uj)Z≡τ,1P[Jχ,1](uj′)Z.
We now claim that if every η(x) satisfies,
η(x)≥τ+χ, then
P[Jχ,1](uj)Y≡τ,1P[Jχ,1](uj′)Y.
This already holds for the top χ levels in
uj and uj′.
Meanwhile, the lower χ levels in
tψ(x)(η(x),Z) consist of copies of
elements of Ttop;
their numbers in uj and uj′
match, thanks to Equation (1);
this ensures a match in the bottom region.
The middle region consists in the tuples inserted at step ii
and in elements of
Ttop(ξ(x)−χ,Z), namely the
tuples inserted at step iii minus their
lowermost χ levels.
By Equation (1) and the fact that
η(x)−χ≥τ for every Z-port x of uj and uj′,
the contributions of the tuples inserted at step iii
match with one another.
Concerning the tuples inserted at step ii,
let t,t′∈TJ and consider
two non-Z-ports y,y′,
located at depth levels d,d′ and height levels h,h′
in t and t′, respectively, denoting by
m(y) and m′(y′) the copies of multicontexts from M
they belong to.
If
y↔σ,1y′,
then
∇(m(y),y)↔σ,1∇(m′(y′),y′).
The reverse implication works in t,t′ only if
d≡σ,1d′ and
h≡σ,1h′.
Let t be inserted at step ii of the construction of u~j:
the same port y
is, in u~j, at depth d+χ≥χ and height at least τ+χ+h≥χ;
then
if t′ is in the same u~j, we have
[TABLE]
Then from this, from
P[Jσ,1](t)Y≡τ,1P[Jσ,1](t′)Y, and from
the fact that the numbers of occurrences in
u~j and u~j′ of elements of TJ
match because
∣Z(tj(χ,Z))∣≡τ,1∣Z(tj′(χ,Z))∣,
we conclude that the counters corresponding to
the middle regions of uj and uj′ balance.
We now look at a circuit where TJ has been built
by pumping along Z but has no
Z-ports, and the
construction of TJ uses two sets of multicontexts M and R closed under
↔σ,1,
and is done in two steps:
pumping at the Z-ports of a set M
to obtain a set SJ={sj:j∈J},
then for each j∈J,
inserting at every Z-port x of sj
a tuple r(x)∈R, which builds tj.
This is a situation where
P[Jσ,1](s)≡τ,1P[Jσ,1](s′)
is not satisfied by some tuples s,s′∈SJ
(otherwise a suitable circuit could be built
solely with SJ,
without the tuples from R), but where
P[Jσ,1](t)≡τ,1P[Jσ,1](t′)
holds for all t,t′∈TJ.
From the latter equivalence we deduce that
P[Jσ,1](s)Y≡τ,1P[Jσ,1](s′)Y
holds for all s,s′, and we have
P[Jσ,1](s)Z≡τ,1P[Jσ,1](s′)Z for some s,s′.
This makes possible a situation where a different tuple
from R has to be inserted at every port of the sj’s
in order for the resulting tj’s to be equivalent under
⟨⋆Jσ,1⟩: this means that
tuples from R inserted at
the antichains of ports Z(sj) can be correlated
in a potentially intricate way.
We assume that using R was necessary, namely
that no set of tuples S={sj:j∈J}
built by
pumping at the Z-ports of any set M
closed under
↔σ,1
can simultaneously be diagonal
and closed under ⟨⋆Jσ,1⟩.
Equivalently, for every vector of counters p, there is a
subset J′⊂J such that, for every s∈S,
having
P[Jσ,1](s)≡τ,1p
implies φ(s)∈J′.
Moreover, the set R of tuples to be inserted at the
ports of S does not satisfy this either, for otherwise
some subset {rj:j∈J}
would be suitable to replace TJ in the construction of a circuit
that satisfies
RC(G⋆Jσ,1).
Provided that M in not itself expressible555This is
the counterpart, in the world of trees and multicontexts,
of the fact that the word language
{wθ:θ≥σ} is star-free,
unless w=xπ for some word x
and π≥2.
as
N(π,Z) for N⊂MA,B and π≥2,
one can prove by induction on the construction of
[[M(σ,Z)]]σ,1
that this set
is recognized
by a forest algebra
in ∗∗⟨⟨Nτ,1⟩⟩,
and that the same holds for every intersection666Actually,
one can prove that
[[T(σ,Z)]]σ,1
is a union of a number of such classes.
of the corresponding set of tuples
[[T(σ,Z)]]σ,1
with an equivalence class for
⟨⋆Jσ,1⟩;
let
H1∈∗∗⟨⟨Nτ,1⟩⟩
recognize everyone of these intersections.
Next, the constraint on R implies that,
for a large enough n, no set of witnesses for J
and ≈τ,1n
can be built from R.
Since
↔σ,1
refines ≈τ,1n,
this means that
no equivalence class for
⟨⋆Jσ,1⟩
contains a diagonal subset of R.
Then in
the very special case where each intersection
of such a class with R
contains elements from at most one set φ−1(j), j∈J,
and given the fact that R is a set of tuples built
over a set of multicontexts closed under
↔σ,1,
there exists an algebra
H2∈∗∗⟨⟨Nτ,1⟩⟩
that, for every t∈TJ, determines
the content of P[Jσ,1](s)Z
and the class of s for ⟨⋆Jσ,1⟩,
and therefore the subset J′⊂J for which
we have φ(t)∈J′.
This means that for every
n′∈N such that both
H1≺Hτ,1n′
and
H2≺Hτ,1n′ hold,
no set of tuples
obtained by pumping S and inserting
elements of R at the Z-ports
can constitute a subcircuit for J
in a circuit that satisfies
RC(G⋆Hτ,1n′).
This special case is a situation where
the existence of a
circuit that satisfies
RC(G⋆Jσ,1),
where
↔σ,1 refines
≈τ,1n+1,
does not imply the existence of
a uniform proof-by-pumping
for
G∈∗∗⟨⟨Nτ,1⟩⟩.
5 Examples of proofs
We show how the techniques and notations of this article
work on examples, of which three are
discussed in the literature under different formalisms
and methods.
5.1 The Boolean algebra
The Ehrenfeucht-Fraïssé game concerning this algebra, described in
[20, Theorem 4.2], is an example
of a proof-by-copy.
In the
Boolean forest algebra B, the horizontal monoid is
the direct product B={0,1}×{0,1}
of the AND and OR monoids (in the first and second component, respectively),
so that ⟨1,0⟩ is the identity and
⟨0,1⟩ is absorbing. Besides the elements of the
form ε+⟨a,b⟩,
the vertical monoid is generated by
φ(∧) and φ(∨), which work on each ⟨a,b⟩∈B
as projections on
the first and second component, respectively.
With A={∧,∨}, the Boolean algebra can be regarded as the
image of FA by the homomorphism φ; in the special case of
one-node forests, this gives
[TABLE]
We
denote by L0 and L1 the languages
φ−1(⟨0,0⟩) and
φ−1(⟨1,1⟩), respectively.
The proof described in Potthoff [20, Theorem 4.2] builds witnesses from a unique circuit that consists in
two tuples t0=(m,ν0,ψ0) and t1=(m,ν1,ψ1)
defined over the strongly connected set
J={⟨0,0⟩,⟨1,1⟩},
where the mappings ν0 and ν1 are constant
(map every port to J) while ψ0 and ψ1
are
depicted on Figure 1; this circuit satisfies
RC(B⋆Hτ,πn)
for every combination of τ and π.
5.2 The duplex Boolean formulas
The “duplex” technique is a way of building examples of algebras
that demand proofs where the
witnesses have to be built simultaneously for two ≈N-tight
sets. We describe the duplex Boolean formulas as an example.
The language consists of those
trees that satisfy the following conditions.
∙
There is at least one interior node.
∙
If a node x is a leaf, then its label
λ(x) belongs to {0,1};
otherwise
λ(x)∈{∩,∪,⊓,⊔}.
∙
The sons of a node with λ(x)∈{∩,⊓}
are either two leaves, or four interior nodes y,y′,z,z′
with λ(y)=λ(y′)=∪ and λ(z)=λ(z′)=⊔.
∙
The sons of a node with λ(x)∈{∪,⊔}
are either two leaves, or four interior nodes y,y′,z,z′
with λ(y)=λ(y′)=∩ and λ(z)=λ(z′)=⊓.
∙
Each node x has a value val(x)∈{0,1,⊥},
defined according to the following rules.
∙
A leaf evaluates to its label, e.g. if λ(x)=1, then
val(x)=1.
∙
An interior node x
with two sons w,w′, which are leaves, evaluates to
val(w)∧val(w′) if λ(x)∈{∩,⊓},
and to val(w)∨val(w′) if λ(x)∈{∪,⊔}.
∙
If x has four sons
and
λ(x)∈{∩,⊓}, then val(x)=⊥
iff none of its sons evaluates to ⊥
and val(y)∧val(y′)=val(z)∧val(z′),
in which case
val(x)=val(y)∧val(y′);
in other words, the AND of the
subforests with roots in
{∩,∪}
and {⊓,⊔} are evaluated separately,
and val(x)=⊥ iff the values coincide.
When x has four sons
and
λ(x)∈{∪,⊔},
val(x) is an OR, obtained in a similar way.
The language of those trees where
λ(root)=∩ and val(root)=1
is not first-order definable.
Witnesses for this are built from an array of forests T consisting in the trees
t(⊝,0), t(⊝,1), t(⊡,0) and t(⊡,1),
and from the circuit M, consisting in
M(⊝)={m(⊝,0),m(⊝,1)} and
M(⊡)={m(⊡,0),m(⊡,1)},
over the alphabets
A={0,1,∩,∪,⊓,⊔}
and
B={⊝,⊡}.
Each multicontext consists in one root, with label ∩ in
those of M(⊝) and ⊓ in M(⊡),
then
four nodes y,y′,z,z′
with λ(y)=λ(y′)=∪ and λ(z)=λ(z′)=⊔;
under each of them are four ports.
In m(⊝,0) and m(⊡,0), two of
the ports below node y carry the label (⊝,0)
and the other two the label (⊡,0);
the same holds for the ports located under z.
Under the node y′,
two ports carry the label (⊝,1)
and the other two (⊡,1);
the same holds for the ports located under z′.
Thus, under nodes y and y′, the
0’s and 1’s of the four port labels in
{⊝}×{0,1}
follow the same pattern as the four ports of
the m0 multicontext defined in
the previous section.
The same observation holds for the ports labels in
{⊡}×{0,1},
as well as the ports under z and z′.
Similarly, the port labels in
m(⊝,1) and m(⊡,1)
follow the pattern of the ports of m1.
Each array of witnesses S(n)=Mn⋅T
consists in
two subarrays S(n)(⊝) and S(n)(⊡),
whose elements
can be told apart by looking at the label of their roots.
5.3 Forest algebras with aperiodic vertical monoid
and uniform vertical confusion
We describe two examples of forest algebras where the
vertical monoid is aperiodic
and that have
vertical confusion on uniform multicontexts.
The first example, discussed in [21],
is the set Leven of those binary trees over a trivial alphabet A={∙}
where all leaves are located at a nonzero even depth.
In its syntactic forest algebra, the horizontal monoid is
{0,e,o,ee,oo,∞},
where [math] is the identity,
e+e=ee, o+o=oo,
and everything else evaluates to the absorbing element ∞.
The one-port context ∙□ is mapped to the
vertical monoid element that maps
[math] and ee to o,
oo to e,
and everything else to ∞.
The vertical monoid of its syntactic forest algebra777In
this example and the next ones, the action on [math] of the
generators of the vertical monoid is not defined; the
size varies depending on how this gap
is filled. The other properties that matter here are not affected.
is aperiodic,
has 19 elements; the non-idempotent ∙□
constitutes a J-class by itself; the others constitute three regular
J-classes analogous to those of the Brandt monoid B4.
Vertical confusion is verified with the multicontext with one interior node,
that is the father of two ports, i.e. ∙(□+□).
The Ehrenfeucht-Fraïssé game that proves that
Leven is not in FO[≺] uses
nontrivial uniform multicontexts
whose interior nodes constitute the set
{md:d≥1}, where md is the complete binary tree
of depth d over A, and where each node at depth d is the father of two
ports. Given a labeling λ:ports(md)→{e,o,ee,oo},
we have φ˘(md,λ)=∞ only if λ is a constant mapping
with image in {e,o}.
A suitable set S1 consists in
se and so, where
se and so are built from m2τ
and m2τ+1, respectively, by inserting e,
at every port.
Let σ(n)=2τ(2n+1+1).
The circuit
M(n+1) consists in multicontexts mσ(n).
and
mσ(n)+1, where all ports carry the same label,
satisfies
RC(G⋆Hτ,πn).
and therefore can be used
to build S(n+1).
Our second example is a language defined as follows:
(1)
it consists of trees which are either one-node with label 0, 1, 2 or ⊥,
or built recursively through insertions in the four-port uniform multicontext m=a(b(y0+y1)+b(z0+z1));
node labels are taken in
{a,b}
and
port labels are undefined;
(2)
among them, exactly those which evaluate to 0 according to the two tables below
belong to the language.
[TABLE]
Besides the identity [math] and the absorbing element ∞,
the horizontal monoid has one element for each of the following:
a tree that evaluates to 0, to 1 and to 2,
and a sum of two such trees.
The vertical monoid of its syntactic forest algebra contains three regular
J-classes, analogous to those of the Brandt monoid B6.
Vertical confusion is verified with the multicontext m given above.
5.4 The Potthoff Algebra
This is the syntactic
algebra of a language of trees over
A={0,1,⊥,△,≑} defined as follows (see [21, Definition 25]):
(1)
it is included in a set which consists in
three one-node trees with labels 0, 1 or ⊥,
and of all those trees
built recursively through insertions in the three-port multicontext m=△(z+≑(y0+y1)),
depicted on Figure 3, where the
port are labelled with variable names;
(2)
it consists of exactly those
trees which evaluate to 1 according to the following two tables.
[TABLE]
Condition (1) can be specified with a first-order formula in FO[≺];
Condition (2) is specified with a formula in FOMod2[≺]
which, at every ‘△’ node x, consider three of the paths that
start at x and end at a leaf, namely
the unique path that traverses only ‘△’ nodes,
through copies of the port z, and that we call the a-path of x,
plus the a-path of each ‘△’ node that is a grandson of x
through its ‘≑’ son. The formula
verifies the parities of their lengths and the label of the leaves
located at their end.
The reader can verify that the horizontal monoid G of this algebra G=(G,W),
consists of 10 elements:
an identity, an absorbing element, and eight mutually accessible elements, two of which
can be interpreted as ‘a △ node with output 0’ and ‘a △ node with output 1’;
we denote these elements 0^ and 1^, respectively.
Using Section 3.3 it can be verified that
↔3,1 refines the canonical congruence of G.
5.4.1 Verifying non-definability with the extended algebra
One thing that places the Potthoff algebra outside of
∗∗⟨Nτ,1⟩
for any τ≥1, is that the vertical monoid W% of
its extended algebra G%
is non-aperiodic.
To see this, we define B={b,b′},
Cb={c0} and Cb′={c0′,c1′};
the extension of φ to C is given by
φ(c0)=φ(c0′)=0
and
φ(c1′)=1.
Consider a labeling where
ν(z)=b
and
ν(y0)=ν(y1)=b′
and we do not care about ψ.
We define a sequence of tuples ti, i≥1,
where t1=(m,ν,ψ) and every
ti is obtained by inserting ti−1 at the
z port of t1.
The corresponding elements of W%
satisfy
φ%(∇(ti,z))=φ%(∇(ti+1,z))
and
φ%(∇(ti,z))=φ%(∇(ti+2,z))
for every i≥1; in particular,
φ%(∇(ti,z))
maps {0} to {0,⊥} if i is even
and to {1,⊥} if i is odd.
In other words, the group Z2 divides the vertical monoid
of G%.
The labeling used here,
where ν(y0)=ν(y1)=b′ means that the images by φ
of the lateral subtrees are not taken into account,
enables G% to test explicitly the
parity of the length of the path between the z port and
the root, i.e. the a-path of ti.
5.4.2 Verifying non-definability with a recursive proof
Potthoff wrote his proof that G is not first-order
in his doctoral thesis [19] but apparently did not publish it elsewhere.
The proof described here is similar in its main lines;
the differences come from the fact that Potthoff’s construction
builds witnesses for the congruence defined by the first-order formulas
of quantifier depth n, whereas we are interested in witnesses for
congruences of the form ≈τ,1n.
We recall that
A={0,1,⊥,△,≑} and we
work with B={by,bz},
J={0^,1^}, and a unique (up to labeling of the ports)
multicontext m, with
Y={y0,y1}=μ−1(by) and Z={z}=μ−1(bz).
The proof consists in circuits that are pairs
ψ˘(pθ(k)),ψ˘(pθ+1(k)),
for every σ≥τ and appropriate integers θ
and k,
such that
ψ˘(pθ(k))⟨⋆Jσ,1⟩ψ˘(pθ+1(k)),
{φ(pθ(k),ψ)=0^
if θ is odd, and
{φ(pθ(k),ψ)=1^
if θ is even.
The first step in the construction of
pθ(k) and pθ+1(k)
consists in pumping m along Z
to create m(θ,Z) and m(θ+1,Z),
then inserting 1^
at the end of the a-paths, and denoting
the resulting multicontexts pθ(1)
and pθ+1(1).
An induction on θ shows that a suitable ψ
exists for every θ,
for which φ(pθ(1),ψ)=⊥.
For every k≥1, we build
multicontexts pθ(k+1) and pθ+1(k+1)
by inserting
copies of pθ(1) and pθ+1(1)
at the ports of Y(pθ(k)) and Y(pθ+1(k)):
if the port y satisfies ψ(y)=0^,
then pθ(1) is inserted if θ is odd (and
φ(pθ(1),ψ)=0^),
otherwise this is pθ+1(1).
We define a coordinate system for
every node and port of pθ(k), k≥1.
With the convention that the root, a
‘△’ node, sits at depth level 1,
we give to the ‘△’ node at depth level d
in pθ(1)
and to its ‘≑’ son
the coordinates
ad(θ) and bd(θ), respectively.
If we work on
pθ(1), i.e. no multicontext is inserted at the ports of
Y(pθ(1)), then
the sons of bd(θ)
are the ports yd0(θ) and yd1(θ).
Otherwise, the two sons of
bd(θ) are ‘△’ nodes; the one
inserted at ydi
receives the coordinate adi1(θ),
its ‘≑’ son bdi1(θ),
and the two ports below it
ydi10(θ) and ydi11(θ).
Along the a-path from adi1(θ),
the ‘△’ node at depth e
receives coordinate adie(θ),
its ‘≑’ son the coordinate bdie(θ),
and the ports below it
ydie0(θ) and ydie1(θ).
In pθ(3), there ports are
replaced with ‘△’ nodes with coordinates
adie01(θ) and adie11(θ), etc.
We now look at two tuples
(pθ(k),ψ) and (pθ+1(k),ψ);
then we observe that
at every depth d≤θ along the a-paths of the roots, exactly one of
ad(θ) and ad(θ+1)
is an a1 node; the other is an a0.
When ad(θ) is an a1, then its son
bd(θ) feeds it a 0^, and this occurs exactly when
ψ(yd0(θ))=ψ(yd1(θ))
if k=1, and when only one of
ad01(θ) and ad11(θ)
is an a1 node, if k≥2.
This is the pattern denoted Q0 in Figure 5.
Otherwise, ad(θ) is a a0 node,
bd(θ) feeds it a 1^, and either k=1 and
yd0(θ) and yd1(θ) carry the same label,
or k≥2 and ad01(θ) and ad11(θ)
are both a0 or both a1 nodes; these
are the patterns Q10 and Q11.
Assume that ad(θ) is the root of a pattern Q0:
we count those ports labelled
with 0^ and those with 1^,
the result is a pair
(nd(θ,0),nd(θ,1))=(1,1)=(20,20). The same work done
on a Q10 rooted at ad(θ)
gives
(nd(θ+1,0),nd(θ+1,1))=(2,0)=(20+1,20−1);
if this is a Q11,
then
(nd(θ+1,0),nd(θ+1,1))=(0,2)=(20−1,20+1), respectively.
Observe that
the numbers
nd(θ,0) and nd(θ+1,0),
as well as
nd(θ,1) and nd(θ+1,1),
differ by 1.
Assume that
ad(θ) is an a1 node;
then for every depth e, exactly
one of ad0e(θ) and
ad1e(θ) is an a1 node
and the root of a Q0 pattern;
the other is the root of a Q10 or a Q11.
When k=2, the leaves of these patterns are
the four ports yd0e0(θ), yd0e1(θ), yd1e0(θ),
and yd1e1(θ);
their contexts within pθ(2) are equivalent under ↔σ,1,
so that it makes sense to
look at the pair of numbers obtained by counting the
0^s and 1^s at these ports; the result
is
[TABLE]
Meanwhile,
ad(θ+1) is an a0 node, so that
ψ(yd0(θ+1))=ψ(yd1(θ+1))
and at every depth e, ad0e(θ+1)
and
ad1e(θ+1) are both
a0 nodes or both
a1 nodes.
In the former case, the ports under
their ‘≑’ sons constitute two Q0 patterns;
otherwise, we can place a Q10 under one of them
and a Q11 under the other. Either way, we obtain
nd0e(θ+1,2,0)=nd1e(θ+1,2,1)=(2,2).
Iterating the process and building
multicontexts pθ(k) and pθ+1(k),
it is possible to redefine
ψ on Y(pθ(k)) and Y(pθ+1(k))
in such a way that for every set of 2k ports
whose contexts are equivalent under ↔σ,1,
the corresponding pairs of numbers
(nd…(θ,k,0),nd…(θ,k,1))
and
(nd…(θ+1,k,0),nd…(θ+1,k,1))
are
(2k−1,2k−1)
below the a0 node,
and one of
(2k−1+1,2k−1−1) and (2k−1−1,2k−1+1)
below the a1 node.
Consider two ports ydi(θ)
within pθ(1) and yd′j(θ+1) within
pθ+1(1), where i,j∈{0,1}: we have
[TABLE]
If θ≥3σ, then for every pair d,i,
the equivalence class of ∇(pθ(1),ydi(θ)) contains
the contexts of
at least four ports of pθ(1) and four of pθ+1(1),
and at least 4σ in the case where σ<d<θ−σ.
In every case, for each class the numbers of ports labelled 0^ and of ports labelled 1^ are both at least 0=20−1.
Applying the same reasoning to pθ(k)
and pθ+1(k), we find that every equivalence class
that contains the context of a port of Y(pθ(k))
contains at least 2k such contexts, and the numbers of
0^s and 1^s are both at least 2k−1−1.
For every threshold τ and iteration level
n, one can take integers σ and k such that
↔σ,1 refines
≈τ,1n+1, and
k≥log2σ,
build the multicontexts pθ(k)
and pθ+1(k): by the
above reasoning, a labelling ψ
can be defined on them, such that
(pθ(k),ψ)⟨⋆Jσ,1⟩(pθ+1(k),ψ).
These tuples constitute a circuit M(n+1)
that satisfies
RC(G⋆Hτ,1n).
Therefore, the Potthoff algebra does not belong to any
variety
∗∗⟨⟨Nτ,1⟩⟩.
6 Conclusion
This research started with the intention of understanding
the mechanisms that underlie the Ehrenfeucht-Fraïssé games
used to prove that a forest algebra lies outside of
∗∗⟨⟨Nτ,π⟩⟩.
Multicontexts and tuples, where ports have multiple labelings,
are at the center of this proof technique,
and the need for a description of how
a forest algebra works on these objects led to the
definition of the algebras G# and G%.
In parallel to this came the observation that
Ehrenfeucht-Fraïssé games
are recursive and uniform; while games with these two properties always
exist on monoids, word languages and
linear orderings, it is not clear whether the same holds
in the world of trees and forests.
We could prove the existence of a recursive proof for every
algebra outside of
∗∗⟨⟨Nτ,π⟩⟩,
while our investigation of proof-by-pumping suggests
has not solved
the question of the existence of a uniform proof.
Given an algebra G=(G,W), with σ and ρ
the threshold and period of G%, Theorem 4.7 can be combined with Propositions 3.7 and 4.8 into this statement:
[TABLE]
or conversely,
[TABLE]
All examples of algebras outside of ∗∗⟨⟨Nτ,π⟩⟩
known to the author satisfy the left hand side of
(3), so that it makes sense to ask
whether
these implications are actually equivalences.
If they are, then since
satisfaction of
RC(G⋆Jσ,ρ)
is a recursively enumerable problem,
membership of G in
∗∗⟨⟨Nτ,π⟩⟩
is decidable, by running in parallel
a search of an n≥1 such that G≺Hτ,πn,
and a test for non-membership that consists in verifying the left hand side
of (3),
knowing that one of them will stop.
Proving that
RC(G⋆Jσ,ρ)
is by itself decidable would of course yield
a procedure more elegant than this.
Satisfaction of RC(G⋆Jσ,ρ)
implies the existence of a recursive proof for
∗∗⟨⟨Nτ,π⟩⟩;
whether it also implies the
existence of a uniform one
is an open question.
If there exist algebras
outside of ∗∗⟨⟨Nτ,π⟩⟩
that have only non-uniform proofs,
then one
would ask whether this
property is decidable.
It is also worth noting that
conversely, Theorem 4.7 suggests
that if membership in
∗∗⟨⟨Nτ,π⟩⟩
is undecidable, then this
might be proved by showing that
the existence of a recursive non-membership proof
is undecidable.
Condition RC(G⋆Hτ,πn) gives a new viewpoint on the combinatorics of
first-order languages; it involves counting under a threshold and a period,
just as in the one developed in
[10, 13],
but in a
nonlocal (and messy) way along an antichain of ports.
Whether they can be deduced from one another remains to be seen.
The author thanks Howard Straubing, both for discussions
about this research and for giving him access to unpublished material.
Discussions with Andreas Krebs and Charles Paperman were also useful.
Bibliography30
The reference list from the paper itself. Each links out to its DOI / PubMed record.
1[1] M. Benedikt, L. Ségoufin, Regular tree languages definable in 𝖥𝖮 𝖥𝖮 \mathsf{FO} and in 𝖥𝖮 𝗆𝗈𝖽 subscript 𝖥𝖮 𝗆𝗈𝖽 \mathsf{FO}_{\mathsf{mod}} , ACM Trans. Comput. Log. 11 (2009) Issue 1, Article 4.
2[2] M. Bojańczyk, L. Ségoufin, Tree Languages Defined in First-Order Logic with One Quantifier Alternation Proc. of the 35th International Colloquium on Automata, Languages and Programming , Lecture Notes in Comp. Sci. 5126 , Springer-Verlag (2008), pp. 233-245.
3[3] M. Bojańczyk, L. Ségoufin, H. Straubing, Piecewise testable tree languages, Proc. of the 23th IEEE Symp. on Logics in Computer Science (2008), pp. 442-451.
4[4] M. Bojańczyk, H. Straubing, I. Walukiewicz, Wreath Products of Forest Algebras, with Application to Tree Logics, Logical Methods in Computer Science 8 4:03 (2012), pp. 1-39.
5[5] M. Bojańczyk, H. Straubing, I. Walukiewicz, Varieties of Forest Algebras, Unpublished manuscript (2012).
6[6] M. Bojańczyk, I. Walukiewicz, Forest Algebras, in Logic and automata, History and Perspectives, E.J.Flum and T.Wilke eds, Amsterdam University Press, 2008.
7[7] S. Burris, H. Sankappanavar, A Course in Universal Algebra , Millenium Edition (2000).
8[8] Z.Ésik, P.Weil, Algebraic recognizability of regular tree languages, Theoretical Computer Science 340 (2005), pp. 291-321.