Non-finitely axiomatisable modal product logics with
infinite canonical axiomatisations
Christopher Hampsona, Stanislav Kikotb, Agi Kurucza and Sérgio Marcelinoc
aDepartment of Informatics, King’s College London, U.K.
bInstitute for Information Transmission Problems, Moscow, Russia
cInstituto de Telecomunicações, Portugal
( )
Abstract
Our concern is the axiomatisation problem for modal and algebraic logics that correspond to various
fragments of two-variable first-order logic with counting quantifiers.
In particular,
we consider modal products with Diff, the propositional unimodal logic of the difference operator.
We show that the two-dimensional product logic Diff×Diff is non-finitely axiomatisable, but can be axiomatised by infinitely
many Sahlqvist axioms. We also show that its ‘square’ version
(the modal counterpart
of the substitution and equality free fragment of two-variable first-order logic with counting to two) is
non-finitely axiomatisable over Diff×Diff, but can be axiomatised by adding infinitely
many Sahlqvist axioms.
These are the first examples of products of finitely axiomatisable modal logics that are not finitely axiomatisable,
but axiomatisable by explicit infinite sets of canonical axioms.
1 Introduction
Ever since their introduction [38, 40, 9],
products of modal logics—propositional multimodal logics determined by classes of product
frames—have been extensively studied; see [8] for a comprehensive exposition and
further references.
In this paper we consider the problem of finding explicit infinite ‘nice’ axiomatisations for non-finitely axiomatisable two-dimensional modal product logics. By ‘nice’ here we mean formulas to which both the canonicity and
first-order correspondence properties of Sahlqvist formulas apply.
Canonicity is an important tool for proving Kripke completeness of propositional multimodal logics
[2, 12].
A modal logic is canonical if it is valid in all its canonical frames.
The analogous algebraic notion of canonical extension is central in
the theory of Boolean algebras with operators (BAOs) [26].
A variety of BAOs is canonical if it is closed under taking canonical extensions.
A modal formula is canonical if the modal logic axiomatised by it is canonical.
Though in general canonicity of a formula is an undecidable ‘semantical’ property [29],
there exist known syntactical classes of canonical formulas, such as Sahlqvist formulas
[37],
and their generalisations by Goranko and Vakarelov [16].
While any set of canonical formulas always axiomatises a canonical logic, Hodkinson and
Venema [24] show that there are canonical logics that are barely canonical
in the sense that every axiomatisation for such logics must contain infinitely many non-canonical
axioms. Further examples of barely canonical elementarily generated logics are given
in [14, 3, 28].
Kikot [28] also obtained the
following general dichotomy result: If a class of Kripke frames is definable by first-order formulas of the
form ∀x0∃x1…∃xn⋀Rλ(xi,xj), then the modal logic generated by such a class is either barely canonical or can be
axiomatised by a single generalised Sahlqvist formula.
In this paper we show some elementarily generated modal logics that are outside of the scope of
this dichotomy.
It is well known that the two-dimensional (2D) modal product logic S5×S5
has a finite axiomatisation with Sahlqvist axioms, describing two commuting S5-modalities [18].
(S5×S5 is the modal counterpart of the substitution and equality free fragment of two-variable first-order logic, where only relational atomic formulas of the form R(x0,x1) are allowed.)
On the other hand, for n≥3 the
n-dimensional product logic S5n is non-finitely axiomatisable [25]
and barely canonical (even though it is canonical and recursively enumerable [18]).
There are also known examples of recursively enumerable (even decidable) 2D products
of finitely axiomatisable modal logics that are not finitely axiomatisable, such as K4.3×S5 [32].
However, so far no canonical axiomatisations for non-finitely axiomatisable products of
finitely axiomatisable logics have been known.
Instead of S5 (the modal logic of all equivalence relations),
here we consider modal products with the finitely axiomatisable [39] logic Diff of all non-equality frames (W,=W).
An arbitrary frame for Diff is a pseudo-equivalence relation: its equivalence classes might contain
both reflexive and irreflexive points.
(In particular, equivalence relations are frames for Diff, and so Diff⊆S5.)
It is easy to see that, unlike equivalence relations, the class of pseudo-equivalence relations is not Horn-definable. Therefore, the general theorem of Gabbay and Shehtman [9] on axiomatising
2D products of Horn-definable logics by their commutator does not apply to Diff×Diff.
However, as pseudo-equivalence relations form an elementary class, it does follow from general results [13, 31, 9]
that Diff×Diff is canonical and recursively enumerable.
We show that the 2D product logic Diff×Diff is non-finitely axiomatisable, but can be axiomatised by infinitely
many Sahlqvist axioms. We also show that its ‘square’ version Diff×sqDiff (the modal counterpart
of the substitution and equality free fragment of two-variable first-order logic with counting to two) is
non-finitely axiomatisable over Diff×Diff, but can be axiomatised by adding infinitely
many axioms that are generalised Sahlqvist à la Goranko and Vakarelov [16].
This way we give the first examples of products of finitely axiomatisable modal logics that are not finitely axiomatisable
but axiomatisable by explicit infinite sets of canonical axioms.
By the correspondence theorem for (generalised) Sahlqvist formulas it follows that the classes
of all frames for both Diff×Diff and Diff×sqDiff are elementary
(unlike the frames for Diffn and S5n whenever n≥3, see [22, 31]).
As Diff-modalities are ‘self-reversive’, it also follows [15] that Diff×sqDiff in fact can be axiomatised by infinitely many Sahlqvist axioms.
Our results can also be formulated in an algebraic logic setting.
Given the full Boolean set algebra B(U×V) of all subsets of the Cartesian
product U×V of some non-empty sets U,V, one can define two additional unary operations
C0=, C1= on it by taking,
for every X⊆U×V,
[TABLE]
Just like usual cylindrifications are algebraisations of the existential quantifier in first-order logic,
these strict-cylindrifications algebraise the ‘there is a different’ first-order quantifier.
We define sRdf2 as the variety generated by all set algebras of this kind,
and sRdf2sq as the variety generated by those ones where U=V for the Boolean unit U×V.
Members of sRdf2 (sRdf2sq) might be referred to as two-dimensional rectangularly (square) representable diagonal-free strict-cylindric algebras.
It follows from general considerations that both sRdf2 and sRdf2sq are canonical varieties.
We show the following:
The equational theory of sRdf2 is non-finitely axiomatisable, but it has an infinite Sahlqvist axiomatisation.
The equational theory of sRdf2sq is non-finitely axiomatisable over that of sRdf2, but it has an infinite generalised Sahlqvist axiomatisation.
While our varieties are the first such among ‘full rectangular’ algebraisations of finite variable fragments of classical first-order logic, a similarly behaving ‘non-rectangular’ algebraisation has been known.
Andréka and Németi [18, 5.5.12] showed that the equational theory of
the variety Crsn of n-dimensional
relativised cylindric algebras
is non-finitely axiomatisable whenever n≥3,
while Resek and Thompson [36, 34] gave an infinite Sahlqvist axiomatisation for it, for any n.
2 Our results and proof methods
2.1 Non-finite axiomatisability
Theorem 1**.**
For any Kripke complete logic L with K⊆L⊆S5,
L×Diff is not axiomatisable using finitely many propositional variables.
Thus, Diff×Diff is not finitely axiomatisable.
Theorem 2**.**
Diff×sqDiff* is not axiomatisable over Diff×Diff using finitely many propositional variables.
*
After providing the necessary definitions in §3 and some general tools in §4,
Theorems 1 and 2 are proved in §5.
In our proofs, we will use the following pattern.
We show that every axiomatisation of a logic L must contain infinitely many propositonal
variables by providing two infinite sequences of frames Fk and Gk such that
every Fk is a frame for L, while every Gk is not,
but if k is sufficiently large compared to m, then we cannot distinguish between Fk
and Gk using m many propositional variables.
2.2 Infinite canonical axiomatisations
Theorem 3**.**
There is an infinite axiomatisation for Diff×Diff consisting of Sahlqvist formulas.
The class of all frames for Diff×Diff is elementary.
For every countable rooted frame F, F is a frame for Diff×Diff iff
F is the p-morphic image of some product of two difference frames.
Theorem 4**.**
Diff×sqDiff* can be axiomatised by adding infinitely many generalised Sahlqvist formulas to Diff×Diff.*
The class of all frames for Diff×sqDiff is elementary.
For every countable rooted frame F, F is a frame for Diff×sqDiff iff
F is the p-morphic image of some product of two difference frames of the same size.
As each generalised Sahlqvist formula is axiomatically equivalent to
a Sahlqvist formula with inverse modalities [15, 16] and
Diff-modalities are ‘self-reversive’, we have
the following (see §7.3 for more detail):
Corollary 5**.**
There is an infinite axiomatisation for Diff×sqDiff consisting of Sahlqvist formulas.
Theorems 3 and 4 are proved in the respective §6 and §7.
In our proofs, we will use the following pattern.
In order to axiomatise Logic_ofC for some class C of frames, we define a recursive
set Σ of (generalised) Sahlqvist formulas, and prove that the following hold:
All formulas in Σ are valid in every frame in C.
For every countable rooted frame F that is not the p-morphic image of some frame in C, there is some ϕF∈Σ such that
ϕF is not valid in F.
Then it follows that Logic_ofC is axiomatised by Σ.
Indeed, let L be the smallest bimodal logic containing Σ. Then we clearly have
L⊆Logic_ofC by (ax1).
On the other hand, by the (generalised) Sahlqvist completeness theorem, L is canonical,
and so Kripke complete. By the (generalised) Sahlqvist correspondence theorem,
the class of all frames for L is an elementary class. Then it is easy to see by
a Löwenheim–Skolem type argument (see e.g. [8, Thm. 1.6]) that L is the logic of its countable
frames, and so by a standard modal logic argument L is the logic of its countable rooted frames.
Now take some ψ∈/L. Then there is some countable rooted frame F such
that F is a frame for L, but ψ is not valid in F.
By (ax2), F is the p-morphic image of some frame in C, and so ψ∈/Logic_ofC.
Now the following two statements clearly follow from the above:
The class of all frames for Logic_ofC is elementary.
For every countable rooted frame F, F is a frame for Logic_ofC iff
F is the p-morphic image of some frame in C.
3 Preliminaries and basic definitions
Our notation and terminology are mostly standard.
We denote the cardinality of a set S by ∣S∣.
Natural numbers are considered as finite cardinals, and we use the usual multiplication operation
and ordering relations < and ≤ among them and the infinite cardinal ℵ0=∣ω∣.
We call S countable if ∣S∣≤ℵ0.
We denote the set of natural numbers by N and its positive members by N+.
We will also use the usual functions
min(X), max(X) and sup(X) with respect to <, for X⊆N∪{ℵ0}.
3.1 Digraphs
We assume that the reader is familiar with the basic notions about digraphs
G=(NG,→)
(see [1] for reference).
Below we summarise the notions used in the paper.
We call a node (vertex) in NG an initial node if it has no incoming → edges,
and a final node if it has no outgoing → edges.
A digraph G− is called a subgraph of G if its nodes and edges are subsets
of the nodes and edges of G, respectively. If the edges of a subgraph G−
consists of all the edges of G whose endpoints are nodes in G−,
then G− is called an induced subgraph of G.
Given two nodes z,z′, a (directed)
path in G from z to z′ is a finite sequence of subsequent edges,
the first one starting in z and the last one ending in z′. The length of a path is the number of
edges in it (we also consider paths of length [math]).
We call a path simple if it does not contain the same edge twice.
A cycle is a path starting and ending at the same node.
G is called acyclic if it does not contain any cycles.
A strongly connected component is a maximal subgraph S such that
for all nodes z and z′ in S there is a path from z to z′.
A finite sequence z0,…,zm of nodes is an undirected path between z0 and zm in G if for every i<m, either zi→zi+1 or zi+1→zi is an edge in G.
An acyclic digraph G is called a directed rooted tree (or tree, for short)
if there is some inital node r (the root) such that for every node z in G there is a unique path from r to z. For each z, the length of this unique path is the height of z.
If z→z′ is an edge in a tree, then z′ is called a child of z. A leaf in a tree is a node
without children, that is, a final node.
Given a finite digraph G and a node r in it, the
tree unravelling of G with root r is the directed rooted tree TG,r=(TG,r,⇒), where TG,r is the set of all paths in G starting at r (with the length 0 path being the root of TG,r), and P⇒P′ iff P′ can be obtained from P by adding an additional → edge to its endpoint.
It is customary to identify each path P∈TG,r with a distinct copy of its endpoint, in particular,
to identify the root of TG,r with r.
3.2 Unimodal and bimodal logics
In what follows we assume that the reader is familiar with the basic notions in propositional multimodal logic
and its possible world semantics (see [2, 4] for reference).
Below we
summarise the necessary notions and notation for the bimodal case only, but we will use them
throughout for the unimodal case as well.
We define bimodal formulas by the following grammar:
[TABLE]
where p ranges over a countably infinite set of propositional variables.
We use the usual abbreviations →, ↔,
and also
[TABLE]
for i=h,v. (The subscripts are indicative of the 2D intuition: h for ‘horizontal’ and
v for ‘vertical’.)
Bimodal formulas are evaluated in bimodal frames: relational structures of the form
F=(W,Rh,Rv), having two binary relations Rh and Rv on a non-empty set W.
A (Kripke) model on F is
a function M mapping propositional variables to subsets of W.
(With a slight abuse of notation, we identify the pair (F,M) with M.)
Given m∈N, we call a Kripke model M m-generated if
there are at most m different propositional variables p such that
M(p)=∅.
The truth relation
‘M,w⊨ϕ’, connecting points in models and formulas, is defined as usual by induction
on ϕ. If M,w⊨ϕ for some model M on F and some point w in M, then
we say that ϕ is satisfied in M, and satisfiable in F.
Given a set Σ of bimodal formulas, we write M⊨Σ if
we have M,w⊨ϕ,
for every ϕ∈Σ and every w∈W. (We write
just M⊨ϕ for M⊨{ϕ}.) We say that ϕ is valid in F,
if M⊨ϕ for every model M based on F. If every formula
in a set Σ is valid in F, then we say that F is a frame for Σ.
The usual operations on unimodal frames and models can be defined on their bimodal counterparts
as well. In particular,
given two frames F=(F,RhF,RvF) and G=(G,RhG,RvG),
a function f:F→G is called a p-morphism from F to
G if it satisfies the following conditions, for all u,v∈F,
y∈G, i=h,v:
uR_{i}^{{\mathfrak{F}}}v\ implies f(u)RiGf(v) (that is, f is a homomorphism),
f(u)RiGy implies that there is some v∈F such that f(v)=y and uRiFv
(the backward condition).
If f is onto then we say that G is a p-morphic image of F.
Similarly to the unimodal case, validity of bimodal formulas in frames is preserved
under taking p-morphic images.
For any model M on F and model N on G, a p-morphism from F to G
is called a p-morphism from M to N whenever, for all propositional
variables p and points x in F, x∈M(p) iff f(x)∈N(p).
If f is onto then we say that N is a p-morphic image of M.
Given two frames F=(F,RhF,RvF) and G=(G,RhG,RvG),
F is a subframe of G if F⊆G and RiF=RiG∩(F×F), for i=h,v.
Given some x∈F, the subframe Fx of F
generated by point x is the subframe of F with the following set Fx of
points:
[TABLE]
We say that a frame F is rooted if F=Fr for some point r. Such a point r is
called a root in F.
A set L of bimodal formulas is called a (normal) bimodal logic (or logic, for short)
if it contains all propositional tautologies and the formulas
□i(p→q)→(□ip→□iq), for i=h,v,
and is closed under the rules of Substitution, Modus Ponens and
Necessitation φ/□iφ, for i=h,v. Given a class C of frames, we always
obtain a logic by taking
[TABLE]
We say that Logic_ofC is the logic of C. It is well known that
[TABLE]
A logic L is called
Kripke complete if L=Logic_ofC for some class C.
Given a bimodal logic L and a recursive set Σ of bimodal
formulas, we say that Σ axiomatises L if
L is the smallest bimodal logic containing Σ.
A logic L is called finitely axiomatisable whenever there is some finite Σ axiomatising L.
3.2.1 Sahlqvist and generalised Sahlqvist formulas
Below we recall the definition of Sahlqvist formulas [37], and generalised (monadic) Sahlqvist formulas of
Goranko and Vakarelov [16, Def.24] for our bimodal language.
A bimodal formula is
positive (negative) if every occurrence of a propositional variable in it is under the scope of an even (odd) number of negations ¬.
A boxed atom is a formula □i1…□inp
where n∈N, i1,…,in∈{h,v} and p is a propositional variable.
A Sahlqvist antecedent is a formula built up from ⊤, ⊥, boxed atoms, and negative
formulas, using ∨, ∧, ◊h and ◊v.
A boxed formula is a formula of the form
[TABLE]
where n∈N, each □i is a finite (possibly empty) sequence of boxes □h and □v,
each ψi is a positive formula,
and p is a propositional variable. The variable p is called the head of the
boxed formula, and all variables in any of the ψi are called inessential variables.
A potential generalised Sahlqvist antecedent is a formula ϕ built up from ⊤, ⊥,
boxed formulas, and negative
formulas, using ∨, ∧, ◊h and ◊v.
Given such a formula ϕ, the dependency digraph of ϕ is a digraph
D(ϕ)=(Nϕ,⇛), where Nϕ is the set of heads of the boxed
formulas in ϕ, and q⇛p iff q is an inessential variable in a boxed formula with head p. If D(ϕ) is acyclic, then ϕ is called a generalised Sahlqvist antecedent.
A (generalised) Sahlqvist implication is of the form ϕ→ψ, where ϕ is a
(generalised) Sahlqvist antecedent and ψ is a positive formula.
A (generalised) Sahlqvist formula is a formula that is built up from (generalised) Sahlqvist implications by freely applying
□h, □v and ∧, and by applying ∨ only between formulas that do not share any
propositional variables.
The (generalised) Sahlqvist completeness theorem says that every logic axiomatised by (generalised) Sahlqvist
formulas is canonical, and so Kripke complete.
The (generalised) Sahlqvist correspondence theorem says that every
(generalised) Sahlqvist formula has a first-order correspondent.
(A first-order formula A in the language having equality and binary predicate symbols Rh and Rv
is called a correspondent of a bimodal formula ϕ, whenever for every frame F,
A is valid in F iff ϕ is valid in F.)
Kracht [29] gives a syntactical description of
first-order correspondents of Sahlqvist formulas.
Kracht’s characterisation is extended to generalised Sahlqvist formulas
by Kikot [27].
3.2.2 Some unimodal logics
The following well-known unimodal logics are mentioned in the paper:
[TABLE]
In order to avoid extensive use of ×, we denote the universal relation W×W
on any non-empty set W by ∀W.
By (1),
[TABLE]
Most of the paper is about two-dimensional modal product logics (see §3.2.3 below) where one or both component
logics is the much-studied unimodal ‘logic of elsewhere’ Diff
[6, 10, 11]. This logic was introduced by
Von Wright [41] as the set of unimodal formulas that are valid in all
difference frames, that is, in frames (W,=W), where =W is the
non-equality relation on some non-empty set W.
Segerberg [39] axiomatised Diff by the Sahlqvist formulas
[TABLE]
So an arbitrary frame for Diff is a pseudo-equivalence relation, that is,
it may contain both reflexive and irreflexive points, but it is always symmetric and
pseudo-transitive:
[TABLE]
In particular, equivalence relations are frames for Diff, and so Diff⊆S5.
It is straightforward to see that every rooted frame (W,R) for Diff is a p-morphic image of
any difference frame (U,=U) for which
[TABLE]
In particular,
[TABLE]
Note that one can express the universal, the at least two and the precisely one modalities with the help of a difference modality:
[TABLE]
3.2.3 Bimodal product frames and logics
Given unimodal frames
Fh=(Wh,Rh) and Fv=(Wv,Rv), their (modal) product is defined to be
the bimodal frame
[TABLE]
where Wh×Wv is the Cartesian product of Wh and Wv
and, for all x,x′∈Wh, y,y′∈Wv,
[TABLE]
It is easy to see that both taking point-generated subframes and p-morphic images
commute with the product construction:
[TABLE]
Given Kripke complete unimodal logics Lh and Lv in the respective unimodal languages
having ◊h,□h and ◊v,□v,
their product is defined as the (Kripke complete) bimodal logic
[TABLE]
In particular,
\mathbf{Diff}\!\times\!\mathbf{Diff}={\sf Logic\_of}\,\bigl{\{}{\mathfrak{F}}\!\times\!{\mathfrak{G}}:{\mathfrak{F}},{\mathfrak{G}}\mbox{ are frames for \mathbf{Diff}}\bigr{\}}.
We call a frame of the form (U,=U)×(V,=V), for some non-empty sets U,V,
a product of difference frames.
Then, by (1), (6) and (7), we have that
[TABLE]
If ∣U∣=∣V∣>0 then we call (U,=U)×(V,=V),
a square product of difference frames.
We define the ‘square’ version of Diff×Diff as
[TABLE]
Then by (8), we have
[TABLE]
As Theorem 2 shows, there is an infinite gap between these two logics.
(Note that it is easy to reduce the validity problem of both Diff×Diff and Diff×sqDiff to that of
two-variable first-order logic with counting, and so by the decidability of the latter [17], both
Diff×Diff and Diff×sqDiff are decidable.)
It is easy to see that the classes of (isomorphic copies of) products of difference frames and of
square products of difference frames are both closed under ultraproducts. Thus, by a general result
of [13], both Diff×Diff and Diff×sqDiff are canonical logics. Note that while (isomorphic copies of) products of difference frames form a (finitely axiomatisable) elementary class by Corollary 7 below,
it is not hard to show that the class of square products of difference frames is not closed under
elementary equivalence,
and so is not elementary (cf. [5, Thm. 4.1.12]).
Let comm_pse be the conjunction of the Sahlqvist formulas (□h□vp↔□v□hp) and
(2)–(3) for both □h and □v (with the first-order correspondent
saying that Rh and Rv are commuting pseudo-equivalence relations).
Then the logic [Diff,Diff] axiomatised by comm_pse is canonical, and so Kripke complete.
It is
straightforward to see that comm_pse is valid in every product of difference frames,
and so
[TABLE]
As Theorem 1 shows, there is an infinite gap between these two logics.
4 Rooted frames for [Diff,Diff]
In this section, we have a closer look at rooted frames for [Diff,Diff], that is,
rooted frames of the form F=(W,Rh,Rv), where
Rh and Rv are commuting pseudo-equivalence relations.
We begin with the simplest rooted frames of this kind.
A frame C=(C,Rh,Rv) is called a bi-cluster, if =C is a subset of Rj for both j=h,v.
It is straightforward to see that a bi-cluster is a rooted frame for [Diff,Diff].
For j=h,v, a point c in C is called Rj-irreflexive (Rj-reflexive)
if c¬Rjc (cRjc) holds.
So there can be four kinds of points in a bi-cluster:
both Rh- and Rv-reflexive (denoted by
),
Rh-irreflexive and Rv-reflexive (
),
Rh-reflexive and Rv-irreflexive (
), and
both Rh- and Rv-irreflexive (
).
We use * to indicate when a point is Rh-irreflexive and it does not matter whether it is
Rv-reflexive or Rv-irreflexive.
Similarly, * will be used whenever a point is Rv-irreflexive and it does not matter whether it is
Rh-reflexive or Rh-irreflexive.
(An example of a bi-cluster is depicted in Fig. 5.)
In what follows we often identify a bi-cluster with its domain. In particular,
for every bi-cluster C=(C,Rh,Rv), we denote by ∣C∣ the cardinality of its domain. We also let
[TABLE]
Next, let F=(W,Rh,Rv) be an arbitrary rooted frame for [Diff,Diff], and let Rh+ and Rv+ be the respective reflexive closures of Rh and Rv. It is easy to see that
Rh+ and Rv+ are commuting equivalence relations.
We define an equivalence relation ∼ on W
by taking, for all u,v∈W,
[TABLE]
For each u∈W, let [u] denote its ∼-class, and let W∼={[u]:u∈W}.
We say (with a slight abuse of notation)
that F is (represented as) a grid of bi-clusters (X,Y,g) whenever
g:X×Y→W∼ is a bijection for some sets X, Y
such that the following hold for all x=x′∈X and y=y′∈Y:
uRhv for all u∈g(x,y) and v∈g(x′,y);
uRvv for all u∈g(x,y) and v∈g(x,y′).
Observe that a single bi-cluster is a special case of a grid of bi-clusters when ∣X∣=∣Y∣=1.
Given two grids of bi-clusters F=(X,Y,g) and F⋆=(X⋆,Y⋆,g⋆), we say that
F is a subgrid of F⋆ if X⊆X⋆, Y⊆Y⋆, and
g=g⋆∣X×Y.
For each (x,y)∈X×Y, we will denote by Fxy the subgrid of
\bigl{(}\{x\},\{y\},g|_{\{x\}\!\times\!\{y\}}\bigr{)} of F.
Observe that Fxy is always a bi-cluster. For any bi-cluster C, we say that F contains C,
if C is isomorphic to Fxy for some x,y.
Throughout, we draw grids of bi-clusters by depicting each bi-cluster as a rectangular box, depicting X (and Rh between bi-clusters) horizontally and Y
(and Rv between bi-clusters) vertically; see, for example, Figs. 3 and 4.
Lemma** 6****.**
Every rooted frame F for [Diff,Diff] is a grid of bi-clusters.
Proof.
Suppose F=(W,Rh,Rv) is a rooted frame F for [Diff,Diff], that is,
Rh and Rv are commuting pseudo-equivalence relations.
Take any r∈W.
As Rh+ and Rv+ are commuting equivalence relations,
it is easy to see that r is a root in F, and
for all v,w∈W, if rRh+v and rRv+w then there is u with vRv+u and wRh+u.
So we let
[TABLE]
and define a function g:X×Y→W∼ by taking, for all [v]∈X, [w]∈Y,
[TABLE]
As both Rh+ and Rv+ are equivalence relations, for all v, w, u, v′, w′, u′, if
v∼v′, w∼w′, vRv+u, wRh+u, v′Rv+u′ and w′Rh+u′ then u∼u′ follows, and
so g is well-defined. It is easy to see that g is injective and both (gc1) and (gc2) hold.
Finally, we show that g is surjective: Take some [u]∈W∼. Then there exist
n∈N and u0,…,un such that u0=r, un=u and for each i<n, either uiRh+ui+1
or uiRv+ui+1. As Rh+ and Rv+ are commuting equivalence relations,
it follows that there are v,w such that
rRh+vRv+u and rRv+wRh+u, and so [v]∈X, [w]∈Y and g\bigl{(}[v],[w]\bigr{)}=[u], as required.
∎
Corollary 7**.**
For every frame F=(W,Rh,Rv), F is isomorphic to a product of difference frames iff
Rh and Rv are commuting irreflexive pseudo-equivalence relations and all bi-clusters in F
are singletons.
Because of the proof-pattern described in §2.2, we are particularly interested
in those countable grids of bi-clusters that are p-morphic images of some product of difference frames.
The following lemma provides a general characterisation for them.
Lemma** 8****.**
A countable grid of bi-clusters F is a p-morphic image of a product of two difference frames iff F is such that
each of its bi-clusters is the p-morphic image of a product of two
difference frames, and
the sizes of the product preimages for each bi-cluster ‘fit’.
More precisely, for any countable grid of bi-clusters F=(X,Y,g), we have the following:
If h:(U,=U)×(V,=V)→F is an onto p-morphism, then
there exists a function \xi_{h}:(X\cup Y)\to\bigl{(}\mathbb{N}^{+}\cup\{\aleph_{0}\}\bigr{)} such that
for every (x,y)∈X×Y, the bi-cluster Fxy in F is a p-morphic image of
(Ux,=Ux)×(Vy,=Vy) for some sets Ux,Vy with
∣Ux∣=ξh(x) and ∣Vy∣=ξh(y).
If \xi:(X\cup Y)\to\bigl{(}\mathbb{N}^{+}\cup\{\aleph_{0}\}\bigr{)} is a function such that
for every (x,y)∈X×Y, the bi-cluster Fxy in F is a p-morphic image of
(Uxy,=Uxy)×(Vxy,=Vxy) for some sets Uxy,Vxy with
∣Uxy∣=ξ(x) and ∣Vxy∣=ξ(y), then there is an onto p-morphism
hξ:(U,=U)×(V,=V)→F for some sets U,V with
∣U∣=∑x∈Xξ(x) and ∣V∣=∑y∈Yξ(y).
Proof.
(i):
We let
[TABLE]
Then it is straightforward to see that, for every (x,y)∈X×Y, the restriction of
h to Ux×Vy is a p-morphism onto Fxy. So we can define ξh by taking ξh(x)=∣Ux∣, for x∈X, and
ξh(y)=∣Vy∣, for y∈Y, as required.
(ii):
For every (x,y)∈X×Y,
suppose that hxy:(Uxy,=Uxy)×(Vxy,=Vxy)→Fxy
is an onto p-morphism.
As for every x∈X, we have ∣Uxy∣=ξ(x)=∣Uxy′∣ for any y,y′∈Y, we may assume that
Uxy and Uxy′ are the same set Ux. Similarly, for every y∈Y, we may assume that Vxy and Vx′y are
the same set Vy. We may also assume that all these sets are disjoint. Now let U=⋃x∈XUx,
V=⋃y∈YVy, and let the function hξ:U×V→F be defined by taking hξ(u,v)=hxy(u,v)
whenever u∈Ux and v∈Vy. Then it is straightforward to check that hξ is a p-morphism
from (U,=U)×(V,=V) onto F.
∎
4.1 ‘Good’ and ‘bad’ bi-clusters
By Lemma 8, if a grid of bi-clusters is not the p-morphic image of a product of difference frames,
then it is because its bi-clusters are not p-morphic images
of ‘fitting’ product preimages.
In this subsection, we have a closer look at individual bi-clusters first: which of them can or cannot be obtained
as the p-morphic image of some product of difference frames, and what size-restrictions we have on possible
product preimages.
We distinguish fifteen types of finite bi-clusters, depending on whether they contain Ri-reflexive points or not, for i=h,v (see Table 1).
In particular, finite bi-clusters of types (no1)–(no4) will be called impossible bi-clusters
throughout. Lemma 9 below claims that every countable bi-cluster that is not
impossible can be obtained as the p-morphic image of any product of difference frames
validating some constraints.
(In §6.2 we will show that the converse of Lemma 9 also holds in the sense
that whenever a countable bi-cluster C is a p-morphic image of a product of difference frames,
then C is not impossible, and the described constraints hold for the preimage product frame, see Corollary 22.)
Lemma** 9****.**
Every countably infinite bi-cluster C is a p-morphic image of
(ω,=ω)×(ω,=ω).
For every finite bi-cluster C, if C is not an impossible bi-cluster, then
C is the p-morphic image of (U,=U)×(V,=V) for any countable sets
U, V such that
the constraints of Table 1 hold for x=∣U∣ and y=∣V∣.
Proof.
We begin with a useful tool.
Given a bi-cluster C=(C,Rh,Rv),
we define an C-network to be a homomorphism
f:(U,=U)×(V,=V)→C, for some finite non-empty sets U and V.
Given C-networks f1:(U1,=U1)×(V1,=V1)→C and
f2:(U2,=U2)×(V2,=V2)→C, we write f1⊆f2 whenever
U1⊆U2, V1⊆V2 and f2∣U1×V1=f1.
We define a game G(C) between two players, ∀ and ∃.
They build a countable sequence of C-networks
f0⊆f1⊆⋯⊆fk⊆….
In round 0, ∀ picks any point r in C, and ∃ responds with
U0={u0}, V0={v0}, and f0(u0,v0)=r.
In round k (k∈N+), some sequence f0⊆⋯⊆fk−1 of C-networks
has already been built. ∀ picks a pair (c∗,z) where
c∗∈C and z∈Uk−1∪Vk−1.
There are two cases:
z∈Vk−1. Then ∃ can respond in two ways:
If either c∗ is Rh-irreflexive and there is u∈Uk−1 with fk−1(u,z)=c∗,
or c∗ is Rh-reflexive and there are u,u′∈Uk−1, u=u′ with fk−1(u,z)=fk−1(u′,z)=c∗,
then she responds with fk=fk−1. Otherwise,
she responds (if she can) with some C-network fk⊇fk−1
such that Vk=Vk−1, Uk=Uk−1∪{u+} for some fresh point u+,
and fk(u+,z)=c∗.
In other words, she needs to find a sequence \bigl{(}c_{v}:v\in V_{k-1}-\{z\}\bigr{)} of points in C such that,
for every v∈Vk−1−{z},
cvRhfk−1(u,v) for every u∈Uk−1,
cvRvc∗, and cvRvcv′ for every v′∈Vk−1−{z}, v′=v.
z∈Uk−1. Then again, ∃ can respond in two ways:
If either c∗ is Rv-irreflexive and there is v∈Vk−1 with fk−1(z,v)=c∗,
or c∗ is Rv-reflexive and there are v,v′∈Vk−1, v=v′ with fk−1(z,v)=fk−1(z,v′)=c∗,
then she responds with fk=fk−1. Otherwise,
she responds (if she can) with some C-network fk⊇fk−1
such that Uk=Uk−1, Vk=Vk−1∪{v+} for some fresh point v+,
and fk(z,v+)=c∗.
In other words,
she needs to find a sequence \bigl{(}c_{u}:u\in U_{k-1}-\{z\}\bigr{)} of points in C such that,
for every u∈Uk−1−{z},
cuRvfk−1(u,v) for every v∈Vk−1,
cuRhc∗, and cuRhcu′ for every u′∈Uk−1−{z}, u′=u.
If ∃ can respond in each round k for k∈N then she wins the play. We say that
∃ has a winning strategy in G(C) if she can win all plays, whatever moves
∀ takes in the rounds.
Claim** 9.1****.**
For every countable bi-cluster C,
player ∃ has a winning strategy in G(C) iff
C is the p-morphic image of a product of two countable difference frames.
Proof.
On the one hand, it is easy to see that ∃ can use a p-morphism from
a product of two difference frames onto C to determine her winning strategy in G(C).
For the other direction,
consider a play of the game G(C) with the following property:
For all k∈N, (u,v)∈Uk×Vk, c∗∈C, there exist ℓh,ℓv∈N such
that ℓh,ℓv>k, ∀ picks (c∗,u) in round ℓh, and
∀ picks (c∗,v) in round ℓv (since C is countable, he can do these).
If ∃ uses her strategy, then the union f:(U,=U)×(V,=V)→C of
the constructed countable ascending chain of C-networks is a p-morphism.
Indeed,
take some u∗∈U, v∗∈V, c∗∈C such that, say, f(u∗,v∗)Rhc∗.
We need to find some u∈U such that u=u∗ and f(u,v∗)=c∗.
Let k be such that (u∗,v∗)∈Uk×Vk and consider round ℓ>k
when ∀ picks (c∗,v∗). There are three cases:
If c∗ is Rh-irreflexive and there is u∈Uℓ−1 with fℓ−1(u,v∗)=c∗,
then f(u,v∗)=f(u∗,v∗), and so u=u∗.
If c∗ is Rh-reflexive and there are u,u′∈Uℓ−1 with u=u′ and fℓ−1(u,v∗)=fℓ−1(u′,v∗)=c∗, then either u=u∗ or u′=u∗.
Otherwise, there is u+∈Uℓ−Uℓ−1 with f(u+,v∗)=c∗. As u∗∈Uℓ−1,
it follows that u+=u∗.
∎
Now we can complete the proof of Lemma 9.
Item (i): Consider the game G(C).
As C is infinite and the constructed networks in each round of a play in the game have finite domains,
∃ can always respond according to the rules, and so she has a winning strategy in G(C). So by Claim 9.1, C is the p-morphic image of a product of two countable difference frames. As C contains infinitely many points that are Rj-connected, for both j=h and
j=v, both components in the product preimage must be infinite.
Item (ii):
Suppose first that C is a finite infinity bi-cluster. Then C contains at least one point a.
So in every round of a play in the game G(C), ∃ can always
use a full a-sequence as the sequence of points needed in her response, giving her
a winning strategy in G(C). So by Claim 9.1, there exists an
onto p-morphism f:(U,=U)×(V,=V)→C, for some countable sets U and V.
We claim that
[TABLE]
Indeed, for (11), let a be a point in C. Then for every
v∈V, there exist uv,uv′∈U, uv=uv′ such that
f(uv,v)=f(uv′,v)=a. Also, if v1=v2 then uv1, uv1′, uv2, uv2′
must be four distint points, and so (11) follows. The proof of (12) is similar.
For (13), let b be a point in C. Then for every
v∈V, there exist uv∈U such that
f(uv,v)=b. Also, if v1=v2 then uv1=uv2 must hold, and so ∣U∣≥∣V∣
follows. We can show ∣V∣≥∣U∣ similarly, and so we obtain (13).
Now if C is an infinity bi-cluster, then the infinity of both U and V follows from
(11)–(13).
Suppose that C is an n-element (hvstrict) bi-cluster, and take any n-element sets U and
V. Let f:U×V→C be any function such that the n×n-matrix
\bigl{(}f(u,v)\bigr{)}_{(u,v)\in U\!\times\!V} is a Latin square over the elements of C
(that is, each element of C occurs exactly once in each row and exactly once in each column).
It is straightforward to check that such an f is a p-morphism from (U,=U)×(V,=V)
onto C.
Suppose that C is an n-element (hstrict) bi-cluster, and take any n-element sets U and
V−. Let f:U×V−→C be any function such that the n×n-matrix
\bigl{(}f(u,v)\bigr{)}_{(u,v)\in U\!\times\!V^{-}} is a Latin square over the elements of C.
It is straightforward to check that such an f is a p-morphism from (U,=U)×(V−,∀V−)
onto C. Now take any set V with ∣V∣≥2n. By (5) and (7), we obtain that C is the p-morphic image of (U,=U)×(V,=V).
The proof for (vstrict) bi-clusters is similar.
The following claim will also be used in §5:
Claim** 9.2****.**
If C is a (h2vsw) bi-cluster containing n points and m points,
then for all sets U′, V with ∣U′∣=∣V∣≥n+2m there exists a p-morphism from (U′,∀U′)×(V,=V) onto C.
Proof.
Take any sets U′, V such that ∣U′∣=∣V∣=N for some N≥n+2m.
Let S be an N-element set that contains all the points of C, and at least two distinct ‘copies’
of each point in C. Then let f:U′×V→C be any function such that the N×N-matrix
\bigl{(}f(u,v)\bigr{)}_{(u,v)\in U^{\prime}\!\times\!V} is a Latin square over the elements of S.
It is straightforward to check that
such an f is a p-morphism from (U′,∀U′)×(V,=V) onto C.
∎
Now suppose that C is a (h2vsw) bi-cluster containing n points and m points,
and take any sets U,V with ∣U∣≥2⋅∣V∣ and V≥n+2m. By (5), (7),
and Claim 9.2, we obtain that C is the p-morphic image of (U,=U)×(V,=V).
The proof for (v2hsw) bi-clusters is similar.
Suppose that C is a (=sw) bi-cluster containing n points and m points,
and take any sets U, V such that ∣U∣=∣V∣=N for some N≥n+2m.
Let S be an N-element set that contains all the points of C, and at least two distinct ‘copies’
of each point in C. Then let f:U×V→C be any function such that the N×N-matrix
\bigl{(}f(u,v)\bigr{)}_{(u,v)\in U\!\times\!V} is a Latin square over the elements of S.
It is straightforward to check that such an f is a p-morphism from (U,=U)×(V,=V)
onto C.
Finally, suppose that C is an n-element (free) bi-cluster, and take any n-element sets U− and
V−. Let f:U−×V−→C be any function such that the n×n-matrix
\bigl{(}f(u,v)\bigr{)}_{(u,v)\in U^{-}\!\times\!V^{-}} is a Latin square over the elements of C.
It is straightforward to check that such an f is a p-morphism from (U−,∀U−)×(V−,∀V−)
onto C. Now take any sets U, V with ∣U∣≥2n and ∣U∣≥2n.
By (5) and (7), we obtain that C is the p-morphic image of (U,=U)×(V,=V).
∎
5 Non-finite axiomatisability
In this section we prove Theorems 1 and 2,
using the proof pattern described in §2.1.
We will also use a result of
[31, Cor. 2.5], saying that
if C is closed under ultraproducts and point-generated subframes, then
[TABLE]
In order to prove Theorem 1, we show the following more general statement, which
also generalises some results of [30]:
Theorem 10**.**
Let L be any bimodal logic such that
L* contains K×Diff, and*
for every k∈N+ there are U, V, such that ∣V∣≥k, ∣U∣≥2⋅∣V∣ and
(U,∀U)×(V,=V) is a frame for L.
Then L is not axiomatisable using finitely many propositional variables.
Proof.
For every k∈N+, k≥2, take the grids of bi-clusters Fk and Gk depicted in Fig. 1.
Lemma** 10.1****.**
Fk* is not a frame for K×Diff.*
Gk* is a p-morphic image of (U,∀U)×(V,=V), whenever ∣V∣≥k and ∣U∣≥2⋅∣V∣.*
If k,m∈N and k≥2m+1, then for every m-generated model M over Fk there is some model N over Gk that is a p-morphic image of M.
Proof.
(i):
By definition, K×Diff=Logic_ofC, where
[TABLE]
Using (6) and the fact that the ultraproduct construction also commutes with
the modal product construction,
it is not hard to see that C is closed
under point-generated subframes and ultraproducts.
Therefore, by (14),
it is enough to show that Fk=(W,Rh,Rv) is not the p-morphic image of any
(Wh,Qh)×(Wv,Qv), where Qv is a pseudo-equivalence relation.
Suppose to the contrary that there is an onto p-morphism f:(Wh,Qh)×(Wv,Qv)→Fk.
Take any point a in the k-element bi-cluster C1(k), and
any point b in the k+1-element bi-cluster C2(k). As aRhb,
there are x0,x1∈Wh, y0∈Wv such that x0Qhx1,
f(x0,y0)=a and f(x1,y0)=b.
As there are k other points in C2(k), each of them is Rv-related to b, there exist y1,…,yk∈Wv
such that y0Qvyi for all 0<i≤k and yi=yj for all i=j≤k. As Qv is a pseudo-equivalence relation, it follows that yiQvyj for all i=j≤k.
Then f(x0,yi)Rvf(x0,yj) must hold, for all i=j≤k. As
every point in C1(k) is Rv-irreflexive, this is not possible by the pigeonhole principle.
(ii):
Take any sets U, V, with ∣V∣≥k and ∣U∣≥2⋅∣V∣, and choose two disjoint subsets
U1 and U2 of U such that ∣U1∣=∣U2∣=∣V∣.
Observe that each of the two bi-clusters Ci′(k) in Gk is a (h2vsw) bi-cluster, containing
k−2 points and one point
(cf. Fig. 1 and Table 1).
So by Claim 9.2,
there exist onto p-morphisms hi:(Ui,∀Ui)×(V,=V)→Ci′(k), for i=1,2.
Let U′=U1∪U2, and
define a function h from U′×V to Gk by taking, for all u∈U′, v∈V,
[TABLE]
Then it is easy to check that h is a p-morphism from (U′,∀U′)×(V,=V) onto
Gk. As (U′,∀U′) is a p-morphic image of (U,∀U), it follows from (7)
that Gk is a p-morphic image of (U,∀U)×(V,=V).
(iii):
Let M be a model over Fk such that if M(p)=∅ for some propositional variable p
then p=pi for some i<m. We define two equivalence relations ∼1 and ∼2 on
C1(k) and on C2(k), respectively, by taking, for all a,a′ in C1(k) and b,b′ in C2(k),
[TABLE]
As k≥2m+1, by the generalised pigeonhole principle,
there is a ∼1-class containing at least two points a,a′, and
there is a ∼2-class containing at least three points b,b′,b′′.
Now define a function h from Fk onto Gk by
mapping a and a′ to the
point in C1′(k),
mapping the remaining k−2 points in C1(k) to the k−2 distinct
points in C1′(k),
mapping b,b′ and b′′ to the
point in C2′(k),
mapping the remaining k−2 points in C2(k) to the k−2 distinct
points in C2′(k).
It is easy to check that h is a p-morphism from Fk onto Gk. Now define a model N over
Gk by taking,
for any propositional variable p, {\mathfrak{N}}(p)=\{c:h(a)=c\mbox{ for some a\in{\mathfrak{M}}(p)}\}.
By the above, h is a p-morphism from M onto N.
∎
Now the proof of Theorem 10 can be completed as follows.
Suppose to the contrary that Σ axiomatises L and Σ contains only
m propositional variables, for some m∈N.
Let k≥2m+1 and let M be an arbitrary model over Fk. Let
Mm be another model over Fk that is the same
as M on propositional variables occurring in Σ, and
∅ otherwise. Then Mm is clearly m-generated
and Mm⊨Σ iff M⊨Σ.
Also, by Lemma 10.1 (iii) there is a model N over Gk that is a p-morphic image
of Mm.
As there are U, V, such that ∣V∣≥k, ∣U∣≥4⋅∣V∣ and
(U,∀U)×(V,=V) is a frame for L,
by Lemma 10.1 (ii) Gk is a frame for L. Thus, N⊨L, and so Mm⊨L.
As Σ⊆L, we obtain Mm⊨Σ, and so M⊨Σ.
As this holds for any model M over Fk, Fk is a frame for Σ.
Therefore, Logic_of{Fk} is a bimodal logic containing Σ, and so we have that
Fk is a frame for L. As L contains K×Diff, this implies
that Fk is a frame for K×Diff, contradicting Lemma 10.1 (i).
∎
Proof of Theorem 2
For every k∈N+, take the grids of bi-clusters Gk and Hk from Figs. 1 and
2, respectively.
Lemma** 10.2****.**
Hk* is not a frame for Diff×sqDiff.*
Gk* is a p-morphic image of (ω,=ω)×(ω,=ω).*
If k,m∈N and k>2m, then for every m-generated model M over Hk there is some model N over Gk that is a p-morphic image of M.
Proof.
(i):
By definition, Diff×sqDiff=Logic_of{square products of difference frames}.
Using (6) and the fact that the ultraproduct construction also commutes with
the modal product construction,
it is not hard to see that the class of all square products of difference frames is closed
under point-generated subframes and ultraproducts.
Therefore, by (14),
it is enough to show that Hk=(W,Rh,Rv) is not the p-morphic image of a square product (U,=U)×(V,=V) for any sets U,V with ∣U∣=∣V∣>0.
Suppose indirectly that it is.
As every point in Hk is Rv-irreflexive,
∣V∣=k must hold. On the other hand, as Rh is the universal relation in Hk, we must have
∣U∣≥2k, contradicting ∣U∣=∣V∣>0.
Item (ii) follows from Lemma 10.1 (ii), (5) and (7).
The proof of item (iii)
is similar to that of Lemma 10.1 (iii).
∎
Now the proof of Theorem 2 can be completed similarly to that of Theorem 10,
using Lemma 10.2 in place of Lemma 10.1.
6 Infinite canonical axiomatisation for Diff×Diff
In this section we prove Theorem 3
using the proof pattern described in §2.2 (for the class C of all products of difference frames).
So we will define a recursive
set ΣDiff×Diff of Sahlqvist formulas, and prove that the following hold:
-
All formulas in ΣDiff×Diff are valid in every product of difference frames.
2. 2.
For every countable rooted frame F that is not the p-morphic image of some product of
difference frames, there is some ϕF∈ΣDiff×Diff such that ϕF is not valid in F.
To begin with, if F is a countable rooted frame such that F⊨[Diff,Diff], then
F⊨comm_pse, and so we let
ϕF=comm_pse∈ΣDiff×Diff.
So from now on we assume that F⊨[Diff,Diff], and so F
is a grid of bi-clusters by Lemma 6.
We call a countable grid of bi-clusters F bad if it is not the p-morphic image of a product of difference frames.
In §6.1 below we discuss two kinds of ‘finitary reasons’ for a countable grid of bi-clusters being bad,
and prove that these are the only such reasons.
Then in §6.2 we provide the Sahlqvist formulas in ΣDiff×Diff ‘eliminating’ these reasons.
6.1 Bad grids of bi-clusters
The first reason for a countable grid of bi-clusters F being bad is when F contains a finite impossible bi-cluster.
This reason will be ‘eliminated’ by a Sahlqvist formula in §6.2.1, where it is also shown
that this is indeed a reason for F being bad (see Corollary 18).
So suppose that
F=(W,Rh,Rv) is a countable rooted frame for [Diff,Diff] that is represented as a grid of bi-clusters as (X,Y,g),
and
F contains no impossible bi-clusters.
We may assume that X and Y are disjoint, and consider the elements of X∪Y as distinct variables. We define a set ΓF of ‘constraints’ such that each constraint in ΓF is
one of the forms (z=n), (z≥k), or (z≥λz′),
for some z,z′∈X∪Y, n∈N+∪{ℵ0}, k∈N+, and λ=1,2.
For all x∈X and y∈Y,
[TABLE]
We assume that (z≥1)∈ΓF for every z∈X∪Y.
A solution of ΓF is a function
[TABLE]
validating all constraints in ΓF.
In other words, we are trying to solve a special kind of integer programming problem: ΓF is a
(possibly infinite) set of linear equations and inequalities (where all coefficients are positive integers or ℵ0), and we are looking for integer plus possibly (countably) infinite solutions of it.
By Lemmas 8 (ii) and 9, it is easy to see the following:
Claim** 11****.**
*If F is a countable grid of bi-clusters that contains no impossible bi-clusters and
ξ is a solution of ΓF, then
there is an onto p-morphism
hξ:(U,=U)×(V,=V)→F for some sets U,V with
∣U∣=∑x∈Xξ(x) and ∣V∣=∑y∈Yξ(y).
*
(In §6.2 we will show that the converse of Claim 11 also holds in the sense that
whenever a countable grid of bi-clusters F is a p-morphic image of a product of difference frames,
then F contains no impossible bi-clusters, and ΓF has a solution; see Corollary 22.)
In order to characterise those countable F for which ΓF has no solution,
we first introduce some notions dealing with the one-variable constraints in ΓF. For every z∈X∪Y, we let
[TABLE]
Next, in order to deal with the two-variable constraints, we define a (finite or countably infinite)
edge-labelled digraph
GF=(X∪Y,EF) by taking, for any z,z′∈X∪Y,
[TABLE]
Observe that
(i) all edges either go from some x∈X to some y∈Y, or from some y∈Y to some x∈X,
(ii) edge-labels λ can only be 1 or 2, and
(iii) if (z→1z′)∈EF for some z,z′ then (z′→1z)∈EF as well.
For some m∈N,
we call a path
z0→λ1z1→λ2…zm−1→λmzm
in GF bad, if
max(z0)<λ1⋅ … ⋅λm⋅min(zm).
(Observe that when m=0 then z0 is a bad path of length [math] whenever max(z0)<min(z0).
Note that a bad path is not necessarily simple: it may contain the same edge more than once.)
Figs. 3 and 4
show two examples of grids of bi-clusters that are bad because their graphs contain some bad paths.
In §6.2.2 we will show that if a grid of bi-clusters F is such that it does not contain impossible bi-clusters,
but GF contains a bad path,
then there is a Sahlqvist formula ‘eliminating’ this reason (and F is indeed bad).
Here we show that we have found all reasons for
ΓF not having a solution:
Lemma** 12****.**
Let F=(X,Y,g) be a countable grid of bi-clusters such that
-
F* contains no impossible bi-clusters, and*
2. 2.
there is no bad path in GF.
Then ΓF has a solution.
Proof.
Suppose F contains no impossible bi-clusters, and there is no bad path in GF.
We will define a ‘minimal’ solution ξminF such that it takes the same value on variables belonging to the same strongly connected component of GF.
To begin with,
for every strongly connected component S in GF, we let
(with a slight abuse of notation),
[TABLE]
Next, we define an
acyclic digraph GF+ as follows
(GF+ is what is called the condensation of GF=(X∪Y,EF)):
its nodes are the strongly connected components of GF,
and we define the edges by taking
[TABLE]
For n∈N, we call a path
S0⇒S1⇒…Sn−1⇒Sn
in GF+ bad, if
max(S0)<2n⋅min(Sn).
Claim** 12.1****.**
There is no bad path in GF+.
Proof.
Suppose indirectly that S0⇒S1⇒…Sn−1⇒Sn is a bad path in GF+, that is, max(S0)<2n⋅min(Sn).
Then there exist m∈N, z0∈S0, zn∈Sn and a path P of the form
z0→λ1⋯→λmzn in GF
such that
max(S0)=max(z0)
and 2n≤λ1⋅ … ⋅λm whenever m>0.
Now there are several cases:
There is z∈Sn such that min(Sn)=min(z).
Then take P and continue it with any path from zn to z. The resulting path in GF is bad,
a contradiction.
min(Sn)=ℵ0 and there is a →2 edge within Sn.
Then take any path Q from zn to zn containing this →2 edge.
Suppose Q is of the form zn→μ1⋯→μizn.
Then
μ1⋅ … ⋅μi≥2, and so there is r∈N such that
max(z0)<2n⋅(μ1⋅ … ⋅μi)r⋅min(zn).
Then the path in GF obtained by starting with P and then repeating Q r times is bad,
a contradiction.
min(Sn)=ℵ0, there is no →2 edge within Sn,
but for every i∈N
there is some wi∈Sn with min(wi)≥i.
Then choose i such that i⋅2n>max(z0).
Then the path in GF obtained by starting with P and then continuing with any path from zn to wi is bad, a contradiction again,
proving Claim 12.1.
∎
Next,
for every node S in GF+, let
[TABLE]
We define a function νminF from the nodes of GF+ to N+∪{ℵ0}
by induction on their rank by taking, for every strongly connected component S,
[TABLE]
(see Examples 14 and 32 below).
Claim** 12.2****.**
For all strongly connected components S,S′ in GF,
all n∈N+∪{ℵ0}, k∈N+, and λ∈{1,2}, we have the following:
If (z=n)∈ΓF for some z∈S, then νminF(S)=n.
If (z≥k)∈ΓF for some z∈S, then νminF(S)≥k.
If (z≥λz′)∈ΓF for some z∈S, z′∈S′, then
νminF(S)≥λ⋅νminF(S′).
Proof.
(i):
If n=ℵ0 then min(z)=ℵ0, and so νminF(S)=ℵ0.
So suppose that n∈N+. Then
[TABLE]
If any of the inequalities ≤ in (25) were <, then S would be a bad path of length [math] in
GF+, contradicting Claim 12.1. So we have min(S)=max(S)=n.
We also have that rank(S)∈N. (Otherwise,
there would exist a bad path of length >n in GF+ starting
at S.)
If rank(S)=0 then νminF(S)=min(S) by definition, and so we have νminF(S)=n.
Now suppose that rank(S)>0. By definition, νminF(S)≥min(S) always holds.
So suppose indirectly that νminF(S)>min(S).
We will construct a bad path in GF+, contradicting Claim 12.1.
To begin with,
there is S1 such that S⇒S1
and 2⋅νminF(S1)>min(S)=max(S).
(Either because S1 is such that νminF(S)=2νminF(S1) or
because νminF(S)=ℵ0.)
As νminF(S1)≥min(S1), there are two cases:
(a) νminF(S1)=min(S1).
Then 2⋅min(S1)>min(S)=max(S), and so S⇒S1 is a bad path in GF+.
(For example, this is the case when S1 is a final node in GF+.)
(b) νminF(S1)>min(S1).
Then there is S2 such that S1⇒S2 and
22⋅νminF(S2)>max(S).
(Either because S2 is such that νminF(S1)=2νminF(S2) or
because νminF(S1)=ℵ0.)
Again, there are two cases:
(b.1) νminF(S2)=min(S2).
Then
S⇒S1⇒S2 is a bad path in GF+.
(b.2) νminF(S2)>min(S2).
Then again, there are two cases. And so on, sooner or later we reach
a final node in GF+, where we only have case (a).
(ii):
If νminF(S)=ℵ0 then the statement holds. If νminF(S)∈N then
rank(S)∈N, and so
νminF(S)≥min(S)≥min(z)≥k.
(iii):
If λ=1 then S=S′, and so νminF(S)=νminF(S′), as required.
If λ=2 and S=S′, then min(S)=min(S′)=ℵ0, and so
νminF(S)=νminF(S′)=ℵ0. Thus νminF(S)≥2⋅νminF(S′) holds.
If λ=2 and S=S′, then S⇒S′. If νminF(S)=ℵ0, then νminF(S)≥2⋅νminF(S′) holds.
If νminF(S)∈N then rank(S)∈N, and so again νminF(S)≥2⋅νminF(S′) holds, as required.
∎
Now for every S in GF+ and every z in S, we define
[TABLE]
By Claim 12.2,
ξminF is a solution of ΓF, proving Lemma 12.
∎
Now by Claim 11 and Lemma 12 we obtain:
Corollary 13**.**
If a countable grid of bi-clusters F=(X,Y,g) is bad (that is, F is not the p-morphic image of a product of difference frames), then at least one of the following two reasons holds:
-
either F contains a finite impossible bi-cluster,
2. 2.
or there is a bad path in GF.
Example** 14****.**
Take the grid of bi-clusters F in Fig. 8.
We compute νminF. To begin with, we have the following strongly connected components in GF:
S1={y1},
S2={x1},
S3={y3,x2,y2},
S4={y4},
S5={x4,y5,x5,x6},
S6={x3}.
Then the edges in GF+ are S2⇒S3 and S4⇒S5.
Therefore, we have:
[TABLE]
6.2 Sahlqvist formulas
In §6.2.1 and §6.2.2 below,
we will eliminate each of the two kinds of reasons in
Corollary 13 for a countable grid of bi-clusters F being bad,
using a Sahlqvist formula ϕF in each case.
6.2.1 Eliminating impossible bi-clusters
Recall from Table 1 that a finite bi-cluster is impossible, if it is one of the types (no1)–(no4).
We define formulas for the cases of (no1), (no3) and (no4);
the case of (no2) is similar and left to reader.
So let C=(C,Rh,Rv) be a bi-cluster consisting of n=k+ℓ points for some k,ℓ∈N+, out of which a1,…,ak are Rv-irreflexive, a1 is Rh-reflexive, b1,…,bℓ are Rh-irreflexive; see Fig. 5.
(It does not matter whether any of a2,…,ak are Rh-reflexive or -irreflexive, or
whether any of b1,…,bℓ are Rv-reflexive or -irreflexive.)
We introduce fresh propositional variables ai for i=1,…k, and bj for j=1,…,ℓ,
and define
[TABLE]
It is straightforward to check that impossibleC is a Sahlqvist formula.
Lemma** 15****.**
It is decidable whether a bimodal formula is of the form impossibleC for some impossible bi-cluster C.
Proof.
Observe that impossibleC only depends on the numbers k,ℓ and the type of C.
∎
Lemma** 16****.**
impossibleC* is not valid in any grid of bi-clusters that contains the bi-cluster C.*
Proof.
Suppose F=(W,Rh,Rv) is a grid of bi-clusters containing C. We define a model M on F by taking
[TABLE]
It is straightforward to check that M,a1⊨clashC. On the other hand,
if w∈W is such
that M,w⊨⋀i=1kai∧⋀j=1ℓbj, then
w must be in C by the definition of M and grids of bi-clusters. As all the ai are Rv-irreflexive
and all the bj are Rh-irreflexive, w should be different from all of them,
a contradiction.
∎
Lemma** 17****.**
impossibleC* is valid in every product of difference frames.*
Proof.
Let M be a model over a product (U,=U)×(V,=V) of difference frames, and suppose that
M,(u0,v1)⊨clashC. By (27)–(28),
there are distinct points u0,…,un in U and distinct points v1,…,vn in V such that
[TABLE]
(see Fig. 6).
We say that a pair (u,v)∈U×V is of a-type (or of b-type) if
M,(u,v)⊨a^i for some i=1,…,k
(or M,(u,v)⊨b^j for some j=1,…,ℓ).
Take the subset Z of U×V consisting of the pairs (ui,vj) for i=0,…,n and j=1,…,n.
We claim that
[TABLE]
Indeed, suppose the contrary, that is, every pair in Z is either a-type or b-type.
For every 0≤i≤n, there can be ≤k many a-type pairs among
(ui,v1),…,(ui,vn). So there have to be ≥ℓ many b-type pairs among them.
So altogether in Z there are ≥(n+1)⋅ℓ many b-type pairs.
Thus, by the generalised pigeonhole principle, there exists 1≤s≤n such that there are >ℓ many b-type points among (u0,vs),…,(un,vs).
But for every 1≤j≤n, there can be ≤ℓ many b-type pairs among
(u0,vj),…,(un,vj), a contradiction, proving (33).
So suppose (u,v)∈Z is neither a-type nor b-type.
By (29)–(30), for every 1≤i≤k there is some zi∈V such that
zi=v and M,(u,zi)⊨a^i, and so M,(u,v)⊨ai by (27).
Similarly, by (31)–(32), for every 1≤j≤ℓ there is some wj∈U such that
wj=u and M,(wj,v)⊨b^j, and so M,(u,v)⊨bj by (28).
Therefore,
M,(u,v)⊨⋀i=1kai∧⋀j=1ℓbj,
as required.
∎
As a consequence of Lemmas 16 and 17 we also obtain:
Corollary 18**.**
For every countable grid of bi-clusters F, if F is a p-morphic image of a product of difference frames,
then F contains no impossible bi-clusters.
6.2.2 Eliminating bad paths
Let F=(W,Rh,Rv) be a countable rooted frame for [Diff,Diff] that is represented
as a grid of bi-clusters as (X,Y,g).
Suppose that F contains no impossible bi-clusters, but GF contains a bad path P of the form
[TABLE]
such that m∈N and
max(z0)<λ1⋅ … ⋅λm⋅min(zm).
Throughout this subsection, we assume that z0,zm∈Y, and define
a Sahlqvist formula bad_pathFP for this case. The other three
cases are similar and left to the reader.
The antecedent of bad_pathFP will consists of two conjuncts:
smallFP (expressing the value of max(z0)), and
◊h+◊v+largeFP (expressing that the value of λ1⋅ … ⋅λm⋅min(zm) is sufficiently large).
We begin with defining smallFP.
As max(z0)<λ1⋅ … ⋅λm⋅min(zm), we must have that
max(z0)=nP for some nP∈N+, and so (z0=nP)∈ΓF by
(18).
As z0∈Y,
[TABLE]
So, we introduce fresh propositional variables a and ai, for i=1,…,nP, and
define the formula
[TABLE]
In order to define largeFP,
we first describe the path P with a formula pathFP. To this end,
we say that a bi-cluster C corresponds to an edge z→λz′ in GF,
if C is (isomorphic to) Fzz′ whenever z∈X, z′∈Y, and
C is (isomorphic to) Fz′z whenever z′∈X, z∈Y.
Observe that only switch bi-clusters can correspond to some edge in GF. In particular, for every x∈X and every y∈Y, we have the following:
x→1y is an edge in GF iff y→1x is an edge in GF iff Fxy is a type (=sw) bi-cluster.
x→2y is an edge in GF iff Fxy is a type (h2vsw) bi-cluster.
y→2x is an edge in GF iff Fxy is a type (v2hsw) bi-cluster.
If m>0 then
let C1,…Cm be the sequence of bi-clusters corresponding to the edges in P (that is, Cj corresponds to zj−1→λjzj).
Observe that for each j=1,…,m,
if Cj is of type (v2hsw) then there is some point cj in Cj;
if Cj is of type (h2vsw) then there is some point cj in Cj; and
if Cj is of type (=sw) then there is some point cj in Cj.
So, for j=1,…,m, we introduce fresh propositional variables cj, and define
formulas
[TABLE]
We also introduce a fresh propositional variable b, and define the formulas path0, path1, …, pathm=pathFP inductively as follows.
Let path0=¬a∧□hb (where a is the same variable as in smallFP),
and for j=1,…,m, let
[TABLE]
Now we are in a position to define
largeFP, expressing that the value of λ1⋅ … ⋅λm⋅min(zm) for the endpoint zm of P
is sufficiently large.
Let kP∈N+ be such that
\textit{max}\,({z_{0}})<\mbox{\lambda_{1}\cdot\dots\cdot\lambda_{m}\cdot k_{P}} and kP≤min(zm).
We have two cases, depending on why min(zm) is ‘too large’:
-
There is xP′∈X such that
v_size(FxP′zm)≥kP;
2. 2.
or there is xP′∈X such that FxP′zm is an infinity bi-cluster
(see (19), (15), and Table 1).
We define a Sahlqvist formula largeFP for each of these two cases.
Case 1.
Then there are Rv-reflexive points b1∘,…,brP∘ and
Rv-irreflexive points b1∙,…,biP∙ in FxP′zm such that
2rP+iP≥kP.
We introduce fresh propositional variables bj∘ for j=1,…,rP, and
bs∙ for s=1,…,iP,
and define the formulas
[TABLE]
Then we let
[TABLE]
Case 2.
Now we cannot use that we have enough different points in FxP′zm like in Case 1, but instead we need to ‘generate’ them.
There are two cases:
Either
(a) FxP′zm contains some point c and some * point d
(this is when FxP′zm is of type (inf1), (inf2) or (inf4));
or
(b) FxP′zm contains some * point c and some point d
(this is when FxP′zm is of type (inf1), (inf3) or (inf4)).
In both cases, instead of the bj∘ and bj∙ variables, we introduce fresh
propositional variables c and d, and define the formulas
[TABLE]
Then we define the formulas
large1,…,largekP=largeFP inductively as follows.
Let
[TABLE]
and for j=2,…,kP, let
[TABLE]
Finally,
we define bad_pathFP by taking
[TABLE]
It is straightforward to check that bad_pathFP is a Sahlqvist formula.
Lemma** 19****.**
It is decidable whether a bimodal formula is of the form bad_pathFP for some grid of bi-clusters F and bad path P in GF.
Proof.
Observe that bad_pathFP only depends on the numbers nP, rP, iP, kP, and the types in the sequence of bi-clusters corresponding to the edges in P.
∎
Lemma** 20****.**
Suppose F=(X,Y,g) is a grid of bi-clusters that contains no impossible bi-clusters.
If P is a bad path in GF, then bad_pathFP is not valid in F.
Proof.
We use the notation introduced in the definition of bad_pathFP.
We define a model M on F by taking
[TABLE]
then in Case 1, take
[TABLE]
and in Case 2, take
[TABLE]
It is straightforward to check that M,a1⊨smallFP.
Further, it is easy to see that
in Case 1, M,bj∘⊨pathFP for all j=1,…,rP and
M,bs∙⊨pathFP for all s=1,…,iP,
and in Case 2, M,c⊨pathFP.
Using these, it is not hard to check that largeFP is satisfied in M in both cases.
On the other hand, suppose w∈W is such that a1Rv+w and
M,w⊨b. We claim that a1Rh+w
follows. Indeed, if m=0 then this is because we have some w′ in FxP′z0 with w′Rh+a1 and w′Rhw,
and if m>0 then because of a1Rhc1 and c1Rhw.
Thus, w must be in FxPz0 by the definition of grids of bi-clusters.
If M,w⊨⋀i=1nPai held as well, then w should be different from all the
ai, contradicting FxPz0={a1,…,anP}.
∎
Lemma** 21****.**
bad_pathFP* is valid in every product of difference frames.*
Proof.
Again, we use the notation introduced in the definition of bad_pathFP.
Let M be a model over a product (U,=U)×(V,=V) of difference frames, and suppose that
M,(u,v)⊨smallFP for some u,v.
Then there are distinct points v1,…,vnP in V such that
[TABLE]
Claim** 21.1****.**
If largeFP is satisfied in M, then
there exist points u1,…,ukP∈U and distinct points w1,…,wkP∈V such that
M,(uj,wj)⊨pathFP, for all j=1,…,kP.
Proof.
In Case 1 this easily follows from 2rP+iP≥kP and (37)–(38).
In Case 2(a):
We show by induction that, for all j=1,…,kP, if M,(a,b)⊨largej for some (a,b),
then there are distinct points u1,…,uj in U and
distinct points w1,…,wj in V such that
wj=b,
us=a for any s with 1≤s≤j,
M,(us,ws)⊨d^∧pathFP for all 1≤s≤j, and
M,(us,ws−1)⊨c^ for all 2≤s≤j.
As largeFP=largekP, Claim 21.1 will follow. To begin with,
the j=1 case is obvious. So suppose inductively that the statement holds for some j−1,
and suppose that M,(a,b)⊨largej. Then there are a′∈U, b1,b2∈V such that
a′=a, b,b1,b2 are all distinct, M,(a′,b)⊨d^∧pathFP, and
M,(a′,bi)⊨largej−1 for i=1,2.
By the IH, for each i=1,2,
there are distinct points u1i,…,uj−1i in U and
distinct points w1i,…,wj−1i in V such that wj−1i=bi, usi=a′ for any s,
M,(usi,wsi)⊨d^∧pathFP for all 1≤s≤j−1, and
M,(usi,ws−1i)⊨c^ for all 2≤s≤j−1. Thus, for each i=1,2,
[TABLE]
As wj−11=b1=b2=wj−12, (40) and (41) imply that
all the u11,…,uj−11, u12,…,uj−12 are distinct.
Thus, either a∈/{u11,…,uj−11} or a∈/{u12,…,uj−12}.
Let j be such that a∈/{u1j,…,uj−1j}. Then the points us=usj, ws=wsj for
s=1,…,j−1, uj=a′, wj=b are as required.
Case 2(b) is similar.
∎
Claim** 21.2****.**
\displaystyle{\mathfrak{M}},(u,v)\models\Diamond_{\mathbf{v}}^{+}\Bigl{(}\mathsf{b}\land\bigwedge_{i=1}^{n_{P}}\mathsf{a}_{i}\Bigr{)}.
Proof.
We define sets Gen(zj) and Set(zj) inductively, for j=m,…,0, such that the following hold,
for every j≤m:
Gen(zj)⊆U×V;
M,(a,b)⊨pathj for every (a,b)∈Gen(zj);
if j<m then M,(a,b)⊨c^j+1 for every (a,b)∈Gen(zj);
\textit{Set}({z_{j}})=\left\{\begin{array}[]{ll}\{a\in U:(a,b)\in\textit{Gen}({z_{j}})\mbox{ for some b}\},&\mbox{ if z_{j}\in X},\\[3.0pt]
\{b\in V:(a,b)\in\textit{Gen}({z_{j}})\mbox{ for some a}\},&\mbox{ if z_{j}\in Y};\end{array}\right.
∣Set(zm)∣=kP, and if j<m then ∣Set(zj)∣=λj+1⋅∣Set(zj+1)∣.
To begin with, we take the points from Claim 21.1 and let
[TABLE]
Now suppose inductively that (i)–(v) hold for some j.
There are several cases. Suppose first that zj−1∈X, and
Set(zj)={b1,…,bsj} for some sj.
Take some a1,…,asj∈U such that (ai,bi)∈Gen(zj) for all i=1,…,sj.
If λj=1 (that is, Cj is of type (=sw)),
then pathj=◊h(c^j∧pathj−1) by (36).
By (ii), there are a1′,…,asj′∈U such that M,(ai′,bi)⊨c^j∧pathj−1,
for all i=1,…,sj. As c^j=¬cj∧□hcj∧□vcj by (35),
all the ai′ are distinct. We let Set(zj−1)={a1′,…,asj′}.
If λj=2 (that is, Cj is of type (h2vsw)),
then \mathsf{path}_{j}=\Diamond_{\mathbf{h}}\bigl{(}\hat{\mathsf{c}}_{j}\land\mathsf{path}_{j-1}\land\Diamond_{\mathbf{h}}(\hat{\mathsf{c}}_{j}\land\mathsf{path}_{j-1})\bigr{)} by (36).
By (ii), there are a1′,…,a2sj′∈U such that M,(ai′,bi)⊨c^j∧pathj−1
and M,(asj+i′,bi)⊨c^j∧pathj−1,
for all i=1,…,sj. As c^j=¬cj∧□vcj by (35),
all the 2sj many ai′ are distinct. We let Set(zj−1)={a1′,…,a2sj′}.
The cases when zj−1∈Y are similar.
So we have (i)–(v) for all j=0,…,m, and so ∣Set(z0)∣=kP⋅λm⋅ … ⋅λ1.
As P is a bad path, ∣Set(z0)∣>nP, and so by the pigeonhole principle, there is w∈Set(z0) such that w=vi for any 1≤i≤nP.
Therefore, M,(u,w)⊨⋀i=1nPai by (39).
We claim that M,(u,w)⊨b also holds.
Indeed,
take any a∈U such that (a,w)∈Gen(z0). As path0=¬a∧□hb,
by (ii) we have M,(a,w)⊨¬a∧□hb.
Therefore, a=u by (39), and so M,(u,w)⊨b follows, as required.
∎
As by Claim 21.2 the consequent of bad_pathFP holds at (u,v) in M, the proof of
Lemma 21 is completed.
∎
As a consequence of Corollary 18 and Lemmas 12, 20 and 21 we also obtain the following ‘converse’ of Lemma 9:
Corollary 22**.**
For every countable grid of bi-clusters F, if F is a p-morphic image of a product of difference frames,
then F contains no impossible bi-clusters, and ΓF has a solution.
7 Infinite canonical axiomatisation for Diff×sqDiff
In this section we prove Theorem 4
using the proof pattern described in §2.2 (for the class C of all square products of difference frames).
So we will define a recursive
set ΣDiff×sqDiff of generalised Sahlqvist formulas containing ΣDiff×Diff,
and prove that the following hold:
-
All formulas in ΣDiff×sqDiff are valid in every square product of difference frames.
2. 2.
For every countable rooted frame F that is not the p-morphic image of some square product of
difference frames, there is some ϕF∈ΣDiff×sqDiff such that ϕF is not valid in F.
To begin with, if F is a countable rooted frame such that it is not the p-morphic image of a product of difference frames at all, then there is some ϕF∈ΣDiff×Diff such that ϕF is not valid in F. So from now on we assume that F is the p-morphic image of a product of difference frames.
In particular, F⊨[Diff,Diff], and so F is a grid of bi-clusters by Lemma 6.
We call a countable grid of bi-clusters F square-bad if it is the p-morphic image of a product of difference frames, but
it is not the p-morphic image of a square product of difference frames.
In §7.1 below we classify square-bad grids of bi-clusters into several categories. Then in
§7.2 we define the generalised Sahlqvist formulas in ΣDiff×sqDiff,
for each such category.
Finally, in §7.3 we discuss Corollary 5, that is, why Diff×sqDiff is in fact Sahlqvist axiomatisable.
7.1 Good grids of bi-clusters that are not p-morphic images of squares
Throughout, we suppose that F=(W,Rh,Rv) is square-bad and
represented as a grid of bi-clusters as (X,Y,g).
By Corollary 22, F contains no impossible bi-clusters, and the set ΓF of constraints (as defined in (15)) does have a solution.
By Claim 11,
we have
∑y∈Yξ(y)=∑x∈Xξ(x)
for any solution ξ of ΓF
(see Figs. 7 and 8 for some examples).
In particular,
[TABLE]
We begin with introducing some notions that will help us to deal with ‘upper bound’ constraints.
For any n∈N+ and any z∈X∪Y, we call z n-strict if
(z=n)∈ΓF.
We call z strict if it is n-strict for some n∈N+.
Now recall the digraph GF from (20).
We call a z∈X∪Y bounded if there is a path in GF from some strict z′ to z,
and unbounded otherwise. (In particular, if z is strict then z is bounded.)
Given a bounded z and a path P in GF of the form
z0→λ1z1⋯→λmzm where zm=z and z0 is n-strict for some n∈N+, we let
[TABLE]
Then for every bounded z∈X∪Y, we let
[TABLE]
Note that since ΓF has a solution, ubF(z)=n for any n-strict node z. Also,
it is easy to see that
[TABLE]
For any bounded z, we choose a simple path PubF(z) from z to a strict node such
that \textit{weight}\,\bigl{(}P_{\textit{ub}}^{{\mathfrak{F}}}({z})\bigr{)}=\textit{ub}^{{\mathfrak{F}}}({z}), and if z is strict then PubF(z) consists of just z.
Lemma** 23****.**
Suppose F=(X,Y,g) is a grid of bi-clusters such that ΓF is defined and has a solution, but
∑y∈Yξ(y)=∑x∈Xξ(x) for any solution ξ of ΓF.
Then one of the following cases holds:
X∪Y* is finite, and every z∈X∪Y is bounded.*
X∪Y* is finite, and
either (a) every x∈X is bounded, there is some unbounded y⋆∈Y,
and ∑x∈Xξ(x)<∑y∈Yξ(y) for every solution ξ of ΓF;
or (b)
every y∈Y is bounded, there is some unbounded x⋆∈X,
and ∑y∈Yξ(y)<∑x∈Xξ(x) for every solution ξ of ΓF.*
Either (a)
X is finite, every x∈X is bounded, Y is infinite, and
there is a finite subgrid F−=(X,Y−,g−) of F such that
∑x∈Xξ(x)<∑y∈Y−ξ(y) for every solution ξ of ΓF−;
or (b)
Y is finite, every y∈Y is bounded, X is infinite, and
there is a finite subgrid F−=(X−,Y,g−) of F such that
∑y∈Yξ(y)<∑x∈X−ξ(x) for every solution ξ of ΓF−.
Proof.
Observe that since ΓF has a solution,
at least one of X and Y must be finite by (42).
So suppose, say, that X is finite. Suppose also that there is some
unbounded x∈X. Then by (44) and
(42) we obtain that Y is finite and every y∈Y is bounded. So at least one of X or Y must be such that it is finite and all its members
are bounded.
Suppose, say, that X is finite and every x∈X is bounded. There are three cases:
If Y is finite and every y∈Y is bounded, then we have Case (i).
If Y is finite and there is some unbounded y⋆∈Y, then (as every x∈X is bounded)
the only constraints about y⋆ in ΓF are of the form (y∗≥k) or (y∗≥λx),
for some k∈N+, λ=1,2 and x∈X.
So for any solution ξ of ΓF, if we keep ξ(z) for all z=y∗, and increase ξ(y∗) arbitrarily, we obtain another solution. Therefore,
we cannot have
that ∑x∈Xξ(x)≥∑y∈Yξ(y) for any solution ξ,
and so we have Case (ii).
Finally, suppose that Y is infinite. Then let
[TABLE]
Clearly, Y′ is finite. Now take any finite Y−⊇Y′
such that ∣Y−∣>∑x∈XubF(x), and
let {\mathfrak{F}}^{-}=\bigl{(}X,Y^{-},g|_{X\!\times\!Y^{-}}\bigr{)}. By (43),
for every solution ξ of ΓF−, we have
[TABLE]
and so we have Case (iii).
∎
7.2 Generalised Sahlqvist formulas
In this subsection, we will eliminate the reasons
for a countable grid of bi-clusters F being square-bad.
For each of the cases described in Lemma 23, we use a different generalised Sahlqvist formula ϕFsq.
Throughout this subsection, we assume that
F=(W,Rh,Rv) is
represented as a grid of bi-clusters as (X,Y,g),
ΓF is defined and has a solution, but
∑y∈Yξ(y)=∑x∈Xξ(x) for any solution ξ of ΓF.
In §7.2.1 and §7.2.2 we will discuss the cases when X∪Y is finite and infinite, respectively.
7.2.1 X∪Y is finite
If X∪Y is finite then,
by Lemma 23, at least one of X and Y is such that all its members are bounded.
Suppose, say, that every x∈X is bounded, and let Yb be the set of bounded members in Y.
(The case when every y∈Y is bounded is similar.)
We will define a formula solutionF that is satisfiable in F (Lemma 27),
and ‘forces’ a solution of ΓF
when satisfied in a product of difference frames (Lemma 28).
The formula solutionF will consist of three conjuncts:
upper_boundF and lower_boundF will describe the respective upper and lower bound constraints on any possible solution, while switchF will describe the interactions among switch bi-clusters in F.
We also want solutionF to be a generalised
Sahlqvist antecedent, and so it is a problem that the digraph GF=(X∪Y,EF) might contain cycles.
The following claim says that we can always take a suitable acyclic subgraph of it:
Claim** 24****.**
If F is finite and every x∈X is bounded, then
there is an acyclic subgraph
HF=(X∪Y,EFub) of GF=(X∪Y,EF) such that the following hold:
Every initial node in HF is either strict or belongs to Y−Yb.
EFub* contains all the →2-edges in EF.*
For every edge z→1z′ in EF,
there is an undirected path
between z and z′ in HF such that all edges in the path are →1 edges.
Proof.
Observe that by (42), (24) and (23),
no strongly connected component S in GF contain any →2 edge.
So all cycles in GF consists of →1 edges only.
Observe also that if y∈Y−Yb, then (as all x∈X are bounded) there are no edges in EF of the form x→λy for any x∈X.
So either y is an isolated node in GF,
or there is an edge y→2x in EF for some (maybe several) x∈X.
In any case,
the strongly connected component y belongs to consists of just y alone.
We give an algorithm for how to construct EFub from EF.
For every strongly connected component S=(S,ES) containing only bounded nodes, we define
step-by-step a subset ES− of ES such that (S,ES−) is acyclic.
First, we choose a node zS in S as follows. If there is a strict node in S, then
let zS be any of the strict nodes in S. If there is no strict node in S,
then let zS be any node in S such that there is a →2 edge in GF starting at some bounded node and ending at zS. Let X0={zS} and E0=∅.
In the inductive step,
take some z∈S−Xn, and consider any path P within S from some node in Xn to z such that no other node in P is in Xn.
Let En+1 consist of the edges in En plus the edges in P, and let Xn+1 be obtained from Xn by adding all the nodes in P. Clearly,
if (Xn,En) is acyclic, then (Xn+1,En+1) is acyclic as well. We do this until
Xi=S for some i, and let ES−=Ei.
Now let EFub consist of the edges in ES− for each S, plus all
the →2-edges in EF. It is easy to check that HF is as required.
∎
The formula upper_boundF
We will describe the ‘bounded’ part of HF, while also
keeping track of the connections with the unbounded nodes in Y−Yb.
To begin with,
we need to describe that the rows and columns of the grid-structure are pairwise disjoint.
So for every x∈X and every y∈Y, we introduce respective fresh propositional variables x and
y, and define the formulas
[TABLE]
Next, we need to describe strict nodes in X∪Yb.
Observe that, for every n∈N+ and every n-strict x∈X, there exist some y′∈Y
and distinct Rh-irreflexive points a1x,…,anx in Fxy′.
Similarly, for every n-strict y∈Yb, there exist some x′∈X
and distinct Rv-irreflexive points a1y,…,any in Fx′y.
Thus,
for every n∈N+ and every n-strict z∈X∪Yb,
we introduce fresh propositional variables aiz for i=1,…,n.
We also need to describe the switch bi-clusters in HF. To simplify notation, for all z,z′∈X∪Y,
we will write
[TABLE]
and let
[TABLE]
Observe that for every (x,y)∈SwF, Fxy is a switch bi-cluster. Therefore,
Fxy contains a point cxy such that
(i) cxy is when Fxy is of type (h2vsw) (that is, x→2y is an edge in HF);
(ii) cxy is when Fxy is of type (v2hsw) (that is, y→2x is an edge in HF);
and (iii) cxy is when Fxy is of type (=sw) (that is, either x→1y or y→1x is an edge in HF).
Thus, for every (x,y)∈SwF, we introduce a fresh propositional variable cxy, and
define the formula
[TABLE]
Let HFb be the induced subgraph of HF on its bounded nodes, that is,
on node set X∪Yb.
Starting at each strict node as root, we unravel HFb into a forest
(a disjoint union of directed rooted trees)
TF, where each branch of each tree is continued until it reaches either a strict node different from the root or, if there is no such on the branch, a final node in HFb.
So for each node q in TF there is a unique z∈X∪Yb such that q is a(n unravelled) copy of z. (Each z∈X∪Yb might have many different copies.)
For every node q in TF, we let TF(q) denote the set of its children in TF.
For every node q in TF, now we define a formula tree(q) by induction on the structure of TF
starting at its leaves:
If q is not a root in TF, then there is a unique q∗ with q∈TF(q∗).
There are two cases:
If q is a copy of x∈X and q∗ is a copy of y∗∈Yb, then let
[TABLE]
If q is a copy of y∈Yb and q∗ is a copy of x∗∈X, then let
[TABLE]
If q is a root in TF, then q is a copy of some n-strict z∈X∪Yb for some n∈N+.
Again, there are two cases:
If z is some x∈X, then for each i=1,…,n, let
[TABLE]
and then let
[TABLE]
If z is some y∈Yb, then for each i=1,…,n, let
[TABLE]
and then let
[TABLE]
Finally,
let upper_boundF be the conjunction of ◊h+◊v+tree(q) for all roots q in the forest
TF (see Example 32 below).
The ‘interaction’ formula switchF
We use the variables introduced for the formula upper_boundF.
For every x∈X and every y∈Yb, we define the respective formulas
[TABLE]
[TABLE]
and let switchF be the conjunction of switchFz, for all z∈X∪Yb (see Example 32 below).
The formula lower_boundF
We use the cxy variables introduced for upper_boundF, and will also introduce some fresh variables.
Observe that by (19) and (42),
for every
z∈X∪Y, we have min(z)∈N+ and either \bigl{(}z=\textit{min}\,({z})\bigr{)}$$\in\Gamma^{{\mathfrak{F}}} or
\bigl{(}z\geq\textit{min}\,({z})\bigr{)}\in\Gamma^{{\mathfrak{F}}}. So if x∈X, then there is yx∈Y such that
\textit{h\_size}\bigl{(}{\mathfrak{F}}^{xy_{x}}\bigr{)}=\textit{min}\,({x}), and so there are Rh-reflexive points
b1∘(x), …, brx∘(x) and
Rh-irreflexive points b1∙(x), …, bix∙(x) in Fxyx such that
2rx+ix=min(x).
Similarly,
if y∈Y, then there is xy∈X such that \textit{v\_size}\bigl{(}{\mathfrak{F}}^{x_{y}y}\bigr{)}=\textit{min}\,({y}),
and so there are Rv-reflexive points b1∘(y),…,bry∘(y) and
Rv-irreflexive points b1∙(y),…,biy∙(y) in Fxyy such that
2ry+iy=min(y).
Now recall the function ξminF from (26). As ξminF is a solution of ΓF by
Claim 12.2, it follows from (42) that ξminF(z)∈N+, for every z∈X∪Y.
We define
[TABLE]
For every z∈Xlb∪Ylb∪(Y−Yb),
we introduce fresh propositional variables bj∘(z) for j=1,…,rz, and
bs∙(z) for s=1,…,iz,
and define the formulas
[TABLE]
For every x∈Xlb, we define
[TABLE]
and for every y∈Ylb∪(Y−Yb), we define
[TABLE]
Let lower_boundF be the conjunction of ◊h+◊v+lower_boundFz, for all z∈Xlb∪Ylb∪(Y−Yb)
(see Example 32 below).
The formula solutionF
We let
[TABLE]
Lemma** 25****.**
It is decidable whether a bimodal formula is of the form solutionF for some finite grid of bi-clusters F=(X,Y,g)
for which every X∪Y is bounded,
ΓF is defined and has a solution, but ∑y∈Yξ(y)=∑x∈Xξ(x) for any solution ξ of ΓF.
It is also decidable whether a bimodal formula is of the form solutionF for some finite grid of bi-clusters F=(X,Y,g)
for which every x∈X is bounded, there is some unbounded y⋆∈Y,
ΓF is defined and has a solution, but
∑x∈Xξ(x) <∑y∈Yξ(y) for every solution ξ of ΓF.
Proof.
It is not hard to check that solutionF only depends on
the finite acyclic digraph HF and the types of bi-clusters corresponding to its edges,
the values min(z)∈N+ for all nodes z in HF, and
which nodes in HF are strict.
An inspection of the proof of Claim 24 shows that it is decidable whether HF is
obtained from some finite edge-labelled digraph G with designated strict nodes and
min(z) values.
And it is clearly decidable whether
such a G can be obtained as GF for some finite grid of bi-clusters F=(X,Y,g) as described.
∎
Lemma** 26****.**
solutionF* is a generalised Sahlqvist antecedent.*
Proof.
It is straightforward to check that solutionF is a potential generalised Sahlqvist antecedent.
We show that the dependency digraph D(solutionF) of solutionF is acyclic. To this end, observe that
the
nodes of D(solutionF) are among the cxy variables occurring in switchF, and
we have the following edges ⇛ in D(solutionF):
[TABLE]
We claim that if Q is a path of length >0 in D(solutionF)
from some cxy to some cx′y′,
then either there is a path of length >0 in HF from x to x′,
or there is a path of length >0 in HF from y to y′. Indeed, we show this
by induction on the length ℓ of Q. If ℓ=1 then this follows from
(55)–(56). So suppose ℓ>0 and Q is Q− followed by an edge
of the form, say, cx′y′′⇛cx′y′ for some y′′∈Y. (The other case
is similar.)
By the IH, there are two cases: (i) either there is a path of length >0 in HF from x to x′, in which case we are done, (ii) or there is a path of length >0 in HF from y to y′′.
As we also have y′′→x′→y′ by (55), we have a path of length >0 in HF from y to y′, as required.
Now suppose indirectly that there is a cycle in the dependency digraph of solutionF.
Choose an arbitrary edge in this cycle of the form, say, cxy⇛cx′y for some x,x′∈X, y∈Y. (The other case
is similar.) Then x→y→x′ holds by (56). Also, either there is a path of length >0 in HF from x′ to x,
or there is a path of length >0 in HF from y to y. In both cases, we have a cycle
in HF, contradicting that it is acyclic by Claim 24.
∎
Lemma** 27****.**
If a grid of bi-clusters F1=(X1,Y1,g1) contains F as a subgrid,
then solutionF is satisfiable in F1.
Proof.
We use the notation introduced in the definition of solutionF.
We define a model M on F1 by taking
[TABLE]
It is not hard to check that solutionF is satisfied in M.
∎
Lemma** 28****.**
Suppose M,(u,v)⊨solutionF for some point (u,v) in a model M over a product frame (U,=U)×(V,=V). Then for every x∈X there is a set Set(x)⊆U, and for every y∈Y there is a set Set(y)⊆V
such that the following hold, for every z∈X∪Y:
Set(z)∩Set(z′)=∅* whenever z=z′; z,z′∈X or z,z′∈Y.*
If z∈X∪Yb then we can ‘identify’ points outside Set(z) with a positive formula. In particular:
If z=x∈X then for all a∈U−Set(x),
[TABLE]
If z=y∈Yb then for all b∈V−Set(y),
[TABLE]
ξ* is a solution of ΓF, where ξ(z)=∣Set(z)∣, for z∈X∪Y.*
Proof.
The argument uses a series of claims.
To begin with,
for every node q in TF we will define, inductively on the height of q in TF, a set Gen(q)⊆U×V
such that, for every q, and every (a,b)∈Gen(q),
[TABLE]
If q is a root and a copy of some n-strict z∈X∪Yb, then ◊h+◊v+tree(q) is a conjunct of upper_boundF, and so
there is (a,b) with M,(a,b)⊨tree(q).
So if z=x∈X, then there are distinct a1,…,an∈U such that
M,(ai,b)⊨treei(q) for i=1,…,n.
Let Gen(q)={(a1,b),…,(an,b)}. Then (57) holds for every (a,b)∈Gen(q).
Observe that
[TABLE]
Also,
[TABLE]
Similarly, if z=y∈Yb, then
there are distinct b1,…,bn∈V such that
M,(a,bi)⊨treei(u) for i=1,…,n.
Let Gen(q)={(a,b1),…,(a,bn)}. Then (58) holds for every (a,b)∈Gen(q).
If q∈TF(q∗), q is a copy of some x∈X and q∗ is a copy of some y∗∈Yb
then, by (58) of the IH, we have M,(a,b)⊨◊htree(q) for every (a,b)∈Gen(q∗).
Thus, for every (a,b)∈Gen(q∗), there is some a′∈U with M,(a′,b)⊨tree(q).
Let Gen(q)={(a′,b):(a,b)∈Gen(q∗)}. Then (57) holds for every (a,b)∈Gen(q).
Observe that
[TABLE]
Similarly,
if q∈TF(q∗), q is a copy of some y∈Yb and q∗ is a copy of some x∗∈X
then, by (57) of the IH, we have M,(a,b)⊨◊vtree(q) for every (a,b)∈Gen(q∗).
Thus, for every (a,b)∈Gen(q∗), there is some b′∈V with M,(a,b′)⊨tree(q).
Let Gen(q)={(a,b′):(a,b)∈Gen(q∗)}. Then (58) holds for every (a,b)∈Gen(q).
Observe that
[TABLE]
Next, for every node q in TF that is a copy of some x∈X, we let
[TABLE]
for every node q that is a copy of some y∈Yb, we let
[TABLE]
Observe that
[TABLE]
Claim** 28.1****.**
For every node q in TF, the following hold:
If q is a root and a copy of some n-strict x∈X,
then M,(a∗,v)⊨◊v+⋀i=1naix
for all a∗∈U−Set(q).
If q is a root and a copy of some n-strict y∈Yb, then
M,(u,b∗)⊨◊h+⋀i=1naiy
for all b∗∈V−Set(q).
If q is a copy of some x∈X, q∈TF(q′), and q′ is a copy of some y′∈Yb, then
M,(a∗,v)⊨□v+cxy′
for all a∗∈U−Set(q).
If q is a copy of some y∈Yb, q∈TF(q′), and q′ is a copy of some x′∈X, then
M,(u,b∗)⊨□h+cx′y
for all b∗∈V−Set(q).
Proof.
We prove the claim by induction on the height of q in TF.
(i): Suppose q is a root and a copy of some n-strict x∈X, and take some a∗∈U−Set(q).
By (59), there is b such that for every 1≤i≤n there is (ai,b)∈Gen(q) with
M,(ai,b)⊨treei(q). As □haix is a conjunct of treei(q),
we have M,(ai,b)⊨□haix for every 1≤i≤n.
As by (64) ai∈Set(q) for every 1≤i≤n, we have that a∗=ai for any i,
and so M,(a∗,b)⊨⋀i=1naix.
The case when q is a copy of some n-strict y∈Yb is similar.
(ii):
Suppose q∈TF(q′), q is a copy of some x∈X, q′ is a copy of some y′∈Yb.
(The case when q∈TF(q′), q is a copy of some y∈Yb and q′ is a copy of some x′∈X is similar.)
Suppose inductively that we have (i)–(ii) for q′, and take some a∗∈U−Set(q).
We claim that
[TABLE]
implying M,(a∗,v)⊨□v+cxy′, as required.
Indeed, take some b∗∈V. There are two cases, either b∗∈Set(q′) or b∗∈/Set(q′).
If b∗∈Set(q′) then there is some a such that (a,b∗)∈Gen(q′) by (65).
Thus, there is a′ such that (a′,b∗)∈Gen(q) by (61), and so
M,(a′,b∗)⊨tree(q) by (63). As c^xy′ is a conjunct of tree(q), we also have
M,(a′,b∗)⊨c^xy′.
As q∈TF(q′), we have y′→x. Thus,
□hcxy′ is a conjunct of c^xy′, and so
M,(a′,b∗)⊨□hcxy′ as well. As a∗∈/Set(q) but a′∈Set(q) by (64), it follows that a′=a∗, and so (67) holds.
If b∗∈/Set(q′) then suppose first that q′ is a root, that is, y′ is n-strict for some n∈N+. By item (i) of the IH, M,(u,b∗)⊨◊h+⋀i=1naiy′.
As y′→x holds,
M,(u,b∗)⊨□h+cxy′ follows by (52),
and so (67) holds.
Finally, suppose that q′ is not a root.
Let q′′ be such that q′∈TF(q′′), and suppose that q′′ is a copy of some x′′∈X.
Then M,(u,b∗)⊨□h+cx′′y′ by item (ii) of the IH.
As x′′→y′→x holds,
M,(u,b∗)⊨□h+cxy′ follows by (52).
Thus, (67) holds in this case as well.
∎
Claim** 28.2****.**
For all nodes q,q′ in TF, the following hold:
If q′→2q is an edge in TF then ∣Set(q′)∣≥2⋅∣Set(q)∣.
If q′→1q is an edge in TF then ∣Set(q)∣=∣Set(q′)∣.
Proof.
Suppose q′→λq is an edge in TF for some λ,
q is a copy of some x∈X, q′ is a copy of some y′∈Yb.
(The case when q is a copy of some y∈Yb and q′ is a copy of some x∈X is similar.)
By (64), for every a∈Set(q) there is ba with (a,ba)∈Gen(q).
Then M,(a,ba)⊨tree(q) by (63). As c^xy′ is a conjunct of tree(q), we also have
M,(a,ba)⊨c^xy′.
By (65), we have
[TABLE]
(i):
As q′→2q is an edge in TF, y′→2x is an edge in HF. Thus, c^xy′ implies
¬cxy′∧□hcxy′∧◊v(¬cxy′∧□hcxy′) (see (46)), and so M,(a,ba)⊨¬cxy′∧□hcxy′∧◊v(¬cxy′∧□hcxy′). So for every a∈Set(q), there is also a ba′=ba
with M,(a,ba′)⊨¬cxy′∧□hcxy′.
Therefore
[TABLE]
We claim that
[TABLE]
Indeed,
suppose indirectly that ba′∈/Set(q′) for some a∈Set(q).
There are two cases: If q′ is a root and y′ is n-strict for some n∈N+, then
M,(u,ba′)⊨◊h+⋀i=1naiy′
by Claim 28.1 (i). As y′→x holds,
M,(u,ba′)⊨□h+cxy′ follows by (52),
contradicting M,(a,ba′)⊨¬cxy′.
If q′∈TF(q′′) for some q′′ and q′′ is a copy of some x′′∈X, then by Claim 28.1 (ii)
we have that M,(u,ba′)⊨□h+cx′′y′. As x′′→y′→x holds,
M,(u,ba′)⊨□h+cxy′ follows by (52), a contradiction again, proving (70).
Now ∣Set(q′)∣≥2⋅∣Set(q)∣ follows by (68),
(69) and (70).
(ii):
As q′→1q is an edge in TF, y′→1x is an edge in HF.
So ¬cxy′∧□hcxy′ is a conjunct of c^xy′,
and so M,(a,ba)⊨¬cxy′∧□hcxy′ as well.
So if a1=a2∈Set(q) then ba1=ba2 must hold, and so
∣Set(q)∣≤∣Set(q′)∣ by (68).
On the other hand,
by (65) and (64), for every b∈Set(q′) there is ab∈Set(q) such that
(ab,b)∈Gen(q), and so M,(ab,b)⊨tree(q) by (63).
As c^xy′ is a conjunct of tree(q), we also have
M,(ab,b)⊨c^xy′.
As ¬cxy′∧□vcxy′ is a conjunct of c^xy′, we have
M,(ab,b)⊨¬cxy′∧□vcxy′ as well.
So if b1=b2∈Set(q′) then ab1=ab2 must hold, and so
∣Set(q′)∣≤∣Set(q)∣.
∎
Claim** 28.3****.**
For all nodes q1,q2 in TF,
if q1 and q2 are both copies of the same z∈X∪Yb, then Set(q1)=Set(q2).
Proof.
Clearly,
it is enough to show that Set(q1)⊆Set(q2).
Suppose that q1=q2 are both copies of the same x∈X, and there is some
a∈Set(q1)−Set(q2). (The case when q1=q2 are both copies of the same y∈Yb is similar.)
As a∈Set(q1), there is b with (a,b)∈Gen(q1) by (64).
It cannot be that both q1 and q2 are roots in TF, so there are three cases:
If q1 is a root and q2 is not a root. Then suppose x is n-strict for some n∈N+,
q2∈TF(q2′), and q2′ is a copy of some y′′∈Yb.
So by (60) there exist 1≤i≤n such that M,(a,b)⊨treei(q1).
As y′′→x holds, ◊v¬cxy′′ is a conjunct of treei(q1), and so
M,(a,b)⊨◊v¬cxy′′. On the other hand, as a∈/Set(q2), by Claim 28.1 (ii)
we have (a,v)⊨□v+cxy′′,
a contradiction.
If q2 is a root and q1 is not a root. Again, suppose x is n-strict for some n∈N+,
q1∈TF(q1′), and q1′ is a copy of some y′∈Yb.
We have M,(a,b)⊨tree(q1) by (63).
As c^xy′ is a conjunct of tree(q1), we have M,(a,b)⊨c^xy′.
As ¬cxy′ is a conjunct of c^xy′, we have M,(a,b)⊨¬cxy′.
On the other hand, as a∈/Set(q2), by Claim 28.1 (i) we have
M,(a,v)⊨◊v+⋀i=1naix.
As y′→x holds,
M,(a,v)⊨□v+cxy′ by (49),
a contradiction.
If neither q1 nor q2 is a root. Then suppose q1∈TF(q1′), q2∈TF(q2′),
q1′ is a copy of y′∈Yb, and q2′ is a copy of y′′∈Yb.
We have M,(a,b)⊨tree(q1) by (63).
As c^xy′ is a conjunct of tree(q1), we have M,(a,b)⊨c^xy′.
As ¬cxy′ is a conjunct of c^xy′, we have M,(a,b)⊨¬cxy′.
On the other hand, as a∈/Set(q2), by Claim 28.1 (ii)
we have M,(a,v)⊨□v+cxy′′. If y′=y′′, this is a contradiction.
If y′=y′′ then ◊v¬cxy′′ is a conjunct of tree(q1), and so
M,(a,b)⊨◊v¬cxy′′,
a contradiction again.
∎
Next, for every z∈X∪Y, we will define Set(z). There are two cases:
If z∈X∪Yb, then let
[TABLE]
This is well-defined, as
TF contains some copy of every z∈X∪Yb,
and the definition does not depend on the choice of the particular copy by Claim 28.3.
If y∈Y−Yb then
◊h+◊v+lower_boundFy is a conjunct of lower_boundF, and so there exist a∈U and distinct b1,…,b2ry,b1′,…,biy′∈V such that
[TABLE]
We let
[TABLE]
As 2ry+iy=ξminF(y), we have that
[TABLE]
Next, for each x∈X such that y→2x is an edge in GF, we will define a
set Setx(y) such that
[TABLE]
To this end, first we claim that
[TABLE]
Indeed, there are two cases.
If x is n-strict for some n∈N+, then choose a copy q of x that is a root in TF.
Then by (59) and (60),
there are distinct a1,…,ak∈U and b∈V such that
(ai,b)∈Gen(q), and so M,(ai,b)⊨treei(q), for all 1≤i≤n.
As ◊vc^xy
is a conjunct of treei(q) for every 1≤i≤n, (77) follows.
If x is not strict, then choose a copy q of x that is not a root in TF.
By (63), M,(a,b)⊨tree(q) for every (a,b)∈Gen(q).
As
◊vc^xy
is a conjunct of tree(q), again we have (77).
As y→2x is an edge in GF, c^xy implies
\Diamond_{\mathbf{v}}\bigl{(}\overline{\mathsf{y}}\land\neg\mathsf{c}^{{x}{y}}\land\Box_{\mathbf{h}}\mathsf{c}^{{x}{y}}\land\Diamond_{\mathbf{v}}(\overline{\mathsf{y}}\land\neg\mathsf{c}^{{x}{y}}\land\Box_{\mathbf{h}}\mathsf{c}^{{x}{y}})\bigr{)}
(see (46)).
So by (77), (64) and (71), for every a∈Set(x)
there are ba=ba′∈V such
that
[TABLE]
We let
[TABLE]
Clearly, if a1,a2∈Set(x) and a1=a2, then ba1, ba1′, ba2, and ba2′ are four distinct points, and so (76) follows, as required.
Finally, let
[TABLE]
Thus, by (75) and (76), respectively, we obtain that
[TABLE]
Claim** 28.4****.**
For all z,z′∈X∪Y, the following hold:
If z is n-strict for some n∈N+, then ∣Set(z)∣=n.
If z′→2z is an edge in GF, then ∣Set(z′)∣≥2⋅∣Set(z)∣.
If z and z′ are in the same strongly connected component of GF, then
∣Set(z)∣=∣Set(z′)∣.
∣Set(z)∣≥ξminF(z).
Proof.
Item (i) is by (66).
(ii):
If z,z′∈X∪Yb, then
there are nodes q and q′ in TF
such that q is a copy of z, q′ is a copy of z′, and q′→2q is an edge in TF.
So ∣Set(z′)∣≥2⋅∣Set(z)∣ follows by Claim 28.2 (i).
If z′=y∈Y−Yb and z=x∈X, then y→2x is an edge in GF. So
∣Set(y)∣≥2⋅∣Set(x)∣ follows by (81).
(iii):
Suppose z=z′. Then z,z′∈X∪Yb, and by Claim 24 (iii),
there is an undirected path P in HF between z and z′ such that all edges in the path are →1 edges.
We can break P up to a union of directed paths in HF (each of which has copies in the unravelling TF),
and then ∣Set(z)∣=∣Set(z′)∣ follows by (possibly repeated applications of) Claim 28.2 (ii).
(iv):
It is enough to show that for every strongly connected component S in GF,
[TABLE]
To this end, observe that for every S, we have νminF(S)∈N+
by (42), (26) and Claim 12.2, and so
by (24), (23) and (19),
[TABLE]
Therefore,
[TABLE]
Let z∗∈S be such that min(S)=min(z∗).
We prove (82) by induction on rank(S). Suppose rank(S)=0.
Then ξminF(z∗)=νminF(S)=min(S)=min(z∗).
There are two cases:
z∗ is n-strict for some n.
As ΓF has a solution, n=min(z∗) must hold by (84).
Thus,
∣Set(z∗)∣=min(z∗) by item (i),
and so (82) follows by item (iii).
z∗ is non-strict.
If z∗∈Y−Yb in S, then S={z∗} and so (82) follows by (80).
If z∗=x∗∈Xlb, then
y→x∗ holds for some y∈Yb. Also,
◊h+◊v+lower_boundFx∗ is a conjunct of lower_boundF, and so there exist b∈V and distinct
a1,…,amin(x∗)∈U such that
M,(ai,b)⊨◊v+¬cxy for every 1≤i≤min(x∗).
By Claim 28.1 (ii), ai∈Set(x∗) for every 1≤i≤min(x∗). Thus
∣Set(x∗)∣≥min(x∗)=νminF(S),
and so (82) follows by item (iii).
(The case when z∗∈(Ylb∩Yb) is similar.)
Now take some S with rank(S)>0, and suppose inductively that (82) holds for every S′ with rank(S′)<rank(S).
There are two cases: If νminF(S)=min(S), then (82) can be shown as in (a)–(b) above,
using z∗∈S,
Otherwise, by (83) there is S′ such that νminF(S)=2⋅νminF(S′) and
S⇒S′. Then there exist z1 in S and z2 in S′ such that z1∗→2z2
is an edge in GF. Therefore, by item (ii) and the IH, we have
∣Set(z1)∣≥2⋅∣Set(z2)∣≥2⋅νminF(S′)=νminF(S), and so
(82) follows by item (iii).
∎
Finally, we can complete the proof of Lemma 28:
Item (i): We claim that
[TABLE]
Indeed, there are three cases.
If y∈Yb, then b∈Set(q) for some copy q of y by (71).
So there is a such that (a,b)∈Gen(q) by (65), and so (85) follows
by (58).
If y∈Y−Yb and b∈Setx(y) for some x with y→x, then (85) follows
by (78) and (79).
If y∈Y−Yb and b∈Setlb(y), then (85) follows from (72)–(74)
and from the fact that y is a conjunct of each b^j∘(y) and b^s∙(y)
(see (53)–(54)).
Now suppose indirectly that y=y′∈Y and there is some b∈Set(y)∩Set(y′).
By (85), there are a and a′ such that
M,(a,b)⊨y and M,(a′,b)⊨y′, and so
M,(a,b)⊨□h+y and M,(a′,b)⊨¬y by (45), a contradiction.
(The case of x,x′∈X is similar, using the x variables.)
Item (ii) follows from Claim 28.1.
Item (iii):
Constraints of the form (z=n)∈ΓF
hold by Claim 28.4 (i).
Constraints of the form (z′≥2z)∈ΓF
hold by Claim 28.4 (ii).
Constraints of the form (z′≥z)∈ΓF
hold by Claim 28.4 (iii).
Finally, consider a constraint of the form (z≥k)∈ΓF, for some k∈N+.
As ∣Set(z)∣≥ξminF(z) by Claim 28.4 (iv), we have ∣Set(z)∣≥k by (26) and Claim 12.2.
∎
The consequent outF of the generalised Sahlqvist implication
We will use the positive formulas given in Lemma 28 (ii).
For every x∈X, we let
[TABLE]
Similarly,
for every y∈Yb, we let
[TABLE]
Then we let
[TABLE]
(see Example 32 below).
Using Lemma 26,
it is straightforward to check that square_badF is a generalised Sahlqvist formula.
Also, by Lemma 25, it is easy to see the following:
Lemma** 29****.**
It is decidable whether a bimodal formula is of the form square_badF for some finite grid of bi-clusters F=(X,Y,g)
for which every z∈X∪Y is bounded,
ΓF is defined and has a solution, but ∑y∈Yξ(y)=∑x∈Xξ(x) for any solution ξ of ΓF.
It is also decidable whether a bimodal formula is of the form square_badF for some finite grid of bi-clusters F=(X,Y,g)
for which every x∈X is bounded, there is some unbounded y⋆∈Y,
ΓF is defined and has a solution, but
∑x∈Xξ(x)<∑y∈Yξ(y) for every solution ξ of ΓF.
Lemma** 30****.**
square_badF* is not valid in F.*
Proof.
Let F1=F and take the model M on F1 from the proof of Lemma 27 satisfying solutionF.
It is easy to see that ¬outF is satisfied in M as well.
∎
Lemma** 31****.**
square_badF* is valid in every square product of difference frames.*
Proof.
Suppose M is a model on (U,=U)×(V,=V) for some U,V with ∣U∣=∣V∣>0, and
M,(u,v)⊨solutionF.
For every z∈X∪Y, take the set Set(z) from Lemma 28.
There are two cases:
If Yb=Y, then
Lemma 28 (i) and (iii) imply that
∑x∈X∣Set(x)∣=∑y∈Y∣Set(y)∣.
As ∣U∣=∣V∣,
either there is a∈U−⋃x∈XSet(x),
or there is b∈V−⋃y∈YSet(y).
If Y−Yb=∅, then
Lemmas 23 and 28 (i),(iii)
imply that
∑x∈X∣Set(x)∣<∑y∈Y∣Set(y)∣.
As ∣U∣=∣V∣, there is a∈U−⋃x∈XSet(x).
In both cases,
M,(u,v)⊨outF follows by Lemma 28 (ii).
∎
Note
that when X∪Y is finite and every z∈X∪Y is bounded (cf. case (i) in Lemma 23),
then it can happen that
∑x∈Xξ(x)=∑y∈Yξ(y) for any solution of ΓF, but
there are solutions ξ1 and ξ2 of ΓF
such that ∑x∈Xξ1(x)<∑y∈Yξ1(y) and ∑x∈Xξ2(x)>∑y∈Yξ2(y);
see Fig. 8 for an example.
Example** 32****.**
Take the square-bad grid of bi-clusters F in Fig. 7.
We describe the formulas upper_boundF, switchF, lower_boundF, and outF.
To begin with, we have X={x1,x2} and Yb={y1} (so F belongs to case (ii)(a) in Lemma 23).
Also, GF=HF has an isolated node x2 and two edges: y1→2x1 and y2→2x1. Thus,
the unravelling TF of the bounded part HFb of HF has two roots, y1 and x2 (both
are 6-strict), and one edge: y1→2x1. Therefore, we have:
[TABLE]
where
[TABLE]
We also have
[TABLE]
Next, we compute the solution ξminF of ΓF. Note that all strongly connected components in GF
are singletons, and so \xi_{\textit{min}}^{{\mathfrak{F}}}(z)=\nu_{\textit{min}}^{{\mathfrak{F}}}\bigl{(}\{z\}\bigr{)} for all z∈X∪Y. So we have:
[TABLE]
Thus, Xlb={x1}, Ylb=∅ and Ylb∪(Y−Yb)={y2}.
We choose Fx1y1 and Fx2y2 to ‘witness’ that
min(x1)=3 and min(y2)=4, respectively, and so we have
rx1=ix1=1, ry2=2, iy2=0, and
[TABLE]
where
[TABLE]
Finally, we have:
[TABLE]
7.2.2 X∪Y is infinite
If X∪Y is infinite then,
by Lemma 23 (iii), there are two cases. We suppose that
X is finite, every x∈X is bounded, Y is infinite, and
there is a finite subgrid F−=(X,Y−,g−) of F such that
[TABLE]
(The other case is similar.)
By (87) and the finiteness of
F−, the formula solutionF− is defined in §7.2.1.
For every x∈X, take the formula outF(x) from (86),
and let
[TABLE]
Then square_badF is clearly a generalised Sahlqvist formula.
An inspection of the proof of Lemma 23 shows that by
Lemma 29 we have the following:
Lemma** 33****.**
It is decidable whether a bimodal formula is of the form square_badF for some infinite grid of bi-clusters F=(X,Y,g)
for which X is finite, every x∈X is bounded, and
F−=(X,Y−,g−) is a finite subgrid of F such that ΓF− is defined and has a solution, but
∑x∈Xξ(x)<∑y∈Y−ξ(y) for every solution ξ of ΓF−.
Lemma** 34****.**
square_badF* is not valid in F.*
Proof.
As F− is a subgrid of F, the proof of Lemma 27 gives a model M on F satisfying solutionF−.
As the ‘X-coordinates’ of both F− and F are the same, it is easy to see that
¬◊h+⋀x∈XoutF(x)
is satisfied in M as well.
∎
Lemma** 35****.**
square_badF* is valid in every square product of difference frames.*
Proof.
Suppose M is a model on (U,=U)×(V,=V) for some U,V with ∣U∣=∣V∣>0, and
M,(u,v)⊨solutionF−.
For every z∈X∪Y−, take the set Set(z) from Lemma 28.
By Lemma 28 (i),(iii), we have ∑x∈X∣Set(x)∣<∑y∈Y−∣Set(y)∣.
As ∣U∣=∣V∣, there is a∈U−⋃x∈XSet(x), and so
M,(u,v)⊨outF follows by Lemma 28 (ii).
∎
7.3 Infinite Sahlqvist axiomatisation for Diff×sqDiff
Though in general generalised Sahlqvist formulas are more expressive than Sahlqvist
formulas [16], there are special settings when their axiomatic powers coincide [15].
Our bimodal language only has two monadic modalities ◊h and ◊v.
So our generalised Sahlqvist formulas (as defined in §3.2.1 above) are special cases of the PCFs
of [15] (and of the inductive formulas of [16]). The modalities ◊h and ◊v are
self-reversive
in the sense that the formulas p→□h◊hp and p→□v◊vp belong to Diff×sqDiff
(by (2), (9) and (10)).
Therefore, it follows from [15, Thm. 4.10] that
there is an infinite axiomatisation for Diff×sqDiff consisting of Sahlqvist formulas. Moreover,
the Sahlqvist axioms can be obtained algorithmically from
the generalised Sahlqvist formulas they are axiomatically equivalent with.
8 Discussion
We have shown that the 2D product logic Diff×Diff is non-finitely axiomatisable, and also given an
infinite set ΣDiff×Diff of Sahlqvist formulas axiomatising Diff×Diff.
We have also proved that its ‘square’ version Diff×sqDiff (the modal counterpart
of two-variable substitution and equality free first-order logic with counting to 2) is
non-finitely axiomatisable over Diff×Diff, but can be axiomatised by adding infinitely
many Sahlqvist axioms to Diff×Diff.
Here are some related issues and open problems:
-
The two-player p-morphism game we defined for bi-clusters in the proof of Lemma 9
can easily be generalised to arbitrary countable grids of bi-clusters F such that the analogue of
Claim 9.1 still holds for the game G(F). (Algebraically, this is the complete representation
game à la Hirsch and Hodkinson [19], for subdirectly irreducible atomic diagonal-free strict-cylindric algebras.)
So by Theorem 3, F validates the Sahqvist axioms in ΣDiff×Diff iff
player ∃ has a winning strategy in G(F).
However, the precise connection between particular plays of G(F) and the axioms is not clear.
2. 2.
One might also consider the ‘lopsided’ product logics S5×Diff and
[TABLE]
S5×Diff is not finitely axiomatisable by Theorem 10, and
a proof very similar to that of Theorem 2 shows that S5×sqDiff is not finitely axiomatisable over S5×Diff.
Further, using the proof pattern in §2.2, it is easy to show that S5×Diff and S5×sqDiff
are axiomatisable by adding the Sahlqvist axiom □hp→p (expressing that
Rh is reflexive) to Diff×Diff and Diff×sqDiff, respectively.
However, much simpler axioms for these logics can be obtained by actually repeating the proofs of
Theorems 3 and 4, and using that grids of bi-clusters are much simpler in these cases.
In particular, it can be shown directly (without using the algorithm of [15]) that S5×sqDiff is Sahlqvist axiomatisable:
In case of grids of bi-clusters F with reflexive Rh, there are only ‘local’ reasons for F not being the
p-morphic image of a product of a universal and a difference frame. Thus, switch bi-clusters play no role in an axiomatisation,
and so there is no need for switchF-like conjuncts in the antecedents of the axioms.
3. 3.
Both Diff×Diff and Diff×sqDiff are elementarily generated modal logics
(by Corollary 7 or Theorem 3, and Theorem 4, respectively).
Hodkinson [23] ‘synthesises’ modal axioms for such logics from the first-order defining formulas via hybrid logic formulas.
It would be interesting to consider the connections between our axioms and the axioms obtained
in this way.
Note that we did not use (or even compute) the first-order correspondents of our axioms.
4. 4.
Hirsch and Hodkinson [20, 21] give an explicit infinite axiomatisation for (the algebraic counterpart of) the n-dimensional product logic S5n, for any n∈N+. The axioms are obtained by first expressing ‘universally’ the winning strategy for ∃ in a two-player ‘representation’ game, and then turning these ‘universal expressions’ to modal formulas by using that there is a universal modality in S5n-frames.
By the negative results of [24, 3],
infinitely many of these axioms cannot be Sahlqvist/canonical whenever n≥3.
It is easy to see that the method of [20] can also be used to give
an explicit infinite axiomatisation for Diffn, for any n∈N+, so in particular for Diff×Diff.
As S5n is finitely axiomatisable over Diffn by [31, Thm. 2.14], infinitely many of the axioms obtained by the method of [20] cannot be Sahlqvist/canonical whenever n≥3.
But what about the n=2 case?
Are the axioms obtained for Diff×Diff this way Sahlqvist/canonical?
5. 5.
Our axiomatisations are connected to solutions of some special kinds of
integer programming problems.
It would be interesting to understand these connections further, and possibly use
some known integer programming solver methods in order to find simpler axioms.
Note that
Pratt-Hartmann [35] also connects the type-structures of two-variable first-order logic with counting to integer programming.
6. 6.
Here we considered the axiomatisation problem for the modal counterpart of two-variable first-order
logic with counting to 2 only, and without equality and substitution/transposition of variables.
It would be interesting to get closer to the full two-variable fragment with counting, and
study richer languages that contain (some of the) modal operators
‘simulating’ these missing features (that is, cylindric and (quasi-) polyadic algebras with ‘graded’
cylindrifications corresponding to counting quantifiers); see [18, 38, 33, 7].
Acknowledgements
Sérgio Marcelino’s research was funded by FCT/MCTES through national funds and when applicable co-funded EU funds under the project UIDB/EEA/50008/2020. We thank the anonymous referees for their careful reading and expert criticism of the manuscript.