FO = FO3 for linear orders with monotone binary relations
Marie Fortin

TL;DR
This paper proves that monadic first-order logic over linear orders with certain monotone binary relations has the three-variable property, extending known results and providing a new proof via translation to star-free PDL.
Contribution
It generalizes the three-variable property to a broader class of linear orders with monotone relations and introduces a novel proof method using translation to star-free PDL.
Findings
Monadic first-order logic has the three-variable property over these classes.
The proof involves translating formulas into star-free PDL.
The results extend known cases and answer open questions.
Abstract
We show that over the class of linear orders with additional binary relations satisfying some monotonicity conditions, monadic first-order logic has the three-variable property. This generalizes (and gives a new proof of) several known results, including the fact that monadic first-order logic has the three-variable property over linear orders, as well as over (R,<,+1), and answers some open questions mentioned in a paper from Antonopoulos, Hunter, Raza and Worrell [FoSSaCS 2015]. Our proof is based on a translation of monadic first-order logic formulas into formulas of a star-free variant of Propositional Dynamic Logic, which are in turn easily expressible in monadic first-order logic with three variables.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Algebra and Logic · Logic, Reasoning, and Knowledge
LSV, CNRS & ENS Paris-Saclay, Université [email protected]\CopyrightMarie Fortin\ccsdesc[100]Theory of computation Logic\hideLIPIcs
for linear orders with monotone binary relations
Marie Fortin
Abstract
We show that over the class of linear orders with additional binary relations satisfying some monotonicity conditions, monadic first-order logic has the three-variable property. This generalizes (and gives a new proof of) several known results, including the fact that monadic first-order logic has the three-variable property over linear orders, as well as over , and answers some open questions mentioned in a paper from Antonopoulos, Hunter, Raza and Worrell [FoSSaCS 2015]. Our proof is based on a translation of monadic first-order logic formulas into formulas of a star-free variant of Propositional Dynamic Logic, which are in turn easily expressible in monadic first-order logic with three variables.
keywords:
first-order logic, three-variable property, propositional dynamic logic
1 Introduction
Logics with a bounded number of variables have been extensively studied, in particular in the context of descriptive complexity [17, 18, 10, 21] and temporal logics [20, 7, 14, 16]. One recurring question of interest [7, 25, 19, 4, 26, 1] is to determine, in a given class of structures, whether all properties expressible in monadic first-order logic (FO) can be defined in the fragment consisting of formulas which use at most variables. (A same variable may be quantified over several times in a formula.) In fact, several non-equivalent versions of this question appear in the literature, many of which are compared in [15]. We say that has the -variable property if every formula of FO with at most free variables is equivalent over to a formula of . Note that this is strictly stronger than requiring that all sentences (without free variables) of FO are equivalent to some formulas. Indeed, Hodkinson and Simon gave an example of a class of structures where no sentence requires more than 3 variables, but which does not have the -variable property for any [15].
The problem of whether a given class of structures has the -variable property is closely related to the question of the existence of an expressively complete temporal logic (with a finite set of FO-definable modalities). A temporal logic is called expressively complete if any first-order formula with a single free variable can be expressed in it. For instance, it is well-known that linear temporal logic (LTL) over Dedekind-complete time flows, or its extension with Stavi connectives over all time flows, are expressively complete for first-order logic [20, 8]. More recently, it was shown that over the real numbers equipped with binary relations for all , metric temporal logic (MTL) is expressively complete [16]. However, the questions of having the -variable property for some or admitting an expressively complete temporal logic are incomparable in general: there exist a class of structures which admits a finite expressively complete set of temporal connectives but which does not have the -variable property for any [15], and one which has the -variable property but for which no temporal logic is expressively complete [14]. However, Gabbay established that having the -variable property implies the existence of a multi-dimensional expressively complete temporal logic, with multiple reference points [7].
Another classical approach to proving or disproving that a class of structures has the -variable property is through Ehrenfeucht-Fraïssé games, with a bounded number of pebbles [12, 25, 19, 1]. This was applied by Immerman and Kozen to linear orders and bounded-degree trees [19], and by Antonopoulos et al. to real-time signals [1].
Natural candidates for classes which might have the -variable property are classes of linearly ordered structures. Indeed, a typical counter-example to unrestricted structures having the -variable property is a formula such as “there exists distinct elements which satisfy some predicate ”. It is in general not expressible in , but it is easily expressible in if all models are equipped with a linear order . For instance for , we take the formula . As mentioned before, Immerman and Kozen showed that the class of linear orders has the -variable property [19]. However, adding a single binary relation suffices to obtain a class of linearly ordered structures which does not have the -variable property for any . Venema gave an example of a dense linear order with a single equivalence relation which does not have the -variable property for any [29]; this was adapted in [1] to give another example where the equivalence relation is replaced with a bijection. In fact, even for finite linear orders, Rossman [26] proved that the class of linearly ordered graphs does not have the -variable property for any , resolving a problem which had been open for more than 25 years [17]. Therefore, adding binary relations to linear orders while keeping the -variable property requires some restrictions on the interpretation of the relation symbols.
On the positive side, Antonopoulos et al. proved that the class of structures over (or signals) has the -variable property [1]. Such structures have been studied in the context of real-time verification. As a corollary, they also showed that has the -variable property for any linear function .
Contribution.
We consider the class of linearly ordered structures with an additional (finite or infinite) number of binary interval-preserving relations. These are binary relations such that, for all intervals , any point which is in between two points of and has a preimage by must have one in . (We also require a symmetric condition of the converse relation .) We show that FO over this class of structures also has the -variable property.
This generalizes results from [19] and [1] described above. Moreover, this answers some open questions mentioned in the conclusion of [1], which asked if the result could be extended from linear functions to polynomials over the reals, or other linear orders and families of monotone functions. In fact, all increasing or decreasing partial functions (over arbitrary linear orders) are special cases of interval-preserving relations, and thus covered by our result.
Our proof relies on different techniques than [19, 1], which were based on Ehrenfeucht-Fraïssé games. We give an effective translation from FO to which goes through a star-free variant of Propositional Dynamic Logic (PDL) with converse. Propositional dynamic logic was introduced by Fischer and Ladner [6] to reason about program schemes, and has now found a large range of applications in artificial intelligence and verification [11, 5, 23, 22, 9]. It combines local formulas containing modal operators, and path formulas using the concatenation, union and Kleene star operations. Several extensions have been studied, including PDL with converse [27], intersection [3], or negation of atomic programs [24]. The particular star-free variant of PDL we use here is in fact very similar to Tarski’s relation algebras [28], which was used as a basis for formalizing set theory. It also corresponds to a two-dimensional temporal logic in the sense of Gabbay [7].
We applied similar proof techniques in [2], where we introduced a star-free variant of PDL and proved that it is equivalent to FO over message sequence charts (MSCs) (and thus obtained a 3-variable property result for MSCs as a corollary). MSCs are discrete partial orders which represent behaviors of concurrent message passing systems. They consist of a fixed, finite number of linear orders called process orders (one for each process in the system), together with FIFO binary message relations connecting matching send and receive actions. Having a (fixed) finite number of total orders instead of a single one is not an important difference, as we could always put them one after the other to extend them into a single linear order. FIFO relations are a special case of interval-preserving relations, thus the result of the present paper can in fact be seen as a strict generalization of our previous result in [2]. More importantly, a major difference between MSCs studied in [2] and the setting we consider here is that MSCs are discrete structures, whereas here we allow arbitrary linear orders. In fact, [2] relied on the definition of formulas describing the minimum or the maximum of some binary relations. As such, it is interesting to see that the same kind of techniques can still be applied to a priori very different linear orders.
Outline.
In Section 2, we introduce interval-preserving relations and monadic first-order logic. In Section 3, we define star-free PDL, and prove some properties of its formulas. In Section 4, we give an effective translation from FO to star-free PDL, and explain its consequences. We conclude in Section 5.
2 Interval-preserving relations and first-order logic
In this section, we define the class of structures covered by our results, and recall the syntax of first-order logic.
Interval-preserving binary relations.
Let be a binary relation between sets and . We write if , and . For a subset , we also write . We define the converse of a relation as , and the composition of two binary relations and as . Finally, we write for the complement of . Note that we have the following identities:
[TABLE]
A linear order over a set is a reflexive, transitive and antisymmetric relation such that for all , we have or . Let be a linearly ordered set. For , we will also denote by the restriction of to , so that is still a linearly ordered set. Moreover, for , we write if for all , , and if for all , .
An interval of is a set such that for all with , we have . For , we denote by the interval , and similarly for the intervals , , . We call a relation between two linearly ordered sets and interval-preserving if:
- •
For all intervals of , is an interval of .
- •
For all intervals of , is an interval of .
In other terms, for all and with , for all , if there exists some such that , then there exists one in (cf. Figure 1). Note that we do not require that all elements between and are in , but only those which are in the image of . The second condition is symmetric: for all and with , for all , if there exists some such that , then there exists one in .
Example 2.1**.**
For any linear order and partial function , if is increasing or decreasing then the relation is interval-preserving.
As another example, consider a temporal structure over a set of atomic propositions AP, where indicates the set of propositions which are true at a given point. For , we let . Then is interval-preserving.
The following lemma states some simple closure properties of interval-preserving relations.
Lemma 2.2**.**
Let , , be linearly ordered sets.
For all interval-preserving relation , is interval-preserving. 2. 2.
For all interval-preserving relations , is interval-preserving. 3. 3.
For all interval-preserving relations and , is interval-preserving.
Proof 2.3**.**
Part 1 follows from the fact that .
Let us prove 2. Since , by symmetry, it suffices to prove that for all interval of , is an interval of . Let and such that and for some . If , then we are done. Otherwise, suppose for instance that (the other cases are similar). Since is interval-preserving, there exists such that . Then, since and is an interval of , we obtain . Similarly, . Hence .
Let us show that 2 implies 3. Again, by symmetry, it suffices to prove that for all interval of , is an interval of . Let denote the relation . It is an interval-preserving relation between and . Moreover, we have . Now, let be some interval of , and . Then is an interval of . Moreover, since is interval-preserving, we have , hence
[TABLE]
Then, according to 2, is an interval of , i.e., an interval of .
Models.
Let be an infinite set of monadic predicates, and be a finite or infinite set of binary relation symbols. Throughout the paper, will denote a structure where is a linear order over , is an interval-preserving relation for all , and for all .
Monadic first-order logic.
We assume an infinite supply of variables . The set of monadic first-order logic formulas over is defined as follows:
[TABLE]
We assume that all formulas are interpreted over structures defined as above. Given an formula , we denote by its set of free variables. We define the satisfaction relation as usual, where and is an interpretation of the free variables of . We say that two formulas are equivalent, written , if for all and , we have if and only if .
For , we denote by the set of first-order formulas with at most variables. Note that a same variable may be quantified over several times in the formula.
Example 2.4**.**
Let be a polynomial function, and its local extrema (we suppose that ). Fix . For convenience, we will write instead of in formulas. We focus on models of the form where is the usual ordering of the reals, and . Let us describe an formula such that for all and , we have if and only if . First, we write for the formula
[TABLE]
We can then define formulas and which state that is a local minimum (resp. maximum) of , for instance:
[TABLE]
The formula then states that there exists at least local extrema before , alternating existential quantifications over and to identify them; for instance, is the formula
[TABLE]
3 Star-free Propositional Dynamic Logic
Star-free Propositional Dynamic Logic.
Propositional dynamic logic (PDL) [6] consists of two sorts of formulas: state formulas which are evaluated at single elements, and path formulas which are evaluated at pairs of elements and allow to navigate inside the model. Here we consider a star-free variant of PDL (with converse). The syntax of star-free propositional dynamic logic over , written , is given below:
[TABLE]
where and .
Compared to classical PDL, star-free PDL uses the operators of star-free expressions, instead of the rational operators .
Let . The semantics or of a state or path formula in is defined below. The state formula is true at a point in (that is, ) if there exists some such that satisfies and is true at . The path formula is stationary and tests if the state formula is true. The semantics of other formulas is straightforward:
[TABLE]
For simplicity, we will often write or instead of and . We also write if , and if .
We will use the abbreviations , , , , and . For all formulas , we also define a state formula which holds at if and only if .
Example 3.1**.**
Suppose that , and that we consider only models over and with . Let and . The formula of metric temporal logic, which holds at time if there exists such that and for all , , can be expressed in as follows:
[TABLE]
An interval-preserving fragment of star-free PDL.
We say that a path formula is interval-preserving if for all , is interval-preserving. Notice that and (for all ) are interval-preserving. By Lemma 2.2 (and assumption on ), all formulas constructed without the boolean operators and are interval-preserving. However, the complement or the union of interval-preserving relations are not in general interval-preserving. We define below a fragment of where all path formulas are interval-preserving, and which will turn out to be as expressive as (and in fact, ) when it comes to state formulas. To do so, we will introduce several restrictions of which will turn out to be interval-preserving, and which will suffice to characterize .
Let us first look at the different reasons for which we may have , assuming that is interval-preserving. To begin with, we focus on . One first sufficient condition for having is that . Now, suppose . If is interval-preserving, there are only three possible cases in which : , or , or . We define formulas and corresponding respectively to the first two cases. We let
[TABLE]
Now, if we look at instead of , we can make the same observations, by symmetry: we have if and only if , and if is interval-preserving, there are again only four possible cases: , or , or , or .
Unfortunately, the formulas and are still not interval-preserving in general. However, if we take a more symmetric restriction of , where we look at all the possible positions of and relatively to and , we obtain four cases, illustrated in Figure 2, which we will later show correspond to interval-preserving restrictions of .
More precisely, let
[TABLE]
Notice that .
Let be the following restriction of :
[TABLE]
Lemma 3.2**.**
All formulas are interval-preserving.
Proof 3.3**.**
We proceed by induction on the formula. By assumption, is interval-preserving for all . Moreover, and are interval-preserving. For , and , we apply Lemma 2.2.
Suppose that is interval-preserving. Let us show that is interval-preserving. Notice that . So we only need to show that for all intervals , for all and such that , there exists such that . Let such that . Let us show that we can in fact take . The proof is illustrated in the picture below.
I$$J$$a_{2}$$b_{2}$$b$$b_{1}$$\llbracket{\pi}\rrbracket(a_{2})$$<$$c$$c_{2}$$\pi^{-1}$$\pi^{-1}$$\leq$$\leq$$\leq$$<
First, we have . Moreover, (since ). Now, suppose towards a contradiction that . Let such that . Since , there exists such that . We then have and . Since is interval-preserving, we obtain , a contradiction with the fact that . Thus, .
Let us show that is also interval-preserving. Similarly to the previous case, we show that for all and such that , we have .
I$$J$$a_{2}$$b_{2}$$b$$b_{1}$$\llbracket{\pi}\rrbracket(a_{2})$$<$$c_{2}$$c$$\leq$$\leq$$\leq$$<
First, , and . Suppose towards a contradiction that . Let such that , and . We have , and . Since is interval-preserving, we obtain , a contradiction with the fact that . Symmetrically, let be an interval, , and such that . Then for any such that , we also have , hence .
Since , this also implies that is interval-preserving.
Finally, the case of is symmetric to the case of : for all and such that , we have .
4 Star-free PDL is expressively equivalent to FO
Let be a state formula in , and an formula with a single free variable . We say that and are equivalent, written , if for all and elements in , we have if and only if . Similarly, for a path formula and an formula with exactly two free variables and , we write if for all and elements in , we have if and only if .
From to .
An easy induction shows that any formula in can be translated into an formula which uses at most three distinct variables:
Lemma 4.1**.**
For every state formula , there exists a formula such that . For every path formula , there exists a formula such that .
For the other direction, we will see that the fragment of defined below will be sufficient:
[TABLE]
This fragment is a restriction of , where the intersection is only used for formulas.
From to .
The main result of the paper is an effective translation of formulas into positive boolean combinations of formulas in :
Theorem 4.2**.**
Every formula with at least one free variable is equivalent to a positive boolean combination of formulas of the form , where and .
Note that the equivalent formula may also contain subformulas of the form .
Before proving Theorem 4.2, we state some of its consequences.
Corollary 4.3**.**
Every formula with a single free variable is equivalent to some state formula. Every formula with two free variables is equivalent to some path formula.
Proof 4.4**.**
If has a single free variable , it is equivalent to a positive boolean combination of formulas of the form , which are themselves equivalent to the formulas . The combination of these formulas is then a state formula of .
If has two free variables and , we obtain an equivalent positive boolean combination of formulas of the form , , , or . We can replace any subformula with , and any subformula with , where and , and similarly for formulas . We obtain an equivalent positive boolean combination of formulas of the form . Since allows union and intersection of path formulas, this is equivalent to a formula.
Another consequence is that over linear orders with interval-preserving relations has the three-variable property. More precisely:
Theorem 4.5**.**
Any formula is equivalent to a boolean combination of formulas in .
This also allows us to answer an open question from [1], namely, whether structures over the real numbers with polynomial functions have the -variable property. Suppose that is a set of polynomials . Let , where is the usual ordering of the real numbers, and for all . Given an interpretation of the monadic predicates, we denote by the structure . We say that two formulas are equivalent over , written , if for all and , we have if and only if .
Theorem 4.6**.**
For all , there exists a boolean combination of formulas in such that .
Proof 4.7**.**
Let , and its local extrema. We denote by , the (monotone) restrictions of to intervals delimitated by its local extrema, and the set of these partial functions. Let . As above, we let , where is the usual ordering of the real numbers, and . Note that is interval-preserving (cf. Example 2.1). We say that two formulas and are equivalent, written , when for all and , we have if and only if .
Let . The formula obtained by replacing each atomic formula by is equivalent to . Applying Theorem 4.5 to , we obtain another formula such that and is a boolean combination of formulas in .
Following Example 2.4, one can construct for each a formula “” of such that if and only if . Consider now the formula obtained by replacing each atomic formula in by . Then , hence . Moreover, is a boolean combination of formulas in .
The remainder of the section is devoted to the proof of Theorem 4.2.
Eliminating negations.
The fact that all path formulas are interval-preserving gives us a simple characterization of the complement of a path formula: we show below that an event is in if it is to the left or to the right of all elements of , or if it does not satisfy . We can then show that the complement of a path formula in is equivalent to a union of path formulas in . This will allow us to deal with negation in the translation from to .
Lemma 4.8**.**
For all path formulas , is equivalent to a union of formulas.
Proof 4.9**.**
We show that
[TABLE]
We denote by the right-hand-side formula. First, for all such that or , we have and . Now, suppose that and . We have if and only if . Clearly, if , then . Conversely, let us show that if then . In that case, we have either for some , or for some . Since is interval-preserving, we obtain .
Existential quantification.
The elimination of existential quantifiers relies on the simple lemma below:
Lemma 4.10**.**
Let be a linearly ordered set, and intervals of . Then if and only if for all , .
Proof 4.11**.**
We show that there exists and such that , which implies the result. We define relations and over which, intuitively, compare respectively the left and right bounds of the intervals:
[TABLE]
It is easy to check that and are transitive, an that for all and , we have or (or both), and similarly for . Thus, there exists such that for all , and such that for all . Then for all , for all , there exists such that . Since is an interval, we obtain . Hence .
The next lemma follows from an application of Lemma 4.10 to intervals of the form .
Lemma 4.12**.**
Let . For all path formulas and all state formulas in , the formula
[TABLE]
is equivalent to a positive boolean combination of formulas of the form , with and .
Proof 4.13**.**
Let , and
[TABLE]
Notice that .
Let us show that . Let , and . For all , let . Let us show that is an interval of . First, since is interval-preserving, is an interval of . Thus, is an interval of . But since , this is simply . Besides, it is easy to verify that
[TABLE]
Applying Lemma 4.10, we obtain
[TABLE]
Translation from to .
We are now ready to give the proof of Theorem 4.2.
Proof 4.14** (Proof of Theorem 4.2).**
We assume that is in prenex normal form, and prove the result by induction. The translation of atomic formulas or is straightforward; moreover, , and . Using Lemma 4.8 to eliminate negations, we obtain the result for all quantifier-free formulas.
The case reduces to the case of existential quantification, applying again Lemma 4.8 to eliminate negations.
We are left with the case . If is not free in , then (since has at least one free variable) and we are done by induction. Otherwise, assume that with and . By induction, is equivalent to a positive boolean combination of formulas of the form with . We replace with whenever , and bring the resulting formula into disjunctive normal form. Each conjunct is then of the form , where uses only variables from , with for some , and . Note that , where . Then is equivalent to a finite disjunction of formulas
[TABLE]
with and as above. If is empty, then we replace with the formula
[TABLE]
Otherwise, we apply Lemma 4.12 to . In all cases, we obtain an equivalent formula which is a positive boolean combination of formulas with and .
Remark 4.15**.**
Without the assumption that all atomic binary relations are interval-preserving, is still equivalent to . Indeed, in the proof of Theorem 4.2, the assumption that all atomic binary relations are interval-preserving is only used in Lemmas 4.8 and 4.12. But this assumption is not needed in the proof of Lemma 4.12 if uses only three variables , and . Indeed, we then have , where is the intersection of all such that , and is the intersection of all such that . Moreover, Lemma 4.8 is no longer needed if we translate an formula into a positive boolean combination of formulas, since allows to take the complement of a path formula. Note that the equivalence with is already proven in [28] (for the calculus of relations).
5 Conclusion
We proved that every formula over linear orders with interval-preserving binary relations can be translated into an equivalent positive boolean combination of path formulas in . In particular, any formula is equivalent to a boolean combination of formulas in , which shows that has the three-variable property. This generalizes several known results.
It would be interesting to see if the equivalence between and can be used as an intermediate step to prove that a temporal logic is expressively complete. It is not the case in general, since [13] provides an example of a class of structures which fits our assumptions but does not admit any expressively complete temporal logic. However, the equivalence could still be useful in more restricted settings.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Timos Antonopoulos, Paul Hunter, Shahab Raza, and James Worrell. Three variables suffice for real-time logic. In Fo S Sa CS 2015 , volume 9034 of LNCS , pages 361–374. Springer, 2015.
- 2[2] Benedikt Bollig, Marie Fortin, and Paul Gastin. It is easy to be wise after the event: Communicating finite-state machines capture first-order logic with "happened before". In CONCUR 2018 , volume 118 of LIP Ics , pages 7:1–7:17. Schloss Dagstuhl - Leibniz-Zentrum fuer Informatik, 2018.
- 3[3] Ryszard Danecki. Nondeterministic propositional dynamic logic with intersection is decidable. In Computation Theory , pages 34–53. Springer Berlin Heidelberg, 1985.
- 4[4] Anuj Dawar. How many first-order variables are needed on finite ordered structures? In We Will Show Them! (1) , pages 489–520. College Publications, 2005.
- 5[5] G. De Giacomo and M. Lenzerini. Boosting the correspondence between description logics and propositional dynamic logics. In Proceedings of the 12th National Conference on Artificial Intelligence, Seattle, WA, USA, July 31 - August 4, 1994, Volume 1. , pages 205–212. AAAI Press / The MIT Press, 1994.
- 6[6] M. J. Fischer and R. E. Ladner. Propositional Dynamic Logic of regular programs. Journal of Computer and System Sciences , 18(2):194–211, 1979.
- 7[7] D. M. Gabbay. Expressive functional completeness in tense logic. In Uwe Mönnich, editor, Aspects of Philosophical Logic: Some Logical Forays into Central Notions of Linguistics and Philosophy , pages 91–117. Springer Netherlands, Dordrecht, 1981.
- 8[8] D. M. Gabbay, I. M. Hodkinson, and M. A. Reynolds. Temporal expressive completeness in the presence of gaps , volume Volume 2 of Lecture Notes in Logic , pages 89–121. Springer-Verlag, 1993.
