A Modal Characterization Theorem for a Probabilistic Fuzzy Description Logic
Paul Wild, Lutz Schr\"oder, Dirk Pattinson, Barbara K\"onig

TL;DR
This paper establishes a probabilistic analogue of the van Benthem theorem for fuzzy description logic, characterizing its expressive power through invariance under probabilistic bisimilarity and behavioral distance.
Contribution
It proves that non-expansive probabilistic fuzzy first-order formulas can be approximated by bounded rank concepts in probabilistic fuzzy description logic, extending classical modal logic results.
Findings
Probabilistic fuzzy description logic is invariant under probabilistic bisimilarity.
Non-expansive formulas can be approximated by bounded rank concepts.
The paper provides a probabilistic van Benthem theorem analogue.
Abstract
The fuzzy modality `probably` is interpreted over probabilistic type spaces by taking expected truth values. The arising probabilistic fuzzy description logic is invariant under probabilistic bisimilarity; more informatively, it is non-expansive wrt. a suitable notion of behavioural distance. In the present paper, we provide a characterization of the expressive power of this logic based on this observation: We prove a probabilistic analogue of the classical van Benthem theorem, which states that modal logic is precisely the bisimulation-invariant fragment of first-order logic. Specifically, we show that every formula in probabilistic fuzzy first-order logic that is non-expansive wrt. behavioural distance can be approximated by concepts of bounded rank in probabilistic fuzzy description logic. For a modal logic perspective on the same result, see arXiv:1810.04722.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Modal Characterization Theorem for a Probabilistic Fuzzy
Description Logic
Paul Wild1
Lutz Schröder1
Dirk Pattinson2
Barbara König3 1Friedrich-Alexander-Universität Erlangen-Nürnberg
2Australian National University, Canberra
3Universität Duisburg-Essen
Abstract
The fuzzy modality probably is interpreted over probabilistic type spaces by taking expected truth values. The arising probabilistic fuzzy description logic is invariant under probabilistic bisimilarity; more informatively, it is non-expansive wrt. a suitable notion of behavioural distance. In the present paper, we provide a characterization of the expressive power of this logic based on this observation: We prove a probabilistic analogue of the classical van Benthem theorem, which states that modal logic is precisely the bisimulation-invariant fragment of first-order logic. Specifically, we show that every formula in probabilistic fuzzy first-order logic that is non-expansive wrt. behavioural distance can be approximated by concepts of bounded rank in probabilistic fuzzy description logic.
For a modal logic perspective on the same result, see Wild et al. (2018b).
1 Introduction
In the representation of uncertain knowledge, one will often wish to avoid mention of exact numerical probabilities, e.g. when these are not precisely known or not relevant to the representation task at hand – as a typical example, a medical practitioner will rarely name a numerical threshold for the likelihood of a diagnosis, and instead qualify the diagnosis as, say, ‘suspected’ or ‘probable’. This has led to efforts aimed at formalizing a modality probably, used alternatively to modalities ‘with probability at least ’ Larsen and Skou (1991); Heifetz and Mongin (2001); Lutz and Schröder (2010). Such a formalization may be approached in a two-valued setting via qualitative axiomatizations of likelihood Burgess (1969); Halpern and Rabin (1987) or via threshold probabilities Hamblin (1959); Herzig (2003). In a fuzzy setting, ‘probably’ leads a natural life as a fuzzy modality P, whose truth value just increases as its argument becomes more probable (this modality thus connects the otherwise well-distinguished worlds of fuzziness and probability Lukasiewicz and Straccia (2008)). The semantics of this operator, first defined by Zadeh Zadeh (1968), interprets as the expected truth value of . It appears in various fuzzy propositional Hájek (2007); Flaminio and Godo (2007), modal Desharnais et al. (1999); van Breugel and Worrell (2005), fixpoint Kozen (1985); Huth and Kwiatkowska (1997), and description logics Schröder and Pattinson (2011).
In the present paper, we pin down the exact expressiveness of the basic description logic of probably, which we briefly refer to as probabilistic fuzzy or , within a natural ambient probabilistic fuzzy first-order logic , by providing a modal characterization theorem. The prototype of such characterization theorems is van Benthem’s theorem van Benthem (1976), which states that (classical) modal logic is precisely the bisimulation-invariant fragment of first-order logic. It has been noted that in systems with numerical values, behavioural pseudometrics offer a more fine-grained measure of equivalence than two-valued bisimilarity Giacalone et al. (1990); Desharnais et al. (1999); van Breugel and Worrell (2005); Desharnais et al. (2008); Baldan et al. (2014). When propositional connectives are equipped with Zadeh semantics, is non-expansive wrt. behavioural distance; we continue to refer to this property as bisimulation invariance. In previous work Wild et al. (2018a) we have shown that relational fuzzy modal logic is the bisimulation-invariant fragment of fuzzy FOL, more precisely that every bisimulation-invariant fuzzy FO formula can be approximated by fuzzy modal formulae of bounded rank. The bound on the rank is key; without it, the statement turns into a form of the (much simpler) Hennessy-Milner theorem Hennessy and Milner (1985) (which classically states that non-bisimilar states in finitely branching systems can be distinguished by modal formulae), and indeed does not need to assume FO definability of the given bisimulation-invariant property van Breugel and Worrell (2005). Here, we establish a corresponding result for the rather more involved probabilistic setting: We show that every bisimulation-invariant formula in probabilistic fuzzy FOL can be approximated in bounded rank in probabilistic fuzzy . This means not only that, up to approximation, is as powerful as on bisimulation-invariant properties, but also that provides effective syntax for bisimulation-invariant , which itself does not Otto (2006).
Proofs are mostly omitted or only sketched; full proofs are in the appendix.
Related Work
There is widespread interest in modal characterization theorems in modal logic Dawar and Otto (2005), database theory Figueira et al. (2015), concurrency Janin and Walukiewicz (1995); Carreiro (2015), and AI Sturm and Wolter (2001); Wild and Schröder (2017); Wild et al. (2018a). The overall structure of our proof builds partly on that of our modal characterization theorem for relational fuzzy modal logic Wild et al. (2018a) (in turn based ultimately on a strategy due to Otto Otto (2004)) but deals with a much more involved logic, which instead of just the lattice structure of the unit interval involves its full arithmetic structure, via the use of probabilities and expected values, necessitating, e.g., the use of Kantorovich-Rubinstein duality. Notable contributions of our proof include new forms of probabilistic bisimulation games up-to- (different from games introduced by Desharnais et al. Desharnais et al. (2008), which characterize a different metric) and Ehrenfeucht-Fraïssé games, related to two-valued games considered in the context of topological FOL Makowsky and Ziegler (1980). (For lack of space, we omit discussion of quantitative Hennessy-Milner type results beyond the mentioned result by van Breugel and Worrell van Breugel and Worrell (2005).)
may be seen as a fuzzy variant of Halpern’s Halpern (1990) type-1 (i.e. statistical) two-valued probabilistic FOL, and uses a syntax related to coalgebraic predicate logic Litak et al. (2018) and, ultimately, Chang’s modal predicate logic Chang (1973). Van-Benthem style theorems for two-valued coalgebraic modal logic Schröder et al. (2017) instantiate to two-valued probabilistic modal logic, then establishing expressibility of bisimulation-invariant probabilistic FO formulae by probabilistic modal formulae with infinite conjunction but of bounded rank, in an apparent analogy to bounded-rank approximation in the fuzzy setting.
2 Fuzzy Probabilistic Logics
We proceed to introduce the logics featuring in our main result. We fix (w.l.o.g., finite) sets of atomic concepts and of roles; concepts of quantitative probabilistic () are defined by the grammar
[TABLE]
where , and . The intended reading of P is ‘probably’; we give examples below. Slightly deviating from standard practice, we define the rank of a concept as the maximal nesting depth of the P and atomic concepts in ; e.g. . We denote the set of all concepts of rank at most by .
Concepts are interpreted over probabilistic structures to which we neutrally refer as interpretations or, briefly, models. We allow infinite models but restrict to discrete probability distributions over successors at each state. A model
[TABLE]
consists of a domain of states or individuals, and interpretations , of atomic concepts and roles such that for each , the map
[TABLE]
is either zero or a probability mass function on , i.e.
[TABLE]
(implying that the support of is at most countable). We call a state -blocking if . At non-blocking states , thus acts as a probabilistic accessibility relation; we abuse to denote also the probability measure defined by .
The interpretation of concepts is defined recursively, extending that of atomic concepts, by
[TABLE]
At non-blocking , is thus the expected truth value of for a random -successor of . We define disjunction as the dual of as usual, so takes maxima. We use Zadeh semantics for the propositional operators, which will later ensure non-expansiveness wrt. behavioural distance; see additional comments in Section 7.
Up to minor variations, our models correspond to Markov chains or, in an epistemic reading, type spaces (e.g. Heifetz and Mongin (2001)). The logic was considered (with Łukasiewicz semantics) by Schröder and Pattinson Schröder and Pattinson (2011), and resembles van Breugel and Worrell’s quantitative probabilistic modal logic van Breugel and Worrell (2005). E.g., in a reading of as consisting of real-world individuals, the concept
[TABLE]
describes noises you hear in your tent at night as being loud and probably coming from the large and probably angry animal whose shadow just crossed the tent roof. (In this view, P can be usefully combined with crisp or fuzzy relational modalities, using off-the-shelf compositionality mechanisms Schröder and Pattinson (2011).) In an epistemic reading where the elements of are possible worlds, and the roles are understood as epistemic agents, the concept
[TABLE]
denotes the degree to which believes she is successfully bluffing by letting overestimate ’s hand.
For readability, we will restrict the technical treatment to a single role , omitted in the syntax, from now on, noting that covering multiple roles amounts to no more than additional indexing. As the first-order correspondence language of quantitative probabilistic we introduce quantitative probabilistic first-order logic (), with formulae defined by the grammar
[TABLE]
where and range over a fixed countably infinite reservoir of variables. The reading of is the expected truth value of at a random successor of . (In particular, when is crisp, then is just the probability of satisfying , similar to the weights in Halpern’s type-1 probabilistic FOL Halpern (1990).) We have the expected notions of free and bound variables, under the additional proviso that (but not !) is bound in . The (quantifier) rank of a formula is the maximal nesting depth of the variable-binding operators and P and propositional atoms in ; e.g. has rank .
Given a model and a vector of values, the semantics of the logic assigns a truth value to a formula with free variables at most . We define recursively by essentially the same clauses as in for the propositional constructs, and
[TABLE]
where takes suprema. Moreover, equality is two-valued, i.e. is if , and [math] otherwise.
E.g. the formula (‘the successor of is probably ’) denotes the access probability from to , the probability of reaching from in two independently distributed steps, and the probability of the most probable successor of .
We have a standard translation from into , indexed over a variable naming the current state. Following Litak et al. Litak et al. (2018), we define recursively by
[TABLE]
and by commutation with all other constructs.
Lemma 2.1**.**
For every -concept and state , .
thus identifies as a fragment of .
3 Behavioural Distances and Games
We next discuss several notions of behavioural distance between states: via fixed point iteration à la Wasserstein/Kantorovich, via games and via the logic. We focus mostly on depth- distances. Only for one version, we define also the unbounded distance, which will feature in the modal characterization result. We show in Section 4 that at finite depth, all these distances coincide. It has been shown in previous work Desharnais et al. (2004); van Breugel and Worrell (2005) that the unbounded-depth distances defined via Kantorovich fixed point iteration and via the logic, respectively, coincide in very similar settings; such results can be seen as probabilistic variants of the Hennessy-Milner theorem.
We recall standard notions on pseudometric spaces:
Definition 3.1** (Pseudometric spaces, non-expansive maps).**
A (bounded) pseudometric on a set is a function such that for , the following axioms hold: (reflexivity), (symmetry), (triangle inequality). If additionally implies , then is a metric. A (pseudo)metric space consists of a set and a (pseudo)metric on .
A map is non-expansive wrt. a pseudometric if for all . The space of these non-expansive functions, denoted , is equipped with the supremum (pseudo)metric ,
[TABLE]
We denote by the ball of radius around in . The space is totally bounded if for every there exists a finite -cover, i.e. finitely many elements such that .
Recall that a metric space is compact iff it is complete and totally bounded.
We next introduce the Wasserstein and Kantorovich distances, which coincide according to Kantorovich-Rubinstein duality. To this end, we first need the notion of a coupling of two probability distributions, from which the original distributions are factored out as marginals.
Definition 3.2**.**
Let and be discrete probability measures on and , respectively. We denote by the set of couplings of and , i.e. probability measures on such that and are marginals of :
- •
for all , ;
- •
for all , .
Definition 3.3** (Wasserstein and Kantorovich distances).**
Let be a pseudometric space. We generally write
[TABLE]
for the set of discrete probability distributions on . We define two pseudometrics on , the Kantorovich distance and the Wasserstein distance :
[TABLE]
where takes meets (and suprema). We extend these distances without further mention to zero functions (like the functions at blocking states ) by decreeing that the zero function has distance from all probability distributions.
The notation is meant as a mnemonic for the fact that these distances are obtained via suprema respectively via infima. If is separable (contains a countable dense subset), these pseudometrics coincide, a fact known as the Kantorovich-Rubinstein duality (e.g. Dudley (2002)):
Lemma 3.4** (Kantorovich-Rubinstein duality).**
Let be a separable pseudometric space. Then for all ,
[TABLE]
The above notions of lifting a distance on to a distance on distributions over can be used to give fixed point equations for behavioural distances on models.
Definition 3.5** (Fixed point iteration à la Wasserstein/Kantorovich).**
Given a model , we define the chains , of depth- Kantorovich and Wasserstein distances, respectively, via fixed point iteration:
[TABLE]
where is binary join. We extend this to states in different models , by taking the disjoint union of , .
In both cases, we start with the zero pseudometric, and in the next iteration lift the pseudometric from the previous step via Wasserstein/Kantorovich. This lifted metric is then applied to the probability distributions associated with . In addition we take the maximum with the supremum over the distances for all atomic .
We now introduce a key tool for our technical development, a new up-to- bisimulation game inspired by the definition of the Wasserstein distance.
Definition 3.6** (Bisimulation game).**
Given models , , and , the -bisimulation game for and is played by Spoiler () and Duplicator (), with rules as follows:
- •
Configurations: triples , with states , and maximal allowed deviation . The initial configuration is .
- •
Moves: In each round, first picks a probability measure . Then, distributes the deviation over all pairs of successors, i.e. picks a function such that . Finally, picks a pair with ; the new configuration is then .
- •
wins if both states are blocking or .
- •
wins if exactly one state is blocking and .
- •
Winning condition: for all .
The game comes in two variants, the (unbounded) bisimulation game and the -round bisimulation game, where . Player wins if the winning condition holds before every round, otherwise wins. More precisely, wins the unbounded game if she can force an infinite play and the -round game once rounds have been played (the winning condition is not checked after the last round, so in particular, any [math]-round game is an immediate win for ).
Remark 3.7**.**
The above bisimulation game differs from bisimulation games in the literature (e.g. Desharnais et al. (2008)) in a number of salient features. A particularly striking aspect is that ’s moves are not similar to those of , and moreover in fact moves before . Intuitively, is required to commit beforehand to a strategy that she will use to respond to ’s next move. Note also that the precision changes as the game is being played, a complication forced by the arithmetic nature of models.
This leads to notions of game distance:
Definition 3.8**.**
depth- game distance and (unbounded-depth) game distance are defined as
[TABLE]
where and denote the the bisimulation game and the -round bisimulation game on , respectively.
Finally we define the depth- logical distance via , restricting to concepts of rank at most :
Definition 3.9**.**
The depth- logical distance of states , in models , is defined as
[TABLE]
The equivalence of the four bounded-depth behavioural distances introduced above will be shown in Theorem 4.3.
Behavioural distance forms the yardstick for our notion of bisimulation invariance; for definiteness:
Definition 3.10**.**
A quantitative, i.e. -valued, property of states, or a formula or concept defining such a property, is bisimulation-invariant if is non-expansive wrt. game distance, i.e. for states in models , respectively,
[TABLE]
Similarly, is depth- bisimulation invariant, or finite-depth bisimulation invariant if mention of is omitted, if is non-expansive wrt. in the same sense.
It is easy to see that -concepts are bisimulation-invariant. More precisely, -concepts of rank at most are depth- bisimulation invariant (a stronger invariance since clearly ), as shown by routine induction. In contrast, many other properties of states are expressible in but not in , as they fail to be bisimulation-invariant. Examples include (probability of a self-transition) and (highest transition probability to a successor).
We are now ready to formally state our main theorem (a proof will be given in Section 6):
Theorem 3.11** (Modal characterization).**
Every bisimulation-invariant -formula of rank at most can be approximated (uniformly across all models) by -concepts of rank at most .
(The exponential bound on the rank features also in the full statement of van Benthem’s theorem.)
4 Modal Approximation at Finite Depth
We now establish the most important stepping stone on the way to the eventual proof of the modal characterization theorem: We show that every depth- bisimulation-invariant property of states can be approximated by -concepts of rank at most . We prove this simultaneously with coincidence of the various finite-depth behavioural pseudometrics defined in the previous section. To begin,
Lemma 4.1**.**
The game-based pseudometric coincides with the Wasserstein pseudometric ,
We note next that the modality P is non-expansive: We extend P to act on -valued functions by
[TABLE]
Lemma 4.2**.**
The map is non-expansive wrt. the supremum metric, that is for all .
Following our previous work Wild et al. (2018a), we prove coincidence of the remaining pseudometrics in one big induction, along with total boundedness (needed later to apply a variant of the Arzelà-Ascoli theorem and the Kantorovich-Rubinstein duality) and modal approximability of depth- bisimulation-invariant properties. We phrase the latter as density of the (semantics of) -concepts of rank at most in the non-expansive function space (Definition 3.1):
Theorem 4.3**.**
Let be a model. Then for all ,
we have on ; 2. 2.
the pseudometric space is totally bounded; 3. 3.
* is a dense subset of .*
Proof sketch.
By simultaneous induction on .
In the base case , all the behavioural distances are the zero pseudometric, so that total boundedness follows trivially and the density claim follows because non-expansive maps are just constants in and the syntax of includes truth constants .
For the inductive step, let be a model and , and assume as the inductive hypothesis that all claims in Theorem 4.3 hold for all . We begin with Item 1; is already proved (Lemma 4.1).
- •
follows by Kantorovich-Rubinstein duality (Lemma 3.4), since every totally bounded pseudometric space is separable.
- •
: By Lemma 4.2 and the inductive hypothesis, is dense in . Thus, the supremum in the definition of does not change when it is taken only over the concepts in instead of all nonexpansive properties. The proof is finished by a simple induction over propositional combinations of concepts.
Item 2: By the inductive hypothesis, the space is totally bounded. By the Arzelà-Ascoli theorem (in a version for totally bounded spaces and non-expansive maps, cf. Wild et al. (2018a)), it follows that is totally bounded wrt. the supremum pseudometric. This implies that depth- distances can be approximated up to by examining differences at only finitely many, say , concepts. As is totally bounded, is, too.
Item 3: By the Stone-Weierstraß theorem (again in a version for totally bounded spaces and non-expansive maps Wild et al. (2018a)) it suffices to give, for each , each non-expansive map , and each pair of states a concept such that
[TABLE]
To construct such a , we note that (by non-expansiveness), so there exists some such that . From , we can construct using truncated subtraction . ∎
This completes the proof of Theorem 4.3. Now that we can approximate depth- bisimulation-invariant properties by -concepts of rank on any fixed model, we need to make the approximation uniform across all models. We achieve this by means of a final model, i.e. one that realizes all behaviours. Formally:
Definition 4.4**.**
A (probabilistic) bounded morphism between models , is a map such that for each and for all , (implying that is blocking iff is blocking). A model is final if for every model , there exists a unique bounded morphism .
It follows from standard results in coalgebra Barr (1993) that a final model exists. Bounded morphisms preserve behaviour on-the-nose, that is:
Lemma 4.5**.**
Let be a bounded morphism. Then, for any , .
This entails the following lemma, which will enable us to use approximants on the final model as uniform approximants across all models:
Lemma 4.6**.**
Let be a final model, and let and be bisimulation-invariant first-order properties. Then, for any model , .
5 Locality
The proof of the modal characterization theorem now further proceeds by first establishing that every bisimulation-invariant first-order formula is local in a sense to be made precise shortly, and subsequently that is in fact even finite-depth bisimulation invariant, for a depth that is exponential in the rank of . Locality refers to a probabilistic variant of Gaifman graphs Gaifman (1982):
Definition 5.1**.**
Let be a model.
- •
The Gaifman graph of is the undirected graph on the set of vertices that has an edge for every pair with or .
- •
The Gaifman distance is graph distance in the Gaifman graph: For every , the distance is the least number of edges on any path from to , if such a path exists, and otherwise.
- •
For and , the radius neighbourhood of consists of the states reachable from in at most steps.
- •
The restriction of to is the model with set of states, and
[TABLE]
for and .
The restriction to thus makes all states at distance blocking. Restricted models have the expected relationship with games of bounded depth:
Lemma 5.2**.**
Let be a state in a model . Then wins the -round [math]-bisimulation game for and .
Locality of a formula now means that its truth values only depend on the neighbourhood of the state in question:
Definition 5.3**.**
A formula is -local for a radius if for every model and every , .
As -concepts are bisimulation-invariant, Lemma 5.2 implies
Lemma 5.4**.**
Every -concept of rank at most is -local.
To prove locality of bisimulation-invariant -formulae, we require a model-theoretic tool, an adaptation of Ehrenfeucht-Fraïssé equivalence to the probabilistic setting:
Definition 5.5**.**
Let be models, and let and be vectors of equal length over and , respectively. The Ehrenfeucht-Fraïssé game for and , played by Spoiler () and Duplicator (), is given as follows.
- •
Configurations: pairs of vectors over and over ; the initial configuration is .
- •
Moves: Each round can be played in one of two ways, chosen by :
- –
Standard round: selects a state in one model, say , and then has to select a state in the other model, say , reaching the configuration .
- –
Probabilistic round: selects an index and a fuzzy subset in one model, say . then has to select a fuzzy subset in the other model, say , such that . Then, selects an element on one side, say , such that , and subsequently selects an element on the other side, say , such that and , reaching the configuration .
- •
Winning conditions: Any player who cannot move loses. wins if a configuration is reached (including the initial configuration) that fails to be a partial isomorphism. Here, a configuration is a partial isomorphism if
- –
- –
for all and all
- –
for all .
Player wins if she reaches the -th round (maintaining configurations that are not winning for ).
For our purposes, we need only soundness of Ehrenfeucht-Fraïssé equivalence:
Lemma 5.6** (Ehrenfeucht-Fraïssé invariance).**
Let be models, and let be vectors of length over and , respectively, such that wins the -round Ehrenfeucht-Fraïssé game on . Then for every -formula with and free variables at most ,
[TABLE]
Since embeddings into disjoint unions of models are bounded morphisms, the following is immediate from Lemma 4.5:
Lemma 5.7**.**
Every bisimulation-invariant formula is also invariant under disjoint union.
We are now in a position to prove our desired locality result:
Lemma 5.8** (Locality).**
Let be a bisimulation-invariant -formula of rank with one free variable . Then is -local for .
Proof sketch.
Let be a state in a model . We need to show . Construct models that extend and , respectively, by adding disjoint copies of both and . We finish the proof by showing that
[TABLE]
The first and third equality follow by bisimulation invariance of (Lemma 5.7), and the second using Lemma 5.6, by giving a winning invariant for in the -round Ehrenfeucht-Fraïssé game for and . ∎
6 Proof of the Main Result
Having established locality of bisimulation-invariant first-order formulae and modal approximability of finite-depth bisimulation-invariant properties, we now discharge the last remaining steps in our programme: We show by means of an unravelling construction that bisimulation-invariant first-order formulae are already finite-depth bisimulation-invariant, and then conclude the proof of our main result, the modal characterization theorem.
Definition 6.1**.**
Let be a model. The unravelling of is a model with non-empty finite sequences as states, where atomic concepts and roles are interpreted by
[TABLE]
for and , where takes last elements.
As usual, models are bisimilar to their unravellings:
Lemma 6.2**.**
For any model and , has a winning strategy in the [math]-bisimulation game for and .
We next show that locality and bisimulation invariance imply finite-depth bisimulation invariance:
Lemma 6.3**.**
Let be bisimulation invariant and -local. Then is depth- bisimulation invariant.
Proof sketch.
By unravelling (Lemma 6.2) and locality (Lemma 5.2), we need only consider depth- tree models. On such models, winning strategies in -round bisimulation games automatically win also the unrestricted game. ∎
This allows us to wrap up the proof of our main result:
Proof of Theorem 3.11.
Let be a probabilistic first-order formula of rank . By Lemma 5.8 and Lemma 6.3, is depth- bisimulation-invariant for . By Theorem 4.3, for every , there exists an concept of rank at most such that on the final model . By Lemma 4.6, this approximation works over all models. ∎
7 Conclusions
We have established a modal characterization result for a probabilistic fuzzy DL , stating that every formula of quantitative probabilistic FOL that is bisimulation-invariant, i.e. non-expansive wrt. a natural notion of behavioural distance, can be approximated by -concepts of bounded modal rank, the bound being exponential in the rank of the original formula. As discussed in the introduction, the bound on the modal rank is the crucial feature making this result into a van-Benthem (rather than Hennessy-Milner) type theorem.
It remains open whether our main result can be sharpened to make do without approximation. (Similar open problems persist for the case of fuzzy modal logic Wild et al. (2018a) and two-valued probabilistic modal logic Schröder et al. (2017).) Further directions for future research include a treatment of Łukasiewicz semantics of the propositional connectives (for which non-expansiveness in fact fails). Moreover, the version of our main result that restricts the semantics to finite models, in analogy to Rosen’s finite-model version of van Benthem’s theorem Rosen (1997), remains open.
Appendix A Appendix
A.1 Coalgebraic Modelling
Universal coalgebra Rutten (2000) serves as a generic framework for modelling state-based systems, with the system type encapsulated as a set functor. Although we are only concerned with a concrete system type in the present paper, we do need coalgebraic methods to some degree. In particular, the requisite background on behavioural distances van Breugel and Worrell (2005); Baldan et al. (2014) is largely based on coalgebraic techniques, and moreover we will need the final coalgebra at one point in the development. We require only basic definitions, which we recapitulate here and then instantiate to the case of our notion of model.
Recall first that a set functor consists of an assignment of a set to every set and a map to every map , preserving identities and composition. The core example of a functor for the present purposes is the distribution functor , which assigns to a set the set of discrete probability measures on , and to a map the map that takes image measures; explicitly, is the image measure of along , given by . Functors can be combined by taking products and sums: Given set functors , the set functors are given by and , respectively, with the evident action on maps in both cases; here, denotes disjoint union as usual. Every set induces a constant functor, also denoted and given by and for every set and every map . Moreover, the identity functor is given by and for all sets and all maps .
An -coalgebra for a set functor consists of a set of states and a transition map , thought of as assigning to each state a structured collection of successors. A -coalgebra , for instance, is just a Markov chain: its transition map assigns to each state a distribution over successor states. Similarly, models in the sense defined above are coalgebras for the set functor : If , then determines the truth values of the atomic concepts at the state , and is either a discrete probability measure determining the successors of or a designated value denoting termination. The probabilistic transition systems considered by van Breugel and Worrell van Breugel and Worrell (2005), which indexes probabilistic transition relations over a set of actions and moreover uses unrestricted subdistributions, corresponds to coalgebras for the set functor – given a state and an action , is a subdistribution over successor states of , with the summand serving to absorb the weight missing to obtain total weight .
A morphism between -coalgebras and is a map such that
[TABLE]
for all states . Morphisms should be thought of as behaviour-preserving maps or functional bisimulations. E.g. is a morphism of -coalgebras (i.e. Markov chains) and if for each set and each state ,
[TABLE]
i.e. the probability of reaching from is the same as that of reaching from . Morphisms of probabilistic transition systems, viewed as coalgebras, satisfy a similar condition for the successor distributions, and additionally preserve the truth values of atomic concepts.
An -coalgebra is final if for every -coalgebra there exists exactly one morphism . Final coalgebras are unique up to isomorphism if they exist, and should be thought of as having as states all possible behaviours of states in -coalgebras. For our present purposes, we do not need an explicit description of the final coalgebra; it suffices to know that since the functor describing probabilistic transition systems is accessible (more precisely -accessible), a final coalgebra for it, i.e. a final probabilistic transition system, exists Barr (1993).
A.2 Omitted Proofs
A.2.1 Proof of Lemma 3.4
We make use of the following version of the Kantorovich-Rubinstein duality (Dudley, 2002, Proposition 11.8.1):
Lemma A.1** (Kantorovich-Rubinstein duality).**
Let be a separable metric space, and let denote the space of probability measures on the Borel -algebra such that for some . Then for ,
[TABLE]
Essentially, we only need to transfer this version of Kantorovich-Rubinstein duality to the slightly more general case of pseudometrics.
First, note that the relation is an equivalence relation on . The quotient set is made into a metric space , the metric quotient of , by taking . Let be the projection map. By construction, is an isometry. Both the Kantorovich and the Wasserstein lifting preserve isometries Baldan et al. (2014), so for all discrete probability measures on ,
[TABLE]
In the second step we have applied Lemma A.1 to the metric space , noting that every discrete probability measure can be defined on the Borel -algebra.
A.2.2 Proof of Lemma 4.1.
Induction over . The base case is clear: the [math]-round game is an immediate win for , so . We proceed with the inductive step from to .
So let and be states in a model . If and are both blocking, then . If exactly one of is blocking, then . Now assume that both and are non-blocking.
“”: Let , so wins the -round bisimulation game on . We show that . First, for every , by the winning condition. Second, suppose chooses and in the first turn. By assumption, wins the -round bisimulation game on for every , so by induction, and thus .
“”: Let . It suffices to give a winning strategy for in the -round bisimulation game on (implying ). The winning condition in the initial configuration follows immediately from the assumption. Also by the assumption, there exists such that . As and are discrete, the set
[TABLE]
is countable; so we can write . Now put and define
[TABLE]
for and for . Then
[TABLE]
so playing and constitutes a legal move for . Now, since , for all . This means that must pick some . Then
[TABLE]
so wins the -round game on .
A.2.3 Proof of Lemma 4.2.
Let ; we have to show . So let ; then
[TABLE]
as required.
A.2.4 Proof of Theorem 4.3.
We proceed by simultaneous induction on .
In the base case , all the behavioural distances are the zero pseudometric: because by the rules of the game each [math]-round game is an immediate win for ; by definition; and because each rank-[math] concept is a propositional combination of truth constants and therefore constant. Total boundedness follows directly from the fact that under the zero pseudometric every -ball is the entire space, regardless of . Finally, the density claim follows because non-expansive maps under the zero pseudometric are just constants in and the syntax of includes truth constants .
For the inductive step, let be a model and , and assume as the inductive hypothesis that all claims in Theorem 4.3 hold for all . We begin with Item 1:
- •
is Lemma 4.1.
- •
follows by Kantorovich-Rubinstein duality (Lemma 3.4), since every totally bounded pseudometric space is separable.
- •
: Let and consider the map
[TABLE]
Then is a continuous function because all of its constituents are continuous (in particular, P is continuous by Lemma 4.2).
By the induction hypothesis, and because density is preserved by continuous maps, is a dense subset of . Thus,
[TABLE]
To prove the penultimate step, we first note that “” follows immediately. To see “”, we proceed by induction over the propositional combinations of atomic concepts and concepts , where , using that for any concepts and :
[TABLE]
Item 2: We make use of the following version of the Arzelà-Ascoli theorem Wild et al. (2018a) where function spaces are restricted to non-expansive functions instead of the more general continuous functions, but the underlying spaces are only required to be totally bounded instead of compact:
Lemma A.2** **(Arzelà-Ascoli for totally bounded
spaces).
Let be a totally bounded pseudometric space. Then the space , equipped with the supremum pseudometric, is totally bounded.
By Lemma A.2, applied to the inductive hypothesis, we know that the space is totally bounded wrt. the supremum pseudometric.
Let . As is dense in , there exist finitely many such that
[TABLE]
From these concepts, together with the atomic concepts , we can construct the map
[TABLE]
Note that we assume here that the set of atomic concepts is a finite set . This is without loss of generality for the modal characterization theorem, because every formula of can only contain finitely many propositional atoms, so can be restricted to just those atoms.
It turns out that is an -isometry, that is
[TABLE]
for all . Thus, by the triangle inequality, we can take preimages to turn a finite -cover of (a compact, hence totally bounded space) into a finite -cover of .
Item 3: We make use of the following Stone-Weierstraß theorem Wild et al. (2018a) (again in a version for totally bounded spaces and non-expansive maps):
Lemma A.3** (Stone-Weierstraß for totally bounded spaces).**
Let be a totally bounded pseudometric space, and let be a subset of such that implies . Then is dense in if each can be approximated at each pair of points by functions in ; that is for all and all there exists such that
[TABLE]
We apply Lemma A.3 to with . Clearly is closed under and so, to finish the proof, it suffices to give, for each , each non-expansive map and each pair of states a concept such that
[TABLE]
To construct such a , we note that (by non-expansiveness), so there exists some such that . From , we can construct using truncated subtraction .
A.2.5 Proof of Lemma 4.5.
We show that wins the bisimulation game for by maintaining the invariant that the current configuration is of the form with , which ensures that the winning condition always holds. It remains to show that can maintain the invariant.
In each round, begins by picking if and [math] otherwise, and . We can see that , because
[TABLE]
and
[TABLE]
for all and . Also, clearly . Now any choice by leads to another configuration with .
A.2.6 Proof of Lemma 4.6.
Let be a model, and let be the unique morphism. Let . Then by Lemma 4.5, and thus and by bisimulation invariance. So
[TABLE]
A.2.7 Proof of Lemma 5.2.
Player wins by maintaining the invariant that whenever rounds have been played, the current configuration is of the form for some with . For , no configuration of this kind can be winning for , because the two states in this configuration represent the same state in different models (recall that the winning conditions are not checked after the last round has been played).
It remains to give a strategy for that maintains the invariant. It clearly holds at the start of the game, with . When the -th round is played, can pick and as follows:
[TABLE]
Clearly, , so this is a legal move. Now the new configuration chosen by necessarily satisfies the invariant.
A.2.8 Proof of Lemma 5.6.
We proceed by induction over formulae.
- •
The cases and (with ) follow immediately from the fact that the initial configuration is a partial isomorphism.
- •
The Boolean cases () follow directly by the inductive hypothesis.
- •
: Let be the current configuration. Let , let be such that
[TABLE]
and let be the winning answer for in reply to choosing . By induction, , so
[TABLE]
Because was arbitrary, it follows that . We can symmetrically show that , which proves this case.
- •
: Let be the current configuration. Suppose that picks the index and the fuzzy subset
[TABLE]
and ’s winning reply is . We show that on the support of , must be equal to
[TABLE]
Suppose there exists some with and . Then has a winning reply in case picks this , which means, by the rules of the game, that and . However, it is also true that , by the inductive hypothesis. This is a contradiction.
Now, because was a winning reply, we obtain
[TABLE]
A.2.9 Proof of Lemma 5.8.
Let be a state in a model . We need to show . Let be a new model that extends by adding disjoint copies of both and . Let be the model that extends likewise. We finish the proof by showing that
[TABLE]
The first and third equality follow by bisimulation invariance of (Lemma 5.7). The second equality follows by Ehrenfeucht-Fraïssé invariance (Lemma 5.6) once we show that has a winning strategy in the -round Ehrenfeucht-Fraïssé game for and .
Such a winning strategy can be described as follows: For , put . Then maintains the invariant that, if the configuration reached after rounds is , then there exists an isomorphism between and that maps each to the corresponding , where .
The invariant holds at the start of the game, because the neighbourhoods on both sides are just . Similarly, whenever the invariant holds, the current configuration is a partial isomorphism by restriction of the given isomorphism to the two vectors of the configuration.
Now we consider what happens during the rounds. Suppose that rounds have been played, and the current configuration is . If decides to play a standard round, playing some , then there are two cases:
- •
: In this case, the radius- neighbourhood of is fully contained in the domain of – this follows by the triangle inequality, as . Now can just reply with , and an isomorphism between and is formed by restricting the domain and codomain of appropriately.
- •
: In this case, the radius- neighbourhoods of and of do not intersect – this too follows from the triangle inequality. Now can pick a fresh copy of or in (depending on which kind of copy lies in); her reply is then just in that copy. Here, a fresh copy is one that was never visited on any of the previous rounds. By construction of and , such a copy is always available. This means that we now have two isomorphisms, one between and (by restriction of ), and one between and (by isomorphism of the respective copies of or ). Because these isomorphisms have disjoint domains and codomains, we can combine them to form the desired isomorphism .
If plays a standard round with some instead, the same argument applies.
Finally, if starts a probabilistic round by picking an index and playing some , then we first note that, by the rules of the game, the support of must be contained in , which in turn must be contained in the domain of . This means that can construct by mapping along , i.e. for all successors of , and otherwise. Now, whichever or is picked by , can just reply with or and is formed as in the first case of a standard round. Again, the same argument applies if picks a fuzzy subset on the other side.
A.2.10 Proof of Lemma 6.2.
wins by maintaining the invariant that the configuration of the game is of the form for some . To do so, she can put for all , all other values of are [math], and . Then any move by leads to a configuration where the invariant holds.
A.2.11 Proof of Lemma 6.3.
Let and be two models and let and be two states such that . It is enough to show that .
We denote by and the copies of in and , respectively. Similarly, and denote the copies of in and . By Lemma 6.2, wins the [math]-bisimulation-game for and (similarly for ) and by Lemma 5.2, she also wins the -round [math]-bisimulation game for and (similarly for ). Because behavioural distance is a pseudometric, this means that
[TABLE]
so has a winning strategy in the -round -bisimulation game for and .
In both and , the reachable states form a tree of depth at most . This implies that, after rounds of the game, the two states on either side of the current configuration are nodes at distance from the root of their respective tree. Thus, whenever rounds have been played in the game, does not have a legal move in the next round, because at that point, both nodes in the configuration are necessarily leaves and thus blocking. This in turn means that if can win the -round game, she also wins the unbounded game, so, by bisimulation invariance of , .
By locality and bisimulation invariance of , and again Lemma 6.2, we have as well as . Thus , as claimed.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Baldan et al. (2014) P. Baldan, F. Bonchi, H. Kerstan, and B. König. Behavioral metrics via functor lifting. In Found. Software Technology and Theoretical Computer Science, FSTTCS 2014 , LIP Ics , vol. 29, pp. 403–415, 2014.
- 2Barr (1993) M. Barr. Terminal coalgebras in well-founded set theory. Theor. Comput. Sci. , 114:299–315, 1993.
- 3Burgess (1969) J. Burgess. Probability logic. J. Symb. Log. , 34:264–274, 1969.
- 4Carreiro (2015) F. Carreiro. PDL is the bisimulation-invariant fragment of weak chain logic. In Logic in Computer Science, LICS 2015 , pp. 341–352. IEEE, 2015.
- 5Chang (1973) C. Chang. Modal model theory. In Cambridge Summer School in Mathematical Logic , LNM , vol. 337, pp. 599–617. Springer, 1973.
- 6Dawar and Otto (2005) A. Dawar and M. Otto. Modal characterisation theorems over special classes of frames. In Logic in Comput. Sci., LICS 2005 , pp. 21–30. IEEE, 2005.
- 7Desharnais et al. (1999) J. Desharnais, V. Gupta, R. Jagadeesan, and P. Panangaden. Metrics for labeled Markov systems. In Concurrency Theory, CONCUR 1999 , LNCS , vol. 1664, pp. 258–273. Springer, 1999.
- 8Desharnais et al. (2004) J. Desharnais, V. Gupta, R. Jagadeesan, and P. Panangaden. Metrics for labelled Markov processes. Theor. Comput. Sci. , 318:323–354, 2004.
