Imagining Probabilistic Belief Change as Imaging (Technical Report)
Gavin Rens, Thomas Meyer

TL;DR
This paper introduces Expected Distance Imaging (EDI), a flexible framework for probabilistic belief change that unifies Bayesian conditioning and imaging, with multiple instantiations for revision and update.
Contribution
The paper proposes EDI, a novel general framework for belief change that encompasses existing imaging methods and defines new instantiations for revision and update.
Findings
EDI can replicate Bayesian conditioning and other imaging forms
Four EDI instantiations are proposed for revision and update
Probabilistic rationality postulates are analyzed for these instantiations
Abstract
Imaging is a form of probabilistic belief change which could be employed for both revision and update. In this paper, we propose a new framework for probabilistic belief change based on imaging, called Expected Distance Imaging (EDI). EDI is sufficiently general to define Bayesian conditioning and other forms of imaging previously defined in the literature. We argue that, and investigate how, EDI can be used for both revision and update. EDI's definition depends crucially on a weight function whose properties are studied and whose effect on belief change operations is analysed. Finally, four EDI instantiations are proposed, two for revision and two for update, and probabilistic rationality postulates are suggested for their analysis.
Click any figure to enlarge with its caption.
Figure 1Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · Bayesian Modeling and Causal Inference · AI-based Problem Solving and Planning
Imagining Probabilistic Belief Change as Imaging
(Technical Report)
Gavin Rens and Thomas Meyer
University of Cape Town, South Africa
{grens,tmeyer}@cs.uct.ac.za
Abstract
Imaging is a form of probabilistic belief change which could be employed for both revision and update. In this paper, we propose a new framework for probabilistic belief change based on imaging, called Expected Distance Imaging (EDI). EDI is sufficiently general to define Bayesian conditioning and other forms of imaging previously defined in the literature. We argue that, and investigate how, EDI can be used for both revision and update. EDI’s definition depends crucially on a weight function whose properties are studied and whose effect on belief change operations is analysed. Finally, four EDI instantiations are proposed, two for revision and two for update, and probabilistic rationality postulates are suggested for their analysis.
1 Introduction
From the perspective of classical (Boolean) belief change, the work of Alchourrón et al., (1985) is regarded as the foundation theory for belief revision (AGM theory). Typically, belief change (in a static world) can be categorized as expansion, revision or contraction, and is performed on a belief set, the set of sentences closed under logical consequence. Revision is when new information is (possibly) inconsistent with and is (minimally) modified so that the new remains consistent and entails . Revision is the process which takes place when an agent modifies its beliefs due to receiving new information not previously known or which is more relevant or trustworthy. Except for the movement of information, the physical world is assumed to be completely unchanging.
Whereas belief revision is considered to take place in a static environment, belief update is thought to be the change in beliefs which takes place due to a dynamic environment. Update refers to the process of bringing beliefs up to date precisely because the world has changed and the agent needs a new, ‘matching’ view on the world.
From the perspective of classical (Boolean) belief change, Katsuno and Mendelzon, (1992) developed the first serious theory of update (KM theory). Their theory is different from that of AGM in that their rationality postulates are derived from the understanding that update occurs in a dynamic environment.
However, simply applying AGM theory for revision and KM theory for update has been (indirectly) challenged (Friedman and Halpern,, 1999; Kern-Isberner,, 2001; Nayak,, 2011; Lang,, 2007). Further, how to categorize a belief change operator is more challenging when notions of uncertainty are considered, for instance, when using probabilities and rankings. The very definition of belief revision and belief update become more problematic under notions of uncertainty.
One kind of probabilistic belief change operation which could potentially ‘relax’ the tension between revision and update is imaging. David Lewis (1976) first proposed imaging to analyse conditional reasoning in probabilistic settings, and it has recently been the focus of several works on probabilistic belief change (Ramachandran et al.,, 2010; Chhogyal et al.,, 2014; Rens et al.,, 2016). Imaging is the approach of moving the belief in worlds possible at one moment to similar worlds compatible with evidence received at a next moment. In other words, the ‘belief-mass’ is shifted to the ‘images’ of the worlds currently believed possible, where the images are the worlds related via new evidence to the currently believed worlds.
One of the main benefits of imaging is that it overcomes the problem with Bayesian conditioning, namely, being undefined when evidence is inconsistent with current beliefs. Gärdenfors, (1988) and Rens et al., (2016) proposed generalizations of Lewis’s original definition. In this paper we propose a new generalization of imaging – or a family of imaging-based belief change operators – and analyse other probabilistic belief change methods with respect to it. In particular, we propose a version of imaging based on the movement of probability mass weighted by the inverse of distances between possible worlds.
Whether imaging is applicable to revision or update (or both) is still an open question. Gärdenfors, (1988) says ”…the imaging method is a general revision method because it gives nontrivial results when…the new information to be accommodated…[contradicts the current beliefs]”. Chhogyal et al., (2014) explore the use of Lewis imaging as a means to construct probabilistic belief revision. They present explicit constructions of three candidates strategies based on imaging and investigate their properties. (Rens et al.,, 2016) define an imaging operation which relaxes the unique-closest-world assumption of Lewis imaging, and they provide a method of revising (via imaging) a potentially infinite set of belief states in a finite procedure. On the other hand, some researchers have considered imaging to be the probabilistic version of update (Katsuno and Mendelzon,, 1992; Dubois and Prade,, 1993; Nayak,, 2011). Ramachandran et al., (2010) propose a version of imaging for probabilistic belief erasure. In fact, Lewis, (1976) himself never said that imaging was meant to be interpreted as one or the other. In this paper we continue to investigate the relationship of imaging to revision and update.
The paper makes four contributions. 1) We define Expected Distance Imaging (EDI) and show how Bayesian conditioning, Lewis imaging and generalized imaging can be couched as EDI operations. 2) We define four (new) instantiations of the EDI operation. 3) We define a weight function (as used in EDI), and several properties such a function might have, and explore which of these properties are satisfied by different instantiations of weight functions (for different versions of the EDI operator). 4) We propose a set of rationality postulates for probabilistic belief update.
Due to space limitations, all proofs of propositions have been omitted. Please refer to the appendix for the proofs.
2 Background
We shall work with a finitely generated classical propositional logic. Let be a finite set of atoms. Formally, a world is a unique assignment of truth values to all the atoms in . There are thus conceivable worlds. An agent may consider some non-empty subset of the conceivable worlds; is called the possible worlds. The classical notion of satisfaction is used. World satisfies (is a model of) is written . Let be all propositional formulae which can be formed from and the logical connectives and , with abbreviating tautology and abbreviating contradiction. Let be a sentence in . denotes the set of models of . entails (denoted ) iff . is equivalent to (denoted ) iff .
Often, in the exposition of this paper, a world will be referred to by its truth vector. For instance, if a two-atom vocabulary is placed in order and , then may be referred to as .
In this work, the basic semantic element of an agent’s beliefs is a probability distribution or a belief state
[TABLE]
where and is the probability (real number) that is the actual world in which the agent is, with . For parsimony, let be the probabilities that belief state assigns to where, for instance, , and . abbreviates .
It is not yet universally agreed what belief change means in a probabilistic setting. One school of thought says that probabilistic expansion (restricted revision) is equivalent to Bayesian conditioning (Gärdenfors, (1988, Chap. 5) and Voorbraak, (1999) mention this, but no not necessarily agree with it). This is evidenced by Bayesian conditioning () being defined only when , thus making expansion equivalent to revision. In other words, one could define expansion to be
[TABLE]
where can be defined as and is a sentence identifying (i.e., a complete theory for ). Note that iff . This implies that is ill-defined when .
We may write as so that we can write , where is an arbitrary belief change operator.
The technique of Lewis imaging for the revision of belief states requires a notion of similarity between worlds. In fact, he implicitly assumes the availability of a mapping of worlds to a total order on worlds for every fixed . Let be the least -world with respect to . That is, and: if , then for all , and if , then .
If we indicate Lewis’s original imaging operation with , then his definition can be stated as
[TABLE]
He calls the image of on . In words, is zero if does not model , but if it does, then retains all the probability it had and accrues the probability mass from all the non--worlds closest to it. This form of imaging only shifts probabilities around; no probabilities are magnified or shrunk. The probabilities in thus sum to 1 without the need for any normalization.
3 Non-unique-closest-world Approaches
Every world having a unique closest -world is quite a strong requirement. We now mention an approach which relaxes the uniqueness requirement.
Gärdenfors, (1988) describes his generalization of Lewis imaging (which he calls general imaging) as “… instead of moving all the probability assigned to a world by a probability function to a unique (“closest”) -world , when imaging on , one can introduce the weaker requirement that the probability of be distributed among several -worlds (that are “equally close”).” If we interpret Gärdenfors’ approach correctly, he does not provide a constructive method but insists that , where is the image (change) of on .
Rens et al., (2016) introduced generalized imaging via a constructive method. It is a particular instance of Gärdenfors’ general imaging. They use a pseudo-distance measure between worlds, as defined by Lehmann et al., (2001) and adopted by Chhogyal et al., (2014).
Definition 3.1**.**
A pseudo-distance function satisfies the following four conditions: for all worlds ,
* (Non-negativity)* 2. 2.
* (Identity)* 3. 3.
* (Symmetry)* 4. 4.
* (Triangular*
Inequality)
Note that induces a mapping from worlds to total preorders on worlds as follows. iff . Note that conditions 2 and 4 make a total preorder, and that adding conditions 1 and 3 do not necessarily make a (total) partial order. This makes it possible for a world to have more than one closest worlds. One may also want to impose a condition on a distance function such that any two distinct worlds must have some distance between them: For all , if , then (Faithfulness).
Let be the set of -worlds closest to with respect to pseudo-distance . Formally,
[TABLE]
where is some pseudo-distance measure between worlds (e.g., Hamming or Dalal distance). For instance, using Hamming distance for and vocabulary ,
- •
- •
- •
- •
Generalized imaging (denoted ) is defined as
[TABLE]
is the new belief state produced by taking the generalized image of on . In words, the probability mass of non--worlds is shifted to their closest -worlds, such that if a non--world with probability has closest -worlds (equally distant), then each of these closest -worlds gets mass from .
As an example, let belief state and let belief state for vocabulary . Then
- •
- •
- •
and
- •
- •
- •
4 Expected Distance Imaging
In this section, we define expected distance imaging (EDI) and some properties of interest, then we define two instantiations of EDI and define Bayesian conditioning, Lewis imaging and generalized imaging in terms of EDI. Each operator is considered with respect to the properties. We end with a discussion about the five operations.
4.1 Definition and Properties
Rens and Meyer, (2015) proposed determining the new probability of an -world as the weighted average of the current probabilities of all worlds , where the weights are (inversely) proportional to the distance between and the . The reason for bringing in weights is that the probability of worlds less (more) similar to the -world under consideration, should contribute less (more) to the new probability mass of . (Distance and similarity are taken to be inversely proportional in this context.) Building on that idea, we introduce an imaging framework based on weighted distances. The weight functions are defined in terms of inverse-distance functions, which are defined in terms of the pseudo-distance function.
First, we introduce potentially inverse-distance functions and propose considering the following seven postulates for such functions. For all and all pseudo-distance functions ,
(Non-negativity) 2. 2.
(Identity) 3. 3.
(Symmetry) 4. 4.
if , then (Weak Inversity) 5. 5.
if , then (Strict Inversity) 6. 6.
if , then (Equi-distance). 7. 7.
if , then (Faithfulness)
Definition 4.1**.**
A weight function is a function . We say that postulate non-negativity, identity, symmetry, weak inversity, strict inversity, equi-distance or faithfulness is satisfied by iff the respective postulate is satisfied by .
To start off with, weight functions should be in the range so as to support the notion of the expectation of probability. A weight function should thus be non-negative and have a maximum of 1. Moreover, a world’s probability should carry the maximum weight when compared with itself (identity). Symmetry seems like a very natural property to expect of a weight function, especially given the close relationship that such functions have with distance. Weak inversity seems to be the weakest property that promotes some sort of inversity. It would be a useful distinction to be able to say that a weight function is weakly inverse, but not strictly so. Hence the definition of strict inversity. Knowing whether the property of equi-distance holds also seems useful and interesting. Note that equi-distance implies symmetry due to symmetry of the pseudo-distance function, and by logic, equi-distance implies weak inversity. Finally, whether a weight function is faithfulness seems like a natural question to ask. And note that identity together with strict inversity implies faithfulness.
Definition 4.2**.**
Let be an epistemic state and a new piece of information. Then the new epistemic state changed with via EDI is defined as
[TABLE]
where is a normalizing factor.
We may write to indicate that is instantiated as . For any probabilistic belief change operator , we say that is EDI-compatible if and only if there exists a function such that for all and .
In the rest of the paper, we shall omit the subscript from as long is it is clear from the context or unnecessary to specify for which pseudo-distance function is defined.
Definition 4.3**.**
A weight function is an inverse-distance weight function iff it satisfies postulates 1-4 above.
Consider the following three properties a weight function might have.
- •
, if and , then (Evidence Relaxation)
- •
, if and , then (Non-evidence Relaxation)
- •
, if and and , then (Retention)
(In the following, an -world is a world satisfying the new evidence .) We shall call a weight function e-relaxed iff it satisfies evidence relaxation, and n-e-relaxed iff it satisfies non-evidence relaxation. We shall say that a weight function is relaxed iff it is both e-relaxed and n-e-relaxed. Later, we shall show how -worlds ‘share’ their mass with other -worlds during the change process if EDI is applied with an e-relaxed weight function. In such operations, belief mass tends to ‘spread’ among worlds consistent with the same evidence (after repeated change operations). Bayesian conditioning and the versions of imaging mentioned so far are not like this. In the latter operations, -worlds never give up their mass; they are ‘retentive’. The idea captured by the retention condition says that when collecting probability mass for (because it is an world), then do not collect anything from other worlds. This rule affects equi-distance, weak and strict inversity. In Section 5.1, the reader will see that n-e-relaxed weight functions are also useful.
Definition 4.4**.**
The EDI operation is said to be relaxed or retentive iff it satisfies the relaxation, respectively, retention property.
Observe that a weight functions which satisfy e-relaxation cannot satisfy retention, and vice versa.
4.2 Reciprocal Weights
Rens and Meyer, (2015) define the weight to be applied to when determining the new probability of as . It is, however, ill-defined because it is undefined when or (i.e., when or ).111Actually, Rens and Meyer, (2015) proposed two definitions for weights, one to be applied when “adding worlds” after a belief change and another when “removing worlds”. In this paper, we use the on for adding worlds. We thus adapt their definition based on the reciprocal of distance as follows.
Definition 4.5**.**
, for , where is a real number.
Because this definition is independent of evidence , we shall usually omit mentioning the evidence and simply write .
In the rest of this section, we use vocabulary and let , unless stated otherwise. Recall that and . Two example applications of follow.
Example 4.1**.**
- •
**
- •
**
- •
**
- •
**
Example 4.2**.**
- •
**
- •
**
- •
**
- •
**
Proposition 4.1**.**
* is an inverse-distance weight function, and also satisfies strict inversity, equi-distance and faithfulness, and is relaxed.*
Proof.
For all :
Non-negativity: and . Thus, .
Identity: By the identity constraint of , for all . Hence,
.
Inversity: Assume . Then . This implies that is weakly and strictly inverse.
Equi-distance: Assume . Then .
Symmetry is implied by equi-distance.
Faithfulness: Assume . Then, by the faithfulness of , . Thus, .
For all , . Relaxation (evidence and non-evidence) is thus satisfied. ∎
4.3 Difference Weights
A new definition with a similar meaning follows.It is also inversely proportional to distance. It is based on the difference between some particular applicable distance and the maximum distance.
Definition 4.6**.**
, for , where is a real number and .
Because this definition is independent of evidence , we shall usually omit mentioning the evidence and simply write . Two examples applications of follow.
Example 4.3**.**
- •
**
- •
**
- •
**
- •
**
Example 4.4**.**
- •
**
- •
**
- •
**
- •
**
If were allowed to equal zero, there would be cases when EDI-belief-change is undefined for instantiated as . For instance, suppose and the evidence is . Using Hamming distance for , . Hence, , which is undefined because . But as soon as , and .
Proposition 4.2**.**
* is an inverse-distance weight function, and also satisfies strict inversity, equi-distance and faithfulness, and is relaxed.*
Proof.
For all :
Non-negativity: By the definition of , , for all . And due to the faithfulness condition of the pseudo-distance function, . Thus, .
Identity: By the identity constraint of , for all . Hence, .
Equi-distance: Assume . Then .
Symmetry is implied by equi-distance.
Inversity: Assume . Then . This implies that is weakly and strictly inverse.
Faithfulness: Assume . Then, by the faithfulness of , . Thus, .
For all , . Relaxation (evidence and non-evidence) is thus satisfied. ∎
4.4 Bayesian Conditioning i.t.o. EDI
Bayesian conditioning can be nicely simulated as an EDI operator. Let be defined as follows.
[TABLE]
Because the definition of is independent of evidence , we shall usually omit mentioning the evidence and simply write .
Proposition 4.3**.**
* iff . That is, is EDI-compatible iff .*
Proof.
[TABLE]
implying that , for . ∎
Proposition 4.4**.**
* is an inverse-distance weight function satisfying equi-distance, symmetry, retention and faithfulness, but not strict inversity.*
Proof.
For all :
Clearly, is non-negative.
Identity follows directly from the definition.
Weak Inversity: Assume . There are two cases to consider: Either or . Assume . Then and , in which case, . Now assume . Then and , in which case, . By combining the two cases, one sees that the desired result follows. is thus an inverse-distance weight function.
Equi-distance: Assume . There are two cases to consider: Either or . Assume . Then and , in which case, . Now assume . Then and , in which case, .
Symmetry is implied by equi-distance.
Faithfulness follows directly from the definition.
Assume again that . Then and , in which case,
. Thus strict inversity does not hold. ∎
Proposition 4.5**.**
* is retentive.*
Proof.
For all :
Assume and and . Then, immediately, by the definition of , . ∎
4.5 Lewis Imaging i.t.o. EDI
Lewis imaging can be simulated as an EDI operator, but not as cleanly as . Let be defined as follows.
[TABLE]
where is fixed between 0 and 1, inclusive, and pseudo-distance function induces a mapping from worlds to total orders (instead of the weaker total pre-orders).
Proposition 4.6**.**
. That is, is EDI-compatible.
Proof.
Let . Then, independent of the definition of , .
Now let . Then,
[TABLE]
where
[TABLE]
Noting that the first line of the definition of is applicable only if , then the result follows when combining the cases when and . ∎
Whereas Proposition 4.6 says that there exists a weight function for simulating Lewis imaging with EDI, Proposition 4.7 says that the function cannot be inverse-distance.
Proposition 4.7**.**
There exists no inverse-distance weight function for which .
Proof.
We provide an example for which weak inversity must fail. is thus not an inverse-distance weight function for this example, nor in general.
Recall that is such that for every , for each of the worlds, a unique closest -world can be identified.
Observe that is retentive: Recall the retention property: if and and , then . Directly, the first line of the definition of is not applicable, and it is impossible for and and , making the second line inapplicable. Only the third line is applicable, and retention thus holds.
Now let . And let . Then ,
, and let . Note that must equal 1 (second line of definition) and by retention, . Therefore,
, but . ∎
Proposition 4.8**.**
* is retentive.*
Proof.
Assume and . Only the second and third lines of the definition of are applicable. We know that whenever , then . Hence, due to , the second line cannot be applicable. Therefore, . ∎
4.6 Generalized Imaging i.t.o. EDI
Generalized imaging can be simulated as an EDI operator, and where is always retentive, is only retentive under a reasonable condition. Let be defined as follows.
[TABLE]
Proposition 4.9**.**
. That is, is EDI-compatible.
Proof.
Let . Then, independent of the definition of , .
Now let . Then,
[TABLE]
where
[TABLE]
Noting that the first line of the definition of is applicable only if , then the result follows when combining the cases when and . ∎
For example, the reader can confirm that and .
Whereas Proposition 4.9 says that there exists a weight function for simulating generalized imaging with EDI, Proposition 4.10 says that the function cannot be inverse-distance.
Proposition 4.10**.**
There exists no inverse-distance weight function for which .
Proof.
Lewis imaging is a specialization of generalized imaging. The proposition is thus entailed by Proposition 4.7. ∎
Lemma 4.1**.**
Let . Then, iff is faithful.
Proof.
Recall that .
Assume is faithful. Then for all , if , then . By identity of , . Hence, .
Assume is not faithful. Then there exists a pair of worlds , such that if , then . Hence, . Therefore, it is not the case that . ∎
Proposition 4.11**.**
* is retentive iff is faithful.*
Proof.
Assume is faithful. Assume and . Only the second and third lines of the definition of are potentially applicable. By Lemma 4.1, we know that whenever , then such that . Hence, due to our assumption that , the second line cannot be applicable. Therefore, .
Assume is not faithful. Assume and . Only the second and third lines of the definition of are potentially applicable. By Lemma 4.1, we know that whenever , then such that . Hence, due to our assumption that , the second line might be applicable. Therefore, it could be that . ∎
Faithfulness is a reasonable property to expect of a distance function. is thus retentive under reasonable conditions.
4.7 Discussion
To summarize, and are relaxed, and and are retentive, and is retentive iff the associated pseudo-distance function is faithful.
Let be or . Let be a particular piece of evidence and the belief state resulting from the -th repeated application of on the resulting belief states for . That is, , , and so on. By running multiple experiments on a computer, we noticed that, as increases, settles on a particular belief state, that is, . (Belief states had eight worlds.) We ran 100 trials for each of the four combinations of choices for and or . For each trial, an initial belief state and evidence was chosen randomly. and were ignored. Per trial, operation was applied to the resulting belief state ten times with the selected for that trial. The difference between the probabilities of a world at successive applications of were recorded, and the difference of all worlds averaged. See Figure 1. The figure shows the average for the 100 trials of the average difference at each application of the change operation. Note how the differences asymptotically decrease. Note that these results do not imply that belief states become uniform distributions over -worlds. In fact, distributions are typically not uniform; distributions seem to be dictated by the mutual reinforcement of -world probabilities and their corresponding distances from each other. However, whenever exactly two worlds model , the probability is eventually divided exactly (uniformly) between the two worlds.
Proposition 4.12**.**
Let be a particular piece of evidence and the belief state resulting from the -th repeated application of on the resulting belief states for , where is a retentive inverse-distance weight function. That is, , , and so on. Then for all .
Proof.
By definition of , if , then , that is, if , then . Therefore, we are only interested in the potential change in probability of -worlds. Hence, we consider only such that . And because contributes no mass to if , we consider only such that . Then by retention and identity of an inverse-distance weight function, iff , else it equals 0. Hence, for all , for all . ∎
The difference between retention and relaxation seems important, from the perspective of (formal) epistemology. Suppose and have probabilities 0.4, respectively, 0.6. Now if is successively observed, then according to, for instance, Bayesian conditioning and Lewis imaging, the probabilities of and will not change. We argue that there are situations when their probabilities should be allowed to change. One could argue that if and are always observed together, then the probabilities of and should become equal. To put it differently, the fact that the same information is continually received should make a difference to the spread of the likelihood of the constituent parts (atoms) of the information. Nonetheless, information received in a static environment has a different flavor to information received in a dynamic environment (giving rise to belief revision, resp., belief update): Repeated observation of the same piece of information should not change beliefs therein, beyond the first revision. The first in the series of observations revises beliefs, but subsequent observations of have no effect because it is ‘old news’/already learnt. In this sense, revision corresponds to retentive belief change. On the other hand, when the same signal is observed in the environment, every new observation is a further confirmation of the truth of , thus modifying one’s belief in the atoms of with every new observation instance (of the same ). In this sense, update corresponds to evidence-relaxed belief change.
5 EDI for Revision vs. EDI for Update
In this section, we couch all belief change operations as EDI. For each of revision and update, EDI will be specialized into two proposed definitions. One of the specializations in each case will have the following pattern: Apply the classical operation to determine the newly believed worlds, and then use EDI to determine a probability distribution over them. The other specialization in each case will be a more direct use of EDI. We shall also check, for each of the four operations, which properties the corresponding weight function satisfies. We end this section with a discussion of what it takes to translate classical belief change as EDI in terms of uniform probability distributions.
Katsuno and Mendelzon, (1992) use an example involving a room with a table, a book and a magazine in it to illustrate the difference between revision and update. Suppose we only know that either the book is on the table or the magazine is on the table, but not both. Let mean the book is on the floor and mean the magazine is on the floor. Then our belief state can reasonably be represented as , that is and . Katsuno and Mendelzon, (1992) argued that after revision with , one should believe . We thus want
[TABLE]
where indicates revision. And they argued that after update with , one should believe . We thus want
[TABLE]
where indicates update. This is the case when is , and when is or after an infinite number of with . However, in this example, . The question is, What should probabilistic revision (and update) correspond to when ? For instance, what if , given the agent currently believes ? Furthermore, we would like the same operation to be applicable whether or not the evidence contradicts current beliefs.
5.1 Probabilistic Revision via Classical Revision
This approach is to determine the new knowledge base exactly as one would in classical belief revision, and then to assign probabilities to the newly believed worlds.
Let
[TABLE]
Let be a sentence modeled by all worlds in and no other worlds. Let be defined as follows.
[TABLE]
where is some (acceptable) revision operator and is a retentive inverse-distance weight function.
Unfortunately, is not well defined, because might not be a belief state. For example, suppose there are four worlds , , and , where and only (). Let be an inverse-distance weight function such that [math]. Then, intuitively, we want . By definition, for . We know that . To compute , the second line of the definition of is applicable, hence,
[TABLE]
This problem would not occur if were not allowed. We thus define as
[TABLE]
where is some (acceptable) revision operator and is a non-evidence (n-e-) relaxed and retentive inverse-distance weight function.
Proposition 5.1**.**
* is retentive.*
Proof.
Assume and . Then only the second and third lines of the definition of are applicable. If , then the third line would be applicable, making . Thus, the proposition’s veracity depends only on the value of . And due to the condition (“if ”) in the first line of the definition of , [math]. ∎
Proposition 5.2**.**
* satisfies non-negativity and identity.*
Proof.
Non-negativity and identity follow directly from the definition of . ∎
Proposition 5.3**.**
* does not satisfy symmetry, weak inversity, strong inversity nor equi-distance.*
Proof.
Suppose the vocabulary has three atoms, the distance function is Hamming and
[TABLE]
with . Note that is an n-e-relaxed and retentive inverse-distance weight function. Assume and and all other worlds do not model . Finally, assume that is defined such that . Note that .
Then , and
. Therefore, , but
. Hence, weak inversity fails for . This case also proves that equi-distance fails.
Due to strict inversity implying weak inversity, by contraposition, strict inversity fails.
To check symmetry, observe that because and . But as we see above, . Thus, symmetry fails. ∎
Corollary 5.1**.**
Because is not symmetric or weakly inverse, it is not an inverse-distance weight function.
Proposition 5.4**.**
* is faithful iff is.*
Proof.
Assume .
Assume is not faithful. It could thus happen that . Next, assume that and , implying that . Hence, is not faithful.
Now assume is faithful. Only the second and third lines are applicable. In both cases,
(particularly, , because). ∎
5.2 Probabilistic Revision with EDI Directly
The approach here is that revision should satisfy the retentive property. Let
[TABLE]
where is an inverse-distance weight function. Note that whenever and (retention).
Proposition 5.5**.**
* does not satisfy weak inversity.*
Proof.
Assume . Then, by the identity condition of , and . Assume , and . Then and . Now it could happen that , in which case, , but . ∎
does not work as desired when ( is instantiated as and ):
[TABLE]
[TABLE]
We thus propose to use in general for probabilistic belief revision:
[TABLE]
The following definition is derived from the one above.
Definition 5.1**.**
[TABLE]
where is a non-evidence (n-e-) relaxed and an inverse-distance weight function.
The reason why is n-e-relaxed is similar to the reason why of is. Recall that . Then
[TABLE]
[TABLE]
And can also be used in cases where : Let and let . Observe that contradicts the agent’s current belief state . Let
[TABLE]
with . Note that is an n-e-relaxed and retentive inverse-distance weight function.
[TABLE]
[TABLE]
Initially, the agent is certain that the book is on the floor and there is a relatively high likelihood (0.7) that the magazine is on the table. Then the agent hears from a reliable source that actually the book is definitely on the table. The agent revises its beliefs accordingly, and reasonably still believes to a relatively high degree () that the magazine is on the table.
Proposition 5.6**.**
* is retentive.*
Proof.
Assume and . Assume . Then only the second line of the definition of is applicable. That is, . Now assume . Again, only the second line is applicable, making . The retention property is thus satisfied. ∎
Proposition 5.7**.**
* satisfies non-negativity, identity and symmetry.*
Proof.
Non-negativity follows directly. If , then only the first and third lines are applicable. In both cases, identity holds.
The first line of the definition is satisfied for iff it is satisfied for . Similarly for the second line. Because is assumed to be an inverse-distance weight function, is symmetrical. It thus follows that satisfies symmetry. ∎
Proposition 5.8**.**
* does not satisfy weak inversity, strict inversity nor equi-distance.*
Proof.
For all :
Assume . If , then, by the identity condition of , and , and only the second and third lines are applicable. Assume “” holds, but that “” does not hold. Then and . Now, it could be that . Hence, if it is not in general the case that . Thus, weak inversity fails. This case also shows that equi-distance and strict inversity fail. ∎
Proposition 5.9**.**
* is faithful iff is.*
Proof.
Assume .
Assume is not faithful. Assume and . Then only the third line is applicable. It could happen that , making , making unfaithful.
Now assume is faithful. Only the second and third lines are applicable. In both cases,
(particularly, , because ). ∎
5.3 Probabilistic Update via Classical Update
This approach is the same as we took for probabilistic revision via classical revision; to determine the new knowledge base exactly as one would in classical belief update, and then to assign probabilities to the newly believed worlds.
Recall that and is a sentence modeled by all worlds in and no others. Let be defined as follows.
[TABLE]
where is some (acceptable) update operator and is a relaxed inverse-distance weight function.
Proposition 5.10**.**
If , then is relaxed.
Proof.
Assume . Assume . If , then only the first line of the definition of is applicable, and . If , then only the second and third lines are applicable. But actually, by the two assumptions, only the second line is applicable. This implies that for all , if , then . Therefore, is relaxed. ∎
Proposition 5.11**.**
* satisfies non-negativity, identity and symmetry.*
Proof.
Non-negativity and identity follow directly from the definition of . We look at the three cases (lines) which make up the definition of . If , then . If , then . But is assumed to be an inverse-distance weight function, which implies that is symmetrical. For all other cases (third line), it must be that . ∎
Proposition 5.12**.**
* does not satisfy weak inversity, strong inversity nor equi-distance.*
Proof.
Suppose the vocabulary has three atoms, the distance function is Hamming and is with . Assume and and all other worlds do not model . Finally, assume that is defined such that . Note that .
Then , and
. Therefore, , but
. Hence, weak inversity fails for . This case also proves that equi-distance fails.
Due to strict inversity implying weak inversity, by contraposition, strict inversity fails. ∎
Corollary 5.2**.**
Because is not weakly inverse, it is not an inverse-distance weight function.
Proposition 5.13**.**
* is faithful iff is.*
Proof.
Assume .
Assume is not faithful. Next, assume that . It could thus happen that , implying that is not faithful.
Now assume is faithful. Only the second and third lines are applicable. In both cases,
(particularly, , because ). ∎
5.4 Probabilistic Update with EDI Directly
Finally, we propose to use any relaxed (e-relaxed and n-e-relaxed) inverse-distance weight function for update. That is, we propose that one may use operation for updating, such that it is relaxed and where is an inverse-distance weight function.
Recall the example in Section 5.2, where , , is instantiated as and . Notice that if an agent were to update its beliefs () with , the resulting belief state would be exactly the same as for revision via : . The reader may confirm that on the second application of , the agent still believes . On the second application of , however, the agent believes .
Each of strict inversity, equi-distance and faithfulness might be satisfied, depending on the particular instantiation of .
6 Rationality Postulates for EDI
In this section, we present and propose rationality postulates – conditions which should be satisfied – for belief revision and belief update. We do not claim that these are sufficient. Some of them are necessary, but some may not be. Nonetheless, the postulates have been given due consideration. For this study, we identify three core (necessary) rationality postulates for revision and update. We prove that and satisfy the three core revision postulates, and that and satisfy the three core update postulates.
6.1 Probabilistic Revision Postulates
We adopt the rationality postulates for probabilistic belief revision from Gärdenfors, (1988) and we employ probabilistic versions of the rationality postulates for (non-probabilistic) belief update from Katsuno and Mendelzon, (1992).
First, we discuss the operation called expansion, because it is mentioned in the postulates below. Conventionally, (classical) expansion (denoted +) is the logical consequences of , where is new information and is the current belief set. Or if the current beliefs can be captured as a single sentence , expansion is defined simply as . We denote the expansion of belief state with as .
(Unless stated otherwise, it is assumed that is logically satisfiable.) The probabilistic belief revision postulates (adapted from (Gärdenfors,, 1988)) in our notation are
- () is a belief state
- ()
- () If , then
- () If , then
- () If , then
We take - to be self explanatory, and to be the three core postulates. is an interpretation of the AGM postulate which says that if the evidence is consistent with the currently held beliefs, then revision amounts to expansion.
says that if is deemed possible in the belief state revised with , then expanding the revised belief state with should be equal to revising the original belief state with the conjunction of and .
We propose adding this postulate:
- () If , then
says that the belief in an -world cannot decrease due to the reception of information that .
Recall that
[TABLE]
where is some (acceptable) revision operator and is a non-evidence relaxed and retentive inverse-distance weight function.
Proposition 6.1**.**
() is satisfied for .
Proof.
Due to normalization (via ) in the definition of EDI (Def. 4.2), is a belief state whenever there exists at least one world s.t. if , then . Assume that . It must be shown that there exists at least one world for which and .
Proof by contradiction: Let be an arbitrary world in . (Main assumption) Assume there exists no world for which and . Assume . Then the first line of the definition of is applicable and it must be that . But this is impossible because it implies that for all worlds , ( is implicitly assumed to be well-defined). Therefore, it must be that .
Let (there must exists at least one such ). Then the second line is applicable. Let (there must exist at least on such ). This implies that , which implies that . Now, either or .
Assume . Recall that . Note that the first line will eventually become applicable, making and contradicting the main assumption. Therefore, it must be the case that . Recall that is n-e-relaxed, that is, , if and , then . This also contradicts the main assumption.
There is no other way to satisfy the main assumption. It must thus be the case that there exists some world for which and , implying that there exists at least one world s.t. if , then . ∎
Proposition 6.2**.**
() is satisfied for .
Proof.
The proposition is satisfied when . It is almost a direct consequence of the definition of : , only if . ∎
Proposition 6.3**.**
() is satisfied for .
Proof.
If , then , iff and , implying that . ∎
Recall that
[TABLE]
where is a non-evidence relaxed and an inverse-distance weight function.
Proposition 6.4**.**
() is satisfied for .
Proof.
Due to normalization (via ) in the definition of EDI (Def. 4.2), is a belief state whenever there exists at least one world s.t. if , then . Assume that . It must be shown that there exists at least one world for which and .
Proof by contradiction: Let be an arbitrary world in . (Main assumption) Assume there exists no world for which and .
Either or . Assume . Then there exists at least one -world with probability greater than zero. Let be that world. Thus, when , (first line of definition) and the main assumption fails in this case. Assume . Then there exists at least one non--world with probability greater than zero. Let be that world. Hence, the first and second lines are inapplicable, because “” fails. Therefore, only the third line is applicable. is n-e-relaxed, so , if and , then . By the main assumption, this implies that for all worlds , . But this contradicts our deduction that there exists some for which .
There is no other way to satisfy the main assumption. It must thus be the case that there exists some world for which and , implying that there exists at least one world s.t. if , then . ∎
Proposition 6.5**.**
() is satisfied for .
Proof.
The proposition is satisfied when . It is almost a direct consequence of the definition of : , only if . ∎
Proposition 6.6**.**
() is satisfied for .
Proof.
If , then , iff and , implying that . ∎
6.2 Probabilistic Update Postulates
A set of probabilistic belief update postulates is now adapted from (Katsuno and Mendelzon,, 1992)’s classical postulates. Each classical postulate is translated into probabilistic counterpart (in our notation).
-
()
-
()
-
() if , then
-
() if , then
-
() if , iff
-
() if , then if , then
It is arguable which of , or are appropriate translations of (if any). is perhaps too strong and perhaps too weak. If is taken to be an appropriate translation, then we argue that in a probabilistic setting, it is too strong. For instance, it implies that if , then , but this is not the case when is e-relaxed.
- () if both and are satisfiable, then is also satisfiable
- () is a belief state
has the simpler form because is assumed satisfiable and is assumed to be a belief state.
-
() if and , then
-
() if , then
-
()
-
() if is satisfiable and , then:
Suppose is a knowledge base and implies . Then in terms of probabilities, one should expect , where is the ‘knowledge base’. Given this metaphor, one can derive from the translation ‘if is satisfiable, then for all , if , then , of which is the sentential version.
- () if and , then:
- () if and , then:
- () if and , then:
is weaker than but arguably more reasonable/rational in a probabilistic setting.
- () if is complete, then:
- () if there exists a s.t. , then:
KM’s justification for is “If some possible world results from updating a complete KB with and it also results from updating it with , then this possible world must also result from updating the KB with ”. However, we go a step farther (given a probabilistic setting), and say that the belief in any given complete belief state resulting from updating with should be no less than the least of the beliefs in after updating the belief state with and updating it with separately, and no more than the sum of the belief in after updating the belief state with and updating it with separately.
- ()
At this time, we do not have a satisfactory translation for .
, and are taken to be the three core postulates.
Recall that
[TABLE]
where is some (acceptable) update operator and is a relaxed inverse-distance weight function.
Proposition 6.7**.**
() is satisfied for .
Proof.
The proposition is satisfied when . It is almost a direct consequence of the definition of : , only if . ∎
Proposition 6.8**.**
() is satisfied for .
Proof.
We must prove that is a belief state. Due to normalization (via ) in the definition of EDI (Def. 4.2), is a belief state whenever there exists at least one world s.t. if , then . Assume that . It must be shown that there exists at least one world for which and .
Proof by contradiction: Let be an arbitrary world in . (Main assumption) Assume there exists no world for which and . Assume . Then the first line of the definition of is applicable and it must be that . But this is impossible because it implies that for all worlds , ( is implicitly assumed to be well-defined). Therefore, it must be that .
Let (there must exists at least one such ). Then the second line is applicable. Let (there must exist at least on such ). This implies that , which implies that . Now, either or .
Assume . Recall that . Note that the first line will eventually become applicable, making and contradicting the main assumption. Therefore, it must be the case that . Recall that is relaxed and thus n-e-relaxed – that is, , if and , then . This also contradicts the main assumption.
There is no other way to satisfy the main assumption. It must thus be the case that there exists some world for which and , implying that there exists at least one world s.t. if , then . ∎
Proposition 6.9**.**
() is satisfied for .
Proof.
If , then , iff and , implying that . ∎
Recall that is a relaxed inverse-distance weight function.
Proposition 6.10**.**
() is satisfied for .
Proof.
The proposition is satisfied when . It is almost a direct consequence of the definition of : , only if . ∎
Proposition 6.11**.**
() is satisfied for .
Proof.
We must prove that is a belief state. Due to normalization (via ) in the definition of EDI (Def. 4.2), is a belief state whenever there exists at least one world s.t. if , then . Assume that . It must be shown that there exists at least one world for which and .
Let (there must exist at least one such). By relaxation, . ∎
Proposition 6.12**.**
() is satisfied for .
Proof.
If , then , iff and , implying that . ∎
7 Conclusion
We have proposed probabilistic belief revision and belief update operators, both within an imaging framework. The role of a weight function – usually expected to be inversely proportional to the ’distance’ between worlds – is central to the Expected Distance Imaging (EDI) framework. In this paper, we have only begun the investigation into the behavior or properties of various instantiations of a general form of imaging. There is still much work to be done. For instance, what is the effect of retention versus relaxation and when is one more appropriate than the other? We have worked with the hypothesis that retention relates to revision and that relaxation relates to update, but to what degree is this hypothesis true? What is the meaning of the effect of the size of in and ?
We would like to make a deeper study of the probabilistic rationality postulates, especially those for update (as they have not been given much attention in the literature). We would then like to test various EDI instantiations against the postulates and use the results to perhaps design new EDI operators. We hope that these insights might lead to further insights in order to better understand the relationship between revision and update, whether classical or not.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Alchourrón et al., (1985) Alchourrón, C., Gärdenfors, P., and Makinson, D. (1985). On the logic of theory change: Partial meet contraction and revision functions. Journal of Symbolic Logic , 50(2):510–530.
- 2Chhogyal et al., (2014) Chhogyal, K., Nayak, A., Schwitter, R., and Sattar, A. (2014). Probabilistic belief revision via imaging. In Pham, D.-N. and Park, S.-B., editors, PRICAI 2014: Trends in Artif. Intell.: Thirteenth Pacific Rim Intl. Conf. on Artif. Intell. , volume 8862 of LNAI , pages 694–707. Springer Intl. Publishing, Switzerland.
- 3Dubois and Prade, (1993) Dubois, D. and Prade, H. (1993). Belief revision and updates in numerical formalisms: An overview, with new results for the possibilistic framework. In Proceedings of the Thirteenth Intl. Joint Conf. on Artif. Intell. (IJCAI-93) , pages 620–625, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
- 4Friedman and Halpern, (1999) Friedman, N. and Halpern, J. (1999). Modeling belief in dynamic systems. Part II: Revision and update. Journal of Artif. Intell. Research (JAIR) , 10:117–167.
- 5Gärdenfors, (1988) Gärdenfors, P. (1988). Knowledge in Flux: Modeling the Dynamics of Epistemic States . MIT Press, Massachusetts/England.
- 6Katsuno and Mendelzon, (1992) Katsuno, H. and Mendelzon, A. O. (1992). On the difference between updating a knowledge base and revising it. In Gärdenfors, P., editor, Belief Revision , pages 183–203. Cambridge University Press.
- 7Kern-Isberner, (2001) Kern-Isberner, G. (2001). Revising and updating probabilistic beliefs. In Williams, M.-A. and Rott, H., editors, Frontiers in Belief Revision , volume 22 of Applied Logic Series , pages 393–408. Kluwer Academic Publishers, Springer Netherlands.
- 8Lang, (2007) Lang, J. (2007). Belief update revisited. In de Mantaras, R. L., editor, Proceedings of the Twentieth Intl. Joint Conf. on Artif. Intell. (IJCAI-07) , pages 2517–2522, Menlo Park, CA. AAAI Press.
