Beliefs and Probability in Bacchus' l.p. Logic: A~3-Valued Logic Solution to Apparent Counter-intuition
Mieczys{\l}aw A. K{\l}opotek

TL;DR
This paper introduces a 3-valued logic to address counterintuitive issues in Bacchus' L.p. logic, aiming to better integrate logic and probability by resolving fundamental discrepancies.
Contribution
It proposes a novel 3-valued logic framework to overcome counterintuitive behaviors in Bacchus' L.p. logic, enhancing the integration of logic and probability.
Findings
The 3-valued logic reduces counterintuitive outcomes in L.p. logic.
It clarifies the discrepancy between first-order logic and statistical inference.
The approach improves the coherence of probabilistic logic systems.
Abstract
Fundamental discrepancy between first order logic and statistical inference (global versus local properties of universe) is shown to be the obstacle for integration of logic and probability in L.p. logic of Bacchus. To overcome the counterintuitiveness of L.p. behaviour, a 3-valued logic is proposed.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPhilosophy and Theoretical Science
Beliefs and Probability in Bacchus’ l.p. Logic: A 3-Valued
Logic Solution to Apparent Counter-intuition
Mieczyslaw A. Klopotek
*Institute of Computer Science Polish Academy of Sciences
Warsaw, Poland*
Abstract
Fundamental discrepancy between first order logic and statistical inference (global versus local properties of universe) is shown to be the obstacle for integration of logic and probability in L.p. logic of Bacchus. To overcome the counterintuitiveness of L.p. behaviour, a 3-valued logic is proposed.
1 Introduction
The paper of Bacchus1 aims at painless integration of two paradigms of human reasoning, that is 1) first order logics and 2) statistical inference. (see also8 ) in such a way as to avoid all the contradictions emerging in previous approaches.
Nonetheless the claim of the current paper is that also the L.p. logic of Bacchus1 fails to achieve its primary goal of becoming the tool for describing knowledge & reasoning in expert systems and other knowledge-based systems. In Section 2 we present several simple examples of basic flaws of this logic exploiting the counter intuitiveness of the L.p. logic. Section 3 demonstrates a more elaborated example pointing at weaknesses of the L.p. logic.
As a remedy we propose (in a sketchy way) a different deduction theory taking into account the gap between first-order way of thinking (global treatment of domains) and that of statistical (experimental) sciences (local treatment of domains).
2 Flaws of Theories Criticized by Bacchus and His Solution
A number of works were concerned with representational and inferential issues when probabilities of events were identified as degrees of belief. Bacchus criticized i.e. the following approaches:
Approach 1: (propositional logic) : (see.6,7 ) Probability of a sentence is the probability of selection of one of those possible worlds wherein this sentence holds. E.g. the 90 % belief that the famous Tweety flies is stated as with being greater than 0.9. However, such an approach does not make it easy to state that “Most birds fly”.
Approach 2: (first order logic) The probability of the expression: be expressed as . Following the principles of probability calculus we obtain: . Hence if , then it should hold that . However one can imagine such a set of possible worlds that in most of those worlds most of birds fly and at the same time in most of the worlds non-flying birds, exist, that is both and hold which means a contradiction.
Approach 3: Cheeseman5 proposed that the above statements be meta-expression with conditional probability of the type:
[TABLE]
However, this representation cannot be treated as a method of expression of statistical knowledge but rather as an update method for degrees of belief, as it leads to a contradiction when mixing general and particular knowledge (see1 for details).
So, both probability inside and outside the scope of quantifiers lead to contradictions. Hence Bacchus proposed an L.p. logic described in1 , where the probability is a quantifier itself (probability of the formula with the free variable is expressed as . Let us cite here from 1 :
INFERENCE RULE: (modus ponens)
R1: From infer .
DEFINITION: Conditional probability [\beta\mid\alpha]_{\overline{x}}\ ( conditioned on ):
[TABLE]
3 The Flaws of Bacchus Himself
Let us show now the major weaknesses of the L.p. logic. Let us notice the following:
many Logic-based knowledge systems express general knowledge in terms of implications, 2. 2.
all the examples of statistical knowledge representation in 1 refer to conditional probabilities instead of probabilities of implications. 3. 3.
the concept of conditional probability in L.p. is not a primary one but a concept derived from “absolute” probability in a strange way (see below), 4. 4.
the strangeness of conditional probability definition results from missing logical construct corresponding to conditional probability, (a construct of the form: with . 5. 5.
the conditional probability does not suffice to substitute this missing logical construct, for how to express a statement “in most cases whenever implies then also implies ”.
Let us demonstrate the non-suitability of implication for expressing statistical knowledge.
Example 1: What is the sum of conditional probabilities of an event and its counter-event ? The answer is: either 1 or 0!! (depending on the probability of , that is ).
Example 2: What is the conditional probability of an event conditioned on itself: ? The answer is: either 1 or 0!!! (depending on the probability of , that is ).
Example 3: Let us consider the following facts:
“With a certainty of at most 90 % if you are man then you are fertile.”
“With a certainty of at most 80 % if you are a fertile man then you will become a father”
“If you are a father then you are a man.”
What is the probability of being a woman ?
The answer is: at most 0.7. The proof is as follows:
We obtain the translation of the facts:
[TABLE]
[TABLE]
[TABLE]
Hence:
[TABLE]
[TABLE]
Hence:
[TABLE]
[TABLE]
But:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Example 4. Let us consider the following facts:
“For all x, if x is a male then x is not pregnant” and
“For all x, it is not true that if x is a male then x is pregnant”
The question is: are there any females ?
Let us use the following predicates: –male –pregnant
We obtain the translation:
[TABLE]
Hence:
[TABLE]
hence:
[TABLE]
But:
[TABLE]
Hence
[TABLE]
but
[TABLE]
[TABLE]
Hence:
[TABLE]
Hence:
[TABLE]
[TABLE]
So being a female is improbable !!!!
Before proceeding with another example let us remind a basic fact from intuitive reasoning: whenever we consider a piece of knowledge to be nearly sure, we reason with it as if it were absolutely true and when we obtain a result then we believe it to be nearly sure if the reasoning chain is not too long. We also take our experience learned in one environment and expect it to hold in a different environment if the first environment yielded significant results. When we apply a body of general knowledge to an individual case, we usually possess only partial knowledge of the case and reason as if we have had a population of cases fitting our knowledge of the individuum and obtain statistical results covering this artificial population. This is how Bayesian networks4 are used for individual diagnosis, as done in also. This is also the very nature of Miller’s Principle3 .
Let us state some claims about L.p. logic5 :
Theorem 1
*L.p. logic is equivalent to a logic Lp’ derived from L.p. by substitution of the inference rule R with R1’ and R2’:
R1’: From infer ., with vector being vector of all free variables in and .
R2’: From infer as in R1’).*
PROOF: see5
Theorem 2
*Lp’ logic is equivalent to a logic Lp” derived from Lp’ by substitution of the inference rules Ri’ with R1”, R2”, R3”:
R1”: From infer , with vector \overline{x}\ is vector of all free variables in .
R2” = ***R2’
R3”**: From infer , (\overline{x}\ as above).
PROOF: see5
Theorem 3
Given , always .
PROOF: easily seen
Theorem 4
If within the proof system Lp’ in a certain step of the proof the premise/conclusion is weakened (\varepsilon_{i}\geq 0\ and small), then in the equivalent proof in Lp” we get:
PROOF:
[TABLE]
hence:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Example 5: Let us consider the example 8 from. (Fig. 1: from1 with my interpretation for ): Let us first consider the rules:
(guilty)prisonpunishmentfinancialpunishmentFig1: Example 8 from [page 227] [1] – intrepreted
[TABLE]
Hence if is valid, then in the logic Lp’ we obtain rules:
[TABLE]
then
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Now let us imagine we verify our rules in a real world environment. Let among 100 persons appearing before court be 5 innocent ones none of which was condemned, and 95 guilty persons of which 94 were imprisoned and one had to pay a fine. Then:
[TABLE]
So in fact our rules are highly probable. Now let us apply the rules learned previously to an individuum of which we know it is innocent. So we consider a population with . Following the spirit of the previous deduction we obtain:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
[TABLE]
However, if we considered conditional probabilities instead of probabilities of inference rules we would obtain: (innocent are not condemned). So apparently the validity of THEOREM 4 is denied, so also that of Bacchus L.p. Though the reason for the flaw is obvious – inference rules are global in nature and conditional probabilities cover local properties of a universe, hence are more suitable to be transferred to another universe – but the solution is not as easy.
4 A Solution
To overcome the problems mentioned above it is necessary to find a logical construct corresponding to conditional probability. It is easily seen that enforcing the interpretation of probability of ordinary implication as conditional probability would lead to serious problems for then: , which may easily lead to a contradiction.
So we see that two-valued logics are not sufficient for our purposes. Hence let us introduce the logical construct having the following three-valued semantics: (T - =true, F=false, U=uninteresting)
pTUFqTTUUUUUUFFUU
We need also truth tables for basic logical constructs :
p\wedge q$$pTUFTTUFUUUFFFFFp\vee q$$pTUFTTTTUTUUFTUF\neg q$$qTUFFUT
Let us define two probability quantifiers: and in such a way that expresses the proportion of the expression taking value T to cases it takes value T or F. is then equivalent to conditional probability . expresses the proportion of cases where a takes values either T or F to cases it takes any of the values T,F,U. We have then the following properties of both:
1)
2)
3)
4)
The quantifier captures local properties of the universe while carries global ones. It is then easily seen that using f instead of implications and instead of in previous examples would resolve all the problems encountered there. Beside this, the statement “Almost always whenever implies then also implies ” may be properly expressed by . So, by proper axiomatization we will gain the following: if a proof is to be transferred from one universe to another one locally similar then all the steps engaging will be kept and those involving need to be verified – also with respect to Miller’s Principle. A detailed presentation of the axiomatization is given in9.
**References **
F. Bacchus: “L.p., a logic for representing and reasoning with statistical knowledge”, Computer Intelligence 6, 209-231, (1990). 2. 2.
P. Cheeseman: “An inquiry into computer understanding”, Computational Intelligence, 4(1), 58-66, (1988). 3. 3.
J.Y. Halpern: “An analysis of first-order logics of probability”, Artificial Intelligence 46(3), 311-350, (1990). 4. 4.
T. Hrycej: “Gibbbs Sampling in Bayesian Networks”, Artificial Intelligence 46 (3), 351-36, (1990). 5. 5.
M.A.Klopotek: “Bayesian Network and L.p. Logic for Statistical Inference”, A Talk at the National Workshop Cybernetics- Intelligence- Development CIR-91, Siedlce-Poland, Sept. (1991) - to appear in Proceedings. 6. 6.
C.G. Morgan: “Weak conditional comparative probability as a formal semantic theory”, Zeit. Fuer Math. Log.30, 199-212, (1984). 7. 7.
N.J. Nilsson: “Probabilistic logic ”, *Artificial Intelligence * 28, 71-87, (1986). 8. 8.
S. Watanabe: “Pattern Recognition, Human and Machine”, (1987). 9. 9.
M.A.Klopotek: “An Axiomatic System For Statistical And Logical Reasoning” - in preparation.
