Probabilistic squares and hexagons of opposition
under coherence111This is a substantially extended version of a paper ([45]) presented at SMPS 2016 (Soft Methods in Probability and Statistics 2016) conference held in Rome in September 12–14, 2016.
Niki Pfeifer
Ludwig-Maximilians-University Munich, Germany
[email protected]
Giuseppe Sanfilippo
University of Palermo, Italy
[email protected]
Abstract
Various semantics for studying the square of opposition and the hexagon of opposition have been proposed recently.
We interpret sentences by imprecise (set-valued) probability assessments on a finite sequence of conditional events. We introduce the acceptability of a sentence within coherence-based probability theory.
We analyze the relations of the square and of the hexagon in terms of acceptability. Then, we show how to construct probabilistic versions of the square and of the hexagon of opposition by forming suitable tripartitions of the set of all coherent assessments. Finally, as an application, we present new versions of the square and of the hexagon involving generalized quantifiers.
keywords:
coherence , conditional events , hexagon of opposition , imprecise probability , square of opposition , quantified sentences , tripartition
1 Introduction
There is a long history of investigations on the square of opposition spanning over two millenia [4, 38]. A square of opposition represents logical key relations among basic sentence types in a diagrammatic way.
The basic sentence types, traditionally denoted by A (universal affirmative: “Every S is P”), E (universal negative: “No S is P”), I (particular affirmative: “Some S are P”), and O (particular negative: “Some S are not P”), constitute the corners of the square. The diagonals and the sides of the square of opposition are formed by the following logical relations among the basic sentence types:
A and E are contraries (i.e., they cannot both be true), I and O are subcontraries (i.e., they cannot both be false), A and O as well as E and I are contradictories (i.e., they cannot both be true and they cannot both be false), I is a subaltern of A and O is a subaltern of E (i.e., A entails I and E entails O; for a visual representation see, e.g., Figure 3 below, and cover the probabilities for seeing the traditional square of opposition).
In the early 1950ies, the square of opposition was expanded to the hexagon of opposition, by adding a sentence at the top and another one at the bottom of the square (see, e.g., Figure 5). Recently, the square of opposition as well as the hexagon of opposition and its extensions have been investigated from various semantic points of view (see, e.g., [3, 4, 11, 20, 21, 22, 28, 34, 35, 36]).
In this paper we present a probabilistic analysis of the square of opposition under coherence, introduce the hexagon of opposition under coherence, and study the semantics of basic key relations among quantified statements.
After preliminary notions (Section 2), we introduce, based on g-coherence, a (probabilistic) notion of sentences and their acceptability and show how to construct squares of opposition under coherence from suitable tripartitions (Section 3). Then, we present an application of our square to the study of generalized quantifiers (Section 4). In Section 5 we introduce the hexagon of opposition under coherence. Section 6 concludes the paper by some remarks on future work.
2 Preliminary Notions
The
coherence-based approach to probability and to other uncertain measures has been adopted by many
authors (see, e.g.,
[5, 6, 9, 10, 14, 15, 16, 17, 18, 24, 26, 30, 31, 41, 42, 44]);
we therefore recall only selected key features of coherence and its generalizations in this section.
An event E is
a two-valued logical entity which can be either true or false.
The indicator of E is a two-valued numerical quantity which is 1,
or 0, according to whether the event E is true, or false, respectively. We
use the same symbols for events and their indicators. We denote by
⊤ the sure event (i.e., tautology or logical truth) and by ⊥ the impossible event (i.e., contradiction or logical falsehood). Moreover, given two events E and H, we
denote by E∧H (resp., E∨H) conjunction (resp., disjunction). To simplify
notation, we will use the product EH to denote the conjunction E∧H, which also denotes the indicator of E∧H. We denote by \widebarE the negation of E.
Given two events E and H, with H=⊥, the
conditional event E∣H is defined as a three-valued logical
entity which is true if EH (i.e., E∧H) is true,
false if \widebarEH is true, and indetermined (void) if H is false ([19, p. 307]).
In terms of
the betting metaphor, if you assess p(E∣H)=p, then you are willing
to pay (resp., to receive) an amount p and to receive (resp., to pay) 1, or 0, or p, according to
whether EH is true, or \widebarEH is true, or \widebarH is true (bet
called off), respectively.
For defining coherence, consider a real function p:F→R, where F is an arbitrary
family of conditional events. Consider a finite sub-family
Fn=(E1∣H1,…,En∣Hn)⊆F, and the vector Pn=(p1,…,pn),
where pi=p(Ei∣Hi),i=1,…,n. We denote by
Hn the disjunction H1∨⋯∨Hn.
With the pair (Fn,Pn) we associate the random gain
G=∑i=1nsiHi(Ei−pi),
where s1,…,sn are n arbitrary real numbers.
G represents the net gain of n transactions, where for each transaction its meaning is specified by the sign of si (plus for buying or minus for selling)
and its scaling is specified by the magnitude of si.
Denoting by GHn the set of values of G restricted to Hn, we recall
Definition 1**.**
The function p defined on F is called coherent
if and only if, for every integer n, for every finite sub-family Fn
⊆F and for every s1,…,sn, it holds that:
minGHn≤0≤maxGHn.
We say that p is incoherent if and only if p is not coherent.
As shown by Definition 1, a probability assessment is coherent if and only if, in any finite combination of n bets, it does not happen that the values in the set GHn are all positive, or all negative (no Dutch Book). Moreover, coherence of p(E∣H) requires that p(E∣H)∈[0,1] for every E∣H∈F. If p on F is coherent, we call it a conditional probability on F (see, e.g., [1, 17, 47]).
Notice that, if p is coherent, then p also satisfies all the well known properties of finitely additive conditional probability (while the converse does not hold; see, e.g., [17, Example 13] or [23, Example 8]).
In what follows F will denote finite sequence of conditional events. Let
F=(E1∣H1,…,En∣Hn). We denote by P
a (precise) probability assessment P=(p1,…,pn) on F, where
pj=p(Ej∣Hj)∈[0,1], j=1,…,n. Moreover, we denote by Π
the set of all coherent precise assessments on F. We recall that when there are no logical relations among the
events E1,H1,…,En,Hn involved in F, that is
E1,H1,…,En,Hn are logically independent, then the set Π
associated with F is the whole unit hypercube [0,1]n.
If there
are logical relations, then the set Π could be a strict
subset of [0,1]n. As it is well known Π=∅;
therefore, ∅=Π⊆[0,1]n.
If not stated otherwise, we do not make any assumptions concerning logical independence.
Definition 2**.**
An imprecise, or set-valued, assessment I on a family of conditional events F
is a (possibly empty) set of precise
assessments P on F.
Definition 2 states that
an imprecise (probability) assessment I on a sequence
of n conditional events
F is just a (possibly empty)
subset of
[0,1]n ([25, 27, 28]). For instance, think about an agent (like Pythagoras) who considers
only rational numbers to evaluate the probability of an event E∣H. Pythagoras’ evaluation can be represented by the imprecise assessment I=[0,1]∩Q on E∣H. Moreover, a constraint like p(E∣H)>0 can be represented by the imprecise assessment I=]0,1] on E∣H.
Given an imprecise assessment I we denote by
\widebarI the
complementary imprecise assessment of I, i.e.
\widebarI=[0,1]n∖I. We now recall the notions of g-coherence and total coherence in the general case of imprecise (in the sense of set-valued) probability assessments [28].
Definition 3** (g-coherence).**
Given a sequence of n conditional events F. An imprecise
assessment I⊆[0,1]n on F is g-coherent iff there exists a coherent precise assessment P
on F
such that P∈I.
Definition 4** (t-coherence).**
An imprecise assessment I on F is totally coherent
(t-coherent) iff the following two conditions are satisfied:
(i) I is non-empty; (ii) if P∈I, then P is a
coherent precise assessment on F.
Definition 5** (t-coherent part).**
Given a sequence of n conditional events F. Let Π be the set of all coherent assessments on F.
We denote by π:℘([0,1]n)→℘(Π) the function defined by π(I)=Π∩I, for any imprecise assessment I∈℘([0,1]n). Moreover, for each subset I∈℘([0,1]n) we call π(I) the t-coherent part of I.
Of course, if π(I)=∅, then I is g-coherent and π(I) is t-coherent.
3 From Imprecise Assessments to the Square of Opposition
In this section we consider imprecise assessments on a given sequence F of n conditional events. In our approach, a sentence s is a pair
(F,I), where I⊆[0,1]n is an imprecise assessment on F. We introduce the following equivalence relation under t-coherence:
Definition 6**.**
Given two sentences s1:(F,I1) and s2:(F,I2), s1 and s2 are equivalent (under t-coherence), denoted by s1≡s2, iff π(I1)=π(I2).
Definition 7**.**
Given three sentences s:(F,I), s1:(F,I1), and s2:(F,I2). We define: s1∧s2:(F,I1∩I2) (conjunction); s1∨s2:(F,I1∪I2) (disjunction); \widebars:(F,\widebarI), where \widebarI=[0,1]n∖I (negation).
Remark 1**.**
As the basic operations among sentences are defined by set-theoretical operations, they inherit the corresponding properties (including associativity, commutativity, De Morgan’s law, etc.).
Moreover, as π(I1∩I2)=π(I1)∩π(I2), by setting s1∗=(F,π(I1)), s2∗=(F,π(I2)) and (s1∧s2)∗:(F,π(I1∩I2)), it follows that
(s1∧s2)≡(s1∧s2)∗≡s1∗∧s2∗. Likewise, s1∨s2≡(s1∨s2)∗≡s1∗∨s2∗.
As we interpret the basic sentence types involved in the square of opposition by imprecise probability assessments on sequences of conditional events, we will introduce the following notion of acceptability, which serves as a semantic bridge between basic sentence types and imprecise assessments:
Definition 8**.**
A sentence s:(F,I) is (resp., is not) acceptable iff the assessment I on F is (resp., is not) g-coherent, i.e. π(I) is not (resp., is) empty.
Remark 2**.**
If s1∧s2 is acceptable, then s1 is acceptable and s2 is acceptable. However, the converse does not hold, indeed s1:(E∣H,{1}) is acceptable and s2:(E∣H),{0}) is acceptable, but s1∧s2:(E∣H,∅) is not acceptable (as π(∅)=∅).
Definition 9**.**
Given two sentences s1:(F,I1) and s2:(F,I2), we say, under coherence:
s1 and s2 are contraries iff the sentence s1∧s2 is not acceptable;222
Some definitions of contrariety
additionally require that “s1 and s2 can both be acceptable”.
For reasons stated in [28], we omit this additional
requirement.
Similarly, mutatis mutandis, in our definition of subcontrariety.
s1 and s2 are subcontraries iff \widebars1∧\widebars2 is not acceptable;
s1 and s2 are contradictories iff s1 and s2 are both, contraries and subcontraries;
s2 is a subaltern of s1 iff the sentence s1∧\widebars2 is not acceptable.
Remark 3**.**
By Remark 1, we observe that two sentences s1 and s2 are contraries if and only if π(I1∩I2)=π(I1)∩π(I2)=∅. Moreover, two sentences s1 and s2 are subcontraries if and only if π(\widebarI1∩\widebarI2)=π(\widebarI1)∩π(\widebarI2)=∅, that is (by De Morgan’s law) if and only if π(I1)∪π(I2)=Π. Then, two sentences s1 and s2 are contradictories
if and only if π(I1)∩π(I2)=∅ and
π(I1)∪π(I2)=Π, that is if and only if s2=\widebars1 (and, of course, s1=\widebars2).
Given two sentences s1,s2 we also observe that s2 is a subaltern of s1 if and only if Π∩(I1∩\widebarI2)=∅, which also amounts to say that Π∩I1⊆Π∩I2, that is if and only if π(I1)⊆π(I2). For instance, s1∨s2 is a subaltern of s1 and also of s2; similarly, s1 is a subaltern of s1∧s2, and s2 is a subaltern of s1∧s2.
Furthermore, if s1 is not acceptable, that is π(I1)=∅, then any sentence s2 is a subaltern of s1. For example, the sentence s1:(E∣\widebarE,{1}) is not acceptable because Π={0} and then any sentence s2:(E∣\widebarE,I), where I⊆[0,1], is a subaltern of s1.
Based on the relations given in Definition 9 we define a square of opposition as follows.
Definition 10**.**
Let sk:(F,Ik), k=1,2,3,4, be four sentences. We call the ordered quadruple (s1,s2,s3,s4) a square of opposition (under coherence), iff the following relations among the four sentences hold:
- (a)
s1* and s2 are contraries, i.e., π(I1)∩π(I2)=∅;*
2. (b)
s3* and s4 are subcontraries, i.e., π(I3)∪π(I4)=Π;*
3. (c)
s1* and s4 are contradictories, i.e., π(I1)∩π(I4)=∅ and
π(I1)∪π(I4)=Π;
*
s2* and s3 are contradictories, i.e., π(I2)∩π(I3)=∅ and
π(I2)∪π(I3)=Π;*
4. (d)
s3* is a subaltern of s1, i.e., π(I1)⊆π(I3);*
s4* is a subaltern of s2, i.e., π(I2)⊆π(I4).*
Figure 1 shows the square of opposition based on Definition 10.
Remark 4**.**
Based on Definition 10, we observe that in order to verify if a quadruple of sentences (s1,s2,s3,s4), where sk:(F,Ik) and k=1,2,3,4, is a square of opposition, it is necessary and sufficient to check that the quadruple (s1′,s2′,s3′,s4′), where sk′=(F,Ik′), Ik′=π(Ik),
is a square of opposition. Then, we say that two squares (s1,s2,s3,s4) and (s1′,s2′,s3′,s4′) coincide iff
π(Ik)=π(Ik′) for each k. Moreover, based on Definition 10, we observe that (s1,s2,s3,s4) is a square of opposition iff (s2,s1,s4,s3) is a square of opposition.
Definition 11**.**
An (ordered) tripartition of a set S is a triple (D1,D2,D3), where D1, D2, and D3 are subsets of S, such that the following conditions are satisfied: (i) Di∩Dj=∅, i=j for all i,j=1,2,3; (ii);
D1∪D2∪D3=S.
Theorem 1**.**
Given any sequence of n conditional events F and
a quadruple (s1,s2,s3,s4) of sentences,
with sk:(F,Ik), k=1,2,3,4. Define D1=π(I1), D2=π(I2), and D3=π(I3)∩π(I4).
Then, the quadruple (s1,s2,s3,s4) is a square of opposition if and only if
(D1,D2,D3) is a tripartition of (the non-empty set) Π such that: π(I3)=D1∪D3, π(I4)=D2∪D3.
Proof.
(⇒).
We assume that D1=π(I1), D2=π(I2), and D3=π(I3)∩π(I4). Of course, Di⊆Π, i=1,2,3. We now prove that: (i) D1∩D2=∅; (ii) D3=Π∖(D1∪D2).
(i) From condition (a) in Definition 10, as s1 and s2 are contraries, it follows that D1∩D2=∅.
(ii) We first prove that D3⊆Π∖(D1∪D2). This trivially follows when D3=∅. If D3=∅, then let x∈D3=π(I3)∩π(I4). As x∈π(I3), from condition (c) in Definition 10, we obtain x∈/π(I2). Likewise, as x∈π(I4), from condition (c) in Definition 10, we obtain x∈/π(I1). Then, x∈Π and x∈/(π(I1)∪π(I2)), that is x∈Π∖(D1∪D2). We now prove that Π∖(D1∪D2)⊆D3.
This trivially follows when Π∖(D1∪D2)=∅. If Π∖(D1∪D2)=∅, let
x∈Π∖(π(I1)∪π(I2)). As x∈Π∖π(I1), from condition (c) in Definition 10, we obtain x∈π(I4). Likewise, as x∈Π∖π(I2) from condition (c) in Definition 10, we obtain x∈π(I3). Then, x∈(π(I3)∩π(I4))=D3. Therefore (D1,D2,D3) is a tripartition of Π.
By our assumption, π(I1)=D1 and π(I2)=D2. We observe that π(I3)∩D3=D3; moreover, from conditions (c) and (d), we obtain π(I3)∩D2=π(I3)∩π(I2)=∅ and π(I3)∩D1=π(I1)∩π(I3)=π(I1)=D1; then
π(I3)=π(I3)∩(D1∪D2∪D3)=D1∪D3. Likewise, we observe that π(I4)∩D3=D3; moreover, from conditions (c),(d) in Definition 10, we obtain D1∩π(I4)=π(I1)∩π(I4)=∅ and
D2∩π(I4)=π(I2)∩π(I4)=π(I2)=D2; then
π(I4)=π(I4)∩(D1∪D2∪D3)=D2∪D3.
(⇐)
Assume that (D1,D2,D3), where D1=π(I1), D2=π(I2), D3=π(I3)∩π(I4), is a tripartition of Π such that D1∪D3=π(I3) and D2∪D3=π(I4), we prove that the quadruple (s1,s2,s3,s4) satisfies conditions (a), (b), (c), and (d) in Definition 10.
We observe that π(I1)∩π(I2)=D1∩D2=∅, which coincides with (a).
Condition (b) is satisfied because π(I3)∪π(I4)=D1∪D3∪D2∪D3=Π. Moreover, π(I1)∩π(I4)=D1∩(D2∪D3)=∅ and
π(I1)∪π(I4)=D1∪(D2∪D3)=Π; likewise, π(I2)∩π(I3)=D2∩(D1∪D3)=∅ and
π(I2)∪π(I3)=D2∪(D1∪D3)=Π. Thus, the conditions in (c) are satisfied. Finally, π(I1)=D1⊆D1∪D3=π(I3) and π(I2)=D2⊆D2∪D3=π(I4) which satisfy conditions in (d).
∎
A method to construct a square of opposition by starting from a tripartition of Π is given in the following result (see also [20]).
Corollary 1**.**
Given any sequence of n conditional events F and a tripartition (D1,D2,D3) of Π, then
the quadruple (s1,s2,s3,s4), with sk:(F,Ik), k=1,2,3,4 and π(I1)=D1, π(I2)=D2, π(I3)=D1∪D3, π(I4)=D2∪D3 is a square of opposition.
Proof.
The proof immediately follows by observing π(I3)∩π(I4)=D3 and by the (⇐) side proof of Theorem 1.
∎
The following result allows to construct a square of opposition by starting from a tripartition of the whole set [0,1]n:
Corollary 2**.**
Given a tripartition
(B1,B2,B3) of [0,1]n, let I1=B1, I2=B2, I3=B1∪B3, and I4=B2∪B3. For any sequence of n conditional events F, the quadruple (s1,s2,s3,s4), where sk:(F,Ik), k=1,2,3,4, is a square of opposition.
Proof.
Let F be any sequence of n conditional events and Π be the associated set of all coherent precise assessments. We set Di=π(Bi), i=1,2,3. Of course,
(π(B1),π(B2),π(B3)) is a tripartition of Π. Moreover, π(I1)=D1, π(I2)=D2, π(I3)=D1∪D3, π(I4)=D2∪D3. Then, by applying Corollary 1 we obtain that (s1,s2,s3,s4) is a square of opposition.
∎
Traditionally the square of opposition can be constructed based on the fragmented square of opposition which requires only the contrariety and contradiction relations (which goes back to Aristotle’s De Interpretatione 6–7, 17b.17–26, see [38, Section 2]).
This result also holds in our framework:
Theorem 2**.**
The quadruple (s1,s2,s3,s4) of sentences,
with sk:(F,Ik), k=1,2,3,4, is a square of opposition iff relations (a) and (c) in Definition 10 are satisfied.
Proof.
(⇒) It follows directly from Definition 10. (⇐) We prove that (d) and (b) in Definition 10 follow from (a) and (c). If π(I1)=∅, then of course π(I1)⊆π(I3). If π(I1)=∅,
let x∈π(I1)⊆Π, from (a) it follows that x∈/π(I2), and since (c) requires π(I2)∪π(I3)=Π, we obtain x∈π(I3). Thus, π(I1)⊆π(I3); likewise, π(I2)⊆π(I4). Therefore, (d) is satisfied. Now we prove that (b) is satisfied, i.e. π(I3)∪π(I4)=Π. Of course, π(I3)∪π(I4)⊆Π. Let x∈Π. If x∈/π(I3), then, x∈π(I2) from (c). Moreover, from (d), x∈π(I4). Then, Π⊆π(I3)∪π(I4). Therefore, (b) is satisfied.
∎
Corollary 3**.**
The quadruple (s1,s2,s3,s4) of sentences,
with sk:(F,Ik), k=1,2,3,4, is a square of opposition if and only if (s1,s2,s3,s4)=(s1,s2,\widebars2,\widebars1)
with
s1 and s2 being contraries.
Proof.
Of course, if (s1,s2,s3,s4) is a square of opposition, then s1 and s2 are contraries.
Moreover, s1 and s4 are contradictories, that is: π(I1)∩π(I4)=∅ and
π(I1)∪π(I4)=Π. Therefore, Π∖π(I4)=π(I1), which amounts to s4=\widebars1. Similary, as s2 and s3 are contradictories, it holds that s3=\widebars2.
Conversely, assume that s1 and s2 are contraries.
By instantiating Theorem 2 with s3=\widebars2 and with s4=\widebars1, it follows that the quadruple (s1,s2,\widebars2,\widebars1) is a square of opposition.
∎
4 Square of Opposition and Generalized Quantifiers
Let F be a conditional event P∣S (where S=⊥) and (B1(x),B2(x),B3(x)) be a tripartition of [0,1], where B1(x)=[x,1], B2(x)=[0,1−x], B3(x)=]1−x,x[ and x∈]21,1] (see Figure 2).
Consider the quadruple of sentences (A(x),E(x),I(x),O(x)), with A(x):(P∣S,IA(x)), E(x):(P∣S,IE(x)), I(x):(P∣S,II(x)), O(x):(P∣S,IO(x)), where
IA(x)=B1(x)=[x,1],
IE(x)=B2(x)=[0,1−x],
II(x)=B1(x)∪B3(x)=]1−x,1], and IO(x)=B2(x)∪B3(x)=[0,x[.
By applying Corollary 2 with (s1,s2,s3,s4)=(A(x),E(x),I(x),O(x)), it follows that (A(x),E(x),I(x),O(x)) is a square of opposition for any x∈]21,1] (see Figure 3).
We recall that in presence of some logical relations between P and S the set Π could be a strict subset of [0,1]. In particular, we have the following three cases (see, [29, 30]):
(i) if P∧S=⊥ and P∧S=S, then Π=[0,1]; (ii) if P∧S=S, then Π={1};
(iii) if P∧S=⊥, then Π={0}.
The quadruple (A(x),E(x),I(x),O(x)), with the threshold 21<x≤1, is a square of opposition in each of the three cases.
In particular we obtain:
case (i)
π(IA(x))=IA(x), π(IE(x))=IE(x),π(II(x))=II(x), and π(IO(x))=IO(x);
case (ii):
π(IA(x))={1}, π(IE(x))=∅,π(II(x))={1}, and π(IO(x))=∅;
case (iii):
π(IA(x))=∅, π(IE(x))={1},π(II(x))=∅, and π(IO(x))={1}.
We note that in cases (ii) and (iii) we obtain degenerated squares each, where—apart from the contradictory relations—all relations are strengthened. Specifically,
both contrary and the subcontrary become contradictory relations. Moreover, both subalternation relations become symmetric. As by coherence p(P∣S)+p(\widebarP∣S)=1, a sentence s:(P∣S,I) is equivalent to the sentence s′:(\widebarP∣S,\widebarI), where \widebarI=[0,1]∖I. Table 1
presents generalization of basic sentence types A(x), E(x), I(x), and O(x) involving generalized quantifiers Q. The generalized quantifiers are defined on a threshold x>21. The value of the threshold may be context dependent and provides lots of flexibility for modeling various instances of generalized quantifiers (like “most”, “almost all”).
Given two thresholds x1 and x2, with 21<x2<x1≤1, we analyze the relations among the same sentence types in the two squares of opposition S(x1) and S(x2), with S(xi)=(A(xi),E(xi),I(xi),O(xi)), i=1,2.
It can be easily proved that:
A(x2) is a subaltern of A(x1),
E(x2) is a subaltern of E(x1),
I(x1) is a subaltern of I(x2), and
O(x1) is a subaltern of O(x2). In the extreme case x=1 we obtain the probabilistic interpretation under coherence of the basic sentence types involved in the traditional square of opposition (A,E,I,O) (see [27, 28] for the default square of opposition, involving defaults and negated defaults).
In agreement with De Morgan (as pointed out by [20]) by the quadruple (a,e,i,o) we denotes the square of opposition obtained from (A,E,I,O) when the events P and S are replaced by \widebarP and \widebarS, respectively. Specifically, a:(\widebarP∣\widebarS,{1}), e:(\widebarP∣\widebarS,{0}), i:(\widebarP∣\widebarS,]0,1]), and o:(\widebarP∣\widebarS,[0,1[).
In the general case when P and S are logically independent it can be proved that the set of all coherent assessments on (P∣S,\widebarP∣\widebarS) is the square [0,1]2 (see e.g. [17]; see also [14, Proposition 1] [15, Theorem 4]). Thus, in the general case there are no relations between any two sentences s1 and s2, where s1∈{A,E,I,O} and s2∈{a,e,i,o}. Therefore, the two squares (A,E,I,O) and (a,e,i,o) do not form a cube of opposition (with these two squares as opposite facing sides).
5 Hexagon of Opposition
Compared to the millennia long history of investigations on the square of opposition, the hexagon of opposition was discovered fairly recently, namely in the 1950ies. The hexagon generalizes the square by adding the disjunction of the top vertices of the square to build a new vertex at the top and by adding the conjunction of the bottom vertices of the square to build a new vertex at the bottom.
According to Béziau ([2]), the hexagon of opposition was introduced by the French priest and logician Augustin Sesmat ([48]) and by the philosopher Robert Blanché ([7]), who worked out the full structure of the hexagon of opposition (for his main work on the hexagon of opposition see [8]). Jaspers and Seuren ([33]) trace the history of the hexagon back also to the American philosopher Paul Jacoby ([32], see also [20]).
In this section we will use the tools developed in Section 3, to construct a hexagon of opposition by starting from a square of opposition. More precisely, given a traditional square of opposition (A,E,I,O), by setting U=A∨E, Y=I∧O, the tuple (A,E,I,O,U,Y) defines a hexagon of opposition. Accordingly, we define the (probabilistic) hexagon of opposition in our approach as follows:
Definition 12** (Hexagon of opposition).**
Let sk:(F,Ik), k=1,2,3,4,5,6, be six sentences. We call the ordered tuple (s1,s2,s3,s4,s5,s6) a hexagon of opposition (under coherence), if and only if the following relations among the six sentences hold:
- (i)
(s1,s2,s3,s4)* is a square of opposition;*
2. (ii)
s5=s1∨s2;
3. (iii)
s6=s3∧s4.
Figure 4 shows the probabilistic hexagon of opposition as given by Definition 12.
Theorem 3**.**
Let sk:(F,Ik), k=1,2,3,4,5,6, be six sentences. The tuple (s1,s2,s3,s4,s5,s6) is a hexagon of opposition, if and only if
(s1,s2,s3,s4,s5,s6)=(s1,s2,\widebars2,\widebars1,s1∨s2,\widebars1∧\widebars2), with s1 and s2 being contraries.
Proof.
(⇒). Let (s1,s2,s3,s4,s5,s6) be a hexagon of opposition. Then, as (s1,s2,s3,s4) is a square of opposition, s1 and s2 are contraries. Moreover, by Corollary 3, it follows that (s1,s2,s3,s4)=(s1,s2,\widebars2,\widebars1). Then, by Definition 12, s5=s1∨s2 and s6=s3∧s4=\widebars1∧\widebars2. Therefore, (s1,s2,s3,s4,s5,s6)=(s1,s2,\widebars2,\widebars1,s1∨s2,\widebars1∧\widebars2).
(⇐).
Let (s1,s2,s3,s4,s5,s6)=(s1,s2,\widebars2,\widebars1,s1∨s2,\widebars1∧\widebars2), with s1 and s2 being contraries.
From Corollary 3, it follows that (s1,s2,s3,s4) is a square of opposition. Then, by relations (ii) and (iii) in Definition 12, it follows that (s1,s2,s3,s4,s5,s6) is a hexagon of opposition.
∎
Remark 5**.**
Assume that s1 and s2 are contraries. Then,
by Corollary 3, the
quadruple (s1,s2,\widebars2,\widebars1) is a square of opposition, and by Definition 12, the
tuple (s1,s2,\widebars2,\widebars1,s1∨s2,\widebars1∧\widebars2) is a hexagon of opposition.
We now consider relations among a tripartition of the set of all coherent assessments Π and a hexagon of opposition.
Remark 6**.**
Given a hexagon of opposition (s1,s2,s3,s4,s5,s6),
we observe that the sentence s6=s3∧s4 represents the pair (F,I6), where I6=I3∩I4. Moreover, by Remark 1, π(I6)=π(I3∩I4)=π(I3)∩π(I4).
Therefore, based on Theorem 1, the triple
(D1,D2,D3), where
D1=π(I1), D2=π(I2), and D3=π(I6), is a tripartition of Π. Conversely, based on Corollary 1, given a tripartition (D1,D2,D3) of Π, the sequence (s1,s2,s3,s4,s5,s6) where
sk:(F,Ik), k=1,…,6, with π(I1)=D1, π(I2)=D2, π(I3)=D1∪D3, π(I4)=D2∪D3,
π(I5)=D1∪D2, and π(I6)=D3, is a hexagon of opposition (see also [11, 20, 21]).
Next, we consider relations among a tripartition of [0,1]n and a hexagon of opposition.
Remark 7**.**
Based on Corollary 2, we can also construct a hexagon of opposition by starting from a tripartition of the whole set [0,1]n. Specifically, given a tripartition (B1,B2,B3) of [0,1]n, let I1=B1, I2=B2, I3=B1∪B3, I4=B2∪B3,
I5=B1∪B2, and
I6=B3.
For any sequence of n conditional events F, the tuple (s1,s2,s3,s4,s5,s6), where sk:(F,Ik), k=1,…,6, is a hexagon of opposition.
Theorem 4**.**
Given a hexagon of opposition (s1,s2,s3,s4,s5,s6), by Definition 12 all relations among the basic sentence types in the square (s1,s2,s3,s4) hold. Moreover, by Theorem 3 (and also by Remark 3), the following relations hold:
- (i)
s1* and s6
are contraries
(since s6=\widebars2∧\widebars1 and π(I1∩\widebarI2∩\widebarI1)=∅);*
2. (ii)
s2* and s6 are contraries (since s6=\widebars2∧\widebars1 and π(I2∩\widebarI2∩\widebarI1)=∅);*
3. (iii)
s3* is a subaltern of s6 (since s6=s3∧s4);*
4. (iv)
s4* is a subaltern of s6 (since s6=s3∧s4);*
5. (v)
s5* is a subaltern of s1
(since s5=s1∨s2);*
6. (vi)
s5* is a subaltern of s2 (since s5=s1∨s2);*
7. (vii)
s5* and s3 are subcontraries (as s5=s1∨s2 and s3=\widebars2, hence
π(I1∪I2)∪\widebarI2)=Π);*
8. (viii)
s5* and s4 are subcontraries
(as s5=s1∨s2 and s4=\widebars1, hence
π(I1∪I2)∪\widebarI1)=Π);*
9. (ix)
s5* and s6 are contradictories (as s5=s1∨s2, s6=s3∧s4=\widebars2∧\widebars1, hence
π((I1∪I2)∩(\widebarI1∩\widebarI2))=∅) and
π((I1∪I2)∪(\widebarI1∩\widebarI2))=Π).*
Figure 4 illustrates all the relations in the hexagon of opposition described in Theorem 4. This figure also shows the two triangles T1:(s1,s2,s6) and T2:(s3,s4,s5). We note that the sides of T1 consist of contrary relations, whereas the sides of T2 consist of subcontrary relations. Moreover,
the coherent part of the imprecise assessments defined by sentences in T1
(i.e., D1=π(I1), D2=π(I2) and
D3=π(I6))
forms a tripartition (D1,D2,D3) of Π. Whereas, the
imprecise assessments defined by sentences in T2 are such that
π(I3)=D1∪D3,
π(I4)=D2∪D3, and
π(I5)=D1∪D2.
By basing the hexagon of opposition on the square of opposition (A(x),E(x),I(x),O(x)) (as introduced in Section 4) we obtain the following hexagon of opposition: (A(x),E(x),I(x),O(x),U(x),Y(x)) with x∈]1/2,1], where U(x) denotes A(x)∨E(x) and Y(x) denotes I(x)∧O(x) (see Table 2). Figure 5 illustrates the hexagon (A(x),E(x),I(x),O(x),U(x),Y(x)) with x∈]1/2,1].
We now consider a generalization of the hexagon of opposition (A(x),E(x),I(x),O(x),U(x),Y(x)) by considering n conditional events. In particular, let F=(P1∣S1,…,Pn∣Sn) be a sequence of n conditional events. Exploiting Remark 7, we construct a hexagon of opposition by considering the following tripartition of [0,1]n: (B1(x),B2(x),B3(x)), with x∈]1/2,1], where
[TABLE]
We obtain the following (generalized) hexagon of opposition (A(x),E(x),I(x),O(x),U(x),Y(x)), with the quantified statements A(x):(F,IA(x)), E(x):(F,IE(x)), I(x):(F,II(x)), O(x):(F,IO(x)), U(x):(F,IU(x)), Y(x):(F,IY(x)),
where
[TABLE]
6 Concluding Remarks
Finally, we note that conditional probability interpretations of quantified statements were also proposed in psychology (see, e.g., [12, 13, 37, 39, 41, 43, 46]), since
generalized quantifiers are psychologically much more plausible compared to
the traditional logical quantifiers, as the latter are either too strict (∀ does not allow for exceptions) or too weak (∃ quantifies over at least one object) for formalizing everyday life sentences. Recent experimental data suggests that people negate conditionals and quantified statements mainly by building contraries (in the sense of inferring p(¬C∣A)=1−x from the negated p(C∣A)=x) but hardly ever by building contradictories (in the sense of inferring p(C∣A)<x from the negated p(C∣A)=x; see [40, 46]). However, this empirical result calls for further experiments. The square presented in Section 4 and the hexagon presented in Section 5 can serve as a new rationality framework for formal-normative and psychological investigations of basic relations among quantified statements.
Acknowledgement
We thank Deutsche Forschungsgemeinschaft (DFG), Fondation Maison des Sciences de l’Homme (FMSH), and Villa Vigoni for supporting joint meetings at Villa Vigoni where parts of this work originated (Project: “Human Rationality: Probabilistic Points of View”). Niki Pfeifer is supported by his DFG project PF 740/2-2 (within the SPP1516 “New Frameworks of Rationality”). Giuseppe Sanfilippo is supported by the INdAM–GNAMPA Project (2016 Grant U 2016/000391).