Aristotelian assertoric syllogistic
Mohamed A. Amer

TL;DR
This paper explores various modern approaches to Aristotelian assertoric syllogistic, analyzing their soundness, completeness, and decidability, and introduces simplified proofs that clarify longstanding logical issues.
Contribution
It systematically discusses and interrelates different semantic and syntactic approaches to syllogistic logic, providing simple proofs and insights into Leibniz's arithmetization.
Findings
Different facets of soundness, completeness, and decidability are examined.
Simplified proofs are provided without maximal or minimal conditions.
Deciphering Leibniz characteristic numbers becomes possible.
Abstract
Aristotelian assertoric syllogistic, which is currently of growing interest, has attracted the attention of the founders of modern logic, who approached it in several (semantical and syntactical) ways. Further approaches were introduced later on. These approaches (with few exceptions) are here discussed, developed and interrelated. Among other things, different facets of soundness, completeness, decidability and independence are investigated. Specifically arithmetization (Leibniz), algebraization (Leibniz and Boole), and Venn models (Euler and Venn) are closely examined. All proofs are simple. In particular there is no recourse to maximal nor minimal conditions (with only one, dispensable, exception), which makes the long awaited deciphering of the enigmatic Leibniz characteristic numbers possible. The problem was how to look at matters from the right perspective.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, Reasoning, and Knowledge · Advanced Algebra and Logic · Logic, programming, and type systems
ARISTOTELIAN ASSERTORIC SYLLOGISTIC1112010 Mathematics Subject Classification. 03A05, 03-03 (primary); 01A20, 01A45, 01A55, 01A60 (secondary). Key words and phrases. axiomatization, natural deduction, structures, models, order models, arithmetization, Venn models, soundness, completeness, decidability, sorites, independence, algebraization, inadequacy.
MOHAMED A. AMER
To Raouf Doss Who introduced modern logic to Egypt
Aristotelian assertoric syllogistic, which is currently of growing interest, has attracted the attention of the founders of modern logic, who approached it in several (semantical and syntactical) ways. Further approaches were introduced later on. These approaches (with few exceptions) are here discussed, developed and interrelated.
Among other things, different facets of soundness, completeness, decidability and independence are investigated. Specifically arithmetization (Leibniz), algebraization (Leibniz and Boole), and Venn models (Euler and Venn) are closely examined. All proofs are simple. In particular there is no recourse to maximal nor minimal conditions (with only one, dispensable, exception), which makes the long awaited deciphering of the enigmatic Leibniz characteristic numbers possible. The problem was how to look at matters from the right perspective.
**Introduction. **Aristotelian assertoric syllogistic (henceforth AAS), which is currently of growing interest (Glashoff 2005), has attracted the attention of the founders of modern logic. Leibniz, Boole, De Morgan, Venn, Peirce, Frege, Hilbert, Russell and Gödel all dealt with it. For some of them it was the starting point (cf. Boole 1948).
Modern treatment of AAS started closer to what may be currently called the semantical or model theoretic approach. This was threefold: arthimetical, algebraic, and diagramatic (or set theoretic). The first trend was developed by Leibniz (Łukasiewicz 1998, pp. 126-9; Kneale and Kneale 1966, pp. 337-8; Glashoff 2002; and Sotirov 2015). The second was developed by Leibniz (Kneale and Kneale 1966, pp. 338-45; Lenzen 2004) and after about two centuries was again developed by Boole (1948), without mentioning the work of Leibniz. The last trend was developed by Euler, then by Venn (Venn 1880).
With the rise of proof theory late nineteenth century, six syntactical formalizations of AAS were developed:
(i) Monadic first order formalization which goes back to Frege (1967, on p.28 the square of logical opposition may be found). This formalization is adopted by Hilbert and Ackermann (1950, pp. 44-54).
(ii) Sentential formalization which goes back to Peirce (Bellucci and Pietarinen 2016, p. 226) and is adopted by Gödel (Adzic and Dosen 2016, p. 479). The most elaborate study of this formalization is that of Łukasiewicz (1998).
(iii) Dyadic first order formalization which goes back to Shepherdson (1956). The novel idea of regarding categorical sentences (or propositions) as binary relational sentences (or propositions) is due to De Morgan (Valencia 2004, pp. 506-7).
It is worthwile to note here that, according to Bocheński (1968, pp. 68-70), Aristotle dealt with the logic of relations among other topics which Bocheński (1968) puts (p. 63) collectively under the title “Non-analytical laws and rules”, to be distinguished from syllogisms such as those considered in this article, which Bocheński (1968) terms (p. 42) “analytic”.
(iv) Natural deduction formalization which goes back to Corcoran (1972) and Smiley (1973).
(v) First order many-sorted formalization which goes back to Smiley (1962).
(vi) A recent formalization based on Hilbert’s epsilon and tau quantifiers (Pasquali and Retoré 2016).
All of the above will be considered below with only two exceptions. The first is the many-sorted formalization ((v) above), for it is a variant of the monadic first order formalization mentioned in (i) above; moreover it was, apparently, abandoned even by its own author (cf. Smiley 1973). The formalization based on Hilbert’s epsilon and tau quantifiers ((vi) above) will take us far off the current mainstream of logic. So it will be the second exception and will not be further considered here, though it may have intrinsic merit especially for those who are interested in formalizing natural languages.
With only one exception, the modern syntactical formalizations of AAS degraded it to the rank of a secondary logic, a subordinate or a subsidiary sublogic of a superior fundamental or principal primary logic. In contrast, the natural deduction formalization ((iv) above, cf. Bocheński (1968, pp. 3, 31, 42, 49, 52)) rehabilitates it to a full-fledged primary logic, as was probably designed by its founder: Aristotle, and as was taken for granted for over two millennia. Accordingly, this formalization will be the focus of this article. Through completeness we shall see that as far as the basic sentences (to be defined in 1.6 below) are concerned other formalizations add nothing new.
In the sequel we deal -from modern standpoints- with AAS, not with medieval nor traditional syllogistic. In contrast to Boole (1948), Glashoff (2007), Hilbert and Ackermann (1950), Russinoff (1999), Shepherdson (1956) and Sotirov (1999), term negation (or complementation, to use a modern term; cf. Bocheński (1968, p. 50)) is not here permitted. Also, in contrast to Hilbert and Ackermann (1950), Łukasiewicz (1998), Shumann (2006), Shepherdson (1956) and Sotirov (1999), Boolean combinations of categorical sentences are not here permitted. So (the extensional aspect of) AAS will be just the logic of, or the fragment of set theory which deals with, inclusion (universal affirmative sentences) and exclusion (universal negative sentences) and their contradictories (particular sentences). However, some exceptions may be appropriate as will be clear, or clarified, at the proper places.
Among other things, different facets of soundness, completeness, decidability, and independence are investigated. Particularly arithmetization (Leibniz), algebraization (Leibniz and Boole) and Venn models (Euler and Venn) are closely examined.
All proofs given here are simple. In contrast to Corcoran (1972), Glashoff (2010), Martin (1997), Shepherdson (1956), Smiley (1973) and Smith (1983), our proofs have no recourse to maximal nor minimal principles nor conditions (with only one exception, which is indirect and may be dispensed with). This makes the long awaited deciphering of the enigmatic Leibniz characteristic numbers possible. The problem was how to look at matters from the right perspective.
To specify, in section 3 below we provide a polynomial time algorithm to decide for any finite set of categorical sentences whether it is consistent and, if it is, to assign a Leibniz model (to be defined below) to it. This settles positively problem 2 of Glashoff (2002) for finite sets. The general case is discussed in section 2.
I hope that the simplicity of this exposition of AAS will help to re-incorporate it into the mainstream of mathematical logic.
After this introduction, the structure of the rest of the article is as follows:
-
Formalizations of AAS
-
Semantics of AAS
-
Decidability
-
Basic equivalence of the four formalizations
-
Venn soundness and completeness
-
Direct way to Venn models
-
Variations on
-
Direct completion of direct deduction
-
Models of revisited
-
Decidability revisited
-
Sorites
-
Independence
-
Algebraic semantics of AAS, a prelude
-
Algebraic interpretation of
-
Annihilators: Embedding the partial into a total
-
Back to algebraic interpretation
-
Leibniz and Boole
-
Inadequacy: bounds of AAS
Acknowledgements
Appendix
**1. Formalizations of AAS. **Formalizations of AAS differ with regard to permitting the subject and the predicate of a formal symbolic categorical sentence (henceforth categorical sentence) to be the same. Smith (1983) and Glashoff (2010) follow Corcoran (1972) in not permitting sameness; as accommodating sameness “would entail rather more deviation from the Aristotelian text” says Corcoran (1972, p. 696).
On the other hand Smiley (1973) left the door open for permitting sameness, noting (p. 144) that “the variables he [Aristotle] uses for the major, middle and minor terms are all distinct from one another […]; though when it comes to substituting actual terms in the resulting forms we are of course at liberty to replace different variables by the same term (64a1).”. Consequently, it seems that Aristotle excluded sameness, for technical -not philosophical- reasons. This is, possibly, why Łukasiewicz (1998) adopted sameness (see pp. 77, 88); while Martin (1997) simultaneously considered two systems, one of them is permitting sameness and the other is not.
In conformity with the current mainstream of mathematical logic, sameness is here permitted. Excluding sameness, and other variations, will be considered in section 7 below.
1.1. Monadic first order formalization of AAS. The languge here is a standard first order language, with or without equality, whose set P of non-logical constants has at least three elements, and all of its elements are unary relational symbols. In the sequel “”, “” and “” will be metalinguistic variables ranging over the elements of P. With abuse of notation, “P” will denote this language too.
ABBREVIATIONS 1.1.
“” is an abbreviation for “”
“” is an abbreviation for “”
“” is an abbreviation for “”
“” is an abbreviation for “”
DEFINITION 1.2. (P) is the theory based on P with only one non -logical axiom schema, namely, (which is equivalent to the schema ).
PROPOSITION 1.3. The following are theorem schemata of P:
-
2.
-
4.
-
6.
-
Proof. Routine.
1.2. Sentential formalization of AAS. The symbols “”, “”, “” and “” were made use of in section 1.1, in this section they will be made use of differently. This abuse of notation is benign as long as the intended denotation is clear from the context, so it will be here permitted. Such abuses of notation may be permitted later on without further notice.
Let be a set (whose elements are to correspond to categorical constants) having at least three elements, let and be four injective functions of pairwise disjoint ranges, each of domain , and let be the union of their ranges. In the sequel “”, “” and “” will be metalinguistic variables ranging over the elements of .
The language here is a standard sentential language whose set of sentential symbols is . With abuse of notation “” will denote this language too.
DEFINITION 1.4. is the theory based on with the following non - logical axiom schemata:
-
Ei 2.
-
4.
-
Ei 6.
-
*Ei
The proof machinery is modus ponens together with any standard set of sentential logical axiom schemata.
REMARK 1.5. There are two kinds of substitution: sentences for sentences and indices for indices. Each may be permitted, under some conditions, as a derived rule of inference (cf. Łukasiewiez 1998, p. 88; see section 1.5 below).
**1.3. Dyadic first order formalization of AAS. **The language here is a standard first order language, with or without, equality whose non-logical constants are four binary relation symbols “”,“”,“” and “”, together with a set of individual (or categorical) constants having at least three elements. In the sequel “”, “”, and “” will be metalinguistic variables ranging over the elements of . With abuse of notation “” will denote this language too.
DEFINITION 1.6. is the theory based on whose non-logical axioms are the universal closures of:
-
2.
-
Axx\qquad\qquad\qquad\quad\leavevmode\nobreak\ 4.
-
6.
-
1.4. Natural deduction formalization of AAS. The language here is a sublanguage of the language defined in 1.3. The alphabet is the four binary relation symbols and , together with a set of individual (or categorical) constants having at least three elements. The sentences are the equality free atomic sentences of 1.3., viz. a sentence is a string where and . By abuse of notation “” will denote this language too, and the set of all sentences will be denoted by “”. In the sequel “”, “”, “”, “”, “” and “” will be metalinguistic variables ranging over the elements of .
Sentences starting with or are called universal, those starting with or are called particular. Also, sentences starting with or are called affirmative, those starting with or are called negative. For , sentences starting with are called -sentences.
DEFINITION 1.7. is the logical system based on the language with the following deduction rules (or enrichments thereof, see sections 8 and 11 below):
-
- 1.
-
3. (Barbara)
-
(Celarent).
“Barbara” and “Celarent” are, respectively, the medieval names of the rules 3 and 4; “-, “” and “” are, respectively, abbreviations for “A-identity”, “A-partial conversion” and “E-conversion”. For simplicity, we may write “rules” instead of “deduction rules”.
DEFINITION 1.8. A direct deduction (or d-deduction) of from is a sequence such that and for each or is the consequent of some rule of whose antecedents are previous terms of the sequence. In this case we write , and is said to be a direct consequence (or theorem) of . Also is said to be a direct (or -) deduction from . From now on, the rules 0-4 given above will be called also “-rules”.
Regarding the current mainstream of mathematical logic, this definition is a typical definition. In contrast, corresponding definitions given in Corcoran (1972), Glashoff (2010), Martin (1997), Smiley (1973) and Smith (1983) are atypical, each has its own peculiarity.
To get closer to the Aristotelian tradition, a more restricted definition of direct deduction is presented in section 11 below, and its relationship to the above one is investigated there.
As usual, the contradictory of is defined as follows:
so .
A set is said to be -inconsistent (or -contradictory) if and , for some ; otherwise is said to be -consistent.
DEFINITION 1.9. The general (or -) deduction relation is defined as follows:
iff is -inconsistent.
For , “ is -inconsistent (or -contradictory)” and “ is -consistent” may be defined along the above lines, replacing “” by “”. Obviously , so if is -inconsistent, it is -inconsistent.
For and , “” will denote the closure of under , i.e.
= the smallest such that and for every , whenever .
LEMMA 1.10. Let be subsets of such that for every . For every -deduction from , there are a , a -deduction from and a strictly increasing function such that and for every , . Hence for every , whenever .
Proof. By induction on , the number of times of making use in of assumptions from .
Basis: ; take and the identity function on .
Induction step: assume the required for . Let and let be the last line in which an assumption from is made use of. The case is easier than the case , so we shall deal only with the latter.
By the induction hypothesis, there are a , a deduction , from and a strictly increasing function such that and for every .
Let be a deduction of from . Put:
for ,
where “” is the concatenation operation symbol.
Evidently . The completion of the proof is now easy.
Parts 3 and 4 of the next proposition are, respectively, reformulations of lemmata M1 and M2 of Corcoran (1972).
PROPOSITION 1.11. Let , , and , then:
-
.
-
If and , then .
-
If is -consistent, then iff .
-
is -consistent iff it is -consistent.
-
If and , then .
-
, hence .
-
; hence, for every -rule , if each antecedent of belongs to , then so also does its consequent.
*Proof.
-
The required is a corollary of the above lemma.
-
Generalizing upon the metalinguistic variable “”, the resulting sentence may be proved by course of values induction on the length of the -deduction from , noticing that is not a premise of any rule of .
-
Let , then for some , , . So, by part 2, if is -consistent then , then . So, by part 2 again, . The other direction is obvious.
-
Let be -consistent and -inconsistent, then there is a universal such that , then by part 3, . Also there is such that , so, by lemma 1.10., , hence is -inconsistent. Consequently, if is -inconsistent it is -inconsistent. The other direction is obvious.
-
Obvious if is -inconsistent, so let be -consistent and let and . There is such that . If is universal, then by part 3 and lemma 1.10, . Also, if is particular and , then by part 2, . In both cases . In the remaining case must be particular and , then by part 2, . But there is such that , then , hence , which completes the proof.
-
By induction, part 5 may be generalized to: for every finite , whenever . From this it readily follows that whenever , hence the result.
-
By part 1 and lemma 1.10, , and by part 6, . To prove the last clause, let be a -rule. If each antecedent of belongs to , then its consequent belongs to .
In view of part 4 of the above proposition, for , the prefix “-” may be deleted from “-consistent”, “-inconsistent” and “-contradictory”.
1.5. Equality / equivalence. For , and , is equivalent to each of:
-
iff all ,
-
iff all .
Thus, , imply the substitutability of for each other in universal positive sentences. This will be generalized below to all universal sentences, respectively all sentences, for , respectively . The last generalization is the essence of equality (congruence or equivalence, depending on the situation). It holds, in the respective appropriate forms, for the other formalizations as is shown in the following:
THEOREM 1.12.
- Let P, then:
for every form of P, where is a form obtained from by substituting some occurrences of “” in by “”, or vice versa.
- Let , then:
for every sentence of , where is a sentence obtained from by substituting some occurrences of “” in by “”, or vice versa.
- Let , then:
for every form of , where is a form obtained from by substituting some occurrences of “” in by “”, or vice versa; provided -for languages with equality- no substitution takes place in a form or a subform of the form , where and are terms.
- Let , and , and let , , then for all :
iff and iff , where for respectively.
Proof. The first three parts may be proved by the standard methods developed in the respective formal systems.
For the last part, part 7 of proposition 1.11 secures the required for and .
It remains to consider the cases where and . If is inconsistent the required follows by the definition of , so let be consistent. Assume , and , then by part 3 of proposition 1.11, , , hence , . But there is such that , , consequently , and, by definition, . The other cases are similar or easier.
1.6. Basic sentences. In each of the four formalizations P, , and the sentences to be made use of in the Aristotelian syllogistic will be called basic (or categorical) sentences. The sets of basic sentences will be denoted, respectively, by “P”, “”, “” and “”. That is:
and P.
(= the set of all atomic sentences of ).
the set of all (equality free) atomic sentences of .
().
1.7. Interpretation. Let , with abuse of notation (no confusion will ensue) we define another function by , for and . As usual, for , the image of under is denoted by “”; also we may write “”, “” for “”, “” respectively. The function is said to be an interpretation of in . Similarly P, and () may be interpreted in each other.
PROPOSITION 1.13. Let , let and be interpretations of in and P respectively, and let , , P. Then:
-
,
-
whenever .
Proof.
-
Easy.
-
By proposition 1.3 for P, and by the definitions for the other cases.
PROPOSITION 1.14. Let P, let be an interpretation of in the set of basic sentences of , and let . Then:
whenever .
Proof. The interpretations of the axioms of are theorems of , and the proof machinery of is not weaker than that of .
To investigate the converses of proposition 1.14 and part 2 of proposition 1.13, we first go to:
**2. Semantics of AAS. **The theories and are first order, and the theory is sentential; so each has its usual class of models with respect to which it is sound and complete.
2.1. Models of P. A model of P is an ordered pair where is a non-empty set and maps P into . is called the universe, or the base, of and may be denoted also by “|$$\mathfrak{B}$$|”.
2.2. Models of . A model of is a mapping form into 2 (), which satisfies all the axioms of . For , means that takes the value 1 under the usual extension of .
2.3. Models of . A structure of the dyadic language (or a -structure ) is a 6-tuple where is a non-empty set and and are binary relations on corresponding to the relation symbols “”,“”,“” and “” respectively, and is a mapping of into . is called the universe, or the base, of and may be denoted also by “|$$\mathfrak{B}$$|”. is a model of if it satisfies its axioms.
Since, by axioms 1 and 2, and (where “” denotes the complement with respect to ) we may -by abuse of notation- say that is a model of whenever the expansion is a model of .
PROPOSITION 2.1. Let , and . Then is a model of iff:
-
is reflexive and transitive (i.e. is a pre-ordering on ),
-
is symmetric,
-
,
-
, where is the relative product of and .
Proof.** ** satisfies axioms 1-6 iff conditions 1-3 above are satisfied.
Axiom 7 is equivalent to . So axiom 7 is satisfied iff which, in the presence of condition 2, is equivalent to condition 4.
2.4. Models of. The structures in which may be interpreted (henceforth -structures) are exactly the -structures. For we write to mean that whenever , where is an -structure and , are defined as usual.
Two -structures , are said to be -equivalent, basically equivalent, or (for short) -eq if for every , iff ; in this case we may say also that is -eq to . This notion may be extended in an obvious way to the other formalization of .
DEFINITION 2.2. An -structure is said to be a direct model (or, for short, a -model) if for every , whenever .
The proof of the following is straightforward.
PROPOSITION 2.3. An -structure is a -model iff all of the rules of inference of are valid in it (in the sense that if the antecedents are true in it, then so also is the consequent), iff:
-
is reflexive on and transitive (equivalently, is a pre-ordering on ),
-
,
-
is symmetric,
-
,
where, for a set , ); and for a binary relation , is its converse.
DEFINITION 2.4. The canonical structure corresponding to ) is the -structure satisfying:
-
,
-
, where for a set , is the identity function on ,
-
for every , .
An -structure is said to be canonical if it is equal to , for some .
The basic property of is:
iff all .
Every -structure in which is an extension of a canonical structure; namely the canonical structure corresponding to .
LEMMA 2.5. For , is a -model (of hence of ).
Proof.** **Let and let . If then , hence , consequently .
THEOREM 2.6. (Direct soundness and completeness). Direct deduction is sound and complete with respect to the class of all direct models. That is, for every ,
[TABLE]
where “” means that for every -model .
Proof.** **Soundness is immediate by the definition. To prove completeness assume then, in particular, . But , then , consequently , hence .
DEFINITION 2.7. (General models). An -structure is said to be a general model (or, for short, a -model) if for every , whenever .
LEMMA 2.8. For , is a -model (of hence of ).
Proof.** **Replace “” by “” in the proof of lemma 2.5.
THEOREM 2.9. (General soundness and completeness). General deduction is sound and complete with respect to the class of all general models. That is, for every ,
[TABLE]
where “” means that for every -model .
Proof.** **Replace “” by “” in the proof of lemma 2.6.
THEOREM 2.10. (-compactness). For every , for every ,
[TABLE]
Proof.** **By -soundness and -completeness.
In theorem 9.4 below the -models will be fully characterized. Now we confine ourselves to the following:
REMARKS and definitions 2.11.
-
Every -model is a -model (obvious) but not vice versa. For, let for some , then is a -model but not a -model.
-
Every model of is obviously a -model (hence a -model) but not vice versa. For let for some , then is a -model but not a model of .
-
For every -model in which is surjective, iff ( or ) iff for some , . Such models are called full models.
If respectively , is said to be complete (respectively consistent). Thus is not full iff it is consistent. is an example of a -model in which is bijective, while it is not complete.
These notions may be generalized to all -structures, does not have to be surjective.
-
Let be an -structure in which is surjective. Then is not complete iff for some , ( and ) iff for some , and for all , .
-
For , is consistent iff it has a consistent -model.
-
Direct deduction is sound with respect to any class of models with respect to which general deduction is sound.
2.5. Order models and Venn models. To the best of my knowledge Shepherdson (1956) was the first to make use of a version of order models; the ordering was pre-ordering (reflexive and transitive, but not necessarily antisymmetric) and the context was the semantics of a version of . In the context of the semantics of , or versions thereof, versions of order models were made use of in Martin (1997) and in Glashoff (2002). The former required a model to be some variation on a lower semi-lattice with a smallest element, the latter relaxed these conditions; none of them mentioned that Shepherdson (1956) made use of order models.
Following Shepherdson (1956), let be a pre-ordering on a non-empty set ; and following Glashoff (2002), put:
[TABLE]
Let be a function from to , then is a model of , hence a -model and a -model. Such models are said to be order models. If is a partial ordering (equivalently, antisymmetric) the order model will also be called partial (or antisymmetric). is said to be the order structure underlying the order model . Notice that if is a model of , then .
A concrete order model (henceforth c.o.m, and c.o.ms for the plural) is an order model in which is a collection of non-empty sets and is , so the c.o.ms are partial. If is defined on such a by iff , then for every is a model of . Such models are said to be Venn models.
In an order model , is determined by and , so we may write -for short- “”. For similar reasons we may write “” to denote the Venn model ; the c.o.m with the same and is denoted by “”.
Let , then for every is a c.o.m but not a Venn model and is a Venn model but not a c.o.m. This is not always the case, for if the universe is the set of all non-empty subsets of a non-empty set, then the model is both a Venn model and a c.o.m. Every Venn model is embeddable in such a model which is -eq to it. A Venn model with universe is a c.o.m iff for every there is such that whenever .
Let {the class of all Venn models, the class of all c.o.ms, the class of all partial order models, the class of all order models}, then for every there is which is -eq to it, hence (with the usual meaning). This is a corollary of the above discussion and the following observation.
Let be an order model. Define the function ′ from to by , where for a binary relation , Domain . Let be the range of ′ and define the function from to by . Then is a c.o.m which is a Venn model and ′ is a homomorphism from onto . It is an isomorphism iff is antisymmetric. In all cases and are -eq.
2.6. Models and interpretations.
2.6.1. P and . Let be an interpretation of in and let be a model of . Put:
[TABLE]
then is a model of . It is easy to see that:
-
For every positive universal , iff .
-
For every positive particular , only if . The other direction holds iff for every Range there is such that both and whenever . In this case:
[TABLE]
On the other hand, let be an interpretation of in and let be a model of . Put:
[TABLE]
then is a model of .
2.6.2. and . Let be an interpretation of in and let be a model of . Put:
[TABLE]
then is a model of .
On the other hand, let be an interpretation of in and let be a model of . Define:
[TABLE]
then is a model of and for every
[TABLE]
2.6.3. and . Let be an interpretation of in and let be a model of , then is a Venn model of and for every , iff .
On the other hand, let be an interpretation of in and let be a model of . Put:
[TABLE]
then is a model of . It is easy to see that:
-
For every positive universal , iff .
-
For every positive particular , only if . The other direction holds if is an order model, in this case:
[TABLE]
**2.7. Leibniz models. **Let be the partial ordering defined on the set of natural numbers by iff is a multiple of , and define the partial ordering on by iff and . Denote the binary operations of the greatest common divisor and the least common multiple on by and respectively, and put:
[TABLE]
The restriction of on , to be also denoted by “”, partially orders . So, for every , is an order model of , hence a -model and a -model. Such models are called Leibniz models, for they were first introduced -in a different setting- by him in 1679, as may be learned from Łukasiewicz (1998, pp. 126-9), Kneale and Kneale (1966, pp. 337-8) and Glashoff (2002). Leibniz practically defines to be , but he sets iff . To show that this gives rise to an order model as defined in 2.5 above, notice that has an -lower bound iff there is such that <m_{3},n_{3}>R$$<m_{1},n_{1}> and , which is equivalent to and . But , so the condition is equivalent to . The l.h.s. . But , so the condition is equivalent to which is equivalent to Leibniz condition. Via reductio ad absurdum Glashoff (2002) gave a different proof of the same result.
Every Leibniz model is isomorphic to a Venn model. The converse is not true, for is denumerable while there are non-denumerable Venn models.
*2.7.1. Assigning Leibniz models. *For put:
occurs in some element of having two distinct categorical constants.
will be called essentially finite if is finite. This notion may be generalized to subsets of and .
LEMMA 2.12. .
Proof. By proposition 1.11 and induction on the length of the deduction.
THEOREM 2.13. To each consistent essentially finite a Leibniz model of may be assigned (cf. Glashoff 2010, Lemma 3.4).
Proof. Let be the order structure underlying the Leibniz models. Put and let be an injective enumeration of , be an injective -sequence of primes and . Define as follows:
[TABLE]
and for , where
[TABLE]
and are square free finite products (the empty product is equal to 1). By the consistency of , for all . Therefore is a Leibniz model.
To show that :
- Let , then only if only if only if only if . So iff . Consequently, for every , if .
Moreover if for some , , then by the consistency of there are such that and , .
Again by the consistency of , , hence , consequently .
- Let be such that , then only if only if only if only if , , only if . From this and the consistency of it follows that for every , , whenever , respectively.
2.7.2. Leibniz soundness and completeness. For , -deduction is sound with respect to the set of all Leibniz models (to be denoted, henceforth, by “”) as they are order models. Regarding completeness, for put:
[TABLE]
THEOREM 2.14. If is essentially finite, then whenever .
Proof. Obvious if is inconsistent. Let be consistent and , then by theorem 2.13 is inconsistent, from which the result follows.
REMARKS 2.15.
- From Łukasiewicz (1998, pp. 126-9) it follows that:
[TABLE]
where is any sentence (not necessarily basic) of the language , and is the obvious adaptation of to . Consequently, for every :
[TABLE]
the other direction holds if is essentially finite.
- In the above remark, as well as in theorem 2.14, only square free Leibniz models (with the obvious definition) may be taken into consideration.
2.7.3. Generalization. Theorem 2.13 cannot be unconditionally generalized to infinite . For, let be an injective enumeration of some denumerable subset of . Put:
[TABLE]
then is consistent but has no Leibniz model, though it has a Venn model. The following theorem gives a sufficient condition for to have a Leibniz model if is denumerable.
THEOREM 2.16. Let be denumerable and let be an injective enumeration of it. Then has a Leibniz model if it is consistent and for every , is finite.
Proof. Along the lines of the proof of theorem 2.13 with the following modifications. Let be an injective enumeration of the primes, put:
[TABLE]
To see that the condition of the above theorem is essentially necessary, define the equivalence relation on by iff (cf. section 1.5 above).
THEOREM 2.17. If has a Leibniz model then there is a consistent extension of such that is countable and for every , with at most two exceptions, is finite.
Proof. Let be a Leibniz model of . Put:
[TABLE]
then and is countable.
If for some , is infinite, then from which the last part of the theorem follows.
REMARKS 2.18.
-
In the underlying order structure of a Leibniz model , is the greatest element and are the only minimal elements. Let . If then . Also assuming that , then implies that hence , and implies that .
-
There would be no exceptions in the above theorem had been replaced by in the definition of Leibniz models, which is equivalent to excluding and from the universe of Leibniz models.
-
Noticing that forces to be assigned the same value in any Leibniz model of , with a slight modification of its proof, theorem 2.16 may be strengthened as follows:
has a Leibniz model if there is a consistent extension of such that:
-
.
-
is countable.
-
For every is finite.
-
The above strengthening is very close to be the converse of theorem 2.17. As a matter of fact, it is its converse had been replaced by in the definition of Leibniz models.
-
The completeness theorem 2.14 may be generalized in line with the above generalizations.
2.7.4. Logico-philosophical discussion of Leibniz models. “It is strange that his [Leibniz’s] philosophic intuitions, which guided him in his research, yielded such a sound result.” says Łukasiewicz (1998, p. 126). Hopefully the above reasoning would make matters less strange.
Following is a further discussion taking into consideration the Liebnizian correlation between prime and composite numbers on one hand and atomic and composite sentences, propositions, concepts or attributes on the other hand (cf. Glashoff 2002, 2010).
If the primes correspond, respectively, to the atomic sentences , it is natural to let the composite number correspond to the composite sentence . The difficulty here is that , which is not equal to a prime, would correspond to the sentence , which is equivalent to an atomic sentence; as conjunction of sentences is idempotent, while multiplication of numbers is not. Obviously this difficulty will not arise for square free numbers.
Notice that in the definitions of given above, the values assigned by to the elements of are always ordered pairs of square free numbers. Extending this property to all elements of , after relaxing it to permit also to be taken as values, gives rise to what will be called essentially square free Leibniz models.
To investigate the relationship between the Leibniz models and the essentially square free Leibniz models, let be an injective double sequence of primes. Map the kth power of the ith prime on . This mapping may be extended in the obvious way to an injection from to such that and for , is square free (being the empty product of primes, ). The mapping may be further extended, in the obvious way, to , the extension also will be denoted by “”. It may be easily seen that and that iff for every . So for every Leibniz model , is a monomorphism from into which is essentially square free and is basically equivalent to . Moreover if in , is replaced by and by , then will be an isomorphism. Such models will be called proper Leibniz models. Since every Leibniz model is isomorphic to a proper Leibniz model, attention may be confined to the latter.
Let be an injective double sequence of atomic sentences in some sentential language. For put:
[TABLE]
As for no prime , the empty conjunction, which is always true. So is always true.
On the other hand, for every prime , so
[TABLE]
These are the only infinitary sentences to be considered.
It may be easily seen that for every proper Leibniz model , iff is a tautology, and iff is a contradiction.
As a matter of fact and are not indispensable as elements of the universe of proper Leibniz models. To keep them or not is a philosophical choice. Rejecting them is probably more compatible with the Aristotelian legacy.
Following Boole (1948, p. 49), to each the set of all truth assignments which satisfy may be appropriated. For every proper Leibniz model (hence for every Leibniz model) , induces an isomorphism of onto a Venn model which is a concrete order model.
3. Decidability.
REMARKS 3.1. Let and .
-
If , may be easily seen to be contradictory. In such a case is said to be plainly contradictory.
-
iff .
-
If then or .
-
If is not plainly contradictory, then is contradictory iff there are .
-
.
-
In a different context, Glashoff (2005) presents an algorithm which may be regarded as a prelude to the one given below. Roughly speaking, it amounts -in our terminology- to: For a finite , may be obtained from in finitely many steps.
THEOREM 3.2. There is a polynomial (of degree 8) time algorithm to decide for any essentially finite which is not plainly contradictory whether it is contradictory, and to assign a Leibniz model to it if it is not.
Proof. Let satisfy the conditions of the theorem, then is finite. Put and .
The input of the algorithm is structured as a list where , and for every , where and . may be obtained from or supplied as a secondary input. and .
The next step is to extract for each , (the set of all elements of starting with ) which may be done through a simple scanning procedure in a linear time. Then construct (with the obvious meaning) for each .
Notice that and is needed to construct each of and . To construct start with the list . At most comparisons are needed to determine all the possible applicabilities of Barbara. And for each possible applicability at most comparisons are needed to check whether the consequent is already there. If not, append it.
It is needed to repeat this process at most times to cover all the required applications of Barbara. In addition, for each at most comparisons are needed to check whether is listed; if not, append it. It is easy to see that this completes the construction of .
By simple variations on the above procedure may be constructed. Constructing is much simpler.
is contradictory iff for some , which needs at most comparisons to check.
If is consistent assign to it a Leibniz model along the lines of the proof of theorem 2.13 (in the appendix a polynomial (in of degree 6) time algorithm will be presented to generate the first n primes).
The total running time is bounded above by a polynomial (in ) of degree 8.
**4. Basic equivalence of the four formalizations. **Let , let and be bijective interpretations of in and respectively, and let , , .
THEOREM 4.1.
-
is consistent iff is.
-
iff .
Proof. From proposition 1.13, if of and only if of follow.
The other two directions for follow from the corresponding directions for , this is a consequence of proposition 1.14. So it remains to prove these two other directions for , .
Only if of (1): Assume is consistent. Let be a finite subset of , then by theorem 2.13 it has a Leibniz model, say. is a model of ; from this the consistency of follows. Since may be assumed to be the inverse image of some finite , then by subsection 2.6.3, induces a model of . From this the consistency of follows.
If of (2): Let , then is inconsistent. By part (1), is inconsistent, hence .
REMARKS 4.2.
-
As far as the basic sentences are concerned, the four formalizations are equivalent in the sense expressed by part 2 of the above theorem; so it may be said, for brevity, that they are basically equivalent.
-
In the above theorem, the only if direction of (1) and the if direction of (2) may be directly proved for .
**5. Venn soundness and completeness. **Let .
DEFINITION 5.1.** ** iff for every Venn model .
THEOREM 5.2.** **(Venn soundness and completeness). General deduction is sound and complete with respect to the class of Venn models. That is iff .
Proof. Every Venn model is a model (subsection 2.5), then a -model (2 of remarks 2.11). This guarantees soundness.
To prove completeness, let , then is consistent then, by theorem 4.1, is consistent where is a bijective interpretation of in , for some appropriate . By well known results in first order logic, has a model. By subsection 2.6.3 and 2 of remarks 2.11, has a Venn model, hence .
Alternatively theorem 6.3 below may be made use of to directly show that has a Venn model.
REMARKS 5.3.**
**
-
In view of subsection 2.5, the above theorem entails that general deduction is sound and complete with respect to each of the classes of order, partial order, and concrete order models.
-
Direct ways to Venn models on one hand, and to order and partial order models on the other hand, will be presented in sections 6 and 9 respectively.
-
For the Venn soundness and completeness of Łukasiewicz’s system, Shepherdson (1956) may be consulted.
**6. Direct way to Venn models. **Let , put , . Define the function from to by:
[TABLE]
Then is a Venn model (which is a concrete order model), denote it by “”.
LEMMA 6.1.** **For every , the following are equivalent:
-
,
-
(which is equivalent to ),
-
.
*Proof. *Straightforward.
LEMMA 6.2.** **Let , consider:
-
,
-
,
-
(which is equivalent to ),
-
,
then 1 is equivalent to 2 which implies 3 which, for consistent , implies 4.
*Proof. *The first two parts are easy to see. For the last part assume , then and there are such that or , or and or . So there are eight cases to consider. We deal only with the case ; the other cases are similar or easier. In this case , then , then , but , then , then which contradicts that ; from this and the consistency of the result follows.
THEOREM 6.3.** (Existence of Venn models). **Let be consistent, then is a Venn model (which is a concrete order model) of .
*Proof. *Let . By lemma 6.1, iff . From this it follows that for , if .
Moreover, if then then, by lemma 6.2, . Finally, if then, by lemma 6.2, then .
Lemma 6.1 syntactically characterizes , hence it syntactically characterizes . The following syntactical characterization of , hence of ,
[TABLE]
is an immediate consequence of lemma 8.3 below; the definition of “” may be found at the beginning of section 8 below.
Slightly modifying the above construction, light may be shed on the role played by the Venn models among the models of .
THEOREM 6.4.** **For every model there is a Venn model and surjection such that:
- and for every :
iff , iff ,
-
and are basically equivalent,
-
is an isomorphism iff is antisymmetric.
*Proof. *Put:
[TABLE]
[TABLE]
The rest of the proof is easy.
7. Variations on . As was promised in section 1, we follow in subsection 7.1 the long standing tradition of not permitting the subject and the predicate of a categorical sentence to be the same. The resulting formalization, , and its relationship to are discussed.
In subsection 7.2 the standpoint that requires that all are but not vice versa, will be considered.
7.1. Weak natural deduction formalization of AAS. The alphabet of the logical system , the weak natural deduction formalization of AAS, is the same as the alphabet of . The set of sentences of is defined as follows:
[TABLE]
In accordance with subsection 1.6, the set of basic sentences of is itself.
The rules of inference of are those of after dropping the first one (). The weak direct and general deduction relations are respectively denoted by “” and “” and are defined along the lines of definitions 1.8 and 1.9 respectively. The definition of the other notions introduced in the theory of may be modified in the obvious way to render the corresponding definitions for the theory of .
The theory of may be obtained from that of by making the obvious modifications. The key observations are the following, where .
PROPOSITION 7.1.
[TABLE]
Proof. The only if direction is obvious. To prove the other direction let be a -deduction of from . We show by induction that for every , if .
Distinguish between four cases:
-
, then , then .
-
, then the result follows by the induction hypothesis.
-
, then or for some . From this and part 2 the result follows by the induction hypothesis.
-
, then the result follows by the induction hypothesis noting that if is obtained via applying then the first occurrence of in the deduction must be obtained via for some . By part 2 and the induction hypothesis , hence .
PROPOSITION 7.2.
is -consistent iff is -consistent (hence iff ).
Proof. The if direction easily follows from proposition 7.1. To prove the other direction assume that is -inconsistent, then , for some and some . If the result follows by the previous proposition. Else, distinguish between two cases:
-
, then which is not permitted.
-
. In this case there is a -deduction of from . The rule made use of to justify the first occurrence of in this deduction must be for some . By the previous proposition , hence is -inconsistent.
COROLLARY 7.3.** ** is -consistent iff is -consistent iff is -consistent iff is -consistent.
Proof. is -consistent only if is -consistent only if is -consistent only if is -consistent only if is -consistent.
REMARK 7.4.** **From propositions 7.1 and 7.2 it follows that the results concerning Leibniz soundness and completeness (subsection 2.7.2) and Venn and order soundness and completeness (section 5) apply to after replacing and by and respectively.
7.2. Proper natural deduction formalization of AAS. AAS may be interpreted to require to hold iff all are but not vice versa. That is, extensionally, the denotation of “” is required to be a proper subclass of the denotation of “”.
To satisfy this requirement introduce the logical system , the proper natural deduction formalization of AAS, based on the same language as the system . So the set of sentences of is the same as . In accordance with subsection 1.6 the set of basic sentences of is itself. For “” to remain to be the contradictory of “”, it must be interpreted as some are not or (all are and vice versa). The rules of inference of are to be obtained from those of by dropping the first one and augmenting the remaining ones by (-) and (-).
The proper direct and general deduction relations are respectively denoted by “” and “” and are defined along the lines of definitions 1.8 and 1.9 respectively. The definitions of the other notions introduced in the theory of may be modified in the obvious way to render the corresponding definitions for the theory of .
PROPOSITION 7.5. For :
-
iff (iff ).
-
If is -consistent then it is -consistent (equivalently -consistent), but not always vice versa.
-
If (equivalently ) then , but not always vice versa.
*Proof.
-
The proof of part 1 is similar to that of proposition 7.1.
-
That is -consistent if it is -consistent easily follows from part 1. To see that the other direction does not always hold consider for some such that ; this proves part 2.
-
Part 3 is a direct consequence of part 2.
PROPOSITION 7.6.** ** is -consistent iff it is -consistent.
Proof. Along the lines of the proof of part 4 of proposition 1.11.
Order models, Leibniz models, and Venn models are not -models, so it does not make sense to ask whether is sound or complete with respect to any of these classes. However, with some modifications, to be shown below, everything goes as expected.
Let be a -model of such that is injective on and is antisymmetric, and let be subsets of such that and . Put:
[TABLE]
PROPOSITION 7.7.** ** is a -model of .
Proof. Assume . To show that , let then for some and some such that , hence . If then . The other cases are obvious.
To show that is a -model, assume . If not then , then , then which is absurd. So Barbara is valid. The other rules are easier to deal with.
Accordingly, it is legitimate to adopt in the sequel the following modifications:
[TABLE]
THEOREM 7.8.** **Let be -consistent, then it has a modified Venn model which is a modified c.o.m and which is also a -model. If, in addition, is essentially finite then it has also a modified Leibniz model which is a -model.
Proof. Put , then is -consistent, hence it is -consistent, hence is a -model of in which is antisymmetric.
To show that is injective let be such that and , then , then , then , then which contradicts that is -consistent.
Therefore is a modified Venn model (which is also a modified c.o.m) of . By proposition 7.7 it is a -model of , from which it may be easily seen that it is a -model of . The proof of the additional result in case is essentially finite is almost the same. The only major difference is that may not be injective. But its restriction to is injective, which is sufficient for our purpose.
REMARK 7.9.** **The last theorem shows that remark 7.4 applies to after making the obvious modifications.
**8. Direct completion of direct deduction. **In this section the five rules of inference given in definition 1.7 are augmented by five more rules, in order that may be directly obtained from in case is consistent (cf. Glashoff (2005) where related problems are dealt with by brute force via a computer program). The additional five rules are:
-
6.
-
8.
-
Taking the ten rules of inference into consideration, the -deduction relation “” may be defined along the lines of the definition of “”. Likewise, all other definitions involving “” may be modified in an obvious way to give corresponding definitions involving “”.
PROPOSITION 8.1.
-
.
-
.
*Proof. *Along the lines of the proofs of the corresponding results for : Part 1 of proposition 1.11 and lemma 2.12, respectively.
The next definition and parts 1,2 of the next lemma are essentially due to Smith (1983).
DEFINITION 8.2.** **Let . An - chain is a sequence for some , such that and . This chain is said to be a -chain, or a chain in , if ; it is said to be an chain if there is such that and .
LEMMA 8.3.** **For and :
-
iff iff there is an - chain in .
-
iff iff there is such that and .
-
iff or .
3*′*. iff for some , or
and .
- iff .
4*′*. iff for some , or
and .
*Proof.
-
If is easy to show that the first statement implies the second. By induction it may be shown that the second statement implies the third. Again by induction it may be shown that the third statement implies the first.
-
It is easy to show that the first statement implies the second and that the third implies the first. By induction it may be shown that the second statement implies the third.
Parts 3 and 4 are easy. In each of the parts 3*′* and 4*′* one direction is easy, the other may be shown by induction.
PROPOSITION 8.4.** **For :
-
If then .
-
If then .
-
is -consistent iff is -consistent iff is -consistent.
(So for the prefix “-” may be deleted from “-consistent”, “-inconsistent” and “-contradictory”).
Proof. Part 1 is obvious, and part 3 is an easy consequence of parts 1 and 2 above and part 4 of proposition 1.11.
Part 2 is immediate if is -inconsistent. To complete the proof assume that is -consistent and proceed by course of values induction. Let and let be a -deduction of from . If the annotation of () is that it belongs to or that it is the consequent of a -rule whose premises are previous sentences, the result easily follows.
It remains to assume that the annotation of is that it is the consequent of a new rule. The completion of the proof depends on the specific rule in use. Following is a proof in the case of Darii. The other cases are similar or easier.
Let and let its annotation be that it follows from by Darii. By the induction hypothesis . By part 3 of proposition 1.11, , and by the definition of , there is such that . Since is -consistent then, in view of parts 1 and 4 of lemma 8.3, there are such that . By lemma 8.3, and there is such that and , hence . In view of the -consistency of , lemma 8.3 implies that . Let (the other case is similar), then . But , then , hence .
In view of the -deduction completeness with respect to the class of Venn models, part 2 of the above proposition is an immediate consequence of:
PROPOSITION 8.5.** **The -deduction is sound with respect to the class of Venn models (hence with respect to the class of order models).
Proof. Routine.
REMARK 8.6.** **The converse of part 2 of proposition 8.4 does not always hold. For if is inconsistent then , while it is easy to find an inconsistent such that . Also the weaker statement: , does not always hold. A counter example is .
The consistency of solves the problem as the following theorem shows (cf. Smith 1983).
THEOREM 8.7.** **For consistent , .
Proof. The inclusion of in is guaranteed by part 2 of proposition 8.4. For the other direction assume that is consistent and . If is universal the result follows by part 3 of proposition 1.11 and part 1 of proposition 8.4. So it remains to deal with the particulars. The consistency of restricts what to be considered to the following:
Case 1. is for some . By the method made use of in the proof of part 2 proposition 8.4, consideration may be restricted to the following subcase only. There are such that , which implies that .
Case 2. is for some . In this case for some . Distinguish between two subcases.
Subcase 2.1. For some , . Then which implies that .
Subcase 2.2. For some , . This subcase may be divided into the following three subsubcases.
Subsubcase 2.2.1. . Then which implies that .
Subsubcase 2.2.2. and . Then there is such that and ; while or , but -by the consistency of - not both.
This subsubcase may be further divided into two subsubsubcases.
Subsubsubcase 2.2.2.1. but . Then from which follows.
Subsubsubcase 2.2.2.2. but . Similar to subsubsubcase 2.2.2.1.
Subsubcase 2.2.3. and . Then there is such that and , while or . But the consistency of implies that , then , then . In particular, , hence the result.
REMARK 8.8.** **In a different context, Smith (1983):
-
Excluded subcase 2.1 under the claim that it is impossible that .
-
Subsubcase 2.2.3 was deemed to be impossible.
**9. Models of revisited. **An -structure is said to be a -model if for every , whenever .
An immediate consequence of this definition is:
PROPOSITION 9.1.** **An -structure is a -model iff it is a -model (hence satisfying conditions 1-4 of proposition 2.3) and:
-
.
-
.
Along the lines of the proofs of lemma 2.5, theorem 2.6 and theorem 2.10, the following may be proved:
THEOREM 9.2.** **For every :
-
is a -model (of , hence of ).
-
-deduction is sound and complete with respect to the class of -models. That is iff .
-
iff for some finite .
(This is called -compactness).
REMARK 9.3.** **All remarks given in remarks and definitions 2.11 hold with “” replacing “”. All proofs of the original versions essentially go through; the only exception is the first remark, whose modified version may be proved by part 2 of proposition 8.4.
THEOREM 9.4.** **An -structure is a -model iff it is a -model and:
-
, or
-
.
Proof. Only if: By part 2 of proposition 8.4 and an obvious generalization of part 3 of remarks and definitions 2.11.
If: Every -structure which satisfies condition 2 is a -model. So, assume that is a -model which satisfies condition 1. To see that it is a -model, let , and . By remark 9.3, is consistent, hence by theorem 8.7, , hence .
Theorem 9.4 fully characterizes the class of -models, as was promised after the proof of theorem 2.10.
DEFINITIONS and remarks 9.5.**
**
- For an -structure and a relation symbol , define (the basic -theory of ), (the basic positive theory of ), (the basic negative theory of ) and (the basic theory of ) as follows:
.
.
.
.
So two -structures are -equivalent iff they have the same basic theory.
For let be an -structure.
- is said to be a substructure of and is said to be a superstructure of if and for every , . If, morever, Range Range , is said to be a core substructure of . Obviously each -structure has a unique core substructure, to be called its core substructure. is a core substructure of some -structure iff it is the core substructure of itself iff Range . In this case is said to be a core structure. Obviously every canonical structure is a core structure.
, have the same core substructure iff and .
- If is a substructure of then they have the same core substructure and the three structures have the same basic theory. Hence for if one of them is an -model, so also are the other two.
In this case is said to be an -submodel of , and is said to be an -supermodel of ; and the core substructure is said also to be the core -submodel. If a core structure is an -model, it is said to be a core -model.
- is said to be a positive semisubstructure of and is said to be a positive semisuperstructure of if , and:
for every ,
for every .
In this case and . For each if, in addition, and are both -models, it is said also that is a positive -semisubmodel of and is a positive -semisupermodel of .
THEOREM 9.6.** **For each if (defined as above) is a consistent core -model then there is an order model such that:
-
is a positive -semisupermodel of .
-
If is a partial ordering, then so also is .
-
If is complete, then it is the core -submodel of .
For , the above holds after weakening part 1 to become:
1*′*. , and ; hence .
Proof. Let and let be a consistent core -model. Put:
or , and has no -lower bound,
( may be assumed disjoint),
or ,
is reflexive on since is reflexive on . To prove the transitivity of , let . If or belongs to then , else . If then . It remains to consider the case where for some such that or , in both cases . So is transitive. Hence is an order model, which is to be denoted by “”.
To prove part 2 it suffices to notice that if then they both belong to or both belong to .
To prove parts 1, 1*′* notice that and, by the disjointness of , . Let . If has an -lower bound then it is an -lower bound, else the element is an -lower bound of the subset . In both cases , hence .
At this point the proof forks into two branches:
(i) Assume and let . To show that several cases have to be considered, following is one of them, the others are similar or easier.
There is such that . Since Range then, by theorem 9.4 and part 5 of proposition 9.1, . So . Hence .
That and is guaranteed by the consistency of . This completes the proof of 1.
(ii) The other branch is . To show that assume that there is , then , then has an -lower bound. To show that this is absurd, several cases have to be considered; following is one of them, the others are easier or similar.
There is such that . Since Range then, by parts 3, 4 of proposition 2.3, which contradicts the consistency of .
That is guaranteed by the consistency of , since . This completes the proof of 1*′* and ends the forkation.
For , if is complete then “” may be replaced by “” at the appropriate places, which proves part 3.
Taking the relationship between the -models () and their respective core -submodels into consideration, a weaker result, which holds for a wider class of -models, immediately follows:
COROLLARY 9.7.** **For , if is an -model whose core -submodel is consistent, then there is an order model such that
. Moreover,
[TABLE]
In view of the last part of subsection 2.5, the above corollary may be immediately strengthened as follows:
COROLLARY 9.8.** **In the above corollary “an order model” may be replaced by “a partial order model which is a c.o.m and a Venn model at the same time”.
Part 4 of theorem 1.12 may be extended to the case , to get a result similar to that obtained there for the case ; the result obtained (there) for the case is weaker. Call the collection of these three results “syntactical congruence”.
Syntactical congruence together with the definitions of core -models () yield semantical congruence as formulated by parts 1 and 2 of the next theorem. Part 3 of the same theorem (whose proof is straightforward) strengthens the conclusion of part 2, under some additional condition. Alternatively, semantical congruence may be directly proved by the characterizations of -models () given in propositions 2.3 and 9.1 and theorem 9.4.
THEOREM 9.9.** **Let and let be a core -model (consistent or not). Put , then:
- is a congruence relation on and is a partial ordering on .
Moreover, for :
-
is a congruence relation on . The mapping is an epimorphism from onto . Hence is a core -model which is basically equivalent to .
-
If is, in addition, an order model, then is also a partial order model.
For , semantical congruence makes it possible to replace “” in theorem 9.6 by “”. This provides, for , an alternative proof of a weaker form of corollary 9.8, where the partial order model may be neither concrete nor Venn.
The corresponding weaker result for the case may likewise be obtained, but the alternative proof is a bit more involved.
REMARKS and definitions 9.10.**
**
-
Theorem 9.6 (or corollary 9.7) and corollary 9.8 (or its weaker forms) provide, respectively, direct ways to order models and partial order models for consistent . Simply in each of them let the core -model be the canonical structure . In the case of corollary 9.8 the partial order model may be required to be a concrete order model and a Venn model at the same time.
-
Let and let be a class of -structures, then:
-
is said to be -strongly semantically complete if for every there is such that .
-
is said to be -syntactically complete if for every , whenever .
-
is said to be -consistently syntactically complete if for every -consistent and every , whenever .
-
is said to be -consistently semantically complete if every -consistent has a model in .
For , the condition given in clause implies the condition given in clause .
- Put:
= the class of all order models,
= the class of all partial order models,
= the set of all Leibniz models,
= the class of all concrete order models,
= the class of all Venn models.
And for put:
= .
Also put:
.
-
, .
-
Every element of is a -model.
Every element of is a -model.
Every element of is a -model.
- For , is -strongly semantically complete.
For , (respectively ) is -consistently semantically (respectively consistently syntactically, syntactically) complete. If is finite, the exclusion of may be dropped.
-
For and , is said to be -syntactically complete if for every , or .
-
For and , if is consistent and -syntactically complete then there is such that . If, moreover, is finite, the exclusion of may be dropped.
**10. Decidability revisited.
**
THEOREM 10.1.** **For each there is a polynomial (of degree at most 8) time algorithm to decide for any whether , provided that is essentially finite and:
.
Proof. For a proof may be obtained by slightly modifying the appropriate parts of the proof of theorem 3.2.
In view of lemma 8.3, a proof for the case may be obtained along the same lines as above.
In view of remarks 3.1, the first part of this theorem may be made use of to determine whether is inconsistent. If yes, ; else iff , by theorem 8.7.
**11. Sorites. **Soriteses are well known in Aristotelian syllogistic (see Hurley, P. J. 1982, p. 201; Rosenthal, M. and Yudin, R (eds.) 1967, p. 423; also cf. Boger, G. 1998, pp. 197-8; Smiley, T.J. 1973, pp. 139-40).
The notion of a sorites may be explicated as follows.
DEFINITION 11.1.** **Let and let . An annotation of an -deduction from is said to be an -sorites annotation if the following conditions are satisfied:
-
whenever .
-
For , is involved in the annotation of another sentence in the following and only in the following way.
2.1. If then exactly one of the following holds:
2.1.1. is annotated as the consequent of by some -rule with one premise.
2.1.2. is annotated as the consequent of , or , by some -rule with two premises.
2.1.3. is annotated as the consequent of , or , by some -rule with two premises.
2.2. If then exactly one of 2.1.1 and 2.1.2 holds.
2.3. If then exactly one of the following holds:
2.3.1. and 2.1.1 holds.
2.3.2. and exactly one of 2.1.1 and 2.1.3 holds.
An -sorites from is an -deduction from which admits a sorites annotation. An -sorites of from is an -deduction of from which is an -sorites. In case there is such a sorites, we write “”.
Condition 2 of the above definition entails that, with the exception of the last sentence, every sentence occurring in an -sorites from is made use of exactly once as a premise of some application of some -rule, and in this (hence in each) application the premise or the premises immediately precede the consequent.
For there is, obviously, a set and an -deduction from which is not an -sorites from . So the best we may hope for is to find an -sorites of from , for every such that . Even this is not always attainable.
Let and , then for , is consistent and , but not . For , adding the rule (-sub) as an additional rule of inference will solve the problem. Same holds for if, instead, (Ferison) is added.
11.1. Further extension of direct deduction. Taking into consideration the following two rules of inference.
- -sub 11. Ferison
in addition to the ten rules of inference of , the -deduction relation “” may be defined along the lines of the definition of “”. Likewise all the other definitions involving “” may be modified in an obvious way to give corresponding definitions involving “”.
PROPOSITION 11.2.** **For ,
[TABLE]
Proof. One direction is obvious, the other is easy.
DEFINITION/remark 11.3.** **The -models may be defined along the lines of the definition of the -models.
By the above proposition they are the same.
THEOREM 11.4.** **Let and then:
[TABLE]
provided one of the following conditions holds:
-
is affirmative,
-
is universal negative and is consistent,
-
is particular negative, or is consistent and .
The other direction unconditionally holds, so the two sides are equivalent if is consistent and .
Proof. Assume . Distinguish between the following cases.
- , for some . By proposition 11.2 and lemma 8.3 there is an - chain in , say. We may assume that this chain is injective. If , then there is an -sorites of from of length 1. Else ; define as follows:
[TABLE]
Then is an -sorites of from .
- , for some . If , the result is an easy consequence of part 1 of this proof and lemma 8.3. Else, by proposition 11.2 we may assume that . By lemma 8.3 it suffices to deal with the following three subcases (for some ):
2.1. and ,
2.2. and ,
2.3. .
Assume 2.1 (the other two subcases are not harder), then there are an - chain and a - chain in , let them be, respectively and . We may assume that the ranges of these two chains are disjoint, otherwise this subcase will be reduced to subcase 2.3. Also we may assume that each of these two chains is injective. The following is a (hence a )-sorites of from : .
-
, for some and is consistent. By proposition 11.2 and lemma 8.3 there are such that and there are an - chain and a - chain in , let them be, respectively, and . By the consistency of , the ranges of the two chains are disjoint. Moreover, we may assume that each of them is injective. Let (the other case is not harder), then the following is an -sorites of from : .
-
, for some . If , then there is a one line -sorites of from . Else assume that is consistent and , by proposition 11.2 and lemma 8.3 it suffices to deal with the following two subcases.
4.1. There are such that and . Making use of Bocardo and Baroco it may be shown, along the lines of part 3 of this proof, that there is a (hence a )-sorites of from .
4.2. There is such that . As in part 3 of this proof, there are such that:
[TABLE]
By lemma 8.3 it suffices to deal with the following two subsubcases.
4.2.1. For some , . By this and (), . So there are -- and - injective -chains; let them be and respectively.
By the consistency of , the range of and the union of the ranges of and are disjoint. Assume that the ranges of and have only in common (the other case is similar).
If , then there is a (hence a )-sorites of from . Else and the following is a -sorites of from .
(here -sub is made use of), .
4.2.2. and , for some . By this and (*), . So there are -- and - injective -chains; let them be and respectively.
By the consistency of , the range of and the union of the ranges of and are disjoint. If the ranges of and are not disjoint, this case will be reduced to the above case; so assume that they are disjoint.
If then there is a (hence a )-sorites of from . Else , assume (the other case is similar), then the following is a -sorites of from . (here Ferison is made use of), .
PROPOSITION 11.5.** **If is inconsistent then it is ds-inconsistent, in the sense that there is such that .
Proof. Let be inconsistent, then there is a universal such that . Distinguish between two cases:
-
, for some . In this case the result is a direct consequence of theorem 11.4.
-
, for some . As in part 3 of the proof of theorem 11.4, there are such that and there are injective -- chains in ; let them be, respectively, and .
If the ranges of these chains are disjoint, the result follows by theorem 11.4 and the methods made use of in its proof. Else there is . Then there are injective -- chains in . Along the lines of the proof of theorem 11.4 it may be shown that (and ).
To show that the consistency condition in each of the parts 2,3 of theorem 11.4 cannot be completely dispensed with, we prove:
PROPOSITION 11.6.** **Let and ; and assume that .
- If every - chain in is a chain, then occurs as an assumption in every -deduction of from ; moreover it is made use of as a premise in the deduction if it is different from .
In parts 2 and 3 below, is assumed to be the only universal negative sentence in .
-
If every - chain and every - chain in is a chain, then occurs as an assumption and is made use of as a premise in every -deduction of and every -deduction of from .
-
If every - chain, every - chain, every - chain and every - chain in is a chain, then for every injective -deduction of from there is such that is made use of as a premise at least twice.
In parts 4 and 5 below, is assumed to be the only negative sentence in .
-
If every - chain or every - chain in is a chain, then occurs as an assumption and is made use of as a premise in every -deduction of from .
-
If every - chain and every - chain in is a chain, then for every injective -deduction of from there is such that is made use of as a premise at least twice.
Proof. Generalize the first part to become:
For every , if every - chain in is a chain, then occurs as an assumption in every -deduction of from ; moreover, it is made use of as a premise in the deduction if it is different from .
The stronger statement may be easily proved by course of values induction on the length of the -deduction.
Parts 2 and 4 may be proved similarly.
Again generalize part 3 to become:
For every if every - chain, every - chain, every - chain and every - chain in is a chain, then for every injective -deduction of from there is such that is made use of as a premise at least twice.
The stronger statement may be proved by course of values induction on as follows. Assume the required for and let be an -deduction of from . Since are disjoint and , then there are only two cases to consider:
-
For some , in this case the result is immediate by the induction hypothesis.
-
For some and some it is the case that and is obtained from them as the conclusion of applying the rule .
If is made use of as a premise in a step whose conclusion is for some , the result is immediate. Also if there is some - chain in which is not a chain, the result follows by the induction hypothesis.
So it remains to assume that every - chain in is a chain and for every , is not made use of as a premise in the step which gives rise to . Put:
[TABLE]
Then for some , . Hence is an -deduction of from , so by parts 1,2 above .
Assume, towards a contradiction, that for every is made use of as a premise at most once. Then is an -deduction of from . Again by parts 1,2 above, . Hence the result.
Part 5 may be proved similarly.
EXAMPLES 11.7. To see that the consistency condition in each of the parts 2,3 of theorem 11.4 cannot be completely dispensed with, put:
,
, .
For , (in fact ), but by the above proposition .
This example still works even if is augmented by all of the Aristotelian syllogisms.
To see that consistency is not always necessary, just notice that whether is consistent or not, whenever .
Following are basic properties of sorites.
DEFINITIONS and remarks 11.8.** **Let , and , and let be an -sorites of from according to some annotation.
-
Two annotations of an -deduction from are said to be essentially the same if the only difference between them is interchanging “assumption” (i.e. the corresponding sentence belongs to ) and “A-Id” in some places. An -deduction from is said to have essentially one, or unique, annotation (of some sort) if all of its annotations (of this sort) are essentially the same.
-
For , is an -sorites from according to the restriction of the given annotation iff or the annotation of is neither “A-Id” nor “assumption”.
-
Let , then for at least one , is an -sorites from according to the restriction of the given annotation.
-
For , every -sorites from has essentially one sorites annotation. This does not apply to , for the -deduction has two -sorites annotations which are not essentially the same. Only one of them is a -sorites annotation.
-
Ferison and -sub are the only -rules which are not -rules. In every -sorites annotation at most one of them is made use of, at most once.
-
In each -sorites at most one triple of the form or occurs, at most once.
If no such triple occurs, the sorities will have an essentially unique -sorites annotation. Else all of its -sorites annotations are essentially the same, with the only exception that an occurrence of “” may be annotated as the consequence of the preceding two sentences by Ferio (which is a -rule) in some of them and by Ferison (which is not) in the others.
**12. Independence.
**
DEFINITIONS 12.1.** **Let be a deduction system and let be a rule of . The deduction system obtained from by excluding will be denoted by “”.
-
is said to be derivable in if whenever is a set of antecedents of an instance of , and is the corresponding conclusion. Otherwise is said to be independent in .
-
is said to be independent if each of its rules is independent in it.
-
is said to be weakly independent in if for some set of sentences, while .
-
is said to be weakly independent if each of its rules is weakly independent in it.
REMARKS 12.2.
-
Independence implies weak independence.
-
For , each rule of is independent in iff it is weakly independent in , hence is independent iff it is weakly independent.
-
Each of is independent (cf. Glashoff (2005) where similar results are obtained via brute force computation).
-
The independence of each of and is an immediate consequence of the independence of each of and respectively.
THEOREM 12.3.**
**
-
-sub, Ferio and Ferison are derivable in , hence in . Each of the other rules of is independent in , hence in .
-
is not independent, hence not weakly independent.
-
is weakly independent; however, it is not independent.
Proof.
- The sequence shows that -sub is derivable in . The corresponding proofs for Ferio and Ferison are not harder.
Put = Bocardo and let be three pairwise distinct elements of . Put and . It is easy to see that if then . From this the independence of in follows. Similarly the other required results may be obtained.
-
By part 1 above and part 2 of remarks 12.2.
-
Part 1 above shows that is not independent. It shows also that to prove the weak independence of it suffices to deal with -sub, Ferio and Ferison only.
Put -sub and let and , for some distinct . . To see that notice that the only rules which yield an -sentence are Ferio, Baroco, Bocardo, and Ferison. To obtain by applying Baroco or Bocardo the -sentence occurring as one of the antecedents -in the present case- will be the same as the conclusion, which is forbidden in sorites. To apply Ferio or Ferison, the antencedents -in the present case- must be and . But if is a subsequence of a sorites deduction from and the annotation of is that it is obtained from or by some rule, then . So neither Ferio nor Ferison is applicable, hence .
Next, put Ferio (Ferison) and let () and for some pairwise distinct . By a slight modification of the above technique it may be shown that , but .
12.1. Independence of and variations thereof. and have the same deduction rules, but the notion of -deduction is weaker than that of -deduction. By definition 1.9, for , iff is -inconsistent. Likewise for each rule of (equivalently of ) define the -deduction relation “” by: iff is -inconsistent. From this and remarks 12.2 it easily follows that is independent in iff it is weakly independent in , hence is independent iff it is weakly independent.
THEOREM 12.4.** ** is independent.
Proof. Let be pairwise distinct elements of , and put Barbara and . iff is -consistent. But the set of all -consequences of is , hence is -consistent. Consequently Barbara is independent in .
The proofs of the independence of the other rules are similar or easier.
To get closer to the usual deduction systems, we introduce two new deduction systems , and show that each of them is equivalent to and discuss its independence.
12.1.1. First variation on . The deduction system is obtained by augmenting the system by the rule: (contradiction, Co for short).
Let . It is easy to see that if then there is a -deduction of from in which Co is never made use of or it is made use of only at the last step; moreover, this applies to for each rule of .
THEOREM 12.5.** **The following are equivalent:
-
,
-
,
-
or is inconsistent,
-
is inconsistent.
Proof. Easy if is inconsistent; and in all cases parts 1 and 4 are equivalent by definition 1.9.
Assume is consistent. By theorem 8.7, parts 1 and 3 are equivalent, and by the definition of , part 3 implies part 2. Finally assume part 2, then there is a -deduction of from in which Co is never made use of, this implies part 3.
The following theorem settles the indepenence of .
THEOREM 12.6.**
**
-
Every rule of is independent in iff it is weakly independent in , hence is independent iff it is weakly independent.
-
iff .
-
For every rule of , iff or is -inconsistent.
-
is independent.
Proof. The proof of the first three parts is easy.
To prove the last part let , then is an instance of Co. But by lemma 8.3, . So by part 2 above . Therefore Co is independent in . To complete the proof let be some other rule of , then is a rule of . By part 3 above the independence of in may be proved by choosing a consistent set of antecedents of such that , where is the corresponding conclusion; which is always possible.
12.1.2. Second variation on . Though is closer than to the contemporary deduction systems, it is not as close to the Aristotelian spirit as . Inspired by Gentzen-type sequent systems (cf. Kleene, S.C. 1967, p. 306) we introduce a second variation on , which will hopefully be close enough to both modern and Aristotelian traditions. The deduction rules of are:
0*′*. A-Id
1*′*.
2*′*.
3*′*.
4*′*.
5*′*.
6*′*.
where , and . “Ass” and “Raa” are abbreviations for “Assumption” and “Reductio ad absurdum” respectively. “” is just a symbol, instead we could have made use of ordered pairs and write, e.g. “” in place of “”.
DEFINITION and remarks 12.7.** **Let be a set of sequents, i.e. S\subseteq\{$$\Gamma\vdash\sigma:\Gamma\cup\{\sigma\mathbf{\}\subseteq}BN(C)\mathbf{\}}.
- A -deduction from is a sequence of sequents, where and for each , or may be obtained from preceding terms of the sequence by some -deduction rule.
If , is said to be a -deduction of from . In this case we write .
-
We write “” for “”, “-deduction” for “-deduction from ” and “-deduction of (or of )” for “-deduction of (or of ) from ”.
-
The above definition and remark may be generalized to subsystems of .
-
The notions of derivability, independence and weak independence may be extended to in the obvious way.
-
A deduction rule of is independent in iff it is weakly independent in . Hence is independent iff it is weakly independent.
THEOREM 12.8.** **For every :
[TABLE]
Proof. Let then, by definition 1.9, for some . Let and be, respectively, -deductions of and from . Then and are, respectively, -deductions of and . To their concatenation (which is a -deduction) add one more line to obtain by Raa from lines and . This proves the only if direction.
To prove the other direction let , let be a -deduction of , and assume that for each . To show that we deal with as many cases as there are -deduction rules. Following we consider Celarent*′* (4*′) and Raa (6′*), the other cases are similar or easier.
Celarent*′*: There are and such that and . By the above assumption and . Hence . So by part 7 of proposition 1.11, .
Raa: There are such that ; ; , , , and . By the above assumption and . Hence . So by part 4 of proposition 1.11 and the relevant definitions, .
Notice that in the -deductions and which occur in the proof of the only if direction of the above theorem, only the rules are made use of. Moreover, if , then there is a -deduction of in which Raa is never made use of.
This is essentially sufficient to prove the following:
COROLLARY 12.9. If then there is a -deduction of in which Raa is never made use of or is made use of only in the last step.
This section is concluded by proving the independence of .
THEOREM 12.10.** **Let , be a -deduction rule, and be the corresponding -deduction rule.
-
iff ,
-
iff iff iff ,
-
iff ,
-
is independent.
Proof. The proofs of the first and the third parts are along the lines of the proof of theorem 12.8 noting that proposition 1.11 still holds after replacing “”,“” by “”,“” respectively. For part 2 it is sufficient to notice that each of the four statements holds iff is of the form for some and some .
To prove the last part we consider three cases:
Rules : Let be one of these rules and let be the corresponding -rule. Since is independent in , there is a set of antecedents of an instance of such that , where is the corresponding conclusion. Put then is a set of antecedents of and is the corresponding conclusion. By parts 2,3 of definition and remarks 12.7 and part 1 above:
iff iff iff .
But , hence is independent in .
Rule Ass: For , while, by 2 above, . Hence Ass is independent in .
Rule Raa: For distinct elements of let , , and then S is a set of antecedents of Raa and is the corresponding conclusion. By a slight modification of the proof given above for the rules it may be shown that Raa is independent in .
**13. Algebraic semantics of AAS, a prelude. **The most well known attempt to algebraically interpret Aristotelian syllogistic is that of Boole (1948, first published 1847); however, it is not the first. More than a century and a half earlier, this area of research was pioneered by Leibniz (Kneale and Kneale 1966, pp. 338-45; Lenzen 2004). Following is a discussion of the subject in general; the works of Leibniz and Boole will be briefly discussed in section 17 below.
Regarding the central role played by order models in the semantics of , they will be our starting point for algebraization. Each underlying order structure of an order model will induce an algebra which may be expanded to make the interpretation of possible.
The simplicity of order models stems from the fact that all relations are determined by only one of them, namely the interpretation of , which is compatible with the Aristotelian view that Barbara is the essential syllogism. Likewise, algebras defined in this section will each have one (partial) binary operation and no others.
DEFINITIONS and remarks 13.1.
-
Let be a non-empty set and let be a function from a subset of to , then is said to be a partial binary operation on , and is said to be a partial algebra.
-
Let be a binary relation on a set , the partial binary operation induced by on is defined by:
[TABLE]
[TABLE]
is commutative (see 3.3 below) if is antisymmetric. is called the partial algebra induced by .
-
Let be an order structure, then satisfies:
-
Right associativity:
[TABLE]
in the sense that for every if the rhs exists, so does the lhs and they are equal.
- Idempotence:
[TABLE]
If, moreover, is antisymmetric, then satisfies:
- Commutativity:
[TABLE]
in the sense that for every , if both sides exist they are equal.
Honouring Leibniz, a partial algebra satisfying conditions 1 and 2 will be called a Leibniz algebra (LA for short). If, moreover, it satisfies condition 3 it will be called a commutative Leibniz algebra (CLA for short).
So, is a LA if is an order structure; moreover, it is a CLA if is antisymmetric.
-
An idempotent partial algebra will be called a weak Leibniz algebra (WLA for short) if it satisfies:
-
Weak right associativity:
[TABLE]
in the sense that for every if and the rhs both exist, then the lhs exists and equals the rhs.
If, moreover, is commutative (in the sense of condition 3.3 above), it will be called a commutative weak Leibniz algebra (CWLA for short).
Obviously every LA (CLA) is a WLA (CWLA).
- With abuse of notation, “LA”, “CLA”, “WLA” and “CWLA” will denote also the classes of all LAs, CLAs, WLAs and CWLAs respectively; what is intended will be clear from the context.
Abuses of notations such as this may take place later on without further notice.
- Let be a partial algbra, the binary relation induced by on is defined by:
[TABLE]
so iff (in the sense that the lhs exists and equals the rhs). Obviously, is antisymmetric if is commutative.
-
Let and be, respectively, a binary relation and a partial binary operation on a set , and let , , and be as defined above. Then:
-
.
-
; moreover, is commutative if is.
-
Let be a WLA, then is an order structure, called the order structure induced by . Moreover, is antisymmetric if is commutative.
**14. Algebraic interpretation of .
**
DEFINITION 14.1. Let be a WLA and let . The structure is said to be a weak Leibniz structure (WLS for short). The reduct shall be called the WLA base of .
Leibniz structures (LS for short), commutative Leibniz structures (CLS for short) and commutative weak Leibniz structures (CWLS for short) are defined analogously.
The following definition shows how may be interpreted in these structures. So they may, and will, be considered as -structures and will be treated like other -structures when dealing with semantics. In particular, all semantical notions (such as “ is a (n algebraic) model of ” or “”, for and WLS) will be assumed to be known.
DEFINITION 14.2. Let be a WLS, and let , then:
- iff exists and equals
(iff
- iff the system of equations ,
has a solution
(iff the equation has a solution,
iff the equation has a solution).
-
iff
-
iff
REMARKS 14.3.
-
The order (partial order) model is basically equivalent to the WLS (CLS), .
-
The WLS (CLS), , is basically equivalent to the order (partial order) model .
-
Consequently, every WLS (hence every LS, every CWLS and every CLS) is an -model for .
-
In the light of remarks and definitions 9.10 it may be easily seen that:
-
For , is sound wrt WLS, hence wrt every subclass of it.
-
is CLS-syntactically complete.
-
For , is CLS-consistently syntactically complete.
-
For , is CLS-consistently semantically complete.
In clauses 2-4, CLS may be replaced by any class intermediate between it and WLS.
**15. Annihilators: Embedding the partial into a total. **An annihilator of a (partial) binary operation on a set is an element such that:
[TABLE]
Obviously has at most one annihilator.
An annihilator algebra is an ordered triple such that the reduct is a partial algebra, and is an annihilator of .
The subreduct of is the ordered pair , where:
[TABLE]
Here, and in the sequel, is assumed to be non-empty.
DEFINITIONS 15.1.
- An annihilator Leibniz algebra (ALA for short) is an annihilator algebra whose subreduct is a LA.
Annihilator commutative Leibniz algebras (ACLA for short), annihilator weak Leibniz algebras (AWLA for short) and annihilator commutative weak Leibniz algebras (ACWLA for short) are defined analogously.
- A Leibniz algebra with annihilator (LAA for short) is an annihilator algebra whose reduct is a LA.
Commutative Leibniz algebras with annihilators (CLAA for short), weak Leibniz algebras with annihilators (WLAA for short) and commutative weak Leibniz algebras with annihilators (CWLAA for short) are defined analogously.
As usual, an algebra or a structure based on an algebra is said to be total if each of its operations is total. “TLA” will stand for “total Leibniz algebra”, “TLS” will stand for “total Leibniz structure” and similarly for the other cases.
REMARKS 15.2.
- The subreduct of an annihilator algebra is a LA (respectively CLA, WLA or CWLA) if the reduct is.
Hence LAA ALA, and similarly for the other cases.
- Let be a total annihilator algebra whose subreduct also is total. Then is LAA iff it is ALA; “LA” may be replaced by “CLA”, “WLA” or “CWLA”.
[TABLE]
The order structures induced by these algebras are lower semilattices with smallest elements.
In the above equations “C”, the last “A” or both, may be dropped every- where (the corresponding parenthetic clause is to be modified accordingly).
The following definition designates to each partial algebra a total annihilator algebra in which it may be embedded.
DEFINITION and remarks 15.3.
- For , let be a partial algebra. A bijection from to is said to be an isomorphism from to if for every :
[TABLE]
and are said to be isomorphic if there is an isomorphism from one of them to the other.
-
If two partial algebras are isomorphic and one of the partial binary operations has an annihilator, then its image is an annihilator of the other.
-
Two annihilator algebras are said to be isomorphic if their reducts are.
-
Two total annihilator algebras are isomorphic iff their subreducts are.
-
Every partial algebra is the subreduct of some total annihilator algebra. For, let and .
Put:
[TABLE]
Then is a total annihilator algebra, and is its subreduct.
- Every total annihilator algebra whose subreduct is isomorphic to , is isomorphic to . This warrants calling the total annihilator algebra induced by .
The identity map on is an embedding of into .
- is a TALA iff is a LA. “LA” may be replaced by “CLA”, “WLA” or “CWLA”.
**16. Back to algebraic interpretation. **Let be a WLAA, then its reduct is a WLA. So is a WLS, for every . Obviously, for all , is satisfied in this structure. Hence none of nor is consistently semantically complete wrt any class of such structures, though every one of them is sound wrt each of these classes. Evidently expanding the structure to will not solve the problem.
As a matter of fact, the annihilator is the source of the difficulty, and we may get around it by not permitting the annihilator to be assigned as a value corresponding to any element of , nor accepting it as a solution of any of the relevant equations below. An additional advantage of this approach is to be able to consider the more general AWLA.
DEFINITION 16.1. (non-annihilator interpretation of in annihilator algebras)
- Let be an AWLA and let . The structure is called an annihilator weak Leibniz structure (AWLS for short). The reduct of is called the AWLA base of .
The structures based on the other algebras (total or not) are defined, and their names are abbreviated, analogously.
- For each :
[TABLE]
This shows how may be interpreted in the structures defined in part
- So they may, and will, be considered as -structures and will be treated like other -structures when dealing with semantics. In particular, all semantical notions (such as “ is a (total algebraic) model of ” or “”, for and AWLS) will be assumed to be known.
REMARKS 16.2.
-
and are basically equivalent, hence every AWLS is an -model for .
-
In part 4 of remarks 14.3, “WLS” and “CLS” may be, respectively, replaced by “TAWLS” and “TACLS”.
-
If is a TCLSA (equivalently TCWLSA), the provisions given in part 2 of definitions 16.1 may be simplified in the obvious way; in particular, the second provision will be equivalent to .
To investigate the relationship between TCLSA and the Venn models we make use of a (n intermediate) subclass of TCLSA, namely the subclass of those TCLSA based on OSLA which are reducts of Boolean algebras.
These reducts will be called Boolean-Leibniz algebras with annihilators, BLAA for short. As usual BLSA is a Boolean-Leibniz structure with annihilator, i.e. a LSA based on a BLAA.
PROPOSITION 16.3.** **Every TCLAA may be embedded in a BLAA.
Proof. Let be a TCLAA. The mapping:
[TABLE]
is an embedding of in the BLAA: .
will be called the BLAA corresponding to and will be denoted by “”. For , is a BLSA; it will be called the BLSA corresponding to, the TCLSA, and will be denoted by “”. The relevant definitions and part 3 of remarks 16.2 show that and are basically equivalent.
THEOREM 16.4.** **Every TCLSA is basically equivalent to a Venn model. And every Venn model is basically equivalent to a BLSA (hence to a TCLSA); moreover, the BLSA may be assumed to be based on a concrete BLAA whose universe is a power set.
Proof. Let be a TCLSA, then is a BLSA and is a Venn model. They all are basically equivalent.
On the other hand, let be a Venn model, then is a BLSA which is basically equivalent to it.
COROLLARY 16.5.** **In part 4 of remarks 14.3, “WLS” and “CLS” may be, respectively, replaced by “TWLSA” and “BLSA” (either the superclass TCLSA, or the subclass consisting of those elements each of which is based on a concrete BLAA whose universe is a power set, may replace BLSA).
**17. Leibniz and Boole. **The calculus de continentibus et contentis, or the calculus of identity and inclusion -which is an algebraic treatment of concepts- was developed by Leibniz during 1679-90 (Kneale and Kneale 1966, p. 337). As may be gathered from a passage of the same reference (pp. 340-3), or from a translation of an original text of Leibniz (Lewis 1960, pp. 297-305), this calculus is the theory of operational semilattices (OSL for short) with applications to concepts; commutativity and idempotence are explicitly stated, while associativity is implicitly taken for granted (the aforementioned passage is abbreviated with some slight changes from the aforementioned translation (Kneale and Kneale 1966, p. 343); notice that the edition of Lewis’ book referred to in Kneale and Kneale (1966) is earlier than the one referred to above).
Kneale and Kneale (1966)’s assessment of this calculus is unfavorable. It asserts (p. 337) that Leibniz “intended, no doubt, to produce something wider than traditional logic. […]. But […] he never succeeded in producing a calculus which covered even the whole theory of syllogism.”. On p. 345 this assertion is elaborated “What he [Leibniz] produced was certainly much less than he hoped to produce. For the last scheme [the calculus de continentibus et contentis], lacking as it does any provision for negation or for consideration of conjunction and disjunction together, is still a fragment. So far from including all Aristotle’s syllogistic theory as a part, it contains no principle of syllogism except the first […]”.
Likewise, Lenzen (2004)’s assessment of the calculus de continentibus et contentis is unfavorable. It asserts (p. 28) that this calculus “remains a very weak and uninteresting system […]; thus it shall no longer be considered here.”.
On the contrary, we have shown that neither negation (of terms) nor any additional operations are needed to algebraically interpret AAS. It suffices to require the OSL to possess an annihilator, i.e. to be OSLA. For the structures based on the OSLA are the TCLSA and, by corollary 16.5, AAS is both sound and complete with respect to them.
According to Kneale and Kneale (1966, p. 339) it may be seen that Leibniz practically introduced annihilators when he interpreted as (, in our terminology) is nothing.
Lenzen (2004) goes even further. It (pp. 2-3) asserts that Leibniz developed stronger calculi, the most important of them (p.3) “is L1, the full algebra of concepts […], L1 is deductively equivalent or isomorphic to the ordinary algebra of sets. Since Leibniz happened to provide a complete set of axioms for L1, he “discovered” the Boolean algebra 160 years before Boole.”.
Moreover, Lenzen (2004) asserts that Leibniz succeeded in making use of his logical theory to derive the basic laws of Aristotelian syllogism (p. 55). In particular, the Aristotelian inferences may be derived as theorems of L1, or the stronger calculus L2 (p. 56); a detailed discussion of the subject may be found in Lenzen (2004, §8, pp. 55-73). Indeed, as we have shown, AAS does not need all of this.
Boole did more than just algebraically interpreting AAS. In addition to annihilators, which are sufficient for dealing with Aristotelian syllogisms (which involve no term negation), he introduced complementation (which corresponds to term negation) and a second binary operation. This is possibly to:
-
be able to interpret all the Aristotelian categorical sentences into equations (cf. Boole 1948, pp. 20-5),
-
deal with medieval categorical sentences which may involve term negation (cf. Boole 1948, pp. 20, 27-47), or
-
deal with hypotheticals (cf. Boole 1948, pp. 48-59).
In addition to establishing the Aristotelian syllogistic rules, Boole (1948) established some non-Aristotelian ones. For example (p. 37) , where “” denotes “not-”.
Boole (1948) did not address the question of completeness, neither did he consider consequences of more than two premises. However, it discussed (pp. 76-81) a general scheme to solve arbitrarily finite systems of simultaneous equations in arbitrarily finitely many variables; applying, in particular, Lagrange’s method of indeterminate multipliers. This discussion took place after making (p. 18) the confounding assertion “[…] all the processes of common algebra are applicable to the present [Boolean] system.”.
For one more confounding assertion see below.
**18. Inadequacy: bounds of AAs. **Calling the symbols of its system “elective symbols” (p. 16), Boole (1948) makes (p. 59) another confounding assertion: “Every Proposition which language can express may be represented by elective symbols, and the laws of combination of those symbols are in all cases the same; but in one class of instances the symbols have reference to collections of objects, in the other, to the truths of constituent Proposition.”. This, probably, amounts -in modern language- to asserting: Every proposition which language can express is equivalent to a sentential combination of categorical sentences (SCCS for short).
SCCS should be taken seriously, since a stronger assertion has dominated human thought over more than two millennia: Every argument can be put in a syllogistic form. Even Bertrand Russell (1967, p. 198) asserts “Of course it would be possible to re-write mathematical arguments in syllogistic form, but this would be very artificial and would not make them any more cogent.”.
Concerning these assertions, it is worthwile to bring to the fore what Bocheński (1968) calls attention to. On p. 63 it observes that Artistotle “says explicitly that not all logical entailment is “Syllogistic”.”. Moreover it observes on the same page that Aristotle declares that some logical entailments cannot be reduced to syllogisms. So it may be concluded that Artistotle himself contradicts the aforementioned assertions of Boole and Russell, which makes making them deeply confounding, and makes it more urgent for historians of thought to investigate the matter.
Understanding SCCS depends on understanding the notion of categorical sentences. If term negation is permitted, the sentences will be called “Boolean categorical sentences” and the corresponding assertion will be denoted by “SCBCS”. Otherwise, the sentences will be called “Aristotelian categorical sentences” and the corresponding assertion will be denoted by “SCACS”.
Hilbert and Ackermann (1950) formalizes the Boolean categorical sentences (pp. 44-8) and informally refutes SCBCS (pp. 55-6).
To formally discuss SCACS (making use only of the methods developed above and the well known results of sentential logic) augment the alphabet of the language of the natural deduction formalization defined in section 1.4 above, by a ternary relation symbol , and add () to the set of sentences based on . Denote the new set of sentences by “”.
Intuitively, we like to mean that no a which is , is . This may be formalized as follows:
Interpret in a WLS by adding the following provision to the provisions of definition 14.2.
[TABLE]
In an AWLS , is interpreted by adding the following provision to the provisions of part 2 of definitions 16.1.
[TABLE]
Recall that [math] is not in the range of ; also notice that if is a TCLSA, then this provision is equivalent to
5*′*. .
The other syntactical and semantical notions remain the same, or to be appropriately modified in the obvious way.
Let and let . is said to -imply (symbolically ) if for every , whenever . is said to be -equivalent to , or are -equivalent, if each of them -implies the other. is said to be -valid if , it is said to be -consistent if for some .
The above notions may be generalized, in the obvious way, to sets of sentential combinations of elements of . If or is a singleton, it may be replaced by its unique element, e.g. “” may replace “”.
In what follows and are assumed to be pairwise distinct elements of . For every , -implies . The converse depends on . In particular it does not hold for . As a matter of fact we have the following:
THEOREM 18.1.** **Let be a sentential combination of elements of , then:
-
is not -equivalent to , hence
-
is not deductively equivalent to (i.e. one of them does not deductively entail the other), for each deductive system which is sound with respect to .
To prove this, we first prove:
LEMMA 18.2.** **Put:
and
then:
-
is -consistent.
-
is not -implied by any -consistent .
Proof. Part 1 is easy. To see part 2, assume that there is a subset which is both -consistent and -implies . Then there is which is a model of . By theorem 16.4 it may be assumed that for some .
Let for some and let where for every ,
[TABLE]
is a which is basically equivalent to , hence it is a model of ; but it is not a model of . From this the required follows.
Proof of theorem 18.1. Assume that is -equivalent to a sentential combination of elements of , say. Then ( for short) is -equivalent to ( for short) which also is a sentential combination of elements of .
By sentential logic, may be assumed to be a disjunction of conjunctions of elements of and their negations. Since is -consistent and the negation of any element of is -equivalent to some element of , may further be assumed to be a non-empty disjunction of -consistent conjunctions of elements of . Consequently is -implied by each of these conjunctions, which contradicts part 2 of lemma 18.2. From this the required follows.
**Acknowledgements. **Several friends were kind enough to provide me with references which proved to be very helpful. My deep gratitude is hereby expressed to each of them: Wafik Lotfalla, Essawy Amasha, Sharon Amasha, and Fawzy Hegab. I am most indebted to two more friends: Azza Khalifa for pointing out some misprints, and Ahmed Ghaleb for patiently and carefully proofreading the manuscript and transforming its scientific Workplace file into TEX.
**Appendix. **The following algorithm, to generate the first primes, may not be efficient, but it is simple, and its running time (see below) makes it sufficient for our purposes.
Input: (positive integer)
Output: (the strictly increasing list of the first primes)
Procedure:
Declare natural number parameters;
;
If go to
Else , ,
End If;
For1 do
For2 do
For3 do
If go to
Else
End If;
Repeat
End For3;
If go to
Else
End If;
Repeat
End For2;
Repeat
End For1;
Print ;
End Algorithm.
The termination of this algorithm is guaranteed by the respective upper bounds stipulated at the beginnings of the three For loops. The correctness is guaranteed by the well known fact which goes back to Euclid’s Elements: , together with the simple fact that is the first (odd) integer greater than , which is not a multiple of any of .
To estimate the running time, notice that (Landau 1958, p. 91) for large , . For such the For1 loop is iterated at most times, for each iteration the For2 loop is iterated at most times, and for each of these iterations the For3 loop is iterated at most times. All the steps of the algorithm are simple assignment or comparison steps, the only exception is the test which needs at most () simple steps. So the total running time is a polynomial in , of degree at most .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Adzic, M. and Dosen, K. (2016). Gödel’s Notre Dame course. The Bulletin of Symbolic Logic , 22 (4), 469-481.
- 2[2] Bellucci, F. and Pietarinen, A. (2016). Existential graphs as an Instrument of Logical analysis: part I. alpha. The Review of Symbolic Logic , 9 (2), 209-237.
- 3[3] Bocheński, I.M. (1968). Ancient Formal Logic . Amsterdam: North Holland.
- 4[4] Boger, G. (1998). Completion, reduction and analysis: three proof-theoretic processes in Aristotle’s Prior Analytics. History and Philosophy of Logic , 19 , 187-226.
- 5[5] Boole, G. (1948). The Mathematical Analysis of Logic . Oxford: Basil Blackwell.
- 6[6] Corcoran, J. (1972). Completeness of an ancient logic. The Journal of Symbolic Logic , 37 , 696-702.
- 7[7] Frege, G. (1967). Begriffsschrift, a formula language modeled upon that of arithmetic for pure thought. In van Heijenoort, J. ed., From Frege to Gödel: A Source Book in Mathematical Logic, 1879-1931 . Harvard University Press, Cambridge, MA, 1-82.
- 8[8] Glashoff, K. (2002). On Leibniz’s characteristic numbers. Studia Leibnitiana , XXXIV/2 , 161-184.
