Truth and Feasible Reducibility
Ali Enayat, Mateusz {\L}e{\l}yk, Bartosz Wcis{\l}o

TL;DR
This paper proves that three canonical truth theories based on Peano arithmetic are feasibly reducible to PA, meaning proofs in these theories can be efficiently translated into PA proofs, showing no significant speed-up.
Contribution
It establishes that these truth theories are polynomial-time reducible to PA, contrasting with the behavior over finitely axiomatizable base theories.
Findings
Feasibly reducible to PA with polynomial time translation
No significant speed-up over PA for these truth theories
Contrasts with finitely axiomatizable base theories
Abstract
Let be any of the three canonical truth theories (Compositional truth without extra induction), (Friedman--Sheard truth without extra induction), and (Kripke--Feferman truth without extra induction), where the base theory of is (Peano arithmetic). We show that is \textit{feasibly reducible to} , i.e., there is a polynomial time computable function such that for any proof of an arithmetical sentence in , is a proof of in . In particular, has at most polynomial speed-up over , in sharp contrast to the situation for for \textit{finitely axiomatizable} base theories .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Truth and Feasible Reducibility
Ali Enayat, Mateusz Łełyk, Bartosz Wcisło
Abstract
Let be any of the three canonical truth theories (Compositional truth without extra induction), (Friedman–Sheard truth without extra induction), and (Kripke–Feferman truth without extra induction), where the base theory of is PA (Peano arithmetic). We show that is feasibly reducible to PA, i.e., there is a polynomial time computable function such that for any proof of an arithmetical sentence in , is a proof of in PA. In particular, has at most polynomial speed-up over PA, in sharp contrast to the situation for for finitely axiomatizable base theories B.
Contents
1 Introduction
One of the celebrated results in the area of axiomatic theories of truth is the Krajewski-Kotlarski-Lachlan (KKL) theorem [13] that asserts that every countable recursively saturated model of PA (Peano arithmetic) is expandable to a model of (compositional truth over PA with no extra induction111This theory is referred to as in [10], in [2], and in [5].). The KKL theorem is an overtly model-theoretic result, but it is well-known that it is equivalent to the conservativity of over PA.222This equivalence follows from two key facts: (1) every countable consistent theory has a countable recursively saturated model, and (2) countable recursively saturated models are resplendent, both of which can be verified in the subsystem of second order arithmetic. Recent proofs of the KKL theorem given by Enayat and Visser [5] (using model-theoretic techniques) and Leigh [15] (using proof-theoretic machinery) show that is conservative over B for every "base theory" B (i.e., a theory B that supports a modicum of coding machinery for handling elementary syntax). Leigh’s proof makes it clear that if B is a base theory with a computable set of axioms, then is proof-theoretically reducible to B, and in particular, there is a primitive recursive function such that for any proof of a sentence in , where is a sentence in the language of B, is a proof of in B. Indeed, Leigh’s "reducing function" is readily seen to be a provably total function of the fragment of (Primitive Recursive Arithmetic) commonly known as 333 asserts the totality of the superexponential function , with and Leigh [15] refers to this function as hyper-exponentiaton.
The main result of this paper shows that is feasibly reducible to PA, i.e., there is a polynomial-time computable function such that for any proof of an arithmetical sentence in , is a proof of in . The feasible reducibility of to PA readily implies that does not exhibit significant speed-up over PA, i.e., there is a polynomial function such that for any arithmetical sentence , if is provable in by a proof of length (in some standard proof system444 The choice of the ”standard proof system” is immaterial since it is well-known that any two such systems polynomially simulate each other [18].), then is provable in by a proof of length . This solves a problem posed by Enayat in 2012 [4].
The absence of significant speed up of over PA implied by the feasible reducibility of to PA exhibits a dramatic difference between and for finitely axiomatized base theories B, since as shown by Fischer [7], has superexponential speed-up over B for finitely axiomatized base theories B, and therefore, is not feasibly reducible to B for finitely axiomatized base theories B. It is also known that (where is the axiom of internal induction) is conservative over PA ([13], [15]) but not feasibly reducible to PA since it has superexponential speed-up over PA [7].
Our proof of the feasible reduction of to PA includes the verification that PA proves the formal consistency of every finite subtheory of , thereby establishing that is a reflexive theory. This result follows from Leigh’s work [15]; and was also established by Enayat and Visser (unpublished) with help of the "low basis theorem" of computability theory to arithmetize their model-theoretic proof of conservativity of over PA. The proof presented here, however, is based on a simpler arithmetization of the Enayat–Visser construction and does not appeal to the low basis theorem; the syntactic analysis of this arithmetization forms one of the main ingredients of the proof of our main result.
We also employ the machinery developed for the proof of our main result to analyse two other prominent theories of truth, namely (Friedman–Sheard theory of truth over PA, with no extra induction), and (Kripke–Feferman theory of truth over PA with no extra induction). More specifically, we show that and are both reflexive and feasibly reducible to . These results, in turn, show that both and are interpretable in and have at most polynomial speed-up over .
A word about the organization of the paper is in order. Section 2 deals with arithmetical preliminaries and technical machinery that will be employed for establishing our principal results. Section 3 presents basic definitions and facts about the truth theories , , and , including their conservativity over PA. The main results of the paper are contained in Section 4, which contains the proofs of feasible reduction of , , and to PA; these proofs should be viewed as refined arithmetizations of the conservativity proofs presented in Section 3.3. The last subsection of Section 4, on the other hand, spells out the interpretability-theoretic ramifications of our work. Section 5 collects some open questions; and the Appendix (Section 6) consists of routine-but-technical proofs of certain results employed in the body of the paper.
2 Setting the stage: arithmetical machinery
This section discusses basic notions and fundamental machinery that can be generally described as refined arithmetization of certain parts of proof theory and model theory that play a key role in the statements and proofs of our main results in Section 4. Note, however that the material in Subsection 2.6 will be only employed in Subsection 4.4.
2.1 Arithmetized syntax
In this paper, PA denotes the theory using as non-logical function symbols, whose axioms consist of the axioms of Robinson’s Arithmetic Q together with the usual induction scheme for the whole language of PA. Its intended model are natural numbers with addition, multiplication and successor functions. We will also denote by , typically when treating it as set of indices for some construction. Sometimes in this paper, we will be referring to or when working in PA. Then these symbols simply refer to the whole universe. This should not lead to any confusion.
Crucially for our purposes, PA is capable of representing syntax. This means that in PA, one can employ recursion to define notions such as "term," "formula" or "proof in PA" similarly to how these notions are defined in Zermelo–Fränkel set theory. This is a standard topic, covered, e.g., in [11] or [9].
Every sequence is represented by a number
[TABLE]
Each formula is represented as a -string and then coded as a number. If is a binary string, then denotes its length.
Throughout the paper we will use certain formulae to represent various syntactic and technical notions. For the convenience of our reader, we gather here all the notation which might be possibly confusing.
Definition 1** (Arithmetized syntax).**
asserts: " is a sequence and its length is equal to ." 2. 2.
asserts: " is a term of a language ." For instance, asserts that is a (code of an) arithmetical term. 3. 3.
asserts: " is a closed term of a language ," i.e. a term without free variables. 4. 4.
asserts: " is a sequence of terms of the language ." 5. 5.
asserts: " is a sequence of closed terms of the language ." 6. 6.
asserts: " and is the value of the term ." For instance,
[TABLE]
holds provably in PA ( stands for the number coding the term ). 7. 7.
asserts: " is a variable." For instance, means that is a code of a variable in the coded language. Since without loss of generality we can assume that all first-order languages have the same set of variables, we omit the reference to a specific language in the subscript. 8. 8.
asserts: " is a formula of the language ." 9. 9.
asserts: " is a free variable of ," where is either a term or a formula. 10. 10.
asserts: " is a formula of the language with at most one free variable", and asserts: " is a formula of the language with exactly one free variable." 11. 11.
asserts: " is a sentence of the language ." 12. 12.
asserts: " is a (coded) sequence whose elements are (some) free variables of ," where is either a term or a formula. 13. 13.
is an assignment for a formula or a term if is a function whose domain includes the free variables of . The formula asserts: " is an assignment for ," where is a term or a formula. We will often denote it with . If is a coded set or sequence, we will write to denote that is an assignment for all elements of . We will sometimes also write or meaning , where 14. 14.
asserts: " and are assignments, is a variable, and for all variables , possibly except for which belongs to the domain of (and not necessarily to the domain of )."
The reader could expect to see in this list certain other predicates such as Proof or Con. Since we will need some more precise information about these formulae and their length, they will be only introduced in Subsection 2.3.
Next we introduce numerals. For our purposes the numeral for a number will not be a code of iterated times, since this is not an efficient notation when it comes to writing short formulae and proofs. Our numerals will be handled via binary expansions.
Definition 2** (Numerals).**
For any natural number , denotes the binary expansion of written as a term of . More precisely: let , where . We define:
[TABLE]
where , and . Thus it takes symbols to represent .
Let us introduce one more definition:
Definition 3** (Substitutions).**
Let be a formula with free variables shown and let be an assingment for . By we denote the formula in which the numeral (in the sense of Definition 2) denoting is substituted for the variable . 2. 2.
Similarly, if and , then by we mean the closed term obtained by substituting the numeral for each free variable in . 3. 3.
If and , then by we mean Notice that if is a variable and , then provably in PA. 4. 4.
If for some language , and , then is an arithmetical formula which asserts " is the effect of substituting in the formula the term for every occurrence of the variable ."
We adopt a few conventions as to how we will be dealing with formalized syntactic notions.
Convention 4**.**
If is a standard formula or a standard term, we will sometimes denote the corresponding code by to prevent ambiguity, but most of the time we skip the corners to lighten the notation. 2. 2.
We can clearly code a formula as a binary string. Then by we mean the length of this string. Note that is of size exponential in and is of size linear in . We deal with proofs in in a similar fashion. 3. 3.
Since we will sometimes skip the corners denoting formulae, is most of the time denoted by for a standard formula . 4. 4.
We will sometimes use the formulae defining syntactic notions as if they were denoting sets. For example, we will sometimes write "" rather than "." 5. 5.
We will use provably functional formulae such as , , or as if they were terms. 6. 6.
For better readability we will sometimes skip formulae denoting syntactic operations and write the effect of the operations instead. Thus, for example, we will write to denote "There exists which is the negation of the sentence and ."
2.2 Arithmetized model theory
Peano arithmetic is capable of accommodating a substantial part of the model theory of countable structures. We will make constant use of this fact throughout the whole paper. This subsection briefly introduces the reader to this topic. The rough convention is as follows: a theory is a definable set of sentences. If is a formula which defines a set of (codes of) sentences, then we call that formula a theory.
Models come in two kinds. By a full model we mean the elementary diagram of that model (or, actually, a formula defining the elementary diagram). It is given as a complete Henkinized theory. By a model , we mean a formula defining its domain and some relations on that domain (this does not mean that we only deal with models of relational languages, but rather, we construe the denotations of function and constant symbols as relations).
Definition 5**.**
A formula defines a theory in a language if for all , \phi(x)\rightarrow\bigl{(}x\in\textnormal{{Sent}}_{\mathcal{L}}\bigr{)} holds. By a full model of a theory , we mean a theory in a language expanding with some constants (possibly trivially) such that:
- •
is complete and consistent, so for any , if and only if .
- •
has all existential statements witnessed by constants which means that if , then for some constant , .
By a model of a language (or simply an -structure), we mean a formula which defines a set of (coded) sequences such that if , then:
- •
is either a symbol of the language or some fixed element which is not a symbol of .
- •
If is a relation symbol of , then the length of is the arity of plus one.
- •
If is a function symbol of , then the length of is the arity of plus two. We treat constants as functions of arity zero.
- •
If is the fixed element , then has length two.
- •
If for some , then holds. ( is in the domain of a model.)
In the above, a model is essentially defined as a particular kind of tuple: (definition of the domain, definition of the first relation, definition of the second relation ). However, since we allow models with infinite signatures, we define it in the above compact way rather than an actual tuple of formulae. However, if a model is defined with a standard number of definable relations, we can easily construct a definition in the format specified above. The second warning is as follows: although officially in this paper a full model is the same as the elementary diagram of that model, in practice, we will refer to models in the usual way, since it is clear how to transfer statements about models to statements about their elementary diagrams and vice versa.
Definition 6**.**
If is a full model, we write to say that is a constant in the language of a full model . (This means that is an element of , since we implicitly assume that all full models are built on Henkin constants.) If is a model, the expressions , means that holds. 2. 2.
If is a (full) model over a language , and , we say that is an -assignment or an -valuation for a formula if is a (coded) finite function, whose domain contains , , and for every , . We denote this by . 3. 3.
If is a full model over a language , , and is an -assignment, then the relation is defined simply as . 4. 4.
If is a model over a language , , and is an -assignment, then the relation is defined only for of standard complexity via the usual compositional conditions with quantifiers restricted to the domain of and satisfaction for base relations (of arity ) defined as follows:
[TABLE]
We define satisfaction for equalities of terms in an analogous fashion. 5. 5.
If is a full model, we will write (elementary diagram of ) instead of when we want to stress that we are thinking of a theory rather than of a structure. 6. 6.
If is a (full) model over a language , and is a formula with the displayed free variables, we will write
[TABLE]
meaning there there exists an -valuation for such that for all and 7. 7.
If is a (full) model over a language and with all free variables displayed, then by we mean the set of (tuples of) elements defined by the formula in . In other words, it is the set of tuples of the elements of such that for some (equivalently, any) such that .
Note that we haven’t yet defined what it means that a model satisfies a theory. This is not an omission. Since for general (not full) models, satisfaction is defined only for standard sentences, we only define satisfaction for standard formulae. This is actually a scheme: for each formula we define what it means that a model satisfies this formula. More precisely: for each , we define what it means that a model satisfies a formula of depth .
On the other hand, in our paper, non-full models will play a crucial role and in some specific circumstances we are going to say that a model satisfies a theory. This will be defined in some specific cases of our interest later in Definition 29.
Let us define some more notions which will be particularly important in further parts of our paper.
Definition 7**.**
Let be full models of theories in the same language. We say that is an elementary extension of if is contained in .
Recall that officially, a full model is the same as the elementary diagram of that model, so elementary submodels in our sense correspond to elementary submodels in the usual sense. In what follows, we will sometimes conflate an elementary submodel with an image of the elementary embedding. This should be understood in the obvious way: a formula defines an elementary embedding between models and if it is a injection on elements of the models (i.e., it is a relation on constants such that together imply and together imply that ) and the image is an elementary submodel of (i.e, the restriction of to the language with constants representing the image of is a full model).
We will denote both being an elementary submodel and being an image of an elementary embedding with
[TABLE]
Sometimes we only require that the elementarity relation only holds for formulae in a language such that are full models over a language containing . We then write:
[TABLE]
Crucially, in this paper we will be looking at the situation where is a full model, is a (non-full) model over a bigger language and is an -elementary extension of . This is a slightly subtler notion which will be made precise in Definition 30.
Definition 8**.**
Let be models or full models in languages , respectively. We say that is an expansion of if the following conditions are satisfied:
- •
.
- •
For every element there is an element such that , where , . (That is, the domain does not change. We write it in this slightly convoluted manner, since we want the definition to work both for models and full models).
- •
For every atomic formula and -assignment for , if and only if .
We can say that PA is capable of handling basic model theory, since it is able to capture the link between models and consistent theories. In the context of our definitions, this means that in PA every consistent theory can be extended to a complete consistent theory with Henkin constants.
Theorem 9** (Arithmetized Completeness Theorem).**
For every , PA proves that if is a consistent -theory, then it has a -full model.
By -theory, we mean a set of sentences defined both with a -formula and a -formula. -full model is defined in an analogous way. The above theorem is standard, cf. Section 13.2 of Kaye’s book [11]. Throughout the paper, we will be using the following handy conventions concerning models:
Convention 10**.**
When there is no risk of confusion, we will use the same symbol for a predicate symbol and for its denotation in a given (full) model. 2. 2.
We will sometimes denote (full) models as tuples, like . This will simply mean that is an expansion of with a predicate .
2.3 Lengths of proofs
This section summarizes the basic definitions and tools that we will need in connection with analysing lengths of proofs; much of this material is taken from Pudlák’s survey article [18].
Definition 11**.**
Recall that denotes the length of binary code of and is its Gödel number (which we denote as by our Convention 4).
Let be an -formula.
[TABLE] 2. 2.
If , we write also
[TABLE]
We treat theories simply as sets of sentences, not necessarily closed under deductions. This is because changing the axiomatization of a theory might result in facilitating proofs.
Definition 12** (Simulations, Speed-up, Reducibility).**
Let and be two theories and a family of functions . We shall say that -simulates iff there exists a function such that for every sentence that is provable in both and , and for every , we have:
[TABLE]
We say that is -reducible to if there exists a function such that for every we have:
[TABLE]
We say that has a super- speed-up over if does not -simulate .
Typical examples in the literature of speed-up (simulability) phenomena concern the cases where is the family of either polynomial or elementary functions, which respectively correspond to super-polynomial speed-up (or polynomial simulation) and super-exponential (or super-elementary) speed-up.
Remark 13**.**
The main focus of our paper is the relation of -reducibility, where is the family of all P-time computable functions. Recall that is a P-time computable function if it is a total function such that for each (the binary code of) can be computed by a deterministic Turing machine which takes as input (the binary code of) (see [9] for the precise formulation) and which runs in polynomial time. We call this relation feasible reducibility.
The proofs of the following observations are routine:
Observation 14**.**
Let be any family of functions from to .
If is -reducible to , then -simulates . Moreover if is feasibly reducible to , then polynomially simulates . 2. 2.
If is countable, then has a super speed-up over if there exists an infinite sequence of formulae provable in both and such that for every function there exists such that
[TABLE]
The most prominent role in the investigations in the lengths of proofs is played by consistency statements. We shall now discuss arithmetized provability. Recall that denotes the length of the binary expansion of .
Definition 15** (Pudlák, [17]).**
Let be a theory, be a formula and be a relation. We say that polynomially numerates in if there exists a polynomial such that for all natural numbers
[TABLE]
Theorem 16** (Pudlák, [17] Theorem 3.2).**
For any consistent theory with an NP-time set of axioms555An NP-time set or relation is one whose characteristic function can be computed by a non-deterministic Turing machine that runs in polynomial time. and any the following are equivalent:
* is an NP-time relation;* 2. 2.
* is polynomially numerable in Q;* 3. 3.
* is polynomially numerable in .*
Since on hand in our paper, we want slightly stronger results concerning feasible reducibility between theories rather than mere facts about speed-up, and on the other, we work with relatively strong theories anyway, we will use a modification of Pudlák’s result, (which actually is simpler than the original theorem).
Definition 17**.**
Let be a theory, be a formula and be a relation. We say that uniformly polynomially numerates in if there exists a P-time computable function such that for all natural numbers , holds iff is a proof of in .
In what follows, we need only the following simple fact which may be proved by a natural formalisation of Turing Machines in . It is significantly simpler than Pudlák’s result, since we do not need to consider cuts or use cut-shortening techniques.
Theorem 18**.**
For any consistent theory with a set of axioms computable in polynomial time, and any the following are equivalent:
* is a P-time relation;* 2. 2.
* is uniformly polynomially numerable in ;* 3. 3.
* is uniformly polynomially numerable in .*
Corollary 19**.**
Let be a theory with a P-time set of axioms. Then there exists a formalization of the relation
[TABLE]
and a polynomially computable function such that
[TABLE]
implies that is a proof of in .
Moreover, we define formulae Con and Pr in the usual manner as follows:
[TABLE]
and
[TABLE]
If not stated otherwise, throughout this paper , and for range over P-time axiomatizable theories.
Remark 20** (Relativized provability predicates).**
The content of this remark will be needed only in Section 4.4. The formalization of the provability predicate from Corollary 19 is of the form: "There exists an accepting computation of the Turing machine which recognizes proofs". Let be any arithmetical formula. By writing
[TABLE]
we mean the relativized version of the above predicate, i.e., the one in which the relevant Turing machine is supplied with an oracle given by and recognizes the theorems of (whatever means). We can treat as a new arithmetized theory, but then in typical cases it won’t be . This is why we decide to distinguish between the roles played by the lower and the upper indices in : the former will be reserved for theories satisfying the thesis of Corollary 19 and the latter for arbitrary formulae, playing the roles of oracles. Obviously the relativized version of Corollary 19 need not be true, but we will only demand that the following two conditions hold:
There exists a P-time computable function such that for all and all , is a PA proof of
[TABLE]
where denotes the chosen arithmetical definition of . 2. 2.
Likewise, there exists a P-time computable function such that is a PA proof of the sentence
[TABLE]
This requires that be constructed uniformly in , which can certainly be assured.
In particular, is of length polynomial in the lengths of and the chosen definition of . As usual
[TABLE]
and
[TABLE]
The following theorem gives a canonical example of a family of sentences whose proofs grow super-exponentially. Let denote the canonical sentence expressing "there is a proof of sentence from the axioms of of length at most ." Then .
Theorem 21** (Pudlák,[18], Theorem 7.2.2).**
Let be a sufficiently strong theory. Let be an increasing computable function, provably total in , whose graph has a polynomial numeration in . Then there exists a such that
[TABLE]
In particular, for , where , and , there is some some such that:
[TABLE]
2.4 Feasible truth predicates
Now we turn to arithmetized partial truth predicates, which we want to apply to arbitrary sentences of fixed complexity, defined here in a way that is different from the usual one (i.e., , ). The measure of complexity will be given by the depth of formulae, as defined below.
Definition 22**.**
The depth of a formula is the length of the longest path in its syntactic tree (which is allowed to contain arbitrary terms as leaves). denotes an arithmetical formula representing the relation "The depth of a formula is at most ." We will also write it as .
For example has depth .
To state some results in greater generality we will use Pudlák’s notion of sequential theories (since there seems to be more than one good definition of sequentiality, we include Pudlák’s definition):
Definition 23** (Pudlák, [17]).**
A theory is sequential, if
Robinson’s arithmetic Q is interpretable in relativized to some formula of and 2. 2.
there exists a formula (of two variables , ) that defines in a total function (in both variables) and such that proves
[TABLE]
Now the promised "feasible" partial truth predicates:
Theorem 24** (Pudlák, [18], Theorem 3.3.1).**
Let be a sequential theory. There is a family of formulae and a polynomial such that for every there exists a -proof of compositional Tarski’s conditions for such that the size of that proof is less than . Moreover for every of length less than , the shortest proof of
[TABLE]
is less than .
Moreover, by inspection of Pudlák’s proof one quickly sees that in fact finding the above mentioned proofs of polynomial length is feasible. This means that the above theorem can be slightly strengthened as in the theorem below.
Theorem 25**.**
Let be a sequential theory. There is a family of formulae and P-time computable functions , such that for every , is a proof of compositional Tarski’s conditions for . Moreover for every of length less than , is a -proof of
[TABLE]
In what follows, abbreviates .
Observation 26**.**
There exists a P-time computable function such that for every , is a proof in PA of
[TABLE]
The proof uses induction (in PA) on the complexity of and provable Tarski biconditionals for predicates.
In further parts of the paper, we will also need the relativized version of the Theorem 25 (we state it only for PA). If is a -model (not necessarily a full one) for a language with finitely many fresh non-arithmetical relational symbols, then by -relativized Tarski’s conditions for a formula we mean the usual statement that satisfies Tarski’s compositional truth conditions in which the condition for atomic formulae is:
[TABLE]
for an arbitrary relation in , and the condition for the existential quantifier is given by
[TABLE]
Corollary 27**.**
Let be any -model. There is a family of formulae and P-time computable functions , such that for every , is a PA-proof of the -relativized compositional Tarski’s conditions for . Moreover for every of length less than , is a PA-proof of
[TABLE]
The family can be defined essentially by relativizing predicates from Theorem 24. Since the definition of does not depend on , the length of will be polynomial in .
As above, we will write to denote satisfaction under the empty valuation. Occasionally, we will use the handy notational convention described below.
Convention 28**.**
If is a formula with standardly many many variables, we will be writing to denote
[TABLE]
where is some valuation which assigns to the variables .
Let us end this subsection with a definition of satisfaction of theories for a larger class of models.
Definition 29**.**
Let . Let be a model over a language and let be its expansion to an -structure. Suppose that is a theory over the language such that consists only of sentences of depth . We say that
[TABLE]
if for all .
The above definition is actually a scheme. We define separately for all standard what it means for an expanded structure to satisfy a theory whose axioms have depth bounded by . Note that whenever is an expansion of a full model and extends B with finitely many standard sentences , the condition means simply that for . Let us introduce one more definition in the similar spirit.
Definition 30**.**
Let , where B is a theory over language . Let be a theory over language such that consists only of sentences of depth . Suppose that is an expansion of . We say that is an -elementary extension of if is an elementary extension of . We denote it with
[TABLE]
2.5 Polynomial simulations and feasible reductions for theories extending PA.
Let us fix such that conservatively extends . It turns out that in order to verify that is feasibly reducible to it is sufficient to demonstrate that the formalized conservativity statements for finite fragments of over finite fragments of are feasibly provable in . The theorem below makes this precise.666We are grateful to Fedor Pakhomov who pointed out to us that this is the most direct way of proving our main results. Our previous proofs employed the conceptually more transparent—but technically more demanding—framework of feasible interpretations, as explained in Subsection 4.4. Before stating the theorem, we need a definition.
Definition 31**.**
Let be a theory. By we mean the set of axioms of of length at most . Abusing notation, we treat as the canonical formula representing in .
Theorem 32**.**
Let be a theory extending PA with an NP-set of axioms. If there is a polynomial such that for every ,
[TABLE]
then PA polynomially simulates . Moreover, if admits a P-time computable set of axioms and there exists a P-time computable function and a polynomial such that for all , is a PA-proof of
[TABLE]
then is feasibly reducible to PA.
Recall that is the height of the syntactic tree of and that we use this symbol for the arithmetic formula representing this function. The proof of Theorem 32 will be facilitated by the following lemma which shows that PA is feasibly strongly reflexive.
Lemma 33**.**
There exists a P-time computable function such that for every , is a PA proof of:
[TABLE]
Proof of Lemma 33.
The proof follows the usual pattern, but we have to check that each transformation at work is feasible. Here we provide the general outline; the details are verified carefully in Subsection 6.1 of the Appendix. Assume first that . Working in PA, we first prove cut-elimination for First Order Logic (this is a single sentence independent of ). Then we show that every axiom of PA of length is true. For finitely many axioms of Robinson’s Q, this is done independently of . For induction axioms of length at most we use Proposition 80, and for logical axioms we use Proposition 79. Next we apply cut-elimination over First Order Logic to show that for a sentence of depth if then there is a cut-free proof of a sequent
[TABLE]
where contains only axioms of . By the subformula property, in such a proof every formula is of depth bounded by . Then, using induction on the number of proof lines in a proof using only formulae of depth at most , we show that if
[TABLE]
is provable, then we have:
[TABLE]
where abbreviates , as in Definition 1. Since we already know that all axioms are true, we conclude that is true.
If , then it is sufficient to carry out the above proof substituting for everywhere and use Observation 26. Note that all the transformations above are uniform in , hence in particular they give rise to a function as in the thesis of the lemma. ∎
Proof of Theorem 32.
Assume and . Then in fact and is of depth at most . Let code this proof. Then, by the properties of the provability predicate we have:
[TABLE]
Hence also . By the formalized conservativity we have that ; by Lemma 33, we have ; and by feasibly provable -biconditionals (Theorem 24) we obtain . All the intermediate steps are polynomial in .
Also, if has a P-time computable axiomatization, by Corollary 19 there exists a P-time computable function such that given a proof of a formula , is the proof of the sentence
[TABLE]
Given a function as in our assumptions and the function as in Lemma 33 one easily defines the appropriate feasible reduction by concatenating the proofs given by , and as in the proof for polynomial simulation. ∎
Corollary 34**.**
Suppose that there exists a polynomial and there is a such that for every , , where expresses "Every -full model of has an -elementary extension to a full -model of ." Then PA polynomially simulates . Moreover, if is P-time, and there exists a P-time function such that for all , is a PA-proof of , then is feasibly reducible to PA.
Let us make one remark before we proceed to the proof of Corollary 34. Recall from Subsection 2.2 that in the current paper we treat full models as specific arithmetically definable sets of sentences. Note that although we cannot quantify over models in general, we can do this for models of fixed quantifier complexity using arithmetical satisfaction predicates.
Proof of Corollary 34.
Fix and polynomial . We shall prove the assumptions of Theorem 32. Fix . Start the proof by showing using a subproof of length . Then fix of depth and assume:
[TABLE]
It immediately follows that is consistent. We check that this is a -theory (this verification is polynomial in the definition of , hence polynomial in ). We prove the Arithmetized Completeness Theorem 9 for -theories; this is independent of and gives us a -full model of . Now, by , this model has an elementary extension to a full model of . By elementarity is consistent, which ends the feasible proof of the conservativity claim.
The proof of moreover part of the theorem is fully analogous: we apply it to the respective part of Theorem 32. ∎
Corollary 35**.**
Suppose there is a polynomial and there is a such that for every , , where expresses "Every -full model of has an -elementary extension to a -model of ." Then PA polynomially simulates . Moreover, if is P-time and there exists a P-time computable function such that for all , is a PA-proof of , then is feasibly reducible to PA.
Note that above -elementary extensions are understood in the sense of Definition 30.
Proof.
We show that the above assumption implies the assumption of the previous corollary. Fix . Take and as in the assumptions. Work in PA. Fix an arbitrary -full model of . Then there exists a -model which is an elementary extension of . We argue that the -theory
[TABLE]
is consistent. This will finish the proof, since by Arithmetized Completeness Theorem we will get a -full model of this theory (the length of this subproof is polynomial in as the proof of ACT is independent of ).
Take an arbitrary proof of a sentence in and prove cut-elimination for first order logic (the length of this proof is independent of ) and conclude that there exists a proof of in with the subformula property. It follows that every formula in this proof is either an arithmetical formula or is a subformula of an additional axiom of . In particular non-arithmetical formulae which occur in this proof are of depth bounded by . Define:
[TABLE]
where is a feasible relativized truth predicate from Corollary 27. By induction on the length of show that if occurs in , then for every
[TABLE]
It follows that cannot be a proof of the empty sequent, hence is consistent. ∎
The following is the ultimate corollary that best fits the proofs of our main results:
Corollary 36**.**
Suppose that is a finite extension of PA of the form , and the following sentence is provable in PA:
"If B is any finite fragment of PA, then every -full model of B has an elementary extension to a -model of ."
Then is feasibly reducible to PA.
Proof.
The assumptions of Corollary 36 clearly implies that the assumptions of Corollary 35, as the verification that B is a -finite fragment of PA can be done quickly and uniformly in . ∎
We will end this subsection with two simple observations which may be obtained by inspection of the proof of Theorem 32 and the proof of subsequent corollaries. They provide some slightly different sufficient condition for feasible reducibility. Observation 38 will be useful in Subsection 4.3.
Observation 37**.**
Suppose that is a theory with a P-time set of axioms which is PA-provably feasibly strongly reflexive, i.e., there exists a P-time computable function such that for each , is a PA proof of the sentence
[TABLE]
Then is feasibly reducible to PA.
Observation 38**.**
Suppose that satisfies assumptions of Corollary 35 (with the "moreover" part) or Corollary 36. Then is PA-provably feasibly strongly reflexive.
Proof.
By (very direct) inspection of proofs of Corollaries 35 and 36, we see that if satisfies assumptions of any of these statements, then there exists a P-time computable function and a polynomial such that for each , is a PA-proof of
[TABLE]
It follows that is PA-provably feasibly strongly reflexive. Indeed , fix the above mentioned function , polynomial and witnessing the feasible reflexivity of PA. We define . Compute first , then compute , i.e. the proof of
[TABLE]
Then after performing some fixed number of logical transformations obtain the proof of * ‣ 37. ∎
2.6 Feasible interpretability and speed-up
This section presents sufficient conditions for feasible interpretability, a notion that will be only used in Subsection 4.4. All the basic notions relied to relative interpretability can be found in [9], Chapter III. We begin with an observation of Albert Visser, presented in [7].777The formulation of the cited theorem is, however, is not quite right since it is claimed that it holds without any restrictions on , which is incorrect, e.g., let . Then the identity interpretation witnesses relative interpretability of in PA, but the former theory has super-exponential speed-up over the latter. However, with the proviso that is fintiely axiomatizable the proof goes through. Also the proof of our Proposition 42 can be easily adapted to this case.
Theorem 39**.**
Suppose that , where both and are formulated in the language of PA and is finitely axiomatizable and relatively interpretable in . Then polynomially simulates with respect to -sentences.
The crucial insight in the proof of the above theorem is that relative interpretations are always -correct if the interpreting theory is an extension of PA in the same language, i.e., for every -sentence of we have:
[TABLE]
where is the chosen relative interpretation of in .
The next definition is formulated so as to allow us to lift the -correctness in Theorem 39 to wider-and-wider class of sentences so as to establish polynomial simulations of non-finitely axiomatizable theories.
Definition 40**.**
Let , be two theories each of which has an NP-set of axioms.
An interpretation is -correct if for an arbitrary sentence of depth we have:
[TABLE] 2. 2.
An interpretation is *feasible888Feasible interpretations are thoroughly investigated in Verbrugge’s doctoral thesis [20]. In particular, as shown in Theorem 6.4.2 of [20], there is a sentence such that is interpretable in PA, and yet there is no feasible interpretation of in PA. * if there is a polynomial such that for all arbitrary sentences and all we have:
[TABLE] 3. 3.
A family of interpretations is polynomially correct if for each , is -correct and there exists a polynomial such that the following two conditions hold:
- (a)
witnesses that is a feasible interpretation, i.e., for every , and for every sentence of ,
[TABLE]
. 2. (b)
For every and every sentence of length at most ,
[TABLE] 4. 4.
A family of interpretations is uniformly polynomially correct if for each , is -correct and there exist P-time computable functions , such that the following two conditions hold:
- (a)
For every , and every -proof of is a -proof of . 2. (b)
For every , and every sentence of length at most , is a -proof of
[TABLE]
Remark 41**.**
In the context of theories with no additional rules of reasoning, condition (a) in the definition of polynomial correctness (point 3.) can equivalently be replaced with the following one
- (a)’.
For every ,
[TABLE]
for every axiom of (including the logical axioms for ) of length at most .
(Analogously for the uniform version.) However, we prefer (a) over (a)’ as it can be used in the context of theories such as which is closed under two additional rules of reasoning: NEC and CONEC.
The proposition below follows simply by unravelling the relevant definitions:
Proposition 42**.**
If there exists a polynomially correct family of interpretations , then polynomially simulates . Moreover, if is a uniformly polynomially correct family of interpretations, then is feasibly reducible to .
Proof.
Fix a polynomially correct family of interpretations and let be a polynomial witnessing this. Suppose that for some . Then
- •
is of length at most ,
- •
every axiom of which occurs in the proof is of length at most , and
Hence, by Definition 40, . Since is -correct, we have that
[TABLE]
The moreover part is fully analogous. ∎
Remark 43**.**
By the inspection of the proof one quickly realizes that in fact the requirements for the family can be relaxed even further. We do not have to demand that each interprets the whole theory , instead we can make the weaker demand that there is a polynomial such that for every we have:
[TABLE]
where denotes the set of axioms of of length at most .
3 Dramatis personæ: typed and untyped theories of truth
In this section B denotes a "base theory" for a theory of truth, i.e., a theory with a modicum of arithmetic capable of handling syntax. For example any theory extending will do. denotes a fresh unary predicate that is not in the language of B. denotes the language of B and denotes the language of B enriched with the predicate . For simplicity assume that the signature of extends the arithmetical signature with finitely many relational symbols.
In this paper, we will be dealing with theories of truth conservative over their base theories. We say that a theory in the language is conservative over if for every sentence we have:
[TABLE]
In our case, this means that adding the truth predicate and some axioms governing its behaviour does not allow us to prove new arithmetical sentences.
Below, we discuss some prominent examples of truth theories. A standard reference to the subject is Halbach’s book [10].
3.1
Definition 44**.**
is the theory extending a theory B with the following sentences:
. 2.
, for every relational symbol of . 3.
. 4.
. 5.
. 6.
\forall\phi(\bar{x})\in\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}\forall\bar{s},\bar{t}\in\textnormal{{ClTermSeq}}_{\mathcal{L}_{\textnormal{{B}}}}\ \ \bigl{(}\bar{{s}^{\circ}}=\bar{{t}^{\circ}}\rightarrow T\phi[\bar{s}/\bar{x}]\equiv T\phi[\bar{t}/\bar{x}]\bigr{)}
The last condition is sometimes called * generalized regularity*, or generalized term-extensionality. It should resemble the well known extensionality rule from deductive calculi for first-order logic, i.e.
[TABLE]
We include it since without it the quantifier axiom for behaves in an unnatural way.999It behaves decently already after adding the ungeneralized version of CT6 for single terms. For example, for we have:
[TABLE]
Obviously one can simply interchange the quantifier axiom with the following one:
[TABLE]
But then, without regularity, the following implication:
[TABLE]
becomes unprovable. With the Regularity both quantifier axioms are easily seen to be equivalent.
The above version of was claimed to be conservative over PA in [21]. However, no proof of this fact was provided and only a hint that it requires a slight modification of the Enayat and Visser construction (see [5]). This modification, however, adds a layer of technical difficulty, so in the current version we prove feasible conservativity of in full detail. A detailed proof of conservativity of this theory is provided also in [12].
3.2 and
The idea behind the untyped notion of truth is that the truth predicate can be meaningfully applied also to sentences containing it, to the effect that we could e.g., judge
[TABLE]
to be true. In this setting the following additional axiom seems desirable:
[TABLE]
where "TRP" abbreviates "TRansParency". Obviously if one wants to have a compositional theory of self-applicable truth, one cannot simply take (TRP) the axioms through and let the quantifiers range over all formulae of , since the resulting theory would be inconsistent by Tarski’s Theorem. The next two theories which we shall investigate exhibit two different directions in which one can look for a natural theory of untyped truth. In the first one the axiom is rejected and somewhat compensated. In the second one the transparency axiom is missing.
Definition 45**.**
is the -theory extending B with the following axioms:
.
.
, for every relational symbol of .
, for every relational symbol of .
\forall\bar{s},\bar{t}\in\textnormal{{ClTermSeq}}_{\mathcal{L}_{\textnormal{{B}}}}\forall\phi(\bar{x})\in\textnormal{{Form}}_{\mathcal{L}_{T}}\ \ \Big{(}{\bar{s}}^{\circ}={\bar{t}}^{\circ}\rightarrow T\phi(\bar{s})\equiv T\phi(\bar{t})\Big{)}.
.
.
KF, a theory obtained by augmenting with full induction scheme for formulae with the truth predicate, was introduced by Feferman in [6] as an axiomatisation of a theory of truth proposed by Kripke in [14]. represents an attempt to define a reasonably behaved self-applicable truth predicate guided by the following intuition: we try to mark the sentences which are definitely true. We start with the set of true equations on arithmetical sets. Then we proceed in stages, e.g. whenever and are definitely true, we mark as definitely true. Whenever is definitely true, we mark as definitely true. Whenever is definitely true for all , we mark as definitely true. Thus in the process we only enlarge the set of true sentences until it reaches a fixed point. axiomatises properties of fixed points obtained in such a way.
The desirable feature of is that it satisfies the TRP axiom. However, the idempotence of the truth predicate fails rather spectacularily in a different place. It turns out that adding both derivation rules
[TABLE]
to at the same time yields this theory inconsistent (see [10], Lemma 15.20. The Lemma is stated for the full KF, but the induction axioms are not used in the proof). Moreover, the rule (NEC) is inconsistent with the following axiom of consistency which says that no sentence is both true and false:
[TABLE]
Dually, the rule (CONEC) is inconsistent with the axiom of completeness which states that every sentence is either true or false.
The other standard candidate for a well-behaved theory of self-referential truth is Friedman–Sheard’s theory FS.
Definition 46**.**
is the extension of B in the language extending with the following axioms:
\forall s,t\in\textnormal{{ClTerm}}_{\mathcal{L}_{\textnormal{{B}}}}\ \ T(s=t)\equiv\bigl{(}{s}^{\circ}={t}^{\circ}\bigr{)}. 2.
, for every relational symbol of . 3.
. 4.
. 5.
. 6.
\forall\bar{s},\bar{t}\in\textnormal{{ClTermSeq}}_{\mathcal{L}_{\textnormal{{B}}}}\forall\phi(\bar{x})\in\textnormal{{Form}}_{\mathcal{L}_{T}}\ \ \Big{(}{\bar{s}}^{\circ}={\bar{t}}^{\circ}\rightarrow T\phi(\bar{s})\equiv T\phi(\bar{t})\Big{)}.
which additionally is closed under the rules (NEC) and (CONEC).
Note that in none of the above theories we extend the induction scheme to the full . As usual we write simply to abbreviate .
A set of axioms, which is deductively equivalent to the above was first introduced in [8]. The above list of axioms is taken from [10] with a minor variation: we supplemented the normal axiomatization with for reasons analogous to the ones for .
At first sight, seems to be much more natural than . The presence of NEC and CONEC rules compensates in a way the lack of the transparency axiom making the theory symmetric: for every it holds that
[TABLE]
This heavily contrasts with the case of . However this symmetric feature turns out to be very pricey, as the well-known McGee’s theorem shows:
Theorem 47** (McGee, [16]).**
* is -inconsistent.*
Moreover, the fully inductive versions of both theories differ dramatically in strength, when evaluated over PA: can define levels of the ramified truth hierarchy (i.e. for every . See [10] for details), while the strength of is exhausted by -many such levels. We sketch the proof of the latter fact in Subsection 3.3.3 and give a strengthening of it in Subsection 4.3.
Both and are conservative extensions of B.
Theorem 48** (Cantini, [1]).**
* is a conservative extension of B.*
The above theorem has been proved by Cantini for PA, but his proof works essentially in the same way for all base theories B with a modicum of arithmetic. Conservativity of follows from the work of Halbach. He showed that FS with full induction is reducible to the system with full induction and a stratified family of compositional truth predicates. His proof, however, does not rely on induction in the considered theories or on the specific choice of the base theory. Therefore, essentially the same argument shows that is reducible to for a wide choice of base theories B. Conservativity of can in turn be shown by using known proofs of conservativity for , so, in a sense, it was "in the air".101010However, we know of no published proof of this result. We will provide more details (including the definition of ) in Subsection 3.3.3.
Theorem 49** (Essentially due to Halbach).**
* is a conservative extension of B.*
We shall sketch both proofs in the next Subsection 3.3.
3.3 Conservativity of truth theories
The main goal of this paper is to establish that certain truth theories over PA are feasibly reducible to PA. This involves certain elaborate technical arguments in each case. However, what these proofs have in common is that they all rely on the results from Subsection 2.5 since they follow the same general pattern: Suppose that is a theory of truth over PA that is conservative over PA. Moreover, assume that the conservativity proof in fact can be formalized in PA and that it is uniform in the sense that the proof works equally well for PA and its large enough finitely axiomatized fragments B that containing . Then can be shown to be feasibly reducible to PA. Let us recall the precise formulation of this fact (it was formulated as Corollary 36):
Suppose that is a finite extension of PA of the form , , and the following sentence is provable in PA:
"If B is any finite fragment of PA, then every -full model of B has an elementary extension to a -model of ."
Then is feasibly reducible to PA.
The proofs of our feasible reducibility results will in each case consist in an appropriate arithmetization in PA of a known conservativity proof of over fragments of PA. Therefore, we are forced to pay close attention to the specific features of the arithmetical implementation of the conservativity proofs, which is bound to obscure the main idea of the proof of feasible reduction. Therefore, to provide some help to the reader, we present outlines of the relevant conservativity proofs in this section.
3.3.1 Conservativity of
In this section, we sketch the proof of the following conservativity result:
Theorem 50**.**
Fix any fragment B of PA extending . Then is conservative over B.
Sketch of a proof.
We will base our proof on the argument given by Enayat and Visser in [5]. Fix any model of B. We will construct where by first constructing a chain of models
[TABLE]
such that , and the subsets are partially defined satisfaction predicates. Each satisfies compositional conditions for all valuations from but only for formulae from . This condition is axiomatized as a scheme. For example, we require for any arithmetical formulae separately that:
[TABLE]
and for any formula
[TABLE]
Thus we require that behaves compositionally for formulae which belong to , including nonstandard ones. Additionally, we require that agrees with on formulae from and arbitrary valuations. Note that if , then a direct subformula of also belongs to . In other words: the predicate gets fixed on the formulae on which it is guaranteed to behave compositionally.
We write the compositional conditions for in a pointwise manner (formula by formula), so in order to check that such an extension exists, by compactness, we only have to check that for each finite subset of formulae from , we can find a predicate which satisfies compositional conditions for these formulae. This turns out to be possible with a fairly straightforward recursion.
Finally, we take the sum of models . In order to check that the resulting sum satisfies compositional axioms (for formulae and assignments), we take an arbitrary formula , its direct subformulae, and some fixed valuation for . We check that it satisfied compositional conditions in the model , such that and was present already in , and that the compositional conditions were preserved along the construction.
Finally, we turn the model with a satisfaction class (i.e., a set of pairs such that is an arithmetical formula and ) obtained as a sum of a chain into a model with a truth class. We define as follows
[TABLE]
This concludes the sketch of the proof. A detailed argument will be presented in Subsection 4.1. ∎
The proof as written above does not overtly formalise in PA. The problem is as follows: when we speak in PA of full models , we really speak of formulae defining elementary diagrams of . The defining formulae for the full models can in general be more and more complex as we iterate the construction, and there might be no formally correct way of defining the sum of the obtained chain of models. Actually, we cannot even define the whole chain, but only its standard initial fragments.
There are a couple of ways to circumvent this issue. The route undertaken in this paper is the simplest we know of: we do not speak directly of models, but rather, through appropriate first order theories. More specifically, we will show that for any natural number , the theory (formulated in an extension of the language of B with finitely many new predicate symbols), saying:
"There exists a chain of models satisfying the conditions from the Enayat–Visser construction."
is consistent. This will be done by formalising the inductive step in the construction by Enayat and Visser, i.e., by showing that for all numbers , if is consistent, then is consistent as well. The consistency of is a -statement, so PA will be able to verify that for any the theory is consistent. This in turn will be enough to show that the theory saying:
"There is a chain of models: of infnite length which satisfy conditions from Enayat–Visser construction."
is consistent, hence has a model. From this model we will be able to define the whole chain in a uniform way and, consequently, its sum, which will give us a model of (not a full model though). The details involve a number of intricate and technical considerations; they are presented in the next section.
3.3.2 Conservativity of
In this subsection we will outline the proof of conservativity of , where B is a fragment of PA extending . The standard proof of conservativity is motivated by the original construction of Kripke.111111What we present here more resembles a modified constructions which seems to be first formulated by Cantini in [1]. We can define truth predicate semantically as a fixed point of an operator that takes a subset of , thought of as the set of sentences (possibly containing the truth predicate) which can be already identified as true at a given stage of the construction, and replaces it with in the following way:
- •
If is a true atomic or negated atomic formula, then .
- •
If , then .
- •
If , and for a term , then .
- •
If , and , then .
- •
If , then .
- •
If or , then .
- •
If and , then .
- •
If , then .
- •
If for all , then .
If is a limit ordinal, we set . In the above construction, we enlarge the set of sentences which are definitely true with a set of sentences which are definitely true if we interpret the truth predicate as the set . Since at each stage, we only keep enlarging our set, the construction will reach its fixed point. Such fixed points can be easily shown to satisfy the axioms of . The outlined argument carries over to an arbitrary model of B thus establishing conservativity of over its base theory.
The main problem with the outlined argument is that it does not directly formalize in PA since it relies on the principle: "Every positive operator on subsets of reaches a fixpoint." which is clearly not available in PA. However, there is a rather simple fix to this problem.
Start with a *recursively saturated *model . Notice that for , the -th set obtained in the inductive procedure described above, , is arithmetically definable in (let us call the defining formula ). By definability of and recursive saturation of , we can deduce that already is a truth predicate satisfying axioms of . Essentially, this relies on the fact that in recursively saturated models holds for all if and only if holds for some and all .121212A very similar argument has been presented in [3] in the proof that any recursively saturated model of PA can be expanded to a model of with internal induction for total formulae. It seems that this reasoning appears originally in [1], where Cantini proved conservativity of with internal induction for total formulae over PA.
It turns out that this argument can be repeated in PA for a finitely axiomatized fragment B of PA extending . Namely, we first take a full model of B, then we take its recursively saturated elementary extension, a full model and we define the predicate in as the sum of all sets defined with certain formulae defining the analogues of the sets from the above construction. The details will be given in Subsection 4.2
3.3.3 Conservativity of
The proof of conservativity of over PA is analogous to the one showing the upper bounds on the proof-theoretical strength of its fully inductive version, . As an intermediate step we pass through a theory of iterated compositional truth predicate of length , .
Definition 51**.**
is the extension of B in the language extending with new predicate symbols (we stipulate that and ) satisfying the following axioms for all :
. 2.
. 3.
. 4.
. 5.
\forall\bar{s},\bar{t}\in\textnormal{{ClTermSeq}}_{\mathcal{L}_{\textnormal{{PA}}}}\forall\phi(x)\in\textnormal{{Form}}^{\leq 1}_{\mathcal{L}_{<k}}\ \ \bigl{(}{\bar{s}}^{\circ}={\bar{t}}^{\circ}\rightarrow T_{k}(\phi(\bar{s}))\equiv T_{k}(\phi(\bar{t}))\bigr{)}. 6.
\bigwedge_{i<k}\forall s\in\textnormal{{ClTerm}}_{\mathcal{L}_{\textnormal{{PA}}}}\ \bigl{(}{s}^{\circ}\in\textnormal{{Sent}}_{\mathcal{L}_{<i}}\rightarrow T_{k}(T_{i}(s))\equiv T_{i}({s}^{\circ})\bigr{)}. 7.
\forall i<k\forall s\in\textnormal{{ClTerm}}_{\mathcal{L}_{\textnormal{{PA}}}}\ \ \bigl{(}{s}^{\circ}\in\textnormal{{Sent}}_{\mathcal{L}_{<i}}\rightarrow T_{k}(T_{i}(s))\equiv T_{k}({s}^{\circ})\bigr{)}
Define
Remark 52**.**
We assume that the initially chosen coding is extended in such a way that the length of is logarithmic in (in fact, polynomial will do, so this logarythmic bound is not that important).
As in the case of , and abbreviate and respectively. Note that similarly to all the rest of theories studied in this paper, in we do not extend the scheme of induction to formulae with the truth predicate.
Now let B be our base theory. We shall now reduce the problem of conservativity of over B to the analogous problem for . Let us recall that an interpretation ∗ is an -interpretation, if for every arithmetical sentence we have
[TABLE]
In order to perform the above mentioned reduction it suffices to show that every "finite piece" of can be -interpreted in . In this context "an -piece" means "a sentence which can be deduced from B and axioms – (note that in this context is missing) using at most applications of NEC and CONEC rules." We shall denote it with . Thus is in if it can be deduced using one application of the NEC rule or one application of the CONEC rule. (But not both. Our definition differs from the original one given by Halbach.) Now the following holds:
Lemma 53** (Essentially Halbach, [10], Theorem 14.31).**
For each , is -interpretable in .
Proof.
Define a family of primitive recursive functions as follows
[TABLE]
Where abbreviates
[TABLE]
and is a natural -formula which represents in . We shall check that for every , is an -interpretation of in . It is evident that each acts as identity on arithmetical sentences. Moreover, for every and each , is a sentence of (that is, it contains truth predicates with indices at most ) and this fact is provable in B. Hence if is any axiom from through and , then
[TABLE]
Now, following the lines of Halbach’s argument, we fix and, by induction on up to , we show that for every and every 131313That this range of shrinks in the induction process is needed to deal with CONEC. we have:
[TABLE]
Note that ( ‣ 3.3.3) witnesses that the above holds for . Now inductively assume that the above holds for an and fix .
Fix a proof of in . Arguing by induction assume that the last rule used in is either NEC or CONEC. In both cases we will use the fact that for all , and every , proves
[TABLE]
If is obtained by NEC, then and by our induction assumption we know that . Since , by ( ‣ 3.3.3) we obtain . The last sentence is by definition equal to , hence this case is done.
If is obtained by CONEC, then we argue dually using applied to . ∎
In the rest of this section we sketch the proof of conservativity of over B based on Enayat-Visser construction. For starters, let us note that it suffices to construct, for an arbitrary model a chain of models such that
; 2. 2.
; 3. 3.
.
Then will be an elementary extension of satisfying . To get we basically start the Enayat-Visser construction (as sketched in Subsection 3.3.1) on for the base language . More precisely, we build an -chain of models such that
and ; 2. 2.
; 3. 3.
4. 4.
is a satisfaction class for with respect to all valuations from
Satisfying the above requirements would suffice to guarantee that in the limit model axioms through will hold. However, to account for we have to improve our satisfaction classes slightly. This can be done by requiring that makes true all the statements such that
[TABLE]
for and (i.e., for such any and for every assignment ). This, in turn, requires only a tiny modification of the original Enayat–Visser proof. Details will be presented in Section 4.3.
4 The main act: feasible reductions of truth theories
This section contains the principal results of this paper. The first three subsections are devoted, respectively, to feasible reductions of , , and to PA. The last section, on the other hand, presents an interpretability-theoretic perspective of our work.
4.1 Feasible reduction of to PA
This section is devoted to the proof of the following result:
Theorem 54**.**
* is feasibly reducible to PA.*
An immediate corollary of Theorem 54 is that does not have super-polynomial speed-up over PA. The proof of a special case of this corollary for -sentences of arithmetic was presented by Fischer [7], based on an outline suggested by Visser, but as pointed out in a footnote in Subsection 2.6 the presented proof lacks an important detail.
Our proof of Theorem 54 will be based on the verification of the veracity of the assumption of Corollary 36 for and . In fact, we shall do slightly better: let us say that a theory B is good if it is formulated in a language that extends with new finitely many relation symbols (so all terms are arithmetical) and B extends . We shall show that for every the following single sentence is provable in PA:
"If B is any -good theory, then every -full model of B has an elementary extension to a -model of ."
In the above, is to be thought as independent of the size of the proof that our reduction takes as an argument. In the case of we will need the above theorem only for . The more uniform version will be needed to handle the case.
Our proof consists in formalizing the -chain Enayat–Visser construction inside PA, according to the sketch given in Subsection 3.3. As in the conservativity proof of Enayat and Visser, we shall make a detour through partial satisfaction classes. Let us introduce one more definition that will play an intermediate role in the proof below.
Convention 55**.**
If is an arbitrary unary predicate and an arbitrary formula with one free variable, then we write for the formula .
Definition 56** ().**
Let B be a theory in a finite language extending and be a fresh unary predicate. is the theory of -restricted, extensional satisfaction class for formulated in the language and extending B with the following axioms:
\forall x,y\bigl{(}S(x,y)\rightarrow x\in\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}\upharpoonright_{P}\wedge y\in\textnormal{{Asn}}(x)\bigr{)}. 2. 2.
\forall s_{0}\ldots\forall s_{n}\in\textnormal{{Term}}_{\mathcal{L}_{\textnormal{{B}}}}\upharpoonright_{P}\forall\alpha\in\textnormal{{Asn}}(s_{0},\ldots,s_{n})\bigl{(}S(R(s_{0},\ldots,s_{n}),\alpha)\equiv R(s_{0}^{\alpha},\ldots,s_{n}^{\alpha})\bigr{)}. 3. 3.
\forall\phi,\psi\in\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}\upharpoonright_{P}\forall\alpha\in\textnormal{{Asn}}(\phi,\psi)\ \ \bigl{(}S(\phi\vee\psi,\alpha)\equiv S(\phi,\alpha)\vee S(\psi,\alpha)\bigr{)}. 4. 4.
\forall\phi\in\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}\upharpoonright_{P}\forall\alpha\in\textnormal{{Asn}}(\phi)\ \ \bigl{(}S(\neg\phi,\alpha)\equiv\neg S(\phi,\alpha)\bigr{)}. 5. 5.
\forall\phi\in\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}\upharpoonright_{P}\forall v\in\textnormal{{Var}}\upharpoonright_{P}\forall\alpha\in\textnormal{{Asn}}(\exists v\phi)\ \ \bigl{(}S(\exists v\phi,\alpha)\equiv\exists\ \ \beta\sim_{v}\alpha,\beta\in\textnormal{{Asn}}(\phi)\ \ S(\phi,\beta)\bigr{)}.
and the axiom of generalized regularity:
[TABLE]
If and , is such that , then is called a -restricted extensional satisfaction class for on . If is ""-restricted, it is called full. Note that the above notion makes sense even if is not a full model since is a finite extension of B (recall Definition 29).
Note that in the definition above we do not restrict the range of assignments (denoted by variable in the above definition). In effect, we do not assume that the assignments come from the restricted set. This is crucial to our purposes.
Convention 57**.**
Below we always assume that is either empty or defines in a universe of an elementary submodel of . This certainly can be sustained along the inductive condition from the proof below. Under this assumption, is closed under the direct subformula relation, which we denote with . More precisely
[TABLE]
The distinctive feature of Enayat-Visser technique of building truth classes is that one creates a well-behaved satisfaction class via a union of chain argument. Let us now state the proposition which will provide us with a proof of the induction step in this construction.
Lemma 58** (Arithmetized Enayat-Visser construction).**
Let be a finite language extending . The sentence expressing the following implication is provable in PA for every :
If is a -full model for such that:
; 2. 2.
* is a -restricted satisfaction class for ;*
then there exists a -full model for and an -set such that:
; 2. 2.
* is an -restricted satisfaction class for (we add a predicate for the universe of to the language);* 3. 3.
.
Remark 59**.**
We call the reader’s attention to the asymetry in the above lemma: we start with a full model but finish with a full model and two its subsets , . This will be compensated for in our inductive construction.
Proof.
We work in PA. Let be as in the antecedent of the implication. We follow the lines of the standard Enayat-Visser proof from [5], but we perform it inside PA. Moreover we have the additional technical complication caused by adding the regularity axiom to . Let us define the language :
[TABLE]
Now let us define the Enayat-Visser theory for as the sum of the following sets:
[TABLE]
We argue that this theory is consistent. Then, by ACT (Theorem 9) there will be a -model of this theory. Then putting:
[TABLE]
we easily check that the triple satisfies the claim.
To prove the consistency, we will argue by the compactness theorem. Let be a finite (in the sense of PA) fragment of this theory. For each predicate which occurs in we will find a formula such that
[TABLE]
where denotes the theory resulting from by replacing each occurrence with the corresponding formula . Note that the above makes perfect sense, since is a full model. This clearly guarantees that is consistent. Moreover from now on we do not need to bother with the sentences from , since they obviously hold in .
As in the original Enayat-Visser proof we construe ’s by induction on the appropriately defined rank. Note that we have more work to do here than in the proofs given by Enayat and Visser [5] (since in their set-up, the language of arithmetic is purely relational), and by Cieśliński [2] (since in his set-up does not include our generalized regularity axiom ). Let be the set of formulae such that the predicate occurs in a formula in . Let be an arbitrary coded set of formulae of . We put iff there exists a sequence such that the following three conditions hold (in the last condition denotes the relation of being an immediate subformula):
and . 2. 2.
For all . 3. 3.
For all for all , iff for all such that , .
We say that if is the greatest such that . This definition makes sense, since if , then where denotes the cardinality of .
Example 60**.**
If , then the , since .
The intuition behind the above definition is that is the complexity of where formulae whose some immediate subformula does not belong to are treated as atoms. The idea is that for any such formula the satisfaction set can be defined almost arbitrarily and then ’s for formulae of higher rank can be defined in terms of previously defined satisfaction sets.
Observe that if we follow the above described recursive procedure, then all the compositional axioms (i.e., counterparts of axioms for atomic formulae, disjunction, negation and quantifier) from will be satisfied. However, we have one immediate problem: it can happen that (an instance of) the axiom of regularity for and is in , but and get different ranks. In such a situation the standard procedure does not seem to guarantee that and (i.e. formulae which interpret and in ) will satisfy the regularity axiom. To simplify the notation let us define if the following is in :
[TABLE]
Note that it can happen only if the following holds in (this follows by definition of the Enayat–Visser theory):
[TABLE]
A solution to our puzzle is to complete , obtaining , to assure that we have for all ,
[TABLE]
It is convenient to extend a little bit to make it an equivalence relation. We say that is the term trivialization of , and write if the following four conditions hold:
For every occurrence of a term in , if all occurrences of variables in are free, then is a free occurrence of a variable. 2. 2.
No variable occurs in as both bounded and free, and no variable occurs as free more than once. 3. 3.
For some , a function with domain and values in , the equality holds. (Recall that denotes the result of a formal substitution of terms for free variables of and that contains also terms with free variables.) 4. 4.
The indices of free variables of are chosen in a canonical way (for example according to the tree-ordering of the syntactical tree of . This is only needed to guarantee uniqueness).
The idea behind is that if for some term substitution and some formula we have
[TABLE]
then, and there are unique term substitutions , such that:
[TABLE]
We write if .141414The idea of using such term trivializations was directly inspired by Graham Leigh’s [15]. Obviously is an equivalence relation. Moreover, is a congruence with respect to the direct subformula relation , i.e. the following lemma holds. For its proof consult the appendix.
Lemma 61** (Congruence lemma).**
For all , , it holds that
[TABLE]
By induction it follows that the congruence lemma holds for in place of , where denotes the -step transitive closure of (by stipulation is the relation of equality).
Finally, observe that for every , and are mutually interdefinable. Indeed, fix and such that . Then, having , we define with the condition:
[TABLE]
Similarly, having we define with the condition:
[TABLE]
Now, define to be the completion of if for all , if and only if there exists and such that the following pair of conditions hold:
2. 2.
.
[TABLE]
Let us observe that with the current definition of it holds that for all
[TABLE]
Indeed, suppose this is not the case. Then, assumming without loss of generality that
[TABLE]
there exists such that but no formula from is the -st direct subformula of . By the congruence lemma (and induction) there exists such that and . Since , then there are such that for some and and (possibly and —when ). Since is an equivalence relation, then . Now, by the definition of we obtain that , a contradiction.
For every , let denote the fragment of consisting of axioms for predicates for of at most and recall that if is a family of formulae with one free variable indexed with such that , then
[TABLE]
denotes the theory resulting from by replacing every occurrence of with the formula . Let be the formula asserting the following implication:
"There exists the unique family of -formulae indexed with formulae of such that:
For every , if , then:
- (a)
if , then , and 2. (b)
if is from , then , and 3. (c)
if for some , , then is defined from using () and (); 4. (d)
otherwise put . 2. 2.
."
Now, we prove by induction. This concludes the proof of Lemma 58.∎
Let us now complete the proof of Theorem 54. We shall show how, working inside PA, given an arbitrary good -theory B, we can elementarily extend an arbitrary -full model of B to a -model of . To this end, working in PA, fix a good theory B, and a -full model of B. Next, still working in PA we shall construct an unbounded -chain of -full models
[TABLE]
such that:
- R1
,
and for each we have:
- R2
, 2. R3
and is an -restricted satisfaction class for and 3. R4
.
In particular each triple will have a fixed -complexity. Let us assume that such a chain has been constructed and and are formulae defining the sequences of respective -full models and restricted satisfaction classes. For example it holds that iff is the definition of the -th full model (recall that officially full models are identified with their elementary diagrams). Then (in PA) we define the limit model with the formulae:
[TABLE]
where denotes the canonical satisfaction predicate for -formulae.151515Recall that by Convention 28, means , where is a valuation which assigns to the only variable of . Note that is really a full -model, since the chain is elementary with respect to -formulae and each is a full model for .
The rest of the argument follows along the lines of Enayat–Visser proof: we check that is a full satisfaction class on , hence is a -model of .
Let us now construct the promised chain of models: reasoning in PA, we first define a sequence of increasing theories Intuitively speaking, for each , describes a structure and the family satisfies conditions R1 –R4 for boundedly many numbers. In other words, is the initial segment of our desired chain consisting of first models.
We now give a precise description of . The non-logical symbols of consist of , together with constant symbols for every element of , unary predicate symbols , and binary predicate symbols Let denote this language.
Convention 62**.**
If is any formula (in the sense of PA), and is any of ’s then we write to denote the relativisation of to M. This means that we syntactically replace all quantifiers with , all quantifiers with and adding to a conjunct .
The official translations of R1 through R4 above are as follows:
- •
Condition R1 is translated as .
- •
Condition R2 is translated as
[TABLE]
- •
Condition R4 is expressed by the following finite set of sentences:
- •
Condition R3 is expressed by the conjunction of the universal closures of the following finitely many axioms i-i, , which directly correspond to the ones from Definition 56 (we stipulate that is always the formula ) :
- 1i
S_{i}(x,y)\rightarrow\bigl{(}\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}^{\textnormal{M}_{i-1}}(x)\wedge\textnormal{{Asn}}^{\textnormal{M}_{i}}(x,y)\bigr{)}.
- 2i
\left(\textnormal{{TermSeq}}_{\mathcal{L}_{\textnormal{{B}}}}^{\textnormal{M}_{i-1}}(\bar{s})\wedge\left(x=R(\bar{s})\right)^{\textnormal{M}_{i-1}}\wedge\textnormal{{Asn}}^{\textnormal{M}_{i}}(x,\alpha)\right)\rightarrow\bigl{(}S_{i}(x,\alpha)\equiv(R(\bar{s}^{\alpha}))^{\textnormal{M}_{i}}\bigr{)}.
- 3i
- 4i
\textnormal{S}_{i}(x,\alpha)\equiv\bigl{(}\textnormal{S}_{i}\left(y_{1},\alpha\right)\vee\textnormal{S}_{i}\left(y_{2},\alpha\right)\bigr{)}.
- 5i
\left(\textnormal{{Form}}_{\mathcal{L}_{\textnormal{{B}}}}^{\textnormal{M}_{i-1}}(x)\wedge\bigl{(}\exists v\ \ (\textnormal{{Var}}(v)\wedge x=\exists v\ y)\bigr{)}^{\textnormal{M}_{i}}\wedge\textnormal{{Asn}}^{\textnormal{M}_{i}}(x,\alpha)\right)\rightarrow
- 6i
\left.\bigl{(}S_{i}(y_{1},\alpha)\equiv S_{i}(y_{2},\alpha)\bigr{)}\right).
We can now use induction on to show that :
Base case
Recall that is a fixed -full model of B. Let Then since is definable in , the elementary diagram of is also definable. This makes it clear that holds.
Inductive step
Fix and suppose that holds. Then by Theorem 9, there is a full model of satisfying R1 through R4 above whose elementary diagram is -definable.
Let be the language of , and let be the reduct of the structure to the language in which the universe of discourse is the -interpretation of For example, since , a model of will be a structure of the form , where , , and is the domain of discourse of In this case, So in general is of the form 161616Recall the conventions from Subsection 2.2. Although officially full models are elementary diagrams, we refer to them as though they were usual structures, as it is routine to translate statements about complete Henkinized theories into statements about structures. Observe that is a full model. Typically, its domain is smaller than the domain of .
Also let be the reduct of to . Let us observe that taking reducts does not raise the complexity of (the definition of) models, so and are still -full models. To this model apply Lemma 58 for , and . We are given , a -full model for , and a -set such that is an -restricted satisfaction class and . Now we "glue" this model to the end of the chain given by . More precisely, we define a model for in the following way. The universe of is the sum of the universes of and (without loss of generality, renaming the elements of if necessary, we assume that ). is interpreted as , as and and are interpreted on elements from N as they were in . For and are interpreted as in . Thus we have obtained a structure which contains an elementary chain of models of B, with being the top one and possibly some extra elements in the domain of .
Also note that for a structure defined in this manner we do not have an elementary diagram at our disposal, hence an argument is needed to show that holds. We argue as in the proof of Corollary 35. Note that if is an alleged proof of contradiction from the axioms of which has a subformula property, then only the following types of sentences can occur in it:
- A
formulae of the form for ;
- B
subformulae of sentences of the form
[TABLE]
for , .
- C
subformulae of sentences from (formalization) of condition R4;
- D
subformulae of sentences from 1i - 6i, i.
The complexity of formulae from C and D is bounded by a standard number. This is not the case of formulae from A or B. However, to decide every such sentence we can use and and this is clearly sufficient (all formulae from B are in the universal closure of boolean closure of formulae of type for ). All in all, we can define a -truth predicate for , for sufficiently large , which would work for all formulae from the proof . It follows that cannot be a proof of contradiction. This ends the inductive step and we can conclude that holds.
We shall now define the promised chain of models as a full model of the limit of ’s. Define:
[TABLE]
Here is treated internally, it simply denotes the universe. is a consistent theory of complexity (it is computable in . It follows that it has a -full model . This model gives rise to the -chain of -full models , which can be defined as follows:
[TABLE]
The construction guarantees that under such a definition, the chain satisfies the requirements R1 through R4. This finally concludes the proof of feasible reduction of to PA.∎
4.2 Feasible reduction of to PA
In this subsection we will establish:
Theorem 63**.**
* is feasibly reducible to PA.*
Our proof of the above theorem will demonstrate that the assumption of Corollary 36 holds with the choice of and , i.e., we will prove:
Lemma 64**.**
PA* proves that for any finite fragment P of PA, every full -model of B has an elementary extension to a -model of .*
Before proving Lemma 64, we will first show that PA can formalize the proof of the existence of recursively saturated models.
Lemma 65**.**
For any , PA proves that any -full model of a finite fragment B of PA, there exists a -full model such that PA proves:
[TABLE]
Let us first make sense of the above claim. Recall that by a full model over a language , we mean an elementary diagram of that model, that is, a complete consistent Henkinized theory.
We say that a model is recursively saturated if for every Turing machine with code , and every finite sequence of elements of , , if for every finite sequence of formulae whose Gödel numbers are accepted by the machine with index , there exists an such that
[TABLE]
then there exists such that for every which is accepted by the machine with the code ,
[TABLE]
The above definition is well-known. We cite it here to ensure the reader that it really can be spelled out in PA and that the claim of recursive saturation of can be effectively produced in polynomial time given the definition of .
Let us note that the lemma itself is also well known. Its formulation and proof can be found in [19], Lemma IX.4.2. We demonstrate it here for the convenience of the reader.
Proof of Lemma 65.
We reason in PA. Let be the arithmetical language with constants added. Let be any polynomial time enumeration of all sentences of the language . Let be a full model and let be the theory whose axioms are elementary diagram of (which, according to our official definition from Subsection 2.2 is the model itself), all Henkin sentences (in the language with the new constants), and all sentences of the following shape:
[TABLE]
where , and all the constants of occurring in the formulae are of the form for , and the machine with the code accepts sentences in less than steps. By Theorem 9 (ACT) the theory has a -full model . This ends the proof of the claim * ‣ 65 in PA. ∎
Now, we proceed to the proof of Lemma 64 which will end the proof of Theorem 63.
Proof.
We work in PA. Let be any -model of B. By Lemma 65, there exists a -full recursively saturated model of B.
In our proof, we use a construction resembling the one given originally by Kripke in [14]. As we have already noted in Subsection 3.3.2, a very similar argument appeared before in [1] and [3]. By induction, we define a sequence of arithmetical formulae . That is, a sequence of elements such that . Let be a definition of the atomic diagram of . More precisely, let
[TABLE]
Having defined the formula , we set (which we also denote by ) if and only if one of the following conditions is satisfied:
- •
- •
.
- •
.
- •
.
- •
.
- •
- •
.
- •
.
Now, let be the subset of the domain of defined as the sum . In other words,
[TABLE]
Consider the expanded model Since the definition of is in the complexity of , the complexity of the resulting model is . We would like to ensure that is a model . The model satisfies B, since does, so it is enough to check that satisfies truth-theoretic axioms –.
This is obvious for . Let us check the claim for . Suppose that . Since , there exists such that
[TABLE]
Then by definition of , either (and, consequently, holds) or (and then holds). Conversely, if or , then for some , or . But then and, consequently, . This guarantees that . The proofs for axioms are similar, as are the proofs for axioms . Let us focus on axiom .
Suppose that . Then there exists such that . This implies that for all , . Therefore, for all , holds.
Conversely, suppose that for all , . In other words, for every , there exists such that . We claim that there exists such that for all , . Suppose otherwise. Then for every , the following set of arithmetical formulae is realised in by some :
[TABLE]
Therefore, by recursive saturation, there exists an such that for every ,
[TABLE]
contrary to the assumption. This implies that there exists such that for every , and therefore . We conclude that holds. The case of axiom is straightforward.
In order to prove that holds, we check by induction on (in PA) that this axiom is satisfied by formulae in . The conclusion follows immediately. This concludes the proof of the lemma. ∎
4.3 Feasible reduction of to PA
In this section we strengthen the conservativity proof from Subsection 3.3.3 by establishing the following result:
Theorem 66**.**
* is feasibly reducible to PA.*
The key step in our construction is to feasibly reduce the theory of -many truth predicates, , defined in Subsection 3.3.3, to PA. This is achieved in the following lemma.
Lemma 67**.**
* is feasibly reducible to PA.*
Proof.
We shall prove that the assumptions of Corollary 34 hold for . Fix and an arbitrary finite fragment B of PA, w.l.o.g. and assume that is a -full model. We shall build a -model of , which clearly suffices (note that now we are talking about internally). The aim is to formalize in PA the conservativity proof from Subsection 3.3.3. In order to do this we shall build a chain of uniformly definable -full models such that and for each
is a full -model of , and 2. 2.
for each .
Clearly the limit model will be a model of (even a full one—this follows by elementarity). To define the respective chain we shall implement the argument from Section 4.1: the chain will be described by a -theory formulated in the language whose non-logical symbols are:
symbols of ; 2. 2.
unary predicates: ; 3. 3.
unary predicates: .
Similarly to the proof for in Subsection 4.1, the axioms of can be divided into three groups:
is an elementary supmodel of . Formally this is expressed as an infinite set of axioms: . 2. 2.
forms an elementary chain of submodels. More precisely: for each , is an elementary submodel of . Formally this is expressed analogously to the condition (R2) from the proof for . 3. 3.
For every , is a model of . This is expressed by formally relativizing the axioms of to .
Now, by induction on we show that . We follow the lines of the sketch of the conservativity proof given in Subsection 3.3.3. For we simply use the proof from Section 4.1 to build an elementary supmodel of satisfying . For the induction step, note that, using the same reasoning as we did in Section 4.1 to verify , it is enough to build a model for which would be a full model for but will possibly leave some sentences with undefined. As in the conservativity proof for we use the fact that is deductively equivalent to the theory I below 171717”” abbreviates ”Induction” as this theory is used in the induction step of our construction.:
[TABLE]
From we obtain a model of . We build an extension satisfying I in many steps via the union of chain argument. The following is the analogue of Lemma 58 in our situation:
Lemma 68** (Arithmetized Enayat–Visser construction+).**
The sentence expressing the following implication is provable in PA for every :
If is a -full model for such that:
; 2. 2.
* is a -restricted satisfaction class for ;*
then there exists a -full model and a -set such that the following conditions hold:
; 2. 2.
* is an -restricted satisfaction class for (we add a predicate for the universe of to the language);* 3. 3.
; 4. 4.
*for every , . *
Sketch of the proof.
We indicate how to modify the proof Lemma 58. Firstly, we add the following sentences to the definition of Enayat-Visser theory of :
[TABLE]
Now we work with a finite fragment of the Enayat and Visser theory. The next step which requires a modification, is the definition of for a coded set of sentences . According to the previous definition, formula was of zero if and only if either was atomic or some immediate subformula of was outside . Now we will treat as formulae of zero all formulae from as well. For such formulae we have an obvious candidate for the definition of (i.e. the formula defining the extension for in ). We define:
[TABLE]
Note that satisfies generalized regularity, so it is sufficient to verify the truth of on numerals naming values of . The definition of is now as follows: there exists a sequence such that
and . 2. 2.
For all . 3. 3.
For all for all , iff and for all such that , .181818Note that if , this condition implies that has length .
The definitions of and (for an arbitrary ) are analogous to the ones from the original lemma. The last step which requires a modification is the definition of the formula . Below, as in the proof for , is the set of formulae such that occurs in . We define to be the formula expressing:
"There exists the unique family of -formulae indexed with formulae of such that:
For every , if , then:
- (a)
if for a relation symbol , then , and 2. (b)
if , then , and 3. (c)
if , then , and 4. (d)
if is from , then , and 5. (e)
if for some , , then is defined from using () and (); 6. (f)
otherwise . 2. 2.
."
Note that conditions (c) - (e) are the same as in the original definition. THe rest of the proof is as previously. ∎
Once we can prove , the construction of the chain and its sum is precisely the same as in Section 4.1. ∎
Now we want to finish the proof of Theorem 66. We have just shown that satisfies the assumptions of Corollary 35 (with the "moreover" part). By Observation 38, it follows that is PA-provably feasibly strongly reflexive, i.e., there exists a P-time computable function such that for all , is a PA proof of the sentence
[TABLE]
Note that there exists a P-time computable function such that for any , is a PA proof of the sentence
[TABLE]
The above is in fact an easy consequence of the proof of Halbach’s reduction of to from Lemma 53.
Finally let us observe that the relation defined:
" is a proof of "
is P-time, so, it is uniformly polynomially binumerable in . This gives us a function such that for every proof of a sentence , is a PA proof of . Our desired reduction can now be defined as follows: given an proof of a sentence compute and such that there are exactly applications of NEC and CONEC in and is of depth . Using find the proof of . Compute , i.e. the proof of (). Compute , i.e. the proof of (). Apply finitely many logical operations, to conclude . Finally apply Theorem 25 to compute the proof of
[TABLE]
Concatenation of the above proofs yields a PA proof of . ∎
4.4 Feasible interpretability of truth theories
In Section 2.5 we gave a terse proof of Theorem 32; that proof did not directly link the notions of feasible reducibility with feasible interpretability, which is how we originally conceived of—and arrived at—our main results. Since interpretations, especially of the feasible variety, are of foundational and philosophical interest in connection with axiomatic theories of truth, we now explain the interpretability-theoretic perspective of our work by establishing the following result:
Theorem 69** (Feasible interpretability of truth theories).**
Let be any of the truth theories , , and . Then there exists a uniformly polynomially correct family
**
of interpretations (in the sense of Definition 40).
Note that, by Proposition 42, the existence of a uniformly polynomially correct family of interpretations guarantees feasible reducibility. The proof of Theorem 69 can be readily read-off the second proof of Theorem 32 which we give in this section. We shall demonstrate that the assumptions of Theorem 32 imply the existence of a uniformly polynomially correct family of interpretations. This will make it clear that Theorem 69 holds since we have already already verified in Subsections 4.1, 4.2, and 4.3 that the assumptions of Theorem 32 are met when is any of the truth theories , , and . The second proof of Theorem 32 is based on a feasible version of the Arithmetized Completeness Theorem, which we now turn to. In Lemma 70 below an -set should be understood as a set that is definable by a formula of depth ("definable" in the sense of the feasible predicates).
Lemma 70** (Feasible Arithmetized Completeness Theorem, FACT).**
There exists a polynomial and a P-time computable function such that for every , is a PA proof of the sentence expressing:
"If an -theory in a language is consistent then it has a -model."
Moreover there exists a P-time computable function such that for any , is a PA proof of the sentence expressing:
"If is an -model of a -theory , then is consistent."
Remark 71** (Uniformity of FACT).**
As a corollary to the proof of the above lemma we will obtain the following proposition:
Proposition 72**.**
Suppose that is an -formula such that
[TABLE]
( are the parameters). Then there exists a formula such that
[TABLE]
Moreover, there exists a P-time computable function , such that is a proof of .
Proof of Lemma 70.
See the Appendix. ∎
Let us also mention that in addition to FACT we have also the Feasible Compactness Theorem (the proof of which is rather obvious, as we deal here with consistency in the syntactical sense)
Lemma 73** (Arithmetized Compactness Theorem).**
There exists a P-time computable function such that for every , is a PA proof of the sentence
"An -theory is consistent if and only if each bounded fragment of is consistent."
We are now ready to present the second proof of Theorem 32. We include the statement here for the benefit of the reader.
Theorem 74** (Theorem 32 redux).**
Let be a theory extending PA with an NP-set of axioms. If there is a polynomial such that for every ,
[TABLE]
Then PA polynomially simulates . Moreover, if admits a P-time computable set of axioms and there exists a P-time computable function such that for all , is a PA proof of
[TABLE]
then is feasibly reducible to PA.
Second proof of Theorem 32, Sketch.
We will construct a uniformly polynomially correct family of interpretations . Let us define the theory :
[TABLE]
where says that there is no proof of contradiction using as axioms sentences in or true sentences of depth (see Remark 20 for an explanation). Observe that the length of (the formula defining) is polynomial in and its shape depends uniformly on . Then for every , and the proof is uniform in , so in fact there exists a polynomial such that for every ,
[TABLE]
(for the precise argument see [9], Theorem 2.37). By FACT we know that there exists a formula and a polynomial such that
[TABLE]
Now is defined as a relativization to i.e. a function defined on formulae of the language of which preserves boolean operations such that for every relational symbol and all terms
[TABLE]
(recall that full models are coded as elementary diagrams) and for every existential formula ,
[TABLE]
We check by contraposition that is -correct. Work in PA. Assume , where is of length at most . We will derive . Surely, is of depth at most . Let be as in Theorem 24. Then, by provable Tarski biconditionals, with a proof of length we conclude
[TABLE]
We show that this implies . By our main assumption ( ‣ 74) and Lemma 33 we obtain a polynomial such that
[TABLE]
which directly implies that belongs to . Consequently
[TABLE]
and the proof of it has length . Now using at most many steps involving formulae of length polynomial in we obtain
[TABLE]
What is left to show is that for some polynomial and each ,
[TABLE]
for every sentence in . We use polynomial binumerability of to guarantee that there is a polynomial such that for all and ,
[TABLE]
Now, as previously, we have
[TABLE]
Hence, adding a few more steps, we also have
[TABLE]
Then, as previously we check that is satisfied.
The "moreover" part holds, since if is P-time computable, then we can feasibly find a PA proof witnessing that
[TABLE]
The rest of steps are fully analogous. ∎
5 Open Questions
The proofs of our main results in Section 4 suggest that the answers to the following questions are both in the positive; we pose them here as questions since definitive positive answers to them requires a number of technical verifications that are yet to be carried out.
Question A. Is the conservativity of , , and over PA provable in Buss’s system
Question B. Suppose B is a sequential theory that is inductive; i.e., the scheme of induction over the natural numbers of B is provable in B. Are , , and feasibly reducible to B?
6 Appendix
The bulk of the work in the following paper is inherently technical, especially since we are dealing with nuanced arithmetizations and sizes of proofs, and therefore the arguments need to be checked in a very careful manner. In order to minimally distract the reader from the main flow of the argument, we decided to relegate some of the checking to this Appendix.
6.1 Feasible reflexivity
In Section 2.5, we proved Theorem 32 which is the technical core of our paper and which provides us with a uniform way of obtaining polynomial simulations and feasible reductions. We presented two proofs of that theorem. The first of them used the following Lemma (originally Lemma 33):
Lemma 75**.**
There exists a P-time computable function such that for all is a PA proof of the:
[TABLE]
It states that we can uniformly find PA proofs of the uniform reflection for bounded fragments of PA. In the proof, we assumed that the statement holds for the axioms in these fragments, and that arithmetical satisfaction predicates enjoy certain regularity properties. Below, we formulate these results in a precise manner and prove them.
Definition 76**.**
If is a valuation and is in the domain of , then by we mean a valuation which is the same as except for the variable whose value is . 2. 2.
If is a formula, then denotes the instantiation of the induction scheme (with parameters) with formula w.r.t. , i.e. the following formula
[TABLE]
For the sake of simplicity we assume that PA is axiomatized by induction scheme with free variables treated as parameters. This is in order to avoid taking universal closures of axioms. Let us observe that, living inside PA, we know that every object can be named by a closed term. The Proposition below says that for every formula being satisfied by a sequence is equivalent to the truth of the sentence .191919For the notation , recall Definition 3.
Convention 77**.**
For the sake of simplicity let us agree that saying that for every the family is uniformly feasible in means that there exists a P-time computable function such that for each , is PA proof of .
Proposition 78**.**
The following regularity properties for predicates are uniformly feasible in :
. 2. 2.
.
Proposition 79** (Essentially Pudlák, [17]).**
The following formulae are uniformly feasible in :
** 2. 2.
**
Now we prove that for every the truth of all PA axioms of induction of depth can be feasibly established in PA.
Proposition 80**.**
The following sentences are uniformly feasible in :
[TABLE]
Proof of Proposition 80.
For the purposes of this proof, we say that is small if . Let abbreviate the following formula:
" is a small formula such that is a free variable of and is an assignment for "
Moreover let abbreviate
,
Let . The idea is that encodes a sequence of parameters used in the induction and is the varying value assigned to the variable while proving via induction. We work in PA. We start with which is an axiom of length polynomial in (since is). Using a few transformations (their number is independent of ) we obtain
[TABLE]
Let us look at . Observe that by Proposition 78
is equivalent to and 2. 2.
is equivalent to .
Hence implies
[TABLE]
Now by compositional axioms for the above is equivalent to
[TABLE]
∎
6.2 Congruence lemma
We sketch the proof of the following lemma from Section 4.1.
Lemma 81** (Congruence lemma).**
For all , , it holds that
[TABLE]
Sketch of the proof.
We prove the lemma by induction on the complexity of (carried out in which we assumed to satisfy ). The only non-trivial step is the one for . Assume . Then . Take , which, by definition, is of the form . In replace all the occurrences of maximal terms in (i.e. the ones which do not occur within a term) which contain only free variables (in ) with fresh variables, without using the same variable twice. Then rename the free variables of the resulting formula according to the procedure adopted in condition of the definition of the term trivialization. In this way we obtain the term trivialization of both and . ∎
6.3 FACT
In Subsection 4.4, an alternative proof has been provided of Theorem 32 which says that if PA proves reflection over fragments of another theory , then is feasibly reducible to PA. The second proof of that theorem used the fact that arithmetized completeness theorem can proved with a proof of size polynomial in the size of the formula defining . This was stated as Lemma 70. In this subsection, we prove this result.
Lemma 82** (Feasible Arithmetized Completeness Theorem, FACT).**
There exists a polynomial and a P-time computable function such that for every , is a PA proof of the sentence
"If an -theory in a language is consistent then it has a full model"
Moreover, there exists a P-time computable function such that for any , is a PA proof of the sentence
"If is a -full model of a -theory , then is consistent."
Proof.
To prove the second part we show by induction on the lengths of proofs that any -full model (which, recall, is the same as a complete consistent Henkinized theory) is closed under reasoning in first order logic. This argument is carried out uniformly with the only difference that we use different feasible satisfaction predicates depending on the complexity of the model.
To prove the first part we follow the "leftmost branch" strategy. The proof is routine but we present it to be on the safe side. Assume that an -theory of depth is consistent. (Note that and might contain arbitrary parameters.) Note that we need not care about the rise in -complexity of the formula defining a model for as long as the construction of the relevant formula is uniformly feasible in . Let be the set of formulae of enriched with Henkin constants (we denote the Henkin constant for the formula with and assume that the function is ).
Step : finding a complete, consistent Henkin extension Let be defined as
[TABLE]
Here is of depth polynomial in (we may assume that the length, hence also the depth, of the formula defining is polynomial in ) and of length polynomial in the length of . We check that holds: each proof of from the axioms of can be transformed into proof of
[TABLE]
Then we check that the above is equivalent to
[TABLE]
which contradicts the consistency of . Note that the above argument is uniform in .
Let be any enumeration of . For any binary sequence of length let be the theory:
[TABLE]
enumerates first elements of , adding negation at the front of the -th element if . Let be the sentence saying:
"There exist the unique such that
and 2. 2.
and 3. 3.
"
Once again is of length polynomial in the length of and this polynomial does not depend on the initial choice of . We show that is a complete and consistent theory with Henkin sentences (which, according to our definitions is the same as a full model). The whole argument was carried out uniformly in and can be produced by a P-time function .
∎
6.4 A glossary of technical notions
This paper contains a fairly large number of technical definitions; here we enclose a glossary of such terms in order to assist the reader.
- •
means that is an assignment for a formula or a term (or for a set of terms or formulae ), i.e. is a function whose domain includes the free variables of (or whose domain includes free variables of all elements of ). See Definition 1 and Convention 4.
- •
means that is a closed term of a language , see Definition 1 and Convention 4.
- •
means that is a sequence of closed terms of a language , see Definition 1 and Convention 4.
- •
is an arithmetized consistency statement for . See Corollary 19.
- •
is the compositional theory of truth over PA, see Definition 44.
- •
is the compositional theory of truth over a theory B extending , see Definition 44.
- •
is the syntactic depth of a formula , see Definition 22.
- •
(elementary diagram of ) is the same as a full model , this notation is used when is viewed as a complete Henkinized theory, rather than a structure.
- •
means that is a formula of the language , see Definition 1 and Convention 4.
- •
means that is a formula of the language with at most one free variable, see Definition 1 and Convention 4.
- •
means that is a formula of the language with exactly one free variable, see Definition 1 and Convention 4.
- •
is the Friedman–Sheard self-referential theory of truth over PA without induction, see Definition 46.
- •
is the Friedman–Sheard self-referential theory of truth over a theory B extending without induction, see Definition 46.
- •
means that is a free variable of , see Definition 1 and Convention 4.
- •
means that is a coded sequence whose elements are (some) free variables of , see Definition 1 and Convention 4.
- •
is the Kripke–Feferman self-referential theory of truth over PA without induction, see Definition 45.
- •
is the Kripke–Feferman self-referential theory of truth over a theory B extending without induction, see Definition 45.
- •
is the length of a sequence , see Definition 1 and Convention 4.
- •
-model (full model) is a (full) model defined with a formula of depth .
- •
means that there exists a proof of in the theory . See Corollary 19.
- •
means that is a proof of from the theory . See Corollary 19.
- •
means that is a sentence of the language , see Definition 1 and Convention 4.
- •
means that is a term of the language , see Definition 1 and Convention 4.
- •
means that is a sequence of terms of the language , see Definition 1 and Convention 4.
- •
-theory is a theory defined with a formula of depth .
- •
means that is (an arithmetized) variable, see 1.
- •
means and are functions, is a variable and for all variables , possibly except for which also possibly belongs only to the domain of , see Definition 1.
- •
is a formula with the numeral substitued for every every occurrence of for every free variable of , see Definition 3.
- •
denotes the formula with the term substituted for the variable , see Definition 3.
- •
is the length of the shortest proof of in , see Definition 11.
- •
means that is an immediate subformula of , see the proof of Lemma 58.
- •
is the term trivialization of , see remarks preceding Lemma 61.
- •
means that , where are reducts of to the language , see Definition 30 or 7 and the subsequent remarks.
- •
is the value of term in which every free variable has been evaluated to , see Definition 3.
- •
means that the length of the shortest proof of in is not greater than , see Definition 11.
- •
for a theory means the set of axioms of of length at most , see Definition 31.
- •
is a numeral denoting , see Definition 2 and Convention 4.
- •
means that is the value of the term , see Definition 1 and Convention 4.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Andrea Cantini. Notes on formal theories of truth. Zeitshrift für Mathematische Logik Und Grundlagen der Mathematik , 35(1):97–130, 1989.
- 2[2] Cezary Cieśliński. The Epistemic Lightness of Truth: Deflationism and its Logic . Cambridge University Press, 2018.
- 3[3] Cezary Cieśliński, Mateusz Łełyk, and Bartosz Wcisło. Models of PT − superscript PT \textnormal{PT}^{-} with internal induction for total formulae. The Review of Symbolic Logic , 10(1):187–202, 2017.
- 4[4] Ali Enayat. Question 1 in ”a list of open problems”. submitted during the conference Model Theory and Proof Theory of Arithmetic .
- 5[5] Ali Enayat and Albert Visser. New constructions of satisfaction classes. In Theodora Achourioti, Henri Galinon, José Martínez Fernández, and Kentaro Fujimoto, editors, Unifying the Philosophy of Truth . Springer-Verlag, 2015.
- 6[6] Solomon Feferman. Reflecting on incompleteness. The Journal of Symbolic Logic , 56(1):1–49, 1991.
- 7[7] Martin Fischer. Truth and speed-up. The Review of Symbolic Logic , 7(2):319–340, 2014.
- 8[8] Harvey Friedman and Michael Sheard. An axiomatic approach to self-referential truth. Annals of Pure and Applied Logic , 33:1 – 21, 1987.
