The Epsilon Calculus with Equality and Herbrand Complexity
Kenji Miyamoto, Georg Moser

TL;DR
This paper analyzes the complexity bounds of Herbrand disjunctions in epsilon calculus with equality, extending previous results and providing new upper and lower bounds for Herbrand's theorem.
Contribution
It offers the first detailed complexity analysis of Herbrand disjunctions in epsilon calculus with equality, building on and extending prior work without equality.
Findings
Established upper bounds for Herbrand disjunction length with equality.
Derived lower bounds matching the upper bounds, showing tight complexity estimates.
Extended the complexity analysis from epsilon calculus without equality to include equality.
Abstract
Hilbert's epsilon calculus is an extension of elementary or predicate calculus by a term-forming operator and initial formulas involving such terms. The fundamental results about the epsilon calculus are so-called epsilon theorems, which have been proven by means of the epsilon elimination method. It is a procedure of transforming a proof in epsilon calculus into a proof in elementary or predicate calculus through getting rid of those initial formulas. One remarkable consequence is a proof of Herbrand's theorem due to Bernays and Hilbert which comes as a corollary of extended first epsilon theorem. The contribution of this paper is the upper and lower bounds analysis of the length of Herbrand disjunctions in extended first epsilon theorem for epsilon calculus with equality. We also show that the complexity analysis for Herbrand's theorem with equality is a straightforward…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsLogic, programming, and type systems · Polynomial and algebraic computation · Advanced Combinatorial Mathematics
The Epsilon Calculus with Equality and Herbrand Complexity
Kenji Miyamoto and Georg Moser
[email protected], [email protected]
University of Innsbruck
Abstract
Hilbert’s epsilon calculus is an extension of elementary or predicate calculus by a term-forming operator and initial formulas involving such terms. The fundamental results about the epsilon calculus are so-called epsilon theorems, which have been proven by means of the epsilon elimination method. It is a procedure of transforming a proof in epsilon calculus into a proof in elementary or predicate calculus through getting rid of those initial formulas. One remarkable consequence is a proof of Herbrand’s theorem due to Bernays and Hilbert which comes as a corollary of extended first epsilon theorem. The contribution of this paper is the upper and lower bounds analysis of the length of Herbrand disjunctions in extended first epsilon theorem for epsilon calculus with equality. We also show that the complexity analysis for Herbrand’s theorem with equality is a straightforward consequence of the one for extended first epsilon theorem without equality due to Moser and Zach.
Keywords. Hilbert’s epsilon calculus, epsilon theorems, Herbrand complexity, proof complexity
1 Introduction
Hilbert’s epsilon calculus is an extension of predicate calculus by the -operator which forms for a formula a term . This operator is governed by the following two initial formulas: One is the critical formula
[TABLE]
where is an arbitrary term, and the other is the -equality formula
[TABLE]
where and are sequences of terms and and stands for the conjunction of , , , and for an arbitrary positive natural number , and the proper subterms of are only variables . Pure epsilon calculus is an extension of elementary calculus by the -operator and the critical formula. The -operator is expressive enough to encode the existential and universal quantifiers, so that they are definable as and within the epsilon calculus.
The epsilon calculus was originally developed in the context of Hilbert’s program. Early work in proof theory (before Gentzen) concentrated on the epsilon calculus, the -elimination method, and the -substitution method, and those results were carried out by Bernays [HB39] (see also [Zac03, Zac04, MZ06]), Ackermann [Ack25, Ack40] (see also [Mos06]), and von Neumann [vN27]. The correct proof of Herbrand’s theorem was first given by means of epsilon calculus [Bus94]. The theorem is commonly stated in a less general way as follows than the original: If there is a proof of a prenex existential formula for quantifier-free in predicate calculus, there is a proof of for some terms in elementary calculus. The epsilon calculus is of independent and lasting interest, however, and a study from a computational and proof-theoretic point of view is particularly worthwhile.
In the course of proving epsilon theorems and Herbrand’s theorem, the -elimination method is used to proof-theoretically transform a proof in epsilon calculus into a proof which is free from the above mentioned initial formulas. Assume there is a proof of in pure epsilon calculus, where is a finite sequence of terms possibly with occurrences of -terms, then the -elimination method generates another proof of the disjunction in elementary calculus, where are terms without the -operator. The disjunction is a so-called Herbrand disjunction for the formula , and the aim of this paper is analyses of the Herbrand complexity which is the length of the shortest Herbrand disjunction for the original formula. This paper extends the Herbrand complexity analysis by Moser and Zach [MZ06]. Their result tells us that the Herbrand Complexity of a formula is based on the proof measure speaking only about the first-order counterpart of a proof of . While they have dealt with the systems of epsilon calculus without the -equality formula, we target epsilon calculus with the -equality formula and study the upper and lower bounds analysis of the Herbrand complexity for the system with the -equality formula. Our contribution is divided into two parts. The first one is a complexity analysis for Herbrand’s theorem in first-order logic with equality. In this case, we can avoid to rely on epsilon calculus with the -equality formula, hence the result by Moser and Zach is directly applicable. The second one is the upper and lower bounds analyses for extended first epsilon theorem with the -equality formula, where the upper bound analysis depends on a measure concerning the structure of critical formulas as well as the measure for first-order ingredients of a proof.
Hilbert’s epsilon calculus is primarily a classical formalism, and we will restrict our attention to classical first-order logic. For non-classical approaches to epsilon calculus, see the work of Bell [Bel93a, Bel93b], DeVidi [DeV95], Fitting [Fit75], Mostowski [Mos63], and Shirai [Shi71]. Our study is also motivated by the recent renewed interest in the epsilon calculus and the -substitution method in, e.g., the work of Arai [Ara03, Ara05], Avigad [Avi02], Baaz et al. [BLL18], and Mints et al., [MT99, Min03]. The epsilon calculus also allows the incorporation of choice construction into logic [BG00]. The treatment of eigenvariables in the context of unsound proofs and its relation to the epsilon calculus is studied by Aguilera and Baaz [AB16]. On the semantics of epsilon calculus, see the work of Zach [Zac17].
The rest of this paper is organized in the following way. Section 2 describes the syntax of epsilon calculus without the -equality formula, Section 3 shows the embedding lemma which states that predicate calculus is a subset of pure epsilon calculus without the -equality formula. A complexity analyses of Herbrand’s theorem for a prenex existential formula comes as a simple consequence of the lemma. In Section 4, the system is extended by the -equality formula, which makes the identity schema true within the system. Section 5 clarifies the subtlety of complexity analyses of a system with equality through Yukami’s trick [Yuk84]. In Section 6 we review first and second epsilon theorems following the proof by Bernays. Section 7 and Section 8 are devoted to analysing the upper and the lower bounds, respectively, where Section 7 describes our complexity analysis for extended first epsilon theorem and the upper bound of the Herbrand complexity. Section 9 concludes this paper.
2 Epsilon Calculus
We start from defining terms and formulas of our logic. As a convention we assume range over a set of bound variables, over a set of free variables, over a set of function symbols, and over a set of predicate symbols. The symbol is reserved for the equality predicate. Each function symbol and predicate symbol has an arity, and , , , , and are disjoint. We abbreviate as and let denote its length . We define terms, formulas, and free variable occurrences. Notice the difference between free variables and free variable occurrences.
Definition 2.1** (Term and formula).**
Raw terms* and raw formulas , are simultaneously defined as follows.*
[TABLE]
Sets of free variable occurrences and are simultaneously defined, assuming , , and .
[TABLE]
A raw term is a semiterm if , and is a term if . A raw formula is a semiformula if , and is a formula if . A (semi)formula and a (semi)term are quantifier free in case neither , nor occurs in them.
We abbreviate as and also as , and the same convention applies to . Terms of the form is called -terms.
Definition 2.2** (Substitution).**
Assume and . For (semi)terms , (semi)formulas , and variables , the substitution is defined as follows.
[TABLE]
where and is fresh.
We can write for a formula with a free variable , and then is abbreviated as . This notation is extended through the vector notation and the simultaneous substitution. We employ the same for terms.
Definition 2.3** (-equivalence).**
We define the -equivalence for (semi)terms and (semi)formulas as follows.
[TABLE]
We also define the term substitution for (semi)terms through the -equivalence instead of the equality on variables, and the simultaneous substitution.
Definition 2.4** (Set induced by vector).**
For any vector , a set is defined to be via . We say a list of vectors is a split of if and for .
Definition 2.5** (Equality).**
The following formulas are referred to by .
[TABLE]
Definition 2.6** (Elementary calculus and predicate calculus).**
The system of elementary calculus is denoted by , where its initial formulas are propositional tautologies and its inference rule is modus ponens given as follows.
[TABLE]
The system of first-order predicate calculus is denoted by , where the initial formulas are propositional tautologies and the following formulas and .
[TABLE]
The inference rules of are modus ponens and the following and , where the eigenvariable may not occur in any formula in the axiom .
[TABLE]
* and extended by the initial formulas are called and , respectively. We alternatively say and for them.*
Definition 2.7** (Epsilon calculus).**
Let a formula of the form
[TABLE]
where is an arbitrary term and is a formula containing , be a critical formula, and we define the systems and by extending and by taking such critical formulas as initial formulas. We say is the critical -term of the critical formula and the critical formula belongs to .
Definition 2.8** (Proof).**
Let be a system which consists of initial formulas and inference rules, and assume a set of formulas which we call axioms. A list of formulas is a proof in from , if each formula is an initial formula of , a formula in , or a consequence of an inference rule of referring to preceding formulas in the proof. We write if and only if a formula is the last formula of the proof in system from . We omit if it is empty and if there is no confusion. An inference rule consists of one consequence and assumptions, and may be displayed using a horizontal line.
Definition 2.9** (Languages).**
Let the language be formulas and terms in Definition 2.1 and the language be without the universal and existential quantifiers. We denote by and the sublanguages of and without the -operator, respectively. Also, and are the sublanguages of and without the equality symbol, respectively. Finally, and are the sublanguages of and without the equality symbol, respectively.
We give two examples of -proofs. These formulas in the examples are meant to be -calculus versions of the independence of premise and the drinker’s formula. See also Example 3.2 and Example 3.3.
Example 2.10**.**
Consider the following formula in .
[TABLE]
This formula (1) is an instance of the critical formula, hence a proof of (1) is given as follows.
[TABLE]
Example 2.11**.**
Consider the following formula in .
[TABLE]
An -proof of this formula (2) is given as follows.
[TABLE]
We conclude this section with the following basic results.
Theorem 2.12** (Deduction theorem).**
Assume is a closed formula. iff in and in .
Lemma 2.13** (Identity schema).**
For any formula and terms in , holds for .
Proof.
By induction on the size of . ∎
Note that the above identity schema is not available in and , if the language is extended to and , respectively. In Section 4, we deal with epsilon calculus with the -equality formula, within which the identity schema is recovered for and .
3 Embedding Lemma
Hilbert introduced the epsilon operator to encode quantifiers, so that predicate calculus goes to elementary calculus extended with the critical formula. This section describes this encoding of within . The idea is to define the quantifiers by -operator as follows, and recursively apply them.
[TABLE]
Definition 3.1** (-translation).**
For a (semi)term and a (semi)formula we define its -translation and . Let stand for .
[TABLE]
Example 3.2**.**
Here is the formula of independence of premise in ,
[TABLE]
whose -translation is the formula (1) in Example 2.10.
Example 3.3**.**
Here is the drinker’s formula in ,
[TABLE]
whose -translation is the formula (2) in Example 2.11.
Remark 3.4**.**
The above two examples also show that the -translation of a formula which is not provable in intuitionistic logic can be provable in without using any classical propositional tautology.
Definition 3.5** (Regular proof).**
A proof is regular if each eigenvariable in the proof is used by at most one or .
Definition 3.6** (Proof size).**
The size of a proof is the length of the list.
If there is a proof, a regular one is always available and whose size is polynomially bounded to the original non-regular proof. This fact comes by the following theorem due to Krajíček [Kra94].
Theorem 3.7**.**
Let and be the size of the smallest sequence-proof and tree-proof of a provable first-order formula in the Hilbert style calculus, respectively. Then there exists a polynomial such that for every provable first-order formula .
In the rest of this paper we implicitly assume the regularity of proofs.
Definition 3.8** (Critical count).**
For a proof , we let be the number of critical formulas, , and in .
Lemma 3.9** (Embedding).**
Assume for a formula , then for some with .
Proof.
We refer to the formula at line in the proof by . By induction on the length . If it is trivial. We prove the case , making case analysis how the formula at the line is derived. In case it comes by modus ponens using and , which is of the form , for , by the induction hypotheses there are proofs and concluding and , respectively, hence by modus ponens. In case is derived by , is of the form for . As , it suffices to substitute for throughout the proof of which is due to the induction hypothesis. Here we assumed the regularity of the proof. In case is derived by , is of the form and for . As , it suffices to use modus ponens with a critical formula and , which comes by induction hypothesis. In case is by , is of the form and hence we prove , whose contrapositive is a critical formula. In case is by , is of the form and hence we prove that is immediate as it is a critical formula. The rest is the axioms. The rest is the cases for propositional tautologies and , which are all trivial. ∎
Theorem 3.10** (Herbrand’s theorem).**
Assume is a prenex existential formula in , namely, is quantifier free, and
[TABLE]
Then there are -free terms for such that
[TABLE]
Proof.
Assume is the -proof of . By means of Lemma 3.9, there is an -proof of , which is namely for some -terms , then the conclusion follows from extended first epsilon theorem for (cf. Theorem 16 in [MZ06]) with being propositional tautologies. ∎
4 Epsilon Calculus with the -Equality Formula
Epsilon calculus with equality was originally introduced by Hilbert [HB39]. Assuming is an -matrix, he formulated the -equality formula as follows.
[TABLE]
In this section we adopt a variant of the -equality formula which is given as follows via the vector notation.
[TABLE]
Then we define our system of epsilon calculus with equality to be extended with the initial formula (4). The -elimination method and the proofs of epsilon theorems for can be simpler than the ones for the original system by Hilbert. While the notion of closures is crucial in Hilbert and Bernays’ work, we do not need this notion in . Moreover, concerning the hyperexponential part of the upper bound analysis of the Herbrand complexity, our result for is better than the one for the system with (3), as it will be shown in Section 7.3.
Definition 4.1** (-matrix and semicolon notation).**
An -term is an -matrix iff each proper subterm of is a free variable and each free variable in occurs exactly once. The -matrix and its immediate subsemiformula can be denoted as and as , respectively, if and only if is an -matrix with its free variables . We call the free variables of an -matrix its parameters.
Conventionally, we let range over -matrices, possibly with its parameters explicitly denoted as . For any -term, its -matrix is uniquely determined modulo free variable names. If is a critical -term, the -matrix of is called a critical -matrix.
Definition 4.2** (Arity of -matrix).**
For an -matrix , we define its arity to be . Let be the maximal arity of critical -matrices of rank in , and be .
Lemma 4.3**.**
If is an -term, then for some -matrix and .
Proof.
Let be all the immediate subterms of and be fresh variables, so that . Then, is the -matrix . ∎
The epsilon calculus with equality by Hilbert and Bernays also employs the -equality formula as an initial formula.
Definition 4.4** (Epsilon calculus with the -equality formula).**
Let and be and extended with the following additional initial formula, respectively,
[TABLE]
where and are term vectors of the same length as of the parameters of -matrix . A formula of the form is an -equality formula, where and are called the critical -terms of the -equality formula. We also say that the -equality formula belongs to and to .
According to the semicolon notation, the -equality formula always belongs to critical -terms which were formed by applying substitutions to an -matrix . The next section details this constraint from a perspective of complexity analysis. Due to the -equality formula, the identity schema is available in .
Lemma 4.5** (Identity schema).**
Let a formula and terms be in , then . The same holds for in .
Proof.
By induction. ∎
We further define means of measuring complexity of terms and proofs, which are used in the next sections to study procedures of eliminating critical -terms. The rank counts the depth of nesting -semiterms, while the degree counts the depth of nesting -terms. Here we suppose that .
Definition 4.6** (Rank).**
We define the rank for a (semi)term .
[TABLE]
where subordinates iff and is a subsemiterm of . We define be , where are the critical -terms in .
The rank is stable against substitutions.
Lemma 4.7**.**
For any terms and , .
Proof.
Comparing with , nothing new is subordinating in due to the substitution of for , hence it is obvious from Definition 4.6. ∎
Lemma 4.8**.**
For an -matrix , .
Proof.
By induction on the construction of . ∎
Definition 4.9** (Degree).**
For a (semi)term , we define its degree .
[TABLE]
Definition 4.10** (Maximal critical -term).**
Let maximal critical -terms of a proof be the set of critical -terms of the greatest degree among the set of critical -terms of the greatest rank in a proof .
We conclude this section by defining measures for the proof complexity based on critical -terms, -matrices, critical formulas and -equality formulas.
Definition 4.11** (Order).**
For a proof , the number of distinct critical -terms of rank in is denoted by which we call the order, and the number of distinct -matrices is defined in the same manner and denoted by which we call the matrix order.
Definition 4.12** (Width).**
Define and by the number of distinct critical formulas belonging to in and of distinct -equality formulas belonging to in , respectively. The width is defined to be . Let be critical -terms in , then the maximal width is defined to be .
In order to measure the number of -equality formulas, we replace the notion of the critical count in Definition 3.8 by the following one.
Definition 4.13** (Critical count).**
Assume is a proof in or . The critical count of is defined to be the sum of the numbers of critical formulas, -equality formulas, , and in . We let and be the numbers of critical fomulas and of -equality formulas in , respectively.
5 Yukami’s Trick
In this short section, we clarify the need for the restriction to -matrices in the definition of -equality axioms, cf. (3) and (4) (see also Definition 4.4).
For the sake of the argument we assume, for the duration of this section only, that the restriction to -matrices is dropped. We focus on the above formulation of -equality axioms, using vector notation, as expressed in (4). However, the below given argument is equally valid for Hilbert’s original definition (if we drop the restriction to -matrices).
We will employ Yukami’s trick [Yuk84] together with folkore results in structural proof theory [Bus98, Pud98]. For additional insight into the proof theoretic strength of applications of identiy schema, see [BF01].
Theorem 5.1** (cf. [Yuk84]).**
Using two instances of the following restricted scheme of identity
[TABLE]
we can uniformly derive , from (i) , (ii) , and (iii) .
Proof.
Let where is fully indicated. Let , where in refers only to the innermost occuring term . The following equalities can be easily derived (employing in addition suitable instance of the transitivity axiom (ii) and axiom (i))
[TABLE]
if we employ the instances of (5)
[TABLE]
and
[TABLE]
Hence we have derived . Eventually, to obtain the desired result, we apply axiom (iii), as is nothing else than ( is indicated by above).
Note that the derivation is uniform for any : while for any the proof slightly differs, the number of steps and in particular the critical count is constant. ∎
Remark 5.2**.**
Using induction one can derive (5) uniformly from (i) and (ii) . That is Yukami’s trick is available in any suitable rich arithmetical theory.
The next result clarifies that the restricted identity axioms employed in Yukami’s trick are uniformly derivable if no additional restriction on the form of the -terms are enforced in (4). Let denote the extension of the -calculus with the following axioms to cover -equality:
[TABLE]
where , denote (arbitrary) -terms.
Lemma 5.3**.**
The following identity schema, generalising (5), is derivable in :
[TABLE]
where is an aribrary term in .
Proof.
Let . Consider the following two critical axioms:
[TABLE]
Thus derives (i) as well as (ii) . We exploit the following -equality axiom in
[TABLE]
Assuming , we can thus derive (within ) . Due to (i) and (ii) and equality axioms , we thus obtain in as claimed. It is important to emphasise, that the -term employed in (6) is not an -matrix. ∎
Before we can employ Yukami’s trick and the above lemma, we need some preparatory definitions and results.
Let be a theory. We say admits Herbrand’s theorem if whenever , with quantifier-free, then there exists a finite sequence of terms such that .
Let be axiomatised by purely universal formulas. Then it is well-known that admits Herbrand’s theorem, cf. [Bus98]. Due to Theorem 3.10 we can even conclude the existence of a function such that , where denotes the critical count of the proof of .
The next result improves upon this, in the sense that we also bound the term complexity of the sequence of terms in the critical count. Let denote the depth of any term , defined in the usual way. Futher let denote the formula complexity of an formula in the language of . A variant of the following result is due to Krajicek and Pudlak (see [KP88]).
Theorem 5.4**.**
Suppose is a universal theory such that so that the underlying equational theory of (if any), has positive unification type. Then there exists a primitive recursive function and a finite sequence of terms such that , where .
Proof.
Wlog. we assume that is axiomatised by quantifier-free formulas. As is provable in , there exists a conjunction of (quantifier-free) axioms in such that . By the above, we conclude the existence of terms and a primitive recursive function with , such that
[TABLE]
is a consequence of the underlying equational theory of , if any. As above, denotes the critical count of . It remains to prove the existence of the bounding function , bounding not only the number of terms , but also their term depth.
It is not difficult to formulate the property that the formula (7) is quasi-tautology as a unification problem over the corresponding equational theory, cf. [KP88, Pud98]. As is solvable there exists a most general unifier such that is a quasi-tautology, that is follows from the equational theory of . This follows as unification has a positive unification type by assumption. For each term in the range of , is bounded by a (monotone) function , depending only on the input to the unification problem, that is, the length of the term sequence and the formula complexity of .
Finally, it is easy to see how to define a bounding function such that (i) and (ii) . ∎
We say a theory that admits bounded Herbrand complexity if whenever , with quantifier-free, there exist terms such that and is bounded by a function depending only on the formula complexity of and the number of steps in . Note that according to the theorem any universal theory so that the underlying equational theory of (if any), has positive unification type admits bounded Herbrand complexity.
The next results clarifies that no theory can exists that admits bounded Herbrand complexity, while at the same time deriving the assumptions of Theorem 5.1.
Corollary 5.5**.**
Let be a universal theory whose axioms include (i) , (ii) , and (iii) and let the equational theory of (if any) be of positive unification type.
Such a theory cannot admit bounded Herbrand complexity and at the same time derives the restricted identity schema (5).
Proof.
Suppose to the contrary, a theory exists whose underlying equational theory has positive unification type. Moreover admits bounded Herbrand complexity and derives the assumed axioms. Thus due to Theorem 5.1, uniformly derive for all . Further for any , there exists a finite set of (universally quantified) axioms of , in particular including axioms (i)—(iii), such that
[TABLE]
is provable in with proofs of constant critical count. Arguing as in the proof of Theorem 5.4, employing the assumption that that admits bounded Herbrand complexity, we conclude the existence of terms of bounded depth such that
[TABLE]
is a quasi-tautology. As the depth of the terms is bounded, while is unbounded. This is absurd. Contradiction to the assumption that that admits bounded Herbrand complexity. ∎
Finally, we arrive at the main result of this section, emphasising the need for restricting the use of -matrices in the epsilon calculus with equality, cf. Definition 4.4.
Theorem 5.6**.**
Let be finitely axiomatised by the axioms (i) , (ii) , and (iii) and formalised over . Then cannot admit bounded Herbrand complexity.
Proof.
Suppose to the contrary that admits bounded Herbrand complexity, that is, whenever , with quantifier-free, there exists terms such that and is bounded by a function depending only on the formula complexity of and the number of steps in .
Further, as is axiomatised over , derives the restricted identity schema (5), cf. Lemma 5.3,
Finally, as the equational theory of is restriced to syntactic equality, the corresponding unification type is and the most general unifier of any unification problem is uniquely defined. But this contradicts Corollary 5.5, stating that no such theory can exists. ∎
6 First and Second Epsilon Theorems
Assume there is a proof in of a formula which is free from bound variables. First epsilon theorem states that there is an -proof of the same formula . The proof of the theorem is due to the epsilon elimination method, which is to replace critical -terms in a given -proof by other terms, maintaining the correctness of the proof, so that all critical formulas and -equality formulas in the proof are eliminated. Before we go into the general case, we sketch how epsilon elimination works through very simple examples. The first part is for the case only a critical formula belongs to a critical -term, and the second one is the case both a critical formula and an -equality formula belong to a critical -term.
Example 6.1**.**
Consider an -proof of the bound variable free formula involving only one critical formula of the following form.
[TABLE]
We eliminate this critical formula by generating two proofs of and of as follows. Assume is an axiom and replace all occurrences of throughout the proof by , then the above formula goes to , which is provable as is our axiom. All the other formulas in are either propositional tautology or modus ponens, hence we get a proof of without using any axiom. On the other hand, assuming is an axiom, the above critical formula is proved due to ex falso quodlibet, hence we get a proof of . Composing the above two proofs by means of excluded middle , we obtain a proof of without a critical formula, hence it is an -proof of .
Example 6.2**.**
Consider another -proof of the bound variable free formula involving one critical formula and one -equality formula of the following form, assuming and are -free terms.
[TABLE]
We first eliminate this -equality formula by generating two proofs concluding and , where we still use a critical formula which is eliminated due to the above argument. Assume is an axiom and replace throughout the proof by . The critical formula goes to , where , which is proved by means of another critical formula and the identity formulas and from the axiom . The -equality formula goes to which is trivially true, and the identity formulas are trivially true, too. As all the other formulas are either propositional tautology or modus ponens, hence we eliminate the critical formula to have an -proof of . On the other hand, assuming is an axiom, the above -equality formula becomes trivially provable, hence by the previous argument we get an -proof of . Composing those two proofs using excluded middle , we get an -proof of involving a critical formula, which is eliminated by the previous argument.
In the rest of this section, we address the general case, namely, the epsilon elimination method for proofs arbitrarily involving critical formulas and -equality formulas. In case there are at least two different critical -terms in a proof, we have to give a right order to eliminate the critical -terms, so that the above strategy works successfully. The following scenario illustrates an unsuccessful case. Consider a proof involving two different critical formulas.
[TABLE]
If we first try to eliminate by substituting , the second formula goes to the following non-critical formula which is in general not provable.
[TABLE]
The following lemmas tell us about substitutions for a non-critical -term in critical and -equality formulas. As a consequence, by eliminating a critical -term of the greatest rank, critical and -equality formulas not belonging to are kept to be critical and -equality formulas after replacing by any term.
Lemma 6.3**.**
Assume is an -term, is a substitution where is some term, and is a critical formula with . If does not belong to , is a critical formula of the rank .
Proof.
Let be . We first show that neither nor is a proper subterm of . If for some -term , for some . Then, is subordinating the critical -term and hence , which is contradictory. If for some -term , holds, which is contradictory. As the occurrence of in is as subterms among and , is which is a critical formula of the rank . ∎
Lemma 6.4**.**
Assume is an -term and is an -equality formula. If does not belong to , the formula for any term is an -equality formula of rank .
Proof.
Let be . As and , the substitution can change only , hence is , which is an -equality formula. ∎
On the other hand, elimination of one critical -term may increase the number of different critical -terms. It would be a problem if it would increase particularly the number of ones of the greatest rank, because then the termination of our procedure becomes a concern. Assume a proof involving the following critical formulas belonging to the two different critical -terms of the greatest rank.
[TABLE]
If we try to eliminate first, we afterwards have three different critical -terms, , , and , which is more than the number we had. We eliminate a critical -term of the greatest degree among -terms of the greatest rank, in order not to change any subterm of critical -terms of the greatest rank.
Lemma 6.5**.**
Let be a critical formula belonging to and be a substitution for an -term with and a term . If , is a critical formula belonging to . The same holds for an -equality formula belonging to and a substitution for an -term with and for .
Proof.
We prove that does not have an occurrence in . Suppose to the contrary that has an occurrence in , which contradicts the assumption . The case of -equality formulas is trivial. ∎
By eliminating a maximal critical -term, our -elimination method illustrated so far successfully decrease the number of different critical -terms. By repeating this procedure, we can eliminate all critical -term of the greatest rank and decrease the rank of the proof. Finally, we eliminate all critical -terms to obtain an -proof.
Lemma 6.6**.**
Assume for -free formula and let be . If is a maximum critical -term in and no -equality formula belongs to , for some such that and .
Proof.
Let be and be . All the critical formulas belonging to in can be listed by for . We form the proofs and for such that and , from which is derivable by propositional calculus. In order to make , we assume , then is provable from without those critical formulas belonging to , hence we get by Theorem 2.12. In order to make , we first replace in by , then prove for each by assuming . We use modus ponens to the tautology and . Assume is a proof of obtained by the above procedure, then it remains to prove that and . By the construction, critical formulas belonging to in don’t remain in . For any critical -term in , implies that critical and -equality formulas belonging to in are critical and -equality formulas of the same rank in due to Lemma 6.3 and Lemma 6.4. For any critical -term in , and imply that critical and -equality formulas belonging to are not affected by the substitutions for due to Lemma 6.5. Therefore, the critical -term does not occur in anymore, , and . ∎
Note that for , may be larger than and also maximal degrees of critical and -equality formulas of rank below in may be strictly larger than the corresponding original ones in . Also, may be larger than , because each premise of critical formulas may be changed by the substitutions. As we constantly decrease the order at the greatest rank, the termination is still guaranteed. We now study the epsilon elimination method for . We use the following lemma on the identity formula.
Lemma 6.7**.**
Assume has exactly one occurrence of each and for each , if it is a subterm of some term , holds. For any term there exists and , such that and .
Proof.
Let for be the immediate subterms of where the list of vectors is a split of and . For some , . Assume and are the subvectors of and corresponding to . By induction on the construction of , there is a proof of such that and for each . In case is a variable , it is trivial. In case is a function term where the list is a split of , for each , there is a proof of such as and by induction hypothesis. The claim follows by as and . In case is an -term where is an -matrix, for each , there is a proof of such as and by induction hypothesis. The claim follows by -equality formulas as and . There is due to without further use of critical nor -equality formulas by induction on , so that and . ∎
We deal with the case both critical formula and -equality formula belong to the same critical -term. By the next lemmas, we replace such a critical formula and an -equality formula, so that we obtain a proof whose order of the greatest rank is strictly smaller than the original one.
Lemma 6.8**.**
Let be an -matrix, and assume . There is an -proof from the assumption such that it concludes for arbitrary terms for each , and moreover the following conditions hold: , , and .
Proof.
For each , by Lemma 6.7. On the other hand, as it is a critical formula, and also by Lemma 6.7, hence we obtain using the deduction theorem. As , , , and which comes from the number of critical formulas belonging to . ∎
Lemma 6.9**.**
Let be an -matrix, and assume and . There is an proof such that , , , and .
Proof.
Assuming and , holds. By -equality formula and modus ponens, . Then we use Theorem 2.12. ∎
Lemma 6.10**.**
Assume for a quantifier-free and is a maximum critical -term in , then for some where for and .
Proof.
If there is no -equality formula belonging to , we apply Lemma 6.6. Otherwise, assume is of the form , and let be an -equality formula in for where . We form the proofs of and of for , from which is derived, in the following procedure. Assuming , all the -equality formulas belonging to are provable in propositional calculus. If there is no critical formulas belonging to , we already have , and otherwise we further apply Lemma 6.6 to get . Concerning , assume and substitute for throughout . Then, critical formulas of the form in goes to which is provable by Lemma 6.8. On the other hand, each -equality formula is provable in propositional calculus, and the other -equality formulas for is provable by Lemma 6.9. Let be a proof obtained by means of propositional calculus from and . Since all the critical -term in have been removed and the substitutions don’t make any other critical -terms which is different from be same as , there is no critical -term in . It remains to prove that the obtained proof satisfies for and . As it is apparent for , we consider each . Although Lemma 6.8 introduces -equality formulas belonging to a new -term, those terms are of rank strictly below . Any critical formula of rank in each belongs to , which is of the same rank , occurs in , and is distinct from . The -equality formulas of rank used in Lemma 6.9 belong to some critical -term of rank in which is different from . Therefore, and hold. ∎
Repeatedly using the results so far, the rank of the proof is diminished.
Lemma 6.11**.**
Assume for a quantifier-free and is a maximum critical -term in , then for some where .
Proof.
We make a sequence of proofs for , where and . If no -equality formula belongs to the maximal critical -term of , let be a proof obtained by applying Lemma 6.6 to , and otherwise let be a result of applying Lemma 6.10 to . As in any case the order is decreasing, and hence , therefore we let be . ∎
Theorem 6.12** (First epsilon theorem).**
If is a formula in and , then .
Proof.
Assume . We make a sequence of proofs for , where . In case , let be a proof obtained by applying Lemma 6.11 to , and otherwise let be . Then is the -proof of . ∎
We conclude this section, giving the statement of Second epsilon Theorem [HB39]. The proof is given due to the -elimination method without anything new.
Theorem 6.13** (Second epsilon theorem).**
If is a formula in and , then .
7 Extended First Epsilon Theorem
In contrast to Section 6, we consider the case the goal formula involves -terms, which leads us to extended first epsilon theorem. Assume an -proof of a formula is given, where may contain -terms. Eliminating critical formulas and -equality formulas, we obtain an -proof of a quantifier free formula for some terms , which is called the Herbrand disjunction. We say that the Herbrand complexity of is the smallest length of such a Herbrand disjunction, which is denoted by . We firstly study the -elimination method to prove extended first epsilon theorem without considering the Herbrand complexity, then the complexity analysis follows to compute an upper bound of the Herbrand complexity.
7.1 Proof of extended first epsilon theorem
We start from describing how to eliminate maximal critical -terms. In case there is a critical formula belonging to a critical -term of the maximal rank, we follow the -elimination method described in Section 6 for [HB39]. Otherwise, only -equality formulas belong to critical -terms of the maximal rank. Then we substitute a function symbol for the -matrix corresponding to the -term, in order to find a better upper bound of the Herbrand complexity.
The following lemma is for the first case, i.e. a critical formula is involved.
Lemma 7.1**.**
Assume for a formula and terms . If is the maximal -term of , there is a proof with and such that for .
Proof.
Assume is the maximal critical -term in and for and for occur in as -equality and critical formulas, resp. We form proofs of where and for is a result of replacing in by , and of , where is a result of replacing in by . To get the former proof, assume , then all the -formulas belonging to are eliminated due to ex falso quodlibet. By -elimination for , remaining critical formulas belonging to are eliminated and we get a proof of . As is maximal -term, and for are not affected by the substitution, hence follows. To get the latter proof for such that , assume and substitute in for . Due to Lemma 6.9, the modified -equality formulas in are provable after the substitution. Due to Lemma 6.8, the changed critical formulas which belonged to in are provable after the substitution. We obtain combining and , where there is no critical -term belonged to by a critical formula nor an -equality formula, , and there is no new critical -term of rank belonged by an -equality formula. ∎
Repeating the above lemma, we obtain a proof of a strictly smaller rank which concludes a Herbrand disjunction.
Definition 7.2** (Critical ranks).**
For a proof , define the set of the ranks of critical formulas by \{\mathsf{rk}(e)\mid\text{a critical formula belongs to e}\}.
Lemma 7.3**.**
Assume for and . If there is a critical formula of the rank , there is a proof such that , , and where for some .
Proof.
At most times applications of Lemma 7.1 eliminate all the critical -terms of rank in . ∎
The function symbol substitution is applicable for eliminating -equality formulas, provided no critical formula belongs to any -term of those -equality formulas.
Definition 7.4** (Function symbol substitution).**
Assume is a critical -term of the form , where is the -matrix of , and let be a function symbol of the arity which is uniquely assigned to . The substitution is the function symbol substitution for .
We are particularly interested in using the function symbol substitution for the case only -equality formulas are of the maximal rank.
Lemma 7.5**.**
Assume for and for -terms . If only -equality formula is of rank , there is an -proof of for some terms such that , , and . Moreover for any , , and .
Proof.
Let be the rank . We repeatedly replace the maximal critical -term of rank through the corresponding function symbol substitution. After times of replacements, there is no more critical -term of rank and the process terminates. After the substitutions, each -equality formula of rank in is an identity formula for a function symbol . Each critical (and -equality respectively) formula of a rank strictly smaller than is another critical (and -equality respectively) formula which belongs to the same critical -term after the substitution due to Lemma 6.5, hence what we obtained is for some terms an -proof of with the stated conditions. ∎
Lemma 7.6**.**
Assume for and . There is an -proof of the formula for such that and . Moreover, is empty or .
Proof.
Assume , namely, there are and for each , there are -equality formulas of rank in . We apply Lemma 7.5 for -times. ∎
Remark 7.7**.**
Even in case there are both critical and -equality formulas of the maximal rank, it is still possible to make use of the function symbol substitution to eliminate -equality formulas, provided there is no critical formula belonging to any critical -term of those -equality formulas. During the substitution process, it happens to have non-proofs because formulas of the form are present, but eventually these formulas will be of the form , an instance of identity formula of a function symbol.
Theorem 7.8** (Extended first epsilon theorem for ).**
Assume for and . There is a proof such that for some and .
Proof.
We make a sequence of proofs for in the following way. Let be a result of applying Lemma 7.6 to . If is not empty, let be a proof obtained by applying Lemma 7.3 and then Lemma 7.6 to . For each , . Since , is an -proof. Remaining occurrences of -terms may be replaced by free variables. ∎
7.2 Complexity analysis
We make a detailed analysis on the proofs of the previous subsection, in order to compute the numerical bound of the length of the disjunction in Theorem 7.8. To do so, we consider the property degree as a means of measuring the complexity of a critical formula. Given a critical formula , we can determine the formula . The property degree is to count the number of -terms with a free variable occurrence of . If is of the form , this number tells us at most how many -equality formulas are needed to prove the identity formula . We define the property degree and the maximal property degree for an -term as follows.
Definition 7.9** (Property degree).**
For an -term , the property degree is defined to be \max\{\mathsf{deg}(t)\mid\text{te}\}. The maximal property degree of a proof of rank is defined to be
[TABLE]
Also is defined to be
[TABLE]
We give the following results concerning the property degree.
Lemma 7.10**.**
For any and -matrix , . For a given proof , the set of critical -matrices which belong to critical formulas does not increase through the -elimination.
Proof.
The first part is trivial. Concerning the second part, we consider just Lemma 6.8, because a new critical formula is introduced only if a critical term belongs to both critical formulas and -equality formulas at the same time. The critical formulas introduced in the Lemma belongs to an -term of an -matrix in the original proof, hence the claim holds. The third part is trivial since the epsilon elimination method does not add a new -matrix. ∎
Applying the epsilon elimination method, the maximal property degree and the maximal arity are weakly decreasing. Therefore, we can compute them at the very beginning and keep referring to them as the upper bounds of the property degrees and the arities through the whole elimination procedure.
Lemma 7.11**.**
The epsilon elimination method does not increase the maximal property degree nor the maximal arity.
Proof.
Notice that the method does not introduce any new -matrix. ∎
As a consequence, the upper bounds of the critical counts in Lemma 6.7 and Lemma 6.8 depend only on an initial proof.
Lemma 7.12** (Elimination for critical formulas).**
Let be a formula in , terms of , and an -proof of where its maximal -term is belonging only to a critical formula. There are terms such that each of them is in and an -proof of for . Moreover, and hold for .
Proof.
Straightforward from Lemma 21 by Moser and Zach [MZ06]. ∎
Lemma 7.13** (Elimination for -equality formulas).**
Let be a formula in , terms of , and an -proof of where its maximal -term is belonging only to -equality formulas. There are terms such that each of them is in and an -proof of for . Moreover, and hold for and .
Proof.
Assume -equality formulas belonging to are for . There is a proof of satisfying and . On the other hand there are proofs of for with and . In order to get we replace all in by , and give a proof of . It is trivial if , and otherwise we apply Lemma 6.9. We obtain , combining and , which satisfies and . ∎
The following lemma concerns the case that the maximal critical -term is belonged to by both critical and -equality formulas at the same time.
Lemma 7.14** (Eliminating a maximal critical -term).**
We deal with the case the maximal critical -term is belonged to by both critical and -equality formulas at the same time. Assume is an -proof, is the maximal -term of , and is the -matrix of . Let be the arity of , the property degree of , and the rank of . The critical count and the maximal width of at obtained in Lemma 7.1 are bounded as follows: and .
Proof.
Assume is of the form . The proof of is due to of and of for . By removing all critical and -equality formulas belonging to using ex falso quodlibet we obtain , hence . On the other hand, each critical formula belonging to in is gone due to -elimination, . Note that each -equality formula belonging to is removed due to the premise . Therefore,
[TABLE]
On the other hand concerning the width, the following condition holds.
[TABLE]
The proof of is due to -elimination, replacing by . All -equality formulas belonging to is gone, each critical formula belonging to is replaced by a critical formula belonging to , and there are additional -equality formulas due to Lemma 6.8. Therefore,
[TABLE]
and
[TABLE]
Considering the construction of ,
[TABLE]
and
[TABLE]
Note that , , and hold. We conclude the claimed bounds due to Lemma 7.12 and Lemma 7.13. ∎
The notation of hyperexponentiation is useful to represent large numerals. We define the hyperexponentiation and give some arithmetics on the exponentiation and the hyperexponentiation.
Definition 7.15** (Hyperexponentiation).**
For natural numbers , is defined to be
[TABLE]
Lemma 7.16**.**
For natural numbers , the following formulas hold.
[TABLE]
Lemma 7.17** (Eliminating critical -terms of maximal rank).**
Assume is an -proof of in Lemma 7.3. Let , , , , and be , , , the maximal arity , and the maximal property degree , respectively. The critical count of and the length of disjunction of obtained in Lemma 7.3, such that , are bounded as follows: and for .
Proof.
Consider a list of proofs where , , and each is obtained by means of Lemma 7.14 from , eliminating a critical -term of rank . We prove by induction on that . In case , and . In case ,
[TABLE]
Also, , therefore, holds. Let . We also prove by induction on that
[TABLE]
hence . Since and , holds. The bound of the length of disjunction is because of the following calculation, hence .
[TABLE]
∎
Theorem 7.18** (Extended first epsilon theorem).**
Assume is an -proof of , an -formula where terms other than are -free. There is an -proof of the -formula for some -free terms such that , where is the critical count , is the maximal arity , and is the maximal property degree .
Proof.
We make a sequence of proofs where is obtained by applying Lemma 7.6 to , is , and is obtained by applying Lemma 7.17 and Lemma 7.6 in this order to . We prove that by induction on . It is trivial in case . In case ,
[TABLE]
We calculate the length .
[TABLE]
Since , . ∎
7.3 Alternative -equality formula and closure
We study , the system with and the following -equality formula,
[TABLE]
where is ; obtained by replacing the -th element of by . We say that is the position of the -equality formula, and call the above formula the -equality formula of position . Assuming is an -matrix, Hilbert and Bernays employed the following -equality formula
[TABLE]
and presented the -elimination method [HB39]. The formula (9) in explicitly expresses the notion of position. An upper bound of the Herbrand complexity for can be independent from the maximal arity of critical -matrices in the given proof, in contrast to the case for . The following lemma tells us that is as strong as .
Lemma 7.19**.**
* if and only if .*
Proof.
Assume is an -proof of A. We prove in by induction on the number of differences between and . If there is one difference, say and , the formula is an instance of the alternative -equality formula . Assume is an -proof of the formula where there are differences between and . For any , is an instance of the alternative -equality formula, hence by the transitivity of equality and the deduction theorem, . The other direction is trivial. ∎
We define the closure of the -matrix in the proof , which is to enumerate all the critical -terms of which may occur during the -elimination process.
Definition 7.20** (Closure).**
Let be an -proof and the maximal -matrix of . We define a set of terms which occur at the premise of the -equality formulas of the position in , and a strict order such that for , iff . We define a partially ordered set to be with the transitive closure of the strict order such that for any position , iff .
It is easy to see that a closure is a lattice.
Definition 7.21** (Strict partial order on closures).**
Let be sublattices of . Define a strict partial order as follows: if and only if the upper bounds of is a proper subset of the upper bounds of .
Due to the above strict partial order, we can choose terms among maximal -terms such that they are also maximal due to .
Definition 7.22** (-maximal critical -term).**
Assume is a proof and is a critical -term of -matrix in . If is maximal and holds for all the other critical -term of in , is a -maximal critical -term n .
In the rest of this section, we simply say maximality to mean the -maximality. We introduce an order on proofs.
Definition 7.23** (Strict partial order on proofs due to a closure).**
Let and be proofs. We define if and only if either or and for all of the rank , where and are given by the sets of critical -terms of in and , respectively.
Instead of Lemma 6.7, we use the following identity lemma in our current setting.
Lemma 7.24**.**
Assume has exactly one occurrence of each and for each , if it is a subterm of some term , holds. For any term there exists and , such that and . Moreover, concerning the -equality formulas used in , if belongs to and , and are proper subterms of and , and occur just once in and , respectively, and .
Proof.
Trivial due to the proof of Lemma 6.7. ∎
If no -equality formula belongs to the maximal critical -term to eliminate, we apply Lemma 7.12. Otherwise the following lemma applies.
Lemma 7.25** (Eliminating a maximal critical -term involving -equality).**
Let be an -proof of where is an -free formula in and is a list of -terms. Assume the maximal critical -term of is and in there are critical formulas including ones of the form for as well as -equality formulas as follows.
[TABLE]
There is a proof of for some where there is no critical occurrence of in , , , , and for any critical -matrix of rank in , where , and .
Proof.
We give proofs of for and also a proof of , such that they are all free from the critical -term . For the former ones, we apply the substitution throughout the proof . Then, the critical formulas in belonging to goes to , which is provable by the following three steps
[TABLE]
where the first and the third formulas by Lemma 7.24, and the second one is a critical formula. On the other hand, the -th -equality formulas in belonging to goes to
[TABLE]
which is provable as follows. If the positions of -th and -th -equality formulas are the same, namely, , the assumption and the premise implies , hence the above formula is proven by means of an -equality formula . Otherwise, we prove the above formula by using the two -equality formulas.
[TABLE]
In any case, the critical -term is gone. In the second case, although the critical -term may be new in the sense that it has no critical occurrence in , the obtained proof is strictly smaller than the with respect to in . On the other hand for the proof , these -equality formulas are provable due to ex falso quodlibet, hence we use Lemma 7.12 to get rid of the critical formulas.
Finally, we discuss the bounds. After the elimination process, each may have an increased number for , which is bounded by . Due to Lemma 7.12, . Therefore, . The critical counts for are bounded as and , hence . ∎
In case we deal with -equality formulas with different positions, we may introduce a critical -term which is not a critical -term in the original proof . The closure covers all the potential new critical -terms beforehand. Also note that the closure won’t be expanded through the -elimination procedure, because the maximal critical -term never matches terms in the premise of -equality formulas, hence we don’t get anything new for .
By Lemma 7.25, we get a proof without using a maximal -term in the corresponding closure, hence the size of the closure is an upper bound for the number of applications of the lemma needed to eliminate the -matrix. The size of the closure is bounded by , where is the set of positions of -equality formulas belonging to in . In the following proof the upper bound of this size is estimated only by the number of -equality formulas.
Lemma 7.26**.**
For any proof and -matrix in , .
Proof.
The number of different positions of -equality formulas belonging to is at most , and holds for each , hence is bounded by which is smaller than . ∎
Repeatedly eliminating a maximal critical -term, the rank is diminished.
Lemma 7.27** (Eliminating critical -terms of the maximal rank).**
Let be an -proof of where is an -free formula in and is a list of -terms. There is an -proof of for some quantifier free terms and bounded by such that and , where is .
Proof.
We repeatedly apply Lemma 7.25 in order to eliminate maximal critical -terms. As Lemma 7.25 eliminates an -term, which is a least upper bound of a closure, without expanding the closures of rank , this process is terminating. Let be the critical -matrices of rank in , then the number of applications of the lemma is bounded by the number of critical formulas of rank plus the number of elements in the closures , namely, by . Let be . Solving the recurrence relations and , we find , hence is a bound for the length of the Herbrand disjunction. Also solving the recurrence relations and , , hence by excluding a trivial case . ∎
Theorem 7.28** (Extended first epsilon theorem for ).**
Assume for and , then for some , where is bounded by for .
Proof.
As long as the maximal rank of critical formulas in is smaller than , we repeatedly apply Lemma 7.5. Our proof proceeds by induction on the number of different ranks of critical formulas in , namely, size of . In each step we use Lemma 7.25, and then Lemma 7.5 in the same way as mentioned above. Nullary constants can substitute -terms which still remained. The number of applications of Lemma 7.27 is which is bounded by . Solving the recurrence relations and , we find . The length of the Herbrand disjunction is bounded by , taking for the bound of , because . ∎
The property degree is independent from the critical count, and the use of in the upper bound of the length of Herbrand disjunction is an explanation of potential complexity due to the presence of the -equality formula.
8 Lower Bounds on Herbrand Disjunctions
In this section, we adapt the result by Statman to give the lower bounds on Herbrand disjunctions [Sta79]. The case without equality was studied by Moser and Zach, where Orevkov’s result was used to give the lower bound of Herbrand complexity [Ore82, MZ06]. We first define a combinatory logic, which allows to define a term whose normal form is hyperexponentially long. Then we can describe a sequence of formulas such as the critical counts of their proofs shows a linear growth in . Finally, we show that the Herbrand complexity of those formulas are hyperexponential.
Definition 8.1**.**
Let be a left associative binary function symbol. The combinators , , , and are defined as nullary constants satisfying
[TABLE]
Let denote the set of the above semi-formulas and the set of universally quantified closed formulas of .
From now, we omit and write terms in a common way, e.g. and etc.
Definition 8.2**.**
Define and , .
For a finite set of formulas , let denote .
Lemma 8.3**.**
There is a -proof such that .
Proof.
The proof involves the equality axioms, the combinatory logic axioms, and the quantifier axiom, so that the following equations are proven.
[TABLE]
Easy to find a whose critical count is 12, which is constant. ∎
Definition 8.4**.**
Let and be nullary constants. For any we define comprehension terms and a formula .
[TABLE]
Lemma 8.5**.**
In , implies for any terms of arbitrary length and for any .
Proof.
By induction on . In the base case, we assume that there exists a proof of , that is, , and prove , namely, . We apply Lemma 8.3 once and equality axioms to make a proof. In the step case, we prove that if then for any of any length. Assume and , and we prove , which unfolds into . Further assume , then the induction hypothesis, implies for any of any length, is applicable with terms , hence it suffices to show which is trivial due to the assumptions , which unfolds into , and . ∎
Lemma 8.6**.**
In , for any .
Proof.
In case , holds trivially. Assume . Unfolding .
[TABLE]
Due to its premise, follows. Lemma 8.5 implies . ∎
Lemma 8.7**.**
There is a -proof such that for any , such that is linear in .
Proof.
We prove that in , for any by induction on . The base case, for any , is proven by Lemma 8.6. In order to prove the step case, for any , it suffices to assume and to prove both and . They are straightforward by induction hypothesis and by Lemma 8.6, respectively. ∎
Assuming and , there is a linear proof of in .
Theorem 8.8**.**
There is a -proof such that and is linear in .
Proof.
Assume and . By Lemma 8.7, there is a proof of with being linear in . The formula unfolds into , hence . ∎
For a set of semi formulas, let denote a set of closed formulas obtained by instantiating each formula in . We use the following lemma, whose proof is found in Statman [Sta79], to prove the main theorem.
Lemma 8.9** (Statman [Sta79]).**
Suppose that is a finite subset of such that ; then there is a finite subset of such that , , and each term occurring in is closed and in normal form.
Theorem 8.10** (Statman [Sta79]).**
Suppose is a finite subset of such that and each term occurring in is in normal form; then .
Proof.
Suppose to the contrary that the size of is less than . Assume for some and . Because the number of possible different instantiations is at most , for some with , for any , , neither nor is same as . Let be new constants. We define to be extended with infinitely many equations, so that the following reduction rules for closed of normal form without in function position are available:
[TABLE]
We prove that . The normal form of formula is , which is not provable in because of the Church-Rosser property, cf. Theorem 3 in [Hin74]. Notice that is the normal form of .
Finally, we prove that for each . If does not contain in function position, we consider the following three cases. If is not of the form for any , , and if is and , , otherwise and then . If contains in function position, has a normal form containing [math] or and without in function position, so under .
As extends , implies . ∎
This theorem implies the following corollary which tells us that the length of Herbrand disjunction of the formula in Theorem 8.8 is hyperexponential. For a set of semi formulas , let denote a conjunction .
Corollary 8.11**.**
.
Proof.
Let be . Assume , then there exists and a Herbrand disjunction , namely, , where -free terms are in normal form. Due to Theorem 8.10, the number of instances of is at least , hence . ∎
The analysis in this section tells us that the lower bound of the Herbrand complexity is hyperexponential. Further technical investigations will lead to a better lower bound, so that the gap between the upper and lower bounds gets smaller, and also the unneccesity of the property degree may be clarified.
9 Conclusion
We studied the Herbrand complexity for epsilon calculus with the -equality axiom. For Herbrand’s theorem, which is on the prenex existential formula, the Herbrand complexity is same as the case without the -equality formula, because the embedding result does not rely on the -equality formula. The formulation of the -equality formula has to be restricted through the notion of -matrices, since otherwise as Yukami’s trick explains, we would fail to find the complexity bound. Our proof of first epsilon theorem is simpler than the original one by Bernays, as our formulation of the -equality formula due to the vector notation allows us to get rid of the notion of closures from the proof. Using this formulation, we computed the upper bound of the Herbrand complexity for the first extended epsilon theorem with the -equality axiom. While the hyperexponential part of the result, namely, the height of the tower, is the same as the case without the -equality axiom, the exponential part is quadratic with the additional parameters, the property degree and the maximal arity, rather than the linear in the case without the -equality axiom. Employing the original formulation of the -equality axiom by Bernays and Hilbert, the parameter for the maximal arity can be got rid of, although the height of the hyperexponential tower grows faster than the original result. We also gave a lower bounds analysis which tells us that it has to be at least hyperexponential, namely, non-elementary.
Future work
There are open problems for future work. The notion of property degree was introduced to compute the upper bounds of the complexity in Section 7.2. On the other hand our lower bound analysis does not count the property degree, hence it is still not clarified whether the property degree is necessary for complexity analyses or not. On the other hand, it is important to explore a better proof representation suitable for epsilon calculus, because a syntactic complication of -terms is a real practical obstacle to study epsilon calculus. Some modern formalizations of epsilon calculus are known, for example sequent calculus with function variables by Baaz, Leitsch, and Lolic [BLL18] and another formulation based on Miller’s expansion tree by Aschieri, Hetzl, and Weller [AHW18].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[AB 16] Juan P. Aguilera and Matthias Baaz. Unsound inferences make proofs shorter. Co RR , abs/1608.07703, 2016.
- 2[Ack 25] W. Ackermann. Begründung des Tertium non datur mittels der Hilbertschen Theorie der Widerspruchsfreiheit. Mathematische Annalen , 93:1–36, 1925.
- 3[Ack 40] W. Ackermann. Zur Widerspruchsfreiheit der Zahlentheorie. Mathematische Annalen , 117:162–194, 1940.
- 4[AHW 18] Federico Aschieri, Stefan Hetzl, and Daniel Weller. Expansion trees with cut. The Oberwolfach Preprints, 2018. Preprint on webpage at https://publications.mfo.de/bitstream/handle/mfo/1333/OWP 2018_01.pdf .
- 5[Ara 03] T. Arai. Epsilon substitution method for I D 1 ( Π 1 0 ∨ Σ 1 0 ) 𝐼 subscript 𝐷 1 subscript superscript Π 0 1 subscript superscript Σ 0 1 {I}{D}_{1}({\Pi}^{0}_{1}\lor{\Sigma}^{0}_{1}) . Annals of Pure and Applied Logic , 121:163–208, 2003.
- 6[Ara 05] T. Arai. Ideas in the epsilon substitution method for Π 1 0 subscript superscript Π 0 1 {\Pi}^{0}_{1} -FIX. Annals of Pure and Applied Logic , 136:3–21, 2005.
- 7[Avi 02] J. Avigad. Update procedures and the 1 1 1 -consistency of arithmetic. Mathematical Logic Quarterly , 48:3–13, 2002.
- 8[Bel 93a] J. L. Bell. Hilbert’s epsilon-operator and classical logic. Journal of Philosophical Logic , 22:1–18, 1993.
