The finitary content of sunny nonexpansive retractions
Ulrich Kohlenbach, Andrei Sipos

TL;DR
This paper employs proof mining techniques to extract explicit uniform rates of metastability for the convergence of approximants to fixed points of pseudocontractive mappings in certain Banach spaces, extending classical results.
Contribution
It introduces a novel proof mining approach to derive explicit convergence rates in Banach spaces with specific geometric properties, utilizing the existence of a modulus of uniqueness.
Findings
Derived a uniform rate of metastability for fixed point approximations
Extended proof mining techniques to Banach spaces with uniform convexity and smoothness
Produced explicit bounds interpretable in higher-type systems
Abstract
We use techniques of proof mining to extract a uniform rate of metastability (in the sense of Tao) for the strong convergence of approximants to fixed points of uniformly continuous pseudocontractive mappings in Banach spaces which are uniformly convex and uniformly smooth, i.e. a slightly restricted form of the classical result of Reich. This is made possible by the existence of a modulus of uniqueness specific to uniformly convex Banach spaces and by the arithmetization of the use of the limit superior. The metastable convergence can thus be proved in a system which has the same provably total functions as first-order arithmetic and therefore one may interpret the resulting proof in G\"odel's system of higher-type functionals. The witness so obtained is then majorized (in the sense of Howard) in order to produce the final bound, which is shown to be definable in the subsystem…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
The finitary content of sunny nonexpansive retractions
Ulrich Kohlenbacha and Andrei Sipoşa,b
aDepartment of Mathematics, Technische Universität Darmstadt,
Schlossgartenstrasse 7, 64289 Darmstadt, Germany
bSimion Stoilow Institute of Mathematics of the Romanian Academy,
Calea Griviţei 21, 010702 Bucharest, Romania
E-mails: {kohlenbach,sipos}@mathematik.tu-darmstadt.de
Abstract
We use techniques of proof mining to extract a uniform rate of metastability (in the sense of Tao) for the strong convergence of approximants to fixed points of uniformly continuous pseudocontractive mappings in Banach spaces which are uniformly convex and uniformly smooth, i.e. a slightly restricted form of the classical result of Reich. This is made possible by the existence of a modulus of uniqueness specific to uniformly convex Banach spaces and by the arithmetization of the use of the limit superior. The metastable convergence can thus be proved in a system which has the same provably total functions as first-order arithmetic and therefore one may interpret the resulting proof in Gödel’s system of higher-type functionals. The witness so obtained is then majorized (in the sense of Howard) in order to produce the final bound, which is shown to be definable in the subsystem . This piece of information is further used to obtain rates of metastability to results which were previously only analyzed from the point of view of proof mining in the context of Hilbert spaces, i.e. the convergence of the iterative schemas of Halpern and Bruck.
Mathematics Subject Classification 2010: 47H06, 47H09, 47H10, 03F10.
Keywords: Proof mining, sunny nonexpansive retractions, metastability, resolvents, pseudocontractions, functional interpretation, Halpern iteration, Bruck iteration, uniformly convex Banach spaces, uniformly smooth Banach spaces.
1 Introduction
Let be a real Banach space, be a nonempty bounded closed convex subset and be a nonexpansive mapping. For and let be the unique fixed point of the strict contraction
[TABLE]
In 1967, Browder [9] and Halpern [33] independently proved in the case where is a Hilbert space that for the path strongly converges and its limit is the fixed point of which is closest to i.e. where is the metric projection onto Both proofs for the strong convergence do not readily generalize even to the class of spaces (other than ).
That the strong convergence does hold in this case was finally shown in 1980, when Reich established in the celebrated paper [60] that it actually holds in any uniformly smooth space. Moreover, Reich showed that the limit is where is the unique sunny nonexpansive retraction . This result has subsequently been extended in many ways including the context of families of operators [1, 2, 3].
The significance of Reich’s theorem is twofold:
- •
It provides for the first time an algorithmic approach to the construction of sunny nonexpansive retractions. This aspect is highlighted e.g. in [21, 1].
- •
Many important iterative algorithms in nonlinear analysis are shown to be strongly convergent by proving that they asymptotically approach (for some suitable sequence converging to ).
We start discussing the first item in more detail. Nonexpansive retractions were first considered by Bruck in [11], who showed – using Zorn’s lemma – that is a nonexpansive retract of whenever is a real reflexive strictly convex Banach space. This result was generalized further in [12], in particular, to reflexive Banach spaces which have the conditional fixed point property for nonexpansive mappings which e.g. includes all uniformly smooth spaces. Since metric projections onto closed convex subsets are nonexpansive only in Hilbert spaces, nonexpansive retractions are, already for spaces (again, other than ), very different from metric projections and may not even exist although the metric projection does. For example, Bruck showed in [14] that no real Banach space with has a bounded smooth subset with nonempty interior which is the range of a nonexpansive retraction unless is a Hilbert space.
Retractions are called sunny if the property
[TABLE]
holds. In smooth Banach spaces, for a retraction to be nonexpansive and sunny it is necessary and sufficient for the variational inequality (where denotes the single-valued normalized duality map)
[TABLE]
to hold, which in Hilbert spaces characterizes the metric projection. Thus, the relevance of sunny nonexpansive retractions is that they are in many respects the right substitute for the metric projection outside Hilbert spaces. From this characterization it follows that there is at most one sunny nonexpansive retraction in smooth spaces (in [13], Bruck used the term ‘nonexpansive projection’ instead of the nowadays common name ‘sunny nonexpansive retraction’). If is even uniformly smooth and strictly convex and is the fixed point set of a nonexpansive mapping then the unique sunny nonexpansive retraction necessarily exists [13]. Bruck’s proof is, however, highly nonconstructive. Reich’s theorem establishes that the sunny nonexpansive retraction can be obtained as the limit of objects which are constructively available (since Banach’s fixed point theorem is constructive). Our logical analysis of the proofs due to Morales of Reich’s theorem implies that the pointwise existence of sunny nonexpansive retracts can be carried out in a logically fairly weak formal system (see Remark 6.6) which is of foundational interest.
As stated in the second item above, the great relevance of Reich’s theorem for algorithmic purposes can also be seen from the fact that it implies the strong convergence of important iterative algorithms: in [33], the so-called Halpern iteration (starting from some and using as anchor)
[TABLE]
is considered for and – in Hilbert spaces – shown to converge to for the metric projection under very restrictive conditions on In a milestone paper [67], Wittmann generalized this to much more general sequences including for the first time the case If is linear and , then coincides (for this choice of ) with the ergodic average and so Wittmann’s theorem is a nonlinear generalization of the classical von Neumann mean ergodic theorem, while remaining strongly convergent (without linearity, the usual ergodic averages are known to converge only weakly by results due to [6] and [26]). In [62], Wittmann’s theorem is generalized to uniformly smooth Banach spaces by reducing the strong convergence of to that of and then applying Reich’s theorem (in fact, [62] considers a somewhat larger class of spaces). For Halpern’s more restrictive sequences this had already been shown in [60].
Reich [60] established his theorem not only for nonexpansive mappings but even for set-valued accretive operators satisfying the range condition which, in particular, covers the important class of continuous pseudocontractions, introduced by Browder [8], which extend the class of nonexpansive mappings and which play a crucial role in the abstract formulation of Cauchy problems. For pseudocontractions one can no longer use the Halpern iterative schema but has to apply a more complicated schema due to Bruck [15]
[TABLE]
for suitable sequences , in In [18], it is shown that for Lipschitzian pseudocontractions (a class which still strictly generalizes the class of nonexpansive mappings and which contains the class of strict pseudocontractions due to [10]) the strong convergence of the Bruck iteration schema can be shown using the strong convergence of i.e. again by reduction to Reich’s theorem.
Furthermore, recently, in [4], a Halpern-type variant of the famous proximal point algorithm was shown to strongly converge by a similar reduction.
These and many other results point to the paramount significance of this result of Reich. In this paper, we give for the first time a quantitative account of it. From results of Neumann [56] on the Halpern iteration and the aforementioned connection with the convergence of (which was treated quantitatively in [46]), it follows that even for the case of Hilbert spaces, in fact, already for and there are simple computable mappings for which with the anchor point does not have a computable rate of convergence. In this situation, the next best thing one can hope for is an effective so-called rate of metastability – in the sense of Terence Tao [65, 66], the name having been suggested to him by Jennifer Chayes – i.e. a function such that
[TABLE]
where whose complexity reflects the computational content of the original convergence proof from which it is extractable by proof-theoretic methods (see [40]). Note that provides a quantitative form of
[TABLE]
which, noneffectively, is equivalent to the ordinary Cauchy property of In proof theory, the metastable version of the original Cauchy statement is known as the Kreisel no-counterexample interpretation [50, 51]. General so-called logical metatheorems due to [39, 27, 40, 23, 46, 31] guarantee the extractability of explicit effective bounds, in particular of rates of metastability, for large classes of proofs and provide algorithms for their actual extraction from a given proof based on modern variants and extensions of Gödel’s [30] famous functional (‘Dialectica’) interpretation. Moreover, these bounds only depend on , and via ‘majorizing’ data (such as moduli of smoothness on or of uniform continuity of and norm bounds on the elements of ). These developments are all part of the research program of ‘proof mining’, that aims to apply these logical techniques to proofs in a broad range of areas of mainstream mathematics, such as nonlinear analysis, convex optimization, commutative algebra, ergodic theory or topological dynamics; the standard introduction to the field is [40], while more recent surveys are [42, 43].
For the Hilbert space case of the problem at hand, such ’s of low primitive recursive complexity have already been extracted both for the Browder-Halpern theorem and for Wittmann’s theorem in [41], and an alternative way of using proof mining to derive these and related results was recently explored in [25].
However, a quantitative analysis of Reich’s generalization to Banach spaces had been a major challenge of the ‘proof mining’ paradigm for about ten years. The present paper, which for the first time succeeds in achieving such an analysis, is the technically most complex extraction of a metastability bound for a strong convergence theorem in analysis which has ever been carried out. The enormous complexity of the final bound reflects the profound combinatorial and computational content of Reich’s deep theorem.
More specifically, in the present paper, we extract for the first time a rate of metastability for the convergence of for uniformly continuous pseudocontractions within the class of Banach spaces which are uniformly smooth and uniformly convex (which covers all spaces for ). Using quantitative results extracted already in [46], this also gives the first explicit rate of metastability for the extension (due to [62]) of Wittmann’s theorem to this class of spaces as well as, using quantitative results from [49], the first rate of metastability for Bruck’s iteration for this class. All previous results only considered the class of Hilbert spaces (or geodesic generalizations of Hilbert spaces such as CAT(0) spaces [45] or CAT() spaces for [52]). As predicted by general logical metatheorems from [40, 46], the rate of metastability (in the case where ) only depends (in addition to and ) on a norm bound on the elements in , on moduli , of uniform smoothness and convexity, respectively, of and on a modulus of uniform continuity of .
Our extraction of analyzes the proof of Reich’s theorem given in 1990 by Morales [55]. This proof uses that the continuous convex function
[TABLE]
where is a sequence in which converges to attains its infimum on the closed convex bounded set since is reflexive, being uniformly smooth. (Reich’s original proof [60] produced the operator as the limit of a subsequence, which was shown to be well-defined in [61]; later developments of the idea, even to this day, generally use a simplification of this by applying a Banach limit to the original sequence – see, e.g., [17, 16, 64]; to our knowledge, the above definition – lifted from the theory of asymptotic centers [22] – was first used by Morales in this context and afterwards picked up by few other authors.) The proof then continues by forming the set of all points on which attains its infimum, showing that this set is invariant under the action of (the resolvent of) and thus (the resolvent, and therefore) has a fixed point in this set. (The detour via the resolvent is not needed for nonexpansive mappings.) In the deductive framework to which the known proof-theoretic bound extraction methods apply, it is not clear how to define as an object given as we do not have a term which assigns to a bounded sequence of reals its (in technical terms this is due to the fact the functional interpretation of having such a term has no solution by majorizable functionals; only if would be assumed as separable – which we have to avoid, however, for general reasons discussed in [40] as this prevents the extraction of uniform bounds – then using the continuity of it would be enough to define on a dense sequence and this could be done in our setting). So we aim to replace the use of as an object by
[TABLE]
where ‘’ is logically complex, namely it is a so-called statement.
This makes it difficult to formalize the above arguments in a setting which only allows one to use That is why we add the additional assumption that is a uniformly convex Banach space which yields that is a uniformly convex function. This is usually used to prove that asymptotic centers are unique in this class of spaces, and we show that one can construct (by way of Proposition 2.4) a modulus of uniqueness for the infimum problem stating that – given – there is a such that -approximate infima points are -close to each other (for more details, see e.g. its use in Claims 2 and 3 of the proof in Section 3). It is then sufficient to consider only -infima points instead of actual infima points. The resulting proof can then be shown to be formalizable with the use of arithmetical comprehension which already guarantees the extractability of a rate of metastability which is definable in the calculus where is the system of the Hilbert-Gödel [34, 30] primitive recursive functionals of finite type and is the schema of Spector’s [63] bar recursion (of lowest type). We then show that the use of real limsup’s can be replaced – using a process of arithmetization, see [36] and Remark 4.1 – by that of -limsup’s whose existence can be shown using just induction (more precisely, using -induction, to which it is equivalent and which – by Parsons [58] – has a solution in the fragment of ).
Since the existence of -infima of also requires only induction, it follows from this that one gets a rate of metastability which is primitive recursive in the extended sense of The analysis of the -infima argument shows that suffices. When the details of the extraction are all carried out, it turns out that for the particular instances of that argument used, actually suffices, which, therefore, is the complexity of our final bound. The statement with this explicit bound provides a finitary version (in the sense of Tao) of the theorem that converges to the sunny nonexpansive retraction of (and so, in particular, also of the existence of itself) since the latter can be derived from by an elementary proof. In particular, it follows that only a single instance of -comprehension is needed (or, as seen from the viewpoint of constructive mathematics and in the presence of -AC0,0 choice for numbers, only the law of excluded middle) to derive the theorem. We believe that our analysis exhibits the explicit numerical content of the existence proof for Only future research will show whether the complexity class is the best possible or whether an ordinary primitive recursive rate can be achieved (or even whether a close examination of our bound might show that it can already be defined in , see Remark 6.3).
The next section introduces the preliminary notions used to discuss and prove our result, namely on uniformly convex and uniformly smooth spaces, and on nonexpansive retractions and pseudocontractions. Highlights include the modulus of convexity for the squared norm of a uniformly convex space – which has as an immediate consequence the uniform convexity of the function discussed above – as well as the introduction of the resolvent of a continuous pseudocontraction that allows one to use nonexpansiveness arguments as needed. Section 3 details the way to an intermediate proof of the main result where the use of as an operator has been eliminated and only -infima of it are needed, which are made useful by means of the modulus of uniqueness. In Section 4 even this use of in the form of pointwise limsup’s is removed, as they are replaced with approximate limsup’s. Some care must be taken to ensure that approximate limsup’s may be shown to exist using just induction (Proposition 4.3) and that they are useful for our purposes (Lemma 4.5). Finally, in Section 5, the higher-order portions of the witness extraction are carried out, yielding a highly complex, though structured, realizer. In Section 6 this realizer is progressively majorized in order to obtain our goal, namely a rate of metastability. It is argued there both that this final bound is expressible in and that the metastability statement is a true finitization (again in the sense of T. Tao) of the full form of the original strong convergence statement. Playing the role of an epilogue, Section 7 presents two completions by means of our result of proof mining analyses which had only been carried partially (in the sense that a rate of metastability was produced assuming such a rate for be given which so far was known only in the Hilbert space case), namely the strong convergence of the iterations of B. Halpern and R. E. Bruck.
2 Preliminaries
2.1 Classes of Banach spaces
2.1.1 Uniformly convex spaces
Definition 2.1** (cf. [19, 20]).**
Let be a Banach space. We call the function , defined, for all , by:
[TABLE]
“the” modulus of convexity of .
The following result shows that this modulus can be obtained in a less strict way.
Proposition 2.2** ([53, p. 60]).**
Let be a Banach space. Then, for all ,
[TABLE]
Corollary 2.3**.**
Let be a Banach space. TFAE:
- (a)
for all , . 2. (b)
there is an (called “a” modulus of convexity) such that for all and all with , and one has that
[TABLE]
(One can, obviously, for the implication “(a) (b)”, put, for all , .) In this case, is called uniformly convex.
The following is an application of a recent proof mining result of Bačák and the first author, specifically [5, Proposition 3.2], itself a quantitative version of a theorem of Zălinescu [70, Theorem 4.1]. We remark that a similar kind of result (i.e. with a different modulus) may be obtained by adapting an argument from [69, Section 2] to work with instead of . The non-quantitative version may also be found in the statement of [68, Theorem 2], but the proof given there is highly non-constructive.
Proposition 2.4**.**
Let be a uniformly convex Banach space having as a modulus and let . Put, for all ,
[TABLE]
Then, for all :
- (a)
. 2. (b)
for all with , , , we have that
[TABLE]
Proof.
We may assume that We seek to apply [5, Proposition 3.2]. We need, then, only to pass from to and from to and then to put , , and to be the squaring function. To obtain the conclusion, one has to verify that, for an arbitrary , the squaring function has on the interval the function as a modulus of uniform convexity, as a modulus of uniform continuity and as a modulus of uniform increasingness. ∎
2.1.2 Smooth and uniformly smooth spaces
Definition 2.5**.**
Let be a Banach space. We define the normalized duality mapping of to be the map , defined, for all , by
[TABLE]
A Banach space is called smooth if for any with , we have that for any with , the limit
[TABLE]
exists. This condition has been proven to be equivalent to the fact that the normalized duality mapping of the space, , is single-valued – and we shall denote its unique section by . Therefore, for all , and . Hilbert spaces are smooth, and clearly is then simply , for any in the space. Because of this, we may consider the to be a generalized variant of the inner product, sharing some of its nice properties. We shall generally denote, for all spaces , all and , by . In addition, the homogeneity of – i.e. that for all and , – follows immediately from the definition of the duality mapping.
Remark 2.6**.**
These notions of smoothness were introduced in [20], under the name of flattening.
Lemma 2.7** (cf. [59, Lemma 1]).**
Let be a smooth space and . Then
[TABLE]
Proof.
We have that
[TABLE]
from which the conclusion follows. ∎
Definition 2.8** ([53, Definition 1.e.1.(ii)]).**
Let be a Banach space. We call the function , defined, for all , by
[TABLE]
“the” modulus of smoothness of . We remark that for all , .
The following characterization is immediate.
Proposition 2.9**.**
Let be a Banach space. TFAE:
- (a)
. 2. (b)
there is a (called “a” modulus of smoothness) such that for all and all with , one has that
[TABLE]
In this case, is called uniformly smooth.
Remark 2.10**.**
A uniformly smooth space is smooth, and this condition is equivalent to the limit in (1) being attained uniformly in the pair of variables .
Remark 2.11**.**
Unlike in the case of convexity, “the” modulus of smoothness is not “a” modulus of smoothness.
Proposition 2.12** (cf. [46, Proposition 2.5]).**
Let be a uniformly smooth Banach space with modulus . Put, for all and ,
[TABLE]
Then for all , and all with and , if then .
In the PhD thesis of Bénilan [7, p. 0.5, Proposition 0.3], it is shown that the norm-to-norm uniform continuity on bounded subsets of an arbitrary duality selection mapping is in fact equivalent to uniform smoothness. A more recent proof which uses ideas due to Giles [28] may be found in [48, Appendix A].
2.2 Classes of mappings
In this section, we fix a smooth Banach space and a closed, convex, nonempty subset.
2.2.1 Nonexpansive mappings and sunny nonexpansive retractions
Definition 2.13**.**
A map is called nonexpansive if for all , .
Let be nonempty.
Definition 2.14**.**
A map is called a retraction if for all , .
Definition 2.15**.**
A retraction is called sunny if for all and ,
[TABLE]
Proposition 2.16** ([29, Lemma 1.13.1]).**
Let be a retraction. Then is sunny and nonexpansive if and only if for all and ,
[TABLE]
Proposition 2.17**.**
There is at most one sunny nonexpansive retraction from to .
Proof.
Let and be two such retractions. Let . It follows that
[TABLE]
and
[TABLE]
Using the homogeneity of and then summing up, it follows that and therefore . ∎
2.2.2 Pseudocontractions
Definition 2.18**.**
Let . We call a function a modulus of continuity for if for all and with , we have that .
Remark 2.19**.**
A map has a modulus of continuity iff it is uniformly continuous.
Definition 2.20** ([8, Definition 1]).**
A map is called a pseudocontraction if for all and , we have that
[TABLE]
Proposition 2.21**.**
Any nonexpansive map is a pseudocontraction.
Proof.
Let be nonexpansive. Let and . We have that
[TABLE]
so
[TABLE]
Multiplying by , we obtain our conclusion. ∎
We have the following equivalence.
Proposition 2.22** ([8, Proposition 1]).**
Let . Then is a pseudocontraction if and only if for all ,
[TABLE]
Definition 2.23** (cf. [32, (2.9)]).**
Let . A map is called a -strong pseudocontraction* if for all ,*
[TABLE]
Proposition 2.24**.**
Let be a continuous pseudocontraction, and . Define the map , by putting, for all , . Then is a continuous -strong pseudocontraction.
Proof.
We have that for all , , so for all ,
[TABLE]
from which our conclusion follows. ∎
Proposition 2.25**.**
Let and be a continuous -strong pseudocontraction. Then has a unique fixed point.
Proof.
If and are fixed points of , , so . The existence of a fixed point follows from [54, Proposition 3] and the convexity of . ∎
Definition 2.26**.**
If is a pseudocontraction, we define the map , for all , by .
Proposition 2.27**.**
Let be a continuous pseudocontraction. Then for all there is a unique such that .
Proof.
Let . Define the map , for all , by . Then, by Proposition 2.24, is a continuous -strong pseudocontraction. Since we have that for all , iff , the conclusion follows by applying Proposition 2.25. ∎
Definition 2.28**.**
If is a continuous pseudocontraction, we define the map by putting, for all , to be the unique such that .
Notation 2.29**.**
For any function and for any , put .
Proposition 2.30**.**
Let be a continuous pseudocontraction. Then:
- (i)
for all , ; 2. (ii)
* is nonexpansive;* 3. (iii)
for all , ; 4. (iv)
* and have the same fixed points;* 5. (v)
if is uniformly continuous with modulus , then for all and all , with , we have that .
Proof.
- (i)
Immediately, from the definition of . 2. (ii)
Let and apply (2) for , and to obtain – using (i) – that
[TABLE] 3. (iii)
Let and apply (2) for , and to obtain – again, using (i) – that
[TABLE] 4. (iv)
One direction follows from (iii). For the other, let be such that . Then , so is a fixed point of . 5. (v)
What follows is a quantitative version of the proof of (iv). If , then . Therefore, we have that
[TABLE]
∎
Definition 2.31**.**
If is a continuous pseudocontraction, we define the map by putting if is nonexpansive and otherwise.
The map is defined purely for our convenience, as we could have used regardless of the status of , but we want to emphasize that if is nonexpansive, then the use of is sufficient.
Corollary 2.32**.**
Let be a continuous pseudocontraction. Then:
- (i)
* is nonexpansive;* 2. (ii)
for all , ; 3. (iii)
if is uniformly continuous with modulus , then for all and all , with , we have that ; 4. (iv)
* and have the same fixed points.*
3 The proof using limsup’s but only -infima
The main focus of this paper is the following theorem (here and below ).
Theorem 3.1** (cf. [60]).**
Let be a Banach space which is uniformly convex with modulus and uniformly smooth with modulus . Let a closed, convex, nonempty subset. Let be such that for all , and the diameter of is bounded by . Let be a pseudocontraction that is uniformly continuous with modulus and . For all put to be the unique point in such that (which exists by Propositions 2.24 and 2.25). Then for all such that we have that is Cauchy.
This theorem was first proven by Reich [60] without the assumption of uniform convexity. The starting point of our investigations is the proof given by Morales [55], which we shall now illustrate, after giving a preliminary lemma.
Lemma 3.2**.**
Let , be two bounded sequences of reals. Then
[TABLE]
Proof.
We have that:
[TABLE]
∎
Proof of the theorem. We first show that for all such that , there exist a and , strictly increasing, such that . Put, for all , . Then, for all ,
[TABLE]
so and therefore (by Corollary 2.32.(ii)) . Define now , for all , by . Let be the set of minimizers of .
Claim. There is a .
Proof of claim: Since is convex and continuous, is closed convex bounded nonempty, and is uniformly smooth, hence reflexive, we have that . Let and . Then:
[TABLE]
so . Since is a closed convex bounded nonempty subset of a uniformly smooth space, and it is invariant under the action of the nonexpansive mapping , we have that there is a , so by Corollary 2.32.(iv), .
We only sketch the remainder of the proof, since the details that will actually be used shall be given later. Let and put . Using the continuity of , let be small enough such that for any , . (Note that .) By Lemma 2.7, we have that . Summing up, we get that
[TABLE]
so by Lemma 3.2, . Since was chosen arbitrarily and , we easily get that there is an , strictly increasing, such that . We use that is a pseudocontraction to derive that for any ,
[TABLE]
so that , which we sum up with the previous inequality to get that .
To obtain the convergence of for any (suitable, from now on) sequence , it is clear that we need only to show that there is a such that for any there is an such that . Fix a canonical sequence, say , for any . By the previous argument, we have that there is an and a such that . Now consider a sequence . Again, by the above, there is an and a such that . What remains to be shown is that . Let . Put, for all , . Let be big enough such that and that (again, using the continuity of ) . Using arguments like before, we get that and similarly , so .
It is now clear that the least elementary principles are used in the Claim, where appeal is made to results of Banach space theory as established in a set-theoretic framework. From the point of a quantitative analysis the most difficult argument is the proof of using the reflexivity of This can be avoided as follows. In the light of the conclusion of the theorem, it is immediate that the function has a unique minimizer, namely the limit of , so a viable idea would be to get to this uniqueness in an a priori way. This is where the additional hypothesis of uniform convexity helps us, through Proposition 2.4, which gives a modulus of uniform convexity for the squared norm – and thus also for – that acts as a modulus of minimizer uniqueness for which allows one to show the existence of an actual minimizer as the limit of approximate minimizers, with this modulus providing a rate of convergence. Let us see how these concepts come into play.
Second proof of the Claim. We divide this proof into a series of claims.
Claim 1. For all there is a such that for all :
- •
;
- •
.
Proof of claim: As before, we have that . Suppose that for all there is a such that
[TABLE]
Let and put . Put, then, and recursively for all put such that
[TABLE]
Therefore,
[TABLE]
which is a contradiction. Thus, there is a such that for all
[TABLE]
Now, we have, again as before, that
[TABLE]
so, for all ,
[TABLE]
Claim 2. For all there is a such that:
- •
for all , ;
- •
.
Proof of claim: Take . Apply Claim 1 for and and put to be the resulting . We have only to show that , since from that, using Corollary 2.32.(iii), it follows that . Suppose not. Then, for all ,
[TABLE]
so, for all , by Proposition 2.4,
[TABLE]
Then, applying the defining property of , we get that
[TABLE]
which is a contradiction, since .
Again, we only sketch the remainder of this proof. For any , we fix a such that for all , and . We show that is Cauchy. Let and let , . Assume that . Then, for all , using Proposition 2.4 as before,
[TABLE]
which is a contradiction. It is then immediate that the limit of satisfies our requirements.
The next principles we can now remove from the proof are the ones that allowed us, for example, to pass to the limit in the argument above. What we do is to show that the approximate solutions obtained in Claim 2 are enough for the whole line of argument to go through, by essentially removing any ideal point that would appear in the course of the proof by an approximate one. This is made possible again by the use of Proposition 2.4, asserting that two -infima of , for sufficiently small , must be -close. Also, it is now clear that the resulting proof does no longer use the existence of as a function but only the existence of the individual limsup’s in the form
[TABLE]
As a result of this, the proof may be formalized in a deductive system to which the logical bound extraction theorems, mentioned in the Introduction, apply – which is not clear if would be needed as an object (see also Remark 3.3 below).
Proof of the theorem using only the aforementioned principles. We presuppose the truth of Claim 2 in the previous proof, i.e. that for all such that and for all there is a such that:
- •
for all , ;
- •
.
Thus, we shall start the numbering of claims at 3.
Claim 3. For all such that and for all there is a such that:
- •
for all , ;
- •
for all , .
Proof of claim: Take
Apply Claim 2 for and and put to be the resulting .
We have to show that for all , . Let and put . Apply Claim 2 for and and put to be the resulting , so in particular . We then obtain that
[TABLE]
from which we get that
[TABLE]
Suppose that , so, for all ,
[TABLE]
Then
[TABLE]
which is a contradiction. So , i.e. . From that we obtain
[TABLE]
and
[TABLE]
From (3) and (4) we derive our conclusion.
Claim 4. For all such that and for all there is a such that:
- •
for all , ;
- •
there exists , strictly increasing, such that .
Proof of claim: Put and
Apply Claim 3 for and and put to be the resulting . Denote, for all , . We have that, for all ,
[TABLE]
Put . Since , . By Lemma 2.7, we have that
[TABLE]
Since
[TABLE]
we have that
[TABLE]
and so that
[TABLE]
[TABLE]
Applying Lemma 3.2, we obtain that
[TABLE]
and therefore that there exists , strictly increasing, such that
[TABLE]
so in particular, noting also that ,
[TABLE]
Using (5), we derive , i.e. our conclusion.
Claim 5. For all there is a such that for all with , there exists , strictly increasing, such that
[TABLE]
Proof of claim: Put
[TABLE]
and, for all , .
Apply Claim 4 for and and put to be the resulting . In particular, there is , strictly increasing, such that .
Let now be chosen arbitrarily such that . Apply Claim 4 for and and put to be the resulting . In particular, there is , strictly increasing, such that .
We have that for all ,
[TABLE]
Take a sufficiently large such that
[TABLE]
We have that
[TABLE]
and that
[TABLE]
so
[TABLE]
Therefore
[TABLE]
Similarly, we obtain that
[TABLE]
Summing up, we get that , so . Since , we have
[TABLE]
i.e. our conclusion.
Claim 6. For all there is an such that for all with , we have that
[TABLE]
Proof of claim: Apply Claim 5 for and put to be the resulting . Let now be chosen arbitrarily such that .
Suppose that . Then there is an such that for all there is an such that , so there is an , strictly increasing, such that for all , . By the defining property of , we get that there is a , strictly increasing, such that
[TABLE]
so there is an such that for all ,
[TABLE]
which contradicts the defining property of .
Claim 7. For all with , we have that the sequence is Cauchy.
Proof of claim: Denote, for all , . We want to show that for all there is an such that for all , . Let . By applying Claim 6 for , we obtain an having the property that there is an such that for all ,
[TABLE]
Take . Then
[TABLE]
i.e. our conclusion.
This last claim is exactly our desired statement.
Remark 3.3** (for logicians; we use the terminology from [40]).**
An inspection of the proof of the Cauchy property and hence of the metastability of in this section shows that it can be carried out in the formal system WE-PACAar where WE-PA is defined as in [40, (17.68)] and then augmented by the normalized duality mapping and the modulus of uniform smoothness as in [46]. CAar denotes the schema of arithmetic comprehension which is needed to show the existence of From the logical metatheorems in [40, 46] and Theorems 11.11 and 11.13 in [40] it follows that one can extract a rate of metastability for the Cauchy property of which is definable in Gödel’s calculus of primitive recursive functionals of finite type augmented by Spector’s bar recursion of lowest type. In the next section we will show that even the use of can be avoided.
4 The proof using approximate limsup’s
In this section we, in particular, show that the use of limsup’s can be replaced by that of -limsup’s whose existence can be established by induction (for logicians: -induction). As a result of this, the proof can even be formalized without arithmetic comprehension and so the extractability of a primitive recursive (in the sense of Gödel’s ) rate of metastability is guaranteed (see Remark 3.3). We also exhibit the finitary content of the actual use of approximate limsup’s made in the proof.
Remark 4.1**.**
The process of eliminating ’s by an arithmetical principle in this section is in line with [36, Proposition 5.9] where such an arithmetization of the use of limsup’s to is shown to be possible in a certain restrictive deductive context and by [38, Theorem 6.1] it is optimal. In [40, Section 17.9], the approach is shown to work also within the framework of abstract spaces. In our context, however, we cannot directly apply these results as the limsup’s are used in the presence of e.g. inductions going beyond quantifier-free induction. However, as usual in the context of ordinary proofs, the arithmetization can nevertheless be carried out without problems and we suspect that this could be explained in terms of logical metatheorems by treating the inductions used as implicative assumptions and using that the method behind these arithmetizations works for arbitrary (arithmetical) formulas as long as certain monotonicity conditions are fulfilled (see [37]). Nevertheless, we leave this for future research to clarify.
4.1 The arithmetized version of limits superior
Definition 4.2**.**
Let be a sequence of reals and . A number is called an -approximate limsup* (or simply an -limsup) for if:*
- •
for all there is an such that ;
- •
there is a such that for all , .
What makes approximate limsup’s suitable for proof mining is that they admit an existence proof which uses only -induction.
Proposition 4.3** (-IA).**
For all and for all sequences of reals contained in the interval , there is a with such that is a -limsup of .
Proof.
Let , and be as in the statement.
Claim. There is a with such that it is not the case that for all there is an with implies that for all there is an with .
Proof of claim: Assume towards a contradiction that the opposite holds, i.e. for all natural numbers smaller or equal to , we have that implies , where is the statement that for all there is an such that . Since holds trivially, we have by -induction that . But that states that for all there is an such that , clearly false.
Take as in the Claim. Then and:
- (i)
for all there is an such that , so , 2. (ii)
there is a such that for all , , so ,
i.e. is a -limsup of . ∎
Remark 4.4**.**
One can even show, as mentioned in the Introduction, that this statement is equivalent to -induction. To do that, we tweak the argument used in the proof of [38, Theorem 6.1], whose statement affirms that the existence of limsup’s (without function parameters) implies -induction, to also work with rational approximate limsup’s, i.e. in the form given in Proposition 4.3. The limsup hypothesis is used two times: once in Claims 1-3 and once when it yields -induction as the first stage of a bootstrapping process. The second application does not pose any serious problems, while the first one is a bit more involved, since the statements of Claims 1-3 must be adjusted. Set, for any , . One then requires in Claims 2 and 3 from to be a rational -limsup and a rational -limsup of , respectively, while the new Claim 1 states that for any with and for any rational which is a -limsup of , the following are equivalent:
- (i)
; 2. (ii)
; 3. (iii)
for all there is an with .
The proof then goes through.
It is not sufficient that one can prove the existence of approximate limsup’s, we must also show that they can play the role that is required of them. The following lemma is crucial in this regard, as it proves that one can extract specific sequence ranks that are needed in a later analysis of a proof, whereas the values of the approximate limsup’s may be discarded.
Lemma 4.5**.**
Let . Let , and be sequences of reals and , and be -limsup’s of them, respectively. If and , then for all there is a such that and .
Proof.
By the definition of the approximate limsup, we have that:
- •
there is a such that for all , ;
- •
there is a such that for all , ;
- •
for all there is an such that , and in the following we denote this depending on as .
Let . We set . Then we have that
[TABLE]
and similarly, that . ∎
We will be using the following weaker forms of the above lemma.
Corollary 4.6**.**
Let . Let and be sequences of reals and and be -limsup’s of them, respectively. If , then for all there is a such that .
Corollary 4.7**.**
Let . Let , and be sequences of reals and , and be -limsup’s of them, respectively. If and , then there is a such that and .
4.2 Replacing limsup’s by approximate limsup’s
We consider, in this section, and such that:
- •
for all and all , ;
- •
for all , .
In the case that for all , , we may take, for all , and .
New proof of the theorem. Again, we divide the proof into a series of claims.
Claim I. Let and . Then there is a and a such that is an -limsup of and such that for all and with being an -limsup of , .
Proof of claim: Denote, for all , . Assume towards a contradiction that for all and with being an -limsup of there is a and an such that is an -limsup of and . Let now be arbitrary and be an -limsup of . Put, for all , and be the and the obtained by applying the assumption to and playing the roles of and , respectively. Then, since , and by the assumption, for each , we have that , we get that for all , . If we choose , we obtain that , contradicting the fact that is an -limsup of a sequence of nonnegative reals.
Denote, for all , . We shall prove the Cauchyness of the sequence in its “metastable” formulation, namely: for all and all there is an such that . Let, therefore, and . From this point on, we shall use the following notations (where and ):
[TABLE]
Claim II. There are , , , and , , , , , such that:
- •
,
;
- •
, ,
, ;
- •
;
- •
, .
Proof of claim:
- A.
The construction of and .
We apply Claim I for and . We obtain and such that is an -limsup of and for all and with being an -limsup of we have that .
By the above applied to and an -limsup of , we get after using Corollary 4.6 that there is a such that
[TABLE] 2. B.
The construction of and .
We apply Claim I for and . We obtain and such that is an -limsup of and for all and with being an -limsup of we have that .
By the above applied to and an -limsup of , we get after using Corollary 4.6 that there is a such that
[TABLE] 3. C.
The construction of and .
We apply Claim I for and . We obtain and such that is an -limsup of and for all and with being an -limsup of we have that .
By the above applied to and an -limsup of , we get after using Corollary 4.6 that there is an such that
[TABLE] 4. D.
The construction of .
Since is a -limsup of , it is also a -limsup of . Similarly, is a -limsup of .
Put and take to be a -limsup of .
Since is also a -limsup of ,
[TABLE]
and similarly
[TABLE]
Also take note that is a -limsup of . By Corollary 4.7, we get that there is an such that
[TABLE] 5. E.
The construction of and .
We apply Claim I for and . We obtain and such that is an -limsup of and for all and with being an -limsup of we have that .
By the above applied to and an -limsup of , we get after using Corollary 4.6 that there is an such that
[TABLE]
Put . Then
[TABLE] 6. F.
The construction of .
Since is a -limsup of , it is also a -limsup of . Similarly, is a -limsup of .
Put and take to be a -limsup of .
Since is also a -limsup of ,
[TABLE]
and similarly
[TABLE]
Also take note that is a -limsup of . By Corollary 4.7, we get that there is an such that
[TABLE]
Put . Then
[TABLE]
We are now done.
Claim III. Let , , , and , , , , , be as in Claim II. If we put , then . (One notices here that the part of the proof corresponding to this claim will not need further inspection, as it is enough to obtain a value for in an analysis of Claim II.)
Proof of claim: We will further divide the proof of this claim into sub-claims.
Sub-claim 1. We have that:
- •
, ,
, .
Proof of sub-claim: First, we remark that:
[TABLE]
so, using Corollary 2.32.(ii),
[TABLE]
and, by Corollary 2.32.(i),
[TABLE]
Now we may write:
[TABLE]
Similarly, one may show that .
Sub-claim 2. We have that:
[TABLE]
Proof of sub-claim: Suppose that . Then
[TABLE]
so
[TABLE]
which is a contradiction, since .
Similarly, one shows that .
Sub-claim 3. We have that:
- •
, ;
- •
, .
Proof of sub-claim: Since , we have, using Corollary 2.32.(iii), that . We now compute:
[TABLE]
from which we obtain
[TABLE]
Suppose now that . Then
[TABLE]
and so
[TABLE]
which is a contradiction, since .
Therefore
[TABLE]
so
[TABLE]
We have then
[TABLE]
Similarly, taking into account, when needed, that
[TABLE]
we obtain the other three inequalities.
Sub-claim 4. We have that:
[TABLE]
Proof of sub-claim: By Lemma 2.7, we have:
[TABLE]
Given that, as before, and so , and that
[TABLE]
we get that
[TABLE]
Therefore (using the first item in Claim II),
[TABLE]
In a similar way, using the fact that , we obtain the other inequality to be proven.
Sub-claim 5. We have that .
Proof of sub-claim: We know that . Since
[TABLE]
we have that
[TABLE]
and so
[TABLE]
Similarly, using as the “pivot”, we get that , so
[TABLE]
i.e.
[TABLE]
We can now compute:
[TABLE]
which is what we wanted.
It follows immediately, by the definition of , that . To finish the proof of the claim, we see that
[TABLE]
which also finishes the proof of the theorem.
5 The extraction of the witness
5.1 The logical analysis of Claim I
The first proposition in this section, Proposition 5.1, is the (partial) functional interpretation of Proposition 4.3, i.e. of the existence of -limsup’s using only functionals definable in the fragment (which only contains the recursor constants and ) of Gödel’s . This analysis was obtained with the crucial guidance of the functional interpretation of induction from [58].
We then give in Proposition 5.2 the (partial) functional interpretation of the proof of Claim I, i.e. the existence of -infima for approximate limsup’s, by functions definable in (as now also is used). Since the functional interpretation of the Claim II, which only uses the existence of approximate limsup’s and plain logic plus elementary arithmetic, can be interpreted already in this guarantees the extractability of a rate of metastability definable in
In the following we use, for any , the notation n\mathbin{\vphantom{+}\text{\ooalign{\kern-1.50696pt\cr\smash{\cdot}-\cr}}}m to denote if and [math] otherwise. We also use the usual conventions when defining higher-order functionals and write e.g. ‘JUM(b\cdot(k+1)\mathbin{\vphantom{+}\text{\ooalign{\kern-1.50696pt\cr\smash{\cdot}-\cr}}}PUM)’ instead of ‘J(U,M,b\cdot(k+1)\mathbin{\vphantom{+}\text{\ooalign{\kern-1.50696pt\cr\smash{\cdot}-\cr}}}P(U,M))’. Occasionally, we also use the -notation from functional programming, for a term depending on the variables to denote the function:
Proposition 5.1**.**
Let and be a sequence of reals contained in the interval . Define the following functionals:
[TABLE]
Then, for all , we have that and:
- (i)
; 2. (ii)
.
Proof.
We start with the following claim, analogous to the one in the proof of Proposition 4.3.
Claim. There is a with such that it is not the case that
[TABLE]
implies that
[TABLE]
Proof of claim: Assume towards a contradiction that the opposite holds, i.e. for all natural numbers smaller or equal to , we have that implies , where is the statement that
[TABLE]
Since holds trivially, we have by induction that . But that states that
[TABLE]
which clearly is false.
Take to be minimal with this property. Then , by the definition of the latter, so clearly . We prove the remaining conclusions.
- (i)
Since , we may write b\cdot(k+1)+1\mathbin{\vphantom{+}\text{\ooalign{\kern-1.50696pt\cr\smash{\cdot}-\cr}}}PUM=(b\cdot(k+1)\mathbin{\vphantom{+}\text{\ooalign{\kern-1.50696pt\cr\smash{\cdot}-\cr}}}PUM)+1, so
[TABLE]
and
[TABLE]
Thus,
[TABLE] 2. (ii)
Since
[TABLE]
and
[TABLE]
we have that
[TABLE]
The proof is finished. ∎
The following is a logical analysis of Claim I in the second proof (using approximate limsup’s) of Theorem 3.1 and uses as an ingredient the functional interpretation of the existence of approximate limsup’s. Here, and in the remainder of the paper, we shall frequently use numerical indices to refer to components of tuples.
Proposition 5.2**.**
Let . Let be a Banach space, be a set of diameter at most and . Let . Let be arbitrary. Define the following functionals (where any denotes a constant zero function):
[TABLE]
In the above, , and are the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and . We continue to use in the following the notation .
Then, for any there is an such that if we denote
[TABLE]
and
[TABLE]
we have that
[TABLE]
and that if
[TABLE]
then
[TABLE]
In order to obtain a true realizer, we now put, for any , to be the least which realizes the above (in order to define it properly as a functional, we put it to be [math] in the “impossible” case that there isn’t one, as in the definition of in Proposition 5.1) and to be .
Proof.
Assume towards a contradiction that the opposite holds, i.e. there is an such that if we denote for all ,
[TABLE]
and
[TABLE]
then for all , if
[TABLE]
then
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
Remark that for all ,
[TABLE]
In addition,
[TABLE]
We now derive that for all ,
[TABLE]
and
[TABLE]
together with the corresponding statement . Therefore, what we know is that for all , if
[TABLE]
then
[TABLE]
[TABLE]
[TABLE]
and
[TABLE]
We shall now prove by induction that for all ,
[TABLE]
It remains to show the base case (): we apply Proposition 5.1 for and . Using
[TABLE]
and - similarly -
[TABLE]
we see that what we obtain is that
[TABLE]
Using that , we obtain that
[TABLE]
i.e. what we wanted. The induction case follows immediately by our assumption. Therefore we have that for all ,
[TABLE]
so
[TABLE]
Since by construction and , what we obtain is
[TABLE]
contradicting the fact that . ∎
5.2 The logical analysis of Claim II
In the sequel, we shall denote by (here stands for the sequence )
[TABLE]
the statement that (where we write )
[TABLE]
and that if
[TABLE]
then
[TABLE]
We will also make the parameters and (though not ) in the from Proposition 5.2 explicit in what follows. Thus, Proposition 5.2 states that for any , , and , if we put
[TABLE]
i.e., in particular, is a 5-tuple – corresponding to in the above definition – whereas is a number (and also, we add for clarity, returns a 5-tuple – corresponding to – while returns a number), then
[TABLE]
5.2.1 Preparation
In the proposition below, the eight items correspond to the eight inequalities involving the sequence that must be satisfied in Claim II.
Proposition 5.3**.**
Let be a Banach space, and be a set of diameter at most . Let . Then there is a such that for any having the property that for any , , and , if we put
[TABLE]
then one has
[TABLE]
we have that for any , , , , , , , and , if we put
[TABLE]
we have that
- (i)
; 2. (ii)
; 3. (iii)
;
; 4. (iv)
; 5. (v)
; 6. (vi)
;
; 7. (vii)
; 8. (viii)
.
Take notice that:
By the discussion at the beginning of this subsection, we already have such a , but its form is not relevant for this proposition. 2. 2.
The exact form of the will be given over the course of the proof.
Proof.
We shall first derive a purely qualitative version of the above. Namely, let , , , , , , , be given. We will show that there exist , , , , , , , , , such that (i)-(viii) hold. It will then follow, by the functional interpretation, that these objects can be explicitly constructed. The first step will be to prove the “non-metastable” version of our hypothesis, which we do in the following claim.
Claim. For all and , there are and such that for all and ,
[TABLE]
Proof of claim: Suppose the opposite, so there are and such that for all and there are and , such that it is not the case that
[TABLE]
Put, for any and , to be such a and . Then, for all and ,
[TABLE]
If we now put , we contradict our hypothesis.
If we apply the Claim to , we get and such that for all and ,
[TABLE]
from which we get (i). Apply then the Claim to to get and such that for all and ,
[TABLE]
from which we get (ii). Now apply the Claim to to get and such that for all and ,
[TABLE]
Put . Then we have that for all and ,
[TABLE]
and so we get (iii). Let . From (9), we have that for all and ,
[TABLE]
from which we get (iv). Similarly, from (8), we have that for all and ,
[TABLE]
from which we get (v). Afterwards, , and – and thus (vi), (vii) and (viii) – are obtained in a similar manner.
Now we proceed to the construction of . Since the above proof used only pure logic and the basic properties of the operation of addition, it follows by the soundness theorem of the functional interpretation that can be constructed out of just -terms, and case distinction. When we shall majorize to get our final bound, the case distinction will disappear, being replaced by the maximum. This is why we do not need to solve the case distinction further (which we could, by using suitable rational approximations, in order for the to be fully constructive).
For conceptual clarity, we shall split the proof analysis into two distinct parts, the purely logical one and the “mathematical” one (which uses addition). Define the following functionals:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Claim. There exist , , , , , , , such that
- (i’)
; 2. (ii’)
; 3. (iii’)
; 4. (iv’)
; 5. (v’)
; 6. (vi’)
; 7. (vii’)
; 8. (viii’)
.
Proof of claim: Define in the following way:
[TABLE]
We first apply the hypothesis on to . Since , we have that
[TABLE]
Suppose that it is not the case that
[TABLE]
Then, by the definition of , we have that , so, by (10), we have that (11) holds, which is a contradiction. Therefore (11) holds, so and
[TABLE]
We have thus proven the first and fifth item on our list. The other three pairs of items are proven in the same way, by applying the hypothesis on to
[TABLE]
[TABLE]
and
[TABLE]
successively.
To finish the proof, we need only to use the , , , , , already obtained in the claim and then to put
[TABLE]
[TABLE]
Then the items (i)-(viii) follow from the corresponding ones in the claim by a simple verification using the above definitions and the earlier ones of , , and . ∎
Remark 5.4**.**
The case distinctions made in defining the various functions in the proof of the claim above (and also in some proofs below) serve to achieve that the value produced simultaneously satisfies two requirements. This is reminiscent of the treatment of the logical contraction axiom in Gödel’s functional (‘Dialectica’) interpretation [30]. In the end, when computing the bound we are interested in by a process of majorization (monotone functional interpretation, see [40]), we can always just take the maximum of the two values and so the case distinctions are not needed to be computed but serve to justify why the bound is correct. Alternatively, the correctness of taking the maximum can also be argued for by using the so-called bounded functional interpretation [24] which globally changes the whole interpretation whereas we prefer our local verification as this does not require to actually spell out the general interpretation.
Lemma 5.5**.**
Let , be a Banach space, be a set of diameter at most and . Assume that there are suitable , , , , , , , , , , , , , , such that:
- (i)
; ; 2. (ii)
; ; 3. (iii)
; 4. (iv)
, and are the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and ; 5. (v)
; 6. (vi)
, for an arbitrary argument , has the following value: if it is not the case that
[TABLE]
then , else if it is not the case that
[TABLE]
and
[TABLE]
then , else ; 7. (vii)
; 8. (viii)
; 9. (ix)
.
Then we have that
[TABLE]
Proof.
By the definition of , we have that
[TABLE]
and that if
[TABLE]
and
[TABLE]
then
[TABLE]
The second instance of shows that
[TABLE]
and that if
[TABLE]
and
[TABLE]
then
[TABLE]
By Proposition 5.1 and the condition on , we get that
[TABLE]
By the condition on , if it is not the case that
[TABLE]
then . By (13), it follows that (14) holds, contradicting our assumption. Therefore, indeed, (14) holds. Suppose now that it is not the case that
[TABLE]
Then . By (13), it follows that (15) holds, contradicting our assumption. Therefore, indeed, (15) holds. In particular, since ,
[TABLE]
and
[TABLE]
Thus, . Yet again, by (13), it follows that
[TABLE]
and
[TABLE]
so, since ,
[TABLE]
and
[TABLE]
Therefore, we have that
[TABLE]
and
[TABLE]
We may now compute:
[TABLE]
and similarly we obtain
[TABLE]
so we are done. ∎
The following corollary is simply the instantiation of Lemma 5.5 above for , and .
Corollary 5.6**.**
Let , be a Banach space, be a set of diameter at most and . Assume that there are suitable , , , , , , , , , , , such that:
- (i)
; 2. (ii)
; 3. (iii)
; 4. (iv)
, and are the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and ; 5. (v)
; 6. (vi)
, for an arbitrary argument , has the following value: if it is not the case that
[TABLE]
then , else ; 7. (vii)
; 8. (viii)
; 9. (ix)
.
Then we have that
[TABLE]
5.2.2 The extraction of the quantities in Claim II
We will now show how to prove Claim II in the second proof of Theorem 3.1 (the one that uses approximate limsup’s). We use the notations introduced before the statement of that claim.
We will now define .
- I.
The definition of and .
These quantities are defined analogously to the corresponding ones in (the proof of) Claim II. First put
[TABLE]
For the , we just have to extract the point (i.e. the fourth component) from and then form the sequence as before. 2. II.
The definition of and .
Consider some arbitrary for their arguments.
Let be and , and be the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and . We continue to use in the following the notation . We define , for an arbitrary argument , in the following way: if it is not the case that
[TABLE]
then put their value as , else put it as . Finally, put the value of to be and the value of to be . 3. III.
The definition of and .
Consider some arbitrary for their arguments.
Let be and , and be the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and . We continue to use in the following the notation . We define , for an arbitrary argument , in the following way: if it is not the case that
[TABLE]
then put their value as , else put it as . Finally, put the value of to be and the value of to be . 4. IV.
The definition of and .
Consider some arbitrary for their arguments.
Put
[TABLE]
Now put the value of to be
[TABLE]
and the one of to be
[TABLE] 5. V.
The definition of and .
These are defined similarly to and , with the caveat that we need access to in order to work with the value when defining the corresponding , and . 6. VI.
The definition of and .
These will play a role in the application of Lemma 5.5, so we step carefully through their definition.
Consider some arbitrary for their arguments.
Let be and , and be the functionals defined in Proposition 5.1, customized by instantiating their free parameters with , and . We continue to use in the following the notation . We define , for an arbitrary argument , in the following way: if it is not the case that
[TABLE]
then , else if it is not the case that
[TABLE]
and
[TABLE]
then put their value as , else put it as . Finally, put the value of to be and the value of to be . 7. VII.
The definition of and .
These are defined similarly to . 8. VIII.
The definition of and .
These are defined similarly to .
Now that we have defined , put
[TABLE]
and apply Proposition 5.3.
Claim II then follows by applying Corollary 5.6 four times and Lemma 5.5 two times, and then performing some simple computations similar to the ones in the original proof of the claim. The relevant fact here is that the that witnesses the metastability property is equal to
[TABLE]
6 The rate of metastability
When one has reached the end of the previous section, one can rightfully say that one is in the possession of a formula witnessing, for any and , the rank corresponding to the metastable reformulation of the Cauchy property (depending on additional parameters of the problem). It is however not an effective formula and not uniform at all as it depends on all the data of the problem. However by a process of majorization one easily obtains a bound (called a rate of metastability in the Introduction) which is both effective and highly uniform in the sense that it – in addition to and – only depends on the norm bound and the moduli , , , , and but not on , , , or themselves. In order to explain this approach, however, we need to first make a detour into the details of the calculus of functionals in which our final bound will be expressed.
The system of Hilbert-Gödel, mentioned in the Introduction, is a system of functionals of finite types. Those finite types are defined inductively in the following way: there is a primitive type of natural numbers, and if we have two types and , we have a type denoted by of functions from elements of type to elements of type . Therefore, there is e.g. a type of functions . Product types are not built into the system, but they can be emulated by currying, i.e. the identification of with . The functionals themselves are given by terms in this system, which are built up inductively by repeated application of variables and of constants for zero and successor, for basic combinatory operations, and lastly for recursion over natural numbers.
The crucial notion that we will make use of in the following is the one of majorization, introduced by Howard [35]. Majorization is a family of binary relations, i.e. on elements of each type one has a relation . It is defined inductively and, moreover, hereditarily: for two natural numbers and one has iff and if and are of type then iff for all , of type with one has . For example, to any , we associate the function , defined for any , by
[TABLE]
and it is immediate that – we say of that it majorizes or is a majorant for . Not all elements of higher types admit a majorant, but all the constants of do, and by heredity this extends to all terms (containing only variable of types ) of . As an illustrating example, if is defined recursively by a schema like the following (suppressing the type information and ignoring the definitions of and ):
[TABLE]
then if and are majorants for and , respectively, it is easy to check that the function defined by
[TABLE]
majorizes , where for functionals is defined pointwise.
There is one further issue we need to take care of. In order to formalize arguments involving e.g. Banach spaces, one needs to extend (as was first done in [39]) the type system with a new primitive type corresponding to elements of such a space. To any such extended (‘abstract’) type one can then associate an ordinary type simply by replacing all the ’s with ’s. Majorization is then defined in [27] on each abstract type as a binary relation between elements of type and those of type , as follows: first we have that for any and , iff and then one continues in the same hereditary manner as on the ordinary types.
As a consequence of all this, if one would majorize all the functionals that play a role into the definition of the witness obtained earlier, one would get a chance at finding a purely numerical (and thus uniform) rate of metastability in the sense defined above, after all the case distinctions are removed in favour of taking the corresponding maximum. This is what we will now proceed to do in a stepwise fashion.
First, we majorize the functionals introduced in Proposition 5.1. Those have three hidden parameters: the sequence itself, which is only used directly when defining , and so it completely disappears by majorization, the , which we show here explicitly, and the upper bound, which we instantiate with , since that is the greatest possible bound on the sequences for which the approximate limsup’s are obtained. Since the is trivially majorized by , we omit it, since we can replace it by this value in its further appearances. Note that only the and the will play a role in further developments.
[TABLE]
We now do the same for the functionals in Proposition 5.2. Remark the added explicit parameter . We specify that the variables , and are of type , is in and the fifth and sixth components of take values in (this will be relevant for the calibration of the exact level of recursion that is needed to define these objects). Again, the only functional which we will use later is , the rest of them only serve to define it.
[TABLE]
where for tuples is understood componentwise. Now we begin the most intricate portion of the majorization procedure, namely the treatment of the functions defined in the final part of the previous section. We make some remarks in order to convince the reader of the plausibility of the solution given below. First of all, since yields a sequence which is bounded by , it is trivially majorized, so we may omit it, like before with the . In that same vein, we may omit some parameters if the majorant does not actually depend on them. For example, a majorant for will now no longer depend on , and moreover it can be replaced by the same majorant as for , provided that and are replaced in applications by and . A case distinction may be replaced by a (pointwise) maximum (the verification is as immediate as for the recursion example given before), whereas when majorizing small real numbers the maximum is, obviously, replaced by a minimum. For all undefined quantities below, see sub-section 5.2.2, as well as the notations introduced before the statement of Claim II in Section 4.
[TABLE]
[TABLE]
We now majorize the functionals appearing in the proof of Proposition 5.3. First, we treat the arithmetical shuffling stage.
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Finally, we may treat the purely logical stage, where taking the maximum replaces the case distinctions. What we need to take care of is that the instance of majorizing is applied to its proper arguments, namely (here) .
[TABLE]
As before, one obtains the final bound of
[TABLE]
which, taking care of the dependencies in the formula just produced, we may denote as
[TABLE]
This, however, is not a rate of metastability in the sense that the notion was defined in the Introduction, but it may be easily converted into one. We remark that (suppressing the indices) depends on the only via and, moreover, the only property of that is used is that it is a majorant for . Therefore, for any such that for any , , since then is also a majorant for , is a bound on the (least) such that . The uniformity of the bound having been already taken care of, we may allow the to depend on the sequence itself, and so
[TABLE]
is a valid choice (note that this can be made constructive by using suitable rational approximations). We may then simply put
[TABLE]
to obtain our main result, which is expressed as follows.
Theorem 6.1**.**
Let be a Banach space which is uniformly convex with modulus and uniformly smooth with modulus . Let a closed, convex, nonempty subset. Let be such that for all , and the diameter of is bounded by . Let be a pseudocontraction that is uniformly continuous with modulus and . For all put to be the unique point in such that . Let , and be such that:
- •
for all and all , ;
- •
for all , .
Denote, for all , . Then, for all and there is an such that for all ,
Thus, we have obtained a rate of metastability which is definable in the subsystem of containing at most type two recursion, which we denote by . Note that in order for our bound to be properly said to be defined in that calculus, one must take care that only natural numbers and functionals thereof are used in the definition. The most prominent examples of this sort are that one cannot work with an and must instead use a natural approximation standing for , and also that the moduli of convexity, smoothness and continuity must also operate with and return natural numbers having this interpretation. This is straightforward to arrange (see also the metatheorems in [40, 46] which use the respective moduli in this form).
A closer look at the bound shows that type two recursion is only used in the definition of needed in defining because of the argument which is in turn used by . The concrete instances of to which is applied in the final stage, however, do not depend on that parameter, as it may be gleamed from a very careful examination. To see this, it is crucial to note that the functionals do not depend on the fifth components of which play the role of (That these functionals depend neither on the third nor the fourth component of is not surprising since these can be easily majorized in terms of and for the respective error which corresponds to the definition of the first and second components of the ’s.) Therefore one may replace that recursion by a (simpler) type one recursion111On the other hand, in the applications of that were used to obtain the actual realizer, the parameter played a nondisposable role in the case distinction, but one can also make the remark that cannot play another role because in the proof of Lemma 5.5, the corresponding ‘’ statements within the ’s were never used.. Note also that the primitive recursion of actually only concerns the components 3-6 (the first two ones have constant values) which are of types , , and – so that this is a recursion of type . Actually, using again that the ’s to which this recursion is applied do not depend on the type component of , one can see that in the case at hand it reduces to a recursion of type . We also remark, that in the situation at hand, the functional (and hence ) actually is constantly [math] since the respective functionals, namely the fifth components of the ’s, are [math].
Corollary 6.2**.**
The bound providing a rate of metastability for the resolvents of continuous pseudocontractive operators in Banach spaces which are uniformly convex and uniformly smooth, is definable in as a functional in the parameters , , , , , , , .
Remark 6.3**.**
A detailed analysis of the structure of our rate of metastability might actually reveal – in line with Lemma 4 in [57] – that the remaining type-1 recursions (to define and ) are applied to type-2 functionals which are so simple (w.r.t. their dependence on the function argument) that our bound could be defined already in We have to leave this for future research.
Remark 6.4**.**
In the special case where the mapping is nonexpansive and has a fixed point, we may trivially remove the boundedness condition as follows: let a closed, convex, nonempty subset. Let be nonexpansive with a fixed point and . Let be such that and . For all put to be the unique point in such that . Let , , be as before. Denote, for all , . Then, for all and there is an such that for all , To see this, put to be the intersection of with the closed ball centred on with radius . Clearly is closed, convex and nonempty. Set to be restricted to , whose image is by nonexpansiveness also in . Clearly, the diameter of is bounded by , all elements of are bounded by and , so we may apply Theorem 6.1.
We now argue that the quantitative metastability of the sequence is indeed a finitization in the sense of Tao of the following theorem (which is a somewhat restricted form of the main result in [60]).
Theorem 6.5** ([60]).**
Let be a Banach space which is uniformly convex and uniformly smooth, a closed, convex, bounded, nonempty subset, be a uniformly continuous pseudocontraction and . For all put to be the unique point in such that . Then for all such that we have that converges to a fixed point of , which we denote by . In addition, the map thus defined is a sunny nonexpansive retraction (and therefore the unique such one).
For this we now show that the metastability of implies in an elementary way the above theorem: using just logic (and quantifier-free choice from to ) the metastability of implies that is Cauchy, and therefore, since is complete and is closed, it is convergent (in the context of reverse mathematics, the latter fact uses arithmetical comprehension – in fact a single use of -comprehension – to get a fast converging subsequence as required to obtain the actual limit). It is clear that the limit does not depend on the , so we can unambiguously dub it . For the rest of the proof, we fix a and denote, for all , . That is a fixed point follows from the continuity of and the fact (proven already in the beginning off Section 3) that
[TABLE]
whose trivial proof we recall here:
[TABLE]
If is already a fixed point, then clearly for all , and therefore . We have thus shown that is a retraction. To show that is sunny and nonexpansive, we seek to apply Proposition 2.16. Let . Then
[TABLE]
Now we reuse parts of the argument from Claim 3 in the last proof from Section 3. We have that
[TABLE]
so
[TABLE]
and therefore, using that is homogeneous,
[TABLE]
By passing to the limit, using the continuity of , we get that
[TABLE]
which is what we needed to show.
Remark 6.6** (for logicians; we use the terminology from [40]).**
As mentioned already, the proof of the metastability of in Section 4 shows that it can be carried out in the formal system WE-PA Note that the noneffective definition of the function can easily be avoided by using suitable rational approximations of and From this, the proof above of the convergence of only requires classical logic, a fixed (in the parameters , , needed to define ) sequence QF-AC of instances of QF-AC0,0 (in the terminology of reverse mathematics: -CA) and (a single use of) -CA. Both QF-AC0,0 and -CA can (with classical logic) be combined into -AC
Let us now specify the amount of classical logic needed when using the intuitionistically unproblematic principle AC By applying negative translation to the above proof of the Cauchyness of we obtain in WE-HAQF-ACM
[TABLE]
(here M0 denotes the Markov principle for numbers). Hence,
[TABLE]
(which also covers M0) suffices. Using the closure of WE-HAAC-LEM under the rule of -DNE (proven similarly as in [47, Section 3]) one can conclude that even (a fixed – in the parameters mentioned – sequence -LEM- of instances of)
[TABLE]
suffices (when added to WE-HAAC) to prove the Cauchyness and – in turn – the convergence of and the variational inequality (characterizing sunny nonexpansive retractions) from Proposition 2.16.
7 Applications
The convergence of the resolvents, which form an implicit iteration schema, plays a role in proving the strong convergence of some explicit iteration schemas designed to compute fixed points of some nonlinear operators.
One such schema is the Halpern iteration [33]. If is a mapping, , and , the Halpern iteration corresponding to this data is the sequence , defined by:
[TABLE]
The convergence of this sequence for nonexpansive self-mappings of closed convex bounded nonempty subsets of uniformly smooth Banach spaces was obtained by Shioji and Takahashi [62] under Wittmann’s [67] conditions on and analyzed from the point of view of proof mining by the first author and Leuştean [46], modulo the resolvent convergence. We are now in a position to complete this analysis under the additional hypothesis that is uniformly convex.
Theorem 7.1** (cf. [46, Theorem 3.2]).**
Let be a Banach space which is uniformly convex with modulus and uniformly smooth with modulus . Let a closed, convex, nonempty subset. Let be such that for all , and the diameter of is bounded by . Let be a nonexpansive mapping and , . Put and put and to be the functions defined, for all , by and . Let be such that:
- •
* with rate of divergence ;*
- •
* with rate of convergence ;*
- •
* with Cauchy modulus .*
Denote by the Halpern iteration corresponding to this data. Let be defined by [46, Theorem 3.2]. Then, for all and there is an such that for all ,
An explicit iteration schema that is in addition amenable to pseudocontractions is the Bruck iteration [15]. If is a mapping, and , such that for all , , the Bruck iteration corresponding to this data is the sequence , defined by:
[TABLE]
The convergence of this sequence in some general framework containing the case of Lipschitzian pseudocontractive self-mappings of closed convex bounded nonempty subsets of uniformly convex and smooth Banach spaces was obtained by Chidume and Zegeye [18] under some conditions on and and then analyzed from the point of view of proof mining by Körnlein and the first author [49], again modulo the resolvent convergence. We now complete their analysis.
Theorem 7.2** (cf. [49, Corollary 2.10]).**
Let be a Banach space which is uniformly convex with modulus and uniformly smooth with modulus . Let a closed, convex, nonempty subset. Let be such that for all , and the diameter of is bounded by . Let be a Lipschitzian pseudocontraction of constant and . Let , satisfy the Chidume-Zegeye conditions. Denote by the Bruck iteration corresponding to this data. Let , , and be defined as in [49]. Put to be multiplication by and for all , . Then, for all and there is an such that for all , and for all , .
Proof.
The only issue that needs additional justification is that and are the required moduli for the auxiliary sequence used in the original relative metastability proof. This is shown in the first lines of the proof of [49, Theorem 2.8]. ∎
Another application of the rate of metastability extracted in this paper is given in [44], where it is used to construct a rate of metastability for the strongly convergent Halpern-type Proximal Point Algorithm in uniformly convex and uniformly smooth Banach spaces from [4].
8 Acknowledgements
The authors have been supported by the German Science Foundation (DFG Project KO 1737/6-1).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Aleyner, S. Reich, An explicit construction of sunny nonexpansive retractions in Banach spaces. Fixed Point Theory Appl. 2005 , no. 3, 295–305, 2005.
- 2[2] A. Aleyner, S. Reich, A note on explicit iterative constructions of sunny nonexpansive retractions in Banach spaces. J. Nonlinear Convex Anal. 6, 525–533, 2005.
- 3[3] A. Aleyner, S. Reich, Implicit and explicit constructions of sunny nonexpansive retractions in Banach spaces. J. Math. Appl. 29, 5–16, 2007.
- 4[4] K. Aoyama, M. Toyoda, Approximation of zeros of accretive operators in a Banach space. Israel J. Math. 220, no. 2, 803–816, 2017.
- 5[5] M. Bačák, U. Kohlenbach, On proximal mappings with Young functions in uniformly convex Banach spaces. J. Convex Anal. 25, 1291–1318, 2018.
- 6[6] J.-B. Baillon, Un théorème de type ergodique pour les contractions non linéaires dans un espace de Hilbert. C.R. Acad. Sci. Paris Sèr. A-B 280, 1511–1514, 1975.
- 7[7] P. Bénilan, Equations d’évolution dans un espace de Banach quelconque et applications. Thèse Orsay, 1972.
- 8[8] F. E. Browder, Nonlinear mappings of nonexpansive and accretive type in Banach spaces. Bull. Amer. Math. Soc. 73, 875–882, 1967.
