Metastability of the proximal point algorithm with multi-parameters
Bruno Dinis, Pedro Pinto

TL;DR
This paper applies proof mining techniques to analyze the convergence behavior of a generalized multi-parameter proximal point algorithm, providing explicit bounds and quantitative insights into its metastability and regularity.
Contribution
It offers the first primitive recursive bounds on the metastability of a multi-parameter proximal point algorithm using proof mining methods.
Findings
Provides explicit bounds on the metastability of the algorithm.
Quantifies the asymptotic regularity of the iteration.
Arithmetizes the $\
Abstract
In this article we use techniques of proof mining to analyse a result, due to Yonghong Yao and Muhammad Aslam Noor, concerning the strong convergence of a generalized proximal point algorithm which involves multiple parameters. Yao and Noor's result ensures the strong convergence of the algorithm to the nearest projection point onto the set of zeros of the operator. Our quantitative analysis, guided by Fernando Ferreira and Paulo Oliva's bounded functional interpretation, provides a primitive recursive bound on the metastability for the convergence of the algorithm, in the sense of Terence Tao. Furthermore, we obtain quantitative information on the asymptotic regularity of the iteration. The results of this paper are made possible by an arithmetization of the .
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Metastability of the proximal point algorithm with multi-parameters
††thanks: 2010 Mathematics Subject Classification: 90C25, 47H09, 46N10, 03F10, 03F60. Keywords: Convex optimization, proximal point algorithm, proof mining, metastability.
Bruno Dinis Departamento de Matemática, Faculdade de Ciências da Universidade de Lisboa, Campo Grande, Edifício C6, 1749-016 Lisboa, Portugal. [email protected].
Pedro Pinto Department of Mathematics, Technische Universität Darmstadt, Schlossgartenstrasse 7, 64289 Darmstadt, Germany.
Abstract
In this article we use techniques of proof mining to analyse a result, due to Yonghong Yao and Muhammad Aslam Noor, concerning the strong convergence of a generalized proximal point algorithm which involves multiple parameters. Yao and Noor’s result ensures the strong convergence of the algorithm to the nearest projection point onto the set of zeros of the operator. Our quantitative analysis, guided by Fernando Ferreira and Paulo Oliva’s bounded functional interpretation, provides a primitive recursive bound on the metastability for the convergence of the algorithm, in the sense of Terence Tao. Furthermore, we obtain quantitative information on the asymptotic regularity of the iteration. The results of this paper are made possible by an arithmetization of the .
1 Introduction
In the theory of maximal monotone operators, many problems, such as problems of minimization of a function, variational inequalities, etc., can be formulated as finding a zero of a maximal monotone operator (see e.g. [17] and the references therein). The proximal point algorithm [39] is a powerful and successful algorithm in finding a solution of maximal monotone operators. Starting from any initial guess , the generates a sequence which approximates the solution, defined by
[TABLE]
where are resolvent functions of a maximal monotone operator with parameter , and an error sequence. Ralph Rockafellar showed that () converges weakly towards a zero of the operator, provided that is bounded away from zero and is summable, i.e. . Osman Güler showed in [13], by providing a counter-example, that in general () does not converge strongly. For this reason, several modifications of the algorithm were studied (see for example [8, 15, 35]).
The Krasnosel’skiĭ-Mann () iteration is an important algorithm for the approximation of fixed points of nonexpansive maps [34]. One relevant feature that makes the iteration attractive is the fact that it is Fejér monotone relative to the set of fixed points [6, 25]. In general, one can only guarantee weak convergence for the iteration. Of course, if in practical optimization problems we restrict ourselves to a finite-dimensional context, then this limitation disappears. On the other hand, with the so-called Halpern variant iteration [14] one can actually establish a general strong convergence result. The fact that the Halpern iteration seems to be less sought after for optimization practice may rest on the fact that this iteration is no longer Fejér monotone. However, recent results brought a renewed interest to the Halpern type iteration (see e.g. [33, 7]). Through speedup techniques (as in [36]), improvements on the rate of asymptotic regularity were obtained for the Halpern iteration.
The impact of these iterations in fixed point theory motivated the following variants of ():
[TABLE]
[TABLE]
with , and for all , and is an error term. While, in general, () is only weakly convergent [16], the algorithm () is strongly convergent [16, 46]. Other strongly convergent variants of the proximal point algorithm are, for example, the hybrid projection [40], the shrinking projection [5] and the viscosity [43].
In this paper, we will focus on the following multi-parameter version of the proximal point algorithm, considered by Yonghong Yao and Muhammad Aslam Noor in [47], which is an hybrid between () and (). Let be an initial guess and define
[TABLE]
where is given, and for all it holds that , and .
Yao and Noor showed in [47] that the algorithm () is strongly convergent to the nearest projection point onto the set of zeros of the operator (see Theorem 3). The goal of this paper is to study quantitative information regarding the strong convergence of the iteration ().
Using proof-theoretical techniques, we analyse Yao and Noor’s proof, and are able to prove the strong convergence of () using only a weaker version of the metric projection principle, an arithmetization of the , and bypassing the use of a sequential weak compactness argument. Theorem 29 provides an effective bound on the metastability – in the sense of Terence Tao [45, 44] – of (), i.e. we obtain a computable function such that
[TABLE]
Note that ( ‣ 1) is equivalent (in a non-effective way) to the Cauchy property of the sequence . For a general sequence this is the best one can hope for, in the sense that it is not possible to obtain a computable rate for the Cauchy property. Furthermore, we obtain quantitative information on the asymptotic regularity of the iteration (Proposition 21 and Corollary 22). The complexity of the extracted information follows directly from the strength of the logical principles required for the proof. The fact that these results are proved using the weaker principles mentioned above is reflected in the bounds obtained, which are primitive recursive (in the sense of Kleene). The way to deal with the projection and sequential weak compactness arguments was already understood [23, 10], so the remaining obstacle was on how to deal with the . The problem is solved by an arithmetization of the (Lemma 17) tailored to deal with a key argument (Lemma 5) in Yao and Noor’s proof (see also [29] for an alternative way to deal with the ).
The methods used in this paper are set in the framework of proof mining, a program that describes the process of using proof-theoretical techniques to analyse mathematical proofs with the aim of extracting new information. This program was developed mainly by the works of Ulrich Kohlenbach and his collaborators, producing a vast number of results; analysing proofs from areas such as approximation theory, ergodic theory, fixed point theory and optimization theory (see e.g. [24, 28, 22]). In the proof mining program, proof interpretations are used as tools to extract constructive (i.e. computational) information from noneffective proofs. The output of the interpretation, which we refer to as quantitative version, gives explicit information that previously was implicit and hidden behind the use of quantifiers. The standard technique in proof mining is Kohlenbach’s monotone functional interpretation [18, 22] which is based on Kurt Gödel’s Dialectica interpretation [12, 1] that works with upper bounds for witnessing terms instead of precise witnesses. Recently, Fernando Ferreira and Paulo Oliva’s bounded functional interpretation () [11] (see also the classical version [9]) has proven to be a valid alternative for the proof mining program, providing new insight to some theoretical questions concerning the elimination of sequential weak compactness [10].
The quantitative results, as well as their proofs, obtained by the proof mining program do not presuppose any particular knowledge of logical tools because the latter are only used as an intermediate step and are not visible in the final product. Apart from the end of Section 5, where we make some remarks on the logical aspects of the analysis, our work is no exception and so, knowledge of tools from logic in general and familiarity with the in particular are not necessary to read this paper.
We would like to point out that our work comes as a natural generalization of [30, 32, 31].
The structure of the paper is the following. In Section 2 we recall some notions concerning Hilbert spaces and monotone operators. We also state the result by Yao and Noor for which we give a quantitative analysis in the subsequent sections and explain its original proof. In Section 3 we give a quantitative treatment of the principles used in the original proof: the projection argument, sequential weak compactness, and the arithmetization of the . The quantitative analysis of Yao and Noor’s proof is carried out in Section 4. We start by obtaining a partial result which depends on an additional condition (Subsection 4.1). This condition is then shown to be satisfied by a concrete functional (Subsection 4.2). Finally, we prove the main result regarding the metastability for () (Subsection 4.3). We leave some final remarks that allow to better understand some theoretical aspects of the analysis to Section 5.
2 Preliminaries
We work in a real Hilbert space with inner product and norm . We recall that a multi-valued operator is said to be monotone if and only if whenever and are elements of the graph of , it holds that . A monotone operator is maximal monotone if the graph of is not properly contained in the graph of any other monotone operator on . We denote by the set of all zeros of , i.e. . We fix a maximal monotone operator and assume henceforth to be nonempty. For , we use to denote the resolvent function of with parameter , i.e. the single-valued function defined by
[TABLE]
A mapping is called nonexpansive if
[TABLE]
The set of fixed points of a mapping is the set . The resolvent mapping is nonexpansive, and for every , the set of fixed points of is . If is nonexpansive, then is a closed and convex subset of . For a comprehensive introduction to convex analysis and the theory of monotone operators in Hilbert spaces we refer to [2].
The following lemmas are well-known.
Lemma 1** (Resolvent identity).**
For every and every it holds that
[TABLE]
Lemma 2** ([35]).**
If , then for all it holds that .
The quantitative analysis carried out in this paper makes use of the notion of monotone functional. For that matter we consider the notion of strong majorizability from [3] in the following two particular cases. In the case of functions , we have
[TABLE]
and in the case of functionals , we have
[TABLE]
We say that is monotone if and similarly for a functional . For functions , this monotone property coincides with the usual property . Quantifications over monotone functions will be abbreviated by . Due to the particularities of the , our quantitative results quantify over such monotone functions. Note however that for all , one has , where . Hence there is no real restriction in working with monotone quantifications.
2.1 A result by Yao and Noor
We present the result by Yao and Noor concerning the strong convergence of the (Theorem 3) and give a detailed description of its proof. This will hopefully guide the reader through our quantitative analysis and the several steps that it requires.
Consider the following set of conditions.
- ()
. 2. ()
. 3. ()
. 4. ()
where is some positive constant. 5. ()
. 6. ()
.
Theorem 3**.**
([47, Theorem 3.3])* Let be generated by (). Assume that - hold. Then converges strongly to a point , the nearest to .*
The proof of Theorem 3 relies on Lemma 4 and Lemma 5 below, due to Tomonari Suzuki [42], for which we give quantitative versions in Subsection 4.2 (Lemma 25 and Lemma 26).111In fact, Lemma 4 is only used to prove Lemma 5.
Lemma 4**.**
([42, Lemma 2.1])* Let and be sequences in a Banach space and let be a sequence in such that . Suppose that for all , and . Then*
[TABLE]
Lemma 5**.**
([42, Lemma 2.2 ])* Let and be bounded sequences in a Banach space and let be a sequence in with . Suppose that for all , and . Then .*
The proof of Theorem 3 is divided in the following steps:
- (1)
Show that is bounded. This is just a simple proof by induction and some easy computations. 2. (2)
. Letting , it is shown first that . Using Lemma 5 one concludes that which, by the definition of , is enough to conclude this step. 3. (3)
. From step (2) and the hypothesis of the theorem one concludes that . The conclusion follows from Lemma 2. 4. (4)
Projection argument. With the projection point of onto it is shown that . 5. (5)
Sequential weak compactness and demiclosedness. Pick a subsequence of such that and converges weakly to some . Here the following demiclosedness principle is used.
Lemma 6** (Demiclosedness principle [4]).**
Let be a closed convex subset of and let be a nonexpansive mapping such that . Assume that is a sequence in such that weakly converges to and converges strongly to . Then .
By step (4) it follows that . 6. (6)
Main combinatorial part. In this final step it is shown that the conditions of Lemma 7 below are satisfied. The application of this lemma is enough to conclude the result.
Lemma 7**.**
([46, Lemma 2.5])* Let be a sequence of nonnegative real numbers such that for all *
[TABLE]
where and are such that , and . Then converges to zero.
3 Avoiding roadblocks
From the point of view of proof mining the analysis of the original proof of Theorem 3 presents some difficulties that prevent the extraction of simple bounds. These concern the projection argument in step (4), the weak compactness in step (5) and the assumed existence of the in Suzuki’s lemmas. In Subsections 3.1 and 3.2 we adapt the way to deal with projection [23] and sequential weak compactness [10] to our context. Furthermore, in Subsection 3.2 we give a quantitative version of Lemma 7. In Subsection 3.3 we give an arithmetization of the that allows to obtain a quantitative version of Lemma 5. A more detailed explanation to why these principles are problematic will be given in Section 5.
3.1 Projection argument
In this section we deal with the following projection argument which is used in the original proof
[TABLE]
stating that there is a zero of that is the nearest to a given . The squares are added here only for an easier connection to the inner product of the space that will be required below.
As noticed by Kohlenbach [23], instead of (2), it is enough for the quantitative analysis to consider the weaker statement
[TABLE]
While the proof of (2) requires the use of countable choice, the statement (3) can be proved by a simple induction argument and this fact is reflected on the extracted bounds, which are then recursively defined.
An analysis of the projection argument via the monotone functional interpretation was previously carried out by Kohlenbach [23]. A detailed explanation of the analysis of the projection argument using the bounded functional interpretation was shown in [10]. In the latter, the assumption that one is working in a bounded set plays an important role in simplifying the interpretation. Although we do not have that assumption here, we can equivalently consider the projection statement in (2) restricted to a big enough ball centered at some zero point . This restriction was considered in [38], giving rise to the quantitative version in Proposition 9 bellow.
Notation 8**.**
From now on, we will write instead of and instead of , under the assumption that satisfies for all . For each and function we denote the -fold iteration of by . I.e. and . Let be a zero of . For any , let denote the closed ball centered at with radius . The zero point is always made clear by the context.
Proposition 9** ([38]).**
*Let be such that for some .
For any natural number and monotone function , there are and such that*
[TABLE]
and
[TABLE]
where .
In Proposition 9, we considered only the (almost) fixed points of since the fixed point set of all resolvent functions coincide. We prove the following quantitative version which requires majorizing information on the sequence of real numbers .
Lemma 10**.**
Let and satisfying . Consider a monotone function satisfying . Let be the monotone function defined by . For any and any ,
[TABLE]
Proof.
Let and assume . For , we have
[TABLE]
Hence
[TABLE]
since . ∎
3.2 Sequential weak compactness
In the proof by Yao and Noor, sequential weak compactness, together with the demiclosedness principle, is used to show that , where is the projection point of onto and is generated by (). This means that there exists (the projection point) such that
[TABLE]
Without having access to the projection point, and working only with approximations, we switch from (5) to the following weakening
[TABLE]
As it turns out, this weaker form is still enough to carry out the proof, avoiding the use of sequential weak compactness. This is similar to the arguments in the beginning of section 2 in [10].
Proposition 11** ([38]).**
Let be a natural number satisfying for some point . For any and monotone function , there are and such that
[TABLE]
with and .
In Proposition 21(iii) below (under an additional assumption () on a functional ) we compute a monotone functional satisfying
[TABLE]
Furthermore, in Lemma 18, we show that the sequence is bounded and compute an explicit natural number such that .
Proposition 12 corresponds to the elimination of the sequential weak compactness argument. It can be seen as an application of the general principle in [10, Proposition 4.3] with , where the sequence being considered is , is given by Proposition 11 and .
Proposition 12**.**
Let be a monotone function satisfying (6). For some , let be such that , and . For any and monotone function , there are and such that and
[TABLE]
where with , and .
Proof.
Let and a monotone function be given. By Proposition 11 applied to and the function , we get and such that and
[TABLE]
By (6), there is such that
[TABLE]
Hence, . Since , by (7) we conclude that
[TABLE]
Finally, by the monotonicity of the function ,
[TABLE]
As mentioned in Subsection 2.1, the final step of the proof of Theorem 3 is an application of Lemma 7. There , with the projection point of onto . However, using approximations to the projection point instead of the projection point itself, the inequality (1) only holds with in place of , for a certain sequence of errors. The following result from [38] corresponds to a quantitative version of this statement.
Lemma 13** ([38]).**
Let be a bounded sequence of non-negative real numbers and a positive upper bound on . Consider sequences of real numbers , , and and assume the existence of a monotone function satisfying , for all . For natural numbers and assume
. 2.
. 3.
**
Then
[TABLE]
with .
A direct application of Lemma 13 gives the following result which is more suitable for our analysis.
Lemma 14**.**
Let be a bounded subset of . Let be given and, for each , consider the sequences of real numbers , , and with , and such that, for all ,
[TABLE]
For a natural number and monotone functions , and , assume that:
* is a rate of divergence for , i.e. .* 2.
For all , is a positive upper bound on . 3.
For all , is a Cauchy rate for , i.e. . 4.
.
Then, for any natural number and monotone function , there are and such that
[TABLE]
where with and .
Proof.
Let and a monotone function be given. By condition , consider and such that for
[TABLE]
Define . By condition , for all , . We have and
[TABLE]
where is as in Lemma 13. Hence, for ,
[TABLE]
We are in the conditions of Lemma 13 with and , and so
[TABLE]
Noticing that, by the monotonicity of , we have , we conclude the proof. ∎
We recall that the last step in the proof by Yao and Noor is an application of Lemma 7. Similarly, in our quantitative analysis, the final step to prove metastability for () is an application of Lemma 14. As such, we need to verify each of the conditions of that result (for a specific choice of parameters). Conditions and are easy to check. The existence of a function as in condition follows from a quantitative version of . The next result ensures that condition holds.
Lemma 15**.**
Assume the conditions of Proposition 12, and that there exist satisfying and a monotone function such that . For any and monotone function , there are and such that for all ,
[TABLE]
where , , , where is the function defined in Proposition 12 and is the monotone function defined by
[TABLE]
with as in Lemma 10.
Proof.
Let and monotone be given. Applying Proposition 12 to and to the monotone function we obtain and such that
[TABLE]
and
[TABLE]
Clearly (9) implies that for one has .
Now, by (8)
[TABLE]
Hence, by (4), for ,
[TABLE]
Also so, for ,
[TABLE]
which concludes the proof. ∎
3.3 Rational approximation of the lim sup
In this section we show that the assumption of the existence of the , as in Lemma 4, can be replaced by a rational approximation. A detailed explanation on the origin of these lemmas is given in Section 5.
The idea is that, by working with approximated notions, one can relax the properties of the to something which is already satisfied by a suitable rational number. We start with the following easy result.
Lemma 16**.**
Let and be a sequence of real numbers such that . Then
[TABLE]
Proof.
Suppose towards a contradiction that (10) does not hold. Then there exist and a monotone function such that for all it holds that
[TABLE]
This implies
[TABLE]
where . One easily shows by induction on that
[TABLE]
Hence, with we conclude that
[TABLE]
Hence . Now, by (11), for and the hypothesis, we have that for all it holds that , which gives a contradiction. We conclude that (10) holds. ∎
The proof of Lemma 4 requires the following property of the
[TABLE]
We adapt Lemma 16 to this property, obtaining a result that corresponds to a quantitative version of (12).
Lemma 17**.**
Let and be a sequence of real numbers such that . Let and be monotone, let . For define and Then
[TABLE]
where .
Proof.
Let and be a given monotone function. We define, for each , the monotone functions . We apply (10) with , and , for . Then, we find, for each , and such that and . Now, there exists such that , otherwise there would be a sequence of length of natural numbers such that , which is absurd. Define the natural numbers and . Clearly and . We have that . To conclude the result it is enough to show that . Indeed, we would get, for that . We have that , and since is monotone, . Hence . ∎
4 Quantitative analysis
In this section we carry out the quantitative analysis of Theorem 3. In Subsection 4.1 we obtain intermediate results regarding asymptotic regularity and metastability depending on an additional condition. This additional condition is studied in Subsection 4.2 through the analysis of Suzuki’s lemmas (Lemmas 4 and 5). In Subsection 4.3 we prove our main result establishing the metastability for ().
We start our quantitative analysis of Theorem 3 by giving quantitative versions of the hypothesis of the theorem. We assume that there exist and monotone functions such that
- ()
. 2. ()
. 3. ()
. 4. ()
. 5. ()
. 6. ()
.
The conditions are quantitative versions of, respectively, the hypothesis ()(). Indeed, condition () states that is a rate of convergence for the sequence ; condition () postulates that is a rate of divergence for ; condition () is the quantitative version of () together with the fact that ; condition () expresses the fact that the terms of the sequence are above some positive quantity; condition () states that is a rate of convergence for the difference of terms of the sequence and condition expresses quantitatively that the sequence of the partial sums of the errors is a Cauchy sequence with Cauchy rate .
In our main result (Theorem 29) we compute an explicit bound on the metastability of the iteration () under the assumptions .
4.1 Metastability of the
We show an intermediate metastability result depending on an additional condition () in Theorem 23. Moreover, Proposition 21 and Corollary 22 give quantitative information on the asymptotic regularity of the iteration. We start by showing that the sequence generated by () is bounded and give in (14) the computational information corresponding to .
Lemma 18**.**
Let be generated by (). Assume that there exist and monotone functions such that hold. Let be such that , , and for some one has . Then , where . Moreover, with , we have and
[TABLE]
where .
Proof.
Observe that for all . By the fact that , for all and observing that each resolvent is nonexpansive, we have
[TABLE]
One easily shows by induction on that , from which we deduce that is bounded. We have that . Indeed, by () we have
[TABLE]
We have
[TABLE]
We claim that
[TABLE]
To prove the claim observe that for every it holds that . If , by the resolvent identity we have
[TABLE]
If , again by the resolvent identity we have
[TABLE]
Hence (15) holds. Then
[TABLE]
Let and . We will see that each of the terms in (16) is less than or equal to .
Since satisfies , we have that, for
[TABLE]
and
[TABLE]
Observe that . We then have
[TABLE]
Since satisfies , for it holds that
[TABLE]
Since satisfies (), for we have
[TABLE]
Combining (16)-(21) we conclude that (14) holds. ∎
Definition 19**.**
Let be sequences in . We say that a monotone function satisfies () if
[TABLE]
Remark 20**.**
The hypothesis () corresponds to the quantitative information from Suzuki’s lemma (Lemma 5). With this assumption we will compute in Theorem 23 metastability for () as well as some intermediate results regarding asymptotic regularity (Proposition 21 and Corollary 22). An explicit function satisfying () is computed in Remark 27, using Lemma 26.
The next two results give quantitative information on asymptotic regularity for the sequence .
Proposition 21**.**
Let be generated by (). Assume that there exist and monotone functions such that and hold. Let be such that , , and for some one has . Define . Let be such that and assume that there is a monotone function satisfying (). Then
; 2.
; 3.
;
where and , with the monotone function defined by .
Proof.
(i). The result is a consequence of the fact that
[TABLE]
(ii). Observe that
[TABLE]
Then
[TABLE]
We have that
[TABLE]
Indeed, for we have that
[TABLE]
Applying Part (i) to and we find such that
[TABLE]
Put . Now because clearly and . Then, from (23) and (24) we conclude that Part (ii) holds.
(iii). By Lemma 2 and () we have . Hence, Part (iii) follows from Part (ii). ∎
If () is satisfied with a rate of convergence, i.e. when does not depend on , then the properties (i)–(iii) in Proposition 21 also hold with rates of convergence.
Corollary 22**.**
Assume the conditions of Proposition 21. Assume also that , for all , i.e.
[TABLE]
Then
; 2.
; 3.
;
where .
Under the additional condition () we have the following result concerning the metastability of ().
Theorem 23**.**
Let be generated by (). Assume that there exist and monotone functions such that and hold. Let be a monotone function such that , for all . Let be such that , and for some one has . Define . Let be such that and assume that there is a monotone function satisfying (). Then
[TABLE]
where , , is as in Lemma 14, as in Lemma 15, and .
Remark 24**.**
Note that , i.e. the functional depends on the value of , , and , as well as on the functions and – and obviously on the functional .
Proof of Theorem 23.
Let . Then
[TABLE]
Then, for all
[TABLE]
where , , and .
We verify that the conditions of Lemma 14 are satisfied with , , as in Lemma 15 and .
The first condition holds by hypothesis. Since , the second condition is true with . For the third condition, using (), and the fact that we have
[TABLE]
Finally, by Lemma 15 the fourth condition of Lemma 14 is also verified.
By Lemma 14 we conclude that
[TABLE]
Let and a monotone function be given. By (26) applied to and to the function , we find and such that for ,
[TABLE]
Hence, for , and with , we have
[TABLE]
which concludes the proof. ∎
4.2 A quantitative version of Suzuki’s lemmas
We now turn to the two lemmas by Suzuki required in the original proof. The next result is a partial quantitative version of Lemma 4, which is enough for the quantitative version of Lemma 5 given in Lemma 26. As discussed in Remark 27 below, the latter allows us to obtain a concrete functional satisfying the condition ().
Lemma 25**.**
Let be sequences in a normed space . Let be a sequence of real numbers and be such that
[TABLE]
Suppose that , for all and that there exists a monotone function satisfying
[TABLE]
Let be such that . Then
[TABLE]
where and , with functions defined respectively by and . The function is as in Lemma 17.
Proof.
Let , be arbitrary and be a monotone function. Define . Considering and we apply Lemma 17 to and defined by to find and such that
[TABLE]
Furthermore, for we have
[TABLE]
From (29), (30) and the fact that we conclude that
[TABLE]
Working with given by (31), we now argue that for all .
[TABLE]
We have
[TABLE]
Hence
[TABLE]
So, holds for . To conclude we assume that holds for some and want to see that it holds for . Since
[TABLE]
we obtain that
[TABLE]
which implies that holds for all as we wanted. Instantiating with [math] in we obtain
[TABLE]
Hence
[TABLE]
Given , we conclude the result by putting in (33). ∎
Lemma 26**.**
Let be sequences in a normed space and be such that , for all . Let be a sequence of real numbers and be such that Suppose that , for all and that there exists a monotone function such that
[TABLE]
Then
[TABLE]
where , with is as in Lemma 25 and .
Proof.
Suppose towards a contradiction that there exist and a monotone function such that
[TABLE]
Define . We have that and . Applying Lemma 25 with , , and , we find and such that
[TABLE]
We have , and then
[TABLE]
Hence
[TABLE]
a contradiction. ∎
4.3 Metastability revisited
Remark 27**.**
Under the conditions of Lemma 18, the function satisfies (34). Furthermore, . Hence, Lemma 26 with , is satisfied with the function and outputs the function defined by
[TABLE]
which satisfies (). Note that having this function , the hypothesis () can be removed from Proposition 21. Moreover, the bound in Proposition 21(ii) can be simplified to , and similarly for .
Notation 28**.**
We write and , instead of and , respectively, where is the functional defined in Remark 27.
Having an explicit function satisfying () we can now prove metastability for () with the initial quantitative assumptions .
Theorem 29**.**
Let be generated by (). Assume that there exist and monotone functions such that hold. Let be a monotone function such that , for all . Let be such that , , and for some one has . Define . Let be such that . Then
[TABLE]
where , with , , as in Lemma 14, given by Lemma 15, and .
As an application we consider the special case where the sequence is constant. We note that this case was also considered in Yao and Noor’s paper [47, Corollary 3.1].
Let be a positive real number. Consider the sequence constantly equal to , and generated by (). Assume that there exist and monotone functions such that and hold and that . We consider to be the function identicaly equal to . Note that we can take either or . Clearly , and with are satisfied. The definition of in Lemma 18 simplifies to
[TABLE]
which causes changes in and consequently in , and . This simplifies the bound obtained in Theorem 29.
5 Final considerations
In Theorem 29 we obtain a bound on the metastability of () which is uniform on the parameters of the algorithm. Let us elaborate on this. The bound is uniform on the choice of the anchor point , the given initial point and a point witnessing the assumption that is nonempty, depending only on natural numbers and . The dependence on the sequences and is respectively only in the form of a rate of convergence satisfying and a rate of divergence satisfying ; a natural number satisfying ; and a natural number satisfying , a rate of convergence , satisfying and on a majorizing function . Finally, it is uniform on the error sequence , as it depends only on a Cauchy rate satisfying and on a natural number .
It is well-know that for the condition is equivalent to the condition . Although we don’t do this here, one could have worked with a rate of convergence towards zero for instead of the rate of divergence for . This is done for similar quantitative analyses in [31, 38]. In certain cases, that option may prove useful as a function may be of lower complexity than that of a function , e.g. the sequence has a liner rate but an exponential rate .
Metastability and the Cauchy property are equivalent. So, Theorem 29 and the fact that the space is complete imply that () is strongly convergent. Similarly, from Proposition 21 (with instantiation ) we conclude that and thus () converges to a zero of the maximal monotone operator , say . To see that must be the projection point first note that since we have that . Since , the conclusion of Lemma 15 is satisfied with . Finally, following the arguments in the proof of Theorem 23 with , we see that (26) holds with the point always equal to , implying that . This shows that the analysis in this paper indeed corresponds to a quantitative analysis of the original proof by Yao and Noor.
We finish with some considerations concerning the logical aspects of our analysis.
Let us start by pointing out that, in principle, the monotone functional interpretation could be used to analyse the results presented in this paper. However, our elimination of the sequential weak compactness argument can be seen as an application of the general method obtained in [10]. Also, using the enables us to make use of Proposition 9 and Lemma 13 (shown in [38] using the ) which makes our analysis easier to carry out.
As already mentioned, the original proof of Theorem 3 requires strong principles. These principles are countable choice as used in the projection argument, sequential weak compactness and the existence of the in Lemma 4, which require arithmetical comprehension. The analysis of these principles require the use of a stronger form of recursion called bar recursion [41, 37]. As shown below, the arguments in Section 3 allow us to avoid the use of bar recursion. There are already many examples in the proof mining literature where it is possible to avoid the use of arithmetical comprehension222We would like to thank Ulrich Kohlenbach for pointing this out to the first author and for providing the appropriate references.. For example, in [21, 25] the use of the existence of a limit point for a sequence in a compact geodesic space (which requires arithmetical comprehension) is replaced by a combinatorial argument. In [26, 27], a proof of an asymptotic regularity theorem that was based on countable nested uses of sequential compactness (and hence arithmetic comprehension) is analysed resulting in a simple exponential bound, by elimination of sequential compactness. Moreover, in a series of papers, Kohlenbach showed how the monotone functional interpretation can be used to replace the use of arithmetical comprehension by optimal arithmetic substitutes (see e.g. [19, 20] and [22, Section 17.9]).
The elimination of countable choice required for the projection argument is carried out in Section 3.1. The key observation is that (2) can be replaced by (3). This is in line with earlier analyses (see for example [23, 10, 38]) and it is well-known that this allows for the extracted quantitative information to be expressed in terms of Gödel’s primitive recursive functionals.
The way to deal with sequential weak compactness is explained in full detail in [10], in the context of the . The key point is that sequential weak compactness can be replaced by countable Heine/Borel compactness. The content of Section 3.2 shows that it can be adapted to our context in a similar way.
In Section 3.3 we made the modifications necessary to bypass the assumed existence of the real number in Lemma 4. In [29], Kohlenbach and Andrei Sipoş give a rational approximation to the of a certain sequence by interpreting the approximation. As described in detail below, we must deal with a similar issue.
Let and let be a sequence of real numbers contained in the interval . The existence of the can be stated as
[TABLE]
The main point is that we can weaken this statement by switching the outermost quantifiers
[TABLE]
In fact, we will show that such in (35) is already witnessed by a rational number satisfying
[TABLE]
which implies that (35) holds with .
The idea behind (36) is the following. For each , by dividing the interval into subintervals of length , there exists such that . If we take , i.e. the middle point, then it should satisfy (35) for (and hence for ). This results in statement (36).
Lemma 16, which is shown by -induction, can be seen to imply (36) using a collection argument. First note that Lemma 16 implies
[TABLE]
By a collection argument, we conclude
[TABLE]
which by (monotone) choice axiom is equivalent to (36).
Let us elaborate on (38). Clearly it is a sufficient condition to (37). Let see that (37) implies (38). Assuming that (38) fails, we obtain that for some
[TABLE]
By instantiating , one concludes that (37) must also fail.
Of course, this collection argument is fully justified by a form of induction. The reader can compare this way of proving (36), using -induction and a collection argument, to the similar [29, Proposition 4.2], where -induction was used.
Acknowledgements
We would like to thank Laurenţiu Leuştean for suggesting us to analyse Yao and Noor’s theorem and for several discussions concerning the subject of this paper. Our paper also benefits from discussions and remarks by Fernando Ferreira and Ulrich Kohlenbach.
Both authors acknowledge the support of FCT - Fundação para a Ciência e Tecnologia under the project: UID/MAT/04561/2019 and the research center Centro de Matemática, Aplicações Fundamentais e Investigação Operacional, Universidade de Lisboa. The second author also acknowledges the support of the ‘Future Talents’ short-term scholarship at Technische Universität Darmstadt.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Jeremy Avigad and Solomon Feferman. Gödel’s functional (“Dialectica”) interpretation. In Handbook of proof theory , volume 137 of Stud. Logic Found. Math. , pages 337–405. North-Holland, Amsterdam, 1998.
- 2[2] Heinz Bauschke and Patrick Combettes. Convex analysis and monotone operator theory in Hilbert spaces . CMS Books in Mathematics/Ouvrages de Mathématiques de la SMC. Springer, Cham, second edition, 2017. With a foreword by Hédy Attouch.
- 3[3] Marc Bezem. Strongly majorizable functionals of finite type: A model for barrecursion containing discontinuous functionals. J. Symb. Log. , 50(3):652–660, 1985.
- 4[4] Felix Browder. Nonexpansive nonlinear operators in a Banach space. Proc. Nat. Acad. Sci. U.S.A. , 54:1041–1044, 1965.
- 5[5] L.-C. Ceng, Q. H. Ansari, and J.-C. Yao. Hybrid proximal-type and hybrid shrinking projection algorithms for equilibrium problems, maximal monotone operators, and relatively nonexpansive mappings. Numerical Functional Analysis and Optimization , 31(7):763–797, 2010.
- 6[6] Patrick L. Combettes. Fejér monotonicity in convex optimization. In Christodoulos A. Floudas and Panos M. Pardalos, editors, Encyclopedia of Optimization , pages 1016–1024. Springer US, Boston, MA, 2009.
- 7[7] Jelena Diakonikolas. Halpern iteration for near-optimal and parameter-free monotone inclusion and strong solutions to variational inequalities. ar Xiv:2002.08872 v 2.
- 8[8] Jonathan Eckstein and Dimitri Bertsekas. On the Douglas-Rachford splitting method and the proximal point algorithm for maximal monotone operators. Math. Programming , 55(3, Ser. A):293–318, 1992.
