A proximal point algorithm revisited and extended
Gheorghe Morosanu

TL;DR
This paper revisits and extends a proximal point algorithm for finding zeros of maximal monotone operators, providing convergence analysis, practical implementation guidance, and simulation results to demonstrate its effectiveness.
Contribution
It introduces a more general proximal point algorithm with convergence guarantees and practical considerations, extending previous methods in the field.
Findings
The algorithm converges strongly under specified conditions.
It can approximate minimizers of convex functionals.
Simulations confirm practical applicability.
Abstract
This Note is inspired by the recent paper by Djafary Rouhani and Moradi [J. Optim. Theory Appl. 172 (2017) 222-235], where a proximal point algorithm proposed by Boikanyo and Moro\c{s}anu [Optim. Lett. 7 (2013) 415-420] is discussed. We start with a brief history of the subject and then propose and analyse the following more general algorithm for approximating the zeroes of a maximal monotone operator in real Hilbert space where is a given starting point, is a given sequence in , , and is the error sequence satisfying . Besides the main result on the strong convergence of , we discuss some particular cases, including the approximation of minimizers of convex functionals, explain how to use our…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11institutetext: Author’s Address:
Central European University
Department of Mathematics and its Applications
Nador u. 9
1051 Budapest, Hungary
A proximal point algorithm revisited and extended
Gheorghe Moroşanu
(Received: date / Accepted: date)
Abstract
This Note is inspired by the recent paper by Djafary Rouhani and Moradi [J. Optim. Theory Appl. 172 (2017) 222-235], where a proximal point algorithm proposed by Boikanyo and Moroşanu [Optim. Lett. 7 (2013) 415-420] is discussed. We start with a brief history of the subject and then propose and analyse the following more general algorithm for approximating the zeroes of a maximal monotone operator in real Hilbert space
[TABLE]
where is a given starting point, is a given sequence in , , and is the error sequence satisfying . Besides the main result on the strong convergence of , we discuss some particular cases, including the approximation of minimizers of convex functionals, explain how to use our algorithm in practice, and present some simulations to illustrate the applicability of our algorithm.
Keywords:
Maximal monotone operator Proximal point algorithm Convex function Strong convergence
MSC:
47J25 47H05 90C25 90C90
1 Introduction
Let be a real Hilbert space with scalar product and norm . An operator (possibly set-valued) is said to be monotone if its graph is a monotone subset of , i.e.,
[TABLE]
If in addition is not properly contained in the graph of any other monotone operator in , then is called maximal monotone. It is well-known that a monotone operator is maximal monotone if and only if the range of is all of for all (equivalently for some ). In this case the so-called resolvent operator is everywhere defined, single-valued and nonexpansive (i.e., Lipschitz with constant ). If is a proper (i.e., not identically ), convex, lower semicontiunous function then the subdifferential operator defined by
[TABLE]
is maximal monotone. For more information on monotone operators and convex functions see brezis and morosanu .
We are interested in solving the problem
[TABLE]
Denote by the solution set of (1), i.e., . One of the most important iterative methods for finding approximate solutions of (1) is the proximal point algorithm (PPA) which was introduced by Martinet martinet for a particular case of and then extended by Rockafellar rockafellar to a general maximal monotone operator. For each the PPA generates the sequence as follows
[TABLE]
where for all and is the sequence of computational errors. Unfortunately (under the suitable conditions , ) converges in general only weakly (to points of F), even in the particular case when is a subdifferential operator (see guler ). Subsequently much work has been dedicated towards modifying the PPA to obtain algorithms that generate strongly convergent sequences. Recall that, inspired by Lehdili and Moudafi’s prox-Tikhonov method (see lehmud ), Xu xu considered the following iterative scheme
[TABLE]
where , , , and proved that, under some additional conditions, converges strongly to , the metric projection of onto (which was assumed to be nonempty). The best convergence result on (3) has been reported later by Wang and Cui wangcui . Specifically, they proved that generated by (3) converges strongly to under the following conditions: , , , , and either or . In fact, under these conditions, (3) is equivalent with
[TABLE]
In bm a strong convergence result for generated by (4) was reported in the case of the alternative framework: , , , , and bounded. The same framework is reconsidered in a recent paper by Djafari Rouhani and Moradi rouhanimoradi . They use the condition (instead of the boundedness of ). In fact this condition is also easily visible from our approach in bm .
The main observation leading to this Note is that: while the convex combination in (4) is relevant when , it is not the case if . Indeed, we can consider the following more general algorithm
[TABLE]
where
is a maximal monotone operator with ; , for all , , ; ; .
Our main result (Theorem 2.1) states that under hypotheses , for every , the sequence generated by (5) converges strongly to . By chosing and with , we reobtain Theorem 1 in bm and Theorem 3.2 in rouhanimoradi . In addition if for all (or for all ) then (5) defines just a simple sequence (not an iterative method since is no longer dependent on ) which approximates and in this case Theorem 3.4 in rouhanimoradi is reobtained as a simple particular case (with ).
2 Main Result
Since we want to show that the sequences generated by (5) are convergent, we begin this section with a preliminary result stating that a necessary condition is .
Lemma 1
Assume that is a maximal monotone operator, , is a bounded sequence, for some , and is bounded. Then the sequence generated by (5) is bounded for all (equivalently, for some ) if and only if .
Proof Assume that for some the sequence generated by (5) is bounded. We have
[TABLE]
Therefore taking the limit in the obvious inequality
[TABLE]
we infer
[TABLE]
where is a weak cluster point of . By the maximality of we obtain
[TABLE]
Conversely, assume . Let and be arbitrary but fixed points. Since the resolvent operator is nonexpansive we have
[TABLE]
Therefore
[TABLE]
where is a positive constant. From (6) we derive by induction
[TABLE]
which shows that is bounded. ∎
Before stating our main theorem let us recall the following result which was proved independently by Bruck bruck and Moroşanu moro .
Lemma 2
Let be a maximal monotone operator with nonempty. Then for every , as , where denotes the metric projection of onto .
Now let us state the main result of this Note.
Theorem 2.1
Assume (see the previous section). Then for all the sequence generated by algorithm (5) converges strongly to (the metric projection of onto ).
Proof Let be an arbitrary but fixed point. By Theorem 1 the corresponding sequence generated by (5) is bounded (since there exists a natural number such that for so Lemma 1 is applicable with ). Thus we have
[TABLE]
So by and Lemma 2 we conclude that as . ∎
3 Concluding Comments
-
If where is a proper, convex, lower semicontinuous function the algorithm (5) serves as a method for appoximating minimizers of (assuming that the set of minimizers of is nonempty), since in this case if and only if is a minimizer of .
-
The error sequence is allowed to be bounded as usual in numerical analysis, or even unbounded with .
-
Theorem 2.1 is a generalization of both Theorem 3.2 in rouhanimoradi and Theorem 1 in bm .
If for all (or for all ) then (5) defines just a simple sequence, not a real iterative method, since does not depend on . In this case we have
[TABLE]
or for all . In fact, according to Lemma 2, we have
[TABLE]
Note that the second algorithm introduced and studied in (rouhanimoradi, , p. 228) is in fact a sequence of the form (7) with , .
- Let us explain how the algorithn (5) works when performing simulations. Assume conditions are fulfilled. In addition, for the sake of simplicity, is assumed to be single-valued. For a given we compute by solving for the equation
[TABLE]
and get instead of the exact solution . We do not have any error for (i.e., ) but we have a computational error for . Next we solve for the equation
[TABLE]
and get instead of the exact solution
[TABLE]
and so on. Thus using the computer we obtain satisfying
[TABLE]
where . If for all , then for all , i.e., approximates for large enough.
4 Simulations
Intuitively, in order to achieve fast convergence of the sequence generated by algorithm (5) to we need to choose a point as close as possible to and sequences and that converge fastly to and to [math], respectively.
etc., etc., … to be continued.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Brezis, H.: Opérateurs Maximaux Monotones et Semi-Groupes de Contractions dans les Espaces de Hilbert, North Holland Math. Studies, Vol. 5. North Holland, Amsterdam (1973)
- 2(2) Moroşanu, G.: Nonlinear Evolution Equations and Applications. Reidel, Dordrecht (1988)
- 3(3) Martinet, B.: Régularisation d’inéquations variationnelles par approximations succesives. Rev. Française Informat. Recherche Opérationnelle 4 (Ser. R-3), 154-158 (1970)
- 4(4) Rockafellar, R.T.: Monotone operators and the proximal point algorithm. SIAM J. Control Optim. 14 , 877-898 (1976)
- 5(5) Güler, O.: On the convergence of the proximal point algorithm for convex minimization. SIAM J. Control Optim. 29 , 403-419 (1991)
- 6(6) Lehdili, N., Moudafi, A.: Combining the proximal algorithm and Tikhonov regularization. Optimization 37 , 239-252 (1996)
- 7(7) Xu, H.K.: A regularization method for the proximal point algorithm. J. Global Optim. 36 , 115-125 (2006)
- 8(8) Wang, F., Cui, H.: On the contraction-proximal point algorithms with multi-parameters. J. Global Optim. 54 , 485-491 (2012)
