Golden ratio algorithms with new stepsize rules for variational inequalities
Dang Van Hieu, Yeol Je Cho, and Yi-bin Xiao

TL;DR
This paper introduces two novel golden ratio algorithms with adaptive stepsize rules for solving variational inequalities, eliminating the need for prior knowledge of the Lipschitz constant and demonstrating effective convergence and performance.
Contribution
The paper presents two new golden ratio algorithms with adaptive stepsize rules for variational inequalities, improving upon existing methods by removing the need for Lipschitz constant knowledge.
Findings
Algorithms converge under standard conditions.
The second algorithm's stepsizes stay separated from zero.
Numerical results show competitive performance.
Abstract
In this paper, we introduce two golden ratio algorithms with new stepsize rules for solving pseudomonotone and Lipschitz variational inequalities in finite dimensional Hilbert spaces. The presented stepsize rules allow the resulting algorithms to work without the prior knowledge of the Lipschitz constant of operator. The first algorithm uses a sequence of stepsizes which is previously chosen, diminishing and non-summable. While the stepsizes in the second one are updated at each iteration and by a simple computation. A special point is that the sequence of stepsizes generated by the second algorithm is separated from zero. The convergence as well as the convergence rate of the proposed algorithms are established under some standard conditions. Also, we give several numerical results to show the behavior of the algorithms in comparisons with other algorithms.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Optimization and Variational Analysis · Mathematical Inequalities and Applications
Golden ratio algorithms with new stepsize rules for variational inequalities
\nameDang Van Hieua, Yeol Je Chob,c and Yi-bin Xiaoc Dang Van Hieu. Email: [email protected] aApplied Analysis Research Group, Faculty of Mathematics and Statistics, Ton Duc Thang University, Ho Chi Minh City, Vietnam;
bDepartment of Mathematics Education, Gyeongsang National University, Jinju 52828, Korea;
cSchool of Mathematical Sciences, University of Electronic Science and Technology of China, Chengdu, Sichuan, 611731, P.R. China
Dedicated to Professor Pham Ky Anh on the Occasion of his 70th Birthday
Abstract
In this paper, we introduce two golden ratio algorithms with new stepsize rules for solving pseudomonotone and Lipschitz variational inequalities in finite dimensional Hilbert spaces. The presented stepsize rules allow the resulting algorithms to work without the prior knowledge of the Lipschitz constant of operator. The first algorithm uses a sequence of stepsizes which is previously chosen, diminishing and non-summable. While the stepsizes in the second one are updated at each iteration and by a simple computation. A special point is that the sequence of stepsizes generated by the second algorithm is separated from zero. The convergence as well as the convergence rate of the proposed algorithms are established under some standard conditions. Also, we give several numerical results to show the behavior of the algorithms in comparisons with other algorithms.
keywords:
Variational inequality; Pseudomonotone operator; Lipschitz continuity; Projection method.
1 Introduction
In this paper, we focus on the following variational inequality problem (shortly, (VIP)):
[TABLE]
where is a proper convex lower semi-continuous function with the domain and is an operator. The function here cannot be smooth. Recall that the proximal operator of is defined by
[TABLE]
We are also interested here in the problem (VIP) where the proximal mapping of is computable. The problem (VIP) is known as a central problem in nonlinear analysis, especially, in optimization, control theory, games theory [2, 3, 4, 1, 5, 6, 7, 8] and other fields [14, 13, 12, 11, 9, 10, 15, 16, 17]. Considering the problem (VIP) in a special case, when , the indicator operator of a nonempty closed convex set in , this problem reduces to the classical variational inequality problem [5, 18]:
[TABLE]
Moreover, the motivations of studying the problem (VIP) come from optimization point of view. Several models arising naturally can be formulated as the problem (VIP), see, e.g, in [3, 5, 6]. We restrict our interest in the following two problems. The first basic problem is a convex-concave saddle point problem:
[TABLE]
where , , , are proper convex lower semi-continuous functions, is a smooth convex-concave function. The saddle point problem (1) can be considered the problem (VIP) with , and . This is a type of example for nonsmooth problem, where the gradient method (one step) [19] cannot work. The early proposed iterative methods for this problem may be the extragradient methods (two steps) [20, 21]. Recently, many works on the problem (VIP) and related problems have been devoted to proposing different projection-like methods under various types of conditions, such as the subgradient extragradient methods [22, 23, 24, 25, 26], the modified extragradient method [27], the projected reflected gradient method [28, 29] and others [30, 31, 32, 33, 34, 35, 36, 37, 38, 39].
Another model arising in the signal processing literature is a nonsmooth convex optimization model,
[TABLE]
where and with possibly nonsmooth. This model is equivalent to the problem (VIP) with . Applying generic variational methods to solve special models as the optimization problem (2) is not a good choice. Optimization methods in general have better theoretical convergence rates because they can exploit the characteristics of the potential operator . However, this seems only true when is Lipschitz continuous. Without such a condition, the proximal gradient methods cannot hold anymore. Recently, in the case with non-Lipschitz , some notable optimization methods can be found in, for example, in [40, 41, 42]. Especially, the so-called NoLips algorithm developed in [40] is an interesting and promising method which can use a fixed stepsize when is non-Lipschitz continuous. However, the obtained results are not generic because they depend strictly on problem instances and used Bregman distances.
Methods for solving the problem (VIP) without the Lipschitz continuity of often use a linesearch proceduce which runs in each iteration of the algorithm until a stopping criterion is satisfied. Linesearch methods are thus time-consuming because they require many computations of values of as well as projections onto feasible set. Besides, the estimates of complexity in linesearch methods become not so informative. This is clear because they only show how many outer iterations needed to be done to obtain the desired accuracy while the number of inner linesearch iterations cannot be mentioned. Moreover, in the case is even Lipschitz continuous, but in general the Lipschitz constant is often unknown and in nonlinear problems, it can be difficult to approximate. Recently, without any linesearch procedure, some interesting methods for solving the classical variational inequalities can be found, for instance, in [43, 6, 44] where stepsizes are updated over iteration by some cheap computations.
Especially, an interesting idea has been developed recently by Malitsky in [6]. He has proposed a new algorithm, named the explicit golden ratio algorithm (shortly, EGRAAL), for solving problem (VIP) when is locally Lipschitz continuous. The EGRAAL has a simple and elegant structure and only requires the computations of one proximal mapping value (with function ) and one value of . His algorithm is explicit in the sense that it does not require any linesearch procedure. Its stepsizes are computed explicitly at each iteration from the previous iterates. The theoretical and numerical results in [6] are promising and also suggest some directions for studying in the future (see, [6, Sects. 5 and 6]).
In this paper, motivated by the results in [43, 6, 44], we propose two different golden ratio algorithms with new stepsize rules for solving pseudomonotone problem (VIP) with a Lipschitz condition. The stepsize strategies here are simpler than those ones in [6]. The variable stepsizes in the new algorithms are chosen previously or computed easily. Also, the resulting algorithms work without any information of Lipschitz constant of operator, i.e., the Lipschitz constant must not be the input parameters of the algorithms. More precisely, we first consider a golden ratio algorithm with a priorly taken sequence of stepsizes being diminishing and non-summable. Following to this strategy, the algorithm works well for problem (VIP) with the strong pseudomonotonicity of . For a weaker assumption of pseudomonotonicity of , we present the second golden ratio algorithm incorporated with variable stepsizes updating step-by-step. The convergence as well as the convergence rate of the proposed algorithms are established. Finally, the theoretical results are confirmed by several our numerical experiments in comparisons with other known algorithms.
The remainder of this paper is organized as follows: In Section 2, we introduce a golden ratio algorithm with a sequence of stepsizes priorly chosen. Section 3 deals with another golden ratio algorithm with a simpler stepsize rule. Finally, in Section 4, we study the numerical behaviour of the new algorithms on two test problems and compare them with others.
2 A golden ratio algorithm with diminishing stepsizes
In this section, we present a golden ratio algorithm with a sequence of stepsizes being diminishing and non-summable. The use of this new stepsize rule allows the algorithm to work without previously knowing the Lipschitz constant of the operator. We set , which is called the golden ratio. Moreover, we denote the solution set of the problem (VIP) and it is assumed to be nonempty. The following is the algorithm in details:
Algorithm 2.1* (Golden Ratio Algorithm with diminishing stepsizes).*
.
Initialization: Choose and a non-increasing sequence of stepsizes such that the following conditions hold:
[TABLE]
Iterative Steps: Assume that are known, calculate as follows:
[TABLE]
An example for the sequence satisfying the conditions (H1)-(H3) is with . In order to establish the convergence of Algorithm 2.1, we assume that the operator satisfies the following conditions:
(SP) is strongly pseudomonotone, i.e., there exists such that
[TABLE]
(LC) is Lipschitz continuous, i.e., there exists such that
[TABLE]
However, it is not necessary to know the two constants and . The unique solution of the problem (VIP) is denoted by .
We need the following two lemmas to prove the convergence of Algorithm 2.1:
Lemma 2.2**.**
[45, Proposition 12.26]* We have*
[TABLE]
Lemma 2.3**.**
[45, Corollary 2.14]* We have*
[TABLE]
Now, we have the following first main result:
Theorem 2.4**.**
Under the hypotheses (SP) and (LC), the sequence generated by Algorithm 2.1 converges to the unique solution of the problem (VIP).
Proof.
It follows from the definition of and Lemma 2.2 that
[TABLE]
which, with , implies that
[TABLE]
Thus we have
[TABLE]
This together with the equality and the hypothesis (SP) imply that
[TABLE]
Now, using the relation (3) with , we obtain
[TABLE]
Substituting into the inequality (5), we get
[TABLE]
Multiplying both sides of (6) by and noting that , we get
[TABLE]
Thus, using the identity , we have the following estimate:
[TABLE]
Adding both sides of the relations (4) and (8), we obtain
[TABLE]
Using the Lipschitz continuity of , we derive
[TABLE]
Since is a solution of the problem (VIP), we have
[TABLE]
Combining the relations (9)-(11), we get
[TABLE]
Moreover, from the definition of and Lemma 2.3, it follows that
[TABLE]
Combining the relations (LABEL:h6c) and (13), we obtain
[TABLE]
Since the sequence is non-increasing, we obtain
[TABLE]
Since , there exists such that
[TABLE]
Thus, from , it follows that
[TABLE]
which follows that
[TABLE]
From the relations (LABEL:h8), (15) and (16), it follows that, for all ,
[TABLE]
or
[TABLE]
where
[TABLE]
[TABLE]
Thus the sequence is non-increasing. It is obvious that is bounded from below by [math]. Thus the limit exists and . This implies that is bounded. Thus, from the definition of , we also see that the sequence is bounded. This together with the fact implies that as . Hence, from the definition of , it follows that
[TABLE]
Also, from the relation (17), we obtain . Thus, from the definition of , we obtain
[TABLE]
It follows from the conditions (H3) and (S1) that as . Thus, from the relation (18), we get
[TABLE]
On the other hand, from (H2) and (S2), we obtain that . This together with the relation (19) implies that or the sequence converges to , which solves uniquely the problem (VIP). This completes the proof. ∎
Remark 1*.*
It follows from (S2) that . Thus, if we choose (with ), we obtain an estimate of the convergence rate of the sequence generated by Algorithm 2.1 that
[TABLE]
3 Golden Ratio Algorithm without diminishing stepsizes
In this section, we introduce a simple stepsize rule where the stepsizes will be updated over each iteration and only uses the information on the data, the previous approximations without the prior knowledge of Lipschitz constant. Unlike the previous section, the stepsizes generated by the next algorithm are bounded from below by a positive constant. Another stepsize rule for a golden ratio algorithm can be found in [6]. For the sake of simplicity, we adopt the convention .
Now, we describe the algorithm in details as follows:
Algorithm 3.1* (Golden Ratio Algorithm without diminishing stepsizes).*
.
Initialization: Choose , ,
Iterative Steps: Assume that are known, calculate as follows:
[TABLE]
where
[TABLE]
Remark 2*.*
It follows from the definition of that this sequence is non-increasing. Moreover, from the Lipschitz continuity of and in the case , we see that
[TABLE]
Thus we obtain by the induction that and so there exists such that
[TABLE]
3.1 The convergence of Algorithm 3.1
In this subsection, we study the convergence of Algorithm 3.1. We weaken the assumptions imposed on the cost operator where it only need to satisfy the above condition (LC) and the following pseudomonotone condition (PC):
(PC) is pseudomonotone, i.e., the following implication holds:
[TABLE]
We have the following second result:
Theorem 3.2**.**
Under the conditions (LC) and (PC), the sequence generated by Algorithm 3.1 converges to a solution of the problem (VIP).
Proof.
By arguing similarly to the relation (9) with being pseudomonotone and , we obtain
[TABLE]
where with noting in (9) that . From the definition of , we see that
[TABLE]
We have the following fact (see, the relation (13)):
[TABLE]
Combining the relations (21) - (23), we get
[TABLE]
Since is non-increasing, we get
[TABLE]
Since , it follows from the relation (20) that
[TABLE]
Therefore, there exists such that
[TABLE]
From the relations (24)-(26), we derive
[TABLE]
or
[TABLE]
where
[TABLE]
[TABLE]
Thus the limit of exists and . Hence the sequences and are bounded. Morever, we also see that , which, together with the relation (20), implies that
[TABLE]
Thus, since , we obtain
[TABLE]
It is obvious that as . This together with (28) implies that
[TABLE]
From the definition of and Lemma 2.2, we see that
[TABLE]
Now, assume that is a cluster point of , i.e., there exists a subsequence of converging to . Passing to the limit in (31) when and using (20) and (29), we obtain
[TABLE]
which says that is a solution of the problem (VIP).
Now, in order to finish the proof, we prove that the whole sequence converges to . Indeed, assume that is another subsequence of converges to . Note that, as mentioned above, is also a solution of the problem (VIP). Since and the relation (30), it follows that and thus for each . We have the following equality:
[TABLE]
Thus, since and , we obtain that . Set
[TABLE]
Now, passing to the limit in (33) as , we obtain
[TABLE]
Thus or . This completes the proof. ∎
3.2 The convergence rate of Algorithm 3.1
This subsection deals with the convergence rate of Algorithm 3.1. In order to get the rate of convergence, we choose in Algorithm 3.1 such that with and assume that the operator satisfies the aforementioned conditions (SP) and (LC).
Finally, we study the convergence rate of Algorithm 3.1.
Theorem 3.3**.**
Under the conditions (SP) and (LC), the sequence generated by Algorithm 3.1 converges at least linearly to the unique solution of the problem (VIP).
Proof.
By arguing similarly to the relation (LABEL:h8), we have
[TABLE]
Thus we have
[TABLE]
Note that, from the definition of , we obtain
[TABLE]
Thus it follows from Lemma 2.3 that
[TABLE]
From the relations (35) and (36), we see that
[TABLE]
in which the last inequality follows from the following:
[TABLE]
[TABLE]
Now, if we set , then the inequality (37) can be rewritten as follows:
[TABLE]
Let be fixed. From the relation (20) and the fact , we have
[TABLE]
Thus there exists such that
[TABLE]
Combining the relations (38) and (39) and noting that , we obtain
[TABLE]
Set and . Then we can rewrite the relation (40) as follows:
[TABLE]
On the other hand, since is strongly monotone, we get
[TABLE]
Thus we have or . Let and . Now, we can rewrite the relation (41) as follows:
[TABLE]
Choose and such that and . Thus we have
[TABLE]
Consider the function
[TABLE]
Then we have
[TABLE]
because of . Thus is non-increasing on . Hence we have . Now, set and note . Then it follows from the relation (42) that
[TABLE]
Thus, by induction, we obtain
[TABLE]
and so we can reduce that
[TABLE]
or
[TABLE]
where . This completes the proof. ∎
4 Numerical experiments
In this section, we perform several experiments to show the numerical behaviour of the proposed algorithm (Agorithm 3.1) in comparison with other algorithms. All the programs are written in Matlab 7.0 and computed on a PC Desktop Intel(R) Core(TM) i5-3210M CPU @ 2.50GHz, RAM 2.00 GB.
Example 4.1 In this example, our problem of interest is a sparse logistic regression which is a popular problem in machine learning applications:
[TABLE]
where , , , .
Let be a matrix of size defined by and . Then, the objective function in our problem is with and . This problem is equivalent the considered problem with . We compare Algorithm 3.1 (GRADS) with EGRAAL in [6] and FISTA with constant stepsize in [1]. We do not include the algorithms with linesearch procedures because they require many computations over each iteration which is time-consuming. Note that the algorithm FISTA requires the Lipschitz constant of () while other algorithms are not. All entries of and are generated randomly and we choose . We choose , for Algorithm 3.1 (GRADS); , for the algorithm ERGAAL. The starting points are generated randomly in . The mapping prox is computed by the function fmincon in Matlab. The results are shown on Figure 2 and Figure 2. The execution times for the algorithms are almost equivalent. In these figures, is the most minimum value of generated by all the algorithms with the stopping criterion .
Example 4.2 Consider the nonlinear problem, which is presented by Sun [46], for the operator of the form:
[TABLE]
where and is a square matrix of order , which is given by
[TABLE]
The feasible set is . This problem is equivalent to the considered problem with . In this case, the mapping prox is the projection on the set and it is computed by the function quadprog in Matlab 7.0. Since the Lipschitz constant of is unknown, we do not include the comparison with the algorithm FISTA. We use the sequence for each to compare the computational performance of the algorithms. Figure 4 and Figure 4 describe the results in this example.
The numerical results here have illustrated that the proposed algorithm works well and also has competitive advantage over other algorithms.
5 Conclusions
In this paper, we have introduced the two golden ratio algorithms with two simple stepsize rules for solving pseudomonotone and Lipschitz variational inequalities in finite dimensional Hilbert spaces. The first algorithm uses a sequence of stepsizes taken priorly with some suitable properties while the second one itself generates variable stepsizes which are explicitly computed in each iteration and without a linesearch procedure to be run. We have established the convergence as well as the convergence rate of the new algorithms under appropriate conditions. The theoretical results have been illustrated by some our numerical experiments.
Our results can be extended to many promising directions, such as multi-value variational inequalities [47], equilibrium problems [49, 48, 50, 51], problem (VIP) incorporated with fixed point problems, systems of variational inequalities and mixed equilibrium problems [30, 26, 32, 38, 39], weak and strong convergence in Hilbert spaces as well as extensions to Banach spaces [33, 52]. This is surely our future goals.
Acknowledgement
The authors would like to thank the Associate Editor and the two anonymous referees for their valuable comments and suggestions which helped us very much in improving the original version of this paper. The research of the first author was supported by the National Foundation for Science and Technology Development (NAFOS-TED) of Vietnam under grant number 101.01-2017.315. The research work was also supported by the National Natural Science Foundation of China (11771067) and the Applied Basic Project of Sichuan Province (19YYJC0157). We also would like to thank Dr. Yura Malitsky for sending us the paper [6].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Beck A, Teboulle M. A fast iterative shrinkage-thresholding algorithm for linear inverse problem. SIAM Journal on Imaging Sciences 2009; 2 :183-202.
- 2[2] Daniele P, Giannessi F, Maugeri A. Equilibrium Problems and Variational Models . Springer: Kluwer, 2003.
- 3[3] Facchinei F, Pang JS. Finite-Dimensional Variational Inequalities and Complementarity Problems . Springer: Berlin, 2002.
- 4[4] Giannessi F, Maugeri A, Pardalos PM. Equilibrium Problems: Nonsmooth Optimization and Variational Inequality Models . Springer: Dordrecht, 2004.
- 5[5] Kinderlehrer D, Stampacchia G. An Introduction to Variational Inequalities and Their Applications . Academic Press: New York, 1980.
- 6[6] Malitsky YV. Golden ratio algorithms for variational inequalities. 2018. https://arxiv.org /abs/1803.08832.
- 7[7] Konnov IV. Combined Relaxation Methods for Variational Inequalities . Springer: Berlin, 2000.
- 8[8] Konnov IV. Equilibrium Models and Variational Inequalities . Elsevier: Amsterdam, 2007.
