An Inexact Newton-like conditional gradient method for constrained nonlinear systems
M.L.N. Goncalves, F.R. Oliveira

TL;DR
This paper introduces an inexact Newton-like conditional gradient method for solving constrained nonlinear systems, establishing local convergence and rate under general conditions, with applications to Holder-like and Smale functions, supported by numerical experiments.
Contribution
It presents a novel inexact Newton-like conditional gradient method with convergence analysis for constrained nonlinear systems, applicable to a broad class of functions including analytic ones.
Findings
Method converges locally under general conditions.
Applicable to functions with Holder-like derivatives and Smale conditions.
Numerical experiments demonstrate effectiveness on medium and large problems.
Abstract
In this paper, we propose an inexact Newton-like conditional gradient method for solving constrained systems of nonlinear equations. The local convergence of the new method as well as results on its rate are established by using a general majorant condition. Two applications of such condition are provided: one is for functions whose the derivative satisfies Holder-like condition and the other is for functions that satisfies a Smale condition, which includes a substantial class of analytic functions. Some preliminaries numerical experiments illustrating the applicability of the proposed method for medium and large problems are also presented.
| Problem | Name and souce | n | Box |
|---|---|---|---|
| Pb 1 | Chandrasekhar’s H-equation [21] | 400 | |
| Pb 2 | Discrete boundary value function [32, Problem 28] | 500 | [-100,100] |
| Pb 3 | Troesch [27, Problem 4.21] | 500 | [-1,1] |
| Pb 4 | Discrete integral [32, Problem 29] | 1000 | [-10,10] |
| Pb 5 | Trigexp 1 [27, Problem 4.4] | 1000 | [-100,100] |
| Pb 6 | Problem 74 [28] | 1000 | [0,10] |
| Pb 7 | Problem 77 [28] | 2000 | [0,10] |
| Pb 8 | Function 15 [7] | 2000 | [-10,0] |
| Pb 9 | Tridiagonal exponential [27, Problem 4.18] | 2000 | [, ] |
| Pb 10 | Trigonometric function [7, Problem 8] | 2000 | [5,15] |
| Pb 11 | Zero Jacobian function [7, Problem 19] | 2000 | [0,10] |
| Pb 12 | Trigonometric system [27, Problem 4.3] | 5000 | [, ] |
| Pb 13 | Five diagonal [27, Problem 4.8] | 5000 | [1,] |
| Pb 14 | Seven diagonal [27, Problem 4.9] | 5000 | [0,] |
| Pb 15 | Countercurrent reactors [27, Problem 4.1] | 10000 | [-1,10] |
| FD-INL-CondG | BSU-INL-CondG | BPU-INL-CondG | CoDoSol | ||||||
|---|---|---|---|---|---|---|---|---|---|
| Problem | Iter | Iter | Iter | Iter | |||||
| Pb 1 | 1 | 5 | - | 5 | - | 5 | - | 7 | - |
| 2 | 6 | - | 7 | - | 7 | - | 7 | - | |
| 3 | 5 | - | 7 | - | 7 | - | * | ||
| Pb 2 | 1 | 9 | - | 12 | - | 12 | - | 14 | - |
| 2 | 1 | - | 1 | - | 1 | - | 2 | - | |
| 3 | 9 | - | 12 | - | 12 | - | 14 | - | |
| Pb 3 | 1 | 6 | - | 13 | - | 8 | - | 9 | - |
| 2 | 7 | - | 11 | - | 9 | - | 6 | - | |
| 3 | 6 | - | 9 | - | 8 | - | 7 | - | |
| Pb 4 | 1 | 5 | - | 7 | - | 7 | - | 9 | - |
| 2 | 3 | - | 3 | - | 3 | - | 3 | - | |
| 3 | 6 | - | 7 | - | * | 9 | - | ||
| Pb 5 | 1 | 20 | - | 169 | - | 28 | - | 21 | - |
| 2 | 9 | - | 13 | - | 12 | - | 10 | - | |
| 3 | 13 | - | 19 | - | 17 | - | 23 | - | |
| Pb 6 | 1 | 5 | - | 7 | - | 7 | - | 7 | - |
| 2 | 13 | - | 42 | - | 69 | - | 9 | - | |
| 3 | 9 | - | 25 | - | 12 | - | 12 | - | |
| Pb 7 | 1 | 6 | - | 7 | - | 7 | - | 9 | - |
| 2 | 8 | - | 10 | - | 10 | - | 12 | - | |
| 3 | 9 | - | 11 | - | 12 | - | 13 | - | |
| Pb 8 | 1 | 7 | - | 11 | - | 9 | - | 13 | - |
| 2 | 6 | - | 10 | - | 8 | - | 12 | - | |
| 3 | 6 | - | 8 | - | 7 | - | 11 | - | |
| Pb 9 | 1 | 2 | - | 2 | - | 2 | - | 8 | - |
| 2 | 2 | - | 2 | - | 2 | - | 7 | - | |
| 3 | 2 | - | 2 | - | 2 | - | 7 | - | |
| Pb 10 | 1 | 7 | - | 9 | - | 16 | - | 10 | - |
| 2 | 3 | - | 4 | - | 4 | - | 4 | - | |
| 3 | 10 | - | 14 | - | 73 | - | 12 | - | |
| Pb 11 | 1 | 17 | - | 22 | - | 27 | - | 22 | - |
| 2 | 18 | - | 24 | - | 28 | - | 23 | - | |
| 3 | 19 | - | 25 | - | 30 | - | 24 | - | |
| Pb 12 | 1 | * | * | * | 18 | - | |||
| 2 | * | * | * | 17 | - | ||||
| 3 | * | * | * | 16 | - | ||||
| Pb 13 | 1 | 8 | + | 7 | + | 5 | + | 17 | - |
| 2 | 12 | + | 22 | + | * | * | |||
| 3 | 16 | + | * | 7 | + | * | |||
| Pb 14 | 0 | 4 | - | 5 | - | 5 | - | 4 | - |
| 1 | 12 | - | 17 | - | 17 | - | 17 | - | |
| 2 | 20 | - | 28 | - | * | 25 | - | ||
| Pb 15 | 1 | 11 | - | * | 21 | - | 17 | - | |
| 2 | 12 | - | 19 | - | * | 19 | - | ||
| 3 | 13 | - | 20 | - | * | 20 | - | ||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Optimization Algorithms Research · Iterative Methods for Nonlinear Equations · Advanced Image Processing Techniques
An Inexact Newton-like conditional gradient method for constrained nonlinear systems
M.L.N. Gonçalves Instituto de Matemática e Estatística, Universidade Federal de Goiás, Campus II- Caixa Postal 131, CEP 74001-970, Goiânia-GO, Brazil. (E-mails: [email protected] and [email protected]). The work of these authors was supported in part by CAPES, CNPq Grants 406250/2013-8, 444134/2014-0 and 309370/2014-0.
F.R. Oliveira 11footnotemark: 1
Abstract
In this paper, we propose an inexact Newton-like conditional gradient method for solving constrained systems of nonlinear equations. The local convergence of the new method as well as results on its rate are established by using a general majorant condition. Two applications of such condition are provided: one is for functions whose the derivative satisfies Hölder-like condition and the other is for functions that satisfies a Smale condition, which includes a substantial class of analytic functions. Some preliminaries numerical experiments illustrating the applicability of the proposed method for medium and large problems are also presented.
Keywords: constrained nonlinear systems; inexact Newton-like method; conditional gradient method; local convergence.
1 Introduction
Let be an open set, and be a continuously differentiable nonlinear function. Consider the following constrained system of nonlinear equations
[TABLE]
where is a nonempty convex compact set. Constrained nonlinear systems such as (1) appear frequently in many important areas, for instance, engineering, chemistry and economy. Due to this fact, the numerical solutions of problem (1) have been the object of intense research in the last years and, consequently, different methods have been proposed in the literature. Many of them are combinations of Newton methods for solving the unconstrained systems with some strategies taking into account the constraint set. Strategies based on projections, trust region, active set and gradient methods have been used; see, e.g., [1, 3, 9, 19, 20, 22, 23, 30, 31, 34, 38, 39, 40].
Recently, paper [15] proposed and established a local convergence analysis of a Newton conditional gradient (Newton-CondG) method for solving (1). Basically, this method consists of computing the Newton step and after applying a conditional gradient (CondG) procedure in order to get the Newton iterative back to the feasible set. It is important to point out that the CondG method, also known as the Frank-Wolfe method, is historically known as one of the earliest first methods for solving convex constrained optimization problems, see [8, 11]. The CondG method and its variants require, at each iteration, to minimizing a linear function over the constraint set, which, in general, is significantly simpler than the projection step arising in many proximal-gradient methods. Indeed, projection problems can be computationally hard for some high-dimensional problems. For instance, in large-scale semidefinite programming, each projection subproblem of the proximal-gradient methods requires to obtain the complete eigenvalue decomposition of a large matrix while each subproblem of the CondG methods requires to compute the leading singular vector of such a matrix. The latter requirement is less computationally expensive (see, for example, [18] for more details). Moreover, depending on the application, linear optimization oracles may provide solutions with specic characteristics leading to important properties such as sparsity and low-rank; see, e.g., [12, 18] for a discussion on this subject. Due to these advantages and others, the CondG method have again received much attention, see for instances [12, 16, 18, 25, 29].
It is well-know that implementations of the Newton method for medium- or large-scale problems may be expensive and difficult due to the necessity to compute all the elements of the Jacobian matrix of , as well as, the exact solution of a linear system for each iteration. For this reason, the main goal of this work is to present an extension of the Newton-CondG method in which the inexact Newton-like method is considered instead of standard Newton method. In each step of this new method, the solution of the linear system and Jacobian matrix can be computed in approximate way; see the INL-CondG method in Section 2 and comments following it. From the theoretical viewpoint, we present a local convergence analysis of the proposed method under a majorant condition. The advantage of using a general condition such as majorant condition in the analyses of Newton methods lies in the fact that it allows to study them in a unified way. Thus, two applications of majorant condition are provided: one is for functions whose the derivative satisfies Hölder-like condition and the other is for functions that satisfies a Smale condition, which includes a substantial class of analytic functions. From the applicability viewpoint, we report some preliminaries numerical experiments of the proposed method for medium and large problems and compare its performance with the constrained dogleg method [2].
This paper is organized as follows. Subsection 1.1 presents some notation and basic assumptions. Section 2 describes the inexact Newton-like conditional gradient method and presents its convergence theorem whose proof is postponed to Section 3. Two applications of the main convergence theorem are also present in Section 2. Section 4 presents some preliminary numerical experiments of the proposed method. We conclude the paper with some remarks.
1.1 Notation and basic assumptions
This subsection presents some notations and assumptions which will be used in the paper. We assume that is a continuously differentiable nonlinear function, where is an open set containing a nonempty convex compact set . The Jacobian matrix of at is denoted by . We also assume that there exists such that and is nonsingular. Let the inner product and its associated norm in be denoted by and , respectively. The open ball centered at and radius is denoted by . For a given linear operator , we also use to denote its norm, which is defined by The condition number of a continuous linear operador is denoted by cond and it is defined as .
2 The method and its local convergence analysis
In this section, we present the inexact Newton-like conditional gradient (INL-CondG) method for solving (1) as well as its local convergence theorem whose proof is postponed to Section 3. Our analysis is done by using a majorant condition, which allows to unify the convergence results for two classes of nonlinear functions, namely, one satisfying a Hölder-like condition and another one satisfying a Smale condition. The convergence results for these special cases are established in this section.
The INL-CondG method is formally described as follows.
INL-CondG method
** Step 0.**
Let and be given. Set and go to step 1.
** Step 1.**
If , then stop; otherwise, choose an invertible approximation of and compute a triple such that
(2)
** Step 2.**
Use CondG procedure to obtain as
** Step 3.**
Set , and go to step 1.
end
We now describe the subroutine CondG procedure.
CondG procedure
** P0.**
Set and .
** P1.**
Use the LO oracle to compute an optimal solution of
** P2.**
If , set and stop the procedure; otherwise, compute and as
** P3.**
Set , and go to P1.
end procedure
Remarks. 1) In our local analysis of the INL-CondG method, the invertible approximation of and the residual will satisfy classic conditions (see (5) and (6)). The inexact Newton-like method with these conditions on and was proposed in [33] and, subsequently, also studied in, for example, [6, 10]. 2) The INL-CondG method can be seen as a class of methods, depending on the choices of the invertible approximation of and residual . Indeed, by letting and , the INL-CondG method corresponds to Newton conditional gradient method which was studied in [15]. Another classical choice of would be . We also emphasize that there are some approach to built that do not involve derivatives, see, for example, [24, 26, 30]. 3) Due to existence of constraint set , the point in Step 1 may be infeasible and hence the INL-CondG method use an inexact conditional gradient method in order to obtain the new iterate in C. 4) The CondG procedure requires an oracle which is assumed to be able to minimize linear functions over the constraint set. 5) Finally, in the CondG procedure, if , then and stop. However, if , the procedure continues. In this case, the stepsize is well defined and belong to (0,1].
In the following, we state our main local convergence result for the INL-CondG method whose proof is given in Section 3.
Theorem 1**.**
Let , and Suppose that there exist a continuously differentiable function such that
[TABLE]
for all and , where
- h1.
* and ;*
- h2.
* is strictly increasing.*
Take the constants , , and such that
[TABLE]
Let the scalars , and defined as
[TABLE]
[TABLE]
[TABLE]
Let and be given in step 0 of the INL-CondG method and let also and be generated by the INL-CondG method. Assume that the invertible approximation of satisfies
[TABLE]
and the residual is such that
[TABLE]
where is a sequence of invertible matrix (preconditioners for the linear system in (2)) and is a forcing sequence. If and , then is contained in , converges to and there holds
[TABLE]
where . Additionally, given , assume that the following condition holds:
- h3.
the function (0,\,\nu)\ni t\mapsto[f(t)\big{/}f^{\prime}(t)-t]/t^{p+1} is strictly increasing.
Then, for all integer , we have
[TABLE]
Remark 1**.**
As mentioned before, the INL-CondG method as be viewed as a class of methods. Hence, the above Theorem implies, in particular, the convergence of some new methods, which are named below. We obtain, from Theorem 2, the convergence for the inexact modified Newton conditional gradient method if , the inexact Newton conditional gradient method if and (i.e., ), and the Newton-like conditional gradient method if (in this case and ). We also mention that when , and (i.e., , and ), Theorem 2 is similar to Theorem 6 in [15].
Remark 2**.**
It is worth pointing out that when , then (7) implies that the sequence converge linearly to . Additionally, if , and , then converge superlinear to . On the other hand, if is convex, i.e., h3 holds with , it follows from (8), the first inequality in (7), definition (4) and the fact that that converge linearly to . Additionally, if , , it follows from (8), that converge quadratically to .
We now specialize Theorem 1 for two important classes of functions. In the first one, satisfies a Hölder-like condition [13, 14, 17], and in the second one, is an analytic function satisfying a Smale condition [36, 37].
Corollary 2**.**
Let as defined in Theorem 1. Assume that there exist a constant and such that
[TABLE]
Take , such that and . Let
[TABLE]
Let and be given in step 0 of the INL-CondG method and let also and be generated by the INL-CondG method. Assume that the invertible approximation of satisfies
[TABLE]
and the residual is such that
[TABLE]
where is a sequence of invertible matrix (preconditioners for the linear system in (2)) and is a forcing sequence. If and , then is contained in , converges to and there hold
[TABLE]
[TABLE]
where .
Proof.
It is immediate to prove that , and defined by satisfy the inequality (3), conditions h1, h2 and h3 in Theorem 1. Moreover, in this case, it is easily seen that and , as defined in Theorem 1, satisfy
[TABLE]
as a consequence, (see (4)). Therefore, the statements of the corollary follow directly from Theorem 1. ∎
Remarks. 1) If a function is such that its derivative is L-Lipschitz continuous, i.e., , for all where , then it also satisfies condition (9) with and . Hence, we obtain the convergence of the INL-CondG method under a Lipschitz condition. In this case, converges linearly to , and if additionally and , it converges to quadratically. 2) It is worth mentioning that if and in the previous corollary, we obtain the convergence of the Newton-CondG method under a Hölder-like condition, as obtained in [15, Theorem 7].
We next specialize Theorem 1 for the class of analytic functions satisfying a Smale condition.
Corollary 3**.**
Let as defined in Theorem 1. Assume that is an analytic function and
[TABLE]
Take , such that and . Let , and
[TABLE]
Let and be given in step 0 of the INL-CondG method and let also and be generated by the INL-CondG method. Assume that the invertible approximation of satisfies
[TABLE]
and the residual is such that
[TABLE]
where is a sequence of invertible matrix (preconditioners for the linear system in (2)) and is a forcing sequence. If and , then is contained in , converges to and there holds
[TABLE]
[TABLE]
where .
Proof.
Under the assumptions of the corollary, the real function , defined by , is a majorant function for on ; see for instance, [10, Theorem 15]. Since is convex, it satisfies h3 in Theorem 1 with ; see [10, Proposition 7]. Moreover, in this case, it is easily seen that and , as defined in Theorem 1, satisfy
[TABLE]
and, as a consequence, (see (4)). Therefore, the statements of the corollary follow from Theorem 1. ∎
Remark. The convergence of the Newton-CondG method under a Smale condition, as obtained in [15, Theorem 8], follows from the previous corollary with and .
3 Proof of Theorem 1
The main goal of this section is to prove Theorem 1. First, we establish some properties involving the majorant function and its Newton iteration map. Then, some properties of the CondG procedure are discussed. Finally, the desired proof is presented.
From now on, we assume that all the assumptions of Theorem 1 hold, with the exception of h3, which will be considered to hold only when explicitly stated.
Proposition 4**.**
The constant is positive and for all As a consequence, the Newton iteration map defined by
[TABLE]
is well defined and satisfies
[TABLE]
Moreover, the constants and are positive and
[TABLE]
Proof.
Firstly, since is continuous and , it follows that the constant is positive. Hence, h2 implies that for all , from which we conclude that is well defined. On the other hand, in view of h2, we have is strictly convex in . Therefore, using , we obtain for any which, combined with and for any , proves the inequality in (11). Now, using the fact that and for all , we obtain
[TABLE]
Since , the second statement in (11) follows by taking limit in (13), as .
It remains to prove the last part of the proposition. First, as , we have . Hence, using (11), we conclude that there exists such that
[TABLE]
or, equivalently,
[TABLE]
Hence, is positive which in turn implies that is positive and (12) holds. ∎
The following lemma gives the some relationships between the majorant function and the nonlinear operator .
Lemma 5**.**
Let . Then the function is invertible and the following estimates hold:
- a)
**
- b)
**
- c)
.
Proof.
The proof follows the same pattern as the proofs of Lemmas 10, 11 and 12 in [10]. ∎
The next result presents a basic property of the CondG procedure whose the proof can be found in [15, lemma 4].
Lemma 6**.**
For any , and , we have
[TABLE]
Before presenting the proof of Theorem 1, we first establish a technical result which will be used to prove that the sequence is contained in and the sequence is strictly decreasing.
Lemma 7**.**
Assume that . Then, for every ,
[TABLE]
where is as in (10). As a consequence,
[TABLE]
Proof.
First of all, since for all , it follows from INL-CondG method that
[TABLE]
Hence, using the Lemma 6 with
[TABLE]
we obtain
[TABLE]
Now, simple calculus yields
[TABLE]
Since, , it follows from Lemma 5 that is invertible. Thus, combining the last two inequalities, we obtain
[TABLE]
which, combined with (5) and (6), yields
[TABLE]
On the other hand, using the third inequality in (6), we find
[TABLE]
Hence, it follows from (16) that
[TABLE]
Combining the last inequality with items (a), (b) and (c) of Lemma 5, we conclude that
[TABLE]
The latter inequality, definition of in (10) and the fact that imply that
[TABLE]
Hence, inequality (14) now follows by simple calculus. Since and , it follows from (12) with that
[TABLE]
which, combined with (14), yields (15). ∎
We are now ready to prove Theorem 1.
Proof of Theorem 1:.
Since , combining the first statement of Lemma 5, inequality (15) and an induction argument, it is immediate to conclude that the sequence is contained in .
Let us prove that converges to . Since, for all , the first inequality in (7) follows trivially from (15). Therefore, is a bounded strictly decreasing sequence and hence it converges to some . Moreover, taking into account that is continuous in , in particular, from (14) we have
[TABLE]
which, combined with (12), implies that and consequently .
Now, from (14) we obtain
[TABLE]
In order to prove the asymptotic rate in (7), just take the limit superior in the last inequality as and use , equality in (11) and .
It remains to show the last part of the theorem. For this purpose, let us assume that h3 holds. It follows from (15) and (14) respectively that, for all , and
[TABLE]
Therefore, the inequality (8) follows due to assumption h3, and the definition of in (10). ∎
Remark 3**.**
Similar to the analysis in [15], we could have defined a scalar sequence , associated to the majorant function , such that
[TABLE]
Indeed, if is defined as
[TABLE]
it is possible to prove that (17) holds, and is well defined, is strictly decreasing and converges to [math]. Moreover, where .
4 Numerical experiments
In this section, we present the results of some preliminaries numerical tests which show the computational feasibility of the INL-CondG method. The experiments were carried out on a set of 15 well-known box-constrained nonlinear systems (i.e., problem (1) with where and ) with dimensions between and (see Table 1). We made three implementations of our INL-CondG method which differ in the way that the approximation matrices ’s are built. In the first implementation, the matrices ’s were approximated by finite difference (FD), whereas in the second and third ones, we used the Broyden-Schubert Update (BSU) [5, 35] and Bogle-Perkins Update (BPU) [4], respectively. The three resulting methods described above are denoted by FD-INL-CondG, BSU-INL-CondG and BPU-INL-CondG, respectively. In the implementations of the BSU-INL-CondG and BPU-INL-CondG methods, the matrices ’s were approximated by finite difference when and mod. This strategy to periodically compute the Jacobian matrix seems to be crucial for the robustness of these derivative-free methods. For a comparison purpose, we also run the constrained Dogleg solver (CoDoSol) which is a MATLAB package based on the constrained Dogleg method [2], and available on the web site http://codosol.de.unifi.it.
The computational results were obtained using MATLAB R2016a on a 2.5GHz Intel(R) i5 with 6GB of RAM and Windows 7 ultimate system. The stopping criterion was used, and a failure was declared if the number of iterations was greater than or no progress was detected. The starting points were defined as where for problems having finite lower and upper bounds and with , for problems with infinite upper bound. However, since is a solution of Pb13, we used instead. In the implementations of the INL-CondG method, the error parameter was set equal to for all and the CondG Procedure stopped when either the required accuracy was obtained or the maximum of iterations were performed. The parameters of CoDoSol were set as the default choice recommended by the authors (see [2, Subsection 4.1]). It worth pointing out that the Jacobian matrices in the latter solver are approximated by finite difference.
Table 2 reports the performance of the FD-INL-CondG, BSU-INL-CondG and BPU-INL-CondG methods, and CoDoSol for solving the 15 problems considered. In table 2, “” is the scalar used to compute the starting point , “Iter” is the number of iterations of the methods and “” is the infinity norm of at the final iterate . Finally, the symbol “” indicates a failure.
From Table 2, we see that the FD-INL-CondG, BSU-INL-CondG and BPU-INL-CondG methods and CoDoSol successfully ended 42, 40, 37 and 42 times, respectively, on a total of 45 runs. The FD-INL-CondG method is comparable to or even slightly better than CoDoSol, because it required less iterations in 35 cases in which both methods successfully ended. This behavior also has been observed in [15] for some small and medium scale problems. Regarding the methods whose the is not evaluated at each iteration, the BSU-INL-CondG method solved 3 problems more than BPU-INL-CondG method, while the BPU-INL-CondG method required less (resp. more) iterations than BSU-INL-CondG method in 11 (resp. 7) cases in which both methods successfully ended. Hence, we may say that latter two methods had similar numerical performance and, for some problems, they are comparable to the methods in which all ’s are approximated by finite difference. Finally, based on the previous discussion, we conclude that the INL-CondG method seems to be a promising tool for solving medium and large box-constrained systems of nonlinear equations.
Final remarks
We proposed a method for solving constrained systems of nonlinear equations which is a combination of inexact Newton-like and conditional gradient methods. Under appropriate hypotheses and using a majorant condition, it was showed that the sequence generated by new method converge locally. Additionally, we were able to provide convergence results for two important classes of nonlinear functions, namely, one is for functions whose the derivative satisfies Hölder-like condition and the other is for functions that satisfies a Smale condition. In order to show the practical behavior of the proposed method, we tested it on medium- and large-scale problems from the literature. The numerical experiments showed that it works quite well and compares favorably with the constrained dogleg method [2].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bellavia, S., Macconi, M., and Morini, B. An affine scaling trust-region approach to bound-constrained nonlinear systems. Appl. Num. Math. 44 , 3 (2003), 257 – 280.
- 2[2] Bellavia, S., Macconi, M., and Pieraccini, S. Constrained dogleg methods for nonlinear systems with simple bounds. Comput. Optim. Appl. 53 , 3 (2012), 771–794.
- 3[3] Bellavia, S., and Morini, B. Subspace trust-region methods for large bound-constrained nonlinear equations. SIAM J. Numer. Anal. 44 , 4 (2006), 1535–1555.
- 4[4] Bogle, I. D. L., and Perkins, J. D. A new sparsity preserving quasi-Newton update for solving nonlinear equations. SIAM J. Sci. Statist. Comput. 11 , 4 (1990), 621–630.
- 5[5] Broyden, C. G. The convergence of an algorithm for solving sparse nonlinear systems. Math. Comp. 25 (1971), 285–294.
- 6[6] Chen, J., and Li, W. Convergence behaviour of inexact Newton methods under weak Lipschitz condition. J. Comput. Appl. Math. 191 , 1 (2006), 143–164.
- 7[7] Cruz, W. L., and Raydan, M. Nonmonotone spectral methods for large-scale nonlinear systems. Optim. Methods Softw. 18 , 5 (2003), 583–599.
- 8[8] Dunn, J. C. Convergence rates for conditional gradient sequences generated by implicit step length rules. SIAM J. Control Optim. 18 , 5 (1980), 473–487.
