A Fundamental Convergence Rate Bound for Gradient Based Online Optimization Algorithms with Exact Tracking
Alex Xinting Wu, Ian R. Petersen, Iman Shames

TL;DR
This paper establishes a fundamental convergence rate bound for gradient-based online optimization algorithms with integral action, demonstrating how to achieve optimal tracking of time-varying quadratic cost function optima.
Contribution
It introduces a convergence rate bound for linear gradient algorithms with integral action, based on the internal model principle, for tracking polynomially varying optima.
Findings
Derived a convergence rate bound depending on the condition number and polynomial order.
Constructed algorithms that attain the optimal convergence rate.
Achieved zero steady-state error in tracking the optimal point.
Abstract
In this paper, we consider algorithms with integral action for solving online optimization problems characterized by quadratic cost functions with a time-varying optimal point described by an th order polynomial. Using a version of the internal model principle, the optimization algorithms under consideration are required to incorporate a discrete time -th order integrator in order to achieve exact tracking. By using results on an optimal gain margin problem, we obtain a fundamental convergence rate bound for the class of linear gradient based algorithms exactly tracking a time-varying optimal point. This convergence rate bound is given by , where is the condition number for the set of cost functions under consideration. Using our approach, we also construct algorithms which achieve the optimal…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
A Fundamental Convergence Rate Bound for Gradient Based Online Optimization Algorithms with Exact Tracking
Alex (Xinting) Wu, Ian R. Petersen, Iman Shames *This work was supported by the Australian Research Council under grants DP230102443 and DP210102454. A. Wu, I. R. Petersen and I. Shames are with the CIICADA Lab, School of Engineering, The Australian National University, Canberra, ACT 2601, Australia (e-mail: [email protected]; [email protected]; [email protected]).
Abstract
In this paper, we consider algorithms with integral action for solving online optimization problems characterized by quadratic cost functions with a time-varying optimal point described by an th order polynomial. Using a version of the internal model principle, the optimization algorithms under consideration are required to incorporate a discrete time -th order integrator in order to achieve exact tracking. By using results on an optimal gain margin problem, we obtain a fundamental convergence rate bound for the class of linear gradient based algorithms exactly tracking a time-varying optimal point. This convergence rate bound is given by , where is the condition number for the set of cost functions under consideration. Using our approach, we also construct algorithms which achieve the optimal convergence rate as well as zero steady-state error when tracking a time-varying optimal point.
I INTRODUCTION
Online optimization [1, 2, 3] has emerged as an important area of research with implications across various fields, including real-time control systems, signal processing, and machine learning. An online optimization problem involves making a sequence of decisions in real time, where the objective function changes continually. The primary challenge lies in efficiently solving these problems while ensuring that the time-varying solutions converge asymptotically. This motivates the development of algorithms capable of fast convergence and zero steady-state error. However, many existing methods for online optimization such as those of [1, 3], do not achieve zero steady-state error.
In recent years, various methods have been employed for designing optimization algorithms and analyzing their convergence behavior, utilizing techniques from control theory, including the internal model principle [2], integral quadratic constraints (IQC) [4, 5], Lyapunov-based analysis [6] and dissipative systems [7]. More recently, [8, 9, 10] build on the connection between the optimal gain margin problem [11] and optimization algorithms, with a particular focus on the heavy ball method. This link has provided new insights into understanding the convergence behavior and stability of optimization algorithms by drawing parallels with control theory concepts.
In this paper, we consider a class of online optimization problems with quadratic cost functions where the optimal point varies polynomially with the time step. The internal model principle suggests that in order to achieve zero steady-state error, an algorithm needs to incorporate a suitable number of discrete time integrators [12]. The paper extends the results of [12] by using the optimal gain margin approach of [11] to establish a fundamental convergence rate bound of for all linear gradient based algorithms which achieve zero steady-state error when tracking a polynomially varying optimal point of order . Here, is the condition number for the class of cost functions, which will be defined in the sequel. In addition, for any set of cost functions which includes the set of quadratic cost functions under consideration, will have a convergence rate which satisfies this bound. Moreover, the paper shows that this convergence rate bound is tight in that we give a procedure to construct algorithms which achieve the bound.
A conference version of this paper was presented in [13]. In this paper, we extend the results of [13] by generalizing from a linearly varying optimal point to a polynomially varying optimal point and by establishing a fundamental convergence rate bound for gradient based optimization algorithms with exact tracking.
This paper is organized as follows: Section II presents the problem formulation and preliminary results. In Section III, we describe and prove our main results. Section IV presents an illustrative example. Finally, the paper is concluded in Section V.
II Problem formulation and Preliminary Results
In this section, we formulate the class of optimization problems under consideration and review some existing optimization methods and their convergence rates, alongside relevant preliminary results.
II-A Problem formulation
We consider unconstrained optimization problems of the form:
[TABLE]
where is a quadratic cost function with a unique minimum attained at .
To address this problem, we consider gradient based algorithms similar to the general algorithms considered in [8] of the following form:
[TABLE]
where and
[TABLE]
In (2), represents the current iteration point and denotes the iteration index. The number denotes the number of past iterates considered in the optimization algorithm and the number of gradient evaluations at each iteration. Since corresponds to the number of past iterates, the algorithm needs be initialized with the values . The parameters and are the algorithm parameters, and denotes the gradient of the cost function. It is straightforward to verify that this class of algorithms includes all linear gradient based optimization algorithms with constant step size.
We consider a class of time-varying quadratic cost functions defined as follows.
Definition 1
Given , let denote the class of time-varying quadratic cost functions of the following form:
[TABLE]
where the time-varying optimal point is of the form
[TABLE]
and are constant vectors. In (1),
[TABLE]
We also define .
In the sequel, we will show that in order for the algorithm (2) to track the polynomially varying optimal point defined in (5), it is necessary that it satisfies the following condition.
Condition 1
The algorithm (2) is such that
[TABLE]
for any and any , where denotes the falling factorial of order [14]; i.e., .
In the sequel, we will demonstrate that Condition 1 corresponds to the requirement that the algorithm includes at least discrete time integrators and so obtains zero steady-state error with an optimal point of the form (5) according to the internal model principle; e.g., see [2].
In this paper, we consider online optimization algorithms of the form (2) to track the minimum in (2) where . The algorithms (2) can be reformulated as uncertain linear feedback systems, enabling us to analyze their convergence behavior through the lens of an optimal gain margin problem. This work extends the findings of [8] by considering online optimization problems in which the optimal point varies polynomially with time. The paper extend the findings of [12] by considering the optimal convergence rate with respect to all algorithms of the form (2). In particular, we establish a fundamental convergence rate bound for all algorithms of the form (2) that achieve zero steady-state error for time-varying cost functions of the form . This convergence rate bound is given by the formula . In addition, we show that this bound is tight in the sense that we can construct an algorithm which achieves this bound. This algorithm is constructed using a Blaschke product solution to a corresponding Nevanlinna Pick interpolation problem [15, 16].
II-B The Heavy Ball Method
The heavy ball method [17] is one of the most well-known methods for solving a time-invariant version of optimization problem (1) in which the cost function is independent of . This approach enhances the standard gradient descent algorithm by incorporating a momentum term, resulting in an optimization algorithm of the form:
[TABLE]
The heavy ball method accelerates convergence by combining the current gradient with information from the previous step. With Polyak’s choice of parameters, this method exhibits the fastest convergence rate for time-invariant cost functions [8]. These parameters are defined as:
[TABLE]
The paper [8] has shown that, for any time-invariant cost functions , the heavy ball method yields the optimal worst-case asymptotic convergence rate, among all algorithms characterized by the following general form:
[TABLE]
where
[TABLE]
The convergence rate of the heavy ball method with parameters (9) is calculated as in [8, 17]:
[TABLE]
This formula then defines a bound on the convergence rate for all algorithms of the form (10) when applied to time-invariant cost functions; see [8, 18].
II-C Preliminary results
Let denote an algorithm corresponding to the recursion (2) depending on parameters and the set of cost functions defined by (1), (5), (6) where .
The following lemma provides a time-invariant representation of the iterative algorithm (2) in the case that Condition 1 is satisfied. This provides an alternative approach to the internal model principle for exact tracking of polynomially varying signals.
Lemma II.1
Given , , and , such that , and an algorithm of the form (2) such that Condition 1 is satisfied. Then (2) can be rewritten as:
[TABLE]
where
[TABLE]
Proof: To prove this lemma, note that it follows from (II.1) that
[TABLE]
where . Also,
[TABLE]
It follows from (II-C) that
[TABLE]
In addition,
[TABLE]
Also, it follows from (II.1) that
[TABLE]
Substituting (2) into (18), we obtain
[TABLE]
Substituting (II.1), (16) and (II-C) into (II-C) yields
[TABLE]
where
[TABLE]
Thus, we need to show that . Indeed, it follows from (5) that if
[TABLE]
for . To verify (22), we introduce Stirling numbers of the second kind; e.g., see [14, Section 6.1] which gives the following relations
[TABLE]
where denotes the Stirling numbers of the second kind and denotes the falling factorial. Therefore
[TABLE]
Similarly,
[TABLE]
It is straightforward to verify that and it follows from (7) that
[TABLE]
Since for , it follows that
[TABLE]
Substituting (II-C) into (II-C), we obtain
[TABLE]
Thus, it follows from (II-C) that the left hand side of (22) equals to the right hand side. Hence, and we can conclude that (2) is equivalent to (II.1).
Remark 1
It follows immediately from this lemma that an algorithm of the form (2) satisfying Condition 1 will achieve convergence with zero steady-state error for all if and only if the corresponding algorithm (II.1) is such that as for all .
We can now reformulate (II.1) as an uncertain linear feedback system:
[TABLE]
where , ,
[TABLE]
and
[TABLE]
Here,
[TABLE]
and is a block diagonal matrix with copies of the symmetric matrix on its diagonal,
[TABLE]
where satisfies (6), denotes the Kronecker product, is the identity matrix, and 0 denotes the zero matrix or vector of appropriate dimension.
Define , where are defined as in (30). We also define where .
The following lemma, which follows from [19, Theorem 10.1.4, p. 301] and [8, Theorem 1], relates the convergence rate of the algorithm (II.1) to the spectral radius of .
Lemma II.2
Given any , , , , such that the algorithm (II.1) is globally convergent with the convergence rate
[TABLE]
Then
[TABLE]
Here, denotes the spectral radius of a matrix.
In order to apply this lemma to calculate the convergence rate of the algorithm (II.1) and hence the algorithm (2), we first consider the characteristic polynomial of . Indeed, we compute as
[TABLE]
where
[TABLE]
Then we compute the characteristic equation of as
[TABLE]
Equation (41) can be written as , where
[TABLE]
The following lemma relates Condition 1 to the condition that the algorithm (2) incorporates at least discrete time integrations.
Lemma II.3
Condition 1 is satisfied if and only if the polynomial defined in (42) has at least roots at .
Proof: Let be given and let . Also, define
[TABLE]
From this, it follow that will have at least roots at if and only if has at least roots at .
We now consider the first derivatives of :
[TABLE]
for . Then will have at least roots at , if and only if
[TABLE]
for . This is equivalent to
[TABLE]
for . This is Condition 1, thereby verifying the lemma.
III Main Results
The main result of this paper, which is a fundamental convergence rate bound for algorithms of the form (2), is stated in the following theorem.
Theorem 1
Let and be given. Then any algorithm of the form (2) which is globally convergent with zero steady-state error for any , will have a convergence rate satisfies:
[TABLE]
The following corollary follows immediately from the above theorem.
Corollary 1.1
Let and be given. Then any algorithm which is globally convergent with zero steady-state error for any set of cost functions , will have a convergence rate which satisfies the bound (1).
The following theorem shows that the bound given in Theorem 1 is tight and gives the construction of an algorithm which achieves the bound.
Theorem 2
Let and be given. Then there exists an algorithm of the form (2) which is globally convergent with zero steady-state error for and has the convergence rate
[TABLE]
In this algorithm, and the corresponding values of and are given by
[TABLE]
for and
[TABLE]
for where
[TABLE]
In order to prove the above theorems, we introduce a series of lemmas.
Lemma III.1
The signal in (5) has a -transform of the form
[TABLE]
where is a polynomial of order such that for .
Proof: The proof proceeds by induction on . By inspection,
[TABLE]
where is a polynomial of order 1 and . Hence, the base case holds.
Now assume that for a given , the following formula holds
[TABLE]
where is a polynomial of order and . We now consider the case of and use a property of -transforms; e.g., see [20, Equation 2.9, p. 16]:
[TABLE]
where denotes the derivative of . Using (52), and the fact that is a polynomial of order , it follows that is a polynomial of order and
[TABLE]
Thus by induction, (51) holds for all . From this, the lemma follows.
The following lemma provides a necessary condition for achieving zero steady-state error, and amounts to a version of the internal model principle for the problem under consideration.
Lemma III.2
Given any , suppose an algorithm of the form (2) is globally convergent with zero steady-state error for any . Then
[TABLE]
has at least roots at . That is, has at least roots at .
Proof: In order to prove this lemma, we write the iterative algorithm (2) in terms of a Luré system by letting
[TABLE]
be the state vector, and defining the reference signal
[TABLE]
Also, let , and be matrices defined by
[TABLE]
with defined in (32),
[TABLE]
It follows that for any , (2) can be rewritten as
[TABLE]
where is defined as in (37). Since we have assumed that the algorithm guarantees convergence with zero steady-state error for any , it follows that we will obtain convergence with zero steady-state error for with and . In this case, and we can rewrite (61) as
[TABLE]
This system, can be rewritten in state space form as
[TABLE]
The system (III) is illustrated in Figure 1.
Let denote the transfer function , which takes the form
[TABLE]
where the scalar transfer function is given by
[TABLE]
where the denominator polynomial is
[TABLE]
Note that (64) follows from the fact that the system (55), (60), (III) is in controllable canonical form.
It follows from Figure 1 that the error signal is such that
[TABLE]
leading to
[TABLE]
where is the -transform of and is the -transform of .
Thus,
[TABLE]
with
[TABLE]
Applying the -transform to , we obtain
[TABLE]
It follows from Lemma III.1 that
[TABLE]
Hence,
[TABLE]
Multiplying the numerator and denominator of this expression by , we rewrite (III) as
[TABLE]
Using the Final Value Theorem [20, Theorem 2.1], the steady-state error is
[TABLE]
Therefore, to achieve zero steady-state error, it is required that
[TABLE]
Using (III), this implies that the denominator of must contain at least poles at . That is, has at least roots at . Hence, has at least roots at .
The proofs of our main results use results on the optimal gain margin problem as established in [11, Problem 2.4] and incorporate a related special case from the theoretical development in [8]. Generalizing the case in [8], our approach considers a plant that includes a discrete time -th order integrator, enabling the solution of the polynomially time-varying optimization problem with zero steady-state error. The required result is contained in the following lemma.
Lemma III.3
Consider the uncertain linear feedback system depicted in Figure 2, which comprises the plant , which is a discrete time -th order integrator, an uncertain constant gain and a compensator . Let be given and suppose is strictly proper. If the compensator places all of the poles of this system in the disk for all then
[TABLE]
where is the convergence rate of the heavy ball method (11).
Proof: The proof of the lemma follows along the same lines as the proof of [8, Lemma 2], with modifications to accommodate our problem setting.
We introduce the sensitivity function defined as
[TABLE]
The poles of at , are zeros of . Also the zeros of at , correspond to a zero of , since is strictly proper. Therefore, we conclude that
[TABLE]
with multiplicity of , and
[TABLE]
Define and , where is the set of complex numbers.
It follows from [8, Lemma 2] and [11, Lemma 2.3], that in order to place the poles of the closed loop system within the interior of the open disk for all , it is equivalent to demonstrate the existence of a sensitivity function defined in (68) which is analytic in that satisfies the conditions (69), (70) and for all . To determine the existence of such an , we reformulate the problem as a Nevanlinna Pick interpolation problem, as described in [8, 11], using the commutative diagram in Figure 3.
In this figure,
[TABLE]
and the function maps into . Similarly, it follows from [8, 11] that the function
[TABLE]
is analytic in , maps into . Also, the function
[TABLE]
should be constructed to be analytic in , and map the closed unit disk into the open unit disk .
As illustrated in the commutative diagram in Figure 3, the existence of a function satisfying (69) and (70) which is analytic in , and maps into , is equivalent to a Nevanlinna Pick interpolation problem for the function .
The Nevanlinna Pick interpolation problem is defined as follows. Given a set of points in the closed unit disk and a corresponding set of target values , find a function which is analytic in such that for all , where is the open unit disk, and satisfying the interpolation conditions with specified multiplicities.
It follows from (69) and (70) that the required interpolation values for are
[TABLE]
with multiplicity , and
[TABLE]
Note that it is straightforward to verify that .
Handling an interpolation condition with multiplicities requires imposing conditions on the derivatives of , which complicates the interpolation problem. To avoid this complexity, we introduce small perturbations to the plant’s poles by redefining the plant transfer function as
[TABLE]
where denotes the number of integrators with and the are small positive parameters. As , the plant reduces to , which represents a -th order integrator. This modification allows us to avoid multiplicities in the interpolation conditions by ensuring that the poles and zeros of are distinct, thereby simplifying the Nevanlinna-Pick interpolation problem. In this modified problem, the poles of at , are zeros of . Therefore, the interpolation conditions of become
[TABLE]
all with multiplicity one. By perturbing the interpolation points in this manner, we obtain modified interpolation conditions on , which yields
[TABLE]
all with multiplicity one.
Now we are in the position to determine the existence of such an . By [21, Theorem 2.2] there exists an interpolation function with for all that satisfies (78)-(79) if and only the Pick matrix written as
[TABLE]
is positive semi-definite. In (80), ,
[TABLE]
is a column vector of ones and is a scalar, . To satisfy the positive semi-definiteness of the Pick matrix , a necessary condition is that its determinant be non-negative. To compute the determinant of the Pick matrix , we employ the Schur complement [22], which yields
[TABLE]
Since is a scalar
[TABLE]
and
[TABLE]
Let where . Since is a vector of ones, we obtain
[TABLE]
Let , where ,
[TABLE]
and is a diagonal matrix whose diagonal entries are given by . It is clear that
[TABLE]
and
[TABLE]
Also, according to [23, Theorem 1],
[TABLE]
where
[TABLE]
and
[TABLE]
Substituting (87), (88), (86) and (84) into (82) yields
[TABLE]
Substituting (III) into (III) yields
[TABLE]
Since and , and for all . Hence, a necessary condition for the Pick matrix to be positive semi-definite is . Letting from the positive direction, we obtain
[TABLE]
It follows that a necessary condition for the Pick matrix to be positive semi-definite is
[TABLE]
which is equivalent to
[TABLE]
Since , the condition (95) is equivalent to (67). Thus, if there exists a compensator satisfying the conditions of the lemma, then (67) must be satisfied.
Now, we can prove Theorem 1.
Proof of Theorem 1: Using Lemma III.2, a necessary condition for the algorithm to track a polynomially varying optimal point with zero steady-state error, for any , is that defined in (42) has at least roots at . Moreover, Lemmas II.2 and III.3 imply that the corresponding system in (II-C) must position all poles inside the disk defined by for every in order to achieve a convergence rate . In order to meet this requirement, condition (67) must hold. Thus, any algorithm which satisfies the conditions of the theorem will have a convergence rate which satisfies (1).
Also, we are now in the position to prove Theorem 2.
Proof of Theorem 2: In order to prove this theorem, we construct an algorithm satisfying the conditions of the theorem and such that (2) is satisfied. It follows from Lemma II.2 that we want to construct an algorithm such that the characteristic polynomial of has all its roots inside the circle of radius for all . We now define
[TABLE]
and
[TABLE]
where and are defined in (42). It follows from (42) that
[TABLE]
Therefore, is the radius of the smallest disk that contains the poles of the SISO feedback control system in Figure 2.
We now construct the interpolation function defined in (72) using the finite Blaschke product approach, as described in [21, p. 53]. The Blaschke product is a classical method for constructing bounded analytic functions on the unit disk that satisfy prescribed interpolation conditions. The general form of a finite Blaschke product is given by:
[TABLE]
where , are the interpolation points inside the unit disk and denotes the complex conjugate of . Considering the interpolation conditions (78), the corresponding Blaschke product is given by:
[TABLE]
In addition, considering (79), evaluating the Blaschke product at , we obtain
[TABLE]
By letting , we obtain
[TABLE]
where . Then
[TABLE]
Therefore, the required analytic function is
[TABLE]
since . Thus, the function defined in (102) satisfies the interpolation conditions (73) and (74), for all , and inequality (67) is satisfied with equality.
Since , we can now construct
[TABLE]
In order to find , we firstly calculate :
[TABLE]
Also, we construct the inverse of the function as
[TABLE]
Substituting (104) and (105) into (103) yields
[TABLE]
Substituting (106) and into (68) and rearranging yields
[TABLE]
where
[TABLE]
Substituting (107) into yields
[TABLE]
Substituting into (III) yields
[TABLE]
where
[TABLE]
Substituting (III) into (III) and simplifying, yields
[TABLE]
where are defined in (2). Note that the transfer function is proper, since , which cancels out the leading term in the numerator. In order to find the coefficients and , we employ the binomial theorem [14], which yields
[TABLE]
[TABLE]
[TABLE]
Therefore the numerator of can be rewritten as
[TABLE]
Similarly,
[TABLE]
and the denominator of can be rewritten as
[TABLE]
Note that
[TABLE]
Therefore, we obtain the final expression for
[TABLE]
where the coefficients and in the polynomial expansions of and precisely match the corresponding coefficients obtained from the binomial expansions of and in (III) and (118), respectively. Thus, we have constructed an algorithm satisfying the conditions of the theorem with convergence rate given by (2).
Remark 2
Note that it may be useful to define an alternative form for the algorithm (2) by using an alternative state space realization
[TABLE]
rather than using the formula (2), (48). This is an area for future research.
IV Illustrative Example
This section gives an illustrative example of the use of the algorithm of the form (2) defined in Theorem 2. Consider a quadratic cost function ,
[TABLE]
where is a real source location at time defined by
[TABLE]
Equation (122) has a Taylor series expansion [24, Chapter 4]
[TABLE]
To simplify the problem, we only consider the first two terms of the Taylor series (123). It follows from Lemma III.2 that to achieve exact tracking of a polynomially varying point of order , the iterative method must incorporate at least four integrators. In order to find the corresponding algorithm parameters, it follows from (42) and (III) that
[TABLE]
where are defined in (2) and . Solving these equations, we find the corresponding optimal parameters as follows:
[TABLE]
and
[TABLE]
where
[TABLE]
Figure 4 compares the true optimal point to its estimated value obtained by using Theorem 2 and by using the standard gradient descent method with step size , where and . The yellow star represents the true optimal point, moving according to the polynomial trajectory (123) of order . The red line shows the optimal point estimated by the optimal method of Theorem 2. It can be observed that the optimal method of Theorem 2 successfully tracks the optimal point with zero steady-state error. In addition, the black line in Figure 4 shows the performance of the gradient descent method, which fails to track the time-varying optimal point with zero steady-state error.
To further verify the performance of our approach, Figure 5 illustrates the difference between the true optimal point, its estimate using the optimal algorithm obtained from Theorem 2 and the gradient descent method. Clearly, the optimal algorithm obtained from Theorem 2 achieves zero steady-state error, whereas the gradient descent method does not.
V CONCLUSIONS
In this paper, we establish a fundamental convergence rate bound for all linear gradient based optimization algorithms which optimize a time-varying cost function with zero steady-state error. We consider online optimization algorithms for solving optimization problems with time-varying quadratic cost functions. Our main results incorporate results on the optimal gain margin of linear uncertain systems. We also use a version of the internal model principle to require the inclusion of discrete time integrators in the corresponding algorithm. This approach enables the algorithm to achieve exact tracking of time-varying optimal points.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] E. Hazan, Introduction to Online Convex Optimization , 2nd ed. USA: The MIT Press, 2022.
- 2[2] N. Bastianello, R. Carli, and S. Zampieri, “Internal model-based online optimization,” IEEE Transactions on Automatic Control , vol. 69, no. 1, pp. 689–696, 2024.
- 3[3] L. Madden, S. Becker, and E. Dall’Anese, “Bounds for the tracking error of first-order online optimization methods,” Journal of Optimization Theory and Applications , vol. 189, no. 2, pp. 437–457, 2021.
- 4[4] L. Lessard, B. Recht, and A. Packard, “Analysis and design of optimization algorithms via integral quadratic constraints,” SIAM Journal on Optimization , vol. 26, no. 1, pp. 57–95, 2016.
- 5[5] C. S. Simon Michalowsky and C. Ebenbauer, “Robust and structure exploiting optimisation algorithms: an integral quadratic constraint approach,” International Journal of Control , vol. 94, no. 11, pp. 2956–2979, 2021.
- 6[6] B. Van Scoy and L. Lessard, “A tutorial on a Lyapunov-based approach to the analysis of iterative optimization algorithms,” in 2023 62nd IEEE Conference on Decision and Control (CDC) , 2023, pp. 3003–3008.
- 7[7] L. Lessard, “The analysis of optimization algorithms, a dissipativity approach,” IEEE Control Systems Magazine , vol. 42, 05 2022.
- 8[8] V. Ugrinovskii, I. R. Petersen, and I. Shames, “A robust control approach to asymptotic optimality of the heavy ball method for optimization of quadratic functions,” Automatica , vol. 155, pp. 111–129, 2023.
