Timescale Separation in Autonomous Optimization
Adrian Hauswirth, Saverio Bolognani, Gabriela Hug, Florian D\"orfler

TL;DR
This paper analyzes the stability of autonomous optimization controllers modeled after optimization algorithms, emphasizing the importance of timescale separation and providing stability bounds for various methods.
Contribution
It quantifies the necessary timescale separation for stable autonomous optimization and offers direct design prescriptions using singular perturbation analysis.
Findings
Derived stability bounds for gradient-based feedback laws
Identified robustness issues with certain optimization algorithms in autonomous settings
Provided guidelines for designing stable autonomous optimization controllers
Abstract
Autonomous optimization refers to the design of feedback controllers that steer a physical system to a steady state that solves a predefined, possibly constrained, optimization problem. As such, no exogenous control inputs such as setpoints or trajectories are required. Instead, these controllers are modeled after optimization algorithms that take the form of dynamical systems. The interconnection of this type of optimization dynamics with a physical system is however not guaranteed to be stable unless both dynamics act on sufficiently different timescales. In this paper, we quantify the required timescale separation and give prescriptions that can be directly used in the design of this type of feedback controllers. Using ideas from singular perturbation analysis, we derive stability bounds for different feedback laws that are based on common continuous-time optimization schemes. In…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Timescale Separation in Autonomous Optimization
Adrian Hauswirth, Saverio Bolognani, Gabriela Hug, and Florian Dörfler The authors are with the Department of Information Technology and Electrical Engineering, ETH Zürich, 8092 Zürich, Switzerland. Email: {hadrian,bsaverio,ghug,dorfler}@ethz.ch.This work was supported by ETH Zürich funds, by the SNF AP Energy Grant #160573, and by the Swiss Federal Office of Energy grant #SI/501708 UNICORN.Manuscript received May 20, 2019
Abstract
Autonomous optimization refers to the design of feedback controllers that steer a physical system to a steady state that solves a predefined, possibly constrained, optimization problem. As such, no exogenous control inputs such as setpoints or trajectories are required. Instead, these controllers are modeled after optimization algorithms that take the form of dynamical systems. The interconnection of this type of optimization dynamics with a physical system is however not guaranteed to be stable unless both dynamics act on sufficiently different timescales. In this paper, we quantify the required timescale separation and give prescriptions that can be directly used in the design of this type of feedback controllers. Using ideas from singular perturbation analysis, we derive stability bounds for different feedback laws that are based on common continuous-time optimization schemes. In particular, we consider gradient descent and its variations, including projected gradient, and Newton gradient. We further give stability bounds for momentum methods and saddle-point flows. Finally, we discuss how optimization algorithms like subgradient and accelerated gradient descent, while well-behaved in offline settings, are unsuitable for autonomous optimization due to their general lack of robustness.
Index Terms:
Optimization, Gradient methods, Closed-loop systems
I Introduction
Two of the first and foremost motivations for feedback control have traditionally been the stabilization of unstable dynamical systems and tracking of a reference signal in the presence of disturbances. Although prevalent control design methods often serve to accomplish both goals at the same time, the task of stabilization is generally associated with the design of a proportional controller, whereas tracking of a setpoint under constant disturbances usually requires the incorporation of an integral control component. These setpoints are, in turn, carefully designed, e.g., in a conventional setting via an offline (i.e., feedforward) optimization procedure.
Against this backdrop, we consider in this paper the concept of autonomous optimization (or feedback-based optimization), which aims at generalizing controllers beyond basic setpoint tracking. Instead, we consider the design of (integral) feedback controllers that steer a (stable) physical system to the solution of a general optimization problem without requiring an explicit solution in the form of an exogenous setpoint, hence being “autonomous”. This particular choice of words also refers to the fact that for most practical applications only time-invariant feedback controllers are of relevance.
A particular feature of feedback-based optimization are the different possibilities to incorporate constraints that need to be satisfied at steady state. These constraints can either be saturation-like in that are satisfied at all times or asymptotic, in the sense that the can be violated during the transient behavior, but need to be satisfied in the limit. As the name suggests, saturation-like constraints are generally associated with physical saturation, e.g., due to limited actuation capabilities at the input, and constraints on outputs are often formulated as asymptotic constraints.
The concept of autonomous optimization is in marked contrast with optimal control frameworks such as dynamic programming or model predictive control, since transient optimality of trajectories is not the primary goal. Instead, one aims for controllers that achieve asymptotic optimality at low computational cost and with little model information.
The problem of steering the state (or output) of a physical system to an optimal steady state has been considered in different contexts and fields (see next section). However, many previous works start from a timescale separation assumption where the physical system exhibits fast-decaying dynamics that are ignored in the control design. This simplifies the problem since the physical system can be abstracted by algebraic constraints, i.e., its steady-state behavior.
In this paper, we quantify the required timescale separation for feedback-based optimization schemes that take the simple form illustrated in Fig. 1. Namely, we consider a physical system that is interconnected with optimization dynamics that are modeled after common optimization algorithms (e.g. gradient descent, momentum methods, or saddle-point flows) and apply ideas inspired by singular perturbation analysis to derive sufficient conditions for closed-loop stability.
Throughout, we assume that the physical system is stable (or stabilized by an appropriate fast controller). By doing so, we follow a paradigm of “first stabilize, then optimize” which is in contrast to other recent works that base their designs on integral quadratic constraints [1, 2], backstepping [3], or output regulation [4]. In particular, [1, 2] pursue a holistic perspective where stabilization and tracking are considered as joint objectives. These works, however, arrive at complex and convoluted LMI conditions to certify stability that are computationally expensive at large scales and often do not directly translate into a systematic design method.
In contrast, our results—although simple and potentially conservative—give immediate design prescriptions while requiring only limited model information, that can often be estimated in practice. They can be applied to large-scale systems without redesigning existing stabilizing controllers and have an intuitive interpretation in terms of the timescale separation required between slow optimization dynamics and fast underlying system behavior. Finally, the generality of our approach allows us to consider nonlinear plants as well as a plethora of optimization algorithms.
I-A Related Work
The problem of driving a physical system to an optimal steady state has a considerable history. Early precursors can be found in process control under the name of optimizing control [5] which has evolved into the modern notion of real-time optimization [6, 7, 8]. This line of work is, however, mostly concerned with reducing the effect of inaccurate steady-state models, rather than the interactions with fast dynamics.
Further, the concept of extremum-seeking [9, 10, 11] aims at learning a gradient direction without recourse to any model information by means of a probing signal and exploitation of non-commutativity, but significant limitations arise when considering high-dimensional systems or constraints.
The historic roots of the approach pursued in this paper can be traced back to the study of communication networks where congestion control algorithms have been analyzed from an optimization perspective [12, 13, 14]. Similar ideas have recently attracted a lot of interest in power systems, where feedback-based optimization schemes have been proposed for voltage control [15, 16], frequency control [17, 18, 19], or general power flow optimization [20, 21, 22, 23]. For a survey see [24].
I-B Contributions
In this paper, we extend and generalize the results in [25]. Namely, we consider nonlinear physical systems instead of linear time-invariant (LTI) plants and we study a variety of optimization dynamics other than mere gradient flows. In particular, we study a general class of variable-metric gradient descent algorithms, including special cases such as Newton descent. Furthermore, we consider the case of projected gradient descent which, in the feedback-optimization context, can be interpreted as a model for physical input saturation. We also develop a stability bound for momentum methods (such as the heavy ball method). Finally, we provide a general result that can be applied, for instance, to saddle-point algorithms that are commonly used in autonomous optimization to enforce asymptotic constraints (that can be transiently violated) on output variables.
For our analysis, we use ideas from singular perturbation analysis to construct classes of Lyapunov functions that cannot only be used to certify stability but provide direct prescriptions for the feedback control synthesis.
Finally, through the non-examples of subgradient flows and accelerated gradient descent, we illustrate the sharpness of our analysis (in the sense that our assumptions cannot generally be avoided) and the fundamental limitations of the general framework of autonomous optimization.
I-C Organization
In Section II we fix the notation and recall basic results from nonlinear systems theory. Section III provides a comprehensive study of gradient-based feedback controllers, describes the main proof ideas, and explores specific examples and variations of gradient-based schemes. In Sections IV and V we consider momentum-based algorithms and general feedback optimization schemes, respectively. Finally, in Section VI we summarize our results and discuss open problems. In the Section -D we also provide an additional result specialized to LTI systems.
II Preliminaries
We consider the usual Euclidean setup for where denotes the canonical inner product and the associated 2-norm. The non-negative real line is denoted by . If is a matrix, denotes the induced matrix norm, namely . In particular, if is square and symmetric, then and denote the maximum and minimum eigenvalue of , respectively. If is positive definite, denoted by , we use the notation for to denote the norm on induced by . A map is called a metric on the space , in the sense that it defines a (variable) norm at every point and vector .
Let be open and consider a map . Unless noted otherwise, differentiability is understood in the usual sense (of Fréchet). Namely, denotes the matrix of partial derivatives of evaluated at . If is a subset of variables, then denotes the Jacobian with respect to . The map is -Lipschitz continuous if for all . If we call the gradient of at . In this case, is -strongly convex if for all . In particular, if is twice continuously differentiable, is -strongly convex if and only if for all .
Dynamical Systems
Given a vector field , consider the initial value problem
[TABLE]
where is an initial condition. A function is called a complete solution to (1) if is continuously differentiable, , and holds for all . A set is invariant if all solutions with remain in for all . Given a differentiable function , we denote its Lie derivative along the vector field (which is usually clear from the context) by . Stability and asymptotic stability are understood in the sense of Lyapunov. That is, a set is stable, if for every neighborhood of there exists another neighborhood such that all trajectories starting in remain in .
Nonlinear Optimization
Given two continuously differentiable functions and , let and let denote the set of active inequality constraints at . We call a regular set if for all the matrix has full row rank .111 The term regular alludes to the fact that these sets are in fact Clarke regular (or tangentially regular) [26]. Furthermore, the requirement that is known in the optimization literature as linear independence constraint qualification (LICQ). The tangent and normal cone of at are respectively
[TABLE]
Namely, and are both closed convex cones, and they are polar cones to each other. For an optimization problem where is continuously differentiable, a point is critical if it satisfies the first-order optimality conditions (KKT conditions). Namely, and . This is equivalent to the existence of and such that
[TABLE]
and for all . A point is a local minimizer if for all in a neighborhood of it holds that . A local minimizer is strict if holds for all .
II-A Nonlinear Plant Dynamics
Throughout, we consider physical plants modeled as
[TABLE]
where is the system state, is a measurable control input, is an initial condition, is a locally Lipschitz continuous vector field. Hence, the existence of a local solution for some and any initial condition is guaranteed.
Assumption II.1**.**
The function in (3) is continuously differentiable, -Lipschitz in , and -Lipschitz in . There exists a differentiable, -Lipschitz continuous map such that for all . Finally, there exist such that for every initial condition and every constant it holds that
[TABLE]
where is a solution to (3) with and .
The existence of well-defined steady-state map can for instance be guaranteed if is continuously differentiable and is invertible for all and . In this case the implicit function theorem guarantees the existence of such that for all . Lipschitz continuity of is guaranteed if is Lipschitz continuous and all eigenvalues of are bounded away from 0 with some minimal distance for all . Note that Assumption II.1 implies that trajectories are complete, i.e., can be extended to .
Remark 1*.*
For simplicity, we assume that and can take any value in and , respectively. However, if some subsets and are known to be invariant under given dynamics, Assumption II.1 can be weakened because it needs to be satisfied only on and . In Section III-C4 we illustrate this possibility for the example of projected gradient flows.
Assumption II.1 requires (3) to be exponentially stable with decay rate . This, in turn, implies the existence of Lyapunov function, as indicated by the following result.
Proposition II.1**.**
Let Assumption II.1 hold. Then, for any fixed there exists a Lyapunov function for the system (3) and parameters such that
[TABLE]
Proposition II.1 is a condensation of a standard converse Lyapunov theorem for exponentially stable systems [27, Th. 4.14]. Only the definition of (which captures a Lipschitz-type property of with respect to ) is non-standard. A proof can be found in Section -B.
II-B Variable-Metric Gradient Flows
A gradient flow is a dynamical system on defined as
[TABLE]
for some initial condition where is continuously differentiable with locally Lipschitz gradient, and is a locally Lipschitz continuous metric on , i.e., as a map from to . Namely, Lipschitz continuity of and guarantee the existence and uniqueness of local solution trajectories of (4) for any initial condition.
Although gradient flows are one of the most basic optimization dynamics, generally, one can only conclude the following:
Theorem II.2**.**
If has compact level sets, all trajectories of (4) are complete and converge to the set .
Theorem II.2 follows from the Invariance Principle [28, Prop 5.22]. The fact that trajectories are complete follows from the fact that level sets of are compact and invariant.
The use of a variable metric generalizes the class of gradient flows to include, for instance, Newton gradient flows; see Section III-C. It modifies the solution trajectories, but does not change the qualitative convergence behavior.
In general—and even if —it is not possible to conclude that trajectories converge to minimizers of [29]. One option is to assume convexity of in which case convergence to the set of global minimizers follows immediately.
Without convexity it is still possible to identify minimizers based on their stability properties as dynamic equilibria.
Theorem II.3**.**
[29]** For a critical point of (4) the following relations hold:
{SLM}$${LM}$${ASE}$${SE}
where (S)LM stands for (strict) local minimizer and (A)SE for (asymptotically) stable equilibrium.
In particular, a local minimizer of is not necessarily a stable equilibrium and vice versa. A common remedy to avoid this kind of pathological behavior is to require the objective function (and the metric) to be real analytic [29].
Nevertheless, it is an important observation that, from the dynamical systems point-of-view, asymptotic stability of an equilibrium replaces the need for second-order optimality conditions such as positive definiteness of the Hessian of .
III Gradient-Based Feedback Controllers
We now show how gradient flows lend themselves to designing nonlinear feedback controllers that can steer physical systems to an optimal steady state. In particular, we derive a basic requirement for stability of the feedback interconnection with a physical plant. Finally, we discuss our results (and their limitations) in the context of three special classes of gradient-type controllers.
III-A Gradient-Based Feedback Control
As a starting point for our control design, we consider the optimization problem
[TABLE]
where is the steady-state map of a plant satisfying Assumption II.1 and is a differentiable cost function depending on the system state and the control input.
By substituting with in the objective function, we arrive at the unconstrained optimization problem
[TABLE]
where . Adopting singular perturbation terminology, we call (6) the reduced problem since it assumes that the physical system is at steady state.
Based on (6), we can formulate a gradient flow of as
[TABLE]
where is a Lipschitz continuous metric and where we have applied the chain rule and defined
[TABLE]
A feedback controller can be obtained from (7) by replacing in the evaluation of by the measured value of . The interconnection is hence defined by
[TABLE]
Existence and uniqueness of local solutions of (8) are guaranteed for any initial condition , since , and are locally Lipschitz continuous by assumption. Completeness of solutions will be shown jointly with stability. Independently, equilibria of (8) always coincide with the critical points of (5):
Proposition III.1**.**
Every minimizer of (5) is an equilibrium point of (8). Conversely, every equilibrium point of (8) is a critical point of (5).
Proof.
First, note that is a regular set since and hence first-order optimality conditions are applicable. Given an optimizer , we have and therefore . Further, there exists such that (2) holds, more specifically
[TABLE]
Note that , and therefore (2) implies that . It follows that is an equilibrium of (8). Conversely, let be an equilibrium and therefore and . However, is spanned by , and therefore (2) holds. ∎
Remark 2*.*
The feedback law (8b) does not need to be implemented as a state-feedback controller. Assume that only output measurements are available, where is continuously differentiable. This gives rise to a differentiable input-output steady-state map . Further, instead of (5), consider the problem
[TABLE]
where is a cost function only depending on the system output and the control input.
Then, by substituting with in the objective function as before and computing the gradient of the reduced cost function, one arrives at the output-feedback law
[TABLE]
where .
If is an affine map, this feedback controller is equivalent to (8). To see this, note that with and therefore
[TABLE]
where .
Consequently, although (8b) is formulated in terms of the state , it is not necessarily a state feedback controller, and can be implemented as an output feedback law. The formulation in terms of the internal state is nevertheless important for the forthcoming stability analysis.
III-B Stability Analysis
Even though (3) and (7) are individually asymptotically stable by Assumption II.1 and Theorem II.2, respectively, the interconnection (8) is not guaranteed to be stable. However, under the following mild assumption we can derive conditions for the asymptotic stability of (8).
Assumption III.1**.**
For the objective function and the steady-state map in (5) there exists such that
[TABLE]
for all and all .
Remark 3*.*
Assumption III.1 is a weakened Lipschitz condition. It is for instance satisfied if is -Lipschitz continuous, in which case can be chosen as where is the Lipschitz constant of (which exists by Assumption II.1). However, in practice a tighter bound can often be established by exploiting the structure of and .
Our first main result establishes a sufficient condition for the asymptotic stability of (8) where we consider the metric as a design parameter. In particular, the bound illustrates the trade-off between the decay properties of the fast physical system and the gain of the slow optimization dynamics. This behavior will also be illustrated in Section III-C with the help of numerical examples.
Theorem III.2**.**
Consider (8) and let Assumptions II.1 and III.1 hold. If has compact level sets, then all trajectories of (8) are complete and converge to the set of first-order optimal points of (5) whenever
[TABLE]
where is a constant satisfying (9). Furthermore, and are constants associated with a Lyapunov function for (3) according to Proposition II.1. Finally, asymptotically stable equilibrium points of (8) are strict local minimizers of (5), and strict local minimizers are stable equilibria.
In many practical applications the righthand side of (10) can be estimated. The parameter is can be derived from model information (see Remark 3) and the parameters and can often be estimated from measurements of the decay rate of the open-loop system without explicitly formulating a Lyapunov function [28, Thm 5.17].
If where is constant, the bound (10) expresses a design condition on the global control gain .
Corollary III.3**.**
Consider the same setup as in Theorem III.2 and assume . Then, for all the system (8) is asymptotically stable.
Remark 4*.*
If the integrator of the controller is grouped together with the plant in order to make the feedback law purely proportional, then is an estimate of the input-to-state (ISS) gain of the augmented plant and is the ISS gain of the proportional feedback law. Hence, the condition (10) can also be interpreted as a small gain result: The product of the two gains has to be less than unity.
It is immediate that under the additional assumption of convexity the following stronger conclusion can be drawn.
Corollary III.4**.**
Consider the same setup as in Theorem III.2, and assume that is convex and is linear. Then, if (10) holds, all trajectories converge to the global minimizers of
[TABLE]
Proof of Theorem III.2
Our proof is similarly structured as in [25] and is inspired by ideas from singular perturbation analysis [30, 27]. Namely, we work towards an application of the LaSalle invariance principle. For this, we consider a LaSalle function of the form
[TABLE]
where is a convex combination coefficient. In this context, note that and that is essentially of the form (see Section -B) where is referred to as boundary-layer error coordinates in singular perturbation terminology and measures the deviation from the steady state.
First, we establish the requirement for to be non-increasing along the trajectories of (8). We then show that the level sets of are compact (and hence invariant) and therefore the invariance principle is applicable. Finally, we prove the connection between stability and optimality of equilibria.
Asymptotic Convergence
The following key lemma establishes an upper bound on the Lie derivative of .
Lemma III.1**.**
If for some , the 2-by-2-matrix
[TABLE]
is negative definite, then .
Furthermore, if is negative definite, then implies that and .
Proof.
The Lie derivative of along (8) is
[TABLE]
where . Each of the terms in (12) can be bounded.
Namely, for the first term we can do a rearrangement, apply Cauchy-Schwarz and Assumption III.1 (first inequality below) and use the definition of to write
[TABLE]
where is the unique positive definite square root of and .
According to Proposition II.1, we have for the second term in (12) that . Furthermore, for the third term we can apply Cauchy-Schwarz and the definition of as in (13) to arrive at
[TABLE]
Therefore the Lie derivative of is bounded by a quadratic function that can be rewritten in matricial form as
[TABLE]
where is given by (11). Clearly, if , then .
Finally, we note that if , then holds only if and . Hence the point is an equilibrium of (8), and satisfies the first-order optimality conditions of (5) by Proposition III.1. This completes the proof of Lemma III.1. ∎
In order to choose an appropriate that guarantees and therefore , we use Lemma A.1 in the appendix. Namely, by setting , , , and , we conclude that whenever we choose
[TABLE]
thus recovering the bound (10) in Theorem III.2.
Finally, we apply Lemma A.2 to find that the sublevel sets of are compact and therefore invariant. Consequently, all the requirements of the invariance principle are satisfied, and we conclude that all trajectories converge to the closure of the largest invariant subset for which . This, in turn, coincides with the set of critical points of (5).
Relation between Stability and Optimality
The fact that asymptotically stable equilibria are strict local minimizer has been shown in [25] for LTI plants and the standard metric. The proof extends to the present case without major modifications.
To show that strict local minimizers of (5) are stable, let be any compact neighborhood of in which is a strict minimizer of . We construct a neighborhood of such that every trajectory starting in remains in , thus proving stability.
Hence, consider the LaSalle function in the previous section, and let be such that where denotes the boundary of . Define which has a non-empty interior because . Furthermore, as a sublevel set of , the set is invariant since (with the proper choice of according to Lemma III.1). This establishes stability of .
III-C Examples of Gradient-Based Controllers
In the following we discuss three algorithms that, broadly speaking, can be considered variations or extensions of the basic gradient flow (4). In particular, we discuss their suitability for autonomous optimization and the limits of stability when interconnected with a dynamical system. Note that in Section -D we also present a more specific result for LTI plants.
III-C1 Basic Gradient Flows
In general, the conservativeness of the bound (10) depends largely on the specific problem. Figs. 2 and 3 illustrate this fact based on two random problem instances. In both examples, we consider, for simplicity, the case where (i.e., as in Corollary III.3), the cost function is convex quadratic, and the plant is LTI (and consequently is linear). In each case, we have (state dimension) and (input dimension).
In both cases, the interconnected gradient system (7) is stable for values of larger than . For , the feedback interconnection illustrated in Fig. 2 exhibits a similar convergence rate as the reduced system. However, for larger than instability of the interconnected system occurs.
For the second example (Fig. 3) the stability bound on is more conservative. For the interconnected system is stable, however, the convergence rate compared to the reduced system is significantly deteriorated. For this problem instance, instability occurs for values of larger than .
These examples illustrate not only the variable degree of conservativeness of our stability bound, but also the gradual performance degradation as the stability limit of the interconnected system is reached.
III-C2 Newton Gradient Flows
The classical Newton method finds widespread application in numerical optimization as a second-order method (i.e., requiring information about second-order derivatives) with superlinear convergence [31, Chap 3.3]. The continuous-time limit of the Newton method is given by a simple gradient flow of the form (4), namely,
[TABLE]
where serves to adjust the convergence rate.
For (14) to be well-defined, we may assume that is -strongly convex and twice continuously differentiable such that the metric is well-defined for all . Hence, convergence to the unique equilibrium is exponential and moreover isotropic, i.e., trajectories approach the equilibrium from all directions with the same speed. In other words, the linearization around the equilibrium point is given by .
In terms of stability, Newton flows are well-suited for the implementation as feedback controllers. Although the evaluation (or estimation) of the inverse Hessian of can pose computational problems.
Theorem III.2 can be directly applied to give a condition for asymptotic stability in closed loop. Namely, since is -strongly convex, we have that and therefore the following holds.
Corollary III.5**.**
Consider the same setup as in Theorem III.2 and assume that is -strongly convex and twice continuously differentiable. With the metric , the closed-loop system (8) is asymptotically stable and converges to the unique global minimizer of (5) whenever
[TABLE]
Compared to the previous results, the above bound on is invariant with respect to a uniform scaling of by a constant since this will scale both and by the same factor . Furthermore, the requirement that is strongly convex implies the uniqueness of the optimizer, but it does not necessarily require that the problem (5) is itself convex.
Fig. 4 illustrates, similarly to Figs. 2 and 3, the interconnection of an LTI plant with a Newton flow for a quadratic function. In this case is constant. As before, the interconnected system is stable even for larger than the theoretical bound in Corollary III.5, however, the convergence rate gradually worsens compared to the reduced system.
III-C3 Subgradient Flow (Non-Example)
Subgradient flows are the continuous-time version of subgradient descent and generalize gradient flows to the case where is not differentiable. Namely, assuming that is convex, its subgradient at is defined as the set
[TABLE]
As a set-valued map, gives rise to a dynamical system in the form of a differential inclusion .
Subgradient inclusions are well-defined (i.e., existence of generalized solutions is guaranteed under technical assumptions) and convergence to critical points is also assured. However, subgradient flows are in general not appropriate for feedback-based optimization.
Apart from issues relating to the physical implementability, Theorem III.2 is not applicable since Assumption III.1 is in general not satisfied. Namely, if is not continuously differentiable, then its gradient cannot be Lipschitz continuous.
In fact, subgradient flows in closed loop with a dynamical system are in general not asymptotically stable. To see this, consider a one-dimensional physical system in the form
[TABLE]
with and steady-state map . Further, as an objective we consider the absolute value that gives rise to a subgradient control law
[TABLE]
It is easy to see that this control law exhibits a bang-bang behavior that will not allow the closed-loop system to converge to the optimizer .
Fig. 5 illustrates this behavior for a higher dimensional setup where we minimize an objective function with an -regularization term in an attempt to promote sparsity of the minimizing state variables.
III-C4 Projected Gradient Flows
In order to model the input saturation as part of the system (3) that enforces a constraint on the inputs that cannot be violated, we resort the mathematical formalism of projected dynamical systems. For convenience, we have summarized the relevant key definitions in the appendix. Hence, instead of (5), we consider
[TABLE]
where is a regular set expressing constraints on the control inputs, e.g., limited actuation capacity. Given the gradient vector field where as before, a projected gradient flow is defined as
[TABLE]
where the projected gradient is defined according to (-C.1). Existence of so-called Carathéodory solutions is guaranteed by Theorems C.1 and C.2 (which is also an invariance principle).
A feedback implementation of (16) takes the form
[TABLE]
where . For the sake of simplicity, we do not consider the use of a variable metric , since that would require a more general definition of the projection operator in order to take into account oblique projections.
In fact, by definition of projected dynamical systems, it must holds that for all . Consequently, in the following, strictly speaking, Assumptions II.1 and III.1 have to hold only for instead of all .
Stability of (17) can be shown similarly to Theorem III.2:
Corollary III.6**.**
Consider the same setup as in Theorem III.2, but let the feedback control law be given by (17b). Then, the same conclusions as in Theorem III.2 hold. Namely, all trajectories of (17) are complete and converge to critical points of (15) whenever .
Proof.
The proof is analogous to the proof of Theorem III.2 with , but slightly different, because non-differentiable Carathéodory solutions (and their possible non-uniqueness) have to be considered instead of standard (differentiable) solutions, and Theorem C.2 has to be applied instead of the standard invariance principle for continuous dynamics. Nevertheless, the final stability bound remains the same.
The difference lies in the proof of Lemma III.1. In particular, when deriving the bound for the term with we make use of Lemma C.3 which states that
[TABLE]
where and . Hence, instead of (13) we can establish the bound
[TABLE]
where we have used Lemma C.3 to establish the last inequality. Thus, the bound is the same bound as in (13). ∎
Hence, input saturation that can be modeled by a projected dynamical system does not pose an obstacle in our timescale separation analysis, other than the fact that a specialized notion of solution and existence results inherent to projected dynamical systems have to be used.
IV Momentum-Based Controllers
We now consider a class of optimization dynamics that arises as the so-called momentum methods [32] which have recently gained renewed interest in the context of machine learning but have not yet been extensively considered for feedback-based optimization. In the following, we primarily consider a continuous-time generalization of Polyak’s heavy-ball method [33] interconnected with a physical system and derive a stability requirement analogous to Theorem III.2. With a counter-example at the end of this section we show that time-varying optimization dynamics are in general not suited for feedback-based optimization, in particular, if they do not exhibit uniform asymptotic convergence. Namely, for a continuous-time version of Nesterov’s accelerated gradient method [34] which violates our analysis assumptions, we show that the interconnection with a exponentially decaying physical system is in general not asymptotically stable. This feature is not surprising since an online implementation of this algorithm is a time-varying controller with asymptotically infinite gain.
Given a continuous metric and a differentiable objective function , as before, we consider continuous-time heavy-ball dynamics of the form
[TABLE]
where denotes a momentum variable, and is a positive definite damping matrix depending on .
Asymptotic convergence of the optimization dynamics (18) is guaranteed by the following result.
Theorem IV.1**.**
If has compact level sets, then the dynamical system (18) is asymptotically stable, and all trajectories converge to the set of points such that and . In particular, if is convex, then convergence is to the set of global optimizers of the optimization problem
[TABLE]
Proof.
Consider the LaSalle function . Its Lie derivative along the trajectories of (18) is
[TABLE]
Furthermore, note that the sublevel sets of are compact. This leads us to conclude that all trajectories of (18) converge to the largest invariant subset for which . This, in turn, implies and is constant on . Furthermore, since is constant on , we need and consequently which corresponds to being a critical point of (19). ∎
IV-A Control Design & Stability Analysis
As before, we are primarily interested in the stability of the interconnection between (18) with a physical system (3), that is, we consider systems of the form
[TABLE]
where and are defined as before.
Similarly to Theorem III.2 we derive a requirement on and that guarantees asymptotic stability of (20).
Theorem IV.2**.**
Consider (20) and Assumptions II.1 and III.1 hold. If has compact sublevel sets, then all trajectories of (20) converge asymptotically to the set of points for which and is a critical point of (5) whenever it holds that
[TABLE]
where is a constant satisfying (9). Further, and are constants associated with a Lyapunov function for (3) according to Proposition II.1.
Theorem IV.2 gives a design condition on and expresses a trade-off between the two. Namely, acts as a generalized gain in the same way as in Theorem III.2, whereas a large damping has a stabilizing effect.
Analoguous corollaries and facts as for Theorem III.2 can be developed for Theorem IV.2. For example, one can show that asymptotically stable equilibria of (20) are optimizers of .
Proof.
As in the proof of Theorem III.2, we consider a LaSalle function of the form
[TABLE]
where is a convex combination parameter and . The Lie derivative of is
[TABLE]
where and .
The four terms in the expression of can be bounded as follows. For the first two terms we have
[TABLE]
where we have used and . Note that in the fourth equation, the first and the last term cancel out.
For the third and fourth term we get as before
[TABLE]
With these bounds, we can upper-bound with a quadratic function where
[TABLE]
Thus, we can apply Lemma A.1 with , , , and which yields that whenever
[TABLE]
The remainder of the proof analogous to the proof of Theorem III.2. Namely, Lemma A.2 serves to certify that has compact (and hence invariant) sublevel sets for an appropriate choice . Thus, solutions converge to the largest invariant subset for which , which, in turn, is equivalent to the points for which , and . ∎
IV-B Non-Example: Accelerated Gradient Flows
A special and widely popular variation of (18) consists in making the damping decay over time. Namely, in [35, 34] the authors show that the ODEs of the form
[TABLE]
can be interpreted as continuous-time limit of Nesterov’s accelerated gradient descent.
As before, we can derive a feedback controller from (22). Strictly speaking, Theorem IV.2 does not apply to this type of time-varying control, but an extension is possible.
Nevertheless, as one can easily see, with a damping term that decays monotonically over time, the bound (21) eventually (i.e., for large enough) fails to hold and the feedback interconnection between a physical system and the accelerated gradient dynamics will become unstable. In other words, the feedback controller is time-varying with asymptotically infinite gain. This behavior is illustrated in Fig. 6 for a one-dimensional plant with , steady-state map , and where .
This example violates our assumptions (thus indicating the sharpness of our analysis) and fails in practice (showing a general limitation of autonomous optimization).
V General Optimization Dynamics
Next, we turn to more general optimization algorithms. A particular class that we cover with the subsequent analysis are saddle-point flows (see [36, 37] and references therein), that can be interpreted controllers with memory.
Hence, in this section, we consider the general dynamics
[TABLE]
where is an internal variable of the controller and are define the controller behavior, and is the steady-state map of the plant.
For autonomous optimization, the dynamics (23) are chosen such that their equilibria correspond to criticial points of a predefined optimization problem. As an example of (23), we later consider the case of primal-dual saddle-point flows that have been successfully applied to enforce constraints on the output variables of a physical system.
The only assumption that we require are Lipschitz continuity and a Lyapunov function for (23).
Assumption V.1**.**
The vector field is -Lipschitz in , i.e., for all , and all and one has
[TABLE]
Assumption V.2**.**
The reduced vector field , where and , is -Lipschitz, i.e., for all and it holds that
[TABLE]
Assumptions V.1 and V.2 can be relaxed or combined in several ways. For instance, if the norm of the map is bounded by , then choosing such that will satisfy Assumption V.2.
Further, Assumptions V.1 and V.2 guarantee the existence of complete solutions to both the reduced system (23) as well as the dynamic interconnection which takes the form
[TABLE]
where is a control gain and tuning parameter.
Assumption V.3**.**
The system (23) has a unique equilibrium point , and there exists a positive definite Lyapunov function according to Proposition II.1. Namely, there exist such that
[TABLE]
where .
Remark 5*.*
Assumption V.3 is in particular satisfied if the vector field is -strongly monotone, i.e.,
[TABLE]
holds for all and , and it has a unique equilibrium point . In this case, we have and .
In the same spirit as Theorems III.2 and IV.2 we can derive a requirement on that guarantees asymptotic stability of (24).
Theorem V.1**.**
Under Assumptions II.1, V.1, V.2, and V.3 all trajectories of (24) converge asymptotically to whenever is chosen such that
[TABLE]
Similarly to the bounds in Theorems III.2 and IV.2 the bound (25) contains the term . However, the generality of the bound (25) comes at the expense of another factor that deteriorates the stability bound depending on the conditioning of the reduced vector field.
Proof.
Analogously to the proofs of Theorems III.2 and IV.2, we consider a LaSalle function of the form
[TABLE]
The Lie derivative of is given by
[TABLE]
For the first two terms of can be bounded as
[TABLE]
For the third and fourth term we have
[TABLE]
Hence, we can establish a quadratic bound on as a function where
[TABLE]
Lemma A.1 with , , , and guarantees negative definiteness of for
[TABLE]
which simplifies (25). The remainder of the proof is the same as before in Theorems III.2 and IV.2. ∎
V-A Example: A weak bound for Convex Gradient Flows
Theorem V.1 can also be applied to the algorithms in the previous sections, but in this case the stability bound (25) is weaker than previous tailor-made conditions. To compare Theorem V.1 and Theorem III.2 we reconsider the case of a gradient-based feedback controller as given by the system (8) with the metric . Further, assume that has a -Lipschitz gradient and is -strongly convex. Thus, Assumptions V.2 and V.3 are satisfied with and , respectively.
Then, one can choose as the Lyapunov function according to Assumption V.3 with . The parameter of Assumption III.1 and Assumption V.1 coincide. It follows from Theorem V.1 that the feedback gradient system is asymptotically stable for which is weaker than the bound in Theorem III.2 by at least a factor 2 because is the Lipschitz constant and the modulus of strong convexity of and therefore .
V-B Example: Primal-Dual Saddle-Point Flow
A key requirement of many autonomous optimization scenarios is the satisfaction of constraints. As seen previously, constraints on the input variable can be (strictly) enforced, e.g., by projection. Incorporating constraints on the state (or output) variables is trickier and they need to be treated as constraints that can be violated during the transients. For this purpose, saddle-point flows have proven to be an adequate tool. As an illustrative example, instead of (5), we consider
[TABLE]
where and . Namely, defines a constraint on the state variables that has to be satisfied asymptotically at steady state. After eliminating from (26), the augmented Lagrangian is given by
[TABLE]
where is an augmentation parameter.
The corresponding augmented saddle point flow is given by
[TABLE]
Note that equilibria of (27) and critical points of (26) coincide.
In a feedback interconnection with a physical system we instead replace with the measured value of to arrive at
[TABLE]
Intuitively, augmented saddle-point flows, implemented as feedback controllers, provide a proportional and integral feedback of the measured constraint violation , hence acting as PI-control (on top of the integral controller that defines the optimization dynamics). Namely, the augmentation term results in the proportional component, whereas the dual variable yields the integral term.
Clearly, (28) falls into the class of systems of the form (24). Furthermore, Assumptions V.1 and V.2 are in general satisfied and and depend on the optimization problem only.
The application of Theorem V.1 hinges on Assumption V.3 and therefore on the existence of an explicit Lyapunov function for the dynamics (23). For the special case (26), this assumption is satisfied [38]. Whether this setup can be generalized, remains open and a topic of active study [39, 40].
Nevertheless, the numerical simulations of (28) of randomized problem instances suggest that the interconnection of a saddle-point flow and a dynamical plant has benign stability properties with a stability threshold on .
Fig. 7 illustrates, as before, the (stable) interconnection of an LTI plant (with and ) and saddle-point flow (27) where is a quadratic function and (# of output constraints). Namely, after an initial transient, the physical plant remains almost at steady state and the interconnected system closely tracks the trajectory of the reduced system and converges to the optimizer of (26).
VI Conclusion
We have studied the implementation of different types of optimization algorithms as feedback controllers with the goal of steering a physical system to a steady state that (locally) solves a predefined optimization problem. In particular, we have derived stability bounds inspired by singular perturbation analysis that guarantee closed-loop stability. We have illustrated the generality of our approach by treating three general classes of algorithms and several specific instances. In general, our approach only requires limited information about a Lyapunov function for plant dynamics and Lipschitz constants for the optimization problem.
Our results give immediate prescriptions for the design of feedback controllers that are easy to evaluate. The conservativeness of our bounds is domain-specific, but they are sometimes of practical relevance, for instance, in power system [25].
While our work establishes stability conditions, it does not give quantitative results on the rate of convergence, the robustness against noise, or the tracking performance for time-varying problem setups. All of these question remain open problems and are the subject of ongoing research. Further, it is unclear whether for the case of discrete-time systems corrupted by noise analogous stability results can be derived by using so-called stochastic approximations. Finally, from a practical perspective, it is highly desirable to solve the corresponding design problem, i.e., to optimize the metric with respect to a given robustness objective. Although it is relatively simple formulating such a problem, its solvability is unclear.
-A Technical Results
Lemma A.1**.**
Consider a -matrix defined as
[TABLE]
where , , and . If and , then is negative definite.
The proof of Lemma A.1 is standard [30, pp.296].
Lemma A.2**.**
Consider a system satisfying Assumption II.1 with a Lyapunov function and a steady-state map . Further, let where is continuous and has compact level sets. Then, has compact sublevel sets.
Lemma A.2 is a straightforward extension of [25, Lem. 4]. We provide a proof for completeness.
Proof.
Consider a sublevel set , for some . Since we have , implies that . But since has compact sublevel sets, there exist such that for all .
On the compact set the continuous function is also lower bounded by some value . We therefore have that in . As is positive definite, we must have that . We then have
[TABLE]
where is the Lipschitz constant of , and therefore is also bounded for all . ∎
-B Proof of Proposition II.1
We use the change of coordinates such that (3) can be written as
[TABLE]
By Lipschitz continuity of (in and ) and , we have
[TABLE]
where the last inequality follows from the triangle inequality and Cauchy-Schwarz.
Let denote the solution of (-B.1) at time that starts in for fixed . Define
[TABLE]
with . Analogously to the proof of [27, Th. 4.14], it can be shown that satisfies
[TABLE]
with , and .
Next, we proceed similarly as in the proof of [27, Lem. 9.8]. The sensitivity function of the solution with respect to changes in [27, Ch. 3.3], satisfies the ODE
[TABLE]
with . Using Lipschitz continuity of we get
[TABLE]
Applying a special case of the Gronwall inequality [41, Cor. 6.2] (because is monotone increasing) yields the bound , and we have
[TABLE]
where we have used by exponential stability and .
Finally, we can reverse the change of coordinates by defining . We immediately have the desired bounds
[TABLE]
and the time derivative of with respect to (3) as
[TABLE]
For the final bound, note that we have
[TABLE]
where . This completes the proof.
-C Projected Dynamical Systems
For convenience, we restrict ourselves to a simplified definition of projected dynamical systems that is centered around regular sets as defined in Section II. For a more comprehensive treatment the reader is referred to [42]. We define a projection operator for a regular set , , and as
[TABLE]
that is, projects a vector onto the tangent cone of at the point . Since is a closed convex set for any , the minimum norm projection of on exists and is unique, and is well-defined. Furthermore, it holds that for all since is a cone. Further, we have the following crucial property [42, Lem. 4.5]:
Lemma C.3**.**
For a regular set , , and , there exists a unique such that . Further, it holds that and .
A projected dynamical system is thus defined by applying the projection operator to a standard vector field at every point. This leads to the initial value problem
[TABLE]
where denotes an initial condition.
In general, is not continuous and standard existence results for ODEs do not apply. Instead, a (Carathéodory) solution to (-C.2) is defined as an absolutely continuous function for some and , and for which holds almost everywhere, i.e., for almost all . In particular, a solution has to remain in .
The following two theorems establish the existence of solutions to (-C.2). First, according to Corollary 5.2 in [42], we have local existence of Carathéodory solutions:
Theorem C.1** (Local Existence).**
Let be a regular set and let be a locally Lipschitz vector field. Then, for every there exists a local solution of (-C.2) for some .
Second, Proposition 8.6 in [42] provides an invariance principle for projected dynamical systems that can also serve to certify the existence of complete solutions.
Theorem C.2** (Invariance Principle).**
Consider (-C.2) with regular and locally Lipschitz. Furthermore, let be continuously differentiable with compact sublevel sets . If it holds that for all , then every solution to (-C.2) starting at is complete and will converge to the largest weakly invariant subset of .
-D LTI systems
In the following, we show how for LTI plant dynamics, the previously developed stability bounds take a particularly easy form and can, in fact, be made independent of the internal state representation. This also allows us to formulate a simple example in which our stability bound is tight.
For simplicity, we limit ourselves to the case of gradient-based controllers, although the same ideas can be extended to other classes of optimizing feedback controllers.
Hence, instead of (3) we consider the special case
[TABLE]
where , , and is a fixed, but unknown, disturbance.
For a fixed , exponential stability of (-D.1) is equivalent to being Hurwitz (i.e., only having eigenvalues with negative real part) and consequently being invertible. Hence, the steady-state map takes the explicit form
[TABLE]
where and .
Furthermore, let be such that
[TABLE]
and hence is a Lyapunov function satisfying the conditions of Proposition II.1. In particular, we have since
[TABLE]
and
[TABLE]
This allows us to directly state the following corollary to Theorem III.2:
Corollary D.3**.**
Consider the a plant of the form (-D.1) with and satyisfying (-D.2). Further let Assumption III.1 hold and have compact level sets. Then, the same conclusions as in Theorem III.2 hold whenever
[TABLE]
where satisfies (9).
In particular, (-D.3) is satisfied if it holds that
[TABLE]
where is the condition number of .
Remark 6*.*
In [25], instead of (-D.2), the dynamical system is only required to satisfy . This is more easily solvable, but yields a suboptimal estimate of the decay rate and therefore a more conservative stability bound.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Z. E. Nelson and E. Mallada, “An integral quadratic constraint framework for real-time steady-state optimization of linear time-invariant systems,” in 2018 Annual American Control Conference (ACC) , Milwaukee, WI, Jun. 2018, pp. 597–603.
- 2[2] M. Colombino, E. Dall’Anese, and A. Bernstein, “Online Optimization as a Feedback Controller: Stability and Tracking,” IEEE Trans Control Netw Syst. , 2019.
- 3[3] F. D. Brunner, H. Dürr, and C. Ebenbauer, “Feedback design for multi-agent systems: A saddle point approach,” in 2012 IEEE 51st IEEE Conference on Decision and Control (CDC) , Dec. 2012, pp. 3783–3789.
- 4[4] L. S. P. Lawrence, Z. E. Nelson, E. Mallada, and J. W. Simpson-Porco, “Optimal Steady-State Control for Linear Time-Invariant Systems,” in 2018 IEEE Conference on Decision and Control (CDC) , Miami Beach, FL, Dec. 2018, pp. 3251–3257.
- 5[5] C. E. Garcia and M. Morari, “Optimal operation of integrated processing systems: Part II: Closed-loop on-line optimizing control,” AI Ch E J. , vol. 30, no. 2, pp. 226–234, Mar. 1984.
- 6[6] S. Skogestad, “Self-optimizing control: The missing link between steady-state optimization and control,” Comput Chem Eng. , vol. 24, no. 2-7, pp. 569–575, Jul. 2000.
- 7[7] A. Marchetti, B. Chachuat, and D. Bonvin, “Modifier-Adaptation Methodology for Real-Time Optimization,” Ind Eng Chem Res. , vol. 48, no. 13, pp. 6022–6033, Jul. 2009.
- 8[8] G. Francois and D. Bonvin, “Measurement-Based Real-Time Optimization of Chemical Processes,” Adv Chem Eng. , vol. 43, pp. 1–508, 2013.
