Time Distributed Optimization for Model Predictive Control: Stability, Robustness, and Constraint Satisfaction
Dominic Liao-McPherson, Marco Nicotra, and Ilya Kolmanovsky

TL;DR
This paper introduces a systems theoretic framework for analyzing time distributed optimization in model predictive control, focusing on stability, robustness, and constraint satisfaction, and extends theoretical results for real-time iteration schemes.
Contribution
It provides a general stability analysis framework for time distributed optimization in MPC, extending existing theory for real-time iteration methods.
Findings
The framework guarantees stability and constraint satisfaction under certain conditions.
Numerical simulations validate the effectiveness of the proposed approach.
The analysis enhances understanding of robustness in time distributed optimization schemes.
Abstract
Time distributed optimization is an implementation strategy that can significantly reduce the computational burden of model predictive control by exploiting its robustness to incomplete optimization. When using this strategy, optimization iterations are distributed over time by maintaining a running solution estimate for the optimal control problem and updating it at each sampling instant. The resulting controller can be viewed as a dynamic compensator which is placed in closed-loop with the plant. This paper presents a general systems theoretic analysis framework for time distributed optimization. The coupled plant-optimizer system is analyzed using input-to-state stability concepts and sufficient conditions for stability and constraint satisfaction are derived. When applied to time distributed sequential quadratic programming, the framework significantly extends the existing…
| Name | Symbol | Value |
|---|---|---|
| Mass | ||
| Yaw Inertia | ||
| Front, Rear CG distance | ||
| Coefficient of friction | ||
| Tire parameters | ||
| Lateral Area | ||
| Air Density | ||
| Lateral Drag Coefficient | 1.5 | |
| Longitudinal Velocity |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Time Distributed Optimization for Model Predictive Control: Stability, Robustness, and Constraint Satisfaction
Dominic Liao-McPherson [email protected]
Marco M. Nicotra [email protected]
Ilya Kolmanovsky [email protected] Department of Aerospace Engineering, University of Michigan, 1221 Beal Avenue, Ann Arbor, MI 48109
Department of Electrical, Computer, and Energy Engineering, University of Colorado Boulder, 425 UCB, Boulder, CO 80309
Abstract
Time distributed optimization is an implementation strategy that can significantly reduce the computational burden of model predictive control by exploiting its robustness to incomplete optimization. When using this strategy, optimization iterations are distributed over time by maintaining a running solution estimate for the optimal control problem and updating it at each sampling instant. The resulting controller can be viewed as a dynamic compensator which is placed in closed-loop with the plant. This paper presents a general systems theoretic analysis framework for time distributed optimization. The coupled plant-optimizer system is analyzed using input-to-state stability concepts and sufficient conditions for stability and constraint satisfaction are derived. When applied to time distributed sequential quadratic programming, the framework significantly extends the existing theoretical analysis for the real-time iteration scheme. Numerical simulations are presented that demonstrate the effectiveness of the scheme.
keywords:
Real-time optimization, Model predictive control, Real-time iterations, Input-to-state stability, Constrained control, Control of nonlinear systems
††thanks: This research is supported by the Toyota Research Instituite (TRI) and by the National Science Foundation through awards CMMI 1904441 and CMMI 1562209. TRI provided funds to assist the authors with their research but this article solely reflects the opinions and conclusions of its authors and not TRI or any other Toyota entity.
, ,
1 Introduction
Model Predictive Control (MPC) [22] is a widely used control technique that computes control actions by solving an Optimal Control Problem (OCP) over a finite receding horizon. MPC can systematically handle constraints and nonlinearities but is challenging to implement since it requires the solution of a constrained and potentially non-convex OCP at each sampling instant. The development of robust and efficient quadratic and convex programming solvers, see e.g., [34, 42, 11, 43], has enabled the application of linear-quadratic MPC to a wide variety of systems. However, the implementation of MPC for systems with limited onboard computing power, fast sampling rates, and/or pronounced nonlinear dynamics remains an open problem.
One approach for reducing the computational cost of MPC is time distributed optimization (TDO). TDO distributes optimizer iterations over time by exploiting the robustness of MPC to suboptimality [2, 41, 35]. Rather than accurately solving the OCP at each sampling instant, TDO maintains a guess of the optimal solution and improves it at each timestep by performing a finite number of iterations of an optimization algorithm. TDO can be interpreted as a dynamic compensator that maintains a solution estimate as an internal state, the dynamics of which are defined by the optimizer iterations. As illustrated in Figure 1, this interpretation differs from “ideal”, or “optimal” MPC which is an implicitly defined static feedback law.
There are a variety of TDO variants proposed in the literature. The stability of input constrained TDO controllers using linearly convergent optimization algorithms are studied in [18]. Unconstrained suboptimal NMPC without terminal conditions is considered in [21]. A fixed point scheme for input constrained MPC of sampled data input affine systems is proposed in [17], a gradient based dynamic programming approach is considered in [49], a proximal gradient method for linear input-constrained MPC is studied in [51], and a continuous time gradient flow based approach is described in [38]. These methods use some combination of shifting terminal control updates and first order optimization methods. In [2, 41], a generic suboptimal MPC scheme is considered and sufficient conditions on the warmstart for robust stability are derived; the optimization algorithm is not specified and its convergence is not considered. The robustness of MPC to disturbances arising from incomplete optimizations is considered in [55, 20] and conditions for complexity certification of suboptimal state constrained linear MPC are presented in [46]. However, the treatment of the optimizer itself as a dynamic system was not pursued.
An alternative to gradient based approaches are second order methods. In particular, Time Distributed Sequential Quadratic Programming (TD-SQP) methods are attractive since they can be implemented using existing Quadratic Programming (QP) solvers. The fundamental idea behind a TD-SQP based model predictive controller is to apply a finite number of SQP iterations at each sampling instant and to warmstart the iterations with the solution estimate from the previous sampling instant. A widely used variant of TD-SQP is the Real-Time Iteration (RTI) scheme [6] which uses a Gauss-Newton Hessian approximation and performs a single SQP iteration per sampling instant. The RTI scheme has been successfully applied to a variety of applications including engines [1, 56], kites[25], cranes [52], ground vehicles [15], race cars[36], distillation columns [9] and wind turbines [19]. Software for implementing the RTI scheme is provided by the ACADO toolkit [24]. Despite its widespread success, formal stability guarantees for the RTI scheme have only been provided in the absence of inequality constraints [8].
It should be noted that TDO is distinct from so-called suboptimal solution tracking, sensitivity, or running methods, e.g., [55, 54, 32, 10, 16, 27] which are tailored numerical methods for tracking the solutions of parameterized nonlinear programs/generalized equations. These methods are typically used to accelerate or replace existing nonlinear programming solvers to reduce computation times, which is different from considering the dynamic interactions between the plant and the optimizer. Some of them, e.g., [55], consider robust stability by treating suboptimality as a bounded disturbance. This differs from our approach where we treat suboptimality as the output of a dynamic system which is coupled with the closed-loop plant.
This paper begins by presenting a system theoretic framework for analyzing a broad class of TDO algorithms. Specifically, the framework applies to any MPC feedback law that is Locally Input-to-State Stable (LISS) combined with any optimization algorithm featuring a convergence rate that is at least locally -linear. Any MPC formulation with proven LISS properties can be used, including nominal MPC with terminal constraints [35], robust MPC111A nominal MPC controller does not explicitly consider the presence of disturbances in the OCP formulation, unlike a robust MPC controller. [35], and MPC formulations with no terminal constraints [22, Chapter 6]. in this paper, we establish the existence of a joint region of attraction for the state and solution estimate, i.e., we show that if the initial state is sufficiently close to the origin and if the initial solution estimate is sufficiently accurate, the state will converge to the origin and the estimate will converge to the optimal solution. Moreover, we analyze the effect of performing more iterations, establish robustness properties, and show that, if the initial solution guess is within the convergence basin of the optimization method, TDO can recover the robust region of attraction of optimal MPC with a finite number of iterations.
The proposed theoretical framework is then specialized to the RTI scheme. Our analysis extends that in [8] as follows: (i) We explicitly consider inequality constraints and relax the terminal state constraint to a terminal set constraint; (ii) We explicitly consider the robustness properties of the RTI scheme by establishing LISS of the closed-loop system; (iii) We analyze the effect of the number of SQP iterations performed at each sampling instant and establish sufficient conditions for robust constraint satisfaction. We also provide a proof which extends the classical preconditioned fixed-point type analysis of Newton’s method, see [31, Section 5.4.2], to the setting of generalized equations and establish conditions under which discrete time optimal control problems with polyhedral constraints are strongly regular. The latter property is important since it is a sufficient condition for Lipschitz continuity of the optimal value function and thus for robust stability.
This paper builds upon the results in [33] which analyzes the stability of MPC implemented using a suboptimal semismooth predictor-corrector (SSPC) method. Specifically, we generalize the previous results for suboptimal SSPC to a wide class of optimizers which are at least q-linearly convergent. Moreover, we consider external disturbances in our analysis and analyze several different variants of the RTI scheme.
The layout of the paper is as follows. We review pertinent notation and concepts in Section 2 then describe the problem setting and the class of optimization algorithms we consider in Sections 3 and 4. We establish the ISS properties of the optimization algorithms and of the coupled plant-optimizer system in Sections 5 and 6. Next, we discuss the strong regularity assumption in Section 7, we discuss relevant SQP methods in Section 8, and we illustrate how they fit into our optimization framework in Section 9. Finally, we present simulation results in Section 10.
2 Preliminaries
We denote by the non-negative (positive) integers and by the non-negative (positive) reals. For a discrete time system
[TABLE]
given an initial state , and an input sequence we denote its solution by . For a vector, denotes the usual Euclidean norm, for we let . We use as shorthand for . Recall that a function is said to be of class if it is continuous, strictly increasing and . If it is also unbounded, then . A function is said to be of class if for each fixed and as for fixed . If we denote their composition by . We use to denote the identity matrix, and use to denote the identity function. We denote the domain of a set-valued mapping by . If is a matrix then is its th row. If is an index set, is its cardinality and denotes the row wise concatenation of . For two vectors, denotes vertical concatenation. We denote the unit ball centered at by , it is understood that . If is a closed neighbourhood of the origin, we denote its radius by , i.e., the largest such that . The normal cone mapping of a closed-convex set is defined as
[TABLE]
and set addition/subtraction is defined as
[TABLE]
We make extensive use of the concept of input-to-state stability[29]. Since MPC is a constrained control technique, meaning that it is intrinsically not “global”, it is natural to consider a local variant.
Definition 1** (LISS[28]).**
A system (1) is said to be Locally Input-to-State Stable (LISS) if there exists , , and such that, ,
[TABLE]
provided and .
Definition 2** (Asymptotic gain[29]).**
Consider system (1), we say that it has an asymptotic gain if there exists some such that
[TABLE]
for all .
3 Problem Setting and Control Strategy
Consider the following discrete time system,
[TABLE]
where , and denote the state, input, and disturbance. Throughout this paper we assume full state feedback and that the following holds.
Assumption 1**.**
The function in (4) is twice continuously differentiable in its first two arguments, Lipschitz continuous in the third, and . Moreover, the sets , and are compact and contain the origin.
We wish to control (4) using MPC and thus consider an OCP of the following form,
[TABLE]
where is the horizon length, are the constraints, is the terminal state constraint, is the state sequence, and is the control sequence. The OCP (5) is parameterized by the measured system state . We impose the following conditions on (5) to ensure that it is well posed and can be used to construct a stabilizing control law for (4).
Assumption 2**.**
All functions in (5) are twice continuously differentiable in their arguments and their second derivatives are Lipschitz continuous.
Assumption 3**.**
The stage cost satisfies , and there exists such that for all . The terminal set is a subset of , contains the origin in its interior, and is an admissible control invariant set for (4), i.e., for all there exists such that and . is a Control Lyapunov Function for (4) with , such that,
[TABLE]
for all , where .
Denote by
[TABLE]
the set of feasible parameters. Under Assumptions 1 and 2, the set is compact and, for all , a minimum of (5) exists [37]. The ideal/optimal MPC feedback policy is then
[TABLE]
where is a global minimizer of (5). To address the effects of incomplete optimization, we consider the perturbed closed loop system
[TABLE]
where the control signal is corrupted by an additive disturbance that represents suboptimality, i.e., . Before stating the stability properties of (8), recall the following notion.
Definition 3** (RPI set [35]).**
Given suitable sets and , a set is a Robust Positively Invariant (RPI) set for system (8) if for all , , . In addition, if , then is called an admissible RPI set.
The following theorem summarizes the LISS properties of nominal MPC.
Theorem 1**.**
[35, Theorem 4]** Let Assumptions 1 - 3 hold and suppose that , the optimal value function for (5), is uniformly continuous. Then, the closed-loop system (8) is LISS with respect to on a non-empty RPI set . Moreover, there exist such that if and , then is an admissible RPI set for (8).
In this paper we consider the situation where not enough computational resources are available to accurately solve (5) at each sampling instant. Instead we will approximately track solution trajectories of (5) as the measured state in (5c) varies over time. To track the solution trajectories, we use an appropriate iterative optimization algorithm, e.g. SQP, which is warmstarted at time instance with the approximate solution from . In this way we construct a dynamic system,
[TABLE]
where is an estimate of the primal-dual solution of (5) and represents a fixed number of optimizer iterations ( is formally defined in (13)). This leads to the interconnected system illustrated in Figure 1. The objective of this paper is to analyze the interconnection between the plant and the dynamic controller from a systems theoretic perspective.
Remark 1**.**
This paper focuses on a common nominal MPC formulation for the sake of clarity. However, our analysis is applicable to any MPC formulation for which it is possible to prove LISS e.g., formulations that employ exact penalty functions [5] or robust MPC formulations [35]. Moreover, note that in (7) the MPC feedback law is defined using a global optimum of (5). This requirement is not intrinsic to our analysis but rather an artifact of the specific MPC formulation. Our analysis is performed relative to a nominal “ideal” MPC feedback law. If the nominal feedback law is input-to-state stable our analysis is applicable regardless of whether the nominal feedback law is globally optimal or not. For example, one could use the dual-mode MPC formulation in [48] which does not require global optimality.
4 Optimization Strategy
In this section we describe the class of optimization algorithms considered in this paper. We start in an abstract setting to clarify which properties are essential to our analysis. Later in Sections 8 and 9 we will illustrate how SQP fits into this framework.
Suppose the first order necessary conditions for (5) can be written as a parameterized Generalized Equation (GE) of the form
[TABLE]
where is the normal cone mapping222See [14] for more background on set-valued and normal cone mappings. of a closed, convex set , is a function, are the optimization variables and is the parameter. Its solution mapping is
[TABLE]
which can be set valued. Because (10) are necessary conditions for (5) and a minimizer of (5) exists under Assumptions 1 and 2 [37] we have that .
Many optimization algorithms are designed to “solve” necessary conditions. We thus associate (10) with an iterative optimization algorithm of the form
[TABLE]
where . Multiple iterations of the algorithm can be represented by the action of the function which is defined recursively by the sequence
[TABLE]
where is the iteration number and .
Remark 2**.**
The optimality conditions of (5) can be written in multiple forms depending on the choice of (10) and (12). For example, can either be treated as a decision variable or a function of . As a result, the definition of is not unique and is chosen by the designer of the optimization algorithm. Specifically, the vector always includes the control sequence, but may also include the state sequence and/or the Lagrange multipliers associated with equality (dynamic) or inequality constraints.
In this paper we consider algorithms that converge at least q-linearly for a fixed parameter . The following definition formalizes this notion.
Definition 4** (At least q-linear convergence).**
For any and an optimization algorithm converges to if there exists such that
[TABLE]
for all . If there exists and such that
[TABLE]
for all and then is said to converge at least q-linearly over .
Remark 3**.**
The necessary conditions (10) may be satisfied at stationary points or local maxima as well as local minima. We perform a local analysis that allows us to exclude those points.
Next we consider the regularity properties of (11), we will use of the following regularity condition for generalized equations.
Definition 5** (Strong Regularity[44]).**
A set-valued mapping is said to be strongly regular at for if and there exist neighborhoods of and of such that the truncated inverse mapping is single-valued, i.e., a function, and is Lipschitz continuous on .
Strong regularity reduces to non-singularity of the Jacobian matrix if is a continuously differentiable function. Our main regularity assumption follows. It establishes that any solution trajectories are Lipschitz continuous. In Section 7, we discuss conditions for strong regularity for specific instances of (10).
Assumption 4**.**
All points satisfying that correspond to minimizers are strongly regular .
The following theorem shows that Assumption 4 ensures that the notion of tracking a local solution trajectory is well defined.
Theorem 2**.**
[13]** Let the parameter be a Lipschitz continuous function of . Then each solution trajectory is isolated and Lipschitz continuous.
A local optimization algorithm, such as SQP, can be used to track a specific “isolated branch” of the solution mapping. The branch is implicitly selected through the choice of initial guess supplied to the algorithm. Some MPC formulations require global optima while some do not, as discussed in Remark 1. In practice, local methods like SQP are often used regardless due to the prohibitive computational complexity of global methods.
Remark 4**.**
To summarize, an algorithm/optimality condition pair fits in our framework if:
- •
The optimality conditions can be written in the form (10) and satisfy Assumption 4.
- •
The algorithm can be written in the form (12).
- •
The algorithm is at least q-linearly convergent.
5 LISS of time-distributed optimization
Consider the application of TDO to problem (5). In a real-time setting it is only possible to perform a finite number of iterations per sampling instant which we denote by . In this situation the optimizer can be viewed as a dynamical system of the form,
[TABLE]
where is a surjective matrix that selects from the solution estimate, i.e., where
[TABLE]
is an isolated single valued restriction333Our analysis is performed relative to the ideal feedback law . Any choice of the restriction that renders the origin of the closed loop system (8) LISS is admissible. of (this is possible due to Theorem 2).
In this section we establish conditions under which (16) is LISS. We consider the associated error system,
[TABLE]
where , and . The error system can be explicitly constructed as follows
[TABLE]
Lemma 1**.**
Consider (16) and its error system (18) and suppose that is at least q-linearly convergent. Further, let Assumption 4 hold. Then, there exists , such that the error satisfies
[TABLE]
subject to the restriction
[TABLE]
where is the convergence radius in Definition 4 and is the Lipschitz constant of over . Further, and are monotonically decreasing with , , , and .
{pf*}
Proof. If for some fixed then the error bound (15) implies that
[TABLE]
for all , d where . Now consider any and let . Then, applying (21) with , we obtain that
[TABLE]
where we have used that is Lipschitz on with constant by Assumption 4. Recall that denotes the convergence radius of in . A sufficient condition for the restriction is then
[TABLE]
Continuing and imposing ,
[TABLE]
If , then since and . Otherwise, note that
[TABLE]
where we used that, for ,
[TABLE]
Thus,
[TABLE]
where
[TABLE]
and . Since and by assumption, , the functions and are monotonically decreasing and as . ∎
The following Theorem establishes the LISS properties of the error system.
Theorem 3**.**
Consider (16) and its error system (18) and suppose that is at least q-linearly convergent. Further, let Assumption 4 hold. Then, there exists such that the system is LISS if and , where and , and are defined in Lemma 1. Further, the asymptotic gain of (18) is of the form
[TABLE]
where and correspond to the and inputs, respectively, and monotonically as .
{pf*}
Proof. Given Lemma 1, if (20) holds for all time instants leading up to , it follows by direct computation (see e.g. [29, Example 3.4]) that
[TABLE]
where . To ensure that (20) holds, we first consider the case and note that444Recall that for any two scalars.
[TABLE]
Next, assuming (20) holds for and recalling that , we can apply (29) at iteration to show that
[TABLE]
thus ensuring that (20) also holds at due to the restriction and . Since (29) recursively enforces its restrictions, we obtain
[TABLE]
which directly establishes LISS with and . The remaining claims follow from the expression since monotonically as .∎
6 LISS properties of suboptimal MPC
Theorem 3 establishes sufficient conditions under which an at least q-linearly convergent optimizer, viewed as a dynamic system, is LISS. It also establishes that the asymptotic gain of (18) can be made arbitrarily small by increasing the number of iterations. Since the closed-loop system (8) is itself LISS, we can derive sufficient conditions under which the coupled system, as shown in Figure 2, is LISS with respect to the disturbance input .
Theorem 4**.**
Consider the dynamical systems
[TABLE]
where and is defined in (8). Let the optimization algorithm used to construct satisfy (15) and let Assumptions 1 - 4 hold. Then, there exists such that if the interconnected system (33) is LISS with respect to the input .
{pf*}
Proof. Under Assumptions 1 - 4, Theorem 1 holds555Recall that Assumption 4 is sufficient for Lipschitz continuity of the optimal value function .. Thus system is LISS, meaning that there exist asymptotic gains such that
[TABLE]
for suitably restricted and . Let denote the Lipschitz constant of , then
[TABLE]
combining this with (34) we obtain that
[TABLE]
where , and . Similarly, due to Theorem 3, there exists and positive functions and such that
[TABLE]
given . Therefore, it follows from (33b) that
[TABLE]
Combining (38) with (36) we obtain that
[TABLE]
Thus, if the contraction property
[TABLE]
is satisfied for all , (33) is LISS with suitable restrictions on the initial state and on the disturbance , as detailed in [50, Theorem 2]. Note that, since , we have where is the convergence radius666If the optimizer is globally convergent then can be chosen arbitrarily. In that scenario it may be possible to obtain a stronger result using different analysis tools. of the optimizer defined in Theorem 3. Since monotonically as , the existence of such that (40) is satisfied follows from the finiteness of , , and . ∎ Theorem 4 establishes conditions under which the interconnected plant-optimizer system is LISS. However, this result does not provide any information about the set of admissible initial conditions and does not consider constraint satisfaction. By noting that the ideal MPC feedback law admits a robust positively invariant set, we can extend our result by deriving sufficient conditions for constraint satisfaction.
Theorem 5**.**
Suppose that the assumptions of Theorem 4 hold so the interconnected system (33) is LISS. Let denote the admissible RPI set in Theorem 1, let denote the asymptotic gain of (18), and let denote the closed-loop trajectory of (33) for some initial condition and disturbance sequence . Then, if the disturbances are sufficiently small, there exists and such that, if and , then for all .
{pf*}
Proof. Due to Theorem 1, given a sufficiently small disturbance set , there exists a neighbourhood of the origin such that, if and , then . Since for a surjective matrix , there exists such that, if , then . Given the restriction and , where and are defined in Theorem 3, it follows from (32) that can be imposed by enforcing and . To enforce , we note that the set is bounded [35, Theorem 4], thus implying that (see Section 2 for a definition of set subtraction) is bounded by
[TABLE]
Since is finite and monotonically as , there exists such that . Moreover, since is finite and monotonically as there exists such that . Thus, letting and , it follows that the system is LISS with restrictions on the initial conditions and , as well as restrictions on the external disturbance .∎ Theorem 5 establishes that, if enough computational resources are available and the initial solution guess is sufficiently accurate, then TDO recovers the robustness properties of optimal MPC.
Remark 5**.**
The results presented in this section are quite general: as long as the MPC formulation is LISS, the solution mapping of the OCP is strongly regular, and the convergence rate of the iterative solver is at least -linear, Theorems 4 and 5 prove that it is possible to achieve robust stability and constraint satisfaction by performing a limited number of solver iterations per time step. Due to the generality of the framework, however, the actual values we obtain for and are likely to be conservative and would be ill-suited for, e.g., complexity certification as in [46], which, it should be noted, only considers the linear case. Despite this drawback, our results significantly extend the existing analysis of the RTI scheme [8] when our framework is applied to TD-SQP. Complexity certification is significantly more challenging in the nonlinear case due to the nonconvexity of the OCPs and is left to future work.
7 Conditions for Strong Regularity
The main results in this paper are all predicated upon Assumption 4, that for each parameter value the solution mapping of the OCP is strongly regular. In this section we discuss some common settings and derive strong regularity conditions for each.
7.1 Closed Convex Constraint Sets
If the constraint sets and in (5) are closed and convex, it is possible to write the optimality conditions without introducing dual variables for the inequality constraints. In particular, we can express (5) compactly as
[TABLE]
where and . The Lagrangian associated with (42) is
[TABLE]
where are dual variables (sometimes “co-states”). The KKT conditions of (42) are
[TABLE]
where and . Note that (44) can be reduced to (10) by choosing and . Our framework requires that (44) be necessary for optimality. To ensure this, we impose the following constraint qualification [45, Theorem 6.14]
[TABLE]
for all . The following lemma proves that(45) holds automatically in this setting.
Lemma 2**.**
The constraint qualification (45) holds at all points .
{pf*}
Proof. The constraint qualification is implied by surjectivity of the matrix . Denoting , and , the surjectivity of becomes the condition that for every the system
[TABLE]
has a solution. This condition clearly holds: pick an arbitrary sequence and determine recursively.∎
Before stating the conditions for strong regularity, we recall the following second order condition (which can be monitored numerically, see e.g., [39, Section 16.2]).
Definition 6** (SOSC).**
The Second Order Sufficient Condition (SOSC) is said to hold at if
[TABLE]
7.1.1 Convex Control Constraints
If only convex control constraints are present, Theorem 6 provides sufficient conditions for strong regularity.
Theorem 6**.**
[12, Theorem 1.2]** Suppose that , where is closed and convex, and consider any . If (46) holds, then is strongly regular at .
As a result of Theorem 6, Assumption 4 reduces to the assumption that (46) holds at all minimizers in . In this scenario, any terminal set constraints would have to be enforced through penalty functions.
7.1.2 Polyhedral State and Control Constraints
If the state and control constraints are convex polyhedra, the following theorem applies. The result was previously asserted without proof in [10, Section 3.2], we provide a proof for completeness.
Theorem 7**.**
Suppose that in (42) is polyhedral with a representation . Now consider a KKT point . If (46) holds, then is strongly regular at .
{pf*}
Proof. Strong regularity of the nonlinear GE (44) at follows from strong regularity of its partial linearization [44]. This can be written as
[TABLE]
where , , , , , and . Equation (47) is an affine GE of the form,
[TABLE]
to which we apply [14, Theorem 2E.6] to establish strong regularity of the mapping . This requires
[TABLE]
where , , and
[TABLE]
is the critical cone777See [14, Section 2E] for more details on critical cones. We’ve simplified the expression for using [14, Theorem 2E.3] and (44). of at . Next, note that thus, by the second order condition, for all . Thus
[TABLE]
which implies that for all . As a result, (49) is satisfied and (44) is strongly regular. ∎ Thus, as in the case of convex input constraints, Assumption 4 reduces to the condition that (46) holds at all minimizers in .
7.2 Nonlinear Inequality Constraints
If the constraint sets in (5) can be expressed in the form and for suitable twice continuously differentiable functions and , then (5) can be written compactly as the following Nonlinear Program (NLP),
[TABLE]
where are the decision variables. The Lagrangian associated with (51) is
[TABLE]
where and are dual variables. Its KKT conditions [26] are
[TABLE]
where is the normal cone mapping of the non-negative orthant. Comparing (53) with (10) we can identify , , and
[TABLE]
To ensure that (53) are necessary for optimality, as required by our framework, we need to impose a constraint qualification on (51).
Definition 7** (LICQ).**
The Linear Independence Constraint Qualification (LICQ) is said to hold at if
[TABLE]
where is the set of active constraint indices and is the number of equality constraints.
The following Theorem summarizes necessary and sufficient conditions for strong regularity in the context of nonlinear programming.
Theorem 8**.**
[26, Prop 1.27, 1.28]** Consider a parameterized nonlinear program of the form (51) and let be the solution mapping of its KKT conditions (53). A point satisfying is strongly regular if it satisfies the LICQ and the strong second order sufficient condition (SSOSC)
[TABLE]
where
[TABLE]
and . Moreover, if is a local minimizer of (51), the LICQ and SSOSC are also necessary conditions for strong regularity.
Thus, Assumption 4 reduces to the assumption that the SSOSC and LICQ hold at all minima in .
8 Sequential Quadratic Programming
Having defined under what conditions the solution mapping of the OCP is strongly regular, we investigate the convergence properties of two widely used SQP schemes to show that they can be used for TDO. To this effect, note that the OCPs (42) and (51) can both be solved using SQP. Specifically, for (51), given a solution estimate , the next iterate can be computed by solving the following Quadratic Program (QP)
[TABLE]
where approximates the Hessian of the Lagrangian . Specifically, if we denote the Lagrange multipliers associated with the equality and the inequality constraints by and respectively. The SQP update for (51) is . Note that (55) is fully defined by (42) or (51) except for , which will depend on the specific SQP method.
SQP applied to (42) is similar. The SQP update becomes , i.e., the inequality duals are removed from the iteration, must approximate instead of and (55c) becomes , where and satisfy .
Remark 6**.**
The convex control constraints setting (Section 7.1.1) technically allows for non-polyhedral convex constraints. In this case (55c) would need to be replaced with a convex constraint of the form but otherwise no changes are necessary.
To provide a unified formulation, we exploit that SQP can be seen as a Newton-type process for solving GEs of the form
[TABLE]
where , , is continuously differentiable and is the normal cone mapping for a closed, convex set . Newton’s method applied to (56) is
[TABLE]
where the sequence approximates . Referring to the QP subproblem (55), we note that
[TABLE]
for (51)” for (42) simply discard the third row and column. Thus, the the sequence is fully determined by the Hessian approximation sequence .
Remark 7**.**
In this paper we only consider “undamped” Newton methods, which are intrinsically local methods. More sophisticated implementations may include various type of regularization and/or globalization techniques such as trust regions or linesearches to enlarge the methods region of attraction. Nevertheless, undamped Newton methods are commonly used in practice, especially in the context of the RTI scheme, and the tools we develop in this paper are applicable to locally convergent algorithms. We leave the application of our tools to globalized SQP methods to future work and refer readers to e.g., [39] or [26], for more detailed treatments of SQP methods.
8.1 The Josephy-Newton (JN) method
Using the exact Hessian of the Lagrangian results in the Josephy-Newton method. The following theorem summarizes the convergence properties of the JN method applied to (56).
Theorem 9**.**
[26, Theorem 3.2]** Let for some fixed and suppose that Assumption 2 holds and is strongly regular. Let the sequence be generated by repeatedly solving
[TABLE]
Then, there exists and satisfying , such that, if , then is unique and converges to q-quadratically, i.e.,
[TABLE]
In general, we cannot expect to be positive semidefinite even in the vicinity of a solution. This may make solving the QP subproblems difficult and is a well known issue in the SQP literature. A detailed discussion is outside the scope of this paper, we refer interested readers to e.g., [26, 39, 4]. We will however briefly discuss two possible solutions. The first is to use an Augmented Lagrangian Hessian, i.e., to use for some . If the second order sufficient conditions hold, then will be convex if is sufficiently large [26, Section 4.2]. This will shift the multipliers associated with , see [26, Section 4.2] for details on how to recover the original multipliers. This method may not always be numerically efficient because it can negatively impact the sparsity of . The second is to use a reduced Hessian approach, see e.g., [47, 39], which maintains a basis for the null space of the active constraints and solves the QPs on this reduced space. The Hessian projected onto the reduced space (the “reduced Hessian”) is guaranteed to be positive definite in the vicinity of a solution if a second order condition holds.
8.2 The Gauss-Newton (GN) method
The Gauss-Newton method is applicable when the objective function has the form for some residual function . The Hessian of the Lagrangian is then approximated by
[TABLE]
For example if the GN Hessian approximation is . The GN method has the advantage that the Hessian approximation is guaranteed to be positive semidefinite, so the QP subproblems can be solved reliably. Because of this, the GN method is widely used in practice, see e.g., [24, 52, 19, 20, 7, 1]. The GN approximation error satisfies
[TABLE]
so the approximation error depends on the size of the residuals and on the second derivative which is related to the nonlinearity of the dynamics. We show in Theorem 10 that it is important to approximate where . The following theorem establishes sufficient conditions for q-linear convergence of the GN method by extending the classical fixed-point type analysis of Newton’s method, see [31, Section 5.4.2]. The nearest analysis we found in the literature is [10, Theorem 3.5] which considers a path tracking problem rather than a fixed one.
Theorem 10**.**
Fix some parameter , let and suppose that Assumptions 2 and 4 hold. Consider a sequence generated by repeatedly solving (57). Further, define and suppose that there exist such that for all . If the mapping
[TABLE]
is strongly regular for all , i.e., is a Lipschitz continuous function with Lipschitz constant , and , then there exists and such that if , then is unique, converges to q-linearly, and
[TABLE]
where .
{pf*}
Proof. A solution, , exists for every thanks to Assumption 4; from this point forward we will suppress the dependencies on in the subsequent expressions. The GN method can be written as
[TABLE]
where ; note that for any choice of . First consider
[TABLE]
[TABLE]
Since is Lipschitz (Assumption 2) the fundamental theorem of calculus implies that there exist such that
[TABLE]
for all , so, taking norms, we obtain that
[TABLE]
By assumption the mapping is Lipschitz continuous so satisfies
[TABLE]
for all . Now consider the update equation
[TABLE]
where we have used that . Since is strongly regular, is a function and is unique. Using (66) we have
[TABLE]
Since by assumption, it is possible to pick such that . Then converges q-linearly to if , i.e.,
[TABLE]
Theorem 10 requires that be a sufficiently good approximation of and that the GN subproblems be strongly regular. A sufficient condition for strong regularity is that the QP (55) satisfies the LICQ and SSOSC (Theorem 8). In practice, strong regularity can be achieved by a judicious choice of . For example, if the Hessian approximation is convex then it is possible to guarantee strong regularity of the subproblems by adding a regularization term, i.e., for some small . Then the mapping is strongly monotone, which implies strong regularity [14, Theorem 2F.6].
Remark 8**.**
Theorem 10 just requires the Hessian approximation be sufficiently good. One can conceive of useful approximation schemes other than the GN approximation, e.g., when is convex or for some fixed sufficiently close to .
9 Time-distributed SQP
In this section we demonstrate that the methods described in Sections 7 and 8 satisfy the condition of Remark 4 and can therefore be used within the framework presented in Section 4.
Remark 9**.**
Note that TD-SQP using the GN Hessian approximation and with corresponds to the RTI scheme [6]. As such, when specialized to the RTI scheme, Theorems 4 and 5 are a significant extension of the existing analysis [8] which does not consider inequality constraints.
Strong Regularity Assumption: As detailed in Section 7.1, in the presence of convex constraint sets Assumption 4 can be reduced to the following:
Assumption 5**.**
The second order sufficient condition (46) holds at all minimizers in .
As detailed in Section 7.2, in the nonlinear inequalities setting, Assumption 4 can instead be ensured under the following:
Assumption 6**.**
The linear independence constraint qualification (see Definition 7) and strong second order sufficient condition (see Theorem 8) hold at all minimizers in .
Algorithm Definition: Both SQP methods described in Section 8 are instances of the following iterative process
[TABLE]
for specific choices of , , and . Thus, in both cases the optimization mapping (12) can be written as
[TABLE]
Convergence Rate: If the exact Hessian is used then Theorem 9 applies and the method is at least q-linearly convergent with . If the GN Hessian approximation is used then Theorem 10 applies under some additional assumptions regarding the accuracy of the Hessian approximation, and the method is at least q-linearly convergent with . In both cases the definition of q-linear convergence requires that there be a uniform convergence constant and convergence radius over . Under the assumption that the functions and in Theorems 9 and 10 are upper and lower semicontinuous, respectively, these can be defined as and . Thus, SQP fits into the framework in Section 4 and can be used for time distributed optimization.
10 A Numerical Example
Figure 3 illustrates a bicycle model of a sedan. We only consider the lateral portion of the dynamics; the longitudinal velocity is assumed constant. The states and control inputs are,
[TABLE]
where is the lateral position, is the lateral component of velocity, is the yaw angle, is the yaw rate, is the front steering angle, and is the rear steering angle.
The equations of motion are
[TABLE]
where
[TABLE]
The tire forces are described by a Pacejka model.This model is a modified version of the one presented in [53] and roughly represents a 2017 BMW 740i. The vehicle is disturbed by normally distributed wind gusts with a mean velocity of and standard deviation of . We obtain a discrete time model using a forward Euler integration scheme with a sampling period of leading to a discrete time model of the form . The model parameters are summarized in Table 1888SI units are used and all angles are in radians unless otherwise noted..
The control objective is to perform a lane change maneuver. This can be achieved by stabilizing the origin which is chosen to coincide with the center of the target lane. The vehicle begins in the neighboring lane at the initial condition . The OCP is
[TABLE]
where is the discrete time model of the sedan. The vehicle is subject to state constraints which keep the vehicle on the road and restrict its yaw and steering angles. The state constraints on and are softened using exact penalty functions which are implemented using slack variables in order to satisfy our smoothness assumptions. The upper and lower bounds are
[TABLE]
and the weighting matrices are , and . The terminal weight is obtained by solving the discrete time algebraic Riccati equation using the linearization about the origin. The matrices encoding the terminal set, and , are computed using the MPT3 toolbox [23]. The natural residual
[TABLE]
is an error bound [40], i.e., it upper and lower bounds , where , and is commonly used as an easily computable surrogate for the error. We use it throughout this section to measure .
Figure 4 compares the RTI scheme [8], i.e., a TD-SQP scheme using the GN Hessian approximation with , with an LQR controller and the optimal MPC feedback law999All simulations were carried out in MATLAB 2017b on a 2015 Macbook Pro with 16GB of RAM and a 2.8GHz i7 processor. We solved quadratic programs using quadprog. The optimal MPC feedback law was computed using fmincon with default settings. CASADI [3] was used to compute analytic derivatives which were supplied to the optimization routines.. The RTI feedback law successfully stabilizes the origin of the plant-optimizer system and outperforms the LQR controller. The state error and the optimization residual both converge to a ball about the origin, demonstrating the expected robustness due to the LISS properties of the combined system (Theorem 4). The closed-loop trajectories generated by the RTI controller are nearly indistinguishable from those from the optimal feedback law but are an order of magnitude cheaper to compute. The RTI scheme took on average and in the worst case vs. and for the optimal feedback law. Closed-loop responses using the RTI controller for 15 different initial position and yaw angle combinations, with all other states are initialized to zero, are shown in Figure 5.
Figure 6 compares the GN and JN methods with and . In the bottom plot of Figure 6 note that if iterations are performed, the yaw angle constraint is satisfied exactly, even in the presence of disturbances, as predicted by Theorem 5. Also, note that the residuals of the computational subsystem converge faster, for a given number of iterations, if the JN method is used instead of the GN method. This is as expected since the convergence rate of the SQP algorithm is faster when the exact Hessian is used.
11 Conclusions
In this paper we presented a general framework for the stability analysis of model predictive controllers implemented using time-distributed optimization. When specialized to Sequential Quadratic Programming, our result extends the existing stability analysis of the RTI scheme by explicitly considering inequality constraints, analyzing the effect of performing additional SQP iterations, considering a wider class of Hessian approximations, and proving local input-to-state stability of the closed-loop system. Future work includes analyzing the effect of the sampling rate, applying our framework to globalized SQP methods, and developing numerical methods for estimating the the asymptotic gain functions used in the analysis.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Thivaharan Albin, Dennis Ritter, Norman Liberda, Rien Quirynen, and Moritz Diehl. In-vehicle realization of nonlinear MPC for gasoline two-stage turbocharging airpath control. IEEE Transactions on Control Systems Technology , 26(5):1606–1618, 2018.
- 2[2] Douglas A. Allan, Cuyler N. Bates, Michael J. Risbeck, and James B. Rawlings. On the inherent robustness of optimal and suboptimal nonlinear MPC. Systems and Control Letters , 106:68 – 78, 2017.
- 3[3] Joel Andersson, Johan Åkesson, and Moritz Diehl. Casadi: A symbolic package for automatic differentiation and optimal control. In Recent advances in algorithmic differentiation , pages 297–307. Springer, 2012.
- 4[4] Paul T Boggs and Jon W Tolle. Sequential quadratic programming. Acta numerica , 4:1–51, 1995.
- 5[5] G Di Pillo and L Grippo. Exact penalty functions in constrained optimization. SIAM Journal on control and optimization , 27(6):1333–1360, 1989.
- 6[6] Moritz Diehl, Hans Georg Bock, and Johannes P Schlöder. A real-time iteration scheme for nonlinear optimization in optimal feedback control. SIAM Journal on control and optimization , 43(5):1714–1736, 2005.
- 7[7] Moritz Diehl, Rolf Findeisen, and Frank Allgöwer. A stabilizing real-time implementation of nonlinear model predictive control. In Real-Time PDE-Constrained Optimization , pages 25–52. SIAM, 2007.
- 8[8] Moritz Diehl, Rolf Findeisen, Frank Allgöwer, Hans Georg Bock, and Johannes P Schlöder. Nominal stability of real-time iteration scheme for nonlinear model predictive control. IEE Proceedings-Control Theory and Applications , 152(3):296–308, 2005.
