Symmetry and Motion Primitives in Model Predictive Control
Kathrin Fla{\ss}kamp, Sina Ober-Bl\"obaum, Karl Worthmann

TL;DR
This paper explores how symmetries in mechanical systems can be leveraged within Model Predictive Control to improve stability and control strategies, demonstrated through a mobile robot example and motion primitives.
Contribution
It establishes the foundation for integrating symmetries into MPC, proving stability in a mobile robot scenario and providing guidelines for stability when symmetry is not aligned with the cost function.
Findings
Asymptotic stability of a set point in MPC with symmetries is proven.
Numerical validation using motion primitives in a parallel parking scenario.
Guidelines for stability guarantees when the optimization criterion conflicts with symmetry actions.
Abstract
Symmetries, e.g. rotational and translational invariances for the class of mechanical systems, allow to characterize solution trajectories of nonlinear dynamical systems. Thus, the restriction to symmetry-induced dynamics, e.g. by using the concept of motion primitives, may be considered as a quantization of the system. Symmetry exploitation is well-established in both motion planning and control. However, the linkage between the respective techniques to optimal control is not yet fully explored. In this manuscript, we want to lay the foundation for the usage of symmetries in Model Predictive Control (MPC). To this end, we investigate a mobile robot example in detail where our contribution is twofold: Firstly, we establish asymptotic stability of a desired set point w.r.t. the MPC closed loop, which is also demonstrated numerically by using motion primitives applied to the parallel…
| 0 | 0.0 | -2.00 | 2.0 | 4.00 | 4.000 |
|---|---|---|---|---|---|
| 1 | 0.1 | -1.80 | 1.8 | 3.24 | 3.640 |
| 2 | 0.2 | -1.62 | 1.7 | 2.626 | 3.350 |
| 3 | 0.3 | -1.45 | 1.5 | 2.105 | 3.118 |
| 4 | 0.4 | -1.30 | 1.3 | 1.690 | 2.928 |
| 5 | 0.5 | -1.17 | 1.2 | 1.371 | 2.778 |
| 6 | 0.6 | -1.05 | 1.1 | 1.105 | 2.656 |
| 7 | 0.7 | -0.94 | 1.0 | 0.886 | 2.558 |
| 8 | 0.8 | -0.84 | 0.9 | 0.708 | 2.480 |
| 9 | 0.9 | -0.75 | 0.8 | 0.565 | 2.418 |
| 10 | 1.0 | -0.67 | 0.7 | 0.451 | 2.368 |
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
| 20 | 2.0 | -0.21 | 0.3 | 0.045 | 2.192 |
| ⋮ | ⋮ | ⋮ | ⋮ | ⋮ | ⋮ |
| 35 | 3.5 | -0.00 | 0.0 | 0.000 | 2.182 |
| No. | Trim primitive | |
|---|---|---|
| 1 | rest | |
| 2 | move straight | |
| 3 | circle clockwise | |
| 4 | circle anti-clockwise | |
| 5 | turn on the spot |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Symmetry and Motion Primitives in Model Predictive Control††thanks: Funding by Deutsche Forschungsgemeinschaft (DFG, grant no. WO 2056/4-1, 6-1) and by Mathematisches Forschungsinstitut Oberwolfach is gratefully acknowledged.
Kathrin Flaßkamp111K. Flaßkamp, Center for Industrial Mathematics, University of Bremen, Germany, email: [email protected]
Sina Ober-Blöbaum222S. Ober-Blöbaum, Department of Engineering Science, University of Oxford, United Kingdom, email: [email protected]
Karl Worthmann333K. Worthmann, Institut für Mathematik, Technische Universität Ilmenau, Germany, email: [email protected]
Abstract
Symmetries, e.g. rotational and translational invariances for the class of mechanical systems, allow to characterize solution trajectories of nonlinear dynamical systems. Thus, the restriction to symmetry-induced dynamics, e.g. by using the concept of motion primitives, may be considered as a quantization of the system. Symmetry exploitation is well-established in both motion planning and control. However, the linkage between the respective techniques to optimal control is not yet fully explored. In this manuscript, we want to lay the foundation for the usage of symmetries in Model Predictive Control (MPC). To this end, we investigate a mobile robot example in detail where our contribution is twofold: Firstly, we establish asymptotic stability of a desired set point w.r.t. the MPC closed loop, which is also demonstrated numerically by using motion primitives applied to the parallel parking scenario. Secondly, if the optimization criterion is not consistent with the symmetry action, we provide guidelines to rigorously derive stability guarantees based on symmetry exploitation.
1 Introduction
In Model Predictive Control (MPC), a sequence of Optimal Control Problems (OCPs) on finite horizons is iteratively solved to approximately solve an OCP on an infinite time horizon while continuously taking into account state measurements, see, e.g. [17, 38] for further details. It has to be guaranteed though that the resulting feedback law stabilizes the system at a desired set point despite the challenging task of solving OCPs for nonlinear dynamical systems in real-time, see, e.g. [22, 47, 16, 24] and [6] for techniques to speed up the numerical solution of the OCPs to be solved in each MPC step.
The approach pursued in this paper is based on exploiting structural properties of the underlying dynamical system in MPC. While, in contrast to linear systems, nonlinear systems cannot be (globally) characterized by evaluating the spectrum at a desired set point, it is still possible to identify characteristic properties which describe global system behavior and are useful in motion planning and optimal control [12, 13]. An important system property is the existence of symmetries, namely continuous symmetries represented by Lie groups. These induce invariances, i.e. the system dynamics are invariant w.r.t. the corresponding symmetry actions. Mechanical systems, such as cars or helicopters, for instance, are typically invariant w.r.t. translations or rotations. Consequently, translations or rotations of a trajectory lead to another trajectory of the mechanical system. Further, symmetries induce the existence of basic motions, e.g. going straight at constant speed or turning with constant rotational velocity in mechanical systems. These basic motions will be called trim primitives or trims, for short, see [14]. Trims can be represented very conveniently, even if general solutions of the dynamical systems cannot be computed by hand. We will quantize the nonlinear system dynamics by choosing a finite set of basic motions, which will be called motion primitives, to which the system is restricted in order to approximately solve the original OCP.
Symmetry exploitation is well-established in control of nonlinear dynamical systems (see e.g. [3, 2, 31]), where the notion of symmetry is based on Noether’s theorem, stating that a symmetry of a dynamical system induces a first integral, i.e. a quantity that is preserved along the system trajectory. In [23], Sussmann introduced a definiton of symmetry for optimal control problems which allows to identify first integrals, i.e. quantities which are preserved along the state and adjoint trajectories (the so called biextremals). This is a useful tool to solve equations of motion for dynamical systems or control problems because finding first integrals can be used to reduce the dimension of the problem which is the main motivation in the aforementioned works.
Classical planning methods only perform geometrical path planning, see e.g. [29] for an overview, and do not take the dynamics of the control system into account. However, already Dubin showed that solutions consisting of arcs of circles and straight lines are optimal w.r.t. path length for system dynamics with constrained turning radius, [7]. This has been extended by Reeds and Shepp in [40] to explicit solution formulas for shortest paths of system dynamics that allow going forward and backward. More recently, control methodologies exploiting the multi-body system dynamics [15] or geometric mechanics [21] were proposed.
The exploitation of symmetry-induced motion primitives for planning problems of nonlinear dynamical systems has been first proposed by Frazzoli et al. [12, 14]. Moreover, in Frazzoli’s approach, optimal motion plans are searched for. Following the idea of quantization (see also [13, 28, 10]), the motion primitives are partly generated by solving optimal control problems for intermediate problems. Finding the best motion primitive sequence can then be written as a mixed-integer optimization problem. Thus, heuristic approaches for globally solving the sequencing problem can be applied, such as sampling-based road-map algorithms [28]. While quantizing itself can be seen as reformulating the dynamics as a hybrid system, the approach can also be applied to systems with intrinsic hybrid, i.e. mixed discrete-continuous, behavior [9]. Recently, the idea of motion primitives has also been applied to autonomous driving problems, see [35] for motion planning and [37, 34] for a multiobjective MPC framework. The latter work numerically exploits symmetries to reduce the computational effort for generating a library of motion primitives used within an explicit MPC framework. In contrast to this contribution, trim primitives and stability questions regarding MPC are not considered.
Besides their utility in motion planning problems, we have seen that basic motions may help in the analysis and the design of MPC schemes as well as the numerical solution of the underlying OCPs: in [11] and [20] concatenations of such basic motions allowed the construction of stabilizing terminal regions and costs for several examples with non-stabilizable linearization (with the peculiarity that the desired set point was not contained in the interior of the terminal region). In particular, the mobile robot was used as a prototype application since its nonholonomic nature ”makes the stabilization of this system challenging; see [1]” according to [11, p. 136]. More recently, basic motions were also utilized in the analysis of MPC schemes without terminal costs and constraints, see, e.g. [45]. However, a formal connection between motion primitives and the proposed techniques has not been established yet. The main goal of this work is to lay the foundation for a link between the already quite mature technique of motion primitives and MPC. To this end, we revisit the example of the mobile robot in detail in order to directly illustrate the appropriateness of the proposed methodology. Here, after introducing the notion of symmetries for optimal control problems based on the definitions for dynamical systems, the contribution of this work is twofold: Firstly, for stage cost, which are consistent with the symmetry action, we show recursive feasibility and asymptotic stability of the origin w.r.t. the MPC closed loop. The key idea is to show that the control effort is uniformly distributed, which also explains why the iterative nature of MPC leads to reduced costs in comparison to the finite horizon optimal control problems. Moreover, we prove finite time convergence if motion primitives are used. Secondly, we show that the basic motions used in [44] were trims and we show that – besides not using the wording – the characteristic properties of trims, namely that a suitable quantization of the system dynamics allows to derive sufficiently good bounds on the value function, was of key importance for the deduced results. This may, e.g., also pave the way for a verification of the distributed controllability assumption introduced in [19].
The remainder of the paper is organized as follows. In Section 2, symmetry, motion and trim primitives are introduced for dynamical control systems and illustrated for the example of the mobile robot. In Section 3, the terminology of symmetries and invariances is extended to optimal control problems. Then, in Section 4, the MPC scheme and a class of admissible control functions are introduced which guarantee the restriction to control trajectories along trim primitives. Convergence of the MPC closed loop trajectory to the origin is shown. In Section 5 we illustrate the MPC approach with motion primitives for the mobile robot.
2 Symmetries and Motion Primitives
Let the system dynamics be given by the ordinary differential equation
[TABLE]
with initial condition . Here, and denote the state and the control at time respectively, where is an -dimensional manifold. Let denote the tangent bundle of . We assume that the map is continuous and locally Lipschitz w.r.t. its first argument in order to guarantee existence and uniqueness of the solution on its maximal interval of existence for . , , denotes the space of Lebesgue-measurable and locally absolutely integrable functions. If we restrict the domain of the control function to the compact interval , and the solution uniquely exists on .
Throughout this manuscript, we consider the following example to illustrate the definitions, concepts, and results. The robot is modeled by a kinematic model for a -wheeled vehicle which autonomously moves in the 2-dimensional plane under nonholonomic constraints due to the wheels.
Example 1** (Mobile Robot; , ).**
The system dynamics of the mobile robot are governed by
[TABLE]
where and denote the position of the robot in the plane while represents its orientation and thus, . Since is globally Lipschitz continuous, finite escape times can be excluded such that existence and uniqueness of the solution , , is guaranteed.
For Example 1, a translational invariance (w.r.t. the position) can be observed, i.e.
[TABLE]
holds for all with , . This equation states that the translation commutes with the flow, i.e. we may first translate the initial state by and then compute the flow or first solve the differential equation and then apply the translation . We formalize this commutativity in the following definition. To this end, recall that a Lie group is a group , which is also a smooth manifold, for which the group operations and are smooth. If, in addition, a smooth manifold is given, we call a map a left-action of on if and only if the following properties hold:
- •
for all where denotes the neutral element of .
- •
for all and .
For convenience, we define with for and .
Definition 2** (Symmetry Group).**
Let be a smooth manifold, a Lie-group, and a left-action of on . Then, we call the triple a symmetry group of the system if the property
[TABLE]
holds for all .
Next, we compute the symmetry group of the mobile robot introduced in Example 1 to illustrate Definition 2. Here, a formal inspection reveals that rotational invariance is combined with translational invariance. In general, the symmetry group of mechanical systems is a subgroup of , where is the group of translations and is the special orthogonal group, which can be represented by the set of matrices . Since this is a subgroup of the affine group of dimensions, there are two ways to represent the elements of :
- •
Either by a pair with and . Then, the group action can by represented by .
- •
Or by the single matrix via
[TABLE]
where we have first rewritten in homogeneous coordinates (see [33]) before multiplying by . The projection on the state space can then by represented by the matrix .
In addition to the translational invariance observed in Equation (3), the planar mobile robot also has rotational symmetry as specified in the following proposition.
Proposition 3**.**
For given , we consider the flow generated by the system dynamics (2). Then,
[TABLE]
is a symmetry group of the mobile robot with the matrix multiplication as group action, i.e. , (with represented in homogeneous coordinates).
Proof.
As a subgroup of , is a Lie group. The flow of the mobile robot is given by
[TABLE]
Then, direct calculations, using the angle sum formula for sine and cosine, show the Identity (4) since both terms can be written as
[TABLE]
where the rotation matrix is denoted by . ∎
If Property (4) holds, the flow of the system is said to be equivariant w.r.t. the symmetry action . As a consequence, given a trajectory , new trajectories , , can be generated using . This family of trajectories – parametrized in the group element – forms an equivalence class.
Definition 4** (Motion Primitive).**
Let be a symmetry group in the sense of Definition 2. Then, two trajectories and are called equivalent, if there exists such that
[TABLE]
A motion primitive is the equivalence class of all trajectories equivalent to w.r.t. the left action .
Note that is an arbitrary but fixed control function in Definition 4, i.e. is identical for all members of the same motion primitive. By slight abuse of notation, we will use the term motion primitive also for a representative of the equivalence class. The symmetry action of the mobile robot is illustrated in Figure 1.
Considering constant control functions , we may link trajectories , , to (the positive part of) a corresponding one-parameter subgroup . Trajectories , which exhibit this link are called trim primitives, trims in short, and can be considered as basic motions. To be more precise, a particular element of the Lie algebra is scaled by time and then mapped via the exponential function to the Lie group to generate the corresponding one-parameter subgroup. We show this connection for the symmetry group of the mobile robot in combination with constant control functions.
Definition 5** (Trim Primitive).**
Let be a symmetry group in the sense of Definition 2. Then, a trajectory is called a trim primitive if there exists a Lie algebra element such that
[TABLE]
Proposition 6**.**
Let be the symmetry group of Proposition 3 and denote the corresponding Lie algebra. Then, for the constant control function , , defined by for all and the initial value , we get
[TABLE]
for with defined by
[TABLE]
In particular, with the translation vector
[TABLE]
for and , respectively.
Proof.
The Lie algebra for a Lie group that consists of rotation matrices is given by skew symmetric matrices [33]. The corresponding Lie algebra is one-dimensional and can be represented as
[TABLE]
The Lie algebra that corresponds to translations in and is simply and , respectively [33]. Together, we obtain that every element of the Lie algebra can be represented by the matrix
[TABLE]
using the triple . Then, since we are using the homogeneous representation, the linear scaling by a factor and the exponential can by directly calculated by first scaling the representation matrix and, then, computing the matrix exponential ( is constant). Doing so yields
[TABLE]
which shows the claimed representation of . Moreover, the left hand side of Equation (7) equals
[TABLE]
by solving the integrals in the representation (6). Then, direct calculations using the angle sum formula for sine and cosine show Equation 7. ∎
Proposition 6 shows that the mobile robot exhibits trims according to Definition 5 for all constant control functions and arbitrary initial values. If the rotational speed is constant and not equal to zero, trims result in a circular motion (otherwise it is a straight line), see e.g. the black curve in Figure 1. In general, trims are attractive since particular solutions of a nonlinear system can be computed while no analytic expression for the general solution is available.
Remark 7** (Dynamical Systems & Trims).**
*In dynamical systems without control input, motions generated by symmetry actions are called relative equilibria since these motions have to be constant in coordinates which are non-symmetric [31, 2]. Constructive approaches to find trims can be deduced from symmetry reduction methods. For mechanical systems with cyclic coordinates, this has been worked out in [10].
Note that any solution with constant control generates a trim for Example 1 since it is a kinematic model (and, thus, the velocities are directly controlled). If actuator dynamics are added, it becomes a second order mechanical system. For this system class, trims typically correspond to motions with constant velocities in body-fixed frame.*
Remark 8** (Alternative Characterization of Symmetry Groups).**
Let maps from a smooth manifold to the tangential bundle of . The symmetry action as a map can be lifted to for via
[TABLE]
Then, the vector field is said to be equivariant w.r.t. the symmetry action if . A direct calculation shows the equivalence of this condition to Property (4): Application of Equation (4) and its time derivative yields
[TABLE]
Remark 9** (Alternative proof of Proposition 3).**
Instead of showing the invariance of the flow (Property (4)) we can, alternatively, show the equivariance of the vector field to prove Proposition 3. With the lifted action is given by and thus, it follows with the vector field given in (2)
[TABLE]
3 Symmetry and Optimal Control
For given optimization horizon , , we consider the Optimal Control Problem (OCP)
[TABLE]
with continuous stage cost and functions and , . Moreover, , is the space of absolutely continuous functions on the manifold .
Recall that we identified symmetries of the system dynamics in Section 2. Now, our aim is to take advantage of these symmetries in optimal control. Therefore, we are interested in functions , , and that share the invariance properties w.r.t. a symmetry of the system dynamics, see Subsection 3.1. Then, in Subsection 3.2, we are concerned with limitations and possible remedies, which might occur if a constraint function or the stage cost do not share the invariance. Finally, in Subsection 3.3, we formulate the OCP for the example of the mobile robot. Here, we define a set of admissible control functions and show existence of an optimal control.
3.1 OCPs Consistent with the Invariance of the System Dynamics
In the following definition, we precisely state what we mean by saying that the constraints and the stage cost share the invariance of the system dynamics.
Definition 10**.**
Consider the system dynamics (1), i.e. , with symmetry group . We call a function invariant w.r.t. the symmetry, if every state can be replaced by its image under the symmetry action without changing its value - independent of the particular choice of and , i.e.
[TABLE]
If the stage cost and the constraint functions are invariant, all equivalent trajectories, i.e. each motion primitive, have the same costs and remain feasible, which directly follows from Definition 10. Then, we call (OCP) consistent (with the invariance property of the system dynamics), which is motivated by the following proposition. As a direct consequence, also optimality is preserved if (OCP) is consistent.
Proposition 11**.**
Consider (OCP) and let be a symmetry group of the system dynamics. Furthermore, let and satisfy the constraints. Then, if the stage cost and the constraint functions are invariant, all pairs , , also satisfy the constraints and yield the same costs, i.e.
[TABLE]
Next, we show that the cost function is particularly simple to evaluate along trim primitives, which also explains why is sometimes called unit cost of a trim, see [14].
Proposition 12**.**
Consider a continuous stage cost and let be a symmetry group of the system dynamics (1). Further, let a constant control function and a corresponding element of the generating Lie algebra be given. Then, for given , for each state trajectory such that holds, i.e. for each trim primitive, the invariant stage cost satisfies
[TABLE]
Proof.
We have
[TABLE]
Here, the first equality follows from the trim definition and and the second equality from the invariance of . ∎
Possible choices for invariant stage costs are the following. Note that also a weighted sum leads to an invariant cost function.
- •
Minimal control effort: with being some symmetric positive definite matrix.
- •
Minimum path length
- •
Minimum “fuel consumption”, i.e. . For some systems, e.g. when models the fuel, it might be desirable to minimize the norm of instead of the norm, or a combination of both.
- •
Minimal time, i.e. . Here, we get , i.e. the final time is free. Then, is an additional real-valued optimization variable.
3.2 Pitfalls, Inconsistency, and Remedies
In Subsection 3.1, we have seen a variety of invariant stage costs. A typical representative for invariant constraints would be a (geometrical) path bridging a certain distance, which explains why motion primitives are often employed for path planning objectives, see, e.g. [14]. However, initial value problems with a fixed desired terminal state or quadratic stage costs, which are typically used for stabilization task, lead to inconsistent OCPs as shown in the following. Moreover, we explicate a modified/shifted OCP, which allows to recover consistency of the OCP if desired.
Let us start with a (classical) initial condition, i.e. or, equivalently,
[TABLE]
Note that only the first components of the function describing the boundary conditions are used in this example. The remaining components would be typically employed to enforce meeting the terminal constraint. Plugging in (and also for completeness) yields the condition , which is for , in general, not satisfied. However, a shifted OCP with initial condition – a shift with the particular group element , – is invariant, meaning that feasibility (and optimality) are preserved for the OCP with shifted constraints. This can be explained using the representation of the symmetry action in homogeneous coordinates: the translation vector cancels out while the distance is preserved under the respective orthogonal transformation. Similar arguments apply to terminal conditions and quadratic stage costs exemplarily defined as
[TABLE]
with symmetric matrices and (with and ) and references and for state and control respectively.
Example 13** (Shifted OCP).**
The robot of Example 1 shall be controlled into the final state . First, consider as a starting point. It can be easily checked that constant control drives the system to zero at final time . Now, recall the symmetry group of the robot computed in Proposition 3 and say we choose defined by . The shifted initial point is then given by , see Figure 2.
Now, the control problem is fundamentally different: does not drive the system to , in fact we are faced with the famous parallel parking problem, see, e.g. [36]. A solution could be obtained by a sequence ”turn-move-turn”, as presented in [44], but, at the moment, our focus is on finding invariances. As proposed above, to this aim we modify the terminal condition, i.e. the shifted final state becomes . Then, the previously computed solution can drive the system from to . Consequently, solutions of the OCP would remain the same, given that the stage costs are either invariant or analogously modified.
3.3 Optimal Control Problem for the Mobile Robot
In Proposition 6 we identified characteristic motions of the mobile robot, i.e. circles and straight lines, which are formally trim primitives. Since any pair of constant control values generates a trim we now switch perspective and define a class of admissible controls which guarantees that the system evolves on a finite sequence of trims. Furthermore, we show that an optimal control function exists for this class of admissible control functions.
Let us suppose that the control has to be piece-wise constant corresponding to the use of at most , , trims (which do not have to be pairwise different), i.e., for each control function , there exists a finite partition
[TABLE]
such that the control function is constant for all . Furthermore, the boundary condition is replaced by an initial and a terminal state constraint, i.e. and . Moreover, instead of the (mixed) control-state constraint , we consider the state constraint and the control constraint for a compact and convex set and a compact set with . Then, for a given optimization horizon , , and initial value , , we define the set of admissible control functions
[TABLE]
where the set of feasible control values is given by with for all . Moreover, if the time horizon is considered as an optimization variable, the set of admissible control functions is given as . Furthermore, for weighting coefficients , , and , let the stage cost by defined as
[TABLE]
with for a positive definite, symmetric matrix and an arbitrary norm.
We consider the Optimal Control Problem
[TABLE]
for and respectively. Since the stage cost is positive definite w.r.t. , the infimum is bounded from below by zero and labeled .
The following lemmata show basic properties of the OCP (14) for Example 1; namely, existence of a feasible solution and finite cost.
Lemma 14**.**
Consider Example 1 and the OCP (14). Then, if and the (finite) set of control values contains the values , , and , there exists, for each , a time horizon , , and a control function such that holds.
Proof.
Essentially, we can proceed analogously to [44, 41]. Due to convexity of and the fact that the length of the optimization horizon is not fixed, we can simply prolong such that the sequence
turn, i.e. use until the robot points towards the origin, 2. 2.
move, i.e. use until the robot reaches the origin, and 3. 3.
turn, i.e. use until the desired orientation is attained,
becomes feasible and ensures that is non-empty. Due to compactness of and and continuity of , this ensures finite costs. If , i.e. minimal/short time is not weigthed in the stage cost, the third control value is important because it allows to stay at until the final time is reached (which is typically not an optimization variable for ). ∎
In particular, Lemma 14 ensures existence and finiteness of the infimum of OCP (14). Hence, the value function is well-defined. The following lemma shows that the optimum is attained, see, e.g. [30] for a proof.
Lemma 15**.**
Consider Example 1 and the OCP (14) and let be given. Moreover, let and the (finite) set of control values contain the values , , and .
For , there exists a time horizon , , and a control function satisfying .
For and optimization horizon such that , there exists an admissible control function with .
The preceding lemmata justify a restriction to control functions with piecewise constant control values and finitely many switches, since existence of admissible control functions can still be guaranteed. Restricting to is a first step in the direction of a finite maneuver automaton, as proposed by Frazzoli. In [14], two types of motion primitives are distinguished: trim primitives, as we defined them in Definition 5, and maneuvers. Maneuvers are arbitrarily controlled trajectories that start and end on trims. Thus, they allow to concatenate trims and maneuvers alternatingly. The resulting trajectories are admissible to the system dynamics, in particular, switching from trim to maneuver or from a preceding maneuver to a trim is continuous. The finite set of motion primitives is called a maneuver automaton. It can be represented as a graph: Trim primitives are the nodes and edges correspond to maneuvers which have been designed to connect the preceding and succeeding trim.
The robot example is special, since any fixed pair of controls leads to a trim and no maneuvers are necessary to switch from one trim to another. Therefore, we focus on constructing sequences which solely consist of trim primitives.
Remark 16** (Sequence of Trim Primitives).**
Any partition of type (9) together with an arbitrary set of control values generates a sequence of trim primitives for the mobile robot, which can be transcribed as a trajectory via
[TABLE]
with for , being the Lie algebra element defined by and according to Proposition 6, and being a short notation for .
4 Model Predictive Control for the Mobile Robot
This section is essentially split into two parts. Firstly, we demonstrate the effectiveness of the techniques proposed in the preceeding two sections by considering Model Predictive Control (MPC) for the mobile robot. To this end, we consider the following MPC scheme where the terminal state is, w.l.o.g., set to zero.
Algorithm 17**.**
*Let and be given.
Set , , and .*
Solve OCP (14) to compute a minimizer , and implement . 2. 2.
Set and 3. 3.
If stop. Otherwise increment and go to Step (1)
Algorithm 17 yields the MPC closed-loop trajectory
[TABLE]
where we emphasized the dependence of the optimal and on the initial value , which we omitted in the exposition of Algorithm 17. The concatenated control function is denoted by . Key properties in MPC are recursive feasibility and convergence of the closed-loop trajectory to the desired set point . The former is important to guarantee that the feasible set of the OCP to be solved in Step (1) of Algorithm 17 is non-empty provided it was non-empty at (initial feasibility), see, e.g. [38]. For Algorithm 17, recursive feasibility is ensured by the terminal constraint, see, e.g. [27].
For the convergence of the closed-loop trajectory, we distinguish whether the coefficient weighting the process time is present in the stage cost or not. If it is, the proof is very simple and presented in Subsection 4.3. Here, we even prove finite time convergence of the MPC closed-loop trajectory given by (15) to the origin.
Secondly, in Subsection 4.4 we give further comments on the usefullness of the proposed techniques if the stage cost is inconsistent with the invariance induced by the symmetry action. Here, we also eschew the terminal equality constraint, which is – in general – not desirable from a numerical point of view.
4.1 Minimizing Energy and Fuel Consumption
We consider the Optimal Control Problem
[TABLE]
Before we rigorously show that the origin is asymptotically stable w.r.t. the MPC closed loop (in Theorem 20), we establish that, for each optimal solution of the OCP (16), the control effort is uniformly distributed on the whole time interval in the following proposition.
Proposition 18** (Necessary Optimality Condition).**
Let be an admissible control function for the OCP (16). Then, for given weighting coefficients and , either exhibits uniform control effort, i.e.
[TABLE]
or we can construct an admissible control function with
[TABLE]
i.e. strictly smaller objective value.
Proof.
Since is piecewise constant there exists a finite partition
[TABLE]
such that is constant on each interval , . Assume, w.l.o.g., that the control function exhibits values on and on such that holds. Then, we have the costs
[TABLE]
on the time interval . In the following, we construct a piecewise constant control such that the corresponding trajectory reaches the same point at time but produces less costs — a contradiction to optimality of , which shows the claimed assertion. To this end, we exploit the property
[TABLE]
for arbitrary if is constant on , see [41].
W.l.o.g. let hold. Then, we replace by a scaled version; namely with sufficiently close to one such that all following quantities are well-defined. Moreover, we enlarge the length of the interval by the factor . This implies, using the Identity (19), that the same path — starting at — is traversed in the state space but with a slower speed. Simultaneously, we enlarge and reduce the length of the respective time interval such that the same path is traversed and the overall length of both intervals remains unchanged. The latter implies if the scaling factor used for is called . In particular, we get the equations
[TABLE]
Using the representation of displayed in (20) to rewrite the factor in front of the first summand, we get the costs
[TABLE]
Subtracting the original value (18) and showing that the resulting expression is strictly less than zero completes the proof. Hence, we have to establish the inequality
[TABLE]
Then, replacing using (20) and dividing by yields
[TABLE]
The right hand side is independent of the scaling factor and strictly larger than zero, while the left hand side converges to zero for . In conclusion, the inequality is satisfied for sufficiently small . ∎
The following corollary extends the assertion of Proposition 18 to a more general set of admissible control functions, which turns out to be helpful for the proof of the following theorem.
Corollary 19**.**
Proposition 18 also holds for except on a set of measure zero.
Proof.
Note that we have not used that there were at most switches. Hence, the line of reasoning works for arbitrary piecewise control functions — a class, which is dense in , which allows us to conclude the assertion. ∎
Proposition 18 and its extension formulated in Corollary 19 are key ingredients to prove convergence of the MPC closed-loop trajectory to the origin since they allow us to derive a decrease of the value function.
Theorem 20**.**
Let , , and an initial state be given. Then, we have recursive feasibility for the MPC algorithm 17 and, if is initially feasible, the corresponding MPC closed-loop trajectory converges to the origin, i.e. exists for all and holds.
Proof.
Since the optimal control problem contains a terminal equality constraint, recursive feasibility holds provided that initial feasibility is given. To prove the convergence, note that Proposition 18 implies that, for an optimal control function , is constant for almost all . Hence, adding this as a constraint to the optimal control problem, does not change the set of all minimizers (the minimizer).
If we now modify the optimal control problem by setting (but leaving as it is), each admissible control function remains admissible and is assigned to an objective value, which is upper bounded by its counterpart of the original OCP. In addition, we further relax the constraints by allowing arbitrary -functions, see Corollary 19. Clearly, the set of minimizers may change. So far, we get the relation
[TABLE]
for the optimal control of the original OCP and the optimal value of the modified optimal control problem. Then, we can estimate that the control effort is only decreasing for each admissible control function if we replace the cost function by where denotes the smallest eigenvalue of the positive definite matrix . Then, dropping the artificially introduced constraint on uniform control effort w.r.t. , we get
[TABLE]
where denotes the minimal value of the OCP (16) with stage costs . Then, further reducing this value by using the simplified dynamics, see Proposition 25 in Appendix A (and still using the notation for the optimal value), we have derived the Lyapunov inequality
[TABLE]
Then, standard arguments, see, e.g. [42, 18] can be used to conclude the assertion. ∎
Next, we extend Proposition 18 to the case with additional control and state constraints as a preliminary step to show that also the assertions of Theorem 20 remain valid.
Corollary 21**.**
Let control constraints with be given such that the set is closed, convex, and contains the origin in its interior. Then, if the control function exhibits neither uniform control effort, i.e. (17), nor satisfies for almost all with the threshold value
[TABLE]
then the alternative proposed in Proposition 18 holds, i.e. is not optimal.
Proof.
The proof is a direct adaptation of the arguments used in the proof of Proposition 18 since the proposed construction is still doable as long as there exists an interval, on which the boundary of the control constraints is not yet active (for which reaching/exceeding the threshold value is a necessary condition). ∎
Remark 22** (State Constraints).**
Note that adding state constraints with , , does not affect the assertions of Proposition 18 and Corollary 21 since we have shown the following in the respective proofs: Each path in the --plane remains feasible but the optimal time parametrization w.r.t. the cost functional is attained only if the proposed necessary optimality condition holds. Therefore, feasibility w.r.t. the state constraint set is maintained (the angle is also invariant on a given path).
Also Theorem 20 remains valid. The only changes needed in the proof are the following: Firstly, one has to argue that a certain minimal decrease is automatically achieved if the condition with defined by (21) holds for almost all . Furthermore, dropping the control and state constraints before Proposition 25 (see Appendix) is applied, leads to the same lower bound and is, thus, doable. Furthermore, note that all results presented in this section also hold if control functions of class are used instead of .
Example 23**.**
We consider again the mobile robot example with states and define a control problem from initial state to final state . Stage costs are defined as
[TABLE]
We transform from Lagrange to Bolza form by introducing a new state and a second auxiliary state by
[TABLE]
Then, the cost function is and we fix . A solution is computed numerically by the graphical interface WORHP Lab of the optimization software WORHP [5, 4]. The robot dynamics are transcribed by the trapezoidal rule on an equidistant time grid with time points. An optimal solution is found after 56 outer-loop iterations of an SQP method and resulting costs are . In Figure 3, the optimal trajectories and controls are shown. Our focus is on auxiliary state : The optimal control satisfies for all , which illustrates the result of Proposition 18. Thus, the quadratic part of the control effort increases linearly with time. Note, however, that the integrated stage cost, i.e. , does not increase linearly, nor do the individual controls.
4.2 Energy and Fuel Consumption for Finite Sets of Motion Primitives
Let us now focus on the motion primitives setting. That is, we further restrict in the definition of the set of admissible control functions (13). Feasible control values have to belong to a finite set , . Moreover, in accordance with our existence results, see Lemmata 14 and 15, we assume that holds if and only if .
To this end, let us observe that, for , there exists at least one interval with non-zero control in view of the terminal equality condition. Then, we reorder the optimal control such that all indices
[TABLE]
are shifted to the end of the sequence, which can be done without loss of optimality. Using this additional condition, we know that we either reach the origin within the sampling interval or use a control function, which is non-zero for each . The latter, however, implies
[TABLE]
which ensures a decrease of at least in each MPC step. Since is finite, we get finite time convergence.
4.3 Penalization of the Process Time
In this subsection, we present a convergence proof for stage costs, in which the process time is penalized. While convergence is clear, MPC may contribute to further reduce the costs while still ensuring finite time convergence as shown in the following proposition.
Proposition 24**.**
Consider the OCP (14) with and let be given. If the OCP (14) is initially feasible, i.e. if there exists a time and a control function , , the MPC closed-loop trajectory is well-defined. Moreover, there exists a time , , such that holds.
Proof.
We use the abbreviations and . Recursive feasibility can be directly concluded from the admissibility of the shifted control sequence at the successor time instant. Moreover, we have
[TABLE]
If there exists an index such that holds, we are done. Otherwise, taking into account that is positive definite, we get
[TABLE]
using a telescope sum argument. Then, for , the term grows unboundedly, which implies that the right hand side becomes smaller than zero for sufficiently large (e.g., ) — a contradiction. ∎
4.4 MPC without Terminal Constraint and Outlook
Here, we want to highlight that motion primitives (or trims if the terminology is supposed to be solely adapted to the example of the mobile robot) were already used in [45] and the follow-up paper [44] to rigorously ensure asymptotic stability for the setting in which neither terminal constraints nor terminal costs were used. The key idea was to derive the controllability assumption initially proposed by Tuna et al. in [43] in combination with the suboptimality estimates from [18], see also [39] and [46] for the extension to the continuous-time setting. In [45], bounds on the value function in dependence of the initial condition were deduced by using the simple sequence turn-move-turn as explicated in Subsection 3.3. A key element was to use a parametric representation of the solution trajectory, which nicely corresponds to the explanation provided in Section 2.
In conclusion, combining the blueprint outlined in [44, 45] and the wording and deeper insight in the use of motion primitives to quantize the nonlinear system dynamics seems to be a very promising approach to tackle systems, for which the linearization does not contain sufficient information to fulfill the stabilization task. Here, it is worth mentioning that purely quadratic costs do, in general, not work for the example of the mobile robot, see [32]. Moreover, the proposed symmetry exploiting technique can also be used to verify initial feasibility, to characterize a set of initially feasible states, and to rigorously treat obstacle avoidance problems.
5 Predictive Control based on Trim Primitives: Numerical Results
Numerical results for the mobile robot example are shown to illustrate the effect of quantizing the set of control values to trim primitives.
5.1 Quantization of the Set of Feasible Control Values
Let the initial value , the terminal set , and the stage cost be given. Moreover, we use the set
[TABLE]
with . Then, the minimal optimization horizon such that initial feasibility is ensured is . In the following, we use the time shift . Moreover, if the optimal control function is not unique, we choose a sequence with maximal costs on the interval :
At time , we get and . Hence, we have and the closed-loop costs given by . 2. 2.
At time , we get , , and . Hence, we have reduced the overall costs from to . The closed-loop costs on are . 3. 3.
At time , we get on and for (using our convention since is not unique).
The following values are summarized in Table 1.
The closed-loop cost for reaching the origin are significantly less than the costs associated to the OCP at time ( in comparison to ). The reason is that the control effort is constantly reduced by using MPC since there is some additional freedom to satisfy the terminal equality constraint after each MPC iteration. If a coarser discretization, e.g. , and – as a consequence – a smaller trim library is used, the origin is reached after steps and the closed-loop costs are . In conclusion, there is a trade-off between the numerical effort for solving the combinatorial OCP online (which is drastically increasing for a refined quantization) and the closed-loop performance.
5.2 Optimal Control with Trim Primitives
We consider again the dynamics of the mobile robot, cf. Example 1, to illustrate qualitatively different optimal solution built of trim primitive sequences. The parallel parking problem from to shall be solved in time units by optimization w.r.t. various cost functionals.
We restrict to the library of trim primitives as given in Table 2, stored as tuples and we consider sequences with at most 4 switches. Since algorithmic performance is not in the focus of this work, we globally search through all possible combinations of trim primitives and compute the optimal switching times in each case for which a solution can be found. Note that sequences with fewer switches can be found since two succeeding primitives might be identical. The rest trim plays an important role so that solutions which would not need time units can be prolonged so that they become feasible.
We show the solutions for minimizing the cost . The best solution is given in Figure 4. It uses the trim sequence (cf. Table 2), as can be seen in the right top picture. Alternative solutions, also with at most 4 switches, are given in Figure 5. When searching for time minimal solutions, the results depicted in Figure 6 are obtained. The control curves code the switching sequence of the solution. Note that the state plot is a projection to the -plane, i.e. non-smooth turns (edges) in trajectory have in fact a turning phase, such that the mobile robot does fulfill its nonholonomic constraints.
It can be seen that even a small library of primitives can generate different types of solutions. In Figure 5, the red solution is the turn-move-turn sequence. However, there exists a solution with lower costs (green). Other solutions have much higher costs and would not be chosen in the unconstrained scenario. However, they might become of importance as soon as obstacle avoidance is included in the problem.
In Figure 6, one can see that the optimal time is depending on the considered sequence of trim primitives.
5.3 MPC with Trim Primitives
We keep the library of trim primitives which was chosen in the previous subsection to solve the parallel parking problem by an MPC scheme now. Here we compute MPC steps and set . Exemplarily, we show the solution for minimizing with fixed horizon in every MPC step in Figure 7. The MPC scheme is able to stabilize the system in time instances. In every MPC step, the cost decreases. Additionally, at and a replanning occurs, i.e. another sequence of motion primitives becomes more efficient than the old solution. (Note that a previous solution can always be prolonged to a valid new solution with the help of the rest trim.) In the future, we would like to investigate the interplay between quantization and closed-loop performance. Larger libraries tend to higher computationally costs but potentially to a better closed-loop performance. Thus, one is interested in the trade-off of these two conflicting optimization goals.
6 Conclusions
We propose to exploit inherent symmetries by using motion primitives in the design, analysis, and numerical treatment of MPC schemes. The key advantage in doing so is that trajectories for genuine nonlinear systems can be easily represented by one-parameter groups. W.r.t. the design and analysis of MPC schemes, we in particular re-interpreted the results presented in [11, 20] (with terminal constraints and costs) and [45] (without terminal constraints and costs) to lay the foundation for their generalization to a larger class of systems. Hereby, it is important to check consistency of stage costs and constraints w.r.t. the symmetries of the system dynamics. Regarding algorithmic aspects, the quantization of the state space by choosing tailored maneuvers encoded by motion primitives is an essential and helpful concept to encounter, on the one hand, the curse of dimensionality in dynamic programming. On the other hand, nonlinear MPC is typically realized via local optimization methods. Thus, getting stuck in local optima is a common problem which can be circumvented by finding (approximations to) alternative solutions via globally searching on a motion primitive graph, cf. e.g. [26]. A potential next step to further enhance the compatability of motion primitives and MPC is to take tailored numerical techniques, see, e.g. [25], into account.
We studied the representative example of the mobile robot in depth in order to illustrate our findings. Here, we derived new necessary optimality conditions for the open-loop OCP and provided numerical simulations to shed some light on the trade-off between performance and numerical effort, which corresponds to setting up a suitable library of motion primitives and solving the respective mixed integer OCP.
In conclusion, motion primitives seem to be a (very) promising approach to systematically verify stability conditions like cost controllability as outlined in [44] without using the proper wording. Furthermore, trims correspond to inherent optimality/turnpike properties of trims, which are useful for the structural analysis of optimal control problems, see, e.g. [8]. Moreover, the proposed combination of motion primitives and MPC is also beneficial if (moving) obstacles or potentially non-convex constraints have to be considered as, e.g., in distributed MPC and for the efficient construction of alternative solutions in order to avoid local minima in the numerical solution of the OCP to be solved in each MPC step.
Appendix
Appendix A OCP with Simplified Dynamics
In this section, we consider an auxiliary OCP, which is needed in order to prove our main result Theorem 20. The auxiliary OCP is constrained by the system dynamics (2), in which the -dependence of the first two components is replaced by the additional control . Moreover, is not penalized in the objective function.
Proposition 25**.**
We consider the optimal control problem
[TABLE]
w.r.t. subject to the boundary conditions , and, for almost all , the differential equation
[TABLE]
Then, the value function is given by the positive definite function
[TABLE]
Proof.
Since the OCP in consideration is decoupled, we can split it into the following two optimal control problems:
[TABLE]
[TABLE]
Firstly, we solve (OCP 1): We assume , i.e. a non-zero initial condition. Otherwise the assertion holds trivially since the stage cost is bounded from below by zero. The Hamiltonian is given by
[TABLE]
Differentiation of the Hamiltonian w.r.t. the state variables , yields the adjoint equation , i.e. the adjoints and are constant. Moreover, Pontryagin’s maximum principle also yields the necessary optimality conditions
- (1)
\mathcal{H}_{u_{1}}=0\Longleftrightarrow\lambda_{0}u_{1}^{\star}(t)=-\frac{1}{2}\Big{(}\lambda_{1}\cos(u_{3}^{\star}(t))+\lambda_{2}\sin(u^{\star}_{3}(t))\Big{)} 2. (3)
\mathcal{H}_{u_{3}}=0\Longleftrightarrow u_{1}^{\star}(t)\Big{(}\lambda_{2}\cos(u^{\star}_{3}(t))-\lambda_{1}\sin(u^{\star}_{3}(t))\Big{)}=0
Firstly, we observe that the right hand side of the differential equation is equal to zero if holds. Combining this observation with the assumed non-zero initial condition and the imposed terminal constraint implies that the set
[TABLE]
has strictly positive measure .
Next, we show that , which allows us to set w.l.o.g. in the following: Suppose that holds. Moreover, let us assume that also equals zero. Then, Conditions (1) and (3) imply in view of for all — a contradiction since has non-zero measure. Analogously, we also get a contradiction for . Hence, we have . Then, Conditions (1) and (3) imply and . Combining these two equations yields — again a contradiction. Thus, let in the following.
For each , Condition (3) yields , which implies
[TABLE]
Then, the terminal conditions read
[TABLE]
which can be rewritten as
[TABLE]
by using Condition (1). Then, plugging (30) into these equations leads to
[TABLE]
using the formulas
[TABLE]
Necessarily, this leads either to for or to a contradiction otherwise. Consequently, we get
[TABLE]
for and for and . Hence, in both cases we obtain the objective value
[TABLE]
which is minimal for . Hence, holds for the optimal control, which shows that the optimal value of (OCP 1) is .
Next, we consider (OCP 2), which is a linar quadratic OCP with zero-terminal constraint. Here, the Hamiltonian is . Again, the differentiation of w.r.t. the state yields that the adjoint is constant. Moreover, the abnormal multiplier can be set to one (otherwise imposes also — a contradiction). Hence, we get from the necessary optimality condition . Then, the terminal constraint implies and, thus, .
Adding up the two computed optimal values shows the assertion. ∎
Acknowledgements
K. Flaßkamp thanks L. Lüttgens and S. Roy for helpful discussions on the mobile robot example, in particular for the derivation of the Lie algebra representation used to derive the trim primitives. K. Worthmann thanks F. Rußwurm for helpful discussions on the mobile robot example.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] A. Astolfi. Discontinuous control of nonholonomic systems. Systems & control letters , 27(1):37–45, 1996.
- 2[2] A. M. Bloch. Nonholonomic mechanics and control . Springer, 2003.
- 3[3] F. Bullo and A. D. Lewis. Geometric Control of Mechanical Systems , volume 49 of Texts in Applied Mathematics . Springer, 2004.
- 4[4] C. Büskens and M. Knauer. From WORHP to Trans WORHP. In Proceedings of the 5th International Conference on Astrodynamics Tools and Techniques , May 2012.
- 5[5] C. Büskens and D. Wassel. The ESA NLP solver WORHP. In Modeling and optimization in space engineering , pages 85–110. Springer, 2012.
- 6[6] S. Di Cairano and I. V. Kolmanovsky. Real-time optimization and model predictive control for aerospace and automotive applications. In 2018 Annual American Control Conference (ACC) , pages 2392–2409, 2018.
- 7[7] L. E. Dubins. On curves of minimal length with a constraint on average curvature, and with prescribed initial and terminal positions and tangents. American Journal of Mathematics , 79(3):497–516, 1957.
- 8[8] T. Faulwasser, K. Flaßkamp, S. Ober-Blöbaum, and K. Worthmann. Towards velocity turnpikes in optimal control of mechanical systems. In Proc. 11th IFAC Symp. Nonlinear Control Systems (NOLCOS) , 2019.
