Optimal Control Problems with Symmetry Breaking Cost Functions
Anthony Bloch, Leonardo Colombo, Rohit Gupta, Tomoki Ohsawa

TL;DR
This paper explores symmetry reduction in optimal control problems on Lie groups with partial symmetry breaking costs, using variational principles to derive continuous and discrete equations, and demonstrates applications like obstacle-aware motion planning.
Contribution
It introduces a variational framework for symmetry breaking in optimal control on Lie groups, deriving new continuous and discrete Lie-Poisson equations and applying them to practical motion planning.
Findings
Derived Euler--Poincaré equations from variational principles.
Obtained discrete-time Lie-Poisson equations via time discretization.
Demonstrated application in obstacle-aware motion planning.
Abstract
We investigate symmetry reduction of optimal control problems for left-invariant control systems on Lie groups, with partial symmetry breaking cost functions. Our approach emphasizes the role of variational principles and considers a discrete-time setting as well as the standard continuous-time formulation. Specifically, we recast the optimal control problem as a constrained variational problem with a partial symmetry breaking Lagrangian and obtain the Euler--Poincar\'e equations from a variational principle. By applying a Legendre transformation to it, we recover the Lie-Poisson equations obtained by A. D. Borum [Master's Thesis, University of Illinois at Urbana-Champaign, 2015] in the same context. We also discretize the variational principle in time and obtain the discrete-time Lie-Poisson equations. We illustrate the theory with some practical examples including a motion planning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHomotopy and Cohomology in Algebraic Topology · Nonlinear Waves and Solitons · Advanced Algebra and Geometry
Optimal Control Problems with Symmetry Breaking Cost Functions
Anthony M. Bloch
A. M. Bloch: Department of Mathematics, University of Michigan, Ann Arbor, MI 48109-1043, USA
,
Leonardo J. Colombo
L. J. Colombo: Department of Mathematics, University of Michigan, Ann Arbor, MI 48109-1043, USA
,
Rohit Gupta
R. Gupta: Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI 48109-2140, USA
and
Tomoki Ohsawa
T. Ohsawa: Department of Mathematical Sciences, The University of Texas at Dallas, Richardson, TX 75080-3021, USA
Abstract.
We investigate symmetry reduction of optimal control problems for left-invariant control systems on Lie groups, with partial symmetry breaking cost functions. Our approach emphasizes the role of variational principles and considers a discrete-time setting as well as the standard continuous-time formulation. Specifically, we recast the optimal control problem as a constrained variational problem with a partial symmetry breaking Lagrangian and obtain the Euler–Poincaré equations from a variational principle. By applying a Legendre transformation to it, we recover the Lie–Poisson equations obtained by A. D. Borum [Master’s Thesis, University of Illinois at Urbana-Champaign, 2015] in the same context. We also discretize the variational principle in time and obtain the discrete-time Lie–Poisson equations. We illustrate the theory with some practical examples including a motion planning problem in the presence of an obstacle.
Key words: Euler–Poincaré equations, Lie–Poisson equations, Optimal control, Symmetry reduction
AMS s. c. (2010): 70G45, 70H03, 70H05, 37J15, 49J15
1. Introduction
Symmetry reduction of optimal control problems (OCPs) for left-invariant control systems on Lie groups has been studied extensively over the past couple of decades. Such symmetries are usually described as an invariance under an action of a Lie group and the system can be reduced to a lower-dimensional one or decoupled into subsystems by exploiting the symmetry (see, e.g., [2], [4], [12], [14], [23], [24], [28], , [30], [31]). Symmetry reduction of OCPs is desirable from a computational point of view as well. Given that solving OCPs usually involves iterative/numerical methods such as the shooting method (as opposed to solving a single initial value problem), reducing the system to a lower-dimensional one results in a considerable reduction of the computational cost as well.
The goal of this work is to study symmetry reduction of OCPs for left-invariant control systems on Lie groups with partially broken symmetries, more specifically, cost functions that break some of the symmetries but not all. Symmetry breaking is common in several physical contexts, from classical mechanics to particle physics. The simplest example is the heavy top dynamics (the motion of a rigid body with a fixed point in a gravitational field), where due to the presence of gravity, we get a Lagrangian that is -invariant but not -invariant, contrary to what happens for the free rigid body. Reduction theory for Lagrangian/Hamiltonian systems on Lie groups with symmetry breaking was developed in [17], [25], [27]. In the context of motion planning, the symmetry breaking appears naturally in the form of a barrier function as we shall see in Section 4.2.
The results in this paper are the Lagrangian/variational counterpart of those in [6], [7]; we also develop a discrete-time version of the results. From the Lagrangian point of view, we obtain the Euler–Poincaré equations from a variational principle and by using a Legendre transformation, we obtain the Lie–Poisson equations. By discretizing the variational principle in time, we obtain the discrete-time Lie–Poisson equations.
We also work out some practical examples to illustrate the theory, including an optimal control description of the heavy rigid top equations as well as a motion planning problem with obstacles; the latter provides an example of the symmetry breaking by a barrier function mentioned above.
1.1. Outline
The paper is organized as follows. In Section 2, we introduce some preliminaries about geometric mechanics on Lie groups. In Section 3, we study the Euler–Poincaré reduction of OCPs for left-invariant control systems on Lie groups, with partial symmetry breaking cost functions and we also obtain the Lie–Poisson equations by using a Legendre transformation. Furthermore, we consider the discrete-time case and obtain the discrete-time Lie–Poisson equations. In Section 4 some practical examples are considered to illustrate the theory. Finally, in Section 5, we make some concluding remarks.
2. Preliminaries
Let be an -dimensional differentiable manifold with local coordinates , the configuration space of a mechanical system. Denote by its tangent bundle with induced local coordinates . Given a Lagrangian , the corresponding Euler–Lagrange equations are
[TABLE]
which determine a system of implicit second-order differential equations.
An indispensable tool in the study of mechanical systems is symmetry reduction. Particularly, when the configuration space is a Lie group, one can reduce the Euler–Lagrange equations, which generally give a system of second-order equations, to a system of first-order ones on its Lie algebra.
Let be a Lie group and be its Lie algebra. Let be the left translation by an element , i.e., , for any . Note that the map is a diffeomorphism on and is also a left action of on itself. The tangent map of at is denoted by . Similarly, the cotangent map of at is denoted by . A vector field is called left-invariant, if , for any , . Let be a left action of on . A function is said to be -invariant, if , for any .
If we assume that the Lagrangian is -invariant under the left action of on , then we can obtain a reduced Lagrangian , where . We can now obtain the reduced Euler–Lagrange equations, commonly known as the Euler–Poincaré equations (see, e.g., [3], [18], [26]), which are given by111For an infinite-dimensional Lie algebra, one generally uses the notation instead of the notation (see, e.g., [5]).
[TABLE]
The Euler–Poincaré equations together with the reconstruction equation are equivalent to the Euler–Lagrange equations on . If we assume that the reduced Lagrangian is hyper-regular, then we can obtain the reduced Hamiltonian (by using a Legendre transformation) given by
[TABLE]
where . The Euler–Poincaré equations (2.1) can now be written as the Lie–Poisson equations (see, e.g., [3], [18], [26]), which are given by
[TABLE]
3. Optimal Control Problems on Lie Groups
We will first define a left-invariant control system on , which is assumed to be -dimensional.
Definition 1**.**
A left-invariant control system on is given by
[TABLE]
where and is a curve in the vector space . More precisely, if , then is given by
[TABLE]
where and the -tuple of control inputs take values in .
Remark**.**
If , then the left-invariant control system is under-actuated otherwise it is fully-actuated.
In what follows, we will fix , , with and . Consider the following OCP
[TABLE]
where the following assumptions hold:
- (i)
is a -invariant function (under a suitable left action of on , which will be defined shortly) and is also sufficiently regular. 2. (ii)
(potential function) is not a -invariant function and is also sufficiently regular. 3. (iii)
The potential function also depends on a parameter that may be considered to be an element of the dual space of a vector space . Hence, we define the extended potential function as , with . 4. (iv)
There is a left representation of on , i.e., is a homomorphism. Hence, there is a left representation of on , i.e., , defined as the adjoint of , i.e., for any , we define as the adjoint of , i.e., for any and
[TABLE]
As a result, there is a left action of on defined as
[TABLE] 5. (v)
The extended potential function is -invariant under (3.1), i.e., , for any , or more concretely , for any and . 6. (vi)
The potential function is invariant under the left action of the isotropy group
[TABLE]
Remark**.**
Note that is a trivial vector bundle over and define the left action of on as follows
[TABLE]
Throughout the paper, we assume that is -invariant under (3.2), i.e., , for any .
3.1. Euler–Poincaré Reduction
We can solve (P1) as a constrained variational problem using the method of Lagrange multipliers (see, e.g, [3], [23]). Define the augmented Lagrangian as follows
[TABLE]
where . Define the left action of on as follows
[TABLE]
Under the assumption (v), it follows that is -invariant under (3.3), i.e., , for any . In particular, under the assumption (vi), it follows that the augmented Lagrangian is -invariant under (3.3), i.e., , for any . We can now obtain the reduced augmented Lagrangian , which is given by
[TABLE]
where and with a slight abuse of notation, we write and . A normal extremal for (P1) satisfies the following Euler–Poincaré equations (see Theorem 1 below for a proof)
[TABLE]
where is the momentum map corresponding to the left action of on defined using the left representation of on , i.e., for any , and
[TABLE]
with being the infinitesimal generator of the left action of on and is defined as the adjoint of , which is the representation induced by the left representation of on . Note that the solution to (3.5) is given by . Also, note that using the notation of [17], we have . For more details, see [10], [17], [18]. To summarize, we have the following theorem.
Theorem 1**.**
A normal extremal for (P1), under the assumptions (i)–(vi), satisfies the following Euler–Poincaré equations
[TABLE]
Proof.
The proof follows arguments similar to the ones given in [17], [23]. Consider the following variational principles:
- (a)
The variational principle
[TABLE]
holds for all variations of (vanishing at the endpoints) and . 2. (b)
The constrained variational principle
[TABLE]
holds using variations of and of the form
[TABLE]
where vanishes at the endpoints.
We will first show that the variational principle (a) implies the constrained variational principle (b). We begin by noting that is -invariant under (3.3), i.e., , for any and , so the integrand in the variational principle (a) is equal to the integrand in the constrained variational principle (b). However, all variations of vanishing at the endpoints induce and are induced by variations of of the form , with . The relation between and is given by (see Proposition 5.1 in [5], which is Lemma 3.2 in [17]). So, if the variational principle (a) holds and if we define and , then by Proposition 5.1 in [5], we have . In addition, we have . Hence, the variational principle (a) implies the constrained variational principle (b).
A normal extremal for (P1) satisfies the variational principle (a) and hence, the constrained variational principle (b), by the above discussion. So, we have
[TABLE]
where we have used integration by parts along with the fact that and so, we have
[TABLE]
Finally, by taking the time derivative of , we have
[TABLE]
∎
There are two special cases of Theorem 1 that are of particular interest and we state them as corollaries.
Corollary 1**.**
Let and be the adjoint representation of on , i.e., , for any . Then, the Euler–Poincaré equations (3.7)–(3.8) give the following equations
[TABLE]
Proof.
We first begin by noting that and is the coadjoint representation of on , i.e., , for any . We also have , for any and it follows that , for any . From (3.6), we have
[TABLE]
which gives . ∎
Corollary 2**.**
Let and be the coadjoint representation of on , i.e., , for any . Then, the Euler–Poincaré equations (3.7)–(3.8) give the following equations
[TABLE]
Proof.
We first begin by noting that and is the adjoint representation of on , i.e., , for any . We also have , for any and it follows that , for any . From (3.6), we have
[TABLE]
which gives . ∎
The Euler–Poincaré equation (3.7) is not particularly feasible for describing the time evolution of and because the equations for them are combined into a single equation. However, assuming an additional structure on the Lie algebra , we may decouple the equations for and (see [4] for a similar approach applied to the standard Euler–Poincaré equation):
Proposition 1**.**
Assume that such that
[TABLE]
then the time evolution of and in (3.7) are given by the following equations
[TABLE]
where .
Proof.
It is easy to verify that such that
[TABLE]
It is also easy to verify that and by definition . We now have a splitting of the left hand side of (3.7) in and . We have the following
[TABLE]
where . By using (3.9), we have , , and . We can also split in and . We now have a splitting of the right hand side of (3.7) in and . So, (3.7) splits into the following equations
[TABLE]
∎
Remark**.**
Note that semisimple Lie algebras admit a Cartan decomposition, i.e., if is semisimple, then such that
[TABLE]
where is the eigenspace of the Cartan involution and is the eigenspace of the Cartan involution . In addition, the Killing form is positive definite on and negative definite on . So, connected semisimple Lie groups are potential candidates that satisfy the assumption of Proposition 1. Conversely, a Cartan decomposition determines a Cartan involution (see, e.g., [19]). For more details, see [11], [16], [19]. Also, note that the roles of and can be reversed in Proposition 1.
3.2. Legendre Transformation
If we assume that the reduced Lagrangian is hyper-regular, then we can obtain the reduced Hamiltonian (by using a Legendre transformation) given by
[TABLE]
where , with . The Euler–Poincaré equations (3.7)–(3.8) can now be written as the Lie–Poisson equations (see, e.g., [13], [17]), which are given by
[TABLE]
Remark**.**
The Lie–Poisson equations (3.10)–(3.11) are also obtained in [6], [7] using Pontryagin’s maximum principle. For more details, see [6], [7].
3.3. Discrete Lagrange–Pontryagin Principle
Consider the discrete-time version of
(P1) given by
[TABLE]
where is the time-step, is the retraction map (see, e.g., [8], [9], [21], [20], [22]) and the functions and satisfy assumptions (i)–(vi).
We can solve (P2) using a discrete analogue of the Lagrange–Pontryagin variational principle. Define the augmented cost function as follows
[TABLE]
where the augmented Lagrangian is defined as follows
[TABLE]
Define the left action of on as follows
[TABLE]
Under the assumption (v), it follows that is -invariant under (3.13), i.e., , for any . In particular, under the assumption (vi), it follows that the augmented Lagrangian is -invariant under (3.13), i.e., , for any . We can now obtain the reduced augmented Lagrangian , which is given by
[TABLE]
where , and with a slight abuse of notation, we write and . A normal extremal for (P2) satisfies the following discrete-time Lie–Poisson equations (see Theorem 2 below for a proof)
[TABLE]
where . To summarize, we have the following theorem but before proceeding further, we recall a lemma, which will be used in its proof.
Lemma 1** ([8], [9], [22]).**
The following properties hold:
[TABLE]
where .
[TABLE]
where and is the inverse right trivialized tangent of .
Theorem 2**.**
A normal extremal for (P2), under the assumptions (i)–(vi), satisfies the following discrete-time Lie–Poisson equations
[TABLE]
where .
Proof.
Consider the following variational principles:
- (a)
The variational principle
[TABLE]
holds for all variations of (vanishing at the endpoints), (induced by the variations of ) and . 2. (b)
The variational principle
[TABLE]
holds for all variations of (induced by the variations of vanishing at the endpoints), and of the form
[TABLE]
where vanishes at the endpoints.
We will first show that the variational principle (a) implies the variational principle (b). We begin by noting that is -invariant under (3.13), i.e., , for any and , so the summand in the variational principle (a) is equal to the summand in the variational principle (b). In addition, we have . Hence, the variational principle (a) implies the variational principle (b).
A normal extremal for (P2), satisfies the variational principle (a) and hence, the variational principle (b). So, we have
[TABLE]
where we have used Lemma 1, the analogue of integration by parts in the discrete-time setting along with the fact that and so, we have
[TABLE]
Finally, after a simple calculation, we have
[TABLE]
∎
As in the continuous-time case, there are are two special cases of Theorem 2 that are of particular interest and we state them as corollaries.
Corollary 3**.**
Let and be the adjoint representation of on , i.e., , for any . Then, the discrete-time Lie–Poisson equations (3.18)–(3.19) give the following equations
[TABLE]
where .
Proof.
See Corollary 1. ∎
Corollary 4**.**
Let and be the coadjoint representation of on , i.e., , for any . Then, the discrete-time Lie–Poisson equations (3.18)–(3.19) give the following equations
[TABLE]
where .
Proof.
See Corollary 2. ∎
4. Examples
We will now consider two examples.
4.1. The Heavy Top as an Optimal Control Problem
We will now show that the heavy top equations can also be obtained from OCPs for left-invariant control systems on Lie groups, with partial symmetry breaking cost functions.
We now consider (P1), with , and with the cost function given by
[TABLE]
where , for any , , with the elements of the basis of (which is semisimple) given by
[TABLE]
which satisfy
[TABLE]
is the inertia tensor of the top222It is calculated with respect to the pivot, which is not in general the center of mass., is the mass of the body, is the acceleration due to gravity, is the vertical unit vector, is the unit vector from the point of support to the direction of the body’s center of mass (constant) in body coordinates and is the length of the line segment between these two points.
Under the dual pairing, where , for any and , the elements of the basis of are given by
[TABLE]
It is easy to verify that the potential function is -invariant but not -invariant and so, the potential function breaks the symmetry partially. We will now show how the theory developed in the paper can be applied to the OCP under consideration.
Let and be the adjoint representation of on , i.e., , for any . So, is the coadjoint representation of on , i.e., , for any . Let the extended potential function be given as follows
[TABLE]
where is identified with (see, e.g., [18, Section 5.3]). If we set in the extended potential function, we recover our original potential function. It is easy to verify that the extended potential function is -invariant under (3.1), i.e., , for any . It is also easy to verify that the potential function is invariant under the left action of the isotropy group
[TABLE]
i.e., rotations about the vertical axis . We can now see that the assumptions (i)–(vi) are satisfied. By Corollary 1, the Euler–Poincaré equations (3.7)–(3.8) (under the identifications and ) give the following equations
[TABLE]
where
[TABLE]
and is the inertia matrix. For more details, see [18], [26]. So, (4.1)–(4.2) give the following equations
[TABLE]
We do not give all the details and leave it up to reader to verify that the Lie–Poisson equations (3.10)–(3.11) are given by
[TABLE]
which are the well known heavy top equations (see, e.g., [18, Section 7.4], [26, Section 1.4]).
4.2. Motion Planning of a Unicycle with Obstacles
We study the OCP for the motion planning of a unicycle with obstacles. To avoid the obstacle, we use the barrier function approach (see, e.g, [15], [29]), which plays the role of the potential function in the cost function of the OCP.
Our model for a unicycle is a homogeneous disk rolling on a horizontal plane maintaining its vertical position (see, e.g., [3, Section 1.4]). The configuration of the unicycle at any given time is completely determined by the element (as a set) given by
[TABLE]
where represents the point of contact of the wheel with the ground and represents the angular orientation of the overall system (see Figure 1). For more details, see [3], [24]. The controlled equations for the unicycle are given by
[TABLE]
We can view (4.3)–(4.5) as a left-invariant control system on (see [24]). Note that (4.3)–(4.4) are equivalent to the nonholonomic constraint .
Let the obstacle be circular in shape and be located in the - plane. Without loss of generality, assume that the obstacle has unit radius, with its center located at the point . Let the potential function be given by
[TABLE]
where and . Let us now equip with the inner product , for any and hence, the norm , for any . Equivalently, the potential function is given by
[TABLE]
where and is given below. We now consider (P1), with , and with the cost function given by
[TABLE]
where , with the elements of the basis of (which is not semisimple) given by
[TABLE]
which satisfy
[TABLE]
Under the dual pairing, where , for any and , the elements of the basis of are given by
[TABLE]
It is easy to verify that the potential function is -invariant but not -invariant and so, the potential function breaks the symmetry partially. We will now show how the theory developed in the paper can be applied to the OCP under consideration.
Let and be the coadjoint representation of on , i.e., , for any . So, is the adjoint representation of on , i.e., , for any . Let the extended potential function be given as follows
[TABLE]
If we set in the extended potential function, we recover our original potential function. It is easy to verify that the extended potential function is -invariant under (3.1), i.e., , for any . It is also easy to verify that the potential function is invariant under the left action of the isotropy group
[TABLE]
We can now see that the assumptions (i)–(vi) are satisfied and we also have such that
[TABLE]
By Corollary 2 and Proposition 1, the Euler–Poincaré equations (3.7)–(3.8) give the following equations
[TABLE]
where
[TABLE]
and
[TABLE]
For more details, see [26, Section 14.6]. So, (4.6)–(4.8) give the following equations
[TABLE]
We do not give all the details and leave it up to reader to verify that the Lie–Poisson equations (3.10)–(3.11) are given by
[TABLE]
Remark**.**
Let us consider the case, when the OCP under consideration has no potential function. In this case, the Euler–Poincaré equations (3.7) give the following equations
[TABLE]
We will now show that the Euler–Poincaré equations (4.9)–(4.11) are exactly the Euler–Lagrange equations, when we view the OCP under consideration as a constrained variational problem. The OCP under consideration is equivalent to the following constrained variational problem
[TABLE]
The Lagrangian for (P3) is given by
[TABLE]
where is the Lagrange multiplier. A solution for (P3) must satisfy the following Euler–Lagrange equations
[TABLE]
Using the facts that , , and after a few simple calculations, the Euler–Lagrange equations (4.12)–(4.14) give the following equations
[TABLE]
which are exactly the Euler–Poincaré equations (4.9)–(4.11).
Remark**.**
For the above example, in the discrete-time setting, one would choose
[TABLE]
where
[TABLE]
and . Also, in the discrete-time setting, the extended potential function can be constructed in exactly the same way as in the above example and is given by
[TABLE]
We do not give all the details again and leave it up to reader to verify that the assumptions (i)–(vi) are satisfied. By Corollary 4, the discrete-time Lie–Poisson equations (3.18)-(3.19) give the following equations
[TABLE]
where
[TABLE]
and . For numerical purposes, one first chooses an appropriate retraction map, such as the Cayley map or the exponential map, etc. (see, e.g., [8], [9], [21], [20], [22]) and then computes the quantities and . As an example, if we choose the Cayley map as the retraction map, then we have
[TABLE]
and
[TABLE]
Note that for , the matrix representation for is given by
[TABLE]
For more details, see [8], [9], [21], [20], [22].
5. Conclusions
We studied symmetry reduction of OCPs for left-invariant control systems on Lie groups, with partial symmetry breaking cost functions and we obtained the Euler–Poincaré equations. Furthermore, by using a Legendre transformation, we obtained the Lie–Poisson equations and in the discrete-time setting, we obtained the discrete-time Lie–Poisson equations. The theory was also illustrated with some practical examples.
Acknowledgments
The research of Anthony M. Bloch was supported by NSF grants DMS-, DMS-, INSPIRE- and the Simons Foundation. The research of Leonardo J. Colombo was supported by MINECO (Spain) grant MTM --P and NSF grant INSPIRE-.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1]
- 2[2] Andrei A Agrachev and Yuri L Sachkov. Control theory from the geometric viewpoint . Springer Science & Business Media, 2004.
- 3[3] Anthony M Bloch. Nonholonomic mechanics and control . Springer Science & Business Media, 2015.
- 4[4] A M Bloch and P E Crouch. Reduction of Euler-Lagrange problems for constrained variational problems and relation with optimal control problems. In Proceedings of IEEE Conference on Decision and Control , pages 2584–2590. IEEE, 1994.
- 5[5] Anthony M Bloch, P S Krishnaprasad, Jerrold E Marsden, and Tudor S Ratiu. The Euler-Poincaré equations and double bracket dissipation. Communications in Mathematical Physics , 175(1):1–42, 1996.
- 6[6] Andy D Borum. Optimal control problems on Lie groups with symmetry breaking cost functions. master’s 634 thesis, University of Illinois at Urbana-Champaign , 2015.
- 7[7] Andy D Borum and Timothy Bretl. Geometric optimal control for symmetry breaking cost functions. In Proceedings of IEEE Conference on Decision and Control , pages 5855–5861, 2014.
- 8[8] N Bou-Rabee. Hamilton-Pontryagin integrators on Lie groups . Ph D thesis, California Institute of Technology, 2007.
