The turnpike property in nonlinear optimal control -- A geometric approach
Noboru Sakamoto, Enrique Zuazua

TL;DR
This paper introduces a geometric dynamical systems approach to analyze the turnpike property in nonlinear optimal control, providing new insights and simpler proofs for existing results.
Contribution
It develops a geometric framework to study the turnpike property, extending understanding to more general conditions and removing some restrictions on initial and target states.
Findings
Turnpike-like behavior appears in systems with hyperbolic equilibrium.
Sufficient conditions for the turnpike property are established.
Simpler proofs for existing turnpike results are provided.
Abstract
This paper presents, using dynamical system theory, a framework for investigating the turnpike property in nonlinear optimal control. First, it is shown that a turnpike-like property appears in general dynamical systems with hyperbolic equilibrium and then, apply it to optimal control problems to obtain sufficient conditions for the turnpike occurs. The approach taken is geometric and gives insights for the behaviors of controlled trajectories, allowing us to find simpler proofs for existing results on the turnpike properties. Attempts to remove smallness restrictions for initial and target states are also discussed based on the geometry of (un)stable manifold and exponential stabilizability of control systems.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsControl and Stability of Dynamical Systems · Advanced Differential Equations and Dynamical Systems · Control and Dynamics of Mobile Robots
The turnpike property in nonlinear optimal control — A geometric approach
Noboru Sakamoto [email protected]
Enrique Zuazua [email protected] Faculty of Science and Engineering, Nanzan University, Yamazato-cho 18, Showa-ku, Nagoya, 464-8673, Japan
Chair in Applied Analysis, Alexander von Humboldt-Professorship, Department of Mathematics, Friedrich-Alexander-Universität Erlangen-Nürnberg 91058 Erlangen, Germany
Departamento de Matemáticas, Universidad Autónoma de Madrid, 28049 Madrid, Spain
Chair of Computational Mathematics, Fundación Deusto, Avda Universidades, 24, 48007, Bilbao, Basque Country, Spain
Abstract
This paper presents, using dynamical system theory, a framework for investigating the turnpike property in nonlinear optimal control. First, it is shown that a turnpike-like property appears in general dynamical systems with hyperbolic equilibrium and then, apply it to optimal control problems to obtain sufficient conditions for the turnpike occurs. The approach taken is geometric and gives insights for the behaviors of controlled trajectories, allowing us to find simpler proofs for existing results on the turnpike properties. Attempts to remove smallness restrictions for initial and target states are also discussed based on the geometry of (un)stable manifold and exponential stabilizability of control systems.
keywords:
Optimal control; Nonlinear system; Turnpike.
\AtAppendix\AtAppendix\AtAppendix
††thanks: This paper was not presented at any IFAC meeting. Corresponding author: Noboru Sakamoto.
a: This work was partially funded by by JSPS KAKENHI Grant Numbers JP26289128, JP19K04446 and by Nanzan University Pache Research Subsidy I-A-2 for 2019 academic year.
b: Supported, in part, by the Alexander von Humboldt-Professorship program, by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 694126-DyCon), by grant MTM2017-92996 of MINECO (Spain), by ELKARTEK project KK-2018/00083 ROAD2DC of the Basque Government, by ICON of the French ANR and Nonlocal PDEs: Analysis, Control and Beyond, and by AFOSR Grant FA9550-18-1-0242.
,
1 Introduction
The turnpike property was first recognized in the context of optimal growth by economists (see, e.g., [28]). The turnpike theorems say that for a long-run growth, regardless of starting and ending points, it will pay to get into a growth phase, called von Neumann path, in the most of intermediate stages. It is exactly like a turnpike and a network of minor roads; "if origin and destination are far enough apart, it will always pay to get on the turnpike and cover distance at the best rate of travel …" (quoted from [9]).
In control theory, independently of the turnpike theorems in econometrics, this property was investigated as dichotomy in linear optimal control [47, 38] and later extended to nonlinear systems [1]. In optimal control, the turnpike property essentially means that the solution of an optimal control problem is determined by the system and cost function and independent of time intervals, initial and terminal conditions except in the thin layers at the both ends of the time interval (see, e.g., [5, 50, 37]). In the last decades, much progress has been made in the theory of turnpike for finite or infinite dimensional and linear or nonlinear control systems. In [35], the authors study the turnpike for linear finite and infinite dimensional systems and derive a simple but meaningful inequality, for which we term turnpike inequality. Their works are extended to finite-dimensional nonlinear systems [45], the semi-linear heat equation [36], the wave equation [20, 51], periodic turnpike for systems in Hilbert spaces[44], optimal shape design [24], optimal boundary control for hyperbolic systems [19] and general evolution equations [18]. The turnpike property draws attentions of system theory researchers from the viewpoints of model predictive control [15, 10], dissipative systems (see, e.g., [48, 49]) [3, 7, 12, 16, 17], mixed-integer optimal control [13], mechanical systems [11] and the maximum hands-off control [29, 41].
In this paper, we first show that turnpike-like behaviors naturally appear in general dynamical systems with hyperbolic equilibrium. The main technique we use is the -lemma which describes trajectory behaviors near invariant manifolds such as stable and unstable manifolds. That the turnpike-like inequality holds implies that if one fixes two ends of a trajectory close to stable and unstable manifolds and designates the time duration sufficiently large, then the trajectory necessarily converges to these manifolds to spend the most of time near the equilibrium. This is exactly the turnpike property. It should be noted that the two ends, as long as they are close to the manifold, do not need to lie in the vicinity of the equilibrium, from which it may be possible to remove locality restrictions in the research of turnpike.
We apply this inequality to a class of optimal control problems in which terminal states are not specified and the steady optimal solutions are not the origin as in [35, 36, 51, 44] as well as to a class of optimal control problems in which two terminal states are specified and the steady optimal point is the origin as in [47, 1, 45]. For both classes of problems, we employ a Dynamic Programming approach with Hamilton-Jacobi equations (HJEs). The characteristic equations for HJEs are Hamiltonian systems and the stabilizability (controllability for the second class) and detectability conditions assure that the equilibrium of the Hamiltonian systems is hyperbolic. The controlled trajectories appear in these Hamiltonian systems and we apply the turnpike result for dynamical system. Then, the existence of the trajectory satisfying initial and boundary conditions is guaranteed. In this paper, we derive sufficient conditions for optimality by using the Dynamic Programming and HJEs and by imposing a condition that guarantees the existence of the solution to the HJEs (Lagrangian submanifold property, see, e.g., [26]).
The present manuscript expands upon our conference contribution [42] incorporating a new result on the relationship between nonlinear stabilizability and existence of infinite horizon optimal control [40]. It allows one to give an estimate of the existence region of a stable manifold of hyperbolic Hamiltonian system associated with an optimal control problem, from which one may be able to predict the occurrence of turnpike (see Section 4.2). The manuscript also contains a number of examples worked out to show how the proposed geometric approach is effectively applied for turnpike analysis and appendices for necessary results in the theory of algebraic Riccati equations and for stable manifold estimate in Hamiltonian systems.
The organization of the paper is as follows. In Section 2 we review some of key tools from dynamical system theory and derive the turnpike inequality. In Section 3, we apply it to optimal control problems. Section 3.1 handles the problem where boundary state is free and Section 3.2 handles the problem where initial and boundary states are fixed. Section 4 shows turnpike analyses for a class of nonlinear systems for which target in can be taken arbitrarily large and a class of nonlinear systems for which initial states can be taken arbitrarily large. Section 5 discusses possible extensions for more general turnpike using the geometric approach.
2 Turnpike in dynamical systems
Let us consider a nonlinear dynamical system of the form
[TABLE]
where is of class (). We assume that and the hyperbolicity of at [math], namely, assume that has eigenvalues with strictly negative real parts and eigenvalues with strictly positive real parts.
It is known, as the stable manifold theorem, that there exist manifolds and , called stable manifold and unstable manifold of (1) at [math], respectively, defined by
[TABLE]
where is the solution of (1) starting at . Let , be stable and unstable subspaces in of with dimension , , respectively. It is known that , are invariant under the flow of and are tangent to , , respectively, at . See, e.g., [21, 31] for more details on the theory of stable manifold.
We will consider limiting behavior of submanifolds under the flow of and need to introduce topology for maps and manifolds. Let be a compact manifold of dimension and the space of maps, , defined on . There exists a natural vector space structure on . Since is compact, we take a finite cover of by open sets and take a local chart for with such that , , where and are the balls of radius 1 and 2 at the origin of . For a map , we define a norm by
[TABLE]
where , local representation of , and is a norm for linear maps. It is known that does not depend on the choice of finite cover and we call it norm. For maps in where is a manifold, we embed in a Euclidean space with sufficiently high dimension. Let , be submanifolds of and let . We say that * and are -close* if there exists a diffeomorphism such that , where and are inclusion maps. In this case, we use the notation .
By a -dimensional (topological) disc we mean a set that is homeomorphic to . The following lemma is known as the -lemma or inclination lemma and plays a crucial role in the theory of dynamical systems (see [31, 46]).
Lemma 1** (The -lemma).**
Suppose that is a hyperbolic equilibrium for (1). Suppose also that and are , -dimensional stable and unstable manifolds of at [math], respectively. For any -dimensional disc in , any point , any -dimensional disc transversal to at and any , there exists a such that if , contains an -dimensional disc with .
Next, we show that the turnpike behavior appears in the transition of points near the stable manifold to points near the unstable manifold if the transition duration is designated large. Let and be given points. From the stable manifold theorem, it holds that
[TABLE]
where is a constant dependent on and and is a constant independent of and .
Proposition 2**.**
- (i)
There exists a such that for every there exists a such that
[TABLE]
where is the -dimensional open ball centered at with radius . Moreover, when . 2. (ii)
There exist a such that for every there exists a such that
[TABLE]
Moreover, when . 3. (iii)
For any -dimensional disc transversal to at and any -dimensional disc transversal to at , there exists a such that for any there exist an -dimensional disc transversal to at and a -dimensional disc transversal to at such that intersects at a single point.
Proof. (i), (ii) These are consequences of the properties (2) of (un)stable manifold, which can be derived using contradiction arguments. The proofs of the convergence as also use contradictions to the facts that , are submanifolds with strictly lower dimension than .
(iii) First we take an -dimensional disc in passing through 0, a -dimensional disc in passing through 0 and an arbitrarily. From the -lemma, there exists a such that for any there exists an -dimensional disc transversal to at and a -dimensional disc transversal to at such that , . Since , it is possible to take , and so that intersects at a single point.
Remark 3**.**
It should be noted that the above statements, especially (i) and (ii), are only on finite interval . This is the major difference from the trajectories on the stable and unstable manifolds.
Theorem 4**.**
For any , any , any -dimensional disc transversal to at and any -dimensional disc transversal to at , there exists a such that for every there exist , and such that and
[TABLE]
Moreover, when .
Proof. Take the largest and the smallest in Proposition 2. We rename this as . Take arbitrary and use Proposition 2-(iii) to get a disc which is -dimensional and transversal to at and a disc which is -dimensional and transversal to at satisfying and . This is possible by taking smaller and in the proof of Proposition 2-(iii). Then, there exists a single point such that (see Fig 1). Let . Then, and by Proposition 2-(i), we have
[TABLE]
Let . Then, and
[TABLE]
This shows that
[TABLE]
and consequently,
[TABLE]
Combining (3) and (4), we get the inequality in the theorem. The last assertion follows from Proposition 2-(i) and (ii).
3 Turnpike in nonlinear optimal control
Let us consider a nonlinear control system
[TABLE]
where , are of class with , is state variables and is control input. An optimal control problem or OCP is to find a control input for (5) such that the cost functional
[TABLE]
is minimized, where we set when the existence domain of solution for (5) is strictly contained in . There are several types in OCPs depending on whether or not the terminal time is specified and whether or not the state variables are specified at the terminal time. In this paper, we consider OCPs where the terminal time is specified and two types of OCPs; one in which the state variables are free at and another in which they are fixed at . For both types of OCPs, we are interested in the relationship between the solution and corresponding trajectory of an OCP and steady-state optimum pair , which will be defined more precisely later on.
Definition 5**.**
[5]** An optimal pair has the turnpike property if for any , there exists an such that
[TABLE]
for all , where depends only on , , , , and and denotes length (Lebesgue measure) of interval.
Remark 6**.**
Turnpike inequality is to require and to satisfy
[TABLE]
for some constants and independent of , which is a sufficient condition for the turnpike property in Definition 5. Also, it should be noted that requiring (6) limits ourselves to the exponential input-state turnpike defined in [17]. **
3.1 The OCP with state variables unspecified at the terminal time
For system (5), we consider the following cost functional
[TABLE]
where and is a given vector (target). We call this problem ;
[TABLE]
Associated with , we consider a steady optimization problem
[TABLE]
We assume the following.
Assumption 7**.**
(SOP) has a solution .
Also, associated with , we can derive a Hamilton-Jacobi equation
[TABLE]
for , where , . Defining a Hamiltonian
[TABLE]
we consider the corresponding characteristic equation for (9)-(10)
[TABLE]
with , Note that since the system (5) is time-invariant, the equation corresponding to is not necessary. The following fact is readily verified.
Fact. A solution of (SOP) corresponds to an equilibrium point of (11) with .
Let , .
Assumption 8**.**
The triplet is stabilizable and detectable.
Under Assumptions 7, 8, the equilibrium is hyperbolic equilibrium for the Hamiltonian system (11) and there exist stable and unstable manifolds for (11) at which are expressed as
[TABLE]
Here, , are the stable and unstable manifold of (11) in the coordinates , where , , which is re-written as
[TABLE]
We can now state the main theorem of this subsection. Let , be canonical projections.
Theorem 9**.**
Under Assumptions 7, 8, suppose that , where is the interior of a set in , and that intersects transversally. If is taken sufficiently large, then there exists a solution to (11) satisfying and . If, moreover,
[TABLE]
then
[TABLE]
is the local optimal solution for and turnpike inequality (6) holds for some constants and which are independent of .
Proof. Let and , where and take -dimensional discs , , which correspond to , , and in Theorem 4, respectively. Then the theorem implies that for a sufficiently large , there exist , and with , such that
[TABLE]
where denotes the solution of (11) starting from . This shows that the two-point boundary value problem associated with has been solved. Let . Then, the theorem also says that there exist and such that
[TABLE]
Since , (6) holds with , where supremum is taken along the trajectory. The condition (13) guarantees that there exists a Lagrangian submanifold in a neighborhood of this trajectory and this implies the existence of solution to (9)-(10) in the neighborhood. Then, the verification theorem in Dynamic Programming (see, e.g., [2]) shows that the control is locally optimal.
Remark 10**.**
The condition (13) guarantees that the solution to (9) exists in a neighborhood of the trajectory , . The optimality of is valid only in the neighborhood. This existence theory is described using the notion of Lagrangian submanifold (see, e.g., [26]) and when one seeks for larger domain of existence, the non-uniqueness issue of solution arises. We refer to [8] for general analysis of non-unique solutions and [30, 22, 23] for non-unique optimal controls for mechanical systems. **
We next show that for small , , the solution for has a solution with turnpike property using perturbation theory of stable manifold. Let , .
Assumption 11**.**
The triplet is stabilzable and detectable.
Fact. Under Assumption 11, there is a neighborhood of in such that (SOP) has a unique solution for in the neighborhood and is stabilzable and detectable.
Corollary 12**.**
Under Assumption 11, for sufficiently small and and for sufficiently large , has a solution with the turnpike property.
Proof. From the Fact above, under Assumption 11, the Hamiltonian system (11) has stable manifold and unstable manifold at . For the linear part of the Hamiltonian system is , for which we apply the eigen structure analysis in Appendix A. Apply Lemma A.1 with , and let and as in the Appendix. Then, the tangent spaces , of , at the origin can be written as
[TABLE]
From the expression of , one can take sufficiently small so that there is an -dimensional disc in that contains the origin and in its interior. From Lemma A.2, is nonsingular and therefore, intersects transversally, which implies that there is an -dimensional disc in that intersects transversally. Let be the Hamiltonian vector field (11). As , can be arbitrarily close to with topology in an appropriate compact set. The stable manifold theory (see, e.g., [31, Theorem 6.2]) ensures that there exists a small so that there are -dimensional discs , that are close enough to , , respectively, with -topology. For this , it holds that and intersects transversally. Now, all the hypotheses in Theorem 9 are satisfied.
Next Corollary is proved in [35, 44] in the study of the turnpike property for infinite dimensional systems under slightly more restrictive conditions (controllability and observability rather than stabilizability and detectability). Their proofs are based on the estimates on adjoint variables in the linear Hamiltonian system (11) which is derived as a necessary condition of optimality. Here, we give an alternative proof using the geometric picture in Theorem 9.
Corollary 13**.**
Suppose that the system (5) is linear, that is, and with real constant matrices and . Under Assumption 11, has the global solution , for any . Moreover, turnpike inequality (6) holds.
Proof. We use some of the notations in the proof of Corollary 12. The unique solution to (SOC) is expressed as . and in (12) can be written as
[TABLE]
It is readily seen that for any and intersects transversally for any . The condition (13) is equivalent to the nonsingularity of (1,1)-block in , which is proved in Lemma A.3.
Remark 14**.**
Although the problem in Corollary 13 is linear, it is not an easy task to explicitly write down the solution for (9)-(10) except for . This corollary, however, says that the solution globally exists. 2. 2.
As is discussed in [36, 45, 33], relaxing the smallness conditions in Corollary 12 is one of major challenges in the research of nonlinear turnpike. In § 4.1, we show a class of nonlinear OCPs for which turnpike occurs for all by explicitly analyzing unstable manifold.
3.2 The OCP with state variables specified at the terminal time
In this subsection, we consider an OCP for (5) with arbitrarily specified terminal states. Let be given. Let us define cost functional
[TABLE]
and consider
[TABLE]
With Assumption 11, the corresponding steady optimization problem has a unique solution around the origin. The Hamilton-Jacobi equation associated with is
[TABLE]
The Hamiltonian in this case is
[TABLE]
and the corresponding characteristic equation for (16) is
[TABLE]
with and .
Under Assumptions 11, the Hamiltonian system (17) can be written as
[TABLE]
and the origin is a hyperbolic equilibrium with stable and unstable eigenvalues. Let and be the stable and unstable manifolds of (17) at the origin.
Theorem 15**.**
Under Assumption 11, suppose that and . If is taken sufficiently large, there exists a solution to (17) satisfying and . If, moreover,
[TABLE]
then
[TABLE]
is the local optimal solution for and turnpike inequality (6) hols for some , independent of .
Proof. Let , , and which correspond to , , and in Theorem 4, respectively. Then, for sufficiently large, there exist , and with such that a solution to (17) connecting and exists. The rest of the proof is almost the same as Theorem 9.
Corollary 16**.**
Let us additionally impose the controllability of in Assumption 11. Then, for sufficiently small and and sufficiently large , the local optimal control exists and turnpike inequality (6) holds.
Proof. We again employ the eigen structure analysis (27). The tangent spaces of and at the origin are written as
[TABLE]
The latter is obtained by showing, using the controllability of , that is strictly negative definite (Lemma A.2). Therefore, and for sufficiently small , . It is seen that the condition (18) holds for these , (making them smaller if necessary) from the analysis on in the proof of Theorem 9.
Remark 17**.**
- (i)
The linear counterpart of Corollary 16 is in [47] where they use anti-stabilizing solution for the Riccati equation. In this case, the turnpike holds for all and . It can be shown that . Note that in Corollary 16 we only need the detectability condition. Corollary 16 is obtained in [1] using Hamilton-Jacobi theory under unusual nonlinear controllability and observability conditions. Compared with their conditions, we use only the linear controllability and detectability which can be easily checked. The authors of [45] obtain similar results to Corollary 16 with more generalized terminal conditions. 2. (ii)
Corollary 16 states that the turnpike occurs for small initial and terminal states under linear stabilizability and detectability. Relaxing the smallness conditions is one of major challenges in (). In § 4.2, we will give a class of nonlinear systems for which the turnpike occurs for all initial states. This is done with the aid of the result in [40] giving an estimates on the region for stable manifold in terms of nonlinear stabilizability. In the example in § 4.2, the unstable manifold is linear and a geometric condition in Theorem 15 is readily verified.
4 Examples
4.1 Problem
In this subsection, we show a class of nonlinear systems where the turnpike occurs in for all target . Let us consider the following class of nonlinear control systems
[TABLE]
where is an Hurwitz matrix, is a function and is a control input. Assume that is stabilizable and for all . The cost function is
[TABLE]
where , are constant matrices with appropriate dimensions, is detectable and , are given constant vectors.
The corresponding Hamiltonian system for this problem is
[TABLE]
Using the stabilizability and detctability of , it can be seen that there is an equilibrium , for (21). At this equilibrium, the linearized matrix is
[TABLE]
where is an symmetric matrix and therefore, it can be seen that it is a hyperbolic equilibrium.
Let , and be solutions for
[TABLE]
with , , and being Hurwitz. Using a linear coordinate transformation (see Appendix A)
[TABLE]
the Hamiltonian system (21) is rewritten as
[TABLE]
where , , are appropriately computed higher order terms. Since =0, , for all , , the unstable manifold at the equilibrium is the affine space , or
[TABLE]
Since is nonsingular, which is shown using Lemma A.2 and Sylvester’s determinant identity, for any , , intersects transversally. Now, using Theorem 9, for any , , if the initial point is close enough to , the optimal control for (19)-(20) possesses the turnpike property.
As an example of the class of systems, a turnpike trajectory for a nonlinear optimal control problem
[TABLE]
is depicted in Fig. 2, where a solution of (SOP) is .
4.2 Problem
Next, we show a class of nonlinear control systems for which estimates on (un)stable manifold of Hamiltonian systems obtained in [40] are effective for the prediction of turnpike.
Let us consider an -dimensional system represented in Byrnes-Isidori normal form [4] for globally exponentially minimum phase nonlinear systems
[TABLE]
where and is a smooth map with . We assume that is globally exponentially stable. It is known that (23) is globally exponentially stabilizable via a smooth feedback. Therefore, representing , for a cost functional
[TABLE]
the associated Hamiltonian system is hyperbolic at the origin if and the matrix defining -dynamics is a detectable pair. If, in addition, and satisfy the growth condition in Proposition B.1-(iv) with respect to , the stable manifold of the Hamiltonian system satisfies . Therefore, from Corollary 16, the OCP has a solution for all and for sufficiently small that exhibits turnpike if the zero-state detectability condition is satisfied and is taken large enough.
As a numerical example, consider (22a), which is in Byrnes-Isidori normal form (see e.g., [4]), with
[TABLE]
Introducing a cut-off function on , the result in [40] is applied to confirm that the turnpike occurs for all initial condition and terminal states , where is the unstable manifold of the Hamiltonian system at the origin. Similarly to the previous subsection, is described as
[TABLE]
Figs. 3, 4 show the turnpike trajectory of the optimal control problem (22a)-(24) with , . In Fig. 4, , , , are depicted for while the last figure shows for . From these figures, one sees that starting from at , the states and costates rapidly grow during the time span and go to the steady optimal (the origin) by the time and then, the states reach the destination at . The peak of this growth increases as increases. This growth of states is called "peaking phenomenon" of nonlinear stabilization [43] and it is interesting to see that peaking phenomenon appears in turnpike trajectory.
5 Discussions
The geometric approach proposed in the present paper may be applied to more general cases where turnpike phenomena need more sophisticated analyses. Here, we discuss two kinds of extensions.
5.1 Global analysis when (SOP) admits multiple solutions
When (SOP) admits multiple solutions, multiple equilibria appear in associated Hamiltonian systems. If they are all hyperbolic, the -lemma still applies to draw pictures of flows around stable and unstable manifolds that are separatrices dividing the phase space (see, e.g., [31, p.87 Corollary 1]).
Let us consider for
[TABLE]
Associated Hamiltonian system has three equilibrium points; , and , the first two of which are the global solution of (SOP) and hyperbolic. Fig. 5 shows stable and unstable manifolds, closed orbits around and heteroclinic orbits connecting and .
From this figure and using the geometric method in the present paper, one immediately sees that for any initial point and final point , solution for with large exists. For instance, trajectory in - space and corresponding optimal input are depicted in Fig. 6 for , . Although the input response looks like turnpike, the response of for is not stationary but steady motion with nonzero velocity.
Fig. 7 shows optimal trajectory and control response for , . Nonzero control is necessary to drive against stable vector field. The ratio of the time duration for nonzero control for the overall horizon can be arbitrarily small as and in this sense, this can be also considered turnpike phenomenon.
As for , when multiple global minimizers exist, an interesting question is raised in [32] as to which minimizer attracts turnpike for wider initial conditions. It is interesting to study how the geometry of these invariant manifolds affects turnpike occurrence and its strength in terms of the question.
5.2 Non-hyperbolic Hamiltonian systems
In [11], a concept of velocity turnpike or time-varying turnpike arising in mechanical systems is proposed combining trim primitives and turnpike properties. Motivated by that, the authors in [34] consider turnpike properties when detectability (ovservability) is not satisfied. A common feature in these cases is that associated Hamiltonian systems have zero eigenvalues. It is then interesting to consider the application of the -lemma for normally hyperbolic invariant manifolds [6] combining the classification result on Hamiltonian and symplectic matrices [25].
6 Conclusions
In this paper, using techniques from dynamical system theory such as invariant manifolds and the -lemma, we showed that turnpike-like behavior naturally appears in hyperbolic dynamical systems. This is then applied to analyze Hamiltonian systems describing controlled trajectories to obtain sufficient conditions for optimal controls yielding the turnpike to exist. The framework proposed in the paper is geometric and an alternative to existing ones. Using the framework, we showed classes of nonlinear systems for which target or initial states can be taken arbitrarily large.
Since our interests were to discover geometric nature in turnpike, we focused on OCPs without constraints and exponential turnpike. Future works include applications of this approach to more specific problems and considering OCPs with constraints, for which we mention an attempt to analyze turnpike in the maximum hands-off control [41]. Acknowledgement. The authors would like to thank Emmanuel Trélat and Lars Grüne for their comments on the early version of the manuscript. The authors are also grateful to Dario Pighin for valuable discussions.
Appendix
Appendix A Results related with Riccati equations and linear Hamiltonian systems
Let us consider Riccati equation
[TABLE]
where are constant matrices with . Suppose that is stabilizable and is detectable. The following are known (see, e.g., [14, 27, 39]).
Lemma A.1**.**
- (i)
There is a solution to (26) such that is Hurwitz. 2. (ii)
Let be a solution to a Lyapunov equation
[TABLE]
then is a symplectic matrix and its inverse is . 3. (iii)
The Hamiltonian matrix is block-diagonalized as
[TABLE]
The following lemma can be considered as dual version of Theorem 2 in [14, p.90], for which simplified proofs are given for the sake of self-containedness.
Lemma A.2**.**
* is nonsingular. If, in addition, is controllable, then (negative definite).*
Proof. Let . From (27) we have
[TABLE]
We show that the condition leads to a contradiction. It can be shown from (28) that satisfies , using and , showing that is -invariant. Thus, we may assume that is an eigenvector of with eigenvalue with . From (28b), we have and therefore . This shows that . With , the detectability of implies . This shows that with , which contradicts to Lemma A.1(ii). The second statement can also be proved in a similar way, deriving , for and a contradiction.
Lemma A.3**.**
Let
[TABLE]
where , , are matrix functions of . When is stabilizable and is detectable, is nonsingular for .
Proof. Using (27),
[TABLE]
where we have set . Since and
[TABLE]
by Lemma A.1(ii), for . If for some and , then we have and therefore . This implies , and we have .
Appendix B Existence of infinite horizon optimal control and stable manifold of Hamiltonian systems
This appendix introduces a result in [40] on the existence of infinite horizon optimal control. The main result in the paper is under simpler growth conditions than those given below, but is more restrictive to apply.
Let be an open set containing the origin. A nonlinear system (5) is said to be -exponentially stabilizable in if there exists a feedback control with such that the the closed loop system is exponentially stable with respect to . Let be a nonnegative function defined in . A system (5) with output is zero-state detectable for , or simply * is zero-state detectable for *, if the following holds. If a solution with satisfies for , then as .
For system (5), let with , , and rewrite it as
[TABLE]
where , , . Let be a cutoff function such that for and for . Define and .
Assumption 1**.**
- (i)
System (5) is -exponentially stabilizable in , where is an open set in containing the origin. 2. (ii)
For a nonnegative function , there exist positive constants , , such that for . 3. (iii)
The pair is zero-state detectable for an open set containing . 4. (iv)
For any , there exist constants , , , which may depend on , such that
[TABLE]
for sufficiently large . 5. (v)
There exist constants , and such that
[TABLE]
for all and sufficiently large .
Proposition B.1**.**
Under Assumption 1, for OPC (5) and
[TABLE]
there exists an optimal control for . Furthermore, for a Hamiltonian system associated with OCP (5)-(29), a stable manifold at the origin exists with the projection property .
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Brian D. O. Anderson and Peter V. Kokotovic. Optimal control problems over large time intervals. Automatica , 23(3):355–363, 1987.
- 2[2] Michael Athans and Peter L. Falb. Optimal Control: An Introduction to the Theory and Its Applications . Mc Grow-Hill, New York, 1966.
- 3[3] Julian Berberich, Johannes Köhler, Frank Allgöwer, and Mathias A Müller. Indefinite linear quadratic optimal control: Strict dissipativity and turnpike properties. IEEE Control Systems Letters , 2(3):399–404, 2018.
- 4[4] Christopher I. Byrnes and Albelto Isidori. Asymptotic stabilization of minimum phase nonlinear systems. IEEE Trans. Automat. Control , 36(10):1122–1137, 1991.
- 5[5] D. A. Carlson, A. Haurie, and A. Leizarowitz. Infinite Horizon Optimal Control . Springer-Verlag, Berlin Heidelberg, 2nd edition, 1991.
- 6[6] Jacky Cresson and Stephen Wiggins. A λ 𝜆 \lambda -lemma for normally hyperbolic invariant manifolds. Regular and Chaotic Dynamics , 20(1):94–108, 2015.
- 7[7] Tobias Damm, Lars Grüne, Marleen Stieler, and Karl Worthmann. An exponential turnpike theorem for dissipative discrete time optimal control problems. SIAM J. Control Optim. , 52(3):1935–1957, 2014.
- 8[8] Martin V. Day. On Lagrange manifolds and viscosity solutions. Journal of Mathematical Systems, Estimation and Control , 18:369–372, 1998. http://www.math.vt.edu/people/day/research/LMVS.pdf .
