The turnpike property in nonlinear optimal control -- A geometric   approach

Noboru Sakamoto; Enrique Zuazua

arXiv:1903.09069·math.OC·February 10, 2021·Autom.

The turnpike property in nonlinear optimal control -- A geometric approach

Noboru Sakamoto, Enrique Zuazua

PDF

Open Access

TL;DR

This paper introduces a geometric dynamical systems approach to analyze the turnpike property in nonlinear optimal control, providing new insights and simpler proofs for existing results.

Contribution

It develops a geometric framework to study the turnpike property, extending understanding to more general conditions and removing some restrictions on initial and target states.

Findings

01

Turnpike-like behavior appears in systems with hyperbolic equilibrium.

02

Sufficient conditions for the turnpike property are established.

03

Simpler proofs for existing turnpike results are provided.

Abstract

This paper presents, using dynamical system theory, a framework for investigating the turnpike property in nonlinear optimal control. First, it is shown that a turnpike-like property appears in general dynamical systems with hyperbolic equilibrium and then, apply it to optimal control problems to obtain sufficient conditions for the turnpike occurs. The approach taken is geometric and gives insights for the behaviors of controlled trajectories, allowing us to find simpler proofs for existing results on the turnpike properties. Attempts to remove smallness restrictions for initial and target states are also discussed based on the geometry of (un)stable manifold and exponential stabilizability of control systems.

Equations150

\overset{z}{˙} = f (z),

\overset{z}{˙} = f (z),

S

S

U

∥ g ∥_{r} := i max sup {∣ g (u) ∣, ∥ D g^{i} (u) ∥, \dots, ∥ D^{r} g^{i} (u) ∥ ∣ u \in B (1)},

∥ g ∥_{r} := i max sup {∣ g (u) ∣, ∥ D g^{i} (u) ∥, \dots, ∥ D^{r} g^{i} (u) ∥ ∣ u \in B (1)},

∣ φ (t, z_{0}) ∣

∣ φ (t, z_{0}) ∣

∣ φ (t, z_{1}) ∣

∣ φ (t, y) ∣ ⩽ K e^{- μ t} for t \in [0, T], y \in B (z_{0}, ρ),

∣ φ (t, y) ∣ ⩽ K e^{- μ t} for t \in [0, T], y \in B (z_{0}, ρ),

∣ φ (t, y) ∣ ⩽ K e^{μ t} for t \in [T, 0], y \in B (z_{1}, ρ) .

∣ φ (t, y) ∣ ⩽ K e^{μ t} for t \in [T, 0], y \in B (z_{1}, ρ) .

∣ φ (t, y_{0}) ∣ ⩽ K [e^{- μ t} + e^{- μ (T - t)}] for t \in [0, T] .

∣ φ (t, y_{0}) ∣ ⩽ K [e^{- μ t} + e^{- μ (T - t)}] for t \in [0, T] .

∣ φ (t, y_{0}) ∣ ⩽ K e^{- μ t} for 0 ⩽ t ⩽ T /2.

∣ φ (t, y_{0}) ∣ ⩽ K e^{- μ t} for 0 ⩽ t ⩽ T /2.

∣ φ (t, y_{1}) ∣ ⩽ K e^{μ t} for - T /2 ⩽ t ⩽ 0.

∣ φ (t, y_{1}) ∣ ⩽ K e^{μ t} for - T /2 ⩽ t ⩽ 0.

∣ φ (t + T, y_{0}) ∣ ⩽ K e^{μ t} for - T /2 ⩽ t ⩽ 0,

∣ φ (t + T, y_{0}) ∣ ⩽ K e^{μ t} for - T /2 ⩽ t ⩽ 0,

∣ φ (t, y_{0}) ∣ ⩽ K e^{- μ (T - t)} for T /2 ⩽ t ⩽ T .

∣ φ (t, y_{0}) ∣ ⩽ K e^{- μ (T - t)} for T /2 ⩽ t ⩽ T .

\overset{x}{˙} = f (x) + g (x) u, x (t_{0}) = x_{0},

\overset{x}{˙} = f (x) + g (x) u, x (t_{0}) = x_{0},

J (u) = \int_{0}^{T} L (x (t), u (t)) d t

J (u) = \int_{0}^{T} L (x (t), u (t)) d t

∣ {t ⩾ 0 ∣ ∣ u_{T} (t) - \overset{u}{ˉ} ∣ + ∣ x_{T} (t, x_{0}) - \overset{x}{ˉ} ∣ > ε} ∣ < η_{ε}

∣ {t ⩾ 0 ∣ ∣ u_{T} (t) - \overset{u}{ˉ} ∣ + ∣ x_{T} (t, x_{0}) - \overset{x}{ˉ} ∣ > ε} ∣ < η_{ε}

∣ u_{T} (t) - \overset{u}{ˉ} ∣ + ∣ x_{T} (t, x_{0}) - \overset{x}{ˉ} ∣ ⩽ K [e^{- μ t} + e^{- μ (T - t)}]

∣ u_{T} (t) - \overset{u}{ˉ} ∣ + ∣ x_{T} (t, x_{0}) - \overset{x}{ˉ} ∣ ⩽ K [e^{- μ t} + e^{- μ (T - t)}]

J_{1} (u) = \frac{1}{2} \int_{0}^{T} ∣ C x (t) - z ∣^{2} + ∣ u (t) ∣^{2} d t,

J_{1} (u) = \frac{1}{2} \int_{0}^{T} ∣ C x (t) - z ∣^{2} + ∣ u (t) ∣^{2} d t,

\displaystyle(\text{OCP}_{1})_{T}:\ \

\displaystyle(\text{OCP}_{1})_{T}:\ \

J_{1} (u) along (\ref e q n : n sy s_{g} e n er a l) is minimized over all

u \in L^{\infty} (0, T, R^{m}) .

(SOP) :

(SOP) :

over all (x, u) \in R^{n} \times R^{m} such that

f (x) + g (x) u = 0.

\displaystyle\begin{multlined}V_{t}(t,x)+V_{x}(t,x)f(x)\\ -\frac{1}{2}V_{x}(t,x)g(x)g(x)^{\top}V_{x}(t,x)^{\top}+\frac{1}{2}|Cx-z|^{2}=0,\end{multlined}V_{t}(t,x)+V_{x}(t,x)f(x)\\ -\frac{1}{2}V_{x}(t,x)g(x)g(x)^{\top}V_{x}(t,x)^{\top}+\frac{1}{2}|Cx-z|^{2}=0,

\displaystyle\begin{multlined}V_{t}(t,x)+V_{x}(t,x)f(x)\\ -\frac{1}{2}V_{x}(t,x)g(x)g(x)^{\top}V_{x}(t,x)^{\top}+\frac{1}{2}|Cx-z|^{2}=0,\end{multlined}V_{t}(t,x)+V_{x}(t,x)f(x)\\ -\frac{1}{2}V_{x}(t,x)g(x)g(x)^{\top}V_{x}(t,x)^{\top}+\frac{1}{2}|Cx-z|^{2}=0,

V_{x} (T, x) = 0,

H (x, p) = p^{⊤} f (x) - \frac{1}{2} p^{⊤} g (x) g (x)^{⊤} p + \frac{1}{2} ∣ C x - z ∣^{2},

H (x, p) = p^{⊤} f (x) - \frac{1}{2} p^{⊤} g (x) g (x)^{⊤} p + \frac{1}{2} ∣ C x - z ∣^{2},

\overset{x}{˙}_{i} = \frac{\partial H}{\partial p _{i}}, \overset{p}{˙}_{i} = - \frac{\partial H}{\partial x _{i}}, i = 1, \dots, n

\overset{x}{˙}_{i} = \frac{\partial H}{\partial p _{i}}, \overset{p}{˙}_{i} = - \frac{\partial H}{\partial x _{i}}, i = 1, \dots, n

S_{z} = \tilde{S} + {(\overset{x}{ˉ}, \overset{p}{ˉ})}, U_{z} = \tilde{U} + {(\overset{x}{ˉ}, \overset{p}{ˉ})} .

S_{z} = \tilde{S} + {(\overset{x}{ˉ}, \overset{p}{ˉ})}, U_{z} = \tilde{U} + {(\overset{x}{ˉ}, \overset{p}{ˉ})} .

\frac{d}{d t} [\tilde{x} \tilde{p}] = [A_{z} - C^{⊤} C - B_{z} B_{z}^{⊤} - A_{z}^{⊤}] [\tilde{x} \tilde{p}] + o (∣ \tilde{x} ∣ + ∣ \tilde{p} ∣) .

\frac{d}{d t} [\tilde{x} \tilde{p}] = [A_{z} - C^{⊤} C - B_{z} B_{z}^{⊤} - A_{z}^{⊤}] [\tilde{x} \tilde{p}] + o (∣ \tilde{x} ∣ + ∣ \tilde{p} ∣) .

det D_{x_{0}} x_{T} (t, x_{0}) \neq = 0 for t \in [0, T],

det D_{x_{0}} x_{T} (t, x_{0}) \neq = 0 for t \in [0, T],

u_{T} (t) := - g (x_{T} (t, x_{0}))^{⊤} p_{T} (t, x_{0})

u_{T} (t) := - g (x_{T} (t, x_{0}))^{⊤} p_{T} (t, x_{0})

φ (T, (x_{0}, p_{0})) = (x_{1}^{'}, 0),

φ (T, (x_{0}, p_{0})) = (x_{1}^{'}, 0),

∣ x_{T} (t, x_{0}) - \overset{x}{ˉ} ∣ + ∣ p_{T} (t, x_{0}) - \overset{p}{ˉ} ∣ ⩽ K^{'} [e^{- μ t} + e^{- μ (T - t)}] for 0 ⩽ t ⩽ T .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsControl and Stability of Dynamical Systems · Advanced Differential Equations and Dynamical Systems · Control and Dynamics of Mobile Robots

Full text

The turnpike property in nonlinear optimal control — A geometric approach

Noboru Sakamoto [email protected]

Enrique Zuazua [email protected] Faculty of Science and Engineering, Nanzan University, Yamazato-cho 18, Showa-ku, Nagoya, 464-8673, Japan

Chair in Applied Analysis, Alexander von Humboldt-Professorship, Department of Mathematics, Friedrich-Alexander-Universität Erlangen-Nürnberg 91058 Erlangen, Germany

Departamento de Matemáticas, Universidad Autónoma de Madrid, 28049 Madrid, Spain

Chair of Computational Mathematics, Fundación Deusto, Avda Universidades, 24, 48007, Bilbao, Basque Country, Spain

Abstract

This paper presents, using dynamical system theory, a framework for investigating the turnpike property in nonlinear optimal control. First, it is shown that a turnpike-like property appears in general dynamical systems with hyperbolic equilibrium and then, apply it to optimal control problems to obtain sufficient conditions for the turnpike occurs. The approach taken is geometric and gives insights for the behaviors of controlled trajectories, allowing us to find simpler proofs for existing results on the turnpike properties. Attempts to remove smallness restrictions for initial and target states are also discussed based on the geometry of (un)stable manifold and exponential stabilizability of control systems.

keywords:

Optimal control; Nonlinear system; Turnpike.

\AtAppendix\AtAppendix\AtAppendix

††thanks: This paper was not presented at any IFAC meeting. Corresponding author: Noboru Sakamoto.

a: This work was partially funded by by JSPS KAKENHI Grant Numbers JP26289128, JP19K04446 and by Nanzan University Pache Research Subsidy I-A-2 for 2019 academic year.

b: Supported, in part, by the Alexander von Humboldt-Professorship program, by the European Research Council (ERC) under the European Union’s Horizon 2020 research and innovation programme (grant agreement No. 694126-DyCon), by grant MTM2017-92996 of MINECO (Spain), by ELKARTEK project KK-2018/00083 ROAD2DC of the Basque Government, by ICON of the French ANR and Nonlocal PDEs: Analysis, Control and Beyond, and by AFOSR Grant FA9550-18-1-0242.

,

1 Introduction

The turnpike property was first recognized in the context of optimal growth by economists (see, e.g., [28]). The turnpike theorems say that for a long-run growth, regardless of starting and ending points, it will pay to get into a growth phase, called von Neumann path, in the most of intermediate stages. It is exactly like a turnpike and a network of minor roads; "if origin and destination are far enough apart, it will always pay to get on the turnpike and cover distance at the best rate of travel …" (quoted from [9]).

In control theory, independently of the turnpike theorems in econometrics, this property was investigated as dichotomy in linear optimal control [47, 38] and later extended to nonlinear systems [1]. In optimal control, the turnpike property essentially means that the solution of an optimal control problem is determined by the system and cost function and independent of time intervals, initial and terminal conditions except in the thin layers at the both ends of the time interval (see, e.g., [5, 50, 37]). In the last decades, much progress has been made in the theory of turnpike for finite or infinite dimensional and linear or nonlinear control systems. In [35], the authors study the turnpike for linear finite and infinite dimensional systems and derive a simple but meaningful inequality, for which we term turnpike inequality. Their works are extended to finite-dimensional nonlinear systems [45], the semi-linear heat equation [36], the wave equation [20, 51], periodic turnpike for systems in Hilbert spaces[44], optimal shape design [24], optimal boundary control for hyperbolic systems [19] and general evolution equations [18]. The turnpike property draws attentions of system theory researchers from the viewpoints of model predictive control [15, 10], dissipative systems (see, e.g., [48, 49]) [3, 7, 12, 16, 17], mixed-integer optimal control [13], mechanical systems [11] and the maximum hands-off control [29, 41].

In this paper, we first show that turnpike-like behaviors naturally appear in general dynamical systems with hyperbolic equilibrium. The main technique we use is the $\lambda$ -lemma which describes trajectory behaviors near invariant manifolds such as stable and unstable manifolds. That the turnpike-like inequality holds implies that if one fixes two ends of a trajectory close to stable and unstable manifolds and designates the time duration sufficiently large, then the trajectory necessarily converges to these manifolds to spend the most of time near the equilibrium. This is exactly the turnpike property. It should be noted that the two ends, as long as they are close to the manifold, do not need to lie in the vicinity of the equilibrium, from which it may be possible to remove locality restrictions in the research of turnpike.

We apply this inequality to a class of optimal control problems in which terminal states are not specified and the steady optimal solutions are not the origin as in [35, 36, 51, 44] as well as to a class of optimal control problems in which two terminal states are specified and the steady optimal point is the origin as in [47, 1, 45]. For both classes of problems, we employ a Dynamic Programming approach with Hamilton-Jacobi equations (HJEs). The characteristic equations for HJEs are Hamiltonian systems and the stabilizability (controllability for the second class) and detectability conditions assure that the equilibrium of the Hamiltonian systems is hyperbolic. The controlled trajectories appear in these Hamiltonian systems and we apply the turnpike result for dynamical system. Then, the existence of the trajectory satisfying initial and boundary conditions is guaranteed. In this paper, we derive sufficient conditions for optimality by using the Dynamic Programming and HJEs and by imposing a condition that guarantees the existence of the solution to the HJEs (Lagrangian submanifold property, see, e.g., [26]).

The present manuscript expands upon our conference contribution [42] incorporating a new result on the relationship between nonlinear stabilizability and existence of infinite horizon optimal control [40]. It allows one to give an estimate of the existence region of a stable manifold of hyperbolic Hamiltonian system associated with an optimal control problem, from which one may be able to predict the occurrence of turnpike (see Section 4.2). The manuscript also contains a number of examples worked out to show how the proposed geometric approach is effectively applied for turnpike analysis and appendices for necessary results in the theory of algebraic Riccati equations and for stable manifold estimate in Hamiltonian systems.

The organization of the paper is as follows. In Section 2 we review some of key tools from dynamical system theory and derive the turnpike inequality. In Section 3, we apply it to optimal control problems. Section 3.1 handles the problem where boundary state is free and Section 3.2 handles the problem where initial and boundary states are fixed. Section 4 shows turnpike analyses for a class of nonlinear systems for which target $z$ in $(\mathrm{OCP}_{1})$ can be taken arbitrarily large and a class of nonlinear systems for which initial states can be taken arbitrarily large. Section 5 discusses possible extensions for more general turnpike using the geometric approach.

2 Turnpike in dynamical systems

Let us consider a nonlinear dynamical system of the form

[TABLE]

where $f:\mathbb{R}^{N}\to\mathbb{R}^{N}$ is of $C^{r}$ class ( $r\geqslant 1$ ). We assume that $f(0)=0$ and the hyperbolicity of $f$ at [math], namely, assume that $Df(0)\in\mathbb{R}^{N\times N}$ has $k$ eigenvalues with strictly negative real parts and $N-k$ eigenvalues with strictly positive real parts.

It is known, as the stable manifold theorem, that there exist $C^{r}$ manifolds $S$ and $U$ , called stable manifold and unstable manifold of (1) at [math], respectively, defined by

[TABLE]

where $\varphi(t,z)$ is the solution of (1) starting $z$ at $t=0$ . Let $E^{s}$ , $E^{u}$ be stable and unstable subspaces in $\mathbb{R}^{N}$ of $Df(0)$ with dimension $k$ , $N-k$ , respectively. It is known that $S$ , $U$ are invariant under the flow of $f$ and are tangent to $E^{s}$ , $E^{u}$ , respectively, at $z=0$ . See, e.g., [21, 31] for more details on the theory of stable manifold.

We will consider limiting behavior of submanifolds under the flow of $f$ and need to introduce topology for maps and manifolds. Let $M$ be a compact manifold of dimension $m$ and the space $C^{r}(M,\mathbb{R}^{l})$ of $C^{r}$ maps, $0\leqslant r<\infty$ , defined on $M$ . There exists a natural vector space structure on $C^{r}(M,\mathbb{R}^{l})$ . Since $M$ is compact, we take a finite cover of $M$ by open sets $V_{1},\ldots,V_{k}$ and take a local chart $(z_{i},U_{i})$ for $M$ with $z_{i}(U_{i})=B(2)$ such that $z_{i}(V_{i})=B(1)$ , $i=1,\ldots,k$ , where $B(1)$ and $B(2)$ are the balls of radius 1 and 2 at the origin of $\mathbb{R}^{m}$ . For a map $g\in C^{r}(M,\mathbb{R}^{l})$ , we define a norm by

[TABLE]

where $g^{i}=g\circ{z_{i}}^{-1}$ , local representation of $g$ , and $\|\cdot\|$ is a norm for linear maps. It is known that $\|\cdot\|_{r}$ does not depend on the choice of finite cover and we call it $C^{r}$ norm. For maps in $C^{r}(M,N)$ where $N$ is a manifold, we embed $N$ in a Euclidean space with sufficiently high dimension. Let $L$ , $L^{\prime}$ be $C^{r}$ submanifolds of $M$ and let $\varepsilon>0$ . We say that $L$ * and $L^{\prime}$ are $\varepsilon$ $C^{r}$ -close* if there exists a $C^{r}$ diffeomorphism $h:L\to L^{\prime}$ such that $\|i^{\prime}\circ h-i\|_{r}<\varepsilon$ , where $i:L\to M$ and $i^{\prime}:L^{\prime}\to M$ are inclusion maps. In this case, we use the notation $d_{L}^{r}(L^{\prime}):=\|i^{\prime}\circ h-i\|_{r}$ .

By a $k$ -dimensional (topological) disc we mean a set that is homeomorphic to $D^{k}:=\{(x_{1},\ldots,x_{k})\in\mathbb{R}^{k}\,|\,x_{1}^{2}+\cdots+x_{k}^{2}\leqslant 1\}$ . The following lemma is known as the $\lambda$ -lemma or inclination lemma and plays a crucial role in the theory of dynamical systems (see [31, 46]).

Lemma 1 (The $\lambda$ -lemma).

Suppose that $z=0$ is a hyperbolic equilibrium for (1). Suppose also that $S$ and $U$ are $k$ , $(N-k)$ -dimensional stable and unstable manifolds of $f$ at [math], respectively. For any $(N-k)$ -dimensional disc $B$ in $U$ , any point $z\in S$ , any $(N-k)$ -dimensional disc $D$ transversal to $S$ at $z$ and any $\varepsilon>0$ , there exists a $T>0$ such that if $t>T$ , $\varphi(t,D)$ contains an $(N-k)$ -dimensional disc $\tilde{D}$ with $d_{B}^{1}(\tilde{D})<\varepsilon$ .

Next, we show that the turnpike behavior appears in the transition of points near the stable manifold to points near the unstable manifold if the transition duration is designated large. Let $z_{0}\in S$ and $z_{1}\in U$ be given points. From the stable manifold theorem, it holds that

[TABLE]

where $K>0$ is a constant dependent on $z_{0}$ and $z_{1}$ and $\mu>0$ is a constant independent of $z_{0}$ and $z_{1}$ .

Proposition 2.

(i)

There exists a $T_{0}>0$ such that for every $T>T_{0}$ there exists a $\rho=\rho(T)>0$ such that

[TABLE]

where $B(z_{0},\rho)$ is the $N$ -dimensional open ball centered at $z_{0}$ with radius $\rho$ . Moreover, $\rho\to 0$ when $T\to\infty$ . 2. (ii)

There exist a $T_{0}<0$ such that for every $T<T_{0}$ there exists a $\rho=\rho(T)>0$ such that

[TABLE]

Moreover, $\rho\to 0$ when $T\to-\infty$ . 3. (iii)

For any $(N-k)$ -dimensional disc $\bar{D}$ transversal to $S$ at $z_{0}$ and any $k$ -dimensional disc $\bar{E}$ transversal to $U$ at $z_{1}$ , there exists a $T_{0}>0$ such that for any $T>T_{0}$ there exist an $(N-k)$ -dimensional disc $D\subset\bar{D}$ transversal to $S$ at $z_{0}$ and a $k$ -dimensional disc $E\subset\bar{E}$ transversal to $U$ at $z_{1}$ such that $\varphi(T,D)$ intersects $\varphi(-T,E)$ at a single point.

Proof. (i), (ii) These are consequences of the properties (2) of (un)stable manifold, which can be derived using contradiction arguments. The proofs of the convergence $\rho\to 0$ as $T\to\pm\infty$ also use contradictions to the facts that $S$ , $U$ are submanifolds with strictly lower dimension than $N$ .

(iii) First we take an $(N-k)$ -dimensional disc $U_{0}$ in $U$ passing through 0, a $k$ -dimensional disc $S_{0}$ in $S$ passing through 0 and an $\varepsilon>0$ arbitrarily. From the $\lambda$ -lemma, there exists a $T_{0}>0$ such that for any $T>T_{0}$ there exists an $(N-k)$ -dimensional disc $D\subset\bar{D}$ transversal to $S$ at $z_{0}$ and a $k$ -dimensional disc $E\subset\bar{E}$ transversal to $U$ at $z_{1}$ such that $d_{U_{0}}^{1}(\varphi(T,D))<\varepsilon$ , $d_{S_{0}}^{1}(\varphi(-T,E))<\varepsilon$ . Since $E^{s}\cap E^{u}=\{0\}$ , it is possible to take $\varepsilon$ , $S_{0}$ and $U_{0}$ so that $\varphi(T,D)$ intersects $\varphi(-T,E)$ at a single point. $\hfill\blacksquare$

Remark 3.

It should be noted that the above statements, especially (i) and (ii), are only on finite interval $[0,T]$ . This is the major difference from the trajectories on the stable and unstable manifolds.

Theorem 4.

For any $z_{0}\in S$ , any $z_{1}\in U$ , any $(N-k)$ -dimensional disc $\bar{D}$ transversal to $S$ at $z_{0}$ and any $k$ -dimensional disc $\bar{E}$ transversal to $U$ at $z_{1}$ , there exists a $T_{0}>0$ such that for every $T>T_{0}$ there exist $\rho=\rho(T)>0$ , $y_{0}\in B(z_{0},\rho)\cap\bar{D}$ and $y_{1}\in B(z_{1},\rho)\cap\bar{E}$ such that $\varphi(T,y_{0})=y_{1}$ and

[TABLE]

Moreover, $\rho\to 0$ when $T\to\infty$ .

Proof. Take the largest $T_{0}$ and the smallest $\rho$ in Proposition 2. We rename this $T_{0}$ as $T_{0}/2$ . Take arbitrary $T>T_{0}$ and use Proposition 2-(iii) to get a disc $D$ which is $(N-k)$ -dimensional and transversal to $S$ at $z_{0}$ and a disc $E$ which is $k$ -dimensional and transversal to $U$ at $z_{1}$ satisfying $D\subset B(z_{0},\rho)$ and $E\subset B(z_{1},\rho)$ . This is possible by taking smaller $S_{0}$ and $U_{0}$ in the proof of Proposition 2-(iii). Then, there exists a single point $\zeta$ such that $\varphi(T/2,D)\cap\varphi(-T/2,E)=\{\zeta\}$ (see Fig 1). Let $y_{0}:=\varphi(-T/2,\zeta)$ . Then, $y_{0}\in D\subset B(z_{0},\rho)$ and by Proposition 2-(i), we have

[TABLE]

Let $y_{1}:=\varphi(T/2,z)$ . Then, $y_{1}\in E\subset B(x_{1},\rho)$ and

[TABLE]

This shows that

[TABLE]

and consequently,

[TABLE]

Combining (3) and (4), we get the inequality in the theorem. The last assertion follows from Proposition 2-(i) and (ii). $\hfill\blacksquare$

3 Turnpike in nonlinear optimal control

Let us consider a nonlinear control system

[TABLE]

where $f:\mathbb{R}^{n}\to\mathbb{R}^{n}$ , $g:\mathbb{R}^{n}\to\mathbb{R}^{n\times m}$ are of $C^{2}$ class with $f(0)=0$ , $x(t)\in\mathbb{R}^{n}$ is state variables and $u(t)\in\mathbb{R}^{m}$ is control input. An optimal control problem or OCP is to find a control input for (5) such that the cost functional

[TABLE]

is minimized, where we set $J(u)=+\infty$ when the existence domain of solution for (5) is strictly contained in $[0,T)$ . There are several types in OCPs depending on whether or not the terminal time $T$ is specified and whether or not the state variables are specified at the terminal time. In this paper, we consider OCPs where the terminal time $T$ is specified and two types of OCPs; one in which the state variables are free at $t=T$ and another in which they are fixed at $t=T$ . For both types of OCPs, we are interested in the relationship between the solution $u_{T}$ and corresponding trajectory $x_{T}$ of an OCP and steady-state optimum pair $(\bar{u},\bar{x})$ , which will be defined more precisely later on.

Definition 5.

[5]** An optimal pair $(u_{T},x_{T})$ has the turnpike property if for any $\varepsilon>0$ , there exists an $\eta_{\varepsilon}>0$ such that

[TABLE]

for all $T>0$ , where $\eta_{\varepsilon}$ depends only on $\varepsilon$ , $f$ , $g$ , $x_{0}$ , and $L$ and $|\cdot|$ denotes length (Lebesgue measure) of interval.

Remark 6.

Turnpike inequality is to require $x_{T}$ and $u_{T}$ to satisfy

[TABLE]

for some constants $K>0$ and $\mu>0$ independent of $T$ , which is a sufficient condition for the turnpike property in Definition 5. Also, it should be noted that requiring (6) limits ourselves to the exponential input-state turnpike defined in [17]. **

3.1 The OCP with state variables unspecified at the terminal time

For system (5), we consider the following cost functional

[TABLE]

where $C\in\mathbb{R}^{r\times n}$ and $z\in\mathbb{R}^{r}$ is a given vector (target). We call this problem $(\text{OCP}_{1})_{T}$ ;

[TABLE]

Associated with $(\text{OCP}_{1})_{T}$ , we consider a steady optimization problem

[TABLE]

We assume the following.

Assumption 7.

(SOP) has a solution $(\bar{x},\bar{u})=(\bar{x}(z),\bar{u}(z))$ .

Also, associated with $(\text{OCP}_{1})_{T}$ , we can derive a Hamilton-Jacobi equation

[TABLE]

for $V(t,x)$ , where $V_{t}=D_{t}V$ , $V_{x}=D_{x}V$ . Defining a Hamiltonian

[TABLE]

we consider the corresponding characteristic equation for (9)-(10)

[TABLE]

with $\ p_{i}(T)=0$ , $i=1,\ldots,n.$ Note that since the system (5) is time-invariant, the equation corresponding to $V_{t}$ is not necessary. The following fact is readily verified.

Fact. A solution $(\bar{x},\bar{u})$ of (SOP) corresponds to an equilibrium point $(\bar{x},\bar{p})$ of (11) with $\bar{u}=-g(\bar{x})^{\top}\bar{p}$ .

Let $A_{z}=D_{x}D_{p}H(\bar{x},\bar{p})$ , $B_{z}=g(\bar{x})$ .

Assumption 8.

The triplet $(C,A_{z},B_{z})$ is stabilizable and detectable.

Under Assumptions 7, 8, the equilibrium $(\bar{x},\bar{p})$ is hyperbolic equilibrium for the Hamiltonian system (11) and there exist stable and unstable manifolds for (11) at $(\bar{x},\bar{p})$ which are expressed as

[TABLE]

Here, $\tilde{S}$ , $\tilde{U}$ are the stable and unstable manifold of (11) in the coordinates $(\tilde{x},\tilde{p})$ , where $\tilde{x}=x-\bar{x}$ , $\tilde{p}=p-\bar{p}$ , which is re-written as

[TABLE]

We can now state the main theorem of this subsection. Let $\pi_{1}:(x,p)\mapsto x$ , $\pi_{2}:(x,p)\mapsto p$ be canonical projections.

Theorem 9.

Under Assumptions 7, 8, suppose that $x_{0}\in\mathrm{Int}(\pi_{1}(S_{z}))$ , where $\mathrm{Int}(\cdot)$ is the interior of a set in $\mathbb{R}^{n}$ , and that $U_{z}$ intersects $p=0$ transversally. If $T$ is taken sufficiently large, then there exists a solution $(x_{T}(t,x_{0}),p_{T}(t,x_{0}))$ to (11) satisfying $x_{T}(0,x_{0})=x_{0}$ and $p_{T}(T,x_{0})=0$ . If, moreover,

[TABLE]

then

[TABLE]

is the local optimal solution for $(\mathrm{OCP}_{1})_{T}$ and turnpike inequality (6) holds for some constants $K>0$ and $\mu>0$ which are independent of $T$ .

Proof. Let $\zeta_{0}=(x_{0},0)$ and $\zeta_{1}=(x_{1},0)$ , where $(x_{1},0)\in U_{z}$ and take $n$ -dimensional discs $\{(x_{0},p)\,|\,|p|<\rho\}$ , $\{(x,0)\,|\,|x-x_{1}|<\rho\}$ , which correspond to $z_{0}$ , $z_{1}$ , $B(z_{0},\rho)\cap\bar{D}$ and $B(z_{1},\rho)\cap\bar{E}$ in Theorem 4, respectively. Then the theorem implies that for a sufficiently large $T>0$ , there exist $\rho=\rho(T)>0$ , $p_{0}$ and $x_{1}^{\prime}$ with $|p_{0}|<\rho$ , $|x_{1}^{\prime}-x_{1}|<\rho$ such that

[TABLE]

where $\varphi(t,(x_{0},p_{0}))$ denotes the solution of (11) starting from $(x_{0},p_{0})$ . This shows that the two-point boundary value problem associated with $(\mathrm{OCP}_{1})_{T}$ has been solved. Let $(x_{T}(t,x_{0}),p_{T}(t,x_{0}))=\varphi(t,(x_{0},p_{0}))$ . Then, the theorem also says that there exist $K^{\prime}>0$ and $\mu>0$ such that

[TABLE]

Since $|u_{T}(t)-\bar{u}|\leqslant\sup\|g(x)\||p(t,x_{0})-\bar{p}|$ , (6) holds with $K=2K^{\prime}(1+\sup\|g(x)\|)$ , where supremum is taken along the trajectory. The condition (13) guarantees that there exists a Lagrangian submanifold in a neighborhood of this trajectory and this implies the existence of solution $V(t,x)$ to (9)-(10) in the neighborhood. Then, the verification theorem in Dynamic Programming (see, e.g., [2]) shows that the control $u^{\ast}$ is locally optimal. $\hfill\blacksquare$

Remark 10.

The condition (13) guarantees that the solution $V$ to (9) exists in a neighborhood of the trajectory $(x_{T}(t,x_{0}),p_{T}(t,x_{0}))$ , $0\leqslant t\leqslant T$ . The optimality of $u_{T}$ is valid only in the neighborhood. This existence theory is described using the notion of Lagrangian submanifold (see, e.g., [26]) and when one seeks for larger domain of existence, the non-uniqueness issue of solution arises. We refer to [8] for general analysis of non-unique solutions and [30, 22, 23] for non-unique optimal controls for mechanical systems. **

We next show that for small $x_{0}$ , $z$ , the solution for $(\mathrm{OCP}_{1})_{T}$ has a solution with turnpike property using perturbation theory of stable manifold. Let $A=f(0)$ , $B=g(0)$ .

Assumption 11.

The triplet $(C,A,B)$ is stabilzable and detectable.

Fact. Under Assumption 11, there is a neighborhood of $z=0$ in $\mathbb{R}^{r}$ such that (SOP) has a unique solution for $z$ in the neighborhood and $(C,A_{z},B_{z})$ is stabilzable and detectable.

Corollary 12.

Under Assumption 11, for sufficiently small $x_{0}$ and $z$ and for sufficiently large $T$ , $(\mathrm{OCP}_{1})_{T}$ has a solution with the turnpike property.

Proof. From the Fact above, under Assumption 11, the Hamiltonian system (11) has stable manifold $S_{z}$ and unstable manifold $U_{z}$ at $(\bar{x},\bar{p})$ . For $z=0$ the linear part of the Hamiltonian system is $\mathrm{Ham}=\left[\begin{smallmatrix}A&-BB^{\top}\\ -C^{\top}C&-A^{\top}\end{smallmatrix}\right]$ , for which we apply the eigen structure analysis in Appendix A. Apply Lemma A.1 with $R=BB^{\top}$ , $Q=C^{\top}C$ and let $P$ and $L$ as in the Appendix. Then, the tangent spaces $T_{0}S_{0}$ , $T_{0}U_{0}$ of $S_{0}$ , $U_{0}$ at the origin can be written as

[TABLE]

From the expression of $T_{0}S_{0}$ , one can take $x_{0}$ sufficiently small so that there is an $n$ -dimensional disc $D_{0}$ in $S_{0}$ that contains the origin and $x_{0}$ in its interior. From Lemma A.2, $PL+I$ is nonsingular and therefore, $T_{0}U_{0}$ intersects $p=0$ transversally, which implies that there is an $n$ -dimensional disc $E_{0}$ in $U_{0}$ that intersects $p=0$ transversally. Let $X_{H}(x,p;z)$ be the Hamiltonian vector field (11). As $z\to 0$ , $X_{H}(x,p,z)$ can be arbitrarily close to $X_{H}(x,p;0)$ with $C^{1}$ topology in an appropriate compact set. The stable manifold theory (see, e.g., [31, Theorem 6.2]) ensures that there exists a small $z$ so that there are $n$ -dimensional discs $D_{z}\subset S_{z}$ , $E_{z}\subset U_{z}$ that are close enough to $D_{0}$ , $E_{0}$ , respectively, with $C^{1}$ -topology. For this $z$ , it holds that $x_{0}\in\mathrm{Int}(\pi_{1}(D_{z}))$ and $E_{z}$ intersects $p=0$ transversally. Now, all the hypotheses in Theorem 9 are satisfied. $\hfill\blacksquare$

Next Corollary is proved in [35, 44] in the study of the turnpike property for infinite dimensional systems under slightly more restrictive conditions (controllability and observability rather than stabilizability and detectability). Their proofs are based on the estimates on adjoint variables in the linear Hamiltonian system (11) which is derived as a necessary condition of optimality. Here, we give an alternative proof using the geometric picture in Theorem 9.

Corollary 13.

Suppose that the system (5) is linear, that is, $f(x)=Ax$ and $g(x)=B$ with real constant matrices $A\in\mathbb{R}^{n\times n}$ and $B\in\mathbb{R}^{n\times m}$ . Under Assumption 11, $(\mathrm{OCP}_{1})_{T}$ has the global solution $u^{\ast}(t)$ , $0\leqslant t\leqslant T$ for any $z\in\mathbb{R}^{r}$ . Moreover, turnpike inequality (6) holds.

Proof. We use some of the notations in the proof of Corollary 12. The unique solution $(\bar{x},\bar{p})$ to (SOC) is expressed as $\left[\begin{smallmatrix}\bar{x}\\ \bar{p}\end{smallmatrix}\right]=-\mathrm{Ham}^{-1}\left[\begin{smallmatrix}0\\ C^{\top}z\end{smallmatrix}\right]$ . $U_{z}$ and $S_{z}$ in (12) can be written as

[TABLE]

It is readily seen that $x_{0}\in\mathrm{Int}(\pi_{1}(S))$ for any $x_{0}\in\mathbb{R}^{n}$ and $U$ intersects $p=0$ transversally for any $z\in\mathbb{R}^{r}$ . The condition (13) is equivalent to the nonsingularity of (1,1)-block in $\exp[t\mathrm{Ham}]$ , which is proved in Lemma A.3. $\hfill\blacksquare$

Remark 14.

Although the problem in Corollary 13 is linear, it is not an easy task to explicitly write down the solution for (9)-(10) except for $z=0$ . This corollary, however, says that the solution globally exists. 2. 2.

As is discussed in [36, 45, 33], relaxing the smallness conditions in Corollary 12 is one of major challenges in the research of nonlinear turnpike. In § 4.1, we show a class of nonlinear OCPs for which turnpike occurs for all $z$ by explicitly analyzing unstable manifold.

3.2 The OCP with state variables specified at the terminal time

In this subsection, we consider an OCP for (5) with arbitrarily specified terminal states. Let $x_{f}\in\mathbb{R}^{n}$ be given. Let us define cost functional

[TABLE]

and consider

[TABLE]

With Assumption 11, the corresponding steady optimization problem has a unique solution $(\bar{x},\bar{u})=(0,0)$ around the origin. The Hamilton-Jacobi equation associated with $(\mathrm{OCP}_{2})_{T}$ is

[TABLE]

The Hamiltonian in this case is

[TABLE]

and the corresponding characteristic equation for (16) is

[TABLE]

with $x(0)=x_{0}$ and $x(T)=x_{f}$ .

Under Assumptions 11, the Hamiltonian system (17) can be written as

[TABLE]

and the origin is a hyperbolic equilibrium with $n$ stable and $n$ unstable eigenvalues. Let $S$ and $U$ be the stable and unstable manifolds of (17) at the origin.

Theorem 15.

Under Assumption 11, suppose that $x_{0}\in\mathrm{Int}(\pi_{1}(S))$ and $x_{f}\in\mathrm{Int}(\pi_{1}(U))$ . If $T>0$ is taken sufficiently large, there exists a solution $(x_{T}(t,x_{0}),p_{T}(t,x_{0}))$ to (17) satisfying $x(0)=x_{0}$ and $x(T)=x_{1}$ . If, moreover,

[TABLE]

then

[TABLE]

is the local optimal solution for $(\mathrm{OCP}_{2})_{T}$ and turnpike inequality (6) hols for some $K>0$ , $\mu>0$ independent of $T$ .

Proof. Let $\zeta_{0}=(x_{0},0)$ , $\zeta_{1}=(x_{f},0)$ , $\{(x_{0},p)\,|\,|p|<\rho\}$ and $\{(x_{f},p)\,|\,|p|<\rho\}$ which correspond to $z_{0}$ , $z_{1}$ , $B(z_{0},\rho)\cap\bar{D}$ and $B(z_{1},\rho)\cap\bar{E}$ in Theorem 4, respectively. Then, for $T>0$ sufficiently large, there exist $\rho>0$ , $p_{0}$ and $p_{1}$ with $|p_{0}|,|p_{1}|<\rho$ such that a solution to (17) connecting $(x_{0},p_{0})$ and $(x_{f},p_{1})$ exists. The rest of the proof is almost the same as Theorem 9. $\hfill\blacksquare$

Corollary 16.

Let us additionally impose the controllability of $(A,B)$ in Assumption 11. Then, for sufficiently small $|x_{0}|$ and $|x_{f}|$ and sufficiently large $T$ , the local optimal control exists and turnpike inequality (6) holds.

Proof. We again employ the eigen structure analysis (27). The tangent spaces of $S$ and $U$ at the origin are written as

[TABLE]

The latter is obtained by showing, using the controllability of $(A,B)$ , that $L$ is strictly negative definite (Lemma A.2). Therefore, $x_{0}\in\mathrm{Int}(\pi_{1}(S))$ and $x_{f}\in\mathrm{Int}(\pi_{1}(U))$ for sufficiently small $|x_{0}|$ , $|x_{f}|$ . It is seen that the condition (18) holds for these $|x_{0}|$ , $|x_{f}|$ (making them smaller if necessary) from the analysis on $\Phi_{11}(t)$ in the proof of Theorem 9. $\hfill\blacksquare$

Remark 17.

(i)

The linear counterpart of Corollary 16 is in [47] where they use anti-stabilizing solution $P_{u}$ for the Riccati equation. In this case, the turnpike holds for all $x_{0}$ and $x_{f}$ . It can be shown that $P_{u}=(PL+I)L^{-1}$ . Note that in Corollary 16 we only need the detectability condition. Corollary 16 is obtained in [1] using Hamilton-Jacobi theory under unusual nonlinear controllability and observability conditions. Compared with their conditions, we use only the linear controllability and detectability which can be easily checked. The authors of [45] obtain similar results to Corollary 16 with more generalized terminal conditions. 2. (ii)

Corollary 16 states that the turnpike occurs for small initial and terminal states under linear stabilizability and detectability. Relaxing the smallness conditions is one of major challenges in ( $\mathrm{OCP}_{2}$ ). In § 4.2, we will give a class of nonlinear systems for which the turnpike occurs for all initial states. This is done with the aid of the result in [40] giving an estimates on the region for stable manifold in terms of nonlinear stabilizability. In the example in § 4.2, the unstable manifold is linear and a geometric condition in Theorem 15 is readily verified.

4 Examples

4.1 Problem $(\mathrm{OCP}_{1})$

In this subsection, we show a class of nonlinear systems where the turnpike occurs in $(\mathrm{OCP}_{1})$ for all target $z$ . Let us consider the following class of nonlinear control systems

[TABLE]

where $A_{1}$ is an $n_{1}\times n_{1}$ Hurwitz matrix, $A_{2}:\mathbb{R}^{n_{1}}\times\mathbb{R}^{n_{2}}\to\mathbb{R}^{n_{1}\times n_{1}}$ is a $C^{2}$ function and $u\in\mathbb{R}^{m}$ is a control input. Assume that $(A_{3},B_{2})$ is stabilizable and $A_{2}(0,x_{2})=0$ for all $x_{2}\in\mathbb{R}^{n_{2}}$ . The cost function is

[TABLE]

where $C_{1}$ , $C_{2}$ are constant matrices with appropriate dimensions, $(C_{2},A_{3})$ is detectable and $z_{1}\in\mathbb{R}^{r_{1}}$ , $z_{2}\in\mathbb{R}^{r_{2}}$ are given constant vectors.

The corresponding Hamiltonian system for this problem is

[TABLE]

Using the stabilizability and detctability of $(C_{3},A_{3},B_{2})$ , it can be seen that there is an equilibrium $(0,x_{20}(z_{2})$ , $p_{10}(z_{1}),p_{20}(z_{2}))$ for (21). At this equilibrium, the linearized matrix is

[TABLE]

where $\Gamma=\Gamma(z_{1},z_{2})$ is an $n_{1}\times n_{1}$ symmetric matrix and therefore, it can be seen that it is a hyperbolic equilibrium.

Let $P_{1}$ , $P_{3}$ and $S_{3}$ be solutions for

[TABLE]

with $P_{1}=P_{1}^{\top}$ , $P_{3}\geqslant 0$ , $S_{3}\geqslant 0$ and $A_{3}-B_{2}B_{2}^{\top}P_{3}$ being Hurwitz. Using a linear coordinate transformation (see Appendix A)

[TABLE]

the Hamiltonian system (21) is rewritten as

[TABLE]

where $\psi_{j}$ , $j=1,\ldots,4$ , are appropriately computed higher order terms. Since $\psi_{j}(0,0,x_{1}^{\prime},x_{2}^{\prime})$ =0, $j=1,\ldots,4$ , for all $x_{1}^{\prime}$ , $x_{2}^{\prime}$ , the unstable manifold $U$ at the equilibrium is the affine space $p_{1}^{\prime}=p_{2}^{\prime}=0$ , or

[TABLE]

Since $I+S_{3}P_{3}$ is nonsingular, which is shown using Lemma A.2 and Sylvester’s determinant identity, for any $z_{1}$ , $z_{2}$ , $U$ intersects $p_{1}=p_{2}=0$ transversally. Now, using Theorem 9, for any $z_{1}$ , $z_{2}$ , if the initial point $(x_{1}(0),x_{2}(0))$ is close enough to $(0,x_{20})$ , the optimal control for (19)-(20) possesses the turnpike property.

As an example of the class of systems, a turnpike trajectory for a nonlinear optimal control problem

[TABLE]

is depicted in Fig. 2, where a solution of (SOP) is $(0,z_{2},-z_{1},0)$ .

4.2 Problem $(\mathrm{OCP}_{2})$

Next, we show a class of nonlinear control systems for which estimates on (un)stable manifold of Hamiltonian systems obtained in [40] are effective for the prediction of turnpike.

Let us consider an $(n_{1}+n_{2})$ -dimensional system represented in Byrnes-Isidori normal form [4] for globally exponentially minimum phase nonlinear systems

[TABLE]

where $x\in\mathbb{R}^{n_{1}}$ and $q:\mathbb{R}^{n_{1}+1}\to\mathbb{R}^{n_{1}}$ is a smooth map with $q(0,0)=0$ . We assume that $\dot{x}=q(x,0)$ is globally exponentially stable. It is known that (23) is globally exponentially stabilizable via a smooth feedback. Therefore, representing $y=(y_{1},\ldots,y_{n_{2}})$ , for a cost functional

[TABLE]

the associated Hamiltonian system is hyperbolic at the origin if $C_{2}$ and the matrix defining $y$ -dynamics is a detectable pair. If, in addition, $|C_{1}x|^{2}$ and $q(x,y_{1})$ satisfy the growth condition in Proposition B.1-(iv) with respect to $x$ , the stable manifold $S$ of the Hamiltonian system satisfies $\pi_{1}(S)=\mathbb{R}^{n_{1}+n_{2}}$ . Therefore, from Corollary 16, the OCP has a solution for all $x_{0}$ and for sufficiently small $x_{f}$ that exhibits turnpike if the zero-state detectability condition is satisfied and $T$ is taken large enough.

As a numerical example, consider (22a), which is in Byrnes-Isidori normal form (see e.g., [4]), with

[TABLE]

Introducing a cut-off function on $x_{2}$ , the result in [40] is applied to confirm that the turnpike occurs for all initial condition $x_{0}=(x_{1}(0),x_{2}(0))$ and terminal states $x_{f}\in\pi_{1}(U)$ , where $U$ is the unstable manifold of the Hamiltonian system at the origin. Similarly to the previous subsection, $U$ is described as

[TABLE]

Figs. 3, 4 show the turnpike trajectory of the optimal control problem (22a)-(24) with $x_{0}=(12,12)$ , $x_{f}=(0,5)$ . In Fig. 4, $x_{1}(t)$ , $x_{2}(t)$ , $p_{1}(t)$ , $p_{2}(t)$ are depicted for $t\in[0,0.1]$ while the last figure shows $p_{2}(t)$ for $t\in[0.1,10]$ . From these figures, one sees that starting from $x_{0}=(12,12)$ at $t=0$ , the states and costates rapidly grow during the time span $[0,0.02]$ and go to the steady optimal (the origin) by the time $t=0.1$ and then, the states reach the destination $x_{f}=(0,5)$ at $t=10$ . The peak of this growth increases as $|x_{0}|$ increases. This growth of states is called "peaking phenomenon" of nonlinear stabilization [43] and it is interesting to see that peaking phenomenon appears in turnpike trajectory.

5 Discussions

The geometric approach proposed in the present paper may be applied to more general cases where turnpike phenomena need more sophisticated analyses. Here, we discuss two kinds of extensions.

5.1 Global analysis when (SOP) admits multiple solutions

When (SOP) admits multiple solutions, multiple equilibria appear in associated Hamiltonian systems. If they are all hyperbolic, the $\lambda$ -lemma still applies to draw pictures of flows around stable and unstable manifolds that are separatrices dividing the phase space (see, e.g., [31, p.87 Corollary 1]).

Let us consider $(\mathrm{OCP}_{2})_{T}$ for

[TABLE]

Associated Hamiltonian system has three equilibrium points; $(0,0)$ , $(1,0)$ and $(1/2,-1/4)$ , the first two of which are the global solution of (SOP) and hyperbolic. Fig. 5 shows stable and unstable manifolds, closed orbits around $(1/2,-1/4)$ and heteroclinic orbits connecting $(0,0)$ and $(0,1)$ .

From this figure and using the geometric method in the present paper, one immediately sees that for any initial point $x(0)$ and final point $x_{f}$ , solution for $(\mathrm{OCP}_{2})_{T}$ with large $T$ exists. For instance, trajectory in $x$ - $p$ space and corresponding optimal input are depicted in Fig. 6 for $x(0)=1.5$ , $x_{f}=-1$ . Although the input response looks like turnpike, the response of $x$ for $u\sim 0$ is not stationary but steady motion with nonzero velocity.

Fig. 7 shows optimal trajectory and control response for $x(0)=-0.1$ , $x_{f}=1.4$ . Nonzero control is necessary to drive $x$ against stable vector field. The ratio of the time duration for nonzero control for the overall horizon can be arbitrarily small as $T\to\infty$ and in this sense, this can be also considered turnpike phenomenon.

As for $(\mathrm{OCP}_{1})$ , when multiple global minimizers exist, an interesting question is raised in [32] as to which minimizer attracts turnpike for wider initial conditions. It is interesting to study how the geometry of these invariant manifolds affects turnpike occurrence and its strength in terms of the question.

5.2 Non-hyperbolic Hamiltonian systems

In [11], a concept of velocity turnpike or time-varying turnpike arising in mechanical systems is proposed combining trim primitives and turnpike properties. Motivated by that, the authors in [34] consider turnpike properties when detectability (ovservability) is not satisfied. A common feature in these cases is that associated Hamiltonian systems have zero eigenvalues. It is then interesting to consider the application of the $\lambda$ -lemma for normally hyperbolic invariant manifolds [6] combining the classification result on Hamiltonian and symplectic matrices [25].

6 Conclusions

In this paper, using techniques from dynamical system theory such as invariant manifolds and the $\lambda$ -lemma, we showed that turnpike-like behavior naturally appears in hyperbolic dynamical systems. This is then applied to analyze Hamiltonian systems describing controlled trajectories to obtain sufficient conditions for optimal controls yielding the turnpike to exist. The framework proposed in the paper is geometric and an alternative to existing ones. Using the framework, we showed classes of nonlinear systems for which target $z$ or initial states can be taken arbitrarily large.

Since our interests were to discover geometric nature in turnpike, we focused on OCPs without constraints and exponential turnpike. Future works include applications of this approach to more specific problems and considering OCPs with constraints, for which we mention an attempt to analyze turnpike in the maximum hands-off control [41]. Acknowledgement. The authors would like to thank Emmanuel Trélat and Lars Grüne for their comments on the early version of the manuscript. The authors are also grateful to Dario Pighin for valuable discussions.

Appendix

Appendix A Results related with Riccati equations and linear Hamiltonian systems

Let us consider Riccati equation

[TABLE]

where $A,R,Q\in\mathbb{R}^{n\times n}$ are constant matrices with $R,Q\geqslant 0$ . Suppose that $(A,R)$ is stabilizable and $(Q,A)$ is detectable. The following are known (see, e.g., [14, 27, 39]).

Lemma A.1.

(i)

There is a solution $P\geqslant 0$ to (26) such that $A_{c}:=A-RP$ is Hurwitz. 2. (ii)

Let $L\leqslant 0$ be a solution to a Lyapunov equation

[TABLE]

then $\left[\begin{smallmatrix}I&L\\ P&PL+I\end{smallmatrix}\right]$ is a symplectic matrix and its inverse is $\left[\begin{smallmatrix}LP+I&-L\\ -P&I\end{smallmatrix}\right]$ . 3. (iii)

The Hamiltonian matrix $\mathrm{Ham}=\left[\begin{smallmatrix}A&-R\\ -Q&-A^{\top}\end{smallmatrix}\right]$ is block-diagonalized as

[TABLE]

The following lemma can be considered as dual version of Theorem 2 in [14, p.90], for which simplified proofs are given for the sake of self-containedness.

Lemma A.2.

$PL+I$ * is nonsingular. If, in addition, $(A,R)$ is controllable, then $L<0$ (negative definite).*

Proof. Let $V:=PL+I$ . From (27) we have

[TABLE]

We show that the condition $\dim\mathrm{Ker}\,V\geqslant 1$ leads to a contradiction. It can be shown from (28) that $0\neq v\in\mathrm{ker}\,V$ satisfies $QLv=0$ , $VA_{c}^{\top}v=0$ using $LV=V^{\top}L$ and $Q\geqslant 0$ , showing that $\mathrm{Ker}\,V$ is $A_{c}^{\top}$ -invariant. Thus, we may assume that $v$ is an eigenvector of $A_{c}^{\top}$ with eigenvalue $\lambda$ with $\mathrm{Re}\,\lambda<0$ . From (28b), we have $ALv=-A_{c}^{\top}v=-\lambda Lv$ and therefore $(-\lambda I-A)Lv=0$ . This shows that $\left[\begin{smallmatrix}Q\\ -\lambda I-A\end{smallmatrix}\right]Lv=0$ . With $\mathrm{Re}\,(-\lambda)>0$ , the detectability of $(Q,A)$ implies $Lv=0$ . This shows that $\left[\begin{smallmatrix}L\\ V\end{smallmatrix}\right]v=0$ with $v\neq 0$ , which contradicts to Lemma A.1(ii). The second statement can also be proved in a similar way, deriving $\lambda u=A^{\top}u$ , $Ru=0$ for $0\neq u\in\mathrm{Ker}\,L$ and a contradiction. $\hfill\blacksquare$

Lemma A.3.

Let

[TABLE]

where $\Phi_{ij}(t)$ , $i,j=1,2$ , are $n\times n$ matrix functions of $t$ . When $(A,R)$ is stabilizable and $(Q,A)$ is detectable, $\Phi_{11}(t)$ is nonsingular for $t\geqslant 0$ .

Proof. Using (27),

[TABLE]

where we have set $\tilde{L}:=L-\exp[-tA_{c}]L\exp[-tA_{c}^{\top}]$ . Since $\tilde{L}(0)=0$ and

[TABLE]

by Lemma A.1(ii), $\tilde{L}(t)\geqslant 0$ for $t\geqslant 0$ . If $\Phi_{11}(t)\eta=0$ for some $t\geqslant 0$ and $\eta\in\mathbb{C}^{n}$ , then we have $(I+\tilde{L}(t)P)\eta=0$ and therefore $\eta^{\ast}P\eta+\eta^{\ast}P\tilde{L}(t)P\eta=0$ . This implies $P\eta=0$ , $\tilde{L}(t)P\eta=0$ and we have $\eta=0$ . $\hfill\blacksquare$

Appendix B Existence of infinite horizon optimal control and stable manifold of Hamiltonian systems

This appendix introduces a result in [40] on the existence of infinite horizon optimal control. The main result in the paper is under simpler growth conditions than those given below, but is more restrictive to apply.

Let $U\subset\mathbb{R}^{n}$ be an open set containing the origin. A nonlinear system (5) is said to be $C^{1}$ -exponentially stabilizable in $U$ if there exists a $C^{1}$ feedback control $u=k(x)$ with $k(0)=0$ such that the the closed loop system is exponentially stable with respect to $U$ . Let $h(x)$ be a $C^{1}$ nonnegative function defined in $\mathbb{R}^{n}$ . A system (5) with output $y=h(x)$ is zero-state detectable for $U$ , or simply $(f,h)$ * is zero-state detectable for $U$ *, if the following holds. If a solution $x(t)$ with $x(0)\in U$ satisfies $h(x(t))=0$ for $t\geqslant 0$ , then $x(t)\to 0$ as $t\to\infty$ .

For system (5), let $x=(x_{1},x_{2})$ with $x_{1}\in\mathbb{R}^{n_{1}}$ , $x_{2}\in\mathbb{R}^{n_{2}}$ , $n_{1}+n_{2}=n$ and rewrite it as

[TABLE]

where $f_{j}:\mathbb{R}^{n}\to\mathbb{R}^{n_{j}}$ , $g_{j}:\mathbb{R}^{n}\to\mathbb{R}^{n_{j}\times m}$ , $j=1,2$ . Let $\varphi_{R}:\mathbb{R}^{n_{2}}\to\mathbb{R}$ be a $C^{\infty}$ cutoff function such that $\varphi_{R}(x_{2})=1$ for $|x_{2}|<R$ and $\varphi_{R}(x_{2})=0$ for $|x_{2}|\geqslant R+1$ . Define $\tilde{f}_{R}(x_{1},x_{2}):=f(x_{1},\varphi_{R}(x_{2})x_{2})$ and $\tilde{g}_{R}(x_{1},x_{2}):=g(x_{1},\varphi_{R}(x_{2})x_{2})$ .

Assumption 1.

(i)

System (5) is $C^{1}$ -exponentially stabilizable in $\Omega$ , where $\Omega$ is an open set in $\mathbb{R}^{n}$ containing the origin. 2. (ii)

For a nonnegative $C^{1}$ function $h(x)$ , there exist positive constants $p$ , $\rho$ , $c_{h}$ such that $h(x)\geqslant c_{h}|x|^{p}$ for $|x|>\rho$ . 3. (iii)

The pair $(f,h)$ is zero-state detectable for an open set containing $|x|\leqslant\rho$ . 4. (iv)

For any $R>0$ , there exist constants $c_{f}>0$ , $c_{g}>0$ , $0\leqslant\theta<1$ , which may depend on $R$ , such that

[TABLE]

for sufficiently large $x\in\mathbb{R}^{n}$ . 5. (v)

There exist constants $c_{f2}>0$ , $c_{g2}>0$ and $0\leqslant\theta_{2}<1$ such that

[TABLE]

for all $x_{1}\in\mathbb{R}^{n_{1}}$ and sufficiently large $x_{2}\in\mathbb{R}^{n_{2}}$ .

Proposition B.1.

Under Assumption 1, for OPC (5) and

[TABLE]

there exists an optimal control for $x(0)\in\Omega$ . Furthermore, for a Hamiltonian system associated with OCP (5)-(29), a stable manifold $S$ at the origin exists with the projection property $\Omega\subset\pi_{1}(S)$ .

Bibliography51

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Brian D. O. Anderson and Peter V. Kokotovic. Optimal control problems over large time intervals. Automatica , 23(3):355–363, 1987.
2[2] Michael Athans and Peter L. Falb. Optimal Control: An Introduction to the Theory and Its Applications . Mc Grow-Hill, New York, 1966.
3[3] Julian Berberich, Johannes Köhler, Frank Allgöwer, and Mathias A Müller. Indefinite linear quadratic optimal control: Strict dissipativity and turnpike properties. IEEE Control Systems Letters , 2(3):399–404, 2018.
4[4] Christopher I. Byrnes and Albelto Isidori. Asymptotic stabilization of minimum phase nonlinear systems. IEEE Trans. Automat. Control , 36(10):1122–1137, 1991.
5[5] D. A. Carlson, A. Haurie, and A. Leizarowitz. Infinite Horizon Optimal Control . Springer-Verlag, Berlin Heidelberg, 2nd edition, 1991.
6[6] Jacky Cresson and Stephen Wiggins. A λ 𝜆 \lambda -lemma for normally hyperbolic invariant manifolds. Regular and Chaotic Dynamics , 20(1):94–108, 2015.
7[7] Tobias Damm, Lars Grüne, Marleen Stieler, and Karl Worthmann. An exponential turnpike theorem for dissipative discrete time optimal control problems. SIAM J. Control Optim. , 52(3):1935–1957, 2014.
8[8] Martin V. Day. On Lagrange manifolds and viscosity solutions. Journal of Mathematical Systems, Estimation and Control , 18:369–372, 1998. http://www.math.vt.edu/people/day/research/LMVS.pdf .

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

The turnpike property in nonlinear optimal control — A geometric approach

Abstract

keywords:

1 Introduction

2 Turnpike in dynamical systems

Lemma 1** (The λ\lambdaλ-lemma).**

Proposition 2**.**

Remark 3**.**

Theorem 4**.**

3 Turnpike in nonlinear optimal control

Definition 5**.**

Remark 6**.**

3.1 The OCP with state variables unspecified at the terminal time

Assumption 7**.**

Assumption 8**.**

Theorem 9**.**

Remark 10**.**

Assumption 11**.**

Corollary 12**.**

Corollary 13**.**

Remark 14**.**

3.2 The OCP with state variables specified at the terminal time

Theorem 15**.**

Corollary 16**.**

Remark 17**.**

4 Examples

4.1 Problem (OCP1)(\mathrm{OCP}_{1})(OCP1​)

4.2 Problem (OCP2)(\mathrm{OCP}_{2})(OCP2​)

5 Discussions

5.1 Global analysis when (SOP) admits multiple solutions

5.2 Non-hyperbolic Hamiltonian systems

6 Conclusions

Appendix A Results related with Riccati equations and linear Hamiltonian systems

Lemma A.1**.**

Lemma A.2**.**

Lemma A.3**.**

Appendix B Existence of infinite horizon optimal control and stable manifold of Hamiltonian systems

Assumption 1**.**

Proposition B.1**.**

Lemma 1 (The $\lambda$ -lemma).

Proposition 2.

Remark 3.

Theorem 4.

Definition 5.

Remark 6.

Assumption 7.

Assumption 8.

Theorem 9.

Remark 10.

Assumption 11.

Corollary 12.

Corollary 13.

Remark 14.

Theorem 15.

Corollary 16.

Remark 17.

4.1 Problem $(\mathrm{OCP}_{1})$

4.2 Problem $(\mathrm{OCP}_{2})$

Lemma A.1.

Lemma A.2.

Lemma A.3.

Assumption 1.

Proposition B.1.