Analyzing a Maximum Principle for Finite Horizon State Constrained   Problems via Parametric Examples. Part 1: Problems with Unilateral State   Constraints

Vu Thi Huong; Jen-Chih Yao; and Nguyen Dong Yen

arXiv:1901.03794·math.OC·January 15, 2019

Analyzing a Maximum Principle for Finite Horizon State Constrained Problems via Parametric Examples. Part 1: Problems with Unilateral State Constraints

Vu Thi Huong, Jen-Chih Yao, and Nguyen Dong Yen

PDF

Open Access

TL;DR

This paper examines the maximum principle for finite horizon state constrained optimal control problems using parametric examples, focusing on problems with unilateral constraints to deepen understanding and illustrate applications in economic growth models.

Contribution

It provides a detailed analysis of the maximum principle via parametric examples with unilateral constraints, linking theoretical conditions to economic growth models.

Findings

01

Establishes solution existence using Filippov's theorem.

02

Analyzes the maximum principle as a necessary condition.

03

Serves as a prototype for economic optimal growth models.

Abstract

In the present paper, the maximum principle for finite horizon state constrained problems from the book by R. Vinter [\textit{Optimal Control}, Birkh\"auser, Boston, 2000; Theorem~9.3.1] is analyzed via parametric examples. The latter has origin in a recent paper by V.~Basco, P.~Cannarsa, and H.~Frankowska, and resembles the optimal growth problem in mathematical economics. The solution existence of these parametric examples is established by invoking Filippov's existence theorem for Mayer problems. Since the maximum principle is only a necessary condition for local optimal processes, a large amount of additional investigations is needed to obtain a comprehensive synthesis of finitely many processes suspected for being local minimizers. Our analysis not only helps to understand the principle in depth, but also serves as a sample of applying it to meaningful prototypes of economic…

Figures6

Click any figure to enlarge with its caption.

Equations218

N_{Ω} (\overset{v}{ˉ}) = {v^{'} \in I R^{n} : v Ω \overset{v}{ˉ} lim sup \frac{⟨ v ^{'} , v - v ˉ ⟩}{∥ v - v ˉ ∥} \leq 0},

N_{Ω} (\overset{v}{ˉ}) = {v^{'} \in I R^{n} : v Ω \overset{v}{ˉ} lim sup \frac{⟨ v ^{'} , v - v ˉ ⟩}{∥ v - v ˉ ∥} \leq 0},

\displaystyle N_{\Omega}(\bar{v})=\big{\{}v^{\prime}\in{\rm I\!R}^{n}\,:\,\exists\mbox{ sequences }v_{k}\to\bar{v},\ v_{k}^{\prime}\rightarrow v^{\prime}\mbox{ with }v_{k}^{\prime}\in\widehat{N}_{\Omega}(v_{k})\;\mbox{for all}\;k\in{\rm I\!N}\big{\}}.

\displaystyle N_{\Omega}(\bar{v})=\big{\{}v^{\prime}\in{\rm I\!R}^{n}\,:\,\exists\mbox{ sequences }v_{k}\to\bar{v},\ v_{k}^{\prime}\rightarrow v^{\prime}\mbox{ with }v_{k}^{\prime}\in\widehat{N}_{\Omega}(v_{k})\;\mbox{for all}\;k\in{\rm I\!N}\big{\}}.

\partial\varphi(\bar{x})=\big{\{}x^{*}\in{\rm I\!R}^{n}\;:\;(x^{*},-1)\in N\big{(}(\bar{x},\varphi(\bar{x}));\mbox{\rm epi}\,\varphi\big{)}\big{\}}.

\partial\varphi(\bar{x})=\big{\{}x^{*}\in{\rm I\!R}^{n}\;:\;(x^{*},-1)\in N\big{(}(\bar{x},\varphi(\bar{x}));\mbox{\rm epi}\,\varphi\big{)}\big{\}}.

\mbox M inimi z e g (x (t_{0}), x (T)),

\mbox M inimi z e g (x (t_{0}), x (T)),

⎩ ⎨ ⎧ \overset{x}{˙} (t) = f (t, x (t), u (t)), (x (t_{0}), x (T)) \in C u (t) \in U (t), h (t, x (t)) \leq 0, \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] \forall t \in [t_{0}, T],

⎩ ⎨ ⎧ \overset{x}{˙} (t) = f (t, x (t), u (t)), (x (t_{0}), x (T)) \in C u (t) \in U (t), h (t, x (t)) \leq 0, \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] \forall t \in [t_{0}, T],

H (t, x, p, u) := p . f (t, x, u) = i = 1 \sum n p_{i} f_{i} (t, x, u) .

H (t, x, p, u) := p . f (t, x, u) = i = 1 \sum n p_{i} f_{i} (t, x, u) .

\displaystyle\partial^{>}_{x}h(t,x):=\mbox{\rm co}\,\big{\{}\xi\,:\,

\displaystyle\partial^{>}_{x}h(t,x):=\mbox{\rm co}\,\big{\{}\xi\,:\,

\displaystyle\ h(t_{k},x_{k})>0\mbox{ for all }k\mbox{ and }\nabla_{x}h(t_{k},x_{k})\to\xi\big{\}},

f (x) = \int_{[t_{0}, T]} x (t) d v (t),

f (x) = \int_{[t_{0}, T]} x (t) d v (t),

μ_{v} (A) := \int_{[t_{0}, T]} χ_{A} (t) d v (t),

μ_{v} (A) := \int_{[t_{0}, T]} χ_{A} (t) d v (t),

∥ f (t, x, u) - f (t, x^{'}, u) ∥ \leq k (t, u) ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, \forall u \in U (t)

∥ f (t, x, u) - f (t, x^{'}, u) ∥ \leq k (t, u) ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, \forall u \in U (t)

∥ h (t, x) - h (t, x^{'}) ∥ \leq K ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, \forall t \in [t_{0}, T] .

∥ h (t, x) - h (t, x^{'}) ∥ \leq K ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, \forall t \in [t_{0}, T] .

∥ f (t, x, u) - f (t, x^{'}, u) ∥ \leq k (t, u) ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, u \in U (t), a . e .;

∥ f (t, x, u) - f (t, x^{'}, u) ∥ \leq k (t, u) ∥ x - x^{'} ∥, \forall x, x^{'} \in \overset{x}{ˉ} (t) + δ \overset{ˉ}{B}, u \in U (t), a . e .;

M := {(t, x, u) \in I R \times I R^{n} \times I R^{m} : (t, x) \in A, u \in U (t, x)},

M := {(t, x, u) \in I R \times I R^{n} \times I R^{m} : (t, x) \in A, u \in U (t, x)},

\mbox M inimi z e g (t_{0}, x (t_{0}), T, x (T))

\mbox M inimi z e g (t_{0}, x (t_{0}), T, x (T))

⎩ ⎨ ⎧ \overset{x}{˙} (t) = f (t, x (t), u (t)), (t, x (t)) \in A, (t_{0}, x (t_{0}), T, x (T)) \in B u (t) \in U (t, x (t)), \mbox a . e . t \in [t_{0}, T] \mbox f or a l l t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T],

⎩ ⎨ ⎧ \overset{x}{˙} (t) = f (t, x (t), u (t)), (t, x (t)) \in A, (t_{0}, x (t_{0}), T, x (T)) \in B u (t) \in U (t, x (t)), \mbox a . e . t \in [t_{0}, T] \mbox f or a l l t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T],

A(t)=\big{\{}x\in{\rm I\!R}^{n}\;:\;(t,x)\in A\big{\}}\quad\;(t\in A_{0})

A(t)=\big{\{}x\in{\rm I\!R}^{n}\;:\;(t,x)\in A\big{\}}\quad\;(t\in A_{0})

Q(t,x)=\big{\{}z\in{\rm I\!R}^{n}\;:\;z=f(t,x,u),\ u\in U(t,x)\big{\}}\quad\;((t,x)\in A).

Q(t,x)=\big{\{}z\in{\rm I\!R}^{n}\;:\;z=f(t,x,u),\ u\in U(t,x)\big{\}}\quad\;((t,x)\in A).

x_{1} f_{1} (t, x, u) + x_{2} f_{2} (t, x, u) + \dots + x_{n} f_{n} (t, x, u) \leq c (∥ x ∥^{2} + 1) \forall (t, x, u) \in M .

x_{1} f_{1} (t, x, u) + x_{2} f_{2} (t, x, u) + \dots + x_{n} f_{n} (t, x, u) \leq c (∥ x ∥^{2} + 1) \forall (t, x, u) \in M .

\mbox{Minimize}\ \;J(x,u)=\int_{t_{0}}^{T}\big{[}-e^{-\lambda t}(x(t)+u(t))\big{]}dt

\mbox{Minimize}\ \;J(x,u)=\int_{t_{0}}^{T}\big{[}-e^{-\lambda t}(x(t)+u(t))\big{]}dt

⎩ ⎨ ⎧ \overset{x}{˙} (t) = - a u (t), x (t_{0}) = x_{0} u (t) \in [- 1, 1], \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T],

⎩ ⎨ ⎧ \overset{x}{˙} (t) = - a u (t), x (t_{0}) = x_{0} u (t) \in [- 1, 1], \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T],

x_{2}(t):=\int_{t_{0}}^{t}\big{[}-e^{-\lambda t}(x_{1}(\tau)+u(\tau))\big{]}d\tau

x_{2}(t):=\int_{t_{0}}^{t}\big{[}-e^{-\lambda t}(x_{1}(\tau)+u(\tau))\big{]}d\tau

\mbox M inimi z e x_{2} (T)

\mbox M inimi z e x_{2} (T)

⎩ ⎨ ⎧ \overset{x}{˙}_{1} (t) = - a u (t), \overset{x}{˙}_{2} (t) = - e^{- λ t} (x_{1} (t) + u (t)), (x (t_{0}), x (T)) \in {(x_{0}, 0)} \times I R^{2} u (t) \in [- 1, 1], \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] .

⎩ ⎨ ⎧ \overset{x}{˙}_{1} (t) = - a u (t), \overset{x}{˙}_{2} (t) = - e^{- λ t} (x_{1} (t) + u (t)), (x (t_{0}), x (T)) \in {(x_{0}, 0)} \times I R^{2} u (t) \in [- 1, 1], \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] \mbox a . e . t \in [t_{0}, T] .

Q (t, x)

Q (t, x)

\displaystyle=\big{\{}z\in{\rm I\!R}^{2}\;:\;z=(-au,-e^{-\lambda t}(x_{1}+u)),\ u\in[-1,1]\big{\}}

\displaystyle=\big{\{}(0,-e^{-\lambda t}x_{1})\}+\{(-a,-e^{-\lambda t})u\;:\;u\in[-1,1]\big{\}}

M_{ε}

M_{ε}

= {(t, x, u) \in [t_{0}, T] \times I R^{2} \times [- 1, 1] : ∥ x ∥ \leq ε}

= [t_{0}, T] \times {x \in I R^{2} : ∥ x ∥ \leq ε} \times [- 1, 1],

∥ f (t, x, u) ∥ = ∥ (- a u, - e^{- λ t} (x_{1} + u))

∥ f (t, x, u) ∥ = ∥ (- a u, - e^{- λ t} (x_{1} + u))

\leq a + ∣ x_{1} ∣ + 1

\leq c (∥ x ∥ + 1)

H (t, x, p, u) = - a u p_{1} - e^{- λ t} (x_{1} + u) p_{2} \forall (t, x, p, u) \in [t_{0}, T] \times I R^{2} \times I R^{2} \times I R,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAerospace Engineering and Control Systems · Optimization and Variational Analysis · Spacecraft Dynamics and Control

Full text

Analyzing a Maximum Principle for Finite Horizon State Constrained Problems via Parametric Examples. Part 1: Problems with Unilateral State Constraints111Financial supports from several research projects in Taiwan and Vietnam are gratefully acknowledged.

V.T. Huong222Institute of Mathematics, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Hanoi 10307, Vietnam; email: [email protected]; [email protected]., J.-C. Yao333Center for General Education, China Medical University, Taichung 40402, Taiwan; Email: [email protected], and N.D. Yen444Institute of Mathematics, Vietnam Academy of Science and Technology, 18 Hoang Quoc Viet, Hanoi 10307, Vietnam; email: [email protected].

(Dedicated to Professor Do Sang Kim on the occasion of his 65th birthday)

Abstract. In the present paper, the maximum principle for finite horizon state constrained problems from the book by R. Vinter [Optimal Control, Birkhäuser, Boston, 2000; Theorem 9.3.1] is analyzed via parametric examples. The latter has origin in a recent paper by V. Basco, P. Cannarsa, and H. Frankowska, and resembles the optimal growth problem in mathematical economics. The solution existence of these parametric examples is established by invoking Filippov’s existence theorem for Mayer problems. Since the maximum principle is only a necessary condition for local optimal processes, a large amount of additional investigations is needed to obtain a comprehensive synthesis of finitely many processes suspected for being local minimizers. Our analysis not only helps to understand the principle in depth, but also serves as a sample of applying it to meaningful prototypes of economic optimal growth models. Problems with unilateral state constraints are studied in Part 1 of the paper. Problems with bilateral state constraints will be addressed in Part 2.

Keywords: Finite horizon optimal control problem, state constraint, maximum principle, solution existence theorem, function of bounded variation, Borel measurable function, Lebesgue-Stieltjes integral.

2010 Mathematics Subject Classification: 49K15, 49J15.

1 Introduction

It is well known that optimal control problems with state constraints are models of importance, but one usually faces with a lot of difficulties in analyzing them. These models have been considered since the early days of the optimal control theory. For instance, the whole Chapter VI of the classical work [12, pp. 257–316] is devoted to problems with restricted phase coordinates. There are various forms of the maximum principle for optimal control problems with state constraints; see, e.g., [4], where the relations between several forms are shown and a series of numerical illustrative examples have been solved.

To deal with state constraints, one has to use functions of bounded variation, Borel measurable functions, Lebesgue-Stieltjes integral, nonnegative measures on the $\sigma-$ algebra of the Borel sets, the Riesz Representation Theorem for the space of continuous functions, and so on.

By using the maximum principle presented in [3, pp. 233–254], Phu [10, 11] has proposed an ingenious method called the method of region analysis to solve several classes of optimal control problems with one state and one control variable, which have both state and control constraints. Minimization problems of the Lagrange type were considered by the author and, among other things, it was assumed that integrand of the objective function is strictly convex with respect to the control variable. To be more precise, the author considered regular problems, i.e., the optimal control problems where the Pontryagin function is strictly convex with respect to the control variable.

In the present paper, the maximum principle for finite horizon state constrained problems from the book by Vinter [14, Theorem 9.3.1] is analyzed via parametric examples. The latter has origin in a recent paper by Basco, Cannarsa, and Frankowska [1, Example 1], and resembles the optimal growth problem in mathematical economics (see, e.g., [13, pp. 617–625]). The solution existence of these parametric examples, which are irregular optimal control problems in the sense of Phu [10, 11], is established by invoking Filippov’s existence theorem for Mayer problems [2, Theorem 9.2.i and Section 9.4]. Since the maximum principle is only a necessary condition for local optimal processes, a large amount of additional investigations is needed to obtain a comprehensive synthesis of finitely many processes suspected for being local minimizers. Our analysis not only helps to understand the principle in depth, but also serves as a sample of applying it to meaningful prototypes of economic optimal growth models.

Note that the maximum principle for finite horizon state constrained problems in [14, Chapter 9] covers many known ones for smooth problems and allows us to deal with nonsmooth problems by using the Mordukhovich normal cone and the Mordukhovich subdifferential [7, 8, 9], which are also called the limiting normal cone and the limiting subdifferential. This principle is a necessary optimality condition which asserts the existence of a multipliers set $(p,\mu,\nu,\gamma)$ consisting of an absolutely continuous function $p$ , a function of bounded variation $\mu$ , a Borel measurable function $\nu$ , and a real number $\gamma\geq 0$ , where $(p,\mu,\gamma)\neq(0,0,0)$ , such that the four conditions (i)–(iv) in Theorem 2.1 below are satisfied. The relationships between these conditions are worthy a detailed analysis. We will present such an analysis via three parametric examples of optimal control problems of the Langrange type, which have five parameters $(\lambda,a,x_{0},t_{0},T)$ , where $\lambda>0$ appears in the description of the objective function, $a>0$ appears in the differential equation, $x_{0}$ is the initial value, $t_{0}$ is the initial time, and $T$ is the terminal time. Observe that, in Example 1 of [1], $T=\infty$ , $x_{0}$ and $t_{0}$ are fixed. Problems with unilateral state constraints are studied in Part 1 of the paper. Problems with bilateral state constraints will be addressed in Part 2.

This Part 1 is organized as follows. Section 2 presents some background materials including the above-mentioned maximum principle and Filippov’s existence theorem for Mayer problems. Control problems without state constraints are considered in Section 3, while control problems with unilateral state constraints are studied in Section 4. Some concluding remarks are given in Section 5.

2 Background Materials

In this section, we give some notations, definitions, and results that will be used repeatedly in the sequel.

2.1 Notations and Definitions

The symbol ${\rm I\!R}$ (resp., ${\rm I\!N})$ denotes the set of real numbers (resp., the set of positive integers). The norm in the $n$ -dimensional Euclidean space ${\rm I\!R}^{n}$ is denoted by $\|.\|$ . For a subset $C\subset{\rm I\!R}^{n}$ , we abbreviate its convex hull to $\mbox{\rm co}\,C$ . For a set-valued map $F:{\rm I\!R}^{n}\rightrightarrows{\rm I\!R}^{m}$ , we call the set ${\rm gph}\,F:=\{(x,y)\in{\rm I\!R}^{n}\times{\rm I\!R}^{m}\,:\,y\in F(x)\}$ the graph of $F$ .

Let $\Omega\subset{\rm I\!R}^{n}$ be a closed set and $\bar{v}\in\Omega$ . The Fréchet normal cone (also called the prenormal cone, or the regular normal cone) to $\Omega\subset{\rm I\!R}^{n}$ at $\bar{v}$ is given by

[TABLE]

where $v\xrightarrow{\Omega}\bar{v}$ means $v\to\bar{v}$ with $v\in\Omega$ . The Mordukhovich (or limiting) normal cone to $\Omega$ at $\bar{v}$ is defined by

[TABLE]

Given an extended real-valued function $\varphi:{\rm I\!R}^{n}\rightarrow{\rm I\!R}\cup\{-\infty,+\infty\}$ , one defines the epigraph of $\varphi$ by $\mbox{\rm epi}\,\varphi=\{(x,\mu)\in{\rm I\!R}^{n}\times{\rm I\!R}\,:\,\mu\geq\varphi(x)\}$ . The Mordukhovich subdifferential (or limiting subdifferential) of $\varphi$ at $\bar{x}\in{\rm I\!R}^{n}$ with $|\varphi(\bar{x})|<\infty$ is defined by

[TABLE]

If $|\varphi(x)|=\infty$ , then one puts $\partial\varphi(\bar{x})=\emptyset$ . The reader is referred to [7, Chapter 1] and [9, Chapter 1] for comprehensive treatments of the Fréchet normal cone, the limiting normal cone, the limiting subdifferential, and the related calculus rules.

For a given segment $[t_{0},T]$ of the real line, we denote the $\sigma$ -algebra of its Lebesgue measurable subsets (resp., the $\sigma$ -algebra of its Borel measurable subsets) by $\mathcal{L}$ (resp., $\mathcal{B}$ ). The Sobolev space $W^{1,1}([t_{0},T],{\rm I\!R}^{n})$ is the linear space of the absolutely continuous functions $x:[t_{0},T]\to{\rm I\!R}^{n}$ endowed with the norm $\|x\|_{W^{1,1}}=\|x(t_{0})\|+\displaystyle\int_{t_{0}}^{T}\|\dot{x}(t)\|dt$ (see, e.g., [5, p. 21] for this and another equivalent norm).

As in [14, p. 321], we consider the following finite horizon optimal control problem of the Mayer type, denoted by $\mathcal{M}$ ,

[TABLE]

over $x\in W^{1,1}([t_{0},T],{\rm I\!R}^{n})$ and measurable functions $u:[t_{0},T]\to{\rm I\!R}^{m}$ satisfying

[TABLE]

where $[t_{0},T]$ is a given interval, $g:{\rm I\!R}^{n}\times{\rm I\!R}^{n}\to{\rm I\!R}$ , $f:[t_{0},T]\times{\rm I\!R}^{n}\times{\rm I\!R}^{m}\to{\rm I\!R}^{n}$ , and $h:[t_{0},T]\times{\rm I\!R}^{n}\to{\rm I\!R}$ are given functions, $C\subset{\rm I\!R}^{n}\times{\rm I\!R}^{n}$ is a closed set, and $U:[t_{0},T]\rightrightarrows{\rm I\!R}^{m}$ is a set-valued map.

A measurable function $u:[t_{0},T]\to{\rm I\!R}^{m}$ satisfying $u(t)\in U(t)$ a.e. $t\in[t_{0},T]$ is called a control function. A process $(x,u)$ consists of a control function $u$ and an arc $x\in W^{1,1}([t_{0},T];{\rm I\!R}^{n})$ that is a solution to the differential equation in (2.2). A state trajectory $x$ is the first component of some process $(x,u)$ . A process $(x,u)$ is called feasible if the state trajectory satisfies the endpoint constraint $(x(t_{0}),x(T))\in C$ and the state constraint $h(t,x(t))\leq 0$ for all $t\in[t_{0},T]$ .

Due to the appearance of the state constraint, the problem $\mathcal{M}$ in (2.1)–(2.2) is said to be an optimal control problem with state constraints. But, if the inequality $h(t,x(t))\leq 0$ is fulfilled for every $(t,x(t))$ with $t\in[t_{0},T]$ and $x\in W^{1,1}([t_{0},T];{\rm I\!R}^{n})$ (for example, when $h$ is constant function having a fixed nonpositive value), i.e., the condition $h(t,x(t))\leq 0$ for all $t\in[t_{0},T]$ can be removed from (2.2), then one says that $\mathcal{M}$ an optimal control problem without state constraints.

The Hamiltonian $\mathcal{H}:[t_{0},T]\times{\rm I\!R}^{n}\times{\rm I\!R}^{n}\times{\rm I\!R}^{m}\to{\rm I\!R}$ of (2.2) is defined by

[TABLE]

Definition 2.1.

A feasible process $(\bar{x},\bar{u})$ is called a $W^{1,1}$ local minimizer for $\mathcal{M}$ if there exists $\delta>0$ such that $g(\bar{x}(t_{0}),\bar{x}(T))\leq g(x(t_{0}),x(T))$ for any feasible processes $(x,u)$ satisfying $\|\bar{x}-x\|_{W^{1,1}}\leq\delta$ .**

Definition 2.2.

A feasible process $(\bar{x},\bar{u})$ is called a $W^{1,1}$ global minimizer for $\mathcal{M}$ if, for any feasible processes $(x,u)$ , one has $g(\bar{x}(t_{0}),\bar{x}(T))\leq g(x(t_{0}),x(T))$ .**

Definition 2.3 (See [14, p. 329]).

The partial hybrid subdifferential $\partial^{>}_{x}h(t,x)$ of $h(t,x)$ w.r.t. $x$ is given by

[TABLE]

where the symbol $(t_{k},x_{k})\overset{h}{\rightarrow}(t,x)$ means that $(t_{k},x_{k})\rightarrow(t,x)$ and $h(t_{k},x_{k})\rightarrow h(t,x)$ as $k\to\infty$ .**

2.2 A Maximum Principle for State Constrained Problems

Due to the appearance of the state constraint $h(t,x(t))\leq 0$ in $\mathcal{M}$ , one has to introduce a multiplier that is an element in the topological dual $C^{*}([t_{0},T];{\rm I\!R})$ of the space of continuous functions $C([t_{0},T];{\rm I\!R})$ with the supremum norm. By the Riesz Representation Theorem (see, e.g., [5, Theorem 6, p. 374] and [6, Theorem 1, pp. 113–115]), any bounded linear functional $f$ on $C([t_{0},T];{\rm I\!R})$ can be uniquely represented in the form

[TABLE]

where $v$ is a function of bounded variation on $[t_{0},T]$ which vanishes at $t_{0}$ and which are continuous from the right at every point $\tau\in(t_{0},T)$ , and $\displaystyle\int_{[t_{0},T]}x(t)dv(t)$ is the Riemann-Stieltjes integral of $x$ with respect to $v$ (see, e.g., [5, p. 364]). The set of the elements of $C^{*}([t_{0},T];{\rm I\!R})$ which are given by nondecreasing functions $v$ is denoted by $C^{\oplus}(t_{0},T)$ .

Every $v\in C^{*}([t_{0},T];{\rm I\!R})$ corresponds to a finite regular measure, denoted by $\mu_{v}$ , on the $\sigma$ -algebra ${\mathcal{B}}$ of the Borel subsets of $[t_{0},T]$ by the formula

[TABLE]

where $\chi_{A}(t)=1$ for $t\in A$ and $\chi_{A}(t)=0$ if $t\notin A$ . Due to the correspondence $v\mapsto\mu_{v}$ , we call every element $v\in C^{*}([t_{0},T];{\rm I\!R})$ a “measure” and identify $v$ with $\mu_{v}$ . Clearly, the measure corresponding to each $v\in C^{\oplus}(t_{0},T)$ is nonnegative.

The integrals $\displaystyle\int_{[t_{0},t)}\nu(s)d\mu(s)$ and $\displaystyle\int_{[t_{0},T]}\nu(s)d\mu(s)$ of a Borel measurable function $\nu$ in next theorem are understood in the sense of the Lebesgue-Stieltjes integration [5, p. 364].

Theorem 2.1 (See [14, Theorem 9.3.1]).

Let $(\bar{x},\bar{u})$ be a $W^{1,1}$ local minimizer for $\mathcal{M}$ . Assume that for some $\delta>0$ , the following hypotheses are satisfied:

(H1)

$f(.,x,.)$ * is $\mathcal{L}\times\mathcal{B}^{m}$ measurable, for fixed $x$ . There exists a Borel measurable function $k(.,.):[t_{0},T]\times{\rm I\!R}^{m}\to{\rm I\!R}$ such that $t\mapsto k(t,\bar{u}(t))$ is integrable and*

[TABLE]

for almost all $t\in[t_{0},T]$ ; 2. (H2)

$\mbox{\rm gph}\,U$ * is a Borel set in $[t_{0},T]\times{\rm I\!R}^{m}$ ;* 3. (H3)

$g$ * is Lipschitz continuous on the ball $(\bar{x}(t_{0}),\bar{x}(T))+\delta\bar{B}$ ;* 4. (H4)

$h$ * is upper semicontinuous and there exists $K>0$ such that*

[TABLE]

Then there exist $p\in W^{1,1}([t_{0},T];{\rm I\!R}^{n})$ , $\gamma\geq 0$ , $\mu\in C^{\oplus}(t_{0},T)$ , and a Borel measurable function $\nu:[t_{0},T]\to{\rm I\!R}^{n}$ such that $(p,\mu,\gamma)\neq(0,0,0)$ , and for $q(t):=p(t)+\eta(t)$ with $\eta(t):=\displaystyle\int_{[t_{0},t)}\nu(s)d\mu(s)$ if $t\in[t_{0},T)$ and $\eta(T):=\displaystyle\int_{[t_{0},T]}\nu(s)d\mu(s)$ , the following holds true:

(i)

$\nu(t)\in\partial^{>}_{x}h(t,\bar{x}(t))\ \mu-\mbox{a.e.};$ ** 2. (ii)

$-\dot{p}(t)\in\mbox{\rm co}\,\partial_{x}\mathcal{H}(t,\bar{x}(t),q(t),\bar{u}(t))$ * a.e.;* 3. (iii)

$(p(t_{0}),-q(T))\in\gamma\partial g(\bar{x}(t_{0}),\bar{x}(T))+N_{C}(\bar{x}(t_{0}),\bar{x}(T))$ ; 4. (iv)

$\mathcal{H}(t,\bar{x}(t),q(t),\bar{u}(t))=\max_{u\in U(t)}\mathcal{H}(t,\bar{x}(t),q(t),u)$ * a.e.*

Applying Theorem 2.1 to unconstrained optimal control problems, one has next proposition.

Proposition 2.1 (See [14, Theorem 6.2.1]).

Suppose that $\mathcal{M}$ is an optimal control problem without state constraints. Let $(\bar{x},\bar{u})$ be a $W^{1,1}$ local minimizer for $\mathcal{M}$ . Assume that for some $\delta>0$ , the following hypotheses are satisfied.

(H1)

For every $x\in{\rm I\!R}^{n}$ , the function $f(.,x,.):[t_{0},T]\times{\rm I\!R}^{m}\to{\rm I\!R}^{n}$ is $\mathcal{L}\times\mathcal{B}^{m}$ measurable. In addition, there exists a Borel measurable function $k:[t_{0},T]\times{\rm I\!R}^{m}\to{\rm I\!R}$ such that $t\mapsto k(t,\bar{u}(t))$ is integrable and

[TABLE] 2. (H2)

$\mbox{\rm gph}\,U$ * is an $\mathcal{L}\times\mathcal{B}^{m}$ measurable set in $[t_{0},T]\times{\rm I\!R}^{m}$ ;* 3. (H3)

$g$ * is locally Lipschitz continuous.*

Then there exist $p\in W^{1,1}([t_{0},T];{\rm I\!R}^{n})$ and $\gamma\geq 0$ such that $(p,\gamma)\neq(0,0)$ and the following holds true:

(i)

$-\dot{p}(t)\in\mbox{\rm co}\,\partial_{x}\mathcal{H}(t,\bar{x}(t),p(t),\bar{u}(t))$ * a.e.;* 2. (ii)

$(p(t_{0}),-p(T))\in\gamma\partial g(\bar{x}(t_{0}),\bar{x}(T))+N_{C}(\bar{x}(t_{0}),\bar{x}(T))$ ; 3. (iii)

$\mathcal{H}(t,\bar{x}(t),p(t),\bar{u}(t))=\max_{u\in U(t)}\mathcal{H}(t,\bar{x}(t),p(t),u)$ .

2.3 Solution Existence in State Constrained Optimal Control

To recall a solution existence theorem for optimal control problems with state constraints of the Mayer type, we will use the notations and concepts given in [2, Section 9.2]. Let $A$ be a subset of ${\rm I\!R}\times{\rm I\!R}^{n}$ and $U:A\rightrightarrows{\rm I\!R}^{m}$ be a set-valued map defined on $A$ . Let

[TABLE]

and $f=(f_{1},f_{2},\dots,f_{n}):M\to{\rm I\!R}^{n}$ be a single-valued map defined on $M$ . Let $B$ be a given subset of ${\rm I\!R}\times{\rm I\!R}^{n}\times{\rm I\!R}\times{\rm I\!R}^{n}$ and $g:B\to{\rm I\!R}$ be a real function defined on $B$ . Consider the optimal control problem of the Mayer type

[TABLE]

over $x\in W^{1,1}([t_{0},T];{\rm I\!R}^{n})$ and measurable functions $u:[t_{0},T]~{}\to~{}{\rm I\!R}^{m}$ satisfying

[TABLE]

where $[t_{0},T]$ is a given interval. The problem (2.5)–(2.6) will be denoted by $\mathcal{M}_{1}$ .

A feasible process for $\mathcal{M}_{1}$ is a pair of functions $(x,u)$ with $x:[t_{0},T]\to{\rm I\!R}^{n}$ being absolutely continuous on $[t_{0},T]$ , $u:[t_{0},T]\to{\rm I\!R}^{m}$ being measurable, such that all the requirements in (2.6) are satisfied. If $(x,u)$ is a feasible process for $\mathcal{M}_{1}$ , then $x$ is said to be a feasible trajectory, and $u$ a feasible control function for $\mathcal{M}_{1}$ . The set of all feasible processes for $\mathcal{M}_{1}$ is denoted by $\Omega$ .

Let $A_{0}=\big{\{}t\in\mathbb{R}\,:\,\exists x\in\mathbb{R}^{n}\ {\rm s.t.}\ (t,x)\in A\big{\}}$ , i.e., $A_{0}$ is the projection of $A$ on the $t-$ axis. Set

[TABLE]

and

[TABLE]

The forthcoming statement is called Filippov’s Existence Theorem for Mayer problems.

Theorem 2.2 (see [2, Theorem 9.2.i and Section 9.4]).

Suppose that $\Omega$ is nonempty, $B$ is closed, $g$ is lower semicontinuous on $B$ , $f$ is continuous on $M$ and, for almost every $t\in[t_{0},T]$ , the sets $Q(t,x)$ , $x\in A(t)$ , are convex. Moreover, assume either that $A$ and $M$ are compact or that $A$ is not compact but closed and the following three conditions hold

(a)

For any $\varepsilon\geq 0$ , the set $M_{\varepsilon}:=\{(t,x,u)\in M\;:\;\|x\|\leq\varepsilon\}$ is compact; 2. (b)

There is a compact subset $P$ of $A$ such that every feasible trajectory $x$ of $\mathcal{M}_{1}$ passes through at least one point of $P$ ; 3. (c)

There exists $c\geq 0$ such that

[TABLE]

Then, $\mathcal{M}_{1}$ has a $W^{1,1}$ global minimizer.

Clearly, condition (b) is satisfied if the initial point $(t_{0},x(t_{0}))$ or the end point $(T,x(T))$ is fixed. As shown in [2, p. 317], the following condition implies (c):

( $c_{0}$ )

There exists $c\geq 0$ such that $\|f(t,x,u)\|\leq c(\|x\|+1)$ for all $(t,x,u)\in M$ .

3 Control Problems without State Constraints

Denote by $(FP_{1})$ the finite horizon optimal control problem of the Lagrange type

[TABLE]

over $x\in W^{1,1}([t_{0},T],{\rm I\!R})$ and measurable function $u:[t_{0},T]\to{\rm I\!R}$ satisfying

[TABLE]

with $a>\lambda>0$ , $T>t_{0}\geq 0$ , and $x_{0}\in{\rm I\!R}$ being given.

To treat $(FP_{1})$ in (3.7)–(3.8) as a problem of the Mayer type, we set $x(t)=(x_{1}(t),x_{2}(t))$ , where $x_{1}(t)$ plays the role of the state variable $x(t)$ in $(FP_{1})$ , and

[TABLE]

for all $t\in[0,T]$ . Then $(FP_{1})$ is equivalent to the problem

[TABLE]

over $x=(x_{1},x_{2})\in W^{1,1}([t_{0},T],{\rm I\!R}^{2})$ and measurable functions $u:[t_{0},T]\to{\rm I\!R}$ satisfying

[TABLE]

The problem (3.9)–(3.10) is abbreviated to $(FP_{1a})$ .

3.1 Solution Existence

Clearly, $(FP_{1a})$ is of the form $\mathcal{M}_{1}$ (see Subsection 2.3) with $n=2$ , $m=1$ , $A=[t_{0},T]~{}\times~{}{\rm I\!R}^{2}$ , $U(t,x)=[-1,1]$ for all $(t,x)\in A$ , $B=\{t_{0}\}\times\{(x_{0},0)\}\times{\rm I\!R}\times{\rm I\!R}^{2}$ , $g(t_{0},x(t_{0}),T,x(T))=x_{2}(T)$ , $M=A\times[-1,1]$ , $f(t,x,u)=(-au,-e^{-\lambda t}(x_{1}+u))$ for all $(t,x,u)\in M$ . We are going to show that $(FP_{1a})$ satisfies all the assumptions of Theorem 2.2.

Clearly, the pair $(x,u)$ , where $u(t)=0$ , $x_{1}(t)=x_{0}$ , and $x_{2}(t)=-x_{0}\displaystyle\int_{t_{0}}^{t}e^{-\lambda\tau}d\tau$ for all $t\in[t_{0},T]$ , is a feasible process for $(FP_{1a})$ . Thus, the set $\Omega$ of feasible processes is nonempty. Besides, $B$ is closed, $g$ is lower semicontinuous on $B$ , $f$ is continuous on $M$ . Moreover, by the formula for $A$ , one has $A_{0}=[t_{0},T]$ and $A(t)={\rm I\!R}^{2}$ for all $t\in A_{0}$ . In addition, from the formulas for $M$ , $U$ , and $f$ , one gets

[TABLE]

for any $(t,x)\in A$ . Thus, for every $t\in[t_{0},T]$ , the sets $Q(t,x)$ , $x\in A(t)$ , are line segments; hence they are convex. Since $A$ is closed, but not compact, we have to check the conditions (a)–(c) in Theorem 2.2.

Condition (a): For any $\varepsilon\geq 0$ , since

[TABLE]

one sees that $M_{\varepsilon}$ is compact.

Condition (b): Obviously, $P:=\{t_{0}\}\times\{(x_{0},0)\}$ is a compact subset of $A$ , and every feasible trajectory passes through the unique point of $P$ . Thus, condition (b) is fulfilled.

Condition (c): Choosing $c=a+1$ , we have

[TABLE]

for any $(t,x,u)\in M$ , because $u\in[-1,1]$ and $e^{-\lambda t}\leq 1$ for $t\geq t_{0}\geq 0$ . Thus, condition ( $c_{0}$ ), which implies (c), is satisfied.

By Theorem 2.2, $(FP_{1a})$ has a $W^{1,1}$ global minimizer. Therefore, $(FP_{1})$ has a $W^{1,1}$ global minimizer by the equivalence of $(FP_{1a})$ and $(FP_{1})$ .

3.2 Necessary Optimality Conditions

To obtain necessary conditions for $(FP_{1a})$ , we note that $(FP_{1a})$ is in the form of $\mathcal{M}$ with $g(x,y)=y_{2}$ , $f(t,x,u)=(-au,-e^{-\lambda t}(x_{1}+u))$ , $C=\{(x_{0},0)\}\times{\rm I\!R}^{2}$ , $U(t)=[-1,1]$ , and $h(t,x)=0$ for all $x=(x_{1},x_{2})\in{\rm I\!R}^{2}$ , $y=(y_{1},y_{2})\in{\rm I\!R}^{2}$ , $t\in[t_{0},T]$ , and $u\in{\rm I\!R}$ . Since $(FP_{1a})$ is an optimal control problem without state constraints, we can apply both Proposition 2.1 Theorem 2.1 to this problem. In accordance with (2.3), the Hamiltonian of $(FP_{1a})$ is given by

[TABLE]

while by (2.3) we have $\partial^{>}_{x}h(t,x)=\emptyset$ for all $(t,x)\in[t_{0},T]\times{\rm I\!R}^{2}$ . Let $(\bar{x},\bar{u})$ be a $W^{1,1}$ local minimizer of $(FP_{1a})$ .

3.2.1 Necessary Optimality Conditions for $(FP_{1a})$ in Terms of Proposition 2.1

It is clear that the assumptions (H1)–(H3) of Proposition 2.1 are satisfied for $(FP_{1a})$ . So, there exist $p\in W^{1,1}([t_{0},T];{\rm I\!R}^{2})$ and $\gamma\geq 0$ such that $(p,\gamma)\neq(0,0)$ , and conditions (i)–(iii) of Proposition 2.1 hold true. Let us analyze these conditions.

Condition (i): By (3.11), $\mathcal{H}$ is differentiable in $x$ and $\partial_{x}\mathcal{H}(t,x,p,u)=\{(-e^{-\lambda t}p_{2},0)\}$ for all $(t,x,p,u)\in[t_{0},T]\times{\rm I\!R}^{2}\times{\rm I\!R}^{2}\times{\rm I\!R}$ . Thus, condition (i) implies that $\dot{p}_{1}(t)=e^{-\lambda t}p_{2}(t)$ for a.e. $t\in[t_{0},T]$ and $p_{2}(t)$ is a constant function.

Condition (ii): By the formulas for $g$ and $C$ , we have $\partial g(\bar{x}(t_{0}),\bar{x}(T))=\{(0,0,0,1)\}$ and $N_{C}(\bar{x}(t_{0}),\bar{x}(T))={\rm I\!R}^{2}\times\{(0,0)\}$ . Thus, condition (ii) implies that

[TABLE]

hence $p_{1}(T)=0$ and $p_{2}(T)=-\gamma$ . As $p_{2}(t)$ is a constant function, we have $p_{2}(t)=-\gamma$ for all $t\in[t_{0},T]$ . So, the above analysis of condition (i) gives $p_{1}(t)=\dfrac{\gamma}{\lambda}\big{(}e^{-\lambda t}-e^{-\lambda T}\big{)}$ for all $t\in[t_{0},T]$ . Since $(p,\gamma)\neq(0,0)$ , we must have $\gamma>0$ .

Condition (iii): Due to (3.11), condition (iii) means that

[TABLE]

for a.e. $t\in[t_{0},T]$ . Equivalently,

[TABLE]

Setting $\varphi(t):=ap_{1}(t)+e^{-\lambda t}p_{2}(t)$ for $t\in[t_{0},T]$ , we have

[TABLE]

for a.e. $t\in[t_{0},T]$ . As $\dfrac{a}{\lambda}>1$ , we see that $\varphi$ is decreasing on ${\rm I\!R}$ . In addition, it is clear that $\varphi(T)=-\gamma e^{-\lambda T}<0$ , and $\varphi(t)=0$ if and only if $t=\bar{t}$ , where $\bar{t}:=T-\dfrac{1}{\lambda}\ln\dfrac{a}{a-\lambda}$ .

We have the following cases.

Case A: $t_{0}\geq\bar{t}$ . Then $\varphi(t)<0$ for all $t\in(t_{0},T]$ . Therefore, condition (3.12) implies $\bar{u}(t)=1$ for all $t\in[t_{0},T]$ . Hence, by (3.10), $\bar{x}_{1}(t)=x_{0}-a(t-t_{0})$ for a.e. $t\in[t_{0},T]$ .

Case B: $t_{0}<\bar{t}$ . Then $\varphi(t)>0$ for $t\in[t_{0},\bar{t})$ and $\varphi(t)<0$ for $t\in(\bar{t},T]$ . Thus, (3.12) yields $\bar{u}(t)=-1$ for $t\in[t_{0},\bar{t})$ and $\bar{u}(t)=1$ for a.e. $t\in(\bar{t},T]$ ; hence $\bar{x}_{1}(t)=x_{0}+a(t-t_{0})$ for every $t\in[t_{0},\bar{t}]$ and $\bar{x}_{1}(t)=x_{0}-a(t+t_{0}-2\bar{t})$ for every $t\in(\bar{t},T]$ .

3.2.2 Necessary Optimality Conditions for $(FP_{1a})$ in Terms of Theorem 2.1

Since the assumptions (H1)–(H4) of Theorem 2.1 are satisfied for $(FP_{1a})$ , by that theorem one can find $p\in W^{1,1}([t_{0},T];{\rm I\!R}^{2})$ , $\gamma\geq 0$ , $\mu\in C^{\oplus}(t_{0},T)$ , and a Borel measurable function $\nu:[t_{0},T]\to{\rm I\!R}^{2}$ such that $(p,\mu,\gamma)\neq(0,0,0)$ , and for $q(t):=p(t)+\eta(t)$ with $\eta:[t_{0},T]\to{\rm I\!R}^{2}$ being given by $\eta(t):=\displaystyle\int_{[t_{0},t)}\nu(s)d\mu(s)$ if $t\in[t_{0},T)$ and $\eta(T):=\displaystyle\int_{[t_{0},T]}\nu(s)d\mu(s)$ , conditions (i)–(iv) in Theorem 2.1 hold true. Since $\partial^{>}_{x}h(t,\bar{x}(t))=\emptyset$ for all $t\in[t_{0},T]$ , the inclusion $\nu(t)\in\partial^{>}_{x}h(t,\bar{x}(t))$ is violated at every $t\in[t_{0},T]$ . Hence, condition (i) forces $\mu=0$ . We see that condition (iv) is fulfilled and the conditions (ii)–(iv) in Theorem 2.1 recover the conditions (i)–(iii) of Proposition 2.1.

Going back to the original problem $(FP_{1})$ , we can put the obtained results in the following theorem.

Theorem 3.1.

Given any $a,\lambda$ with $a>\lambda>0$ , define $\rho=\dfrac{1}{\lambda}\ln\dfrac{a}{a-\lambda}>0$ and $\bar{t}=T-\rho$ . Then, problem $(FP_{1})$ has a unique local solution $(\bar{x},\bar{u})$ , which is a global solution, where $\bar{u}(t)=-a^{-1}\dot{\bar{x}}(t)$ for almost everywhere $t\in[t_{0},T]$ and $\bar{x}(t)$ can be described as follows:

(a)* If $t_{0}\geq\bar{t}$ (i.e., $T-t_{0}\leq\rho$ ), then*

[TABLE]

(b)* If $t_{0}<\bar{t}$ (i.e., $T-t_{0}>\rho$ ), then*

[TABLE]

Proof.

The assertions (a) and (b) are straightforward from the results obtained in Case A and Case B of Subsection 3.2.1, because $\bar{x}_{1}(t)$ in $(FP_{1a})$ coincides with $\bar{x}(t)$ in $(FP_{1})$ . ∎

4 Control Problems with Unilateral Constraints

By $(FP_{2})$ we denote the finite horizon optimal control problem of the Lagrange type

[TABLE]

over $x\in W^{1,1}([t_{0},T],{\rm I\!R})$ and measurable functions $u:[t_{0},T]\to{\rm I\!R}$ satisfying

[TABLE]

with $a>\lambda>0$ , $T>t_{0}\geq 0$ , and $x_{0}\leq 1$ being given.

We transform this problem into one of the Mayer type by setting $x(t)=(x_{1}(t),x_{2}(t))$ , where $x_{1}(t)$ plays the role of $x(t)$ in (4.13)–(4.14) and

[TABLE]

for all $t\in[0,T]$ . Thus, $(FP_{2})$ is equivalent to the problem

[TABLE]

over $x=(x_{1},x_{2})\in W^{1,1}([t_{0},T],{\rm I\!R}^{2})$ and measurable functions $u:[t_{0},T]\to{\rm I\!R}$ satisfying

[TABLE]

We denote problem (4.16)–(4.17) by $(FP_{2a})$ .

4.1 Solution Existence

To check that $(FP_{2a})$ is of the form $\mathcal{M}_{1}$ (see Subsection 2.3), we choose $n=2$ , $m=1$ , $A=[t_{0},T]~{}\times(-\infty,1]\times{\rm I\!R}$ , $U(t,x)=[-1,1]$ for all $(t,x)\in A$ , $B=\{t_{0}\}\times\{(x_{0},0)\}\times{\rm I\!R}\times{\rm I\!R}^{2}$ , $g(t_{0},x(t_{0}),T,x(T))=x_{2}(T)$ , $M=A\times[-1,1]$ , $f(t,x,u)=(-au,-e^{-\lambda t}(x_{1}+u))$ for all $(t,x,u)\in M$ . In comparison with the problem $(FP_{1a})$ , the only change in this formulation of $(FP_{2a})$ is that we have $A=[t_{0},T]~{}\times(-\infty,1]\times{\rm I\!R}$ instead of $A=[t_{0},T]~{}\times{\rm I\!R}^{2}$ . Thus, to show that $(FP_{2a})$ satisfies all the assumptions of Theorem 2.2, we can use the arguments in Subsection 3.1, except those related to the convexity of the sets $Q(t,x)$ and the compactness of $M_{\varepsilon}$ , which have to be verified in a slightly different manner.

By the above formula for $A$ , we have $A_{0}=[t_{0},T]$ and $A(t)=(-\infty,1]\times{\rm I\!R}$ for all $t\in A_{0}$ . As in Subsection 3.1, we have

[TABLE]

for any $(t,x)\in A$ . Thus, the assumption of Theorem 2.2 on the convexity of the sets $Q(t,x)$ , $x\in A(t)$ , for almost every $t\in[t_{0},T]$ , is satisfied. Since $M=[t_{0},T]~{}\times(-\infty,1]\times{\rm I\!R}\times[-1,1]$ , for any $\varepsilon\geq 0$ , one has

[TABLE]

As $M_{\varepsilon}$ is closed and contained in the compact set $[t_{0},T]\times\{x\in{\rm I\!R}^{2}\;:\;\|x\|\leq\varepsilon\}\times[-1,1]$ , it is compact.

It follows from Theorem 2.2 that $(FP_{2a})$ has a $W^{1,1}$ global minimizer. Therefore, by the equivalence of $(FP_{2})$ and $(FP_{2a})$ , we can assert that $(FP_{2})$ has a $W^{1,1}$ global minimizer.

4.2 Necessary Optimality Conditions

In order to apply Theorem 2.1 for solving $(FP_{2})$ , we observe that $(FP_{2a})$ is in the form of $\mathcal{M}$ with $g(x,y)=y_{2}$ , $f(t,x,u)=(-au,-e^{-\lambda t}(x_{1}+u)),$ $C=\{(x_{0},0)\}\times{\rm I\!R}^{2}$ , $U(t)=[-1,1]$ , and $h(t,x)=x_{1}-1$ for all $t\in[t_{0},T]$ , $x=(x_{1},x_{2})\in{\rm I\!R}^{2}$ , $y=(y_{1},y_{2})\in{\rm I\!R}^{2}$ and $u\in{\rm I\!R}$ .

The forthcoming two propositions describe a fundamental properties of the local minimizers of the problem $(FP_{2a})$ , which is obtained from the optimal control problem of the Lagrange type $(FP_{2})$ by introducing the artificial variable $x_{2}$ . Similar statements as those in the first proposition are valid for any optimal control problem of the Mayer type, which is obtained from an optimal control problem of the Lagrange type in the same manner. While, the claims in the second proposition hold true for every optimal control problem of the Mayer type, whose objective function does not depend on the initial point.

Proposition 4.1.

Suppose that $(\bar{x},\bar{u})$ is a $W^{1,1}$ local minimizer for $(FP_{2a})$ . Then, for any $\tau_{1},\tau_{2}\in[t_{0},T]$ with $\tau_{1}<\tau_{2}$ , the restriction of $(\bar{x},\bar{u})$ on $[\tau_{1},\tau_{2}]$ , i.e., the process $(\bar{x}(t),\bar{u}(t))$ with $t\in[\tau_{1},\tau_{2}]$ , is a $W^{1,1}$ local minimizer for the following optimal control problem of the Mayer type

[TABLE]

over $x=(x_{1},x_{2})\in W^{1,1}([\tau_{1},\tau_{2}],{\rm I\!R}^{2})$ and measurable functions $u:[\tau_{1},\tau_{2}]\to{\rm I\!R}$ satisfying

[TABLE]

which is denoted by $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ . In another words, for any $\tau_{1},\tau_{2}\in[t_{0},T]$ with $\tau_{1}<\tau_{2}$ , the restriction of a $W^{1,1}$ local minimizer for $(FP_{2a})$ on the time segment $[\tau_{1},\tau_{2}]$ is a $W^{1,1}$ local minimizer for the Mayer problem $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ , which is obtained from $(FP_{2a})$ by replacing $t_{0}$ with $\tau_{1}$ , $T$ with $\tau_{2}$ , and $C$ with $\widetilde{C}:=\{(\bar{x}_{1}(\tau_{1}),\bar{x}_{2}(\tau_{1}))\}\times\{\bar{x}_{1}(\tau_{2})\}\times{\rm I\!R}$ .

Proof.

Since $(\bar{x},\bar{u})$ is a $W^{1,1}$ local minimizer for $(FP_{2a})$ , by Definition 2.1 there exists $\delta>0$ such that the process $(\bar{x},\bar{u})$ minimizes the quantity $g(x(t_{0}),x(T))=x_{2}(T)$ over all feasible processes $(x,u)$ of $(FP_{2a})$ with $\|\bar{x}-x\|_{W^{1,1}}\leq\delta$ .

Clearly, the restriction of $(\bar{x},\bar{u})$ on $[\tau_{1},\tau_{2}]$ satisfies the conditions given in (4.1). Thus, it is a feasible process for $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ .

Let $(x(t),u(t))$ , $t\in[\tau_{1},\tau_{2}]$ , be an arbitrary feasible process of $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ satisfying

[TABLE]

Consider the pair of functions $(\widetilde{x},\widetilde{u})$ , where $\widetilde{x}=(\widetilde{x}_{1},\widetilde{x}_{2})$ , which is given by

[TABLE]

and

[TABLE]

Clearly, $(\widetilde{x},\widetilde{u})$ is a feasible process of $(FP_{2a})$ satisfying $\|\bar{x}-\widetilde{x}\|_{W^{1,1}([t_{0},T],{\rm I\!R}^{2})}\leq\delta$ . Thus, one must have $g(\widetilde{x}(T))\geq g(\bar{x}(T))$ or, equivalently,

[TABLE]

where $\omega(\tau):=-e^{-\lambda\tau}(\bar{x}_{1}(\tau)+\bar{u}(\tau))$ . Hence, one obtains the inequality $x_{2}(\tau_{2})\geq\bar{x}(\tau_{2})$ proving that the restriction of $(\bar{x},\bar{u})$ on $[\tau_{1},\tau_{2}]$ is a $W^{1,1}$ local minimizer for $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ . ∎

Proposition 4.2.

Suppose that $(\bar{x},\bar{u})$ is a $W^{1,1}$ local minimizer for $(FP_{2a})$ . Then, for any $\tau_{1}\in[t_{0},T)$ , the restriction of the process $(\bar{x},\bar{u})$ on the time segment $[\tau_{1},T]$ , i.e., the process $(\bar{x}(t),\bar{u}(t))$ with $t\in[\tau_{1},T]$ , is a $W^{1,1}$ local minimizer for the following optimal control problem of the Mayer type

[TABLE]

over $x=(x_{1},x_{2})\in W^{1,1}([\tau_{1},T],{\rm I\!R}^{2})$ and measurable functions $u:[\tau_{1},T]\to{\rm I\!R}$ satisfying

[TABLE]

which is denoted by $(FP_{2b})$ . In another words, for any $\tau_{1}\in[t_{0},T)$ , the restriction of a $W^{1,1}$ local minimizer for $(FP_{2a})$ on the time segment $[\tau_{1},T]$ is a $W^{1,1}$ local minimizer for the Mayer problem $(FP_{2b})$ , which is obtained from $(FP_{2a})$ by replacing $t_{0}$ with $\tau_{1}$ .

Proof.

For a fixed $\tau_{1}\in[t_{0},T)$ , let $(FP_{2b})$ be defined as in the formulation of the lemma. It is clear that the process $(\bar{x}(t),\bar{u}(t))$ , $t\in[\tau_{1},T]$ , is feasible for $(FP_{2b})$ . Since $(\bar{x},\bar{u})$ is a $W^{1,1}$ local minimizer of $(FP_{2a})$ , by Definition 2.1 there exists $\delta>0$ such that the process $(\bar{x},\bar{u})$ minimizes the quantity $g(x(t_{0}),x(T))=x_{2}(T)$ over all feasible processes $(x,u)$ of $(FP_{2a})$ with $\|\bar{x}-x\|_{W^{1,1}}\leq\delta$ . Let $(x(t),u(t))$ , $t\in[\tau_{1},T]$ , be an arbitrary feasible process of $(FP_{2b})$ satisfying $\|\bar{x}-x\|_{W^{1,1}([\tau_{1},T])}\leq\delta$ . Consider the pair of functions $(\widetilde{x},\widetilde{u})$ given by

[TABLE]

Clearly, $(\widetilde{x},\widetilde{u})$ is a feasible process of $(FP_{2a})$ satisfying $\|\bar{x}-\widetilde{x}\|_{W^{1,1}([t_{0},T])}\leq\delta$ . Thus, one must have $g(\widetilde{x}(T))\geq g(\bar{x}(T))$ . Since $\widetilde{x}(T)=x(T)$ , one obtains the inequality $g(x(T))\geq g(\bar{x}(T))$ , which justifies the assertion of the proposition. ∎

In accordance with (2.3), the Hamiltonian of $(FP_{2a})$ is given by

[TABLE]

By (2.3), the partial hybrid subdifferential of $h$ at $(t,x)\in[t_{0},T]\times{\rm I\!R}^{2}$ is given by

[TABLE]

From now on, let $(\bar{x},\bar{u})$ be a $W^{1,1}$ local minimizer for $(FP_{2a})$ .

Since the assumptions (H1)–(H4) of Theorem 2.1 are satisfied for $(FP_{2a})$ , by that theorem one can find $p\in W^{1,1}([t_{0},T];{\rm I\!R}^{2})$ , $\gamma\geq 0$ , $\mu\in C^{\oplus}(t_{0},T)$ , and a Borel measurable function $\nu:[t_{0},T]\to{\rm I\!R}^{2}$ such that $(p,\mu,\gamma)\neq(0,0,0)$ , and for $q(t):=p(t)+\eta(t)$ with

[TABLE]

and

[TABLE]

conditions (i)–(iv) in Theorem 2.1 hold true.

Condition (i): Note that

[TABLE]

Since $\bar{x}_{1}(t)\leq 1$ for every $t$ , combining this with (4.19) gives

[TABLE]

So, from (i) it follows that

[TABLE]

and

[TABLE]

Condition (ii): By (4.18), $\mathcal{H}$ is differentiable in $x$ and $\partial_{x}\mathcal{H}(t,x,p,u)=\{(-e^{-\lambda t}p_{2},0)\}$ for all $(t,x,p,u)\in[t_{0},T]\times{\rm I\!R}^{2}\times{\rm I\!R}^{2}\times{\rm I\!R}$ . Thus, (ii) implies that $-\dot{p}(t)=(-e^{-\lambda t}q_{2}(t),0)$ for a.e. $t\in[t_{0},T]$ . Hence, $\dot{p}_{1}(t)=e^{-\lambda t}q_{2}(t)$ for a.e. $t\in[t_{0},T]$ and $p_{2}(t)$ is a constant for all $t\in[t_{0},T]$ .

Condition (iii): By the formulas for $g$ and $C$ , $\partial g(\bar{x}(t_{0}),\bar{x}(T))=\{(0,0,0,1)\}$ and $N_{C}(\bar{x}(t_{0}),\bar{x}(T))={\rm I\!R}^{2}\times\{(0,0)\}$ . Thus, (iii) yields

[TABLE]

which means that $q_{1}(T)=0$ and $q_{2}(T)=-\gamma$ .

Condition (iv): By (4.18), from (iv) one gets

[TABLE]

or, equivalently,

[TABLE]

Thanks to Proposition 4.1 and the above analysis of Conditions (i)–(iv), we will be able to prove next statement.

Proposition 4.3.

Suppose that $[\tau_{1},\tau_{2}]$ is a subsegment of $[t_{0},T]$ with $h(t,\bar{x}(t))<0$ for all $t\in[\tau_{1},\tau_{2}]$ . Then, the curve $t\mapsto\bar{x}_{1}(t)$ , $t\in[\tau_{1},\tau_{2}]$ , cannot have more than one turning point. To be more precise, the curve must be of one of the following three categories C1 $-$ C3:

[TABLE]

and

[TABLE]

where $t_{\zeta}$ is a certain point in $(\tau_{1},\tau_{2})$ (see Fig. 1–3).

Proof.

Suppose that $[\tau_{1},\tau_{2}]$ is a subsegment of $[t_{0},T]$ with $h(t,\bar{x}(t))<0$ for all $t\in[\tau_{1},\tau_{2}]$ , i.e., $\bar{x}_{1}(t)<1$ for all $t\in[\tau_{1},\tau_{2}]$ . Then, it follows from Proposition 4.1 that the restriction of $(\bar{x},\bar{u})$ on $[\tau_{1},\tau_{2}]$ is a $W^{1,1}$ local minimizer for $(FP_{2a})|_{[\tau_{1},\tau_{2}]}$ . Since the latter satisfies the assumptions (H1)–(H4) of Theorem 2.1, by that theorem one finds $\widetilde{p}\in W^{1,1}([\tau_{1},\tau_{2}];{\rm I\!R}^{2})$ , $\widetilde{\gamma}\geq 0$ , $\widetilde{\mu}\in C^{\oplus}(\tau_{1},\tau_{2})$ , and a Borel measurable function $\widetilde{\nu}:[\tau_{1},\tau_{2}]\to{\rm I\!R}^{2}$ with the property $(\widetilde{p},\widetilde{\mu},\widetilde{\gamma})\neq(0,0,0)$ , and for $\widetilde{q}(t):=\widetilde{p}(t)+\widetilde{\eta}(t)$ with

[TABLE]

and

[TABLE]

the conditions (i)–(iv) in Theorem 2.1 hold true, provided that $t_{0},T,p,\mu,\gamma,\nu,\eta$ , and $q$ are changed respectively to $\tau_{1},\tau_{2},\widetilde{p},\widetilde{\mu},\widetilde{\gamma},\widetilde{\nu},\widetilde{\eta}$ , and $\widetilde{q}$ .

By Condition (i), one has

[TABLE]

By Condition (ii), $\dot{\widetilde{p}_{1}}(t)=e^{-\lambda t}\widetilde{q}_{2}(t)$ for a.e. $t\in[\tau_{1},\tau_{2}]$ and $\widetilde{p}_{2}(t)$ is a constant for all $t\in[\tau_{1},\tau_{2}]$ .

Since $N_{\widetilde{C}}(\bar{x}(\tau_{1}),\bar{x}(\tau_{2}))={\rm I\!R}^{3}\times\{0\}$ , by Condition (iii) one has

[TABLE]

This amounts to saying that $\widetilde{q}_{2}(\tau_{2})=-\widetilde{\gamma}$ .

Condition (iv) means that

[TABLE]

Since $\bar{x}_{1}(t)<1$ for all $t\in[\tau_{1},\tau_{2}]$ , (4.30) yields $\widetilde{\mu}([\tau_{1},\tau_{2}])=0$ , i.e., $\widetilde{\mu}=0$ . Combining this with (4.28) and (4.29), one gets $\widetilde{\eta}(t)=0$ for all $t\in[\tau_{1},\tau_{2}]$ . Thus, the relation $\widetilde{q}(t)=\widetilde{p}(t)+\widetilde{\eta}(t)$ implies that $\widetilde{q}(t)=\widetilde{p}(t)$ for every $t\in[\tau_{1},\tau_{2}]$ . Therefore, together with the Lebesgue Theorem [5, Theorem 6, p. 340], the properties of $\widetilde{p}(t)$ and $\widetilde{q}(t)$ established in the above analyses of the conditions (ii) and (iii) give $\widetilde{p}_{2}(t)=\widetilde{q}_{2}(t)=-\widetilde{\gamma}$ and $\widetilde{p}_{1}(t)=\widetilde{q}_{1}(t)=\dfrac{\widetilde{\gamma}}{\lambda}e^{-\lambda t}+\zeta$ for all $t\in[\tau_{1},\tau_{2}]$ , where $\zeta$ is a constant. Substituting these formulas for $\widetilde{q}_{1}(t)$ and $\widetilde{q}_{2}(t)$ to (4.31), we have

[TABLE]

or, equivalently,

[TABLE]

Set $\widetilde{\varphi}(t)=\widetilde{\gamma}(\dfrac{a}{\lambda}-1)e^{-\lambda t}+a\zeta$ for all $t\in[\tau_{1},\tau_{2}]$ .

If $\widetilde{\gamma}=0$ , then $\widetilde{\varphi}(t)\equiv a\zeta$ on $[\tau_{1},\tau_{2}]$ . Since $a>0$ , the condition $(\widetilde{p},\widetilde{\mu},\widetilde{\gamma})\neq(0,0,0)$ implies that $\zeta\neq 0$ . If $\zeta>0$ , then $\widetilde{\varphi}(t)>0$ for all $t\in[\tau_{1},\tau_{2}]$ . If $\zeta<0$ , then $\widetilde{\varphi}(t)<0$ for all $t\in[\tau_{1},\tau_{2}]$ . Thus, if $\zeta>0$ , then (4.32) implies that $\bar{u}(t)=-1$ a.e. $t\in[\tau_{1},\tau_{2}]$ . Similarly, if $\zeta<0$ , then $\bar{u}(t)=1$ a.e. $t\in[\tau_{1},\tau_{2}]$ . Hence, applying the Lebesgue Theorem [5, Theorem 6, p. 340] to the absolutely continuous function $\bar{x}_{1}(t)$ , one has

[TABLE]

in the first case, and

[TABLE]

in the second case.

If $\widetilde{\gamma}>0$ then, due to the assumption $a>\lambda>0$ , $\widetilde{\varphi}$ is strictly decreasing on $[\tau_{1},\tau_{2}]$ . When there exists $t_{\zeta}\in(\tau_{1},\tau_{2})$ such that $\widetilde{\varphi}(t_{\zeta})=0$ , one has $\widetilde{\varphi}(t)>0$ for $t\in(\tau_{1},t_{\zeta})$ and $\widetilde{\varphi}(t)<0$ for $t\in(t_{\zeta},\tau_{2})$ . Hence, (4.32) forces $\bar{u}(t)=-1$ a.e. $t\in[\tau_{1},t_{\zeta}]$ and $\bar{u}(t)=1$ a.e. $t\in[t_{\zeta},\tau_{2}]$ . Thus, by the cited above Lebesgue Theorem,

[TABLE]

As $\bar{x}_{1}(t)<1$ for all $t\in[\tau_{1},\tau_{2}]$ , one must have $\bar{x}_{1}(t_{\zeta})<1$ , i.e., $t_{\zeta}<\tau_{1}+a^{-1}(1-\bar{x}_{1}(\tau_{1}))$ . When $\widetilde{\varphi}(t)>0$ for all $t\in(\tau_{1},\tau_{2})$ , condition (4.32) implies that $\bar{u}(t)=-1$ a.e. $t\in[\tau_{1},\tau_{2}]$ . So, $\bar{x}_{1}(t)$ is defined by (4.33). When $\widetilde{\varphi}(t)<0$ for all $t\in(\tau_{1},\tau_{2})$ , condition (4.32) implies that $\bar{u}(t)=1$ a.e. $t\in[\tau_{1},\tau_{2}]$ . Hence, $\bar{x}_{1}(t)$ is defined by (4.34).

In summary, for any $\tau_{1},\tau_{2}$ with $t_{0}\leq\tau_{1}<\tau_{2}\leq T$ and $\bar{x}_{1}(t)<1$ for all $t\in[\tau_{1},\tau_{2}]$ , the curve $t\mapsto\bar{x}_{1}(t)$ , $t\in[\tau_{1},\tau_{2}]$ , cannot have more than one turning point. Namely, the curve must be of one of the three categories (4.25)–(4.27). ∎

To proceed furthermore, put ${\mathcal{T}}_{1}:=\{t\in[t_{0},T]\;:\;\bar{x}_{1}(t)=1\}.$ Since $\bar{x}_{1}(t)$ is a continuous function, ${\mathcal{T}}_{1}$ is a compact set (which may be empty).

Case 1: ${\mathcal{T}}_{1}=\emptyset$ , i.e., $\bar{x}_{1}(t)<1$ for all $t\in[t_{0},T]$ . Then, by (4.22) one has $\mu([t_{0},T])=0$ , i.e., $\mu=0$ . Combining this with (4.20) and (4.21), one gets $\eta(t)=0$ for all $t\in[t_{0},T]$ . Thus, the relation $q(t)=p(t)+\eta(t)$ allows us to have $q(t)=p(t)$ for every $t\in[t_{0},T]$ . Therefore, together with the Lebesgue Theorem [5, Theorem 6, p. 340], the properties of $p(t)$ and $q(t)$ established in the above analyses of the conditions (ii) and (iii) give

[TABLE]

and

[TABLE]

for all $t\in[t_{0},T]$ . Now, observe that substituting $q(t)=p(t)$ into (4.24) yields

[TABLE]

Setting $\varphi(t)=ap_{1}(t)+e^{-\lambda t}p_{2}(t)$ for $t\in[t_{0},T]$ and using the above formulas of $p_{1}(t)$ and $p_{2}(t)$ , we have

[TABLE]

for $t\in[t_{0},T]$ . Due to the condition $(p,\gamma,\mu)\neq 0$ , one must have $\gamma>0$ . Moreover, the assumption $a>\lambda>0$ implies $\dfrac{a}{\lambda}>1$ . Thus, the function $\varphi(t)$ is decreasing on $[t_{0},T]$ . In addition, it is clear that $\varphi(T)=-\gamma e^{-\lambda T}<0$ , and $\varphi(t)=0$ if and only if $t=\bar{t}$ , where

[TABLE]

The assumption $a>\lambda>0$ implies that $\bar{t}<T$ . Note that the number $\rho:=\dfrac{1}{\lambda}\ln\dfrac{a}{a-\lambda}$ does not depend on the initial time $t_{0}$ and the terminal time $T$ .

If $t_{0}\geq\bar{t}$ , then one has $\varphi(t)<0$ for all $t\in(t_{0},T)$ . This situation happens if and only if $T-t_{0}\leq\rho$ (the time interval of the optimal control problem is rather small). Clearly, condition (4.35) forces $\bar{u}(t)=1$ a.e. $t\in[t_{0},T]$ . Since (4.17) is fulfilled for $x(t)=\bar{x}(t)$ and $u(t)=\bar{u}(t)$ , applying the Lebesgue Theorem [5, Theorem 6, p. 340] to the absolutely continuous function $\bar{x}_{1}(t)$ , one has

[TABLE]

for all $t\in[t_{0},T]$ . In addition, by (4.15) one finds that

[TABLE]

for all $t\in[t_{0},T]$ .

If $t_{0}<\bar{t}$ , then $\varphi(t)>0$ for $t\in(t_{0},\bar{t})$ and $\varphi(t)<0$ for $t\in(\bar{t},T)$ . This situation happens if and only if $T-t_{0}>\rho$ (the time interval of the optimal control problem is large enough). Condition (4.35) yields $\bar{u}(t)=-1$ for a.e. $t\in[t_{0},\bar{t}]$ and $\bar{u}(t)=1$ for a.e. $t\in[\bar{t},T]$ . Hence, by the above-cited Lebesgue Theorem, one has

[TABLE]

Therefore, from (4.15), we have

[TABLE]

Noting that $\bar{x}(t)<1$ for all $t\in[t_{0},T]$ by our assumption, we must have $\bar{x}_{1}(\bar{t})<1$ , i.e., $\bar{t}<t_{0}+a^{-1}(1-x_{0})$ . Since $\bar{t}=T-\rho$ , the last inequality is equivalent to $T-t_{0}<\rho+a^{-1}(1-x_{0})$ .

Thus, if ${\mathcal{T}}_{1}=\emptyset$ and $T-t_{0}\leq\rho$ , then the unique process $(\bar{x},\bar{u})$ suspected for a $W^{1,1}$ local optimizer of $(FP_{2a})$ is the one with $\bar{u}(t)=1$ a.e. $t\in[t_{0},T]$ , $\bar{x}(t)=(\bar{x}_{1}(t),\bar{x}_{2}(t))$ , where $\bar{x}_{1}(t)$ and $\bar{x}_{2}(t)$ are given respectively by (4.37) and (4.38). Otherwise, if ${\mathcal{T}}_{1}=\emptyset$ and

[TABLE]

then the unique process $(\bar{x},\bar{u})$ serving as a $W^{1,1}$ local optimizer of $(FP_{2a})$ is the one with $\bar{u}(t)=-1$ for a.e. $t\in[t_{0},\bar{t}]$ and $\bar{u}(t)=1$ for a.e. $t\in[\bar{t},T]$ , $\bar{x}(t)=(\bar{x}_{1}(t),\bar{x}_{2}(t))$ , where $\bar{x}_{1}(t)$ and $\bar{x}_{2}(t)$ are defined respectively by (4.39) and (4.40). The situation where ${\mathcal{T}}_{1}=\emptyset$ and $T-t_{0}\geq\rho+a^{-1}(1-x_{0})$ cannot occur. The situation where ${\mathcal{T}}_{1}=\emptyset$ and $x_{0}\geq 1-a(\bar{t}-t_{0})$ also cannot occur.

Now, suppose that ${\mathcal{T}}_{1}\neq\emptyset$ , i.e., there exists $t\in[t_{0},T]$ with the property $\bar{x}(t)=1$ . Setting

[TABLE]

we have $t_{0}\leq\alpha_{1}\leq\alpha_{2}\leq T$ . The following situations can occur.

Case 2: $t_{0}<\alpha_{1}=\alpha_{2}=T$ , i.e., $\bar{x}_{1}(t)<1$ for $t\in[t_{0},T)$ and $\bar{x}_{1}(T)=1$ . Clearly, (4.22) means that $\mu([t_{0},T))=0$ . Moreover, if $\nu(T)\neq(1,0)$ , then from (4.23) it follows that $\mu(\{T\})=0$ . So, we have $\mu([t_{0},T])=\mu([t_{0},T))+\mu(\{T\})=0$ , i.e., $\mu=0$ . Hence, we can repeat the arguments already used in Case 1 to prove that either $\bar{x}_{1}(t)=x_{0}-a(t-t_{0})$ for all $t\in[t_{0},T]$ , or

[TABLE]

In particular, either we have $\bar{x}_{1}(T)=x_{0}-a(T-t_{0})<1$ , or $\bar{x}_{1}(T)=\bar{x}_{1}(\bar{t})-a(T-\bar{t})<1$ . Both instances are impossible, because $\bar{x}_{1}(T)=1$ . So, the situation $\nu(T)\neq(1,0)$ is excluded; thus $\nu(T)=(1,0)$ .

From (4.20) and (4.21), one gets $\eta(t)=0$ for $t\in[t_{0},T)$ and $\eta(T)=(\mu(T)-\mu(T-0),0),$ where $\mu(T-0)$ denotes the left limit of $\mu$ at $T$ . Therefore, the relation $q(t)=p(t)+\eta(t)$ , which holds for every $t\in[t_{0},T]$ , yields $q_{1}(t)=p_{1}(t)$ for $t\in[t_{0},T)$ , $q_{1}(T)=p_{1}(T)+\mu(T)-\mu(T-0)$ , and $q_{2}(t)=p_{2}(t)$ for $t\in[t_{0},T]$ . Combining this with the above results of our analyses of the conditions (ii) and (iii), we have $p_{2}(t)=-\gamma$ and $p_{1}(t)=\dfrac{\gamma}{\lambda}e^{-\lambda t}+\zeta$ for all $t\in[t_{0},T]$ , with $\zeta$ being a constant. Since $q(t)$ equals to $p(t)$ everywhere on $[t_{0},T]$ , except possibly for $t=T$ , condition (4.24) implies that

[TABLE]

As in Case 1, we set $\varphi(t)=ap_{1}(t)+e^{-\lambda t}p_{2}(t)$ for every $t\in[t_{0},T]$ . Here one has

[TABLE]

for all $t\in[t_{0},T]$ . Since $\dfrac{a}{\lambda}>1$ , the function $\varphi(t)$ is decreasing on $[t_{0},T]$ . Besides, since $\mu(T)-\mu(T-0)\geq 0,$ $q_{1}(T)=p_{1}(T)+\mu(T)-\mu(T-0)$ , and $q_{1}(T)=0$ , we have $p_{1}(T)\leq 0$ . So, $\varphi(T)=ap_{1}(T)-\gamma e^{-\lambda T}<0$ . If $\varphi(t)<0$ for all $t\in(t_{0},T)$ , then by (4.41) one has $\bar{u}(t)=1$ for a.e. $t\in[t_{0},T]$ . So, as it has been done in (4.37), we have $\bar{x}_{1}(t)=x_{0}-a(t-t_{0})$ for all $t\in[t_{0},T]$ . This yields $\bar{x}_{1}(T)<x_{0}<1$ . We have arrived at a contraction. Now, suppose that there exists $\bar{t}_{\zeta}\in[t_{0},T)$ satisfying $\varphi(\bar{t}_{\zeta})=0$ . Then $\varphi(t)>0$ for $t\in(t_{0},\bar{t}_{\zeta})$ and $\varphi(t)<0$ for $t\in(\bar{t}_{\zeta},T)$ . Thus, (4.35) yields $\bar{u}(t)=-1$ for a.e. $t\in[t_{0},\bar{t}_{\zeta}]$ and $\bar{u}(t)=1$ for a.e. $t\in[\bar{t}_{\zeta},T]$ . Hence, applying the Lebesgue Theorem [5, Theorem 6, p. 340] to the absolutely continuous function $\bar{x}_{1}(t)$ , one has $\bar{x}_{1}(t)=a(t-t_{0})+x_{0}$ for all $t\in[t_{0},\bar{t}_{\zeta}]$ and $\bar{x}_{1}(t)=-a(t-\bar{t}_{\zeta})+\bar{x}_{1}(\bar{t}_{\zeta})$ for every $t\in[\bar{t}_{\zeta},T]$ . As $\bar{x}(t)<1$ for all $t\in[t_{0},T]$ by our assumption, we must have $\bar{x}_{1}(\bar{t}_{\zeta})<1$ . Then we get $\bar{x}_{1}(T)=-a(T-\bar{t}_{\zeta})+\bar{x}_{1}(\bar{t}_{\zeta})<1,$ which is impossible.

Case 3: $t_{0}=\alpha_{1}=\alpha_{2}<T$ , i.e., $x_{0}=1$ and $\bar{x}_{1}(t)<1$ for $t\in(t_{0},T]$ . Let $\bar{\varepsilon}>0$ be such that $t_{0}+\bar{\varepsilon}<T$ . For any $k\in{\rm I\!N}$ with $k^{-1}\in(0,\bar{\varepsilon})$ , by Proposition 4.2 we know that the restriction of $(\bar{x},\bar{u})$ on $[t_{0}+k^{-1},T]$ is a $W^{1,1}$ local minimizer for the Mayer problem $(FP_{2b})$ , which is obtained from $(FP_{2a})$ by replacing $t_{0}$ with $t_{0}+k^{-1}$ . Since $\bar{x}_{1}(t)<1$ for all $t\in[t_{0}+k^{-1},T]$ , we can repeat the arguments already used in Case 1 to get that either $\bar{x}_{1}(t)=\bar{x}_{1}(t_{0}+k^{-1})-a(t-t_{0}-k^{-1})$ for all $t\in[t_{0}+k^{-1},T]$ , or

[TABLE]

with $\bar{t}=T-\rho$ , $\bar{t}\in[t_{0}+k^{-1},T]$ , and $\bar{x}_{1}(\bar{t})<1$ . By the Dirichlet principle, there must exist an infinite number of indexes $k$ with $k^{-1}\in(0,\bar{\varepsilon})$ such that $\bar{x}_{1}(t)$ has the first form (resp., the second form). Without loss of generality, we may assume that this happens for all $k$ with $k^{-1}\in(0,\bar{\varepsilon})$ . If the first situation occurs, then by letting $k\to\infty$ we can assert that $\bar{x}_{1}(t)=1-a(t-t_{0})$ for all $t\in[t_{0},T]$ . If the second situation occurs, then we have

[TABLE]

Since $\bar{x}_{1}(\bar{t})+a(t_{0}-\bar{t})\leq\bar{x}_{1}(\bar{t})<1$ and $x_{0}=1$ , we have arrived at a contradiction.

Case 4: $t_{0}<\alpha_{1}\leq\alpha_{2}<T$ . Then, $\bar{x}_{1}(\alpha_{1})=\bar{x}_{1}(\alpha_{2})=1$ , $\bar{x}_{1}(t)<1$ for $t\in[t_{0},\alpha_{1})\cup(\alpha_{2},T]$ . To find a formula for $(\bar{x},\bar{u})$ on $[\alpha_{2},T]$ , observe from Proposition 4.2 that the restriction of $(\bar{x},\bar{u})$ on $[\alpha_{2},T]$ is a $W^{1,1}$ local minimizer for the Mayer problem obtained from $(FP_{2a})$ by replacing $t_{0}$ with $\alpha_{2}$ . Thus, the result in Case 3 applied to the process $(\bar{x}(t),\bar{u}(t))$ , $t\in[\alpha_{2},T]$ , implies that $\bar{x}_{1}(t)=1-a(t-\alpha_{2})$ and $\bar{x}_{2}(t)=\displaystyle\int_{\alpha_{2}}^{t}\big{[}-e^{-\lambda\tau}\big{(}1-a(\tau-\alpha_{2})+1\big{)}\big{]}d\tau$ for all $t\in[\alpha_{2},T]$ . To obtain a formula for $(\bar{x},\bar{u})$ on $[t_{0},\alpha_{2}]$ , consider the following two subcases.

Subcase 4a: $t_{0}<\alpha_{1}=\alpha_{2}<T$ . Here we have $\bar{x}_{1}(\alpha_{1})=1$ and $\bar{x}_{1}(t)<1$ for all $t\in[t_{0},T]\setminus\{\alpha_{1}\}$ . To find a formula for $\bar{x}_{1}(.)$ on $[t_{0},\alpha_{1}]$ , we temporarily fix a value $\alpha\in(t_{0},\alpha_{1})$ (later, we will let $\alpha$ converge to $\alpha_{1}$ ). Since $\bar{x}_{1}(t)<1$ for all $[t_{0},\alpha]$ , applying Proposition 4.3 with $\tau_{1}:=t_{0}$ and $\tau_{2}:=\alpha$ , we can assert that the restriction of $\bar{x}_{1}(.)$ on $[t_{0},\alpha]$ is defined by one of next three formulas:

[TABLE]

and

[TABLE]

where $t_{\zeta}\in(t_{0},\alpha)$ . Hence, the graph of $\bar{x}_{1}(.)$ on $[t_{0},\alpha]$ is of one of the following types: C1) Going up as in Fig. 5; C2) Going down as in Fig. 5; C3) Going up first and then going down as in Fig. 6.

Now, let $\alpha=\alpha^{(k)}$ with $\alpha^{(k)}:=\alpha_{1}-\dfrac{1}{k}$ , where $k\in{\rm I\!N}$ is as large as $\alpha\in(t_{0},\alpha_{1})$ . Since for each $k$ the restriction of the graph of $\bar{x}_{1}(.)$ on $[t_{0},\alpha^{(k)}]$ must be of one of the three types C1–C3, by the Dirichlet principle we can find a subsequence $\{k^{\prime}\}$ of $\{k\}$ such that the corresponding graphs belong to a fixed category. If the latter is C2, then by (4.43) and the continuity of $\bar{x}_{1}(.)$ one has

[TABLE]

This is impossible, because $\bar{x}_{1}(\alpha_{1})=1$ . Similarly, the situation where the fixed category is C3 can be excluded by using (4.44). If the graphs belong to the category C1, from (4.42) we deduce that

[TABLE]

Then, the condition $\bar{x}_{1}(\alpha_{1})=1$ is satisfied if and only if $\alpha_{1}=t_{0}+a^{-1}(1-x_{0}).$

Subcase 4b: $t_{0}<\alpha_{1}<\alpha_{2}<T$ . Then, one has $\bar{x}_{1}(\alpha_{1})=\bar{x}_{1}(\alpha_{2})=1$ and $\bar{x}_{1}(t)<1$ for $t\in[t_{0},\alpha_{1})\cup(\alpha_{2},T]$ . We are going to show that this situation cannot occur.

Suppose first that $\bar{x}_{1}(t)=1$ for all $t\in(\alpha_{1},\alpha_{2})$ . Since $(\bar{x},\bar{u})$ is a $W^{1,1}$ local minimizer of $(FP_{2a})$ , by Definition 2.1 we can find $\delta>0$ such that the process $(\bar{x},\bar{u})$ minimizes the quantity $g(x(t_{0}),x(T))=x_{2}(T)$ over all feasible processes $(x,u)$ of $(FP_{2a})$ with $\|\bar{x}-x\|_{W^{1,1}}\leq\delta$ . By the result given before Subcase 4a, one has $\bar{x}_{1}(t)=1-a(t-\alpha_{2})$ for all $t\in[\alpha_{2},T]$ . Fixing a number $\alpha\in(\alpha_{1},\alpha_{2})$ , we consider the pair of functions $(\widetilde{x}^{\alpha},\widetilde{u}^{\alpha})$ defined by

[TABLE]

It is easy to check that $(\widetilde{x}^{\alpha},\widetilde{u}^{\alpha})$ is a feasible process of $(FP_{2a})$ . Besides, by direct computing, we have

[TABLE]

Thus, the condition $\alpha<\alpha_{2}$ yields $\bar{x}_{2}(T)>\widetilde{x}_{2}^{\alpha}(T)$ . Since $\displaystyle\lim_{\alpha\to\alpha_{2}}\|\bar{x}-\widetilde{x}^{\alpha}\|_{W^{1,1}}=0$ , one has $\|\bar{x}-\widetilde{x}^{\alpha}\|_{W^{1,1}}\leq\delta$ for all $\alpha\in(\alpha_{1},\alpha_{2})$ sufficiently close to $\alpha_{2}$ . This contradicts the assumed $W^{1,1}$ local optimality of the process $(\bar{x},\bar{u})$ .

Now, suppose that there exists $\hat{t}\in(\alpha_{1},\alpha_{2})$ such that $\bar{x}_{1}(\hat{t})<1$ . By the continuity of $\bar{x}_{1}(.)$ , the constants $\hat{\alpha}_{1}:=\max\{t\in[\alpha_{1},\hat{t}]\,:\,\bar{x}_{1}(t)=1\}$ and $\hat{\alpha}_{2}:=\min\{t\in[\hat{t},\alpha_{2}]\,:\,\bar{x}_{1}(t)=1\}$ are well defined. Note that $\hat{t}\in\big{(}\hat{\alpha}_{1},\hat{\alpha}_{2}\big{)}$ and $\bar{x}_{1}(t)<1$ for every $t\in(\hat{\alpha}_{1},\hat{\alpha}_{2})$ . If $\varepsilon>0$ is small enough, then $\hat{\alpha}_{1}+\varepsilon\in\big{(}\hat{\alpha}_{1},\hat{t}\big{)}$ . Using the result given in Subcase 4a for the restriction of the function $\bar{x}_{1}(t)$ on the segment $[\hat{\alpha}_{1}+\varepsilon,\hat{\alpha}_{2}]$ (thus, $\hat{\alpha}_{1}+\varepsilon$ plays the role of $t_{0}$ and $\hat{\alpha}_{2}$ takes the place of $\alpha_{1}$ ), one finds that

[TABLE]

In particular, the function $\bar{x}_{1}(t)$ is strictly increasing on $[\hat{\alpha}_{1}+\varepsilon,\hat{\alpha}_{2}]$ . Since $\hat{t}\in\big{(}\alpha_{1}+\varepsilon,\hat{\alpha}_{2}\big{)}$ , this implies that $\bar{x}_{1}(\hat{\alpha}_{1}+\varepsilon)<\bar{x}_{1}(\hat{t})$ . Then, by the continuity of $\bar{x}_{1}(t)$ we obtain

[TABLE]

As $\bar{x}_{1}(\hat{\alpha}_{1})=1$ , we have arrived at a contradiction.

Since Subcase 4b cannot happen, we conclude that the formula for $\bar{x}_{1}(t)$ in this case is given by

[TABLE]

with $\alpha_{1}:=t_{0}+a^{-1}(1-x_{0})$ . One must have $\alpha_{1}\leq\bar{t}$ , where $\bar{t}$ is defined by (4.36). Indeed, suppose on the contrary that $\alpha_{1}>\bar{t}$ . For an arbitrarily given $\alpha\in(\bar{t},\alpha_{1})$ , we consider the problem $(FP_{1b})$ (resp., the problem $(FP_{2b})$ ) which is obtained from the problem $(FP_{1a})$ in Section 3 (resp., from the above problem $(FP_{2b})$ ) by letting $\alpha$ play the role of the initial time $t_{0}$ . Since $\alpha>\bar{t}$ , it follows from Theorem 3.1 that $(FP_{1b})$ has a unique global solution $(\bar{x}^{\alpha},\bar{u}^{\alpha})$ , where $\bar{u}(t)=1$ for almost everywhere $t\in[\alpha,T]$ , $\bar{x}_{1}^{\alpha}(t)=\bar{x}_{1}(\alpha)-a(t-\alpha)$ for all $t\in[\alpha,T]$ , and $\bar{x}_{2}^{\alpha}(t)=\int_{\alpha}^{t}\big{[}-e^{-\lambda\tau}(x_{1}(\tau)+u(\tau))\big{]}d\tau$ for all $t\in[\alpha,T]$ . Clearly, the restriction of $(\bar{x},\bar{u})$ on $[\alpha,T]$ is a feasible process for $(FP_{1b})$ . Thus, we have

[TABLE]

Besides, by Proposition 4.2, the restriction of $(\bar{x},\bar{u})$ on $[\alpha,T]$ is a $W^{1,1}$ local solution for $(FP_{2b})$ . So, there exits $\delta>0$ such that the restriction of $(\bar{x},\bar{u})$ on $[\alpha,T]$ minimizes the quantity $x_{2}(T)$ over all feasible processes $(x,u)$ of $(FP_{2b})$ with $\|x-\bar{x}\|_{W^{1,1}([\alpha,T];{\rm I\!R}^{n})}\leq\delta$ . Clearly, $(\bar{x}^{\alpha},\bar{u}^{\alpha})$ is a feasible process of $(FP_{2b})$ . Therefore, since $\|\bar{x}^{\alpha}-\bar{x}\|_{W^{1,1}([\alpha,T];{\rm I\!R}^{n})}\leq\delta$ for all $\alpha$ sufficiently close to $\alpha_{1}$ , we have $\bar{x}_{2}^{\alpha}(T)\geq\bar{x}_{2}(T)$ for those $\alpha$ . This contradicts (4.46).

Going back to the original problem $(FP_{2})$ , we can summarize the results obtained in this section as follows.

Theorem 4.4.

Given any $a,\lambda$ with $a>\lambda>0$ , define $\rho=\dfrac{1}{\lambda}\ln\dfrac{a}{a-\lambda}>0$ , $\bar{t}=T-\rho$ $\bar{x}_{0}=1-a(\bar{t}-t_{0})$ , and $\alpha_{1}=t_{0}+a^{-1}(1-x_{0})$ . Then, problem $(FP_{2})$ has a unique local solution $(\bar{x},\bar{u})$ , which is a global solution, where $\bar{u}(t)=-a^{-1}\dot{\bar{x}}(t)$ for almost everywhere $t\in[t_{0},T]$ and $\bar{x}(t)$ can be described as follows:

(a)* If $t_{0}\geq\bar{t}$ (i.e, $T-t_{0}\leq\rho$ ), then*

[TABLE]

(b)* If $t_{0}<\bar{t}$ and $x_{0}<\bar{x}_{0}$ (i.e, $\rho<T-t_{0}<\rho+a^{-1}(1-x_{0})$ ), then*

[TABLE]

(c)* If $t_{0}<\bar{t}$ and $x_{0}\geq\bar{x}_{0}$ (i.e, $T-t_{0}\geq\rho+a^{-1}(1-x_{0})$ ), then*

[TABLE]

Proof.

To obtain the assertions (a)–(c), it suffices to combine the results formulated in Case 1, Case 3, and Case 4, having in mind that $\bar{x}_{1}(t)$ in $(FP_{2a})$ plays the role of $\bar{x}(t)$ in $(FP_{2})$ . ∎

5 Conclusions

We have analyzed a maximum principle for finite horizon state constrained problems via two parametric examples of optimal control problems of the Langrange type, which have five parameters. These problems resemble the optimal growth problem in mathematical economics. The first example is related to control problems without state constraints. The second one belongs to the class of irregular control problems with unilateral state constraints. We have proved that the control problem in each example has a unique local solution, which is a global solution. Moreover, we are able to present an explicit description of the optimal process with respect to the five parameters.

The obtained results allows us to have a deep understanding of the maximum principle in question.

It seems to us that, following the approach adopted in this paper, one can study economic optimal growth models by advanced tools from functional analysis and optimal control theory.

Bibliography14

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V. Basco, P. Cannarsa, H. Frankowska, Necessary conditions for infinite horizon optimal control problems with state constraints , Preprint, 2018.
2[2] L. Cesari, Optimization Theory and Applications , 1st edition, Springer-Verlag, New York, 1983.
3[3] A. D. Ioffe, V. M. Tihomirov, Theory of Extremal Problems , North-Holland Publishing Company, Amsterdam, 1979.
4[4] R. F. Hartl, S. P. Sethi, R. G. Vickson, A survey of the maximum principles for optimal control problems with state constraints , SIAM Rev. 37 (1995), 181–218.
5[5] A. N. Kolmogorov, S. V. Fomin, Introductory Real Analysis. Revised English edition. Translated from the Russian and edited by R. A. Silverman, Dovers Publications, Inc., New York, 1970.
6[6] D. G. Luenberger, Optimization by Vector Space Methods . John Wiley & Sons, New York, 1969.
7[7] B. S. Mordukhovich, Variational Analysis and Generalized Differentiation , Vol. I: Basic Theory, Springer, New York, 2006.
8[8] B. S. Mordukhovich, Variational Analysis and Generalized Differentiation , Vol. II: Applications, Springer, New York, 2006.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Analyzing a Maximum Principle for Finite Horizon State Constrained Problems via Parametric Examples. Part 1: Problems with Unilateral State Constraints111Financial supports from several research projects in Taiwan and Vietnam are gratefully acknowledged.

1 Introduction

2 Background Materials

2.1 Notations and Definitions

Definition 2.1**.**

Definition 2.2**.**

Definition 2.3** (See [14, p. 329]).**

2.2 A Maximum Principle for State Constrained Problems

Theorem 2.1** (See [14, Theorem 9.3.1]).**

Proposition 2.1** (See [14, Theorem 6.2.1]).**

2.3 Solution Existence in State Constrained Optimal Control

Theorem 2.2** (see [2, Theorem 9.2.i and Section 9.4]).**

3 Control Problems without State Constraints

3.1 Solution Existence

3.2 Necessary Optimality Conditions

3.2.1 Necessary Optimality Conditions for (FP1a)(FP_{1a})(FP1a​) in Terms of Proposition 2.1

3.2.2 Necessary Optimality Conditions for (FP1a)(FP_{1a})(FP1a​) in Terms of Theorem 2.1

Theorem 3.1**.**

Proof.

4 Control Problems with Unilateral Constraints

4.1 Solution Existence

4.2 Necessary Optimality Conditions

Proposition 4.1**.**

Proof.

Proposition 4.2**.**

Proof.

Proposition 4.3**.**

Proof.

Theorem 4.4**.**

Proof.

5 Conclusions

Definition 2.1.

Definition 2.2.

Definition 2.3 (See [14, p. 329]).

Theorem 2.1 (See [14, Theorem 9.3.1]).

Proposition 2.1 (See [14, Theorem 6.2.1]).

Theorem 2.2 (see [2, Theorem 9.2.i and Section 9.4]).

3.2.1 Necessary Optimality Conditions for $(FP_{1a})$ in Terms of Proposition 2.1

3.2.2 Necessary Optimality Conditions for $(FP_{1a})$ in Terms of Theorem 2.1

Theorem 3.1.

Proposition 4.1.

Proposition 4.2.

Proposition 4.3.

Theorem 4.4.