On the proof of Michel of the maximum Pontryagin Principle

Jo\"el Blot (SAMM); Hasan Yilmaz

arXiv:1904.01254·math.OC·April 3, 2019

On the proof of Michel of the maximum Pontryagin Principle

Jo\"el Blot (SAMM), Hasan Yilmaz

PDF

Open Access

TL;DR

This paper improves the Pontryagin maximum principle for optimal control problems with final constraints, using advanced functional analysis tools and needlelike variations in a Banach space setting.

Contribution

It introduces a novel proof of the maximum principle incorporating piecewise differentiability and recent multiplier rules, enhancing previous Michel's approach.

Findings

01

Enhanced maximum principle with broader applicability

02

Inclusion of piecewise differentiable state functions

03

Use of functional analysis tools in proof structure

Abstract

We provide an improvment of the maximum principle of Pon-tryagin of the optimal control problems, for a system governed by an ordinary differential equation, in presence of final constraints, in the setting of the piece-wise differentiable state functions (valued in a Banach space) and of piecewise continuous control functions (valued in a metric space). As Michel we use the needlelike variations, but we introduce tools of functional analysis and a recent multiplier rule of the static optimization to make our proofs. Mathematical Subject Classification 2010: 49K15, 47H10

Equations76

({\mathcal{B}})\left\{\begin{array}[]{cl}{\rm Maximize}&\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}(x(T))\\ {\rm subject\;to}&x\in PC^{1}([0,T],\Omega),u\in PC^{0}([0,T],U)\\ \hbox{}\hfil&x^{\prime}(t)=f(t,x(t),u(t)),\;x(0)=\xi_{0}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x(T))=0.\end{array}\right.

({\mathcal{B}})\left\{\begin{array}[]{cl}{\rm Maximize}&\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}(x(T))\\ {\rm subject\;to}&x\in PC^{1}([0,T],\Omega),u\in PC^{0}([0,T],U)\\ \hbox{}\hfil&x^{\prime}(t)=f(t,x(t),u(t)),\;x(0)=\xi_{0}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x(T))=0.\end{array}\right.

(QC,i)\left\{\begin{array}[]{l}{\rm If}\;\;(c_{\alpha})_{i\leq\alpha\leq m}\in{\mathbb{R}}_{+}^{1-i+m},(d_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}{\rm\;\;\;satisfy}\\ (\forall\alpha=1,...,m,\;c_{\alpha}g^{\alpha}(x(T))=0),{\rm and}\\ \sum_{\alpha=i}^{m}c_{\alpha}Dg^{\alpha}(x(T))+\sum_{\beta=1}^{q}d_{\beta}Dh^{\beta}(x(T))=0,{\rm then}\\ (\forall\alpha=i,...,m,\;c_{\alpha}=0)\;\;{\rm and}\;\;(\forall\beta=1,...,q,\;d_{\beta}=0).\end{array}\right.

(QC,i)\left\{\begin{array}[]{l}{\rm If}\;\;(c_{\alpha})_{i\leq\alpha\leq m}\in{\mathbb{R}}_{+}^{1-i+m},(d_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}{\rm\;\;\;satisfy}\\ (\forall\alpha=1,...,m,\;c_{\alpha}g^{\alpha}(x(T))=0),{\rm and}\\ \sum_{\alpha=i}^{m}c_{\alpha}Dg^{\alpha}(x(T))+\sum_{\beta=1}^{q}d_{\beta}Dh^{\beta}(x(T))=0,{\rm then}\\ (\forall\alpha=i,...,m,\;c_{\alpha}=0)\;\;{\rm and}\;\;(\forall\beta=1,...,q,\;d_{\beta}=0).\end{array}\right.

N P C^{0} ([0, T], A, (τ_{i})_{0 \leq i \leq k + 1}) := N P C^{0} ([0, T], A) \cap P C^{0} ([0, T], A, (τ_{i})_{0 \leq i \leq k + 1}) .

N P C^{0} ([0, T], A, (τ_{i})_{0 \leq i \leq k + 1}) := N P C^{0} ([0, T], A) \cap P C^{0} ([0, T], A, (τ_{i})_{0 \leq i \leq k + 1}) .

\underline{d}x(t):=\left\{\begin{array}[]{ccl}x^{\prime}(t)&{\rm if}&t\in[0,T]\setminus\{\tau_{i}:i\in\{0,...,k+1\}\}\\ x^{\prime}(\tau_{i}+)&{\rm if}&t=\tau_{i},i\in\{0,...,k\}\\ x^{\prime}(T-)&{\rm if}&t=T.\end{array}\right.

\underline{d}x(t):=\left\{\begin{array}[]{ccl}x^{\prime}(t)&{\rm if}&t\in[0,T]\setminus\{\tau_{i}:i\in\{0,...,k+1\}\}\\ x^{\prime}(\tau_{i}+)&{\rm if}&t=\tau_{i},i\in\{0,...,k\}\\ x^{\prime}(T-)&{\rm if}&t=T.\end{array}\right.

({\mathcal{B}^{\prime}})\left\{\begin{array}[]{cl}{\rm Maximize}&J(x,u):=\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}(x(T))\\ {\rm subject\;to}&x\in PC^{1}([0,T],\Omega),u\in NPC^{0}([0,T],U)\\ \hbox{}\hfil&\underline{d}x(t)=f(t,x(t),u(t)),\;x(0)=\xi_{0}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x(T))=0.\end{array}\right.

({\mathcal{B}^{\prime}})\left\{\begin{array}[]{cl}{\rm Maximize}&J(x,u):=\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}(x(T))\\ {\rm subject\;to}&x\in PC^{1}([0,T],\Omega),u\in NPC^{0}([0,T],U)\\ \hbox{}\hfil&\underline{d}x(t)=f(t,x(t),u(t)),\;x(0)=\xi_{0}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x(T))=0.\end{array}\right.

\overline{u}(t):=\left\{\begin{array}[]{ccl}u(t)&{\rm if}&t\in(\tau_{i},\tau_{i+1}),i\in\{0,...,k\}\\ \ u(\tau_{i}+)&{\rm if}&t=\tau_{i},i\in\{0,...,k\}\\ u(T-)&{\rm if}&t=T;\end{array}\right.

\overline{u}(t):=\left\{\begin{array}[]{ccl}u(t)&{\rm if}&t\in(\tau_{i},\tau_{i+1}),i\in\{0,...,k\}\\ \ u(\tau_{i}+)&{\rm if}&t=\tau_{i},i\in\{0,...,k\}\\ u(T-)&{\rm if}&t=T;\end{array}\right.

\forall ϵ > 0, \exists δ_{ϵ} > 0, \forall x \in X, \forall z \in K, d (x, z) \leq δ_{ϵ} ⟹ d (ϕ (x), ϕ (z)) \leq ϵ .

\forall ϵ > 0, \exists δ_{ϵ} > 0, \forall x \in X, \forall z \in K, d (x, z) \leq δ_{ϵ} ⟹ d (ϕ (x), ϕ (z)) \leq ϵ .

J (i) = J (i, S) := {j \in {1, ..., i - 1} : t_{j} = t_{i}}

J (i) = J (i, S) := {j \in {1, ..., i - 1} : t_{j} = t_{i}}

b_{i}(a)=b_{i}(a,S):=\left\{\begin{array}[]{ccl}0&{\rm if}&J(i)=\emptyset\\ \sum_{j\in J(i)}a_{j}&{\rm if}&J(i)\neq\emptyset.\end{array}\right.

b_{i}(a)=b_{i}(a,S):=\left\{\begin{array}[]{ccl}0&{\rm if}&J(i)=\emptyset\\ \sum_{j\in J(i)}a_{j}&{\rm if}&J(i)\neq\emptyset.\end{array}\right.

I_{i} (a) = I_{i} (a, S) := [t_{i} + b_{i} (a), t_{i} + b_{i} (a) + a_{i}),

I_{i} (a) = I_{i} (a, S) := [t_{i} + b_{i} (a), t_{i} + b_{i} (a) + a_{i}),

u_{a}(t)=u_{a}(t,S):=\left\{\begin{array}[]{ccl}v_{i}&{\rm if}&t\in I_{i}(a),1\leq i\leq N\\ u_{0}(t)&{\rm if}&t\in[0,T]\setminus\cup_{1\leq i\leq N}I_{i}(a).\end{array}\right.

u_{a}(t)=u_{a}(t,S):=\left\{\begin{array}[]{ccl}v_{i}&{\rm if}&t\in I_{i}(a),1\leq i\leq N\\ u_{0}(t)&{\rm if}&t\in[0,T]\setminus\cup_{1\leq i\leq N}I_{i}(a).\end{array}\right.

\underline{d} x_{a} (t) = f (t, x_{a} (t), u_{a} (t)), x_{a} (0) = ξ_{0} .

\underline{d} x_{a} (t) = f (t, x_{a} (t), u_{a} (t)), x_{a} (0) = ξ_{0} .

\int_{0}^{T} ∥ f (t, x_{0} (t), u_{a} (t)) - f (t, x_{0} (t), u_{0} (t)) ∥ d t \leq k ∥ a ∥.

\int_{0}^{T} ∥ f (t, x_{0} (t), u_{a} (t)) - f (t, x_{0} (t), u_{0} (t)) ∥ d t \leq k ∥ a ∥.

u_{0}^{i}(t):=\left\{\begin{array}[]{ccl}u_{0}(t)&{\rm if}&t\in[\tau_{i},\tau_{i+1})\\ u_{0}(\tau_{i+1}-)&{\rm if}&t=\tau_{i+1}.\end{array}\right.

u_{0}^{i}(t):=\left\{\begin{array}[]{ccl}u_{0}(t)&{\rm if}&t\in[\tau_{i},\tau_{i+1})\\ u_{0}(\tau_{i+1}-)&{\rm if}&t=\tau_{i+1}.\end{array}\right.

M := (0 \leq i \leq k ⋃ u_{0}^{i} ([τ_{i}, τ_{i + 1}]) \cup {v_{i} : 1 \leq i \leq N}) .

M := (0 \leq i \leq k ⋃ u_{0}^{i} ([τ_{i}, τ_{i + 1}]) \cup {v_{i} : 1 \leq i \leq N}) .

Γ := {(t, x_{0} (t)) : t \in [0, T]} .

Γ := {(t, x_{0} (t)) : t \in [0, T]} .

\left.\begin{array}[]{r}\forall\epsilon>0,\exists\delta_{\epsilon}\in(0,\gamma),\forall(t,\xi,\zeta)\in K,\forall(t_{1},\xi_{1},\zeta_{1})\in[0,T]\times\Omega\times U,\\ d((t,\xi,\zeta),(t_{1},\xi_{1},\zeta_{1}))=|t-t_{1}|+\|\xi-\xi_{1}\|+d(\zeta,\zeta_{1})\leq\delta_{\epsilon}\Longrightarrow\\ \|D_{2}f(t,\xi,\zeta)-D_{2}f(t_{1},\xi_{1},\zeta_{1})\|\leq\epsilon.\end{array}\right\}

\left.\begin{array}[]{r}\forall\epsilon>0,\exists\delta_{\epsilon}\in(0,\gamma),\forall(t,\xi,\zeta)\in K,\forall(t_{1},\xi_{1},\zeta_{1})\in[0,T]\times\Omega\times U,\\ d((t,\xi,\zeta),(t_{1},\xi_{1},\zeta_{1}))=|t-t_{1}|+\|\xi-\xi_{1}\|+d(\zeta,\zeta_{1})\leq\delta_{\epsilon}\Longrightarrow\\ \|D_{2}f(t,\xi,\zeta)-D_{2}f(t_{1},\xi_{1},\zeta_{1})\|\leq\epsilon.\end{array}\right\}

\begin{array}[]{rcl}\|D_{2}f(t,\xi,\zeta)\|&\leq&\|D_{2}f(t,x_{0}(t),\zeta)\|+\epsilon\\ \hbox{}\hfil&\leq&\sup_{(t_{1},\xi_{1},\zeta_{1})\in K}\|D_{2}f(t_{1},\xi_{1},\zeta_{1})\|+\epsilon\Longrightarrow\end{array}

\begin{array}[]{rcl}\|D_{2}f(t,\xi,\zeta)\|&\leq&\|D_{2}f(t,x_{0}(t),\zeta)\|+\epsilon\\ \hbox{}\hfil&\leq&\sup_{(t_{1},\xi_{1},\zeta_{1})\in K}\|D_{2}f(t_{1},\xi_{1},\zeta_{1})\|+\epsilon\Longrightarrow\end{array}

r_{1} := r e^{- L \cdot T}, X := \overline{B} (x_{0}, r_{1}) .

r_{1} := r e^{- L \cdot T}, X := \overline{B} (x_{0}, r_{1}) .

Φ_{a} : X \to C^{0} ([0, T], E), Φ_{a} (x) := [t \mapsto ξ_{0} + \int_{0}^{t} f (s, x (s), u_{a} (s)) d s] .

Φ_{a} : X \to C^{0} ([0, T], E), Φ_{a} (x) := [t \mapsto ξ_{0} + \int_{0}^{t} f (s, x (s), u_{a} (s)) d s] .

∥ Φ_{a} (x_{0}) - x_{0} ∥_{b} \leq e^{- L \cdot T} r_{1} .

∥ Φ_{a} (x_{0}) - x_{0} ∥_{b} \leq e^{- L \cdot T} r_{1} .

\forall t \in [0, T], ∥ f (t, x_{0} (t), u_{a} (t)) - f (t, x (t), u_{a} (t)) ∥ \leq L ∥ x (t) - x_{0} (t) ∥.

\forall t \in [0, T], ∥ f (t, x_{0} (t), u_{a} (t)) - f (t, x (t), u_{a} (t)) ∥ \leq L ∥ x (t) - x_{0} (t) ∥.

\forall x \in X, ∥ Φ_{a} (x) - Φ_{a} (x_{0}) ∥_{b} \leq (1 - e^{- L \cdot T}) r_{1} .

\forall x \in X, ∥ Φ_{a} (x) - Φ_{a} (x_{0}) ∥_{b} \leq (1 - e^{- L \cdot T}) r_{1} .

\left.\begin{array}[]{cl}f(t,x(t),u_{a}(t))=&1_{[0,t_{1})}f(t,x_{0}(t),u_{0}(t))+\\ \hbox{}\hfil&\sum_{i=1}^{N}1_{[t_{i}+b_{i}(a),t_{i}+b_{i}(a)+a_{i})}f(t,x(t),v_{i})+\\ \hbox{}\hfil&\sum_{i=1}^{N-1}1_{[t_{i}+b_{i}(a)+a_{i},t_{i+1}+b_{i+1}(a))}f(t,x(t),u_{0}(t))\\ \hbox{}\hfil&+1_{[t_{N}+b_{N}(a)+a_{N},T]}f(t,x_{0}(t),u_{0}(t)).\end{array}\right\}

\left.\begin{array}[]{cl}f(t,x(t),u_{a}(t))=&1_{[0,t_{1})}f(t,x_{0}(t),u_{0}(t))+\\ \hbox{}\hfil&\sum_{i=1}^{N}1_{[t_{i}+b_{i}(a),t_{i}+b_{i}(a)+a_{i})}f(t,x(t),v_{i})+\\ \hbox{}\hfil&\sum_{i=1}^{N-1}1_{[t_{i}+b_{i}(a)+a_{i},t_{i+1}+b_{i+1}(a))}f(t,x(t),u_{0}(t))\\ \hbox{}\hfil&+1_{[t_{N}+b_{N}(a)+a_{N},T]}f(t,x_{0}(t),u_{0}(t)).\end{array}\right\}

n \to + \infty lim ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ = 0, μ - a . e . t \in [0, T] .

n \to + \infty lim ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ = 0, μ - a . e . t \in [0, T] .

\exists σ \in R_{+*}, \forall n \in N, \forall t \in [0, T], ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ \leq σ .

\exists σ \in R_{+*}, \forall n \in N, \forall t \in [0, T], ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ \leq σ .

n \to + \infty lim \int_{0}^{T} ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ d t = 0.

n \to + \infty lim \int_{0}^{T} ∥ f (t, x (t), u_{a^{n}} (t)) - f (t, x (t), u_{\overset{a}{^}} (t)) ∥ d t = 0.

\varrho(a)=\left\{\begin{array}[]{ccl}\frac{1}{\|a\|}(x_{a}(T)-x_{0}(T)-\Lambda a)&{\rm if}&a\neq 0\\ 0&{\rm if}&a=0.\end{array}\right.

\varrho(a)=\left\{\begin{array}[]{ccl}\frac{1}{\|a\|}(x_{a}(T)-x_{0}(T)-\Lambda a)&{\rm if}&a\neq 0\\ 0&{\rm if}&a=0.\end{array}\right.

(\mathcal{F}_{S}):=\left\{\begin{array}[]{cl}{\rm Maximize}&g^{0}(x_{a}(T))\\ {\rm subject}\>\>{\rm to}&a\in{B}(0,r_{4})\cap{\mathbb{R}}^{N}_{+}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x_{a}(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x_{a}(T))=0.\end{array}\right.

(\mathcal{F}_{S}):=\left\{\begin{array}[]{cl}{\rm Maximize}&g^{0}(x_{a}(T))\\ {\rm subject}\>\>{\rm to}&a\in{B}(0,r_{4})\cap{\mathbb{R}}^{N}_{+}\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(x_{a}(T))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(x_{a}(T))=0.\end{array}\right.

(\mathcal{F}_{S}^{1}):=\left\{\begin{array}[]{cl}{\rm Maximize}&g^{0}(\kappa(a))\\ {\rm subject}\;\;{\rm to}&a\in B(0,r_{4})\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(\kappa(a))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(\kappa(a))=0\\ \hbox{}\hfil&\forall i=1,...,N,\;\;b^{*}_{i}a\geq 0\end{array}\right.

(\mathcal{F}_{S}^{1}):=\left\{\begin{array}[]{cl}{\rm Maximize}&g^{0}(\kappa(a))\\ {\rm subject}\;\;{\rm to}&a\in B(0,r_{4})\\ \hbox{}\hfil&\forall\alpha=1,...,m,\;\;g^{\alpha}(\kappa(a))\geq 0\\ \hbox{}\hfil&\forall\beta=1,...,q,\;\;h^{\beta}(\kappa(a))=0\\ \hbox{}\hfil&\forall i=1,...,N,\;\;b^{*}_{i}a\geq 0\end{array}\right.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsOptimization and Variational Analysis · Stability and Controllability of Differential Equations · Quantum chaos and dynamical systems

Full text

On the proof of Michel of the Maximum Pontryagin principle

Jo ${\rm\ddot{e}}$ l Blot & Hasan Yilmaz

Joël Blot: Laboratoire SAMM EA 4543,

Université Paris 1 Panthéon-Sorbonne, centre P.M.F.,

90 rue de Tolbiac, 75634 Paris cedex 13, France.

[email protected]

Hasan Yilmaz: Laboratoire LPSM UMR 8001

Université Paris-Diderot, Sorbonne-Paris-Cité

bâtiment Sophie Germain, 8 place Aurélie Nemours,

75013 Paris, France.

[email protected]

(Date: April, 1, 2019)

Abstract.

We provide an improvment of the maximum principle of Pontryagin of the optimal control problems, for a system governed by an ordinary differential equation, in presence of final constraints, in the setting of the piecewise differentiable state functions (valued in a Banach space) and of piecewise continuous control functions (valued in a metric space). As Michel we use the needlelike variations, but we introduce tools of functional analysis and a recent multiplier rule of the static optimization to make our proofs.

Mathematical Subject Classification 2010: 49K15, 47H10

Key Words: Pontryagin maximum principle, piecewise continuous functions, fixed point theorem

1. Introduction

The paper deals with the maximum principle of Pontryagin for a problem of Bolza in the following form.

[TABLE]

In the special case where $f^{0}$ is equal to zero, the problem is called a problem of Mayer and it is denoted by ( ${\mathcal{M}}$ ). $T\in(0,+\infty)$ is fixed. $E$ denotes a real Banach space, $\Omega$ is a nonempty open subset of $E$ , $U$ denotes a nonempty metric space, and $\xi_{0}\in\Omega$ is fixed; we use the mappings $f^{0}:[0,T]\times\Omega\times U\rightarrow{\mathbb{R}}$ and $f:[0,T]\times\Omega\times U\rightarrow E$ . The real valued functions $g^{\alpha}$ and $h^{\beta}$ are defined on $\Omega$ , and $m$ and $q$ are fixed integer numbers.

$PC^{0}([0,T],U)$ denotes the space of the piecewise continuous functions from $[0,T]$ into $U$ , and $PC^{1}([0,T],\Omega)$ denotes the space of the piecewise differentiable functions from $[0,T]$ into $\Omega$ . The precise definitions of these notions are given in Section 2. When $(x,u)$ is an admissible process for $(\mathcal{B})$ or $(\mathcal{M})$ , we consider the following condition of qualification, $i\in\{0,1\}$ . (QC, 0) is due to Michel, [9].

[TABLE]

The main theorems of the paper are the following ones.

Theorem 1.1.

Let $(x_{0},u_{0})$ be a solution of problem ( ${\mathcal{B}}$ ). We assume that the following assumptions are fulfilled.

(A1)

For all $\alpha\in\{0,...,m\}$ , $g^{\alpha}$ is Fréchet differentiable at $x_{0}(T)$ .

(A2)

For all $\beta\in\{1,...,q\}$ , $h^{\beta}$ is continuous on a neighborhood of $x_{0}(T)$ and is Fréchet differentiable at $x_{0}(T)$ .

(A3)

$f^{0}$ * is continuous on $[0,T]\times\Omega\times U$ , the partial differential with respect to the second vector variable $D_{2}f^{0}(t,\xi,\zeta)$ exists for all $(t,\xi,\zeta)\in[0,T]\times\Omega\times U$ , and $D_{2}f^{0}$ is continuous on $[0,T]\times\Omega\times U$ .*

(A4)

$f$ * is continuous on $[0,T]\times\Omega\times U$ , the partial differential with respect to the second vector variable $D_{2}f(t,\xi,\zeta)$ exists for all $(t,\xi,\zeta)\in[0,T]\times\Omega\times U$ , and $D_{2}f$ is continuous on $[0,T]\times\Omega\times U$ .*

*Then there exists $(\lambda_{\alpha})_{0\leq\alpha\leq m}\in{\mathbb{R}}^{1+m}$ , $(\mu_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}$ and $p\in PC^{1}([0,T],E^{*})$ which satisfy the following conditions.

Part (I)*

(NN)

$(\lambda_{\alpha})_{0\leq\alpha\leq m}$ * and $(\mu_{\beta})_{1\leq\beta\leq q}$ are not simultaneously equal to zero.*

(Si)

For all $\alpha\in\{0,...,m\}$ , $\lambda_{\alpha}\geq 0$ .

(S ${\ell}$ )

For all $\alpha\in\{1,...,m\}$ , $\lambda_{\alpha}g^{\alpha}(x_{0}(T))=0$ .

(TC)

$\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{0}(T))=p(T)$ .

(AE.B)

$p^{\prime}(t)=-D_{2}H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})$ * for all $t\in[0,T]$ except at most when $t$ is a discontinuity point of $u_{0}$ .*

(MP.B)

*For all $t\in[0,T]$ , for all $\zeta\in U$ , *

$H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})\geq H_{B}(t,x_{0}(t),\zeta,p(t),\lambda_{0})$ .

(CH.B)

$\bar{H}_{B}:=[t\mapsto H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})]\in C^{0}([0,T],{\mathbb{R}})$ .

Part (II)* If in addition we assume that, for all $(t,\xi,\zeta)\in[0,T]\times\Omega\times U$ , the partial derivatives with respect to the first variable $\partial_{1}f^{0}(t,\xi,\zeta)$ and $\partial_{1}f(t,\xi,\zeta)$ exist and $\partial_{1}f^{0}$ and $\partial_{1}f$ are continuous on $[0,T]\times\Omega\times U$ , then $\bar{H}_{B}\in PC^{1}([0,T],{\mathbb{R}})$ and, for all $t\in[0,T]$ which is a continuity point of $u_{0}$ , $\bar{H}_{B}^{\prime}(t)=\partial_{1}H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})$ .

Part (III) If we assume that (QC, 1) is fulfilled for $(x,u)=(x_{0},u_{0})$ then, for all $t\in[0,T]$ , $(\lambda_{0},p(t))$ is never equal to zero.*

In this statement, $E^{*}$ denotes the topological dual space of $E$ , (NN) is a condition of non nullity, (Si) is a sign condition, (S ${\ell}$ ) is a slackness condition, (TC) is the transversality condition, (AE.B) is the adjoint equation where the Hamiltonian of the problem of Bolza is defined as $H_{B}(t,x,u,p,\lambda):=\lambda f^{0}(t,x,u)+p\cdot f(t,x,u)$ . (MP.B) is the maximum principle and (CH.B) is a condition of continuity on the Hamiltonian.

Theorem 1.2.

*Let $(x_{0},u_{0})$ be a solution of $(\mathcal{M})$ . Under (A1), (A2), and (A4) there exist $(\lambda_{\alpha})_{0\leq\alpha\leq m}\in{\mathbb{R}}^{1+m}$ , $(\mu_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}$ and $p\in PC^{1}([0,T],E^{*})$ such that the following conditions hold.

Part (I)*

(NN)

$(\lambda_{\alpha})_{0\leq\alpha\leq m}$ * and $(\mu_{\beta})_{1\leq\beta\leq q}$ are not simultaneously equal to zero.*

(Si)

For all $\alpha\in\{0,...,m\}$ , $\lambda_{\alpha}\geq 0$ .

(S ${\ell}$ )

For all $\alpha\in\{1,...,m\}$ , $\lambda_{\alpha}g^{\alpha}(x_{0}(T))=0$ .

(TC)

$\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{0}(T))=p(T)$ .

(AE.M)

$p^{\prime}(t)=-D_{2}H_{M}(t,x_{0}(t),u_{0}(t),p(t))$ * for all $t\in[0,T]$ except at most when $t$ is a discontinuity point of $u_{0}$ .*

(MP.M)

*For all $t\in[0,T]$ , for all $\zeta\in U$ , *

$H_{M}(t,x_{0}(t),u_{0}(t),p(t))\geq H_{M}(t,x_{0}(t),\zeta,p(t))$ .

(CH.M)

$\bar{H}_{M}:=[t\mapsto H_{M}(t,x_{0}(t),u_{0}(t),p(t))]\in C^{0}([0,T],{\mathbb{R}})$ .

Part (II)* If we assume that, for all $(t,\xi,\zeta)\in[0,T]\times\Omega\times U$ , the partial derivative with respect to the first variable $\partial_{1}f(t,\xi,\zeta)$ exists and $\partial_{1}f$ is continuous on $[0,T]\times\Omega\times U$ , then we have $\bar{H}_{M}\in PC^{1}([0,T],{\mathbb{R}})$ and, for all $t\in[0,T]$ which is a continuity point of $u_{0}$ , $\bar{H}_{M}^{\prime}(t)=\partial_{1}H_{M}(t,x_{0}(t),u_{0}(t),p(t))$ .

Part (III) If in addition of (A1), (A2), (A4), we assume that (QC, 0) is fulfilled when $(x,u)=(x_{0},u_{0})$ , then $p(t)$ is never equal to zero when $t\in[0,T]$ .*

In this statement the Hamiltonian of the problem of Mayer is defined as

$H_{M}(t,x,u,p):=pf(t,x,u)$ . To prove these statements, we build a variation of the proof of Michel [9] (on the problem of Mayer) by introducing functional analytic arguments. Notably we consider special function spaces of piecewise continuous functions, operators on these function spaces and fixed point theorems. We also use a recent result of multiplier rule on static optimization problems. The main contributions of the paper are the following ones.

•

Our assumptions on the $g^{\alpha}$ are only their Fréchet diffferentiability, and on the $h^{\beta}$ are their continuity and their Fréchet differentiability, not their continuous differentiability as in [9], [1] (p. 321) and [7] (p. 132).

•

In [1]( p. 321) and [7] (p. 132) the first conclusion of the theorem of Pontryagin is that $(\lambda_{\alpha})_{0\leq\alpha\leq m}$ , $(\mu_{\beta})_{1\leq\beta\leq q}$ , and $p$ are not simultaneously equal to zero. In our Theorem 1.1, the first conclusion is that $(\lambda_{\alpha})_{0\leq\alpha\leq m}$ and $(\mu_{\beta})_{1\leq\beta\leq q}$ are not simultaneously equal to zero; it is an improvment.

•

As in [9] we do not demand the finiteness of the dimension of the space $E$ ; in [1] and in [7], $E$ is finite-dimensional. Moreover we use an open subset of $E$ instead of $E$ ; it is another difference with [9]. Ever about [9], we prove a condition of continuity of the needlike variations with respect to the thickness of the needles, which is useful but omitted in [9].

Note that there exist statements of theorem of Pontryagin without assumptions of continuous differentiablity, by using locally Lipschitzean mappings and generalized differential calculus on these mappings, e.g. in [5]. A mapping which is Fréchet differentiable at a point is not necessarily locally Lipschitzean, and conversely a mapping which is locally Lipschitzean is not necessarily Fréchet differentiable at a given point; hence our result is not comparable with the statements of the locally Lipschitzean setting.

2. Function spaces

When $X$ and $Y$ are metric spaces, $C^{0}(X,Y)$ denotes the space of the continuous mappings from $X$ into $Y$ . When $X$ is an open subset of a real normed vector space or an interval of ${\mathbb{R}}$ , $C^{1}(X,Y)$ denotes the space of the continuously Fréchet differentiable mappings from $X$ into $Y$ . When $X$ and $Y$ are real normed vector spaces, ${\mathcal{L}}(X,Y)$ denotes the space of the bounded linear mappings from $X$ into $Y$ , and $Isom(X,X)$ denotes the space of the topological isomorphisms from $X$ onto $X$ . When $X$ is a metric space, $x\in X$ and $r\in{\mathbb{R}}_{+*}:=(0,+\infty)$ , the closed ball (respectively open ball) centered at $x$ with a radius equal to $r$ is denoted by $\overline{B}(x,r)$ (respectively $B(x,r)$ ).

2.1. Piecewise continuous functions.

Let $Y$ be a metric space. A function $u:[0,T]\rightarrow Y$ is called piecewise continuous when $u\in C^{0}([0,T],Y)$ or when there exists a subdivision $0=\tau_{0}<\tau_{1}<...<\tau_{k}<\tau_{k+1}=T$ such that

•

For all $i\in\{0,...,k\}$ , $u$ is continuous on $(\tau_{i},\tau_{i+1})$ .

•

For all $i\in\{0,...,k\}$ , the right-hand limit $u(\tau_{i}+)$ exists in $Y$ .

•

For all $i\in\{1,...,k+1\}$ , the left-hand limit $u(\tau_{i}-)$ exists in $Y$ .

In other words, such a function is a regulated function (cf. [4], chapter 2 ) which possesses at most a finite number of discontinuity points. Their space is denoted by $PC^{0}([0,T],Y)$ . $PC^{0}([0,T],Y,(\tau_{i})_{0\leq i\leq k+1})$ denotes the space of the $u\in PC^{0}([0,T],Y)$ such that the set of the discontinuity points of $u$ is included in $\{\tau_{i}:i\in\{0,...,k+1\}\}$ . When $A$ is a subset of $Y$ , $PC^{0}([0,T],A)$ (respectively $PC^{0}([0,T],A,(\tau_{i})_{0\leq i\leq k+1})$ ) denotes the space of the $u\in PC^{0}([0,T],Y)$ (respectively $PC^{0}([0,T],A,(\tau_{i})_{0\leq i\leq k+1})$ ) such that the closure $\overline{u([0,T])}\subset A$ .

Definition 2.1.

A function $u\in PC^{0}([0,T],A)$ is called a normalized piecewise continuous function when moreover $u$ is right continuous on $[0,T)$ and when $u(T-)=u(T)$ .

The space of such functions is denoted by $NPC^{0}([0,T],A)$ . When $(\tau_{i})_{0\leq i\leq k+1}$ is a subdivision of $[0,T]$ , we set

[TABLE]

2.2. Piecewise continuously differentiable functions.

When $E$ is a real Banach space, a function $x:[0,T]\rightarrow E$ is called piecewise continuously differentiable when $x\in C^{0}([0,T],E)$ and when $x\in C^{1}([0,T],E)$ or when there exists a subdivision $(\tau_{i})_{0\leq i\leq k+1}$ of $[0,T]$ such that the following conditions are fulfilled.

•

For all $i\in\{0,...,k\}$ , $x$ is $C^{1}$ on $(\tau_{i},\tau_{i+1})$

•

For all $i\in\{0,...,k\}$ , $x^{\prime}(\tau_{i}+)$ exists in $E$

•

For all $i\in\{1,...,k+1\}$ , $x^{\prime}(\tau_{i}-)$ exists in $E$ .

The $\tau_{i}$ are the corners of the function $x$ . We denote by $PC^{1}([0,T],E)$ the space of such functions; this space is denoted by $KC^{1}([0,T],E)$ in [1] (p. 66, Section 1.4). When $\Omega$ is an open subset of $E$ , $PC^{1}([0,T],\Omega)$ is the set of the $x\in PC^{1}([0,T],E)$ such that $x([0,T])\subset\Omega$ . When $(\tau_{i})_{0\leq i\leq k+1}$ is a subdivision of $[0,T]$ , we denote by $PC^{1}([0,T],E,(\tau_{i})_{0\leq i\leq k+1})$ the set of the $x\in PC^{1}([0,T],E)$ such that the set of the corners of $x$ is included in $\{\tau_{i}:i\in\{0,...,k+1\}\}$ .

When $x\in PC^{1}([0,T],E,(\tau_{i})_{0\leq i\leq k+1})$ , we define the function $\underline{d}x:[0,T]\rightarrow E$ by setting

[TABLE]

Note that $\underline{d}x\in NPC^{0}([0,T],E,(\tau_{i})_{0\leq i\leq k+1})$ .

2.3. Rewording of the problems

We consider the following problem.

[TABLE]

We denote by ( ${\mathcal{M}}^{\prime}$ ) the special case of ( ${\mathcal{B}}^{\prime}$ ) where $f^{0}=0$ . We denote by $Adm({\mathcal{B}})$ (respectively $Adm({\mathcal{B}}^{\prime})$ ) the set of the admissible processes of $({\mathcal{B}})$ (respectively $({\mathcal{B}}^{\prime})$ ). When $(x,u)\in Adm({\mathcal{B}})$ , and when the discontinuity points of $u$ are in the values of the subdivision $(\tau_{i})_{0\leq i\leq k+1}$ of $[0,T]$ , we introduce the fonction

[TABLE]

we have $\overline{u}\in NPC^{0}([0,T],U)$ .

Note that $f(t,x(t),u(t))$ and $f(t,x(t),\overline{u}(t))$ can to be diffferent only when $t\in\{\tau_{i}:0\leq i\leq k+1\}$ and so we have $\underline{d}x(t)=f(t,x(t),\overline{u}(t))$ for all $t\in[0,T]$ . Also note that $f^{0}(t,x(t),u(t))$ and $f^{0}(t,x(t),\overline{u}(t))$ can to be diffferent only when $t\in\{\tau_{i}:0\leq i\leq k+1\}$ and so we have $\int_{0}^{T}f^{0}(t,x(t),u(t))dt=\int_{0}^{T}f^{0}(t,x(t),\overline{u}(t))dt$ . Consequently we obtain $J(Adm({\mathcal{B}}))=J(Adm({\mathcal{B}}^{\prime}))$ . When $(x_{0},u_{0})$ is a solution of $({\mathcal{B}}^{\prime})$ then it is also a solution of $({\mathcal{B}})$ . Conversely, when $(x_{0},u_{0})$ is a solution of $({\mathcal{B}})$ , building $\overline{u_{0}}$ by using (2.2) where $u$ is $u^{0}$ , we obtain that $(x_{0},\overline{u_{0}})$ is a solution of $({\mathcal{B}}^{\prime})$ . It is why we can say that the problems $({\mathcal{B}})$ and $({\mathcal{B}}^{\prime})$ are equivalent problems. A similar reasoning is valid to show that the problems $(\mathcal{M})$ and $(\mathcal{M}^{\prime})$ are equivalent.

3. The needlelike variations

3.1. Two results of the metric spaces theory

The first result is a generalization of the theorem of Heine on the uniform continuity of a continuous mapping on a compact metric space; it is useful to avoid an assumption of local compactness, and specially, in normed vector spaces, to avoid an assumption of finiteness of the dimension.

Theorem 3.1.

([12] p. 355, note (**)) Let $X$ and $Y$ be two metric spaces, $\phi\in C^{0}(X,Y)$ , and $K\subset X$ be a compact. Then we have

[TABLE]

The following result is a theorem of fixed points in presence of parameters.

Theorem 3.2.

([12] p. 103, Theorem 46-bis ) Let $X$ be a complete metric space, $\Lambda$ be a metric space, and $\phi:X\times\Lambda\rightarrow X$ be a mapping. We assume that the following conditions are fulfilled.

(a)

$\forall x\in X,\phi(x,\cdot)\in C^{0}(\Lambda,X).$ **

(b)

$\exists k\in[0,1),\forall\lambda\in\Lambda,\forall x,z\in X,d(\phi(x,\lambda),\phi(z,\lambda))\leq kd(x,z).$ **

Then we have

(i)

$\forall\lambda\in\Lambda,\exists!x_{\lambda}\in X,\phi(x_{\lambda},\lambda)=x_{\lambda}.$ **

(ii)

$[\lambda\mapsto x_{\lambda}]\in C^{0}(\Lambda,X)$ .

3.2. Definitions of the needlelike variations

We follow the definition of Michel of the needlelike variations which is given in [9]; Michel himself refers to [11] for this approach. Let $(x_{0},u_{0})$ be a solution of $({\mathcal{M}}^{\prime})$ . When $N\in{\mathbb{N}}_{*}:={\mathbb{N}}\setminus\{0\}$ , we consider $S:=((t_{i},v_{i}))_{1\leq i\leq N}$ where $t_{i}\in[0,T]$ satisfying $0<t_{1}\leq t_{2}\leq...\leq t_{N}<T$ , and where $v_{i}\in U$ . We denote by ${\mathbb{S}}$ the set of such $S$ . When $S\in{\mathbb{S}}$ and $a=(a_{1},...,a_{N})\in{\mathbb{R}}^{N}_{+}$ , we define the following objects

[TABLE]

When $a$ is small enough, we have $I_{i}(a)\subset[0,T]$ and $I_{i}(a)\cap I_{j}(a)=\emptyset$ when $i\neq j$ .

We will prove the existence of a solution, denoted by $x_{a}$ (which depends on $S$ and $a$ ) of the following Cauchy problem on $[0,T]$ :

[TABLE]

In the sequel of this section, we arbitrarily fix a $S=(t_{i},v_{i})_{1\leq i\leq N}$ in ${\mathbb{S}}$ .

3.3. Properties of continuity

In this subsection, we establish the existence of $x_{a}$ on $[0,T]$ all over and we establish the continuity of the mapping $[a\mapsto x_{a}]$ . To do that we introduce an appropriate function space and a nonlinear operator from which $x_{a}$ appears as a fixed point of this operator. The continuity of $[a\mapsto x_{a}]$ will be a consequence of the fixed point theorem with parameters.

Lemma 3.3.

([9], Proposition 2) There exists $k\in{\mathbb{R}}_{+*}:={\mathbb{R}}_{+}\setminus\{0\}$ , there exists $\rho\in{\mathbb{R}}_{+*}$ such that, for all $a\in{\mathbb{R}}^{N}_{+}$ satisfying $\|a\|\leq\rho$ , we have

[TABLE]

We consider the subdivision $(\tau_{i})_{0\leq i\leq k+1}$ of $[0,T]$ where the $\tau_{i}$ are the discontinuity points of $u_{0}$ . For all $i\in\{0,...k\}$ we consider the function $u_{0}^{i}:[\tau_{i},\tau_{i+1}]\rightarrow U$ defined by

[TABLE]

Hence $u_{0}^{i}\in C^{0}([\tau_{i},\tau_{i+1}],U)$ , and consequently $u_{0}^{i}([\tau_{i},\tau_{i+1}])$ is compact. We set

[TABLE]

$M$ is compact as a finite union of compacts. We set

[TABLE]

Since $x_{0}\in C^{0}([0,T],\Omega)$ , $\Gamma$ is compact.

Lemma 3.4.

*There exist $L\in{\mathbb{R}}_{+*}$ and $r\in{\mathbb{R}}_{+*}$ such that, $\forall t\in[0,T]$ ,

$\forall\xi,\xi_{1}\in\overline{B}(x_{0}(t),r)$ , $\forall\zeta\in M$ , we have $\|f(t,\xi,\zeta)-f(t,\xi_{1},\zeta)\|\leq L\|\xi-\xi_{1}\|.$ *

Proof.

Since $\Omega$ is open in $E$ , since $x_{0}([0,T])$ is compact and included in $\Omega$ , there exists $\gamma>0$ such that $\{\xi\in E:d(\xi,x_{0}([0,T]))<\gamma\}\subset\Omega$ , where $d(\xi,x_{0}([0,T]):=\inf_{0\leq t\leq T}\|\xi-x_{0}(t)\|$ . We set $K:=\Gamma\times M$ ; $K$ is compact as a product of compacts. Using Theorem 3.1 and (A4), we have

[TABLE]

Arbitrarily fix an $\epsilon>0$ . Let $t\in[0,T]$ , $\zeta\in M$ and $\xi\in\overline{B}(x_{0}(t),\delta_{\epsilon})$ . From (3.6) we obtain

[TABLE]

$L:=\sup\{\|D_{2}f(t,\xi_{1},\zeta_{1})\|:t\in[0,T],\xi_{1}\in\overline{B}(x_{0}(t),\delta_{\epsilon}),\zeta_{1}\in M\}$

$\leq\sup_{(t_{1},\xi_{1},\zeta_{1})\in K}\|D_{2}f(t_{1},\xi_{1},\zeta_{1})\|+\epsilon<+\infty$ .

We set $r:=\delta_{\epsilon}$ . If $t\in[0,T]$ , $\xi,\xi_{1}\in\overline{B}(x_{0}(t),r)$ and $\zeta\in M$ , using the Mean Value Inequality of the differential calculus theory we obtain

$\|f(t,\xi,\zeta)-f(t,\xi_{1},\zeta)\|\leq L\|\xi-\xi_{1}\|$ . ∎

When $\varphi\in C^{0}([0,T],E)$ , we set $\|\varphi\|_{b}:=\sup_{t\in[0,T]}(e^{-Lt}\|\varphi(t)\|)$ . $\|\cdot\|_{b}$ is called the norm of Bielecki ([6] p. 25-27) and $(C^{0}([0,T],E),\|\cdot\|_{b})$ is a complete normed vector space. We define

[TABLE]

Note that $(\mathcal{X},\|\cdot\|_{b})$ is a complete metric space. Note that when $x\in\overline{B}(x_{0},r_{1})$ we have, for all $t\in[0,T]$ , $e^{-L\cdot T}\|x(t)-x_{0}(t)\|\leq e^{-L\cdot t}\|x(t)-x_{0}(t)\|\leq e^{-L\cdot T}r$ which implies $\|x(t)-x_{0}(t)\|\leq r<\gamma$ , and so $x(t)\in\Omega$ . For all $a\in\overline{B}(0,\rho)\cap{\mathbb{R}}^{N}_{+}$ , we consider the operator

[TABLE]

Lemma 3.5.

*The constants $k$ and $\rho$ come from Lemma 3.3; the constant $L$ comes from Lemma 3.4 and the constant $r_{1}$ comes from (3.7).

We set $r_{2}:=\min\{\rho,e^{-L\cdot T}r_{1}k^{-1}\}$ . When $a\in{\mathbb{R}}^{N}_{+}$ , if $\|a\|\leq r_{2}$ then $\Phi_{a}(\mathcal{X})\subset\mathcal{X}$ .*

Proof.

Note that, for all $t\in[0,T]$ , we have $x_{0}(t)=\xi_{0}+\int_{0}^{t}f(s,x_{0}(s),u_{0}(s))ds$ ; consequently we have $\|\Phi_{a}(x_{0})(t)-x_{0}(t)\|=\|\int_{0}^{t}(f(s,x_{0}(s),u_{a}(s))-f(s,x_{0}(s),u_{0}(s)))ds\|$

$\leq\int_{0}^{t}\|f(s,x_{0}(s),u_{a}(s))-f(s,x_{0}(s),u_{0}(s))\|ds\;\;\;\;\Longrightarrow$

$e^{-L\cdot t}\|\Phi_{a}(x_{0})(t)-x_{0}(t)\|\leq e^{-L\cdot t}\int_{0}^{t}\|f(s,x_{0}(s),u_{a}(s))-f(s,x_{0}(s),u_{0}(s))\|ds$

$\leq e^{-L\cdot t}\int_{0}^{T}\|f(s,x_{0}(s),u_{a}(s))-f(s,x_{0}(s),u_{0}(s))\|ds\leq e^{-L\cdot t}k\|a\|$ using Lemma 3.3. Hence taking the sup on the $t\in[0,T]$ , we obtain

$\|\Phi_{a}(x_{0})-x_{0}\|_{b}\leq\sup_{t\in[0,T]}e^{-L\cdot t}k\|a\|\leq k\|a\|$ , and so we have, for $a\in{\mathbb{R}}^{N}_{+}$ ,

[TABLE]

Let $x\in\mathcal{X}$ ; then for all $t\in[0,T]$ , we have $e^{-L\cdot t}\|x(t)-x_{0}(t)\|\leq r_{1}$ which implies $\|x(t)-x_{0}(t)\|\leq r$ , and we can use Lemma 3.4 to assert that we have

[TABLE]

Now for all $t\in[0,T]$ we have

$\|(\Phi_{a}(x)-\Phi_{a}(x_{0}))(t)\|=\|\int_{0}^{t}(f(s,x(s),u_{a}(s))-f(s,x_{0}(s),u_{a}(s)))ds\|$

$\leq\int_{0}^{t}\|f(s,x(s),u_{a}(s))-f(s,x_{0}(s),u_{a}(s))\|ds\;\;\;\;\Longrightarrow$

$e^{-L\cdot t}\|(\Phi_{a}(x)-\Phi_{a}(x_{0}))(t)\|\leq e^{-L\cdot t}\int_{0}^{t}\|f(s,x(s),u_{a}(s))-f(s,x_{0}(s),u_{a}(s))\|ds$

$\leq e^{-L\cdot t}\int_{0}^{t}(L\|x(t)-x_{0}(t)\|)ds$ (after (3.10))

$=Le^{-L\cdot t}\int_{0}^{t}(e^{L\cdot s}e^{-L\cdot s}\|x(s)-x_{0}(s)\|)ds\leq Le^{-L\cdot t}\int_{0}^{t}(e^{L\cdot s}\|x-x_{0}\|_{b})ds$

$=Le^{-L\cdot t}\frac{e^{L\cdot t}-1}{L}\|x-x_{0}\|_{b}=(1-e^{L\cdot t})\|x-x_{0}\|_{b}\leq(1-e^{-L\cdot T})r_{1}$ .

Taking the sup on the $t\in[0,T]$ , we have proven

[TABLE]

Using (3.9), we obtain $\|\Phi_{a}(x)-\Phi_{a}(x_{0})\|_{b}\leq$

$\|\Phi_{a}(x)-\Phi_{a}(x_{0})\|_{b}+\|\Phi_{a}(x_{0})-x_{0}\|_{b}\leq(1-e^{-L\cdot T})r_{1}+e^{-L\cdot T}r_{1}=r_{1}$ , hence $\Phi_{a}(x)\in\mathcal{X}$ . ∎

Lemma 3.6.

The constant $r_{2}$ comes from Lemma 3.5. Let $a\in{\mathbb{R}}^{N}_{+}$ . If $\|a\|\leq r_{2}$ , then, for all $x,z\in\mathcal{X}$ , we have $\|\Phi_{a}(x)-\Phi_{a}(z)\|_{b}\leq(1-e^{-L\cdot T})\|x-z\|_{b}$ .

Proof.

Let $x,z\in\mathcal{X}$ . Since, for all $t\in[0,T]$ , we have $e^{-L\cdot t}\|x(t)-x_{0}(t)\|\leq r_{1}$ and $e^{-L\cdot t}\|z(t)-x_{0}(t)\|\leq r_{1}$ , we obtain $\|x(t)-x_{0}(t)|\leq r$ and $\|z(t)-x_{0}(t)\|`\leq r$ , and using Lemma 3.4, we have

$e^{-L\cdot t}\|(\Phi_{a}(x)-\Phi_{a}(z))(t)\|\leq e^{-L\cdot t}\int_{0}^{t}\|f(s,x(s),u_{a}(s))-f(s,z(s),u_{a}(s))\|ds$

$\leq e^{-L\cdot t}\int_{0}^{t}(L\|x(s)-z(s)\|)ds=Le^{-L\cdot t}\int_{0}^{t}(e^{L\cdot s}e^{-L\cdot s}\|x(s)-z(s)\|)ds$

$\leq Le^{-L\cdot t}\int_{0}^{t}(e^{L\cdot s}\|x-z\|_{b})ds\leq Le^{-L\cdot t}\frac{e^{L\cdot t}-1}{L}\|x-z\|_{b}\leq(1-e^{-L\cdot T})\|x-z\|_{b}$ . ∎

Lemma 3.7.

For all $x\in\mathcal{X}$ , the mapping $[a\mapsto\Phi_{a}(x)]$ is continuous from $\overline{B}(0,r_{2})\cap{\mathbb{R}}^{N}_{+}$ into $\mathcal{X}$ .

Proof.

Lemma 3.3 ensures the continuity of this mapping at $a=0$ . Now we fix $\hat{a}\neq 0$ . Let $(a^{n})_{n\in{\mathbb{N}}}$ be a sequence in $\overline{B}(0,r_{2})\cap{\mathbb{R}}^{N}_{+}$ which converges toward $\hat{a}$ . Note that we have

[TABLE]

We denote by $\mu$ the positive measure of Borel-Lebesgue of $[0,T]$ . We have

$\lim_{n\rightarrow+\infty}1_{[t_{i}+b_{i}(a^{n}),t_{i}+b_{i}(a^{n})+a_{i}^{n})}(t)=1_{[t_{i}+b_{i}(\hat{a}),t_{i}+b_{i}(\hat{a})+\hat{a}_{i})}(t)$ , $\mu$ -a.e. $t\in[0,T]$ since the pointwise convergence is clear when $t\in(t_{i}+b_{i}(\hat{a}),t_{i}+b_{i}(\hat{a})+\hat{a}_{i})$ and when $t\in[0,T]\setminus[t_{i}+b_{i}(\hat{a}),t_{i}+b_{i}(\hat{a})+\hat{a}_{i}]$ , and a finite set is a $\mu$ -null set. Similarly we obtain $\lim_{n\rightarrow+\infty}1_{[t_{i}+b_{i}(a^{n})+a_{i}^{n},t_{i+1}+b_{i+1}(a^{n}))}(t)=1_{[t_{i}+b_{i}(\hat{a})+\hat{a}_{i},t_{i+1}+b_{i+1}(\hat{a}))}(t)$ , $\mu$ -a.e. $t\in[0,T]$ , $\lim_{n\rightarrow+\infty}1_{[t_{N}+b_{N}(a^{n})+a^{n}_{N},T]}(t)=1_{[t_{N}+b_{N}(\hat{a})+\hat{a}_{N},T]}(t)$ , $\mu$ -a.e. $t\in[0,T]$ . Since a finite union of $\mu$ -null sets is a $\mu$ -null set, using (3.12) we obtain

[TABLE]

Let $(\tau_{j})_{0\leq j\leq k+1}$ be a subdivision of $[0,T]$ such that the discontinuity points of $u_{0}$ belong to $\{\tau_{j}:0\leq j\leq k\}$ . When $j\in\{0,...,k\}$ we use the function $u_{0}^{j}$ defined in (3.3) and then $\{f(t,x(t),u_{0}^{j}(t)):t\in[\tau_{j},\tau_{j+1}]\}$ is compact an an image of a compact set by a continuous function. Since $\{f(t,x(t),u_{0}(t)):t\in[0,T]\}$ is included in the finite union of compact sets $\bigcup_{0\leq j\leq k}\{f(t,x(t),u_{0}^{j}(t)):t\in[\tau_{j},\tau_{j+1}]\}$ , it is bounded. For all $i\in\{1,...,N\}$ , the set $\{f(t,x(t),v_{i}):t\in[0,T]\}$ is compact under (A4). Note that $\{f(t,x(t),u_{a}(t)):t\in[0,T],a\in\overline{B}(0,r_{2})\cap{\mathbb{R}}_{+}^{N}\}$ is included in $\{f(t,x(t),u_{0}(t)):t\in[0,T]\}\cup(\cup_{1\leq i\leq N}\{f(t,x(t),v_{i}):t\in[0,T]\})$ . This last set is bounded as a finite union of bounded sets, hence there exists $\sigma\in{\mathbb{R}}_{+*}$ such that, for all $t\in[0,T]$ and for all $a\in\overline{B}(0,r_{2})\cap{\mathbb{R}}_{+}^{N}$ , $\|f(t,x(t),u_{a}(t))\|\leq\frac{\sigma}{2}$ . Hence we have

[TABLE]

Note that the constant $\sigma$ is $\mu$ -integrable on $|0,T]$ , and that the functions $[t\mapsto\|f(t,x(t),u_{a^{n}}(t))-f(t,x(t),u_{\hat{a}}(t))\|]$ is a Borel function on $[0,T]$ as a composition of Borel functions. Hence, using (3.13) and (3.14), we can use the theorem of the dominated convergence of Lebesgue and assert that we have

[TABLE]

For all $n\in{\mathbb{N}}$ , for all $t\in[0,T]$ , we have

$e^{-L\cdot t}\|(\Phi_{a^{n}}(x)-\Phi_{\hat{a}}(x))(t)\|\leq e^{-L\cdot t}\int_{0}^{t}\|f(s,x(s),u_{a^{n}}(s))-f(s,x(s),u_{\hat{a}}(s))\|ds$

$\leq\int_{0}^{T}\|f(t,x(t),u_{a^{n}}(t))-f(t,x(t),u_{\hat{a}}(t))\|dt$ , then taking the sup on the $t\in[0,T]$ , and using (3.15), we obtain $\lim_{n\rightarrow+\infty}\|\Phi_{a^{n}}(x)-\Phi_{\hat{a}}(x)\|_{b}=0$ . ∎

Proposition 3.8.

The following assertions hold.

(i)

For all $a\in\overline{B}(0,r_{2})\cap{\mathbb{R}}^{N}_{+}$ , there exists a solution $x_{a}$ of the Cauchy problem (3.2) which is defined on $[0,T]$ all over.

(ii)

The mapping $[a\mapsto x_{a}]$ , from $\overline{B}(0,r_{2})\cap{\mathbb{R}}^{N}_{+}$ into $\mathcal{X}$ , is continuous.

Proof.

From Lemma 3.5, Lemma 3.6 and Lemma 3.7 we can use Theorem 3.2 and assert that, for each $a\in\overline{B}(0,r_{2})\cap{\mathbb{R}}^{N}_{+}$ , there exists a unique fixed point $x_{a}$ of $\Phi_{a}$ in $\mathcal{X}$ , and moreover we know that the mapping $[a\mapsto x_{a}]$ is continuous. From the definition (3.5), we have $x_{a}(t)=\xi_{0}+\int_{0}^{t}f(s,x_{a}(s),u_{a}(s))ds$ for all $t\in[0,T]$ . From (A4), we can see that the function $[s\mapsto f(s,x_{a}(s),u_{a}(s))]$ belongs to $NPC^{0}([0,T],E)$ , and consequently the function $[t\mapsto\int_{0}^{t}f(s,x_{a}(s),u_{a}(s))ds]$ belongs to $PC^{1}([0,T],E)$ , and using a classical result on the differentiation of the primitives functions ([4], chapter 2, Corollary 1, FVR. II6), we obtain that $\underline{d}x_{a}$ is well defined on $[0,T]$ and we have $\underline{d}x_{a}(t)=f(t,x_{a}(t),u_{a}(t))$ on $[0,T]$ . We also have $x_{a}(0)=\xi_{0}$ , and so $x_{a}$ is a solution of the Cauchy problem (3.2). Hence the assertion (i) is proven, and the assertion (ii) results from the continuity of the fixed point with respect to $a$ . ∎

3.4. Properties of differentiability

In this subsection we establish the Fréchet differentiability of the mapping $[a\mapsto x_{a}(T)]$ at the origine. First we recall some properties of the resolvents. We consider the linear ODE $\underline{d}y(t)=D_{2}f(t,x_{0}(t),u_{0}(t))y(t)$ when $t\in[0,T]$ . Following the indications which are given in [10] (Chapter 18) we can assert that, denoting by $R(t,s)$ the resolvent of this linear equation, we have $R(t_{3},t_{1})=R(t_{3},t_{2})R(t_{2},t_{1})$ , $R(s,s)=id_{E}$ , $R(s,t)=R(t,s)^{-1}$ , $R(\cdot,s)\in PC^{1}([0,T],\mathcal{L}(E,E))$ . We define $\underline{d}_{1}R(t,s):=\underline{d}R(\cdot,s)(t)$ and we have, for all $t\in[0,T]$ , $\underline{d}_{1}R(t,s)=D_{2}f(t,x_{0}(t),u_{0}(t))R(t,s)$ , and from $R(t,s)=R(s,t)^{-1}$ , we obtain that $R(t,\cdot)\in PC^{1}([0,T],\mathcal{L}(E,E))$ . We set $\underline{d}_{2}R(t,s):=\underline{d}R(t,\cdot)(s)$ . The second step is the following fundamental result due to Michel.

Lemma 3.9.

*([9] Lemma 1) There exist $r_{3}\in(0,r_{2})$ , $\Lambda\in\mathcal{L}({\mathbb{R}}^{N},E)$ and a mapping $\varrho:\overline{B}(0,r_{3})\cap{\mathbb{R}}^{N}_{+}\rightarrow E$ such that $\lim_{a\rightarrow 0}\varrho(a)=0$ , and such that, for all $a\in\overline{B}(0,r_{3})\cap{\mathbb{R}}^{N}_{+}$ , we have $x_{a}(T)=x_{0}(T)+\Lambda a+\|a\|\varrho(a)$ .

More precisely, $\Lambda a=\sum_{i=1}^{N}a_{i}R(T,t_{i})[f(t_{i},x_{0}(t_{i}),v_{i})-f(t_{i},x_{0}(t_{i}),u_{0}(t_{i}))]$ .*

The following result proves that the mapping $[a\mapsto x_{a}(T)]$ is a restriction of a mapping (defined on a neighborhood of the origine in ${\mathbb{R}}^{N}$ ) which is Fréchet differentiable at the origine.

Proposition 3.10.

The constant $r_{3}$ and the linear mapping $\Lambda$ are provided by Lemma 3.9. There exist $r_{4}\in(0,r_{3}]$ and a mapping $\kappa\in C^{0}(\overline{B}(0,r_{4}),\Omega)$ which is Fréchet differentiable at $a=0$ and which satisfies, for all $a\in\overline{B}(0,r_{3})\cap{\mathbb{R}}^{N}_{+}$ , $\kappa(a)=x_{a}(T)$ , and $D\kappa(0)=\Lambda$ .

Proof.

As a norm on ${\mathbb{R}}^{N}$ we choose the norm associated to the usual inner product. We denote by $\pi$ the best approximation projector from ${\mathbb{R}}^{N}$ on the closed convex cone ${\mathbb{R}}^{N}_{+}$ , [2] (p. 18, Theorem 1). We know that $\pi$ is $1$ -Lipschitzean. It is easy to verify that $\pi(\overline{B}(0,r_{3}))\subset(\overline{B}(0,r_{3})\cap{\mathbb{R}}^{N}_{+})$ . Using Proposition 3.8 note that the mapping $\varrho$ is continuous on $\overline{B}(0,r_{3})\cap{\mathbb{R}}^{N}_{+}$ since we have

[TABLE]

We set $\overline{\varrho}:=\varrho\circ\pi\in C^{0}(\overline{B}(0,r_{3}),E)$ . We define $\kappa:\overline{B}(0,r_{3})\rightarrow E$ by setting $\kappa(a):=x_{0}(T)+\Lambda a+\|a\|\overline{\varrho}(a)$ . Then $\kappa$ is continuous since $\Lambda$ and $\overline{\varrho}$ are continuous. We have also $\lim_{a\rightarrow 0}\overline{\varrho}(a)=\varrho(0)=0$ which implies that $\kappa$ is Fréchet differentiable at [math], and that $D\kappa(0)=\Lambda$ . Since $x_{0}(T)\in\Omega$ with $\Omega$ open, since $\lim_{a\rightarrow 0}(\Lambda a+\|a\|\overline{\varrho}(a))=0$ , reducing $r_{3}$ to $r_{4}\in(0,r_{3}]$ we can assert that $\kappa(\overline{B}(0,r_{4}))\subset\Omega$ . ∎

4. Proof of the principle for the problem of Mayer

We describe the general method. When we fix $S=((t_{i},v_{i}))_{1\leq i\leq N}\in{\mathbb{S}}$ , we reduce the initial dynamic problem of Mayer to a finite-dimensional static optimization problem where the unknow is the vector $a$ of the thicknessess of the needles. Using a multiplier rule on this static problem we obtain a list of multipliers which is dependent on $S$ . This is the matter of the first subsection.

In the second subsection we prove that we can choose such a list of multipliers which is independent of $S\in{\mathbb{S}}$ , and from this particular list we build the multipliers and the adjoint function of Theorem 1.2.

4.1. Reduction to the finite dimension

We arbitrarily fix $S\in{\mathbb{S}}$ . Since $(x_{0},u_{0})$ is optimal for $(\mathcal{M}^{\prime})$ , [math] is a solution of the following finite-dimensional optimization problem

[TABLE]

Using the mapping $\kappa$ of Proposition 3.10 and $(b^{*}_{i})_{1\leq i\leq N}$ , the dual basis of the canonical basis of ${\mathbb{R}}^{N}$ , [math] is also solution of the following finite-dimensional optimization problem

[TABLE]

since, when $a\in B(0,r_{4})$ is admissible for $(\mathcal{F}_{S}^{1})$ then necessarily we have $a\in B(0,r_{4})\cap{\mathbb{R}}^{N}_{+}$ . The interest to introduce $(\mathcal{F}_{S}^{1})$ is that this problem enters into the setting of the multiplier rule of [3] while it is not the case for $(\mathcal{F}_{S})$ .

Note that Michel in [9] works on $(\mathcal{F}_{S})$ , not on $(\mathcal{F}_{S}^{1})$ . To do that, he uses a multiplier rule given in [8], which concerns problems on a convex cone.

Lemma 4.1.

Let $S=((t_{i},v_{i}))_{1\leq i\leq N}\in{\mathbb{S}}$ . There exist $(\lambda_{\alpha})_{0\leq\alpha\leq m}\in{\mathbb{R}}^{1+m}$ and $(\mu_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}$ which satisfy the following conditions.

(a)

$(\lambda_{\alpha})_{0\leq\alpha\leq m}$ * and $(\mu_{\beta})_{1\leq\beta\leq q}$ are not simulteanous equal to zero.*

(b)

$\forall\alpha=0,...,m$ , $\lambda_{\alpha}\geq 0$ .

(c)

$\forall\alpha=1,...,m$ , $\lambda_{\alpha}g^{\alpha}(x_{0}(T))=0$ .

(d)

$\forall i=1,...,N$ , $p(t_{i})[f(t_{i},x_{0}(t_{i}),v_{i})-f(t_{i},x_{0}(t_{i}),u_{0}(t_{i}))]\leq 0$ , where

$p(t):=(\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{0}(T)))R(T,t)$ , $R(t,s)$ being defined just before Lemma 3.9.

Proof.

Using Proposition 3.10, (A1) and (A2), the assumptions of Theorem 3.2 in [3] are fulfilled, and so we know that there exist $(\lambda_{\alpha})_{0\leq\alpha\leq m}\in{\mathbb{R}}^{1+m}$ , $(\mu_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}$ , and $(\nu_{i})_{1\leq i\leq N}\in{\mathbb{R}}^{N}$ such that the following conditions are fulfilled.

(i)

$(\lambda_{\alpha})_{0\leq\alpha\leq m}$ , $(\mu_{\beta})_{1\leq\beta\leq q}$ and $(\nu_{i})_{1\leq i\leq N}$ are not simultaneously equal to zero.

(ii)

$\forall\alpha=0,...,m$ , $\lambda_{\alpha}\geq 0$ .

(iii)

$\forall i=1,...,N$ , $\nu_{i}\geq 0$ .

(iv)

$\forall\alpha=1,...,m$ , $\lambda_{\alpha}g^{\alpha}(x_{0}(T))=0$ .

(v)

$\forall i=1,...,N$ , $\nu_{i}b^{*}_{i}0=0$ .

(vi)

$\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))D\kappa(0)+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{0}(T))D\kappa(0)+\sum_{i=1}^{N}\nu_{i}b^{*}_{i}=0$ .

To prove (a), we proceed by contradiction, we assume that $(\lambda_{\alpha})_{0\leq\alpha\leq m}$ and $(\mu_{\beta})_{1\leq\beta\leq q}$ are equal to zero. Hence, using (i), we have $(\nu_{i})_{1\leq i\leq N}$ different to zero. Using (vi) we obtain $\sum_{i=1}^{N}\nu_{i}b^{*}_{i}=0$ , and since the $b^{*}_{i}$ are linearly independent we obtain that $(\nu_{i})_{1\leq i\leq N}$ is equal to zero: this is a contradiction. Consequently (a) is proven. Assertion (b) comes from (i) and (c) comes from (iv). When $a\in{\mathbb{R}}^{N}_{+}$ , using (iii), we have $\nu_{i}a_{i}\geq 0$ , and from (vi) we obtain

[TABLE]

which implies the following relation, for all $a\in{\mathbb{R}}^{N}_{+}$ ,

[TABLE]

Since $D\kappa(0)a=\sum_{i=1}^{N}a_{i}R(T,t_{i})[f(t_{i},x_{0}(t_{i}),v_{i})-f(t_{i},x_{0}(t_{i}),u_{0}(t_{i}))]$ , the relation 4.1) is equivalent to

[TABLE]

which is equivalent to the conclusion (d). ∎

4.2. End of the proof of Part (I)

In this subsection we follow [9]. Since the set of the lists of multipliers is a cone, we can normalized them by adding the condition $\sum_{\alpha=0}^{m}|\lambda_{\alpha}|+\sum_{\beta=1}^{q}|\mu_{\beta}|=1$ . When $S\in{\mathbb{S}}$ , we define $K(S)$ as the set of the $((\lambda_{\alpha})_{0\leq\alpha\leq m},(\mu_{\beta})_{1\leq\beta\leq q})$ which verify the conclusions (a, b, c, d) of Lemma 4.1 and the additional condition $\sum_{\alpha=0}^{m}|\lambda_{\alpha}|+\sum_{\beta=1}^{q}|\mu_{\beta}|=1$ . Denoting by $\Sigma(0,1)$ the unit sphere of ${\mathbb{R}}^{1+m+q}$ , we have $K(S)\subset\Sigma(0,1)$ , $K(S)$ is closed since it is defined by wide inequalities and equalities, If $(S^{\ell})_{1\leq\ell\leq n}=((t_{i}^{\ell},v_{i}^{\ell})_{1\leq i\leq N^{\ell}})_{1\leq{\ell}\leq n}$ is a finite family of elements of ${\mathbb{S}}$ , then setting $N:=\sum_{{\ell}=1}^{n}N^{\ell}$ , we can build $0<s_{1}\leq s_{2}\leq...\leq s_{N}<T$ and $w_{1},w_{2},...,w_{N}\in U$ such that $\bar{S}=(s_{j},w_{j})_{1\leq j\leq N}\in{\mathbb{S}}$ and such that, for all ${\ell}\in\{1,...,n\}$ , for all $i\in\{1,...,N^{\ell}\}$ , there exists a unique $j\in\{1,...,N\}$ verifying $t^{\ell}_{i}=s_{j}$ ; and then we take $w_{j}:=v^{\ell}_{i}$ . Note that, for all ${\ell}\in\{1,...,n\}$ , the values of $S^{\ell}$ belong to the values of $\bar{S}$ . If $((\lambda_{\alpha})_{0\leq\alpha\leq m},(\mu_{\beta})_{1\leq\beta\leq q})\in K(\bar{S})$ , the conclusions (a, b, c, d) of Lemma 4.1 are satisfied for the values of ${S}$ , they are also satisfied for the values of $S^{\ell}$ for alll ${\ell}\in\{1,...,n\}$ , which implies that $((\lambda_{\alpha})_{0\leq\alpha\leq m},(\mu_{\beta})_{1\leq\beta\leq q})\in\bigcap_{1\leq\ell\leq n}K(S^{\ell})\neq\emptyset$ . Hence, this last finite intersection is nonempty.

Since $\Sigma(0,1)$ is compact, the finite intersection property of the closed subsets of $\Sigma(0,1)$ implies that $\bigcap_{S\in{\mathbb{S}}}K(S)\neq\emptyset$ , [6] (p. 154, Appendix). Now we choose an element $((\lambda_{\alpha})_{0\leq\alpha\leq m},(\mu_{\beta})_{1\leq\beta\leq q})$ in $\bigcap_{S\in{\mathbb{S}}}K(S)$ , and we consider $p$ defined in the conclusion (d) of Lemma 4.1 for this chosen $((\lambda_{\alpha})_{0\leq\alpha\leq m},(\mu_{\beta})_{1\leq\beta\leq q})$ . After the building of the $K(S)$ , we see that the conclusions (NN), (Si) and (S $\ell$ ) are proven. We take $t\in(0,T)$ and $v\in U$ , and then we have $(t,v)\in{\mathbb{S}}$ . Then the conclusion (d) of Lemma 4.1 implies $p(t)[f(t,x_{0}(t),v)-f(t,x_{0}(t),u_{0}(t))]\leq 0$ . Doing $t\rightarrow 0+$ and $t\rightarrow T-$ , we obtain the inequality for all $t\in[0,T]$ . Hence the conclusion (MP.M) is proven. Now we want to prove that $p$ is a solution of the adjoint equation. Using the differentiability of $R(\cdot,s)$ outside of a finite set, $R(t,s)=R(s,t)^{-1}$ , the Fréchet differentiability of the inversion operator $\mathcal{I}:Isom(E,E)\rightarrow Isom(E,E)$ , $\mathcal{I}(L):=L^{-1}$ , and the chain rule we obtain the following formula.

[TABLE]

Differentiating $p(t)=(\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{à}(T)))R(T,t)$ with respect to $t$ , we obtain

$\underline{d}p(t)=(\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{à}(T)))\underline{d}_{2}R(T,t)$ and using (4.2), we obtain

$\underline{d}p(t)=(\sum_{\alpha=0}^{m}\lambda_{\alpha}Dg^{\alpha}(x_{0}(T))+\sum_{\beta=1}^{q}\mu_{\beta}Dh^{\beta}(x_{à}(T)))(-R(T,t)D_{2}f(t,x_{0}(t),u_{0}(t))$

$=-p(t)D_{2}f(t,x_{0}(t),u_{0}(t))=-D_{2}H_{M}(t,x_{0}(t),u_{0}(t),p(t))$ , and so $p$ satisfies (AE). From the equality $R(T,T)=id_{E}$ and from the formula which defines $p$ we see that the conclusion (TC) holds. To prove (CH.M) we need the following result.

Lemma 4.2.

Let $\phi\in C^{0}([0,T]\times U,{\mathbb{R}})$ and $u\in NPC^{0}([0,T],U)$ such that $\phi(t,u(t))=\max_{\zeta\in U}\phi(t,\zeta)$ for all $t\in[0,T]$ . Then $\bar{\phi}:=[t\mapsto\phi(t,u(t))]\in C^{0}([0,T],{\mathbb{R}})$ .

Proof.

Since $u$ is right continuous on $[0,T)$ and $\phi$ is continuous, $\bar{\phi}$ is reght continuous on $[0,T)$ . Since $u$ is left continuous at $T$ and $\phi$ is continuous, we have $\bar{\phi}$ is left continuous at $T$ . Now we ought to prove that $\bar{\phi}$ is left continuous on $(0,T)$ . Let $t\in(0,T)$ ; for all $h\in(-t,0)$ , we have $\phi(t,u(t+h))\leq\phi(t,u(t))$ and $\phi(t+h,u(t))\leq\phi(t+h,u(t+h))$ , and doing $h\rightarrow 0-$ , we obtain $\phi(t,u(t-))\leq\phi(t,u(t))$ and $\phi(t,u(t))\leq\phi(t,u(t-))$ . Hence we have $\phi(t,u(t-))=\phi(t,u(t))$ , i.e. $\bar{\phi}(t-)=\bar{\phi}(t)$ . ∎

If we set $\phi(t,\zeta):=H_{M}(t,x_{0}(t),\zeta,p(t))$ , from (MP.M) we have $\bar{\phi}=\bar{H}_{M}$ and the conclusion (CH.M) is proven. Hence Part (I) of Theorem 1.2 is completely proven for the problem of Mayer.

4.3. Proof of part (II)

We need of the following result.

Lemma 4.3.

Let $\phi\in C^{0}([0,T]\times U,{\mathbb{R}})$ such that, for all $(t,\zeta)\in[0,T]\times U$ , the partial derivative with respect to the first variable $\partial_{1}\phi(t,\zeta)$ exists, and $\partial_{1}\phi$ is continuous on $[0,T]\times U$ . Let $u\in NPC^{0}([0,T],U)$ such that $\bar{\phi}(t):=\phi(t,u(t))=\max_{\zeta\in U}\phi(t,\zeta)$ . Then the two following assertions hold.

(i)

When $t$ is a continuity point of $u$ , then $\bar{\phi}$ is differentiable at $t$ and we have $\bar{\phi}^{\prime}(t)=\partial_{1}\phi(t,u(t))$ .

(ii)

$\bar{\phi}\in PC^{1}([0,T],{\mathbb{R}})$ .

Proof.

From Lemma 4.2 we know that $\bar{\phi}\in C^{0}([0,T],{\mathbb{R}})$ . Let $t$ be a continuity point of $u$ . For all $h>0$ small enough, we set $\Delta(h):=\bar{\phi}(t+h)-\bar{\phi}(t)$ . We have $\phi(t+h,u(t))-\phi(t,u(t))\leq\phi(t+h,u(t+h))-\phi(t,u(t))=\Delta(h)$ and $\phi(t+h,u(t+h))-\phi(t,u(t+h))\geq\phi(t+h,u(t+h))-\phi(t,u(t))=\Delta(h)$ . Using a classical theorem of Lagrange for the functions of one real variable ([1], p. 142), we know that there exist $\theta^{h}_{1}$ and $\theta^{h}_{2}$ in $(0,1)$ such that $\partial_{1}\phi(t+\theta^{h}_{1}h,u(t))h\leq\Delta(h)\leq\partial_{1}\phi(t+\theta^{h}_{2}h,u(t+h))h$ which implies $\partial_{1}\phi(t+\theta^{h}_{1}h,u(t))\leq\frac{1}{h}\Delta(h)\leq\partial_{1}(t+\theta^{h}_{2}h,u(t+h))$ , and doing $h\rightarrow 0+$ and using the continuity of $\partial_{1}\phi$ and the continuity of $u$ at $t$ , we obtain $\lim_{h\rightarrow 0+}\frac{\Delta(h)}{h}=\partial_{1}\phi(t,u(t)$ . These last inequalities imply that the right derivative $\bar{\phi}^{\prime}_{R}(t)$ exists and is equal to $\partial_{1}\phi(t,u(t))$ . Doing a similar reasonning, we obtain that the left derivative $\bar{\phi}^{\prime}_{L}(t)$ exists and is equal to $\partial_{1}\phi(t,u(t))$ . Hence assertion (i) is proven.

Assertion (ii) is a consequence of assertion (i) using the continuity of $\partial_{1}\phi$ and the normalized piecewise continuity of $u$ . ∎

Setting $\phi(t,\zeta):=H_{M}(t,x_{0}(t),\zeta,p(t))$ , we have $\bar{\phi}=\bar{H}_{M}$ and Part (II) is a corollary of Lemma 4.3.

4.4. Proof of Part (III)

We proceed by contradiction; if there exists $t_{0}\in[0,T]$ such that $p(t_{0})=0$ , since (AE) is linear, by using the uniqueness of the solution of the Cauchy problem ((AE), $p(t_{0})=0$ ), we obtain that $p(t)=0$ for all $t\in[0,T]$ , notably $p(T)=0$ . Hence using (TC), (Si) and (S ${\ell}$ ), (QC, 0) implies that $(\forall\alpha=0,...,m,\;\lambda_{\alpha}=0)$ and $(\forall\beta=1,...,q,\mu_{\beta}=0)$ which is a contradiction with (NN). Hence Part (III) is proven.

5. Proof of the principle for the problem of Bolza

It is well known that we can transform a problem of Bolza into a problem of Mayer [10] (p. 393, Chapter 18). We realize such a transformation to deduce Theorem 1.1 from Theorem 1.2. We introduce an additional state variable denoted by $\sigma$ . We set $X:=(\sigma,x)\in{\mathbb{R}}\times\Omega$ as a new state variable; we set $F(t,(\sigma,x),u):=(f^{0}(t,x,u),f(t,x,u))$ as the new vectorfield; we set $G^{0}(\sigma,x):=\sigma+g^{0}(x)$ , $G^{\alpha}(\sigma,x):=g^{\alpha}(x)$ when $\alpha=1,...,m$ , and we set $H^{\beta}(\sigma,x):=h^{\beta}(x)$ when $\beta=1,...,q$ . We formulate te new following problem of Mayer:

[TABLE]

5.1. Proof of Part (I)

We denote by $\varpi_{1}:{\mathbb{R}}\times E\rightarrow{\mathbb{R}}$ and by $\varpi_{2}:{\mathbb{R}}\times E\rightarrow E$ the two projections. When $(x,u)$ is an admissible process for $(\mathcal{B})$ , setting $\sigma(t):=\int_{0}^{t}f(s,x(s),u(s))ds$ , we see that $((\sigma,x),u)$ is an admissible process for $(\mathcal{MB})$ and we have $G^{0}((\sigma,x))(T)=\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}(x(T))$ . Conversely when $(X,u)$ is an admissible process for $(\mathcal{MB})$ , setting $x:=\varpi_{2}\circ X$ , we see that $(x,u)$ is an admissible process for $(\mathcal{B})$ , and setting $\sigma:=\varpi_{1}\circ X$ , we have $\int_{0}^{T}f^{0}(t,x(t),u(t))dt+g^{0}((x(T))=\sigma(T)+g^{0}(x(T))=G^{0}(X(T))$ . Hence since $(x_{0},u_{0})$ is optimal for $(\mathcal{B})$ , we obtain that $(X_{0},u_{0})=((\sigma_{0},x_{0}),u_{0})$ is optimal for $(\mathcal{MB})$ . The assumptions of Theorem 1.1 imply that the assumptions of Theorem 1.2 are fulfilled, where $(\mathcal{M})$ is replaced by $(\mathcal{MB})$ . Hence there exist $(\Lambda_{\alpha})_{0\leq\alpha\leq m}\in{\mathbb{R}}^{1+m}$ , $(M_{\beta})_{1\leq\beta\leq q}\in{\mathbb{R}}^{q}$ and $P\in PC^{1}([0,T],({\mathbb{R}}\times E)^{*})$ such that the conclusions of Theorem 1.2 hold. When $P\in({\mathbb{R}}\times E)^{*}$ , we define $p_{0}\in{\mathbb{R}}$ and $p\in E^{*}$ by setting $p_{0}:=P(1,0)$ and $p\xi:=P(0,\xi)$ for all $\xi\in E$ , and so we have $P(r,\xi)=p_{0}r+p\xi$ for all $(r,\xi)\in{\mathbb{R}}\times E$ . The Hamiltonian of $(\mathcal{MB})$ is $H_{M}(t,(\sigma,x),u,(p_{0},p)):=(p_{0},p)F(t,(\sigma,x),u)=p_{0}f^{0}(t,x,u)+pf(t,x,u)$ . The conclusions of Theorem 1.2 provide the following conditions.

(i)

$(\Lambda_{\alpha})_{0\leq\alpha\leq m}$ and $(M_{\beta})_{1\leq\beta\leq q}$ are not simulteanously equal to zero.

(ii)

$\forall\alpha=0,...,m$ , $\Lambda_{\alpha}\geq 0$ .

(iii)

$\forall\alpha=1,...,m$ , $\Lambda_{\alpha}G^{\alpha}(X_{0}(T))=0$ .

(iv)

$P(T)=\sum_{\alpha=0}^{m}\Lambda_{\alpha}DG^{\alpha}(X_{0}(T))+\sum_{\beta=1}^{q}M_{\beta}DH^{\beta}(X_{0}(T))$ .

(v)

$\underline{d}P(t)=-D_{2}H_{M}(t,X_{0}(t),u_{0}(t),P(t))$ for all $t\in|0,T]$ .

(vi)

$H_{M}(t,X_{0}(t),u_{0}(t),P(t))\geq H_{M}(t,X_{0}(t),\zeta,P(t))$ for all $t\in[0,T]$ and for all $\zeta\in U$ .

(vii)

$[t\mapsto H_{M}(t,X_{0}(t),u_{0}(t),P(t))]\in PC^{1}([0,T],{\mathbb{R}})$ .

We set $\lambda_{\alpha}:=\Lambda_{\alpha}$ for all $\alpha=0,...,m$ , and $\mu_{\beta}:=M_{\beta}$ for all $\beta=1,...,q$ . Hence (i) and (ii) imply that (NN) and (Si) of Theorem 1.1 hold. From (iii) we obtain $\lambda_{\alpha}g^{\alpha}(x_{0}(T))=0$ for all $\alpha=1,...,m$ , and so (S ${\ell}$ ) of Theorem 1.1 holds. About the partial differentials, note that we have, for the partial differentials with respect to the first variable: $D_{1}G^{0}(\sigma,x_{0}(T))=id_{{\mathbb{R}}}$ , $D_{1}G^{\alpha}(\sigma,x_{0}(T))=0$ when $\alpha=1,...,m$ , $D_{1}H^{\beta}(\sigma,x_{0}(T))=0$ when $\beta=1,...,q$ , and for the partial differentials with respect to the second variable: $D_{2}G^{0}(\sigma,x_{0}(T))=Dg^{0}(x_{0}(T))$ , $D_{2}G^{\alpha}(\sigma,x_{0}(T))=Dg^{\alpha}(x_{0}(T))$ when $\alpha=1,...,m$ , and $D_{2}H^{\beta}(\sigma,x_{0}(T))=Dh^{\beta}(x(T))$ when $\beta=1,...,q$ . Hence from (iv) we deduce the two following relations.

[TABLE]

This last equatility is just the conclusion (TC) of Theorem 1.1. From (v) we obtain that $\underline{d}p_{0}(t)=0$ for all $t\in[0,T]$ , and then using (5.1) we have the following relation.

[TABLE]

From (v) we also deduce that, for all $t\in[0,T]$ , we have

$\underline{d}p(t)=\lambda_{0}D_{2}f^{0}(t,x_{0}(t),u_{0}(t))+p(t)D_{2}f(t,x_{0}(t),u_{0}(t))$ which is (AE.B) of Theorem 1.1. From (vi) we deduce that, for all $t\in[0,T]$ and for all $\zeta\in U$ , we have

$\lambda_{0}f^{0}(t,x_{0}(t),u_{0}(t))+p(t)f(t,x_{0}(t),u_{0}(t))\geq\lambda_{0}f^{0}(t,x_{0}(t),\zeta)+p(t)f(t,x_{0}(t),\zeta)$ which is the conclusion (MP.B) of Theorem 1.1.

From (vii), since $H_{M}(t,X_{0}(t),u_{0}(t),P(t))=H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})$ we obtain (CH.B). Hence Part (I) of Theorem 1.1 is completely proven.

5.2. Proof of Part (II)

Using Part (II) of Theorem 1.2 on ( $\mathcal{MB}$ ), the existence and the continuity of $\partial_{1}f^{0}$ and of $\partial_{1}f$ imply the existence and the continuity of $\partial_{1}F$ . We obtain that $[t\mapsto H_{B}(t,x_{0}(t),u_{0}(t),p(t),\lambda_{0})=H_{M}(t,X_{0}(t),u_{0}(t),P(t))]\in PC^{1}([0,T],{\mathbb{R}})$ , and when $t$ is a continuity point of $u_{0}$ , we have $\bar{H}^{\prime}_{B}(t)=\bar{H}^{\prime}_{M}(t)=\lambda_{0}\partial_{1}f^{0}(t,x_{0}(t),u_{0}(t))+p(t)\partial_{1}f(t,x_{0}(t),u_{0}(t))$ . Hence Part (II) is proven.

5.3. Proof of Part (III)

We procced by contradiction assuming that there exists $t_{*}\in[0,T]$ such $(\lambda_{0},p(t_{*}))=(0,0)$ . Since $\lambda_{0}=0$ , (AE.B) becomes an homogeneous linear equation, and using the uniqueness of the cauchy problem ((AE.B), $p(t_{*})=0$ ), we obtain that $p$ is equal to zero on $[0,T]$ , notably we have $p(T)=0$ . Hence using (TC), (Si), (S ${\ell}$ ), (QC, 1) implies that $(\forall\alpha=1,...,m,\lambda_{\alpha}=0)$ and $(\forall\beta=1,...,q,\mu_{\beta}=0)$ . Since $\lambda_{0}=0$ , we have $(\forall\alpha=0,...,m,\lambda_{\alpha}=0)$ and $(\forall\beta=1,...,q,\mu_{\beta}=0)$ which is a contradiction with (NN).

Bibliography12

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] V.M. ALEXEEV, V.M. TIHOMIROV, and S.V. FOMIN, Commande optimale, french edition, MIR, Moscow, 1982.
2[2] J.-P. AUBIN, Applied functional analysis, John Wiley and Sons, Inc., New York, 1979.
3[3] J. BLOT, On the multipliers rules , Optimization, 65 (2) 2018, 947-955.
4[4] N. BOURBAKI, Fonctions d’une variable réelle; théorie élémentaire, Hermann, Paris 1976.
5[5] F.H. CLARKE, Yu.S. LEDYAEV, R.J. STERN, and P.R. WOLENSKI, Nonsmooth analysis and control theory, Springer-Verlag New York Inc., New York, 1998.
6[6] J. DUGUNDJI and A. GRANAS, Fixed point theory; volume 1, PWN-Polish Scientific Publishers, Warsawa, 1982.
7[7] A.D. IOFFE and V.M. TIHOMIROV, Theory of extremal problems, english edition, North-Holland Pub. Co., Amsterdam, 1979.
8[8] P. MICHEL, Problèmes des inégalités et application à la programmation dans le cas où l’espace d’arrivée est de dimension finie , C.R. Acad. Sc. Paris, t. 273, série B, 1974, 389-391.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

On the proof of Michel of the Maximum Pontryagin principle

Abstract.

1. Introduction

Theorem 1.1**.**

Theorem 1.2**.**

2. Function spaces

2.1. Piecewise continuous functions.

Definition 2.1**.**

2.2. Piecewise continuously differentiable functions.

2.3. Rewording of the problems

3. The needlelike variations

3.1. Two results of the metric spaces theory

Theorem 3.1**.**

Theorem 3.2**.**

3.2. Definitions of the needlelike variations

3.3. Properties of continuity

Lemma 3.3**.**

Lemma 3.4**.**

Proof.

Lemma 3.5**.**

Proof.

Lemma 3.6**.**

Proof.

Lemma 3.7**.**

Proof.

Proposition 3.8**.**

Proof.

3.4. Properties of differentiability

Lemma 3.9**.**

Proposition 3.10**.**

Proof.

4. Proof of the principle for the problem of Mayer

4.1. Reduction to the finite dimension

Lemma 4.1**.**

Proof.

4.2. End of the proof of Part (I)

Lemma 4.2**.**

Proof.

4.3. Proof of part (II)

Lemma 4.3**.**

Proof.

4.4. Proof of Part (III)

5. Proof of the principle for the problem of Bolza

5.1. Proof of Part (I)

5.2. Proof of Part (II)

5.3. Proof of Part (III)

Theorem 1.1.

Theorem 1.2.

Definition 2.1.

Theorem 3.1.

Theorem 3.2.

Lemma 3.3.

Lemma 3.4.

Lemma 3.5.

Lemma 3.6.

Lemma 3.7.

Proposition 3.8.

Lemma 3.9.

Proposition 3.10.

Lemma 4.1.

Lemma 4.2.

Lemma 4.3.