Second order necessary and sufficient optimality conditions for singular   solutions of partially-affine control problems

M. Soledad Aronna

arXiv:1703.00875·math.OC·January 15, 2019

Second order necessary and sufficient optimality conditions for singular solutions of partially-affine control problems

M. Soledad Aronna

PDF

TL;DR

This paper develops second order necessary and sufficient optimality conditions for singular solutions in partially-affine control problems, enhancing understanding of optimality in systems with mixed affine and nonlinear controls.

Contribution

It introduces new second order conditions and Goh pointwise conditions for singular solutions in partially-affine control problems, expanding theoretical tools for optimal control analysis.

Findings

01

Derived second order necessary and sufficient conditions for weak optimality.

02

Established Goh pointwise necessary optimality conditions.

03

Provided an illustrative example demonstrating the theoretical results.

Abstract

In this article we study optimal control problems for systems that are affine with respect to some of the control variables and nonlinear in relation to the others. We consider finitely many equality and inequality constraints on the initial and final values of the state. We investigate singular optimal solutions for this class of problems, for which we obtain second order necessary and sufficient conditions for weak optimality in integral form. We also derive Goh pointwise necessary optimality conditions. We show an example to illustrate the results.

Equations298

\overset{x}{˙} = f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u), a.e. on [0, T] .

\overset{x}{˙} = f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u), a.e. on [0, T] .

k_{1} (x) = O (k_{2} (x)),

k_{1} (x) = O (k_{2} (x)),

k_{1} (x) = o (k_{2} (x)) .

k_{1} (x) = o (k_{2} (x)) .

min

min

\overset{x}{˙} = F (x, u, v), a.e. on [0, T],

η_{j} (x (0), x (T)) = 0, for j = 1 \dots, d_{η},

φ_{i} (x (0), x (T)) \leq 0, for i = 1, \dots, d_{φ},

u (t) \in U, v (t) \in V, a.e. on [0, T],

F (x, u, v) := f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u) .

F (x, u, v) := f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u) .

∥ x - \overset{x}{^} ∥_{\infty} < ε, ∥ u - \overset{u}{^} ∥_{\infty} < ε, ∥ v - \overset{v}{^} ∥_{\infty} < ε .

∥ x - \overset{x}{^} ∥_{\infty} < ε, ∥ u - \overset{u}{^} ∥_{\infty} < ε, ∥ v - \overset{v}{^} ∥_{\infty} < ε .

H [λ] (x, u, v, t) := p (t) (f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u)),

H [λ] (x, u, v, t) := p (t) (f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u)),

ℓ [λ] (x_{0}, x_{T}) := i = 0 \sum d_{φ} α_{i} φ_{i} (x_{0}, x_{T}) + j = 1 \sum d_{η} β_{j} η_{j} (x_{0}, x_{T}),

ℓ [λ] (x_{0}, x_{T}) := i = 0 \sum d_{φ} α_{i} φ_{i} (x_{0}, x_{T}) + j = 1 \sum d_{η} β_{j} η_{j} (x_{0}, x_{T}),

L [λ] (w) := ℓ [λ] (x (0), x (T)) + \int_{0}^{T} p (f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u) - \overset{x}{˙}) d t .

L [λ] (w) := ℓ [λ] (x (0), x (T)) + \int_{0}^{T} p (f_{0} (x, u) + i = 1 \sum m v_{i} f_{i} (x, u) - \overset{x}{˙}) d t .

φ_{i} (\overset{x}{^} (0), \overset{x}{^} (T)) = 0, for all i = 1, \dots, d_{φ} .

φ_{i} (\overset{x}{^} (0), \overset{x}{^} (T)) = 0, for all i = 1, \dots, d_{φ} .

∣ α ∣ + ∣ β ∣ = 1,

∣ α ∣ + ∣ β ∣ = 1,

α = (α_{0}, α_{1}, \dots, α_{d_{φ}}) \geq 0,

- \overset{p}{˙} (t) = H_{x} [λ] (\overset{x}{^} (t), \overset{u}{^} (t), \overset{v}{^} (t), t),

- \overset{p}{˙} (t) = H_{x} [λ] (\overset{x}{^} (t), \overset{u}{^} (t), \overset{v}{^} (t), t),

p (0) p (T) = - D_{x_{0}} ℓ [λ] (\overset{x}{^} (0), \overset{x}{^} (T)), = D_{x_{T}} ℓ [λ] (\overset{x}{^} (0), \overset{x}{^} (T)),

p (0) p (T) = - D_{x_{0}} ℓ [λ] (\overset{x}{^} (0), \overset{x}{^} (T)), = D_{x_{T}} ℓ [λ] (\overset{x}{^} (0), \overset{x}{^} (T)),

\left\{\begin{array}[]{l}\vspace{3pt}\displaystyle H_{u}[\lambda](\hat{x}(t),\hat{u}(t),\hat{v}(t),t)=0,\\ H_{v}[\lambda](\hat{x}(t),\hat{u}(t),\hat{v}(t),t)=0,\end{array}\right.\quad{\rm a.e.}\ {\rm on}\ [0,T].

\left\{\begin{array}[]{l}\vspace{3pt}\displaystyle H_{u}[\lambda](\hat{x}(t),\hat{u}(t),\hat{v}(t),t)=0,\\ H_{v}[\lambda](\hat{x}(t),\hat{u}(t),\hat{v}(t),t)=0,\end{array}\right.\quad{\rm a.e.}\ {\rm on}\ [0,T].

\dot{\overset{x}{ˉ}}

\dot{\overset{x}{ˉ}}

\overset{x}{ˉ} (0)

D η_{j} (\overset{x}{^} (0), \overset{x}{^} (T)) (\overset{x}{ˉ} (0), \overset{x}{ˉ} (T)) = 0, for j = 1, \dots, d_{η},

D η_{j} (\overset{x}{^} (0), \overset{x}{^} (T)) (\overset{x}{ˉ} (0), \overset{x}{ˉ} (T)) = 0, for j = 1, \dots, d_{η},

D φ_{i} (\overset{x}{^} (0), \overset{x}{^} (T)) (\overset{x}{ˉ} (0), \overset{x}{ˉ} (T)) \leq 0, for i = 0, \dots, d_{φ} .

C_{2} := {\overset{w}{ˉ} (\cdot) \in W_{2} : \eqref lineareq - \eqref lineareq0 and \eqref linearconseq - \eqref linearconsineq hold},

C_{2} := {\overset{w}{ˉ} (\cdot) \in W_{2} : \eqref lineareq - \eqref lineareq0 and \eqref linearconseq - \eqref linearconsineq hold},

C := C_{2} \cap W .

\begin{split}\Omega[\lambda]&(\bar{x},\bar{u},\bar{v}):=\,\mbox{$\frac{1}{2}$}D^{2}\ell[\lambda](\hat{x}(0),\hat{x}(T))(\bar{x}(0),\bar{x}(T))^{2}+\int_{0}^{T}\Big{(}\mbox{$\frac{1}{2}$}\bar{x}^{\top}H_{xx}[\lambda]\bar{x}\ \\ &+\bar{u}^{\top}H_{ux}[\lambda]\bar{x}+\bar{v}^{\top}H_{vx}[\lambda]\bar{x}+\mbox{$\frac{1}{2}$}\bar{u}^{\top}H_{uu}[\lambda]\bar{u}+\bar{v}^{\top}H_{vu}[\lambda]\bar{u}\Big{)}\mathrm{d}t.\end{split}

\begin{split}\Omega[\lambda]&(\bar{x},\bar{u},\bar{v}):=\,\mbox{$\frac{1}{2}$}D^{2}\ell[\lambda](\hat{x}(0),\hat{x}(T))(\bar{x}(0),\bar{x}(T))^{2}+\int_{0}^{T}\Big{(}\mbox{$\frac{1}{2}$}\bar{x}^{\top}H_{xx}[\lambda]\bar{x}\ \\ &+\bar{u}^{\top}H_{ux}[\lambda]\bar{x}+\bar{v}^{\top}H_{vx}[\lambda]\bar{x}+\mbox{$\frac{1}{2}$}\bar{u}^{\top}H_{uu}[\lambda]\bar{u}+\bar{v}^{\top}H_{vu}[\lambda]\bar{u}\Big{)}\mathrm{d}t.\end{split}

L [λ] (w) = L [λ] (\overset{w}{^}) + Ω [λ] (δ x, δ u, δ v) + ω [λ] (δ x, δ u, δ v) + R (δ x, δ u, δ v),

L [λ] (w) = L [λ] (\overset{w}{^}) + Ω [λ] (δ x, δ u, δ v) + ω [λ] (δ x, δ u, δ v) + R (δ x, δ u, δ v),

ω [λ]

ω [λ]

\int_{0}^{T} [H_{v xx} [λ] (δ x, δ x, δ v) + 2 H_{v ux} [λ] (δ x, δ u, δ v) + H_{v uu} [λ] (δ u, δ u, δ v)] d t,

R (δ x, δ u, δ v) = L_{ℓ} ∣ (δ x (0), δ x (T)) ∣^{3} + L K (1 + ∥ v ∥_{\infty}) ∥ (δ x, δ u) ∥_{\infty} ∥ (δ x, δ u) ∥_{2}^{2} .

R (δ x, δ u, δ v) = L_{ℓ} ∣ (δ x (0), δ x (T)) ∣^{3} + L K (1 + ∥ v ∥_{\infty}) ∥ (δ x, δ u) ∥_{\infty} ∥ (δ x, δ u) ∥_{2}^{2} .

\Omega[\lambda](\bar{w})=\mbox{$\frac{1}{2}$}D^{2}\mathcal{L}[\lambda](\hat{w})\,\bar{w}^{2}.

\Omega[\lambda](\bar{w})=\mbox{$\frac{1}{2}$}D^{2}\mathcal{L}[\lambda](\hat{w})\,\bar{w}^{2}.

λ \in Λ max Ω [λ] (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \geq 0, for all (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \in C .

λ \in Λ max Ω [λ] (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \geq 0, for all (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \in C .

λ \in Λ max Ω [λ] (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \geq 0, for all (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \in C_{2} .

λ \in Λ max Ω [λ] (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \geq 0, for all (\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) \in C_{2} .

H_{2} := {(\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) (\cdot) \in W_{2} : \eqref l in e a r e q holds},

H_{2} := {(\overset{x}{ˉ}, \overset{u}{ˉ}, \overset{v}{ˉ}) (\cdot) \in W_{2} : \eqref l in e a r e q holds},

(co Λ)^{#} := {λ \in co Λ : Ω [λ] is weakly-l.s.c. on H_{2}} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

††footnotetext: This article has been accepted for publication in Discrete Contin. Dyn. Syst. Ser. S.

Second order necessary and sufficient optimality conditions for singular solutions of partially-affine control problems

M. Soledad Aronna

M.S. Aronna

Escola de Matemática Aplicada, Fundação Getulio Vargas, Praia de Botafogo 190, 22250-900 Rio de Janeiro - RJ, Brazil

[email protected]

Abstract.

In this article we study optimal control problems for systems that are affine with respect to some of the control variables and nonlinear in relation to the others. We consider finitely many equality and inequality constraints on the initial and final values of the state. We investigate singular optimal solutions for this class of problems, for which we obtain second order necessary and sufficient conditions for weak optimality in integral form. We also derive Goh pointwise necessary optimality conditions. We show an example to illustrate the results.

Key words and phrases:

optimal control, singular control, second order optimality condition, Goh condition, Legendre-Clebsch, shooting algorithm

1. Introduction

The purpose of this paper is to investigate optimal control problems governed by systems of ordinary differential equations of the form

[TABLE]

Here $x:[0,T]\to\mathbb{R}^{n}$ is the state variable, $v_{i}:[0,T]\to\mathbb{R}$ are the affine controls for $i=1,\dots m,$ while $u:[0,T]\to\mathbb{R}^{l}$ is the vector of nonlinear controls and $f_{i}:\mathbb{R}^{n+l}\to\mathbb{R}^{n}$ is a vector field, for each $i=0,\dots m.$

Many models that enter into this framework can be found in practice and, in particular, in the existing literature. Among these we can mention: the Goddard’s problem in three dimensions [24] analyzed in Bonnans et al. [11], several models concerning the motion of rockets as the ones treated in Lawden [33], Bell and Jacobson [8], Goh [26, 29], Oberle [40], Azimov [7] and Hull [31]; an hydrothermal electricity production problem studied in Bortolossi et al. [13], the problem of atmospheric flight considered by Oberle in [41], and the optimal production processes studied in Cho et al. [16] and Maurer at al. [36]. All the systems investigated in these cited articles are partially-affine in the sense that they have at least one affine and at least one nonlinear control.

The subject of second order optimality conditions for these partially-affine problems has been studied by Goh in [26, 27, 28, 29], Dmitruk in [21], Dmitruk and Shishov in [22], Bernstein and Zeidan [9], Frankowska and Tonon [23], and Maurer and Osmolovskii [37]. The first works were by Goh, who introduced a change of variables in [27] and used it to obtain necessary optimality conditions in [27, 26, 25], always assuming normality of the optimal solution. The necessary conditions we present imply those by Goh [25], when there is only one multiplier (see Corollary 5.2). Recently, Dmitruk and Shishov [22] analyzed the quadratic functional associated with the second variation of the Lagrangian function, and provided a set of necessary conditions for the nonnegativity of this quadratic functional. Their results are consequence of a second order necessary condition that we present (see Theorem 5.3). In [21], Dmitruk proposed, without proof, necessary and sufficient conditions for a problem having a particular structure: the affine control variable applies to a term depending only on the state variable, i.e. the affine and nonlinear controls are uncoupled or, equivalently $H_{uv}$ is identically zero, where $H$ denotes the unmaximized Hamiltonian. This hypothesis is not used in our work. Nevertheless, the conditions established here coincide with those suggested in Dmitruk [21], when the latter are applicable. In [9], Bernstein and Zeidan derived the Riccati equation for the singular linear-quadratic regulator, which is a modification of the classical linear-quadratic regulator where only some components of the control enter quadratically in the cost function. Frankowska and Tonon proved in [23] second order necessary conditions for problems with closed control constraints and optimal controls containing arcs along which the second order derivative $H_{uu}$ of the unmaximized Hamiltonian vanishes. The necessary conditions given in [23] hold for problems either with no endpoint constraints, or with smooth endpoint constraints and additional hypotheses as calmness and the abnormality of Pontryagin’s Maximum Principle. All the articles mentioned in this paragraph use Goh’s transformation to derive their optimality conditions, as it is done in the current paper, while none of them proved sufficient conditions of second order which is the main contribution of this article. It is worth mentioning that sufficient conditions were shown by Maurer and Osmolovskii in [37], but for the case of a scalar control subject to bounds and bang-bang optimal solutions (i.e. no singular arc). This structure is not studied here since no closed control constraints are considered and thus our optimal control is supposed to be singular along the whole interval.

The contributions of this article are as follows. We provide a pair of necessary and sufficient conditions in integral form for weak optimality of singular solutions of partially-affine problems (Theorems 5.3-6.2). These conditions are ‘no gap’ in the sense that the sufficient condition is obtained from the necessary one by strengthening an inequality. We consider fairly general endpoint constraints and we do not assume uniqueness of multiplier. The main result is the sufficient condition of Theorem 6.2, which, up to our knowledge, cannot been found in the existing literature, and has important practical applications. As a product of the necessary condition 5.3 we get the pointwise Goh conditions in Corollary 5.2, extending this way previous results (see [25, 23]) to problems with general endpoint constraints, and removing the hypothesis of vanishing $H_{uu}$ imposed in [23]. In order to obtain the sufficient condition we impose a regularity assumption on the optimal controls, that in some practical situations is a consequence of the generalized Legendre-Clebsch condition (see Remark 6.4). We provide a simple example to illustrate our results.

As a main application of the sufficient condition provided in this article we can mention the proof of convergence of an associated shooting algorithm as stated in Aronna [4] and shown in detail in the technical report Aronna [5]. It is worth mentioning that, for practical interest, this shooting algorithm and its proof of convergence can be also used to solve partially-affine problems with bounds on the control and associated bang-singular solutions.

The article is organized as follows. In Section 2 we present the problem, the basic definitions and first order optimality conditions. In Section 3 we give the tools for second order analysis and establish a second order necessary condition. We introduce Goh’s transformation in Section 4. In Section 5 we show a new second order necessary condition. In Section 6 we present the main result of this article that is a second order sufficient condition. We show an example to illustrate our results in Section 7, while Section 8 is devoted to the conclusions and possible extensions. Finally, we include an Appendix containing some proofs of technical results that are omitted throughout the article.

Notations. Given a function $h$ of variable $(t,x)$ , we write $D_{t}h$ or $\dot{h}$ for its derivative in time, and $D_{x}h$ or $h_{x}$ for the differentiations with respect to space variables. The same convention is extended to higher order derivatives. We let $\mathbb{R}^{k}$ denote the $k$ -dimensional real space, i.e. the space of column real vectors of dimension $k;$ and by $\mathbb{R}^{k,*}$ its corresponding dual space, which consists of $k-$ dimensional real row vectors. By $L^{p}(0,T;\mathbb{R}^{k})$ we mean the Lebesgue space with domain equal to the interval $[0,T]\subset\mathbb{R}$ and with values in $\mathbb{R}^{k}.$ The notation $W^{q,s}(0,T;\mathbb{R}^{k})$ refers to the Sobolev spaces (see e.g. Adams [1]). Given $A$ and $B$ two $k\times k$ symmetric real matrices, we write $A\succeq B$ to indicate that $A-B$ is positive semidefinite. Given two functions $k_{1}:\mathbb{R}^{N}\rightarrow\mathbb{R}^{M}$ and $k_{2}:\mathbb{R}^{N}\rightarrow\mathbb{R}^{L},$ we say that $k_{1}$ is a big-O of $k_{2}$ around 0 and write

[TABLE]

if there exists positive constants $\delta$ and $M$ such that $|k_{1}(x)|\leq M|k_{2}(x)|$ for $|x|<\delta.$ It is a small-o if $M$ goes to 0 as $|x|$ goes to 0, and in this case we write

[TABLE]

2. Statement of the problem and assumptions

2.1. Statement of the problem.

We study the optimal control problem (P) given by

[TABLE]

where the function $F\colon\mathbb{R}^{n+l+m}\to\mathbb{R}^{n}$ can be written as

[TABLE]

Here $f_{i}\colon\mathbb{R}^{n+l}\rightarrow\mathbb{R}^{n}$ for $i=0,\ldots,m,$ $\varphi_{i}\colon\mathbb{R}^{2n}\rightarrow\mathbb{R}$ for $i=0,\ldots,d_{\varphi},$ $\eta_{j}\colon\mathbb{R}^{2n}\rightarrow\mathbb{R}$ for $j=1,\ldots,d_{\eta}.$ The sets $U$ and $V$ are open domains of $\mathbb{R}^{l}$ and $\mathbb{R}^{m},$ respectively. The control $u(\cdot)$ is called nonlinear, while $v(\cdot)$ is named affine control. We consider the function spaces $\mathcal{U}:=L^{\infty}(0,T;\mathbb{R}^{l})$ and $\mathcal{V}:=L^{\infty}(0,T;\mathbb{R}^{m})$ for the controls, and $\mathcal{X}:=W^{1,\infty}(0,T;\mathbb{R}^{n})$ for the state. When needed, we use $w(\cdot):=(x,u,v)(\cdot)$ to refer to a point in $\mathcal{W}:=\mathcal{X}\times\mathcal{U}\times\mathcal{V}.$ We call trajectory an element $w(\cdot)\in\mathcal{W}$ that satisfies the state equation (2). If in addition, the endpoint constraints (3) and (4) and the control constraint (5) hold for $w(\cdot),$ then we say that it is a feasible trajectory of problem (P).

We consider the following regularity hypothesis throughout the article.

*Assumption 2.1**.*

All data functions have Lipschitz-continuous second order derivatives.

In this paper we study optimality conditions for weak minima of problem (P). A feasible trajectory $\hat{w}(\cdot)=(\hat{x},\hat{u},\hat{v})(\cdot)$ is said to be a weak minimum if there exists $\varepsilon>0$ such that the cost function attains at $\hat{w}(\cdot)$ its minimum in the set of feasible trajectories $w(\cdot)=(x,u,v)(\cdot)$ satisfying

[TABLE]

For the remainder of the article, we fix a nominal feasible trajectory $\hat{w}(\cdot):=(\hat{x},\hat{u},\hat{v})(\cdot)$ for which we provide optimality conditions. We assume that the controls $\hat{u}(\cdot)$ and $\hat{v}(\cdot)$ do not accumulate at the boundaries of $U$ and $V,$ respectively. This is, letting $\mathbb{B}$ denote the closed unit ball of $\mathbb{R}^{l+m},$ we impose:

*Assumption 2.2**.*

There exists $\delta>0$ such that $(\hat{u},\hat{v})(t)+\delta\mathbb{B}\subset U\times V,$ for almost all $t\in[0,T].$

An element $\delta w(\cdot)\in\mathcal{W}$ is termed feasible variation for $\hat{w}(\cdot)$ if $\hat{w}(\cdot)+\delta w(\cdot)$ is a feasible trajectory for (P). For $\lambda=(\alpha,\beta,p(\cdot))$ in the space $\mathbb{R}^{d_{\varphi}+1,*}\times\mathbb{R}^{d_{\eta},*}\times W^{1,\infty}(0,T;\mathbb{R}^{n,*}),$ we define the following functions:

•

the pre-Hamiltonian (or unmaximized Hamiltonian) function $H[\lambda]\colon\mathbb{R}^{n}\times\mathbb{R}^{m}\times\mathbb{R}^{l}\times[0,T]\to\mathbb{R}$ given by

[TABLE]

•

the endpoint Lagrangian function $\ell[\lambda]\colon\mathbb{R}^{2n}\to\mathbb{R},$

[TABLE]

•

and the Lagrangian function $\mathcal{L}[\lambda]\colon\mathcal{W}\to\mathbb{R},$

[TABLE]

We assume, in sake of simplicity of notation that, whenever some argument of $F,$ $f_{i},$ $H,$ $\ell,$ $\mathcal{L}$ or their derivatives is omitted, they are evaluated at $\hat{w}(\cdot).$ If we further want to explicit that they are evaluated at time $t,$ we write $F[t],$ $f_{i}[t],$ etc. The same convention notations hold for other functions of the state, control and multiplier that we define throughout the article. We assume, without any loss of generality, that

[TABLE]

2.2. Lagrange multipliers

We introduce here the concept of multiplier. The second order conditions that we prove in this article are expressed in terms of the second variation of the Lagrangian function $\mathcal{L}$ given in (6) and the set of Lagrange multipliers associated with $\hat{w}(\cdot)$ that we define below.

*Definition 2.3**.*

An element $\lambda=(\alpha,\beta,p(\cdot))\in\mathbb{R}^{d_{\varphi}+1,*}\times\mathbb{R}^{d_{\eta},*}\times W^{1,\infty}(0,T;\mathbb{R}^{n,*})$ is a Lagrange multiplier associated with $\hat{w}(\cdot)$ if it satisfies the following conditions:

[TABLE]

the function $p(\cdot)$ is solution of the costate equation

[TABLE]

it satisfies the transversality conditions

[TABLE]

and the stationarity conditions

[TABLE]

We let $\Lambda$ denote the set of Lagrange multipliers associated with $\hat{w}(\cdot).$

The following result constitutes a first order necessary condition and yields the existence of Lagrange multipliers.

Theorem 2.4.

If $\hat{w}(\cdot)$ is a weak minimum for (P), then the set $\Lambda$ is non empty and compact.

Proof.

The existence of a Lagrange multiplier follows from Milyutin-Osmolovskii [39, Thm. 2.1] or equivalent results proved in Alekseev et al. [3] and Kurcyusz-Zowe [32]. In order to prove the compactness, observe that $\Lambda$ is closed and that $p(\cdot)$ may be expressed as a linear continuous mapping of $(\alpha,\beta).$ Thus, since the normalization (7) holds, $\Lambda$ is necessarily a finite-dimensional compact set. ∎

In view of previous Theorem 2.4, note that $\Lambda$ can be identified with a compact subset of $\mathbb{R}^{s},$ where $s:=d_{\varphi}+d_{\eta}+1.$ The main results of this article are stated on a restricted subset of $\Lambda$ for which the matrix $D^{2}_{(u,v)^{2}}H[\lambda](\hat{w},t)$ is singular and, consequently, the pairs $(\hat{w},\lambda)$ result to be singular extremals. We comment again on this fact in Remark 3.6 below.

Given $(\bar{x}_{0},\bar{u}(\cdot),\bar{v}(\cdot))\in\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V},$ consider the linearized state equation

[TABLE]

The solution $\bar{x}(\cdot)$ of (12)-(13) is called linearized state variable.

2.3. Critical cones

We define here the sets of critical directions associated with $\hat{w}(\cdot),$ both in the $L^{\infty}$ - and the $L^{2}$ -norms. Even if we are working with control variables in $L^{\infty}$ and hence the control perturbations are naturally taken in $L^{\infty},$ the second order analysis involves quadratic mappings that require to continuously extend the cones to $L^{2}.$

Set $\mathcal{X}_{2}:=W^{1,2}(0,T;\mathbb{R}^{n}),$ $\mathcal{U}_{2}:=L^{2}(0,T;\mathbb{R}^{l})$ and $\mathcal{V}_{2}:=L^{2}(0,T;\mathbb{R}^{m}),$ and write $\mathcal{W}_{2}:=\mathcal{X}_{2}\times\mathcal{U}_{2}\times\mathcal{V}_{2}$ to refer to the corresponding product space. Given $\bar{w}(\cdot)\in\mathcal{W}_{2}$ satisfying the linearized state equation (12)-(13), consider the linearization of the endpoint constraints and cost function,

[TABLE]

The critical cones in $\mathcal{W}_{2}$ and $\mathcal{W}$ are given, respectively, by

[TABLE]

The following density result holds.

Lemma 2.5.

The critical cone $\mathcal{C}$ is dense in $\mathcal{C}_{2}$ with respect to the $\mathcal{W}_{2}$ -topology.

The proof of previous lemma follows from the following technical result (due to Dmitruk [20, Lemma 1]).

Lemma 2.6 (on density of cones).

Consider a locally convex topological space $X,$ a finite-faced cone $Z\subset X,$ and a linear space $Y$ dense in $X.$ Then the cone $Z\cap Y$ is dense in $Z.$

Proof of Lemma 2.5. Set $X:=\{\bar{w}(\cdot)\in\mathcal{W}_{2}:\text{\eqref{lineareq}-\eqref{lineareq0}}\,\text{hold}\},$ $Y:=\{\bar{w}(\cdot)\in\mathcal{W}:\text{\eqref{lineareq}-\eqref{lineareq0}}\,\text{hold}\},$ and $Z:=\mathcal{C}_{2}$ and apply Lemma 2.6. The desired density follows. $\square$

3. Second order analysis

We begin this section by giving an expression of the second order derivative of the Lagrangian function $\mathcal{L},$ in terms of derivatives of $\ell$ and $H.$ We let $\Omega$ denote this second variation. All the second order conditions we present are established in terms of either $\Omega$ or some transformed form of $\Omega.$ The main result of the current section is the necessary condition in Theorem 3.9, which is applied in Section 5 to get the stronger condition given in Theorem 5.3.

3.1. Second variation

Let us consider the quadratic mapping

[TABLE]

The result that follows gives an expression of the Lagrangian $\mathcal{L}$ at the nominal trajectory $\hat{w}(\cdot).$ For the sake of simplicity, the time variable is omitted in the statement.

Lemma 3.1 (Lagrangian expansion).

Let $w(\cdot)=(x,u,v)(\cdot)\in\mathcal{W}$ be a trajectory and set $\delta w(\cdot)=(\delta x,\delta u,\delta v)(\cdot):=w(\cdot)-\hat{w}(\cdot).$ Then, for every multiplier $\lambda\in\Lambda,$ the following expansion of the Lagrangian holds

[TABLE]

where $\omega$ is a cubic mapping given by

[TABLE]

and $\mathcal{R}$ satisfies the estimate

[TABLE]

Here $L_{\ell}$ is a Lipschitz constant for $D^{2}\ell[\lambda]$ uniformly with respect to $\lambda\in\Lambda,$ $L$ is a Lipschitz constant for $D^{2}f_{i}$ uniformly in $i=0,\dots,m,$ and $K:=\displaystyle\sup_{\lambda\in\Lambda}\|p(\cdot)\|_{\infty}.$

Proof.

See Appendix A.1. ∎

*Remark 3.2**.*

From previous lemma one gets the identity

[TABLE]

3.2. Second order necessary condition

The following result is a classical second order condition for weak minima.

Theorem 3.3 (Second order necessary

condition).

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

A proof of Theorem 3.3 can be found in Levitin, Milyutin and Osmolovskii [34]. Nevertheless, for the sake of completeness, we give a proof in the Appendix A.2 that uses techniques of optimization in abstract spaces.

An extension of the condition (20) to the cone $\mathcal{C}_{2}$ can be easily proved and gives the following, stronger, second order condition.

Theorem 3.4.

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

Proof.

Observe first that $\Omega[\lambda]$ can be extended to the space $\mathcal{W}_{2}$ since all the coefficients are essentially bounded. The result follows by the density property of Lemma 2.5 and the compactness of the Lagrange multipliers set $\Lambda$ proved in Theorem 2.4. ∎

3.3. Strengthened second order necessary condition

In the sequel we aim at strengthening the necessary condition of Theorem 3.4 by proving that the maximum in (21) remains nonnegative when taken in a possibly smaller set of multipliers, whenever $\Lambda$ is convex.

Let ${\rm co}\,\Lambda$ denote the convex hull of $\Lambda.$ Observe that if $\lambda=(\alpha,\beta,p(\cdot))$ is in ${\rm co}\,\Lambda$ then it verifies (8)-(11) and, if $\hat{w}(\cdot)$ is a weak minimum, also the second order condition (21) is fulfilled for $\lambda.$ However, $\lambda$ may not verify the nontriviality condition (7), thus ${\rm co}\,\Lambda$ may content the trivial (i.e. identically zero) multiplier.

Set

[TABLE]

and consider the subset of ${\rm co}\,\Lambda$ given by

[TABLE]

Next we prove that $({\rm co}\,\Lambda)^{\#}$ can be characterized in a quite simple way (see Lemma 3.5 below). Theorem 3.9 stated afterwards yields a new necessary optimality condition.

Lemma 3.5.

[TABLE]

*Remark 3.6** (About singular* solutions).

From now on we restrict the set $({\rm co}\,\Lambda)^{\#}$ or some subset of it and, therefore, $H_{uv}[\lambda]\equiv 0$ along the nominal trajectory $\hat{w}(\cdot).$ Consequently,

[TABLE]

The latter assertion together with the stationarity condition (11) imply that $(\hat{w},\lambda)$ is a singular extremal (as defined in Bryson-Ho [15, Page 246]). That is, if we write $\nu:=(u,v)$ for the control, we say that $(\hat{w},\lambda)$ is a singular extremal if $H_{\nu}[\lambda]=0$ and $H_{\nu\nu}[\lambda]$ is singular a.e. on $[0,T]$ .

Let us comment on the terminology used in the literature for the class of problems where $H_{\nu\nu}$ is a singular matrix. In Bell-Jacobson [8, Definition 1.2] and Ruxton-Bell [44] they refer to singular extremals (as defined above) as totally singular, while they use the term partially singular to refer to controls for which $H_{\nu}=0$ only on some subintervals of $[0,T],$ which is not the class of controls studied here. The same definition is adopted in Poggiolini and Stefani [43]. On the other hand, O’Malley in [42] calls partially singular the linear-quadratic problems in which the matrix $H_{\nu\nu}$ is (singular but) not of constant non-zero rank, that is a framework included in our class of problems.

In order to prove Lemma 3.5 we shall notice that $\Omega[\lambda]$ can be written as the sum of two maps: the first one being a weakly-continuous function on the space $\mathcal{H}_{2}$ given by

[TABLE]

and the second one being the quadratic operator

[TABLE]

The weak-continuity of the mapping in (24) follows easily. Additionally, in view of Hestenes [30, Theorem 3.2], the following characterization holds.

Lemma 3.7.

The mapping in (25) is weakly-lower semicontinuous on $\mathcal{U}\times\mathcal{V}$ if and only if the matrix

[TABLE]

is positive semidefinite almost everywhere on $[0,T].$

*Remark 3.8**.*

The fact that the matrix in (26) is positive semidefinite is known as the Legendre-Clebsch necessary optimality condition for the extremal $(\hat{w},\lambda)$ (see e.g. Bliss [10] in the framework of Calculus of Variations, and Bryson-Ho [15], Agrachev-Sachkov [2] or Corollary 3.12 below for Optimal Control).

We can now prove Lemma 3.5.

Proof of Lemma 3.5. It follows from the decomposition given in (24)-(25) and the characterization of weak-lower semicontinuity stated in previous Lemma 3.7. $\square$

Theorem 3.9 (Strengthened second order necessary condition).

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

*Remark 3.10** (On unqualified* solutions).

Notice that it may occur that $0\in({\rm co}\,\Lambda)^{\#}$ and, in this case, the second order condition in Theorem 3.9 above does not provide any information. This situation may arise when the endpoint constraints are not qualified, in the sense of the constraint qualification condition (73) introduced in the Appendix, which is a natural generalization of the Mangasarian-Fromovitz condition [35] to the infinite-dimensional framework.

In order to achieve Theorem 3.9, let us recall the following result on quadratic forms (taken from Dmitruk [18, Theorem 5]).

Lemma 3.11.

Given a Hilbert space $H,$ and $a_{1},a_{2},\ldots,a_{p}$ in $H,$ set

[TABLE]

Let $M$ be a convex and compact subset of $\mathbb{R}^{s},$ and let $\{Q^{\psi}:\psi\in M\}$ be a family of continuous quadratic forms over $H,$ the mapping $\psi\rightarrow Q^{\psi}$ being affine. Set $M^{\#}:=\{\psi\in M:\ Q^{\psi}\ \text{is weakly-l.s.c.}\text{ on }H\}$ and assume that

[TABLE]

Then

[TABLE]

We are now able to show Theorem 3.9 as desired.

Proof of Theorem 3.9. It is a consequence of Theorem 3.4, Lemmas 3.5 and 3.11.

$\square$

We finish this section with the following extension of the classical second order pointwise Legendre-Clebsch condition, which follows as a corollary of Theorem 3.9.

Corollary 3.12 (Legendre-Clebsch condition).

If $\hat{w}(\cdot)$ is a weak minimum of (P) with a unique associated Lagrange multiplier $\hat{\lambda},$ then $(\hat{w},\hat{\lambda})$ satisfies the Legendre-Clebsch condition, this is, the matrix in (26) is positive semidefinite and, consequently,

[TABLE]

Proof.

It follows easily from Theorem 3.9. In fact, as the Lagrange multiplier is unique, ${\rm co}\,\Lambda=\Lambda=\{\hat{\lambda}\},$ and the inequality in (27) implies that $({\rm co}\,\Lambda)^{\#}\neq\emptyset.$ Therefore, $({\rm co}\,\Lambda)^{\#}=\Lambda^{\#}=\{\hat{\lambda}\}$ and (31) necessarily holds. ∎

4. Goh Transformation

In this section we introduce the Goh trasformation which is a linear change of variables applied usually to a linear differential equation, and that is motivated by the facts explained in the sequel. In the previous section we were able to provide a necessary condition involving the nonnegativity on $\mathcal{C}_{2}$ of the maximum of $\Omega[\lambda]$ over the set $({\rm co}\,\Lambda)^{\#}$ (Theorem 3.9). Our next step is finding a sufficient condition. To achieve this one would naturally try to strengthen the inequality (27) to convert it into a condition of strong positivity. However, since no quadratic term on $\bar{v}(\cdot)$ appears in $\Omega,$ the latter cannot be strongly positive with respect to the norm of the controls. Thus, what we do here to find the desired sufficient condition is transforming $\Omega$ into a new quadratic mapping that may result strongly positive on an appropriate transformed critical cone. For historical interest, we recall that Goh introduced this change of variables in [27] and employed it to derive necessary conditions in [27, 25]. Since then, many optimality conditions were obtained by using that transformation as already mentioned in the Introduction.

For the remainder of the article, we consider the following regularity hypothesis on the controls.

*Assumption 4.1**.*

The controls $\hat{u}(\cdot)$ and $\hat{v}(\cdot)$ are smooth.

This hypothesis is not restrictive since it is a consequence of the strengthened generalized Legendre-Clebsch condition as explained in Aronna [5, 4], where it is shown that, whenever this generalized condition holds, one can write the controls as smooth functions of the state and costate variable. See also Remark 6.4 below.

Consider hence the linearized state equation (12) and the Goh transformation defined by

[TABLE]

Observe that $\bar{\xi}(\cdot)$ defined in that way satisfies the linear equation

[TABLE]

where

[TABLE]

Here $B$ is an $n\times m$ -matrix whose $i$ th column is given by

[TABLE]

where $[f_{i},f_{j}]^{x}:=({\rm D}_{x}f_{i})f_{j}-(D_{x}f_{j})f_{i}$ and it is referred as the Lie bracket with respect to $x$ of the vector fields $f_{i}$ and $f_{j}.$

4.1. Tranformed critical cones

In this paragraph we present the critical cones obtained after Goh’s transformation. We shall recall the linearized endpoint constraints (14)-(15) and the critical cones (16)-(17). Let $(\bar{x},\bar{u},\bar{v})(\cdot)\in\mathcal{C}$ be a critical direction. Define $(\bar{\xi},\bar{y})(\cdot)$ by Goh’s transformation (32) and set $\bar{h}:=\bar{y}(T).$ From (14)-(15) we get

[TABLE]

Remind the definition of the linear space $\mathcal{W}_{2}$ given in paragraph 2.3. Let $\mathcal{Y}$ denote the Sobolev space $W^{1,\infty}(0,T;\mathbb{R}^{m}),$ and consider the cones

[TABLE]

*Remark 4.2**.*

Observe that $\mathcal{P}$ is the cone obtained from $\mathcal{C}$ via Goh’s transformation (32).

The next result shows the density of $\mathcal{P}$ in $\mathcal{P}_{2}.$ This fact is used afterwards when we extend a necessary condition stated in $\mathcal{P}$ to the bigger cone $\mathcal{P}_{2}$ by continuity arguments, as it was done for $\mathcal{C}$ and $\mathcal{C}_{2}$ in Section 3.

Lemma 4.3.

$\mathcal{P}$ * is a dense subspace of $\mathcal{P}_{2}$ in the $\mathcal{W}_{2}\times\mathbb{R}^{m}$ -topology.*

Proof.

Notice that the inclusion $\mathcal{P}\subset\mathcal{P}_{2}$ is immediate. In order to prove the density, consider the linear spaces

[TABLE]

and the cone

[TABLE]

Notice that $Y$ is a dense linear subspace of $X$ (Dmitruk-Shishov [22, Lemma 6] or Aronna et al. [6, Lemma 8.1]), and $Z$ is a finite-faced cone of $X.$ The desired density follows by Lemma 2.6. ∎

4.2. Transformed second variation

Next we write the quadratic mapping $\Omega$ in the variables $(\bar{\xi}(\cdot),\bar{u}(\cdot),\bar{y}(\cdot),\bar{v}(\cdot),\bar{h}).$ Set, for $\lambda\in({\rm co}\,\Lambda)^{\#},$

[TABLE]

where

[TABLE]

Observe that, in view of Assumptions 2.1 and 4.1, all the functions defined above are continuous in time.

*Remark 4.4**.*

We can see that $M$ is an $m\times n$ -matrix whose $i$ th row is given by the formula

[TABLE]

$E$ is $m\times l$ with $E_{ij}=p\displaystyle\frac{\partial^{2}F}{\partial u_{j}\partial x}f_{i}-p\frac{\partial f_{i}}{\partial x}\frac{\partial F}{\partial u_{j}},$ the $m\times m-$ matrices $S$ and $G$ have entries $S_{ij}=\displaystyle\mbox{$ \frac{1}{2} $}p\left(\frac{\partial f_{i}}{\partial x}f_{j}+\frac{\partial f_{j}}{\partial x}f_{i}\right),$ and

[TABLE]

respectively. The components of the matrix $R$ have a quite long expression, that is simplified for some multipliers as it is detailed in equation (50) in the next section.

The identity between $\Omega$ and $\Omega_{\mathcal{P}}$ stated in the following lemma holds.

Lemma 4.5.

Let $\lambda\in({\rm co}\,\Lambda)^{\#},$ $(\bar{x},\bar{u},\bar{v})(\cdot)\in\mathcal{H}_{2}$ (given in (22)) and $(\bar{\xi},\bar{y})(\cdot)$ be defined by Goh’s transformation (32). Then

[TABLE]

The proof of this lemma is merely technical and we leave it to the Appendix A.3.

Finally let us remind the strengthened necessary condition of Theorem 3.9. Observe that by Goh’s transformation (27) and in view of Remark 4.2, we obtain the following form of the second order necessary condition.

Corollary 4.6.

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

5. New second order necessary condition

We aim at removing the dependence on $\bar{v}$ in the formulation of the second order necessary condition of Corollary 4.6 above. Note that in the inequality (45), $\bar{v}=\dot{\bar{y}}$ appears only in the term $\bar{v}^{\top}G[\lambda]\bar{y}.$ We prove in the sequel that we can restrict the maximum in (45) to the subset of $({\rm co}\,\Lambda)^{\#}$ consisting of the multipliers for which $G[\lambda]$ vanishes.

Let $G({\rm co}\,\Lambda)^{\#}$ refer to the subset of $({\rm co}\,\Lambda)^{\#}$ for which $G[\lambda]$ vanishes, i.e.

[TABLE]

Hence, the following optimality condition holds.

Theorem 5.1 (New necessary condition).

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

Theorem 5.1 is an extension of similar results given in Dmitruk [17], Milyutin [38] and recently in Aronna et al. [6]. The proof given in Aronna et al. [6, Theorem 4.6] holds for Theorem 5.1 with minor modifications and hence we do not include it in the present article.

Notice that when $\hat{w}(\cdot)$ has a unique associated multiplier, from Theorem 5.1 one can deduce that $G({\rm co}\,\Lambda)^{\#}$ is not empty, and since the latter is a singleton, the corollary below follows. This result gives an extension of the necessary conditions stated by Goh in [25] to the present framework.

Corollary 5.2 (Goh conditions).

Assume that $\hat{w}(\cdot)$ is a weak minimum having a unique associated multiplier. Then the following conditions holds.

(i)

$G\equiv 0$ * or, equivalently, the matrix $H_{vx}F_{v}$ is symmetric, which, in view of (44), can be written as*

[TABLE]

where $p(\cdot)$ is the unique associated adjoint state.

(ii)

The matrix

[TABLE]

is positive semidefinite.

We aim now at stating a necessary condition that does not depend on $\bar{v}(\cdot).$ Let us note that, for $\lambda\in G({\rm co}\,\Lambda)^{\#},$ the quadratic form $\Omega[\lambda]$ does not depend on $\bar{v}(\cdot)$ since its coefficients vanish. We can then consider its continuous extension to $\mathcal{P}_{2}$ for multipliers $\lambda\in G({\rm co}\,\Lambda)^{\#},$ given by

[TABLE]

where the involved matrices and the function $g$ were defined in (40)-(43). Observe that, since $G[\lambda]\equiv 0,$ one has that $H_{vx}[\lambda]F_{v}$ is symmetric and, therefore, the $ij$ entry of $R[\lambda]$ can be written as

[TABLE]

for each $i,j=1,\dots,m.$

From Theorem 5.1, it follows:

Theorem 5.3 (Second order necessary condition in new variables).

If $\hat{w}(\cdot)$ is a weak minimum of problem (P), then

[TABLE]

6. Second order sufficient condition for weak minimum

In this section we present the main contribution of the article: a second order sufficient condition for strict weak optimality. The optimality to be investigated here is with respect to the following $\gamma$ -order:

[TABLE]

defined for $(\bar{x}(0),\bar{u}(\cdot),\bar{y}(\cdot),\bar{h})\in\mathbb{R}^{n}\times\mathcal{U}_{2}\times\mathcal{V}_{2}\times\mathbb{R}^{m}.$ Let us note that $\gamma_{\mathcal{P}}$ can also be considered as a function of $(\bar{x}(0),\bar{u}(\cdot),\bar{v}(\cdot))\in\mathbb{R}^{n}\times\mathcal{U}_{2}\times\mathcal{V}_{2}$ by setting

[TABLE]

with $\bar{y}(\cdot)$ being the primitive of $\bar{v}(\cdot)$ defined as in Goh transform (32).

This $\gamma$ -order was proposed in Dmitruk [21] for a simpler partially-affine problem and it is a natural extension of the order suggested (for control-affine problems) in Dmitruk [17].

*Definition 6.1**.*

[ $\gamma$ -growth] We say that $\hat{w}(\cdot)$ satisfies the $\gamma$ -growth condition in the weak sense if there exist $\varepsilon,\rho>0$ such that

[TABLE]

for every feasible trajectory $w(\cdot)$ with $\|w(\cdot)-\hat{w}(\cdot)\|_{\infty}<\varepsilon.$

Theorem 6.2 (Sufficient condition for weak optimality).

(i)

Assume that there exists $\rho>0$ such that

[TABLE]

Then $\hat{w}(\cdot)$ is a weak minimum satisfying $\gamma$ -growth in the weak sense.

(ii)

Conversely, if $\hat{w}(\cdot)$ is a weak solution satisfying $\gamma$ -growth in the weak sense and such that $\alpha_{0}>0$ for every $\lambda\in G(\mathrm{co}\,\Lambda)^{\#},$ then (55) holds for some positive $\rho.$

In the absence of the nonlinear control $u,$ Theorem 6.2 was proved in Dmitruk [17]. In Aronna et al. [6] the same result was shown for the case of scalar control subject to bounds.

As a consequence of Theorem 6.2 and standard results on positive quadratic mappings due to Hestenes [30] we get the following pointwise condition.

Corollary 6.3.

If $\hat{w}(\cdot)$ satisfies the uniform positivity in (55) and it has a unique associated multiplier, then the matrix in (48) is uniformly positive definite, i.e.

[TABLE]

where $I$ refers to the identity matrix.

*Remark 6.4**.*

Under suitable hypotheses, Goh in [26] proved that the strengthened generalized Legendre-Clebsch condition is a consequence of the uniform positivity in (55) (see Goh [26, Section 4.8] and Aronna [5, Remark 8.2]). Thus, in that situation, the controls can be expressed as smooth functions of the state and costate variable, as was assumed here.

The remainder of this section is devoted to the proof of Theorem 6.2. Several technical lemmas that are used in the following proof were stated and proved in the Appendix B.

Proof of Theorem 6.2. (i) We shall prove that if (55) holds for some $\rho>0,$ then $\hat{w}(\cdot)$ satisfies $\gamma$ -growth in the weak sense. By the contrary, let us assume that the $\gamma$ -growth condition (54) is not satisfied. Consequently, there exists a sequence of feasible trajectories $\{w_{k}(\cdot)=(x_{k}(\cdot),u_{k}(\cdot),v_{k}(\cdot))\}$ converging to $\hat{w}(\cdot)$ in the weak sense, such that

[TABLE]

with

[TABLE]

Let $(\bar{\xi}_{k}(\cdot),\bar{u}_{k}(\cdot),\bar{y}_{k}(\cdot))$ be the transformed directions defined by Goh transformation (32). We divide the remainder of the proof of item (i) in the following two steps:

(A)

First we prove that the sequence given by

[TABLE]

where $\bar{h}_{k}:=\bar{y}_{k}(T),$ contains a weak converging subsequence whose weak limit is an element

$(\mathring{\xi}(\cdot),\mathring{u}(\cdot),\mathring{y}(\cdot),\mathring{h})$ of $\mathcal{P}_{2}.$

(B)

Afterwards, making use of the latter sequence and its weak limit, we show that the uniform positivity hypothesis (55) together with (56) lead to a contradiction.

We shall begin by Part (A). For this we take an arbitrary Lagrange multiplier $\lambda$ in $(\mathrm{co}\,\Lambda)^{\#}.$ By multiplying the inequality (56) by $\alpha_{0},$ and adding the nonpositive term

[TABLE]

to its left-hand side, we get

[TABLE]

Note that the elements of the sequence $(\mathring{\xi}_{k}(0),\mathring{u}_{k}(\cdot),\mathring{y}_{k}(\cdot),\mathring{h}_{k})$ have unit $\mathbb{R}^{n}\times\mathcal{U}_{2}\times\mathcal{V}_{2}\times\mathbb{R}^{m}$ -norm. The Banach-Alaoglu Theorem (see e.g. Brézis [14, Theorem III.15]) implies that, extracting if necessary a subsequence, there exists $(\mathring{\xi}(0),\mathring{u}(\cdot),\mathring{y}(\cdot),\mathring{h})\in\mathbb{R}^{n}\times\mathcal{U}_{2}\times\mathcal{V}_{2}\times\mathbb{R}^{m}$ such that

[TABLE]

where the two limits indicated with $\rightharpoonup$ are considered in the weak topology of $\mathcal{U}_{2}$ and $\mathcal{V}_{2},$ respectively. Let $\mathring{\xi}(\cdot)$ denote the solution of the equation (33) associated with $(\mathring{\xi}(0),\mathring{u}(\cdot),\mathring{y}(\cdot)).$ Hence, it follows easily that $\mathring{\xi}(\cdot)$ is the limit of $\mathring{\xi}_{k}(\cdot)$ in (the strong topology of) $\mathcal{X}_{2}.$

With the aim of proving that $(\mathring{\xi}(\cdot),\mathring{u}(\cdot),\mathring{v}(\cdot),\mathring{h})$ belongs to $\mathcal{P}_{2},$ it remains to check that the linearized endpoint constraints (35)-(36) are verified. Observe that, for each index $0\leq i\leq d_{\varphi},$ one has

[TABLE]

In order to prove that the right hand-side of (60) is nonpositive, we consider the following first order Taylor expansion of $\varphi_{i}$ around $(\hat{x}(0),\hat{x}(T)):$

[TABLE]

Previous equation and Lemmas B.2 and B.4 imply

[TABLE]

Thus, the following approximation for the right hand-side of (60) holds,

[TABLE]

Since $w_{k}(\cdot)$ is a feasible trajectory, it satisfies the final inequality constraint (4) and, therefore, equations (60) and (61) yield, for $1\leq i\leq d_{\varphi},$

[TABLE]

Now, for $i=0,$ use (56) to get the corresponding inequality. Analogously, one has

[TABLE]

Thus $(\mathring{\xi}(\cdot),\mathring{u}(\cdot),\mathring{y}(\cdot),\mathring{h})$ satisfies (35)-(36), and hence it belongs to $\mathcal{P}_{2}.$

Let us now pass to Part (B). Notice that from the expansion of $\mathcal{L}$ given in (103) of Lemma B.5, and the inequality (58) we get

[TABLE]

and thus

[TABLE]

Let us consider the subset of $G({\rm co}\,\Lambda)^{\#}$ defined by

[TABLE]

By applying Lemma 3.11 to the inequality of uniform positivity (55) one gets

[TABLE]

Let us take the multiplier $\mathring{\lambda}\in\Lambda^{\#,\rho}$ that attains the maximum in (66) for the direction $(\mathring{\xi}(\cdot),\mathring{u}(\cdot),\mathring{y}(\cdot),\mathring{h})$ of $\mathcal{P}_{2}.$ We get

[TABLE]

since $\Omega_{\mathcal{P}_{2}}[\mathring{\lambda}]-\rho\gamma_{\mathcal{P}}$ is weakly-l.s.c., $\gamma_{\mathcal{P}}(\mathring{\xi}_{k}(0),\mathring{u}_{k},\mathring{y}_{k},\mathring{h}_{k})=1$ for every $k$ and inequality (64) holds. This leads us to a contradiction since $\rho>0.$ Therefore, the desired result follows, this is, the uniform positivity (55) implies strict weak optimality with $\gamma$ -growth.

(ii) Let us now prove the second statement of the theorem. Assume that $\hat{w}(\cdot)$ is a weak solution satisfying $\gamma$ -growth in the weak sense for some constant $\rho^{\prime}>0,$ and such that $\alpha_{0}>0$ for every multiplier $\lambda\in G({\rm co}\,\Lambda)^{\#}.$ Let us consider the modified problem

[TABLE]

and rewrite it in the Mayer form

[TABLE]

We will next apply the second order necessary condition of Theorem 5.3 to ( ${\breve{P}}$ ) at the point $(w(\cdot)=\hat{w}(\cdot),y(\cdot)=\hat{y}(\cdot),\pi_{1}(\cdot)\equiv 0,\pi_{2}(\cdot)\equiv 0).$ Simple computations show that at this solution each critical cone (see (37)) is the projection of the corresponding critical cone of ( ${\breve{P}}$ ), and that the same holds for the set of multipliers. Furthermore, the second variation of ( ${\breve{P}}$ ) evaluated at a multiplier ${\breve{\lambda}}\in G({\rm co}\,{\breve{\Lambda}}^{\#})$ is given by

[TABLE]

where $\lambda\in G({\rm co}\,{\Lambda})^{\#}$ is the corresponding multiplier for problem (37). Hence, the necessary condition in Theorem 5.3 (see Remark 6.5 below) implies that for every $(\bar{\xi}(\cdot),\bar{u}(\cdot),\bar{v}(\cdot),\bar{h})\in\mathcal{P}_{2},$ there exists $\lambda\in G({\rm co}\,{\Lambda})^{\#}$ such that

[TABLE]

Setting $\displaystyle\rho:=\min_{G({\rm co}\,{\Lambda})^{\#}}\alpha_{0}\rho^{\prime}>0$ the desired result follows. This completes the proof of the theorem. $\square$

*Remark 6.5**.*

Since the dynamics of ( ${\breve{P}}$ ) are not autonomous, what we applied above is an extension of Theorem 5.3 to time-dependent dynamics. The latter follows easily by adding a state variable $\kappa$ with dynamics $\dot{\kappa}=1$ and $\kappa(0)=0.$

7. Example

We consider the following example from Dmitruk-Shishov [22]:

[TABLE]

Let us use $p_{1},p_{2},p_{3}$ to denote the costate variables associated to (PE). Observe that $\dot{p}_{3}(\cdot)\equiv 0$ and $p_{3}(T)=1,$ thus $p_{3}(\cdot)\equiv 1.$ Note as well that the linearized state equation implies $\dot{\bar{x}}_{2}=\bar{v},\,\bar{x}_{1}(0)=\bar{x}_{2}(0)=\bar{x}_{3}(0)=0.$ Consequently, $\bar{y}(\cdot)=\bar{x}_{2}(\cdot),$ $\bar{\xi}_{1}(0)=\bar{\xi}_{2}(0)=\bar{\xi}_{3}(0)=0,$ and

[TABLE]

where the first equality follows from Goh’s transformation (32).

Recalling the definitions given in (40)-(43), the second variation $\Omega_{\mathcal{P}_{2}}$ (defined in (49)) on the critical cone $\mathcal{P}_{2}$ of (PE) gives:

[TABLE]

We see that $\Omega_{\mathcal{P}_{2}}$ verifies the sufficient condition (55). We should now look for a feasible solution that verifies the first order optimality conditions.

In Aronna [4] we used the shooting algorithm to solve problem (PE) numerically. The numerical tests converged to the optimal solution $(\hat{u},\hat{v})(\cdot)\equiv 0$ for arbitrary guesses of the initial values of the costate variables. It is inmediate to check that $\hat{w}(\cdot)\equiv 0$ is a feasible trajectory that verifies the first order optimality conditions. Since the second variation at this $\hat{w}$ verifies the sufficient condition of Theorem 6.2, we conclude that $\hat{w}(\cdot)$ is a strict weak optimal trajectory that satisfies $\gamma$ -growth.

8. Conclusion and possible extensions

We studied optimal control problems in the Mayer form governed by systems that are affine in some components of the control variable. A set of ‘no gap’ necessary and sufficient second order optimality conditions was provided. These conditions apply to a weak minimum, consider fairly general endpoint constraints and do not assume uniqueness of multiplier. We further derived the Goh conditions when we assume uniqueness of multiplier.

The main result of the article is Theorem 6.2. The interest of this result is that it can be applied either to prove optimality of some candidate solution of a given problem, or to show convergence of an associated shooting algorithm as stated in Aronna [4] and proved in the detail in the technical report Aronna [5]. This algorithm and its proof of convergence apply also to partially-affine problems with bounds on the control and bang-singular solutions, and hence its convergence has strong practical interest.

The results here presented can be pursued by many interesting extensions. One of the most important extensions are the optimality conditions for bang-singular solutions for problems containing closed control constraints.

Acknowledgments

Part of this work was done during my Ph.D. under the supervision of Frédéric Bonnans, who I thank for the great guidance.

I also acknowledge the anonymous referee for his careful reading and useful remarks.

Appendix A Proofs of technical results

We include in this part the proofs that were omitted throughout the article.

A.1.

Proof of Lemma 3.1. We shall omit the dependence on $\lambda$ for the sake of simplicity of notation. Let us consider the following second order Taylor expansions, written in a compact form,

[TABLE]

Observe that, in view of the transversality conditions (10) and the costate equation (9), one has

[TABLE]

In the definition of $\mathcal{L}$ given in (6), replace $\ell(x(0),x(T))$ and $f_{i}(x,u)$ by their Taylor expansions (70)-(71) and use the identity (72). This yields

[TABLE]

Finally, to obtain (19), remove the first order terms by the stationarity conditions (11), and use the Cauchy-Schwarz inequality in the last integral. This completes the proof. $\square$

A.2. Proof of Theorem 3.3

Let us write problem (P) in an abstract form defining, for $j=1,\dots,d_{\eta}$ and $i=0,\dots,d_{\varphi},$

[TABLE]

where $x(\cdot)\in\mathcal{X}$ is the solution of (2) associated with $(x(0),u(\cdot),v(\cdot)).$ Hence, (P) can be written as the following problem in the space $\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V},$

[TABLE]

Notice that if $\hat{w}(\cdot)$ is a weak solution of (P) then $(\hat{x}(0),\hat{u}(\cdot),\hat{v}(\cdot))$ is a local solution of (AP).

*Definition A.1**.*

We say that the endpoint equality constraints are qualified if

[TABLE]

When (73) does not hold, the constraints are not qualified or unqualified.

The proof of Theorem 3.3 is divided in two cases: qualified and not qualified endpoint equality constraints. In the latter case the condition (20) follows easily and it is shown in Lemma A.2 below. The proof for the qualified case is done by means of an auxiliary linear problem and duality arguments.

Lemma A.2.

If the equality constraints are not qualified then (20) holds.

Proof.

Observe that since $D\bar{\eta}(\hat{x}(0),\hat{u},\hat{v})$ is not onto there exists $\beta\in\mathbb{R}^{d_{\eta},*}$ with $|\beta|=1$ such that $\sum_{j=1}^{d_{\eta}}\beta_{j}D\bar{\eta}_{j}(\hat{x}(0),\hat{u},\hat{v})=0$ and consequently,

[TABLE]

Set $\lambda:=(p(\cdot),\alpha,\beta)$ with $p(\cdot)\equiv 0$ and $\alpha=0.$ Then both $\lambda$ and $-\lambda$ are in $\Lambda.$ Observe that

[TABLE]

Thus, either $\Omega[\lambda](\bar{x},\bar{u},\bar{v})$ or $\Omega[-\lambda](\bar{x},\bar{u},\bar{v})$ is necessarily nonnegative. The desired result follows. ∎

Let us now deal with the qualified case. Take a critical direction $\bar{w}(\cdot)=(\bar{x},\bar{u},\bar{v})(\cdot)\in\mathcal{C}$ and consider the problem in the variables $\tau\in\mathbb{R}$ and $r=(r_{x_{0}},r_{u},r_{v})\in\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V}$ given by

[TABLE]

Proposition A.3.

Assume that $\hat{w}(\cdot)$ is a weak solution of (AP) for which the endpoint equality constraints are qualified. Let $\bar{w}(\cdot)\in\mathcal{C}$ be a critical direction. Then the problem (QP ${}_{\bar{w}}$ ) is feasible and has nonnegative value.

Proof of Proposition A.3. Step I. Let us first show feasibility. Since $D\bar{\eta}(\hat{x}(0),\hat{u},\hat{v})$ is onto, there exists $r\in\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V}$ for which the equality constraint in (QP ${}_{\bar{w}}$ ) is satisfied. Set

[TABLE]

Then $(\tau,r)$ is feasible for (QP ${}_{\bar{w}}$ ).

Step II. Let us now prove that (QP ${}_{\bar{w}}$ ) has nonnegative value. Suppose on the contrary that there is $(\tau,r)\in\mathbb{R}\times\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V}$ feasible for (QP ${}_{\bar{w}}$ ) with $\tau<0.$ We shall look for a family of feasible solutions of (AP) referred as $\{r(\sigma)\}_{\sigma}$ with the following properties: it is defined for small positive values of $\sigma$ and it satisfies

[TABLE]

The existence of such family $\{r(\sigma)\}_{\sigma}$ will contradict the local optimality of $(\hat{x}(0),\hat{u},\hat{v}).$ Consider hence

[TABLE]

Let $0\leq i\leq d_{\varphi}$ and observe that

[TABLE]

where last inequality holds since $(\bar{x},\bar{u},\bar{v})(\cdot)$ is a critical direction and in view of the definition of $\tau$ in (74). Analogously, one has

[TABLE]

Since $D\bar{\eta}(\hat{x}(0),\hat{u},\hat{v})$ is onto, there exists $r(\sigma)\in\mathbb{R}\times\mathcal{U}\times\mathcal{V}$ such that $\|r(\sigma)-\tilde{r}(\sigma)\|_{\infty}=o(\sigma^{2})$ and $\bar{\eta}(r(\sigma))=0.$ This follows by applying the Implicit Function Theorem to the mapping

[TABLE]

On the other hand, by taking $\sigma$ sufficiently small in estimate (76), we obtain

[TABLE]

since $\tau<0.$ Hence $r(\sigma)$ is feasible for (AP) and verifies (75). This contradicts the optimality of $(\hat{x}(0),\hat{u},\hat{v}).$ We conclude then that all the feasible solutions of (QP ${}_{\bar{w}}$ ) have $\tau\geq 0$ and, therefore, its value is nonnegative.

$\square$

We shall now proceed to prove Theorem 3.3.

Proof of Theorem 3.3. The unqualified case is covered by Lemma A.2 above. Hence, for this proof, assume that (73) holds.

Given $\bar{w}(\cdot)\in\mathcal{C},$ note that (QP ${}_{\bar{w}}$ ) can be regarded as a linear problem in the variables $(\zeta,r),$ whose associated dual is given by

[TABLE]

The Proposition A.3 above and the linear duality result Bonnans [12, Theorem 3.43] imply that (77)-(79) has finite nonnegative value (the reader is referred to Shapiro [45] and references therein for a general theory on linear duality). Consequently, there exists a feasible solution $(\bar{\alpha},\bar{\beta})\in\mathbb{R}^{d_{\varphi}+d_{\eta}+1}$ to (77)-(79), with associated nonnegative and finite value. Set $(\alpha,\beta):=(\bar{\alpha},\bar{\beta})/(\sum_{i=0}^{d_{\varphi}}|\bar{\alpha}_{i}|+\sum_{j=1}^{d_{\eta}}|\bar{\beta}_{j}|),$ where the denominator is not zero in view of (79). We get that $(\alpha,\beta)\in\mathbb{R}^{d_{\varphi}+d_{\eta}+1}$ verifies (7)-(8), (78) and

[TABLE]

For this $(\alpha,\beta),$ let $p(\cdot)$ be the solution of (9) with final condition

[TABLE]

We shall prove that $\lambda:=(\alpha,\beta,p(\cdot))$ is in $\Lambda,$ i.e. that also the first line in (10) and the stationarity conditions (11) hold. Let $(\tilde{x},\tilde{u},\tilde{v})(\cdot)\in\mathcal{W}$ be the solution of the linearized state equation (12). In view of (78),

[TABLE]

Hence, rewriting in terms of the endpoint Lagrangian $\ell$ and using (81)-(82), one has

[TABLE]

By regrouping terms in the previous equation, we get

[TABLE]

where we used (9) and (12) in the last equality. Since (83) holds for all $(\tilde{x}(0),\tilde{u}(\cdot),\tilde{v}(\cdot))$ in $\mathbb{R}^{n}\times\mathcal{U}\times\mathcal{V},$ the first line in (10) and the stationarity conditions in (11) are necessarily verified. Thus, $\lambda$ is an element of $\Lambda.$ On the other hand, simple computations yield that (80) is equivalent to

[TABLE]

and, therefore, the result follows.

$\square$

A.3.

Proof of Lemma 4.5. First recall that the term $\bar{v}^{\top}H_{vu}[\lambda]\bar{u}$ in $\Omega[\lambda]$ vanishes since we are taking $\lambda\in\Lambda^{\#}$ and, in view of Lemma 3.5, $H_{vu}[\lambda]\equiv 0.$ In the remainder of the proof we omit the dependence on $\lambda$ for the sake of simplicity. Replacing $\bar{x}$ in the definition of $\Omega$ in equation (18) by its expression in (32) yields

[TABLE]

In view of (33) one gets

[TABLE]

The decomposition of $H_{vx}\,F_{v}$ introduced in (41) followed by an integration by parts leads to

[TABLE]

The result follows by replacing using (85) and (86) in (84).

$\square$

Appendix B Technical lemmas used in the proof of the main Theorem 6.2

Recall first the following classical result for ordinary differential equations.

Lemma B.1 (Gronwall’s Lemma).

Let $a(\cdot)\in W^{1,1}(0,T;\mathbb{R}^{n}),$ $b(\cdot)\in L^{1}(0,T)$ and $c(\cdot)\in L^{1}(0,T)$ be such that $|\dot{a}(t)|\leq b(t)+c(t)|a(t)|$ for a.a. $t\in(0,T).$ Then

[TABLE]

For the lemma below recall the definition of the space $\mathcal{H}_{2}$ given in (22).

Lemma B.2.

There exists $\rho>0$ such that

[TABLE]

for every linearized trajectory $(\bar{x},\bar{u},\bar{v})(\cdot)\in\mathcal{H}_{2}.$ The constant $\rho$ depends on $\|A\|_{\infty},$ $\|F_{v}\|_{\infty},$ $\|E\|_{\infty}$ and $\|B\|_{\infty}.$

Proof.

Throughout this proof, whenever we put $\rho_{i}$ we refer to a positive constant depending on $\|A\|_{\infty},$ $\|F_{v}\|_{\infty},$ $\|E\|_{\infty},$ and/or $\|B\|_{\infty}.$ Let $(\bar{x},\bar{u},\bar{v})(\cdot)\in\mathcal{H}_{2}$ and $(\bar{\xi},\bar{y})(\cdot)$ be defined by Goh’s Transformation (32). Thus $(\bar{\xi},\bar{u},\bar{y})(\cdot)$ is solution of (33). Gronwall’s Lemma B.1 and Cauchy-Schwarz inequality yield

[TABLE]

with $\rho_{1}=\rho_{1}(\|A\|_{1},\|E\|_{\infty},\|B\|_{\infty}).$ This last inequality together with the relation between $\bar{\xi}(\cdot)$ and $\bar{x}(\cdot)$ provided by (32) imply

[TABLE]

for $\rho_{2}=\rho_{2}(\rho_{1},\|F_{v}\|_{\infty}).$ On the other hand, (32) and estimate (88) lead to

[TABLE]

Then, in view of Young’s inequality ‘ $2{ab}\leq{a^{2}+b^{2}}$ ’ for real numbers $a,b,$ one gets

[TABLE]

for some $\rho_{3}=\rho_{3}(\rho_{1},\|F_{v}\|_{\infty}).$ The desired estimate follows from (89) and (90). ∎

Notice that Lemma B.2 above gives an estimate of the linearized state in the order $\gamma.$ The following result shows that the analogous property holds for the variation of the state variable as well and it is a natural extension of a similar result given in Dmitruk [19] for control-affine systems.

Lemma B.3.

Given $C>0,$ there exists $\rho>0$ such that

[TABLE]

for every $(x,u,v)(\cdot)$ solution of the state equation (2) having $\|v(\cdot)\|_{2}\leq C,$ and where $\delta w(\cdot):=w(\cdot)-\hat{w}(\cdot).$ The constant $\rho$ depends on $C,$ $\|B\|_{\infty},$ $\|\dot{B}\|_{\infty}$ and the Lipschitz constants of $f_{i}.$

Proof.

In order to simplify the notation we omit the dependence on $t.$ Consider $(x,u,v)(\cdot)$ solution of (2) with $\|v(\cdot)\|_{2}\leq C.$ Let $\delta w(\cdot):=w(\cdot)-\hat{w}(\cdot),$ $\delta y(t):=\int_{0}^{t}\delta v(s){\rm d}s,$ and $\xi(\cdot):=\delta x(\cdot)-B[\cdot]\delta y(\cdot),$ with $y(t):=\int_{0}^{t}v(s)\mathrm{d}s.$ Note that

[TABLE]

In view of the Lipschitz-continuity of $f_{i},$

[TABLE]

for some $L>0.$ Thus, from (92) it follows

[TABLE]

Applying Gronwall’s Lemma B.1 one gets

[TABLE]

and Cauchy-Schwarz inequality applied to previous estimate yields

[TABLE]

for $\rho_{1}=\rho_{1}(L,C,\|F_{v}\|_{\infty},\|\dot{F}_{v}\|_{\infty}).$ Since $\|\delta x\|_{2}\leq\|\xi\|_{2}+\|F_{v}\|_{\infty}\|\delta y\|_{2},$ by previous estimate and Cauchy-Schwarz inequality, the result follows. ∎

Finally, the following lemma gives an estimate for the difference between the variation of the state variable and the linearized state.

Lemma B.4.

Consider $C>0$ and $w(\cdot)=(x,u,v)(\cdot)\in\mathcal{W}$ a trajectory with $\|w(\cdot)-\hat{w}(\cdot)\|_{\infty}\leq C.$ Set $(\delta x,\delta u,\delta v)(\cdot):=w(\cdot)-\hat{w}(\cdot)$ and let $\bar{x}(\cdot)$ be the linearization of $\hat{x}(\cdot)$ associated with $(\delta x,\delta u,\delta v)(\cdot).$ Define

[TABLE]

Then, $\vartheta(\cdot)$ is solution of the differential equation

[TABLE]

where the remainder $\zeta(\cdot)$ is given by

[TABLE]

and $L$ is a Lipschitz constant for $D^{2}f_{i},$ uniformly in $i=0,\dots,m.$ Furthermore, $\zeta(\cdot)$ satisfies the estimates

[TABLE]

where $\rho_{1}=\rho_{1}(C,\|D^{2}f\|_{\infty},L,\|v\|_{\infty}+1).$

If in addition, $C\rightarrow 0,$ the following estimates for $\vartheta(\cdot)$ hold

[TABLE]

Proof.

We shall note first that

[TABLE]

Consider the following second order Taylor expansions for $f_{i},$

[TABLE]

Combining (100) and (101) yields

[TABLE]

with the remainder being given by (97). The linearized equation (12) together with (102) lead to (96). In view of (97) and Lemma B.3, it can be seen that the estimates in (98) hold.

On the other hand, by applying Gronwall’s Lemma B.1 to (96), and using Cauchy-Schwarz inequality afterwards lead to

[TABLE]

for some positive $\rho_{3},\rho_{4}$ depending on $\|\hat{v}\|_{\infty}$ and $\|Df\|_{\infty}.$ Finally, using the estimate in Lemma B.3 and (98) just obtained, the inequalities in (99) follow. ∎

In view of Lemmas 3.1, B.2, B.3 and B.4 we can justify the following technical result that is an essential point in the proof of the sufficient condition of Theorem 6.2.

Lemma B.5.

Let $w(\cdot)\in\mathcal{W}$ be a trajectory. Set $(\delta x,\delta u,\delta v)(\cdot):=w(\cdot)-\hat{w}(\cdot),$ and $\bar{x}(\cdot)$ its corresponding linearized state, i.e. the solution of (12)-(13) associated with $(\delta x(0),\delta u(\cdot),\delta v(\cdot)).$ Assume that $\|w(\cdot)-\hat{w}(\cdot)\|_{\infty}\rightarrow 0.$ Then

[TABLE]

for every $\lambda\in{\rm co}\,\Lambda.$

Proof.

For the sake of simplicity of notation, we shall omit the dependence on $\lambda.$

Let us recall the expansion of the Lagrangian function given in Lemma 3.1, and observe that it also holds for any $\lambda$ in ${\rm co}\,\Lambda.$ Next, notice that, by Lemma B.3, $\mathcal{L}(w)=\mathcal{L}(\hat{w})+\Omega(\delta x,\delta u,\delta v)+o(\gamma).$ Hence,

[TABLE]

with $\Delta\Omega:=\Omega(\delta x,\delta u,\delta v)-\Omega(\bar{x},\delta u,\delta v).$ The next step is using Lemmas B.2, B.3 and B.4 to prove that

[TABLE]

Note that $\mathcal{Q}(a,a)-\mathcal{Q}(b,b)=\mathcal{Q}(a+b,a-b),$ for any bilinear mapping $\mathcal{Q},$ and any pair $a,b$ of elements in its domain. Set $\vartheta(\cdot):=\delta x(\cdot)-\bar{x}(\cdot)$ as it is done in Lemma B.4. Hence,

[TABLE]

The estimates in Lemmas B.2, B.3 and B.4 yield $\Delta\Omega=\int_{0}^{T}\delta v^{\top}C\vartheta\mathrm{d}t+o(\gamma).$ Integrating by parts in the latter expression and using (99) leads to

[TABLE]

and hence the desired result follows. ∎

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R.A. Adams. Sobolev spaces . Academic Press, New York, 1975.
2[2] A.A. Agrachev and Y.L. Sachkov. Control theory from the geometric viewpoint , volume 87 of Encyclopaedia of Mathematical Sciences . Springer-Verlag, Berlin, 2004.
3[3] V.M. Alekseev, V.M. Tikhomirov, and S.V. Fomin. Optimal Control . Nauka, Moscow, 1979. [in Russian].
4[4] M.S. Aronna. Convergence of the shooting algorithm for singular optimal control problems. In Proceedings of the IEEE European Control Conference (ECC) , pages 215–220, July 2013.
5[5] M.S. Aronna. Singular solutions in optimal control: second order conditions and a shooting algorithm. Technical report, Inria RR-7764, 2013. ar Xiv:1210.7425 , Inria RR-7764 .
6[6] M.S. Aronna, J. F. Bonnans, A. V. Dmitruk, and P.A. Lotito. Quadratic order conditions for bang-singular extremals. Numer. Algebra Control Optim., AIMS Journal, special issue dedicated to Professor Helmut Maurer on the occasion of his 65th birthday, , 2(3):511–546, 2012.
7[7] D.M. Azimov. Active sections of rocket trajectories. A survey of research. Avtomat. i Telemekh. , (11):14–34, 2005.
8[8] D.J. Bell and D.H. Jacobson. Singular Optimal Control Problems . Academic Press, 1975.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Second order necessary and sufficient optimality conditions for singular solutions of partially-affine control problems

Abstract.

Key words and phrases:

1. Introduction

2. Statement of the problem and assumptions

2.1. Statement of the problem.

Assumption 2.1*.*

Assumption 2.2*.*

2.2. Lagrange multipliers

Definition 2.3*.*

Theorem 2.4**.**

Proof.

2.3. Critical cones

Lemma 2.5**.**

Lemma 2.6** (on density of cones).**

3. Second order analysis

3.1. Second variation

Lemma 3.1** (Lagrangian expansion).**

Proof.

Remark 3.2*.*

3.2. Second order necessary condition

Theorem 3.3** **(Second order necessary

Theorem 3.4**.**

Proof.

3.3. Strengthened second order necessary condition

Lemma 3.5**.**

Remark 3.6* (About singular solutions).*

Lemma 3.7**.**

Remark 3.8*.*

Theorem 3.9** (Strengthened second order necessary condition).**

Remark 3.10* (On unqualified solutions).*

Lemma 3.11**.**

Corollary 3.12** (Legendre-Clebsch condition).**

Proof.

4. Goh Transformation

Assumption 4.1*.*

4.1. Tranformed critical cones

Remark 4.2*.*

Lemma 4.3**.**

Proof.

4.2. Transformed second variation

Remark 4.4*.*

Lemma 4.5**.**

Corollary 4.6**.**

5. New second order necessary condition

Theorem 5.1** (New necessary condition).**

Corollary 5.2** (Goh conditions).**

Theorem 5.3** (Second order necessary condition in new variables).**

6. Second order sufficient condition for weak minimum

Definition 6.1*.*

Theorem 6.2** (Sufficient condition for weak optimality).**

Corollary 6.3**.**

Remark 6.4*.*

Remark 6.5*.*

7. Example

8. Conclusion and possible extensions

Acknowledgments

Appendix A Proofs of technical results

A.1.

A.2. Proof of Theorem 3.3

Definition A.1*.*

Lemma A.2**.**

Proof.

Proposition A.3**.**

A.3.

Appendix B Technical lemmas used in the proof of the main Theorem 6.2

Lemma B.1** (Gronwall’s Lemma).**

Lemma B.2**.**

Proof.

Lemma B.3**.**

Proof.

Lemma B.4**.**

Proof.

Lemma B.5**.**

*Assumption 2.1**.*

*Assumption 2.2**.*

*Definition 2.3**.*

Theorem 2.4.

Lemma 2.5.

Lemma 2.6 (on density of cones).

Lemma 3.1 (Lagrangian expansion).

*Remark 3.2**.*

Theorem 3.3 (Second order necessary

Theorem 3.4.

Lemma 3.5.

*Remark 3.6** (About singular* solutions).

Lemma 3.7.

*Remark 3.8**.*

Theorem 3.9 (Strengthened second order necessary condition).

*Remark 3.10** (On unqualified* solutions).

Lemma 3.11.

Corollary 3.12 (Legendre-Clebsch condition).

*Assumption 4.1**.*

*Remark 4.2**.*

Lemma 4.3.

*Remark 4.4**.*

Lemma 4.5.

Corollary 4.6.

Theorem 5.1 (New necessary condition).

Corollary 5.2 (Goh conditions).

Theorem 5.3 (Second order necessary condition in new variables).

*Definition 6.1**.*

Theorem 6.2 (Sufficient condition for weak optimality).

Corollary 6.3.

*Remark 6.4**.*

*Remark 6.5**.*

*Definition A.1**.*

Lemma A.2.

Proposition A.3.

Lemma B.1 (Gronwall’s Lemma).

Lemma B.2.

Lemma B.3.

Lemma B.4.

Lemma B.5.