Necessary Optimality Conditions For Average Cost Minimization Problems

Piernicola Bettiol; Nathalie Khalil

arXiv:1901.04213·math.OC·January 15, 2019

Necessary Optimality Conditions For Average Cost Minimization Problems

Piernicola Bettiol, Nathalie Khalil

PDF

TL;DR

This paper establishes necessary optimality conditions for average cost control problems with unknown parameters affecting dynamics, cost, and constraints, accommodating non-compact parameter spaces, thus broadening the theoretical framework for uncertain control systems.

Contribution

It introduces new necessary optimality conditions for control problems with unknown parameters in a general metric space, extending previous results to non-compact parameter sets.

Findings

01

Derived necessary conditions for optimality in average cost problems.

02

Extended the theory to include non-compact parameter spaces.

03

Applicable to control systems with uncertainties in dynamics and constraints.

Abstract

Control systems involving unknown parameters appear a natural framework for applications in which the model design has to take into account various uncertainties. In these circumstances the performance criterion can be given in terms of an average cost, providing a paradigm which differs from the more traditional minimax or robust optimization criteria. In this paper, we provide necessary optimality conditions for a nonrestrictive class of optimal control problems in which unknown parameters intervene in the dynamics, the cost function and the right end-point constraint. An important feature of our results is that we allow the unknown parameters belonging to a mere complete separable metric space (not necessarily compact).

Equations384

⎩ ⎨ ⎧ minimize J_{Ω} ((u (.), {x (., ω)})) := \int_{Ω} g (x (T, ω); ω) d μ (ω) over measurable functions u : [0, T] \to R^{m} and W^{1, 1} arcs {x (., ω) : [0, T] \to R^{n} ∣ ω \in Ω} such that u (t) \in U (t) a.e. t \in [0, T] and, for each ω \in Ω, \overset{x}{˙} (t, ω) = f (t, x (t, ω), u (t), ω) a.e. t \in [0, T], x (0, ω) = x_{0} and \int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0.

⎩ ⎨ ⎧ minimize J_{Ω} ((u (.), {x (., ω)})) := \int_{Ω} g (x (T, ω); ω) d μ (ω) over measurable functions u : [0, T] \to R^{m} and W^{1, 1} arcs {x (., ω) : [0, T] \to R^{n} ∣ ω \in Ω} such that u (t) \in U (t) a.e. t \in [0, T] and, for each ω \in Ω, \overset{x}{˙} (t, ω) = f (t, x (t, ω), u (t), ω) a.e. t \in [0, T], x (0, ω) = x_{0} and \int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0.

u (t) \in U (t) a.e. t \in [0, T]

u (t) \in U (t) a.e. t \in [0, T]

\overset{x}{˙} (t, ω) = f (t, x (t, ω), u (t), ω) a.e. t \in [0, T], x (0, ω) = x_{0} .

\overset{x}{˙} (t, ω) = f (t, x (t, ω), u (t), ω) a.e. t \in [0, T], x (0, ω) = x_{0} .

\int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0 .

\int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0 .

\int_{Ω} g (\overset{x}{ˉ} (T, ω); ω) d μ (ω) \leq \int_{Ω} g (x (T, ω); ω) d μ (ω)

\int_{Ω} g (\overset{x}{ˉ} (T, ω); ω) d μ (ω) \leq \int_{Ω} g (x (T, ω); ω) d μ (ω)

∥ \overset{x}{ˉ} (., ω) - x (., ω) ∥_{W^{1, 1}} \leq ϵ for all ω \in supp (μ) .

∥ \overset{x}{ˉ} (., ω) - x (., ω) ∥_{W^{1, 1}} \leq ϵ for all ω \in supp (μ) .

∣ f (t, x, u, ω) - f (t, x^{'}, u, ω) ∣ \leq k_{f} (t, u) ∣ x - x^{'} ∣

∣ f (t, x, u, ω) - f (t, x^{'}, u, ω) ∣ \leq k_{f} (t, u) ∣ x - x^{'} ∣

μ = j = 1 \sum N α_{j} δ_{ω_{j}}, j = 1 \sum N α_{j} = 1, α_{j} \in (0, 1] .

μ = j = 1 \sum N α_{j} δ_{ω_{j}}, j = 1 \sum N α_{j} = 1, α_{j} \in (0, 1] .

j = 1 \sum N α_{j} g (x (T, ω_{j}); ω_{j}),

j = 1 \sum N α_{j} g (x (T, ω_{j}); ω_{j}),

⎩ ⎨ ⎧ minimize j = 1 \sum N α_{j} g (x (T, ω_{j}); ω_{j}) over controls u (.) such that u (t) \in U (t) a.e. t \in [0, T] and arcs x (., ω_{j}) such that for each j = 1, \dots, N \overset{x}{˙} (t, ω_{j}) = f (t, x (t, ω_{j}), u (t), ω_{j}) a.e. t \in [0, T] x (0, ω_{j}) = x_{0} and x (T, ω_{j}) \in C (ω_{j}) .

⎩ ⎨ ⎧ minimize j = 1 \sum N α_{j} g (x (T, ω_{j}); ω_{j}) over controls u (.) such that u (t) \in U (t) a.e. t \in [0, T] and arcs x (., ω_{j}) such that for each j = 1, \dots, N \overset{x}{˙} (t, ω_{j}) = f (t, x (t, ω_{j}), u (t), ω_{j}) a.e. t \in [0, T] x (0, ω_{j}) = x_{0} and x (T, ω_{j}) \in C (ω_{j}) .

p (., ω_{j}) := \frac{p ( . , ω _{j} )}{α _{j}} .

p (., ω_{j}) := \frac{p ( . , ω _{j} )}{α _{j}} .

∣ f (t, x, u, ω) - f (t, x^{'}, u, ω) ∣ \leq k_{f} (t) ∣ x - x^{'} ∣ and ∣ f (t, x, u, ω) ∣ \leq c

∣ f (t, x, u, ω) - f (t, x^{'}, u, ω) ∣ \leq k_{f} (t) ∣ x - x^{'} ∣ and ∣ f (t, x, u, ω) ∣ \leq c

∣ g (x, ω_{1}) - g (x, ω_{2}) ∣ \leq θ (ρ_{Ω} (ω_{1}, ω_{2})) \mbox f or a l l ω_{1}, ω_{2} \in Ω,

∣ g (x, ω_{1}) - g (x, ω_{2}) ∣ \leq θ (ρ_{Ω} (ω_{1}, ω_{2})) \mbox f or a l l ω_{1}, ω_{2} \in Ω,

∣ d_{C (ω_{1})} (x) - d_{C (ω_{2})} (x) ∣ \leq θ (ρ_{Ω} (ω_{1}, ω_{2})) \mbox f or a l l ω_{1}, ω_{2} \in Ω .

∣ d_{C (ω_{1})} (x) - d_{C (ω_{2})} (x) ∣ \leq θ (ρ_{Ω} (ω_{1}, ω_{2})) \mbox f or a l l ω_{1}, ω_{2} \in Ω .

\int_{0}^{T} x \in \overset{x}{ˉ} (t, ω) + δ B, u \in U (t) sup ∣ f (t, x, u, ω_{1}) - f (t, x, u, ω_{2}) ∣ d t \leq θ_{f} (ρ_{Ω} (ω_{1}, ω_{2})) .

\int_{0}^{T} x \in \overset{x}{ˉ} (t, ω) + δ B, u \in U (t) sup ∣ f (t, x, u, ω_{1}) - f (t, x, u, ω_{2}) ∣ d t \leq θ_{f} (ρ_{Ω} (ω_{1}, ω_{2})) .

P (ω)

P (ω)

- \overset{q}{˙} (t, ω) \in co \partial_{x} [q (t, ω) \cdot f (t, \overset{x}{ˉ} (t, ω), \overset{u}{ˉ} (t), ω)] a.e. t \in [0, T],

\displaystyle\text{and }\ -q(T,\omega)\in\lambda\partial_{x}g(\bar{x}(T,\omega);\omega)+N^{1}_{C(\omega)}(\bar{x}(T,\omega))\Bigg{\}}.

- \overset{p}{˙} (t, ω) \in co \partial_{x} [p (t, ω) \cdot f (t, \overset{x}{ˉ} (t, ω), \overset{u}{ˉ} (t), ω)] a.e. t \in [0, T],

- \overset{p}{˙} (t, ω) \in co \partial_{x} [p (t, ω) \cdot f (t, \overset{x}{ˉ} (t, ω), \overset{u}{ˉ} (t), ω)] a.e. t \in [0, T],

and - p (T, ω) \in λ \partial_{x} g (\overset{x}{ˉ} (T, ω); ω) + N_{C (ω)}^{1} (\overset{x}{ˉ} (T, ω)) .

- \overset{q}{˙} (t, ω) \in co \partial_{x} [q (t, ω) \cdot f (t, \overset{x}{ˉ} (t, ω), \overset{u}{ˉ} (t), ω)] a.e. t \in [0, T],

- \overset{q}{˙} (t, ω) \in co \partial_{x} [q (t, ω) \cdot f (t, \overset{x}{ˉ} (t, ω), \overset{u}{ˉ} (t), ω)] a.e. t \in [0, T],

- q (T, ω) \in λ \partial_{x} g (\overset{x}{ˉ} (T, ω); ω) + N_{C (ω)}^{1} (\overset{x}{ˉ} (T, ω)) .

- q (T, ω) \in λ \partial_{x} g (\overset{x}{ˉ} (T, ω); ω) + N_{C (ω)}^{1} (\overset{x}{ˉ} (T, ω)) .

(λ, {q (., ω) : ω \in Ω}) \neq = (0, 0)

(λ, {q (., ω) : ω \in Ω}) \neq = (0, 0)

\int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0

\int_{Ω} d_{C (ω)} (x (T, ω)) d μ (ω) = 0

\left\{\begin{array}[]{lll}{\bf minimize}\;\max_{\omega\in\Omega:=[-1,1]}-|x(1)-\omega|\\ \dot{x}(t)=u(t)\quad\textrm{a.e. }t\in[0,1]\\ u(t)\in[-1,1]\quad\mbox{a.e.}\;t\in[0,1]\\ \mbox{and }x(0)=0.\end{array}\right.

\left\{\begin{array}[]{lll}{\bf minimize}\;\max_{\omega\in\Omega:=[-1,1]}-|x(1)-\omega|\\ \dot{x}(t)=u(t)\quad\textrm{a.e. }t\in[0,1]\\ u(t)\in[-1,1]\quad\mbox{a.e.}\;t\in[0,1]\\ \mbox{and }x(0)=0.\end{array}\right.

D (ω) := {z \in R^{K} : (ω, z) \in D} and D_{i} (ω) := {z \in R^{K} : (ω, z) \in D_{i}} for all i = 1, 2, \dots .

D (ω) := {z \in R^{K} : (ω, z) \in D} and D_{i} (ω) := {z \in R^{K} : (ω, z) \in D_{i}} for all i = 1, 2, \dots .

d η_{i} (ω) = γ_{i} (ω) d μ_{i} (ω) i = 1, 2, \dots

d η_{i} (ω) = γ_{i} (ω) d μ_{i} (ω) i = 1, 2, \dots

γ_{i} (ω) \in D_{i} (ω) μ_{i} - a.e.

γ_{i} (ω) \in D_{i} (ω) μ_{i} - a.e.

i \to \infty lim sup D_{i} \subset D,

i \to \infty lim sup D_{i} \subset D,

η_{i} ⇀ * η

η_{i} ⇀ * η

d η (ω) = γ (ω) d μ (ω)

d η (ω) = γ (ω) d μ (ω)

γ (ω) \in D (ω) μ - a.e. ω \in Ω .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Necessary Optimality Conditions For Average Cost Minimization Problems

Piernicola Bettiol111 *Laboratoire de Mathématiques, Université de Bretagne Occidentale, 6 Avenue Victor Le Gorgeu, 29200 Brest, France, e-mail: * [email protected] , Nathalie Khalil222 *MODAL’X, Université Paris Ouest Nanterre La Défense, 200 Avenue de la République, 92001 Paris Nanterre, France, e-mail: * [email protected]

Abstract

Control systems involving unknown parameters appear a natural framework for applications in which the model design has to take into account various uncertainties. In these circumstances the performance criterion can be given in terms of an average cost, providing a paradigm which differs from the more traditional minimax or robust optimization criteria. In this paper, we provide necessary optimality conditions for a nonrestrictive class of optimal control problems in which unknown parameters intervene in the dynamics, the cost function and the right end-point constraint. An important feature of our results is that we allow the unknown parameters belonging to a mere complete separable metric space (not necessarily compact).

1 Introduction

In this paper we consider a class of optimal control problems in which uncertainties appear in the data in terms of unknown parameters belonging to a given metric space. Though the state evolution is governed by a deterministic control system and the initial datum is fixed (and well-known), the description of the dynamics depends on uncertain parameters which intervene also in the cost function and the right end-point constraint. Taking into consideration an average cost criterion, a crucial issue is clearly to be able to characterize optimal controls independently of the unknown parameter action: this allows to find a sort of ‘best trade-off’ among all the possible realizations of the control system as the parameter varies. In this context we provide, under non-restrictive assumptions, necessary optimality conditions. More precisely, we consider the following average cost minimization problem:

[TABLE]

Here, $d_{C}(x)$ is the Euclidean distance of a point $x$ from the set $C$ . The data for this problem comprise a time interval $[0,T]$ , a probability measure $\mu$ defined on a metric space $\Omega$ , functions $g:\mathbb{R}^{n}\times\Omega\rightarrow\mathbb{R}$ and $f:[0,T]\times\mathbb{R}^{n}\times\mathbb{R}^{m}\times\Omega\rightarrow\mathbb{R}^{n}$ , a nonempty multifunction $U:[0,T]\leadsto\mathbb{R}^{m}$ , and a family of closed sets $\{C(\omega)\subset\mathbb{R}^{n}\ |\ \omega\in\Omega\}$ . A measurable function $u:[0,T]\rightarrow\mathbb{R}^{m}$ that satisfies

[TABLE]

is called a control function. The set of all control functions is written $\mathcal{U}$ . A process $(u,\{x(.,\omega):\omega\in\Omega\})$ is a control function $u$ coupled with a family of arcs $\{x(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n}):\omega\in\Omega\}$ , satisfying, for each $\omega\in\Omega$ , the dynamic constraint:

[TABLE]

A process is said to be feasible if, in addition, the arcs $x(.,\omega)$ ’s satisfy the averaged right end-point constraint

[TABLE]

If the integral cost term in (LABEL:intprob) does not exist for a feasible process $(u,\{x(.,\omega):\omega\in\Omega\}),$ then we set $J_{\Omega}(u(.),\{x(.,\omega)\})=+\infty$ . To underline the dependence on a given control $u(.)\in\mathcal{U}$ , sometimes we shall employ the notation $x(.,u,\omega)$ for the feasible arc belonging to the family of trajectories $\{x(.,\omega)\ :\ \omega\in\Omega\}$ , associated with the control $u(.)$ and the element $\omega\in\Omega$ .

A feasible process $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ is said to be a $W^{1,1}-$ local minimizer for (LABEL:intprob) if there exists $\epsilon>0$ such that

[TABLE]

for all feasible processes $(u,\{x(.,\omega):\omega\in\Omega\})$ such that

[TABLE]

Control systems involving unknown parameters have been well-studied in literature finding widespread applications particularly from the point of view of the robust (worst-case) control, see for instance the monographs [1], [20] and [6] (and the references therein), and the paper [18] on minimax optimal control. In the introductory section of [20, Chapter IX], control problems with uncertainties are considered comparing the conservative approach (minimax) with an alternative approach in which one might minimize, for instance, an “expected value” (which corresponds to the average cost problem studied in our paper). Then, in [20, Chapters IX and X] Warga investigates the so-called “conflicting/adverse control problems” providing necessary conditions for this broad class of problems which covers minimax problems (under some regularity assumptions), but which does not cover optimal control problems having the average cost criterion studied in our paper. (See [21] for further developments on adverse control problems in the nonsmooth context; cf. the recent papers [13] on adverse control problems and [11] on state-constrained minimax problems.)

A growing interest has recently emerged in considering an ‘averaged’ (or ‘expected’ with respect to a given measure) approach, exploring various issues, directions and applications: see for instance a recent series of papers on aerospace systems [15], [16], [7], and the articles [2] and [22] on averaged controllability (from different viewpoints); see also [17] for results on heterogeneous systems.

Therefore, motivated not only by theoretical reasons but also by a recent growing interest in applications (such as aerospace engineering, see in particular [15] and [16]), in our paper we consider the ‘average cost’ paradigm rather than the more ‘classical’ criteria employed in the minimax/robust or adverse optimization framework.

For the general (nonsmooth) case we derive necessary optimality conditions ensuring the existence of a costate function $p(.,.):[0,T]\times\Omega\to\mathbb{R}^{n}$ which satisfies an averaged (on $\Omega$ ) maximality condition. Moreover, the costate arcs $p(.,\omega)$ ’s satisfy also the somewhat expected adjoint system and transversality condition, when $\omega$ belongs at least to a countable dense subset $\widehat{\Omega}$ of supp $(\mu)$ . We show that these last two necessary conditions extend to the whole supp $(\mu)$ for free right end-point problems, if we impose (suitable) regularity assumptions on the dynamics and the cost function. We also prove that a further (non-trivial) case, in which the conditions of maximum principle extend to the whole supp $(\mu)$ , is when the measure $\mu$ is purely atomic (not necessarily with finite support).

This paper is organized as follows. We first study the simpler case in which the measure $\mu$ has a finite support (Section 2), which constitutes a discretization model for the general case of an arbitrary measure on a complete separable metric space (which is investigated successively). The main results are displayed in Section 3, and their proofs are given in Section 5. Section 4 is devoted to recall some fundamental theorems in measure theory and provide a limit-taking lemma which play a crucial role in our analysis. The approach that we suggest in our paper consists in approximating the measure $\mu$ by measures with finite support (convex combination of Dirac measures). Owing to Ekeland’s variational principle, we construct a suitable family of auxiliary optimal control problems, the solutions of which approximate the reference problem (LABEL:intprob). Invoking the maximum principle (applicable in a more traditional version) for the approximating minimizers, we obtain properties which, taking the limit (in a suitable sense), allow us to derive the desired necessary conditions. The most difficult part in our proof is to show the maximality condition: this requires non-trivial consideration of multifunction representation and selection theorems. This part becomes simpler for the ‘purely atomic’ case and the ‘smooth’ case.

An important source of inspiration for the techniques here employed is represented by Vinter’s paper [18] (which is devoted to minimax optimal control but, in fact, contains flexible and effective analytical tools that can be extended or adapted to our case). As one may expect, the necessary conditions that we obtain differ from those ones in the minimax context (in particular for the general nonsmooth case and the purely atomic case), for the nature of the minimization criterion is different. For instance, for the general (nonsmooth) case the most evident difference with respect to the costate arcs characterization given in [18] is that (avoiding a formulation which might involve somewhat complicate sets) we show that the ‘expected’ adjoint system and transversality conditions are satisfied by a family of costate arcs $p(.,\omega)$ ’s, at least when the parameter $\omega$ belongs to a countable dense set $\widehat{\Omega}\subset\text{supp}(\mu)$ . We highlight that an important feature of our paper is the unrestrictive nature of our assumptions: indeed, we allow not only nonsmooth data (on the dynamics, the cost function and the averaged right end-point constraint), but we also provide results for unknown parameters belonging to a mere complete separable metric space $\Omega$ . This aspect is particularly relevant for applications (cf. [15]) where $\Omega$ (and the support of the reference measure $\mu$ ) need not to be compact. Our techniques could be used to generalize the conditions in [18] and might provide some insights into dealing with adverse/conflicting control problems with non-compact parameter sets (in [20] and [21] parameter sets are assumed to be compact.)

Notation Let $(\Omega,\rho_{\Omega})$ be a metric space. Denote by $\mathcal{B}_{\Omega}$ the $\sigma$ -algebra of Borel sets in $\Omega$ . A probability measure $\mu$ on the measurable space $(\Omega,\mathcal{B}_{\Omega})$ takes non-negative values, verifies the $\sigma$ -additivity property and is such that $\mu(\Omega)=1$ . The family of all probability measures on $(\Omega,\mathcal{B}_{\Omega})$ is denoted by $\mathcal{M}(\Omega)$ . Recall that a sequence $\{\mu_{i}\}$ of measures in $\mathcal{M}(\Omega)$ is said to converge weakly∗ to a measure $\mu\in\mathcal{M}(\Omega)$ (in symbol $\mu_{i}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ ), if $\int_{\Omega}hd\mu_{i}\rightarrow\int_{\Omega}hd\mu$ for every bounded continuous function $h$ on $\Omega$ . The support of a measure $\mu$ defined on $\Omega$ is written supp( $\mu$ ). $\mathcal{L}$ denotes the Lebesgue subsets of $[0,T]$ , while $\mathcal{B}^{m}$ are the Borel subsets of $\mathbb{R}^{m}$ . $\mathcal{L}\times\mathcal{B}^{m}$ (respectively $\mathcal{L}\times\mathcal{B}^{m}\times\mathcal{B}_{\Omega}$ ) is the product $\sigma-$ algebra of $\mathcal{L}$ and $\mathcal{B}^{m}$ (respectively $\mathcal{L}$ , $\mathcal{B}^{m}$ and $\mathcal{B}_{\Omega}$ ). The Euclidean norm is written $|.|$ . We shall employ the following norm on $W^{1,1}([0,T];\mathbb{R}^{n})$ : $\|x(.)\|_{W^{1,1}}~{}:=~{}|x(0)|+\|\dot{x}(.)\|_{L^{1}(0,T)}$ . We write $\partial\varphi(x)$ the limiting subdifferential of the (possibly extended valued) function $\varphi:\mathbb{R}^{n}\rightarrow\mathbb{R}\cup\{+\infty\}$ at $x\in\text{dom}\varphi.$ If $\varphi=\varphi(x,y)$ , then $\partial_{x}\varphi(x,y)$ is the partial limiting subdifferential with respect to the variable $x$ . $\mathbb{B}$ is the closed unit ball in Euclidean space. $N_{C}(x)$ is the limiting normal cone of a closed set $C$ at a point $x\in C$ , and $N^{1}_{C}(x):=N_{C}(x)\cap\mathbb{B}$ . (We refer the reader to [4], [9], [10], and [19] and the references therein for these nonsmooth analytical tools.)

2 Average on measures with finite support

We start considering the particular and simple case of optimal control problems of the form (LABEL:intprob), where the probability measure $\mu$ of the integral functional has a finite support: it is a convex combination of unit Dirac measures. This constitutes also a preliminary step to derive necessary conditions for the general case.

The following assumptions will be needed throughout this section. For a given $W^{1,1}-$ local minimizer $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ and for some $\delta>0$ , we shall suppose:

(H1)
(i)

The function $f(.,x,.,\omega)$ is $\mathcal{L}\times\mathcal{B}^{m}$ measurable for each $(x,\omega)\in\mathbb{R}^{n}\times\Omega.$ 2. (ii)

The multifunction $t\leadsto U(t)$ has nonempty values, and $\textrm{Gr }U(.)$ is a $\mathcal{L}\times\mathcal{B}^{m}$ measurable set. 2. (H2)

There exists a $\mathcal{L}\times\mathcal{B}^{m}$ measurable function $k_{f}:[0,T]\times\mathbb{R}^{m}\rightarrow\mathbb{R}$ such that $t\rightarrow k_{f}(t,\bar{u}(t))$ is integrable, and for each $\omega\in\Omega$ ,

[TABLE]

for all $x,x^{\prime}\in\bar{x}(t,\omega)+\delta\mathbb{B}$ , $u\in U(t)$ , a.e. $t\in[0,T]$ . 3. (H3)

The function $g(.,\omega)$ is Lipschitz continuous on $\bar{x}(T,\omega)+\delta\mathbb{B}$ for all $\omega\in\text{supp}(\mu)$ .

Proposition 2.1.

Let $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ be a $W^{1,1}-$ local minimizer for (LABEL:intprob). Assume that $\mu$ is a given probability measure with finite support and that for some $\delta>0$ , hypotheses (H1)-(H3) are satisfied. Then, there exist a family of arcs $\{p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n}):\omega\in\Omega\}$ and a number $\lambda\geq 0$ such that

(a)

$(\lambda,p(.,\omega))\neq(0,0)$ * for all $\omega\in\Omega$ ;* 2. (b)

$\begin{aligned} \int_{\Omega}p(t,\omega)\cdot&f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)\ d\mu(\omega)=\max_{u\in U(t)}\int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),u,\omega)\ d\mu(\omega)\qquad\text{a.e. }t\in[0,T]\ ;\end{aligned}$ ** 3. (c)

$-\dot{p}(t,\omega)\in\textrm{co }\partial_{x}[p(t,\omega)\cdot f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)]$ * for $\mu-\textrm{a.e. }\omega\in\Omega$ ;* 4. (d)

$-p(T,\omega)\in\lambda\partial_{x}g(\bar{x}(T,\omega);\omega)+N_{C(\omega)}(\bar{x}(T,\omega))$ * for $\mu-\textrm{a.e. }\omega\in\Omega$ .*

Proof.

The measure $\mu$ can be written as a convex combination of Dirac measures at points $\omega_{j}\in\Omega$ , for $j=1,\ldots,N$ , where $N$ is a suitable integer, as follows:

[TABLE]

As a consequence the integral functional to minimize (LABEL:intprob) reduces to the following finite sum:

[TABLE]

and, the minimization problem (LABEL:intprob) turns out to be easily treated, for it can be equivalently written as a standard optimal control problem:

[TABLE]

Observe that in writing ( $P_{N}$ ), we can restrict attention only to elements $\omega$ belonging to the $\text{supp}(\mu)=\{\omega_{1},\ldots,\omega_{N}\}$ . Under the stated assumptions (H1)-(H3) and using the sum rule (cf. [19, Theorem 5.4.1]), the necessary conditions for ( $P_{N}$ ) can be derived from the nonsmooth maximum principle [19, Theorem 6.2.1] which guarantees the existence of a multiplier $\lambda\geq 0$ and arcs $\widetilde{p}(.,\omega_{j})\in W^{1,1}([0,T],\mathbb{R}^{n})$ , for $j=1,\ldots,N$ such that

(i)

$(\lambda,\widetilde{p}(.,\omega_{1}),\ldots,\widetilde{p}(.,\omega_{N}))\neq(0,\ldots,0)$ ; 2. (ii)

$-\dot{\widetilde{p}}(t,\omega_{j})\in\textrm{co }\partial_{x}[\widetilde{p}(t,\omega_{j})\cdot f(t,\bar{x}(t,\omega_{j}),\bar{u}(t),\omega_{j})]$ for all $j=1,\ldots,N$ ; 3. (iii)

$-\widetilde{p}(T,\omega_{j})\in\lambda\alpha_{j}\partial_{x}g(\bar{x}(T,\omega_{j});\omega_{j})+N_{C(\omega_{j})}(x(T,\omega_{j}))$ for all $j=1,\ldots,N$ ; 4. (iv)

$\sum\limits_{j=1}^{N}\widetilde{p}(t,\omega_{j})\cdot f(t,\bar{x}(t,\omega_{j}),\bar{u}(t),\omega_{j})=\max\limits_{u\in U(t)}\sum\limits_{j=1}^{N}\widetilde{p}(t,\omega_{j})\cdot f(t,\bar{x}(t,\omega_{j}),u,\omega_{j})\quad$ a.e. $t\in[0,T]$ .

For each $j$ , we set

[TABLE]

We deduce, therefore, conditions (a)-(d) of the proposition statement. This concludes the proof.

∎

3 Main results

We take now a probability space $(\Omega,\mathcal{B}_{\Omega},\mu)$ where $\mu$ is a (general) probability measure. For a given $W^{1,1}-$ local minimizer $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ and for some $\delta>0$ , we shall suppose:

(A1)

$(\Omega,\rho_{\Omega})$ is a complete separable metric space. 2. (A2)

(i)

The function $f(.,x,.,.)$ is $\mathcal{L}\times\mathcal{B}^{m}\times\mathcal{B}_{\Omega}$ measurable for each $x\in\mathbb{R}^{n}$ . 2. (ii)

The multifunction $t\leadsto U(t)$ has nonempty values and $\textrm{Gr }U(.)$ is a $\mathcal{L}\times\mathcal{B}^{m}$ measurable set. 3. (iii)

The set $f(t,x,U(t),\omega)$ is closed for all $x\in\bar{x}(t,\omega)+\delta\mathbb{B}$ , and $(t,\omega)\in[0,T]\times\Omega.$ 3. (A3)

There exist a constant $c>0$ and an integrable function $k_{f}:[0,T]\rightarrow\mathbb{R}$ such that

[TABLE]

for all $x,x^{\prime}\in\bar{x}(t,\omega)+\delta\mathbb{B}$ , $u\in U(t)$ , $\omega\in\Omega$ a.e. $t\in[0,T]$ . 4. (A4)

(i)

The function $g$ is $\mathcal{B}^{n}\times\mathcal{B}_{\Omega}$ measurable. 2. (ii)

There exist positive constants $k_{g}\geq 1$ and $M\geq\delta$ such that for all $\omega\in\Omega$ we have $|g(x,\omega)|\leq M$ and $d_{C(\omega)}(x)\leq M$ for all $x\in\bar{x}(T,\omega)+\delta\mathbb{B}$ , $|g(x,\omega)-g(x^{\prime},\omega)|\leq k_{g}|x-x^{\prime}|$ for all $x,x^{\prime}\in\bar{x}(T,\omega)+\delta\mathbb{B}\ .$ 3. (iii)

There exists a modulus of continuity $\theta(.)$ such that for all $\omega\in\Omega$ and $x\in\bar{x}(T,\omega)+\delta\mathbb{B}$ we have

[TABLE]

and

[TABLE] 5. (A5)

There exists a modulus of continuity $\theta_{f}(.)$ such that for all $\omega,\omega_{1},\omega_{2}\in\Omega$ ,

[TABLE]

(We say that $\theta:[0,\infty)\rightarrow[0,\infty)$ is a modulus of continuity if $\theta(s)$ is increasing and $\lim\limits_{s\downarrow 0}\theta(s)=0.$ )

The first result provides necessary optimality conditions for the general nonsmooth case.

Theorem 3.1.

Let $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ be a $W^{1,1}-$ local minimizer for (LABEL:intprob) in which $\mu\in\mathcal{M}(\Omega)$ is given. Assume that, for some $\delta>0$ , hypotheses (A1)-(A5) are satisfied. Then, there exist $\lambda\geq 0$ , a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p(.,.):[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ and a countable dense subset $\widehat{\Omega}$ of supp $(\mu)$ such that

(i)

$p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ * for all $\omega\in\widehat{\Omega}\ ;$ * 2. (ii)

$\begin{aligned} \int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)\ d\mu(\omega)=\max_{u\in U(t)}\int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),u,\omega)\ d\mu(\omega)\quad\text{a.e. }t\in[0,T]\ ;\end{aligned}$ ** 3. (iii)

$p(.,\omega)\in\text{co }\mathcal{P}(\omega)$ * for all $\omega\in\widehat{\Omega}$ where *

[TABLE]

Moreover, we consider two special cases in which condition (iii) becomes much simpler and the desired properties involving the costate arcs extend to the whole $\mbox{supp}(\mu)$ : when the measure $\mu$ is purely atomic, and the smooth right end-point free case.

Theorem 3.2 (Purely atomic case).

Let $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ be a $W^{1,1}-$ local minimizer for (LABEL:intprob) in which $\mu\in\mathcal{M}(\Omega)$ is a purely atomic measure such that each atom is a singleton. Assume that, for some $\delta>0$ , hypotheses (A1)-(A5) are satisfied. Then, there exist $\lambda\geq 0$ , a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p(.,.):[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ and a (at most) countable set $\widehat{\Omega}=$ supp $(\mu)$ such that

(i)

$p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ * for all $\omega\in\widehat{\Omega}\ ;$ * 2. (ii)

$\begin{aligned} \int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)\ d\mu(\omega)=\max_{u\in U(t)}\int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),u,\omega)\ d\mu(\omega)\quad\text{a.e. }t\in[0,T]\ ;\end{aligned}$ ** 3. (iii)

$(\lambda,\{p(.,\omega):\omega\in\widehat{\Omega}\})\neq(0,0)$ , and for all $\omega\in\widehat{\Omega}=$ supp $(\mu)$

[TABLE]

Theorem 3.3 (Smooth case).

Let $(\bar{u},\{\bar{x}(.,\omega):\omega\in\Omega\})$ be a $W^{1,1}-$ local minimizer for (LABEL:intprob) where $\mu\in\mathcal{M}(\Omega)$ is given. Suppose that, for some $\delta>0$ , hypotheses (A1)-(A3), (A4)(i) and (A5) are satisfied. In addition, assume that

(C1)

$g(.,\omega)$ * is differentiable on $\bar{x}(T,\omega)+\delta\mathbb{B}$ , for each $\omega\in\Omega$ , and $\nabla_{x}g(.,.)$ is continuous;* 2. (C2)

$f(t,.,u,\omega)$ * is continuously differentiable on $\bar{x}(t,\omega)+\delta\mathbb{B}$ for all $u\in U(t)$ and $\omega\in\Omega$ a.e. $t\in[0,T]$ , and $\omega\rightarrow\nabla_{x}f(t,x,u,\omega)$ is uniformly continuous with respect to $(t,x,u)\in\{(t^{\prime},x^{\prime},u^{\prime})\in[0,T]\times\mathbb{R}^{n}\times\mathbb{R}^{m}\ |\ u^{\prime}\in U(t^{\prime})\}.$ * 3. (C3)

$C(\omega):=\mathbb{R}^{n}$ .

Then, there exists a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p(.,.):[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ such that

(i)′

$p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ * for all $\omega\in$ supp $(\mu)$ ; * 2. (ii)′

$\begin{aligned} \int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)\ d\mu(\omega)=\max_{u\in U(t)}\int_{\Omega}p(t,\omega)\cdot f(t,\bar{x}(t,\omega),u,\omega)\ \textrm{d}\mu(\omega)\quad\text{a.e. }t\in[0,T]\ ;\end{aligned}$ ** 3. (iii)′

$-\dot{p}(t,\omega)=[\nabla_{x}f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)]^{T}p(t,\omega)$ * a.e. $t\in[0,T]$ , for all $\omega\in$ supp* $(\mu)$ ; 4. (iv)′

$-p(T,\omega)=\nabla_{x}g(\bar{x}(T,\omega);\omega),$ * for all $\omega\in$ supp $(\mu)$ .*

Comments

Condition (iii) of Theorem 3.1 is interpreted in the following sense: for each $\omega\in\widehat{\Omega}$ , one considers functions $q(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ (such that $\|q(.,.)\|_{L^{\infty}}$ is uniformly bounded by a constant) satisfying the adjoint system

[TABLE]

and the transversality condition

[TABLE]

Then, from this set of functions, one takes into account only the $q(.,.)$ ’s such that

[TABLE]

to generate the family of arcs sets of $\{\mathcal{P}(\omega)\}_{\omega\in\widehat{\Omega}}$ .

In optimal control theory, necessary optimality conditions results are usually provided avoiding the ‘trivial’ case, which is given by the couple $(\lambda,p(.,.))=(0,0)$ , where $\lambda$ is the multiplier associated with the cost. However, in literature dealing with optimal control problems with unknown parameters in the non-smooth context, results are often written including possible trivial cases which are not considered so relevant for the general properties expressed in the results statement; cf. [18] on nonsmooth minimax problems and [21] on nonsmooth adverse problems, in which the operator ‘ ${\rm co}$ ’ (convexifying over sets of costate arcs) is considered possibly bringing trivial cases. (The fact that in [18] and [21] the multiplier associated with the cost $\lambda$ does not appear in the necessary conditions should not be so surprising: this multiplier is somewhat hidden in the analysis and, in these contexts, the situation ‘ $p\equiv 0$ ’ alone might be considered as ‘trivial’). In our case, we might have a trivial couple $(\lambda=0,p(.,.)=0)$ which satisfies the conditions of Theorem 3.1, indeed, employing the convexification operator ‘ ${\rm co}$ ’ on the set of costate arcs, it may happen that, taking $\lambda=0$ , even if $p(.,\hat{\omega})\neq 0$ , with $\hat{\omega}\in\widehat{\Omega}$ , also $-p(.,\hat{\omega})$ is an admissible costate arc; convexifying, $p\equiv 0\in\text{co }\mathcal{P}(\hat{\omega})$ . We decided to be consistent with part of previous (nonsmooth) literature on problems with unknown parameters and provide a general nonsmooth result (Theorem 3.1), which allows (in some particular circumstances) a trivial case, but at the same time covers a number of non-restrictive non-trivial cases. For instance, (iii) of Theorem 3.1 immediately implies a non-triviality condition for the pair $(\lambda,p(.,.))$ when

(a)

the right end-point constraints are absent ( $C(\omega)\equiv\mathbb{R}^{n}$ ); 2. (b)

the given measure $\mu$ has a nonatomic component, the averaged right end-point constraints

[TABLE]

are imposed but the normal cone to the end-point constraint ${\rm co}N_{C(\omega)}(\bar{x}(T,\omega))$ is pointed for all $\omega\in\Omega$ (or even for $\omega$ belonging to a countable dense subset of the support of the nonatomic component of $\mu$ ). We recall that a convex cone $K\subset\mathbb{R}^{n}$ is said to be ‘pointed’ if for any nonzero elements $d_{1},\ d_{2}\in K,$ $d_{1}+d_{2}\neq 0$ .

Concerning (b), the abnormal situation (i.e. $\lambda=0$ ) is admissible, but the fact that ${\rm co}N_{C(\omega)}(\bar{x}(T,\omega))$ is pointed ensures that $p\equiv 0\notin\text{co }\mathcal{P}(\hat{\omega})$ for all $\hat{\omega}\in\widehat{\Omega}$ .

The ‘degeneracy issue’ (i.e. the necessary conditions are satisfied by any control) is a longstanding issue which has been widely investigated in optimal control. It is well-known that this issue may arise, for instance, in presence of state constraints for ‘standard’ (in the sense that parameters are absent) optimal control problems (cf. [19, Chapter X] and the references therein). Rather less is known for the case when unknown parameters intervene in the dynamics and the cost: minimax, adverse, and average optimal control problems. (See [11] for a non-degeneracy result on state constrained minimax problems avoiding the degeneracy caused by the state constraint; see also [18] for a link between minimax and state-constrained problems). In our context degeneracy might occur for the general nonsmooth case (Theorem 3.1) when the given measure $\mu$ has a nonatomic component. Indeed, our construction of the costate arcs $p(t,\omega)$ for $\omega\in\Omega$ is based on a limit-taking procedure starting from the information provided by (non-trivial) costate arcs $p(t,\hat{\omega})$ for $\hat{\omega}\in\widehat{\Omega}$ (cf. (5.21) below). If $\mu$ has a nonatomic component, we have no reason to expect (under the general assumptions considered in Theorem 3.1) that the non-degenerate property of the costate arcs $p(t,\hat{\omega})$ ( $\hat{\omega}\in\widehat{\Omega}$ ) always propagates on $\Omega$ as desired: there might be some degenerate situations in which for a full-measure subset of $\Omega$ the limit we take in the proof of Theorem 3.1 does not exist, and $p(.,.)$ extends with the value zero on $\Omega\setminus\widehat{\Omega}$ , obtaining a degeneracy issue. However, under some circumstances, the information provided on the set $\widehat{\Omega}$ does propagate: if there is no right end-point constraint and, in addition, we impose regularity assumptions on the dynamics and the terminal cost function, properties (i) and (iii) of Theorem 3.1 extend to the whole parameter set $\Omega$ , as stated in Theorem 3.3. Theorems 3.2 and 3.3 do provide non-degenerate results.

Nonsmooth results on optimal control problems with unknown parameters, such as adverse and minimax problems (see [21] and [18]), are concerned with a ‘degenerate issue’ which is not far from the one of our nonsmooth result Theorem 3.1, maybe, in a more ‘dramatic’ way, for the measure -appearing there as a multiplier in the necessary conditions- is not uniquely determined, and may have a support with degenerate effects on the necessary conditions. Consider for instance the simple example [18, Example 4.1] in the context of minimax problem:

[TABLE]

A minimax minimizer is: $(\bar{x}\equiv 0,\bar{u}\equiv 0)$ . In [18] there is a detailed discussion comparing [18, Proposition 2.1] (finite parameter sets case) and [18, Theorem 3.1] (general nonsmooth case), and the necessity of convexifying the set of costate arcs in the general nonsmooth case, for, otherwise, the necessary conditions would not be valid. In particular, in [18] the (Dirac) measure $\delta_{\omega=0}$ concentrated at $\omega=0$ (point at which the reference minimizer attains its maximum) is considered, for which “an arbitrary collection of $W^{1,1}$ functions such that $p(.;\omega=0)\equiv 0$ ” satisfies the necessary conditions of [18, Theorem 3.1]. The counterpart of this choice is that it is degenerate: any control satisfies the necessary conditions of [18, Theorem 3.1].

One might go a little bit further in this direction, observing that degeneracy is -in fact- much more dramatic for this particular example: indeed, for any probability measure $\mu$ on the parameter set $\Omega=[-1,1]$ the maximality conditions of [18, Theorem 3.1] are necessarily degenerate for the reference minimax minimizer $(\bar{x}\equiv 0,\bar{u}\equiv 0)$ (and the trivial case $p\equiv 0$ is also admitted). On the other hand, if one is interested in the different performance criterion given by the average cost $\int_{\omega\in[-1,1]}-|x(1)-\omega|d\omega/2$ with the same dynamics, these dramatic issues of triviality and degeneracy disappear. (To see this, we can take, for instance, the average minimizer $(\bar{x}(t)=t,\bar{u}\equiv 1)$ associated with the costate $p\equiv 1$ .)

At first glance our results might look similar to some statements on necessary conditions appearing in [20] and [21]. Not only these results do not cover the class of average cost minimization problems (in the sense of our paper), but we also highlight a crucial aspect concerning the completely different role of the measures entering in the picture of the necessary conditions: in Warga’s framework the existence of a positive Radon measure (on the set of ‘adverse’ relaxed controls) is a necessary condition, and this measure plays the role of a ‘multiplier’. In our context (of average control problems) the probability measure $\mu$ is a given datum, and we underline the fact that our objective is to give necessary conditions w.r.t. the given measure $\mu$ .

We finally observe that the construction of the countable set $\widehat{\Omega}$ proposed in this paper could be useful for applications: it provides a constructive way to approximate the reference measure $\mu$ by means of a sequence of convex combinations of Dirac measures concentrated at points of $\widehat{\Omega}$ . Therefore the set $\widehat{\Omega}$ can be considered as a reference set of parameters $\omega$ ’s for which one starts computing the costate arcs $p(.,\omega)$ and, eventually, derives conditions for optimal controls.

4 Preliminary results in measure theory

This section is devoted to display results which will be relevant for the proofs of Theorems 3.1, 3.2 and 3.3. We shall make repeatedly use of the following theorem (also referred to as Portmanteau Theorem, cf. [3, Theorem 4.5.1] or [14, Theorem 6.1. pp. 40]) which provides conditions characterizing the weak∗ convergence of probability measures on a metric space $(\Omega,\rho_{\Omega})$ .

Theorem 4.1.

Let $(\Omega,\rho_{\Omega})$ be a metric space. Take a sequence of measures $\{\mu_{i}\}$ in $\mathcal{M}(\Omega)$ and a measure $\mu\in\mathcal{M}(\Omega)$ . The following conditions are equivalent:

(a)

$\int_{\Omega}h\textrm{d}\mu_{i}\rightarrow\int_{\Omega}h\textrm{d}\mu$ * for any bounded continuous function $h$ on $\Omega$ (i.e. $\mu_{i}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ ) ;* 2. (b)

$\int_{\Omega}h\textrm{d}\mu_{i}\rightarrow\int_{\Omega}h\textrm{d}\mu$ * for any bounded uniformly continuous function $h$ on $\Omega$ ;* 3. (c)

$\lim\mu_{i}(B)=\mu(B)$ * for every Borel set $B$ whose boundary has * $\mu-$ *measure zero. (Such sets are also referred to as * $\mu-$ continuity sets) ; 4. (d)

$\limsup\mu_{i}(C)\leq\mu(C)$ * for every closed set $C$ in $\Omega$ ;* 5. (e)

$\liminf\mu_{i}(E)\geq\mu(E)$ * for every open set $E$ in $\Omega$ .*

We recall that $\mu\in\mathcal{M}(\Omega)$ is said to be tight if for each $\varepsilon>0$ , there exists a compact set $K_{\varepsilon}\subset\Omega$ such that $\mu(\Omega\setminus K_{\varepsilon})<\varepsilon$ . A very well-known result asserts that when $(\Omega,\rho_{\Omega})$ is a complete separable metric space, then every $\mu\in\mathcal{M}(\Omega)$ is tight (cf. [14, Theorem 3.2. pp. 29]). We shall invoke also a generalized version of the Prokhorov’s Theorem [5, Theorem 8.6.2] which provides a useful characterization of the relatively compact subsets of Borel measures on $\Omega$ , when $\Omega$ is a complete separable metric space. This result will be crucial to derive measure convergence properties (see Lemma 4.3 below).

Theorem 4.2 (Generalized Prokhorov Theorem).

Let $(\Omega,\rho_{\Omega})$ be a complete separable metric space and consider a family $\Upsilon$ of Borel measures on $\Omega$ . Then, $\Upsilon$ is relatively compact if and only if $\Upsilon$ is uniformly tight and uniformly bounded in the variation norm; in particular a sequence of measures $\{\mu_{i}\}$ admits a weakly∗ convergent subsequence if and only if the sequence $\{\mu_{i}\}$ is uniformly tight and uniformly bounded in the variation norm.

We consider now subsets $D$ and $D_{i}$ , for $i=1,2,\ldots$ , of $\Omega\times\mathbb{R}^{K}$ . We denote respectively by $D(.),\ D_{i}(.):\Omega\leadsto\mathbb{R}^{K}$ the multifunctions defined as

[TABLE]

Let $\{\mu_{i}\}$ be a weak∗ convergent sequence of measures in $\mathcal{M}(\Omega)$ . Our aim is to justify the limit-taking of sequences like

[TABLE]

in which $\{\gamma_{i}(\omega)\}$ is a sequence of Borel measurable functions satisfying

[TABLE]

The required convergence result is provided by Lemma 4.3 below, which represents an extension of [19, Proposition 9.2.1] and [18, Proposition 6.1] to the case in which $\Omega$ is an arbitrary complete separable metric space (not necessarily compact).

Lemma 4.3.

Let $\Omega$ be a complete separable metric space. Consider a sequence of measures $\{\mu_{i}\}$ in $\mathcal{M}(\Omega)$ such that $\mu_{i}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ for some $\mu\in\mathcal{M}(\Omega)$ , a sequence of sets $\{D_{i}\subset\Omega\times\mathbb{R}^{K}\}$ such that

[TABLE]

for some closed set $D\subset\Omega\times\mathbb{R}^{K}$ , and a sequence $\{\gamma_{i}:\Omega\rightarrow\mathbb{R}^{K}\}$ of Borel functions. Suppose that

(i)

$D(\omega)$ * is convex for each $\omega\in\text{dom }D(.)$ ;* 2. (ii)

the multifunctions $\omega\leadsto D(\omega)$ and $\omega\leadsto D_{i}(\omega)$ , for all $i$ , are uniformly bounded; 3. (iii)

for each $i=1,2,\ldots$ , $\gamma_{i}(\omega)\in D_{i}(\omega)\quad\mu_{i}-\text{a.e.}\text{ and }\text{supp}(\mu_{i})\subset\text{dom }D_{i}(.).$

Define, for each $i$ , the vector of signed measures $\eta_{i}:=\gamma_{i}\mu_{i}$ . Then, along a subsequence, we have

[TABLE]

where $\eta$ is a vector-valued Borel measure on $\Omega$ such that

[TABLE]

for some Borel measurable function $\gamma:\Omega\rightarrow\mathbb{R}^{K}$ satisfying

[TABLE]

(The upper limit in (4.1) above must be understood in the Kuratowski sense, cf. [4] or [19].)

Proof.

Since $\Omega$ is a complete separable metric space, the sequence $\{\mu_{i}\}$ turns out to be uniformly tight as result of Theorem 4.2. We also know that $\gamma_{i}(\omega)\in D_{i}(\omega)\ \mu_{i}-$ a.e. and $D_{i}(\omega)$ is uniformly bounded for all $i$ . It follows that there exists a constant $M>0$ such that

[TABLE]

For each $i$ , the vector-valued measure $\eta_{i}=\gamma_{i}\mu_{i}$ can be expressed as $\eta_{i}=(\eta_{i,1},\ldots,\eta_{i,K})$ . From the tightness of $\{\mu_{i}\}$ and (4.2), it immediately follows that, for all $k\in\{1,\ldots,K\}$ , $\{\eta_{i,k}\}$ is a family of uniformly tight, possibly signed measures. Therefore according to Theorem 4.2, for each $k\in\{1,\ldots,K\}$ one can extract a subsequence $\{\eta_{i,k}\}$ (we do not relabel) which converges weakly∗ to some $\eta_{k}$ . We show that $\eta:=(\eta_{1},\ldots,\eta_{K})$ is absolutely continuous with respect to $\mu$ . Let $\eta_{i,k}=\eta_{i,k}^{+}-\eta_{i,k}^{-}$ and $\eta_{k}=\eta_{k}^{+}-\eta_{k}^{-}$ be the Jordan decompositions of $\eta_{i,k}$ and $\eta_{k}$ , where $\eta_{k}^{+}$ and $\eta_{k}^{-}$ are respectively the weak∗ limits of $\eta_{i,k}^{+}$ and $\eta_{i,k}^{-}$ . Let $B_{\eta,\mu}$ be the common family of continuity sets (in the sense of (c) of Theorem 4.1) for the measures $\eta_{1}^{+},\ldots,\eta_{K}^{+}$ , $\eta_{1}^{-},\ldots,\eta_{K}^{-}$ and $\mu$ . Take any Borel set $B$ in $B_{\eta,\mu}$ , we have

[TABLE]

But since $B_{\eta,\mu}$ generates all the Borel sets of $\Omega$ (cf. [12, Chapter 7, Appendix]), it follows that $\eta$ is absolutely continuous with respect to $\mu$ . Therefore, by the Radon-Nikodym Theorem, there exists a $\mathbb{R}^{K}$ -valued, Borel measurable and $\mu$ -integrable function $\gamma$ on $\Omega$ such that for any Borel subset $B$ of $\Omega$ we have

[TABLE]

equivalently,

[TABLE]

It remains to show that $\gamma(\omega)\in D(\omega)\;\mu-$ a.e. $\omega\in\Omega.$ For all $j\in\mathbb{N}$ fixed, following the approach suggested in [19, Proposition 9.2.1], we define $D^{j}(\omega):=D(\omega)+\frac{1}{j}\mathbb{B}\subset\mathbb{R}^{K}$ . We fix $q\in\mathbb{R}^{K}$ . Since $D(\omega)$ is uniformly bounded and $D$ is closed, the multifunction $D^{j}(.)$ is upper semicontinuous. Then, for $\bar{R}>0$ large enough, the marginal function defined by

[TABLE]

turns out to be upper semicontinuous and bounded on $\Omega$ , owing to the Maximum Theorem (cf. [4, Theorem 1.4.16]). From standard results on semicontinuous maps (cf. [3, A6.6]), there exists a sequence of bounded continuous functions $\{\psi_{q}^{\ell}:\Omega\rightarrow\mathbb{R}\ ,\ \ell=1,2,\ldots\}$ such that:

[TABLE]

Recalling that the sets $D(\omega)$ and $D_{i}(\omega)$ for $i=1,2,\ldots,$ are uniformly bounded, and owing to (4.1), we have that, for all $j\in\mathbb{N}$ , there exists $i_{j}$ such that for all $i\geq i_{j}$ , $D_{i}(\omega)\subset D^{j}(\omega)\ .$ Then for $q\in\mathbb{R}^{K}$ and for any Borel subset $B$ of $\Omega$ , for all $i\geq i_{j}$ , we have

[TABLE]

The last inequality is a consequence of (4.3). Before passing to the limit, we observe that

[TABLE]

Indeed, take any open set $E\subset\Omega\setminus\text{dom }D^{j}(.)$ . Since $\text{supp}(\eta_{i})\subset\text{dom }D^{j}(.)$ for $i$ sufficiently large, and for all $j$ , from (e) of Theorem 4.1, we have

[TABLE]

We deduce that $\eta_{k}^{+}(E)=0$ for all $k=1,\ldots,K$ . Following the same reasoning, one can conclude that $\eta_{k}^{-}(E)=0$ for all $k\in 1,\ldots,K$ . Hence, $\eta(E)=0$ for all open subsets $E\subset\Omega\setminus\text{dom }D^{j}(.)$ and $\text{supp}(\eta)\subset\text{dom }D^{j}(.)$ . The inclusion (4.5) is therefore proved. By passing to the limit in (4) as $i\rightarrow\infty$ , since $\psi_{q}^{\ell}(.)$ is bounded continuous on $\Omega$ , we obtain for any Borel set $B\in B_{\eta,\mu}$

[TABLE]

As $\int_{B}d\eta(\omega)=\int_{B}\gamma(\omega)\ d\mu(\omega)$ , for any $B\in B_{\eta,\mu}$ , we have

[TABLE]

Recalling that $B_{\eta,\mu}$ generates the Borel $\sigma-$ algebra $\mathcal{B}_{\Omega}$ , we deduce that (4.6) is actually valid for all Borel subsets of $\Omega$ . As a consequence, $q\cdot\gamma(\omega)\leq\psi^{\ell}_{q}(\omega)\quad\mu-\textrm{a.e. },$ and letting $\ell\rightarrow\infty$ , we obtain

[TABLE]

Inequality (4.7) holds for all $q\in\mathbb{R}^{K}$ with $|q|=1$ . (Indeed, from the continuity of the map $q\mapsto\max\{q\cdot d:d\in D^{j}(\omega)\}$ , it is enough to establish inequality (4.7) for $q\in\mathbb{Q}^{K}$ , and subsequently use the density of $\mathbb{Q}^{K}$ in $\mathbb{R}^{K}$ .)

Since $D^{j}(\omega)$ is a closed and convex set, for each $\omega\in\text{dom }D(.)$ , invoking the Hahn-Banach separation theorem, we obtain that

[TABLE]

Taking the limit as $j\rightarrow\infty$ , we deduce that $\gamma(\omega)\in\bigcap\limits_{j\in\mathbb{N}}D^{j}(\omega)=D(\omega)\quad\mu-$ a.e. $\omega\in\Omega$ which concludes the proof.

∎

5 Proofs of Theorem 3.1, Theorem 3.2 and Theorem 3.3

We first employ a standard hypotheses reduction argument establishing that we can, without loss of generality, replace assumptions (A3)-(A5) by the stronger conditions in which $\delta=+\infty$ (i.e. the conditions are satisfied globally).

(A3)′

There exist a constant $c>0$ and an integrable function $k_{f}:[0,T]\rightarrow\mathbb{R}$ such that

[TABLE]

for all $x,x^{\prime}\in\mathbb{R}^{n},\ u\in U(t),\ \omega\in\Omega,\ \text{a.e. }t\in[0,T]\ .$ 2. (A4)′

(i)

There exist positive constants $k_{g}\geq 1$ and $M$ such that for all $\omega\in\Omega$

$|g(x,\omega)|\leq M$ and $d_{C(\omega)}(x)\leq M$ for all $x\in\mathbb{R}^{n}$ , $|g(x,\omega)-g(x^{\prime},\omega)|\leq k_{g}|x-x^{\prime}|$ for all $x,x^{\prime}\in\mathbb{R}^{n}\ .$ 2. (ii)

There exists a modulus of continuity $\theta(.)$ such that we have

[TABLE]

and

[TABLE]

for all $\omega_{1},\omega_{2}\in\Omega$ and $x\in\mathbb{R}^{n}$ . 3. (A5)′

There exists a modulus of continuity $\theta_{f}(.)$ such that for all $\omega_{1},\omega_{2}\in\Omega$ ,

[TABLE]

This is possible if we consider the “truncation” function $tr_{y,\delta}:\mathbb{R}^{n}\rightarrow\mathbb{R}^{n}$ , defined to be

[TABLE]

and we replace $f,\ g$ and $d$ above by their local expression $\widetilde{f}$ , $\widetilde{g}$ and $\widetilde{d}$ defined as follows

[TABLE]

Indeed, the problems involving the functions $(f,g,d)$ and $(\widetilde{f},\widetilde{g},\widetilde{d})$ do coincide in a neighbourhood of the $W^{1,1}-$ local minimizer $(\bar{u},\{\bar{x}(.,\omega)\ |\ \omega\in\Omega\})$ for (LABEL:intprob). Therefore, $(\bar{u},\{\bar{x}(.,\omega)\ |\ \omega\in\Omega\})$ does remain a $W^{1,1}-$ local minimizer for the problem (LABEL:intprob) when we substitute the pair $(f,g,d)$ with $(\widetilde{f},\widetilde{g},\widetilde{d})$ . Furthermore, the assertions of the theorem are unaffected by changing the data in this way.

We provide two technical lemmas which will be employed in the approximation techniques used in the theorems proof. These preliminary results establish the uniform continuity of trajectories with respect to $\omega$ and the existence of a sequence of suitable finite support measures approximating the reference measure $\mu$ . Throughout this section, $d_{\mathbcal}{E}(.,.)$ denotes the Ekeland metric defined on the control set $\mathcal{U}$ as

[TABLE]

We recall that, given a control $u(.)$ , to make clearer which control is used we shall employ the alternative notation $x(.,u,\omega)$ for the feasible arc belonging to the family of trajectories $\{x(.,\omega)\ :\ \omega\in\Omega\}$ associated with the control $u(.)$ .

Lemma 5.1.

Let $(\Omega,\rho_{\Omega})$ be a metric space. Suppose that assumptions (A2)(i)-(ii), LABEL:A2'_nco_general_case and LABEL:A5'_nco_general_case are satisfied. Then,

(i)

we can find $\beta>0$ such that

[TABLE]

for all $u(.),u^{\prime}(.)\in\mathcal{U}$ . 2. (ii)

for all $\widetilde{\varepsilon}>0$ , we can find $\widetilde{r}>0$ , such that for any given $u(.)\in\mathcal{U}$ ,

[TABLE]

Proof.

(i) Write

[TABLE]

Fix any $\varepsilon>0$ . Take any $u(.),u^{\prime}(.)\in\mathcal{U}$ . Owing to Filippov Existence Theorem [19, Theorem 2.4.3] (recall that we have the same initial datum $x_{0}$ ), for each $\omega\in\Omega$ , we obtain

[TABLE]

The last inequality is a consequence of the bound on the dynamic (assumption LABEL:A2'_nco_general_case). The particular choice $\beta$ allows to conclude.

(ii) Fix now any $\widetilde{\varepsilon}>0$ . Take a control $u(.)\in\mathcal{U}$ . Owing to assumption LABEL:A5'_nco_general_case, we choose $\widetilde{r}>0$ such that

[TABLE]

Take $\omega,\ \omega^{\prime}\in\Omega$ such that $\rho_{\Omega}(\omega,\omega^{\prime})<\widetilde{r}.$ Taking two different trajectories $x(.,u,\omega)$ and $x(.,u,\omega^{\prime})$ with the same initial point $x_{0}$ and the same control $u(.)$ , for all $t\in[0,T]$ we have,

[TABLE]

Taking into account assumptions LABEL:A2'_nco_general_case and LABEL:A5'_nco_general_case, we conclude that

[TABLE]

Applying Gronwall Lemma, for all $t\in[0,T]$ , we deduce

[TABLE]

The particular choice of $\widetilde{r}$ as in (5.2) and the fact that $\rho_{\Omega}(\omega,\omega^{\prime})<\widetilde{r}$ allow to conclude the proof.

∎

Lemma 5.2.

Suppose that conditions (A1), (A2)(i)-(ii), LABEL:A2'_nco_general_case-LABEL:A5'_nco_general_case are satisfied, and $\mu\in\mathcal{M}(\Omega)$ . Then, there exist a sequence of finite subsets of $\Omega$ , $\{\Omega^{\ell}:=\{\omega_{j}^{\ell}\ :\ j=0,\ldots,N_{\ell}\}\}_{\ell\geq 1}$ and a sequence of convex combinations of Dirac measures $\{\mu_{\ell}\}_{\ell\geq 1}$ , such that the following properties are satisfied.

(i)

$\Omega^{\ell}\subset\Omega^{\ell+1}$ * for all integer $\ell\geq 1$ , and $\widehat{\Omega}:=\bigcup\limits_{\ell\geq 1}\Omega^{\ell}$ is a countable dense subset of $\text{supp}(\mu)$ ;* 2. (ii)

$\mu_{\ell}=\sum_{j=0}^{N_{\ell}}\alpha_{j}^{\ell}\delta_{\omega_{j}^{\ell}}$ , where $\alpha_{j}^{\ell}\in(0,1]$ and $\sum_{j=0}^{N_{\ell}}\alpha_{j}^{\ell}=1$ , and $\mu_{\ell}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ ; 3. (iii)

for each $\varepsilon>0$ , we can find $\ell_{\varepsilon}\in\mathbb{N}$ such that for all $\ell\geq\ell_{\varepsilon}$ ,

[TABLE]

and

[TABLE]

for all $u(.)\in\mathcal{U}$ .

Moreover, if the measure $\mu$ has a purely atomic component such that each atom is a singleton, then the countable set $\widehat{\Omega}$ can be constructed in such a manner that $\widehat{\Omega}$ contains all the atoms of $\mu$ .

Proof.

(i). Since $\Omega$ is a complete separable metric space, the measure $\mu$ is tight. As a consequence, for all integer $\ell\geq 1$ , there exists a compact set $K_{\ell}\subset\Omega$ such that $\mu(\Omega\setminus K_{\ell})<\frac{1}{\ell}.$ Write $\Omega_{0}^{\ell}:=(\Omega\setminus K_{\ell})\cap\text{supp}(\mu)$ . Therefore, employing an iterative argument, a suitable choice of the compact set $K_{\ell}$ allows us to obtain, for each $\ell\geq 1$ , a family of disjoint Borel subsets $\{\Omega_{j}^{\ell}\}_{j=0,\ldots,N_{\ell}}$ , for some $N_{\ell}\in\mathbb{N}$ , such that the following conditions are satisfied:

(a)

$\text{supp}(\mu)=\bigcup\limits_{j=0}^{N_{\ell}}\Omega_{j}^{\ell}$ ; 2. (b)

for each $j\in\{1,\ldots,N_{\ell}\}$ , ${\Omega}_{j}^{\ell}\subset K_{\ell}$ , and diam $(\Omega_{j}^{\ell})\leq\frac{1}{\ell}$ . (Recall that diam $(\Omega_{j}^{\ell})=\sup\limits_{a,b\in\Omega_{j}^{\ell}}\rho_{\Omega}(a,b)$ .) 3. (c)

$\mu(\Omega_{0}^{\ell})<\frac{1}{\ell}$ and $\Omega_{0}^{\ell}\supset\Omega_{0}^{\ell+1}$ .

We can also choose elements $\omega_{j}^{\ell}\in\Omega_{j}^{\ell}$ , for all $j=0,1,\ldots,N_{\ell}$ , in such a manner that we have $\{\omega_{j}^{\ell}\}_{j=0,\ldots,N_{\ell}}\subset\{\omega_{j}^{\ell+1}\}_{j=0,\ldots,N_{\ell+1}}$ . If $\text{supp}(\mu)$ is compact, then we can always assume that $\Omega_{0}^{\ell}=\emptyset$ for all integer $\ell\geq 1$ . In this case, we can relabel the elements chosen in the Borel sets $\Omega_{j}^{\ell}$ ’s, taking

[TABLE]

and we replace $N_{\ell}$ with $\widetilde{N}_{\ell}:=N_{\ell}-1$ . In any case, we obtain, for each $\ell\geq 1$ , a finite set $\Omega^{\ell}:=\{\omega_{j}^{\ell}\}_{j}$ such that $\Omega^{\ell}\subset\Omega^{\ell+1}$ . From the standard properties of complete separable metric spaces, it is easy to see that the sequence of sets $\{\Omega^{\ell}\}$ can be constructed in such a way that $\widehat{\Omega}:=\bigcup\limits_{\ell\geq 1}\Omega^{\ell}$ is (countable) dense in $\text{supp}(\mu)$ .

(ii). We assume here that $\text{supp}(\mu)$ is not compact (the compact case can be treated in a similar and easier way). Consider, for each $\ell\geq 1$ , the family of Borel disjoint subsets of $\Omega$ , $\{\Omega_{j}^{\ell}\}_{j=0,\ldots,N_{\ell}}$ and the finite sequence of elements $\{\omega_{j}^{\ell}\}_{j=0,\ldots,N_{\ell}}$ , with $\omega_{j}^{\ell}\in\Omega^{\ell}_{j}$ , provided in the proof of (i). We define the measure $\mu_{\ell}$

[TABLE]

Owing to Theorem 4.1, we can check the weak∗ convergence of the sequence $\{\mu_{\ell}\}$ on the set of bounded real valued uniformly continuous functions on $(\Omega,\rho_{\Omega})$ (instead of the set of bounded continuous functions). Take any bounded uniformly continuous function $h:\Omega\to\mathbb{R}$ . Write $M:=\sup\limits_{\omega\in\Omega}|h(\omega)|$ . Fix any $\varepsilon>0$ . Then, there exists $r_{\varepsilon}>0$ such that

[TABLE]

Let $\ell_{\varepsilon}\in\mathbb{N}$ such that $\frac{1}{\ell_{\varepsilon}}\leq\min\{r_{\varepsilon};\frac{\varepsilon}{4M}\}$ . Then for all $\ell\geq\ell_{\varepsilon}$ , we have

[TABLE]

For each $j\in\{1,\ldots,N_{\ell}\}$ , we define

[TABLE]

Therefore, we can find $y_{j}^{\ell},\ z_{j}^{\ell}\in\Omega_{j}^{\ell}$ such that

[TABLE]

Then for all $\ell\geq\ell_{\varepsilon}$ , using also (5.4) and the fact that $\text{diam}(\Omega_{j}^{\ell})\leq\frac{1}{\ell}$ , it follows that

[TABLE]

As a consequence, for all $\ell\geq\ell_{\varepsilon}$ , from (5.5) we deduce that

[TABLE]

Then, from inequality (5.6) and the choice of $\ell_{\varepsilon}$ , for all $\ell\geq\ell_{\varepsilon}$ , we obtain

[TABLE]

Setting $\alpha_{j}^{\ell}:=\mu(\Omega^{\ell}_{j})>0$ , for $j=0,\ldots,N_{\ell}$ , we conclude the proof of (ii).

(iii). Fix any $\varepsilon>0$ . Choose $r_{0}>0$ such that

[TABLE]

Take any $\omega_{1},\ \omega_{2}\in\Omega$ such that $\rho_{\Omega}(\omega_{1},\omega_{2})<r_{0}$ . Then, from assumption 0(A4)′(ii)

[TABLE]

Take any $u(.)\in\mathcal{U}$ . From Lemma 5.1(ii), there exists $\widetilde{r}>0$ such that for all $\omega_{1},\omega_{2}\in\Omega$ verifying $\rho_{\Omega}(\omega_{1},\omega_{2})<\tilde{r}$ , we have

[TABLE]

Write $r_{\varepsilon}:=\min\{\widetilde{r},r_{0}\}$ . For all $\omega_{1},\omega_{2}\in\Omega$ verifying $\rho_{\Omega}(\omega_{1},\omega_{2})\leq r_{\varepsilon}$ , from assumption 0(A4)′(i), we deduce

[TABLE]

Similarly, $|d_{C(\omega_{1})}(x(T,u,\omega_{1}))-d_{C(\omega_{2})}(x(T,u,\omega_{2}))|\leq\frac{\varepsilon}{2}$ . Therefore, for each $u(.)\in\mathcal{U}$ , the maps $\omega\mapsto g(x(T,u,\omega);\omega)$ and $\omega\mapsto d_{C(\omega)}(x(T,u,\omega))$ are uniformly continuous, and from LABEL:A4'_nco_general_case (uniformly) bounded by the constant $M$ (observe that $M$ and $r_{\varepsilon}$ above do not depend on $u(.)$ ). Invoking the same argument employed in the proof of (ii) we conclude that, whenever we fix $\varepsilon>0$ , we can find $\ell_{\varepsilon}\in\mathbb{N}$ such that for all $\ell\geq\ell_{\varepsilon}$ , we have

[TABLE]

and

[TABLE]

This confirms property (iii).

Finally, if the measure $\mu$ has a purely atomic component such that each atom is a singleton, then at each step of the iterative argument employed in (i), the compact set $K_{\ell}\subset\Omega$ , for all $\ell\geq 1$ , is such that it contains a finite number of atoms of $\mu$ which will be included in $\Omega^{\ell}$ .

∎

Proof of Theorem 3.1. The proof is build up in four parts. The first part consists in approximating the reference problem with a given probability measure by an auxiliary problem which involves measures with finite support. This is possible invoking the result on the weak∗ convergence established in Lemma 5.2 and the Ekeland’s variational Principle. In the second part, we apply necessary optimality conditions (cf. Proposition 2.1 previously obtained) for the auxiliary problem. In the third part, we pass to the limit a first time to obtain optimality conditions on a countable dense subset of supp $(\mu)$ . The last part of the proof is devoted to deriving, via a second limit-taking process, all the desired necessary conditions of the theorem statement. Since it is not restrictive to assume that supp $(\mu)=\Omega$ , we shall consider this assumption throughout the proof. 1. Take a $W^{1,1}-$ local minimizer $(\bar{u},\{\bar{x}(.,\omega)\ :\ \omega\in\Omega\})$ for problem (LABEL:intprob). Then there exists $\bar{\varepsilon}>0$ such that

[TABLE]

for all feasible processes $(u,\{x(.,\omega)\ :\ \omega\in\Omega\})$ such that

[TABLE]

Take a decreasing sequence $\epsilon_{i}\downarrow 0$ such that $\beta\epsilon_{i}\leq\frac{\bar{\varepsilon}}{4}$ for all $i\geq 1$ , where $\beta>0$ is the number provided by Lemma 5.1. For each $i$ , we define the functional $J_{i}:({\mathbcal U},d_{\mathbcal E})\rightarrow\mathbb{R}$ as follows:

[TABLE]

It is clear that $J_{i}(u)\geq 0$ , for all controls $u(.)$ . Moreover, we have $J_{i}(u)>0$ for all controls $u\in\mathcal{U}_{\bar{\varepsilon}}$ , where

[TABLE]

Otherwise, there would exist $\hat{u}\in\mathcal{U}_{\bar{\varepsilon}}$ such that $J_{\Omega}((\hat{u}(.),\{\hat{x}(.,\omega)\}))<J_{\Omega}((\bar{u}(.),\{\bar{x}(.,\omega)\}))$ , contradicting the fact that $(\bar{u},\{\bar{x}(.,\omega)\ :\ \omega\in\Omega\})$ is a $W^{1,1}-$ local minimizer for (P). Observe also that

[TABLE]

which means that $\bar{u}$ is an $\epsilon_{i}^{2}-$ minimizer for $J_{i}$ on ${\mathbcal U}$ . Then, since $J_{i}$ is a continuous function on the complete metric space $({\mathbcal U},d_{\mathbcal E})$ (it suffices to use here the Lipschitz continuity of $g(.,\omega)$ and $d_{C(\omega)}(.)$ and Lemma 5.1(i)), we deduce from Ekeland’s Theorem (cf. [19, Theorem 3.3.1]) that, for each $i\geq 1$ , there exists $v_{i}\in{\mathbcal U}$ such that

[TABLE]

Consider the sequence of convex combinations of Dirac measures $\{\mu_{\ell}\}$ provided by Lemma 5.2. Recall, in particular, that $\mu_{\ell}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\mu$ and

[TABLE]

where $\alpha_{j}^{\ell}\in(0,1]$ , for all $j=0,\ldots,N_{\ell}$ and $\sum_{j=0}^{N_{\ell}}\alpha_{j}^{\ell}=1$ . We can find a decreasing sequence $\rho_{i}\downarrow 0$ , with $\beta\rho_{i}\leq\frac{\bar{\varepsilon}}{4}$ for all $i\geq 1$ , and an increasing sequence $\{\ell_{i}\in\mathbb{N}\}_{i\geq 1}$ such that, setting

[TABLE]

(we write $\Omega^{i}:=\Omega^{N_{\ell_{i}}}\subset\widehat{\Omega}\subset\Omega$ , $\mu_{i}:=\mu_{\ell_{i}}$ for the corresponding convex combination of $(N_{i}+1=N_{\ell_{i}}+1)$ Dirac measures which approximate $\mu$ , and $\omega_{j}^{i}:=\omega_{j}^{\ell_{i}}$ , $j=0,1,\ldots,N_{i}$ , so that $\Omega^{i}=\{\omega_{j}^{i}\}_{j=0}^{N_{i}}$ ), we have

[TABLE]

and

[TABLE]

Therefore, $v_{i}$ is a $\rho_{i}^{2}-$ minimizer on ${\mathbcal U}$ for

[TABLE]

Invoking Ekeland’s theorem one more time, we deduce that there exists $u_{i}\in{\mathbcal U}$ which minimizes

[TABLE]

such that $d_{\mathcal{E}}(u_{i},v_{i})\leq\rho_{i}$ . As a consequence we obtain

[TABLE]

Write $(u_{i},\{x_{i}(.,\omega)\ :\ \omega\in\Omega\})$ the process associated with the control $u_{i}$ . Therefore, from Lemma 5.1 (i) we have that

[TABLE]

Bearing in mind (5.9) it immediately follows that $\widetilde{J}_{i}(u_{i})>0$ .

Now we introduce two $\mathcal{L}\times\mathcal{B}^{m}-$ measurable functions

[TABLE]

Therefore we can write:

[TABLE]

The minimizing property (5.10) can be expressed in terms of the following auxiliary optimal control problem

[TABLE]

whose minimizer is the family $(u_{i},(\gamma_{i},\zeta_{i}\equiv 0,\{x_{i}(.,\omega)\}))$ verifying, as $i\rightarrow\infty$ , $d_{\mathbcal E}(u_{i},\bar{u})\rightarrow 0$ and

[TABLE]

2. The second step of the proof consists in applying necessary optimality conditions (cf. Proposition 2.1) to problem (Pi) for each $i$ sufficiently large: for all $\omega\in\Omega^{i}$ (that is for $\mu_{i}-$ a.e. $\omega\in\Omega$ ), there exist $W^{1,1}-$ arcs $p_{i}(.,\omega)$ (associated with the state variable $x$ ), $q_{i}(.)$ (associated with the variable $\gamma$ ), and $z_{i}(.)$ (associated with the variable $\zeta$ ) such that

[TABLE]

and satisfying the necessary conditions below:

The transversality condition (owing to the Max Rule [19, Theorem 5.5.2]), for suitable $\lambda_{i}\in[0,1]$ , leads to

[TABLE]

(Here, $\alpha^{i}_{j}:=\alpha^{\ell_{i}}_{j}$ , for $j=0,1,\dots,N_{i}$ .) The adjoint system gives $-\dot{q}_{i}(t)\equiv 0$ and $-\dot{z}_{i}(t)\equiv 0$ , which implies that $q_{i}(t)\equiv-\epsilon_{i}\ ,$ and $z_{i}(t)\equiv-\rho_{i}\ .$ Moreover,

[TABLE]

From the maximality condition, we obtain, for a.e. $t\in[0,T]$

[TABLE]

This implies that for a.e. $t\in[0,T]$ and for every $u\in U(t)$

[TABLE]

From (5.11) we deduce that

[TABLE]

Moreover, taking note of the fact that $d_{\mathcal{E}}(u_{i},\bar{u})\leq\rho_{i}^{\prime}$ and, owing to Lemma 5.1 (i), we can also deduce that

[TABLE]

Therefore, for each $i$ , and $\mu_{i}-$ a.e. $\omega\in\Omega$ , from the optimality conditions (5.14)-(5.17), we have

(a1)

$p_{i}(.,\omega)\neq 0$ ; 2. (a2)

$-\dot{p}_{i}(t,\omega)\in\textrm{co }\partial_{x}[p_{i}(t,\omega)\cdot f(t,x_{i}(t,\omega),\bar{u}(t),\omega)]$ for all $t\in A_{\rho_{i}^{\prime}}$ ; 3. (a3)

$-p_{i}(T,\omega)\in\alpha_{j}^{i}\lambda_{i}\partial_{x}g(x_{i}(T,\omega);\omega)+\alpha_{j}^{i}(1-\lambda_{i})\partial d_{C(\omega)}(x_{i}(T,\omega))\ ;$ 4. (a4)

$\sum_{\omega\in\Omega^{i}}p_{i}(t,\omega)\cdot[f(t,x_{i}(t,\omega),u,\omega)-f(t,x_{i}(t,\omega),\bar{u}(t),\omega)]\leq\rho_{i}^{\prime}$ for all $t\in A_{\rho_{i}^{\prime}}$ and for any $u\in U(t)\ .$

Following the idea of Proposition 2.1, and dividing each term of the family of the costate arcs across by the corresponding coefficient $\alpha_{j}^{i}(>0)$ (without relabelling), we obtain that for each $i$ large enough and $\mu_{i}-$ a.e. $\omega\in\Omega$ ,

(a1)′

$p_{i}(.,\omega)\neq 0$ ; 2. (a2)′

$-\dot{p}_{i}(t,\omega)\in\textrm{co }\partial_{x}[p_{i}(t,\omega)\cdot f(t,x_{i}(t,\omega),\bar{u}(t),\omega)]$ for all $t\in A_{\rho_{i}^{\prime}}$ ; 3. (a3)′

$-p_{i}(T,\omega)\in\lambda_{i}\partial_{x}g(x_{i}(T,\omega);\omega)+(1-\lambda_{i})\partial d_{C(\omega)}(x_{i}(T,\omega))\ ;$ 4. (a4)′

$\int_{\Omega}p_{i}(t,\omega)\cdot[f(t,x_{i}(t,\omega),u,\omega)-f(t,x_{i}(t,\omega),\bar{u}(t),\omega)]\ d\mu_{i}(\omega)\leq\rho_{i}^{\prime}$ for all $t\in A_{\rho_{i}^{\prime}}$ and for any $u\in U(t)\ .$

3. We derive now consequences of the limit-taking for conditions (a1)′-(a3)′ of the previous step. Recall that from Lemma 5.2, we have a countable dense subset $\widehat{\Omega}$ of $\Omega$ , such that $\widehat{\Omega}\ =\ \bigcup_{i\geq 1}\Omega^{i}\ ,$ where $\Omega^{i}=\{\omega_{j}^{i}\ :\ j=0,\ldots,N_{i}\}$ provides an increasing sequence of finite subsets of $\Omega$ : $\Omega^{1}\subset\ldots\subset\Omega^{i}\subset\Omega^{i+1}\subset\ldots$ . Since $\widehat{\Omega}$ is a countable set, we can write it as the collection of the elements of a sequence $\{\omega_{k}\}_{k\geq 1}$ such that

[TABLE]

Fix $i\in\mathbb{N}$ . When we take $\omega_{k}\in\widehat{\Omega}$ , two possible cases may occur: either $\omega_{k}\in\Omega^{i}$ for the fixed $i\in\mathbb{N}$ ; or $\omega_{k}\in\widehat{\Omega}\setminus\Omega^{i}$ . In the first case, it means that there exists $j\in\{0,\ldots,N_{i}\}$ such that $\omega_{k}=\omega_{j}^{i}$ and the corresponding adjoint arc $p_{i}(.,\omega_{j}^{i})$ satisfies conditions (a1)′-(a4)′. So, we can define the arc $p_{i}(.,\omega_{k})$ as follows:

[TABLE]

Therefore, by iterating on $i$ , associated with each $\omega_{k}\in\widehat{\Omega}$ , we can construct a sequence of families of arcs $\{p_{i}(.,\omega_{k}):\omega_{k}\in\widehat{\Omega}\}_{i\geq 1}$ . Observe that there exists always $i_{k}\in\mathbb{N}$ such that, for all $i\geq i_{k}$ , $p_{i}(.,\omega_{k})$ is an adjoint arc for which (a1)′-(a4)′ hold true. From (a3)′ and (A4)′ it immediately follows that the sequence $\{p_{i}(T,\omega_{k})\}$ is uniformly bounded by $k_{g}+1$ . On the other hand (a2)′ and (A3)′ imply that $\{\dot{p}_{i}(.,\omega_{k})\}$ are uniformly integrably bounded. Then, the hypotheses are satisfied under which the Compactness Theorem [19, Theorem 2.5.3] is applicable to

[TABLE]

We conclude that, along some subsequence (we do not relabel),

[TABLE]

for some $\widehat{p}(.,\omega_{k})\in W^{1,1}$ which satisfies (for the fixed $k$ )

[TABLE]

We can also take the subsequence in such a manner that $\{\lambda_{i}\}$ converges to some $\lambda\in[0,1]$ . Moreover, from the closure of the graph of the limiting subdifferential and the normal cone (seen as multifunctions), we have that

[TABLE]

But $\widehat{\Omega}=\{\omega_{k}\}_{k}$ is a countable set. Then, we can repeat the similar analysis for each $\omega_{k}\in\widehat{\Omega}$ , taking into account the subsequence obtained for the previous element $\omega_{k-1}$ . As a consequence, we have a collection of subsequences $\{\widetilde{p}_{i}(.,\omega)\}$ verifying the convergence properties (5.19) to a collection of adjoint arcs $\{\widetilde{p}(.,\omega)\}$ which satisfies, for all $\omega\in\widehat{\Omega}$

[TABLE]

and

[TABLE]

Furthermore, since for all $i$ , $\widetilde{p}_{i}(.,.)$ is $\mathcal{L}\times\mathcal{B}_{\widehat{\Omega}}$ measurable, we obtain that its limit $\widetilde{p}(.,.)$ is also $\mathcal{L}\times\mathcal{B}_{\widehat{\Omega}}$ measurable. The final step is represented by the extension of $\widetilde{p}(.,.)$ to a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p(.,.)$ on $[0,T]\times\Omega$ which satisfies conditions (5.20) and (5.22) below when restricted to $\widehat{\Omega}$ . This can be done as follows. Writing explicitly the coordinates of $\widetilde{p}(.,.)=(\widetilde{p}^{1}(.,.),\dots,\widetilde{p}^{n}(.,.))$ , for each $j=1,\dots,n$ , we have the decomposition into the positive and negative parts: $\widetilde{p}^{j}=\widetilde{p}^{j+}-\widetilde{p}^{j-}$ . Consider a sequence of simple functions $\widetilde{\phi}_{k}(.,.)$ (for $\mathcal{L}\times\mathcal{B}_{\widehat{\Omega}}$ ) which approximates from below $\widetilde{p}^{j+}(.,.)$ : $0\leq\widetilde{\phi}_{k}\uparrow\widetilde{p}^{j+}$ . Let $\phi_{k}(.,.)$ be the simple function which provides an extension of $\widetilde{\phi}_{k}(.,.)$ to $\mathcal{L}\times\mathcal{B}_{\Omega}$ . Then, define

[TABLE]

Then, we obtain the desired extension setting $p^{j}=p^{j+}-p^{j-}$ and $p(.,.)=(p^{1}(.,.),\dots,p^{n}(.,.))$ . Clearly we have the following transversality condition:

[TABLE]

Finally, we derive a non-triviality condition for $\{p(.,\omega)\;:\;\omega\in\widehat{\Omega}\}$ . This is immediate if the $\lambda=\lim\lambda_{i}>0$ , so we continue examining the case in which $\lambda=0$ . Choose $i_{0}\in\mathbb{N}$ such that for all $i\geq i_{0}$ , $(k_{g}+1)\lambda_{i}<\frac{1}{2}$ . In particular, for all $i\geq i_{0}$ , from the Max Rule we have $1-\lambda_{i}>0$ , and using the fact that $\widetilde{J}_{i}(u_{i})>0$ , it follows that

[TABLE]

Then there exists $j_{i}\in\{0,1,\dots,N_{i}\}$ and $\nu\in\mathbb{R}^{n}$ such that $|\nu|=1$ and

[TABLE]

Recalling that $k_{g}>0$ is the Lipschitz constant of $g(.,\omega)$ , we have

[TABLE]

And from the choice of $i_{0}\in\mathbb{N}$ , we obtain that

[TABLE]

and so

[TABLE]

We deduce that

[TABLE]

In any case, we obtain the non-triviality condition

[TABLE]

4. In the last part of the proof, we want to use also the information contained in the maximality condition (a4)′ (or in its alternative version (5.17)) as $i\rightarrow\infty$ . This task requires to use Castaing’s Representation Theorem (cf. [8, Theorem III.7], the Aumann’s Measurable Selection Theorem (cf. [8, Theorem III.22]), and Lemma 4.3 which has a central role for the limit-taking of all the necessary conditions obtained in Step 2 at the same time. Write

[TABLE]

Owing to assumption (A2) and the Lipschitz continuity of $f(t,.,u,\omega)$ , we obtain that $(t,\omega)\leadsto F(t,\omega)$ is a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable with closed values. Using the Castaing’s Representation Theorem, we know that there exists a countable family of $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable functions $\{f_{j}(t,\omega)\}_{j\geq 0}$ , such that

[TABLE]

in which $E\subset[0,T]\times\Omega$ is a set of full-measure. We can also assume that $f_{0}(t,\omega)=f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)$ . For all $j\geq 1$ , define the multifunction

[TABLE]

The graph of $\widetilde{U}_{j}(.,.)$ is a $\mathcal{L}\times\mathcal{B}_{\Omega}\times\mathcal{B}^{m}$ measurable set. Indeed, we have

[TABLE]

which is the union of two $\mathcal{L}\times\mathcal{B}_{\Omega}\times\mathcal{B}^{m}$ measurable sets. Now invoking Aumann’s Measurable Selection Theorem, we deduce that $\widetilde{U}_{j}(.,.)$ has a measurable selection $v_{j}(t,\omega)\in\widetilde{U}_{j}(t,\omega)$ .

Let now $\mathcal{D}$ be a countable and dense subset of $[0,T]$ . Consider the sequence of intervals $\{[s_{i},t_{i}]\}_{i\geq 1}$ having extrema in $\mathcal{D}:\ \bigcup\limits_{i\geq 1}\{s_{i},t_{i}\}=\mathcal{D}$ . We construct now a further countable family of controls $\{\widetilde{v}_{j,i}(t,\omega)\}_{j\geq 1,\ i\geq 1}$ as follows

[TABLE]

Writing $\{\widetilde{u}_{k}(t,\omega)\}_{k\geq 0}=\{\widetilde{v}_{j,i}(t,\omega)\}_{j\geq 1,i\geq 1}\cup\{\bar{u}(.)\}$ , in such a manner that (up to a reordering) $\widetilde{u}_{0}(.,\omega)=\bar{u}(.)$ , we obtain

[TABLE]

Following an effective technique proposed by Vinter [18], for a fixed integer $K$ , we introduce the operators $\Psi_{k}(.,.)$ and $\Psi^{i}_{k}(.,.)$ on $W^{1,1}([0,T],\mathbb{R}^{n})\times\Omega$ (linear with respect to their first variable): for $k=1,\ldots,K$ , we set

[TABLE]

and, for all integers $i\geq 1$ ,

[TABLE]

Define also the subsets $D_{i}$ , for all $i\geq 1$ , and $D$ of $\Omega\times\mathbb{R}^{K}$ as follows:

[TABLE]

where $\{\Omega^{i}\}$ is the increasing sequence of (finite) subsets introduced in Step 3 (cf. Lemma 5.2), and

[TABLE]

in which $\epsilon_{i}^{\prime}:=\beta\rho_{i}^{\prime}$ . The set $D$ is written

[TABLE]

where $\widehat{\Omega}$ is the countable dense subset of $\Omega$ ( $=\mbox{supp}(\mu)$ in our assumptions) provided by Lemma 5.2 and

[TABLE]

Now, we define the multifunctions $D_{i}(.)$ , for $i=1,2,\ldots$ , and $D(.)$ on $\Omega$ , taking values in the subsets of $\mathbb{R}^{K}$ as follow:

[TABLE]

The multifunctions $\omega\leadsto D(\omega)$ and $\omega\leadsto D_{i}(\omega)$ , for all $i$ , are uniformly bounded. The necessary optimality conditions (a1)′-(a3)′ corresponding to the auxiliary problem (Pi) of Step 2 guarantee that the set $D_{i}(\omega)$ is non-empty : indeed there exist $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable functions $p_{i}:[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ such that $p_{i}(.,\omega)\in\mathcal{P}_{i}(\omega)$ $\mu_{i}-$ a.e. $\omega\in\Omega$ and so

[TABLE]

Moreover, the linearity of the operator $\Psi_{k}$ with respect to the first variable $p$ and the convexity of the set $\text{co }\mathcal{P}(\omega)$ guarantee the convexity of the set $D(\omega)$ for each $\omega\in\text{dom }D(.)$ . It follows that hypotheses (i)-(iii) of Lemma 4.3 are satisfied. We claim that

[TABLE]

Indeed, take any $(\omega,\xi)\in\limsup\limits_{i\rightarrow\infty}D_{i}$ . From the definition of the limsup in the Kuratowski sense, there exists a subsequence $i_{h}\rightarrow\infty$ and $(\omega_{i_{h}},\xi_{i_{h}})\in D_{i_{h}}$ such that

[TABLE]

We shall show that $(\omega,\xi)\in D$ . Since $(\omega_{i_{h}},\xi_{i_{h}})\in D_{i_{h}}$ , there exists a sequence of $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable functions $p_{i_{h}}:[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ such that $p_{i_{h}}(.,\omega)\in\mathcal{P}_{i_{h}}(\omega)$ for all $\omega\in\Omega^{i_{h}}$ . From the analysis of Step 3, we have established the existence of a map $p$ on $[0,T]\times\Omega$ which is $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable, verifying conditions (5.20), (5.22), and (5.24) for all $\omega\in\widehat{\Omega}$ . Moreover, the uniform convergence of $\{p_{i_{h}}(.,\omega)\ :\ \omega\in\widehat{\Omega}\}$ , Lemma 5.1 and assumption LABEL:A2'_nco_general_case guarantee that, for $k=1,\ldots,K$ and for all $\omega\in\widehat{\Omega}$ ,

[TABLE]

converges, as $i_{h}\rightarrow\infty$ , to

[TABLE]

Therefore, $(\omega,\xi)\in D$ and the claim is confirmed. Consequently, all required hypotheses of Lemma 4.3 are satisfied for $\gamma_{i}(\omega)=(\gamma_{i,1}(\omega),\ldots,\gamma_{i,K}(\omega))$ where for $k=1,\ldots,K$ ,

[TABLE]

which is $\mu_{i}-$ measurable. Defining, for each $i$ , the vector-valued measure $\eta_{i}:=\gamma_{i}\mu_{i}$ , and applying Lemma 4.3, we obtain, along a subsequence (we do not relabel) $\eta_{i}\stackrel{{\scriptstyle*}}{{\rightharpoonup}}\eta$ where $\eta$ is a vector-valued Borel measure on $\Omega$ such that $d\eta(\omega)=\gamma(\omega)\ d\mu(\omega)$ , for some Borel measurable function $\gamma:\Omega\rightarrow\mathbb{R}^{K}$ satisfying

[TABLE]

In addition, from the definition of the set $D$ (associated with each $K\in\mathbb{N}$ ), there exists a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p_{K}:[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ such that $p_{K}(.,\omega)\in\text{co }\mathcal{P}(\omega)$ for all $\omega\in\widehat{\Omega}$ , and $\gamma(\omega):=\big{(}\Psi_{k}(p_{K}(.,\omega),\omega)\big{)}_{k=1,\ldots,K}$ verifying

[TABLE]

In other terms, for each $k=1,\ldots,K$

[TABLE]

The maximality condition (a4)′ of Step 2, after inserting $u=\widetilde{u}_{k}(t,\omega)$ , gives

[TABLE]

Since in (5.28) the integrand function is $\mathcal{L}\times\mathcal{B}_{\Omega}-$ measurable, and the integral function is $\mathcal{L}-$ measurable, making use of Fubini-Tonelli, we obtain

[TABLE]

Therefore, letting $i\to\infty$ and invoking (5), we have that

[TABLE]

For each $K\in\mathbb{N}$ , the map $\omega\rightarrow p_{K}(.,\omega)$ can be interpreted as a $\mathcal{B}_{\Omega}-$ measurable element of the $\mu-$ a.e. equivalence class in the Hilbert space

[TABLE]

endowed with the inner product

[TABLE]

Now consider $\widehat{\mathcal{P}}$ to be the set of $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable functions $\widehat{q}$ of $\mathcal{H}$ defined on $[0,T]\times\Omega$ such that $\widehat{q}(.,\omega)\in\text{co }\mathcal{P}(\omega)$ for all $\omega\in\widehat{\Omega}$ :

[TABLE]

Note that $\widehat{\mathcal{P}}$ is nonempty since $p_{K}(.,\omega)\in\text{co }\mathcal{P}(\omega)$ for all $\omega\in\widehat{\Omega}$ . Moreover, it is a straightforward task to prove that $\widehat{\mathcal{P}}$ is a closed and convex subset in $\mathcal{H}$ (owing to the convexity and the closure of the set $\text{co }\mathcal{P}(\omega)$ for all $\omega\in\widehat{\Omega}$ ). Therefore, $\widehat{\mathcal{P}}$ is weakly closed, as well. Moreover, the sequence $\{\omega\rightarrow p_{K}(.,\omega)\}_{K=1}^{\infty}$ is (uniformly) bounded, w.r.t. the norm induced by $\big{<}.,.\big{>}_{\mu}$ because it belongs to the bounded set $\text{co }\mathcal{P}(\omega)$ for all $\omega\in\widehat{\Omega}$ . By subsequence extraction (without relabelling), there exists a weakly convergent subsequence to $\{\omega\rightarrow p(.,\omega)\}$ for some $p\in\widehat{\mathcal{P}}$ . The weak convergence $p_{K}\rightharpoonup p$ in the Hilbert space $(\mathcal{H},\big{<}.,.\big{>}_{\mu})$ , employed in inequality (5.29), implies that

[TABLE]

We observe that condition (5.26) yields the following inclusion for all $t\in\mathcal{S}$

[TABLE]

where $\mathcal{S}$ is a set of full measure in $[0,T]$ . Define now the set $\mathbcal S^{\prime}\subset\mathbcal S$ , still of full measure in $[0,T]$ , containing the Lebesgue points for the map $\Gamma:[0,T]\rightarrow\mathbb{R}$ defined as

[TABLE]

for all $k$ . Take any $t\in\mathcal{S}^{\prime}$ and $u\in U(t)$ . Owing to (5.31), there exists a subsequence $\{k_{\ell}\}_{\ell}$ such that

[TABLE]

In other words, for a sequence $\beta_{\ell}\downarrow 0$ (possibly taking a subsequence of $\widetilde{u}_{k_{\ell}}$ ), we have

[TABLE]

For the Lebesgue point $t\in\mathbcal S^{\prime}$ , we can also consider a sequence of intervals $\{[s_{i},t_{i}]\}_{i\geq 1}$ , having extrema in a countable dense set $\mathcal{D}$ of $[0,T]$ (in the sense of (5.25)) and such that $s_{i}\uparrow t$ and $t_{i}\downarrow t$ . Recalling the definition (5.25) of $\widetilde{v}_{j,i}$ and replacing in (5.30) $\widetilde{u}_{k}$ by $v_{j}(t,\omega)$ on $[s_{i},t_{i}]\times\Omega$ , and by $\bar{u}(t)$ on $([0,T]\setminus[s_{i},t_{i}])\times\Omega$ , using Fubini-Tonelli (since the integrand is $\mathcal{L}\times\mathcal{B}_{\Omega}-$ measurable) and dividing across by $|t_{i}-s_{i}|$ , we obtain

[TABLE]

Since $t$ is a Lebesgue point for the map $\Gamma$ , we deduce

[TABLE]

Therefore, owing to (5.32)-(5), we have

[TABLE]

for any $\beta_{\ell}\downarrow 0$ and any $u\in U(t)$ . We conclude that

[TABLE]

for any $u\in U(t)$ and for all $t\in\mathcal{S}^{\prime}$ , a set of full measure in $[0,T]$ . Therefore, now all the assertions stated in Theorem 3.1 are confirmed (included the maximality condition (ii)), which completes the proof. ∎

Proof of Theorem 3.2. A purely atomic measure has necessarily at most a countable support. We can therefore choose $\widehat{\Omega}$ in such a manner that $\widehat{\Omega}=\text{supp}(\mu)$ . The properties (i) and (iii) follow immediately considering Steps 1, 2 and 3 of Theorem 3.1 proof and the obtained costate arc $p(.,.)$ . On the other hand, the maximality condition (ii) can be deduced by contradiction, avoiding any use of the technical procedure of Step 4 of Theorem 3.1 proof which requires the construction of appropriate multifunctions and the use of selection theorems. We provide here the details of this ‘new step 4’ which allows to obtain (ii).

Consider the function

[TABLE]

Using a standard argument, one can easily show that

[TABLE]

Therefore, setting, for $j\in\mathbb{N}$

[TABLE]

we have that $E_{j}$ is a $\mathcal{L}\times\mathcal{B}^{m}-$ measurable set. Define

[TABLE]

Then $\{B_{j}\}_{j\geq 1}$ is an increasing sequence of $\mathcal{L}-$ measurable sets. Consider the following $\mathcal{L}\times\mathcal{B}^{m}-$ measurable set $E$

[TABLE]

and denote by $E_{t}$ the $t-$ section of $E$ , i.e.

[TABLE]

Then, $E_{t}:=\cup_{j\geq 1}B_{j}$ .

Now assume, by contradiction, that (ii) of Theorem 3.2 is violated. Therefore, meas $(E_{t})>0$ . Write $\delta:=\text{meas}(E_{t}).$ Since, $\text{meas}(E_{t})=\lim\limits_{j\to\infty}\text{meas}(B_{j})$ , there exists $j_{0}\in\mathbb{N}$ such that $\text{meas}(B_{j})\geq\frac{\delta}{2}$ for all $j\geq j_{0}$ . Therefore, for all $t\in B_{j_{0}}$ , there exists $u_{t}\in U(t)$ such that $\Psi(t,u_{t})\geq\frac{1}{j_{0}}$ . Take $i_{0}\in\mathbb{N}$ such that

[TABLE]

(here $c>0$ is the upper bound for $|f|$ (see (A3)′) and $M_{p}>0$ is an upper bound for $||p(.,\omega)||_{L^{\infty}}$ ), and

[TABLE]

(Recall that $\beta>0$ is the number provided by Lemma 5.1 (i) and $\{\rho_{i}^{\prime}\}$ is the decreasing sequence appearing in Step 2 of the proof of Theorem 3.1.)

For all $i\geq i_{0}$ and for all $t\in B_{j_{0}}\cap A_{\rho_{i}}$ , we have

[TABLE]

Condition (a4)′ established in Step 2 of Theorem 3.1 proof implies that the first term on the right-hand side of (5) satisfies

[TABLE]

Concerning the second term on the right-hand side of (5) we make use of the boundedness of $f$ and $\|p(.,\omega)\|_{L^{\infty}}$ , and the estimate (5.12): we obtain

[TABLE]

Take $i_{1}\geq i_{o}$ large enough such that for all $i\geq i_{1}$

[TABLE]

Therefore, owing to the choice made in (5.35), we have $S\leq 2k_{f}(t)\beta M_{p}\rho_{i}^{\prime}+\frac{1}{8j_{0}}+\frac{1}{4j_{0}}$ . Then, from (5), we obtain that

[TABLE]

By integrating over the measurable set $B_{j_{0}}\cap A_{\rho_{i}^{\prime}}$ , taking into account that $\frac{\delta}{2}\leq\mbox{meas}(B_{j_{0}})\leq\delta$ and meas $([0,T]\setminus A_{\rho^{\prime}_{i}})\leq\rho_{i}^{\prime}$ , we arrive at

[TABLE]

a contradiction. Therefore, also the maximality condition (ii) of Theorem 3.2 holds true.

∎

Proof of Theorem 3.3. A scrutiny of Theorem 3.1 proof reveals that Steps 1, 2 and 3 are applicable providing a simplified result. Indeed, taking into account hypotheses (C1)-(C2) on $f(t,.,u,\omega)$ and $g(.,\omega)$ , we obtain a family of costate arcs $\widetilde{p}(.,\omega)$ , for $\omega\in\widehat{\Omega}$ ( $\widehat{\Omega}$ is a countable dense subset of $\mbox{supp}(\mu)$ ), satisfying the properties listed at the end of the Step 3 of the proof of Theorem 3.1, where (5.20) and (5.22) read now as

[TABLE]

and

[TABLE]

for all $\omega\in\hat{\Omega}$ . Notice, that the multiplier $\lambda$ cannot take the value [math], for otherwise we would obtain a contradiction with the nontriviality condition. Then, normalizing we can take $\lambda=1$ .

We claim now that we can extend in a unique way the family of arcs $\widetilde{p}(.,\omega)$ , for $\omega\in\widehat{\Omega}$ , to a $\mathcal{L}\times\mathcal{B}_{\Omega}$ measurable function $p(.,.):[0,T]\times\Omega\rightarrow\mathbb{R}^{n}$ such that for all $\omega\in\mbox{supp}(\mu)$ we have:

(i)′′

$p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ ; 2. (ii)′′

$-\dot{p}(t,\omega)=[\nabla_{x}f(t,\bar{x}(t,\omega),\bar{u}(t),\omega)]^{T}p(t,\omega)$ a.e. $t\in[0,T]$ ; 3. (iii)′′

$-p(T,\omega)=\nabla_{x}g(\bar{x}(T,\omega);\omega)$ .

Indeed, take any $\omega\in\Omega\setminus\widehat{\Omega}$ . If $\omega\in\Omega\setminus\mbox{supp}(\mu)$ we set $p(.,\omega)=0$ . So we continue the analysis considering the case $\omega\in\mbox{supp}(\mu)\setminus\widehat{\Omega}$ . Then, since $\widehat{\Omega}$ is dense in $\mbox{supp}(\mu)$ , there exists a sequence $\{\widehat{\omega}_{i}\}\subset\widehat{\Omega}$ converging to $\omega$ . Assumptions LABEL:A2'_nco_general_case and LABEL:A4'_nco_general_case guarantee that $|\nabla_{x}f(t,\bar{x}(t,\omega),\bar{u}(t),\widehat{\omega}_{i})|\leq k_{f}(t)$ a.e. $t\in[0,T]$ and $|\nabla_{x}g|\leq k_{g}$ . From (5.38) we deduce that $\{\dot{\widetilde{p}}(.,\widehat{\omega}_{i})\}$ is uniformly integrally bounded, and (5.39) guarantees that $|\widetilde{p}(T,\widehat{\omega}_{i})|\leq k_{g}$ . Then, by a standard compactness argument, taking a subsequence (we do not relabel), there exists $p(.,\omega)\in W^{1,1}([0,T],\mathbb{R}^{n})$ such that

[TABLE]

and

[TABLE]

(The last two equalities are a consequence of Lemma 5.1 (ii).)

This, being true for any sequence $\{\widehat{\omega}_{i}\}\subset\widehat{\Omega}$ converging to $\omega\in\mbox{supp}(\mu)\setminus\widehat{\Omega}$ , since the limit arc satisfies the same conditions (5.40)-(5.41), we conclude that we can extend the family of arcs $\widetilde{p}(.,\omega)$ simply taking the limit:

[TABLE]

confirming the claim above. It remains to prove the Weierstrass condition (ii)′. We follow exactly the same analysis of Step 4 of Theorem 3.1 proof, taking now the simplified version of the definition of the set $D$ in which we take into account the regularity of functions $f$ and $g$ , the fact that $\lambda=1$ and we do not have end-point constraints:

[TABLE]

where now, we set

[TABLE]

The uniqueness of solutions to systems appearing in $\mathcal{P}_{S}(\omega)$ allows to conclude.

∎

Acknowledgements. The authors are thankful to Richard B. Vinter for his suggestion to study necessary conditions for average cost optimal control problems, and to the referees for their many helpful comments.

Bibliography22

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] J. Ackermann, Robust Control: the Parameter Space Approach , Springer Science & Business Media, 2012.
2[2] A. Agrachev, Y. Baryshnikov and A. Sarychev, Ensemble controllability by Lie algebraic methods, ESAIM: Control, Optimisation and Calculus of Variations , 22.4 (2016), 921–938.
3[3] R. B. Ash, Measure, Integration, and Functional Analysis , Academic Press, 2014.
4[4] J-P. Aubin and H. Frankowska, Set-Valued Analysis , Springer Science & Business Media, 2009.
5[5] V. I. Bogachev, Measure Theory , Springer Science & Business Media, 2007.
6[6] V. G. Boltyanskii and A. S. Poznyak, The Robust Maximum Principle: Theory and Applications , Birkhauser. New York, 2012.
7[7] J-B. Caillau, M. Cerf, A. Sassi, E. Trélat and H. Zidani, Solving chance constrained optimal control problems in aerospace via Kernel Density Estimation, Optimal Control Applications and Methods , 39.5 (2018), 1833–1858.
8[8] C. Castaing and M. Valadier, Convex Analysis and Measurable Multifunctions , Springer, 2006.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Necessary Optimality Conditions For Average Cost Minimization Problems

Abstract

1 Introduction

2 Average on measures with finite support

Proposition 2.1**.**

Proof.

3 Main results

Theorem 3.1**.**

Theorem 3.2** (Purely atomic case).**

Theorem 3.3** (Smooth case).**

4 Preliminary results in measure theory

Theorem 4.1**.**

Theorem 4.2** (Generalized Prokhorov Theorem).**

Lemma 4.3**.**

Proof.

5 Proofs of Theorem 3.1, Theorem 3.2 and Theorem 3.3

Lemma 5.1**.**

Proof.

Lemma 5.2**.**

Proof.

Proposition 2.1.

Theorem 3.1.

Theorem 3.2 (Purely atomic case).

Theorem 3.3 (Smooth case).

Theorem 4.1.

Theorem 4.2 (Generalized Prokhorov Theorem).

Lemma 4.3.

Lemma 5.1.

Lemma 5.2.