Singular Optimal Controls for Stochastic Recursive Systems under Convex   Control Constraint

Liangquan Zhang

arXiv:1812.11655·math.OC·December 22, 2020

Singular Optimal Controls for Stochastic Recursive Systems under Convex Control Constraint

Liangquan Zhang

PDF

Open Access

TL;DR

This paper develops second-order necessary conditions and a verification theorem for singular optimal controls in stochastic systems governed by FBSDEs, broadening the theoretical framework and linking maximum principle with dynamic programming.

Contribution

It introduces new second-order necessary conditions and a viscosity solution-based verification theorem for singular controls in stochastic systems, extending existing theories.

Findings

01

Derived pointwise second-order necessary conditions for stochastic SOCs.

02

Established a verification theorem for SOCs using viscosity solutions.

03

Connected maximum principle with dynamic programming without smoothness assumptions.

Abstract

In this paper, we study two kinds of singular optimal controls (SOCs for short) problems where the systems governed by forward-backward stochastic differential equations (FBSDEs for short), in which the control has two components: the regular control, and the singular one. Both drift and diffusion terms may involve the regular control variable. The regular control domain is postulated to be convex. Under certain assumptions, in the framework of the Malliavin calculus, we derive the pointwise second-order necessary conditions for stochastic SOC in the classical sense. This condition is described by two adjoint processes, a maximum condition on the Hamiltonian supported by an illustrative example. A new necessary condition for optimal singular control is obtained as well. Besides, as a by-product, a verification theorem for SOCs is derived via viscosity solutions without involving any…

Equations449

\left\{\begin{array}[]{lll}\mathrm{d}X^{t,x;v,\xi}&=&b\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s+\sigma\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}W\left(s\right)+G\left(s\right)\mathrm{d}\xi\left(s\right),\\ X^{t,x;v,\xi}\left(t\right)&=&x,\qquad 0\leq t\leq s\leq T,\end{array}\right.

\left\{\begin{array}[]{lll}\mathrm{d}X^{t,x;v,\xi}&=&b\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s+\sigma\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}W\left(s\right)+G\left(s\right)\mathrm{d}\xi\left(s\right),\\ X^{t,x;v,\xi}\left(t\right)&=&x,\qquad 0\leq t\leq s\leq T,\end{array}\right.

J (t, x; v, ξ) = E [\int_{t}^{T} l (s, X^{t, x; v, ξ} (s), v (s)) d s + \int_{t}^{T} K (s) d ξ (s)],

J (t, x; v, ξ) = E [\int_{t}^{T} l (s, X^{t, x; v, ξ} (s), v (s)) d s + \int_{t}^{T} K (s) d ξ (s)],

l (\cdot, \cdot, \cdot)

l (\cdot, \cdot, \cdot)

K (\cdot)

J (v (\cdot)) = E \int_{0}^{T} l (x (t), v (t)) d t + E (h (T)),

J (v (\cdot)) = E \int_{0}^{T} l (x (t), v (t)) d t + E (h (T)),

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&g\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&g\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&g\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},y\left(T\right)=y,\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&g\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},y\left(T\right)=y,\end{array}\right.

J (v (\cdot)) = E [\int_{0}^{T} l (t, x (t), y (t), v (t)) d t + h (x (T)) + γ (y (0))],

J (v (\cdot)) = E [\int_{0}^{T} l (t, x (t), y (t), v (t)) d t + h (x (T)) + γ (y (0))],

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&g\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},y\left(T\right)=h\left(x\left(T\right)\right).\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&g\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},y\left(T\right)=h\left(x\left(T\right)\right).\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&-g\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},\quad\quad y\left(T\right)=\xi,\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x\left(t\right)&=&f\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}W\left(t\right),\\ \text{d}y\left(t\right)&=&-g\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}W\left(t\right),\\ x\left(0\right)&=&x_{0},\quad\quad y\left(T\right)=\xi,\end{array}\right.

J (v (\cdot)) = E [\int_{0}^{T} L (t, x (t), y (t), z (t), v (t)) d t + Φ (x (T)) + h (y (0))] .

J (v (\cdot)) = E [\int_{0}^{T} L (t, x (t), y (t), z (t), v (t)) d t + Φ (x (T)) + h (y (0))] .

\left\{\begin{array}[]{rcl}\text{d}x_{t}&=&b\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),y\left(t\right),z\left(t\right)\right)\text{d}B_{t},\\ \text{d}y_{t}&=&-f\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}B_{t},\\ x\left(0\right)&=&x_{0},\quad y\left(T\right)=h\left(x\left(T\right)\right).\end{array}\right.

\left\{\begin{array}[]{rcl}\text{d}x_{t}&=&b\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+\sigma\left(t,x\left(t\right),y\left(t\right),z\left(t\right)\right)\text{d}B_{t},\\ \text{d}y_{t}&=&-f\left(t,x\left(t\right),y\left(t\right),z\left(t\right),v\left(t\right)\right)\text{d}t+z\left(t\right)\text{d}B_{t},\\ x\left(0\right)&=&x_{0},\quad y\left(T\right)=h\left(x\left(T\right)\right).\end{array}\right.

J (v (\cdot)) = E [\int_{0}^{T} l (t, x (t), y (t), z (t), v (t)) d t + Φ (x (T)) + γ (y (0))] .

J (v (\cdot)) = E [\int_{0}^{T} l (t, x (t), y (t), z (t), v (t)) d t + Φ (x (T)) + γ (y (0))] .

\left\{\begin{array}[]{rcl}\mathrm{d}X^{t,x;v,\xi}\left(s\right)&=&b\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s+\sigma\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}W\left(s\right)+G\left(s\right)\mathrm{d}\xi\left(s\right),\\ \mathrm{d}Y^{t,x;v,\xi}\left(s\right)&=&-f\left(t,X^{t,x;v,\xi}\left(s\right),Y^{t,x;v,\xi}\left(s\right),Z^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s\\ &&+Z^{t,x;v,\xi}\left(s\right)\mathrm{d}W\left(s\right)-K\mathrm{d}\xi\left(s\right),\\ X^{t,x;v,\xi}\left(t\right)&=&x,\text{ }Y^{t,x;v,\xi}\left(T\right)=\Phi\left(X^{t,x;v,\xi}\left(T\right)\right),\qquad 0\leq t\leq s\leq T,\end{array}\right.

\left\{\begin{array}[]{rcl}\mathrm{d}X^{t,x;v,\xi}\left(s\right)&=&b\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s+\sigma\left(s,X^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}W\left(s\right)+G\left(s\right)\mathrm{d}\xi\left(s\right),\\ \mathrm{d}Y^{t,x;v,\xi}\left(s\right)&=&-f\left(t,X^{t,x;v,\xi}\left(s\right),Y^{t,x;v,\xi}\left(s\right),Z^{t,x;v,\xi}\left(s\right),v\left(s\right)\right)\mathrm{d}s\\ &&+Z^{t,x;v,\xi}\left(s\right)\mathrm{d}W\left(s\right)-K\mathrm{d}\xi\left(s\right),\\ X^{t,x;v,\xi}\left(t\right)&=&x,\text{ }Y^{t,x;v,\xi}\left(T\right)=\Phi\left(X^{t,x;v,\xi}\left(T\right)\right),\qquad 0\leq t\leq s\leq T,\end{array}\right.

J (t, x; v, ξ) = Y^{t, x; v, ξ} (s)_{s = t} .

J (t, x; v, ξ) = Y^{t, x; v, ξ} (s)_{s = t} .

S^{2} (0, T; R) ≜

S^{2} (0, T; R) ≜

M^{2} (0, T; R) ≜

∣ b (t, 0, x) ∣ + ∣ σ (t, 0, u) ∣

∣ b (t, 0, x) ∣ + ∣ σ (t, 0, u) ∣

b_{(x, u)^{2}} (t, x_{1}, u_{1}) - b_{(x, u)^{2}} (t, x_{2}, u_{2})

σ_{(x, u)^{2}} (t, x_{1}, u_{1}) - σ_{(x, u)^{2}} (t, x_{2}, u_{2})

∣ f (t, x, y, z, u) ∣ \leq C (1 + ∣ x ∣ + ∣ y ∣ + ∣ z ∣),

∣ f (t, x, y, z, u) ∣ \leq C (1 + ∣ x ∣ + ∣ y ∣ + ∣ z ∣),

∣ f_{x} (t, x, y, z, u) ∣ + ∣ f_{y} (t, x, y, z, u) ∣ + ∣ f_{z} (t, x, y, z, u) ∣ + ∣ f_{u} (t, x, y, z, u) ∣ \leq C,

∣ f_{x} (t, x, y, z, u) ∣ + ∣ f_{y} (t, x, y, z, u) ∣ + ∣ f_{z} (t, x, y, z, u) ∣ + ∣ f_{u} (t, x, y, z, u) ∣ \leq C,

\begin{array}[]{l}\left|f_{xx}\left(t,x,y,z,u\right)\right|+\left|f_{xu}\left(t,x,y,z,u\right)\right|+\left|f_{yu}\left(t,x,y,z,u\right)\right|\\ \qquad+\left|f_{yy}\left(t,x,y,z,u\right)\right|+\left|f_{zz}\left(t,x,y,z,u\right)\right|+\left|f_{zu}\left(t,x,y,z,u\right)\right|\\ \qquad+\left|f_{uu}\left(t,x,y,z,u\right)\right|\leq C,\end{array}

\begin{array}[]{l}\left|f_{xx}\left(t,x,y,z,u\right)\right|+\left|f_{xu}\left(t,x,y,z,u\right)\right|+\left|f_{yu}\left(t,x,y,z,u\right)\right|\\ \qquad+\left|f_{yy}\left(t,x,y,z,u\right)\right|+\left|f_{zz}\left(t,x,y,z,u\right)\right|+\left|f_{zu}\left(t,x,y,z,u\right)\right|\\ \qquad+\left|f_{uu}\left(t,x,y,z,u\right)\right|\leq C,\end{array}

\begin{array}[]{l}\left|f_{\left(x,y,z,u\right)^{2}}\left(t,x_{1},y_{1},z_{1},u_{1}\right)-f_{\left(x,y,z,u\right)^{2}}\left(t,x_{2},y_{2},z_{2},u_{2}\right)\right|\\ \qquad\leq C\left(\left|x_{1}-x_{2}\right|+\left|y_{1}-y_{2}\right|+\left|z_{1}-z_{2}\right|+\left|u_{1}-u_{2}\right|\right),\end{array}

\begin{array}[]{l}\left|f_{\left(x,y,z,u\right)^{2}}\left(t,x_{1},y_{1},z_{1},u_{1}\right)-f_{\left(x,y,z,u\right)^{2}}\left(t,x_{2},y_{2},z_{2},u_{2}\right)\right|\\ \qquad\leq C\left(\left|x_{1}-x_{2}\right|+\left|y_{1}-y_{2}\right|+\left|z_{1}-z_{2}\right|+\left|u_{1}-u_{2}\right|\right),\end{array}

\begin{array}[]{l}\Phi\left(x\right)\leq C\left(1+\left|x\right|^{2}\right),\text{ }\Phi_{x}\left(x\right)\leq C\left(1+\left|x\right|\right),\\ \Phi_{xx}\left(x\right)\leq C,\text{ }\left|\Phi_{xx}\left(x_{1}\right)-\Phi_{xx}\left(x_{2}\right)\right|\leq C\left|x_{1}-x_{2}\right|.\end{array}

\begin{array}[]{l}\Phi\left(x\right)\leq C\left(1+\left|x\right|^{2}\right),\text{ }\Phi_{x}\left(x\right)\leq C\left(1+\left|x\right|\right),\\ \Phi_{xx}\left(x\right)\leq C,\text{ }\left|\Phi_{xx}\left(x_{1}\right)-\Phi_{xx}\left(x_{2}\right)\right|\leq C\left|x_{1}-x_{2}\right|.\end{array}

J (t, x; v (\cdot), ξ (\cdot)) = Y_{s}^{t, x; v, ξ}_{s = t}, (t, x) \in [0, T] \times R^{n} .

J (t, x; v (\cdot), ξ (\cdot)) = Y_{s}^{t, x; v, ξ}_{s = t}, (t, x) \in [0, T] \times R^{n} .

u (t, x)

u (t, x)

y^{i} (t) = ξ^{i} + \int_{t}^{T} f^{i} (s, y^{i} (s), z^{i} (s)) d s - \int_{t}^{T} z^{i} (s) d W (s),

y^{i} (t) = ξ^{i} + \int_{t}^{T} f^{i} (s, y^{i} (s), z^{i} (s)) d s - \int_{t}^{T} z^{i} (s) d W (s),

E (\int_{t}^{T} f^{i} (s, y^{i} (s), z^{i} (s)) d s)^{β} < \infty;

E (\int_{t}^{T} f^{i} (s, y^{i} (s), z^{i} (s)) d s)^{β} < \infty;

E 0 \leq t \leq T sup y^{1} (t) - y^{2} (t)^{β} + (\int_{0}^{T} z^{1} (s) - z^{2} (s)^{2} d s)^{\frac{β}{2}}

E 0 \leq t \leq T sup y^{1} (t) - y^{2} (t)^{β} + (\int_{0}^{T} z^{1} (s) - z^{2} (s)^{2} d s)^{\frac{β}{2}}

\mathbb{E}\left[\sup_{0\leq t\leq T}\left|y^{1}\left(t\right)\right|^{\beta}+\left(\int_{0}^{T}\left|z^{1}\left(s\right)\right|^{2}\mathrm{d}s\right)^{\frac{\beta}{2}}\right]\leq C_{\beta}\mathbb{E}\Bigg{[}\left|\xi^{1}\right|^{\beta}+\left(\int_{t}^{T}\left|f^{1}\left(s,0,0\right)\right|\mathrm{d}s\right)^{\beta}\Bigg{]}.

\mathbb{E}\left[\sup_{0\leq t\leq T}\left|y^{1}\left(t\right)\right|^{\beta}+\left(\int_{0}^{T}\left|z^{1}\left(s\right)\right|^{2}\mathrm{d}s\right)^{\frac{\beta}{2}}\right]\leq C_{\beta}\mathbb{E}\Bigg{[}\left|\xi^{1}\right|^{\beta}+\left(\int_{t}^{T}\left|f^{1}\left(s,0,0\right)\right|\mathrm{d}s\right)^{\beta}\Bigg{]}.

D_{θ} ξ = j = 1 \sum k \frac{\partial}{\partial x _{j}} φ (W (h^{1}), W (h^{2}), \dots, W (h^{k})) h_{θ}^{j}, 0 \leq θ \leq T .

D_{θ} ξ = j = 1 \sum k \frac{\partial}{\partial x _{j}} φ (W (h^{1}), W (h^{2}), \dots, W (h^{k})) h_{θ}^{j}, 0 \leq θ \leq T .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Markov Chains and Monte Carlo Methods · Risk and Portfolio Optimization

Full text

Singular Optimal Controls for Stochastic Recursive Systems under

Convex Control Constraint

Liangquan Zhang1

School of Science,

Beijing University of Posts and Telecommunications,

Beijing 100876, China L. Zhang acknowledges the financial support partly by the National Nature Science Foundation of China(Grant No. 11701040, 61871058 &11871010) and the Fundamental Research Funds for the Central Universities (No.2019XD-A11). E-mail: [email protected].

Abstract

In this paper, we study two kinds of singular optimal controls (SOCs for short) problems where the systems governed by forward-backward stochastic differential equations (FBSDEs for short), in which the control has two components: the regular control, and the singular one. Both drift and diffusion terms may involve the regular control variable. The regular control domain is postulated to be convex. Under certain assumptions, in the framework of the Malliavin calculus, we derive the pointwise second-order necessary conditions for stochastic SOC in the classical sense. This condition is described by two adjoint processes, a maximum condition on the Hamiltonian supported by an illustrative example. A new necessary condition for optimal singular control is obtained as well. Besides, as a by-product, a verification theorem for SOCs is derived via viscosity solutions without involving any derivatives of the value functions. It is worth pointing out that this theorem has wider applicability than the restrictive classical verification theorems. Finally, we focus on the connection between the maximum principle and the dynamic programming principle for such SOCs problem without the assumption that the value function is smooth enough.

AMS subject classifications: 93E20, 60H15, 60H30.

**Key words: **Dynamic programming principle (DPP for short), Forward-backward stochastic differential equations (FBSDEs for short), Malliavin calculus, Maximum principle (MP for short), Singular optimal controls, Viscosity solution, Verification theorem.

1 Introduction

Singular stochastic control problem is a fundamental topic in fields of stochastic control. This problem was first introduced by Bather and Chernoff [11] in 1967 by considering a simplified model for the control of a spaceship. It was then found that there was a connection between the singular control and optimal stopping problem. This link was established through the derivative of the value function of this initial singular control problem and the value function of the corresponding optimal stopping problem. Subsequently, it was considered by Beněs, Shepp, Witzsenhausen (see [6]) and Karatzas and Shreve (see [53, 54, 55, 56, 57]).

The state process is described by a $n$ -dimensional SDE of the following type:

[TABLE]

on some filtered probability space $\left(\Omega,\mathcal{F},P\right)$ , where $b\left(\cdot,\cdot,\cdot\right):\left[0,T\right]\times\mathbb{R}^{n}\times\mathbb{R}^{k}\rightarrow\mathbb{R}^{n},$ $\sigma\left(\cdot,\cdot,\cdot\right):\left[0,T\right]\times\mathbb{R}^{n}\times\mathbb{R}^{k}\rightarrow\mathbb{R}^{n\times d},$ $G\left(\cdot\right):\left[0,T\right]\rightarrow\mathbb{R}^{n\times m}$ are given deterministic functions, $\left(W_{s}\right)_{s\geq 0}$ is an $d$ -dimensional Brownian motion, $\left(x,t\right)$ are initial time and state, $v\left(\cdot\right):\left[0,T\right]\rightarrow\mathbb{R}^{k}$ is a regular control process, and $\xi\left(\cdot\right):\left[0,T\right]\rightarrow\mathbb{R}^{m}$ , with nondecreasing left-continuous with right limits stands for the singular control111Because the measure $\mathrm{d}\xi_{s}$ may be singular with respect to the Lebesgue measure $\mathrm{d}s$ . (SC for short). To avoid the risk of confusion, we shall introduce the other definitions of singular control in various senses. Indeed, they are just a coincidence of terminology usage.

The aim is to minimize the cost functional:

[TABLE]

where

[TABLE]

are given deterministic functions, where $l\left(\cdot\right)$ represents the running cost tare of the problem and $K$ the cost rate of applying the singular control.

We mention that there are four approaches to deal with singular control: The first, partial differential equations (PDE for short) and on variational arguments, can be found in the works of Alvarez [1, 2], Chow, Menaldi, and Robin [24], Karatzas [54], Karatzas and Shreve [57], and Menaldi and Taksar [64]. The second one is related to probabilistic methods; see Baldursson [7], Boetius [8, 9], Boetius and Kohlmann [10], El Karoui and Karatzas [31, 32], Karatzas [53], and Karatzas and Shreve [55, 56]. Third, the DPP, has been studied in a general context, for example, by Boetius [9], Haussmann and Suo [43], Fleming and Soner [33] and Zhang [93]. At last the maximum principle for optimal singular controls (see, for example, Cadenillas and Haussmann [21], Dufour and Miller [28], Dahl and Øksendal [29] see references therein).

Singular controls are used in diverse fields such as mathematical finance (see Baldursson and Karatzas [12], Chiarolla, Haussmann [22], Kobila [58], Karatzas, Wang [59], Davis, Norman [26] and Pagès and Possamaï [76]), manufacturing systems (see, Shreve, Lehoczky, and Gaver [79]), and queuing systems (see Martins and Kushner [65]).

Completely different from the singular control introduced above, to the best of our knowledge, there are two other types of singular optimal controls, in which the first-order necessary conditions turn out to be trivial. We list briefly as follows:

•

Singular optimal control in the classical sense (SOCCS for short), is the optimal control for which the gradient and the Hessian of the corresponding Hamiltonian with respect to the control variable vanish/degenerate.

•

Singular optimal control in the sense of Pontryagin-type maximum principle (SOCSPMP for short), is the optimal control for which the corresponding Hamiltonian is equal to a constant in the control region.

When an optimal control is singular in certain senses above (SOCCS and SOCSPMP), usually the first-order necessary condition could not carry sufficient information for the further theoretical analysis and numerical computation, and consequently it is necessary to investigate the second order necessary conditions. In the deterministic setting, reader can refer many articles in this direction (see [5, 34, 39, 40, 50, 51, 52] and references therein).

As for the second-order necessary conditions for stochastic singular optimal controls (SOCCS and SOCSPMP), there are some work should be mentioned, for instance [89, 90] (note that singular control $\xi\left(\cdot\right)$ in these articles does not appear in systems). Tang [81] obtained a pointwise second order maximum principle for stochastic singular optimal controls in the sense of the Pontryagin-type maximum principle whenever the control variable $u$ does not enter into the diffusion term. Meanwhile, Tang addressed an integral-type second-order necessary condition for stochastic optimal controls with convex control constraints. Zhang and Zhang [89] also establish certain pointwise second-order necessary conditions for stochastic singular (SOCCS) optimal controls, in which both drift and diffusion terms in may depend on the control variable $u$ with convex control region $U$ by making use of Malliavin calculus technique. Later, adopting the same idea but with large complicated analysis, Zhang et al. [90] deepen this research for the general case when the control region is nonconvex.

The theory of backward stochastic differential equation (BSDE for short) can be traced back to Bismut [3, 4] who studied linear BSDE motivated by stochastic control problems. Pardoux and Peng 1990 [74] proved the well-posedness for nonlinear BSDE. Duffie and Epstein (1992) introduced the notion of recursive utilities in continuous time, which is actually a type of BSDE where the generator $f$ is independent of $z$ . El Karoui et al. (1997, 2001) extended the recursive utility to the case where $f$ contains $z$ . The term $z$ can be interpreted as an ambiguity aversion term in the market (see Chen and Epstein 2002 [25]). Particularly, the celebrated Black-Scholes formula indeed provided an effective way of representing the option price (which is the solution to a kind of linear BSDE) through the solution to the Black-Scholes equation (parabolic partial differential equation actually). Since then, BSDE has been extensively studied and used in the areas of applied probability and optimal stochastic controls, particularly in financial engineering (cf for instance [48]).

By means of BSDE, Peng (1990) [72] considered the following type of stochastic optimal control problem: Minimize a cost function

[TABLE]

subject to

[TABLE]

over an admissible control domain which need not be convex, and the diffusion coefficients depends on the control variable. In his paper, by spike variational method and the second order adjoint equations, Peng [72] obtained a general stochastic maximum principle for the above optimal control problem. It was just the adjoint equations in stochastic optimal control problems that motivated the famous theory of BSDE (cf [74]).

Later, Peng first [73] studied a stochastic optimal control problem where state variables are described by the system of FBSDEs:

[TABLE]

where $x$ and $y$ are given deterministic constants. The optimal control problem is to minimize the cost function:

[TABLE]

over an admissible control domain which is convex. Later, Xu [86] studied the following non-fully coupled forward-backward stochastic control system:

[TABLE]

The optimal control problem is to minimize the cost function $J\left(v_{\left(\cdot\right)}\right)=\mathbb{E}\gamma\left(y_{0}\right),$ over $\mathcal{U}_{ad},$ but the control domain is non-convex. Wu [84] firstly gave the maximum principle for optimal control problem of fully coupled forward-backward stochastic system:

[TABLE]

where $\xi$ is a random variable and the cost function:

[TABLE]

The optimal control problem is to minimize the cost function $J\left(v_{\left(\cdot\right)}\right)$ over an admissible control domain which is convex. Ji and Zhou [47] obtained a maximum principle for stochastic optimal control of non-fully coupled forward-backward stochastic system with terminal state constraints. Shi and Wu [80] studied the maximum principle for fully coupled forward-backward stochastic system:

[TABLE]

and the cost function is

[TABLE]

The control domain is non-convex but the forward diffusion does not contain the control variable.

Subsequently, in order to study the backward linear-quadratic optimal control problem, Kohlmann and Zhou [60], Lim and Zhou [63] developed a new method for handling this problem. The term $z$ is regarded as a control process and the terminal condition $y_{T}=h\left(x_{T}\right)$ as a constraint, and then it is possible to use the Ekeland variational principle to obtain the maximum principle. Adopting this idea, Yong [88] and Wu [85] independently established the maximum principle for the recursive stochastic optimal control problem (noting the diffusion term containing control variable with non-convex control region). Nonetheless, the maximum principle derived by these method involves two unknown parameters. Therefore, the hard questions raise as follows: What is the second-order variational equation for the BSDE? How to obtain the second-order adjoint equation since the quadratic form with respect to the variation of $z$ . All of which seem to be extremely complicated.

Hu [44] overcomes the above difficulties by introducing two new adjoint equations. Then, the second-order variational equation for the BSDE and the maximum principle are obtained. The main difference of his variational equations with those in Peng [72] consists in the term $\left\langle p\left(t\right),\delta\sigma\left(t\right)\right\rangle I_{E_{\varepsilon}}\left(t\right)$ in the variation of $z$ . Due to the term $\left\langle p\left(t\right),\delta\sigma\left(t\right)\right\rangle I_{E_{\varepsilon}}\left(t\right)$ in the variation of $z$ , Hu obtained a global maximum principle which is novel and different from that in Wu [85], Yong [88] and previous work, which solves completely Peng’s open problem. Furthermore, Hu’s maximum principle is stronger than the one in Wu [85], Yong [88]. For a general case, reader can refer [45].

Motivated by above work, in this paper, we consider singular controls problem of the following type:

[TABLE]

with the similar cost functional

[TABLE]

Wang [83] firstly introduced and studied a class of singular control problems with recursive utility, where the cost function is determined by BSDE. Under certain assumptions, the author proved that the value function is a nonnegative, convex solution of the H-J-B equation. However, FBSDEs in Wang [83] do not contain the regular control and the generator is not general case. In our work, using some properties of the BSDE and analysis technique, we expand the extension of the MP for SOC to the recursive control problem in Zhang and Zhang [89]. To the best of our knowledge, such singular optimal controls problems of FBSDEs (8) via two kinds of singular controls have not been explored before. We shall establish some pointwise second-order necessary conditions for stochastic optimal controls of FBSDEs. Both drift and diffusion terms may contain the control variable $u$ , and we assume that the control region $U$ is convex. We also consider the pointwise second-order necessary condition, which is easier to verify in practical applications.

As claimed in [89], quite different from the deterministic setting, there exist some essential difficulties in deriving the pointwise second-order necessary condition from an integral-type one whenever the diffusion term depends on the control variable, even for the case of convex control domain. We overcome these difficulties by means of some technique from the Malliavin calculus. For general case, namely, the control region is non-convex can be found in [90].

In this paper, we are interested in studying singular optimal controls for FBSDEs (8). Compared with above literature, our paper has several new features. The novelty of the formulation and the contribution in this paper may be stated as follows:

•

Our control systems in this paper are governed by FBSDEs which exactly extends the work of Zhang and Zhang [89] to utilities. Our work is the first time to establish the pointwise second order necessary condition for stochastic singular optimal control in the classical sense for FBSDEs, a new necessary condition for singular control is involved as well. In this sense, our paper actually considers two kinds of singular controls problems simultaneously, which is interesting to deepen this research.

•

We derive a new verification theorem for optimal singular controls via viscosity solution, which responses to the question raised in Zhang [93]; Meanwhile, we study the relationship between the adjoint equations derived and value function, which extends the smooth case considered by Cadenillas and Haussmann [21] to the framework of viscosity solution for stochastic recursive systems.

The rest of this paper is organized as follows: after some preliminaries in the second section, we are devoted the third section to the MP for two kinds of singular optimal controls. A concrete example is concluded with as well. Then, in Section 4, we study the verification theorem for singular optimal controls via viscosity solutions. Finally, we establish the relationship between the DPP and MP for viscosity solution. Some proofs of lemmas are displayed in Appendix 5.

2 Preliminaries and Notations

Throughout this paper, we denote by $\mathbb{R}^{n}$ the space of $n$ -dimensional Euclidean space, by $\mathbb{R}^{n\times d}$ the space the matrices with order $n\times d$ . Let $(\Omega,\mathcal{F},\{\mathcal{F}_{t}\}_{t\geq 0},P)$ be a complete filtered probability space on which a one-dimensional standard Brownian motion $W(\cdot)$ is defined, with $\{\mathcal{F}_{t}\}_{t\geq 0}$ being its natural filtration, augmented by all the $P$ -null sets. Given a subset $U$ (nonempty, bounded, and convex) of $\mathbb{R}^{k},$ we will denote $\mathcal{U}\left[0,T\right]\mathcal{=U}_{1}\times\mathcal{U}_{2}$ , separately, the class of measurable, adapted processes $\left(v,\xi\right):\left[0,T\right]\times\Omega\rightarrow U\times\left[0,\infty\right)^{m},$ with $\xi$ nondecreasing left-continuous with right limits and $\xi_{0}=0$ , moreover, $\mathbb{E}\left[\sup\limits_{0\leq t\leq T}\left|v\left(t\right)\right|^{2}+\left|\xi\left(T\right)\right|^{2}\right]<\infty.$ $\xi$ is called singular control. For each $t>0$ , we denote by $\left\{\mathcal{F}_{s}^{t},t\leq s\leq T\right\}$ the natural filtration of the Brownian motion $\{{W}\left(s\right){-W}\left(t\right){\}}_{{t\leq s\leq T}}$ , augmented by the $P$ -null sets of $\mathcal{F}$ . $\top$ appearing as superscript denotes the transpose of a matrix. In what follows, $C$ represents a generic constant, which can be different from line to line.

We now introduce the following spaces of processes:

[TABLE]

and denote $\mathcal{N}^{2}\left[0,T\right]=\mathcal{S}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{S}^{2}(0,T;\mathbb{R})\times\mathcal{M}^{2}(0,T;\mathbb{R}^{n}).$ Clearly, $\mathcal{N}^{2}\left[0,T\right]$ forms a Banach space.

For any $v\left(\cdot\right)\times\xi\left(\cdot\right)\in\mathcal{U}_{1}\times\mathcal{U}_{2},$ we study the stochastic control systems governed by FBSDEs (8).

We assume that the following conditions hold:

(A1)

The coefficients $b:[0,T]\times\mathbb{R}^{n}\times\mathbb{R}^{k}\rightarrow\mathbb{R}^{n},$ $\sigma:[0,T]\times\mathbb{R}^{n}\times\mathbb{R}^{k}\rightarrow\mathbb{R}^{n},$ are twice continuously differentiable with respect to $x;$ $b,$ $b_{x},$ $b_{xx},$ $\sigma,$ $\sigma_{x},$ $\sigma_{xx}$ are continues in $\left(x,u\right);$ $b_{x},$ $b_{xx},$ $\sigma_{x},$ $\sigma_{xx}$ are bounded $b$ , $\sigma$ are bounded by $C\left(1+\left|x\right|+\left|u\right|\right)$ for some positive constant $C.$ Moreover, for any $\left(t,x_{1},u_{1}\right),$ $\left(t,x_{2},u_{2}\right)\in\left[0,T\right]\times\mathbb{R}^{n}\times\mathbb{R}^{k},$

[TABLE]

(A2)

The coefficients $f:[0,T]\times\mathbb{R}^{n}\times\mathbb{R}\times\mathbb{\mathbb{R}}\times\mathbb{R}^{k}\rightarrow\mathbb{R},$ $\Phi:\mathbb{R}^{n}\rightarrow\mathbb{R},$ are twice continuously differentiable with respect to $\left(x,y,z\right).$ $K$ is a given deterministic matrix. $f,$ $Df,$ $D^{2}f$ are continuous in $\left(x,y,z,u\right)$ . There exists constant $C>0$ such that for any $\left(t,x_{1},y_{1},z_{1},u_{1}\right),$ $\left(t,x_{2},y_{2},z_{2},u_{2}\right)\in\left[0,T\right]\times\mathbb{R}^{n}\times\mathbb{R}\times\mathbb{R}\times\mathbb{R}^{k},$

[TABLE]

and

[TABLE]

Under above assumptions (A1)-(A2), for any $v\left(\cdot\right)\times\xi\left(\cdot\right)\in\mathcal{U}_{1}\times\mathcal{U}_{2}$ , it is easy to check that FBSDEs (8) admit a unique $\mathcal{F}_{t}$ -adapted solution denoted by the triple $(X_{\cdot}^{t,x;v,\xi},Y_{\cdot}^{t,x;v,\xi},Z_{\cdot}^{t,x;v,\xi})\in\mathcal{N}^{2}\left[0,T\right]$ (See Pardoux and Peng [74]).

Like Peng [75], given any control processes $v\left(\cdot\right)\times\xi\left(\cdot\right)\in\mathcal{U}_{1}\times\mathcal{U}_{2}$ , we introduce the following cost functional:

[TABLE]

We are interested in the value function of the stochastic optimal control problem:

[TABLE]

Since the value function (11) is defined by the solution of controlled FBSDEs (8), so from the existence and uniqueness, $u$ is well-defined.

The following estimate is very useful whose proof can be found in Briand et al. 2003 [19].

Lemma 1.

Let $\left(y^{i},z^{i}\right),$ $i=1,2,$ be the solution to the following

[TABLE]

where $\xi^{i}\in\mathcal{F}_{T}$ and $\mathbb{E}\left[\left|\xi^{i}\right|^{\beta}\right]<\infty,$ whilst $f^{i}\left(s,y^{i},z^{i}\right)$ satisfies the conditions (A2), and

[TABLE]

Then, for some $\beta\geq 2,$ there exists a positive constant $C_{\beta}$ such that

[TABLE]

Particularly, whenever putting $\xi^{2}=0,$ and $f^{2}=0,$ one has

[TABLE]

Now let us recall briefly the notion of differentiation on Wiener space (see the expository papers by Nualart 1995 [66], Nualart and Pardoux [67] and Ocone 1988 [69]).

•

$C_{b}^{k}\left(\mathbb{R}^{k},\mathbb{R}^{q}\right)$ will denote the set of functions of class $C^{k}$ from $\mathbb{R}^{k}$ into $\mathbb{R}^{q}$ whose partial derivatives of order less than or equal to $k$ are bounded.

•

Let $\mathcal{S}$ denote the set of random variables $\xi$ of the form $\xi=\varphi(W\left(h^{1}\right),W\left(h^{2}\right),\cdots,W\left(h^{k}\right)),$ where $\varphi\in C_{b}^{\infty}\left(\mathbb{R}^{k},\mathbb{R}\right)$ , $h^{1},h^{2},\cdots h^{k}\in L^{2}\left(\left[0,T\right];\mathbb{R}^{n}\right),$ and $W\left(h^{i}\right)=\int_{0}^{T}\left\langle h_{s}^{i},\mathrm{d}W\left(s\right)\right\rangle$ .

•

If $\xi\in\mathcal{S}$ is of the above form, we define its derivative as being the $n$ -dimensional process

[TABLE]

For $\xi\in\mathcal{S},$ $p>1,$ we define the norm

[TABLE]

It can be shown (Nualart 1995) that the operator $\mathcal{D}$ has a closed extension to the space $\mathbb{D}^{1,p}$ , the closure of $\mathcal{S}$ with respect to the norm $\left\|\cdot\right\|_{1,p}$ . Observe that if $\xi$ is $\mathcal{F}_{t}$ -measurable, then $\mathcal{D}_{\theta}\xi=0$ for $\theta\in\left(t,T\right]$ . We denote by $\mathcal{D}_{\theta}^{i}\xi$ , the ith component of $\mathcal{D}_{\theta}\xi$ .

Let $\mathbb{L}^{1,p}\left(\mathbb{R}^{d}\right)$ denote the set of $\mathbb{R}^{d}$ -valued progressively measurable processes $\{u\left(t,\omega\right),0\leq t\leq T;\omega\in\Omega\}$ such that

•

For a.e. $t\in\left[0,T\right],$ $u\left(t,\cdot\right)\in\mathbb{D}^{1,p}\left(\mathbb{R}^{n}\right);$

•

$\left(t,\omega\right)\rightarrow\mathcal{D}u\left(t,\omega\right)\in\left(L^{2}\left(\left[0,T\right]\right)\right)^{n\times d}$ admits a progressively measurable version;

•

We have

[TABLE]

Note that for each $\left(\theta,t,\omega\right),$ $\mathcal{D}_{\theta}u\left(t,\omega\right)$ is an $n\times d$ matrix. Hence, $\left|\mathcal{D}_{\theta}u\left(t\right)\right|^{2}=\sum_{i,j}\left|\mathcal{D}_{\theta}^{i}u_{j}\left(t\right)\right|^{2}.$ Obviously, $\mathcal{D}_{\theta}u\left(t,\omega\right)$ is defined uniquely up to sets of $\mathrm{d}\theta\otimes\mathrm{d}t\otimes\mathrm{d}P$ measure zero. Moreover, denote by $\mathbb{L}_{\mathbb{F}}^{1,p}\left(\mathbb{R}^{d}\right)$ the set of all adapted processes in $\mathbb{L}^{1,p}\left(\mathbb{R}^{d}\right).$

We define the following notations from Zhang and Zhang [89]:

[TABLE]

and

[TABLE]

Denote $\mathbb{L}_{2}^{1,p}\left(\mathbb{R}^{d}\right)=\mathbb{L}_{2+}^{1,p}\left(\mathbb{R}^{d}\right)\cap\mathbb{L}_{2-}^{1,p}\left(\mathbb{R}^{d}\right).$ For any $\varphi\left(\cdot\right)\in\mathbb{L}_{2}^{1,p}\left(\mathbb{R}^{d}\right),$ denote $\nabla\varphi\left(\cdot\right)=\mathcal{D}^{+}\varphi\left(\cdot\right)+\mathcal{D}^{-}\varphi\left(\cdot\right).$ Whenever $\varphi$ is adapted, it follows that $\mathcal{D}_{s}\varphi\left(t\right)=0$ for $t<s.$ Furthermore, $\nabla\varphi\left(\cdot\right)=\mathcal{D}^{+}\varphi\left(\cdot\right)$ since $\mathcal{D}^{-}\varphi\left(\cdot\right)=0.$ Put $\mathbb{L}_{2,\mathbb{F}}^{1,p}\left(\mathbb{R}^{d}\right)$ as the set of all adapted processes in $\mathbb{L}_{2}^{1,p}\left(\mathbb{R}^{d}\right).$

3 Maximum Principle of Singular Optimal Controls

This section will study the optimal controls separately. Due to the special structure of control systems, we shall first consider the singular control part, deriving the necessary condition, subsequently, regular part. The initial condition will fixed to be $\left(0,x\right),$ $x\in\mathbb{R}^{n}.$ At the beginning let us suppose that $\left(\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)\in\mathcal{U}_{1}\times\mathcal{U}_{2}$ is an optimal control and denote by $\left(X^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right),Y^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right),Z^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right)\right)$ the optimal solution of (8). Our maximum principle will be proved in two steps. The first variational inequality is derived from the fact

[TABLE]

where $u^{\varepsilon}\left(\cdot\right)$ is a convex perturbation of optimal control. The second variational inequity is attained from the inequity

[TABLE]

where $\xi^{\varepsilon}\left(\cdot\right)$ is a convex perturbation of $\xi.$

3.1 Optimal Singular Control

For $l=b\left(\cdot\right),$ $\sigma\left(\cdot\right),$ $f\left(\cdot\right),$ we denote

[TABLE]

Let us introduce the following

Proposition 2.

Let (A1)-(A2) hold, and let $\left(X^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right),Y^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right),Z^{0,x;\bar{u},\bar{\eta}}\left(\cdot\right)\right)\in\mathcal{N}^{2}(0,T;\mathbb{R}^{n})$ be an optimal solution. Then, the following FBSDEs:

[TABLE]

admit an adapted solution $\left(\mathfrak{p}\left(\cdot\right),\mathfrak{q}\left(\cdot\right),\mathfrak{k}\left(\cdot\right)\right)\in\mathcal{N}^{2}(0,T;\mathbb{R}^{n}).$

Theorem 3.

Let (A1)-(A2) hold. If $\left(X^{\bar{u},\bar{\eta}}\left(\cdot\right),Y^{\bar{u},\bar{\eta}}\left(\cdot\right),Z^{\bar{u},\bar{\eta}}\left(\cdot\right),\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ is an optimal solution of (8), then there exists a unique pair of adapted processes $\left(\mathfrak{p}\left(\cdot\right),\mathfrak{q}\left(\cdot\right)\right)$ satisfying (15) such that

[TABLE]

and

[TABLE]

Before the proof, we need some lemmas. At the beginning, we introduce the convex perturbation

[TABLE]

where $\alpha\in\left[0,1\right]$ and $\xi\left(\cdot\right)$ is an arbitrary element of $\mathcal{U}_{2}.$ We now introduce the following variational equations of (8):

[TABLE]

From (A1)-(A2) it is easy to check that (18) has a unique strong solution. Moreover, we have

Lemma 4.

Under the Assumptions (A1)-(A2), we have

[TABLE]

The proof can be seen in the Appendix.

Proof of Theorem 3.

Applying Itô’s formula to $\left\langle\mathfrak{p}\left(\cdot\right),x^{1}\left(\cdot\right)\right\rangle+\mathfrak{q}\left(\cdot\right)y^{1}\left(\cdot\right)$ on $\left[0,T\right]$ yields

[TABLE]

In particular, let $\xi\in\mathcal{U}_{2}$ be a process satisfying $P\left\{\sum_{i}\int_{0}^{T}G\left(s\right)\mathrm{d}\xi_{s}^{\left(i\right)}<\infty\right\}$ and such that (22) and

[TABLE]

holds where $\xi_{s}^{\left(i\right)}$ denotes the $i$ th component. Then,

[TABLE]

Thus

[TABLE]

which proves (17). Next we show that (16) is valid. For that, let us define the events:

[TABLE]

where $t\in\left[0,T\right],$ $1\leq i\leq m.$

Define the stochastic process $\breve{\xi}^{\left(i\right)}:\left[0,T\right]\times\Omega\rightarrow\left[0,\infty\right)$ by

[TABLE]

Then one can easily check that $\breve{\xi}=\left(\breve{\xi}^{\left(1\right)},\breve{\xi}^{\left(2\right)},\ldots,\breve{\xi}^{\left(m\right)}\right)$ is a measurable, adapted process which is nondecreasing left-continuous with right limits and $\breve{\xi}\left(0\right)=0$ , and which satisfies

[TABLE]

Further, we have

[TABLE]

which obviously contradicts to (22), unless for any $i,$ we have $\left(Leb\otimes P\right)\left\{\mathcal{A}^{\left(i\right)}\right\}=0.$ We thus complete the proof. $\Box$

Remark 5.

One can easily check that

[TABLE]

which implies that $\mathfrak{q}\left(r\right)>0,$ $r\in\left[0,T\right]$ , $P$ -a.s. So $-\mathfrak{p}\left(\cdot\right)/\mathfrak{q}\left(\cdot\right)$ makes sense. Clearly, our Theorem 3 for optimal singular control is completely different from [13]. Ours contains two variables $\left(\mathfrak{p}\left(\cdot\right),\mathfrak{q}\left(\cdot\right)\right)$ . As a matter of fact, we have

[TABLE]

We claim that $-\mathfrak{p}\left(\cdot\right)/\mathfrak{q}\left(\cdot\right)$ is the partial derivative of value function, which will be studied in Section 4.3.

3.2 Optimal Regular Control

In this subsection, we study the optimal regular controls for systems driven by FBSDEs (8) under the types of Pontryagin, namely, necessary maximum principles for optimal control. To this end, we fix $\bar{\xi}\in\mathcal{U}_{2}$ and introduce the following convex perturbation control. Taking $u\left(\cdot\right)\in\mathcal{U}_{1},$ we define $v\left(\cdot\right)=u\left(\cdot\right)-\bar{u}\left(\cdot\right),$ $u^{\varepsilon}\left(\cdot\right)=\bar{u}\left(\cdot\right)+\varepsilon v\left(\cdot\right),$ where $\varepsilon>0$ is sufficiently small. Since $U$ is convex, $u^{\varepsilon}\left(\cdot\right)\in\mathcal{U}\left(0,T\right).$ Let $\left(x^{\varepsilon},y^{\varepsilon},z^{\varepsilon},u^{\varepsilon}\right)$ be the trajectory of the control system (8) corresponding to the control $u^{\varepsilon}.$ Put $\delta x\left(\cdot\right)=x^{\varepsilon}\left(\cdot\right)-\bar{x}\left(\cdot\right).$ When $l=b,$ $\sigma$ and $\Phi,$ we denote

[TABLE]

Let us introduce the following two kinds of variational equations, mainly taken from [17]. For simplicity, we omit the superscript.

[TABLE]

and

[TABLE]

From Lemma 3.5 and Lemma 3.11 in [17], we have following result.

Lemma 6.

Assume that (A1)-(A2) is in force. Then, we have, for any $\beta\geq 2,$

[TABLE]

where

[TABLE]

We shall introduce the so called variational equations for FBSDEs (8) beginning from the following two adjoint equations:

[TABLE]

and

[TABLE]

where $\Gamma\left(\cdot\right),$ $\Pi\left(\cdot\right)$ are unknown two processes to be determined. Next we will derive two kinds of adjoint equations. The main idea is borrowed from [44]. First of all, we observe that

[TABLE]

which inspires us to use the adjoint equations to expand the following:

[TABLE]

Itô’s formula applied to (27) yields for $t\in\left[0,T\right],$

[TABLE]

where

[TABLE]

Remark 7.

Note that $\Gamma\left(t\right)$ and $\Pi\left(t\right)$ do not appear in the $\mathrm{d}W\left(s\right)$ -term.

Define

[TABLE]

Let

[TABLE]

Clearly, from Lemma 6, we have

[TABLE]

After some tedious computations, we have

[TABLE]

Put

[TABLE]

then we attain

[TABLE]

where

[TABLE]

Next we are going to seek $\Gamma\left(\cdot\right),$ $\Pi\left(\cdot\right),$ determined by the optimal quadruple $(\bar{x}\left(\cdot\right),\bar{y}\left(\cdot\right),\bar{z}\left(\cdot\right),\bar{u}\left(\cdot\right)),$ such that

[TABLE]

where

[TABLE]

in which $o\left(\varepsilon^{2}\right)$ does not involve the terms $x_{1}\left(\cdot\right)$ and $x_{2}\left(\cdot\right).$ Note that in BSDE (30), there appears the term $x_{1}^{\top}\left(s\right)\Lambda_{3}\left(s\right)x_{1}\left(s\right).$ Hence, we make use of Taylor’s expansion to

[TABLE]

where the Hessian matrix $\mathbf{H}_{1}$ is with respect to $\left(x,y,z\right).$

Then, we obtain

[TABLE]

where $I_{n\times n}$ denotes the identity matrix.

In the classical theory of optimal control for FBSDEs (cf [84, 85]), there generally appear two groups of the first-order adjoint equations, for instance $\left(\mathfrak{p}\left(\cdot\right),\mathfrak{q}\left(\cdot\right)\right)$ in Eqs. (15). The following proposition will establish the relationship between them with $p\left(\cdot\right)$ from (25), which is very useful to study the connection between maximum principle and dynamic programming (see Theorem 40 below).

Proposition 8.

Suppose that Assumptions (A1)-(A2) are in force. Then we have

[TABLE]

where $p\left(\cdot\right)$ and $\left(\mathfrak{p}\left(\cdot\right),\mathfrak{q}\left(\cdot\right)\right)$ are solutions to FBSDEs (25) and (15), respectively.

The proof is just to apply the Itô’s formula to $-\mathfrak{p}^{T}\left(s\right)/\mathfrak{q}\left(s\right),$ so we omit it.

We define the classical Hamiltonian function:

[TABLE]

where $\left(t,x,y,z,u,p,q\right)\in\left[0,T\right]\times\mathbb{R}^{n}\mathbb{\times R\times R\times R}^{k}\mathbb{\times R}^{n}\mathbb{\times R}^{n}.$

Then, we have

[TABLE]

Namely,

[TABLE]

Remark 9.

Note that FBSDEs (33) are somewhat different from (22) in Hu [44]. Specifically, the term $A_{4}x_{1}\left(s\right)I_{E_{\varepsilon}}\left(s\right)$ disappears in (22) since

[TABLE]

in [44] by using spike variational approach. Nevertheless, the corresponding term in our paper is just $\varepsilon^{2}x_{1}^{\top}\left(s\right)\Lambda_{2}\left(s\right)$ . We will see a moment later that this term is needed to define an extensive “Hamiltonian function” as follows.

Define

[TABLE]

where

[TABLE]

We now give the adjoint equation for BSDE (33) as follows:

[TABLE]

Lemma 10.

Under the Assumptions (A1)-(A2), SDE (34) admits a unique adapted strong solution $\chi\left(t\right)\in\mathcal{S}^{2}(0,T;\mathbb{R}).$ Moreover, we have

[TABLE]

Proof.

The first inequality can be obtained from Theorem 6.16 of [87]. We deal with the second one. By Itô’s formula, we have

[TABLE]

It follows that

[TABLE]

But by the B-D-G inequality, we get

[TABLE]

Thus

[TABLE]

The second estimation comes from the Hölder inequality. We complete the proof. $\Box$

Set

[TABLE]

We are able to give the variational equations as follows:

[TABLE]

and

[TABLE]

Obviously, we have

[TABLE]

Lemma 11.

Under the Assumptions (A1)-(A2), we have the following estimation

[TABLE]

Proof.

To prove (37), we consider (33) again. From assumptions (A1)-(A2), one can check that the adjoint equations (25) and (26) have a unique adapted strong solution, respectively. Furthermore, by classical approach, we are able to get the following estimates for $\beta\geq 2,$

[TABLE]

Applying Lemma 1 to (33), we get the desired result. Indeed, since there appears a term

[TABLE]

in BSDE (33), so we have the estimation with $O\left(\varepsilon^{2}\right).$ We complete the proof. $\Box$

We shall derive a variational inequality which is crucial to establish the necessary condition for optimal control. Before this, we introduce the following the other type of singular control using the Hamiltonian function:

Definition 12 (Singular control in the classical sense).

We call a control $\breve{u}\left(\cdot\right)\in\mathcal{U}\left(0,T\right)$ a singular control in the classical sense if $\breve{u}\left(\cdot\right)$ satisfies

[TABLE]

*where $\left(\breve{x}\left(\cdot\right),\breve{y}\left(\cdot\right),\breve{z}\left(\cdot\right)\right)$ denotes the state trajectories driven by $\breve{u}\left(\cdot\right).$ Moreover, $\left(\breve{p}\left(\cdot\right),\breve{q}\left(\cdot\right)\right)$ and $\left(\breve{P}\left(\cdot\right),\breve{Q}\left(\cdot\right)\right)$ denote the adjoint processes given respectively by (25) and (26) with $\left(\bar{x}\left(t\right),\bar{y}\left(t\right),\bar{z}\left(t\right),\bar{u}\left(t\right)\right)$ replaced by

$(\breve{x}\left(\cdot\right),\breve{y}\left(\cdot\right),\breve{z}\left(\cdot\right),\breve{u}\left(\cdot\right))$ . If this $\breve{u}\left(\cdot\right)$ is also optimal, then we call it a singular optimal control in the classical sense.*

Remark 13.

Hu [44] first considers the forward-backward stochastic control problem whenever the diffusion term $\sigma\left(t,x,u\right)$ depends on the control variable $u$ with non-convex control domain. In order to to establish the stochastic maximum principle, he introduces the $\mathcal{H}$ -function of the following type:

[TABLE]

Note that this Hamiltonian function is slightly different from Peng 1990 [72]. The main difference of this variational equations with those in (Peng 1990) [72] appears in the term $p\left(t\right)\delta\sigma\left(t\right)I_{E_{\varepsilon}}\left(t\right)$ (the similar term $\varepsilon p\left(t\right)\sigma_{u}\left(t\right)v\left(t\right)+\frac{\varepsilon^{2}}{2}p\left(t\right)\sigma_{uu}\left(t\right)v^{2}\left(t\right)$ in our paper) in variational equation for BSDE and maximum principle for the definition of $p\left(t\right)$ in the variation of $z$ , which is $O(\varepsilon)$ for any order expansion of $f$ . So it is not helpful to use the second-order Taylor expansion for treating this term. The stochastic maximum principle (see [44]) says that if $\left(\breve{x}\left(t\right),\breve{y}\left(t\right),\breve{z}\left(t\right),\breve{u}\left(t\right)\right)$ is an optimal pair, then

[TABLE]

Apparently, Definition 12 says that a singular control in the classical sense is the real one that fulfils trivially the first and second-order necessary conditions in classical optimization theory dealing with the maximization problem (40), namely,

[TABLE]

It is easy to verify that (39) is equivalent to (41). Certainly, one could investigate stochastic singular optimal controls for forward-backward stochastic systems in other senses, say, in the sense of process in Skorohod space, which can be seen in Zhang [93] via viscosity solution approach (Hamilton-Jacobi-Bellman inequality), or in the sense of Pontryagin-type maximum principle (cf Tang [81]). As this complete remake of the various topics is much longer than the present paper, it will be reported elsewhere.

Lemma 14 (Variational inequality).

Under the Assumptions (A1)-(A2), it holds that

[TABLE]

where

[TABLE]

Proof.

Using Itô’s formula to $\left\langle\chi\left(s\right),\hat{y}^{\varepsilon}\left(s\right)\right\rangle$ on $\left[0,T\right],$ we get the desired result. $\Box$

Theorem 15.

Assume that (A1)-(A2) hold. If $\bar{u}\left(\cdot\right)\in\mathcal{U}\left(0,T\right)$ is a singular optimal control in the classical sense, then

[TABLE]

for any $v\left(\cdot\right)=u\left(\cdot\right)-\bar{u}\left(\cdot\right),$ $u\left(\cdot\right)\in\mathcal{U}\left(0,T\right).$

Proof.

According to the definition of value function, we have

[TABLE]

Letting $\varepsilon\rightarrow 0+,$ we get the desired result from Definition 12 and Lemma 14. $\Box$

Remark 16.

Clearly, if $f$ does not depend on $\left(y,z\right)$ , then $\chi\left(\cdot\right)\equiv 1.$ Consequently, (43) reduces to

[TABLE]

which is just the classical case studied in Zhang et al. [89] for classical stochastic control problems. Meanwhile, our result actually extends Peng [73] to second order case.

Remark 17.

Recall that, for deterministic system, it is possible to derive pointwise necessary conditions for optimal controls via the first suitable integral-type necessary conditions and normally there is no obstacles to establish the pointwise first-order necessary condition for optimal controls whenever an integral type one is on the hand. Nevertheless, the classical approach to handle the pointwise condition from the integral-type can not be employed directly in the framework of the pointwise second-order condition in the general stochastic setting because of certain feature the stochastic systems owning. In order to derive the second order variational equations for BSDE in Hu [44], the author there introduces two kinds of adjoint equations and a new Hamiltonian function. The main difference of this variational equations with those in (Peng 1990) [72] lies in the term $p\left(t\right)\delta\sigma\left(t\right)I_{E_{\varepsilon}}\left(t\right).$ Then, it is possible to get the maximum principle basing one variational equation. Note that the order of the difference between perturbed state, optimal state and first, second order state is $o\left(\varepsilon\right).$

As observed in Theorem 15, there appears a term $\mathbb{H}\left(s\right)x_{1}\left(s\right)v\left(s\right)$ . In order to deal with it, we give the expression of $x_{1}\left(\cdot\right),$ mainly taken from Theorem 1.6.14 in Yong and Zhou [87]. To this end, consider the following matrix-valued stochastic differential equation:

[TABLE]

where $I$ denotes the identity matrix in $\mathbb{R}^{n\times n}$ . Then,

[TABLE]

Substituting the explicit representation (45) of $x_{1}$ into (43) yields

[TABLE]

Clearly, (46) contains an Itô’s integral. Next we shall borrow the spike variation method from [89] to check its order with perturbed control. More precisely, let $\varepsilon>0$ and $E_{\varepsilon}\subset\left[0,T\right]$ be a Borel set with Borel measure $\left|E_{\varepsilon}\right|=\varepsilon,$ define

[TABLE]

where $u\left(\cdot\right)\in\mathcal{U}\left(0,T\right).$ This $u^{\varepsilon}$ is called a spike variation of the optimal control $\bar{u}$ . For our aim, we only need to use $E_{\varepsilon}=\left[l,l+\varepsilon\right]$ for $l\in\left[0,T-\varepsilon\right]$ and $\varepsilon>0$ . Let

[TABLE]

Then, inserting it into (46), we have

[TABLE]

By Hölder inequality and Burkholder-Davis-Gundy inequality, we have

[TABLE]

since $\sup_{s\in\left[0,T\right]}\left|\chi\left(s\right)\right|^{2}<\infty$ from classical estimate for stochastic differential equations.

Lemma 18 (Martingale representation theorem).

Suppose that $\phi\in L_{\mathbb{F}}^{2}\left(\Omega;L^{2}\left(\left[0,T\right]:\mathbb{R}^{n}\right)\right).$ Then, there exists a $\kappa\left(\cdot,\cdot\right)\in L^{2}\left(\left[0,T\right];L_{\mathbb{F}}^{2}\left(\left[0,T\right]\times\Omega;\mathbb{R}^{n}\right)\right)$ such that

[TABLE]

The proof can seen in Zhang et al. [89].

Lemma 19.

Assume that (A1)-(A2) hold. Then,

[TABLE]

Proof.

We shall prove that

[TABLE]

From (A1)-(A2), we have

[TABLE]

Besides,

[TABLE]

Hence,

[TABLE]

From Lemma 10 and the classical estimation in (38), we finish the proof. $\Box$

Therefore, by our assumption (A1)-(A2) and Lemma 18, for any $u\in U$ , there exists a

[TABLE]

such that for a.e. $t\in\left[0,T\right]$

[TABLE]

Using (47), we are able to assert the following:

Theorem 20.

Suppose that (A1)-(A2) are in force. Let $\bar{u}\left(\cdot\right)$ be a singular optimal control in the classical sense, then we have

[TABLE]

where

[TABLE]

where $\psi^{u}\left(s,t\right)$ is obtained by (47), and $\Psi$ is determined by (44).

The proof is just to repeat the process in Theorem 3.10, [89], so we omit it.

Note that Theorem 20 is pointwise with respect to the time variable $t$ (but also the integral form). Now if each of $\chi\left(\cdot\right)\mathbb{H}\left(\cdot\right)$ and $\bar{u}\left(\cdot\right)$ are regular enough, then the function $\psi^{u}\left(\cdot,\cdot\right)$ admits an explicit representation.

Suppose the following:

(A3)

$\bar{u}\left(\cdot\right)\in\mathbb{L}_{2,\mathbb{F}}^{1,2}\left(\mathbb{R}^{k}\right),$ $\chi\left(\cdot\right)\mathbb{H}^{\top}\left(\cdot\right)\in\mathbb{L}_{2,\mathbb{F}}^{1,2}\left(\mathbb{R}^{k\times n}\right)\cap L^{\infty}\left(\left[0,T\right]\times\Omega;\mathbb{R}^{k\times n}\right).$

Theorem 21.

Suppose that the Assumptions (A1)-(A3) are in force. Let $\bar{u}\left(\cdot\right)$ be a singular optimal control in the classical sense, then we have

[TABLE]

Observe that the expression (43) is similar to (3.17) in [89]. Therefore, the proof is repeated as in Theorem 3.13 in Zhang and Zhang [89].

3.2.1 Example

We provide a concrete example to illustrate our theoretical result (Theorem 21) by looking at an example. If the FBSDEs considered in this paper are linear, it is possible to implement our principles directly. For convenience, we still adopt the notations introduced in Section 3.2.

Example 22.

Consider the following FBSDEs with $n=1$ and $U=\left[-1,1\right].$

[TABLE]

One can easily get the solutions to (34),

[TABLE]

Set $\left(\bar{x}\left(t\right),\bar{y}\left(t\right),\bar{z}\left(t\right),\bar{u}\left(t\right)\right)=\left(0,0,0,0\right).$ The corresponding adjoint equations are (25) and (26), namely,

[TABLE]

and

[TABLE]

We get immediately, the solutions to (49) and (50) are

[TABLE]

respectively. Hence, we have

[TABLE]

Therefore, $\bar{u}\left(t\right)=0$ is a singular control in the classical sense. Moreover, we compute

[TABLE]

Consequently, we get

[TABLE]

which indicates that Theorem 21 always holds and $\bar{u}\left(r\right)=0$ is a singular optimal control.

4 Singular Optimal Controls via Dynamic Programming Principle

In this section, we proceed our control problem from the view point of DPP. From now on, we focus on the following

[TABLE]

Since the value function defined by the solution of controlled BSDE (51), so from the existence and uniqueness, $u$ defined in (11) is well-defined.

Remark 23.

We assume that $G_{n\times m}$ and $K_{1\times m}$ are deterministic matrices. On the one hand, from the derivations in Theorem 5.1 of [43], it is convenient to show the “inaction” region for singular control; On the other hand, we may regard $Y_{s}^{t,x;v,\xi}+K\xi_{s}$ together as a solution, in this way, we are able to apply the classical Itô’s formula, avoiding the appearance of jump. We believe these assumptions can be removed properly, but at present, we consider constant only in our paper. Whilst in order to get the uniqueness of the solution to H-J-B inequality (52), we add the assumption $K^{i}>k_{0}>0,$ $1\leq i\leq m.$ More details, see Theorem 2.2 in [93].

Set

[TABLE]

4.1 Verification Theorem via Viscosity Solutions

Zhang [93] has given a verification theorem for smooth solution of the following H-J-B inequality:

[TABLE]

Lemma 24.

Define

[TABLE]

Then the optimal state process $X^{t,x;\hat{v},\hat{\xi}}$ is continuous whenever $\left(r,X_{r}^{t,x;\hat{v},\hat{\xi}}\right)\in\mathcal{D}_{r}\left(u\right)$ . To be precise, we have

[TABLE]

The proof can be seen in Zhang [93].

Proposition 25.

Suppose that $V$ is a classical solution of the H-J-B inequality (52) such that for some $l>1,$

[TABLE]

Then for any $\left[0,T\right]\times\mathbb{R}^{n}$ , $\left(v,\xi\right)\in\mathcal{U}:$

[TABLE]

Furthermore, if there exists $\left(\hat{v},\hat{\xi}\right)\in\mathcal{U}$ such that

[TABLE]

and

[TABLE]

Then

[TABLE]

In this section, we remove the unreal condition, smooth on value function, by means of viscosity solutions222In the classical optimal stochastic control theory, the value function is a solution to the corresponding H-J-B equation whenever it has sufficient regularity (Fleming and Rishel [35], Krylov [49]). Nevertheless, when it is only known that the value function is continuous, then, the value function is a solution to the H-J-B equation in the viscosity sense (see Lions [23]).. We will recall the definition of a viscosity solution for H-J-B variational inequality (52) from [23]. Below, $\mathbb{S}^{n}$ will denote the set of $n\times n$ symmetric matrices.

Let us begin at introducing the following parabolic superjet:

Definition 26.

Let $V\left(t,x\right)\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ and $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}$ . We denote by $\mathcal{P}^{2,+}V\left(t,x\right)$ , the “parabolic superjet” of $V$ at $\left(t,x\right)$ the set of triples $\left(p,q,X\right)\in\mathbb{R}\times\mathbb{R}^{n}\times\mathbb{S}^{n}$ which are such that

[TABLE]

Similarly, we denote by $\mathcal{P}^{2,-}V\left(t,x\right),$ the “parabolic subjet” of $V$ at $\left(t,x\right)$ the set of triples $\left(p,q,X\right)\in\mathbb{R}\times\mathbb{R}^{n}\times\mathbb{S}^{n}$ which are such that

[TABLE]

Lemma 27.

Let $V\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ and $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}$ be given. Then:

1) $\left(p,q,X\right)\in\mathcal{P}^{2,+}V\left(t,x\right)$ if and only if there exists a function $\varphi\in C^{1,2}\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ such that $V-\varphi$ attains a strict maximum at $\left(t,x\right)$ and

[TABLE]

2) $\left(p,q,X\right)\in\mathcal{P}^{2,-}V\left(t,x\right)$ if and only if there exists a function $\varphi\in C^{1,2}\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ such that $V-\varphi$ attains a strict minimum at $\left(t,x\right)$ and

[TABLE]

More details can be seen in Lemma 5.4 and 5.5 in Yong and Zhou [87].

Define

[TABLE]

Definition 28.

(i) It can be said $V\left(t,x\right)\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ is a viscosity subsolution of (52) if $V\left(T,x\right)\geq\Phi\left(x\right),$ $x\in\mathbb{R}^{n}$ , and at any point $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}$ , for any $\left(p,q,X\right)\in\mathcal{P}^{2,+}V\left(t,x\right)$ ,

[TABLE]

In other words, at any point $\left(t,x\right),$ we have both $qG+K\geq 0$ and

[TABLE]

(ii) It can be said $V\left(t,x\right)\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ is a viscosity supersolution of (52) if $V\left(T,x\right)\leq\Phi\left(x\right),$ $x\in\mathbb{R}^{n}$ , and at any point $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}$ , for any $\left(p,q,X\right)\in\mathcal{P}^{2,-}V\left(t,x\right)$ ,

[TABLE]

In other words, at any point where $qG+K\geq 0$ , we have

[TABLE]

(iii) It can be said $V\left(t,x\right)\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ is a viscosity solution of (52) if it is both a viscosity sub and super solution.

We have the following result:

Proposition 29.

Assume that (A1)-(A2) are in force. Then there exists at most one viscosity solution of H-J-B inequality (52) in the class of bounded and continuous functions.

We need a generalized Itô’s formula. Define

[TABLE]

For any $\Psi\in C^{1,2}\left(\left[0,T\right]\times\mathbb{R}^{n};\mathbb{R}\right)$ , by virtue of Doléans-Dade-Meyer formula (see [43, 21]), we have

[TABLE]

We begin to introduce a useful lemma.

Lemma 30.

Assume that (A1)-(A2) are in force. Let $\left(t,x\right)\in\left[0,T\right)\times\mathbb{R}^{n}$ be fixed and let $\left(X^{t,x;u}\left(\cdot\right),u\left(\cdot\right)\right)$ be an admissible pair. Define processes

[TABLE]

Then

[TABLE]

The proof can be found in [87].

Lemma 31.

Let $g\in C\left(\left[0,T\right]\right).$ Extend $g$ to $\left(-\infty,+\infty\right)$ with $g\left(t\right)=g\left(T\right)$ for $t>T,$ and $g\left(t\right)=g\left(0\right),$ for $t<0.$ Suppose that there is a integrable function $\rho\in L^{1}\left(\left[0,T\right];\mathbb{R}\right)$ and some $h_{0}>0,$ such that

[TABLE]

Then

[TABLE]

The proof can be seen in Zhang [92].

The main result in this section is the following.

Theorem 32 (Verification Theorem).

Suppose that the Assumptions (A1)-(A2) are in force. Let $V\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right),$ be a viscosity solution of the H-J-B equations (52), satisfying the following conditions:

[TABLE]

Then we have

[TABLE]

for any $\left(t,x\right)\in\left(0,T\right]\times\mathbb{R}^{n}$ and any $u\left(\cdot\right)\times\xi\left(\cdot\right)\in\mathcal{U}\left(t,T\right).$

Furthermore, let $\left(t,x\right)\in\left(0,T\right]\times\mathbb{R}^{n}$ be fixed and let

[TABLE]

be an admissible pair such that there exist a function $\varphi\in C^{1,2}\left(\left[0,T\right];\mathbb{R}^{n}\right)$ and a triple

[TABLE]

satisfying

[TABLE]

and

[TABLE]

where $\overline{\varphi}\left(s\right)=\varphi\left(s,\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(s\right)\right)$ and $\mathcal{G}$ is defined in (58). Then $(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right))$ is an optimal pair.

In order to prove Theorem 32, we need the following lemma:

Lemma 33.

Let $v$ be a viscosity subsolution of the H-J-B equations (52) satisfying (63). Then we have

[TABLE]

where $\rho\left(s\right)\in L^{1}\left(\left[t,T\right]:\mathbb{R}\right).$

The proof can be seen in the Appendix.

Proof of Theorem 32.

We have (64) from the uniqueness of viscosity solutions of the H-J-B equations (52). It remains to show that $\left(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ is an optimal, we now fix $t_{0}\in\left[t,T\right]$ such that (65) and (66) hold at $t_{0}.$ For $z_{1}\left(\cdot\right)=\overline{b}\left(\cdot\right),$ $z_{2}\left(\cdot\right)=\overline{\sigma}\left(\cdot\right)\overline{\sigma}\left(\cdot\right)^{\ast},$ $z_{3}\left(\cdot\right)=\overline{f}\left(\cdot\right).$ We claim that the set of such points is of full measure in $\left[t,T\right]$ by Lemma 7 in [92]. Now we fix $\omega_{0}\in\Omega$ such that the regular conditional probability $P\left(\left.\cdot\right|\mathcal{F}_{t_{0}}^{t}\right)\left(\omega_{0}\right)$ , given $\mathcal{F}_{t_{0}}^{t}$ is well defined. In this new probability space, the random variables $\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(t_{0}\right),\overline{p}\left(t_{0}\right),\overline{q}\left(t_{0}\right),\overline{\Theta}\left(t_{0}\right)$ are almost surely deterministic constants and equal to

[TABLE]

respectively. We remark that in this probability space the Brownian motion $W$ is still the a standard Brownian motion although now $W\left(t_{0}\right)=W\left(t_{0},\omega_{0}\right)$ almost surely. The space is now equipped with a new filtration $\left\{\mathcal{F}_{r}^{t}\right\}_{t\leq r\leq T}$ and the control process $\overline{u}\left(\cdot\right)$ is adapted to this new filtration. For $P$ -a.s. $\omega_{0}$ the process $\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right)$ is a solution of (1.1) on $\left[t_{0},T\right]$ in $\left(\Omega,\mathcal{F},P\left(\left.\cdot\right|\mathcal{F}_{t_{0}}^{t}\right)\left(\omega_{0}\right)\right)$ with the inial condition $\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(t_{0}\right)=\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(t_{0},\omega_{0}\right).$

Then on the probability space $\left(\Omega,\mathcal{F},P\left(\left.\cdot\right|\mathcal{F}_{t_{0}}^{t}\right)\left(\omega_{0}\right)\right)$ , we are going to apply Itô’s formula to $\varphi$ on $\left[t_{0},t_{0}+h\right]$ for any $h>0,$

[TABLE]

Taking conditional expectation value $\mathbb{E}^{\mathcal{F}_{t_{0}}^{t}}\left(\cdot\right)\left(\omega_{0}\right),$ dividing both sides by $h$ , and using (66), we have

[TABLE]

We now handle the last two terms. Note that

[TABLE]

and

[TABLE]

Thus

[TABLE]

We now deal the term

[TABLE]

Combining (70) and (71), we have

[TABLE]

Letting $h\rightarrow 0,$ and employing the similar delicate method as in the proof of Theorem 4.1 of Gozzi et al. [41], we have

[TABLE]

From Lemma 33, that there exist $\rho\in L^{1}\left(t_{0},T;\mathbb{R}\right)$ and $\rho_{1}\in L^{1}\left(\Omega;\mathbb{R}\right)$ such that

[TABLE]

and

[TABLE]

holds, respectively.

By virtue of Fatou’s Lemma, noting (74), we obtain

[TABLE]

for a.e. $t_{0}\in\left[t,T\right].$ Then the rest of the proof goes exactly as in [41].

We apply Lemma 8 in [92] to $g\left(t\right)=\mathbb{E}\left[v\left(t,\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(t\right)\right)\right],$ using (73), 67) and (75) to get

[TABLE]

From this we claim that

[TABLE]

where

[TABLE]

Thus, combining the above with the first assertion (64), we prove the $\left(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\overline{u}\left(\cdot\right)\right)$ is an optimal pair. The proof is thus completed. $\Box$

Remark 34.

The condition (67) is just equivalent to the following:

[TABLE]

where $\overline{\varphi}\left(t\right)$ is defined in Theorem 32. This is easily seen by recalling the fact that $v$ is the viscosity solution of (52):

[TABLE]

which yields (76) under (67).

Remark 35.

Clearly, Theorem 32 is expressed in terms of parabolic superjet. One could naturally ask whether a similar result holds for parabolic subjet. The answer was positive for the deterministic case (in terms of the first-order parabolic subjet, see Theorem 3.9 in [87]). Unfortunately, as claimed in Yong and Zhou [87], the answer is that the statement of Theorem 32 is no longer valid whenever the parabolic superjet in (66) is replaced by the parabolic subjet.

Now let us present a non-smooth version of the necessity part of Theorem 32. However, we just have “partial” result.

Theorem 36.

Assume that (A1)-(A2) hold. Let $v\in C\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ be a viscosity solution of the H-J-B equations (52) and let $\left(\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ be an optimal singular controls. Let $\left(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Y}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Z}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ be an admissible pair such that there exist a function $\varphi\in C^{1,2}\left(\left[0,T\right];\mathbb{R}^{n}\right)$ and a triple

[TABLE]

satisfying

[TABLE]

Then, it holds that

[TABLE]

Proof.

On the one hand, let $s\in\left[t,T\right]$ and $\omega\in\Omega$ such that

[TABLE]

By Lemma 27, we have a test function $\varphi\in C^{1,2}\left(\left[0,T\right]\times\mathbb{R}^{n}\right)$ with $\left(s,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}$ $\left(p,q,\Theta\right)\in\mathbb{R}\times\mathbb{R}^{n}\times\mathbf{S}^{n}$ such that $v-\varphi$ achieves its minimum at $\left(s,\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(s\right)\right)$ and

[TABLE]

holds. Then for sufficiently small $\theta>0$ , a.e. $s\in\left[t,T\right]$ .

[TABLE]

The last inequality comes from the derivation in Theorem 32 by means of the condition (77). On the other hand, since $\left(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Y}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Z}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right)\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ is optimal, by DPP of optimality, it yields

[TABLE]

which implies that

[TABLE]

Therefore, it follows from (78) that

[TABLE]

where

[TABLE]

We thus complete the proof. $\Box$

4.2 Optimal Feedback Controls

In this subsection, we describe the method to construct optimal feedback controls by the verification Theorem 32. First, let us recall the definition of admissible feedback controls.

Definition 37.

A measurable function $\left(\mathbf{u,\xi}\right)$ from $\left[0,T\right]\times\mathbb{R}^{n}$ to $U\times\left[0,\infty\right)^{m}$ is called an admissible feedback control pair if for any $\left(t,x\right)\in\left[0,T\right)\times\mathbb{R}^{n}$ there is a weak solution $X^{t,x;u,\xi}\left(\cdot\right)$ of the following SDE:

[TABLE]

where $M^{t,x;\mathbf{u},\mathbf{\xi}}$ is an $\mathbb{R}$ -valued $\mathbb{F}^{t,x;\mathbf{u},\mathbf{\xi}}$ -adapted right continuous and left limit martingale vanishing in $t=0$ which is orthogonal to the driving Brownian motion $W.$ Here $\mathbb{F}^{t,x;\mathbf{u},\mathbf{\xi}}=\left(\mathcal{F}_{s}^{X^{t,x;\mathbf{u},\mathbf{\xi}}}\right)_{s\in\left[t,T\right]}$ is the smallest filtration and generated by $X^{t,x;\mathbf{u},\mathbf{\xi}}$ , which is such that $X^{t,x;\mathbf{u},\mathbf{\xi}}$ is $\mathbb{F}^{t,x;\mathbf{u},\mathbf{\xi}}$ -adapted. Obviously, $M^{t,x;\mathbf{u},\mathbf{\xi}}$ is a part of the solution of BSDE of (79). Simultaneously, we suppose that $f$ satisfies the Lipschitz condition with respect to $\left(x,y,z\right)$ . An admissible feedback control pair $\left(\mathbf{u}^{\star},\mathbf{\xi}^{\star}\right)$ is called optimal if

[TABLE]

is optimal for each $\left(t,x\right)$ is a solution of (79) corresponding to $\left(\mathbf{u}^{\star},\mathbf{\xi}^{\star}\right).$

Theorem 38.

Let $\left(\mathbf{u}^{\star},\mathbf{\xi}^{\star}\right)$ be an admissible feedback control and $p^{\star},q^{\star},$ and $\Theta^{\star}$ be measurable functions satisfying $\left(p^{\star}\left(t,x\right),q^{\star}\left(t,x\right),\Theta\left(t,x\right)\right)\in\mathcal{P}^{2,+}v\left(t,x\right)$ for all $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n}.$ If

[TABLE]

and $q^{\star}\left(t,x\right)G+K\geq 0$ for all $\left(t,x\right)\in\left[0,T\right]\times\mathbb{R}^{n},$ then $\left(\mathbf{u}^{\star},\mathbf{\xi}^{\star}\right)$ is singular optimal control pair.

Proof

From Theorem 32, we get the desired result. $\Box$

Remark 39.

In FBSDEs (79), $Y^{t,x;u}\left(\cdot\right)$ is actually determined by $\left(X^{t,x;u}\left(\cdot\right),u\left(\cdot\right),\xi\left(\cdot\right)\right).$ Hence, we need to investigate the conditions imposed in Theorem 32 to ensure the existence and uniqueness of $X^{t,x;u}\left(\cdot\right)$ in law and the measurability of the multifunctions $\left(t,x\right)\rightarrow\mathcal{P}^{2,+}v\left(t,x\right)$ to obtain $\left(p^{\star}\left(t,x\right),q^{\star}\left(t,x\right),\Theta\left(t,x\right)\right)\in\mathcal{P}^{2,+}v\left(t,x\right)$ that minimizes (80). This can be done by virtue of the celebrated Filippov’s Lemma (cf [87]).

4.3 The Connection between DPP and MP

In Section 3, we have obtained the first and second order adjoint equations. In this part, we shall investigate the connection between the general DPP and the MP for such singular controls problem without the assumption that the value is sufficient smooth. By associated adjoint equations and delicate estimates, it is possible to establish the set inclusions among the super- and sub-jets of the value function and the first-order and second- order adjoint processes as well as the generalized Hamiltonian function.

Theorem 40.

Assume that (A1)-(A2) are in force. Suppose that $\left(\bar{u},\bar{\xi}\right)$ be a singular optimal controls, $v\left(\cdot,\cdot\right)$ is a value function, and $\left(\bar{X}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Y}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{Z}^{t,x;\bar{u},\bar{\xi}}\left(\cdot\right),\bar{u}\left(\cdot\right),\bar{\xi}\left(\cdot\right)\right)$ is optimal trajectory. Let $\left(p,q\right)\in\mathcal{S}^{2}(0,T;\mathbb{R}^{n})\times\mathcal{M}^{2}(0,T;\mathbb{R}^{n})$ and $\left(P,Q\right)\in\mathcal{S}^{2}(0,T;\mathbb{R}^{n\times n})\times\mathcal{M}^{2}(0,T;\mathbb{R}^{n\times n})$ be the adjoint equations (25), (26), respectively. Then, we have

[TABLE]

Proof.

From Theorem 3 and Proposition 8, we get the first part of (81). From Theorem 3.1 in Nie, Shi and Wu [68], we get the second and third results of (81). $\Box$

5 Concluding remarks

In this paper, on the one hand, we have derived a second order pointwise necessary condition for singular optimal control in classical sense of FBSDEs with convex control domain by means of the variation equations and two adjoint equations, which is separately extends the work by Zhang and Zhang [89] to stochastic recursive case, and Hu [44] to pointwise case in the framework of Malliavin calculus. A new necessary condition for singular control has been obtained. Moreover, we investigate the verification theorem for optimal controls via viscosity solution and establish the connection between the adjoint equations and value function also in viscosity solution sense.

There are still several interesting topics should be scheduled as follows:

•

As an important issue, the existence of optimal singular controls has never been exploited. Haussmannand and Suo [42] apply the compactification method to study the classical and singular control problem of Itô’s type of stochastic differential equation, where the problem is reformulated as a martingale problem on an appropriate canonical space after the relaxed form of the classical control is introduced. Under some mild continuity assumptions on the data, they obtain the existence of optimal control by purely probabilistic arguments. Note that, in the framework of BSDE with singular control, the trajectory of $Y$ seems to be a càdlàg process (from French, for right continuous with left hand limits). Hence, we may consider $Y$ in some space with appropriate topologies, for instance, Skorokhod $M_{1}$ topology or Meyer-Zheng topology (see [36]) to obtain the convergence of probability measures deduced by $Y$ involving relaxed control. Related work from the technique of PDEs can be seen in [14, 18] references therein. From Wang [83], one may construct the optimal control via the existence of diffusion with refections (see [24]). However, it is interesting to extend this result to FBSDEs.

•

The matrices $K,G$ are deterministic. It is also interesting to extend this restriction to time varying matrices, even the generator $b,\sigma,f$ involving the singular control. Whenever the coefficients are random, the H-J-B inequality will become stochastic PDEs. No doubt, stochastic viscosity solution will be applied. For this direction, reader can refer to Buckdahn, Ma [15, 16], Peng [71] and Qiu [77].

•

As for the general cases, i.e., the control regions are assumed to be non-convex and both the drift and diffusion terms depend on the control variable. Indeed, such a mathematical model, from view point of application, is more reasonable and urgent in many real-life problems (for instance, some finance models in which the controls may impact the uncertainty, etc). In near future, we shall remove the condition of convex control region, employing the idea developed by Zhang et al. [90]. It is worth mentioning that the analysis in [90] is much more complicated. Some new and useful tools, such as the multilinear function valued stochastic processes, the BSDE for these processes are introduced. Hence, it will be interesting to borrow these tools to investigate the singular optimal controls problems for FBSDEs, which will definitely promote and enrich the theories of FBSDEs.

Appendix A Proofs of Lemmas

Proof of Lemma 4.

We first prove the continuity of solution depending on parameter.

Set

[TABLE]

It can be shown that

[TABLE]

by standard estimates and the Burkholder-Davis-Gundy inequality, so we omit it.

Next, set

[TABLE]

Note that (19) has be obtained in [13]. We will prove (20) and (21).

Then,

[TABLE]

where

[TABLE]

Simple calculation yields

[TABLE]

where

[TABLE]

and

[TABLE]

From classical theory of BSDE, one can show

[TABLE]

By using (82) and (84), the dominated convergence theorem, Lemma 1 and Gronwall’s lemma, we get the desired result by letting $\alpha\rightarrow 0$ . $\Box$

Proof of Lemma 33.

From (63) and (6) in Gozzi et al. in [41], we have that if $\left(p,q,P\right)\in\mathcal{P}^{2,+}v\left(t,x\right),$ then

[TABLE]

We shall deal with $I_{1},$ $I_{2},$ $I_{3},$ separately. For $I_{1},$ we have $\mathbb{E}\left(1+\left|X^{t,x;\bar{u},\bar{\xi}}\left(t+h\right)\right|^{m}\right)h\leq C\left(1+\left|x\right|^{m}\right)h,$ by classical estimate and the assumption $\mathbb{E}\left[\left|\xi_{T}\right|^{2}\right]<\infty.$ For $I_{2},$ from (7) in [41] and Hölder inequality, we have

[TABLE]

since $\mathbb{E}\left[\left|\xi_{T}\right|^{2}\right]<\infty$ and the fact $\left(1+\left|x\right|^{2}\right)^{\frac{1}{2}}\leq 1+\left|x\right|.$

Finally,

[TABLE]

By Itô isometry and classical estimate on SDE, we complete the proof. $\Box$

Bibliography93

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] L. Alvarez, A class of solvable singular stochastic control problems, Stochastics Stochastics Rep . 67 1999 83–122.
2[2] L. Alvarez, Singular stochastic control, linear diffusions, and optimal stopping: A class of solvable problems, SIAM J. Control Optim. 39 2001 1697–1710.
3[3] J. M. Bismut, Théorie Probabiliste du Contrôle des Diffusions, Memoirs of the American Mathematical Society , 176, Providence, Rhode Island, 1973.
4[4] J. Bismut, An introductory approach to duality in optimal stochastic control, SIAM Riew , 20 1978, 62–78.
5[5] D. J. Bell and D. H. Jacobson, Singular Optimal Control Problems, Math. Sci. Eng. 117, Academic Press, New York, 1975.
6[6] V.E. Beněs, L.A. Shepp, H.S. Witzsenhausen, Some solvable stochastic control problems, Stochastics 4 1980 39–83.
7[7] F. Baldursson, Singular stochastic control and optimal stopping, Stochastics 21 1987 1-40.
8[8] F. Boetius, Bounded variation singular stochastic control and associated Dynkin game, in Mathematical Finance, Trends Math., M. Kohlmann and S. Tang, eds., Birkhäuser, Basel, 2001 111–120.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Singular Optimal Controls for Stochastic Recursive Systems under

Abstract

1 Introduction

2 Preliminaries and Notations

Lemma 1**.**

3 Maximum Principle of Singular Optimal Controls

3.1 Optimal Singular Control

Proposition 2**.**

Theorem 3**.**

Lemma 4**.**

Proof of Theorem 3.

Remark 5**.**

3.2 Optimal Regular Control

Lemma 6**.**

Remark 7**.**

Proposition 8**.**

Remark 9**.**

Lemma 10**.**

Proof.

Lemma 11**.**

Proof.

Definition 12** (Singular control in the classical sense).**

Remark 13**.**

Lemma 14** (Variational inequality).**

Proof.

Theorem 15**.**

Proof.

Remark 16**.**

Remark 17**.**

Lemma 18** (Martingale representation theorem).**

Lemma 19**.**

Proof.

Theorem 20**.**

Theorem 21**.**

3.2.1 Example

Example 22**.**

4 Singular Optimal Controls via Dynamic Programming Principle

Remark 23**.**

4.1 Verification Theorem via Viscosity Solutions

Lemma 24**.**

Proposition 25**.**

Definition 26**.**

Lemma 27**.**

Definition 28**.**

Proposition 29**.**

Lemma 30**.**

Lemma 31**.**

Theorem 32** (Verification Theorem).**

Lemma 33**.**

Proof of Theorem 32.

Remark 34**.**

Remark 35**.**

Theorem 36**.**

Proof.

4.2 Optimal Feedback Controls

Definition 37**.**

Theorem 38**.**

Proof

Remark 39**.**

4.3 The Connection between DPP and MP

Theorem 40**.**

Proof.

5 Concluding remarks

Appendix A Proofs of Lemmas

Proof of Lemma 4.

Proof of Lemma 33.

Lemma 1.

Proposition 2.

Theorem 3.

Lemma 4.

Remark 5.

Lemma 6.

Remark 7.

Proposition 8.

Remark 9.

Lemma 10.

Lemma 11.

Definition 12 (Singular control in the classical sense).

Remark 13.

Lemma 14 (Variational inequality).

Theorem 15.

Remark 16.

Remark 17.

Lemma 18 (Martingale representation theorem).

Lemma 19.

Theorem 20.

Theorem 21.

Example 22.

Remark 23.

Lemma 24.

Proposition 25.

Definition 26.

Lemma 27.

Definition 28.

Proposition 29.

Lemma 30.

Lemma 31.

Theorem 32 (Verification Theorem).

Lemma 33.

Remark 34.

Remark 35.

Theorem 36.

Definition 37.

Theorem 38.

Remark 39.

Theorem 40.