Mild solutions to the dynamic programming equation for stochastic   optimal control problems

Viorel Barbu; Chiara Benazzoli; Luca Di Persio

arXiv:1706.06824·math.PR·June 22, 2017·Autom.

Mild solutions to the dynamic programming equation for stochastic optimal control problems

Viorel Barbu, Chiara Benazzoli, Luca Di Persio

PDF

TL;DR

This paper establishes the existence and uniqueness of mild solutions to the 1-D dynamic programming equation in stochastic optimal control with multiplicative noise, using nonlinear semigroup theory, and extends results to higher dimensions.

Contribution

It introduces a novel approach using nonlinear semigroup theory to analyze the dynamic programming equation in stochastic control, including multidimensional cases.

Findings

01

Unique mild solution in 1D for the dynamic programming equation

02

Solution regularity in $C([0,T];W^{1,inity})$ and $ ext{second derivative}$ in $C([0,T];L^1)$

03

Extension of results to n-dimensional stochastic control problems

Abstract

We show via the nonlinear semigroup theory in $L^{1} (R)$ that the $1$ -D dynamic programming equation associated with a stochastic optimal control problem with multiplicative noise has a unique mild solution $φ \in C ([0, T]; W^{1, \infty} (R))$ with $φ_{xx} \in C ([0, T]; L^{1} (R))$ . The $n$ -dimensional case is also investigated.

Equations243

\text{\text@underline{Minimize}}\quad\mathbb{E}\biggl{\{}\int_{0}^{T}\Bigl{(}g\bigl{(}X(t)\bigr{)}+h\bigl{(}u(t)\bigl{)}\Bigr{)}\,dt+g_{0}\bigl{(}X(T)\bigr{)}\biggr{\}},

\text{\text@underline{Minimize}}\quad\mathbb{E}\biggl{\{}\int_{0}^{T}\Bigl{(}g\bigl{(}X(t)\bigr{)}+h\bigl{(}u(t)\bigl{)}\Bigr{)}\,dt+g_{0}\bigl{(}X(T)\bigr{)}\biggr{\}},

{d X = f (X) d t + u σ (X) d W, for t \in (0, T) X (0) = X_{0}

{d X = f (X) d t + u σ (X) d W, for t \in (0, T) X (0) = X_{0}

∣ σ (x) ∣ \geq ρ > 0, \forall x \in R .

∣ σ (x) ∣ \geq ρ > 0, \forall x \in R .

H (u) = h (u) + I_{[0, \infty)} (u) = {h (u) if u \geq 0 + \infty otherwise

H (u) = h (u) + I_{[0, \infty)} (u) = {h (u) if u \geq 0 + \infty otherwise

H^{*} (p) = sup {p u - H (u) : u \in R}, \forall p \in R .

H^{*} (p) = sup {p u - H (u) : u \in R}, \forall p \in R .

(H^{*})^{''} \in L^{\infty} (R), 0 \leq (H^{*})^{'} (p) \leq C (∣ p ∣ + 1), \forall p \in R .

(H^{*})^{''} \in L^{\infty} (R), 0 \leq (H^{*})^{'} (p) \leq C (∣ p ∣ + 1), \forall p \in R .

j (r) = \int_{0}^{r} H^{*} (p) d p, \forall r \in R .

j (r) = \int_{0}^{r} H^{*} (p) d p, \forall r \in R .

\begin{cases}\varphi_{t}(t,x)+\min_{u}\bigl{\{}\frac{1}{2}\sigma^{2}\,\varphi_{xx}(t,x)\,u+H(u)\bigr{\}}\\ \hskip 28.45274pt+f(x)\,\varphi_{x}(t,x)+g(x)=0,\quad\forall t\in[0,T],x\in\mathbb{R}\\ \varphi(T,x)=g_{0}(x),\quad x\in\mathbb{R},\end{cases}

\begin{cases}\varphi_{t}(t,x)+\min_{u}\bigl{\{}\frac{1}{2}\sigma^{2}\,\varphi_{xx}(t,x)\,u+H(u)\bigr{\}}\\ \hskip 28.45274pt+f(x)\,\varphi_{x}(t,x)+g(x)=0,\quad\forall t\in[0,T],x\in\mathbb{R}\\ \varphi(T,x)=g_{0}(x),\quad x\in\mathbb{R},\end{cases}

\begin{cases}\varphi_{t}(t,x)-H^{*}\bigl{(}-\frac{1}{2}\sigma^{2}\,\varphi_{xx}(t,x)\bigr{)}+f(x)\,\varphi_{x}(t,x)\\ \hskip 71.13188pt+g(x)=0,\quad\forall(t,x)\in[0,T]\times\mathbb{R}\\ \varphi(T,x)=g_{0}(x),\quad x\in\mathbb{R}\,.\end{cases}

\begin{cases}\varphi_{t}(t,x)-H^{*}\bigl{(}-\frac{1}{2}\sigma^{2}\,\varphi_{xx}(t,x)\bigr{)}+f(x)\,\varphi_{x}(t,x)\\ \hskip 71.13188pt+g(x)=0,\quad\forall(t,x)\in[0,T]\times\mathbb{R}\\ \varphi(T,x)=g_{0}(x),\quad x\in\mathbb{R}\,.\end{cases}

u(t)=\arg\min_{u}\Bigl{\{}\frac{1}{2}\,\sigma^{2}\,\varphi_{xx}\bigl{(}t,X(t)\bigr{)}\,u+H(u)\Bigr{\}}\,,

u(t)=\arg\min_{u}\Bigl{\{}\frac{1}{2}\,\sigma^{2}\,\varphi_{xx}\bigl{(}t,X(t)\bigr{)}\,u+H(u)\Bigr{\}}\,,

_{X} ⟨ v_{1} - v_{2}, η ⟩_{X^{'}} \geq 0,

_{X} ⟨ v_{1} - v_{2}, η ⟩_{X^{'}} \geq 0,

y (t, x) = - φ_{xx} (T - t, x), \forall t \in [0, T], x \in R,

y (t, x) = - φ_{xx} (T - t, x), \forall t \in [0, T], x \in R,

\begin{cases}y_{t}(t,x)-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y(t,x)\bigr{)}\Bigr{)}_{xx}+f^{\prime\prime}(x)\varphi_{x}(T-t,x)\\ \hskip 14.22636pt-2f^{\prime}(x)y(t,x)-f(x)y_{x}(t,x)=-g^{\prime\prime}(x),\\ \hskip 156.49014pt\text{in }(0,T)\times\mathbb{R}\\ y(0,x)=-g^{\prime\prime}_{0}(x),\quad x\in\mathbb{R}.\end{cases}

\begin{cases}y_{t}(t,x)-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y(t,x)\bigr{)}\Bigr{)}_{xx}+f^{\prime\prime}(x)\varphi_{x}(T-t,x)\\ \hskip 14.22636pt-2f^{\prime}(x)y(t,x)-f(x)y_{x}(t,x)=-g^{\prime\prime}(x),\\ \hskip 156.49014pt\text{in }(0,T)\times\mathbb{R}\\ y(0,x)=-g^{\prime\prime}_{0}(x),\quad x\in\mathbb{R}.\end{cases}

- Ψ^{''} = z, in D^{'} (R),

- Ψ^{''} = z, in D^{'} (R),

\varphi(t,x)=-\Phi\bigl{(}y(T-t,x)\bigr{)}\in W^{1,\infty}(\mathbb{R}),\quad\forall t\in[0,T].

\varphi(t,x)=-\Phi\bigl{(}y(T-t,x)\bigr{)}\in W^{1,\infty}(\mathbb{R}),\quad\forall t\in[0,T].

B y = - f^{''} (Φ (y))^{'} - 2 f^{'} y, \forall y \in L^{1} (R),

B y = - f^{''} (Φ (y))^{'} - 2 f^{'} y, \forall y \in L^{1} (R),

∣∣ B y ∣ ∣_{1} \leq C ∣∣ y ∣ ∣_{1}, \forall y \in L^{1} (R) .

∣∣ B y ∣ ∣_{1} \leq C ∣∣ y ∣ ∣_{1}, \forall y \in L^{1} (R) .

\begin{cases}y_{t}-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y\bigr{)}\Bigr{)}_{xx}-f\,y_{x}+B\,y=g_{1},\,\text{in }[0,T]\times\mathbb{R}\\ y(0)=y_{0}\in L^{1}(\mathbb{R})\,,\end{cases}

\begin{cases}y_{t}-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y\bigr{)}\Bigr{)}_{xx}-f\,y_{x}+B\,y=g_{1},\,\text{in }[0,T]\times\mathbb{R}\\ y(0)=y_{0}\in L^{1}(\mathbb{R})\,,\end{cases}

y (t) = ϵ \to 0 lim y_{ϵ} (t) in L^{1} (R), \forall t \in [0, T],

y (t) = ϵ \to 0 lim y_{ϵ} (t) in L^{1} (R), \forall t \in [0, T],

y_{\epsilon}(t)=y_{\epsilon}^{i},\text{ for }t\in[i\,\epsilon,(i+1)\,\epsilon],\,i=0,1,\dots,N=\Bigl{[}\frac{T}{\epsilon}\Bigr{]}\,,

y_{\epsilon}(t)=y_{\epsilon}^{i},\text{ for }t\in[i\,\epsilon,(i+1)\,\epsilon],\,i=0,1,\dots,N=\Bigl{[}\frac{T}{\epsilon}\Bigr{]}\,,

\frac{1}{\epsilon}\,(y_{\epsilon}^{i+1}-y_{\epsilon}^{i})-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y_{\epsilon}^{i+1}\bigr{)}\Bigr{)}^{\prime\prime}\\ -f(y_{\epsilon}^{i+1})^{\prime}+B\,y_{\epsilon}^{i+1}=g_{1},\quad\text{in }\mathcal{D}^{\prime}(\mathbb{R}),

\frac{1}{\epsilon}\,(y_{\epsilon}^{i+1}-y_{\epsilon}^{i})-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y_{\epsilon}^{i+1}\bigr{)}\Bigr{)}^{\prime\prime}\\ -f(y_{\epsilon}^{i+1})^{\prime}+B\,y_{\epsilon}^{i+1}=g_{1},\quad\text{in }\mathcal{D}^{\prime}(\mathbb{R}),

y_{ϵ}^{0} = y_{0}, y_{ϵ}^{i} \in L^{1} (R), i = 0, 1, \dots, N .

y_{ϵ}^{0} = y_{0}, y_{ϵ}^{i} \in L^{1} (R), i = 0, 1, \dots, N .

\varphi\in C\bigl{(}[0,T];W^{1,\infty}(\mathbb{R})\bigr{)}\;,\;\varphi^{\prime\prime}\in C\bigl{(}[0,T];L^{1}(\mathbb{R})\bigr{)}\,,

\varphi\in C\bigl{(}[0,T];W^{1,\infty}(\mathbb{R})\bigr{)}\;,\;\varphi^{\prime\prime}\in C\bigl{(}[0,T];L^{1}(\mathbb{R})\bigr{)}\,,

φ (t) = ϵ \to 0 lim φ_{ϵ} (t) in W^{1, \infty} (R), \forall t \in [0, T],

φ (t) = ϵ \to 0 lim φ_{ϵ} (t) in W^{1, \infty} (R), \forall t \in [0, T],

φ_{ϵ} (t) = Ψ (y_{ϵ}^{i}), t \in [T - (i + 1) ϵ, T - i ϵ],

φ_{ϵ} (t) = Ψ (y_{ϵ}^{i}), t \in [T - (i + 1) ϵ, T - i ϵ],

{\frac{d y}{d t} + A y + B y = g_{1}, in [0, T] y (0) = y_{0},

{\frac{d y}{d t} + A y + B y = g_{1}, in [0, T] y (0) = y_{0},

A\,y=-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y\bigr{)}\Bigr{)}^{\prime\prime}-f\,y^{\prime}\quad\text{ in }\mathcal{D}^{\prime}(\mathbb{R}),\,\forall y\in D(A)\,,

A\,y=-\Bigl{(}H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y\bigr{)}\Bigr{)}^{\prime\prime}-f\,y^{\prime}\quad\text{ in }\mathcal{D}^{\prime}(\mathbb{R}),\,\forall y\in D(A)\,,

\displaystyle D(A)=\Bigl{\{}y\in L^{1}(\mathbb{R}):H^{*}\bigl{(}\frac{\sigma^{2}\,y}{2}\bigr{)}\in L^{\infty}(\mathbb{R}),

\displaystyle D(A)=\Bigl{\{}y\in L^{1}(\mathbb{R}):H^{*}\bigl{(}\frac{\sigma^{2}\,y}{2}\bigr{)}\in L^{\infty}(\mathbb{R}),

A y \in L^{1}

λ y + A y = η .

λ y + A y = η .

∣∣ y (η) - y (\overset{η}{ˉ}) ∣ ∣_{1} \leq (λ - λ_{0})^{- 1} ∣∣ η - \overset{η}{ˉ} ∣ ∣_{1},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Mild solutions to the dynamic programming equation for stochastic optimal control problems

Viorel Barbu A.I. Cuza University, Iasi, Romania

Chiara Benazzoli Dept. of Mathematics, University of Trento, Italy

Luca Di Persio Dept. of Computer Science, University of Verona, Italy

Abstract

We show via the nonlinear semigroup theory in $L^{1}(\mathbb{R})$ that the $1$ -D dynamic programming equation associated with a stochastic optimal control problem with multiplicative noise has a unique mild solution $\varphi\in C([0,T];W^{1,\infty}(\mathbb{R}))$ with $\varphi_{xx}\in C([0,T];L^{1}(\mathbb{R}))$ . The $n$ -dimensional case is also investigated.

Keyword: stochastic process; optimal control;

$m$ -accretive operator; Cauchy problem.

1 Introduction

Consider the following stochastic optimal control problem

[TABLE]

subject to $u\in\mathcal{U}$ and to state equation

[TABLE]

where $\mathcal{U}$ is the set of all $\{\mathcal{F}_{t}\}_{t\geq 0}$ -adapted processes $u:(0,T)\rightarrow\mathbb{R}^{+}=[0,+\infty]$ and $W:\mathbb{R}\rightarrow\mathbb{R}$ is an $1$ -D Wiener process in a probability space $(\Omega,\mathcal{F},\mathbb{P})$ , provided the natural filtration $\{\mathcal{F}_{t}\}_{t\geq 0}$ . Here $X_{0}\in\mathbb{R}$ , while $X:[0,T]\rightarrow\mathbb{R}$ is the strong solution to (2).

We would like to underline that the studied optimization problem is related to the so called stochastic volatility models, used in the financial framework, whose relevance has raised exponentially during last years. In fact such models, contrarily to the constant volatility ones as, e.g., the standard Black and Scholes approach, the Vasicek interest rate model, or the Cox-Ross-Rubistein model, allow to consider the more realistic situation of volatility levels changing in time. As an example, the latter is the case of the Heston model, see [9], where the variance is assumed to be a stochastic process following a Cox-Ingersoll-Ross (CIR) dynamic, see [10] or [4] and references therein for more recent related techniques, as well as the case of the Constant Elasticity of Variance (CEV) model, see [5], where the volatility is expressed by a power of the underlying level, which is often referred as a local stochastic volatility model. Other interesting examples, which is the object of our ongoing research particularly from the numerical point of view, include the Stochastic Alpha, Beta, Rho (SABR) model, see, e.g., [8], and models which are used to estimate the stochastic volatility by exploiting directly markets data, as happens using the GARCH approach and its variants.

Within latter frameworks and due to several macroeconomic crises that have affected different (type of) financial markets worldwide, governments decided to become active players of the game, as, e.g., in the recent case of the Volatility Control Mechanism (VCM) established for the securities, resp. for the derivatives, market established in August 2016, resp. in January 2017, within the Hong Kong Stock Exchange (HKEX) framework, see, e.g., [12, 13] and references therein for other applications and examples.

Hypotheses:

$h:\mathbb{R}\rightarrow\mathbb{R}$ is convex, continuous and $h(u)\geq\alpha_{1}\,|u|^{2}+\alpha_{2}$ , $\forall u\in\mathbb{R}$ , for some $\alpha_{1}>0,\alpha_{2}\geq 0$ . 2. 2.

$f\in C_{b}^{2}(\mathbb{R})$ , $f^{\prime\prime}\in L^{1}(\mathbb{R})$ , $g,g_{0}\in W^{2,\infty}(\mathbb{R})$ . 3. 3.

$\sigma\in C_{b}^{1}(\mathbb{R})$ , and

[TABLE]

We set

[TABLE]

and we denote by $H^{*}$ the Legendre conjugate of $H$ , namely,

[TABLE]

We have $(H^{*})^{\prime}(p)=(\partial h+N_{[0,\infty)})^{-1}p\in Lip(\mathbb{R})$ , where $\delta h$ is the subdiffential of $h$ , and $N_{[0,\infty)}$ is the normal cone to $[0,\infty)$ . This yields

[TABLE]

We denote also by $j$ the potential of $H^{*}$ , that is

[TABLE]

The dynamic programming equation corresponding to the stochastic optimal control problem (1) is given by (see, e.g., [7],[11]),

[TABLE]

or equivalently

[TABLE]

Moreover, if $\varphi$ is a smooth solution to (6) the associated feedback controller

[TABLE]

is optimal for problem (1).

Up to our knowledge, in literature the rigorous treatment of existence theory for equation (6) has been shown, so far within the theory of viscosity solutions only. (See, e.g., [6].) Here we shall exploit a different approach, namely we use a suitable transformation aiming at reducing (6) to an one dimensional Fokker-Planck equation which is then treated as a nonlinear Cauchy problem in $L^{1}(\mathbb{R})$ . The $n$ -dimensional case is also studied in section 4. As regards the non-degenerate hypothesis (3) it will be later on dispensed by assuming more regularity on function $\sigma$ . (See section 4 below.)

1.1 Notation and basic results

We shall use the standard notation for functional spaces on $\mathbb{R}$ . In particular $C^{k}_{b}(\mathbb{R})$ is the space of functions $y:\mathbb{R}\rightarrow\mathbb{R}$ , differentiable of order $k$ and with bounded derivatives until order $k$ . By $L^{p}(\mathbb{R})$ , $1\leq p\leq\infty$ , we denote the classical space of Lebesgue-measurable $p$ -integrable functions on $\mathbb{R}$ with the norm $\left\lVert\cdot\right\rVert_{p}$ and by $H^{k}(\mathbb{R}^{n})$ , $W^{k,p}(\mathbb{R}^{n})$ , $k=1,2$ , the standard Sobolev spaces on $\mathbb{R}^{n}$ , $n=1,2$ . We set also $y_{x}=y^{\prime}=\partial y/\partial x$ , $y_{t}=\partial y/\partial t$ , $y_{xx}=\partial^{2}y/\partial x^{2}$ , for $x\in\mathbb{R}$ and $\Delta y(x)=\sum_{i=1}^{n}\frac{\partial^{2}y}{\partial x_{i}^{2}}$ , for $x\in\mathbb{R}^{n}$ . By $\mathcal{D}^{\prime}(\mathbb{R}^{n})$ we denote the space of Schwartz distributions on $\mathbb{R}^{n}$ .

Definition 1.1 (Accretive operator)

Given a Banach space $X$ , a nonlinear operator $A$ from $X$ to itself, with domain $D(A)$ , is said to be accretive if $\forall u_{i}\in D(A),\forall v_{i}\in A\,u_{i}$ , $i=1,2$ , there exists $\eta\in J(u_{1}-u_{2})$ such that

[TABLE]

*where $X^{\prime}$ is the dual space of $X$ , ${}_{X}\langle\cdot,\cdot\rangle_{X^{\prime}}$ is the duality pairing and $J:X\rightarrow X^{\prime}$ is the duality mapping of $X$ . (See, e.g., [1].)

An accretive operator $A$ is said to be $m$ -accretive if $\mathbb{R}(\lambda\,I+A)=X$ for all (equivalently some) $\lambda>0$ , while it is said to be $quasi-$ m $-accretive$ if there is $\lambda_{0}\in\mathbb{R}$ such that $\lambda_{0}\,I+A$ is $m$ -accretive.*

We refer to [1] for basic results on $m$ -accretive operators in Banach spaces and the corresponding associated Cauchy problem.

2 Existence results

We set

[TABLE]

and we rewrite eq. (7) as

[TABLE]

We recall (see [3] for details), that, for $z\in L^{1}(\mathbb{R})$ , the equation

[TABLE]

has a unique solution $\Psi=\Phi(z)\in W^{1,\infty}(\mathbb{R})$ and $\|\Psi\|_{W^{1,\infty}(\mathbb{R})}\leq C\|z\|_{1}$ . Then by (10) we have

[TABLE]

Setting

[TABLE]

and taking into account that $f^{\prime}\in L^{\infty}(\mathbb{R})$ , $f^{\prime\prime}\in L^{1}(\mathbb{R})$ , and $\|(\Phi(y))^{\prime}\|_{\infty}\leq\|\Phi\|_{W^{1,\infty}(\mathbb{R})}\leq C\|y\|_{1}$ , we obtain for operator $B$ the estimate

[TABLE]

Therefore eq. (11) can be rewritten as follows

[TABLE]

where $y_{0}=-g_{0}^{\prime\prime}$ and $g_{1}=-g^{\prime\prime}$ in $\mathcal{D}^{\prime}(\mathbb{R})$ .

Definition 2.1

The function $y\colon[0,T]\times\mathbb{R}\to\mathbb{R}$ is said to be a mild solution to equation (16) if $y\in C([0,T];L^{1}(\mathbb{R}))$ and

[TABLE]

We have

Theorem 2.2

Under hypotheses (1)-(3) eq. (11) has a unique mild solution $y$ . Assume further that $j(\frac{\sigma^{2}}{2}\,y_{0})\in L^{1}(\mathbb{R})$ . Then $j(\frac{\sigma^{2}}{2}\,y_{\epsilon})\in L^{\infty}([0,T];L^{1}(\mathbb{\mathbb{R}}))$ and $\left(H^{*}(\frac{\sigma^{2}}{2}\,y)\right)_{x}\in L^{2}([0,T]\times\mathbb{R})$ .

Theorem 2.2 will be proven by using the standard existence theory for the Cauchy problem in Banach spaces with nonlinear quasi- $m$ -accritive operators. Now taking into account that for $y\in C([0,T];L^{1}(\mathbb{R}))$ equation (12) uniquely defines the function $\varphi\in C([0,T];W^{1,\infty}(\mathbb{R}))$ , by Theorem 2.2 we obtain the following existence result for the dynamic programming equation (6).

Theorem 2.3

Under hypothesis (1)-(3) there is a unique mild solution

[TABLE]

to equation (6). Moreover, if $h(\lambda u)\leq C_{\lambda}h(u)$ $\forall u\in\mathbb{R}$ , $\lambda>0$ and $j(-\frac{\sigma^{2}}{2}\,g^{\prime\prime}_{0})\in L^{1}(\mathbb{R})$ , then $H^{*}\bigl{(}-\frac{\sigma^{2}}{2}\,\varphi_{xx}(T-t,x)\bigr{)}\in L^{2}([0,T]\times\mathbb{R})$ .

According to the Definition 2.1 and (13), by mild solution $\varphi$ to equation (6), we mean a function $\varphi\in C([0,T];W^{1,\infty}(\mathbb{R}))$ defined by

[TABLE]

for $i=0,1,\dots,N=\left[\frac{T}{\epsilon}\right]$ and $\{y_{\epsilon}^{i}\}$ is the solution to (19).

In particular, the mild solution $\varphi$ to equation (6) is in $H_{\text{loc}}^{2}(\mathbb{R})\cap W^{1,\infty}(\mathbb{R})$ . Therefore, the feedback controller (8) is well defined on $[0,T]$ .

Remark 2.4

The principal advantage of Theorem 2.2 compared with standard existence results expressed in terms of viscosity solutions is the regularity of $\varphi$ and the fact that the optimal feedback controller can be computed explicitly by the finite difference scheme (21)-(22). This will be treated in a forthcoming paper.

3 Proof of Theorem 2.2

The idea is to write equation (16) as a Cauchy problem of the form

[TABLE]

in the space $L^{1}(\mathbb{R})$ , where $A$ is a suitable nonlinear quasi- $m$ -accretive operator. The operator $A:D(A)\subset L^{1}(\mathbb{R})\rightarrow L^{1}(\mathbb{R})$ is defined as follows

[TABLE]

where the derivatives are taken in the $\mathcal{D}^{\prime}(\mathbb{R})$ sense.

Lemma 3.1

For each $\eta\in L^{1}(\mathbb{R})$ and $\lambda\geq\lambda_{0}=||f^{\prime}||_{\infty}$ there exists a unique solution $y=y(\eta)$ to equation

[TABLE]

Moreover, it holds

[TABLE]

$\forall\eta,\bar{\eta}\in L^{1}(\mathbb{R}),\lambda>\lambda_{0}$ , hence $A$ turns to be quasi- $m$ -accretive in $L^{1}(\mathbb{R})$ .

Proof. [Proof of Lemma 3.1] Assume first that $\eta\in L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ . For each $\nu>0$ consider the equation

[TABLE]

in $\mathcal{D}^{\prime}(\mathbb{R})$ . Equivalently,

[TABLE]

where $z=\bigl{(}\nu\,I-\frac{d^{2}}{dx^{2}}\bigr{)}^{-1}\,y$ is defined by equation

[TABLE]

Note that by Hypothesis (2) the operator $\Gamma\,y=(\lambda-\nu^{2})\bigl{(}\nu I-\frac{d^{2}}{dx^{2}}\bigr{)}^{-1}y-\bigl{(}\nu I-\frac{d^{2}}{dx^{2}}\bigr{)}^{-1}(fy^{\prime})+\nu y$ is linear continuous in $L^{2}(\mathbb{R})$ and by (29) we have that

[TABLE]

Here $||\cdot||_{2}$ and $\langle\cdot,\cdot\rangle_{2}$ are the norm and the scalar product in $L^{2}(\mathbb{R})$ , respectively, and by $||\cdot||_{p}$ , $1\leq p\leq\infty$ we denote the norm of $L^{p}(\mathbb{R})$ . We note that Hypothesis (1) and (4) imply that the function $H^{*}$ is continuous, monotonically non–decreasing, and

[TABLE]

Furthermore, by (29)-(31), we have

[TABLE]

The latter yields

[TABLE]

where $C$ is dependent on $\nu$ . By assumption (3) we have that the operator $y\rightarrow\mathcal{H}(y)\equiv H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y\bigr{)}$ is maximal monotone in $L^{2}(\mathbb{R})$ , hence, by (33), $\Gamma$ is maximal monotone and coercive, i.e. positively definite, therefore we have

[TABLE]

for $\lambda\geq\lambda^{*}=C(\frac{1}{\nu}+\nu^{2})$ . Consequently, for each $\nu>0$ and $\lambda\geq\lambda^{*}$ , eq. (28) (equivalently eq. (27)) has a unique solution $y=y_{\lambda,\nu}\in L^{2}(\mathbb{R})$ , with $H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,y_{\lambda,\nu}\bigr{)}\in L^{2}(\mathbb{R})$ .

We have also

[TABLE]

so that $z_{\lambda,\nu}\in H^{2}(\mathbb{R})$ .

Since by assumption (3) the operator $z\rightarrow\nu z+H^{*}\bigl{(}\frac{\sigma^{2}}{2}\,z\bigr{)}$ is invertible in $L^{2}(\mathbb{R})$ , and its inverse maps inverse $H^{1}(\mathbb{R})$ into itself, we infer that $y_{\lambda,\nu}\in H^{1}(\mathbb{R})$ .

It is worth to mention that by (27), we have

[TABLE]

$\forall\eta,\bar{\eta}\in L^{1}(\mathbb{R})$ , so that

[TABLE]

$\forall\eta,\bar{\eta}\in L^{1}(\mathbb{R})$ , for $\lambda\geq\max\left(\lambda_{0},\lambda^{*}\right)$ and where $\lambda_{0}=\left\lVert f^{\prime}\right\rVert_{\infty}$ . To get (34), we simply multiply the equation

[TABLE]

by $\zeta\in L^{\infty}(\mathbb{R})$

[TABLE]

where $\operatorname{sgn}r=\frac{r}{\mid r\mid}$ for $r\neq 0$ , $\operatorname{sgn}0=[-1,1]$ and we integrate on $\mathbb{R}$ , taking into account that

[TABLE]

For a rigorous proof of these relations we replace $\operatorname{sgn}y$ by $X_{\delta}(y)$ , where $X_{\delta}$ is a smooth approximation of signum function, while $\delta\to 0$ , see , e.g., [1], p. 115. If $\eta\in L^{1}(\mathbb{R})$ and $\{\eta_{n}\}_{n=1}^{\infty}\subset L^{1}(\mathbb{R})\cap L^{2}(\mathbb{R})$ is strongly convergent to $\eta\in L^{1}(\mathbb{R})$ , we can proceed as above to obtain for the corresponding solution $y_{n}$ to (27) the estimate (34), namely,

[TABLE]

Hence there exists $y\in L^{1}(\mathbb{R})$ such that

[TABLE]

By (28), we have

[TABLE]

By (12) and (29) , we have

[TABLE]

Let $\theta_{n}:=\left(\nu I-\frac{d^{2}}{dx^{2}}\right)^{-1}\left(fy_{n}^{\prime}\right)$ ,

that is $\nu\theta_{n}-\theta_{n}^{{}^{\prime\prime}}=fy_{n}^{\prime}=\left(fy_{n}\right)^{\prime}-f^{\prime}y_{n}$ in $\mathcal{D}^{\prime}\left(\mathbb{R}^{n}\right)$ . Equivalently

[TABLE]

This yields

[TABLE]

and then

[TABLE]

On the other hand, by (38), we have

[TABLE]

Hence

[TABLE]

This yields

[TABLE]

and therefore, by (36), we derive the estimate

[TABLE]

Since, by hypothesis (1) $H^{*}(v)v\geq 0,\forall v\in\mathbb{R}$ , the latter implies that

[TABLE]

where $C_{1}$ is still independent of $n$ as well as on $\nu$ .

By (35) and (42), it follows that

[TABLE]

strongly in $L^{1}(\mathbb{R})$ , and therefore $y=y_{\lambda,\nu}\in L^{\infty}(\mathbb{R})\cap L^{1}(\mathbb{R})$ solves (27). Furthermore, by (34) and (42), we have

[TABLE]

$\forall\lambda>\max\left(\lambda^{*},\lambda_{0}\right)$ , where $C_{1}$ is independent of $\nu$ . We also obtain that inequality (34) holds for solution $y_{\lambda,\nu}$ to (27), with $\eta\in L^{1}(\mathbb{R})$ only. Now we are going to extend the solution $y_{\lambda,\nu}$ to (27) for all $\lambda>\lambda_{0}$ . To this end we set $G^{\nu}_{\lambda}=\Gamma+\mathcal{H}$ , rewriting (27) as follows $G^{\nu}_{\lambda}=\eta$ . For every $\lambda>0$ , we can equivalently write this as

[TABLE]

By (34) we also have

[TABLE]

then, by contraction principle, (45) has a unique solution $y=y_{\lambda,\nu}\in L^{1}(\mathbb{R})$ , for all $\lambda>\lambda_{0}$ . Estimate (44) extends for all $\lambda>\lambda_{0}$ . In order to complete the proof of Lemma 3.1, we are going to let $\nu\to 0$ in equation (27), or, more precisely, in (28) which holds for all $\lambda>\lambda_{0}$ . As noted before, for all $z\in L^{1}(\mathbb{R})$ , we have

[TABLE]

and

[TABLE]

consequently

[TABLE]

and

[TABLE]

We set $u_{\nu}=\left(\nu I-\frac{d^{2}}{dx^{2}}\right)^{-1}y_{\lambda,\nu}$ . Then, for $\nu\to 0$ , we have $\nu u_{\nu}\to 0$ in $L^{1}(\mathbb{R})$ and

[TABLE]

Hence

[TABLE]

strongly in $W^{1,\infty}(\mathbb{R})$ , and

[TABLE]

strongly in $L^{\infty}(\mathbb{R})$ , where $y\in L^{1}(\mathbb{R})$ , and

[TABLE]

for $\lambda>\lambda_{0}$ . Moreover, by (34), the map $\eta\to y$ is Lipschitz in $L^{1}(\mathbb{R})$ , with Lipschitz constant $(\lambda-\lambda_{0})^{-1}$ , then $y$ solves (25), and (26) follows. This completes the proof of Lemma 3.1.

Proof. [Proof of Theorem 2.2 (continued)] Coming back to equation (23), by Lemma 3.1 and (14), it follows that the operator $A+B$ is quasi-m-accretive in $L^{1}(\mathbb{R})$ . Then by the Crandall & Ligget theorem, see [1], p. 147, the Cauchy problem (23) has a unique mild solution $y\in C([0,T];L^{1}(\mathbb{R}))$ , that is

[TABLE]

The function $y$ is a mild solution to (16) in the sense of Definition 2.1.

Assume now that $j(\lambda v)\leq C_{\lambda}j(v)$ $\forall v\in\mathbb{R}$ and $\lambda>0$ . Taking into account that $j(v)\leq j(2v)-vH^{*}(v),\forall v\in\mathbb{R}\;,$ it is easily seen that this implies that

[TABLE]

Assume also that $j(\frac{\sigma^{2}}{2}\,y_{0})\in L^{1}(\mathbb{R})$ . Then, if we take in (19), $z^{i}=\frac{\sigma^{2}}{2}\,y^{i}_{\epsilon}$ and get

[TABLE]

Multiplying by $H^{*}(z^{i+1})$ and integrating on $\mathbb{R}$ we get

[TABLE]

Integrating by parts in $\int_{\mathbb{R}}f\left(\frac{z^{i+1}}{\sigma^{2}}\right)^{\prime}H^{*}\left(z^{i+1}\right)dy$ , summing up, after some calculation involving (14) and (47), we get the estimate $\forall k$

[TABLE]

which implies the desired conclusion

[TABLE]

4 A multi-dimensional case

Consider the problem (1) in $\mathbb{R}^{n}$ with the drift $f\equiv 0$ , namely

[TABLE]

subject to $u\in\mathcal{U}$ , and to stochastic differential equation

[TABLE]

Here $W\colon[0,T]\to\mathbb{R}^{m}$ is a Wiener process, $h:\mathbb{R}\to\mathbb{R}$ satisfies assumption (1) and

(i)

$g,g_{0}\in W^{2,\infty}(\mathbb{R}^{n};\mathbb{R})$ 2. (ii)

$\sigma(x)=\sigma_{0}(x)a$ , where $\sigma_{0}\in C^{1}_{b}(\mathbb{R})$ satisfies condition (3), while the matrix $a=\left\lVert a_{ij}\right\rVert^{n,m}_{i,j=1}$ is such that $b=aa^{T}$ is positive defined.

Let $\mathcal{L}$ be the elliptic second order operator

[TABLE]

where $b_{ij}=\sum_{k=1}^{m}a_{ik}a_{jk}$ . The corresponding dynamic programming equation for (48) reads as follows

[TABLE]

If

[TABLE]

equation (52) reduces to

[TABLE]

see (11), where $y_{0}=-\mathcal{L}g_{0}$ , $g_{1}=-\mathcal{L}g$ . By [3], for $z\in L^{1}(\mathbb{R}^{n})$ the elliptic equation $-\mathcal{L}\psi=z$ in $\mathcal{D}^{\prime}(\mathbb{R}^{n})$ has a unique solution $\psi$ which satisfies $\psi\in W^{1,\infty}(\mathbb{R})$ if $n=1$ , $\psi\in W^{1,1}_{\text{loc}}(\mathbb{R}^{2})$ if $n=2$ and $\psi\in L^{1}_{\text{loc}}(\mathbb{R})\cap M^{\frac{n}{n-2}}(\mathbb{R}^{n})$ if $n=3$ , where here $M^{\frac{n}{n-2}}(\mathbb{R}^{n})$ is the Marcinkiewicz space. The latter implies that any solution $y\in C([0,T];L^{1}(\mathbb{R}^{n}))$ to (53) leads to a unique solution $\varphi\in C([0,T];W^{1,\infty}(\mathbb{R}))$ for $n=1$ , $\varphi\in C([0,T];W^{1,1}_{\text{loc}}(\mathbb{R}^{2}))$ , for $n=2$ , and, respectively, $\varphi\in C([0,T];M^{\frac{n}{n-2}}(\mathbb{R}^{n}))$ for $n\geq 3$ . Concerning the existence of a solution to eq. (53), we have a result similar to the one stated in Theorem 2.2, namely

Theorem 4.1

Under assumption (i)-(ii)-(iii) there is a unique mild solution $y\in C([0,T];L^{1}(\mathbb{R}^{n}))$ , in the sense of Definition 2.1.

Proof. We shall proceed as in the proof of Theorem 2.2. In particular, we consider the operator $A\colon D(A)\subset L^{1}(\mathbb{R}^{n})\to L^{1}(\mathbb{R}^{n})$

[TABLE]

and we write equation (53) as

[TABLE]

Lemma 4.1

The operator $A$ is m-accretive in $L^{1}(\mathbb{R}^{n})$ .

Proof. Since the operator $-\Delta$ is m-accretive in $L^{1}(\mathbb{R}^{n})$ , see, e.g., [2, 3], then the same holds for the operator $-\mathcal{L}$ , moreover, taking into account that $\left|\sigma_{0}(x)\right|\geq\rho>0$ , it follows the m-accretivety of the operator $A$ , as claimed. Indeed, equation

[TABLE]

is equivalent to

[TABLE]

where $\beta=\frac{1}{\sigma^{2}_{0}}z$ and this implies the conclusion.

Again invoking the Crandall & Ligget Theorem, we get that the eq. (55) has a unique mild solution $y\in C([0,T];L^{1}(\mathbb{R}^{n}))$ , which is given by

[TABLE]

hence completing the proof of Theorem 4.1.

By Theorem 4.1 it follows the existence and uniqueness of a solution $\varphi\in C([0,T];L^{1}_{\text{loc}}(\mathbb{R}^{n})\cap M^{\frac{n}{n-2}}(\mathbb{R}^{n}))$ .

Remark 4.2

In the general $n$ -dimensional case, where $f\in C^{2}_{b}(\mathbb{R}^{n})$ , the dynamic programming equation corresponding to (1) reduces to

[TABLE]

where

[TABLE]

therefore eq. (56) can be treated analogously to what we have seen in the 1-dimensional case, at least if the operator $B$ is continuous in $L^{1}(\mathbb{R}^{n})$ , which happens under some additional conditions on $f=\{f_{k}\}_{k=1}^{n}$ . We note that, for $\mathcal{L}=\Delta$ , the linear Fokker-Planck equation (56), has been treated in [2].

5 The degenerate 1-D case

Consider here equation (16), that is

[TABLE]

where $\sigma$ is assumed to satisfy the condition $\sigma\in C_{b}^{2}(\mathbb{R})$ only. Moreover, if we consider, as above, the operator $A:D(A)\subset L^{1}(\mathbb{R})\rightarrow L^{1}(\mathbb{R})$ , such that

[TABLE]

we have the following holds

Lemma 5.1

$A$ * is quasi- $m$ -accretive in $L^{1}(\mathbb{R})$ .*

Proof. For each $\epsilon>0$ we consider the operator

[TABLE]

which is quasi- $m$ -accretive, seen Lemma 3.1. Hence, for each $\eta\in L^{1}(\mathbb{R})$ and $\lambda\geq\lambda_{0}$ the equation

[TABLE]

has a unique solution $y_{\epsilon}\in L^{1}(\mathbb{R})$ , with $H^{*}\Bigl{(}\frac{\sigma^{2}+\epsilon}{2}\,y_{\epsilon}\Bigr{)}\in L^{\infty}(\mathbb{R})$ .

Dynamic estimates. As in the proof of Lemma 3.1, we have

[TABLE]

that is for $\lambda>\left\lVert f^{\prime}\right\rVert_{\infty}$

[TABLE]

Assume now that $\eta\in L^{1}(\mathbb{R})\cap L^{\infty}(\mathbb{R})$ , then, by (60) we see that for each $M>0$

[TABLE]

Moreover, by (5), we also have

[TABLE]

for $M$ and $\lambda$ large enough (independently of $\epsilon$ ). This yields

[TABLE]

Hence $y_{\epsilon}\leq M$ in $\mathbb{R}$ for $\lambda>\left\lVert f^{\prime}\right\rVert_{\infty}$ . Similarly, it follows that

[TABLE]

if $M$ is large enough, but independent of $\epsilon$ . Therefore, if multiply the equation by $(y_{\epsilon}+M)^{-}$ and integrate on $\mathbb{R}$ , we get $\left\lVert(y_{\epsilon}+M)^{-}\right\rVert_{1}\geq 0$ which implies $y_{\epsilon}\geq-M$ in $\mathbb{R}$ .

By (60), we see that $\Bigl{\{}\Bigl{(}H^{*}\Bigl{(}\frac{\sigma^{2}+\epsilon}{2}\,y_{\epsilon}\Bigr{)}\Bigr{)}^{\prime}+f\,y_{\epsilon}^{\prime}\Bigr{\}}_{\epsilon>0}$ is bounded in $W^{1,\infty}(\mathbb{R})$ .

Hence $\Bigl{(}H^{*}\Bigl{(}\frac{\sigma^{2}+\epsilon}{2}\,y_{\epsilon}\Bigr{)}\Bigr{)}^{\prime}$ bounded in $L^{1}(\mathbb{R})\cap L^{\infty}(\mathbb{R})$ , so that $\Bigl{\{}\eta_{\epsilon}=H^{*}\Bigl{(}\frac{\sigma^{2}+\epsilon}{2}\,y_{\epsilon}\Bigr{)}\Bigr{\}}$ is compact in $C(\mathbb{R})$ . It follows that on a subsequence $\epsilon\rightarrow 0$ , we have

[TABLE]

where $\zeta=H^{*}\Bigl{(}\frac{\sigma^{2}}{2}\,y\Bigr{)}$ in $\mathbb{R}$ . Letting $\epsilon\rightarrow 0$ in (60), we get

[TABLE]

Next for $\eta\in L^{1}(\mathbb{R})$ we choose $\{\eta_{n}\}\subset L^{1}(\mathbb{R})\cap L^{\infty}(\mathbb{R})$ , $\eta_{n}\rightarrow\eta$ in $L^{1}(\mathbb{R})$ and we have

[TABLE]

getting

[TABLE]

Hence, for $\lambda>\left\lVert f^{\prime}\right\rVert_{\infty}$ we have for $n\rightarrow\infty$

[TABLE]

This yields

[TABLE]

Hence for $\lambda\geq\lambda_{0}$ , $y\in L^{1}(\mathbb{R})$ is the solution to equation $\lambda\,y+A\,y=\eta$ as claimed. As seen earlier this implies that the operator $A+B$ is quasi- $m$ -accretive in $L^{1}(\mathbb{R})$

Then by the existence theorem for the equation

[TABLE]

we get

Theorem 5.1

There is a unique mild solution $y\in C([0,T];\mathbb{R})$ to equation (57).

As in previous case Theorem 5.1 implies via (13) the existence of a mild solution $\varphi$ to equation (1) satisfying (20). We omit the details.

6 Conclusions

In this paper it is shown, via nonlinear semigroup theory in $L^{1}$ , both the existence and the uniqueness of a mild solution for the dynamic programming equation for stochastic optimal control problem with control in the volatility term. Latter problem is related to the analysis of controlled stochastic volatility models, within the financial frameworks, whose related computational study is the subject of our ongoing research.

Bibliography13

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Viorel Barbu. Nonlinear differential equations of monotone types in Banach spaces . Springer Science & Business Media, 2010.
2[2] Viorel Barbu. Generalized solutions to nonlinear fokker–planck equations. Journal of Differential Equations , 261(4):2446–2471, 2016.
3[3] Philippe Benilan, Haim Brezis, and Michael G Crandall. A semilinear equation in l 1 ( ℝ n ) superscript 𝑙 1 superscript ℝ 𝑛 l^{1}(\mathbb{R}^{n}) . Annali della Scuola Normale Superiore di Pisa-Classe di Scienze , 2(4):523–555, 1975.
4[4] Francesco. Cordoni and Luca Di Persio. Transition density for cir process by lie symmetries and application to zcb pricing. International Journal of Pure and Applied Mathematics , 88(2):239–246, 2013.
5[5] John C. Cox. Notes on option pricing i: Constant elasticity of diffusions. Stanford University , Unpublished draft(2), 1975.
6[6] Michael G Crandall, Hitoshi Ishii, and Pierre-Louis Lions. User’s guide to viscosity solutions of second order partial differential equations. Bulletin of the American Mathematical Society , 27(1):1–67, 1992.
7[7] Wendell H Fleming and Raymond W Rishel. Deterministic and stochastic optimal control , volume 1. Springer Science & Business Media, 2012.
8[8] P. Hagan, A. Lesniewski, and D. Woodward. Probability distribution in the sabr model of stochastic volatility. volume 110, pages 1–35, 2015.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Mild solutions to the dynamic programming equation for stochastic optimal control problems

Abstract

1 Introduction

1.1 Notation and basic results

Definition 1.1** (Accretive operator)**

2 Existence results

Definition 2.1

Theorem 2.2

Theorem 2.3

Remark 2.4

3 Proof of Theorem 2.2

Lemma 3.1

4 A multi-dimensional case

Theorem 4.1

Lemma 4.1

Remark 4.2

5 The degenerate 1-D case

Lemma 5.1

Theorem 5.1

6 Conclusions

Definition 1.1 (Accretive operator)