Non-Parametric Robust Model Risk Measurement with Path-Dependent Loss   Functions

Yu Feng

arXiv:1903.00590·q-fin.MF·March 6, 2019

Non-Parametric Robust Model Risk Measurement with Path-Dependent Loss Functions

Yu Feng

PDF

Open Access

TL;DR

This paper develops a comprehensive non-parametric framework for dynamic, path-dependent model risk measurement using $f$-divergences, extending existing entropic methods to more general settings.

Contribution

It generalizes the relative-entropic approach to dynamic, path-dependent losses under any $f$-divergence, providing a unified theory for model risk quantification.

Findings

01

Unified treatment of worst-case risk and divergence budget

02

Extension of entropic methods to path-dependent, dynamic settings

03

Applicable to various $f$-divergences in model risk measurement

Abstract

Understanding and measuring model risk is important to financial practitioners. However, there lacks a non-parametric approach to model risk quantification in a dynamic setting and with path-dependent losses. We propose a complete theory generalizing the relative-entropic approach by Glasserman and Xu to the dynamic case under any $f$ -divergence. It provides an unified treatment for measuring both the worst-case risk and the $f$ -divergence budget that originate from the model uncertainty of an underlying state process.

Equations376

\mathscr{F}^{0}_{t}\coloneqq\bigvee_{s\in[0,t]}\bigl{\{}X(s)^{-1}(U)\,\bigl{|}\,U\in\mathscr{B}(\mathbb{R}^{d})\bigr{\}}=\bigvee_{s\in[0,t]}\bigcup_{U\in\mathscr{B}(\mathbb{R}^{d})}\{\omega\in\Omega\,|\,\omega(s)\in U\},

\mathscr{F}^{0}_{t}\coloneqq\bigvee_{s\in[0,t]}\bigl{\{}X(s)^{-1}(U)\,\bigl{|}\,U\in\mathscr{B}(\mathbb{R}^{d})\bigr{\}}=\bigvee_{s\in[0,t]}\bigcup_{U\in\mathscr{B}(\mathbb{R}^{d})}\{\omega\in\Omega\,|\,\omega(s)\in U\},

\mathscr{F}^{0}_{0}\coloneqq\bigl{\{}X(0)^{-1}(U)\,\bigl{|}\,U\in\mathscr{B}(\mathbb{R}^{d})\bigr{\}}=\bigcup_{U\in\mathscr{B}(\mathbb{R}^{d})}\{\omega\in\Omega\,|\,\omega(0)\in U\}.

\mathscr{F}^{0}_{0}\coloneqq\bigl{\{}X(0)^{-1}(U)\,\bigl{|}\,U\in\mathscr{B}(\mathbb{R}^{d})\bigr{\}}=\bigcup_{U\in\mathscr{B}(\mathbb{R}^{d})}\{\omega\in\Omega\,|\,\omega(0)\in U\}.

\textsf{P}\bigl{(}X(0)^{-1}(U)\bigr{)}=\textsf{P}(\{\omega\in\Omega\,|\,\omega(0)\in U\})=\begin{cases}1&\text{if $0\in U$};\\ 0&\text{if $0\notin U$},\end{cases}

\textsf{P}\bigl{(}X(0)^{-1}(U)\bigr{)}=\textsf{P}(\{\omega\in\Omega\,|\,\omega(0)\in U\})=\begin{cases}1&\text{if $0\in U$};\\ 0&\text{if $0\notin U$},\end{cases}

(t, ω) \sim (t^{'}, ω^{'}) if and only if t = t^{'} and ω_{t} = ω_{t^{'}}^{'},

(t, ω) \sim (t^{'}, ω^{'}) if and only if t = t^{'} and ω_{t} = ω_{t^{'}}^{'},

d_{\infty}\bigl{(}(t,\omega),(t^{\prime},\omega^{\prime})\bigr{)}\coloneqq\sup_{s\in[0,T]}|\omega(s\wedge t)-\omega^{\prime}(s\wedge t^{\prime})|+|t-t^{\prime}|=\|\omega_{t}-\omega^{\prime}_{t^{\prime}}\|_{\infty}+|t-t^{\prime}|,

d_{\infty}\bigl{(}(t,\omega),(t^{\prime},\omega^{\prime})\bigr{)}\coloneqq\sup_{s\in[0,T]}|\omega(s\wedge t)-\omega^{\prime}(s\wedge t^{\prime})|+|t-t^{\prime}|=\|\omega_{t}-\omega^{\prime}_{t^{\prime}}\|_{\infty}+|t-t^{\prime}|,

M_{+} (1) : = {Z \in M ∣ Z ⩾ 0 and Z (0) = 1}

M_{+} (1) : = {Z \in M ∣ Z ⩾ 0 and Z (0) = 1}

Z(t)\coloneqq\textsf{E}\biggl{(}\frac{\mathrm{d}\textsf{{Q}}}{\mathrm{d}\textsf{P}}\,\biggl{|}\,\mathscr{F}^{0}_{t}\biggr{)},

Z(t)\coloneqq\textsf{E}\biggl{(}\frac{\mathrm{d}\textsf{{Q}}}{\mathrm{d}\textsf{P}}\,\biggl{|}\,\mathscr{F}^{0}_{t}\biggr{)},

D_{f}(\textsf{{Q}}\|\textsf{P})\coloneqq\textsf{E}\biggl{(}f\biggl{(}\frac{\mathrm{d}\textsf{{Q}}}{\mathrm{d}\textsf{P}}\biggr{)}\biggr{)}

D_{f}(\textsf{{Q}}\|\textsf{P})\coloneqq\textsf{E}\biggl{(}f\biggl{(}\frac{\mathrm{d}\textsf{{Q}}}{\mathrm{d}\textsf{P}}\biggr{)}\biggr{)}

Z_{η} : = {Z \in M_{+} (1) ∣ D_{f} (Q_{Z} ∥ P) ⩽ η},

Z_{η} : = {Z \in M_{+} (1) ∣ D_{f} (Q_{Z} ∥ P) ⩽ η},

\textsf{P}\bigl{(}X(0)^{-1}\{0\}\bigr{)}=\textsf{P}(\{\omega\in\Omega\,|\,\omega(0)=0\})=1,

\textsf{P}\bigl{(}X(0)^{-1}\{0\}\bigr{)}=\textsf{P}(\{\omega\in\Omega\,|\,\omega(0)=0\})=1,

\sup_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}\qquad\text{and}\qquad\sup_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}-\textsf{E}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}.

\sup_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}\qquad\text{and}\qquad\sup_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}-\textsf{E}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}.

\mathcal{L}(Z,\vartheta,\eta)\coloneqq\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}-\frac{D_{f}(\textsf{{Q}}_{Z}\|\textsf{P})-\eta}{\vartheta}=\textsf{E}^{\textsf{{Q}}_{Z}}\biggl{(}\ell(T,\,\cdot\,)-\frac{f\bigl{(}Z(T)\bigr{)}}{\vartheta Z(T)}\biggr{)}+\frac{\eta}{\vartheta},

\mathcal{L}(Z,\vartheta,\eta)\coloneqq\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}-\frac{D_{f}(\textsf{{Q}}_{Z}\|\textsf{P})-\eta}{\vartheta}=\textsf{E}^{\textsf{{Q}}_{Z}}\biggl{(}\ell(T,\,\cdot\,)-\frac{f\bigl{(}Z(T)\bigr{)}}{\vartheta Z(T)}\biggr{)}+\frac{\eta}{\vartheta},

d (ϑ, η) : =

d (ϑ, η) : =

\widehat{\ell}_{\vartheta}(t,Z)\coloneqq\ell(t,\,\cdot\,)-\frac{f\bigl{(}Z(t)\bigr{)}}{\vartheta Z(t)}

\widehat{\ell}_{\vartheta}(t,Z)\coloneqq\ell(t,\,\cdot\,)-\frac{f\bigl{(}Z(t)\bigr{)}}{\vartheta Z(t)}

Z \in Z_{η} sup E^{Q_{Z}} (ℓ (T, \cdot)) = ϑ \in (0, \infty) in f d (ϑ, η)

Z \in Z_{η} sup E^{Q_{Z}} (ℓ (T, \cdot)) = ϑ \in (0, \infty) in f d (ϑ, η)

\displaystyle Z^{*}=\arg\max_{Z\in\mathscr{M}_{+}(1)}\mathsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\widehat{\ell}_{\vartheta^{*}}(T,Z)\bigr{)}

\displaystyle Z^{*}=\arg\max_{Z\in\mathscr{M}_{+}(1)}\mathsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\widehat{\ell}_{\vartheta^{*}}(T,Z)\bigr{)}

\displaystyle Z^{*}=\arg\max_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\ell(T,\cdot)\bigr{)}

\displaystyle Z^{*}=\arg\max_{Z\in\mathcal{Z}_{\eta}}\textsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\ell(T,\cdot)\bigr{)}

\displaystyle D_{f}(\textsf{{Q}}_{\lambda Z_{1}+(1-\lambda)Z_{2}}\|\textsf{P})=D_{f}\bigl{(}\lambda\textsf{{Q}}_{Z_{1}}+(1-\lambda)\textsf{{Q}}_{Z_{2}}\bigl{\|}\textsf{P}\bigr{)}

\displaystyle D_{f}(\textsf{{Q}}_{\lambda Z_{1}+(1-\lambda)Z_{2}}\|\textsf{P})=D_{f}\bigl{(}\lambda\textsf{{Q}}_{Z_{1}}+(1-\lambda)\textsf{{Q}}_{Z_{2}}\bigl{\|}\textsf{P}\bigr{)}

\displaystyle\leqslant\textsf{E}\bigl{(}\lambda f\bigl{(}Z_{1}(T)\bigr{)}+(1-\lambda)f\bigl{(}Z_{2}(T)\bigr{)}\bigr{)}

\displaystyle=\lambda\textsf{E}\bigl{(}f\bigl{(}Z_{1}(T)\bigr{)}\bigr{)}+(1-\lambda)\textsf{E}\bigl{(}f\bigl{(}Z_{2}(T)\bigr{)}\bigr{)}

= λ D_{f} (Q_{Z_{1}} ∥ P) + (1 - λ) D_{f} (Q_{Z_{2}} ∥ P),

\displaystyle\textsf{E}^{\textsf{{Q}}_{\lambda Z_{1}+(1-\lambda)Z_{2}}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}

\displaystyle\textsf{E}^{\textsf{{Q}}_{\lambda Z_{1}+(1-\lambda)Z_{2}}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}

\displaystyle=\lambda\textsf{E}\bigl{(}Z_{1}(T)\ell(T,\,\cdot\,)\bigr{)}+(1-\lambda)\textsf{E}\bigl{(}Z_{2}(T)\ell(T,\,\cdot\,)\bigr{)}

\displaystyle=\lambda\textsf{E}^{\textsf{{Q}}_{Z_{1}}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}+(1-\lambda)\textsf{E}^{\textsf{{Q}}_{Z_{2}}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)},

S \subseteq {Z \in M_{+} (1) ∣ D_{f} (Q_{Z} ∣∣ P) < η} \subseteq Z_{η}

S \subseteq {Z \in M_{+} (1) ∣ D_{f} (Q_{Z} ∣∣ P) < η} \subseteq Z_{η}

ϑ \in (0, \infty) in f d (ϑ, η) ⩽

ϑ \in (0, \infty) in f d (ϑ, η) ⩽

=

=

=

=

=

⩽

\displaystyle\inf_{\vartheta\in(0,\infty)}d(\vartheta,\eta)=\mathsf{E}^{\mathsf{Q}_{Z^{*}}}\bigl{(}\ell(T,\cdot)\bigr{)}=\sup_{Z\in\mathcal{Z}_{\eta}}\mathsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\ell(T,\cdot)\bigr{)}

\displaystyle\inf_{\vartheta\in(0,\infty)}d(\vartheta,\eta)=\mathsf{E}^{\mathsf{Q}_{Z^{*}}}\bigl{(}\ell(T,\cdot)\bigr{)}=\sup_{Z\in\mathcal{Z}_{\eta}}\mathsf{E}^{\mathsf{Q}_{Z}}\bigl{(}\ell(T,\cdot)\bigr{)}

\max_{Z\in\mathscr{M}_{+}(1)}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\widehat{\ell}_{\vartheta}(T,Z)\bigr{)}

\max_{Z\in\mathscr{M}_{+}(1)}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\widehat{\ell}_{\vartheta}(T,Z)\bigr{)}

Z (t, \overset{ˉ}{Z}) : = {Z \in M_{+} (1) ∣ Z (t) = \overset{ˉ}{Z} (t)} .

Z (t, \overset{ˉ}{Z}) : = {Z \in M_{+} (1) ∣ Z (t) = \overset{ˉ}{Z} (t)} .

Z(s)=\textsf{E}\bigl{(}Z(t)\,|\,\mathscr{F}^{0}_{s}\bigr{)}=\textsf{E}\bigl{(}\bar{Z}(t)\,|\,\mathscr{F}^{0}_{s}\bigr{)}=\bar{Z}(s),

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Monetary Policy and Economic Impact · Market Dynamics and Volatility

Full text

Non-Parametric Robust Model Risk Measurement with Path-Dependent Loss Functions

Yu Feng

Finance Discipline Group

University of Technology Sydney

P.O. Box 123

Broadway, NSW 2007

Australia

[email protected]

Abstract.

Understanding and measuring model risk is important to financial practitioners. However, there lacks a non-parametric approach to model risk quantification in a dynamic setting and with path-dependent losses. We propose a complete theory generalizing the relative-entropic approach by Glasserman and Xu (2014) to the dynamic case under any $f$ -divergence. It provides an unified treatment for measuring both the worst-case risk and the $f$ -divergence budget that originate from the model uncertainty of an underlying state process.

1. Introduction

As a working definition, model risk refers to the quantification of unanticipated losses resulting from the use of inappropriate models to value and manage financial securities, including widely traded securities like stocks and bonds, for which market prices are readily available, and less traded derivatives written on such securities. Unlike other financial risks, which are concerned with the impact of randomness within the paradigm of a chosen model, model risk is concerned with the possibility that the wrong modelling paradigm was chosen in the first place. This makes it a much more challenging proposition, both conceptually and in terms of implementation. It is thus unsurprising that model risk continues to languish behind its more traditional counterparts, such as price risk, interest rate risk and credit risk, both in terms of identifying an appropriate theoretical methodology and in the development of specific metrics.

A simple approach of accounting for model uncertainty is to assign weights to alternative models and then calculate the average market risk Branger and Schlag (2004). Perhaps a better way is to separate the model risk component from the market risk component. In addition, from the risk management point of view, one may be more interested in the worst-case scenario instead of the average scenario. Kerkhof et al. (2002) proposed a risk-differencing measure that separates the market risk under the worst-case model from the nominal market risk. Following the worst-case approach, Cont (2006) formulated a quantitative framework for measuring the model risk in derivative pricing. This approach applies to a parametric set of alternative measures which price some benchmark instruments within their respective bid-ask spreads. Following Cont’s work, Gupta et al. (2010) proposed the definition of the spread of a contingent claim to be the set of the prices given by all legitimate models. Bannör and Scherer (2013) proposed a parametric risk framework that unifies the proposals of Cont (2006), Gupta et al. (2010) and Lindström (2010). This approach incorporates a distribution of parameter values to capture the risk of parameter uncertainty, resulting in bid-ask spreads in instruments that face parameter risk. Detering and Packham (2016) approach the problem of model risk measurement based on the residual profit and loss from hedging in the reference model. Kerkhof et al. (2010) propose a procedure to take model risk into account when computing capital reserves. Instead of formulating model risk in terms of a collection of probability measures, they consider the reality that practitioners may evaluate risk based on models of different natures. From a practical point of view, Boucher et al. (2014) proposed an approach that incorporates model risk into the usual market risk measures.

The approaches described above are parametric in the sense that they consider alternative models parametrised by a finite set of parameters. To go beyond that, Glasserman and Xu (2014) proposed a non-parametric approach. Under this framework, a worst-case model is found among alternative models in a neighborhood of a reference model. Glasserman and Xu adopted the relative entropy (or the Kullback-Leibler divergence) to measure the distance between the probability measure given by the reference model and an (equivalent) alternative measure. By imposing a constraint on the relative entropy budget, the set of legitimate alternative models is defined in a non-parametric fashion, and the worst-case scenario can then be solved analytically within a finite distance to the reference model. This approach is formulated w.r.t the distribution of a state variable, thus less applicable when the state variable evolves dynamically. In this paper, we apply it conceptually to the problem of measuring model risk w.r.t a state process. We solve the problem in a dual formulation and handle its path-dependency with the help of the functional Ito calculus Cont (2016). The constraint that defines the legitimate alternative models is w.r.t the $f$ -divergence, a more general choice than the Kullback-Leibler divergence.

2. Problem Formulation

Fix $T\in(0,\infty)$ and $d\in\mathbb{N}$ , and let $\Omega\coloneqq D([0,T],\mathbb{R}^{d})$ denote the set of càdlàg paths $\omega:[0,T]\rightarrow\mathbb{R}^{d}$ . Let $[0,T]\ni t\mapsto X(t)$ be the canonical process on $\Omega$ , which means to say that $X(t)(\omega)\coloneqq\omega(t)$ , for all $(t,\omega)\in[0,T]\times\Omega$ . Let $\mathfrak{F}^{0}=(\mathscr{F}^{0}_{t})_{t\in[0,T]}$ denote the filtration on $\Omega$ generated by $X$ , which is to say that

[TABLE]

for all $t\in[0,T]$ . In particular,

[TABLE]

Fix a reference probability measure P on $(\Omega,\mathscr{F}^{0}_{T})$ , subject to the condition

[TABLE]

for all $U\in\mathscr{B}(\mathbb{R}^{d})$ , which is to say that almost all paths start at zero under P. Note that this condition ensures that $\textsf{P}(A)=0$ or $\textsf{P}(A)=1$ , for all $A\in\mathscr{F}^{0}_{0}$ .

To be consistent with the notation in Cont (2016), we shall write $\omega_{t}\coloneqq\omega(t\wedge\cdot)\in\Omega$ to denote the path $\omega\in\Omega$ stopped at time $t\in[0,T]$ . We impose an equivalence relation $\sim$ on $[0,T]\times\Omega$ , by specifying that

[TABLE]

for all $(t,\omega),(t^{\prime},\omega^{\prime})\in[0,T]\times\Omega$ . That is to say, two pairs, each consisting of a time and a path, are equivalent if the times are equal and the corresponding stopped paths are the same. The quotient set $\Lambda_{T}^{d}\coloneqq[0,T]\times\Omega\,/\!\sim$ forms a complete metric space, when endowed with the metric $d_{\infty}:(\Lambda_{T}^{d})^{2}\rightarrow\mathbb{R}_{+}$ , defined by

[TABLE]

for all $(t,\omega),(t^{\prime},\omega^{\prime})\in\Lambda_{T}^{d}$ . We refer to $(\Lambda_{T}^{d},d_{\infty})$ as the space of stopped paths.

A measurable function $F:\Lambda_{T}^{d}\rightarrow\mathbb{R}$ is called a non-anticipative functional, where $\Lambda_{T}^{d}$ is endowed with the Borel sigma-algebra generated by $d_{\infty}$ and $\mathbb{R}$ is endowed with the Borel sigma-algebra generated by the usual Euclidean metric. Since $(t,\omega)\sim(t,\omega_{t})$ , for all $(t,\omega)\in[0,T]\times\Omega$ , we may regard a non-anticipative functional $F:\Lambda_{T}^{d}\rightarrow\mathbb{R}$ as an appropriately measurable function $F:[0,T]\times\Omega\rightarrow\mathbb{R}$ that satisfies the condition $F(t,\omega)=F(t,\omega_{t})$ . That is to say, the value of a non-anticipative functional, when applied to a particular time and path, depends only on the behaviour of the path up to the time. Note that $({F}(t,\,\cdot\,))_{t\in[0,T]}$ is a progressively measurable process, adapted to the filtration $\mathfrak{F}^{0}$ .

Let $\mathscr{M}$ denote the family of (right-continuous versions of) martingales on the filtered probability space $(\Omega,\mathscr{F}^{0}_{T},\mathfrak{F}^{0},\textsf{P})$ , over the compact time-interval $[0,T]$ , and let

[TABLE]

denote the sub-family of non-negative martingales starting at one. Each $Z\in\mathscr{M}_{+}(1)$ defines a probability measure $\textsf{{Q}}_{Z}$ on $(\Omega,\mathscr{F}^{0}_{T})$ satisfying $\textsf{{Q}}_{Z}\ll\textsf{P}$ (i.e. $\textsf{{Q}}_{Z}$ is absolutely continuous w.r.t P), according to the recipe $\textsf{{Q}}_{Z}(A)\coloneqq\textsf{E}\bigl{(}\mathbf{1}_{A}Z(T)\bigr{)}$ , for all $A\in\mathscr{F}^{0}_{T}$ . Conversely, each probability measure Q on $(\Omega,\mathscr{F}^{0}_{T})$ satisfying $\textsf{{Q}}\ll\textsf{P}$ can be written as $\textsf{{Q}}=\textsf{{Q}}_{Z}$ , where $Z\in\mathscr{M}_{+}(1)$ is determined by

[TABLE]

for all $t\in[0,T]$ .

Consider a twice-differentiable strictly convex function $f:\mathbb{R}_{+}\rightarrow\mathbb{R}$ satisfying $f(1)=0$ . For any probability measure Q on $(\Omega,\mathscr{F}^{0}_{T})$ satisfying $\textsf{{Q}}\ll\textsf{P}$ , the $f$ -divergence of Q with respect to P is defined by

[TABLE]

(see Basseville 2013, Section 2). Intuitively, $f$ -divergence provides a measure of the distance between two probability measures. Hence, the set

[TABLE]

where $\eta\geqslant 0$ , corresponds to the family of absolutely continuous probability measures that are close to the reference probability measure P.

Finally, fix a non-anticipative functional $\ell:\Lambda_{T}^{d}\rightarrow\mathbb{R}$ satisfying $\ell(0,0)=0$ . We shall interpret $\ell(t,\omega)$ as the cumulative realized loss up to time $t$ , incurred by a portfolio of financial securities. The state of the portfolio is completely determined by the path $\omega\in\Omega$ . The condition of the reference probability measure guarantees

[TABLE]

It follows that $\ell(0,\,\cdot\,)=0$ P-a.s. That is to say, the initial realized loss incurred by the portfolio is zero under the reference probability measure. If we interpret P as the probability measure associated with a nominal model for the dynamics of the portfolio, then $\textsf{E}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}$ gives the expected total loss under the nominal model. In financial applications, we usually set the terminal time $T$ as the point when the entire portfolio gets liquidated, thus realizing the cumulative loss.

Suppose, now, that there is some uncertainty about which model best describes the portfolio. In particular, suppose that each probability measure determined by a member of $\mathcal{Z}_{\eta}$ , for some $\eta\geqslant 0$ , corresponds to a plausible model for the dynamics of the portfolio.111The idea here is that all absolutely continuous probability measures close enough to the reference measure (in the sense of $f$ -divergence) correspond with models that are plausibly close to the reference model. In that case, a risk manager would be interested in the following quantities:

[TABLE]

The former expression may be regarded as the worst-case expected loss suffered by the portfolio under all plausible models, while the latter expression quantifies the difference between the worst-case expected loss and the expected loss under the default model. As such, it serves as a measure of model risk.

Problem defined in (2.2) may be formulated in a dual form Glasserman and Xu (2014). We first define the Lagrangian $\mathcal{L}:\mathscr{M}_{+}(1)\times(0,\infty)\times(0,\infty)\to\mathbb{R}$ by

[TABLE]

The Lagrangian leads to a dual function defined by

[TABLE]

Given $t\in[0,T]$ and $Z\in\mathscr{M}_{+}(1)$ ,

[TABLE]

defines a $\mathscr{F}^{0}_{t}$ -measurable function $\widehat{\ell}_{\vartheta}(t,Z):\Omega\to\mathbb{R}$ . As with $\ell:[0,T]\times\Omega\to\mathbb{R}$ , $\widehat{\ell}_{\vartheta}(\cdot,Z)$ may be regarded as a non-anticipative functional.

If the primal problem is convex and the constraint satisfies Slater’s condition Slater (2014), then strong duality holds, giving

[TABLE]

This is proved in the following lemma.

Lemma 2.1.

The following statements are true:

(1)

The set $\mathcal{Z}_{\eta}$ is convex. 2. (2)

The function $\mathcal{Z}_{\eta}\ni Z\mapsto\textsf{E}^{\textsf{{Q}}_{z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}$ is convex. 3. (3)

Strong duality Eq. 2.4 holds. 4. (4)

Given $\vartheta^{*}\in(0,\infty)$ , and suppose that $Z^{*}\in\mathscr{M}_{+}(1)$ satisfies

[TABLE]

then

[TABLE]

*with $\eta:=\textsf{E}\left(f\left(Z^{*}(T)\right)\right)$ . *

Proof.

(1) Given $Z_{1},Z_{2}\in\mathcal{Z}_{\eta}$ , observe that

[TABLE]

for all $\lambda\in[0,1]$ , by virtue of the convexity of $f$ and Jensen’s inequality. Since $D_{f}(\textsf{{Q}}_{Z_{1}}\|\textsf{P})\leqslant\eta$ and $D_{f}(\textsf{{Q}}_{Z_{1}}\|\textsf{P})\leqslant\eta$ , the inequality above leads to $D_{f}(\textsf{{Q}}_{\lambda Z_{1}+(1-\lambda)Z_{2}}\|\textsf{P})\leqslant\eta$ . This implies that $\lambda Z_{1}+(1-\lambda)Z_{2}\in\mathcal{Z}_{\eta}$ , by virtue of the fact that $\lambda Z_{1}+(1-\lambda)Z_{2}\in\mathscr{M}_{+}(1)$ .

(2) Given $Z_{1},Z_{2}\in\mathcal{Z}_{\eta}$ , observe that

[TABLE]

for all $\lambda\in[0,1]$ . Hence, the function $\mathcal{Z}_{\eta}\ni Z\mapsto\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\ell(T,\,\cdot\,)\bigr{)}$ is linear and therefore also convex.

(3) For a given $\eta\in(0,\infty)$ , the constant process $Z=1$ satisfies $D_{f}(\mathsf{Q}_{Z}||P)=D_{f}(P||P)=0<\eta$ . It is also an interior point of the subset $\mathcal{Z}_{\eta}\subseteq\mathscr{M}_{+}(1)$ .222To see this point, consider the continuous function $\mathcal{H}:\mathscr{M}_{+}(1)\to\mathbb{R}$ defined by $\mathcal{H}(Z)=\textsf{E}\left(f\left(Z(T)\right)\right)$ (we endow $\mathscr{M}_{+}(1)$ with the topology induced by the metric $d(Z_{1},Z_{2})=\textsf{E}(|f(Z_{1}(T))-f_{2}(Z_{2}(T))|)$ . The continuity ensures that $S:=\mathcal{H}^{-1}\left((-\eta,\eta)\right)$ is an open subset of $\mathscr{M}_{+}(1)$ . Furthermore,

$\displaystyle S\subseteq\{Z\in\mathscr{M}_{+}(1)|D_{f}(\textsf{{Q}}_{Z}||\textsf{{P}})<\eta\}\subseteq\mathcal{Z}_{\eta}$

suggesting that $S\subseteq\mathsf{int}(\mathcal{Z}_{\eta})$ . As an element in $S$ , the constant process $Z=1$ is an interior point of $\mathcal{Z}_{\eta}$ . According to Slater’s condition Slater (2014), the strong duality holds.

(4) Let $\eta:=\textsf{E}\left(f\left(Z^{*}(T)\right)\right)$ , and observe that

[TABLE]

Lemma 2.1(3) then ensures that

[TABLE]

and the result follows. ∎

For the primal problem formulated in Eq. 2.2, Lemma. 2.1(4) implies the existence of a solution $Z^{*}$ that lies on the boundary of $\mathcal{Z}_{\eta}$ given $\eta>0$ (i.e. $\textsf{E}\left(f\left(Z^{*}(T)\right)\right)=\eta$ ), as long as $Z^{*}$ solves

[TABLE]

for some $\vartheta\in(0,\infty)$ . In the following context, we will consider the dual problem formulated in Eq. 2.5 instead of the primal problem. For simplicity, we will regard $\theta>0$ as given and express $\widehat{\ell}_{\vartheta}$ by $\widehat{\ell}$ .

3. Characterising the Worst-Case Expected Loss

This section provides implicit characterisation of the solution to the worst-case expected loss problem formulated in (2.2).

Given $t\in[0,T]$ and $\bar{Z}\in\mathscr{M}_{+}(1)$ , define the family of $\bar{Z}$ -consistent martingale densities up to time $t$ by

[TABLE]

Note that $\mathcal{Z}(0,\bar{Z})=\mathscr{M}_{+}(1)$ , since $Z(0)=1=\bar{Z}(0)$ for all $Z\in\mathscr{M}_{+}(1)$ . Note that the martingale property of the members of $\mathcal{Z}(t,\bar{Z})$ ensures that

[TABLE]

for all $Z\in\mathcal{Z}(t,\bar{Z})$ and all $s\in[0,t]$ . In other words, $\mathcal{Z}(t,\bar{Z})$ is the set of processes in $\mathscr{M}_{+}(1)$ that are consistent with $\bar{Z}$ over the interval $[0,t]$ . Moreover, we observe that

[TABLE]

for all $Z\in\mathcal{Z}(t,\bar{Z})$ and all $A\in\mathscr{F}^{0}_{t}$ . That is to say, the probability measures associated with members of $\mathcal{Z}(t,\bar{Z})$ agree with each other on all $\mathscr{F}^{0}_{t}$ -measurable events. This is the set of feasible alternative measures by looking forward (from time $t$ ).

Given $\bar{Z}\in\mathscr{M}_{+}(1)$ , we now define the $\mathfrak{F}^{0}$ -adapted process $(\widehat{L}(t,\bar{Z}))_{t\in[0,T]}$ by

[TABLE]

for all $t\in[0,T]$ , assuming the maximum always exists. Since $\widehat{\ell}(\,\cdot\,,Z)$ is a non-anticipative functional satisfying $\widehat{\ell}(0,Z)=0$ P-a.s. and $Z(0)=1$ implies that $\textsf{{Q}}_{Z}|_{\mathscr{F}^{0}_{0}}=\textsf{P}|_{\mathscr{F}^{0}_{0}}$ , it follows that $\widehat{\ell}(0,Z)=0$ $\textsf{{Q}}_{Z}$ -a.s. as well. Consequently,

[TABLE]

where the second equality follows from the fact that $\mathscr{F}^{0}_{0}$ and $\mathscr{F}^{0}_{T}$ are independent sigma-algebras, with respect to $\textsf{{Q}}_{Z}$ .333First observe that $Z(0)=1$ implies that $\textsf{{Q}}_{Z}(A)=\textsf{P}(A)=0$ or $\textsf{{Q}}_{Z}(A)=\textsf{P}(A)=1$ , for all $A\in\mathscr{F}^{0}_{0}$ . Consequently, given $A\in\mathscr{F}^{0}_{0}$ and $B\in\mathscr{F}^{0}_{T}$ , we obtain

$0\leqslant\textsf{{Q}}_{Z}(A\cap B)\leqslant\textsf{{Q}}_{Z}(A)=0=\textsf{{Q}}_{Z}(A)\textsf{{Q}}_{Z}(B),$

in the case when $\textsf{{Q}}_{Z}(A)=0$ , while

$\displaystyle\textsf{{Q}}_{Z}(A)\textsf{{Q}}_{Z}(B)=\textsf{{Q}}_{Z}(B)\geqslant\textsf{{Q}}_{Z}(A\cap B)=\textsf{{Q}}_{Z}\bigl{(}(A^{\mathsf{c}}\cup B^{\mathsf{c}})^{\mathsf{c}}\bigr{)}=1-\textsf{{Q}}_{Z}(A^{\mathsf{c}}\cup B^{\mathsf{c}})$ $\displaystyle\geqslant 1-\bigl{(}\textsf{{Q}}_{Z}(A^{\mathsf{c}})+\textsf{{Q}}_{Z}(B^{\mathsf{c}})\bigr{)}$

$\displaystyle=1-\textsf{{Q}}_{Z}(B^{\mathsf{c}})$

$\displaystyle=\textsf{{Q}}_{Z}(B)$

$\displaystyle=\textsf{{Q}}_{Z}(A)\textsf{{Q}}_{Z}(B),$

in the case when $\textsf{{Q}}_{Z}(A)=1$ . This is simply the problem given in Eq. 2.5.

Definition 3.1.

A worst-case density process is some $Z^{*}\in\mathscr{M}_{+}(1)$ that solves the maximisation problem (3.1) w.r.t the family of $Z^{*}$ -consistent martingale densities:

[TABLE]

for each $t\in[0,T]$ .

Suppose $Z^{*}\in\mathscr{M}_{+}(1)$ is a worst-case martingale density according to the definition above, then $Z^{*}$ solves the problem formulated in Eq. 2.5. This is confirmed by substituting Eq. 3.2 into Eq. 3.3 which leads to $\textsf{E}^{\textsf{{Q}}_{Z^{*}}}\bigl{(}\widehat{\ell}(T,Z^{*})\bigr{)}=\max_{Z\in\mathscr{M}_{+}(1)}\textsf{E}^{\textsf{{Q}}_{Z}}\bigl{(}\widehat{\ell}(T,Z)\bigr{)}$ . In the proposition below, we characterizes such worst-case density by its martingale property.

Proposition 3.2.

Fix $\bar{Z}\in\mathscr{M}_{+}(1)$ and suppose the maximum in (3.1) exists for each $t\in[0,T]$ . Then the process $[0,T]\ni t\mapsto\widehat{L}(t,\bar{Z})+\widehat{\ell}(t,\bar{Z})$ is a $\textsf{{Q}}_{\bar{Z}}$ -supermartingale. It is a $\textsf{{Q}}_{\bar{Z}}$ -martingale iff $\bar{Z}$ is a worst-case density process.

Proof.

Given an arbitrary $t\in[0,T]$ , we suppose $Z^{\prime}\in\mathcal{Z}(t,\bar{Z})$ solves the maximisation problem (Eq. 3.1). Applying the law of iterated expectation, we have

[TABLE]

for all $s\in[0,t]$ . By virtue of $Z^{\prime}(s)=\bar{Z}(s)$ , $\widehat{\ell}(s,Z^{\prime})=\widehat{\ell}(s,\bar{Z})$ for all $s\in[0,t]$ . The same condition also leads to $Z^{\prime}\in\mathcal{Z}(s,\bar{Z})$ . According to the definition of $\widehat{L}$ (Eq. 3.1), we have the following inequality

[TABLE]

for all $s\in[0,t]$ . In the last equality, we replace $\textsf{{Q}}_{Z^{\prime}}$ by $\textsf{{Q}}_{\bar{Z}}$ because $\widehat{\ell}(t,\bar{Z})$ , $\widehat{\ell}(s,\bar{Z})$ and $\widehat{L}(t,\bar{Z})$ are all $\mathscr{F}^{0}_{t}$ -measurable.444The conditional expectation of a $\mathscr{F}^{0}_{t}$ -measurable function $X:\Omega\to\mathbb{R}$ w.r.t a sub- $\sigma$ -algebra $\mathscr{F}^{0}_{s}\subseteq\mathscr{F}^{0}_{t}$ is

$\displaystyle\textsf{E}^{\textsf{{Q}}_{Z^{\prime}}}\left(\left.X\,\right|\mathscr{F}^{0}_{s}\right)=\textsf{E}\left(\left.\frac{Z^{\prime}(T)}{Z^{\prime}(s)}X\,\right|\mathscr{F}^{0}_{s}\right)=\textsf{E}\left(\left.\frac{Z^{\prime}(t)}{Z^{\prime}(s)}\textsf{E}\left(\left.\frac{Z^{\prime}(T)}{Z^{\prime}(t)}X\,\right|\mathscr{F}^{0}_{t}\right)\,\right|\mathscr{F}^{0}_{s}\right)=$ $\displaystyle\,\textsf{E}\left(\left.\frac{Z^{\prime}(t)}{Z^{\prime}(s)}\textsf{E}^{\textsf{{Q}}_{Z^{\prime}}}\left(\left.X\,\right|\mathscr{F}^{0}_{t}\right)\,\right|\mathscr{F}^{0}_{s}\right)$

$\displaystyle=$ $\displaystyle\,\textsf{E}\left(\left.\frac{\bar{Z}(t)}{\bar{Z}(s)}X\,\right|\mathscr{F}^{0}_{s}\right)$

$\displaystyle=$ $\displaystyle\,\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}\left(\left.X\,\right|\mathscr{F}^{0}_{s}\right)$

Since $t\in[0,T]$ is chosen arbitrarily, Eq. 3 holds for any $s$ and $t$ that satisfies $0\leqslant s\leqslant t\leqslant T$ .

By re-arranging Eq. 3, we obtain the supermartingale property of the $\mathfrak{F}^{0}$ -adapted process $[0,T]\ni t\mapsto\widehat{L}(t,\bar{Z})+\widehat{\ell}(t,\bar{Z})$ :

[TABLE]

The process is a $\textsf{{Q}}_{\bar{Z}}$ -martingale iff the equality holds for all $0\leqslant s\leqslant t\leqslant T$ . If $\bar{Z}$ is a worst-case density process, then according to Definition 3.1 $\bar{Z}$ solves Eq. 3.1 for all $t\in[0,T]$ . We may set $Z^{\prime}=\bar{Z}$ in Eq. 3 so that the first line takes the equal sign for all $s\in[0,t]$ . Conversely, if the equality holds for all $0\leqslant s\leqslant t\leqslant T$ , then it holds for all $0\leqslant s\leqslant t=T$ . By taking the equal sign in Eq. 3.6 and replacing $t$ by $T$ , we get

[TABLE]

for all $s\in[0,T]$ , confirming that $\bar{Z}$ is a worst-case density process by Definition 3.1. ∎

Proposition. 3.2 can be regarded as generalization of the dynamic programming equation. In fact, given an optimal martingale density $Z^{*}\in\mathscr{M}_{+}(1)$ , we take an arbitrary $\bar{Z}\in\mathcal{Z}(s,Z^{*})$ and substitute it into Eq. 3.6. By observing that $\bar{Z}\in\mathcal{Z}(s,Z^{*})$ matches $Z^{*}$ up to time $s$ , we transform Eq. 3.6 into

[TABLE]

The inequality holds for all $\bar{Z}\in\mathcal{Z}(s,Z^{*})$ . It takes the equal sign when $\bar{Z}=Z^{*}$ . This leads to the following dynamic programming equation with respect to the density process,

[TABLE]

for all $s$ and $t$ that satisfies $0\leqslant s\leqslant t\leqslant T$ .

4. General Result of Model Risk Measurement

We have shown in Proposition. 3.2 that the $\mathfrak{F}^{0}$ -adapted process $[0,T]\ni t\mapsto\widehat{L}(t,{Z}^{*})+\widehat{\ell}(t,{Z}^{*})$ is a $\textsf{{Q}}_{{Z}^{*}}$ -martingale iff ${Z}^{*}$ is a worst-case density process. In this section, we will show that such $Z^{*}$ indeed exists under certain conditions and is characterized by an equation. This leads to a complete solution to the problem formulated in Eq. 2.2. First we prove a lemma.

Lemma 4.1.

Fix a martingale density $\bar{Z}\in\mathscr{M}_{+}(1)$ . A measurable process $C:[0,T]\times\Omega\to\mathbb{R}$ , satisfying

[TABLE]

for all $t\in[0,T]$ and all $Z\in\mathcal{Z}(t,\bar{Z})$ , admits a progressively measurable modification, i.e. there exists a progressively measurable process $\tilde{C}:[0,T]\times\Omega\to\mathbb{R}$ , regarded as a non-anticipative functional, satisfying $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)=\tilde{C}(t,\omega)\}\bigr{)}=1$ for every $t\in[0,T]$ .

Proof.

The $\mathscr{F}^{0}_{t}$ -measurable function $u(t,\cdot):=\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}\bigl{(}C(t,\cdot)\,|\,\mathscr{F}^{0}_{t}\bigr{)}$ forms a $\mathfrak{F}^{0}$ -adapted process $\left(u(t,\cdot)\right)_{t\in[0,T]}$ . It admits a progressively measurable modification $\bigl{(}\tilde{C}(t,\cdot)\bigr{)}_{t\in[0,T]}$ Karatzas and Shreve (1991). We would like to show that $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)=\tilde{C}(t,\omega)\}\bigr{)}=1$ for every $t\in[0,T]$ .

We prove this lemma by contradiction. Suppose there exists a $t\in[0,T]$ such that $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega|C(t,\omega)=\tilde{C}(t,\omega)\}\bigr{)}<1$ , then $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)=u(t,\omega)\}\bigr{)}<1$ .555We only need to prove $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)={u}(t,\omega)\}\bigr{)}=1$ leads to $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)=\tilde{C}(t,\omega)\}\bigr{)}=1$ . In fact, assuming $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)={u}(t,\omega)\}\bigr{)}=1$ we have

$\displaystyle\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)=\tilde{C}(t,\omega)\}\bigr{)}=$ $\displaystyle\,1-\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)\neq\tilde{C}(t,\omega)\}\bigr{)}$

$\displaystyle\geqslant$ $\displaystyle\,1-\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)\neq u(t,\omega)\}\cup\{\omega\in\Omega\,|\,u(t,\omega)\neq\tilde{C}(t,\omega)\}\bigr{)}$

$\displaystyle\geqslant$ $\displaystyle\,1-\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)\neq u(t,\omega)\}\bigr{)}-\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,u(t,\omega)\neq\tilde{C}(t,\omega)\}\bigr{)}$

$\displaystyle=$ $\displaystyle\,1$

This implies that either $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)<{u}(t,\omega)\}\bigr{)}>0$ or $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)>{u}(t,\omega)\}\bigr{)}>0$ . Without losing generality, we assume $\textsf{{Q}}_{\bar{Z}}\bigl{(}\{\omega\in\Omega\,|\,C(t,\omega)<{u}(t,\omega)\}\bigr{)}>0$ .

For notational simplicity, in the rest of the proof we use $C$ to denote the random variable $C(t,\cdot)$ and $u$ to denote the $\mathscr{F}^{0}_{t}$ -measurable function $u(t,\cdot)$ . We construct an alternative martingale density $Z^{\prime}\in\mathcal{Z}(t,{\bar{Z}})$ by

[TABLE]

To show that indeed $Z^{\prime}\in\mathcal{Z}(t,{\bar{Z}})$ , we need to prove that $Z^{\prime}(0)=1$ , $Z^{\prime}\geqslant 0$ , $Z^{\prime}(t)={\bar{Z}}(t)$ and $Z^{\prime}$ is a P-martingale. The first three conditions are obvious from the definition. The martingale property of $\left(Z^{\prime}(s)\right)_{s\in[0,t]}$ is clear. The martingale property of $\left(Z^{\prime}(s)\right)_{s\in[t,T]}$ is confirmed by

[TABLE]

for all $s\in[t,T]$ and $r\in[t,T]$ satisfying $s\leqslant r$ .

Because $\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}\bigl{(}\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}\left(\left.\mathbf{1}_{C<u}\,\right|\mathscr{F}^{0}_{t}\right)\bigr{)}=\textsf{{Q}}_{\bar{Z}}(C<u)>0$ , there exists a $\omega\in\Omega$ such that $\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}\left(\left.\mathbf{1}_{C<u}\,\right|\mathscr{F}^{0}_{t}\right)(\omega)>0$ . We define

[TABLE]

then the LHS of Eq. 4.1 (with $Z$ replaced by $Z^{\prime}$ ) satisfies

[TABLE]

Note that the inequality is given by the Chebyshev’s sum inequality, which states that $w_{1},w_{2}>0$ and $w_{1}+w_{2}=1$ , one have $(w_{1}a_{1}+w_{2}a_{2})(w_{1}b_{1}+w_{2}b_{2})<w_{1}a_{1}b_{1}+w_{2}a_{2}b_{2}$ if $a_{1}<a_{2}$ and $b_{1}<b_{2}$ . This inequality can be easily proved by expanding the left-hand side. In Eq. 4, we have $w_{l}>0$ , $w_{u}>0$ 666 $w_{u}=0$ would lead to $\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}(C|\mathscr{F}^{0}_{t})(\omega)=\textsf{E}^{\textsf{{Q}}_{\bar{Z}}}(C\mathbf{1}_{C<u}|\mathscr{F}^{0}_{t})(\omega)<u(\omega)$ in contradiction with the definition of $u$ . and

[TABLE]

and

[TABLE]

Therefore Chebyshev’s sum inequality is applicable.

We further apply Jensen’s inequality to the following expression twice ( $x\ln x$ is a convex function while $\ln x$ is a concave function),

[TABLE]

Following the inequality above, we take expectation w.r.t $\mathscr{F}^{0}_{t}$ and under the alternative measure generated by the Radon-Nikodym derivative

[TABLE]

By further assigning $x=e^{C}$ , we get the following inequality

[TABLE]

The LHS is simply $c_{l}$ . Substituting the inequality into Eq. 4 one gets

[TABLE]

This violates the condition stated in Eq. 4.1. We therefore conclude that $\bigl{(}C(t,\cdot)\bigr{)}_{t\in[0,T]}$ admits a progressively measurable modification $\bigl{(}\tilde{C}(t,\cdot)\bigr{)}_{t\in[0,T]}$ . ∎

A process $C$ that satisfies the conditions in Lemma 4.1 admits a progressively measurable modification $\tilde{C}$ w.r.t $\textsf{{Q}}_{\bar{Z}}$ , but not necessarily w.r.t the reference measure P. However, if it also holds w.r.t P, then we get the converse of Lemma 4.1. In fact, for $\bar{Z}\in\mathscr{M}_{+}(1)$ and any $Z\in\mathcal{Z}(t,\bar{Z})$ , both $\textsf{{Q}}_{\bar{Z}}$ and $\textsf{{Q}}_{Z}$ are absolutely continuous w.r.t P, implying $\tilde{C}$ is a modification of $C$ w.r.t $\textsf{{Q}}_{\bar{Z}}$ and $\textsf{{Q}}_{Z}$ . This results in

[TABLE]

The progressively measurable process $\tilde{C}$ is adapted to the filtration $\mathfrak{F}^{0}$ . Therefore

[TABLE]

for all $t\in[0,T]$ and $Z\in\mathcal{Z}(t,{\bar{Z}})$ . We use Lemma 4.1 to prove the following proposition.

Proposition 4.2.

$Z^{*}\in\mathscr{M}_{+}(1)$ * is a worst-case martingale density iff the random variable*

[TABLE]

equals constant $\textsf{{Q}}_{Z^{*}}$ -a.s., and is dominated by the same constant P-a.s.

Proof.

Suppose ${Z^{*}}\in\mathscr{M}_{+}(1)$ is a worst-case martingale density. According to Definition. 3.1,

[TABLE]

for all $t\in[0,T]$ . Given any $t\in[0,T]$ and any $Z\in\mathcal{Z}(t,{Z^{*}})$ , we construct a new martingale density that lies between ${Z^{*}}$ and $Z$ by

[TABLE]

where $\lambda\in[0,1]$ . $Z_{\lambda}\in\mathcal{Z}(t,{Z^{*}})$ for all $\lambda\in[0,1]$ due to the convexity of $\mathcal{Z}(t,{Z^{*}})$ . Since ${Z^{*}}$ solves Eq. 4.5, the maximum value of

[TABLE]

is reached when $\lambda=0$ . Taking the first and second derivatives with respect to $\lambda$ , we get

[TABLE]

Notice that the twice-differentiable function $f:\mathbb{R}_{+}\to\mathbb{R}$ is convex as required by the non-negativity of the $f$ -divergence Ali and Silvey (1966). This implies that $f^{\prime\prime}(z)>0$ for all $z\in\mathbb{R}_{+}$ . Combined with Eq. 4.8, this condition leads to $K^{\prime\prime}(\lambda)<0$ for all $\lambda\in[0,1]$ . For $K(0)=\max_{\lambda\in[0,1]}K(\lambda)$ to hold, the first derivative at $\lambda=0$ must satisfy

[TABLE]

where the process $C_{Z^{*}}:[0,T]\times\Omega\to\mathbb{R}$ is defined by

[TABLE]

The inequality above holds for all $t\in[0,T]$ and all $Z\in\mathcal{Z}(t,{Z^{*}})$ . According to Lemma. 4.1, $C_{Z^{*}}$ admits a progressively measurable modification, say $\tilde{C}_{Z^{*}}$ . In particular, at $t=0$

[TABLE]

takes a constant value $c:=\tilde{C}_{Z^{*}}(0,0)$ , $\textsf{{Q}}_{Z^{*}}$ -a.s. In fact, $\tilde{C}_{Z^{*}}$ is regarded as a non-anticipative functional so that $\tilde{C}_{Z^{*}}(0,\omega)=\tilde{C}_{Z^{*}}(0,0)=c$ for all $\omega\in\Omega$ satisfying $(0,\omega)\sim(0,0)$ . As a result,

[TABLE]

Next we prove $\textsf{{P}}\bigl{(}C_{Z^{*}}(0,\cdot)\leqslant c\bigr{)}=1$ by contradiction. Suppose on the contrary that $\textsf{{P}}\bigl{(}C_{Z^{*}}(0,\cdot)>c\bigr{)}>0$ . We construct a martingale density $Z^{\prime}\in\mathcal{Z}(0,{Z^{*}})=\mathscr{M}_{+}(1)$ by setting

[TABLE]

for all $t\in[0,T]$ . This leads to

[TABLE]

Because we have already shown that $C_{Z^{*}}(0,\,\cdot\,)=c$ , $\textsf{{Q}}_{Z^{*}}$ -a.s. (Eq. 4),

[TABLE]

According to Eq. 4.9, $K^{\prime}(0)>0$ (where the generic density process $Z$ is replaced by the constructed process $Z^{\prime}\in\mathcal{Z}(0,Z^{*})$ ). This contradicts the assumption that $Z^{*}$ is a worst-case martingale density.

Conversely, given a process $Z^{*}\in\mathscr{M}_{+}(1)$ , suppose $C_{Z^{*}}(0,\,\cdot\,):\Omega\to\mathbb{R}$ takes a constant value, say $c$ , $\textsf{{Q}}_{Z^{*}}$ -a.s., and $C_{Z^{*}}(0,\,\cdot\,)\leqslant c$ P-a.s. Given any $t\in[0,T]$ and any $Z\in\mathcal{Z}(t,{Z^{*}})$ , $C_{Z^{*}}(0,\,\cdot\,)\leqslant c$ $\textsf{{Q}}_{Z}$ -a.s. due to the absolute continuity of $\textsf{{Q}}_{Z}$ w.r.t. P. These properties lead to conditional expectations

[TABLE]

Noticing that $C_{Z^{*}}(t,\,\cdot\,)=C_{Z^{*}}(0,\,\cdot\,)-\ell(t,\cdot)$ where $\ell(t,\cdot)$ is $\mathscr{F}^{0}_{t}$ -measurable, We have

[TABLE]

According to Eq. 4.9, $K^{\prime}(0)\leqslant 0$ . Because $K^{\prime\prime}(\lambda)<0$ (Eq. 4.8) for all $\lambda\in[0,1]$ , $K(0)\geqslant K(1)$ . According to the definition of $K(\lambda)$ (Eq. 4.6), we have

[TABLE]

This inequality applies to every $t\in[0,T]$ and every $Z\in\mathcal{Z}(t,{Z^{*}})$ . As a result, ${Z^{*}}$ solves Eq. 4.5 for all $t\in[0,T]$ and is indeed a worst-case martingale density. ∎

It is noted that Proposition. 3.2 is a general result that works for any $\mathfrak{F}^{0}$ -adapted process $(\widehat{\ell}(t,Z))_{t\in[0,T]}$ , irrespective of its actual formulation (Eq. 2.3). On the other hand, Proposition. 4.2 makes use of the formulation, thus specifying the condition of a worst-case martingale density w.r.t the function $f(x)$ . Note that any worst-case density process ${Z^{*}}\in\mathscr{M}_{+}(1)$ solves the original problem formulated in Eq. 2.5. Assuming the existence of such ${Z^{*}}$ , we regard Eq. 2.5 as the initial value (at $t=0$ ) of a particular process, termed as the value process. In general, we define three $\mathfrak{F}^{0}$ -adapted processes as below.

Definition 4.3.

Given $\vartheta\in(0,\infty)$ and a worst-case martingale density ${Z^{*}}\in\mathscr{M}_{+}(1)$ , the value process, $U:[0,T]\times\Omega\to\mathbb{R}$ , the worst-case risk, $V:[0,T]\times\Omega\to\mathbb{R}$ , and the budget process $\eta:[0,T]\times\Omega\to\mathbb{R}$ ,777We name it the budget process as it measures the remaining budget of the fictitious adversary Glasserman and Xu (2014). $\eta(0,\cdot)$ is referred as the relative entropy budget in Glasserman and Xu (2014). regarded as non-anticipative functionals, are defined by

[TABLE]

where $\left(F(t,{Z^{*}})\right)_{t\in[0,T]}$ is the $\textsf{{Q}}_{Z^{*}}$ -martingale that satisfies $F(T,{Z^{*}})=f({Z^{*}}(T))/{Z^{*}}(T)$ .

Intuitively, $U(t,\cdot)$ gives the worst-case expected loss, subtracting the on-going cost of perturbing the nominal model from time $t$ to $T$ . According to the definition of the worst-case martingale density (Eq. 3.3),

[TABLE]

The second term is the penalization term for perturbing the nominal model from time $t$ onwards. For continuity it is defined to be zero in the limiting case of ${Z^{*}}(t)=0$ . According to Definition 4.3, $V(t,\cdot)$ is the worst-case expected loss,

[TABLE]

The difference between $V(t,\cdot)$ and $U(t,\cdot)$ gives the cost for perturbing the nominal model (measured by the $f$ -divergence), characterized by the process $\eta$ :

[TABLE]

We may further consider the terminal and initial values of the three processes. The value process, $U(t,\cdot)$ , measures the target formulated in Eq. 2.5 from backwards, in the sense that

[TABLE]

The worst-case risk process measures the model risk, Eq. 2.2, from backwards. According to Lemma. 2.1(4), the worst-case density $Z^{*}$ solves the primal problem with $\eta:=\eta(0,\cdot)=\textsf{E}\left(f({Z^{*}}(T))\right)$ . Therefore

[TABLE]

The cumulative budget $\eta$ (i.e. relative entropy budget in Glasserman and Xu (2014)) is measured by the budget process from backwards,

[TABLE]

To solve the problem formulated in Eq. 2.5, Eq. 4.12 suggests solving the process $U$ by backward induction. In a similar way, the model risk, Eq. 2.2, and its corresponding cumulative budget, $\eta$ , may be quantified by solving the processes $V$ and $\eta$ by backward induction. The full procedure is given by the following theorem.

Theorem 4.4.

Given $\vartheta\in(0,\infty)$ , suppose there exists a function $z:\mathbb{R}\to\mathbb{R}_{+}$ that satisfies

[TABLE]

where $c\in\mathbb{R}$ is a constant such that $\textsf{E}\bigl{(}z\circ\ell(T,\cdot)\bigr{)}=1$ and $\textsf{{P}}\left(\ell(T,\cdot)<\sup I_{c}\right)=1$ . Then the value process, $U$ , the worst-case risk, $V$ , and the budget process, $\eta$ , satisfy the following equations

[TABLE]

for all $t\in[0,T]$ and a.a. $\omega\in\{Z(t)>0\}$ , where $(Z,\,M,\,W)$ is a $\mathfrak{F}^{0}$ -adapted P-martingale that satisfies the following terminal condition:

[TABLE]

Proof.

The function $z$ defined by Eq. 4.13 provides a martingale density $Z\in\mathscr{M}_{+}(1)$ by composition:

[TABLE]

for all $t\in[0,T]$ . $Z$ is exactly the first element of the vectorized process defined in Eq. 4.16. It is indeed an element of $\mathscr{M}_{+}(1)$ , for $Z(T)=z\circ\ell(T,\cdot)\geqslant 0$ and $Z(0)=\textsf{E}\bigl{(}z\circ\ell(T,\cdot)\bigr{)}=1$ . The random variable

[TABLE]

is equal to the constant $c$ $\textsf{{Q}}_{Z}$ -a.s. In fact, $c\in\mathbb{R}$ is selected such that

[TABLE]

by virtue of $z(x)=0$ for all $x\notin I_{c}$ . Since $C_{Z}(0,\omega)=c$ for all $\omega\in\Omega$ satisfying $\ell(T,\omega)\in I_{c}$ , we have

[TABLE]

Next we need to show that $C_{Z}(0,\cdot)\leqslant c$ P-a.s. Notice that the function $f^{\prime}:(0,\infty)\to\mathbb{R}$ is continuous and strictly increasing due to the convexity of $f$ , implying that $\text{range}(f^{\prime})=(f^{\prime}(0_{+}),f^{\prime}(\infty_{-}))$ . We conclude that $\text{range}(f^{\prime})$ is an open interval and denote it by $(a,b)$ , where $a$ and $b$ can be either real numbers or $\pm\infty$ . According to the assumption, we have

[TABLE]

We extend the function $f^{\prime}$ continuously to zero by assigning $f^{\prime}(0)=a$ .

[TABLE]

We conclude that $C_{Z}(0,\cdot)=c$ $\textsf{{Q}}_{Z}$ -a.s. and $C_{Z}(0,\cdot)\leqslant c$ P-a.s. According to Proposition. 4.2, $Z$ defined in Eq. 4.17 is a worst-case density process.

The second component of Eq. 4.16 is a P-martingale given by

[TABLE]

for all $t\in[0,T]$ . Substituting Eq. 4.18 into Eq. 4, we have

[TABLE]

By virtue of $C_{Z}(0,\cdot)=c$ $\textsf{{Q}}_{Z}$ -a.s., the equation above holds $\textsf{{Q}}_{Z}$ -a.s. More precisely, it holds for a.a. $\omega\in\{Z(t)>0\}$ .888 According to the definition of $C_{Z}(0,\cdot)$ (Eq. 4.18), $C_{Z}(0,\omega)=c$ for all $\omega\in\Omega$ satisfying $\ell(T,\omega)\in I_{c}$ . It follows from $Z(T)(\omega)=z\circ\ell(T,\omega)=0$ for a.a $\omega\in\{\omega\in\Omega\,|\,\ell(T,\omega)\notin I_{c}\}$ that

$\displaystyle\textsf{E}^{\textsf{{Q}}_{Z}}(C_{Z}(0,\cdot)\,|\,\mathscr{F}^{0}_{t})=$ $\displaystyle\,\textsf{E}^{\textsf{{Q}}_{Z}}(C_{Z}(0,\cdot)\mathbf{1}_{\ell(T,\cdot)\in I_{c}}\,|\,\mathscr{F}^{0}_{t})+\textsf{E}^{\textsf{{Q}}_{Z}}(C_{Z}(0,\cdot)\mathbf{1}_{\ell(T,\cdot)\notin I_{c}}\,|\,\mathscr{F}^{0}_{t})$

$\displaystyle=$ $\displaystyle\,Z(t)^{-1}\textsf{E}(Z(T,\cdot)C_{Z}(0,\cdot)\mathbf{1}_{\ell(T,\cdot)\in I_{c}}\,|\,\mathscr{F}^{0}_{t})+Z(t)^{-1}\textsf{E}(Z(T,\cdot)C_{Z}(0,\cdot)\mathbf{1}_{\ell(T,\cdot)\notin I_{c}}\,|\,\mathscr{F}^{0}_{t})$

$\displaystyle=$ $\displaystyle\,cZ(t)^{-1}\textsf{E}(Z(T,\cdot)\mathbf{1}_{\ell(T,\cdot)\in I_{c}}\,|\,\mathscr{F}^{0}_{t})$

$\displaystyle=$ $\displaystyle\,cZ(t)^{-1}\textsf{E}(Z(T,\cdot)\,|\,\mathscr{F}^{0}_{t})=c$

for a.a. $\omega\in\{Z(t)>0\}$ .

The third element of Eq. 4.16, $W(t)=\\ \textsf{E}\bigl{(}\left.z\circ\ell(T,\cdot)\times\ell(T,\cdot)\,\right|\mathscr{F}^{0}_{t}\bigr{)}=\textsf{E}\bigl{(}\left.Z(T)\ell(T,\cdot)\,\right|\mathscr{F}^{0}_{t}\bigr{)}$ , characterizes the worst-case risk by

[TABLE]

for all $\omega\in\{\omega\in\Omega\,|\,Z(t)(\omega)>0\}$ . Thus the equation above holds $\textsf{{Q}}_{Z}$ -a.s. Following the expressions for $U(t,\cdot)$ and $V(t,\cdot)$ , we get the formula for the budget process

[TABLE]

∎

In the proof above, we propose the inverse of the function $f^{\prime}$ , denoted by $g:\text{range}(f^{\prime})\to(0,\infty)$ . Using this inverse function, we have the following proposition which states that certain integrability conditions guarantee the existence of the solution, given by Theorem 4.4, to the problem of model risk quantification.

Proposition 4.5.

Denote $g:(a,b)\to(0,\infty)$ as the inverse function of $f^{\prime}$ . If $f^{\prime}(\infty_{-})=\infty$ and for every $c\in\mathbb{R}$ $g\bigl{(}\vartheta(\ell(T,\cdot)-c)\bigr{)}\mathbf{1}_{\ell(T,\cdot)\in I_{c}}$ is integrable under the reference measure P, then the assumptions in Theorem 4.4 hold.

Proof.

We need to prove the existence of $c\in\mathbb{R}$ and $z:\mathbb{R}\to\mathbb{R}_{+}$ , such that Eq. 4.13 for all $x\in I_{c}$ and $z(x)=0$ for all $x\notin I_{c}$ , $\textsf{E}\bigl{(}z\circ\ell(T,\cdot)\bigr{)}=1$ and $\textsf{{P}}\left(\ell(T,\cdot)<\sup I_{c}\right)=1$ .

We have shown in the proof of Theorem 4.4 that $\text{range}(f^{\prime})=(a,b)$ . Here $b$ takes $\infty$ as the strictly increasing function $f^{\prime}$ diverges at infinity. For a given $c\in\mathbb{R}$ , the implicit equation Eq. 4.13 gives

[TABLE]

for all $x\in I_{c}=(\vartheta^{-1}a+c,\infty)$ . For all $x\notin I_{c}$ , $z(x)=0$ which gives

[TABLE]

We would like to show that the function $K:\mathbb{R}\to\mathbb{R}$ defined by

[TABLE]

takes value of one for some $c\in\mathbb{R}$ .

First we will show that $K$ is continuous. Fix an arbitrary $c_{0}\in\mathbb{R}$ and $\varepsilon\in(0,\infty)$ . Resulted from the continuity of $g$ , the function $y(\cdot,\omega):(-\infty,c_{0}\,]\to\mathbb{R}$ defined by

[TABLE]

is continuous for every $\omega\in\Omega$ . Therefore, the function $Y:(-\infty,c_{0}]\to\mathbb{R}$ , defined by $Y(c):=\textsf{E}\left(y(c,\cdot)\right)$ , is continuous at $c_{0}$ .999 It follows from the dominated convergence theorem that $Y$ is continuous at $c_{0}$ . In fact, the sequence, $\{y(c_{0}-1/n,\cdot)\}_{n=1}^{\infty}$ , of real-valued measurable functions converges pointwise to $y(c_{0},\cdot)$ by virtue of its continuity. The sequence is dominated by $y(c_{0}-1,\cdot)$ due to the fact that $g$ increases monotonically. $y(c_{0}-1,\cdot)$ is integrable as

$\displaystyle\textsf{E}\bigl{(}|y(c_{0}-1,\cdot)|\bigr{)}\leqslant\textsf{E}\left(g\bigl{(}\vartheta(\ell(T,\cdot)-c_{0}+1\bigr{)}\mathbf{1}_{\ell(T,\omega)>\vartheta^{-1}a+c_{0}}\right)\leqslant\textsf{E}\bigl{(}g\bigl{(}\vartheta(\ell(T,\cdot)-c_{0}+1\bigr{)}\mathbf{1}_{\ell(T,\omega)>I_{c_{0}-1}}\bigr{)}<\infty$

The dominated convergence theorem guarantees the convergence of the expectation

$\displaystyle\lim_{n\to\infty}\textsf{E}\bigl{(}y(c_{0}-1/n,\cdot)\bigr{)}=\textsf{E}\bigl{(}y(c_{0},\cdot)\bigr{)}=0$

This means that given an arbitrary $\varepsilon>0$ , there exists $n_{0}\in\mathbb{N}$ such that $\bigl{|}\textsf{E}\bigl{(}y(c_{0}-1/n,\cdot)\bigr{)}\bigr{|}<\varepsilon$ for all $n\geqslant n_{0}$ . Due to the fact that $g$ increases monotonically, for every $c\in[c_{0}-1/n_{0},c_{0}]$ we have

$\displaystyle 0\leqslant\textsf{E}\bigl{(}y(c,\cdot)\bigr{)}-\textsf{E}\bigl{(}y(c_{0},\cdot)\bigr{)}=\textsf{E}\bigl{(}y(c,\cdot)\bigr{)}\leqslant\textsf{E}\bigl{(}y(c_{0}-1/n,\cdot)\bigr{)}<\varepsilon$

This proves that $Y$ is continuous at $c_{0}$ .

Its continuity implies the existence of $\delta>0$ such that $|Y(c)|=|Y(c)-Y(c_{0})|<\varepsilon/2$ for all $c_{0}\in\mathbb{R}$ satisfying $c_{0}-\delta<c\leqslant c_{0}$ . Let

[TABLE]

Then for all $c_{0}-\delta_{-}<c\leqslant c_{0}$ we have

[TABLE]

We may prove in a similar way that there exists $\delta_{+}>0$ such that $K(c)-K(c_{0})\in(-\varepsilon,0\,]$ for all $c_{0}<c<c_{0}+\delta_{+}$ . Combining the two arguments, $|K(c)-K(c_{0})|$ is less than $\varepsilon$ for all $c\in\mathbb{R}$ satisfying $|c-c_{0}|<\min(\delta_{+},\delta_{-})$ . This proves that the function $K$ , defined in Eq. 4.19, is continuous.

Next we need to prove that there exist $c_{+},c_{-}\in\mathbb{R}$ such that $K(c_{+})\leqslant 1$ and $K(c_{-})\geqslant 1$ . In fact, the limit $\lim_{c\to-\infty}\textsf{{P}}\bigl{(}\ell(T,\cdot)>\vartheta^{-1}a+c\bigr{)}=1$ implies the existence of $c\in\mathbb{R}$ such that $\textsf{{P}}\bigl{(}\ell(T,\cdot)>\vartheta^{-1}a+c\bigr{)}\geqslant 1/\xi$ for some $\xi>1$ . Defining

[TABLE]

we have

[TABLE]

On the other hand, the following limit101010 The convergence is guaranteed by the dominated convergence theorem. See the footnote in the last page.

[TABLE]

implies the existence of $c\in\mathbb{R}$ such that

[TABLE]

Letting $c_{+}=\max(0,c)$ , we have

[TABLE]

According to the intermediate value theorem, there exists $c\in\mathbb{R}$ such that the continuous function $K$ , defined in Eq. 4.19, takes the value of one. 111111 Such $c\in\mathbb{R}$ is also unique by noticing that the function $K$ is strictly decreasing.

The condition $\textsf{{P}}\left(\ell(T,\cdot)<\sup I_{c}\right)=1$ holds irrespective of the actual measure P, for

[TABLE]

has probability one. As a result, the assumptions stated in Theorem 4.4 are valid, which guarantees the existence of the worst-case solution provided by the theorem. ∎

We consider a special class of $f$ -divergence, including the renowned Kullback-Leibler divergence, of which the function $\mathbb{R}\ni x\mapsto xf^{\prime}(x)-f(x)$ is linear (or equivalently $x\mapsto xf^{\prime\prime}(x)$ is constant). This type of $f$ -divergence has a particular advantage on applying Theorem. 4.4, because the process

[TABLE]

can be calculated directly from $Z(t)$ . Therefore in practice we only need to apply backward induction to the two-dimensional P-martingale $(Z(t),W(t))_{t\in[0,T]}$ . By substituting Eq. 4 into Eq. 4.4, we have the following proposition.

Corollary 4.6.

Suppose in Theorem 4.4 there exists $d\in(0,\infty)$ such that $xf^{\prime\prime}(x)=d$ for all $x\in\mathbb{R}_{+}$ . Then the value process, $U$ , the worst-case risk, $V$ , and the budget process, $\eta$ , satisfy the following equations

[TABLE]

for all $t\in[0,T]$ and all $\omega\in\Omega$ such that $Z(t)(\omega)>0$ , where $(Z,\,W)$ is a $\mathfrak{F}^{0}$ -adapted P-martingale that satisfies the following terminal condition:

[TABLE]

Corollary 4.6 applies to the Kullback-Leibler divergence. In particular, the calculation of the constant $c$ is pretty straightforward. We illustrate this in the following corollary.

Corollary 4.7.

Under the Kullback-Leibler divergence, suppose $\textsf{E}\left(e^{\vartheta\ell(T,\cdot)}\right)<\infty$ . Then there exists an unique solution to the problem of model risk quantification, given by

[TABLE]

where $\bigl{(}\tilde{Z},\,\tilde{W}\bigr{)}$ is a $\mathfrak{F}^{0}$ -adapted P-martingale that satisfies the terminal condition:

[TABLE]

Proof.

The Kullback-Leibler divergence adopts $f^{\prime}(x)=(x\ln x)^{\prime}=\ln x+1$ for all $x\in(0,\infty)$ . $f^{\prime}$ diverges at $\infty$ . The inverse function $g:\mathbb{R}\to(0,\infty)$ is given by $g(x)=e^{x-1}$ . Since $\textsf{E}\left(e^{\vartheta\ell(T,\cdot)}\right)<\infty$ , we have

[TABLE]

for all $c\in\mathbb{R}$ . Proposition 4.5 guarantees the existence of a unique $c\in\mathbb{R}$ and $z:\mathbb{R}\to\mathbb{R}_{+}$ satisfying $\textsf{E}\bigl{(}z\circ\ell(T,\cdot)\bigr{)}=1$ , therefore a unique solution to the problem of model risk quantification.

More specifically, we calculate the function $z:\mathbb{R}\to\mathbb{R}_{+}$ from Eq. 4.13:

[TABLE]

for all $x\in\mathbb{R}$ . The constant $c\in\mathbb{R}$ is given by

[TABLE]

The corollary defines two P-martingales by

[TABLE]

The process $Z$ and $W$ in Corollary 4.6 are simply normalized versions of $\tilde{Z}$ and $\tilde{W}$ ,

[TABLE]

Substituting the equations above into Eq. 4.6, we have

[TABLE]

Note that $Z(T)(\omega)=e^{\vartheta(\ell(T,\omega)-c)-1}>0$ for all $\omega\in\Omega$ . $Z(t)=\textsf{E}(Z(T)\,|\mathscr{F}^{0}_{t})>0$ , implying that the equations above hold for all $t\in[0,T]$ and all $\omega\in\Omega$ . ∎

5. Model Risk Measurement with Continuous Semimartingales

The last section provides the general theory on quantifying the model risk. In this section, we focus on the class of continuous semimartingales. It has an important property formulated by the functional Ito formula. To introduce the formula we need to briefly review the functional Ito calculus Bally et al. (2016). First we define the horizontal derivative and the vertical derivative of a non-anticipative functional $F:\Lambda_{T}^{d}\to\mathbb{R}$ . Its horizontal derivative at $(t,\omega)\in\Lambda_{T}^{d}$ is defined by the limit

[TABLE]

if it exists. Intuitively, it describes the rate of change w.r.t time, assuming no change of the state variable from $t$ onwards, and conditional to its history up to $t$ given by the stopped path $\omega_{t}$ . On the other hand, the vertical derivative describes the rate of change w.r.t the state variable from $t$ onwards. Formally, the vertical derivative at $(t,\omega)\in\Lambda_{T}^{d}$ , denoted by $\nabla_{\omega}F(t,\omega)$ , is defined as the gradient of the function $\mathbb{R}^{d}\ni x\mapsto F\bigl{(}t,\omega_{t}+x\mathbf{1}_{[t,T]}\bigr{)}$ at [math], assuming its existence. The horizontal and vertical derivatives of a non-anticipative functional are also non-anticipative functionals.

We define the left-continuous non-anticipative functionals by noticing that the space of stopped paths, $\Lambda_{T}^{d}$ , is endowed with a metric $d_{\infty}$ . Suppose $F:\Lambda_{T}^{d}\to\mathbb{R}$ is a non-anticipative functional. $F$ is left-continuous if for every $(t,\omega)\in\Lambda_{T}^{d}$ and $\varepsilon>0$ , there exists $\delta>0$ such that $|F(t,\omega)-F(t^{\prime},\omega^{\prime})|<\varepsilon$ for all $(t^{\prime},\omega^{\prime})\in\Lambda_{T}^{d}$ satisfying $t^{\prime}<t$ and $d_{\infty}((t,\omega),(t^{\prime},\omega^{\prime}))<\delta$ . We may further impose a boundedness condition to a non-anticipative functional $F$ . It states that for any compact $K\subset\mathbb{R}^{d}$ and $t_{0}<T$ , there exists a $C>0$ such that $|F(t,\omega)|\leqslant C$ for all $t\leqslant t_{0}$ and $\omega\in\Omega$ . Suppose a non-anticipative functional $F$ is horizontally differentiable and vertically twice-differentiable for all $(t,\omega)\in\Lambda_{T}^{d}$ , and $\mathcal{D}F$ , $\nabla_{\omega}F$ and $\nabla_{\omega}^{2}F$ satisfy the boundedness condition above. In addition, $F$ , $\nabla_{\omega}F$ and $\nabla_{\omega}^{2}F$ are left-continuous, and $\mathcal{D}F$ is continuous for all $(t,\omega)\in\Lambda_{T}^{d}$ . Then we call $F$ a regular functional.

Suppose the canonical process $X$ on $\Omega$ is a continuous semimartingale and $F:\Lambda_{T}^{d}\to\mathbb{R}$ is a regular functional. The $\mathbb{R}$ -valued process $(Y(t))_{t\in[0,T]}$ , defined by $Y(t)=F(t,\cdot)$ for all $t\in[0,T]$ , follows the functional Ito formula P-a.s.(Bally et al. 2016, pp. 190–191)

[TABLE]

If we further impose the constraint that $\int_{0}^{T}\xi(t)dX(t)=0$ for all bounded predictable processes $\xi$ satisfying $\int_{0}^{T}\xi(t)dt=0$ , then the canonical process $X$ is a strong solution to the SDE Revuz and Yor (2013)

[TABLE]

where $(W(t))_{t\in[0,T]}$ is a $\mathbb{R}^{d}$ -valued standard Wiener process on the underlying filtered probability space (assuming its existence). $(\mu(t))_{t\in[0,T]}$ is a $\mathbb{R}^{d}$ -valued predictable process, and $(\sigma(t))_{t\in[0,T]}$ is a $\mathbb{R}^{d^{2}}$ -valued predictable process. We may identify their elements, say $(\mu_{i}(t))_{t\in[0,T]}$ and $(\sigma_{ij}(t))_{t\in[0,T]}$ , with non-anticipative functionals. The SDE Eq. 5.1 may be regarded as a path-dependent generalisation of the renowned Ito diffusion process. The existence and uniqueness of its solutions have been given in the literature by imposing various conditions (e.g. boundedness and Lipschitz properties, see Bally et al. (2016)). Now if $X$ satisfies Eq. 5.1 P-a.s., then it follows from the functional Ito formula that the process $Y$ is a strong solution to the SDE

[TABLE]

Note that the square of $\sigma(t)$ is in the sense of matrix multiplication, i.e. $\sigma(t)^{2}=\sigma(t)\sigma(t)^{T}$ . For simplicity we may define a nonlinear differential operator $\mathcal{A}$ that sends a regular functional to a non-anticipative functional by

[TABLE]

Then the process $Y$ , defined by $Y(t)=F(t,\cdot)$ , is a strong solution to

[TABLE]

Suppose $Y$ is a P-martingale, then the regular functional $F$ satisfies $\mathcal{A}F=0$ P-a.s. Applying this property, we may convert the martingale statement in Theorem 4.4 to an analytical statement. This is formulated in the following corollary.

Corollary 5.1.

Given $\vartheta\in(0,\infty)$ , suppose there exist $c\in\mathbb{R}$ and $z:\mathbb{R}\to\mathbb{R}_{+}$ defined in Theorem 4.4. If the canonical process $X$ satisfies Eq. 5.1 for some $\mathbb{R}^{d}$ -valued predictable process $(\mu(t))_{t\in[0,T]}$ and $\mathbb{R}^{d^{2}}$ -valued predictable process $(\sigma(t))_{t\in[0,T]}$ , then the value process, $U$ , the worst-case risk, $V$ , and the cost process, $\eta$ , satisfy the following equations

[TABLE]

for all $t\in[0,T]$ and all $\omega\in\Omega$ such that $Z(t)(\omega)>0$ , where $Z$ , $M$ and $W$ are identified by the solutions to the equation $\mathcal{A}F=0$ (P-a.s.), subject to their respective terminal conditions:

[TABLE]

In practice, we are more interested in the type of $f$ -divergence that gives the constant function $x\mapsto xf^{\prime\prime}(x)$ . Such $f$ -divergence allows us to solve $U$ and $V$ directly using path-dependent partial differential equations.

Proposition 5.2.

Suppose there exists $d\in(0,\infty)$ such that $xf^{\prime\prime}(x)=d$ for all $x\in\mathbb{R}_{+}$ , and the function $f^{\prime}$ diverges at infinity. In addition, the inverse function, $g:\mathsf{Im}f^{\prime}\to(0,\infty)$ , provides a twice-differentiable function $\mathbb{R}\ni x\mapsto g(x)\mathbf{1}_{x\in\mathsf{Im}f^{\prime}}$ . The value process and the worst-case risk, identified with the regular functionals $U_{t}:=U(t,\cdot)$ and $V_{t}:=V(t,\cdot)$ , solve the following path-dependent partial differential equations $\textsf{{Q}}_{Z}$ -a.s.

[TABLE]

subject to the terminal condition $U_{T}=V_{T}=\ell(T,\cdot)$ . The cost process $\eta_{t}=\vartheta(V_{t}-U_{t})$ for all $t\in[0,T]$ . Defining $I_{c}\coloneqq\{\vartheta^{-1}y+c\,|\,y\in\mathsf{Im}f^{\prime}\}$ , the solution exists if $g\bigl{(}\vartheta(\ell(T,\cdot)-c)\bigr{)}\mathbf{1}_{\ell(T,\cdot)\in I_{c}}$ is integrable for every $c\in\mathbb{R}$ .

Proof.

It follows from Corollary 4.6 that121212 We have shown in the proof of Proposition 4.5 that $f^{\prime}$ diverges at infinity implies that $\mathsf{Im}f^{\prime}$ is an open interval in the form of $(a,\infty)$ . Then

$\displaystyle U(t,\omega)=\vartheta^{-1}{f^{\prime}(Z(t)(\omega))}+c>\vartheta^{-1}a+c\in I_{c}$

for all $\omega\in\{\omega\in\Omega\,|\,Z(t)(\omega)>0\}$ . On the other hand, for all $\omega\in\{\omega\in\Omega\,|\,Z(t)(\omega)=0\}$ ,

$\displaystyle 0=Z(t)(\omega)=$ $\displaystyle\,\textsf{E}^{\textsf{{Q}}_{Z}}(Z(T)\mathbf{1}_{\ell(T,\cdot)\leqslant\vartheta^{-1}a+c}\,|\,\mathscr{F}_{t}^{0})(\omega)+\textsf{E}^{\textsf{{Q}}_{Z}}(Z(T)\mathbf{1}_{\ell(T,\cdot)>\vartheta^{-1}a+c}\,|\,\mathscr{F}_{t}^{0})$

$\displaystyle\geqslant$ $\displaystyle\,\textsf{E}^{\textsf{{Q}}_{Z}}(g\left(\vartheta\left(\ell(T,\cdot)-c\right)\right)\mathbf{1}_{\ell(T,\cdot)>\vartheta^{-1}a+c}\,|\,\mathscr{F}_{t}^{0})(\omega)$

This implies that $\textsf{E}^{\textsf{{Q}}_{Z}}(\mathbf{1}_{\ell(T,\cdot)>\vartheta^{-1}a+c}\,|\,\mathscr{F}_{t}^{0})(\omega)=0$ by virtue of $\mathsf{Im}g=(0,\infty)$ , which gives

$\displaystyle U(t,\omega)=\textsf{E}^{\textsf{{Q}}_{Z}}(\ell(T,\cdot)\,|\,\mathscr{F}_{t}^{0})(\omega)=\textsf{E}^{\textsf{{Q}}_{Z}}(\ell(T,\cdot)\mathbf{1}_{\ell(T,\cdot)\leqslant\vartheta^{-1}a+c}\,|\,\mathscr{F}_{t}^{0})(\omega)\leqslant\vartheta^{-1}a+c\notin I_{c}$

[TABLE]

for all $t\in[0,T]$ , where $\mathfrak{g}$ denotes the twice-differentiable function $\mathbb{R}\ni x\mapsto g(x)\mathbf{1}_{x\in\mathsf{Im}f^{\prime}}$ . Since $(Z(t))_{t\in[0,T]}$ is a P-martingale that can be identified with a solution to the equation $\mathcal{A}F=0$ (P-a.s.), we have

[TABLE]

For all $\omega\in\Omega$ such that $U(t,\omega)\in I_{c}$ , the equation is equivalent to131313 For all $x\in(a,\infty)$ , $\mathfrak{g}^{\prime}(x)={g}^{\prime}(x)>0$ (due to the convexity of $f$ ), and for all $x\in(-\infty,a\,]$ ,

$\displaystyle\mathfrak{g}^{\prime}(x)=\lim_{h\to 0^{-}}\frac{\mathfrak{g}(x)-\mathfrak{g}(x-h)}{h}=0$

Therefore, $\mathfrak{g}(x)=g(x)\mathbf{1}_{x\in(a,\infty)}$ implies that $\mathfrak{g}^{\prime}(x)=g^{\prime}(x)\mathbf{1}_{x\in(a,\infty)}$ , which in turns implies $\mathfrak{g}^{\prime\prime}(x)=g^{\prime\prime}(x)\mathbf{1}_{x\in(a,\infty)}$ . For all $\omega\in\{\omega\in\Omega\,|\,U(t,\omega)\in I_{c}\}$ , $\vartheta U(t,\omega)-c\in(a,\infty)$ and thus

$\displaystyle\mathfrak{g}^{\prime}\bigl{(}\vartheta\left(U(t,\omega)-c\right)\bigr{)}={g}^{\prime}\bigl{(}\vartheta\left(U(t,\omega)-c\right)\bigr{)}>0\qquad\text{and}\qquad\mathfrak{g}^{\prime\prime}\bigl{(}\vartheta\left(U(t,\omega)-c\right)\bigr{)}={g}^{\prime\prime}\bigl{(}\vartheta\left(U(t,\omega)-c\right)\bigr{)}$

[TABLE]

Noticing that $\{\omega\in\Omega\,|\,U(t,\omega)\in I_{c}\}$ has measure one under $\textsf{{Q}}_{Z}$ 141414 $\textsf{{Q}}_{Z}(U(t,\cdot)\in I_{c})=\textsf{{Q}}_{Z}(Z(t)>0)=\textsf{E}(Z(T)\mathbf{1}_{Z(t)>0})=\textsf{E}(Z(t)\mathbf{1}_{Z(t)>0})=\textsf{E}(Z(t))=1$ , the equation above holds $\textsf{{Q}}_{Z}$ -a.s.

It follows from Eq. 5.3 that the P-martingale $(Z(t))_{t\in[0,T]}$ solves the SDE

[TABLE]

We may define a process $(Y(t))_{t\in[0,T]}$ by the stochastic integral

[TABLE]

for all $t\in[0,T]$ . This transforms the SDE above into

[TABLE]

suggesting that the process $\left(Z(t)\right)_{t\in[0,T]}$ is a Doleans-Dade exponent, i.e. $Z=\mathcal{E}\left(Y\right)$ . Note that the SDE above ensures that $\left(Z(t)\right)_{t\in[0,T]}$ is a local martingale. To guarantee that it is indeed a martingale, we assume the Novikov’s condition,

[TABLE]

According to the Girsanov theorem, the Brownian motion under $\textsf{{Q}}_{Z}$ is given by adding an extra drift term. Noticing that $U_{t}\in I_{c}$ $\textsf{{Q}}_{Z}$ -a.s., the Girsanov theorem transforms the SDE of the canonical process under P (Eq. 5.1) to the following SDE (in the sense that $\left(X(t)\right)_{t\in[0,T]}$ is a strong solution of the following under $\textsf{{Q}}_{Z}$ ),

[TABLE]

The functional Ito formula, Eq. 5.2-5.3, applies to the alternative measure $\textsf{{Q}}_{Z}$ as well. Following the definition of the operator $\mathcal{A}$ , we have

[TABLE]

for some regular functional $F:\Lambda_{T}^{d}\to\mathbb{R}$ and all $t\in[0,T]$ . The worst-case model risk, $\textsf{E}^{\textsf{{Q}}_{Z}}\left(\ell(T,\cdot)\,|\,\mathscr{F}_{t}\right)$ , is a $\textsf{{Q}}_{Z}$ -martingale. Identified with the regular functional $V$ , it satisfies the following equation $\textsf{{Q}}_{Z}$ -a.s.

[TABLE]

Combined with the terminal condition $U_{T}=V_{T}=\ell(T,\cdot)$ , Eq. 5.8 and Eq. 5.7 provide the path-dependent partial differential equations that govern the value process and the worst-case risk, respectively. It follows from Proposition 4.5 that the solution indeed exists if $g\bigl{(}\vartheta(\ell(T,\cdot)-c)\bigr{)}\mathbf{1}_{\ell(T,\cdot)\in I_{c}}$ is integrable for every $c\in\mathbb{R}$ . ∎

The renowned Kullback-Leibler divergence provides us with much convenience on applying Proposition 5.2 into practice. The function $f^{\prime}(x)=\ln x+1$ diverges at $\infty$ , and its inverse $g:\mathbb{R}\to(0,\infty)$ given by $g(x)=e^{x-1}$ is twice-differentiable. In addition, the worst-case martingale density $Z(T)=e^{\vartheta(\ell(T,\cdot)-c)-1}>0$ supplies a measure $\textsf{{Q}}_{Z}$ that is equivalent to the reference measure P. Combining Corollary 4.7 with Proposition 5.2, and substituting $g(x)=e^{x-1}$ into Eq. 5.4, we get the following corollary that applies to the Kullback-Leibler divergence.

Corollary 5.3.

Under the Kullback-Leibler divergence, suppose $\textsf{E}\left(e^{\vartheta\ell(T,\cdot)}\right)<\infty$ . Then there exists an unique solution to the problem of model risk quantification. The value process and the worst-case risk, identified with regular functionals $U_{t}:=U(t,\cdot)$ and $V_{t}:=V(t,\cdot)$ , solve the following path-dependent partial differential equations P-a.s.

[TABLE]

subject to the terminal condition $U_{T}=V_{T}=\ell(T,\cdot)$ . The cost process $\eta_{t}=\vartheta(V_{t}-U_{t})$ for all $t\in[0,T]$ .

In practice, the path-dependent partial differential equations, Eq. 5.8, are generally difficult to solve. However, we may convert Eq. 5.8 into normal non-linear partial differential equations for a special type of path dependency, formulated by

[TABLE]

for some functions $h:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $h_{i}:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}~{}(i=1,2)$ . We further restrict the canonical process $X$ to the class of Ito diffusions. This means that the process is Markovian, and there exist functions $\mu:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d}$ and $\sigma:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}^{d^{2}}$ such that $\mu_{t}=\mu(t,X(t))$ and $\sigma_{t}=\sigma(t,X(t))$ . The path-dependent partial differential equations, Eq. 5.8, degenerates to normal partial differential equations.

Corollary 5.4.

Under the Kullback-Leibler divergence, suppose $\textsf{E}\left(e^{\vartheta\ell(T,\cdot)}\right)<\infty$ , the canonical process $(X(t))_{t\in[0,T]}$ solves the SDE, $dX(t)=\mu(t,X(t))dt+\sigma(t,X(t))dW(t)$ , and the cumulative loss $\ell(T,\cdot)$ takes the form of Eq. 5.9. If there exists a function $\tilde{u}:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ that solves the partial differential equation

[TABLE]

and a function $\tilde{v}:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ that solves the partial differential equation

[TABLE]

subject to the terminal condition $\tilde{u}(T,\cdot)=\tilde{v}(T,\cdot)=h_{0}(T,\cdot)$ , then the value process, the worst-case risk and the cost process, identified with regular functionals, follow

[TABLE]

and $\eta_{t}=\vartheta\bigl{(}\tilde{v}(t,X(t))-\tilde{u}(t,X(t))\bigr{)}$ for all $t\in[0,T]$ .

Proof.

We first define regular functionals $\tilde{U},\tilde{V}:\Lambda_{T}^{d}\to\mathbb{R}$ by

[TABLE]

The horizontal and vertical derivatives can be derived from Eq. 5.12,

[TABLE]

Substituting the equations above into Eq. 5.8, we transform Eq. 5.8 to

[TABLE]

and

[TABLE]

If there exists a function $\tilde{u}:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ that solves the partial differential equation

[TABLE]

and a function $\tilde{v}:[0,T]\times\mathbb{R}^{d}\to\mathbb{R}$ that solves

[TABLE]

then the regular functionals defined by $\tilde{U}_{t}:=\tilde{u}(t,X(t))$ and $\tilde{V}_{t}:=\tilde{v}(t,X(t))$ , for all $t\in[0,T]$ , satisfy Eqs. 5.13 and 5.14. The terminal condition $\tilde{U}_{T}=\tilde{V}_{T}=h_{0}(T,X(T))$ is satisfied if $\tilde{u}(T,x)=\tilde{v}(T,x)=h_{0}(T,x)$ holds for all $x\in\mathbb{R}$ . ∎

Note that Eq. 5.10-5.11 are non-linear parabolic partial differential equations and in general have to be solved numerically.

6. Concluding Remarks

This paper provides a theoretical framework of formulating and solving the problem of model risk quantification in a path-dependent setting. We need several ingredients to formulate the problem, including terminal time $T$ , a (path-dependent) loss function $\ell$ , a nominal model (i.e. a canonical process $(X_{t})_{t\in[0,T]}$ under a nominal measure P) and some $f$ -divergence. The non-parametric nature of this approach relies on the $f$ -divergence to restrict the set of proper alternative models. This is, however, only applicable to measures that are absolutely continuous w.r.t the nominal measure. More generic distance measure, such as the Wasserstein metric, may be applied instead Feng and Schlögl (2018). Despite of this incompleteness, $f$ -divergence, especially the Kullback-Leibler divergence, is most tractable and yield simple results for path-dependent problems.

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Ali and Silvey (1966) Ali, S. M. and S. D. Silvey (1966). A general class of coefficients of divergence of one distribution from another. Journal of the Royal Statistical Society. Series B (Methodological) , 131–142.
2Artzner et al. (1999) Artzner, P., F. Delbaen, J.-M. Eber, and D. Heath (1999). Coherent measures of risk. Mathematical finance 9 (3), 203–228.
3Bally et al. (2016) Bally, V., L. Caramellino, and R. Cont (2016). Functional Kolmogorov equations. In Stochastic Integration by Parts and Functional Itô Calculus , pp. 183–207. Springer.
4Bannör and Scherer (2013) Bannör, K. F. and M. Scherer (2013). Capturing parameter risk with convex risk measures. European Actuarial Journal 3 , 97–132.
5Basseville (2013) Basseville, M. (2013). Divergence measures for statistical data processing—An annotated bibliography. 93 (4), 621–633.
6Boucher et al. (2014) Boucher, C. M., J. Danielsson, P. S. Kouontchou, and B. B. Maillet (2014). Risk models–at–risk. Journal of Banking & Finance 44 , 72–92.
7Branger and Schlag (2004) Branger, N. and C. Schlag (2004). Model risk: A conceptual framework for risk measurement and hedging.
8Cont (2006) Cont, R. (2006). Model uncertainty and its impact on the pricing of derivative instruments. Mathematical Finance 16 (3), 519–547.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Non-Parametric Robust Model Risk Measurement with Path-Dependent Loss Functions

Abstract.

1. Introduction

2. Problem Formulation

Lemma 2.1**.**

Proof.

3. Characterising the Worst-Case Expected Loss

Definition 3.1**.**

Proposition 3.2**.**

Proof.

4. General Result of Model Risk Measurement

Lemma 4.1**.**

Proof.

Proposition 4.2**.**

Proof.

Definition 4.3**.**

Theorem 4.4**.**

Proof.

Proposition 4.5**.**

Proof.

Corollary 4.6**.**

Corollary 4.7**.**

Proof.

5. Model Risk Measurement with Continuous Semimartingales

Corollary 5.1**.**

Proposition 5.2**.**

Proof.

Corollary 5.3**.**

Corollary 5.4**.**

Proof.

6. Concluding Remarks

Lemma 2.1.

Definition 3.1.

Proposition 3.2.

Lemma 4.1.

Proposition 4.2.

Definition 4.3.

Theorem 4.4.

Proposition 4.5.

Corollary 4.6.

Corollary 4.7.

Corollary 5.1.

Proposition 5.2.

Corollary 5.3.

Corollary 5.4.