Optimal execution with dynamic risk adjustment

Xue Cheng; Marina Di Giacinto; and Tai-Ho Wang

arXiv:1901.00617·q-fin.MF·July 16, 2019·J. Oper. Res. Soc.

Optimal execution with dynamic risk adjustment

Xue Cheng, Marina Di Giacinto, and Tai-Ho Wang

PDF

Open Access

TL;DR

This paper develops a dynamic risk-adjusted optimal liquidation model in financial markets, deriving explicit solutions under quadratic risk measures to quantify the impact of risk preferences on trading strategies.

Contribution

It introduces a novel continuous-time stochastic control framework incorporating dynamic risk measures for optimal liquidation, with explicit solutions under quadratic specifications.

Findings

01

Closed-form solutions for optimal liquidation policies.

02

Quantification of risk-adjustment impact on P&L.

03

Framework based on g-conditional risk measures.

Abstract

This paper considers the problem of optimal liquidation of a position in a risky security in a financial market, where price evolution are risky and trades have an impact on price as well as uncertainty in the filling orders. The problem is formulated as a continuous time stochastic optimal control problem aiming at maximizing a generalized risk-adjusted profit and loss function. The expression of the risk adjustment is derived from the general theory of dynamic risk measures and is selected in the class of $g$ -conditional risk measures. The resulting theoretical framework is nonclassical since the target function depends on backward components. We show that, under a quadratic specification of the driver of a backward stochastic differential equation, it is possible to find a closed form solution and an explicit expression of the optimal liquidation policies. In this way it is immediate…

Equations212

{d X (t) = - v (t) d t + m d B_{1} (t), t \in [0, T], X (0) = x_{0} > 0,

{d X (t) = - v (t) d t + m d B_{1} (t), t \in [0, T], X (0) = x_{0} > 0,

{d S (t) = γ d X (t) + σ d B_{2} (t), t \in [0, T], S (0) = s_{0} > 0,

{d S (t) = γ d X (t) + σ d B_{2} (t), t \in [0, T], S (0) = s_{0} > 0,

{d S (t) = - γ v (t) d t + γ m d B_{1} (t) + σ d B_{2} (t), t \in [0, T], S (0) = s_{0} > 0.

{d S (t) = - γ v (t) d t + γ m d B_{1} (t) + σ d B_{2} (t), t \in [0, T], S (0) = s_{0} > 0.

S (t) = S (t) - η v (t), t \in [0, T] .

S (t) = S (t) - η v (t), t \in [0, T] .

Π^{0} (t) := X (t) (S (t) - S (0)) + \int_{0}^{t} (S (0) - S (u)) d X (u) .

Π^{0} (t) := X (t) (S (t) - S (0)) + \int_{0}^{t} (S (0) - S (u)) d X (u) .

Π^{0} (t) = \frac{γ}{2} (X^{2} (t) - x_{0}^{2}) + \frac{γ}{2} m^{2} t - η \int_{0}^{t} v^{2} (u) d u + + η m \int_{0}^{t} v (u) d B_{1} (u) + σ \int_{0}^{t} X (u) d B_{2} (u),

Π^{0} (t) = \frac{γ}{2} (X^{2} (t) - x_{0}^{2}) + \frac{γ}{2} m^{2} t - η \int_{0}^{t} v^{2} (u) d u + + η m \int_{0}^{t} v (u) d B_{1} (u) + σ \int_{0}^{t} X (u) d B_{2} (u),

Π^{0} (t) = γ m^{2} t - \int_{0}^{t} (γ v (u) X (u) + η v^{2} (u)) d u + + m \int_{0}^{t} (γ X (u) + η v (u)) d B_{1} (u) + σ \int_{0}^{t} X (u) d B_{2} (u) .

Π^{0} (t) = γ m^{2} t - \int_{0}^{t} (γ v (u) X (u) + η v^{2} (u)) d u + + m \int_{0}^{t} (γ X (u) + η v (u)) d B_{1} (u) + σ \int_{0}^{t} X (u) d B_{2} (u) .

f (x) := β x^{2}, β > 0.

f (x) := β x^{2}, β > 0.

h (v (t)) := λ_{1} (v (t) - \overline{v})^{2}, t \in [0, T] .

h (v (t)) := λ_{1} (v (t) - \overline{v})^{2}, t \in [0, T] .

R (t, ξ (T)) := \frac{1}{λ _{2}} ln E [exp (- λ_{2} ξ (T)) ∣ F_{t}], t \in [0, T]

R (t, ξ (T)) := \frac{1}{λ _{2}} ln E [exp (- λ_{2} ξ (T)) ∣ F_{t}], t \in [0, T]

g (Z_{1} (t), Z_{2} (t)) := \frac{λ _{2}}{2} (Z_{1}^{2} (t) + Z_{2}^{2} (t)), t \in [0, T]

g (Z_{1} (t), Z_{2} (t)) := \frac{λ _{2}}{2} (Z_{1}^{2} (t) + Z_{2}^{2} (t)), t \in [0, T]

⎩ ⎨ ⎧ d R (t, ξ (T)) = - \frac{λ _{2}}{2} (Z_{1}^{2} (t) + Z_{2}^{2} (t)) d t + Z_{1} (t) d B_{1} (t) + Z_{2} (t) d B_{2} (t), t \in [0, T], R (T, ξ (T)) = - ξ (T) .

⎩ ⎨ ⎧ d R (t, ξ (T)) = - \frac{λ _{2}}{2} (Z_{1}^{2} (t) + Z_{2}^{2} (t)) d t + Z_{1} (t) d B_{1} (t) + Z_{2} (t) d B_{2} (t), t \in [0, T], R (T, ξ (T)) = - ξ (T) .

R (t, R (τ, ξ (T))) = R (t, ξ (T)) P -a.s. .

R (t, R (τ, ξ (T))) = R (t, ξ (T)) P -a.s. .

Π^{t} (T) := Π^{0} (T) - Π^{0} (t) .

Π^{t} (T) := Π^{0} (T) - Π^{0} (t) .

{- d Y (u) := d Π^{u} (T) - d R (u, f (X (T))) - d (\int_{u}^{T} h (v (u)) d u), u \in [t, T], Y (T) := - f (X (T)) = - β X^{2} (T),

{- d Y (u) := d Π^{u} (T) - d R (u, f (X (T))) - d (\int_{u}^{T} h (v (u)) d u), u \in [t, T], Y (T) := - f (X (T)) = - β X^{2} (T),

⎩ ⎨ ⎧ d X (u) = - v (u) d u + m d B_{1} (u), u \in [t, T], - d Y (u) = g (X (u), Z_{1} (u), Z_{2} (u), v (u)) d u + - Z_{1} (u) d B_{1} (u) - Z_{2} (u) d B_{2} (u), u \in [t, T], X (t) = x, Y (T) = - β X^{2} (T),

⎩ ⎨ ⎧ d X (u) = - v (u) d u + m d B_{1} (u), u \in [t, T], - d Y (u) = g (X (u), Z_{1} (u), Z_{2} (u), v (u)) d u + - Z_{1} (u) d B_{1} (u) - Z_{2} (u) d B_{2} (u), u \in [t, T], X (t) = x, Y (T) = - β X^{2} (T),

g (X (u), Z_{1} (u), Z_{2} (u), v (u)) := (η + λ_{1}) v^{2} (t) + \frac{λ _{2}}{2} (Z_{1}^{2} (u) + Z_{2}^{2} (u)) + + (γ X (u) - 2 λ_{1} \overline{v}) v (u) - (γ m^{2} - λ_{1} \overline{v}^{2}), u \in [t, T],

g (X (u), Z_{1} (u), Z_{2} (u), v (u)) := (η + λ_{1}) v^{2} (t) + \frac{λ _{2}}{2} (Z_{1}^{2} (u) + Z_{2}^{2} (u)) + + (γ X (u) - 2 λ_{1} \overline{v}) v (u) - (γ m^{2} - λ_{1} \overline{v}^{2}), u \in [t, T],

(Z_{1} Z_{2}) (u) = (Z_{1} (u) + γ m X (u) + m η v (u) Z_{2} (u) + σ X (u)), u \in [t, T] .

(Z_{1} Z_{2}) (u) = (Z_{1} (u) + γ m X (u) + m η v (u) Z_{2} (u) + σ X (u)), u \in [t, T] .

V_{ad} [t, T] := {v : [t, T] \times Ω \to R ∣ v \in H_{F^{t}}^{2} (t, T; R)} .

V_{ad} [t, T] := {v : [t, T] \times Ω \to R ∣ v \in H_{F^{t}}^{2} (t, T; R)} .

J (t, x; v (\cdot)) := Y (t; t, x; v (\cdot)),

J (t, x; v (\cdot)) := Y (t; t, x; v (\cdot)),

\textsc ma x imi z e J (t, x; v (\cdot)) \textsc o v er v (\cdot) \in V_{ad} [t, T],

\textsc ma x imi z e J (t, x; v (\cdot)) \textsc o v er v (\cdot) \in V_{ad} [t, T],

⎩ ⎨ ⎧ W (t, x) := v (\cdot) \in V_{\textsl a d} [t, T] sup J (t, x; v (\cdot)), \forall (t, x) \in [0, T] \times R^{2} W (T, x) := - β x^{2}, \forall x \in R .

⎩ ⎨ ⎧ W (t, x) := v (\cdot) \in V_{\textsl a d} [t, T] sup J (t, x; v (\cdot)), \forall (t, x) \in [0, T] \times R^{2} W (T, x) := - β x^{2}, \forall x \in R .

⎩ ⎨ ⎧ w_{t} (t, x) + v \in R sup H_{c v} (x, w_{x}, w_{xx}; v) = 0, (t, x) \in [0, T] \times R, w (T, x) = - β x^{2},

⎩ ⎨ ⎧ w_{t} (t, x) + v \in R sup H_{c v} (x, w_{x}, w_{xx}; v) = 0, (t, x) \in [0, T] \times R, w (T, x) = - β x^{2},

H_{c v} (x, q, Q; v) := tr [Σ Σ^{⊤} Q] + ⟨ b (v), q ⟩ - g (x, Σ^{⊤} q, v), (x, q, Q) \in R \times R \times R \times S^{2},

H_{c v} (x, q, Q; v) := tr [Σ Σ^{⊤} Q] + ⟨ b (v), q ⟩ - g (x, Σ^{⊤} q, v), (x, q, Q) \in R \times R \times R \times S^{2},

Σ := (m γ m 0 σ), b (v) := (- v - γ v),

Σ := (m γ m 0 σ), b (v) := (- v - γ v),

v^{⋆} (x, q, Q) = - \frac{q + γ x - 2 λ _{1} v}{2 ( η + λ _{1} )} .

v^{⋆} (x, q, Q) = - \frac{q + γ x - 2 λ _{1} v}{2 ( η + λ _{1} )} .

w_{t} + \frac{1}{2} m^{2} w_{xx} - \frac{1}{2} λ_{2} m^{2} w_{x}^{2} + γ m^{2} - λ_{1} \overline{v}^{2} + \frac{1}{4 ( η + λ _{1} )} (w_{x} + γ x - 2 λ_{1} \overline{v})^{2} = 0,

w_{t} + \frac{1}{2} m^{2} w_{xx} - \frac{1}{2} λ_{2} m^{2} w_{x}^{2} + γ m^{2} - λ_{1} \overline{v}^{2} + \frac{1}{4 ( η + λ _{1} )} (w_{x} + γ x - 2 λ_{1} \overline{v})^{2} = 0,

w (T, x) = - β x^{2} .

w (T, x) = - β x^{2} .

κ := 2 m^{2} (η + λ_{1}) .

κ := 2 m^{2} (η + λ_{1}) .

⎩ ⎨ ⎧ \overset{a}{˙} (t) = [m^{2} λ_{2} - \frac{1}{2 ( η + λ _{1} )}] a^{2} (t) - 2 m^{2} λ_{2} γ a (t) + m^{2} λ_{2} γ^{2}, a (T) = - 2 β + γ, \dot{b} (t) = [m^{2} λ_{2} - \frac{1}{2 ( η + λ _{1} )}] a (t) b (t) - m^{2} λ_{2} γ b (t) + \frac{λ _{1} v}{η + λ _{1}} a (t), b (T) = 0, \overset{c}{˙} (t) = \frac{1}{2} [λ_{2} m^{2} - \frac{1}{2 ( η + λ _{1} )}] b^{2} (t) + \frac{λ _{1} v}{η + λ _{1}} b (t) - \frac{1}{2} m^{2} a (t) + - \frac{1}{2} m^{2} γ + λ_{1} \overline{v}^{2} - \frac{λ _{1}^{2} v ^{2}}{η + λ _{1}}, c (T) = 0,

⎩ ⎨ ⎧ \overset{a}{˙} (t) = [m^{2} λ_{2} - \frac{1}{2 ( η + λ _{1} )}] a^{2} (t) - 2 m^{2} λ_{2} γ a (t) + m^{2} λ_{2} γ^{2}, a (T) = - 2 β + γ, \dot{b} (t) = [m^{2} λ_{2} - \frac{1}{2 ( η + λ _{1} )}] a (t) b (t) - m^{2} λ_{2} γ b (t) + \frac{λ _{1} v}{η + λ _{1}} a (t), b (T) = 0, \overset{c}{˙} (t) = \frac{1}{2} [λ_{2} m^{2} - \frac{1}{2 ( η + λ _{1} )}] b^{2} (t) + \frac{λ _{1} v}{η + λ _{1}} b (t) - \frac{1}{2} m^{2} a (t) + - \frac{1}{2} m^{2} γ + λ_{1} \overline{v}^{2} - \frac{λ _{1}^{2} v ^{2}}{η + λ _{1}}, c (T) = 0,

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStochastic processes and financial applications · Risk and Portfolio Optimization · Economic theories and models

Full text

Optimal execution with dynamic risk adjustment

Xue Cheng, Marina Di Giacinto and Tai-Ho Wang

Xue Cheng

Department of Mathematical Finance

Peking University

Beijing, China

[email protected]

Marina Di Giacinto

Dipartimento di Economia e Giurisprudenza

Università degli studi di Cassino e del Lazio Meridionale, Cassino (FR), Italy

and

Dipartimento di Matematica per le Scienze economiche, finanziarie ed attuariali

Università Cattolica del Sacro Cuore, Milano, Italy

[email protected]

Tai-Ho Wang

Department of Mathematics

Baruch College, The City University of New York

1 Bernard Baruch Way, New York, NY10010

[email protected]

Abstract.

This paper considers the problem of optimal liquidation of a position in a risky security in a financial market, where price evolution are risky and trades have an impact on price as well as uncertainty in the filling orders. The problem is formulated as a continuous time stochastic optimal control problem aiming at maximizing a generalized risk-adjusted profit and loss function. The expression of the risk adjustment is derived from the general theory of dynamic risk measures and is selected in the class of $g$ -conditional risk measures. The resulting theoretical framework is nonclassical since the target function depends on backward components. We show that, under a quadratic specification of the driver of a backward stochastic differential equation, it is possible to find a closed form solution and an explicit expression of the optimal liquidation policies. In this way it is immediate to quantify the impact of risk-adjustment on the profit and loss and on the expression of the optimal liquidation policies.

1. Introduction

Trading algorithms are nowadays widely spread among financial agents. They are typically used for the execution of large orders by brokers. Differently from standard inventory models, once that a given market move has been decided, the success of the market operation crucially depends on the strategy chosen to execute the orders in the market. In fact, in addition to price fluctuation risk, optimal execution must also take into account the so-called market impact effect, i.e., the feedback effect that the order execution may have on the execution price. Practitioners have developed interesting models to quantify price impact and its implications, the simplest being the optimization of the Volume Weighted Average Price (VWAP) during the execution. A general introduction to the modeling problems and empirical evidence on algorithmic and high frequency trading can be found in [13]. More sophisticate models to quantify and manage price impact include [7], [3] in a discrete time model and [2] as its continuous time variant, and transient impact models such as [24], [1], and [20].

In order to restate the problem of optimal execution as a variational problem, a crucial decision without a simple solution is the definition of the objective function to be optimized. While it is obvious that a risk neutral trader is willing to maximize the final profit and loss, the choice of the risk function that quantifies the tradeoff between profit and loss and execution risk is far from unique. In fact, the risk quantification is based on the full liquidation path and thus is essentially dynamic in nature.

In this paper we rely on the theory of dynamic convex risk measures and formulate the optimal liquidation problem as a stochastic control problem where the riskiness of the optimal strategy is quantified by a $g$ -conditional measure. In this way, borrowing results from the general theory, see, e.g., [6] the riskiness of the liquidation strategy is locally described by the driver of a backward stochastic differential equation (BSDE).

The resulting optimal stochastic control problem is more general than a classical one due to the involvement of $g$ -expectation which introduces additional terms depending non linearly from the volatilities of the backward components. Remarkably, we verify that for linear-quadratic expressions of the driver, the solution of this highly nonlinear controlled problem can be reduced to the solution of a system of Riccati ordinary differential equations (ODEs) and solved in closed form.

The solution provides a closed form description of the liquidation policy and of the impact that the risk aversion of the agent has on it. The explicit expression of the optimal control is useful to quantify the impact of the dynamic risk adjustment of the liquidation policy and offers interesting new insights showing the great potential relevance of forward-backward optimal control strategies in financial applications. Remarkably the analysis of the resulting control policies highlight the critical interaction between the uncertainty of order fills and the minimization of the dynamic risk functional. This interaction introduces a new characteristic trading time scale depending both on the microstructural parameters in the model and on the risk aversion that radically modifies in a non-linear way the optimal scheduled liquidation program and the policy used to implement it. As a consequence, the resulting optimal policy differs substantially from those that have been analyzed in the literature that consider only a forward risk adjustment component.

Many of these strategies can be recovered as limiting cases within our general framework. In the [3] framework, by applying the technique of integration by parts, one can show that the optimal strategy for a risk neutral trader is always the VWAP strategy, no matter what the driving noise is so long as it is a martingale. However, for transient impact models such as [24], [1], and [28], many of the optimal strategies are U-shaped like modified VWAP strategies. Namely, block trades at the beginning and the end of the trading period and a VWAP in between. See, for instance, [24, Table 1, p. 20].

On the other hand, for risk averse traders the choice of risk factors for penalization varies case by case for the sake of tractability. In the classical [3] model, the authors use the quadratic variation as the risk factor; whereas [21] use time-average VaR as a risk factor to penalize the P&L or cost of trading in the determination of optimal strategies (in fact, the authors claimed the same rationale applies to the use of general coherent risk measure, see Remark 2.2 on p. 356). More recently, in [10], the asset is assumed following a displaced diffusion process and the authors compare the optimal trading strategies with various risk functions such as Value at Risk (VaR) and Expected Shortfall (ES). The authors also propose a new risk function called Squared Asset Expectation (SAE) in order to test the robustness of the optimal trading strategies. In [18], the problem of optimal execution is formalized as an exponential utility maximization program where the trader is subject to a VaR constraint that has to be instantaneously satisfied by P&L for the entire execution period. This paper in a sense attempts to blend two risk limiting elements (risk averse utility + VaR limit) in one control problem. [14] and [5] discuss a control problem with linear quadratic objective function close to the one treated in this paper with the crucial difference that they do not consider the risk adjustment that induces the backward component. Finally, it is worth mentioning that in a discrete time model [23] obtain an optimal trading strategy that is time-consistent and deterministic by minimizing a dynamic coherent risk of the cost of trading.

The latter paper is the one closest in spirit to ours, in fact they also use a dynamic risk measure to quantify the riskiness of a strategy. On the other hand the present analysis relies on continuous time and on a more realistic modeling of the profit and loss function where uncertainty in order fills, i.e., the risk for an order to be filled either incompletely or in excess, is taken into account. In addition, while their treatment relies on a recursive discrete time approach, our formulation involves a continuous time stochastic control problem that is solvable in closed form.

In fact, our approach is grounded on the formulation of general time-consistent dynamic convex risk measures relying on notion of the $g$ -expectations developed by Peng (see, e.g., [27]) and his collaborators in the 90’s. See [6] and the references therein for more details on the relationship between dynamical risk measures, $g$ -expectations, and $g$ -conditional risk measures.

Optimal strategies with uncertain order fills have been introduced by [15], and, more recently, treated in [11] and [29]. In particular, in the [15] framework the magnitude of the uncertain order fills is assumed to be proportional to the order size, while [29] develop a discrete time model showing that the volume uncertainty is independent of trader’s decisions and the optimal strategy for a risk-averse investor is a trade-off between early and late trades to balance risk associated to both price and volume.

Furthermore, our model provides for a quadratic running penalty to discourage large deviation from a given threshold chosen by the investment committee.

The use of a quadratic penalization for deviations of the trading velocity from a pre-specified target is also found in other classes of trading problems, such as hedging via risk minimization with constraints (see, [22]) or preventing the agent from trading too quickly with either market or limit order type (see, [11]).

The paper is organized as follows. Section 2 sets up the model and provides more detailed discussions and motivations on the problem formulation. Section 3 introduces the general notion of convex risk measures and the way these are used to “risk adjust” the target functional form. Section 4 discusses the stochastic control problem and its solution. In Section 5 we briefly analyze the applicative implications of the result and conclude.

2. The model

To setup the mathematical model, let $[0,T]$ , $T>0$ , be the fixed finite trading horizon and let $\left(\Omega,\mathcal{F},\mathbb{P}\right)$ be a complete probability space that carries a two-dimensional uncorrelated Brownian motion $\left(B_{1},B_{2}\right):=\{B_{1}(t),B_{2}(t)\}_{t\in[0,T]}$ . The information structure is described by the filtration $\mathbb{F}:=\left\{\mathcal{F}_{t}\right\}_{t\in[0,T]}$ generated by the trajectories of the two-dimensional Brownian motion, and completed with the addition of the all $\mathbb{P}$ -null measure sets of $\mathcal{F}$ . We denote by $\mathcal{M}^{2}_{\mathbb{F}}(0,T;\mathbb{R}^{2})$ the set of all two-dimensional real-valued $\mathbb{F}$ -predictable processes $\left\{H(t)\right\}_{t\in[0,T]}$ satisfying $\mathbb{E}\left[\int_{0}^{T}\left|H(t)\right|^{2}dt\right]<+\infty$ , $\mathcal{H}^{2}_{\mathbb{F}}(0,T;\mathbb{R})$ the set of all real-valued $\mathbb{F}$ -progressively measurable processes $\left\{H(t)\right\}_{t\in[0,T]}$ satisfying $\mathbb{E}\left[\int_{0}^{T}\left|H(t)\right|^{2}dt\right]<+\infty$ , $L^{2}_{\mathcal{A}}(\Omega;\mathbb{R})$ the set of all real-valued $\mathcal{A}$ -measurable square integrable random variables, $L^{\infty}_{\mathcal{A}}(\Omega;\mathbb{R})$ the set of all bounded real-valued $\mathcal{A}$ -measurable random variables, where $\mathcal{A}\subseteq\mathcal{F}$ is a sub- $\sigma$ -algebra, and $\mathcal{S}^{n}$ the set of all $n\times n$ symmetric real matrices.

An issue often overlooked in the existing literature on optimal execution is that the pre-scheduled transactions might be underly or overly executed. As discussed in [15], in modern electronic markets order execution is a complex process involving potentially different type of orders and realized transaction may deviate from scheduled ones. We refer to such a deviation as the uncertainty of order fills. To take into consideration this kind of risk, we add a noise term to the evolution of the investor’s position $X$ and therefore suppose that it satisfies the following stochastic differential equation:

[TABLE]

where the $\mathbb{F}$ -progressively measurable process $v\colon\Omega\times[0,T]\rightarrow\mathbb{R}$ is the trading rate of market order and regarded as the control variable, while $m\geq 0$ measures the magnitude of the uncertainty of order fills.

Modeling the uncertainty of order fills by a diffusion process driven by a Brownian motion serves as a ‘zeroth order approximation’ to the uncertainty and the implications seem plausible as far as the size of the uncertainty is small and $m\rightarrow 0$ . A more accurate model for the uncertainty of order fills is by adding a pure jump spectrally negative LÃ©vy process. Alternatively, the term $mdB_{1}$ can be also interpreted as the whole retail of the financial institution exposure. In other words, the initial amount $x_{0}$ to be liquidated from the responsible of the desk of the whole financial institution is subject to small random variation induced by additional liquidation or redemption orders that he could receive from other desks in the meantime.

The fair price $S$ of the stock follows the dynamics:

[TABLE]

i.e.,

[TABLE]

In other words, the fair price $S$ is driven by an Arithmetic Brownian motion with drift equal to zero and volatility $\sigma>0$ , along with a linear permanent impact with parameter $\gamma\geq 0$ . The choice of considering a Bachelier price evolution (see, e.g., [4, 16]) is made to keep the problem tractable. It restricts the applicability of the model to securities-trading in a low volatility environment (quite common in recent years).

Following [3], the transacted price $\widetilde{S}$ is consists of the fair price and a slippage referred as temporary impact:

[TABLE]

That is, the transacted price reflects a temporary impact given by a linear function of the current trading rate of market order $v$ with size $\eta>0$ .

Following [3, Section 2.4, p. 10], the profit and loss (P&L) $\Pi^{0}(t)$ of a trading strategy earned over the time interval $[0,t]$ , $t\leq T$ , is defined as:

[TABLE]

The first term of the right-hand side captures the change in fair value of the remaining untransacted shares, while the second term measures the transaction costs resulting from selling shares due to the presence of a temporary price impact component. More details can be found in Appendix A, Subsection A.1, p. A.1.

Taking into account (2.1)–(2.2), the computation given in Appendix A (Subsection A.2, p. A.2) shows that:

[TABLE]

or, equivalently:

[TABLE]

Depending on the circumstances, we will use indifferently the first or second expression.

We must ensure that at $t=T$ the initial position is fully liquidated despite the uncertainty of order fills. To this end, we add to the P&L at the terminal time $T$ a penalty term for the final block trade $f\colon\mathbb{R}\rightarrow[0,+\infty)$ so that anything but complete liquidation is undesirable. Following [15], we take into consideration as a penalty term for the final block trade $x\in\mathbb{R}$ a continuous function defined as:

[TABLE]

Observe that any final block trade is discouraged since it is zero if and only if the initial position is fully liquidated; otherwise it is always positive if the initial position is underly or overly liquidated. Notice that this choice of the terminal condition penalizes both negative or positive final investor’s position. This is consistent with the task to be accomplished by the desk that is in charge of the full portfolio liquidation.

Finally, we introduce a running penalty $h\colon\mathbb{R}\rightarrow[0,+\infty)$ with the following quadratic specification:

[TABLE]

where $\lambda_{1}\geq 0$ is the cost to be paid for a unit deviation per unit of time from the desired target $\overline{v}>0$ . This target speed of execution represents the ideal liquidation rate which is exogenously set by the investment committee. Note that the above function penalizes during liquidation any deviation of the selected trading rate from $\overline{v}$ and prevents either overly large trading rates or the placement of buy side orders.

3. Risk adjustment of the target profit and loss function

It is a well known result of financial economics that in a static single period market model, the preference of a risk averse non-satiated agent can be represented using a concave and increasing functional, a utility function. In fact, it is easy to verify that the degree of concavity of the utility function is proportional to the additional amount of money (risk compensation) in addition to the expected payoff that the agent requires to play a fair lottery. The quantification of this risk compensation is critical to interpret the best risk-return tradeoff achievable in the market and thus sort out the best investment opportunities.

A similar dynamic risk-return tradeoff is faced by investors that are willing to liquidate their portfolios in the market and are subject to the uncertainty induced by price impact and by limited liquidation possibilities due to the microstructural frictions that typically affect real markets. While conventional approaches to optimal liquidation maximize the profit and loss function that accounts for the costs and profits generated by the execution problem, the main innovation of this paper is the formulation and the solution of an optimization program that evaluates the liquidation policy by taking into account also the risk aversion of the investor. In other words, we specify a functional that represents both the cost-benefit tradeoff of a liquidation strategy while penalizing liquidation paths that are particularly risky from the point of view of the investor. A natural way to include this risk contribution in a dynamic functional to be optimized is to rely on the theory of $g$ -conditional risk measures, i.e., dynamic risk measures characterized by the solution to BSDEs associated with a convex driver $g$ .

In Appendix C we briefly review the basic definitions and results on dynamic risk measures that are relevant to our analysis.

We will focus our analysis to the case of the so-called dynamic entropic risk measure defined, for any $\xi(T)\in L_{\mathcal{F}_{T}}^{2}(\Omega;\mathbb{R})$ , as follows:

[TABLE]

where $\lambda_{2}\geq 0$ is the risk aversion coefficient, that is, everything else equal, a change of $\lambda_{2}$ modifies the risk attitude of the investor. The next Proposition establishes a well-known connection between the entropic risk measure and the solution of the one-dimensional BSDE with the following quadratic driver $g$ :

[TABLE]

where $\left(Z_{1},Z_{2}\right)^{\top}:=\{\left(Z_{1},Z_{2}\right)^{\top}\}_{t\in[0,T]}$ is a two-dimensional BSDE control process corresponding to the two dimensional correlated Brownian motion $\left(B_{1},B_{2}\right)$ .

Proposition 1.

Let $\left(Z_{1},Z_{2}\right)^{\top}\in\mathcal{M}_{\mathbb{F}}^{2}(0,T;\mathbb{R}^{2})$ . For any $\xi(T)\in L^{2}_{\mathcal{F}_{T}}(\Omega;\mathbb{R})$ , the entropic risk measure is the unique solution to the following one-dimensional BSDE:

[TABLE]

Proof.

See Appendix B, p. B. ∎

Remark 1.

For any time $t\in[0,T]$ and $\tau\in[t,T]$ , $\mathcal{R}$ corresponds to the solution flow to the above BSDE (3.1) and has the following semigroup property:111The relation is verified in Appendix B, p. B.

[TABLE]

The same type of relationship can be extended to a larger class of dynamic convex risk measures by considering a more general expression for the driver of a BSDE (see Appendix C, Proposition 3).

These results show that the driver of a BSDE is a natural object to describe locally (i.e., over small intervals of time) the dynamic risk-return tradeoff faced by an investor who liquidates her position in the market and measures the risk considering a $g$ -conditional risk measure.

In our framework, an important motivation leading us to the use of a dynamic risk measure arises from the fact that the success of the trader’s liquidation policy is based on the ability of two countervailing tasks: on one hand, it is related to her ability to complete the liquidation of the portfolio with a maximum profit (equivalently minimum loss) by the final date; on the other hand, the liquidation policy is conditioned to the unfolding of the uncertainty drivers that impact both on prices and quantities. At each time, the dynamic risk assessment of the final cost to be paid takes into account both the costs of final partial liquidation and the evaluation of this potential cost in a way that is conditional on the information available and adjusted to keep into account the risk aversion of the trader.

4. The optimal control problem

The optimal execution is formulated and studied as a stochastic optimal control problem. For any initial time $t\in[0,T]$ and initial points $X(t):=x\in\mathbb{R}$ , the trader maximizes her expected risk adjusted total P&L of liquidation within the time horizon $[t,T]$ , penalized by the final block trade and by the cumulative deviation from the pre-specified target speed $\overline{v}$ .

4.1. The state equation and the objective functional

Let $\mathcal{F}^{t}_{u}$ be the $\sigma$ -algebra generated by $\left\{B_{1}(r)-B_{1}(t),B_{2}(r)-B_{2}(t)\right\}_{r\in[t,u]}$ and $\mathbb{F}^{t}:=\left\{\mathcal{F}^{t}_{u}\right\}_{u\in[t,T]}$ the filtration augmented by all $\mathbb{P}$ -null measure sets of $\mathcal{F}$ . In order to write up the state equation and the objective functional, let $\Pi^{t}(T)$ denote the P&L earned over the time interval $[t,T]$ , i.e.,

[TABLE]

Setting:

[TABLE]

where $f$ is the cost function given by (2.4) and $h$ is the running penalty defined in (2.5), the state equation is described by the following decoupled quadratic growth forward-backward stochastic differential equation (qgFBSDE):

[TABLE]

with

[TABLE]

and

[TABLE]

Remark 2.

In this framework, the terminal condition in (4.1) allows us to measure at any time the riskiness of the final penalty.

Here, the set of admissible control is a space of processes defined as follow:

[TABLE]

Observe that for any, $v(\cdot)\in\mathcal{V}_{\text{ad}}[T,t]$ , the above decoupled qgFBSDE (4.1) admits a unique strong solution. As a matter of fact, existence and uniqueness of the solution to the forward component is a rather standard result (see, e.g., [30, Theorem 6.3, p. 42]); while the solution to the BSDE with quadratic growth driver and unbounded terminal condition in (4.1) is guaranteed by, e.g., [9].

Denoting by $(Y,\widetilde{Z}_{1},\widetilde{Z}_{2})^{\top}(\cdot;t,x,v(\cdot))$ the solution to the backward component of (4.1), when $X(\cdot;t,x,v(\cdot))$ is the solution to the forward part of (4.1) starting from $x\in\mathbb{R}$ at time $t\in[0,T]$ and control $v(\cdot)\in\mathcal{V}_{\text{ad}}[t,T]$ , the objective functional is given by:

[TABLE]

and the trader’s optimal liquidation policy consists in finding for any $(t,x)\in[0,T]\times\mathbb{R}$ the solution to the following problem:

[TABLE]

while the associated value function $W$ is thus defined as:

[TABLE]

Notice that the expression of $\mathcal{R}(t,f(X(T)))$ depends on the backward component and makes the full stochastic optimal control problem non-standard.

4.2. The HJB equation

In the context of stochastic optimal control problems with finite horizon, it is well-known that the value function is associated to a second-order partial differential equation (PDE) with terminal boundary condition – the so-called Hamilton-Jacobi-Bellman (HJB) equation – which we aim to derive.

Following, e.g., [26], it is possible to derive the generalized HJB equation associated with the controlled state equation (4.1). It is given by:

[TABLE]

where the Hamiltonian current value $\mathcal{H}_{cv}$ reads as:

[TABLE]

with $\Sigma$ and $b\colon\mathbb{R}\rightarrow\mathbb{R}^{2}$ which represent the volatility matrix and the drift term of the forward diffusion process in (4.1), respectively, i.e.,

[TABLE]

and $\widetilde{g}$ is specified in (4.2).

The construction of the above generalized HJB (4.6) follows repeating the well-known argument applied to state the HJB equation in standard control problems. The additional prescription that the differential representation of the stochastic backward variable $\left(Z_{1},Z_{2}\right)^{\top}$ is given by $\Sigma^{\top}Dw$ , follows from the standard Feynman-Kac representations for FBSDEs. Optimality for the solution to the generalized HJB (4.6) for the original control problem will be proved in the verification argument.

The function $\mathcal{H}_{cv}$ has a unique maximum point on $\mathbb{R}$ given by:

[TABLE]

Therefore, the Hamilton-Jacobi-Bellman equation related to the stochastic control (4.5) can be rewritten as:

[TABLE]

with terminal condition:

[TABLE]

4.3. Solution to the HJB equation and the verification theorem

The value function and the optimal trading strategy are presented in the verification theorem below. In order to state and prove the result, we start with the following lemma making use for convenience of the following notation:

[TABLE]

Lemma 1.

Let $\beta>\dfrac{\gamma}{2}$ and $\lambda_{2}<\dfrac{1}{\kappa}$ . Then the deterministic functions $a,b,c\colon[0,T]\rightarrow\mathbb{R}$ uniquely solve the following system of Riccati ODEs:

[TABLE]

i.e.,

[TABLE]

and the following concave function:

[TABLE]

satisfies the HJB equation (4.10)–(4.11) for $x\in\mathbb{R}$ .

Proof.

See Appendix B, p. B. ∎

Taking into account (4.9) and Lemma 1, the feedback map coming from the optimization of the Hamiltonian current value $\mathcal{H}_{cv}$ defined in (4.7) when $\operatorname{D}w$ is plugged in place the formal argument $q$ reads as:

[TABLE]

and applying (4.3) along with Lemma 1 the backward control process in state feedback form is given by:

[TABLE]

Thus, the corresponding closed loop equation:

[TABLE]

with $\widetilde{g}$ given by (4.2), has a unique solution222As previously mentioned, see, e.g., [30, Theorem 6.3, p. 42] for the forward part and [9] for the backward component.. Moreover, denoting by $X^{\star}(\cdot):=X^{\star}(\cdot;t,x,v^{\star}(\cdot))$ the solution to the forward process of the above closed loop equation (4.15), the feedback strategy defined as:

[TABLE]

is Lipschitz continuous. Therefore $v^{\star}(\cdot)$ is admissible, that is, $v^{\star}(\cdot)\in\mathcal{V}_{\text{ad}}[t,T]$ .

Now we are ready to prove the following verification theorem.

Theorem 1.

Let $\beta>\dfrac{\gamma}{2}$ , $\lambda_{2}<\dfrac{1}{\kappa}$ , and $w$ be the function defined in (4.14). Then, for any $(t,x)\in[0,T]\times\mathbb{R}$ , the optimization problem (4.5) has a unique solution corresponding to the value function $W(t,x)=w(t,x)$ , i.e.,

[TABLE]

The unique optimal scheduled trading rate $v^{\star}(\cdot)$ in state feedback form is given by:

[TABLE]

and the unique two-dimensional optimal backward control process $(\widetilde{Z}_{1}^{\star},\widetilde{Z}_{2}^{\star})^{\top}$ in feedback form reads as:

[TABLE]

Proof.

See Appendix B, p. B. ∎

An immediate consequence of the above result is the following.

Corollary 1.

The unique two-dimensional optimal backward control process $\left(Z_{1}^{\star},Z_{2}^{\star}\right)^{\top}$ in state feedback form related to the dynamic entropic risk measure (3.1) is given by:

[TABLE]

Proof.

See Appendix B, p. B. ∎

5. Optimal liquidation strategies

The above solution provides a number of interesting insights on the optimal liquidation policy in relation to the exogenous parameters that define the economic setting. As a general observation it is important to remark that the selection of a dynamic risk measure and the inclusion of a running penalty differentiate substantially the optimal liquidation strategies found in this analysis with respect to those computed considering static measures of risk. In order to best understand these differences, it is useful to restate the optimal control policy in a economically sound way. This is done in the following Proposition.

Proposition 2.

The optimal trading policy (4.16) can be specified as follows:

[TABLE]

where:

[TABLE]

the coefficient $a(\cdot)$ is specified in (4.13), that we recall for convenience:

[TABLE]

and the function $\ell\colon[0,T]\rightarrow\mathbb{R}$ reads as:

[TABLE]

Proof.

See Appendix B, p. B. ∎

In general, the agent will set a liquidation speed deviating from the target one, $\overline{v}^{\ell}$ , by an amount proportional to the deviation of the position with respect to a scheduled liquidation program which is defined by the function $\ell\left(T-\cdot\right)$ . The mean reversion rate is proportional to $-a(\cdot)$ , i.e., the opposite of $a(\cdot)$ , which is positive under the sufficient assumptions for the value function to be concave. Indeed, the coefficient $a(\cdot)$ may be equivalently rewritten as:

[TABLE]

which is clearly negative, for any $t\in[0,T]$ , when $\beta>\frac{\gamma}{2}$ and $\lambda_{2}<\frac{1}{\kappa}$ .

Recalling that the effective drift term of the optimal position $X^{\star}(\cdot)$ is proportional to $-v(\cdot)$ , this implies that the resulting optimal policy enforces mean reversion toward a scheduled liquidation program. In order to gain intuition on relation (5.1), it is worth to consider first of all its limiting expression as the risk aversion parameter $\lambda_{2}\rightarrow 0$ . We obtain:

[TABLE]

i.e., the target liquidation program for a risk neutral agent corresponds to a constant speed liquidation program. The target velocity does correspond to the one set by the investment committee $\overline{v}$ reduced by a factor $\frac{\lambda_{1}}{\lambda_{1}+\eta}$ that takes into account the effect of the transitory price impact on trades. As expected, in the same limit the mean reversion rate increases with the cost of the final block trade $\beta$ and decreases with the size of permanent impact $\gamma$ . Indeed:

[TABLE]

which is positive considering the admissibility condition $\beta>\frac{\gamma}{2}$ . In the limit as $\lambda_{2}\rightarrow 0$ , the tracking error induced by the optimal strategy is determined by the ratio $\frac{\beta-\frac{\gamma}{2}}{\left(\eta+\lambda_{1}\right)}$ .

For levels of risk aversion that satisfy the sufficient condition for concavity $\lambda_{2}<\frac{1}{\kappa}$ , which is equivalent to $\lambda_{2}<\frac{1}{2m^{2}\left(\eta+\lambda_{1}\right)}$ by (4.12), the optimal policy is essentially modified as follows: a positive (negative) deviation from the target liquidation policy $\overline{v}$ implies a corresponding increase (decrease) of the trading speed with coefficient $-a\left(\cdot\right)$ divided by $\left(\eta+\lambda_{1}\right)$ . Note that for finite levels of risk aversion $\lambda_{2}$ , the target liquidation program differs substantially from a linear liquidation program. The policy ceases to be linear and the non-linearity increases with the time to the final liquidation. We can identify two well-defined regimes: the late stage liquidation regime and the early stage liquidation regime.

5.1. Late stage liquidation regime

In the late liquidation stage corresponding to the limit as $(T-t)\rightarrow 0$ we may assume:

[TABLE]

In this case, for any $t\in[0,T]$ , we obtain:

[TABLE]

This regime corresponds to the one where, setting $\lambda_{1}=\lambda_{2}=0$ , one recovers the Adaptive Value Weighted Average Price (AVWP) strategy found in [15]. The contribution induced by $\lambda_{1}>0$ changes the benchmark liquidation policy, whereas the parameter $\lambda_{2}>0$ in this regime operates simply as a renormalization of the reference strategy, progressively reducing the liquidation velocity far from the final block liquidation date while raising the mean reversion rate.

5.2. Early stage liquidation regime

In the early liquidation stage corresponding to the limit $(T-t)\rightarrow\infty$ regime we have:

[TABLE]

This is the regime where the impact of the backward component and of the penalization for deviations of the trading velocity is more evident. Far from the final block liquidation the presence of the risk averse component drives the optimal control policy into a steady policy depending on the size of the permanent impact $\gamma$ , but independent from the cost $\beta$ of the final block order. In this regime, the optimal evolution for the process $X^{\star}(\cdot)$ is given by:

[TABLE]

where:

[TABLE]

i.e.,

[TABLE]

which corresponds to a standard mean reverting Ornstein-Uhlenbeck process with rate of mean reversion $-\frac{a_{\infty}}{2\left(\eta+\lambda_{1}\right)}$ vanishing in the limit as $\lambda_{2}m\gamma\rightarrow 0$ . In other words, if the final block trade is sufficiently far in the future, the investor is simply controlling the quantity of security held with a tracking error that is decreasing with increasing permanent impact $\gamma$ , risk aversion $\lambda_{2}$ and volatility of the position $m$ .

Notice that in this regime the higher the price impact the higher the optimal reaction of the trader to a deviation from the optimal path. In fact, the presence of the uncertainty of order filling generates a cost that is increasing with increasing price impact. In the absence of the backward component, i.e., $\lambda_{2}=0$ , the trader would not be penalized for this term. In other words, it is possible to interpret the penalization component induced by the backward part, as a term that penalizes those strategies inducing high tracking errors in the liquidation strategy relative to the benchmark set by the investment committee.

In summary, it is interesting to remark that under the conditions $\lambda_{1}>0$ and $\lambda_{2}>0$ , the liquidation policy smoothly interpolates between a stationary investment committee policy when $(T-t)\rightarrow+\infty$ , and a liquidation policy with constant rate of liquidation set by the investment committee with a mean reversion rate that increases with increasing risk aversion $\lambda_{2}$ .

In order to better illustrate the impact of the backward component and of the risk aversion parameter $\lambda_{2}$ on the liquidation strategy, we provide a graphical illustration of the target liquidation schedule and of the mean reversion rate for different levels of $\lambda_{2}$ as a function of time.

In Figure 1 we perform a numerical illustration using the parameters similar to the one used in [15] which are closely related to those discussed in [3]:

[TABLE]

As we should expect, a higher risk aversion implies that the agent will try to reduce the tracking error implementing a policy where mean reversion raises earlier and faster tightening the policy reaction to deviations from the scheduled liquidation program. As a matter of fact, we see that the mean reversion is increasing with time and with parameter $\lambda_{2}$ .

A critical innovation of the present approach compared to previous liquidation models is that the parameter $\lambda_{2}$ determines directly the characteristic duration of the trade $\frac{\gamma\sqrt{\kappa\lambda_{2}}}{2\left(\eta+\lambda_{1}\right)}$ jointly with the size of the permanent impact $\gamma$ , the volatility of the order fills $m$ , and the quantity $(\eta+\lambda_{1})$ .

While in the conventional framework of [3] the characteristic “trade time” is set by a specific parameter, in this case this characteristic time is a function of a number of parameters, including trader’s subjective risk aversion $\lambda_{2}$ and the volatility coefficient $m$ that quantifies uncertainty of order fills.

In Figure 2, to make comparable the target liquidation paths for different levels of $\lambda_{2}$ , we choose to set the target $\overline{v}$ as a value that normalizes this asymptotic early stage position to a reference value equal to $1$ , corresponding to a $100\%$ notional amount to be liquidated. In addition, we consider a longer liquidation horizon to magnify the relationship between the level of risk aversion and the concavity of the scheduled liquidation program that converges to a linear liquidation program as the final block trade date is approached. The dynamic nature of the risk adjustment raises the importance of the deviation from the scheduled target, while reducing the relative importance of the cost paid by the trader in the final block trade. This implies a raise of concavity of the liquidation path, which signals the transition between early and late liquidation regimes.

Note that a joint interpretation of the evidence coming from Figures 1 and 2 indicates that the minimization of the volatility of the backward component induced by the dynamic risk measure determines a policy and a liquidation schedule that are progressively and non-linearly tightening as the final liquidation is approached. In light of this consideration and the above results, an interesting argument that future research will have to clarify is the economic interpretation of the characteristic “trading time” resulting from the interaction between the uncertainty of order fills, the risk aversion parameter of the dynamic risk measures, and characteristic half life of the trade.

Acknowledgement

We wish to thank two anonymous referees and the editors for careful scrutiny and helpful suggestions. We thanks Claudio Tebaldi for valuable discussions and for carefully reading the draft of this paper. Holger Kraft, Athena Picarelli and Emanuela Rosazza-Gianin deserve special mention for their useful comments. Finally, we are grateful to all participants to seminars and conferences where the work was presented. The usual disclaimer applies.

Appendix A Some technical details on the profit and loss function

We provide to give more technical detail about the profit and loss function.

A.1. Decomposition of the P&L functional

It is important to observe that a properly defined P&L must admit a decomposition into two contributions: one can be regarded as a modified self-financing strategy proposed in [12] and the other corresponds to slippage. The formula must recover the classical self-financing condition in the absence of trading frictions333We thank an anonymous referee for pointing this out..

The P&L defined in (2.3) that we recall for convenience:

[TABLE]

can be decomposed in a self-financing strategy contribution and a slippage component. Moreover, it is consistent with the self-financing condition introduced by [12, p. 731]. They generalize the usual self-financing relationships of frictionless markets to make it compatible with markets with frictions, including the presence of the uncertainty in the order fills as defined in our model.

In the following, we show how to decompose the above P&L formula by adding and subtracting the terms $\int_{0}^{t}S(u)dX(u)$ to its right-hand side. We obtain:

[TABLE]

that can be decomposed as:

[TABLE]

In fact, integration by parts implies:

[TABLE]

that is the value accrued by trading on $S(u)$ using a self-financing strategy $X(u)$ , for any $u\in[0,t]$ , according to the definition extended by [12] to take into account trading frictions. Correspondingly, the amount:

[TABLE]

can be interpreted as a slippage component since it properly vanishes as soon as the price impact is set to zero, i.e., $\widetilde{S}(u)=S(u)$ , for any $u\in[0,t]$ .

A.2. Computation of the P&L functional

We have:

[TABLE]

Since:

[TABLE]

then:

[TABLE]

Equivalently, taking into account the last equality of (A.1) we obtain:

[TABLE]

Appendix B Technical proofs

Here we provide the technical proofs.

Proof of Proposition 1

It follows straightforward from the result showed in [6, Proposition 3.12, p. 123] and the comparison theorem presented in [9, Theorem 5, p. 554]. ∎

Semigroup property of Remark 1

We have:

[TABLE]

Proof of Lemma 1

The claim follows by direct computations and observing that:

[TABLE]

or, equivalently,

[TABLE]

is clearly negative if $\beta>\dfrac{\gamma}{2}$ and $\lambda_{2}<\dfrac{1}{\kappa}$ , since $\frac{\gamma\sqrt{\kappa\lambda_{2}}}{2\left(\eta+\lambda_{1}\right)}(T-t)\geq 0$ , for any $t\in[0,T]$ .

Regarding the solution to the system of Riccati ODEs, we point out that the computation for the functions $a(\cdot)$ and $c(\cdot)$ comes straightforward from solving the associated ODEs, while $b(\cdot)$ is recovered by the explicit computation of the solution to the ODE derived for the function $\ell\colon[0,T]\rightarrow\mathbb{R}$ , which is defined as:

[TABLE]

Indeed, the flow generated by the ODEs for $a(\cdot)$ and $b(\cdot)$ induces a linear ODE for $\ell(T-\cdot)$ that can be solved by variation of constants.444Details of the (long) computation are available upon request. ∎

Proof of Theorem 1

We know that the function $w$ given in (4.14) satisfies the generalized HJB equation (4.10)-(4.11) for any $x\in\mathbb{R}$ and want to prove that this solution in this case is the unique value function for the optimal control problem. Let us consider $x\in\mathbb{R}$ and $v\left(\cdot\right)\in\mathcal{V}_{ad}\left[t,T\right]$ with the associated state trajectory $X(\cdot):=X(\cdot;t,x,v)$ . Apply the Dynkin formula to the functions $(t,x)\mapsto\frac{1}{2}\left(a(t)-\gamma\right)x^{2}$ and $(t,x)\mapsto b(t)x$ with the process $X(\cdot)$ , respectively. For any $u\in[t,T]$ , we obtain:

[TABLE]

then:

[TABLE]

and

[TABLE]

i.e.,

[TABLE]

Moreover:

[TABLE]

i.e.,

[TABLE]

Recalling (4.4), the objective functional can be recast as:

[TABLE]

Thus, by substituting (B.1) into the above we have the following:

[TABLE]

from which, applying (B.2), recalling (B.3), and reorganizing all the terms, we obtain:

[TABLE]

For any fixed time $u\in[t,T]$ , let us define the following function:

[TABLE]

which is by definition equal to the argument of the integral in the right side hand of (B.4) at given time $u\in[t,T]$ . It has the interpretation of the Lagrangian function related to the following constrained optimization problem:

[TABLE]

that corresponds to the maximization of the contribution driven by the P&L component of the objective functional for a fixed maximum threshold of risk. Note that the above is a static constrained optimization problem, where the risk aversion parameter $\lambda_{2}$ plays the role of the Lagrangian parameter, $v\in\mathbb{R}$ is the control variable, and the two-dimensional vector $(\widetilde{Z}_{1},\widetilde{Z}_{2})^{\top}\in\mathbb{R}^{2}$ appears as an independent variable.

Let the optimal control variable corresponding to $\lambda_{2}$ be denoted by $v^{\star}(\lambda_{2})$ and $(Z_{1}^{\star}(\lambda_{2}),Z_{2}^{\star}(\lambda_{2}))^{\top}$ be the corresponding risk which satisfies the constraint with equality, the first order conditions for the Lagrangian function $L^{u}$ defined in (B.5) read as:

[TABLE]

which imply:

[TABLE]

Since the target function is concave in $v$ , we observe that the above first order conditions are necessary and sufficient to select the (constrained) maximum. Moreover, arguing as in [25, Section 5.2, p. 2972], we show that:

[TABLE]

Hence, the maximization of the target function subject to a maximum risk constraint is equivalent to the unconstrained maximization of $L^{u}(v,\lambda_{2})$ for each given $\lambda_{2}>0$ .

Now, recall that starting from $x\in\mathbb{R}$ at time $t\in[0,T]$ , for each control $v(\cdot)\in\mathcal{V}_{\text{ad}}[t,T]$ , there is a unique choice of the process $(\widetilde{Z}^{\star}_{1},\widetilde{Z}^{\star}_{2})^{\top}(\cdot;t,x;v(\cdot))$ that makes the solution $Y^{\star}(\cdot;t,x;v(\cdot))$ to the backward part of the controlled state equation (4.1) adapted, when $X^{\star}(\cdot):=X^{\star}(\cdot;t,x;v(\cdot))$ is the solution to the forward part of (4.1). By [9, Section 5] we observe that the backward component in (4.1) admits a Feynman-Kac representation; thus, denoting by $w^{v}:[0,T]\times\mathbb{R}\times\mathbb{R}\rightarrow\mathbb{R}$ the solution to a proper semi-linear parabolic PDE, we notice that $w^{v}(t,x):=Y^{\star}(t;t,x,v)$ is a deterministic function, and by the Markov property of the diffusion process we have:

[TABLE]

where $\Sigma$ is defined in (4.8).

Remarkably, at every fixed time $u\in[t,T]$ , it is straightforward to verify that $(\widetilde{Z}_{1}^{\star}(\lambda_{2}),\widetilde{Z}_{2}^{\star}(\lambda_{2}))^{\top}=(\widetilde{Z}_{1}^{v^{\star}},\widetilde{Z}_{2}^{v^{\star}})^{\top}(u)$ , i.e., the first order conditions for the Lagrangian problem are verified by $w^{v^{\star}}(t,x)$ and

[TABLE]

Hence, the determination of the pointwise optimal conditions confirms that $v^{\star}(\cdot)$ defined in (4.16) is an optimal control strategy, and consequently $(\widetilde{Z}_{1}^{\star},\widetilde{Z}_{1}^{\star})^{\top}(\cdot)$ defined in (4.17) is a two-dimensional optimal control process that makes the backward component in (4.1) an adapted process.

The uniqueness of the optimal solution is direct consequence of the uniqueness of the solution to the closed loop equation (4.15). ∎

Proof of Corollary 1

The result is obtained simply recalling (4.3), and applying the optimal backward process (4.17) and the optimal trading strategy (4.16). ∎

Proof of Proposition 2

The result follows from the explicit computation and taking into account the solution to the Riccati ODE for $a(\cdot)$ and the solution to the linear ODE derived for $\ell(T-\cdot)$ , as specified in the proof of Lemma 1 at p. B. ∎

Appendix C Dynamic risk measures

Definition 1.

A dynamic convex risk measure is a family of continuous semimartingales which maps, for any bounded stopping time $T$ , a random variable $\xi(T)\in L_{\widetilde{\mathcal{F}}_{T}}^{2}\left(\Omega;\mathbb{R}\right)$ onto a process $\left\{\mathcal{R}\left(t,\xi(T)\right)\right\}_{t\in\left[0,T\right]}$ and satisfies the following axioms:

Convexity:* For any stopping time $S\leq T$ , for any $\xi_{1}(T),\xi_{2}(T)$ , for any $\alpha\in[0,1]$ ,*

[TABLE]

Decreasing monotonicity:* For any stopping time $S\leq T$ , for any $\xi_{1}(T),\xi_{2}(T)$ such that $\xi_{1}(T)\geq\xi_{2}(T)$ $\mathbb{P}$ -a.s., the operator is decreasing, i.e.,*

[TABLE]

Translation invariant*: For any stopping time $S\leq T$ , for any $\eta(S)\in\mathcal{F}_{S}$ , for any $\xi(T)$ ,*

[TABLE]

Semigroup property* or Time consistency property: For any three bounded stopping time $S\leq T\leq U$ , for any $\xi(U)$ ,*

[TABLE]

Arbitrage free*: For any stopping time $S\leq T$ , for any $\xi_{1}(T),\xi_{2}(T)$ such that $\xi_{1}(T)\leq\xi_{2}(T)$ $\mathbb{P}\text{-a.s.}$ ,*

[TABLE]

In our framework, a generalized result regarding the strict relationship between dynamic convex risk measures and one-dimensional BSDEs is stated by the following proposition.

Proposition 3.

Let $\left(Z_{1},Z_{2}\right)^{\top}:=\{\left(Z_{1},Z_{2}\right)^{\top}(t)\}_{t\in[0,T]}$ be the two-dimensional BSDE control process corresponding to the two-dimensional correlated Brownian motion $\left(B_{1},B_{2}\right)$ . If $g$ is a convex driver of a BSDE depending only on $\left(Z_{1},Z_{2}\right)^{\top}\in\mathcal{H}_{\mathbb{F}}^{2}(0,T;\mathbb{R}^{2})$ then, for $\xi(T)\in L^{2}_{\mathcal{F}_{T}}(\Omega;\mathbb{R})$ , the solution $\mathcal{R}(t,\xi(T))$ to:

[TABLE]

characterizes a dynamic convex risk measure.

Proof.

It follows straightforward from the result [6, Theorem 3.21, pp. 125] and the comparison theorem in [9, Theorem 5, p. 554]. ∎

Another simple example of $g$ -conditional risk measure corresponding to a conventional mean-variance description of this risk return tradeoff is given by:

[TABLE]

where $\theta$ can be interpreted as the correlation with the market.

Bibliography30

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Aurélien Alfonsi and Alexander Schied . Optimal trade execution and absence of price manipulations in limit order book models, SIAM Journal on Financial Mathematics , 1 , pp. 490–522, 2010.
2[2] Robert Almgren Optimal execution with nonlinear impact functions and trading-enhanced risk. Applied Mathematical Finance , 10 (1), pp. 1–18, 2003.
3[3] Robert Almgren and Neil Chriss , Optimal execution of portfolio transactions. Journal of Risk , 3 (2), pp.5–39, 2000.
4[4] Louis Bachelier , Théorie de la spéculation. Annales scientifiques de l’École Normale Supérieure , série 3, 17 , pp.21–86, 1900.
5[5] Peter Bank and Moritz Voß , Linear quadratic stochastic control problems with stochastic terminal constraint. SIAM Journal on Control and Optimization , 56 (2), pp.672–699, 2018.
6[6] Pauline Barrieu and Nicole El Karoui , Pricing, hedging, and optimally designing derivatives via minimization of risk measures. In R. Carmona (Ed.), Indifference pricing: Theory and applications , pp. 77–146, Princeton Series in Financial Engineering, Princeton University Press, 2009.
7[7] Dimitris Bertsimas and Andrew W. Lo . Optimal control of execution costs, Journal of Financial Markets , 1 (1), pp.1–50, 1998.
8[8] Tomas Björk , Arbitrage Theory in Continuous Time . Second Edition, Oxford University Press, 2009.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Optimal execution with dynamic risk adjustment

Abstract.

1. Introduction

2. The model

3. Risk adjustment of the target profit and loss function

Proposition 1**.**

Proof.

Remark 1**.**

4. The optimal control problem

4.1. The state equation and the objective functional

Remark 2**.**

4.2. The HJB equation

4.3. Solution to the HJB equation and the verification theorem

Lemma 1**.**

Proof.

Theorem 1**.**

Proof.

Corollary 1**.**

Proof.

5. Optimal liquidation strategies

Proposition 2**.**

Proof.

5.1. Late stage liquidation regime

5.2. Early stage liquidation regime

Acknowledgement

Appendix A Some technical details on the profit and loss function

A.1. Decomposition of the P&L functional

A.2. Computation of the P&L functional

Appendix B Technical proofs

Proof of Proposition 1

Semigroup property of Remark 1

Proof of Lemma 1

Proof of Theorem 1

Proof of Corollary 1

Proof of Proposition 2

Appendix C Dynamic risk measures

Definition 1**.**

Proposition 3**.**

Proof.

Proposition 1.

Remark 1.

Remark 2.

Lemma 1.

Theorem 1.

Corollary 1.

Proposition 2.

Definition 1.

Proposition 3.