Skewed target range strategy for multiperiod portfolio optimization   using a two-stage least squares Monte Carlo method

Rongju Zhang; Nicolas Langren\'e; Yu Tian; Zili Zhu; Fima Klebaner,; Kais Hamza

arXiv:1704.00416·q-fin.PM·July 11, 2019

Skewed target range strategy for multiperiod portfolio optimization using a two-stage least squares Monte Carlo method

Rongju Zhang, Nicolas Langren\'e, Yu Tian, Zili Zhu, Fima Klebaner,, Kais Hamza

PDF

Open Access

TL;DR

This paper introduces a new multiperiod portfolio optimization strategy called Skewed Target Range Strategy (STRS) that aims to maximize expected returns within a specified range, effectively controlling downside risk and volatility.

Contribution

The paper develops a two-stage least squares Monte Carlo method to improve portfolio optimization with difficult payoffs, demonstrating STRS's advantages over classical methods.

Findings

01

STRS effectively contains portfolio values within targeted ranges.

02

STRS achieves a better downside risk-return trade-off than CRRA utility.

03

Numerical results show substantial improvements over classical LSMC.

Abstract

In this paper, we propose a novel investment strategy for portfolio optimization problems. The proposed strategy maximizes the expected portfolio value bounded within a targeted range, composed of a conservative lower target representing a need for capital protection and a desired upper target representing an investment goal. This strategy favorably shapes the entire probability distribution of returns, as it simultaneously seeks a desired expected return, cuts off downside risk and implicitly caps volatility and higher moments. To illustrate the effectiveness of this investment strategy, we study a multiperiod portfolio optimization problem with transaction costs and develop a two-stage regression approach that improves the classical least squares Monte Carlo (LSMC) algorithm when dealing with difficult payoffs, such as highly concave, abruptly changing or discontinuous functions. Our…

Tables2

Table 1. Table 5.1: Risky assets and return predictors

Assets	Underlying	Data source
U.S. Bonds	AGG (ETF)	Yahoo Finance
U.S. Shares	SPY (ETF)	Yahoo Finance
International Shares	IFA (ETF)	Yahoo Finance
Emerging Market Shares	EEM (ETF)	Yahoo Finance
Japanese shares	NIKKEI225	Yahoo Finance
U.K. shares	FTSE100	Yahoo Finance
Australian shares	ASX200	Yahoo Finance
Gold	Spot Price	World Gold Council
Crude Oil	Spot Price	U.S. Energy Info. Admin.
U.S. Dollar	USD Index	Federal Reserve
Japanese Yen	JPYUSD	Federal Reserve
Euro	USDEUR	Federal Reserve
Australian Dollar	USDAUD	Federal Reserve

Table 2. Table 5.2: Two-stage LSMC v.s. classical LSMC for STRS

		Classical LSMC				Two-Stage LSMC				Two-Stage LSMC + $σ (z, w)$
$L_{W}$	$U_{W}$	${\hat{v}}_{0}$	$𝔼 [W_{T}]$	$SD [W_{T}]$	$ℙ [W_{T} < 1]$	${\hat{v}}_{0}$	$𝔼 [W_{T}]$	$SD [W_{T}]$	$ℙ [W_{T} < 1]$	${\hat{v}}_{0}$	$𝔼 [W_{T}]$	$SD [W_{T}]$	$ℙ [W_{T} < 1]$
1	1.1	0.0058	1.1571	0.1847	0.1244	0.0574	1.0596	0.0272	0.0028	0.0475	1.0499	0.0318	0.0095
1	1.2	0.0292	1.1609	0.1709	0.1077	0.0922	1.0883	0.0405	0.0128	0.0904	1.0867	0.0405	0.0122
1	1.3	0.0608	1.1631	0.1542	0.0832	0.1190	1.1126	0.0588	0.0178	0.1239	1.1164	0.0609	0.0192
1	1.4	0.0918	1.1663	0.1597	0.0656	0.1393	1.1296	0.0832	0.0244	0.1446	1.1351	0.0893	0.0286
1	1.5	0.1199	1.1692	0.1625	0.0503	0.1578	1.1449	0.1078	0.0299	0.1596	1.1491	0.1165	0.0321
1	1.6	0.1455	1.1721	0.1641	0.0454	0.1718	1.1563	0.1264	0.0352	0.1728	1.1596	0.1359	0.0413
1	$\infty$	0.1903	1.1743	0.1652	0.0483	0.1934	1.1684	0.1635	0.0423	0.1938	1.1688	0.1625	0.0446

Equations57

\sup_{\boldsymbol{\alpha}}\mbox{$\mathbb{E}$}\left[f\left(W_{T}\right)\right],

\sup_{\boldsymbol{\alpha}}\mbox{$\mathbb{E}$}\left[f\left(W_{T}\right)\right],

f (w) = (w - L_{W}) \mathbbm 1 {L_{W} \leq w \leq U_{W}},

f (w) = (w - L_{W}) \mathbbm 1 {L_{W} \leq w \leq U_{W}},

v_{t} (z, w)

v_{t} (z, w)

W_{t_{n + 1}}

W_{t_{n + 1}}

v_{t_{N}} (z, w)

v_{t_{N}} (z, w)

v_{t_{n}} (z, w)

CV_{t_{n}}^{j} (z, w)

CV_{t_{n}}^{j} (z, w)

v_{t_{n}} (z, w) = α_{t_{n}} \in A sup E [v_{t_{n + 1}} (Z_{t_{n + 1}}, W_{t_{n + 1}}) ∣ Z_{t_{n}} = z, W_{t_{n}} = w] \approx a_{j} \in A^{d} max CV_{t_{n}}^{j} (z, w) .

v_{t_{n}} (z, w) = α_{t_{n}} \in A sup E [v_{t_{n + 1}} (Z_{t_{n + 1}}, W_{t_{n + 1}}) ∣ Z_{t_{n}} = z, W_{t_{n}} = w] \approx a_{j} \in A^{d} max CV_{t_{n}}^{j} (z, w) .

\hat{W}_{t_{n + 1}}^{m, (n, j)}

\hat{W}_{t_{n + 1}}^{m, (n, j)}

\hat{W}_{t_{n + 2}}^{m, (n, j)}

\hat{W}_{t_{N}}^{m, (n, j)}

{\hat{β}_{k, t_{n}}^{j}}_{1 \leq k \leq K}

{\hat{β}_{k, t_{n}}^{j}}_{1 \leq k \leq K}

\overset{σ}{^}_{t_{n}}^{j}

\hat{W}_{t_{N}}^{(n, j)} = \overset{μ}{^}_{t_{n}}^{j} (z, w) + \overset{σ}{^}_{t_{n}}^{j} ε,

\hat{W}_{t_{N}}^{(n, j)} = \overset{μ}{^}_{t_{n}}^{j} (z, w) + \overset{σ}{^}_{t_{n}}^{j} ε,

\hat{CV}_{t_{n}}^{j} (z, w)

\hat{CV}_{t_{n}}^{j} (z, w)

\hat{α}_{t_{n}} (z, w) = ar g a_{j} \in A^{d} max \hat{CV}_{t_{n}}^{j} (z, w)

\hat{α}_{t_{n}} (z, w) = ar g a_{j} \in A^{d} max \hat{CV}_{t_{n}}^{j} (z, w)

\hat{W}_{t_{N}}^{(n, j)}

\hat{W}_{t_{N}}^{(n, j)}

ε

\overset{μ}{^}_{t_{n}}^{j} (z, w)

\overset{σ}{^}_{t_{n}}^{j} (z, w)

L (η Z_{t_{n}}, \tilde{W}_{t_{n}}, \hat{W}_{t_{N}}^{(n, j)}) = m = 1 \sum M ⎩ ⎨ ⎧ - k = 1 \sum K^{'} η_{k, t_{n}}^{j} ψ_{k} (Z_{t_{n}}^{m}, \tilde{W}_{t_{n}}^{m}) - \frac{( ε ^ ^{m} ) ^{2}}{2} exp - 2 k = 1 \sum K^{'} η_{k, t_{n}}^{j} ψ_{k} (Z_{t_{n}}^{m}, \tilde{W}_{t_{n}}^{m}) ⎭ ⎬ ⎫,

L (η Z_{t_{n}}, \tilde{W}_{t_{n}}, \hat{W}_{t_{N}}^{(n, j)}) = m = 1 \sum M ⎩ ⎨ ⎧ - k = 1 \sum K^{'} η_{k, t_{n}}^{j} ψ_{k} (Z_{t_{n}}^{m}, \tilde{W}_{t_{n}}^{m}) - \frac{( ε ^ ^{m} ) ^{2}}{2} exp - 2 k = 1 \sum K^{'} η_{k, t_{n}}^{j} ψ_{k} (Z_{t_{n}}^{m}, \tilde{W}_{t_{n}}^{m}) ⎭ ⎬ ⎫,

\overset{ε}{^}^{m}

\overset{ε}{^}^{m}

Γ (z) = \int_{0}^{\infty} t^{z - 1} exp (- t) d t

Γ (z) = \int_{0}^{\infty} t^{z - 1} exp (- t) d t

z^{(n)} = \frac{Γ ( z + n )}{Γ ( z )}

z^{(n)} = \frac{Γ ( z + n )}{Γ ( z )}

_{1} F_{1} (a, b, z)

_{1} F_{1} (a, b, z)

Ψ (a, b, z)

Ψ (a, b, z)

\hat{CV}_{t_{n}}^{j} (z, w)

\hat{CV}_{t_{n}}^{j} (z, w)

f (w) = \mathbbm 1 {L_{W} \leq w \leq U_{W}} .

f (w) = \mathbbm 1 {L_{W} \leq w \leq U_{W}} .

v_{t} (z, w)

v_{t} (z, w)

\hat{CV}_{t_{n}}^{j} (z, w)

\hat{CV}_{t_{n}}^{j} (z, w)

f_{B} (w, b) := (w - b) \mathbbm 1 {L_{W} \leq w - b \leq U_{W}},

f_{B} (w, b) := (w - b) \mathbbm 1 {L_{W} \leq w - b \leq U_{W}},

f_{B} (w, b) := \mathbbm 1 {L_{W} \leq w - b \leq U_{W}},

f_{B} (w, b) := \mathbbm 1 {L_{W} \leq w - b \leq U_{W}},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Monetary Policy and Economic Impact · Stochastic processes and financial applications

Full text

Skewed target range strategy for multiperiod portfolio optimization

using a two-stage least squares Monte Carlo method

Rongju Zhang , Nicolas Langrené22footnotemark: 2, Yu Tian33footnotemark: 3, Zili Zhu22footnotemark: 2, Fima Klebaner33footnotemark: 3 and Kais Hamza33footnotemark: 3 Corresponding author. Email: [email protected]CSIRO Data61, RiskLab AustraliaSchool of Mathematical Sciences, Monash University, Australia

Abstract

In this paper, we propose a novel investment strategy for portfolio optimization problems. The proposed strategy maximizes the expected portfolio value bounded within a targeted range, composed of a conservative lower target representing a need for capital protection and a desired upper target representing an investment goal. This strategy favorably shapes the entire probability distribution of returns, as it simultaneously seeks a desired expected return, cuts off downside risk and implicitly caps volatility and higher moments. To illustrate the effectiveness of this investment strategy, we study a multiperiod portfolio optimization problem with transaction costs and develop a two-stage regression approach that improves the classical least squares Monte Carlo (LSMC) algorithm when dealing with difficult payoffs, such as highly concave, abruptly changing or discontinuous functions. Our numerical results show substantial improvements over the classical LSMC algorithm for both the constant relative risk-aversion (CRRA) utility approach and the proposed skewed target range strategy (STRS). Our numerical results illustrate the ability of the STRS to contain the portfolio value within the targeted range. When compared with the CRRA utility approach, the STRS achieves a similar mean–variance efficient frontier while delivering a better downside risk–return trade-off.

Keywords: target-based portfolio optimization; alternative performance measure; multiperiod portfolio optimization; least squares Monte Carlo; two-stage regression

JEL Classification: G11, D81, C63, C34, MSC Classification: 91G10, 91G80, 91G60

First version: August 19, 2016

This revised version: September 10, 2018

Final version: Journal of Computational Finance, 2019

1 Introduction

A crucial and long-standing problem in the theory and practice of portfolio optimization is the choice of an effective and transparent performance criterion that balances risk and return. In this paper, we propose a novel portfolio optimization criterion that aims to combine to some extent the respective strengths of the classical criteria considered in the literature.

The origin of the literature corresponds to the notion of decision making under uncertainty. From there, von Neumann and Morgenstern (1944) proposed the expected utility approach for which the investment preferences are captured by a utility function. The shortcomings of this approach include the abstract nature of utility functions, which can make them impractical, and its omission of several practical aspects of actual decision making, as identified by Tversky and Kahneman (1992)’s cumulative prospect theory, see for example Barberis (2012).

The mean-variance framework of Markowitz (1952), which uses variance to measure risk, can well approximate the quadratic utility case. When asset returns are assumed to be normally distributed, many other risk measures have been found equivalent to variance (for example, the equivalence to the first and second-order lower partial moments has been proved by Klebaner, Landsman, Makov, and Yao (2017)), but the mean-variance framework greatly benefits from its simple quadratic formulation.

Some may argue that variance is an inadequate measure of portfolio risk as asset returns usually exhibit the so-called leptokurtic property, meaning that higher moments may need to be incorporated into the optimization. We refer to Lai (1991) and Konno, Shirakawa, and Yamazaki (1993) for the skewness component and Davis and Norman (1990) for both skewness and kurtosis. Another approach to address the issue of non-normality of asset returns is to use a downside risk measure. The most common downside risk measures are the lower-partial moments (e.g., semivariance introduced in Markowitz (1959)), Value at Risk (VaR, Longerstaey 1996) and Conditional Value at Risk (CVaR, Rockafellar and Uryasev 2000, a.k.a. expected shortfall). These measures can replace variance to form a mean-downside risk approach, see Harlow (1991) for a mean-lower-partial moment framework, Alexander and Baptista (2002) for the mean-VaR framework and Agarwal and Naik (2004) for the mean-CVaR framework.

The last main strand of literature corresponds to target-based strategies that aim to track a prespecified investment target. A popular target-based strategy is to maximize the probability of achieving a return target, see Browne (1999a) for a fixed absolute target and Browne (1999b), Pham (2003), Gaivoronski, Krylov, and van der Wijst (2005) and Morton, Popova, and Ivilina (2006) for relative benchmark targets. Alternatively, one can minimize the probability of an undesirable outcome, see for example Hata, Nagai, and Sheu (2010), Nagai (2012) and Milevsky, Moore, and Young (2006). Using an explicitly specified investment target in portfolio optimization makes it easier to understand and monitor in practice. However, choosing a suitable investment target that properly balances risk and return remains a challenging task.

Building upon these classical investment criteria, we propose in this paper the so-called Skewed Target Range Strategy (STRS), which maximizes the expected portfolio value bounded within a prespecified target range, composed of a conservative lower target representing a need for capital protection and a desired upper target corresponding to an ideal return level the investor wishes to achieve. Implicitly, the optimization can be described as maximizing the probability that the realized return lies within the targeted range and as close to the upper target as possible.

There are three main motivations behind the proposed STRS. The first motivation traces back to the primary purpose of an investment objective function, which is to carve a desirable shape for the probability distribution of returns. The STRS, seeking a desirable expected return while chopping off most of the tails of the distribution beyond the targeted range, restrains the entire return distribution. The second motivation comes from the difficulty of specifying a single return target for classical target-based strategies, which cannot simultaneously serve the pursuit of a desired investment target and downside protection. The STRS solves this dilemma by using an upper target which accounts for return-seeking preference, combined with a lower target which accounts for loss-aversion preference. Finally, performance criteria such as utility functions depending on abstract parameters with unforeseeable practical effects are unlikely to be adopted by investors. Our proposition of two explicit targets labeled in terms of returns, with intuitive purposes (capital protection for the lower target and desired investment return for the upper target), serves as a more practical investment criterion.

To test the effectiveness of the proposed STRS (formulated in Section 2), we study a multiperiod portfolio optimization problem with proportional transaction costs. To do so, we modify the classical Least Squares Monte Carlo (LSMC) algorithm to use a two-stage regression technique, which makes the problem of approximating the abrupt STRS objective function (equation (2.1)) as easy as approximating a linear function. The LSMC literature and the details of the proposed two-stage LSMC method are further discussed in Section 3. We show that this two-stage LSMC method is numerically more stable than the classical LSMC method for both the smooth constant relative risk aversion (CRRA) utility approach and the abrupt STRS. We find that an appropriate level for the lower target is the initial portfolio value, as it marginally minimizes the standard deviation and the downside risk of the terminal portfolio value. Importantly, we show that the STRS criterion does behave as expected from its design: the portfolio value is well targeted within the specified range, and the downside risk is robust with respect to the choice of the upper target. We numerically show that the STRS achieves a similar mean-variance efficient frontier while delivering a better downside risk-return trade-off when compared to the CRRA utility optimization approach. We also provide two simple extensions of the STRS, described in Section 4. The first extension, dubbed Flat Target Range Strategy (FTRS), corresponds with pure probability maximization of achieving a targeted range, without a further attempt to pursue a higher return. The FTRS is useful for problems where maintaining solvency is more important than seeking high returns, for example for long-term pension schemes, retirement funds and life-cycle management. The second extension, dubbed Relative Target Range Strategy (RTRS), focuses relative returns: it involves a return target range defined in terms of excess return over a stochastic benchmark, such as stock market index, interest rate or inflation rate. All the numerical results are presented in Section 5.

2 Skewed Target Range Strategy

In this section, we define the skewed target range strategy (STRS) for portfolio optimization problems and discuss potential benefits of this strategy. We consider a portfolio optimization problem with $d$ risky assets available over a finite time horizon $T$ . Let $\boldsymbol{\alpha}_{t}=\{\alpha_{t}^{i}\}_{1\leq i\leq d}$ be the portfolio weight in each risky asset at time $t$ , and denote by $W_{t}$ the portfolio value (or wealth). Assume that the investor aims to maximize the expectation of some function of the terminal portfolio value $\mbox{$ \mathbb{E} $}\left[f(W_{T})\right]$ . Then, the objective function simply reads

[TABLE]

where the investment preference is characterized by the function $f\left(\cdot\right)$ . In this paper, we propose the following parametric shape:

[TABLE]

where $L_{{\scriptscriptstyle\!W}}\in\mathbb{R}$ represents a conservative lower target, $U_{{\scriptscriptstyle\!W}}\in\mathbb{R}$ represents a desired upper target, and the indicator function $\mathbbm{1}\{L_{{\scriptscriptstyle\!W}}\leq w\leq U_{{\scriptscriptstyle\!W}}\}$ returns one if $L_{{\scriptscriptstyle\!W}}\leq w\leq U_{{\scriptscriptstyle\!W}}$ and returns zero otherwise. We refer to the shape (2.2) and the corresponding objective (2.1) as the STRS. Throughout this paper, we normalize the portfolio value $W$ and the bounds $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ by the initial portfolio value $W_{0}$ . Indeed, the formula (2.2) shows that $f(w;L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}})=W_{0}\times f(\frac{w}{W_{0}};\frac{L_{{\scriptscriptstyle\!W}}}{W_{0}},\frac{U_{{\scriptscriptstyle\!W}}}{W_{0}})$ , so we can assume without loss of generality that $W_{0}=1$ and set the bounds $L_{{\scriptscriptstyle\!W}}$ and $U_{{\scriptscriptstyle\!W}}$ in the vicinity of $1$ . Figure 2.1 shows an example of equation (2.2) with $L_{{\scriptscriptstyle\!W}}=1.0$ and $U_{{\scriptscriptstyle\!W}}=1.2$ .

From equation (2.2), one can see that the objective is to maximize the expected terminal portfolio value within the interval $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ , while the values outside this interval are penalized down to zero. This strategy implicitly combines two objectives: maximizing the expected terminal portfolio value and maximizing the probability that the terminal portfolio value lies within the chosen target range $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ .

On the left side of the skewed shape in equation (2.2), the function is convex at the lower target $L_{{\scriptscriptstyle\!W}}$ . This is consistent with the notion from Tversky and Kahneman (1992)’s cumulative prospect theory that investors tend to be risk-seeking when losing money. By contrast, on the right side of the skewed shape, the function is discontinuous and jumps down to zero at the upper target $U_{{\scriptscriptstyle\!W}}$ . This is the distinctive feature of the STRS compared to classical utility functions as well as cumulative prospect theory. In particular, the foregoing of the upside potential beyond the upper target $U_{{\scriptscriptstyle\!W}}$ seems to conflict with the non-satiation axiom that people prefer more to less. The following explains the importance of this upper threshold.

Everything else being equal (ceteris paribus assumption), one would expect people to prefer more to less. This axiom in the context of dynamic stochastic portfolio optimization can be interpreted as follows: the downside risk being fixed (the left tail of the return distribution), investors would prefer higher upside potential (a longer right tail of the return distribution). However, *after extensive numerical experiments, we came to the conclusion that *non-decreasing utility functions are unable to decouple upside potential from downside risk. Indeed, pursuing higher upside potential leads to riskier portfolio decisions, which may result in a return distribution with a large right tail (gains) as well as a large left tail (losses). As the ceteris paribus assumption does not apply in this stochastic context, one cannot rule our the existence of a satiation level. Such a level is determined by the investor’s preference with respect to risk and return.

As upside potential and downside risk are naturally intertwined, the proposed upper target is able to curtail downside risk by addressing its main cause - namely the pursuit of excessive upside potential. As a result, the realized returns can be well contained within the targeted range with a high degree of confidence, which in several contexts is more important than allowing for the possibility of rare windfall returns at the cost of higher downside risk.

3 Multiperiod Portfolio Optimization

In this section, we consider a multiperiod portfolio optimization problem and formulate it as a discrete-time dynamic programming problem, for which we develop a two-stage LSMC method to solve it. The LSMC algorithm, originally developed by Carriere (1996), Longstaff and Schwartz (2001) and Tsitsiklis and Van Roy (2001) for pricing American options, has been extended to solve dynamic portfolio optimization problems by several researchers. Brandt, Goyal, Santa-Clara, and Stroud (2005) consider a CRRA utility function and determine a semi-closed-form solution by solving the first order condition of the Taylor series expansion of the value function. Cong and Oosterlee (2016a) and Cong and Oosterlee (2016b) consider a target-based mean-variance objective function and use a suboptimal strategy to perform the forward simulation of control variables which are iteratively updated in the backward recursive programming. Later, Cong and Oosterlee (2017) combine Jain and Oosterlee (2015)’s stochastic bundling technique with Brandt et al. (2005)’s method. Zhang, Langrené, Tian, Zhu, Klebaner, and Hamza (2019) consider a CRRA utility function and adopt Kharroubi, Langrené, and Pham (2014)’s control randomization technique for a portfolio optimization problem with switching costs including transaction costs, liquidity costs and market impact.

The aforementioned works solve problems with a continuous payoff function for which the classical LSMC method can be very effective. By contrast, highly nonlinear, abruptly changing or discontinuous payoffs can be more difficult to handle for the LSMC algorithm (Zhang et al. (2019), Balata and Palczewski (2018), Andreasson and Shevchenko (2018)). The STRS (2.2), with its abrupt drop at the upper bound $U_{{\scriptscriptstyle\!W}}$ , is such a difficult function. In addition, as the terminal wealth outside the targeted range are truncated to zero in the value function, a direct regression on these zeros would forego the original information from the wealth variable. In this section, we propose a two-stage LSMC method to overcome these issues.

3.1 Dynamic programming

Denote by $R^{f}$ the cumulative return of the risk-free asset over one single period. Denote by $\mathbf{R}_{t}=\left\{R_{t}^{i}\right\}_{1\leq i\leq d}$ the excess returns of the risky assets over the risk-free rate and denote by $\mathbf{Z}_{t}$ the vector of return predictors. The optimization problem in equation (2.1) can be formulated as a stochastic control problem with exogenous state variables $\mathbf{Z}_{t}$ and one endogenous state variable $W_{t}$ . Let $\mathcal{A}\subseteq\mathbb{R}^{d}$ be the set of admissible portfolio weights. The value function in equation (2.1) can now be rewritten as

[TABLE]

Consider an equidistant discretization of the investment horizon $[0,T]$ , denoted by $0=t_{0}<\cdots<t_{N}=T$ . The wealth process evolves as

[TABLE]

and the value function satisfies the following dynamic programming principle

[TABLE]

where $f(w)=(w-L_{{\scriptscriptstyle\!W}})\mathbbm{1}\{L_{\!{\scriptscriptstyle W}}\leq w\leq U_{{\scriptscriptstyle\!W}}\}$ .

3.2 Classical least squares Monte Carlo

The first part of the LSMC algorithm is the forward simulation of all the stochastic state variables. Let $M$ denote the number of Monte Carlo simulations. The return predictors $\{\mathbf{Z}_{t_{n}}^{m}\}_{0\leq n\leq N}^{1\leq m\leq M}$ and the asset excess returns $\{\mathbf{R}_{t_{n}}^{m}\}_{0\leq n\leq N}^{1\leq m\leq M}$ are generated through some predetermined return dynamics. By contrast, the wealth process is an endogenous state variable depending on the realization of the portfolio weights. We follow the control randomization approach of Kharroubi et al. (2014): we randomly generate uniform portfolio weights $\{\tilde{\boldsymbol{\alpha}}_{t_{n}}^{m}\}_{0\leq n\leq N}^{1\leq m\leq M}$ , and then compute the corresponding portfolio values $\{\tilde{W}_{t_{n}}^{m}\}_{0\leq n\leq N}^{1\leq m\leq M}$ according to equation (3.2).

The second part of the LSMC algorithm uses a discretization procedure. We discretize the control space as $\mathcal{A}^{\text{d}}=\{\mathbf{a}_{1},...,\mathbf{a}_{J}\}$ . We define the continuation value function $\text{CV}{}_{t_{n}}^{j}$ as the expectation of the subsequent value function conditional on making the decision $\boldsymbol{\alpha}_{t_{n}}=\mathbf{a}_{j}\in\mathcal{A}^{\text{d}}$ , i.e.,

[TABLE]

Therefore, the value function can be approximated by

[TABLE]

To compute this value function, we proceed by backward dynamic programming. At time $t_{N}$ , the value function is equal to $\hat{v}_{t_{N}}(z,w)=(w-L_{{\scriptscriptstyle\!W}})\mathbbm{1}\{L_{{\scriptscriptstyle\!W}}\leq w\leq U_{{\scriptscriptstyle\!W}}\}$ . At time $t_{n}$ , assume that the continuation value functions $\{\hat{\text{CV}}{}_{t_{n^{\prime}}}^{j}(z,w)\}_{n+1\leq n^{\prime}\leq N-1}^{1\leq j\leq J}$ have been estimated. We evaluate the continuation value function at the current time $\text{CV}_{t_{n}}^{j}$ for each decision $\mathbf{a}_{j}\in\mathcal{A}^{\text{d}}$ . We then reset the portfolio weights $\{\boldsymbol{\alpha}_{t_{n}}^{m}\}_{1\leq m\leq M}$ to $\mathbf{a}_{j}$ , and recompute the endogenous wealth from $t_{n}$ to $t_{N}$ :

[TABLE]

where $\hat{W}_{t_{n^{\prime}}}^{m,(n,j)}:=\left.\hat{W}_{t_{n^{\prime}}}^{m}\right|_{W_{t_{n}}^{m}=\tilde{W}_{t_{n}}^{m},\boldsymbol{\alpha}_{t_{n}}=\mathbf{a}_{j}}$ , $n^{\prime}=n,\ldots,N$ is the recomputed wealth from $t_{n}$ to $t_{N}$ , using the portfolio weights $\mathbf{a}_{j}$ at time $t_{n}$ and the estimated optimal portfolio weights at times $t_{n+1},\ldots,t_{N-1}$ .

To approximate the continuation value function $\text{CV}{}_{t_{n}}^{j}(z,w)$ , the classical LSMC algorithm regresses the payoffs $\{f(\hat{W}_{t_{N}}^{m,(n,j)})\}_{1\leq m\leq M}$ on $\{\psi_{k}(\mathbf{Z}_{t_{n}}^{m},\tilde{W}_{t_{n}}^{m})\}_{1\leq m\leq M}^{1\leq k\leq K}$ , where $\{\psi_{k}(z,w)\}_{1\leq k\leq K}$ is the vector of basis functions of the state variables. However, the major difficulty here lies in the abrupt upper bound $U_{{\scriptscriptstyle\!W}}$ , which can cause large numerical errors in the regression according to our numerical exploration.

As $f$ censors the values of $\hat{W}_{t_{N}}^{m,(n,j)}$ outside the targeted range $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ , our regression problem looks similar to a censored regression problem, for which a common estimation approach is maximum likelihood estimation (MLE). However, the main difference between our problem and a censored regression problem is that we have access to both the censored samples $\{f(\hat{W}_{t_{N}}^{m,(n,j)})\}_{1\leq m\leq M}$ and the uncensored samples $\{\hat{W}_{t_{N}}^{m,(n,j)}\}_{1\leq m\leq M}$ . Thus, MLE would ignore the information of the uncensored values $\hat{W}_{t_{N}}^{m,(n,j)}$ which are also observable in this estimation problem. The availability of this extra piece of information motivates us to propose a two-stage regression that takes advantages of this information. We now describe this technique in detail.

3.3 Two-stage least squares Monte Carlo

This two-stage regression works as follows:

Instead of regressing the payoffs $\{f(\hat{W}_{t_{N}}^{m,(n,j)})\}_{1\leq m\leq M}$ , we regress the wealth $\{\hat{W}_{t_{N}}^{m,(n,j)}\}_{1\leq m\leq M}$ on $\{\psi_{k}(\mathbf{Z}_{t_{n}}^{m},\tilde{W}_{t_{n}}^{m})\}_{1\leq m\leq M}^{1\leq k\leq K}$ to obtain

[TABLE]

As a result, the terminal wealth can be modeled as

[TABLE]

where $\varepsilon$ is the regression residual, which for demonstrative purposes we assume Gaussian. (Remark that an assumption for the distribution of the residuals is also required by MLE.) Let $\phi(x)=\frac{1}{\sqrt{2\pi}}\exp(\frac{x^{2}}{2})$ represent the standard normal probability density function, and $\Phi(x)=\int_{-\infty}^{x}\phi(x)dx$ represent the standard normal cumulative distribution function. 2. 2.

Plug equation (3.7) into the continuation value formula (3.4) to obtain a closed-form estimate. By combining equations (3.4), (3.5), (3.6) and (3.7), we obtain the following closed-form estimate of the continuation value function for each $\mathbf{a}_{j}\in\mathcal{A}^{\text{d}}$ at time $t_{n}$ :

[TABLE]

where the last equality is obtained by direct integration. 3. 3.

The mappings $\hat{\boldsymbol{\alpha}}_{t_{n}}:(z,w)\mapsto\hat{\boldsymbol{\alpha}}_{t_{n}}(z,w)$ and $\hat{v}_{t_{n}}:(z,w)\mapsto\hat{v}_{t_{n}}(z,w)$ are estimated by

[TABLE]

In summary, thanks to the censored linear shape of the skewed target range function in equation (2.2), the conditional expectations in the dynamic programming equations (3.3) can be estimated by the closed-form formula (3.8). Due to the linearity of the regressand $\hat{W}_{t_{N}}^{m,(n,j)}$ in equation (3.6), this two-stage regression is much more robust and stable than a direct regression of $f(\hat{W}_{t_{N}}^{m,(n,j)})$ . Subsection 4.1 describes a similar closed-form conditional value for the CRRA utility approach, and Subsection 5.3 illustrates the numerical improvements provided by this two-stage LSMC method.

More generally, the approach proposed here (linear approximation in (3.7) + decensored corrections in (3.8)) can be adapted to the situations where residuals are non-Gaussian: this would simply modify the correction terms in (3.8). There is no restriction on the choice of the residual distribution, nor on the estimation methods (empirical distribution, kernel estimation, mixture normal, etc.). Nevertheless, without loss of generality, it can be reasonable to assume normality of residuals for low-frequency trading such as monthly returns with monthly rebalancing considered in our numerical experiments in Section 5. In addition, the properties of the wealth distribution can be well captured by regressing $\{\hat{W}_{t_{N}}^{m,(n,j)}\}_{1\leq m\leq M}$ on basis functions of $\{\tilde{W}_{t_{n}}^{m}\}_{1\leq m\leq M}$ , yielding regression residuals close to normal. Based on our numerical experiments, the residuals are indeed very close to normal. For these reasons and for demonstration purposes, we henceforth assume normality of residuals and focus on the analysis of the effects of the new investment objective (2.2).

3.4 State-dependent standard deviation

An important assumption made in the previous subsection is that $\hat{\sigma}_{t_{n}}^{j}$ only depends on the portfolio decision $\boldsymbol{a}^{j}$ , but not on the state variables $(\mathbf{Z}_{t_{n}},W_{t_{n}})$ . This subsection describes how to improve the standard deviation estimate to incorporate state variables. Similar to the approximation of $\hat{\mu}_{t_{n}}^{j}(z,w)$ , the state-dependent standard deviation $\hat{\sigma}_{t_{n}}^{j}(z,w)$ can be approximated by the exponential of a linear combination of basis functions of state variables, $\hat{\sigma}_{t_{n}}^{j}(z,w)=\exp(\sum_{k=1}^{K^{\prime}}\hat{\eta}_{k,t_{n}}^{j}\psi_{k}\left(z,w\right))$ . The purpose of the exponential transform is to avoid the possibility of negative standard deviation estimates. Then, the two-stage regression becomes

[TABLE]

Note that a standard least squares regression cannot be used to estimate an unobservable variable such as standard deviation. Instead, we use MLE. We first perform a least squares regression to approximate the mean $\hat{\mu}_{t_{n}}^{j}(z,w)$ , and then approximate the logarithmic standard deviation $\log\hat{\sigma}_{t_{n}}^{j}(z,w)$ by maximizing the following log-likelihood function:

[TABLE]

where

[TABLE]

We use the Broyden–Fletcher–Goldfarb–Shanno algorithm to perform the maximization of this log-likelihood function. In Subsection 5.3, we compare the results obtained with and without state-dependency in the standard deviation estimate.

3.5 Upper target as stop-profit

As discussed in Section 2, the main purpose of the upper target $U_{{\scriptscriptstyle\!W}}$ in the performance measure is to reduce downside risk. However, in multiperiod optimization, a paradox might occur when the realized wealth overshoots the upper target: by default, the portfolio optimizer might tell the fund manager to pick the assets most likely to fall. It is trivial to see that, when $W_{t}\geq U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-t)}$ , one can outperform the upper target for certain by henceforth investing $U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-t)}$ amount of wealth into the risk-free asset and taking out the balance amount $W_{t}-U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-t)}$ from the problem. To implement such a correction, two approaches are possible:

One can replace $T$ by $\min\{T,\tau\}$ in the value function in equation (2.1), where $\tau$ is the first (stopping) time such that $W_{\tau}\geq U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-\tau)}$ . At time $\tau$ (if it occurs before $T$ ), the dynamic optimization stops: the amount $U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-\tau)}$ is invested in the risk-free asset, and the balance amount $W_{\tau}-U_{{\scriptscriptstyle\!W}}R_{f}^{-(T-\tau)}$ is taken out. 2. 2.

Alternatively, one can add an extra dynamic control to the problem: dynamic withdrawal/consumption, see Dang, Forsyth, and Vetzal (2017) for example.

For simplicity, we use the first approach in this paper. Based on our numerical experiments, we find that imposing this stop-profit rule does not significantly affect the terminal wealth distribution, as usually only a very small portion of wealth realizations overshoot the upper bound. For example, we show in the numerical section that about 1% of the realizations overshoot the upper bound for $[L_{\!{\scriptscriptstyle W}}=1.0,U_{{\scriptscriptstyle\!W}}=1.1]$ , and virtually 0% for $[L_{\!{\scriptscriptstyle W}}=1.0,U_{{\scriptscriptstyle\!W}}=1.2]$ .

4 Extensions

This section adapts the two-stage LSMC method to alternative investment objectives. We first describe how to use the two-stage LSMC method to deal with the CRRA utility approach, then we adapt the formulation of the STRS to the Flat Target Range Strategy (FTRS) which purely maximizes the probability of achieving a prespecified target range without further attempts to rally for profits, and to target range strategies based on a stochastic benchmark, for which the absolute fixed target range is replaced by a relative target range.

4.1 CRRA utility

In the classical LSMC approach, a conditional expected utility of the type $\mathbb{E}[\mathcal{U}(W_{T})|\mathbf{Z}_{t_{n}}=z,W_{t_{n}}=w]$ would be approximated by $\beta\cdot\psi(z,w)$ , which may lead to large numerical errors when the utility function $\mathcal{U}$ is highly non-linear, see Van Binsbergen and Brandt (2007), Garlappi and Skoulakis (2009), Denault and Simonato (2017), Zhang et al. (2019) and Andreasson and Shevchenko (2018). The proposed two-stage regression avoids this non-linearity problem and greatly improves the stability of the LSMC method. In this subsection, we derive the two-stage continuation value estimates for the CRRA utility approach. These estimates involve the following special functions:

•

Gamma function:

[TABLE]

•

Rising factorial:

[TABLE]

•

Confluent hypergeometric function of the first kind:

[TABLE]

•

Confluent hypergeometric function of the second kind:

[TABLE]

Assume that the conditional mean of the terminal wealth $\hat{\mu}_{t_{n}}^{j}(z,w)$ and the standard deviation $\hat{\sigma}_{t_{n}}^{j}$ have been estimated according to equations (3.6) and (3.7). Then, using the general formula for the real moments of a Gaussian distribution (Winkelbauer (2014)), the continuation value function in the CRRA utility approach is given by

[TABLE]

We use this closed-form formula for the numerical comparisons in Subsection 5.3.

4.2 Flat target range strategy

The return distribution produced by the STRS (2.2) is skewed towards the upper return target. Yet, there exists some other types of portfolio optimization problems (such as life-cycle and insurance-related investments) for which the ability to remain solvent prevails over the appetite for high expected return. For such problems, one can adjust the skewed target range shape (2.2) to a flat target range shape given by

[TABLE]

Figure 4.1 illustrates the above equation (4.2) with $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]=[1.0,1.2]$ .

Then the portfolio optimization problem becomes

[TABLE]

which is a pure probability maximizing strategy.

The conservative FTRS can be deemed more flexible than the classical Value-at-Risk (VaR) minimization approach: when $U_{{\scriptscriptstyle\!W}}=+\infty$ , the FTRS (4.3) and VaR minimization achieve comparable investment outcomes, the difference being a fixed, absolute cut-off level for the former and an implicit, relative cut-off level for the latter. In particular, the FTRS minimizes the probability of being below a particular loss level, while the VaR procedure minimizes a particular loss quantile. When $U_{{\scriptscriptstyle\!W}}$ is finite, the FTRS provides greater flexibility for investors to devise their risk preferences, as the lower return target $L_{{\scriptscriptstyle\!W}}$ in such circumstances is an explicit input from the investor, and the option to fix an upper target $U_{{\scriptscriptstyle\!W}}$ broadens the range of possible risk profiles.

Assuming that the conditional mean of the terminal wealth $\hat{\mu}_{t_{n}}^{j}(z,w)$ and the standard deviation $\hat{\sigma}_{t_{n}}^{j}$ have been estimated according to equations (3.6) and (3.7), the continuation value function is simply given by

[TABLE]

4.3 Target range over a stochastic benchmark

It is also possible to define the return thresholds $L_{{\scriptscriptstyle\!W}}$ and $U_{{\scriptscriptstyle\!W}}$ relatively to a stochastic benchmark, be it stock market index, inflation rate, exchange rate or interest rate. We refer to Franks (1992), Browne (1999a), Brogan and Stidham Jr. (2005) and Gaivoronski et al. (2005) for classical investment strategies that aim to outperform a stochastic benchmark.

Denote by $B$ the stochastic benchmark of interest, and define the relative excess wealth as $W-B$ . We can then modify the target range function as:

[TABLE]

for STRS, and

[TABLE]

for FTRS.

The stochastic benchmark $B$ can be simply modeled as one additional exogenous state variable, so that this new problem can be solved using the same approach developed in Section 3.

5 Numerical Experiments

In this section, we test the skewed target range strategy (STRS), and illustrate how it can achieve the investor’s range objective. Table 5.1 summarizes the asset classes and the exogenous state variables used for our numerical experiments. We consider a portfolio invested in five assets: risk-free cash, U.S. bonds (AGG), U.S. shares (SPY), international shares (IFA) and emerging market shares (EEM), the other assets listed in Table 5.1 being used as return predictors.

The annual interest rate on the cash component is set to be $2\%$ . We assume $0.1\%$ proportional transaction costs and we refer to Zhang et al. (2019) on how to deal with switching costs in the LSMC algorithm with endogenous variables. A first-order vector autoregression model is calibrated to the monthly log-returns of the assets listed in Table 5.1 from September 2003 to March 2016. By bootstrapping the residuals, 10,000 simulation paths are generated for one year with monthly time steps. The two-stage regression method approximates a linear wealth $W_{T}$ , but not a concave utility $\mathcal{U}(W_{T})$ ; as a result, a sample of 10,000 paths can be deemed sufficient to reach numerical stability, as reported in Van Binsbergen and Brandt (2007) and Zhang et al. (2019). For the same reason, we use a simple second-order multivariate polynomial as the basis functions for the linear least squares regressions in the algorithm. For simplicity, all the reported distributions are simulated in-sample, which might in theory make the estimation upward-biased. In the numerical experiments, we use a mesh of 0.2 increment for the discrete control grid and we do not allow short-selling and borrowing. Apart from Subsection 5.3 where a state-dependent standard deviation is tested, the state-independent standard deviation is used for all the other numerical experiments. The program is coded in Python 3.4.3, and it takes approximately two hours on a 2.2 GHz Intel Core i7 CPU to complete the computation for $M=10,000$ paths, 12 time steps, 13 state variables, a second-order polynomial basis, and a control mesh of 0.2 for a five-dimensional portfolio.

5.1 Wealth distribution

Figure 5.1 provides some examples of estimated distribution of terminal portfolio value when using the STRS. We recall that the portfolio value $W$ and the bounds $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ are scaled by the initial wealth, so that without loss of generality we assume $W_{0}=1.00$ . The lower target $L_{{\scriptscriptstyle\!W}}$ is set to the initial wealth level $1.00$ , a natural choice representing the preference of investors for capital protection. Four different upper targets $U_{{\scriptscriptstyle\!W}}$ are tested: $1.05$ , $1.10$ , $1.20$ and $1.30$ .

Several comments can be made about the shape of the terminal wealth distribution produced by the STRS in Figure 5.1. The most striking observation is that the STRS does confine most of the wealth realizations within the predefined target range, and for low upper target levels $U_{{\scriptscriptstyle\!W}}=1.05$ and $U_{{\scriptscriptstyle\!W}}=1.10$ , the wealth distributions mimics to some extent the shape of the skewed target range function (2.2), making downside risk negligible. This suggests the two-stage LSMC algorithm is indeed capable of handling an abrupt discontinuous payoff function properly. There are some wealth realizations lying above the upper bound, which, in spite of the first correction described in Subsection 3.5, may occur due to the discrete-time nature of monthly rebalancing (a large upward jump can occur during one single month, after which the risky investment is immediately stopped as described in Subsection 3.5).

As expected, setting the upper target $U_{{\scriptscriptstyle\!W}}$ to a higher level produces a higher expected terminal wealth with higher standard deviation and greater downside risk (as measured by the probability of losing capital). At the same time, the higher the upper target $U_{{\scriptscriptstyle\!W}}$ , the harder it is for the terminal wealth distribution to be skewed towards the upper target. Regarding the tails beyond the targeted range, the two low upper target levels $U_{{\scriptscriptstyle\!W}}=1.05$ and $U_{{\scriptscriptstyle\!W}}=1.10$ produce larger right tails, while the two higher levels $U_{{\scriptscriptstyle\!W}}=1.20$ and $U_{{\scriptscriptstyle\!W}}=1.30$ produce larger left tails, which is consistent with the fact that the greater $U_{{\scriptscriptstyle\!W}}$ , the higher the risk that the investor is willing to take to achieve a higher return. This illustrates the capability of the STRS to cater to different risk appetites.

An interesting quantity to monitor is the ratio $\mathcal{R}:=(\mathbb{E}[W_{T}]-L_{{\scriptscriptstyle\!W}})/(U_{{\scriptscriptstyle\!W}}-L_{{\scriptscriptstyle\!W}})$ which measures the location of the expected performance $\mathbb{E}[W_{T}]$ relative to the targeted range: $\mathcal{R}=0\%$ means $\mathbb{E}[W_{T}]=L_{{\scriptscriptstyle\!W}}$ , while at the opposite $\mathcal{R}=100\%$ means $\mathbb{E}[W_{T}]=U_{{\scriptscriptstyle\!W}}$ . In our experiments from Figure 5.1, $\mathcal{R}$ is a decreasing function of $U_{{\scriptscriptstyle\!W}}$ , from $\mathcal{R}=72\%$ for $U_{{\scriptscriptstyle\!W}}=1.05$ down to $\mathcal{R}=38\%$ for $U_{{\scriptscriptstyle\!W}}=1.30$ . This illustrates the natural fact that the higher the desired upper target, the harder it is to achieve it. One visible drawback of the proposed strategy is the relatively long left tail when both the upper and lower targets are set to relatively high levels, for example, $L_{{\scriptscriptstyle\!W}}\geq 1.00$ and $U_{{\scriptscriptstyle\!W}}\geq 1.20$ .

Figure 5.2 shows the time evolution of the wealth distribution (0.05 percentile to 99.95 percentile) over the whole investment horizon, for the STRS with $[L_{{\scriptscriptstyle\!W}}=1.0,U_{\negmedspace{\scriptscriptstyle W}}=1.1]$ (top-left panel), $[L_{{\scriptscriptstyle\!W}}=1.0,U_{\negmedspace{\scriptscriptstyle W}}=1.2]$ (top-right panel), $[L_{{\scriptscriptstyle\!W}}=1.0,U_{\negmedspace{\scriptscriptstyle W}}=\infty]$ (bottom-left panel) and $[L_{{\scriptscriptstyle\!W}}=0,U_{\negmedspace{\scriptscriptstyle W}}=\infty]$ (bottom-right panel), where the last strategy is equivalent to maximizing the expected terminal wealth without taking risk into account. The results show that the wealth distributions in the top panel are well tightened within the prespecified target ranges over the whole investment process, as opposed to the case $U_{\negmedspace{\scriptscriptstyle W}}=\infty$ in the bottom panel. Once again, as upside potential and downside risk are naturally intertwined, one cannot protect against downside risk very well when the upper target is set to a very high level, as shown by the $[L_{{\scriptscriptstyle\!W}}=1.0,U_{\negmedspace{\scriptscriptstyle W}}=\infty]$ example (bottom-left panel).

5.2 Sensitivity analysis and choice of $L_{{\scriptscriptstyle\!W}}$

The next experiment is a sensitivity analysis of the expected terminal wealth, standard deviation and downside risk with respect to the bounds of the STRS. Figure 5.3 shows how the expected terminal wealth ( $\mathbb{E}[W_{T}]$ , first row), the standard deviation of the terminal wealth ( $\text{SD}[W_{T}]$ , second row) and the downside risk ( $\mathbb{P}[W_{T}<1]$ , third row) are affected by changes in the upper bound $U_{\!{\scriptscriptstyle W}}$ (left column) and by changes in the lower bound $L_{\!{\scriptscriptstyle W}}$ (right column).

The left column of Figure 5.3 shows how the expectation $\mathbb{E}[W_{T}]$ , standard deviation $\text{SD}[W_{T}]$ and downside risk $\mathbb{P}[W_{T}<1]$ increase with $U_{{\scriptscriptstyle\!W}}$ , though a plateau is reached around $U_{\!{\scriptscriptstyle W}}=1.5$ for $\mathbb{P}[W_{T}<1]$ and around $U_{\!{\scriptscriptstyle W}}=1.8$ for $\mathbb{E}[W_{T}]$ .

On the right column, one can see that the standard deviation $\text{SD}[W_{T}]$ and downside risk $\mathbb{P}[W_{T}<1]$ both increase when $L_{{\scriptscriptstyle\!W}}$ moves away from the initial wealth $W_{0}=1.0$ . When $L_{{\scriptscriptstyle\!W}}>1.0$ , both risk measures increase with $|L_{{\scriptscriptstyle\!W}}-W_{0}|$ due to the additional risk required at the beginning of the trading period to force the portfolio value to grow from $W_{0}=1.0$ to the lower target $L_{{\scriptscriptstyle\!W}}>W_{0}=1.0$ . When $L_{{\scriptscriptstyle\!W}}<1.0$ , both risk measures also increase with $|W_{0}-L_{{\scriptscriptstyle\!W}}|$ due to the lack of immediate loss penalization. Nevertheless, the net effect of $L_{{\scriptscriptstyle\!W}}$ on $\mathbb{E}[W_{T}]$ is mostly negligible. As a result, these observations suggest that $L_{{\scriptscriptstyle\!W}}=W_{0}=1.0$ is an appropriate choice for the lower bound of the targeted interval, from which the upper bound $U_{\!{\scriptscriptstyle W}}$ can be set according to the risk preference and the return requirement of the investor.

5.3 Model validation

The following experiment aims at validating the two-stage LSMC method via a comparison to the classical LSMC method. We first study a CRRA utility optimization example. It has been noted that a simulation-and-regression approach can generate large numerical errors when the utility function is highly nonlinear (high risk aversion), see for example Van Binsbergen and Brandt (2007), Garlappi and Skoulakis (2009) and Denault and Simonato (2017). We apply the two-stage LSMC method and the classical LSMC method to CRRA utility optimization, and then compare the resulting initial value function estimates $\hat{v}_{0}=\frac{1}{M}\sum_{m=1}^{M}(\hat{W}_{t_{N}})^{1-\gamma}/(1-\gamma)$ for a one-year time horizon with monthly rebalancing. Following Zhang et al. (2019), we choose $M=10,000$ sample paths to ensure numerical stability of the solution. For the classical LSMC method, we include the utility function itself as part of the regression basis, so that the regression basis can be adjusted to some extent to the risk-aversion parameter. Figure 5.4 shows that the classical LSMC method becomes unstable when the value of $\gamma$ is high, while the two-stage LSMC method converges quite well. In our experiment, the two-stage LSMC method can approximate the CRRA utility optimization approach well up to $\gamma=100$ .

We then compare our two-stage LSMC to the classical LSMC for solving the STRS. To check the possibility of heteroskedastic residuals, we calibrate a state-dependent standard deviation $\sigma\left(z,w\right)$ as described in Subsection 3.4 and compare it with the original two-stage LSMC method in which the standard deviation only depends on the portfolio decision. In particular, we use a simple linear basis to approximate the logarithmic standard deviation. Figure 5.2 shows that the two-stage LSMC method substantially improves the estimates $\hat{v}_{0}$ and the return distributions, compared to the classical LSMC approach, while using a state-dependent standard deviation does not significantly improve the results, suggesting that the assumption of homoskedastic residuals is reasonable.

5.4 STRS and CRRA

We now compare the STRS to the CRRA utility optimization approach. Our main finding regarding this comparison is that for each risk aversion level $\gamma$ of the CRRA utility approach, one can find a target range $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ such that the STRS delivers a similar expectation, but with a lower standard deviation and a lower downside risk. As an illustration, Figure 5.5 shows how the STRS with $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]=[0.93,1.53]$ outperforms the CRRA utility approach with $\gamma=10$ . Despite the better statistical moments of the STRS, the shorter right tail of the STRS compared to the CRRA utility approach can be deemed a shortcoming of our approach, though giving up some upside potential is the reason for the improved downside risk protection compared to the CRRA utility approach.

To provide a more comprehensive comparison, we now report two risk-return trade-offs: the mean-variance efficient frontier and the trade-off between return and downside risk. Figure 5.6 displays the efficient frontiers of the STRS (for different combinations of $L_{{\scriptscriptstyle\!W}}$ and $U_{{\scriptscriptstyle\!W}}$ ) and the CRRA utility approach (for different $\gamma$ levels) for a three-month investment horizon. The results show that the STRS and the CRRA utility approach trace out a similar mean-variance efficient frontier, while the STRS delivers a better downside risk-return trade-off. Remark that the STRS and the CRRA utility approach produce similar results when the risk-aversion parameter is either very small (risk-neutral) or very high, while the STRS is preferable for intermediate risk-aversion levels.

A theoretical proof of the higher efficiency of the STRS over the classical utility strategies would be desirable to corroborate our numerical findings. However, given for example the difficulty in deriving an explicit optimal allocation for a single trading period with a simpler downside risk minimization objective (Klebaner et al. 2017), a theoretical proof of the higher efficiency of the STRS over the classical utility strategies might be out of reach. We thus leave this question for further research.

5.5 Extensions

This subsection discusses the wealth distributions produced by the modified target range strategies described in Section 4. Figure 5.7 provides examples for the flat target range strategy (FTRS) with $L_{\!{\scriptscriptstyle W}}=1.0$ and $U_{\!{\scriptscriptstyle W}}=1.05$ , $1.10$ , $1.20$ and $+\infty$ . The main observation is that, as expected, the probability of the terminal wealth lying outside the predefined range $[L_{{\scriptscriptstyle\!W}},U_{{\scriptscriptstyle\!W}}]$ is smaller than for the STRS (refer to Figure 5.1 for comparison). This is the main strength of the FTRS: downside risk is kept to a minimum, while the price to pay for this safety is the inability to generate high returns. Finally, the wealth distribution is less sensitive to the choice of $U_{\!{\scriptscriptstyle W}}$ : the distribution is tight even when $U_{\!{\scriptscriptstyle W}}=\infty$ , given the absence of incentive to chase high returns.

In theory, if one wants to maximize the probability that the terminal wealth lies within the targeted range with the lower bound $L_{\!{\scriptscriptstyle W}}=1.0$ and a large enough upper bound $U_{\!{\scriptscriptstyle W}}$ , the optimal decision should be to allocate all the capital to the risk-free asset. Numerically though, it is difficult to guarantee a full allocation in the risk-free asset at all times and for all paths. Intuitively, the reason for this is the following: for the portfolios allocated mostly to the risk-free asset, most, if not all, of the terminal wealth realizations will lie within the targeted range, which makes the value function flat and almost invariant among these convervative portfolio allocations.

Figure 5.8 provides some examples for the relative target range strategy (RTRS) with a passive equal-weight portfolio as benchmark. The probability that the portfolio value underperforms the benchmark portfolio remains small (around $6\%-8\%$ for the excess return distributions), though higher than those provided by absolute targets. The reason for this is that the passive equal-weight benchmark already delivers a high expected return, therefore outperforming it requires taking more risk than what was necessary in the previous absolute return target examples.

6 Conclusions

This paper introduces the skewed target range strategy (STRS) for portfolio optimization problems. The STRS maximizes the expected portfolio value while simultaneously restraining the bulk of the return distribution within a predefined range. This joint goal is achieved with an unconstrained optimization formulation, which achieves, in a simpler manner, similar results to those that can be expected from more complex constrained optimization methods. To illustrate the effectiveness of the STRS, we study a multiperiod portfolio optimization problem and propose a two-stage least squares Monte Carlo (LSMC) method to handle the new objective function. The two-stage regression method can also be adopted for general investment objectives such as the smooth constant relative risk aversion (CRRA) utility. We show that this regression method substantially improves the numerical stability of the LSMC algorithm compared to direct regression. We show that the STRS achieves a similar mean-variance efficient frontier while delivering a better downside risk-return trade-off, compared to the CRRA utility approach. We find that the recommended level for the lower bound of the target range is the initial portfolio value, at which the standard deviation and the downside risk of the terminal portfolio value are marginally minimized. From there, the upper bound of the target range can be set based on risk preferences.

Going further, the unconstrained optimization formulation used by the STRS, built upon an indicator function, has the potential to incorporate additional range constraints on other dynamic risk measures such as realized volatility or maximum drawdown. This is an area we wish to investigate in future research.

Acknowledgments

The authors are grateful to Dr. Wen Chen and the two anonymous referees for their valuable comments and remarks.

Bibliography41

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Agarwal and Naik (2004) Agarwal, V. and N. Y. Naik (2004). Risks and portfolio decisions involving hedge funds. Review of Financial Studies 17 (1), 63–98.
2Alexander and Baptista (2002) Alexander, G. J. and A. M. Baptista (2002). Economic implications of using a mean-Va R model for portfolio selection: A comparison with mean-variance analysis. Journal of Economic Dynamics and Control 26 (7-8), 1159–1193.
3Andreasson and Shevchenko (2018) Andreasson, J. and P. Shevchenko (2018). Bias-corrected least-squares Monte Carlo for utility based optimal stochastic control problems. SSRN:2985828.
4Balata and Palczewski (2018) Balata, A. and J. Palczewski (2018). Regress-Later Monte Carlo for optimal control of Markov processes.
5Barberis (2012) Barberis, N. (2012). A model of casino gambling. Management Science 58 (1), 35–51.
6Brandt et al. (2005) Brandt, M., A. Goyal, P. Santa-Clara, and J. Stroud (2005). A simulation approach to dynamic portfolio choice with an application to learning about return predictability. Review of Financial Studies 18 , 831–873.
7Brogan and Stidham Jr. (2005) Brogan, A. J. and S. Stidham Jr. (2005). A note on separation in mean-lower-partial-moment portfolio optimization with fixed and moving targets. IIE Transactions 37 (10), 901–906.
8Browne (1999 a) Browne, S. (1999 a). Beating a moving target: Optimal portfolio strategies for outperforming a stochastic benchmark. Finance and Stochastics 3 (3), 275–294.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Skewed target range strategy for multiperiod portfolio optimization

Abstract

1 Introduction

2 Skewed Target Range Strategy

3 Multiperiod Portfolio Optimization

3.1 Dynamic programming

3.2 Classical least squares Monte Carlo

3.3 Two-stage least squares Monte Carlo

3.4 State-dependent standard deviation

3.5 Upper target as stop-profit

4 Extensions

4.1 CRRA utility

4.2 Flat target range strategy

4.3 Target range over a stochastic benchmark

5 Numerical Experiments

5.1 Wealth distribution

5.2 Sensitivity analysis and choice of LWL_{{\scriptscriptstyle\!W}}LW​

5.3 Model validation

5.4 STRS and CRRA

5.5 Extensions

6 Conclusions

Acknowledgments

5.2 Sensitivity analysis and choice of $L_{{\scriptscriptstyle\!W}}$