Active and Passive Portfolio Management with Latent Factors

Ali Al-Aradi; Sebastian Jaimungal

arXiv:1903.06928·q-fin.PM·March 19, 2019

Active and Passive Portfolio Management with Latent Factors

Ali Al-Aradi, Sebastian Jaimungal

PDF

TL;DR

This paper develops a convex analysis-based method for optimal portfolio selection that combines active and passive strategies in a latent factor market model, incorporating filtering for hidden states and demonstrating improved performance through backtests.

Contribution

It introduces a closed-form solution for portfolio optimization with latent Markovian factors, including a filtering approach for unobservable states, and validates it with historical backtests.

Findings

01

Optimal portfolio strategy is the posterior average of state-specific strategies.

02

The solution is unique and derived in closed form.

03

Backtests show improved portfolio performance.

Abstract

We address a portfolio selection problem that combines active (outperformance) and passive (tracking) objectives using techniques from convex analysis. We assume a general semimartingale market model where the assets' growth rate processes are driven by a latent factor. Using techniques from convex analysis we obtain a closed-form solution for the optimal portfolio and provide a theorem establishing its uniqueness. The motivation for incorporating latent factors is to achieve improved growth rate estimation, an otherwise notoriously difficult task. To this end, we focus on a model where growth rates are driven by an unobservable Markov chain. The solution in this case requires a filtering step to obtain posterior probabilities for the state of the Markov chain from asset price information, which are subsequently used to find the optimal allocation. We show the optimal strategy is the…

Figures8

Click any figure to enlarge with its caption.

Tables2

Table 1. Table 1: Summary of generalizations achieved using the convex analysis approach over the dynamic programming approach.


	Dynamic Programming	Convex Analysis

Growth Rates
	Bounded, deterministic,
differentiable growth rates	Stochastic, unobservable growth rates (possibly rank-dependent)

Noise component
	Deterministic, differentiable
volatility with Brownian noise	$𝕃^{2}$ (or even $𝕃^{1}$ ) martingale noise
(possibly with stochastic volatility)

Benchmarks
	Benchmarks are Markovian in $𝑿$ ,
differentiable maps	Benchmarks are $𝔉$ -adapted

Penalty matrices
	Deterministic penalty
weighting matrices	Stochastic penalty
weighting matrices

Preference parameters
	Constant subjective
preference parameters	Stochastic subjective preference parameters (e.g. wealth-dependent)

Table 2. Table 2: HMM estimation results based on applying the EM algorithm to data in the period 2010-2015; estimate of days spent in each state is based on the Viterbi algorithm.

State	Average Daily Growth Rate Across Industries	Number of Days Spent in State	% of Days Spent in State	Expected Sojourn Time
Very Bad	-250 bps	44	3 %	1 day
Bad	-25 bps	28	2 %	1 day
Normal	7 bps	1305	86 %	60 days
Good	10 bps	89	6 %	1 day
Very Good	210 bps	44	3 %	1 day

Equations272

X_{t}^{i} = X_{0}^{i} exp (\int_{0}^{t} γ_{s}^{i} d s + M_{t}^{i})

X_{t}^{i} = X_{0}^{i} exp (\int_{0}^{t} γ_{s}^{i} d s + M_{t}^{i})

d lo g X_{t}^{i} = γ_{t}^{i} d t + d M_{t}^{i}, \forall i \in N .

d lo g X_{t}^{i} = γ_{t}^{i} d t + d M_{t}^{i}, \forall i \in N .

d lo g X_{t} = γ_{t} d t + d M_{t},

d lo g X_{t} = γ_{t} d t + d M_{t},

\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\log{\boldsymbol{X}}_{t}}=\left(\log X^{1}_{t},...,\log X^{n}_{t}\right)^{\intercal}\,,\qquad\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{{\boldsymbol{\gamma}}_{t}}=\left(\gamma^{1}_{t},...,\gamma^{n}_{t}\right)^{\intercal}\,,\qquad\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{{\boldsymbol{M}}_{t}}=\left(M^{1}_{t},...,M^{n}_{t}\right)^{\intercal}\,.

\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\log{\boldsymbol{X}}_{t}}=\left(\log X^{1}_{t},...,\log X^{n}_{t}\right)^{\intercal}\,,\qquad\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{{\boldsymbol{\gamma}}_{t}}=\left(\gamma^{1}_{t},...,\gamma^{n}_{t}\right)^{\intercal}\,,\qquad\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{{\boldsymbol{M}}_{t}}=\left(M^{1}_{t},...,M^{n}_{t}\right)^{\intercal}\,.

where L^{p}

where L^{p}

L^{\infty, M}

ε ∥ x ∥^{2} \leq x^{⊺} Σ_{t} x \leq C ∥ x ∥^{2}, \forall t \geq 0.

ε ∥ x ∥^{2} \leq x^{⊺} Σ_{t} x \leq C ∥ x ∥^{2}, \forall t \geq 0.

π_{t}^{1} + \dots + π_{t}^{n} = 1 \mbox a . s .

π_{t}^{1} + \dots + π_{t}^{n} = 1 \mbox a . s .

{\mathcal{A}}^{2}=\big{\{}{\boldsymbol{\pi}}:\Omega\times[0,T]\rightarrow{\mathbb{R}}^{n}\text{ s.t. }{\boldsymbol{\pi}}\in{\mathbb{L}}^{2},~{}{\mathfrak{F}}\text{-adapted}\text{ and }{\boldsymbol{\pi}}_{t}^{\intercal}\mathbf{1}=1,\text{ for }t\geq 0~{}~{}{\mathbb{P}}\text{-a.s.}\big{\}}

{\mathcal{A}}^{2}=\big{\{}{\boldsymbol{\pi}}:\Omega\times[0,T]\rightarrow{\mathbb{R}}^{n}\text{ s.t. }{\boldsymbol{\pi}}\in{\mathbb{L}}^{2},~{}{\mathfrak{F}}\text{-adapted}\text{ and }{\boldsymbol{\pi}}_{t}^{\intercal}\mathbf{1}=1,\text{ for }t\geq 0~{}~{}{\mathbb{P}}\text{-a.s.}\big{\}}

{\mathcal{A}}^{\infty}=\big{\{}{\boldsymbol{\pi}}:\Omega\times[0,T]\rightarrow{\mathbb{R}}^{n}\text{ s.t. }{\boldsymbol{\pi}}\in{\mathbb{L}}^{\infty,M},~{}{\mathfrak{F}}\text{-adapted}\text{ and }{\boldsymbol{\pi}}_{t}^{\intercal}\mathbf{1}=1,\text{ for }t\geq 0~{}~{}{\mathbb{P}}\text{-a.s.}\big{\}}

{\mathcal{A}}^{\infty}=\big{\{}{\boldsymbol{\pi}}:\Omega\times[0,T]\rightarrow{\mathbb{R}}^{n}\text{ s.t. }{\boldsymbol{\pi}}\in{\mathbb{L}}^{\infty,M},~{}{\mathfrak{F}}\text{-adapted}\text{ and }{\boldsymbol{\pi}}_{t}^{\intercal}\mathbf{1}=1,\text{ for }t\geq 0~{}~{}{\mathbb{P}}\text{-a.s.}\big{\}}

d lo g Z_{t}^{π} = γ_{t}^{π} d t + π_{t}^{⊺} d M_{t},

d lo g Z_{t}^{π} = γ_{t}^{π} d t + π_{t}^{⊺} d M_{t},

w h er e γ_{t}^{π}

w h er e γ_{t}^{π}

\frac{d Z _{t}^{π}}{Z _{t}^{π}} = i = 1 \sum n π_{t}^{i} \frac{d X _{t}^{i}}{X _{t}^{i}} .

\frac{d Z _{t}^{π}}{Z _{t}^{π}} = i = 1 \sum n π_{t}^{i} \frac{d X _{t}^{i}}{X _{t}^{i}} .

\frac{d X _{t}^{i}}{X _{t}^{i}} = (γ_{t}^{i} + \frac{1}{2} ⟨ M^{i} ⟩_{t}) d t + d M_{t}^{i},

\frac{d X _{t}^{i}}{X _{t}^{i}} = (γ_{t}^{i} + \frac{1}{2} ⟨ M^{i} ⟩_{t}) d t + d M_{t}^{i},

\frac{d Z _{t}^{π}}{Z _{t}^{π}} = d lo g Z_{t}^{π} + \frac{1}{2} d ⟨ lo g Z^{π} ⟩_{t} .

\frac{d Z _{t}^{π}}{Z _{t}^{π}} = d lo g Z_{t}^{π} + \frac{1}{2} d ⟨ lo g Z^{π} ⟩_{t} .

⟨ lo g Z^{π} ⟩_{t} = i, j = 1 \sum n π_{t}^{i} π_{t}^{j} ⟨ lo g X^{i}, lo g X^{j} ⟩_{t} = π_{t}^{⊺} Σ_{t} π_{t}

⟨ lo g Z^{π} ⟩_{t} = i, j = 1 \sum n π_{t}^{i} π_{t}^{j} ⟨ lo g X^{i}, lo g X^{j} ⟩_{t} = π_{t}^{⊺} Σ_{t} π_{t}

γ_{t} = γ_{t}^{(Θ_{t})},

γ_{t} = γ_{t}^{(Θ_{t})},

d\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\gamma}}^{(i)}_{t}}=\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\phi}}}(t,{\boldsymbol{\gamma}}_{t},i)~{}dt+\underset{\color[rgb]{1,0,0}\footnotesize n\times k}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\Phi}}}(t,{\boldsymbol{\gamma}}_{t},i)~{}d\underset{\color[rgb]{1,0,0}\footnotesize k\times 1}{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}\,.

d\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\gamma}}^{(i)}_{t}}=\underset{\color[rgb]{1,0,0}\footnotesize n\times 1}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\phi}}}(t,{\boldsymbol{\gamma}}_{t},i)~{}dt+\underset{\color[rgb]{1,0,0}\footnotesize n\times k}{\vphantom{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}{\boldsymbol{\Phi}}}(t,{\boldsymbol{\gamma}}_{t},i)~{}d\underset{\color[rgb]{1,0,0}\footnotesize k\times 1}{{\boldsymbol{W}}^{\boldsymbol{\gamma}}_{t}}\,.

d lo g X_{t}^{i} = (γ^{i} + j = 1 \sum n g^{j} \mathds 1_{{r_{t}^{i} = j}}) d t + j = 1 \sum n σ^{j} \mathds 1_{{r_{t}^{i} = j}} d W_{t}^{j}

d lo g X_{t}^{i} = (γ^{i} + j = 1 \sum n g^{j} \mathds 1_{{r_{t}^{i} = j}}) d t + j = 1 \sum n σ^{j} \mathds 1_{{r_{t}^{i} = j}} d W_{t}^{j}

d Y_{t}^{π, ρ} = (γ_{t}^{π} - γ_{t}^{ρ}) d t + (π_{t} - ρ_{t})^{⊺} d M_{t},

d Y_{t}^{π, ρ} = (γ_{t}^{π} - γ_{t}^{ρ}) d t + (π_{t} - ρ_{t})^{⊺} d M_{t},

Y_{t}^{π, ρ} = Y_{0}^{π, ρ} + \int_{0}^{T} (γ_{t}^{π} - γ_{t}^{ρ}) d t + \int_{0}^{T} (π_{t} - ρ_{t})^{⊺} d M_{t} .

Y_{t}^{π, ρ} = Y_{0}^{π, ρ} + \int_{0}^{T} (γ_{t}^{π} - γ_{t}^{ρ}) d t + \int_{0}^{T} (π_{t} - ρ_{t})^{⊺} d M_{t} .

π \in A^{c} sup H (π)

π \in A^{c} sup H (π)

H (π) = E [ζ^{0} Y_{t}^{π, ρ} - \frac{1}{2} \int_{0}^{T} ζ_{s}^{1} (π_{s} - η_{s})^{⊺} Ω_{s} (π_{s} - η_{s}) d s - \frac{1}{2} \int_{0}^{T} ζ_{s}^{2} π_{s}^{⊺} Q_{s} π_{s} d s] .

H (π) = E [ζ^{0} Y_{t}^{π, ρ} - \frac{1}{2} \int_{0}^{T} ζ_{s}^{1} (π_{s} - η_{s})^{⊺} Ω_{s} (π_{s} - η_{s}) d s - \frac{1}{2} \int_{0}^{T} ζ_{s}^{2} π_{s}^{⊺} Q_{s} π_{s} d s] .

ε ∥ x ∥^{2} \leq x^{⊺} Ω_{t} x \leq C ∥ x ∥, and ε ∥ x ∥^{2} \leq x^{⊺} Q_{t} x \leq C ∥ x ∥, \forall t \geq 0.

ε ∥ x ∥^{2} \leq x^{⊺} Ω_{t} x \leq C ∥ x ∥, and ε ∥ x ∥^{2} \leq x^{⊺} Q_{t} x \leq C ∥ x ∥, \forall t \geq 0.

H (π) = E [{\int_{0}^{T} ζ^{0} (γ_{t}^{π} - γ_{t}^{ρ}) - \frac{1}{2} ζ_{t}^{1} (π_{t} - η_{t})^{⊺} Ω_{t} (π_{t} - η_{t}) - \frac{1}{2} ζ_{t}^{2} π_{t}^{⊺} Q_{t} π_{t}} d t] .

H (π) = E [{\int_{0}^{T} ζ^{0} (γ_{t}^{π} - γ_{t}^{ρ}) - \frac{1}{2} ζ_{t}^{1} (π_{t} - η_{t})^{⊺} Ω_{t} (π_{t} - η_{t}) - \frac{1}{2} ζ_{t}^{2} π_{t}^{⊺} Q_{t} π_{t}} d t] .

γ_{t} : = E [γ_{t} ∣ F_{t}] .

γ_{t} : = E [γ_{t} ∣ F_{t}] .

Σ_{t} : = E [Σ_{t} ∣ F_{t}] .

Σ_{t} : = E [Σ_{t} ∣ F_{t}] .

ε ∥ x ∥^{2} \leq x^{⊺} Σ_{t} x \leq C ∥ x ∥^{2}

ε ∥ x ∥^{2} \leq x^{⊺} Σ_{t} x \leq C ∥ x ∥^{2}

(E [γ_{t}^{i} F_{t}])^{2}

(E [γ_{t}^{i} F_{t}])^{2}

⟹ E [(E [γ_{t}^{i} F_{t}])^{2}]

⟹ E [(γ_{t}^{i})^{2}]

M_{t} : = lo g X_{t} - \int_{0}^{t} γ_{s} d s

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Active and Passive Portfolio Management with Latent Factors

Ali Al-Aradi

[email protected]

Sebastian Jaimungal

[email protected]

Department of Statistical Sciences, University of Toronto

Abstract

We address a portfolio selection problem that combines active (outperformance) and passive (tracking) objectives using techniques from convex analysis. We assume a general semimartingale market model where the assets’ growth rate processes are driven by a latent factor. Using techniques from convex analysis we obtain a closed-form solution for the optimal portfolio and provide a theorem establishing its uniqueness. The motivation for incorporating latent factors is to achieve improved growth rate estimation, an otherwise notoriously difficult task. To this end, we focus on a model where growth rates are driven by an unobservable Markov chain. The solution in this case requires a filtering step to obtain posterior probabilities for the state of the Markov chain from asset price information, which are subsequently used to find the optimal allocation. We show the optimal strategy is the posterior average of the optimal strategies the investor would have held in each state assuming the Markov chain remains in that state. Finally, we implement a number of historical backtests to demonstrate the performance of the optimal portfolio.

keywords:

Active portfolio management; Convex analysis; Stochastic Portfolio Theory; Functionally generated portfolios; Rank-based models; Growth optimal portfolio; Hidden Markov models; Partial information.

t1t1footnotetext: The authors would like to thank NSERC for partially funding this work.

1 Introduction

Problems in portfolio management can be divided into two types: active and passive. In the former, investors aim to achieve superior portfolio returns; in the latter, the investors’ goal is to track a preselected index; see, for example, Buckley and Korn (1998) or Pliska and Suzuki (2004). One can further separate active portfolio management objectives into two types: absolute and relative. There is a great deal of literature dedicated to solving various portfolio selection problems with absolute goals via stochastic control theory. The seminal work of Merton (1969) introduced the dynamic asset allocation and consumption problem, utilizing stochastic control techniques to derive optimal investment and consumption policies. Extensions can be found in Merton (1971), Magill and Constantinides (1976), Davis and Norman (1990), Browne (1997) and more recently Blanchet-Scalliet et al. (2008), Liu and Muhle-Karbe (2013) and Ang et al. (2014) to name a few. The focus in these papers is generally on maximizing the utility of discounted consumption and terminal wealth or minimizing shortfall probability, or other related absolute performance measures that are independent of any external benchmark or relative goal. Works on optimal active portfolio management with relative goals (i.e. attempting to outperform a given benchmark) can be found in Browne (1999a), Browne (1999b) and Browne (2000), Pham (2003) and, more recently, Oderda (2015).

There are also several works that address the question of achieving absolute portfolio selection goals when parameters are stochastic, including cases where the investor only has access to partial information and must rely on Bayesian learning or filtering techniques to solve for their optimal allocation. Merton (1971) solves for the portfolio that maximizes expected terminal wealth assuming that the instantaneous expected rate of return follows a mean-reverting diffusive process. Lakner (1998) extends this to the case where the drift processes are unobservable. In Rieder and Bäuerle (2005) the assets’ drift switches between various quantities according to an unobservable Markov chain; Frey et al. (2012) extends this to incorporate expert opinions, in the form of signals at random discrete times, into the filtering problem by using this observable information to obtain posterior probabilities for the state of the Markov chain. Bäuerle and Rieder (2007) introduces jumps to the asset price dynamics by including Poisson random measures with unobservable intensity processes. Latent models are also central to the work of Casgrain and Jaimungal (2018b) and Casgrain and Jaimungal (2018a) in the context of algorithmic trading and mean field games.

Many of the concepts discussed in this work, particularly the notion of functionally generated portfolios (FGPs) and rank-based models, are key concepts in Stochastic Portfolio Theory (SPT) (see Fernholz (2002) and Karatzas and Fernholz (2009) for a thorough overview). SPT is a flexible framework for analyzing portfolio behavior and market structure which takes a descriptive, rather than a normative, approach to addressing these issues, and emphasizes the use of observable quantities to make its predictions and conclusions. The appeal of SPT partially lies in the fact that it relies on a minimal set of assumptions that are readily satisfied in real equity markets and that the techniques it employs construct relative arbitrage portfolios that outperform the market almost surely without the need for parameter estimation. This is primarily done through the machinery of portfolio generating functions (PGFs), which are portfolio maps that give rise to investing strategies that depend only on prevailing market weights. A discussion of the relative arbitrage properties of FGPs and related approaches to achieving outperformance vis-à-vis the market portfolio can be found in Pal and Wong (2013), Wong (2015) and Pal and Wong (2016).

Although SPT focuses on almost sure outperformance, i.e. relative arbitrage with respect to the benchmark portfolio, we deviate from this criterion in favor of maximizing the expected growth rate differential. We present two justifications for this choice. First, certain rank-based models such as the first-order models admit equivalent martingale measures over all horizons implying the non-existence of relative arbitrage opportunities. This forces the investor to select an alternative performance criterion. Secondly, Fernholz (2002) argues for the use of functionally-generated portfolios such as diversity-weighted portfolios as benchmarks for active equity portfolio management given their passive, rule-based nature and ease of implementation. However, Wong (2015) notes that under certain reasonable conditions relative arbitrage opportunities do not exist with respect to these portfolios. Therefore, once again the investor must seek a substitute for almost sure outperformance if they decide to have a performance benchmark of this sort. One SPT-inspired work that uses an expectation-based objective function is Samo and Vervuurt (2016), in which machine learning techniques are utilized to achieve outperformance in expectation by maximizing the investor’s Sharpe ratio.

Active managers often dynamically invest in markets with the goal of achieving optimal relative returns against a performance benchmark while anchoring their portfolio to a tracking benchmark (in the sense of incurring minimal active risk/tracking error). They also often have in mind additional constraints on the investor’s portfolio, e.g. penalizing large positions in certain assets or excessive volatility in the investor’s wealth. In Al-Aradi and Jaimungal (2018), the authors formulate these goals and constraints by posing a portfolio optimization problem with log-utility of relative wealth, together with running penalty terms that incorporate the investor’s constraints on tracking a benchmark and total risk. They solve the problem in closed-form using dynamic programming under the assumptions that the benchmarks are differentiable maps that are Markovian in the asset values; this encompasses the market portfolio and, more broadly, the class of (time-dependent) functionally generated portfolios.

A shortcoming of Al-Aradi and Jaimungal (2018) is that when the investor values outperformance, the optimal solution relies crucially on the asset growth rate estimates, which are assumed to be bounded, differentiable, deterministic functions of time. However, returns are notoriously difficult to estimate robustly and the deterministic assumption does not provide adequate estimates. To address this shortcoming, here, we allow for growth rates to be stochastic and be driven by latent factors. This is essential to making the strategy robust to differing market environments. Our formulation also accommodates rank-based models; such models exploit the stability of capital distribution in the market to arrive at estimates of asset growth rates based on asset ranks.

Our modeling assumption is similar to that adopted in Casgrain and Jaimungal (2018a), who study the mean-field version of an algorithmic trading problem, where assets are driven by two components: a drift term and a martingale component both of which are adapted to an unobservable filtration. The investor’s strategy, on the other hand, is restricted to be adapted to a smaller filtration; namely, the natural filtration generated by the price process.

The approach we take to solve the stochastic control problem is based on techniques from convex analysis as in Bank et al. (2017) and Casgrain and Jaimungal (2018a), however these techniques date as far back as Cvitanić and Karatzas (1992). The reason we deviate from the dynamic programming approach taken in Al-Aradi and Jaimungal (2018) centers around the difficulty of extending that approach to more general market models. Although possible, it would be a difficult task to ensure that all the additional state variables (which would include various semimartingale local times in the case of rank-based models) satisfy the conditions for a Feynman-Kac representation to the HJB equation that arises from the control problem, which is a central aspect of the proof. A number of additional (possibly restrictive) assumptions would have to be made on the market model and, as such, the approach we adopt in the current work allows for a more succinct solution to a more general problem with fewer assumptions.

2 Model Setup

2.1 Market Model

We adopt a market model that generalizes the one in Al-Aradi and Jaimungal (2018) and is a multidimensional version of the one used in Casgrain and Jaimungal (2018a). Let $(\Omega,{\mathcal{G}},{\mathfrak{G}},{\mathbb{P}})$ be a filtered probability space, where ${\mathfrak{G}}=\{{\mathcal{G}}_{t}\}_{t\geq 0}$ is the natural filtration generated by all processes in the model. We assume that the market consists of $n$ assets defined as follows:

Definition 1

The stock price process for asset $i$ , $X^{i}=\left(X^{i}_{t}\right)_{t\geq 0}$ for all $i\in{\mathfrak{N}}\coloneqq\{1,\dots,n\}$ , is a positive semimartingale satisfying:

[TABLE]

where $\gamma^{i}=\left(\gamma^{i}_{t}\right)_{t\geq 0}$ is a ${\mathfrak{G}}$ -adapted process representing the asset’s (total) growth rate and $M^{i}=\left(M^{i}_{t}\right)_{t\geq 0}$ is a ${\mathfrak{G}}$ -adapted martingale with $M^{i}_{0}=0$ representing the asset’s noise component.

It is convenient to work with the logarithmic representation of asset dynamics:

Proposition 1

The logarithm of prices, $\log X^{i}$ , satisfies the stochastic differential equation:

[TABLE]

This can also be expressed in vector notation as follows:

[TABLE]

where

[TABLE]

We make the following assumption on the growth rate and noise component of asset prices:

Assumption 1

The growth rate and martingale noise processes satisfy one of the two following conditions:

(a)

${\boldsymbol{\gamma}}$ , ${\boldsymbol{M}}\in{\mathbb{L}}^{2}$ ; 2. (b)

${\boldsymbol{\gamma}}\in{\mathbb{L}}^{\infty,M}$ * and ${\boldsymbol{M}}\in{\mathbb{L}}^{1}$ ,*

[TABLE]

In the assumption above $\|{\boldsymbol{x}}\|_{p}\coloneqq\left(\sum_{i=1}^{n}|x_{i}|^{p}\right)^{1/p}$ and $\|{\boldsymbol{x}}\|_{\infty}\coloneqq\underset{i\in{\mathfrak{N}}}{\max}\hskip 2.84544pt|x_{i}|$ for ${\boldsymbol{x}}\in{\mathbb{R}}^{n}$ denote the $p$ -norm and $\infty$ -norm on ${\mathbb{R}}^{n}$ , respectively. Furthermore, we will make use of the shorthand notation $\|{\boldsymbol{x}}\|\coloneqq\|{\boldsymbol{x}}\|_{2}$ to denote the usual Euclidean norm.

We also assume the quadratic co-variation processes associated with the noise component satisfy

Assumption 2

Let $\mathbf{\Sigma}$ be the matrix whose $ij$ -th element is the quadratic covariation process between $M_{i}$ and $M_{j}$ , $\Sigma_{ij}\coloneqq\langle M_{i},M_{j}\rangle_{t}$ . We assume that, for each ${\mathbf{x}}\in{\mathbb{R}}^{n}$ , there exists $\varepsilon>0$ and $C<\infty$ such that

[TABLE]

This is an extension of the usual non-degeneracy and bounded variance conditions.

Remark 1

The constant $M$ in ${\mathbb{L}}^{\infty,M}$ of Assumption 1 may depend on the constants $\varepsilon$ and $C$ that appear in Assumption 2, but provided that $M$ is sufficiently large we can ensure that the candidate optimal solution that we obtain is in fact in the set of admissible controls.

2.2 Portfolios and Observable Information

The investor does not have access to the latent processes driving asset prices and observes asset prices alone (it is possible to allow other processes in addition to the price process, but here we restrict to this case). Let the filtration ${\mathfrak{F}}=\left\{{\mathcal{F}}_{t}\right\}_{t\geq 0}$ where ${\mathcal{F}}_{t}=\sigma\left(\left\{{\boldsymbol{X}}_{s}\right\}_{s\in[0,t]}\right)$ denotes the investor’s filtration.

Definition 2

A portfolio is a measurable, ${\mathfrak{F}}$ -adapted, vector-valued process ${\boldsymbol{\pi}}=({\boldsymbol{\pi}}_{t})_{t\geq 0}$ , where ${\boldsymbol{\pi}}_{t}=\left(\pi^{1}_{t},...,\pi^{n}_{t}\right)^{\intercal}$ such that for all $t\geq 0$ , ${\boldsymbol{\pi}}_{t}$ satisfies:

[TABLE]

Furthermore, we define the set of admissible portfolios as follows:

(a)

if Assumption 1(a) is enforced, we assume ${\boldsymbol{\pi}}\in{\mathbb{L}}^{2}$ and define :

[TABLE] 2. (b)

if Assumption 1(b) is enforced, we assume ${\boldsymbol{\pi}}\in{\mathbb{L}}^{\infty,M}$ and define:

[TABLE]

In the sequel, we write ${\mathcal{A}}^{c}$ to denote either ${\mathcal{A}}^{2}$ or ${\mathcal{A}}^{\infty}$ depending on which part of Assumption 1 is being enforced.

Remark 2

The cost of allowing for ${\mathbb{L}}^{1}$ noise processes is that both growth rate processes and admissible portfolios are ${\mathbb{L}}^{\infty,M}$ rather than ${\mathbb{L}}^{2}$ processes.

In the definition above, portfolios are adapted to the filtration ${\mathfrak{F}}\subseteq{\mathfrak{G}}$ , which is the information set generated by the asset price paths and not the full information set ${\mathfrak{G}}$ . The latter includes the noise component ${\boldsymbol{M}}$ as well as its quadratic covariation process $\mathbf{\Sigma}$ , both assumed unobservable. This ensures that strategies depend only on fully observable quantities which in our context are limited to the asset price processes.

Given the model dynamics, and portfolio assumptions, we next derive the dynamics of wealth associated with an arbitrary portfolio ${\boldsymbol{\pi}}$ :

Proposition 2

The logarithm of the portfolio value process $Z^{\boldsymbol{\pi}}=(Z^{\boldsymbol{\pi}}_{t})_{t\geq 0}$ satisfies the SDE:

[TABLE]

and $\gamma^{\boldsymbol{\pi}}$ and $\Gamma^{\boldsymbol{\pi}}$ are the portfolio growth rate and excess growth rate processes, respectively.

Proof. The proof follows the same steps as the proof of Proposition 1.1.5. of Fernholz (2002). The proportional change in the value of portfolio ${\boldsymbol{\pi}}$ is a weighted average of the simple return of each asset held in the portfolio:

[TABLE]

From the asset dynamics in (2.2) and an application of Itô’s lemma we have:

[TABLE]

where $\langle M^{i}\rangle_{t}=\langle M^{i},M^{i}\rangle_{t}$ is the quadratic variation process of $\log X^{i}$ . Another application of Itô’s lemma on the portfolio wealth process dynamics gives

[TABLE]

The result follows by noting that the quadratic variation $\langle\log Z^{\boldsymbol{\pi}}\rangle_{t}$ is given by:

[TABLE]

and then rearranging terms.

2.3 Market Model Examples

Al-Aradi and Jaimungal (2018) assume growth rates and volatilities are bounded, differentiable, deterministic functions and the only driver of asset prices was a multidimensional Wiener process. In this section we present two market models satisfying the assumptions in this paper that allow for more general asset growth rates. The two models are presented with the goal of improved growth rate estimation in mind.

2.3.1 Diffusion-Switching Growth Rate Process

The diffusion-switching model assumes that asset growth rates switch between a number of possible diffusion processes according to an underlying Markov chain. That is:

[TABLE]

where $\Theta=(\Theta_{t})_{t\geq 0}$ is a continuous-time Markov chain with state space ${\mathfrak{M}}\coloneqq\{1,...,m\}$ and ${\boldsymbol{\gamma}}^{(i)}_{t}$ is the growth rate diffusion process associated with state $i\in{\mathfrak{M}}$ given as the solution to the SDE:

[TABLE]

In this formulation, ${\boldsymbol{W}}^{\boldsymbol{\gamma}}$ is a $k$ -dimensional Wiener process driving the growth rate diffusions and ${\boldsymbol{\phi}}$ and ${\boldsymbol{\Phi}}$ are the drift and volatility functions of the growth rate. We require ${\boldsymbol{\phi}}$ and ${\boldsymbol{\Phi}}$ to be chosen so that ${\boldsymbol{\gamma}}^{(i)}\in{\mathbb{L}}^{2}$ for all $i$ . A sufficient set of conditions for this are the usual Lipschitz and polynomial growth conditions that guarantee the existence of a unique, square-integrable strong solution to the SDE (see Theorem 2.9 in Chapter 5 of Karatzas and Shreve (1998)). Figure 1 shows a simulation of this process when the possible diffusions are Ornstein-Uhlenbeck (OU) processes.

In Section 4, we take both ${\boldsymbol{\phi}}$ and ${\boldsymbol{\Phi}}$ to be identically zero. This recovers the hidden Markov model (HMM) used in Rieder and Bäuerle (2005), where the growth rate switches between a number of possible constants rather than diffusion processes. This simplifies the calibration process and this is the model we employ in the implementation.

2.3.2 Second-Order Rank-Based Model

An alternative model that may be considered is the second-order rank-based model of equity markets as described in Fernholz et al. (2013). In this model, an asset’s price dynamics depend on the rank of the asset’s market weights; typically, smaller assets have higher growth rates and volatilities than larger assets. The goal of this modeling approach is to better capture observed long-term characteristics of capital distribution in equity markets, such as average rank occupation times, by exploiting the inherent stability in the capital distribution curve.

Let $r^{i}_{t}$ be the rank of asset $i$ at time $t$ , the asset price is assumed to satisfy the SDE:

[TABLE]

That is, $\gamma^{i}$ is the “name”-based growth rate of asset $i$ and $g^{j}$ is the additional growth an asset experiences when its capitalization occupies rank $j$ . Similarly, $\sigma^{j}$ is the volatility of the asset in rank $j$ . We assume the model parameters satisfy the requirements for the market to form an asymptotically stable system; see Fernholz et al. (2013), which also provides an outline for parameter estimation for this class of models.

It is important to notice that when this model is assumed, the rank processes for each of the stocks must be incorporated in the optimization problem as state variables. This can vastly complicate the proof of optimality when using a dynamic programming approach. The approach we take in the present work does not suffer from these issues involving local times and non-differentiability. Finally, we note that it is possible to create a hybrid model that is rank-dependent and driven by an unobservable Markov chain, but this may lead to difficulties in the parameter estimation.

3 Stochastic Control Problem

3.1 Description

The stochastic control problem we consider is similar to the one posed in Al-Aradi and Jaimungal (2018). The investor fixes two portfolios against which they measure their outperformance and their active risk, respectively. That is, the investor chooses a performance benchmark ${\boldsymbol{\rho}}$ , which they wish to outperform, and a tracking benchmark ${\boldsymbol{\eta}}$ , which they penalize deviations from. The objective is to determine the portfolio process ${\boldsymbol{\pi}}$ that maximizes the expected growth rate differential relative to ${\boldsymbol{\rho}}$ over the investment horizon $T$ . Moreover, the investor is penalized for taking on excessive levels of active risk (measured against ${\boldsymbol{\eta}}$ ). An additional penalty independent of the two benchmarks is also included to control absolute risk (as measured by quadratic variation of wealth) or penalize allocation to certain assets as discussed in Section 4 of Al-Aradi and Jaimungal (2018).

The main state variable in our optimization problem is the logarithm of the ratio of the wealth of an arbitrary portfolio ${\boldsymbol{\pi}}$ relative to a preselected performance benchmark ${\boldsymbol{\rho}}$ . Let $Y^{{\boldsymbol{\pi}},{\boldsymbol{\rho}}}_{t}=\log\left(\frac{Z^{\boldsymbol{\pi}}_{t}}{Z^{\boldsymbol{\rho}}_{t}}\right)$ denote the logarithm of relative portfolio wealth for the portfolios ${\boldsymbol{\pi}}$ and ${\boldsymbol{\rho}}$ . Then this process satisfies the SDE:

[TABLE]

which in turn implies

[TABLE]

Our main stochastic control problem is to find the optimal portfolio ${\boldsymbol{\pi}}^{*}$ which, if the supremum is attained in the set of admissible strategies, achieves

[TABLE]

where $H({\boldsymbol{\pi}})$ is the performance criteria of a portfolio ${\boldsymbol{\pi}}$ given by:

[TABLE]

Here, $\zeta^{0}$ is a constant and ${\boldsymbol{\zeta}}=\left({\boldsymbol{\zeta}}_{t}\right)_{t\geq 0}$ with ${\boldsymbol{\zeta}}_{t}=\left(\zeta^{0},\zeta^{1}_{t},\zeta^{2}_{t}\right)$ is an ${\mathfrak{F}}$ -adapted process defined on $[0,\overline{\zeta}]^{3}$ for some fixed $\overline{\zeta}<\infty$ . The vector ${\boldsymbol{\zeta}}_{t}$ represents the subjective preference parameters set by the investors to reflect their emphasis on three goals:

The first term is a terminal reward term which corresponds to the investor wishing to maximize the expected growth rate differential between their portfolio and the performance benchmark ${\boldsymbol{\rho}}$ . It is also equivalent to maximizing the expected utility of relative wealth assuming a log-utility function. 2. 2.

The second term is a running penalty term which penalizes deviations from the tracking benchmark. When $\mathbf{\Omega}_{t}=\mathbf{\Sigma}_{t}$ , the investor is penalizing risk-weighted deviations from the tracking benchmark, with deviations in riskier assets being penalized more heavily. Thus, this can be seen as the investor aiming to minimize tracking error/active risk. 3. 3.

The final term is a general quadratic running penalty term that does not involve either benchmark. One possible choice for ${\mathbf{Q}}_{t}$ is the covariance matrix $\mathbf{\Sigma}_{t}$ , which can be adopted to minimize the absolute risk of the portfolio measured in terms of the quadratic variation of the portfolio wealth process, $Z^{\boldsymbol{\pi}}_{t}$ . Another option is to let ${\mathbf{Q}}_{t}$ be a constant diagonal matrix, which has the effect of penalizing allocation in each asset according to the magnitude of the corresponding diagonal entry. The investor can use this choice of ${\mathbf{Q}}$ as a way of imposing a set of “soft” constraints on allocation to each asset.

The reader is referred to Al-Aradi and Jaimungal (2018) for further interpretation of these terms.

Remark 3

The two preference parameters $\zeta_{t}^{1,2}$ can be stochastic; e.g., they may depend on the investor’s wealth level or other factors. Furthermore, the preference parameters are restricted to $[0,\overline{\zeta}]^{3}$ for two reasons: firstly, it simplifies the proof of optimality; secondly, from Al-Aradi and Jaimungal (2018), the results are driven by the relative weights, rather than absolute weights, therefore restriction to the cube results in no loss of generality.

Remark 4

The benchmarks may be non-Markovian; if they are Markovian and can be represented as ${\boldsymbol{\rho}}_{t}=\rho(t,{\boldsymbol{X}}_{t})$ and ${\boldsymbol{\eta}}_{t}=\eta(t,{\boldsymbol{X}}_{t})$ , the functions $\rho$ and $\eta$ are not restricted to be differentiable. This allows for a much wider class of benchmarks including rank-based portfolios and portfolios constructed using additional information not related to asset prices, e.g., factor portfolios based on company fundamentals. Benchmarks from the class of functionally generated portfolios are allowed, including the market portfolio, as well as portfolios generated by rank-dependent portfolio generating functions, such as large-cap portfolios.

We also require the following assumption on the relative and absolute penalty matrices $\mathbf{\Omega}$ and ${\mathbf{Q}}$ :

Assumption 3

The penalty matrices $\mathbf{\Omega}$ and ${\mathbf{Q}}$ are ${\mathfrak{F}}$ -adapted matrix-valued stochastic processes such that, for each $\mathbf{x}\in{\mathbb{R}}^{n}$ , there exists constants $\varepsilon>0$ and $C<\infty$ satisfying

[TABLE]

These bounds play an analogous role to the nondegeneracy and bounded variance assumptions made on the quadratic covariation $\mathbf{\Sigma}$ , and ensure that the candidate optimal control we derive later is in fact admissible.

Allowing for stochastic penalty matrices is useful as it opens the door for stochastic volatility models in the case of $\mathbf{\Omega}$ (when choosing $\mathbf{\Omega}=\mathbf{\Sigma}$ ) and stochastic transaction costs in the case of ${\mathbf{Q}}$ .

We next rewrite the control problem in terms of running reward/penalty terms. When either of the conditions in Assumption 1 is enforced, the expected value of the last integral in (3.2) is zero as the stochastic integral is in fact a martingale. Further, assuming that $Z^{\boldsymbol{\pi}}_{0}=Z^{\boldsymbol{\rho}}_{0}$ , the performance criteria becomes

[TABLE]

The generalizations achieved thus far compared to Al-Aradi and Jaimungal (2018) are summarized in Table 3.1 below.

Bibliography44

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Al-Aradi and Jaimungal (2018) Al-Aradi, A. and S. Jaimungal (2018). Outperformance and tracking: Dynamic asset allocation for active and passive portfolio management. Applied Mathematical Finance, Forthcoming .
2Ang et al. (2014) Ang, A., D. Papanikolaou, and M. M. Westerfield (2014). Portfolio choice with illiquid assets. Management Science 60 (11), 2737–2761.
3Bank et al. (2017) Bank, P., H. M. Soner, and M. Voß (2017). Hedging with temporary price impact. Mathematics and Financial Economics 11 (2), 215–239.
4Bäuerle and Rieder (2007) Bäuerle, N. and U. Rieder (2007). Portfolio optimization with jumps and unobservable intensity process. Mathematical Finance 17 (2), 205–224.
5Baum et al. (1970) Baum, L. E., T. Petrie, G. Soules, and N. Weiss (1970). A maximization technique occurring in the statistical analysis of probabilistic functions of markov chains. The annals of mathematical statistics 41 (1), 164–171.
6Biernacki et al. (2000) Biernacki, C., G. Celeux, and G. Govaert (2000, July). Assessing a mixture model for clustering with the integrated completed likelihood. IEEE Trans. Pattern Anal. Mach. Intell. 22 (7), 719–725.
7Bishop (2006) Bishop, C. M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics) . Berlin, Heidelberg: Springer-Verlag.
8Blanchet-Scalliet et al. (2008) Blanchet-Scalliet, C., N. E. Karoui, M. Jeanblanc, and L. Materllini (2008). Optimal investment decisions when time-horizon is uncertain. Journal of Mathematical Economics 44 (11), 1100–1113.