Incorporating Black-Litterman Views in Portfolio Construction when Stock Returns are a Mixture of Normals
Burak Kocuk, G\'erard Cornu\'ejols

TL;DR
This paper develops a method for portfolio construction that combines market equilibrium insights with a mixture of normals return model, improving risk management while maintaining competitive returns.
Contribution
It introduces a convex programming approach to CVaR minimization under a mixture of normals and integrates Black-Litterman views via inverse optimization.
Findings
The proposed method effectively reduces portfolio risk compared to market-only strategies.
The mixture of normals model accurately captures return distributions.
Empirical results show improved risk profiles with comparable returns.
Abstract
In this paper, we consider the basic problem of portfolio construction in financial engineering, and analyze how market-based and analytical approaches can be combined to obtain efficient portfolios. As a first step in our analysis, we model the asset returns as a random variable distributed according to a mixture of normal random variables. We then discuss how to construct portfolios that minimize the Conditional Value-at-Risk (CVaR) under this probabilistic model via a convex program. We also construct a second-order cone representable approximation of the CVaR under the mixture model, and demonstrate its theoretical and empirical accuracy. Furthermore, we incorporate the market equilibrium information into this procedure through the well-known Black-Litterman approach via an inverse optimization framework by utilizing the proposed approximation. Our computational experiments on a…
| Normal | Mixture (, ) | ||||||
|---|---|---|---|---|---|---|---|
| Sector | |||||||
| Energy | 1.1769 | 6.2823 | -0.0686 | 1.4687 | 8.5162 | 5.5799 | |
| Consumer Discretionary | 1.5112 | 5.3195 | 0.4788 | 1.7532 | 8.2673 | 4.3019 | |
| Consumer Staples | 1.3905 | 4.1470 | 0.6265 | 1.5696 | 6.7426 | 3.2166 | |
| Real Estate | 1.1514 | 7.2451 | 0.3782 | 1.3327 | 11.9825 | 5.5492 | |
| Industrials | 1.2887 | 5.1505 | 0.1638 | 1.5523 | 8.1720 | 4.0790 | |
| Financials | 1.3322 | 6.2810 | 0.7178 | 1.4762 | 10.0766 | 4.9656 | |
| Telecommunications Services | 1.0318 | 5.4780 | 0.3681 | 1.1873 | 8.7565 | 4.3408 | |
| Information Technology | 1.7264 | 7.1032 | 1.3907 | 1.8051 | 11.8056 | 5.4234 | |
| Materials | 1.3898 | 5.6955 | 0.0673 | 1.6998 | 8.4971 | 4.7437 | |
| Health Care | 1.4164 | 4.6432 | 1.3203 | 1.4389 | 6.9005 | 3.9212 | |
| Utilities | 1.0140 | 4.2801 | 0.6794 | 1.0924 | 6.1977 | 3.6764 | |
| Average | |||
|---|---|---|---|
| Arithmetic | 2.40 | 0.45 | 0.76 |
| Geometric | 0.75 | 0.17 | 0.42 |
| Avg | St Dev | 1% CVaR | Avg/St Dev | Avg/1% CVaR | |
|---|---|---|---|---|---|
| Market | 1.31 | 4.17 | 13.01 | 0.31 | 0.10 |
| 1.31 | 3.86 | 11.79 | 0.34 | 0.11 | |
| 1.30 | 3.79 | 11.50 | 0.34 | 0.11 | |
| 1.29 | 3.61 | 10.67 | 0.36 | 0.12 | |
| 1.20 | 3.41 | 9.38 | 0.35 | 0.13 | |
| 1.31 | 4.04 | 12.50 | 0.32 | 0.10 | |
| 1.31 | 3.79 | 11.52 | 0.34 | 0.11 | |
| 1.29 | 3.52 | 10.33 | 0.37 | 0.12 | |
| 1.24 | 3.39 | 9.62 | 0.37 | 0.13 |
| Avg | St Dev | 1% CVaR | ||
|---|---|---|---|---|
| 1.31 | 3.49 | 9.67 | ||
| 1.38 | 3.65 | 10.22 | ||
| 1.31 | 3.46 | 9.86 | ||
| 1.38 | 3.62 | 10.37 |
| Avg | St Dev | 1% CVaR | 0.1% CVaR | 0.05% CVaR | Avg/St Dev | Avg/1% CVaR | |
|---|---|---|---|---|---|---|---|
| Market | 1.31 | 4.08 | 12.46 | 17.59 | 18.74 | 0.32 | 0.10 |
| 1.31 | 3.80 | 11.27 | 15.79 | 16.73 | 0.35 | 0.12 | |
| 1.31 | 3.72 | 10.94 | 15.34 | 16.10 | 0.35 | 0.12 | |
| 1.30 | 3.57 | 10.29 | 14.36 | 15.10 | 0.36 | 0.13 | |
| 1.24 | 3.49 | 9.75 | 13.90 | 14.78 | 0.35 | 0.13 | |
| 1.31 | 3.95 | 11.91 | 16.90 | 17.88 | 0.33 | 0.11 | |
| 1.31 | 3.72 | 10.94 | 15.53 | 16.25 | 0.35 | 0.12 | |
| 1.29 | 3.51 | 10.03 | 14.16 | 14.88 | 0.37 | 0.13 | |
| 1.25 | 3.43 | 9.58 | 13.69 | 14.52 | 0.37 | 0.13 |
| Avg | St Dev | 1% CVaR | |
|---|---|---|---|
| 1.31 | 3.83 | 11.12 | |
| 1.22 | 3.52 | 9.98 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Incorporating Black-Litterman Views
in Portfolio Construction
when Stock Returns are a Mixture of Normals
Burak Kocuk and Gérard Cornuéjols Corresponding author. [email protected], Industrial Engineering Program, Sabancı University, Istanbul, Turkey 34956. [email protected], Tepper School of Business, Carnegie Mellon University, Pittsburgh, PA 15213 USA.
Abstract
In this paper, we consider the basic problem of portfolio construction in financial engineering, and analyze how market-based and analytical approaches can be combined to obtain efficient portfolios. As a first step in our analysis, we model the asset returns as a random variable distributed according to a mixture of normal random variables. We then discuss how to construct portfolios that minimize the Conditional Value-at-Risk (CVaR) under this probabilistic model via a convex program. We also construct a second-order cone representable approximation of the CVaR under the mixture model, and demonstrate its theoretical and empirical accuracy. Furthermore, we incorporate the market equilibrium information into this procedure through the well-known Black-Litterman approach via an inverse optimization framework by utilizing the proposed approximation. Our computational experiments on a real dataset show that this approach with an emphasis on the market equilibrium typically yields less risky portfolios than a purely market-based portfolio while producing similar returns on average.
Keywords: portfolio selection, finance, the Black-Litterman model, mixture of normals, conditional value-at-risk
1 Introduction
Portfolio construction is one of the most fundamental problems in financial engineering: Given risky assets, historical information about their returns and market capitalization of these assets, construct a portfolio that will produce the maximum expected return at a minimum risk. Maximizing expected return and minimizing risk are often conflicting objectives, and hence a compromise should be made by investors based on their risk aversion.
We consider two paradigms for solving this problem: an “analytical” approach and one that is “market-based”. In the analytical approach, key parameters such as the vector of mean asset returns, denoted by , and covariance between the asset returns, denoted by , are estimated from historical data. Then, a combination of the expected portfolio return and a risk measure is optimized by solving a problem of the form:
[TABLE]
Here, is the risk of portfolio under a chosen risk measure, is a positive, predetermined risk aversion factor and is the set of feasible portfolios. The earliest example of this approach is \citeAmarkowitz1952, in which a mean-variance (MV) optimization problem111Although variance and standard deviation are technically deviation measures, by adapting a slight abuse of terminology, we will be referring to them as risk measures as well. is proposed with assets, and .
In the market-based approach, one merely invests in a “market portfolio” proportional to the current market capitalization of the assets. The logic behind this market-based approach is the Efficient-Market Hypothesis Fama (\APACyear1970), which loosely states that the price of an asset captures all the information about that asset.
There are advantages and disadvantages to each approach. The major advantage of the analytical approach is that if the parameter predictions are accurate, then it can yield provably optimal portfolios. Unfortunately, this almost never happens in practice. In particular, the estimation of the mean return vector is error-prone and even a small perturbation in the parameter estimation can yield completely different portfolios due to what is called the “error-maximization property” in \citeAMichaud1989. Robust optimization techniques are proposed to circumvent the difficulties caused by parameter estimation in \citeAgoldfarb2003, ghaoui2003, tutuncu2004, ceria2006, demiguel2009, zhu2009worst. Another issue with the analytical approach is the determination of the risk aversion parameter . Choosing smaller values of puts more emphasis on the expected return and since the estimates of are generally inaccurate, it may lead to poor portfolios in practice. One may alleviate this issue by simply choosing infinitely large, thus reducing the generic portfolio optimization (1) to a “risk minimizing” portfolio problem (see, for instance, \citeAdemiguel2009).
Another issue regarding the analytical approach is to specify an adequate risk measure. Although variance (or standard deviation) is a typical risk measure, others are also used, for instance, Value-at-Risk (VaR) and Conditional Value-at-Risk (CVaR), which can better quantify the downside risk. One difficulty with these measures is that either a distributional assumption should be made, which requires modeling stock returns as random variables, or sampling-based simulation methods should be utilized based on the historical observations. A first choice is a multivariate normal distribution, which allows easy-to-solve risk minimization problems for both VaR and CVaR measures. However, the normal distribution typically does not provide a good fit to the stock return data due to heavy tails. Other probabilistic models have been proposed including stable distribution Mandelbrot (\APACyear1963), -distribution Blattberg \BBA Gonedes (\APACyear1974), and mixture of multivariate normals Chin \BOthers. (\APACyear1999); Buckley \BOthers. (\APACyear2008); Chen \BBA Yu (\APACyear2013); Wang \BBA Taaffe (\APACyear2015) among others. As the probabilistic model becomes more complex, it might be difficult to solve optimization problems involving VaR or CVaR terms. In such cases, return scenarios can be generated through Monte Carlo simulation to estimate these measures without any distributional assumption Krokhmal \BOthers. (\APACyear2002).
The main advantage of the market-based approach is its simplicity since it does not require any parameter estimation, optimization or distribution fitting procedure. One can, for instance, simply track Standard & Poor’s 500 index, which is arguably a representative proxy for the United States stock market. The disadvantage of the market-based approach is its inflexibility. For instance, if an investor believes that a certain stock will outperform another, this approach does not allow to incorporate this view.
Several studies try to combine the two approaches. For instance, in the early 1990s, \citeAblack1991, black1992 proposed a way of combining the market portfolio and investor views into the classical MV approach. In practice, portfolios obtained using the Black-Litterman (BL) methodology tend to be more robust to data perturbations. But there are also some issues with the BL methodology: For instance, the derivation of the estimates is not very intuitive and a number of papers including \citeAhe1999, satchell2000, drobetz2001avoid, meucci2010black attempt to clarify it from different perspectives. Moreover, the BL derivation is based on strong assumptions, one of them being the normality of the random return vector. Furthermore, parameters have to be determined exogenously to incorporate the confidence in the investor views.
Recently, the BL model has been generalized using an interpretation as an inverse optimization problem in \citeAbertsimas2012. Different from the classical derivation of the BL model, mean and covariance of the returns are determined as the solution of a conic program. This approach is flexible enough to eliminate some of the shortcomings of the BL methodology such as its inability to include views on variance, necessity to exogenously choose several parameters etc. Other recent extensions of the BL methodology include \citeAjia2016 in which inverse optimization incorporates views on the variance, \citeApang2018 in which a closed-form solution is derived when the risk measure is chosen as CVaR and the stock returns are assumed to follow an elliptical distribution, and \citeAsilva2017 in which views are created using Verbal Decision Analysis.
In this paper, we address several issues raised above and extend recent work. We start our analysis by focusing on the vector of stock returns modeled as a random variable. Using Standard & Poor’s (S&P) 500 dataset, we show that the returns are not normally distributed via statistical tests, and we propose an enhanced probabilistic model, namely a mixture of normal random variables. Then, we discuss how to construct risk minimizing portfolios under different probabilistic models (normal and mixture of normals) and risk measures (standard deviation and CVaR). We also propose a BL-type approach, which incorporates market-based information into CVaR minimizing portfolios.
\citeA
buckley2008 model the stock returns as a mixture of two normal random variables, considering several objective functions for the mean and variance of each “regime”. The main difference in our work is that we optimize portfolios directly with respect to the CVaR of a mixture distribution via a convex program.
Our key contributions in this paper are summarized below:
We analyze how to construct portfolios that minimize CVaR under the mixture model. Although CVaR minimization under the normal distribution is straightforward, resulting in a second-order cone program, the case with the mixture is less obvious since the CVaR of a mixture distribution does not have a closed form expression. We analyze how it can be numerically computed and optimized via a convex program. We also propose a closed form, second-order cone representable approximation of CVaR in this case. 2. 2.
We extend the work on the BL approach via inverse optimization to CVaR minimization under both normal and mixture distribution models. In the latter case, we propose a sophisticated approach, which combines our previous two contributions. In particular, our model is governed by the market equilibrium equation, and the parameters of the mixture distribution are treated as investor views. 3. 3.
We present computational results applied to the S&P 500 dataset. Empirically, we observe that, as expected, market-based portfolios typically have higher reward and higher risk than risk minimizing portfolios. However, we show on the same dataset that a certain combination of the market-based and risk minimizing portfolios obtained through a BL-type approach may yield portfolios with similar rewards and smaller risk.
The rest of the paper is organized as follows: In Section 2, we provide a statistical analysis of the S&P 500 dataset and model the stock returns as a mixture of normal random variables. In Section 3, we present our portfolio optimization problem with different risk measures. In Section 4, we propose a new approach to combine CVaR minimizing portfolios with market information to obtain BL-type portfolios. In Section 5, we present our computational experiments on the S&P 500 dataset, and compare market-based, risk minimizing and combination portfolios. Finally, Section 6 concludes our paper with further remarks.
2 Statistical Analysis of Stock Returns
As opposed to the standard Markowitz approach, which does not require any distribution information on the asset returns to construct the portfolios, the VaR and CVaR measures need either explicit forms of the distributions, or the use of sampling based methods. It is not uncommon in the finance literature to model the stock returns as normally distributed random variables. However, stock returns are rarely normally distributed, and typically have heavier tails. Therefore, it is crucial to use a different probabilistic model to capture the heavy tail effect, especially the left tail, which is closely related to the risk of a portfolio when the VaR and CVaR measures are used.
In this section, we use a real dataset, specifically the stocks in Standard & Poor’s (S&P) 500 index over a 30-year time span. Since the statistical evidence suggest that the stock returns do not come from a normal distribution, we propose an alternative probabilistic model, namely, we model the stock returns as a mixture of normal random variables and explain how the parameters of the mixture distribution can be estimated.
2.1 Data Collection and Normality Tests
We first collected historical stock returns and market capitalization information from the Wharton Research Database Services (WRDS). Since working with tens of thousands of different stocks is not appropriate for this study, we focused on stocks in the S&P 500 index. Following \citeAbertsimas2012, we further simplified our analysis by focusing on 11 sectors according to the Global Industrial Classification Standard (GICS). As a consequence of this simplification, we do not need to keep track of the assets that enter or leave the S&P 500, rather we concentrate on the overall performance of each sector as the average performance of its constituents. Using WRDS, we collected the return and market capitalization information for all the stocks that have been in the S&P 500 between January 1987-December 2016, spanning a 30-year period. We then computed the return of a sector for each month as a weighted average of the returns of the companies in S&P 500 in that particular time period, where the weights are taken as the market capitalization of each stock in that sector. This procedure gave us 360 sector return vectors of size 11, denoted by , . We also recorded the percentage market capitalization of each sector in month , denoted by , , .
We use an R package called MVN Korkmaz \BOthers. (\APACyear2014) to formally test the multivariate normality of the sector return vectors. We also test whether the returns of individual sectors are normally distributed. According to our extensive tests, we conclude that there is significant evidence that the neither the sector return vector nor the returns of individual sectors are normally distributed as expected.
2.2 Modeling Returns as Mixture of Normal Random Variables
The fact that the vector of sector returns is unlikely to come from a multivariate normal distribution motivates us to search for an alternate probabilistic model that better explains the randomness of the stock returns. We will try to construct such a model using mixtures of (multivariate) normal random variables.
This choice for our model can be explained from two perspectives. First, we note that the stock returns have typically heavier left tails, which can be considered as the most critical part since it directly relates to the investment risk. This was previously observed by many researchers, including the J.P. Morgan Asset Management group Sheikh \BBA Qiao (\APACyear2010). In this paper, we try to capture this effect by introducing a mixture of random variables. An intuitive explanation for this phenomenon is offered as follows: Under “regular” conditions, the market, in fact, behaves following an approximate normal distribution. However, every once in awhile, a “shock” happens and shifts the mean of the return distribution to the left with possibly higher variance. This can explain the relatively heavier left tails of the empirical distribution. Second, as we will demonstrate below, introducing this more sophisticated probabilistic model greatly improves the fit to the data. However, this better fit comes with a cost of more complicated procedures for data fitting and for portfolio optimization. The data fitting issue will be addressed in the remainder of this section. As for the portfolio optimization procedures under these more complicated distributions, they will be discussed in Section 3.3.
Formally, let us assume that the random return is distributed as a mixture of two multivariate normal random variables, that is, with some probability , returns are normally distributed with mean and covariance matrix , for . In other words, we have
[TABLE]
Note that if , , , , are given, we can compute the expectation and covariance of as follows:
[TABLE]
A direct approach to estimate the parameters of the mixture distribution is to use maximum-likelihood estimation, which can be achieved by the Expectation-Maximization (EM) Algorithm proposed in \citeAdempster. The EM algorithm can be seen as a clustering method in the sense that it partitions a given dataset into a predetermined number of subsets. For our purposes, we call the EM algorithm with 2 clusters and obtain the estimates of the parameters of the mixture distribution. We also gather statistical evidence that the proposed mixture model improves the fit to the S&P 500 dataset through Likelihood Ratio Test.
There are a couple of points to be clarified in the implementation of the EM algorithm. For instance, the likelihood function of a mixture distribution is not concave. Therefore, the EM algorithm may give some estimates that are only locally optimal. One can initialize the algorithm with random starting points to try to overcome this issue. Another important point, which might be more subtle, is the meaningfulness of the estimates: Suppose that and represent the mean return vectors for regular (good) and non-regular (bad) market conditions respectively. We would expect that . Although the algorithm itself does not guarantee such an outcome, we do actually obtain and , which is in accordance with our intuition (see Table 1).
3 Risk Minimizing Portfolio Construction
In this section, we discuss how to obtain risk minimizing portfolios under different probabilistic models and risk measures. We start with standard deviation minimization in Section 3.1, which does not require any distribution information on the returns. Then, we consider CVaR minimization when the sector returns are modeled as multivariate normal distributions in Section 3.2. These two optimization problems are well-known in the financial engineering literature. Our main contribution is presented in Section 3.3, in which we show how CVaR can be minimized when the sector returns are modeled as a mixture of normals.
Throughout the paper, we follow the standard convention in the financial industry in which the risk measures VaR and CVaR are defined with respect to the left tail of the return distributions. For completeness, we now provide the proper definitions of these risk measures and some basic facts.
Definition 1**.**
Let . The -level VaR of a random variable is defined as
[TABLE]
In other words, is the negative of the -quantile of the random variable .
Definition 2**.**
Let . The -level CVaR of a random variable is defined as
[TABLE]
We would like to point out that the random variable in Definitions 1 and 2 represents the return of a portfolio. One can equivalently define VaR and CVaR with respect to the loss of a portfolio as well.
Due to \citeArockafellar2002, can be computed as the optimal value of the following convex minimization problem
[TABLE]
where . We also note that the minimizer of the above optimization problem gives .
The availability of the explicit computation of VaR and CVaR depends on the underlying probability distribution. For instance, for a normal random variable with mean and variance , the -level VaR and CVaR of , , can be computed as
[TABLE]
where and are respectively the probability density function (pdf) and the cumulative distribution function (cdf) of the standard normal distribution. If , we will set for notational convenience.
3.1 Standard Deviation Minimization
Assuming that the covariance matrix is estimated from historical data, we can solve the following problem to minimize the standard deviation of the portfolio return:
[TABLE]
We note that problem (4) can be solved efficiently either as a quadratic program (after squaring the objective function) or as a second-order cone program (SOCP) in a lifted space.
3.2 CVaR Minimization under Normal Distribution
Let us assume that the vector of sector returns, denoted by , is modeled to come from a multivariate normal distribution with mean parameter and covariance matrix , estimated from historical data. Then, we can obtain a portfolio minimizing CVaR by solving
[TABLE]
where the -level CVaR of the return of a portfolio is computed as
[TABLE]
We again note that problem (5) can be formulated as an SOCP in a lifted space.
3.3 CVaR Minimization under Mixture Distribution
Now, let us assume that the vector of sector returns, denoted by , is modeled to come from a mixture of normal distributions with parameters , , , , obtained from the historical data using the technique proposed in Section 2.2. In this case, we would like to obtain a CVaR minimizing portfolio by solving the following optimization problem:
[TABLE]
As CVaR is a convex function Pflug (\APACyear2000), the optimization problem (7) is again a convex program. Since the CVaR of a mixture distribution does not have a closed form expression, we will utilize the expression (3) to obtain an explicit convex program which can be used to the CVaR minimization problem (7). For notational purposes, let us define , , and .
3.3.1 Computing and Optimizing CVaR under Mixture Distribution
We first note that can be computed analytically if is at hand (also derived in \citeABroda2011) through
[TABLE]
Here, and are respectively the pdf and cdf of the normal distribution with mean and variance evaluated at the point .
However, does not have a closed form expression either. Fortunately, one does not need to have the exact value of to solve the problem (7). Adapting the generic definition of CVaR in equation (3) to the special case of mixture of normals, we obtain
[TABLE]
where the optimal solution corresponds to . Finally, the resulting CVaR minimization problem can be stated explicitly as follows:
[TABLE]
3.3.2 Approximating CVaR under Mixture Distribution
In the previous section, we mentioned that can be computed by solving the convex program (9). In this section, we develop an explicit and second-order cone representable over-approximation of the same quantity, for the reasons that will become clearer in Section 4.2.2 when we incorporate the Black-Litterman views into the portfolio construction procedure via inverse optimization. We also provide the approximation guarantee of the proposed approximation together with some empirical evidence of its accuracy when applied to the S&P 500 dataset.
Below, we first provide under and over-approximations of the function , which will be the key in the approximation of the CVaR function.
Proposition 1**.**
Let . Then, .
Proof.
For any , we have
[TABLE]
This implies that , hence the result follows. ∎
We note that Proposition 1 is guaranteed to provide a non-trivial lower bound on for , which is a very loose assumption.
Let us define the set
Proposition 2**.**
Let , , and \theta\in{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}\tilde{\Delta}_{2}}. Then, .
Proof.
The result follows since we have
[TABLE]
∎
We note that by restricting to be positive for each , we eliminate the possibility of obtaining a trivial upper bound on in Proposition 2.
We now give an over-approximation of the function .
Proposition 3**.**
Let . Then, .
Proof.
For notational purposes, let us define . Note that for all due to Proposition 1. Let denote the pdf of the random variable . Then, we have
[TABLE]
where the inequality follows due to the fact that for all and the integrals involved are nonpositive. ∎
We note that the requirement in Proposition 3 can be satisfied for risk averse investors by selecting a small enough . However, this condition can be restrictive in cases when the decision maker is not very risk averse.
We also derive an explicit under-approximation of .
Proposition 4**.**
Let . Then, .
Proof.
Applying the CVaR identity (3) to the mixture random variable at the level , we have
[TABLE]
Here, the first inequality follows from the fact that the minimum of the sum of two functions is at least the sum of the minimum of these functions while the second inequality follows from the definition of and the fact that .
Since the same argument can be repeated for , the statement of the proposition follows. ∎
We now give a guarantee for the approximation proposed in Proposition 3.
Proposition 5**.**
Let . Then, where .
Proof.
The desired result follows from Proposition 4 since
[TABLE]
where by definition. ∎
We remark that Propositions 1–5 can be extended in a straightforward manner to the mixture of random variables, , that are not necessarily normal. In particular, Proposition 1 yields a non-trivial bound when , Proposition 3 remains valid if , and the approximation factor in Proposition 5 satisfies the condition .
We note that the over-approximation in Proposition 3 has an approximation factor of in the worst case. In practical settings related to the application in this paper, the accuracy of this approximation is higher as the contribution to the CVaR of the mixture distribution due to one of the normals (in particular, the one on the “left”) is larger than the other.
We also point out that the approximation guarantee proven in Proposition 5 is valid for any portfolio vector . Since we are primarily interested in the performance comparison of the optimal solutions obtained from the portfolio optimization problems with the exact and approximate CVaR objectives, we designed the following experiment to test the empirical behavior of the proposed approximation: Let be an optimal solution of the exact CVaR minimization problem (10), which is obtained by using the interior point solver IPOPT Wächter \BBA Biegler (\APACyear2006). Also, let be an optimal solution of the approximate CVaR minimization problem
[TABLE]
which is obtained by using the conic interior point solver MOSEK MOSEK (\APACyear2013) after transforming this optimization problem into an explicit SOCP.
We use the 360-month S&P dataset mentioned in Section 2.1 for this illustration. For each month , we estimate the parameters of the mixture distribution using the historical return vectors in the interval , where is a rolling horizon window. We compute the “percentage error” of each of the 180 pairs of solutions as
[TABLE]
Finally, we report the arithmetic and geometric average for three different values, namely 60, 120 and 180, in Table 2. We can see that the approximation is quite accurate, especially when is chosen as 120 or 180.
In Section 4.2.2, we use the approximate CVaR minimization problem (14) in the adaptation of the Black-Litterman approach for the CVaR minimization under the mixture distribution.
4 Incorporating Market Information into Risk Minimizing Portfolios
In the previous section, we presented ways to obtain risk minimizing portfolios under different probabilistic models and risk measures. In this section, our objective is to combine market information with these risk minimizing portfolios, which will be achieved through a BL-type approach. After reviewing the basic construction of the BL approach through Theil’s mixed estimation procedure, we also summarize a modern interpretation via inverse optimization in Section 4.1. Then, we extend the inverse optimization model from MV optimization to CVaR minimization under both normal and mixture distributions in Section 4.2.
4.1 Review of the Black-Litterman Model
The classical BL model is driven by two factors: market equilibrium and investor views. Let us first discuss how market equilibrium is obtained through “reverse optimization”. Consider the MV optimization problem with :
[TABLE]
Assuming that is estimated from the historical data as , is predetermined and the percentage market capitalization vector is given as , we can compute the “implied returns” which induce this market as
[TABLE]
by writing down the optimality conditions for the unconstrained optimization problem (15). This derivation is the basis of the capital asset pricing model (CAPM). In the absence of any other views, an investor should invest proportional to .
The second force that drives the portfolio away from the market equilibrium is the investor views. The BL model incorporates these views in the portfolio allocation via a certain “mixing” procedure. The basic construction is as follows: Suppose that the random return vector is distributed normally with mean and covariance matrix , that is, . However, according to this model, itself is random with
[TABLE]
where is chosen exogenously, and a smaller value implies strong confidence in the market equilibrium. The investor views are expressed with linear equations of the form with certain confidence level quantified by the covariance matrix . To be more precise, the BL model assumes that
[TABLE]
where is a diagonal matrix (so that views are independent) with positive diagonal entries. Again, smaller values indicate stronger confidence in the views.
At this point, we remind the reader the solution of the generalized least squares problem, which is a key to obtain the BL estimates:
Proposition 6**.**
Aitken (\APACyear1936)** Consider the generalized least squares problem , where with . Then,
[TABLE]
minimizes the norm of the error, where .
In order to obtain an estimate for , one can use Theil’s Mixed Estimation Theil \BBA Goldberger (\APACyear1961); Theil (\APACyear1963) procedure, which is just a corollary of Proposition 6. In particular, we rewrite equations (17)–(18) as
[TABLE]
The BL estimate for the mean return vector is obtained by
[TABLE]
Alternative derivations for and can be seen in \citeAhe1999, satchell2000 based on a Bayesian interpretation and in \citeAbertsimas2012 via inverse optimization. Here, we will outline the latter approach since it has motivated the extension presented in the next section. Recall that the optimality condition of the MV problem (15) can be written as . In the inverse optimization problem, given a solution , we search for the parameters that make this solution optimal. Assuming that , is given and , the solution of the inverse problem satisfies the following linear equation in and :
[TABLE]
This is the market equilibrium in the BL model. Similarly, investor views are expressed as linear equations in as
[TABLE]
In this approach, the aim is to solve the system (21)-(22) and simultaneously to obtain estimates for and in a certain least squares sense. In particular, one solves the following norm minimization problem:
[TABLE]
The flexibility of this approach allows us to use different norms, which can lead to tractable conic programs. For instance, if the underlying norm is where and , then the optimal solution of the problem (23) is precisely in (20). Hence, the BL estimate is just a special case of the inverse optimization approach. Although this derivation allows one to specify views on the variance estimate through, for instance, factor models, we will use historical variance estimates directly not to over-complicate our analysis.
4.2 Extension of the Black-Litterman Model to CVaR Minimization Problems
In this section, we extend the BL Model from MV optimization to CVaR minimization problems under both normal and mixture distributions. We use the inverse optimization framework in \citeAbertsimas2012 outlined in the previous section, and also present an equivalent interpretation in terms of Theil’s Mixed Estimation principle.
4.2.1 Normal Distribution
Let us consider the CVaR minimization problem under a normal distribution defined in (5), which can be explicitly stated as
[TABLE]
where is the vector of ones and . Let us associate dual variables and to the equality and inequality constraints in problem (24). Noting that the problem is a convex program, first-order necessary and sufficient conditions are:
[TABLE]
Let us now consider the inverse problem, where we will treat a market portfolio as an optimal primal solution to problem (24), and we will search for the parameters , and dual variables , that satisfy the optimality conditions in (25). At this point, we will assume that the covariance matrix is estimated by the sample covariance matrix . There are two main reasons for this assumption: First, it is generally accepted in the financial engineering literature that the estimation of the covariance matrix is reasonably accurate as opposed to the mean estimation. Second, this assumption allows us to have an inverse problem that is linear in the remaining unknowns. A further simplification can be made by taking into account the fact that each sector is invested a positive amount in the market portfolio since the percentage market capitalization is positive for each sector, i.e., . Hence, we have that . Therefore, the inverse problem is given below, where and are the only remaining unknowns:
[TABLE]
We will call (26) the market equilibrium equation under normal distribution.
In our approach, the investor views are expressed simply as
[TABLE]
where is the sample average estimated from the data. We then solve the following optimization problem to obtain adjusted estimates
[TABLE]
where , with . Note that as goes to zero, we put more emphasis on the market equilibrium and investor views become less important. In the other extreme, as gets larger, then the investor views become dominant and we recover the sample estimate. We would like to point out that our model allows more general investor views than the one used above. For instance, similar to the original BL model, linear equations in terms of the mean return vector given as can be incorporated into our model as well. In this case, the following optimization problem can be solved to find adjusted estimates
[TABLE]
where and the confidence of investor views is expressed as in (18).
Note that problem (28) has a closed form solution. Below, we also provide an equivalent interpretation in terms of Theil’s Mixed Estimation principle.
Proposition 7**.**
The solution of (28) is equivalent to the solution of the following mixed estimation problem:
[TABLE]
4.2.2 Mixture Distribution
Let us now consider the CVaR minimization problem under the mixture distribution defined in (10). Our aim is again to analyze the corresponding inverse optimization problem, which might lead us to the BL extension under the mixture model. Unfortunately, the optimality conditions of this minimization problem yield highly nonlinear expressions in terms of the mean return parameters and . Therefore, it seems very unlikely that we would be able to solve the related inverse optimization problem in closed form. We remark that this has been the main motivation behind the CVaR approximation developed in Section 3.3.2. Due to Proposition 3, we have an explicit expression that over-approximates , whose accuracy is demonstrated theoretically by Proposition 5 and empirically by the experiments summarized in Table 2. Therefore, we will base our construction on the approximate CVaR minimization problem (14). To be more precise, let us consider the following convex program
[TABLE]
where z_{i}{\color[rgb]{0,0,0}\definecolor[named]{pgfstrokecolor}{rgb}{0,0,0}\pgfsys@color@gray@stroke{0}\pgfsys@color@gray@fill{0}(\rho_{i})}:=\frac{\phi\left(\Phi^{-1}(\alpha/\rho_{i})\right)}{\alpha/\rho_{i}}. Let us again associate dual variables and to the equality and inequality constraints, and write down the first-order necessary and sufficient conditions as:
[TABLE]
Using the inverse optimization framework, we will again treat a market portfolio as an optimal primal solution to problem (29), and we will search for the parameters , , , , and dual variables , that satisfy the optimality conditions (30). Since our preliminary experiments have demonstrated that the estimation of ’s is quite stable across different data subsets, we make the assumption that in addition to the covariance matrices ’s, the probabilities ’s are also estimated from the historical data, using the technique developed in Section 2.2. We will denote these estimated probabilities as , . By the same reasoning as above, we have that . Finally, the inverse problem is given below, where , , and are the remaining unknowns:
[TABLE]
We will call (31) the market equilibrium equation under a mixture distribution.
Similarly, the investor views are expressed as
[TABLE]
where ’s are estimated from the historical data. We then solve the following optimization problem to obtain adjusted estimates
[TABLE]
where , with .
We point out a difference between the two models under normal and mixture distributions. In the former case, as approaches zero, we recover the market portfolio. However, in the latter, we do not necessarily recover the market portfolio due to added flexibility in the mixture model. On the other hand, as gets larger, the investor views become more important and we recover the estimates , as before.
We provide an equivalent interpretation of the problem (33) below.
Proposition 8**.**
Solution of (33) is equivalent to the solution of the following mixed estimation problem:
[TABLE]
5 Computational Results
In this section, we present the results of our computational experiments on the S&P 500 dataset. We explain the experimental setting in Section 5.1, including a proper definition of two types of market-based portfolios, our criterion for preferable portfolios and the rolling horizon based back-testing. In Section 5.2, we provide a comparison of these two market portfolios with three other risk minimizing portfolios obtained through the analytical approaches outlined in Section 3. In Section 5.3, we apply the BL-approach proposed in Section 4.2 for CVaR minimization and show that the resulting portfolios may dominate the market-based approach in terms of both expected return and risk measures. Finally, in Section 5.4, we conduct extensive simulations to replicate the results on a synthetic dataset.
5.1 Experimental Setting
Let us start by explaining the details of our experimental setting. We use the 360-month S&P dataset mentioned in Section 2.1. Five portfolios, two market-based and three risk minimizing, are constructed for each month , using only the historical return and market capitalization data in the interval , where is a rolling horizon window. This procedure gives us 180 portfolios for each strategy. For a fixed strategy , these portfolios, denoted by , are evaluated using the return information at , by simply computing . The performance of a portfolio construction strategy is evaluated with respect to average reward, standard deviation and 1% CVaR measures by using the dataset . We also report two risk-adjusted performance measures: the ratio of average reward to standard deviation (also known as the Sharpe ratio) and 1% CVaR.
We now discuss how these five portfolios are obtained. We construct two market-based portfolios for using the market capitalization information :
- •
Last Market Portfolio (): We simply choose .
- •
Average Market Portfolio (): We compute .
We construct three risk minimizing portfolios for using the returns :
- •
Standard Deviation Minimizing Portfolio (): We obtain by solving (4), where is estimated from the data.
- •
CVaR Minimizing Portfolio under Normal Distribution (): We obtain by solving (5) with , where and are estimated from the data.
- •
CVaR Minimizing Portfolio under Mixture Distribution (): We obtain by solving (10) with , where , and are estimated from the data using the EM algorithm.
We use MOSEK MOSEK (\APACyear2013) to solve problems (4) and (5), and IPOPT Wächter \BBA Biegler (\APACyear2006) to solve problem (10).
5.2 Market-Based vs. Risk Minimizing Portfolios
We first compare the performance of two market-based and three risk minimizing portfolios with respect to different values in Figure 1. We have several observations: First and foremost, market-based approaches result in higher reward, higher risk portfolios compared to risk minimizing portfolios. An interesting observation is that performs better with increasing values of in terms of both average return and risk measures, and that it has a higher average return than with typically slightly larger risk (except for CVaR when ). This suggests that historical data can be useful even in market-based portfolio construction. We also observe that most of the risk minimizing portfolios, especially , and , have better risk-adjusted performance compared to the market-based ones.
If we focus on the analytical approaches, we observe a general trend that larger values give rise to portfolios with a significantly higher average return, especially for , while the performances of and are less affected. The associated risks for the , and methods are more or less stable. Finally, we note that the risk-adjusted performance measures typically improve with larger values under the mixture model .
5.3 Combining Risk Minimizing Portfolios with Market Portfolios via the Black-Litterman Approach
In the previous subsection, we had two main observations: (i) The risk minimizing approaches, in fact, do generate lower risk portfolios than market-based approaches, but also with a lower reward. (ii) The performance of all portfolios is better in terms of the average return when is large. In this subsection, we will focus on , and apply the BL approach developed in Section 4.2, which may result in portfolios with higher average return, lower risk than the market-based portfolios.
Note that our previous results with , and are not affected. They will serve as benchmarks in this subsection. We modify the other two portfolio construction methods as follows:
- •
: First, solve problem (28) with respect to a given market capitalization to update the mean estimate , and then solve problem (5) to obtain a BL portfolio.
- •
: First, solve problem (33) with respect to a given market capitalization to update the mean estimates , , and then solve problem (10) to obtain a BL portfolio.
In Figure 2, we choose the last market portfolio as for each . Although not shown in the figure, we verify that matches when . Interestingly, outperforms for in both average return and risk measures. Arguably, the best performing method is , which has higher average return and lower risk than and when and , respectively. We also note the success of risk-minimizing portfolios in terms of both of the risk-adjusted performance measures, especially the approach. A final observation is that when is large enough, we can recover the risk minimizing portfolios as expected.
In Figure 3, we choose the average market portfolio as for each . We again verify that matches when . This time does not perform better than in terms of average return, but its performance is very close for , with a sharp decrease in both risk measures. is again the winner as it performs better than both market-based portfolios in similar ranges mentioned above with even better average returns. It is also interesting to note that the approach outperforms the other portfolios in terms of the risk-adjusted measures for a large range of values considered.
Our general conclusion regarding the experiments in this subsection is that one can benefit from the BL modification the most if the market portfolio is chosen according to the average percentage market capitalization and the value of is chosen small enough, for instance, close to . This typically results in portfolios with similar average return and lower risk compared to market portfolios. Such a strategy may even lead to portfolios that dominate the market-based portfolios when the method is used. Moreover, the risk-minimizing portfolio constructions methods, especially the approach, typically yield portfolios which are better in terms of the risk-adjusted measures.
Finally, we would like to discuss the choice of the rolling horizon window in our results. When we repeated the above experiments with and , we observed that and consistently give portfolios with similar average returns and lower risk compared to the market portfolios for small . We again observe that the risk-minimizing portfolios have better risk-adjusted performance compared to the market-based portfolios. However, it is interesting to note that the methods based on the mixture distribution typically have worse risk-adjusted performance than the ones based on the normal distribution when is selected as 60. This is somewhat expected since the mixture models have a larger number of parameters to be estimated, hence, they may require a reasonably large dataset to have more accurate estimators than the normal models.
5.4 Replicating the Results on a Synthetic Dataset
In the previous subsections, our experiments were based on a single stream of data, namely the monthly S&P 500 dataset over a 30-year period. Now, we will try to replicate these results using a synthetic dataset. In order to carry out a reasonable simulation, we assume that the true distribution for the return vector is a mixture of two normal random variables with the parameters given in Table 1.
5.4.1 True Distribution is Known
Under the assumption that the true distribution is known to the investors, we can obtain an ideal CVaR minimizing portfolio by solving problem (10), or an approximately ideal portfolio under a normal distribution with parameters computed using (2) by solving problem (5). We compare the performance of these portfolios ( and ) with an equally weighted market portfolio, that is, in Table 3 and six other portfolios obtained through the BL approach. By construction, the portfolio has the smallest CVaR value but its expected return is also much smaller than that of the market portfolio. However, when combined with the market information through the BL approach, the portfolios retain a similar expected return while having a significantly reduced risk compared to the market portfolio. For example, the expected return of the portfolio is only 0.66% smaller in relative terms than that of the equally weighted market portfolio whereas the standard deviation and CVaR measures are improved by about 9.1% and 11.6%, respectively. This is also reflected in the risk-adjusted performance measures.
We observe that the approach, which is obtained via a normal approximation, also produces a lower-reward, lower-risk portfolio compared to the market portfolio. Interestingly, its average reward is slightly better than the portfolio with higher risk in terms of the CVaR measure. We also see that the performance of the portfolios with the BL modification is very similar compared to their counterparts, especially when , suggesting that the BL approach may not be very sensitive to the exact distribution.
Because we assume here that the true distribution of the returns is known, it is interesting to observe what happens when we add an expected return constraint of the form to the CVaR minimization problems. In Table 4, we report the performance of the optimal portfolios under this additional constraint with two different values of . These results demonstrate that one can obtain portfolios with higher expected returns and smaller risk than the market portfolio under the assumption that the true distribution is known. We once again observe the similarity of the performances of the and portfolios.
5.4.2 True Distribution is not Known
So far in this subsection, we assumed that the true distribution of the returns is known. Of course this is not the case in reality. We now relax this assumption and design the following experiment: Suppose that we are given 181 random return vectors drawn from the mixture distribution. We use the first 180 of these vectors to estimate the parameters of the true distribution under the mixture and normal models. Based on these estimated parameters we obtain the and portfolios and their BL versions with varying values using the optimization algorithms described earlier. Finally, we evaluate these portfolios together with the equally weighted market portfolio using the last return vector. We repeat this experiment 10000 times and report the statistics in Table 5. We again note that the BL version of the approach yields portfolios with practically the same average return as the market portfolio for all the values of considered with a significantly reduced risk. For example, the average return of the portfolio is only 0.40% lower than that of the market portfolio in relative terms whereas the standard deviation and 1% CVaR measures are improved by about 12.4% and 17.4%, respectively. We note that the reduction in the CVaR measure is even more dramatic when smaller values of are considered. We also point out that the statistics of the and portfolios with the BL modification are almost indistinguishable for while the performance of the portfolio is slightly better than that of the portfolio.
As a final test, we repeat the above experiment with an expected return constraint and report the results in Table 6. When compared to the case without the expected return constraint (Table 5), the portfolio has higher average return and higher risk. Also, its performance is similar to the market portfolio in terms of the average return and it has a smaller risk. Nevertheless, a BL modified portfolio without the expected return constraint, such as , would still be preferable.
On the other hand, the portfolio with an expected return constraint produces counterintuitive results: Its average return is smaller compared to the case without the expected return constraint and the risk is higher. This observation suggests that when the true parameters are not known and an additional expected return constraint is enforced, the approach, which assumes a normal distribution for the return vector, may produce poor results and its behavior can be fundamentally different than that of the approach, which assumes a mixture distribution.
6 Conclusion
In this paper, we addressed the basic question of portfolio construction, combining analytical and market-based approaches. As an important component of the analytical approaches, we proposed to model stock returns as the mixture of two normals. We then explored the possibility of constructing risk minimizing portfolios under different probabilistic models (including the mixture distribution) and risk measures (including Conditional Value-at-Risk). We also proposed a Black-Litterman type approach using an inverse optimization framework to incorporate market information into a portfolio that minimizes the Conditional Value-at-Risk under the mixture distribution. Our computational experiments showed that market portfolios typically yield high-reward, high-risk portfolios, and that risk can be reduced by combining analytical and market-based approaches while achieving similar average returns.
Acknowledgments
We wish to thank Dr. Rob Stubbs for his insightful suggestions on an earlier version of this paper, and two anonymous reviewers for their constructive comments which have helped us to improve our paper. This work was supported in part by NSF grant CMMI1560828 and ONR grant N00014-12-10032.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aitken ( \APA Cyear 1936) \APA Cinsertmetastar aitken 1936 {APA Crefauthors} Aitken, A \BPBI C. \APA Cref Year Month Day 1936. \BBOQ \APA Crefatitle IV. – On Least Squares and Linear Combination of Observations. IV. – On least squares and linear combination of observations. \BBCQ \APA Cjournal Vol Num Pages Proceedings of the Royal Society of Edinburgh 5542–48. \Print Back Refs \Current Bib
- 2Bertsimas \B Others . ( \APA Cyear 2012) \APA Cinsertmetastar bertsimas 2012 {APA Crefauthors} Bertsimas, D., Gupta, V. \BCBL \BBA Paschalidis, I \BPBI C. \APA Cref Year Month Day 2012. \BBOQ \APA Crefatitle Inverse optimization: A new perspective on the Black-Litterman model Inverse optimization: A new perspective on the Black-Litterman model. \BBCQ \APA Cjournal Vol Num Pages Operations Research 6061389–1403. \Print Back Refs \Current Bib
- 3Black \BBA Litterman ( \APA Cyear 1991) \APA Cinsertmetastar black 1991 {APA Crefauthors} Black, F. \BCBT \BBA Litterman, R \BPBI B. \APA Cref Year Month Day 1991. \BBOQ \APA Crefatitle Asset allocation: Combining investor views with market equilibrium Asset allocation: Combining investor views with market equilibrium. \BBCQ \APA Cjournal Vol Num Pages Journal of Fixed Income 127–18. \Print Back Refs \Current Bib
- 4Black \BBA Litterman ( \APA Cyear 1992) \APA Cinsertmetastar black 1992 {APA Crefauthors} Black, F. \BCBT \BBA Litterman, R \BPBI B. \APA Cref Year Month Day 1992. \BBOQ \APA Crefatitle Global portfolio optimization Global portfolio optimization. \BBCQ \APA Cjournal Vol Num Pages Financial Analysts Journal 48528–43. \Print Back Refs \Current Bib
- 5Blattberg \BBA Gonedes ( \APA Cyear 1974) \APA Cinsertmetastar blattberg 1974 {APA Crefauthors} Blattberg, R \BPBI C. \BCBT \BBA Gonedes, N \BPBI J. \APA Cref Year Month Day 1974. \BBOQ \APA Crefatitle A comparison of the stable and student distributions as statistical models for stock prices A comparison of the stable and student distributions as statistical models for stock prices. \BBCQ \APA Cjournal Vol Num Pages The Journal of Business 472244–280. \Print Back Refs \Current Bib
- 6Broda \BBA Paolella ( \APA Cyear 2011) \APA Cinsertmetastar Broda 2011 {APA Crefauthors} Broda, S \BPBI A. \BCBT \BBA Paolella, M \BPBI S. \APA Cref Year Month Day 2011. \BBOQ \APA Crefatitle Expected shortfall for distributions in finance Expected shortfall for distributions in finance. \BBCQ \B In P. Cizek, W \BPBI K. Härdle \BCBL \BBA R. Weron ( \BEDS ), \APA Crefbtitle Statistical Tools for Finance and Insurance Statistical tools for finance and insurance ( \BPGS 57–99).
- 7Buckley \B Others . ( \APA Cyear 2008) \APA Cinsertmetastar buckley 2008 {APA Crefauthors} Buckley, I., Saunders, D. \BCBL \BBA Seco, L. \APA Cref Year Month Day 2008. \BBOQ \APA Crefatitle Portfolio optimization when asset returns have the Gaussian mixture distribution Portfolio optimization when asset returns have the Gaussian mixture distribution. \BBCQ \APA Cjournal Vol Num Pages European Journal of Operational Research 18531434–1461. \Print Back Refs \Current Bib
- 8Ceria \BBA Stubbs ( \APA Cyear 2006) \APA Cinsertmetastar ceria 2006 {APA Crefauthors} Ceria, S. \BCBT \BBA Stubbs, R \BPBI A. \APA Cref Year Month Day 2006. \BBOQ \APA Crefatitle Incorporating estimation errors into portfolio selection: Robust portfolio construction Incorporating estimation errors into portfolio selection: Robust portfolio construction. \BBCQ \B In \APA Crefbtitle Asset Management Asset management ( \BPGS 270–294). \APA Caddress Publisher Springer. \Print Back Re
