Sampling Distributions of Optimal Portfolio Weights and Characteristics in Low and Large Dimensions
Taras Bodnar, Holger Dette, Nestor Parolya, Erik Thors\'en

TL;DR
This paper derives the exact and asymptotic sampling distributions of estimated optimal portfolio weights and their characteristics, providing tools for efficient sampling and high-dimensional analysis in portfolio optimization.
Contribution
It introduces a stochastic representation for the sampling distribution of optimal portfolio weights, enabling efficient sampling and high-dimensional asymptotic normality analysis.
Findings
The asymptotic distribution of portfolio weights is multivariate normal.
The paper provides a consistent estimator for high-dimensional settings.
Simulation results show robustness of the asymptotic approximation.
Abstract
Optimal portfolio selection problems are determined by the (unknown) parameters of the data generating process. If an investor wants to realise the position suggested by the optimal portfolios, he/she needs to estimate the unknown parameters and to account for the parameter uncertainty in the decision process. Most often, the parameters of interest are the population mean vector and the population covariance matrix of the asset return distribution. In this paper, we characterise the exact sampling distribution of the estimated optimal portfolio weights and their characteristics. This is done by deriving their sampling distribution by its stochastic representation. This approach possesses several advantages, {e.g.} (i) it determines the sampling distribution of the estimated optimal portfolio weights by expressions, which could be used to draw samples from this distribution efficiently;…
| Portfolio | Additional quantities | |
|---|---|---|
| GMV | ||
| MV | -target expected return | |
| EU | is the risk-aversion coefficient | |
| T | is the risk-free return | |
| SR | ||
| MVaR | ||
| MCVaR | ||
| MVoR | is the target value-at-risk | |
| MCVoR | is the target conditional value-at-risk |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
**Corrigendum to
”Sampling Distributions of Optimal Portfolio Weights and Characteristics in Small and Large Dimensions”**
Taras Bodnara, Holger Detteb, Nestor Parolyac and Erik Thorséna
a* Department of Mathematics, Stockholm University, Stockholm, Sweden *
b* Department of Mathematics, Ruhr University Bochum, D-44870 Bochum, Germany
c Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands *
We would like to thank Raymond Kan, Nathan Lassance and Xiaolu Wang for drawing our attention to the errors present in the paper “Sampling Distributions of Optimal Portfolio Weights and Characteristics in Small and Large Dimensions”. The correct statements of the theorems are presented below, in which we used the notations from the original paper. For further important stochastic representations of estimated optimal portfolio weights and related asymptotic theory we refer to [48].
Although the proof of Lemma A.1 in [10] is correct, a typographical error is present in its formulation. The corrected version of this lemma is given below:
Lemma A.1**.**
Under the conditions of Theorem 2.1 in [10], the distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top} is determined by
- (i)
* is independent of (\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top};* 2. (ii)
; 3. (iii)
\left(\begin{array}[]{c}\hat{\mbox{\boldmath\theta}}\\ \tilde{R}_{GMV}\end{array}\right)\sim t_{k+1}\left(n-p+1,\left(\begin{array}[]{c}\mbox{\boldmath\theta}\\ \breve{R}_{GMV}\end{array}\right),\frac{V_{GMV}}{n-p+1}\breve{\mathbf{G}}\right), with \breve{\mathbf{G}}=\begin{pmatrix}\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}&\mathbf{L}\mathbf{Q}\tilde{\mbox{\boldmath\mu}}\\ \tilde{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\mathbf{L}^{\top}&\tilde{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\tilde{\mbox{\boldmath\mu}}\end{pmatrix}=\begin{pmatrix}\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}&\breve{s}\breve{\mbox{\boldmath\eta}}\\ \breve{s}\breve{\mbox{\boldmath\eta}}^{\top}&\breve{s}\end{pmatrix} 4. (iv)
* and \tilde{\mbox{\boldmath\eta}} are conditionally independent given \hat{\mbox{\boldmath\theta}} and * 5. (v)
; 6. (vi)
\tilde{\mbox{\boldmath\eta}}|\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV}\sim t_{k}\left(n-p+3,\breve{\mbox{\boldmath\eta}}+\mathbf{h},\frac{(n-p+3)^{-1}\tilde{\mathbf{F}}}{\breve{s}\left(1+\frac{(\tilde{R}_{GMV}-\breve{R}_{GMV})^{2}}{V_{GMV}\breve{s}}\right)^{2}}\right),* where*
[TABLE]
The new formulation of Lemma A.1 impacts the statements of Theorem 2.1 in [10], which is corrected to
Theorem 2.1**.**
Let be independent and normally distributed with mean vector and covariance matrix , i.e. \mathbf{x}_{i}\sim\mathcal{N}_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma}) for with . Define \mathbf{M}=(\mathbf{L}^{\top},\tilde{\mbox{\boldmath\mu}},\mathbf{1})^{\top} and assume that . Let be positive definite. Then, a joint stochastic representation of , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}} is given by
- (i)
; 2. (ii)
; 3. (iii)
[TABLE] 4. (iv)
* with*
[TABLE] 5. (v)
[TABLE]
where , , \xi_{3}\sim\chi^{2}_{p-k-1;n\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}}, , , , , and are mutually independent with \mbox{\boldmathA}=\mathbf{Q}-\mathbf{Q}\mathbf{L}^{\top}\left(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}\right)^{-1}\mathbf{L}\mathbf{Q}.
Proof of Theorem 2.1:.
The new formulation of Lemma 6.1 has an influence on the derivation of the stochastic representation of \hat{\mbox{\boldmath\eta}} only, while the remaining parts of the proof are correct. The derivation of the new stochastic representation of \hat{\mbox{\boldmath\eta}} is given by
[TABLE]
where and is independent of and . Moreover, due to Lemma 6.1.i and 6.1.iv we get that , , , , and are mutually independent. ∎
The new statement of Theorem 2.1 leads to the corrected version of Theorem 3.1 given by
Theorem 3.1**.**
Under the conditions of Theorem 2.1 in [10], it holds that
[TABLE]
where the joint stochastic representation of , and is given in (i)-(v) of Theorem 2.1.
The corrected version of Corollary 4.1 is the following one
Corollary 4.1**.**
Under the conditions of Theorem 2.1 and Assumption (A1) in [10], it holds that
[TABLE]
with
[TABLE]
for as where
[TABLE]
Proof of Corollary 4.1:.
Only, the expression of \boldsymbol{\Xi}_{s,\mbox{\boldmath\eta}} is changed. The new formula is obtained from Theorem 4.1 in [10] and it is given by
[TABLE]
∎
The improved statement of Theorem 4.2 is given by
Theorem 4.2**.**
Let be differentiable with first order continuous derivatives. Then, under the conditions of Theorem 2.1 and Assumption (A1) in [10], we get
[TABLE]
for as with
[TABLE]
As a special case of Theorem 4.2 we get the formula of the asymptotic covariance matrix expressed as
[TABLE]
The correct version of Theorem 4.4 is given by
Theorem 4.4**.**
Let \mbox{\boldmath\lambda}=(R_{GMV},V_{GMV},s)^{\top}. Then, under the conditions of Theorems 4.2 and 4.3 in [10], it holds that
- (a)
\sqrt{n-p}\left(\mathbf{L}\hat{\mathbf{w}}_{g;c}-\mathbf{L}\mathbf{w}_{g}\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}N_{k}(\mathbf{0},\mbox{\boldmath\Omega}_{\mathbf{L},g,c})* for as with*
[TABLE] 2. (b)
for as holds
[TABLE]
where
[TABLE]
Finally, a consistent estimator for the covariance matrix of the estimated weights of the EU portfolio is given by:
[TABLE]
where and .
Sampling Distributions of Optimal Portfolio Weights and Characteristics in Small and Large Dimensions
Taras Bodnara, Holger Detteb, Nestor Parolyac and Erik Thorséna
a* Department of Mathematics, Stockholm University, Stockholm, Sweden *
b* Department of Mathematics, Ruhr University Bochum, D-44870 Bochum, Germany
c Delft Institute of Applied Mathematics, Delft University of Technology, Delft, The Netherlands *
Abstract
Optimal portfolio selection problems are determined by the (unknown) parameters of the data generating process. If an investor wants to realise the position suggested by the optimal portfolios, he/she needs to estimate the unknown parameters and to account for the parameter uncertainty in the decision process. Most often, the parameters of interest are the population mean vector and the population covariance matrix of the asset return distribution. In this paper, we characterise the exact sampling distribution of the estimated optimal portfolio weights and their characteristics. This is done by deriving their sampling distribution by its stochastic representation. This approach possesses several advantages, e.g. (i) it determines the sampling distribution of the estimated optimal portfolio weights by expressions, which could be used to draw samples from this distribution efficiently; (ii) the application of the derived stochastic representation provides an easy way to obtain the asymptotic approximation of the sampling distribution. The later property is used to show that the high-dimensional asymptotic distribution of optimal portfolio weights is a multivariate normal and to determine its parameters. Moreover, a consistent estimator of optimal portfolio weights and their characteristics is derived under the high-dimensional settings. Via an extensive simulation study, we investigate the finite-sample performance of the derived asymptotic approximation and study its robustness to the violation of the model assumptions used in the derivation of the theoretical results.
Keywords— sampling distribution, optimal portfolio, parameter uncertainty, stochastic representation, high-dimensional asymptotics.
1 Introduction
The solution to the optimal portfolio selection problems are determined by the parameters of the data generating process. In many cases, the optimal portfolio weights and their characteristics, like the portfolio mean, the portfolio variance, the value-at-risk (VaR), the conditional VaR (CVaR), etc., can be computed by using only the mean vector and the covariance matrix of the asset return distribution. More precisely, these relationships are summarized by the following five quantities:
[TABLE]
where \mbox{\boldmath\mu}=E(\mathbf{x}) and \mbox{\boldmath\Sigma}=Var(\mathbf{x}) are the mean vector and the covariance matrix of the -dimensional asset return vector and
[TABLE]
The five quantities in (1.1) have an interesting financial interpretation. The components of the -dimensional vector define the weights of the global minimum variance (GMV) portfolio, i.e. of the portfolio with the smallest variance, while and are the expected return and the variance of the GMV portfolio. The quantity is the slope parameter of the efficient frontier, the set of all optimal portfolios following Markowitz’s approach. This parameter, together with and , fully determine the location and the shape of the efficient frontier, which is a parabola in the mean-variance space. Finally, the components of the -dimensional vector define the weights of the so-called self-financing portfolio (cf. Korkie and Turtle, [52]), i.e. the sum of its weights is equal to zero, that is .
The five quantities in (1.1) determine the structure of many optimal portfolios, like the GMV portfolio, the mean-variance (MV) portfolio, the expected maximum exponential utility (EU) portfolio, the tangency (T) portfolio, the optimal portfolio that maximizes the Sharpe ratio (SR), the minimum VaR (MVaR) portfolio, and the minimum CVaR (MCVaR) portfolio, maximum value-of-return (MVoR) portfolio, maximum conditional value-of-return (MCVoR) portfolio, among others (see, e.g., Markowitz, [54], Ingersoll, [44], Jobson and Korkie, [47], Alexander and Baptista, [3], Alexander and Baptista, [4], Okhrin and Schmid, [58], Kan and Zhou, [51], Frahm and Memmel, [38], Bodnar et al., [24], Adcock, [1], Woodgate and Siegel, [66], Bodnar et al., 2018b [20], Bodnar et al., 2018a [14], Simaan et al., [63], Bodnar et al., 2019e [19], Bodnar et al., 2019b [12]). On the other hand, the quantities (1.1) cannot be directly used to compute the weights and the characteristics of these portfolios, since both and are unobservable parameters in practice. As a result, an investor determines the optimal portfolios by replacing and in (1.1) with the corresponding sample estimators given by
[TABLE]
given a sample of asset returns . This approach leads to the sample or the so-called plug-in estimators of the optimal portfolios, which are based on the corresponding sample estimators of (1.1), expressed as
[TABLE]
with
[TABLE]
as well as to the sample (plug-in) estimators of the optimal portfolio weights.
The notion of the sampling distribution in portfolio allocation has recently been given large attention. Investors and researchers realize that the uncertainty, introduced by using historical data, needs to be integrated into the optimal portfolio decision process as well as properly assessed. The sampling distribution of the mean-variance portfolio was investigated as early as Jobson and Korkie, [47], Britten-Jones, [26], Okhrin and Schmid, [58], where the distributions of estimated optimal portfolio weights were derived under the assumption of an independent sample of asset returns taken from a multivariate normal distribution. Moreover, both the asymptotic and finite-sample distributions of the estimated efficient frontier, the set of all mean-variance optimal portfolios, were obtained by Jobson, [46], Bodnar and Schmid, [22], Kan and Smith, [49], and Bodnar and Schmid, [23], among others, while Siegel and Woodgate, [62] and Bodnar and Bodnar, [8] presented its improved estimators and proposed a test of its existence. Some of these results were later extended to the high-dimensional setting in Frahm and Memmel, [38], Glombeck, [40], Bodnar et al., 2018b [20], Bodnar et al., 2019e [19], whereas several limiting results related to the estimation of optimal portfolios under high-dimensional settings are present in Ao et al., [5], Kan et al., [50], Cai et al., [29], Ding et al., [34], Bodnar et al., [11], among others.
The sample mean vector and the sample covariance matrix given by (1.3) have been used extensively in previous research (see, e.g., Britten-Jones, [26], Memmel and Kempf, [56], Okhrin and Schmid, [59]) for estimating the asset return vector and its covariance matrix. These estimators appear to be consistent and the corresponding estimated optimal portfolios have desirable asymptotic properties when the portfolio dimension is considerably smaller than the sample size. However, they can no longer be used when a high-dimensional portfolio is constructed, due to their performance when the portfolio dimension is comparable to the sample size. One of the issues lies in that the quantities (1.4) depend on the inverse covariance matrix. Its sample counterpart is not a consistent estimator in the high-dimensional settings (see, e.g., Bodnar et al., [13]). To cope with these limitations, a number of improved estimators have been considered in the literature (cf., Efron and Morris, [35], Jagannathan and Ma, [45], Golosnoy and Okhrin, [41], Frahm and Memmel, [38], DeMiguel et al., [32], Rubio et al., [61], Yao et al., [67]).
We contribute to the existent literature by deriving the joint sampling distribution of the estimated five quantities in (1.4), which solely determine the structure of optimal portfolios. These results are then used to establish a unified approach for characterizing the sampling distributions of the estimated weights and the corresponding estimated characteristics of optimal portfolios. The goal is achieved by presenting the joint distribution of in terms of a very useful stochastic representation. A stochastic representation is a computationally efficient tool in statistics and econometrics to characterize the distribution of a random variable/vector, which is widely used in both conventional and Bayesian statistics. While it plays a special rule in the theory of elliptical distributions (c.f., Gupta et al., [43]), the stochastic representation is also a very popular method to generate random variables/vectors in computational statistics (see, e.g., Givens and Hoeting, [39]). The applications of stochastic representations in the determination of the posterior distributions of estimated optimal portfolios can be found in Bodnar et al., [15] and Bauder et al., [7]. Finally, Zellner and Ando, [68], among others, argued that the direct Monte Carlo approach based on stochastic representations is a computationally efficient method to calculate Bayesian estimation. In the present paper, we employ the derived stochastic representation for in the derivation of their high-dimensional asymptotic distribution, as well as in obtaining the high-dimensional asymptotic distribution of estimated optimal portfolios.
The theoretical results derived in the paper are based on the assumption that the asset returns are independent and normally distributed. Although this assumption is crucial for the derivation, especially of the finite-sample distributions of the estimated portfolio weights and their estimated characteristics, it is not obviously fulfilled in practical applications, especially, when financial data of daily or higher frequency are considered. On the other side, such assumptions are still appropriate for data taken of weekly or lower frequency, especially when the data is obtained from developed financial markets. For that reason, in the numerical part of the paper, we investigate the robustness of the derived high-dimensional asymptotic results to the violation of both normality and independence considering multivariate -distributions and the CCC-GARCH model (see, Bollerslev, [25]). The results of the simulation study indicate the presence of overestimation for the slope parameter of the efficient frontier and underestimation of the variance of the global minimum variance portfolio under the -model, while the asymptotic normality can still be used for the rest of the desired quantities. Moreover, the presence of autocorrelation between squared asset returns has only minor influence on the derived high-dimensional distributions. It means that, when the data follows a heavy tailed distribution, it is not fully clear what happens with the derived asymptotic distributions, and this case should be considered with care. We expect that the asymptotic results will depend on the fourth moments of the noise variables, and we will study this case deeper in the future papers.
The rest of the paper is organized as follows. In Section 2, we derive the finite-sample joint distribution of . This result is then used to establish the sampling distributions of the estimated optimal portfolio weights and their estimated characteristics in Section 3. Section 4 presents the asymptotic distributions of the estimated weights derived under the large-dimensional asymptotics. The results of the finite-sample performance of the asymptotic distributions and the robustness analysis to the distributional assumptions imposed on the data-generating process are investigated in Section 5, while final remarks are given in Section 6. The technical derivations are moved to the appendix.
2 Exact sampling distribution of , , , , and
Throughout the paper we assume that the -dimensional vectors of asset returns are independent and normally distributed with mean vector and covariance matrix , i.e. \mathbf{x}_{i}\sim\mathcal{N}_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma}) for . While Fama, [36] argued that the distribution of monthly asset returns can be well approximated by the normal distribution, Tu and Zhou, [64] found no significant impact of heavy tails on the performance of optimal portfolios.
The stochastic representation of , \hat{\mbox{\boldmath\theta}}, , , and \hat{\mbox{\boldmath\eta}} is derived in a more general case, namely by considering linear combinations of \hat{\mbox{\boldmath\theta}} and \hat{\mbox{\boldmath\eta}} expressed as
[TABLE]
where is a matrix of constant with and .In Sections 2 and 3 we assume that , while a stronger condition on is imposed in the derivation of the high-dimensional asymptotic results given in Section 4. Here, we assume that remains finite, while the portfolio dimension increases together with the sample size . The matrix represents the investors interest and views of the portfolio weights. If is selected to be the row vector with one on the th element and zero otherwise, for instance , then the investor would be interested in the th weight and its distribution. If the distribution of the th weight is centered around zero, then one can start to question whether or not the true weight is actually zero and in turn if the asset should be present in the portfolio.
In the same manner as above, we define the population counterparts of \hat{\mbox{\boldmath\theta}} and \hat{\mbox{\boldmath\eta}} given by
[TABLE]
Since \hat{\mbox{\boldmath\mu}} and \hat{\mbox{\boldmath\Sigma}} are independently distributed (cf. Rencher, [60]), the conditional distribution of , \hat{\mbox{\boldmath\theta}}^{\top}, , , \hat{\mbox{\boldmath\eta}}^{\top})^{\top} under the condition \hat{\mbox{\boldmath\mu}}=\tilde{\mbox{\boldmath\mu}} is equal to the distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top} with
[TABLE]
while their population counterparts we denote by:
[TABLE]
Let the symbol denote the equality in distribution. In Lemma 6.1 we derive the joint distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top}, whose result is an intermediate step towards Theorem 2.1. For that reason, Lemma 6.1 is placed in the Appendix. In Theorem 2.1 we present a joint stochastic representation of , \hat{\mbox{\boldmath\theta}}, , , and \hat{\mbox{\boldmath\eta}}, which will be used in the next section to characterize the distribution of portfolio weights on the efficient frontier. Furthermore, we use the notation to indicate that a random vector of size follows a standardized multivariate t distribution with degrees of freedom. Since the multivariate t distribution is not uniquely defined, we state the density to be used. The density and further information can be found in Gupta et al., [43]. If , then we imply that it has density
[TABLE]
where is the gamma function. We omit the subindex in when it is a one dimensional t distribution. The proof of Theorem 2.1 is given in the appendix.
Theorem 2.1**.**
Let be independent and normally distributed with mean vector and covariance matrix , i.e. \mathbf{x}_{i}\sim\mathcal{N}_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma}) for with . Define \mathbf{M}=(\mathbf{L}^{\top},\tilde{\mbox{\boldmath\mu}},\mathbf{1})^{\top} and assume that . Let be positive definite. Then, a joint stochastic representation of , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}} is given by
- (i)
; 2. (ii)
; 3. (iii)
[TABLE] 4. (iv)
* with*
[TABLE] 5. (v)
[TABLE]
where , , \xi_{3}\sim\chi^{2}_{p-k-1;n\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}}, , , , , and are mutually independent with
[TABLE]
The results of Theorem 2.1 provides a simple way to simulate observations from the distribution of , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}}. To simulate observations from the joint distribution, we only need to simulate random variables from well-known distributions. Moreover, the total dimension of independently simulated variables is equal to , which is considerably small when direct simulation would imply simulating matrices from a Wishart distribution and a -dimensional vector from a normal distribution. To this end, we point out that both the square roots in (iii) and (v) can be computed analytically, which will further facilitate speeding up the simulation study. This observation is based on the following two equalities
[TABLE]
where is a symmetric square root of , and
[TABLE]
where . Hence, it holds that
[TABLE]
and
[TABLE]
In equations (2) and (2.9) the matrix inverse and square roots are functions of population quantities. They only need to be computed once, independently of the length of the generated sample. The same argument cannot be performed when simulations of the joint distribution are obtained through simulating the sample covariance matrix and the sample mean vector directly. Hence, Theorem 2.1 provides an efficient algorithm to generate samples of arbitrary large size from the joint distribution of , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}} relative to simulating \hat{\mbox{\boldmath\Sigma}} and \hat{\mbox{\boldmath\mu}} directly. The findings of Theorem 2.1 also leads to an efficient way of sampling from the sample distribution of the optimal portfolio weights and their estimated characteristics, which will be discussed in detail in the next section.
3 Exact sampling distribution of optimal portfolio weights
The weights of the optimal portfolios that belong to the efficient frontier have the following structure
[TABLE]
with their linear combinations expressed as
[TABLE]
where the function determines a specific type of an optimal portfolio. This function depends on and only through the three quantities , , and , which fully determine the efficient frontier in the mean-variance space. By considering the general form of (3.2), we are able to cover a number of well-known optimal portfolios: the global minimum variance (GMV) portfolio, the mean-variance (MV) portfolio, the expected maximum exponential utility (EU) portfolio, the tangency (T) portfolio, the optimal portfolio that maximizes the Sharpe ratio (SR), the minimum Value-at-Risk (MVaR) portfolio, and the minimum conditional Value-at-Risk (MCVaR) portfolio, the maximum Value-of-Return (MVoR) portfolio, the maximum conditional Value-of-Return (MCVoR) portfolio, among others. The specific choices of for each of these optimal portfolios are provided in Table 1.
Let denote the sample estimator of the optimal portfolio weights given in the general form as in (3.2), which is obtained by plugging the sample mean vector and the sample covariance matrix instead of the unknown population counterparts. The linear combinations of the optimal portfolio weights are estimated by
[TABLE]
By Theorem 2.1 the exact sampling distribution of (3.3) is derived in terms of its stochastic representation. The results are summarized in Theorem 3.1, whose proof follows from Theorem 2.1.
Theorem 3.1**.**
Under the conditions of Theorem 2.1, it holds that
[TABLE]
where the joint stochastic representation of , and is given in (i)-(v) of Theorem 2.1.
From Theorem 3.1 we can derive a number of important results. First, it provides a complete characterization of the sampling distribution of the estimators for the optimal portfolio weights. This distribution can be assessed by drawing samples with independent observations from the derived stochastic representation of a relatively large size and then applying the well-established statistical methods for estimating the distribution function, the density, the moments, etc. Second, the obtained stochastic representation in Theorem 3.1 provides an efficient way for generating samples from the finite-sample distribution of following the discussion provided in Section 2 after Theorem 2.1, which is based on drawing independent realizations from well-known univariate and multivariate distributions. To this end, we note that the two square roots in (3.4) should be computed as given by (2) and (2). Similarly to the prior discussion, using these simplifications the derived stochastic representation can be rewritten to include matrix inverses and square roots of population quantities. Once more, these objects should only be computed once during the whole simulation study. Third, for the chosen values of the population quantities used in the simulation study, we can construct concentration sets of optimal portfolio weights. Fourth, an important probabilistic result about the sampling distribution of follows directly from the derived stochastic representation, namely that the finite-sample distribution of depends on the population mean vector and the population covariance matrix through , , , , , and . To sample from the distribution of we only need to fix these seven quantities. In particular, in the case of a single linear combination, i.e. when , we only have to fix six univariate quantities independently of the dimension of the data-generating process.
In a similar way, we derive statistical inference for the estimated characteristics of optimal portfolio with weights as given by (3.1). The expected return of the optimal portfolio with the weights (3.1) is given by
[TABLE]
while its variance is
[TABLE]
Similarly, the Value-at-Risk (VaR), the Conditional Value-at-Risk (CVaR), the Value-of-Return (VoR) and the Conditional Value-of-Return (CVoR) are computed by
[TABLE]
and by symmetry
[TABLE]
Equation (3.7) and (3.8) describe the VaR and CVaR respectively for a specific portfolio and not a single asset. Since risk measures simply measure the risk of random objects, the interpretation of the VaR remains the same as in the case of the univariate case. The VaR measures the loss at a certain confidence level and specifies a quantile. The CVaR specifies the mean loss when a loss larger than (or equal to) the VaR occurs, a tail-conditional expectation, hence its name Conditional Value-at-Risk. By a change of sign, the same holds for the CVoR and VoR.
Inserting the sample mean vector and the sample covariance matrix in (3.5)-(3.10) instead of the population counterparts, we get the sample estimators of the optimal portfolio characteristics. The application of Theorem 2.1 leads to the statement of their (joint) sampling distribution, which is presented in Theorem 3.2
Theorem 3.2**.**
*Under the conditions of Theorem 2.1, the stochastic representation of the estimated characteristic of optimal portfolio are obtained as in (3.5)-(3.10) where , , and are replaced by their sample counterparts , , and with *
[TABLE]
where , , are mutually independent.
The proof of Theorem 3.2 is given in the appendix. It has to be noted that the joint distribution of all six estimators is completely determined by three mutually independent random variables , , and with the standard marginal univariate distribution. Moreover, it depends on the unknown population mean vector and covariance matrix over three univariate quantities , , and . These uniquely determine the whole efficient frontier in the mean-variance space. To this end, the stochastic representation derived for the estimated optimal portfolio characteristics appears to be simpler than the one obtained in Theorem 3.1 for the corresponding estimator of the optimal portfolio weights. Similarly, the independent realizations from the joint distribution of can be drawn efficiently by employing the results of Theorem 3.2.
Another interesting financial application of the derived theoretical findings of Theorem 3.2 is present in the case of the EU portfolio, whose sample expected return and sample variance possess the following stochastic representations:
[TABLE]
It appears that and are conditionally independent, given the estimated slope parameter of the efficient frontier . In the limit case, when the risk aversion coefficient tends to infinity and the EU portfolio tends towards the vertex of the efficient frontier, the two estimated portfolio characteristics are unconditionally independent. In all other cases, the dependence between them is fully captured by the estimated geometry of the efficient frontier.
4 High-dimensional asymptotic distributions
The derived stochastic representations of Sections 3 and 4 are also very useful in the derivation of the asymptotic distributions of the estimators of optimal portfolio weights and their estimated characteristics. To this end, we note that the same approach can be used independently whether the dimension of the data generating process is assumed to be fixed or allowed to grow together with the sample size . These two regimes have been intensively discussed in statistical literature. The former asymptotic regime, i.e. with fixed , is called the “standard asymptotics” (see, e.g., Le Cam and Yang, [53]). Here, both the sample mean and the sample covariance matrix are proven to be consistent estimators for the corresponding population counterparts. Challenges arise when is comparable to , i.e. both the dimension and the sample size tend to infinity while their ratio tends to a positive constant , the so-called concentration ratio. It is called “large dimensional asymptotics” or “Kolmogorov asymptotics” (c.f., Bühlmann and Van De Geer, [27], Cai and Shen, [28]), while the case corresponds to the standard asymptotics.
Although, a large amount of research has been done on the asymptotic behavior of functionals which only include the sample mean vector or the sample covariance matrix under the high-dimensional asymptotics (see, e.g., Bai and Silverstein, [6], Cai et al., [30], Wang et al., [65], Bodnar et al., [13], Bodnar et al., 2019d [17], Bodnar et al., 2019a [9]), the situation becomes more complicated when both the sample mean vector and the sample (inverse) covariance matrix are present in the expressions. The problem is still unsolved and attracts both researchers and practitioners. In this section, we show how the derived stochastic representations of Sections 2 and 3 can be employed in the derivation of the high-dimensional asymptotic distributions of the estimated optimal portfolios and their characteristics. The main advantage of the suggested approach based on the stochastic representations is the clear separation between the deterministic quantities and the stochastic ones. By using the stochastic representation we can determine the joint asymptotic distributions of the latter.
Throughout this section, we will impose the following technical conditions on the functions involving the population mean vector and the population covariance matrix:
(A1)
There exist and , such that
[TABLE]
uniformly in . Moreover, for a linear combination of optimal portfolio weights determined by the -dimensional vector it holds uniformly in that
[TABLE]
Assumption (A1) ensures that the efficient frontier , , and as well as the components of linear combinations of optimal portfolio weights are all finite numbers in higher dimensions. The financial interpretation shows that even though we have an infinite amount of assets, we should not be able to gain an infinite amount of return for any amount of risk taken, i.e., the slope of the efficient frontier is bounded. It also states that the variance of the GMV portfolio is finite and bounded from zero, which can also be interpreted as investing in a market (regardless of how big it is), should imply some risk, but it can be neither infinite nor zero. Mathematically, it may happen depending on and that some quantities of , , , and tend to infinity as increases. In such cases, one should replace the constants and in (4.1) and (4.2) by and for some . This approach would lead only to minor changes in the expressions of the derived asymptotic covariance matrices in this section, where some terms might disappear (see, e.g., Bodnar et al., 2019c [16] for a similar discussion).
To this end, by an abuse of notations, we use the same notations for the functions involving the population mean vector and the population covariance matrix and their corresponding deterministic limits. For instance, \mbox{\boldmath\mu}^{\top}\mbox{\boldmath\Sigma}^{-1}\mbox{\boldmath\mu} will also be used to denote the limit \lim_{p\to\infty}\mbox{\boldmath\mu}^{\top}\mbox{\boldmath\Sigma}^{-1}\mbox{\boldmath\mu}. The interpretation of the quantities becomes clear from the text where they are used.
4.1 High-dimensional asymptotic distribution of , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}}
Before presenting the high-dimensional asymptotic results for the estimated optimal portfolio weights and their characteristic, we derive the asymptotic stochastic representation for the five quantities , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}}. It is presented in Theorem 4.1 in terms of several independently normally distributed random variables/vectors. Such a presentation allows also to characterize the asymptotic dependence structure , , \hat{\mbox{\boldmath\theta}}, , and \hat{\mbox{\boldmath\eta}} as well as to derive the expression of the asymptotic covariance matrix which is given after Theorem 4.1.
Theorem 4.1**.**
Under the conditions of Theorem 2.1 and Assumption (A1), it holds that
- (i)
, 2. (ii)
, 3. (iii)
\sqrt{n-p}\left(\hat{\mbox{\boldmath\theta}}-\mbox{\boldmath\theta}\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}\sqrt{V_{GMV}}\left(\frac{su_{5}}{\sqrt{s+c}}\mbox{\boldmath\eta}+\left(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}-\frac{s^{2}}{s+c}\mbox{\boldmath\eta}\mbox{\boldmath\eta}^{\top}\right)^{1/2}\mathbf{u}_{6}\right), 4. (iv)
\sqrt{n-p}\left(\hat{s}-\frac{(s+p/n)(1-1/n)}{1-p/n+2/n}\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}\frac{1}{1-c}\Bigg{(}\sqrt{2(1-c)\left(c+2\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}\right)}u_{2}+2s\sqrt{(1-c)}\mbox{\boldmath\eta}^{\top}(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top})^{-1/2}\mathbf{u}_{3}+\sqrt{2}(s+c)u_{7}\Bigg{)}, 5. (v)
\sqrt{n-p}\left(\hat{\mbox{\boldmath\eta}}-\frac{s}{s+p/n}\mbox{\boldmath\eta}\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}\frac{1}{\sqrt{s+c}}\left(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}-\frac{s^{2}}{s+c}\mbox{\boldmath\eta}\mbox{\boldmath\eta}^{\top}\right)^{1/2}\mathbf{u}_{8}+\frac{\sqrt{1-c}}{(s+c)}\Bigg{(}\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}-2\frac{s^{2}}{s+c}\mbox{\boldmath\eta}\mbox{\boldmath\eta}^{\top}\Bigg{)}(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top})^{-1/2}\mathbf{u}_{3}
-\frac{s\sqrt{2(1-c)\left(c+2\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}\right)}u_{2}}{(s+c)^{2}}\mbox{\boldmath\eta}**
for as where are mutually independent, and .
Several interesting results are summarized in the statement of Theorem 4.1, whose proof is given in the appendix. We observe that three quantities related to the estimators of the weights and of the characteristics of the GMV portfolio, the vertex point on the efficient frontier, are asymptotically independent of the estimated slope parameter of the efficient frontier and the self-financing portfolio \hat{\mbox{\boldmath\eta}}. However, these two are not asymptotically independent. This may not be so surprising since it is implicitly present in the estimated weights of the self-financing portfolio from its definition (1.4). Moreover, the sample variance of the GMV portfolio appears to be asymptotically independent of its estimated expected return and the estimator of the weights \hat{\mbox{\boldmath\theta}} following the finite-sample findings of Theorem 2.1. However, it is surprising that the covariance between and is partly determined by the self-financing portfolio due to the deterministic expression close to in the asymptotic stochastic representations of and \sqrt{n-p}\left(\hat{\mbox{\boldmath\theta}}-\mbox{\boldmath\theta}\right). Finally, the direct application of the derived stochastic representations in Theorem 4.1 leads to the expression of the asymptotic covariance matrix as given in Corollary 4.1.
Corollary 4.1**.**
Under the conditions of Theorem 2.1 and Assumption (A1), it holds that
[TABLE]
with
[TABLE]
for as where
[TABLE]
4.2 High-dimensional asymptotic distribution of optimal portfolio weights
The results of Theorem 4.1 are used to derive the high-dimensional asymptotic distribution of linear combinations of the estimated optimal portfolio weights as well as of the corresponding estimated characteristics of the portfolios given in Section 3. Throughout this section we assume that the number of linear combinations is finite.
Let
[TABLE]
where the results of Theorem 4.1 show that
[TABLE]
where for as .
Throughout this section it is assumed that the function is differentiable with first order continuous derivatives and define
[TABLE]
The asymptotic distribution of is given in Theorem 4.2, with the proof presented in the appendix.
Theorem 4.2**.**
Let be differentiable with first order continuous derivatives. Then, under the conditions of Theorem 2.1 and Assumption (A1), we get
[TABLE]
for as with
[TABLE]
The results of Theorem 4.2 are derived for the finite number of linear combinations of the estimated optimal portfolio weights and, consequently, they cannot be used to verify the consistency of the whole vector . Several consistency results about estimated optimal portfolio weights can be found in Ao et al., [5].
In the special case of the EU portfolio we get , , and
[TABLE]
As a result, the asymptotic covariance matrix of is expressed as
[TABLE]
In the same way, the high-dimensional asymptotic distribution of the estimated optimal portfolio characteristics is obtained. Following (3.5)-(3.10), are functions of , , and only. On the other hand, Theorem 4.1 determines the joint high-dimensional asymptotic distribution of , , and expressed as
[TABLE]
for as with
[TABLE]
which shows that are asymptotically independently distributed.
The characteristics in equations (3.5) through (3.10) can be viewed as transformations of . In a similar fashion to Theorem 4.2, we can also construct high-dimensional asymptotic distributions for these. Let denote the -th characteristic of the optimal portfolio with the weights and let h_{g,i}\left(\hat{\mbox{\boldmath\lambda}}\right) denote its corresponding estimate, where is defined in (4.5). In the example of characteristics given by equations (3.5) through (3.10), would range from to . Let the -th first order partial derivative of at be denoted by h_{g,i;j}\left(\mbox{\boldmath\lambda}\right). We get the following result of the high-dimensional distribution of estimated optimal portfolio characteristic, whose proof is obtained from the proof of Theorem 4.2.
Theorem 4.3**.**
Let , , be differentiable with first order continuous derivatives. Then, under the conditions of Theorem 2.1 and Assumption (A1), we get for as
[TABLE]
with where
[TABLE]
4.3 Interval estimation and high-dimensional test theory
The results of Theorems 4.2 and 4.3 indicate that both and h_{g,i}\left(\hat{\mbox{\boldmath\lambda}}\right), , are not consistent estimators for and , , respectively. While the asymptotic bias of the sample estimator of linear combinations of optimal portfolio weights is \left(\frac{s}{s+c}g\left(\mbox{\boldmath\lambda}\right)-g\left(R_{GMV},V_{GMV},s\right)\right)\mbox{\boldmath\eta}, the asymptotic bias in the estimator of the -th portfolio characteristic is h_{g,i}\left(\mbox{\boldmath\lambda}\right)-h_{g,i}\left(R_{GMV},V_{GMV},s\right).
On the other hand, the results of Theorem 4.1 already provide consistent estimators for , , , , and . Namely, they are given by
[TABLE]
Combining these equalities, we derive consistent estimators for and expressed as
[TABLE]
In Theorem 4.4, the asymptotic covariance matrices of the consistent estimators of optimal portfolio weights and their characteristics are present.
Theorem 4.4**.**
Let \mbox{\boldmath\lambda}=(R_{GMV},V_{GMV},s)^{\top}. Then, under the conditions of Theorems 4.2 and 4.3, it holds that
- (a)
\sqrt{n-p}\left(\mathbf{L}\hat{\mathbf{w}}_{g;c}-\mathbf{L}\mathbf{w}_{g}\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}N_{k}(\mathbf{0},\mbox{\boldmath\Omega}_{\mathbf{L},g,c})* for as with*
[TABLE] 2. (b)
for as holds
[TABLE]
where
[TABLE]
Since both \mbox{\boldmath\Omega}_{\mathbf{L},g,c} and depend on unobservable quantities, we have to estimate them consistently under the high-dimensional asymptotic regime when confidence regions for the optimal portfolio weights and for the optimal portfolio characteristics are derived.
Consistent estimators for , , , , and are given in (4.10)-(4.14). Similarly, to construct a consistent estimator for , we have that
[TABLE]
First, and are replaced by their consistent estimators and \hat{\mbox{\boldmath\theta}}_{c}. Second, note that a consistent estimator for \mathbf{l}_{i}^{\top}\mbox{\boldmath\Sigma}^{-1}\mathbf{l}_{j} with deterministic vectors and satisfying Assumption (A1) is given by \left(1-p/n\right)\mathbf{l}_{i}^{\top}\hat{\mbox{\boldmath\Sigma}}^{-1}\mathbf{l}_{j} (c.f., Bodnar et al., 2019e [19, Lemma 5.3]). As a result, is consistently estimated by with given in (1.5) and, hence, , , , , , and with their consistent estimators in (a) and (4.17), we obtain consistent estimators for \mbox{\boldmath\Omega}_{\mathbf{L},g,c} and denoted by \hat{\mbox{\boldmath\Omega}}_{\mathbf{L},g,c} and . For instance, a consistent estimator for the covariance matrix of the estimated weights of the EU portfolio is given by:
[TABLE]
where .
The suggested consistent estimators of \mbox{\boldmath\Omega}_{\mathbf{L},g,c} and are then used to derive asymptotic confidence intervals for the population optimal portfolio weights and their characteristics. In the case of linear combination of the optimal portfolio weights we get
[TABLE]
where denotes the quantile from the -distribution with degrees of freedom.
Finally, using the duality between the interval estimation and the test theory (c.f., Aitchison, [2]), a test on the equality of -linear combination of optimal portfolio weights to a preselected vector can be derived. Namely, one has to reject the null hypothesis in favour to the alternative hypothesis at significance level as soon as does not belong to the confidence interval , as given in (4.19). Similar results are also obtained in the case of optimal portfolio characteristics.
5 Finite-sample performance and robustness analysis
The finite-sample performance of the derived high-dimensional asymptotic approximation of the sampling distribution of the estimated optimal portfolio weights is investigated via an extensive Monte Carlo study in this section. Additionally, we study the robustness of the obtained asymptotic distributions to the violation of the assumption of normality used in their derivation. The following four simulation scenarios will be considered in the simulation study:
Scenario 1:
Multivariate normal distribution
Sample of asset returns are generated independently from N_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma});
Scenario 2:
CAPM model
Sample of asset returns are generated independently from the CAPM model. That is, we sample observations from where are generated independently from N_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma}) and are generated independently from a standard normal distribution;
Scenario 3:
Multivariate -distribution
Sample of asset returns are generated independently from multivariate -distribution with degrees of freedom , location parameter , scale matrix \frac{d-2}{d}\mbox{\boldmath\Sigma}. This choice of the scale matrix ensures that the covariance matrix of is ;
Scenario 4:
CCC-GARCH model (c.f., Bollerslev, [25])
The asset returns are assumed to be conditionally normally distributed with \mathbf{x}_{t}\Big{|}\mbox{\boldmath\Sigma}_{t}\sim N_{p}(\mbox{\boldmath\mu},\mbox{\boldmath\Sigma}_{t}). The conditional covariance matrix is specified by \mbox{\boldmath\Sigma}_{t}=\mathbf{D}_{t}^{1/2}\mathbf{C}\mathbf{D}_{t}^{1/2} with and
[TABLE]
where , \mbox{\boldmath\mu}=(\mu_{1},...,\mu_{p})^{\top}, and for .
Scenario 1 corresponds to the assumption used in the derivation of the theoretical results of the paper. Scenario 2 corresponds to a one-factor model of asset returns, which is a popular approach in financial literature. Although, following this model the largest eigenvalue of the covariance matrix \mbox{\boldmath\Sigma}+\beta^{2}\mathbf{1}\mathbf{1}^{\top} is of order (see, e.g., Fan et al., [37]), it still fulfills the assumptions used in the derivation of the theoretical findings of Sections 2 and 3. Scenario 3 violates the key assumption of normality by allowing heavy tails in the distribution of the asset returns. Finally, Scenario 4 is used to investigate the performance of the high-dimensional asymptotic approximation of the sampling distribution of the estimated optimal portfolio weights when the assumption of independence does not hold by introducing non-zero autocorrelation between the squared values of the asset returns.
In order to make the results more flexible, the model parameters are not fixed to some preselected values, but are randomly simulated. In the considered scenarios the components of are generated from , where stands for the uniform distribution on . The eigenvalues of the covariance matrix are fixed, such that 20% of them are equal to 0.2, 40% are equal to 1, and 40% are equal to 5, while its eigenvectors are simulated from the Haar distribution. In Scenario 3 we sample , . The coefficients of CCC-GARCH model in Scenario 4 are generated according to and . By such a construction, we always ensure , i.e., the stationarity condition is fulfilled. Finally, is chosen such that the unconditional covariance matrix of the CCC-GARCH process is equal to . Finally, we put and in all four scenarios.
The results of the simulation study are illustrated in the case of five quantities , \mbox{\boldmath\theta}_{GMV}, , , and , and the first weight of the EU portfolio with and . We will compute the sample and consistent estimators for these quantities in all scenarios, and compare their finite-sample distributions to the corresponding high-dimensional ones derived in Section 4. In Figures 1 to 8, we display QQ-plots for each of the six estimated quantities, both ordinary sample estimator and consistent ones, against the quantiles obtained from their corresponding high-dimensional distribution. On the first row of each figure, we see the estimators for and \hat{\mbox{\boldmath\theta}}, where the sample and consistent estimators coincide. On the second to fifth row, the first column corresponds to the sample estimators of the considered quantities, while the second column presents the results for the consistent estimators.
In the first and second scenarios, we employ the stochastic representations of Theorems 2.1 and 3.1. We sample from the finite-sample distribution of each estimated quantity through independent draws , \hat{\mbox{\boldmath\theta}}_{GMV}^{(b)}, , , \hat{\mbox{\boldmath\eta}}^{(b)}, and for . To this end, we note that the application of Theorems 2.1 and 3.1 provides an efficient way to generate the sample , \hat{\mbox{\boldmath\theta}}_{GMV}^{(b)}, , , \hat{\mbox{\boldmath\eta}}^{(b)}, and , which also avoids the computation of the inverse sample covariance matrix, which might be an ill-defined object in large dimensions, especially when . In Scenarios 3 and 4, the stochastic representations derived in Theorems 2.1 and 3.1 can no longer be used, since the assumptions used for their derivation are no longer fulfilled. In these two cases, we calculate all quantities explicitly for each simulation run. This increases the computational cost immensely, since we need to invert a matrix for each generated sample.
In Figures 1 and 2 the results obtained under Scenario 1 are depicted. We observe in the figures that the high-dimensional asymptotic distributions provide a good approximation for the moderate value of the concentration ratio and its large value . The approximation seems to be worst off in the context of approximating the distribution of when , as the tails become much heavier than the approximation seems to be able to account for. Also, there seems to be some instability in the sample estimator for \hat{\mbox{\boldmath\eta}}, as there is one observation which is very large. We do not see the same behaviour for the consistent estimator. Similar findings are also present in Figures 3 and 4, where we simulated from Scenario 2, that is from the CAPM model. The sample estimator for \hat{\mbox{\boldmath\eta}} seems to have some deviating observations which are no longer observable when the consistent estimator is used.
In Figures 5 and 6 we display the high-dimensional asymptotic approximations of the sampling distributions of the sample and consistent estimators of , , , , , and . The high-dimensional distributions approximate the corresponding sampling distributions reasonably well when the asset returns are assumed to be multivariate -distributed. There are some deviations from normality in the sampling distributions of the estimators for and , regardless of which estimator we use. There are also some observations in the finite-sample distribution of the sample estimator \hat{\mbox{\boldmath\eta}} that might indicate the presence of heavy tails or skewness, while the same behaviour is not seen in its consistent estimator. There is a small bias for two quantities and which could be explained by the influence of heavy tails in the data-generating model on the estimation of the inverse of the high-dimensional covariance matrix. On the other hand, the asymptotic variances seem to be well approximated by the results of Theorems 4.1 and 4.2. All other quantities show a good performance despite the violation of the distributional assumption. We also observe the same type of skewness as in Scenarios 1 and 2 in the case of , when the asset universe becomes large.
In Figures 7 and 8 the results of the simulation are shown in the case of Scenario 4. In contrast to the previous three scenarios, here it is assumed that the square asset returns are autocorrelated, which violates the assumption of independence imposed in the derivation of the theoretical results. The empirical findings of both figures also document here that the high-dimensional asymptotic distributions provide a good approximation of the corresponding finite-sample distributions. The only inconsistency we observe is the same type of heavy tail behaviour in the finite-sample distribution of the sample estimator for which disappear in its consistent counterpart. No significant impact of dependence on the performance of the derived high-dimensional distribution is observable in other plots of Figures 7 and 8.
6 Summary
In this paper we derive the exact sampling distribution of the estimators for a large class of optimal portfolio weights and their estimated characteristics. The results are present in terms of stochastic representations, which provides an easy way to assess the sampling distribution of the estimated optimal portfolio weights. Another important application of the derived stochastic representations is that it presents an efficient way to sample from the corresponding (joint) distribution. The largest computational efficiency comes from the fact that it excludes the inversion of the sample covariance matrix in each simulation run. Furthermore, the derived stochastic representation simplifies the study of the asymptotic properties of the estimated quantities under the high-dimensional asymptotic regime.
In the derivation of the theoretical results we assume that the asset returns are independent and normally distributed. These assumptions seem to be appropriate when asset returns are taken at weekly or lower frequency, while they are not fulfilled for financial data of daily and higher frequency. In our Monte Carlo simulations, we study the finite-sample performance of the obtained asymptotic distributions in comparison to the exact sampling distributions. We investigate the implications of violations to the model assumptions, namely departures from the assumption of normality and independence. We observe a good performance of the asymptotic distributions for finite samples when the data are simulated from the normal distribution. If the assumption of normality is violated, the asymptotic distribution fails to capture some structure in the data. These are biases that appear in the means and variances for a selected few of the estimated quantities. For the rest, the normal approximations still seem to provide a good fit. Assessing the biases in the asymptotic means and in the asymptotic (co)variances of the estimated optimal portfolio weights and their characteristic is an important challenge which will be treated in the consequent paper. Finally, we also find minor impact of the presence of autocorrelation between squared asset returns on the performance of the derived asymptotic distributions.
Acknowledgement
The authors would like to thank Professor Wang Zhou and two anonymous reviewers for their helpful suggestions. We also want to thank M. Stein who typed parts of this manuscript with considerable technical expertise. This research was partly supported by the Swedish Research Council (VR) via the project “Bayesian Analysis of Optimal Portfolios and Their Risk Measures”.
Appendix
In this section, the proofs of the theoretical results are given. In Lemma 6.1 we derive the conditional distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\hat{R}_{GMV},\hat{s},\hat{\mbox{\boldmath\eta}}^{\top})^{\top} under the condition \hat{\mbox{\boldmath\mu}}=\tilde{\mbox{\boldmath\mu}}, i.e. the distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top}.
Lemma 6.1**.**
Under the conditions of Theorem 2.1, the distribution of (\hat{V}_{GMV},\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top} is determined by
- (i)
* is independent of (\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV},\tilde{s},\tilde{\mbox{\boldmath\eta}}^{\top})^{\top};* 2. (ii)
; 3. (iii)
\left(\begin{array}[]{c}\hat{\mbox{\boldmath\theta}}\\ \tilde{R}_{GMV}\end{array}\right)\sim t_{k+1}\left(n-p+1,\left(\begin{array}[]{c}\mbox{\boldmath\theta}\\ \breve{R}_{GMV}\end{array}\right),\frac{V_{GMV}}{n-p+1}\breve{\mathbf{G}}\right),
with \breve{\mathbf{G}}=\begin{pmatrix}\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}&\mathbf{L}\mathbf{Q}\tilde{\mbox{\boldmath\mu}}\\ \tilde{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\mathbf{L}^{\top}&\tilde{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\tilde{\mbox{\boldmath\mu}}\end{pmatrix}=\begin{pmatrix}\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}&\breve{s}\breve{\mbox{\boldmath\eta}}\\ \breve{s}\breve{\mbox{\boldmath\eta}}^{\top}&\breve{s}\end{pmatrix} 4. (iv)
* and \tilde{\mbox{\boldmath\eta}} are conditionally independent given \hat{\mbox{\boldmath\theta}} and * 5. (v)
; 6. (vi)
\tilde{\mbox{\boldmath\eta}}|\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV}\sim t_{k}\left(n-p+3,\breve{\mbox{\boldmath\eta}}+\mathbf{h},\frac{(n-p+3)^{-1}\tilde{\mathbf{F}}}{\breve{s}\left(1+\frac{(\tilde{R}_{GMV}-\breve{R}_{GMV})^{2}}{V_{GMV}\breve{s}}\right)^{2}}\right),* where*
[TABLE]
Proof of Lemma 6.1:.
Under the assumption of independent and normally distributed sample of the asset returns, we get that
- (a)
\hat{\mbox{\boldmath\mu}}\sim\mathcal{N}_{p}\left(\mbox{\boldmath\mu},\dfrac{1}{n}\mbox{\boldmath\Sigma}\right); 2. (b)
(n-1)\hat{\mbox{\boldmath\Sigma}}\sim\mathcal{W}_{p}(n-1,\mbox{\boldmath\Sigma}) (-dimensional Wishart distribution with degrees of freedom and covariance matrix ); 3. (c)
\hat{\mbox{\boldmath\mu}} and \hat{\mbox{\boldmath\Sigma}} are independent.
As a result, the conditional distribution of a random variable defined as a function of \hat{\mbox{\boldmath\mu}} and \hat{\mbox{\boldmath\Sigma}} given \hat{\mbox{\boldmath\mu}}=\tilde{\mbox{\boldmath\mu}} is equal to the distribution of a random variable defined by the same function where \hat{\mbox{\boldmath\mu}} is replaced by \tilde{\mbox{\boldmath\mu}}.
Let \tilde{\mathbf{M}}=(\mathbf{L}^{\top},\tilde{\mbox{\boldmath\mu}},\mathbf{1})^{\top} and define
[TABLE]
with
[TABLE]
and
[TABLE]
with
[TABLE]
Also, let
[TABLE]
and
[TABLE]
with \tilde{\mathbf{G}}_{22}=\tilde{\mbox{\boldmath\mu}}^{\top}\hat{\mathbf{Q}}\tilde{\mbox{\boldmath\mu}} and \mathbf{G}_{22}=\tilde{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\tilde{\mbox{\boldmath\mu}}.
In using the definitions of and , we get
[TABLE]
Moreover, from Muirhead, [57, Theorem 3.2.11] we get and, consequently, (see, Gupta and Nagar, [42, Theorem 3.4.1] (-dimensional inverse Wishart distribution with degrees of freedom and parameter matrix ). The application of Theorem 3 in Bodnar and Okhrin, [18] leads to
- (i)
is independent of and and, consequently,
[TABLE] 2. (ii)
We get that . Hence,
[TABLE] 3. (iii)
Let \Gamma_{l}\Big{(}\frac{m}{2}\Big{)}=\pi^{l(l-1)/4}\prod_{i=1}^{l}\Gamma\Big{(}\frac{m-i+1}{2}\Big{)} be the multivariate gamma function. Then, the density of \tilde{\mathbf{H}}_{12}/\tilde{\mathbf{H}}_{22}=\begin{pmatrix}\hat{\mbox{\boldmath\theta}}^{\top}&\tilde{R}_{GMV}\end{pmatrix}^{\top} is given by
[TABLE]
where the last equality is obtained by the use of the Sylvester determinant identity. The density presented in (6.4) corresponds to a -dimensional distribution with degrees of freedom, location parameter \mathbf{H}_{12}/\mathbf{H}_{22}=\begin{pmatrix}\mbox{\boldmath\theta}^{\top}&\breve{R}_{GMV}\end{pmatrix}^{\top} and scale matrix .
In the proof of parts (iv)-(vi) we use the following result (see Theorem 3.f of Bodnar and Okhrin, [18])
[TABLE]
where
[TABLE]
with
Hence,
- (iv)
and \tilde{\mbox{\boldmath\eta}}={\tilde{\mathbf{G}}_{12}}/{\tilde{\mathbf{G}}_{22}} are conditionally independent given \hat{\mbox{\boldmath\theta}}^{\top} and .
- (v)
It holds that (n-1)^{-1}\tilde{\mathbf{G}}_{22}|\hat{\mbox{\boldmath\theta}}^{\top},\tilde{R}_{GMV}\sim\mathcal{W}_{1}^{-1}\left(n-p+4,\tilde{\mathbf{B}}_{22}\right). Hence,
[TABLE]
- (vi)
Finally, similarly to the proof of part (iii), we get
[TABLE]
where
[TABLE]
∎
Proof of Theorem 2.1:.
From Theorem 6.1.ii we get
[TABLE]
where . Moreover, Theorem 6.1.iii implies that \hat{\mbox{\boldmath\theta}} and are jointly multivariate -distributed and, hence, it holds that (see, e.g., Ding, [33]) and
[TABLE]
As a result, we get
[TABLE]
and
[TABLE]
where , are independent and also they are independent of .
Similarly, the application of Theorem 6.1.v leads to
[TABLE]
Finally, the application of Theorem 6.1.vi leads to
[TABLE]
where and is independent of and . Moreover, due to Theorem 6.1.i and 6.1.iv we get that , , , , and are mutually independent.
Next, we derive stochastic representations for the linear and quadratic forms in \hat{\mbox{\boldmath\mu}}, namely of \mathbf{1}^{\top}\mbox{\boldmath\Sigma}^{-1}\hat{\mbox{\boldmath\mu}}, \mathbf{L}\mathbf{Q}\hat{\mbox{\boldmath\mu}} and \hat{\mbox{\boldmath\mu}}^{\top}\mathbf{Q}\hat{\mbox{\boldmath\mu}} which are present in the derived above stochastic representations. Let and \mbox{\boldmathA}=\mathbf{Q}-\mathbf{P}\mathbf{P}^{\top}=\mathbf{Q}-\mathbf{Q}\mathbf{L}^{\top}\left(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top}\right)^{-1}\mathbf{L}\mathbf{Q}. Then
[TABLE]
Moreover, the equality implies \left(\begin{array}[]{c}\mathbf{1}^{\top}\mbox{\boldmath\Sigma}^{-1}\\ \mathbf{P}^{\top}\end{array}\right)\mbox{\boldmath\Sigma}\mbox{\boldmathA}=\left(\begin{array}[]{c}\mathbf{1}^{\top}\mbox{\boldmathA}\\ \mathbf{P}^{\top}\mbox{\boldmath\Sigma}\mbox{\boldmathA}\end{array}\right)=\left(\begin{array}[]{c}\mathbf{0}^{\top}\\ \mathbf{P}^{\top}-\mathbf{P}^{\top}\end{array}\right)=\mathbf{O} and, consequently, we get from Theorem 5.5.1 in Mathai and Provost, [55] that \hat{\mbox{\boldmath\mu}}^{\top}\mbox{\boldmathA}\hat{\mbox{\boldmath\mu}} is independent of \mathbf{1}^{\top}\mbox{\boldmath\Sigma}^{-1}\hat{\mbox{\boldmath\mu}} and \mathbf{P}^{\top}\hat{\mbox{\boldmath\mu}}, while Corollary 5.1.3a in Mathai and Provost, [55] implies that
[TABLE]
Finally, the identity \mathbf{1}^{\top}\mbox{\boldmath\Sigma}^{-1}\mbox{\boldmath\Sigma}\mathbf{P}=\mathbf{0} ensures that \mathbf{1}^{\top}\mbox{\boldmath\Sigma}^{-1}\hat{\mbox{\boldmath\mu}} and \mathbf{P}^{\top}\hat{\mbox{\boldmath\mu}} are independent (c.f., Rencher, [60, Chapter 2.2]) with
[TABLE]
where and are independent. Inserting (6.11) – (6.14) in (6.6) – (6.10) and performing some algebra, we get the statement of the theorem. ∎
Proof of Theorem 3.1:.
The statement of the theorem follows directly from the results of Theorem 2.1. ∎
Proof of Theorem 3.2:.
The mutual independence of , , and follows from Theorem 2.1, while Theorem 2.1.i provides the stochastic representation for .
Next, we derive the joint stochastic representation for and . Let , then the distribution of is obtained as a transformation of with the Jacobian matrix given by
[TABLE]
which implies that .
Let denote the marginal density of the distribution of . Ignoring the normalizing constants, we get the joint density of expressed as
[TABLE]
We now notice that
[TABLE]
where the first factor is the kernel of a normal distribution. Hence,
[TABLE]
where (6.15) determines the conditional distribution of given which is a normal distribution with mean and variance . The expression in (6.16) specifies the marginal distribution of which is the integral representation of the density of the ratio of two independent variables and with and (c.f., Mathai and Provost, [55, Theorem 5.1.3]). Hence, has a noncentral -distribution with and degrees of freedom and noncentrality parameter . ∎
Proof of Theorem 4.1.
If , then it holds that (see, e.g., Bodnar and Reiß, [21, Lemma 3])
[TABLE]
for .
Throughout the proof of the theorem the asymptotic results are derived under the high-dimensional asymptotic regime, that is under as . The applications of Slutsky’s lemma (c.f., DasGupta, [31, Theorem 1.5]) and Theorem 2.1, and the fact that a t-distribution with increasing degrees of freedom tends to the standard normal distribution yield the following results:
- (i)
The application of Theorem 2.1.i and (6.17) with leads to
[TABLE]
where . 2. (ii)
Using (6.17) with and \delta=n\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}, we get
[TABLE]
and, hence, \sqrt{n-p}\left(f-(s+p/n)\right)\stackrel{{\scriptstyle d}}{{\rightarrow}}\sqrt{2(1-c)\left(c+2\mbox{\boldmath\mu}^{\top}\mbox{\boldmathA}\mbox{\boldmath\mu}\right)}u_{2}+2s\sqrt{(1-c)}\mbox{\boldmath\eta}^{\top}(\mathbf{L}\mathbf{Q}\mathbf{L}^{\top})^{-1/2}\mathbf{u}_{3}, where and which are independent of following Theorem 2.1. Furthermore, the application of (6.18) yields
[TABLE]
where , and , , , , independent. 3. (iii)
Furthermore, by the stochastic representation of as given in Theorem 2.1.iii we have that in distribution \sqrt{n-p}\left(\hat{\mbox{\boldmath\theta}}-\mbox{\boldmath\theta}\right)
[TABLE]
where and is independent of , , , , and . 4. (iv)
The application of Theorem 2.1.iv and (6.17) leads to
[TABLE]
where and is independent of , , , , , and . 5. (v)
Similarly, from Theorem 2.1.v we get
[TABLE]
where and , , , , , , , are mutually independent distributed.
∎
Proof of Theorem 4.2:.
The application of Theorem 4.1 and of the continuous mapping theorem (c.f., DasGupta, [31, Theorem 1.14]) leads to \mathbf{L}\hat{\mathbf{w}}_{g}\stackrel{{\scriptstyle a.s.}}{{\rightarrow}}\mbox{\boldmath\theta}+\frac{sg(R_{GMV},(1-c)V_{GMV},(s+c)/(1-c))}{s+c}\mbox{\boldmath\eta} for as .
Let \hat{}\mbox{\boldmath\lambda} and be defined as in (4.5). Then, the first order Taylor series expansion yields
[TABLE]
Hence, from Theorem 4.1 we get
[TABLE]
Using that , , , , , , , are mutually independent and standard (multivariate) normally distributed, the expression of the asymptotic covariance matrix of is obtained. ∎
Proof of Theorem 4.4:.
Using (4.10)-(4.14) together with a first order Taylor expansion we get that
[TABLE]
The rest of the proof of part (a) follows from the proof of Theorem 4.2. Similarly, the statement of part (b) is obtained. ∎
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Adcock, [2015] Adcock, C. (2015). Statistical properties and tests of efficient frontier portfolios. In Quantitative Financial Risk Management , pages 242–269. Wiley Online Library.
- 2Aitchison, [1964] Aitchison, J. (1964). Confidence-region tests. Journal of the Royal Statistical Society: Series B (Methodological) , 26(3):462–476.
- 3Alexander and Baptista, [2002] Alexander, G. J. and Baptista, A. M. (2002). Economic implications of using a mean-var model for portfolio selection: A comparison with mean-variance analysis. Journal of Economic Dynamics and Control , 26(7-8):1159–1193.
- 4Alexander and Baptista, [2004] Alexander, G. J. and Baptista, A. M. (2004). A comparison of var and cvar constraints on portfolio selection with the mean-variance model. Management science , 50(9):1261–1273.
- 5Ao et al., [2019] Ao, M., Yingying, L., and Zheng, X. (2019). Approaching mean-variance efficiency for large portfolios. The Review of Financial Studies , 32(7):2890–2919.
- 6Bai and Silverstein, [2010] Bai, Z. and Silverstein, J. W. (2010). Spectral Analysis of Large Dimensional Random Matrices . Springer, New York.
- 7Bauder et al., [2019] Bauder, D., Bodnar, R., Bodnar, T., and Schmid, W. (2019). Bayesian estimation of the efficient frontier. Scandinavian Journal of Statistics , 46:802–830.
- 8Bodnar and Bodnar, [2010] Bodnar, O. and Bodnar, T. (2010). On the unbiased estimator of the efficient frontier. International Journal of Theoretical and Applied Finance , 13(07):1065–1073.
