Discrete time portfolio optimisation managing value at risk under heavy tail return distribution
Subhojit Biswas, Diganta Mukherjee

TL;DR
This paper develops a discrete-time portfolio optimization model that maximizes expected utility while managing Value at Risk under heavy-tailed return distributions, using Markov Decision Processes and numerical methods.
Contribution
It introduces a novel approach combining Markov Decision Processes with both parametric and non-parametric heavy-tailed distributions for portfolio optimization under VaR constraints.
Findings
Optimal strategies derived for heavy-tailed distributions.
Numerical methods effectively handle non-parametric cases.
Model enhances risk management in portfolio optimization.
Abstract
We consider an investor, whose portfolio consists of a single risky asset and a risk free asset, who wants to maximize his expected utility of the portfolio subject to the Value at Risk assuming a heavy tail distribution of the stock prices return. We use Markov Decision Process and dynamic programming principle to get the optimal strategies and the value function which maximize the expected utility for parametric as well as non parametric distributions. Due to lack of explicit solution in the non parametric case, we use numerical integration for optimization
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Stochastic processes and financial applications · Financial Markets and Investment Strategies
Discrete Time Portfolio Optimization managing Value at Risk
under heavy tail return distribution
Subhojit Biswas
*Indian Statistical Institute, Kolkata
*Diganta Mukherjee
*Sampling and Official Statistics Unit
Indian Statistical Institute, Kolkata
[email protected]* We thank Mrinal K. Ghosh for helpful comments and suggestions which has helped improving the exposition considerably. The usual caveat applies.
Abstract
We consider an investor, whose portfolio consists of a single risky asset and a risk free asset, who wants to maximize his expected utility of the portfolio subject to the Value at Risk assuming a heavy tail distribution of the stock price return. We use Markov Decision Process and dynamic programming principle to get the optimal strategies and the value function which maximize the expected utility for parametric as well as non parametric distributions. Due to lack of explicit solution in the non-parametric case, we use numerical integration for optimization.
Highlights
- •
Used Markov Decision Process and dynamic programming to get recursive relation for optimal strategy of known distribution
- •
Once it approaches the terminal position we will build up more on the risky asset or will liquidate the risky asset
- •
In the non-parametric case, we used numerical integration to find the recursive relation for optimal strategy
- •
In this case the value function we obtained has wide range of fluctuations and the same is also true for the portfolio wealth
Keywords: Portfolio Optimization, Markov Decision Process, Parametric distribution, Non-parametric distribution
AMS Classification: 91G10, 91G80
Biographical Notes: Mr. Subhojit Biswas, completed his M.Tech in Quality, Reliability and Operations Research at Indian Statistical Institute. Previously, he has completed his B.Tech in Instrumentation Engineering from Indian Institute of Technology, Kharagpur. Research interests are Mathematical Finance, Optimization and Applied Mathematics. Worked at Barclays for 3 years before coming for his masters. Currently working at JPMorgan Chase.
Dr. Diganta Mukherjee, Professor at Sampling and Official Statistics Unit of Indian Statistical Institute, Kolkata completed his BStat, MStat, PhD (Economics) all from Indian Statistical Institute, Kolkata. Research interests are Welfare & Development Economics and Finance. Ex-faculty, Jawaharlal Nehru University, Essex University, ICFAI Business School. Has over 80 publications in national and international journals and authored four books. Has been involved in projects with large corporate houses and various ministries of GoI and WBGovt. Acting as technical advisor to MCX, RBI, SEBI, NSSO, NAD (CSO).
1 Introduction
1.1 Background and Motivation
Risk management occurs everywhere in the financial world. It is done when an investor buys low-risk government bonds over riskier corporate bonds, bank performing a credit check on an individual before issuing a personal line of credit, stockbrokers buying assets like options futures in their portfolio and money managers using strategies like portfolio and investment diversification to mitigate or effectively manage risk. Inadequate risk management can result in severe consequences such as the sub prime mortgage meltdown in 2007 that helped trigger the Great Recession stemming from poor risk-management decisions. In the financial world the performance measure of the portfolio associated with risk and portfolio management is actually risk management. A common definition of investment risk is a deviation from an expected outcome, which we can benchmark with the market parameters. The deviation can be positive or negative. How Do Investors Measure Risk? Investors use a variety of tactics to ascertain risk. One of the most commonly used risk metrics, Value at Risk (VaR), is a statistical measure of the riskiness of financial entities or portfolios of assets. It is defined as the maximum dollar amount expected to be lost over a given time horizon, at a pre-defined confidence level. There are also other risk measure metrics used in the market such as Sharpe’s Ratio or Expected Shortfall (ES). As mentioned, our main focus in this paper will be Value at Risk (VaR).
Consider an investor who is interested to optimize the portfolio return while managing VaR. It is well documented that returns on financial assets often exhibit heavy tails, so much so that even the first moment may not exist for the return distribution (see Campbell et al., 1997). In this paper we consider investment strategies that maximize the median return of the portfolio such that VaR is controlled for a given lower quantile level. We incorporate the effect of transaction cost and do the analysis for known and unknown distribution of returns.
1.2 Literature Review
Interest rate risk immunization is one of the key concerns for fixed income portfolio management. In recent years, risk measures (e.g. value-at-risk and conditional value-at-risk)is used as tools for the formation of an optimum investment portfolio has received much attention. The article by Mato(2005) aims to discuss this issue. The paper by Harmantzis et al. (2006) empirically test the performance of different models in measuring VaR and ES in the presence of heavy tails in returns using historical data. Daily returns are modelled with empirical (or historical), Gaussian or Generalized Pareto (peak over threshold (POT) technique of extreme value theory (EVT)) distributions. Assessing financial risk and portfolio optimization using a multivariate market model with returns assumed to follow a multivariate normal tempered stable distribution (i.e. a mixture of the multivariate normal distribution and the tempered stable subordinator) can be seen in Kim et al.(2012). Several authors have considered the optimal portfolio problem under drawdown constraint. The first to comprehensively study this problem over infinite time horizon in a market setting with single risky asset modelled as a geometric Brownian motion with constant volatility (log normal model) was Grossman and Zhou(1993). Dynamic programming was used to solve the maximization problem of long term growth rate of the expected utility of wealth. Cvitanic and Karatzas(1995) streamlined the analysis of Grossman and Zhou(1993) and extended the results to the case when there are multiple risky assets. The pioneering paper by Samuelson(1975) gives an idea of portfolio selection by stochastic dynamic programming. The paper by Aït-Sahalia and Lo(1998) gives an idea of using a non parametric estimator for the State Price Densities implicit in option prices. Mean-VaR portfolio optimization is studied in the paper by Lwin et al.(2017) in a non-parametric setting. In this paper the authors have investigated the portfolio optimization problem with six practical constraints widely used in real life trading scenarios. The paper by Wozaba(2012) gives an idea of VaR constrained Markowitz style portfolio selection problem, here the distribution of the returns of considered assets are given in the form of finitely many scenarios. Numerical analysis is used to solve the problem. When the utility function is quadratic then how to use the Mean-VaR portfolio optimization is described in the paper by Sukono et al.(2017). In the paper by Chow(2014), the authors have devised an algorithm to solve conditional variance problem by Markov Decision Process. The book by Bhatnagar et al.(2013) gives us the idea of how to use stochastic recursive algorithm for optimization problems. The paper by Bauerle and Rieder(2019) considers statistical Markov Decision Processes where the decision maker is risk averse against model ambiguity. The model ambiguity is given by an unknown parameter which influences the transition law and the cost functions. Risk aversion is either measured by the entropy risk measure or by the average Value at Risk. Minimax optimization is used to solve this problem. The paper by Tang(2018) uses the approximate dynamic programming to set up a Markov decision model for the multi-time segment portfolio with transaction cost. The paper by Archibald and Possani(2019) analyses the contract between an entrepreneur and an investor, using a non-zero sum game in which the entrepreneur is interested in company survival and the investor in maximizing expected net present value. This paper looks into a different setup for a finance company. The paper by Li et al.(2018) analyses the data of the CSI 300 Index in the past five years, and uses Monte Carlo simulation and historical simulation to calculate the VaR of the five-year index and test its validity. It combines the result with China’s market economy and puts forward some suggestions on financial risk management in China’s financial market. The main contribution of the paper by Regis and Artes(2016) is to analyze the application of multi-state Markov models to evaluate credit card risk by investigating the characteristics of different state transitions in client-institution relationships over time, thereby generating score models for various purposes. It gives a different direction of application of the Markov Decision Process in the financial market. The paper by Fu(2019) considers the variance optimization problem of average reward in continuous-time Markov decision process (MDP). It assumes that the state space is countable and the action space is a Borel measurable space. The main purpose of the paper is to find the policy with the minimal variance in the deterministic stationary policy space. The paper by Perez et al.(2016) studies a portfolio optimization problem combining a continuous-time jump market and a defaultable security and provides a numerical solutions through the conversion into a Markov decision process. The paper also analyse allocation strategies under several families of utility functions and compares it with previously obtained results.
1.3 Our Contribution
In this article we consider an investor who is worried about when to build up on stocks or liquidate the stock when dealing with heavy tail distribution of the return of the stock prices, while controlling Value at Risk (VaR). The investor’s portfolio has one risky asset and a risk free asset. We consider this problem in discrete time and then apply the general Markov Decision Problem formulation. Markov decision process theory and algorithms for finite horizon problems primarily concerns with determining a policy with the largest expected total reward. We try to determine the policies for two scenarios where the distribution is known and when the distribution is not known. Pareto, Weibull and Inverse Gaussian distributions are used for the parametric scenario and for the non-parametric case we have used the kernel density estimator to fit our data and carry out the numerical integration.
1.4 Organization of the paper
In section 2 we outline our approach for portfolio optimization where we discretized the wealth equation and applied the general Markov Decision Problem formulation. So it has become a dynamic programming problem. We tried to find the optimal strategy considering two scenarios where the distribution is known (section 3) and when the distribution is not known (section 4). The candidate known distributions are Pareto, Weibull and Inverse Gaussian. We also make a comparative goodness of fit discussion among these three and two other distributions commonly used in the literature (mixture normal and variance gamma, as discussed above) in a separate subsection 3.4 of section 3. For non-parametric case, kernel density estimator is used to fit our data and carry out the numerical integration. We interpret the results under different circumstances, with and without transaction cost. Section 5 concludes the article.
2 Methodology
We work with one risky asset and a risk free bank account with return r. Again at a certain point of time we are investing of the total investible wealth in the risky asset and the rest in the bank account (so ). We can express the equation of wealth as
[TABLE]
The heavy tail distribution doesn’t allow us to use the properties of Brownian motion and the Hamilton-Jacobi-Bellman (HJB) equation to solve the stochastic differential equations to obtain the optimal strategy and the value function in continuous time. To obtain a solution in such a situation, we convert this problem into a discrete domain and then apply the general Markov Decision Problem (MDP) formulation. Since we have a fixed time horizon, it is treated as a Finite Horizon Markov Decision Process. Markov decision process theory and algorithms for finite horizon problems is primarily concerned with determining a policy with the largest expected total reward. We seek a policy for which (see Puterman,2005)
[TABLE]
where is the state of nature. Finite-horizon policy evaluation algorithm is used to compute . To start with let denote the total expected reward obtained by using policy at decision epochs . If the history at decision epochs is then is defined as
[TABLE]
where denotes the reward. The difference between and is that includes rewards over the entire future, whereas only incorporates rewards from decision epoch t onward. Here we can use the algorithm from (Puterman,2005) as below. Consider the formulation
[TABLE]
The expected value of policy over periods , when the history of epoch t is , equals the immediate reward received by selecting the action plus the expected reward over the remaining time periods. The second term contains the product of the probability of being in state j at decision epoch if action is used and the median total reward obtained using the policy over periods t, t+1,…, N when the history at epoch t is . Summing over all possible j give the desired expectation expressed in terms of
[TABLE]
where determines the state at time t. The solution of these equations correspond to optimal value functions and they also provide a basis for determining the optimal policies.
[TABLE]
where HR is the randomized history dependent policy. Expanding the R.H.S of the above equation, we get
[TABLE]
In our scenario we have policy which is a mapping between the state action couple. The action has two alternatives - whether to put money in the stock or withdraw money from the stocks - therefore making it finite and at the same time the state is different across time periods, so we can replace the supremum by maximum and our optimization problem takes the following form {maxi}|l| π:s->a,s∈(t, t+1,…T-1){u_t-1^π(h_t-1)} \addConstraintP (L_t-1 > Q_0.05 | L_t > Q_0.05) ≥0.95 ,
which can be further rewritten as, {maxi}|l| π:s->a,s∈(t, t+1,…T-1){W_t-1(s_t-1, a) + ∑_j∈sp_t-1(j|s_t-1, a)u_t^π(h_t-1, a, j)} \addConstraintP (L_t-1 > Q_0.05 | L_t > Q_0.05) ≥0.95 .
We have considered a quantile of 0.05 for VaR in this paper throughout. The constraint equation will help to identify the recursive relation between each optimal strategy at each time instant and this optimal strategy will help to maximize the value function. Basically we are going to use (2) for the value function as mentioned before and the reward is the change in the wealth from one time period to another time period.
[TABLE]
and for each obtained for a period the maximum value of is calculated which in turn helps to maximizes .
[TABLE]
[TABLE]
3 Examples and numerical results for known distribution of return
For the numerical illustration, data used are daily closing price of the stock “Entergy Corporation” in the time range 31st August, 2009 till 30th August, 2013 (Quantopian, 2018). The return is calculated for this data using the following expression
[TABLE]
.
We chose three benchmark distributions for our parametric illustration. The first candidate is the Pareto distribution which is a the canonical example of power-law that is used in description of social, scientific, geophysical, actuarial and other observable phenomena. The focus being to describe the distribution of wealth in a society, fitting the trend that a large portion of wealth is held by a small fraction of the population. We next use the Weibull distribution which is one of the three canonical extreme value distributions (Frechet and Gumbell being the other two. For details refer to Embrechts et. al., 1997). The last candidate is the Inverse Gaussian which is a common candidate for modelling processes with jumps. As financial asset returns are nowadays thought of as having discontinuities (in the form of discrete jumps), the use of Levy process based distributions have gained currency. For details see Kyprianou et. al., 2005. Inverse Gaussian is a popular candidate of this class of distributions.
3.1 Pareto Distribution
For a random variable with a Pareto (Type I) distribution, the probability that X is greater than some number x is given by
[TABLE]
The Pareto Type I distribution is characterized by a scale parameter and a shape parameter , which is known as the tail index. When this distribution is used to model the distribution of wealth, then the parameter is called the Pareto index. We are going to fit a Pareto distribution to the data-set and estimate the scale and shape parameters of the Pareto distribution using the Maximum Likelihood Estimator (MLE). The likelihood function for the Pareto distribution parameters and , given an independent sample , is
[TABLE]
Therefore, the logarithmic likelihood function is
[TABLE]
It can be seen that is monotonically increasing with , that is, the greater the value of , the greater the value of the likelihood function. Hence, since , we conclude that
[TABLE]
To find the estimator for we compute the corresponding partial derivative and determine where it is zero,
[TABLE]
Thus the maximum likelihood estimator for is:
[TABLE]
The expected standard error of the parameters are
[TABLE]
Now calculating the parameters along with the standard error after fitting the returns in Pareto distribution, we calculate the quantile for the distribution. The constraint is,
[TABLE]
The scale and shape parameters calibrated from data are as follows (refer Table 1).
Table 1 should be here
For judging the out of sample performance of our calibration exercise, once the parameters are estimated using 700 return data points, we check whether the next 50 data points are coming from the same distribution. For this we used the one-sample Kolmogorov-Smirnov statistic to test whether the sample comes from the Pareto distribution. The Kolmogorov–Smirnov statistic quantifies a distance between the empirical distribution function of the sample and the reference cumulative distribution function. Assuming the null hypothesis holds and the data follows a Pareto distribution, we use the following test statistic:
The empirical distribution function for n independent and identically distributed (i.i.d.) ordered observations is defined as
[TABLE]
where is the indicator function, equal to 1 if and equal to 0 otherwise.
The Kolmogorov–Smirnov statistic for a given cumulative distribution function F(x) is
[TABLE]
We obtain the test statistic value , which is less than the critical value at 5% level . Since , we cannot reject the null hypothesis and conclude that there is no significant difference between distribution of the data and Pareto distribution (with = 85.34364 and = 10346.37374).
The rate of interest (daily) is taken to be one of three alternative values. These are 0.00008 (equivalent to an annual rate of 2%), 0.00014 (3.5%) and 0.00024 (6%).
Putting the value of the quantile in equation (4) and putting the value of the wealth from (1) in to the same equation we get,
[TABLE]
Taking M = 1 we get
[TABLE]
For simplicity let us denote the upper and lower limits by the following constants,
[TABLE]
Then the constraint becomes
[TABLE]
Solving (5) and putting the values of and we get the recursive relation between the optimal strategies for t to t-1,
[TABLE]
The expression for other values of r are similarly obtained. Plotting (6) for the last 26 days of the historical data we have taken, in R-Studio, we can see how the optimal strategy varies for different values of r. Refer [Figure (1(a))] we note that as the time increases from the initial point the fluctuations in are present but as it reaches the terminal point it increases steeply with time. The value it reaches is maximum when the rate of interest is least and is minimum when the rate of interest r is maximum. This is intuitively reasonable as a lower rate of interest would push the investment in the risky asset higher.
[Figure (1(a))] should be placed here
The change in the optimal strategy may or may not be associated with transaction cost. We have implemented both the scenarios for all the numerical calculations in R-Studio trying to get some interpretation from the graphs.
3.1.1 Without transaction cost
Putting the values of obtained from (6) into (1) and simultaneously calculating the maximum reward for the transition from t to t-1 we calculate the value function defined in (3). If we plot the value function for the last 26 days of the the historical data (Figure (2(a))) we can interpret that as it moves towards terminal position the fluctuations in the value functions increases more and for the least rate of interest the fluctuations are maximum and for maximum rate of interest the fluctuations are least.
[Figure (2(a))] should be placed here
Finally when we plot the portfolio wealth for the last 26 days of the historical data it can be interpreted that the wealth gradually increases and decreases as it approaches the terminal wealth but before attaining the terminal position the value of the portfolio wealth increases and then decreases. For maximum rate of interest the portfolio wealth is maximum and for least rate on interest the portfolio wealth is minimum (Figure (3(a))). This implies that a safer portfolio (when bank rate is higher) is better.
[Figure (3(a))] should be placed here
3.1.2 With transaction cost
When transaction cost is taken into account the wealth equation (1) modifies to
[TABLE]
and using (7) in the optimisation exercise, the recursive optimal policy modifies to
[TABLE]
we can see that now there is a difference of order 2 when transaction cost is taken into account, where (Transaction cost rate for a change in optimal strategy) = 10%.111In this and all the subsequent cases, we have also studied the effect of a change in the transaction cost rate on our variables of interest. As we could detect no discernible difference, we have not reported those results here. Putting the values of obtained from (8) into (7) and simultaneously calculating the maximum reward for the transition from t to t-1 we calculate the value function defined in (3). When the optimal policy is plotted for the last 26 days of the historical data we can see that for transaction cost scenario the rate of interest doesn’t have any significant impact on the optimal strategy (Figure (1(b))). The optimal policy is almost constant when transaction cost is considered and is at a lower level as compared to no transaction cost scenario but when it reaches the terminal position the optimal strategy value increases more than the case of no transaction cost.
[Figure (1(b))] should be placed here
If we plot the value function for the last 26 days of the historical data denoted by red line (Figure (2(b))) we see that the value function in initial position does not change as much as compared to the no transaction cost scenario whereas as it closes towards the terminal position the fluctuations are more in the value function and there is no significant effect due to change in the rate of interest.
[Figure (2(b))] should be placed here
Finally when we plot the portfolio wealth over the last 26 days of the historical data it can be inferred that the initial wealth follows the same pattern as the no transaction cost scenario but as it approaches the terminal position the portfolio wealth decreases and then increases. There is no significant effect due to change in the rate of interest (Figure (3(b))). If we compare this with the case of no transaction cost, the fluctuations are more in case of transaction cost scenario.
[Figure (3(b))] should be placed here
3.2 Weibull Distribution
Weibull distribution is used next for modelling the financial return distribution, for the purpose of forecasting tail risk measures. The Weibull performs favourably for VaR forecasting compared to other distributions. The probability density function of a Weibull random variable is,
[TABLE]
where > 0 is the shape parameter and > 0 is the scale parameter of the distribution. We fit the Weibull distribution to the data-set and estimate the scale and shape parameters of the distribution using Maximum Likelihood Estimation in the following way:
The maximum likelihood estimator for the parameter given is
[TABLE]
The maximum likelihood estimator for is the solution for in the following equation
[TABLE]
This equation defines only implicitly, one must generally solve for by numerical means. When are the N largest observed samples from a data-set of more than N samples, then the maximum likelihood estimator for the parameter given is
[TABLE]
Also given that condition, the maximum likelihood estimator for is
[TABLE]
Again, this being an implicit function, one must generally solve for by numerical means. Once we get the parameters estimated, we calculate the quantile for the distribution. The constraint equation is again similar to (4). The scale and shape parameters are as follows (refer Table 2).
Table 2 should be here
Again, once the parameters are estimated using 700 return data points, we check whether the next 50 data points are coming from the same distribution using the one-sample Kolmogorov-Smirnov test. We obtain the which is less than the . Since , we cannot reject the null hypothesis and conclude that there is no significant difference between the data and data coming from an Weibull distribution (with = 1.2630 and = 0.0104).
The rate of interest (daily) is taken to be one of three alternative values as for Pareto. Putting the value of the quantile in equation (4) and putting the value of wealth from (1) into the same equation we get,
[TABLE]
Taking M = 1,
[TABLE]
For simplicity let us denote the upper and lower limits by the following constants
[TABLE]
Then the constraint becomes
[TABLE]
Solving (8) and putting the values of and we get the recursive relation between the optimal strategies for t to t-1,
[TABLE]
We have assumed the terminal optimal point be 0.00001. Plotting (10) for the last 26 days of the historical data in R-Studio we can see how the optimal policy varies (Figure (4(a))) and we notice that as the time increases from the initial point the change in is also linear with time and is same for all the rates of interest r.
[Figure (4(a))] should be placed here
Again, to check the association between the optimal strategy and transaction cost, we have implemented both the scenarios.
3.2.1 Without transaction cost
Putting the values of obtained from (10) into (1) and simultaneously calculating the maximum reward for the transition from t to t-1 we calculate the value function defined in (3). If we plot the value function for the last 26 days of the historical data (Figure (5(a))) we can interpret that initially as the time period moves the value function is constant but once it is close to terminal position the fluctuation decreases. Overall the value function doesn’t depend on the rate of interest r.
[Figure (5(a))] should be placed here
Finally when we plot the portfolio wealth over the last 26 days of the historical data it is observed that throughout the time period, portfolio wealth fluctuates very less but when the rate of interest r is maximum then the portfolio wealth is also maximum and when the rate of interest is least then the portfolio wealth is minimum (Figure (6(a))).
[Figure (6(a))] should be placed here
3.2.2 With transaction cost
When transaction cost is taken into account the wealth equation (1) modifies to
[TABLE]
and using (11) the recursive optimal policy modifies to
[TABLE]
The recursive equation is of order 2 and (Transaction cost rate for a change in optimal strategy) = 10%. Putting the values of obtained from (12) into (11) and simultaneously calculating the maximum reward for the transition from t to t-1 we calculate the value function defined in (3). When the optimal policy is plotted for the last 26 days of the historical data it is observed that the optimal policy remains constant at a particular value for a period of time but as it approaches the terminal position the asset needs to be liquidated. Here the rate of interest r has a clear effect on the strategy (Figure (4(b))). For different rate of interest the liquidation of the asset is different. In case of no transaction cost the rate of interest had no effect on the optimal strategy.
[Figure (4(b))] should be placed here
If we plot the value function over the last 26 days of the historical data (Figure (5(b))) we can perceive that the value function keeps fluctuating and there is no effect of the rate of interest.
[Figure (5(b))] should be placed here
Finally when we plot the portfolio wealth over the last 26 days of the historical data it can be interpreted that the wealth fluctuates throughout the time period and there is no significant difference while changing the rate of interest (Figure (6(b))). We obtain a different result during the no transaction cost scenario.
[Figure (6(b))] should be placed here
3.3 Inverse Gaussian Distribution
In probability theory, the inverse Gaussian (IG) distribution (also known as the Wald distribution) is a two-parameter family of continuous probability distributions with support on . Its probability density function is given by
[TABLE]
for x > 0, where is the mean and is the shape parameter. The parameters are calculated using Maximum Likelihood Estimator as follows:
[TABLE]
with all known, unknown and all independent has the following likelihood function
[TABLE]
Solving the maximum likelihood equation yields the following maximum likelihood estimates
[TABLE]
and are independent and
[TABLE]
Fitting an Inverse Gaussian distribution to the data-set and estimating the mean and shape parameters of the distribution, we calculate the quantile for the distribution. The constraint equation is again similar to (4) and comparison can be made of the optimal strategy that we have obtained for Pareto and Weibull distribution in the previous sections.
The mean and shape parameters obtained are as follows (refer Table 3).
Table 3 should be here
Once the parameters are estimated using 700 return data points, we check whether the next 50 data points are coming from the same distribution. We obtain the value for the Kolmogorov-Smirnov statistic which is less than the . Since , we cannot reject the null hypothesis and conclude that there is no significant difference between the data and the Inverse Gaussian distribution (with = 0.0097 and = 0.0044).
We again consider three alternative rates of interest. Putting the value of the quantile in equation (4) and putting the value of wealth from (1) in to the same equation we get,
[TABLE]
Taking M = 1,
[TABLE]
[TABLE]
For simplicity let us denote the following terms with these symbols
[TABLE]
Then, the desired condition can be rewritten as
[TABLE]
[TABLE]
We have assumed the terminal optimal strategy to be 0.1. Plotting (13) for the last 26 days of the historical data in R-Studio with the help of non linear equation solver we can note the variation of the optimal policy (Figure (7(a))) and observe that the change in is non-linear with time and is same for all the rates of interest r.
[Figure (7(a))] should be placed here
Again, to check the association between the optimal strategy and transaction cost, we have implemented both the scenarios.
3.3.1 Without transaction cost
Putting the values of obtained from (13) into (1) and simultaneously calculating the maximum reward for the transition from t to t-1, we calculate the value function defined in (3). If we plot the value function for the last 26 days of the historical data (Figure (8(a))) we can interpret that with gradual increase in time the value function fluctuates slowly but when it approaches the terminal position the fluctuation increases. Overall the value function depend slightly on the rate of interest r. When the rate of interest is least then the values obtained by the value function is minimum and for maximum rate of interest the value is maximum.
[Figure (8(a))] should be placed here
Finally when we plot the portfolio wealth over the last 26 days of the historical data it can be inferred that throughout the time period, portfolio wealth fluctuates as well as increases for all the values of r, but when the rate of interest r is maximum then the portfolio wealth is also maximum and when the rate of interest is least then the portfolio wealth is minimum (Figure (9(a))).
[Figure (9(a))] should be placed here
3.3.2 With transaction cost
When transaction cost is taken into account the wealth equation (1) modifies to
[TABLE]
Using (13) and taking the value of (Transaction cost rate for a change in optimal strategy) = 10%, we plot the optimal policy for the last 26 days of the historical data and can observe that the optimal policy increases non-linearly for the entire time period. The change in slope of the curve’s are different for different rates of interest. When the rate of interest is least the change in slope is minimum and when the rate of interest is maximum the change in slope is also maximum. (see Figure (7(b))). The situation is similar to the no transaction cost situation.
[Figure (7(b))] should be placed here
If we plot the value function over the last 26 days of the historical data (Figure (8(b))) we can perceive that the value function has a similar plot when we compare it to the no transaction cost scenario.
[Figure (8(b))] should be placed here
Finally when we plot the portfolio wealth over the last 26 days of the historical data it can be interpreted that the wealth fluctuates in a similar fashion to that of no transaction cost situation (Figure (9(b))).
[Figure (9(b))] should be placed here
3.4 Comparison of Model Accuracy
Before we move on to the discussion of our non-parametric analysis, we close this section with a short discussion of a comparison of modelling success between the three candidates we considered and a few other common candidates discussed in literature. We consider the mixture normal distribution (as in Kim et al., 2012) which can easily accommodate fat tails (with finite moments), excess skewness and Kurtosis. We also consider Variance Gamma to consider returns distribution with jumps (as in Perez et al., 2016). We follow the same strategy for testing out of sample goodness of fit as above (once the parameters are estimated using 700 return data points, we check whether the next 50 data points are coming from the same distribution).
The estimation results for these two distributions are provided below:
Mixture Normal: We consider two components and . Mixing proportion is , standardised distance between the two means is . Finally, BI is the bimodality index (refer Table 4).
Table 4 should be here
The KS Test statistic turns out to be which is less than the critical value of 0.12323. So the distribution fits the data well.
Variance Gamma We use the Nelder- Mead Optimization method to estimate the MLE here (refer Table 5).
Table 5 should be here
Here the KS Test statistic is which is greater than the critical value. So this candidate distribution fails to fit the out of sample data.
We summarise the comparison in the table below (refer Table 6).
Table 6 should be here
Thus we see that the mixture normal performs the best, although it is much more expensive in that it requires the estimation of six parameters compared to only two for all our candidates. Among them, Inverse Gaussian performs best, followed by Weibull. Variance Gamma fares the worst as it fails to describe the data at hand.
4 Example and numerical results for unknown distribution of return
4.1 Without transaction cost
When the distribution function is not known for the return of the stock prices we try to fit a distribution using kernel density estimator (KDE). It is a non parametric way to estimate the probability density function of a random variable. KDE is a fundamental data smoothing problem based on the finite data sample we choose. If we have as the independent univariate samples coming from an unknown distribution f then we can write
[TABLE]
where K is the kernel which is a non-negative function and h > 0 is a smoothing parameter called the bandwidth Markovich(2007). Using the KDE function we get the following estimate of the parameters (refer Table 7).
Table 7 should be here
Again we take three different values for r. Now, (4) can be used again for finding the conditional probability of the portfolio wealth. Expanding we get,
[TABLE]
[TABLE]
[TABLE]
Taking M = 1000 and now solving (15) with the help of standard numerical integration in R-Studio, we get the relationship as depicted in figure (10(a)). We can observe that the change in optimal policy is not so frequent for the last 26 days of the historical data. The investor would keep the asset at a particular state for some time. Also for the least rate of interest the peak value of the fluctuation is minimum and for the maximum rate of interest the peak value of the fluctuation is maximum.
[Figure (10(a))] should be placed here
If we consider a terminal value as 0.001 and plot the value function [Figure (11(a))], it fluctuates but as it approaches the terminal position it decreases and then increases. Changing the terminal value doesn’t create any difference in the behaviour of the value function. Also the rate of interest has little significance in the plot and the significance can be seen only when it is in the state of fluctuation.
[Figure (11(a))] should be placed here
Plotting the portfolio wealth [Figure (12(a))] we observe that wealth fluctuates but as it approaches the terminal position, the wealth increases in the same manner as the value function. Changing the terminal value doesn’t create any difference in the behaviour of the portfolio wealth. Also the change in the rate of interest has very little significance on the plot when the portfolio wealth is constant and during fluctuations it doesn’t have any effect.
[Figure (12(a))] should be placed here
4.2 With transaction cost
Plotting the effect of transaction cost on optimal strategy we can perceive that the fluctuations are similar to the no transaction cost scenario but there is no effect of the interest rate r. (see Figure (10(b))).
[Figure (10(b))] should be placed here
When we are considering transaction cost involved in the process and plot the value function (Figure (11(b))) we observe that it increases and decreases but as it move towards the terminal position the value function becomes constant. There is no effect of the rate of interest r on the fluctuations.
[Figure (11(b))] should be placed here
Similarly when we plot the portfolio wealth (Figure (12(b))) we observe that the portfolio wealth fluctuates but there is no effect of the interest rate r.
[Figure (12(b))] should be placed here
5 Conclusion
We proposed a new investor objective framework to deal with heavy tail distribution of the return of the stock prices and tried to optimize the portfolio based on managing Value at Risk (Var). As we know that moments do not exist for a heavy tailed distribution, quantiles are the only way to tackle the portfolio optimization problem. In our proposed approach for portfolio optimization we discretized the wealth equation and applied the general Markov Decision Problem formulation. So it has become a dynamic programming problem. We optimize considering two alternative scenarios where the distribution is known and when the distribution is not known.
In the known distribution case, we have fitted the Pareto, Weibull and Inverse Gaussian distributions to the return of the stock prices and found out the quantiles; and when the distribution is not known we used kernel density estimator to find out the quantiles. In both the cases we have shown the recursive relation between the optimal policy and maximized the total expected reward which is the value function at each of the time points. The optimal policy tells us when to build up on the risky asset and when to liquidate it. In the case where we have transaction cost, it will not allow the investor to change its policy very frequently and also not by a large margin. Hence, the value function and portfolio wealth are almost constant but as we approach the terminal position we build up more on the risky asset or liquidate it.
In the situation when the density functions are not known, the value function we obtained has wide range of fluctuations and the same is also true for the portfolio wealth. So the investor should come to know when to liquidate on the risky asset and build up on the risky asset. In this case we have used the standard numerical integration method to solve and get the recursive relation for the optimal policy. We have also shown how the transaction cost will affect the policy implementation. Here also the change in the optimal strategy won’t be as frequent when transaction cost is present. The results also align with the general intuition when the rate of interest increases or decreases, considering the presence or absence of transaction cost.
We have also made a comparative discussion of out of sample fit performance among our three candidate distributions and two other distributions commonly considered in the literature, viz. mixture normal and variance gamma. We see that the performance of our candidates are close to the mixture normal with less computational burden (only two parameters to be estimated instead of six). While variance gamma fails to fit the data at all. Thus the distributions functions we considered seem to be quite suitable for empirical work.
As heavy-tailness is quite common in financial returns data, we hope that our proposed methodology will be useful for practitioners. An additional issue that we have not addressed in this paper is how to incorporate multiple risky assets in this type of analysis. In the absence of moments, the challenge would be to replace the covariance function, which will be undefined, with an appropriate measure of association and carry out the portfolio optimization. We aim to take this issue up in our future work.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Campbell et al. [1997] John Y. Campbell, Andrew W. Lo, & A. Craig Mac Kinlay (1997), “The Econometrics of Financial Markets”, Princeton University Press.
- 2Puterman [2005] Martin L.Puterman (2005) Discrete stochastic dynamic programming, 79-82, Wiley.
- 3Mato [2005] Miguel Ángel Martín Mato (2005) Classic and modern measures of risk in fixed-income portfolio optimization, The Journal of Risk Finance, Vol. 6 Issue: 5, 416-423.
- 4Harmantzis et al. [2006] Fotios C. Harmantzis, Linyan Miao & Yifan Chien (2006) Empirical study of value-at-risk and expected shortfall models with heavy tails,The Journal of Risk Finance, Vol. 7 Issue: 2, pp.117-135.
- 5Kim et al. [2012] Young Shin Kim, Rosella Giacometti, Svetlozar T. Rachev, Frank J. Fabozzi & Domenico Mignacca (2012) Measuring financial risk and portfolio optimization with a non-Gaussian multivariate model, Annals of Operations Research, Volume 201, Issue 1, pp 325–343.
- 6Grossman and Zhou [1993] Grossman, S. J. & Zhou, Z. (1993) Optimal investment strategies for controlling drawdowns, Math. Finance 3(3), 241–276.
- 7Cvitanic and Karatzas [1995] Cvitanic, J. & Karatzas, I. (1995) On portfolio optimization under "drawdown" constraints, IMA Vol. Math.its Appl. 65, 35–35.
- 8Samuelson [1975] Paul A.Samuelson (1975) Lifetime portfolio selection by dynamic stochastic programming, Stochastic Optimization Models in Finance, Pages 517-524.
