Location and portfolio selection problems: A unified framework
Justo Puerto, Moises Rodr\'iguez-Madrena, Andrea Scozzari

TL;DR
This paper introduces a unified mixed-integer linear programming framework that integrates asset clustering and portfolio selection, leveraging location problems on networks to improve investment decision-making.
Contribution
It presents a novel integrated approach combining clustering and portfolio optimization in a single model, unlike traditional two-step methods.
Findings
Effective clustering impacts portfolio performance
Unified model improves decision-making efficiency
Preliminary experiments validate approach on real data
Abstract
Given a set of assets and an investment capital, the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset in order to build the most profitable portfolio. The portfolio optimization problem is naturally modeled as a mean-risk bi-criteria optimization problem where the mean rate of return of the portfolio must be maximized whereas a given risk measure must be minimized. Several mathematical programming models and techniques have been presented in the literature in order to efficiently solve the portfolio problem. A relatively recent promising line of research is to exploit clustering information of an assets network in order to develop new portfolio optimization paradigms. In this paper we endow the assets network with a metric based on correlation coefficients between assets' returns, and show how classical location problems on…
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 2.064 | 0.091 | 0.587 | 1.848 | 0.087 | 0.620 | 1.903 | 0.089 | 0.566 | 2.162 | 0.101 | 0.907 |
| =0.2 | 1.927 | 0.083 | 0.536 | 1.882 | 0.089 | 0.628 | 1.997 | 0.094 | 1.144 | 2.178 | 0.102 | 0.894 |
| =0.3 | 2.088 | 0.091 | 0.718 | 1.820 | 0.086 | 0.597 | 1.892 | 0.090 | 0.834 | 2.163 | 0.101 | 1.540 |
| =0.4 | 2.206 | 0.097 | 0.693 | 1.770 | 0.084 | 0.986 | 2.034 | 0.096 | 1.061 | 2.225 | 0.104 | 1.173 |
| =0.5 | 2.333 | 0.103 | 0.642 | 1.820 | 0.084 | 0.779 | 2.001 | 0.094 | 0.722 | 2.235 | 0.104 | 1.306 |
| =0.6 | 2.338 | 0.104 | 0.630 | 1.913 | 0.088 | 0.741 | 1.963 | 0.092 | 1.083 | 2.166 | 0.101 | 1.179 |
| =0.7 | 2.366 | 0.105 | 0.685 | 1.985 | 0.091 | 0.731 | 2.100 | 0.099 | 0.949 | 2.122 | 0.099 | 0.981 |
| =0.8 | 1.933 | 0.086 | 0.654 | 1.989 | 0.091 | 0.623 | 2.068 | 0.097 | 1.044 | 2.276 | 0.106 | 1.177 |
| =0.9 | 2.285 | 0.099 | 0.667 | 2.006 | 0.091 | 0.680 | 2.060 | 0.097 | 0.945 | 2.157 | 0.100 | 1.670 |
| =1 | 2.179 | 0.088 | 0.082 | 2.321 | 0.105 | 0.130 | 2.098 | 0.097 | 0.191 | 2.379 | 0.108 | 0.139 |
| CVaR-CC | 2.279 | 0.104 | 0.290 | 1.796 | 0.085 | 0.117 | 1.874 | 0.089 | 0.465 | 2.112 | 0.098 | 0.425 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 1.856 | 0.087 | 0.003 | 8.6 | ||||||||
| Index | 1.608 | 0.068 | - | 28 |
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 2.008 | 0.075 | 1.239 | 1.946 | 0.074 | 1.528 | 1.908 | 0.074 | 1.383 | 2.020 | 0.077 | 2.723 |
| =0.2 | 2.051 | 0.076 | 1.108 | 1.977 | 0.075 | 2.134 | 1.936 | 0.075 | 1.888 | 1.973 | 0.076 | 2.203 |
| =0.3 | 1.979 | 0.073 | 1.361 | 1.974 | 0.074 | 2.336 | 1.971 | 0.076 | 3.167 | 1.993 | 0.076 | 2.720 |
| =0.4 | 1.972 | 0.073 | 1.670 | 1.950 | 0.074 | 1.596 | 1.925 | 0.074 | 1.579 | 1.981 | 0.075 | 2.915 |
| =0.5 | 1.771 | 0.063 | 1.845 | 2.047 | 0.077 | 1.610 | 2.037 | 0.078 | 2.699 | 2.039 | 0.078 | 3.446 |
| =0.6 | 1.874 | 0.069 | 1.977 | 1.910 | 0.072 | 1.582 | 1.944 | 0.074 | 2.512 | 2.054 | 0.079 | 3.067 |
| =0.7 | 1.990 | 0.073 | 2.137 | 2.005 | 0.076 | 2.145 | 2.006 | 0.076 | 3.593 | 2.051 | 0.078 | 4.445 |
| =0.8 | 1.766 | 0.064 | 2.151 | 2.090 | 0.078 | 2.810 | 2.059 | 0.078 | 2.770 | 2.063 | 0.078 | 3.749 |
| =0.9 | 1.714 | 0.062 | 2.068 | 1.979 | 0.074 | 2.134 | 2.045 | 0.076 | 3.350 | 2.111 | 0.080 | 5.329 |
| =1 | 0.868 | 0.030 | 0.274 | 1.967 | 0.065 | 0.241 | 1.803 | 0.067 | 0.153 | 1.887 | 0.070 | 0.285 |
| CVaR-CC | 2.103 | 0.078 | 0.327 | 1.955 | 0.073 | 0.077 | 1.939 | 0.075 | 0.102 | 2.008 | 0.077 | 0.150 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 1.922 | 0.072 | 0.002 | 7.2 | ||||||||
| Index | 0.509 | 0.017 | - | 49 |
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 2.066 | 0.089 | 17.495 | 1.581 | 0.074 | 6.434 | 1.542 | 0.072 | 2.732 | 1.698 | 0.080 | 2.309 |
| =0.2 | 1.830 | 0.079 | 20.030 | 1.687 | 0.078 | 8.577 | 1.593 | 0.075 | 2.975 | 1.686 | 0.079 | 2.546 |
| =0.3 | 1.627 | 0.071 | 24.037 | 1.596 | 0.075 | 24.238 | 1.540 | 0.072 | 2.658 | 1.653 | 0.078 | 3.724 |
| =0.4 | 1.807 | 0.079 | 29.242 | 1.623 | 0.075 | 14.249 | 1.639 | 0.077 | 2.809 | 1.597 | 0.076 | 2.605 |
| =0.5 | 2.104 | 0.092 | 27.456 | 1.683 | 0.079 | 17.795 | 1.760 | 0.082 | 4.379 | 1.635 | 0.077 | 3.760 |
| =0.6 | 1.921 | 0.083 | 26.529 | 1.870 | 0.085 | 16.946 | 1.698 | 0.079 | 6.225 | 1.699 | 0.080 | 3.393 |
| =0.7 | 1.628 | 0.066 | 22.059 | 1.681 | 0.077 | 9.477 | 1.590 | 0.074 | 6.440 | 1.710 | 0.081 | 2.966 |
| =0.8 | 1.310 | 0.053 | 19.084 | 1.808 | 0.081 | 13.817 | 1.870 | 0.087 | 5.646 | 1.817 | 0.085 | 3.238 |
| =0.9 | 1.499 | 0.059 | 10.887 | 1.716 | 0.077 | 8.265 | 1.785 | 0.082 | 5.811 | 1.845 | 0.086 | 3.873 |
| =1 | 0.925 | 0.026 | 2.271 | 1.480 | 0.057 | 0.873 | 1.754 | 0.069 | 1.261 | 2.113 | 0.095 | 0.791 |
| CVaR-CC | 2.045 | 0.091 | 0.442 | 1.492 | 0.069 | 0.317 | 1.623 | 0.076 | 0.329 | 1.696 | 0.080 | 0.225 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 1.528 | 0.071 | 0.008 | 11.7 | ||||||||
| Index | 0.040 | 0.015 | - | 83 |
| Model | Av-5 () | Sh-5 | GAP-5 | Av-10 () | Sh-10 | GAP-10 | Av-15 () | Sh-15 | GAP-15 | Av-20 () | Sh-20 | GAP-20 |
| =0.1 | 1.970 | 0.080 | 23.634 | 2.452 | 0.103 | 4.371 | 2.266 | 0.098 | 0.302 | 2.165 | 0.095 | 0.058 |
| =0.2 | 1.877 | 0.075 | 24.092 | 2.466 | 0.104 | 4.814 | 2.255 | 0.098 | 0.304 | 2.170 | 0.095 | 0.063 |
| =0.3 | 2.028 | 0.082 | 25.405 | 2.478 | 0.104 | 5.243 | 2.186 | 0.094 | 0.404 | 2.170 | 0.095 | 0.065 |
| =0.4 | 1.881 | 0.075 | 31.787 | 2.402 | 0.103 | 6.853 | 2.174 | 0.094 | 0.540 | 2.181 | 0.095 | 0.058 |
| =0.5 | 1.861 | 0.073 | 34.658 | 2.271 | 0.097 | 8.286 | 2.182 | 0.095 | 0.629 | 2.193 | 0.096 | 0.037 |
| =0.6 | 1.980 | 0.077 | 35.634 | 2.024 | 0.083 | 9.302 | 2.077 | 0.089 | 1.293 | 2.183 | 0.095 | 0.097 |
| =0.7 | 2.386 | 0.098 | 43.565 | 2.182 | 0.089 | 14.402 | 2.109 | 0.090 | 2.527 | 2.216 | 0.096 | 0.088 |
| =0.8 | 2.052 | 0.084 | 48.679 | 1.796 | 0.076 | 22.566 | 2.073 | 0.087 | 4.250 | 2.116 | 0.092 | 0.463 |
| =0.9 | 2.239 | 0.098 | 54.367 | 1.906 | 0.076 | 27.719 | 2.113 | 0.088 | 11.997 | 2.284 | 0.098 | 2.948 |
| =1 | 2.010 | 0.067 | 0 | 1.364 | 0.046 | 0 | 1.784 | 0.067 | 0 | 1.291 | 0.056 | 0 |
| CVaR-CC | 2.224 | 0.094 | 0.059 | 2.515 | 0.108 | 0.059 | 2.187 | 0.095 | 0.044 | 2.130 | 0.093 | 0.007 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 2.168 | 0.094 | 0.033 | 15.3 | ||||||||
| Index | 1.137 | 0.040 | - | 442 |
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 1.726 | 0.077 | 0.596 | 1.608 | 0.077 | 0.712 | 1.746 | 0.084 | 0.952 | 2.198 | 0.104 | 0.896 |
| =0.2 | 1.851 | 0.083 | 0.540 | 1.616 | 0.077 | 0.681 | 1.766 | 0.085 | 0.940 | 2.145 | 0.100 | 0.937 |
| =0.3 | 1.823 | 0.081 | 0.630 | 1.587 | 0.077 | 0.726 | 1.805 | 0.086 | 1.244 | 2.173 | 0.102 | 1.233 |
| =0.4 | 1.756 | 0.078 | 0.630 | 1.511 | 0.072 | 0.701 | 1.786 | 0.085 | 1.076 | 2.199 | 0.102 | 0.875 |
| =0.5 | 1.691 | 0.074 | 0.686 | 1.529 | 0.072 | 0.798 | 1.867 | 0.088 | 0.989 | 2.200 | 0.103 | 1.399 |
| =0.6 | 1.765 | 0.077 | 0.697 | 1.735 | 0.082 | 0.857 | 1.879 | 0.090 | 0.763 | 2.239 | 0.104 | 1.233 |
| =0.7 | 1.595 | 0.070 | 0.666 | 1.664 | 0.079 | 0.768 | 1.869 | 0.088 | 0.942 | 2.193 | 0.102 | 1.495 |
| =0.8 | 1.625 | 0.071 | 0.710 | 1.629 | 0.077 | 0.791 | 1.987 | 0.094 | 1.066 | 2.356 | 0.110 | 1.189 |
| =0.9 | 1.855 | 0.079 | 0.683 | 1.644 | 0.076 | 0.828 | 1.877 | 0.089 | 1.089 | 2.135 | 0.099 | 1.302 |
| =1 | 2.103 | 0.088 | 0.101 | 2.127 | 0.098 | 0.154 | 2.158 | 0.101 | 0.203 | 2.236 | 0.103 | 0.118 |
| CVaR-CC | 1.530 | 0.070 | 0.276 | 1.499 | 0.071 | 0.137 | 1.752 | 0.083 | 0.300 | 2.088 | 0.098 | 0.371 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 1.518 | 0.071 | 0.003 | 8.7 | ||||||||
| Index | 1.608 | 0.068 | - | 28 |
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 1.975 | 0.081 | 1.314 | 1.816 | 0.075 | 1.418 | 1.851 | 0.077 | 2.031 | 1.928 | 0.080 | 2.022 |
| =0.2 | 2.004 | 0.084 | 1.490 | 1.808 | 0.075 | 1.297 | 1.844 | 0.077 | 1.529 | 1.971 | 0.082 | 2.510 |
| =0.3 | 2.102 | 0.086 | 1.675 | 1.821 | 0.075 | 1.712 | 1.917 | 0.080 | 2.247 | 1.941 | 0.080 | 2.066 |
| =0.4 | 2.117 | 0.087 | 1.652 | 1.760 | 0.073 | 1.655 | 1.899 | 0.079 | 2.105 | 1.929 | 0.080 | 2.654 |
| =0.5 | 2.224 | 0.092 | 1.736 | 1.779 | 0.073 | 1.601 | 1.887 | 0.078 | 2.295 | 2.045 | 0.085 | 3.068 |
| =0.6 | 2.061 | 0.085 | 1.930 | 1.866 | 0.077 | 1.792 | 1.896 | 0.078 | 2.758 | 2.074 | 0.086 | 2.693 |
| =0.7 | 1.921 | 0.080 | 2.065 | 1.794 | 0.074 | 2.407 | 1.951 | 0.080 | 3.260 | 2.060 | 0.085 | 3.665 |
| =0.8 | 2.069 | 0.084 | 1.930 | 1.931 | 0.079 | 2.414 | 2.021 | 0.083 | 3.295 | 2.061 | 0.085 | 4.081 |
| =0.9 | 1.826 | 0.072 | 1.933 | 1.983 | 0.082 | 2.320 | 2.106 | 0.086 | 4.195 | 2.061 | 0.084 | 4.184 |
| =1 | 2.228 | 0.077 | 0.368 | 2.207 | 0.090 | 0.272 | 1.993 | 0.079 | 0.211 | 2.069 | 0.083 | 0.216 |
| CVaR-CC | 1.863 | 0.076 | 0.162 | 1.796 | 0.074 | 0.104 | 1.804 | 0.075 | 0.148 | 1.964 | 0.082 | 0.202 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 1.745 | 0.071 | 0.003 | 6.9 | ||||||||
| Index | 0.712 | 0.024 | - | 49 |
| Model | Av-5 () | Sh-5 | Time-5 | Av-10 () | Sh-10 | Time-10 | Av-15 () | Sh-15 | Time-15 | Av-20 () | Sh-20 | Time-20 |
| =0.1 | 2.771 | 0.123 | 13.531 | 2.180 | 0.103 | 4.895 | 2.275 | 0.109 | 2.637 | 2.207 | 0.108 | 3.033 |
| =0.2 | 2.670 | 0.119 | 19.999 | 2.322 | 0.110 | 5.695 | 2.325 | 0.111 | 2.486 | 2.257 | 0.109 | 3.287 |
| =0.3 | 2.290 | 0.099 | 20.967 | 2.138 | 0.101 | 7.448 | 2.236 | 0.107 | 2.898 | 2.198 | 0.107 | 3.274 |
| =0.4 | 2.197 | 0.097 | 24.230 | 2.125 | 0.100 | 8.006 | 2.195 | 0.104 | 2.864 | 2.183 | 0.105 | 2.909 |
| =0.5 | 2.495 | 0.112 | 24.582 | 2.047 | 0.096 | 7.776 | 2.177 | 0.103 | 3.171 | 2.120 | 0.102 | 3.062 |
| =0.6 | 2.013 | 0.091 | 26.766 | 2.131 | 0.100 | 8.964 | 2.175 | 0.103 | 3.318 | 2.151 | 0.104 | 3.257 |
| =0.7 | 2.626 | 0.117 | 24.267 | 2.190 | 0.103 | 9.490 | 2.068 | 0.098 | 3.841 | 2.170 | 0.104 | 2.976 |
| =0.8 | 3.025 | 0.131 | 17.133 | 2.283 | 0.107 | 9.134 | 2.128 | 0.101 | 4.018 | 2.187 | 0.104 | 3.770 |
| =0.9 | 1.459 | 0.059 | 10.543 | 2.341 | 0.107 | 8.135 | 2.109 | 0.099 | 4.700 | 2.221 | 0.104 | 3.893 |
| =1 | 0.876 | 0.029 | 1.803 | 1.529 | 0.065 | 2.025 | 2.452 | 0.107 | 1.311 | 2.560 | 0.118 | 1.099 |
| CVaR-CC | 2.417 | 0.113 | 0.446 | 2.235 | 0.106 | 0.337 | 2.273 | 0.109 | 0.273 | 2.270 | 0.111 | 0.316 |
| Av () | Sh | Time | n. of assets | |||||||||
| CVaR | 2.315 | 0.109 | 0.009 | 11 | ||||||||
| Index | 0.567 | 0.021 | - | 83 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Optimization and Mathematical Programming · Economic theories and models
Location and portfolio selection problems: A unified framework
Justo Puerto1,2, Moisés Rodríguez-Madrena 1,2, Andrea Scozzari3
1 Institute of Mathematics University of Seville (IMUS), Seville, Spain
2 Department of Statistics and OR, Universidad de Sevilla, 41012 Seville, Spain {puerto, madrena}@us.es
3 Faculty of Economics, Università degli Studi Niccolò Cusano Roma, Italy [email protected]
Abstract
Given a set of assets and an investment capital, the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset in order to build the most profitable portfolio. The portfolio optimization problem is naturally modeled as a mean-risk bi-criteria optimization problem where the mean rate of return of the portfolio must be maximized whereas a given risk measure must be minimized. Several mathematical programming models and techniques have been presented in the literature in order to efficiently solve the portfolio problem. A relatively recent promising line of research is to exploit clustering information of an assets network in order to develop new portfolio optimization paradigms. In this paper we endow the assets network with a metric based on correlation coefficients between assets’ returns, and show how classical location problems on networks can be used for clustering assets. In particular, by adding a new criterion to the portfolio selection problem based on an objective function of a classical location problem, we are able to measure the effect of clustering on the selected assets with respect to the non-selected ones. Most papers dealing with clustering and portfolio selection models solve these problems in two distinct steps: cluster first and then selection. The innovative contribution of this paper is that we propose a Mixed-Integer Linear Programming formulation for dealing with this problem in a unified phase. The effectiveness of our approach is validated reporting some preliminary computational experiments on some real financial dataset.
Keywords: Portfolio selection; Clustering; -median problem on networks; Conditional Value at Risk; Multicretiria optimization.
1 Introduction and motivation
Portfolio selection problem is among most the widely studied problems in financial literature since the seminal paper of Markowitz in 1952 [35]. He formalized the notion of diversification in investing and used the variance of asset prices as a proxy of risk. In spite of its success, the model proposed by Markowitz has received many criticisms, first and foremost due to the large estimation errors on the vector of expected returns and on the covariance matrix [14, 26, 36]. Hence, different directions have been proposed in the literature exploiting several strategies that consider improved estimates of correlation coefficients [19], alternative risk measures [1, 2], effective diversification approaches (see, e.g., [11, 18, 31] and the references therein). As a consequence, new mathematical programming models and techniques were proposed in the literature in order to efficiently solve the portfolio selection problems incorporating the above mentioned different strategies (the interested reader can refer to [33]).
At the early 2000s, some authors started in tackling the asset allocation problem via graph theory in which assets are identified by vertices of a graph and distances assigned to pair of assets and incorporate the dependency structure of returns. Mantegna [34] was one of the first to construct asset graphs based on stock price correlations in order to detect the hierarchical organization inside a stock market. He showed that a minimal spanning tree of provides an arrangement of stocks which selects the most relevant connections among them. Moreover, the minimal spanning tree provides, in a direct way, the subdominant ultrametric hierarchical organization of the assets in a market. Based on this idea, Onnela et al. [39] extended this approach for finding a subgraph of , and, to this aim, they present a simple greedy-like procedure based on edge distances. Finally, they study the properties of the resulting subgraph such as number and size of clusters and clustering coefficient. These properties, based on real world dataset, were then compared against those of random graphs.
A relatively recent promising line of research is to exploit clustering information of an asset network in order to provide new portfolio optimization techniques. These approaches construct a portfolio by solving the classical Markowitz model by substituting the original correlation matrix with a correlation based clustering ultrametric matrix. The aim is twofold: (i) correlation based clustering may be seen as a filtering procedure and (ii) the portfolios selected by clustering algorithms are quite robust with respect to measurement noise due to the finiteness of sample size [6, 15, 44]. In particular, in [44] the authors substitute the original correlation matrix formed by distinct elements with a smaller one formed by elements that represents a measure of distance (or similarity) between clusters of stocks. They apply the single linkage or the average linkage clustering procedure starting from initial distances between assets and . In [15] the authors construct the starting by using the Pearson Correlation measure, the Kendall rank correlation coefficient and the lower tail dependencies between each pair of assets.
To measure the level of interconnections of an asset with the whole system, in [15] the authors also use the clustering coefficient introduced by Watts and Strogatz in [45], which refers to the number of existing triangles around a vertex with respect to the number of potential ones. Finally, they obtain a positive definite matrix of interconnections between pair of assets that is used in Markowitz’s model in place of the original correlation matrix.
Other authors used graph-theoretic concepts to explore the global properties of stock markets by analyzing the structure of the underlying graph. In [7] the authors solve different classical NP-hard optimization problems to analyze the dependencies among stocks. A maximum clique problem was solved for detecting large clusters of similar and dissimilar stocks according to the natural criterion of pairwise correlation. Independent sets, which represent groups of vertices with no connections, are sought for in the stock graph to find well-diversified portfolios.
The common feature of the above strategies is that they are based on two different and independent phases: clustering first and then analyze the number and properties of the assets clustered this way. In fact, all these papers do not investigate how the clustering phase could be used as an effective tool in the portfolio selection process for assessing portfolios’ optimal weights.
The main contribution of this paper is to present a clustering and portfolio selection strategy in a unified framework that, in principle, could be used and adapted for finding portfolios minimizing different coherent risk measures usually considered in portfolio optimization. In this paper we model the clustering problem as a network -median problem. This is a classical network facility location problem in the Operations Research literature. It was introduced in [27] and, given an vertex graph , the problem consists in finding a set of exactly vertices (facilities) in order to minimize the sum of the distances (accessibility criterion) between the selected facilities and the other vertices of the graph. It is shown in [27] that the problem of finding a -median of a network is NP-hard even when the network has a simple structure.
The -median problem may be also interpreted in terms of cluster analysis, and the data clustering application is straightforward since it simply requires reinterpreting the medians as the median of a cluster instead of as a facility. In fact, in [24] the authors observed that when squared Euclidean distance is used, the problem becomes a discretized version of the well-known -means problem in cluster analysis, and the optimal solution to the -median problem results in a partition of the vertex set into homogeneous clusters. Ng and Han [38] showed that the -median model is useful for detecting patterns and mining data as well as clustering. Thus, the -median model can be a powerful tool for data mining applications. We also mention the research of Benati [3], Benati and García [4], and Benati et al. [5] on clustering based on -median models.
In this paper we provide a novel mathematical programming formulation for the problem of simultaneously locating a set of facilities in a network of stocks while selecting a subset of assets minimizing a given risk measure. In particular, here we consider the Conditional Value at Risk to measure the riskiness of an investment. We note that an asset selected as a facility can be considered as a representative of a cluster of stocks centered in it. We also show that our formulations are portable in the sense that there is a subset of constraints that remains unchanged regardless of the risk measure considered. We test our formulations on different real world financial dataset, and compare the results of the different formulations from a profitability point of view.
The paper is organized as follows. In Section 2 we introduce some necessary definitions and notation along with an analysis of the NP-completeness of our unified problem in some special classes of graphs. In section 3 we provide the mathematical programming formulation of our problem, in addition to a description on how to compute some parameters of the model. Section 4 is devoted to the computational experiments that show the effectiveness of our approach on some real financial datasets. Finally, in the concluding section some remarks and future research are depicted.
2 Notation, definitions and properties
Consider a finite, connected and undirected graph with no self-loops . We denote by and the vertex and edge set, respectively, with and . An edge is identified by a pair of vertices , with . Suppose that a nonnegative real length is assigned to each edge . Each edge is assumed to be rectifiable, so that is the continuum set of points on the edges of . We refer to interior points on an edge by their distances (along the edge) from the two end vertices of the edge. For any pair of points , we let denote the length of a shortest path in connecting and . We refer to as the metric space induced by and the edge lengths. Consider a closed subset of points of ; the distance between a vertex and is defined as the smallest of the distances from to the points in , that is: . Thus, we define:
[TABLE]
the sum of the distances of all the vertices in to the set . The -median problem on consists of finding a set such that
[TABLE]
The set is called a p-median of . It is well-known that there exists a -median whose points are vertices, that is, , with [27], so that a p-median of is a set such that
[TABLE]
The portfolio optimization method we propose here is a clustering approach based on the -median optimization problem to select portfolios by modeling the stock market as a network. In other words, we construct a portfolio by selecting each asset from the clusters obtained by partitioning the (possibly complete) asset network , where is the set of stocks and the distances , , are based on correlation coefficients between each pair of assets.
Following the seminal paper by Markowitz [35], the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset of a given stock market. The problem is modeled as a bi-criteria mean-risk optimization problem as follows:
[TABLE]
where , is the proportion of capital invested in asset , , is the expected rate of return of a portfolio , and is a (coherent) risk measure.
In this paper we show how the objective function of the classical -median problem on networks can be used in order to filter the relevant information in a multivariate set of data. In particular, by means of an objective function of a classical location problem we are able to measure the clustering effect on the selected assets w.r.t. the non-selected ones. To this aim, given a network of assets , we propose a tri-criteria optimization problem of the form:
[TABLE]
being . We note that we impose that a -median of must be a subset of vertices (assets) of and a vertex can be considered as the representative asset of a given cluster. A vertex which is assigned to a vertex , is said to be represented by .
We provide a multi-objective Mixed-Integer Linear Programming (MILP) formulation for dealing with the above problem, that can be written as follows:
[TABLE]
where:
z_{jj}=\begin{cases}1\qquad\hbox{if asset j is selected in the portfolio and considered as a representative}\\ 0\qquad\hbox{otherwise}\end{cases}
z_{ij}=\begin{cases}1\qquad\hbox{if asset ij}\\ 0\qquad\hbox{otherwise.}\end{cases}
Constraint (1e) assures to select exactly representatives (facilities); constraints (1f) guarantee that each asset belongs to exactly one cluster, whereas constraints (1g) assure that an asset is represented by only if is selected as a representative of a cluster. Constraints (1h) tie together the location and portfolio selection problems. In fact, constraint (1h) states that if an asset is selected as a representative, then we invest in asset an amount of capital that must be at least equal to , to prevent holding of a position below a minimal allowable size, and at most , to prevent holding too large positions. In the following denotes the feasible region determined by (1d)-(1i).
There exist different approaches for handling the multi-objective optimization problem (1a)-(1i). Most of the methods convert the given multi-objective program to a single-objective one after a scalarization of the different objective functions. Among the most effective approaches one can consider the -constraint method. It consists in solving the problem with respect to one of its objective functions whereas the remaining objective functions are included as constraints parametrically varying their right-hand-sides with values [20]. Taking values , , for the risk measure () and the average return, respectively, our problem becomes:
[TABLE]
This approach can also handle problems with integrality constraints by adequately choosing the different values of the right-hand-sides. From a portfolio selection viewpoint, as also shown in the seminal paper of Chang et al. [13], the presence of integrality constraints (e.g., cardinality constraints) can make the efficient frontier discontinuous, where the discontinuities might imply that there are certain returns which no rational investor would consider.
2.1 Complexity issues
To analyze the complexity status of our location and portfolio selection problem, let us consider precisely the single-objective program (2a)-(2d). It clearly contains the classical -median problem on networks as a special case. Assume and , and , then constraints (2c) and (2d) can be ignored, i.e., a feasible portfolio can be always provided by setting , for all such that , and problem (2a)-(2d), in fact, reduces to finding facilities in order to minimize the sum of the distances between the opened facilities and the vertices of .
It is well known that the -median problem is NP-hard even in the case where the network is a planar graph of maximum vertex degree 3, all of whose edges and vertices have weight 1 [27]. The proof in [27] is based on the fact that the -median problem is polynomially equivalent to the NP-complete problem of deciding if there exists a dominating set of cardinality in a planar graph of maximum vertex degree 3 [23]. Later, in [16, 17] the authors show that the dominating set problem is NP-complete on some special classes of perfect graphs namely, comparability graphs, bipartite graphs, chordal graphs, split graphs, -trees (arbitrary ), and undirected path graphs (which are defined as the intersection graphs of a set of undirected paths in a tree). Hence, the -median problem remains NP-complete on the same classes of graphs. From the above discussion we have the following result.
Theorem 1
The location and portfolio selection problem (2a)-(2d) is NP-hard for:
- (i)
comparability graphs.
- (ii)
bipartite graphs.
- (iii)
chordal graphs.
- (iv)
split graphs.
- (v)
-trees (arbitrary ).
- (vi)
undirected path graphs.
Finally, we observe that in [17] the authors also report the relationships among some families of perfect graphs. Therefore, the NP-hardness results imply a number of other NP-hardness results because of the containment relationships among the various perfect graph families.
3 Solution method
In this section we present the solution technique used for solving the multi-objective optimization problem (1a)-(1i). We adopt the well-known -constraint method which is based on retaining only one of the three objectives and to turn all the others into constraints [20]. We retain the risk measure as objective, so that model (1a)-(1i) can be rewritten as follows:
[TABLE]
where and are, respectively, a lower bound for and an upper bound for .
Problem (3a)-(3d) is a general location and portfolio selection problem in the sense that any (coherent) risk measures can be considered as objective function. Nevertheless, in this paper we focus on some measures based only on the historical returns. In particular, we consider the Conditional Value at Risk (CVaR) as objective function [40]. We observe that other historical returns based measures could be used (Mean Absolute Deviation, Gini’s index, minimax objective function…). There are several reasons for this choice. From a computational viewpoint, risk measures based on past observation periods lead to integer linear programming problems instead of nonlinear (e.g., quadratic) integer programs that are more difficult to solve. Moreover, in the case of the variance (or a correlation based) risk measure, it is well-known that it is based on the assumption that investors are risk averse and that either the distribution of the rate of return is multivariate normal or the utility of the investors is a quadratic function of the rate of returns. Unfortunately, neither of the two above conditions hold in practice (among others, see, e.g., [14, 28, 32] and the references therein). Finally, when the portfolio objective function is based on the variance, it has been found that the composition of the optimal portfolio can be very sensitive to estimation errors in the expected returns of the underlying assets [14, 26, 36].
Despite the above reasons, it is not possible to completely ignore the correlation between the time series of returns of each asset that must be considered in order to quantify a measure of similarity between pairs of stocks. A widespread choice consists in quantifying the similarity between two assets with Pearson’s correlation. Thus, the asset graph represents the correlation structure between stocks where the weights/distances associated to each edge refer to the linear correlation between them [15]. Therefore, following a common practice in the literature about filtering procedures based on correlation clustering [44], we assign to each edge a similarity measure (distance) where is the correlation coefficient between assets’ returns. We remark here that any alternative measures like the Kendall rank correlation coefficient or the Tail coefficient could be used as distances between assets (see, e.g., [15]). Hence, with these choices, in our model the two main criteria (location and risk) are evaluated with two different measures in order to make the clustering and the portfolio selection phases well characterized and distinct.
In the light of the above discussion, in the following we introduce problem (3a)-(3d) with the CVaR as objective function. Let us consider different scenarios for the returns of the assets. Each scenario has associated a probability , , with . Let be the rate of return of asset at time , , . Let be the average rate of return of asset , i.e., , . The rate of return at time of a portfolio is
[TABLE]
and the corresponding expected rate of return is
[TABLE]
The CVaR of a portfolio with tolerance level is defined as
[TABLE]
In order to obtain a linear formulation for model (3a)-(3d), we consider the dual of problem , that is
[TABLE]
Hence our problem (3a)-(3d) can be reformulated as a Mixed-Integer Linear Programming (MILP) problem
[TABLE]
Remark 1
[22]** In a -median problem on networks, variable , , takes value 1 in an optimal solution for a representative such that .
Remark 1 implies that in model (4a)-(4g) we can consistently reduce the number of binary variables from to by substituting (1i) in with the following constraints
[TABLE]
3.1 Selection of
The selection of the lower bound on the portfolio expected return in model (4a)-(4g) is, evidently, of the taste of the investor. However, how to control the effect of clusterization by means of the upper bound is not obvious. In this section we describe how to compute in (4a)-(4g).
Consider the problem
[TABLE]
and let be its optimal value. Note that is the minimum value for that makes problem (4a)-(4g) feasible.
Let be the set of feasible solutions defined by (1d) together with (4b)-(4d). Consider now the problem
[TABLE]
and let be the set of the selected assets in an optimal solution. Then, is the tightest upper bound for , that is, for all the objective is negligible in (4a)-(4g).
Finally, given , the parameter can be computed as follows
[TABLE]
In this way, the selection of the parameter allows us to control the required clustering effect, from the highest one () to the lowest one ().
4 Experimental results
This section presents an empirical analysis with the aim of evaluating the performance of the portfolios selected by our location and selection model (4a)-(4g). In addition, we also compare our model with the pure CVaR approach [40], regarded as the benchmark program, and the CVaR model with cardinality constraints.
The pure CVaR model is [32, 40]
[TABLE]
and the CVaR model with a limited number of assets (CVaR-CC) is precisely the above model (6a)-(6f) (see, e.g., [12]).
4.1 Data sets
We test all the above portfolio selection strategies on some real-world dataset belonging to the major stock markets across the world. We consider the following datasets:
DJIA (Dow Jones Industrial Average, USA), containing 28 assets and 1353 price observations (period: 07/05/1990 - 04/04/2016);
- 2.
EUROSTOXX50 (Europe’s leading blue-chip index, EU), containing 49 assets and 729 price observations (period: 22/04/2002-04/04/2016);
- 3.
FTSE100 (Financial Times Stock Exchange, UK), containing 83 assets and 625 price observations (period: 19/04/2004-04/04/2016);
- 4.
SP500 (Standard Poor’s, USA), containing 442 assets and 573 observations (period: 18/04/2005-04/04/2016).
Each dataset consists of weekly prices data, and, as observed in [32], “the choice of weekly periodicity is consistent with the objective of reducing estimation errors [43]”. To evaluate the performance of our models in practice, we divide the observations in two sets, where the first one is regarded as the past (in-sample window), and so it is known, and the rest is regarded as the future (out-of-sample window), supposed unknown at the time of portfolio selection. The in-sample window is used for selecting the portfolio, while the out-of-sample one is used for testing the performance of the selected portfolio. Let be the observations in the in-sample window; for each dataset, to compute the matrix of the historical returns, we consider the time series of the prices of the stocks and denote by the price of the th asset at time , . The th asset return at time is computed as , with . As is standard practice in the literature, to model the correlation structure between stocks we refer to the series of the in-sample logarithmic returns of each asset computed as , with (see, e.g., [34, 39]). Then, we compute the distances between assets and , , as where is the Pearson correlation coefficient between assets’ logarithmic returns. We do not set any threshold for the correlation coefficients, thus the resulting asset graph is a complete weighted graph. In our experiments we use a rolling time window scheme allowing for the possibility of rebalancing the portfolio composition during the holding period, at fixed intervals. Following [25, 32], for each dataset we adopt a period of 104 weeks (two years) as in-sample window and, in a first experimental set-up, we consider 52 weeks (one year) as out-of-sample, with rebalancing allowed every 52 weeks. In this phase, for each value of , each model solves overall 55 problems (24 for DJIA, 12 for EUROSTOXX50, 10 for FTSE100 and 9 for SP500).
As in several other papers (see, e.g., [10] and the references therein) the values of for the number of representatives, that are also used for bounding the number of assets in a portfolio, are set to . We choose and as lower and upper bounds for the minimum and maximum allowable amount of capital invested in each asset , .
In portfolio selection problems, the out-of-sample performance of a portfolio is generally evaluated by using some performance measures. In our experiments we consider the following measures (see, e.g., [8])
Sharpe Ratio (Sh) ([41, 42]): it is defined as the ratio between the average of the out-of-sample return of a portfolio , , minus a constant risk free rate of return (that we set equal to 0), and its standard deviation, namely:
[TABLE]
The larger is the value of the index, the better is the portfolio performance.
- 2.
Average return (Av): it is defined as the average of the out-of-sample returns of a portfolio.
The models have been implemented in MATLAB R2017A and they make calls to XPRESS solver version 8.5 for solving the MILP programs. All experiments were run in a computer DellT5500 with a processor Intel(R) Xeon(R) with a CPU X5690 at 3.75 GHz and 48 GB of RAM memory. We report the out-of-sample performances in the following tables. In each table, the first column refers to the models implemented for the comparisons. In particular, with we refer to our MILP model (4a)-(4g) for the different values of used for finding the parameter which bounds the function in (4a)-(4g) (see Section 3.1). CVaR-CC and CVaR refer to the cardinality constraint CVaR model (6a)-(6f) and to the pure CVaR model (7a)-(7c), respectively. In all the models, we choose the average index return in each in-sample period as the lower bound on the expected rate of return. We also consider the out-of-sample performance of the market index (Index) as benchmark. For each value of , in the columns we report the out-of-sample average return (Av-) and the value of the Sharpe Ratio (Sh-). In addition, for the first three dataset (DJIA, EUROSTOXX50 and FTSE100) we also report the average in-sample solution times. Finally, for the pure CVaR model, we also report the average number of assets in the optimal in-sample portfolios.
For the SP500 dataset, due to the size of the corresponding MILP model, we were unable to find an optimal solution in reasonable times. Thus, we set a time limit of 7200 seconds, and in the corresponding table we report the average percentage GAP (GAP) in place of the solution times. Furthermore, we adopt a special methodology for solving the model in each in-sample period. Note that MILP model (4a)-(4g) for can be solved in two easy steps: first, solve problem (5a)-(5c) and then, after having fixed the binary variables according to the -median solution found, solve problem (6a)-(6c). Note also that the optimal solution of problem (4a)-(4g) for is a feasible solution of problem (4a)-(4g) for , thus it can be used as initial solution in the branching procedure in order to get better solutions in smaller times. The same applies for a feasible solution of problem (4a)-(4g) for and problem (4a)-(4g) for , and so on.
In the tables, in bold we provide the best values of the out-of-sample performance measures for each value of . In order to highlight the effectiveness of our MILP approach, for each and value of , we write the values of the two performance indexes in italic if they are both better than the corresponding values of the other models (CVaR-CC, CVaR, Index). In this way we can have a count of how many times our approach outperforms the other competing portfolio selection models.
[FIGURE:]
[FIGURE:]
[FIGURE:]
[FIGURE:]
From Tables 1-4, except for two cases (EUROSTOXX50 with 5 assets and SP500 with 10 assets) there always exists at least one value of for which our MILP model outperforms the other competing models, i.e., CVaR-CC, CVaR and the market index. We also note that, as expected, for the SP500 dataset, the GAP values, which are quite high for , tends to become very small when increases. In fact, for our MILP model produces in-sample solutions that we suppose are very close to the optimal ones, but XPRESS solver was unable to certify their optimality. This would suggest to develop an effective heuristic procedure that combines the good features of location and portfolio selection models in order to obtain optimal or near-optimal in-sample portfolios that will likely produce better out-of-sample performances, in particular, for large size financial dataset.
In order to highlight the effectiveness of our approach, we also decided to provide a second set of experiments. In this case, we adopt a period of 104 weeks as in-sample window and 12 weeks (three months) as out-of-sample, with rebalancing allowed every 12 weeks (see also [32]). For each value of , each model solves overall 204 problems (104 for DJIA, 55 for EUROSTOXX50, 45 for FTSE100). Note that, in this case, we did not consider the SP500 dataset, since we are now interested in comparing our approach with the other programs when our MILP model is able to find optimal (certified) in-sample solutions. In any case, we observed that some preliminary results with the SP500 dataset are in line with the ones reported in Table 4. Following the results in Tables 5-7, it is clear that, when the MILP model is able to find optimal in-sample solutions, our approach proves to be efficient both from a computational times of view, and always dominates the other methods. To conclude this section, we note that, quite surprisingly, in all the experimental analysis the pure CVaR model is always dominated by the other methods.
[FIGURE:]
[FIGURE:]
[FIGURE:]
5 Conclusion
In this paper we propose a novel framework for portfolio selection that combines the specific features of a clustering and a portfolio optimization techniques through the global solution of a hard Mixed-Integer Linear Programming problem. The idea of our approach is to overcome the classical two phases approach characterized by two distinct steps: cluster first and then selection. We show that our method is quite general since other risk measures as well as correlation measures can be adopted in our framework. The resulting MILP program is, in fact, portable in the sense that in model (4a)-(4g), only equations (4a)-(4d) depend on the specific risk measure adopted. Actually, constraints (4e)-(4g) are independent of such risk measure and they can be used in the formulation of MILP programs based on other measures (e.g., Mean Absolute Deviation, Gini’s index, minimax objective function etc…). Our model was tested on real financial dataset, compared to some benchmark models, and found to give good results in terms of realized profit. We also point out that the MILP program (4a)-(4g) can be efficiently solved at least for moderate sized problems.
To conclude, the results reported in this paper are encouraging and there is room for improving the optimization phase by providing an ad hoc heuristic approach for solving large size dataset. This is, in fact, one of our future lines of research.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Artzner, P., Delbaen, F., Eber, J. M., Heath, D.: Coherent Measures of Risk. Mathematical Finance 9, 203–228 (1999)
- 2[2] Bellini, F., Di Bernardino, E.: Risk management with expectiles. The European Journal of Finance 23, 487–506 (2017)
- 3[3] Benati, S.: Categorical data fuzzy clustering: an analysis of local search heuristics. Computers & \& Operations Research 35, 766–775 (2008)
- 4[4] Benati, S., García, S.: A mixed integer linear model for clustering with variable selection. Computers & \& Operations Research 43, 280–285 (2014)
- 5[5] Benati, S., García, S., Puerto, J.: Mixed integer linear programming and heuristic methods for feature selection in clustering. Journal of the Operational Research Society 69, 1379–1395 (2018)
- 6[6] Beraldi, P., Bruni, M.E.: A clustering approach for scenario tree reduction: an application to a stochastic programming portfolio optimization problem. TOP 22, 934–949 (2014)
- 7[7] Boginski, V., Butenko, S., Shirokikh, O., Trukhanov, S., Lafuente, J.G.: A network-based data mining approach to portfolio selection via weighted clique relaxations. Annals of Operations Research 216, 23–34 (2014)
- 8[8] Bruni, R., Cesarone, F., Scozzari, A., Tardella, A.: On exact and approximate stochastic dominance strategies for portfolio selection. European Journal of Operational Research 259, 322–329 (2017)
