Location and portfolio selection problems: A unified framework

Justo Puerto; Moises Rodr\'iguez-Madrena; Andrea Scozzari

arXiv:1907.07101·q-fin.PM·July 17, 2019

Location and portfolio selection problems: A unified framework

Justo Puerto, Moises Rodr\'iguez-Madrena, Andrea Scozzari

PDF

Open Access

TL;DR

This paper introduces a unified mixed-integer linear programming framework that integrates asset clustering and portfolio selection, leveraging location problems on networks to improve investment decision-making.

Contribution

It presents a novel integrated approach combining clustering and portfolio optimization in a single model, unlike traditional two-step methods.

Findings

01

Effective clustering impacts portfolio performance

02

Unified model improves decision-making efficiency

03

Preliminary experiments validate approach on real data

Abstract

Given a set of assets and an investment capital, the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset in order to build the most profitable portfolio. The portfolio optimization problem is naturally modeled as a mean-risk bi-criteria optimization problem where the mean rate of return of the portfolio must be maximized whereas a given risk measure must be minimized. Several mathematical programming models and techniques have been presented in the literature in order to efficiently solve the portfolio problem. A relatively recent promising line of research is to exploit clustering information of an assets network in order to develop new portfolio optimization paradigms. In this paper we endow the assets network with a metric based on correlation coefficients between assets' returns, and show how classical location problems on…

Tables7

Table 1. Table 1: Out-of-sample performances for DJIA with a 52 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	2.064	0.091	0.587	1.848	0.087	0.620	1.903	0.089	0.566	2.162	0.101	0.907
$γ$ =0.2	1.927	0.083	0.536	1.882	0.089	0.628	1.997	0.094	1.144	2.178	0.102	0.894
$γ$ =0.3	2.088	0.091	0.718	1.820	0.086	0.597	1.892	0.090	0.834	2.163	0.101	1.540
$γ$ =0.4	2.206	0.097	0.693	1.770	0.084	0.986	2.034	0.096	1.061	2.225	0.104	1.173
$γ$ =0.5	2.333	0.103	0.642	1.820	0.084	0.779	2.001	0.094	0.722	2.235	0.104	1.306
$γ$ =0.6	2.338	0.104	0.630	1.913	0.088	0.741	1.963	0.092	1.083	2.166	0.101	1.179
$γ$ =0.7	2.366	0.105	0.685	1.985	0.091	0.731	2.100	0.099	0.949	2.122	0.099	0.981
$γ$ =0.8	1.933	0.086	0.654	1.989	0.091	0.623	2.068	0.097	1.044	2.276	0.106	1.177
$γ$ =0.9	2.285	0.099	0.667	2.006	0.091	0.680	2.060	0.097	0.945	2.157	0.100	1.670
$γ$ =1	2.179	0.088	0.082	2.321	0.105	0.130	2.098	0.097	0.191	2.379	0.108	0.139
CVaR-CC	2.279	0.104	0.290	1.796	0.085	0.117	1.874	0.089	0.465	2.112	0.098	0.425
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	1.856	0.087	0.003	8.6
Index	1.608	0.068	-	28

Table 2. Table 2: Out-of-sample performances for EUROSTOXX50 with a 52 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	2.008	0.075	1.239	1.946	0.074	1.528	1.908	0.074	1.383	2.020	0.077	2.723
$γ$ =0.2	2.051	0.076	1.108	1.977	0.075	2.134	1.936	0.075	1.888	1.973	0.076	2.203
$γ$ =0.3	1.979	0.073	1.361	1.974	0.074	2.336	1.971	0.076	3.167	1.993	0.076	2.720
$γ$ =0.4	1.972	0.073	1.670	1.950	0.074	1.596	1.925	0.074	1.579	1.981	0.075	2.915
$γ$ =0.5	1.771	0.063	1.845	2.047	0.077	1.610	2.037	0.078	2.699	2.039	0.078	3.446
$γ$ =0.6	1.874	0.069	1.977	1.910	0.072	1.582	1.944	0.074	2.512	2.054	0.079	3.067
$γ$ =0.7	1.990	0.073	2.137	2.005	0.076	2.145	2.006	0.076	3.593	2.051	0.078	4.445
$γ$ =0.8	1.766	0.064	2.151	2.090	0.078	2.810	2.059	0.078	2.770	2.063	0.078	3.749
$γ$ =0.9	1.714	0.062	2.068	1.979	0.074	2.134	2.045	0.076	3.350	2.111	0.080	5.329
$γ$ =1	0.868	0.030	0.274	1.967	0.065	0.241	1.803	0.067	0.153	1.887	0.070	0.285
CVaR-CC	2.103	0.078	0.327	1.955	0.073	0.077	1.939	0.075	0.102	2.008	0.077	0.150
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	1.922	0.072	0.002	7.2
Index	0.509	0.017	-	49

Table 3. Table 3: Out-of-sample performances for FTSE100 with a 52 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	2.066	0.089	17.495	1.581	0.074	6.434	1.542	0.072	2.732	1.698	0.080	2.309
$γ$ =0.2	1.830	0.079	20.030	1.687	0.078	8.577	1.593	0.075	2.975	1.686	0.079	2.546
$γ$ =0.3	1.627	0.071	24.037	1.596	0.075	24.238	1.540	0.072	2.658	1.653	0.078	3.724
$γ$ =0.4	1.807	0.079	29.242	1.623	0.075	14.249	1.639	0.077	2.809	1.597	0.076	2.605
$γ$ =0.5	2.104	0.092	27.456	1.683	0.079	17.795	1.760	0.082	4.379	1.635	0.077	3.760
$γ$ =0.6	1.921	0.083	26.529	1.870	0.085	16.946	1.698	0.079	6.225	1.699	0.080	3.393
$γ$ =0.7	1.628	0.066	22.059	1.681	0.077	9.477	1.590	0.074	6.440	1.710	0.081	2.966
$γ$ =0.8	1.310	0.053	19.084	1.808	0.081	13.817	1.870	0.087	5.646	1.817	0.085	3.238
$γ$ =0.9	1.499	0.059	10.887	1.716	0.077	8.265	1.785	0.082	5.811	1.845	0.086	3.873
$γ$ =1	0.925	0.026	2.271	1.480	0.057	0.873	1.754	0.069	1.261	2.113	0.095	0.791
CVaR-CC	2.045	0.091	0.442	1.492	0.069	0.317	1.623	0.076	0.329	1.696	0.080	0.225
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	1.528	0.071	0.008	11.7
Index	0.040	0.015	-	83

Table 4. Table 4: Out-of-sample performances for SP500 with a 52 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	GAP $%$ -5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	GAP $%$ -10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	GAP $%$ -15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	GAP $%$ -20
$γ$ =0.1	1.970	0.080	23.634	2.452	0.103	4.371	2.266	0.098	0.302	2.165	0.095	0.058
$γ$ =0.2	1.877	0.075	24.092	2.466	0.104	4.814	2.255	0.098	0.304	2.170	0.095	0.063
$γ$ =0.3	2.028	0.082	25.405	2.478	0.104	5.243	2.186	0.094	0.404	2.170	0.095	0.065
$γ$ =0.4	1.881	0.075	31.787	2.402	0.103	6.853	2.174	0.094	0.540	2.181	0.095	0.058
$γ$ =0.5	1.861	0.073	34.658	2.271	0.097	8.286	2.182	0.095	0.629	2.193	0.096	0.037
$γ$ =0.6	1.980	0.077	35.634	2.024	0.083	9.302	2.077	0.089	1.293	2.183	0.095	0.097
$γ$ =0.7	2.386	0.098	43.565	2.182	0.089	14.402	2.109	0.090	2.527	2.216	0.096	0.088
$γ$ =0.8	2.052	0.084	48.679	1.796	0.076	22.566	2.073	0.087	4.250	2.116	0.092	0.463
$γ$ =0.9	2.239	0.098	54.367	1.906	0.076	27.719	2.113	0.088	11.997	2.284	0.098	2.948
$γ$ =1	2.010	0.067	0	1.364	0.046	0	1.784	0.067	0	1.291	0.056	0
CVaR-CC	2.224	0.094	0.059	2.515	0.108	0.059	2.187	0.095	0.044	2.130	0.093	0.007
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	2.168	0.094	0.033	15.3
Index	1.137	0.040	-	442

Table 5. Table 5: Out-of-sample performances for DJIA with a 12 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	1.726	0.077	0.596	1.608	0.077	0.712	1.746	0.084	0.952	2.198	0.104	0.896
$γ$ =0.2	1.851	0.083	0.540	1.616	0.077	0.681	1.766	0.085	0.940	2.145	0.100	0.937
$γ$ =0.3	1.823	0.081	0.630	1.587	0.077	0.726	1.805	0.086	1.244	2.173	0.102	1.233
$γ$ =0.4	1.756	0.078	0.630	1.511	0.072	0.701	1.786	0.085	1.076	2.199	0.102	0.875
$γ$ =0.5	1.691	0.074	0.686	1.529	0.072	0.798	1.867	0.088	0.989	2.200	0.103	1.399
$γ$ =0.6	1.765	0.077	0.697	1.735	0.082	0.857	1.879	0.090	0.763	2.239	0.104	1.233
$γ$ =0.7	1.595	0.070	0.666	1.664	0.079	0.768	1.869	0.088	0.942	2.193	0.102	1.495
$γ$ =0.8	1.625	0.071	0.710	1.629	0.077	0.791	1.987	0.094	1.066	2.356	0.110	1.189
$γ$ =0.9	1.855	0.079	0.683	1.644	0.076	0.828	1.877	0.089	1.089	2.135	0.099	1.302
$γ$ =1	2.103	0.088	0.101	2.127	0.098	0.154	2.158	0.101	0.203	2.236	0.103	0.118
CVaR-CC	1.530	0.070	0.276	1.499	0.071	0.137	1.752	0.083	0.300	2.088	0.098	0.371
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	1.518	0.071	0.003	8.7
Index	1.608	0.068	-	28

Table 6. Table 6: Out-of-sample performances for EUROSTOXX50 with a 12 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	1.975	0.081	1.314	1.816	0.075	1.418	1.851	0.077	2.031	1.928	0.080	2.022
$γ$ =0.2	2.004	0.084	1.490	1.808	0.075	1.297	1.844	0.077	1.529	1.971	0.082	2.510
$γ$ =0.3	2.102	0.086	1.675	1.821	0.075	1.712	1.917	0.080	2.247	1.941	0.080	2.066
$γ$ =0.4	2.117	0.087	1.652	1.760	0.073	1.655	1.899	0.079	2.105	1.929	0.080	2.654
$γ$ =0.5	2.224	0.092	1.736	1.779	0.073	1.601	1.887	0.078	2.295	2.045	0.085	3.068
$γ$ =0.6	2.061	0.085	1.930	1.866	0.077	1.792	1.896	0.078	2.758	2.074	0.086	2.693
$γ$ =0.7	1.921	0.080	2.065	1.794	0.074	2.407	1.951	0.080	3.260	2.060	0.085	3.665
$γ$ =0.8	2.069	0.084	1.930	1.931	0.079	2.414	2.021	0.083	3.295	2.061	0.085	4.081
$γ$ =0.9	1.826	0.072	1.933	1.983	0.082	2.320	2.106	0.086	4.195	2.061	0.084	4.184
$γ$ =1	2.228	0.077	0.368	2.207	0.090	0.272	1.993	0.079	0.211	2.069	0.083	0.216
CVaR-CC	1.863	0.076	0.162	1.796	0.074	0.104	1.804	0.075	0.148	1.964	0.082	0.202
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	1.745	0.071	0.003	6.9
Index	0.712	0.024	-	49

Table 7. Table 7: Out-of-sample performances for FTSE100 with a 12 weeks rebalancing

Model	Av-5 ( $\cdot 10^{- 3}$ )	Sh-5	Time-5	Av-10 ( $\cdot 10^{- 3}$ )	Sh-10	Time-10	Av-15 ( $\cdot 10^{- 3}$ )	Sh-15	Time-15	Av-20 ( $\cdot 10^{- 3}$ )	Sh-20	Time-20
$γ$ =0.1	2.771	0.123	13.531	2.180	0.103	4.895	2.275	0.109	2.637	2.207	0.108	3.033
$γ$ =0.2	2.670	0.119	19.999	2.322	0.110	5.695	2.325	0.111	2.486	2.257	0.109	3.287
$γ$ =0.3	2.290	0.099	20.967	2.138	0.101	7.448	2.236	0.107	2.898	2.198	0.107	3.274
$γ$ =0.4	2.197	0.097	24.230	2.125	0.100	8.006	2.195	0.104	2.864	2.183	0.105	2.909
$γ$ =0.5	2.495	0.112	24.582	2.047	0.096	7.776	2.177	0.103	3.171	2.120	0.102	3.062
$γ$ =0.6	2.013	0.091	26.766	2.131	0.100	8.964	2.175	0.103	3.318	2.151	0.104	3.257
$γ$ =0.7	2.626	0.117	24.267	2.190	0.103	9.490	2.068	0.098	3.841	2.170	0.104	2.976
$γ$ =0.8	3.025	0.131	17.133	2.283	0.107	9.134	2.128	0.101	4.018	2.187	0.104	3.770
$γ$ =0.9	1.459	0.059	10.543	2.341	0.107	8.135	2.109	0.099	4.700	2.221	0.104	3.893
$γ$ =1	0.876	0.029	1.803	1.529	0.065	2.025	2.452	0.107	1.311	2.560	0.118	1.099
CVaR-CC	2.417	0.113	0.446	2.235	0.106	0.337	2.273	0.109	0.273	2.270	0.111	0.316
	Av ( $\cdot 10^{- 3}$ )	Sh	Time	n. of assets
CVaR	2.315	0.109	0.009	11
Index	0.567	0.021	-	83

Equations68

F (X_{p}) = i \in V (G) \sum d_{i} (X_{p}),

F (X_{p}) = i \in V (G) \sum d_{i} (X_{p}),

F (X_{p}^{*}) = X_{p} \subset A (G) min {F (X_{p})} .

F (X_{p}^{*}) = X_{p} \subset A (G) min {F (X_{p})} .

F (V_{p}^{*}) = V_{p} \subset V (G) min {F (V_{p})} .

F (V_{p}^{*}) = V_{p} \subset V (G) min {F (V_{p})} .

max {[μ (x), - ϱ (x)] ∣ x \in Δ},

max {[μ (x), - ϱ (x)] ∣ x \in Δ},

max {[μ (x), - ϱ (x), - F_{p} (x)] ∣ x \in Δ, V_{p} \subset V (G)},

max {[μ (x), - ϱ (x), - F_{p} (x)] ∣ x \in Δ, V_{p} \subset V (G)},

max

max

min

min

x \in Δ \noindent

j = 1 \sum n z_{j j} = p \noindent

j = 1 \sum n z_{ij} = 1 i = 1, \dots, n \noindent

z_{ij} \leq z_{j j} i, j = 1, \dots, n \noindent

ℓ_{j} z_{j j} \leq x_{j} \leq u_{j} z_{j j} j = 1, \dots, n \noindent

z_{ij} \in {0, 1} i, j = 1, \dots, n \noindent

min

min

(x, z) \in Γ \noindent

ϱ (x) \leq ϱ_{0}, \noindent

μ (x) \geq μ_{0}, \noindent .

min

min

μ (x) \geq μ_{0} \noindent

F_{p} (x) \leq F_{p}^{0} \noindent

(x, z) \in Γ \noindent

y_{t} (x) = j = 1 \sum n r_{j t} x_{j},

y_{t} (x) = j = 1 \sum n r_{j t} x_{j},

μ (x) = j = 1 \sum n μ_{j} x_{j} .

μ (x) = j = 1 \sum n μ_{j} x_{j} .

M_{β} (x) = u_{t} min {\frac{1}{β} t = 1 \sum T y_{t} (x) u_{t} ∣ t = 1 \sum T u_{t} = β, 0 \leq u_{t} \leq p_{t} t = 1, \dots, T} .

M_{β} (x) = u_{t} min {\frac{1}{β} t = 1 \sum T y_{t} (x) u_{t} ∣ t = 1 \sum T u_{t} = β, 0 \leq u_{t} \leq p_{t} t = 1, \dots, T} .

M_{β} (x) = η, d_{t}^{-} max {η - \frac{1}{β} t = 1 \sum T p_{t} d_{t}^{-} : d_{t}^{-} \geq η - y_{t} (x), d_{t}^{-} \geq 0 t = 1, \dots, T, η \in R} .

M_{β} (x) = η, d_{t}^{-} max {η - \frac{1}{β} t = 1 \sum T p_{t} d_{t}^{-} : d_{t}^{-} \geq η - y_{t} (x), d_{t}^{-} \geq 0 t = 1, \dots, T, η \in R} .

max

max

d_{t}^{-} \geq η - y_{t} (x) t = 1, \dots, T \noindent

d_{t}^{-} \geq 0 t = 1, \dots, T \noindent

η \in R \noindent

μ (x) \geq μ_{0} \noindent

F_{p} (x) \leq F_{p}^{0} \noindent

(x, z) \in Γ. \noindent

z_{j j} \in {0, 1} j = 1, \dots, n \noindent

z_{j j} \in {0, 1} j = 1, \dots, n \noindent

z_{ij} \geq 01 \leq i, j \leq n, i \neq = j . \noindent

min

min

μ (x) \geq μ_{0}

(x, z) \in Γ

max

max

(x, d^{-}, η) \in S

μ (x) \geq μ_{0} \noindent

j = 1 \sum n z_{j} = p \noindent

ℓ_{j} z_{j} \leq x_{j} \leq u_{j} z_{j} j = 1, \dots, n \noindent

z_{j} \in {0, 1} j = 1, \dots, n \noindent

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Optimization and Mathematical Programming · Economic theories and models

Full text

Location and portfolio selection problems: A unified framework

Justo Puerto1,2, Moisés Rodríguez-Madrena 1,2, Andrea Scozzari3

1 Institute of Mathematics University of Seville (IMUS), Seville, Spain

2 Department of Statistics and OR, Universidad de Sevilla, 41012 Seville, Spain {puerto, madrena}@us.es

3 Faculty of Economics, Università degli Studi Niccolò Cusano Roma, Italy [email protected]

Abstract

Given a set of assets and an investment capital, the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset in order to build the most profitable portfolio. The portfolio optimization problem is naturally modeled as a mean-risk bi-criteria optimization problem where the mean rate of return of the portfolio must be maximized whereas a given risk measure must be minimized. Several mathematical programming models and techniques have been presented in the literature in order to efficiently solve the portfolio problem. A relatively recent promising line of research is to exploit clustering information of an assets network in order to develop new portfolio optimization paradigms. In this paper we endow the assets network with a metric based on correlation coefficients between assets’ returns, and show how classical location problems on networks can be used for clustering assets. In particular, by adding a new criterion to the portfolio selection problem based on an objective function of a classical location problem, we are able to measure the effect of clustering on the selected assets with respect to the non-selected ones. Most papers dealing with clustering and portfolio selection models solve these problems in two distinct steps: cluster first and then selection. The innovative contribution of this paper is that we propose a Mixed-Integer Linear Programming formulation for dealing with this problem in a unified phase. The effectiveness of our approach is validated reporting some preliminary computational experiments on some real financial dataset.

Keywords: Portfolio selection; Clustering; $p$ -median problem on networks; Conditional Value at Risk; Multicretiria optimization.

1 Introduction and motivation

Portfolio selection problem is among most the widely studied problems in financial literature since the seminal paper of Markowitz in 1952 [35]. He formalized the notion of diversification in investing and used the variance of asset prices as a proxy of risk. In spite of its success, the model proposed by Markowitz has received many criticisms, first and foremost due to the large estimation errors on the vector of expected returns and on the covariance matrix [14, 26, 36]. Hence, different directions have been proposed in the literature exploiting several strategies that consider improved estimates of correlation coefficients [19], alternative risk measures [1, 2], effective diversification approaches (see, e.g., [11, 18, 31] and the references therein). As a consequence, new mathematical programming models and techniques were proposed in the literature in order to efficiently solve the portfolio selection problems incorporating the above mentioned different strategies (the interested reader can refer to [33]).

At the early 2000s, some authors started in tackling the asset allocation problem via graph theory in which assets are identified by vertices of a graph $G$ and distances $d_{ij}$ assigned to pair of assets $i$ and $j$ incorporate the dependency structure of returns. Mantegna [34] was one of the first to construct asset graphs based on stock price correlations in order to detect the hierarchical organization inside a stock market. He showed that a minimal spanning tree of $G$ provides an arrangement of stocks which selects the most relevant connections among them. Moreover, the minimal spanning tree provides, in a direct way, the subdominant ultrametric hierarchical organization of the assets in a market. Based on this idea, Onnela et al. [39] extended this approach for finding a subgraph of $G$ , and, to this aim, they present a simple greedy-like procedure based on edge distances. Finally, they study the properties of the resulting subgraph such as number and size of clusters and clustering coefficient. These properties, based on real world dataset, were then compared against those of random graphs.

A relatively recent promising line of research is to exploit clustering information of an asset network in order to provide new portfolio optimization techniques. These approaches construct a portfolio by solving the classical Markowitz model by substituting the original correlation matrix with a correlation based clustering ultrametric matrix. The aim is twofold: (i) correlation based clustering may be seen as a filtering procedure and (ii) the portfolios selected by clustering algorithms are quite robust with respect to measurement noise due to the finiteness of sample size [6, 15, 44]. In particular, in [44] the authors substitute the original correlation matrix formed by $\frac{n(n-1)}{2}$ distinct elements with a smaller one formed by $n-1$ elements that represents a measure of distance (or similarity) between clusters of stocks. They apply the single linkage or the average linkage clustering procedure starting from initial distances $d_{ij}$ between assets $i$ and $j$ . In [15] the authors construct the starting $d_{ij}$ by using the Pearson Correlation measure, the Kendall rank correlation coefficient and the lower tail dependencies between each pair of assets.

To measure the level of interconnections of an asset with the whole system, in [15] the authors also use the clustering coefficient introduced by Watts and Strogatz in [45], which refers to the number of existing triangles around a vertex with respect to the number of potential ones. Finally, they obtain a positive definite matrix $C$ of interconnections between pair of assets that is used in Markowitz’s model in place of the original correlation matrix.

Other authors used graph-theoretic concepts to explore the global properties of stock markets by analyzing the structure of the underlying graph. In [7] the authors solve different classical NP-hard optimization problems to analyze the dependencies among stocks. A maximum clique problem was solved for detecting large clusters of similar and dissimilar stocks according to the natural criterion of pairwise correlation. Independent sets, which represent groups of vertices with no connections, are sought for in the stock graph to find well-diversified portfolios.

The common feature of the above strategies is that they are based on two different and independent phases: clustering first and then analyze the number and properties of the assets clustered this way. In fact, all these papers do not investigate how the clustering phase could be used as an effective tool in the portfolio selection process for assessing portfolios’ optimal weights.

The main contribution of this paper is to present a clustering and portfolio selection strategy in a unified framework that, in principle, could be used and adapted for finding portfolios minimizing different coherent risk measures usually considered in portfolio optimization. In this paper we model the clustering problem as a network $p$ -median problem. This is a classical network facility location problem in the Operations Research literature. It was introduced in [27] and, given an $n$ vertex graph $G$ , the problem consists in finding a set of exactly $p$ vertices (facilities) in order to minimize the sum of the distances (accessibility criterion) between the selected facilities and the other vertices of the graph. It is shown in [27] that the problem of finding a $p$ -median of a network is NP-hard even when the network has a simple structure.

The $p$ -median problem may be also interpreted in terms of cluster analysis, and the data clustering application is straightforward since it simply requires reinterpreting the medians as the median of a cluster instead of as a facility. In fact, in [24] the authors observed that when squared Euclidean distance is used, the problem becomes a discretized version of the well-known $k$ -means problem in cluster analysis, and the optimal solution to the $p$ -median problem results in a partition of the vertex set into homogeneous clusters. Ng and Han [38] showed that the $p$ -median model is useful for detecting patterns and mining data as well as clustering. Thus, the $p$ -median model can be a powerful tool for data mining applications. We also mention the research of Benati [3], Benati and García [4], and Benati et al. [5] on clustering based on $p$ -median models.

In this paper we provide a novel mathematical programming formulation for the problem of simultaneously locating a set of $p$ facilities in a network of stocks while selecting a subset of assets minimizing a given risk measure. In particular, here we consider the Conditional Value at Risk to measure the riskiness of an investment. We note that an asset selected as a facility can be considered as a representative of a cluster of stocks centered in it. We also show that our formulations are portable in the sense that there is a subset of constraints that remains unchanged regardless of the risk measure considered. We test our formulations on different real world financial dataset, and compare the results of the different formulations from a profitability point of view.

The paper is organized as follows. In Section 2 we introduce some necessary definitions and notation along with an analysis of the NP-completeness of our unified problem in some special classes of graphs. In section 3 we provide the mathematical programming formulation of our problem, in addition to a description on how to compute some parameters of the model. Section 4 is devoted to the computational experiments that show the effectiveness of our approach on some real financial datasets. Finally, in the concluding section some remarks and future research are depicted.

2 Notation, definitions and properties

Consider a finite, connected and undirected graph with no self-loops $G=(V,E)$ . We denote by $V(G)$ and $E(G)$ the vertex and edge set, respectively, with $|V(G)|=n$ and $|E(G)|=m$ . An edge $e\in E(G)$ is identified by a pair of vertices $(i,j)$ , with $i,j\in V(G)$ . Suppose that a nonnegative real length $d(e)=d_{ij}$ is assigned to each edge $e\in E(G)$ . Each edge is assumed to be rectifiable, so that $A(G)$ is the continuum set of points on the edges of $G$ . We refer to interior points on an edge by their distances (along the edge) from the two end vertices of the edge. For any pair of points $x,y\in A(G)$ , we let $d_{xy}$ denote the length of a shortest path in $A(G)$ connecting $x$ and $y$ . We refer to $A(G)$ as the metric space induced by $G$ and the edge lengths. Consider a closed subset $X_{p}$ of $p$ points of $A(G)$ ; the distance $d_{i}(X_{p})$ between a vertex $i$ and $X_{p}$ is defined as the smallest of the distances from $i$ to the points in $X_{p}$ , that is: $d_{i}(X_{p})=\min\{d_{ix}|x\in X_{p}\}$ . Thus, we define:

[TABLE]

the sum of the distances of all the vertices in $G$ to the set $X_{p}$ . The $p$ -median problem on $G$ consists of finding a set $X^{*}_{p}$ such that

[TABLE]

The set $X^{*}_{p}$ is called a p-median of $G$ . It is well-known that there exists a $p$ -median whose points are vertices, that is, $V^{*}_{p}=X^{*}_{p}$ , with $V^{*}_{p}\subset V(G)$ [27], so that a p-median of $G$ is a set $V^{*}_{p}$ such that

[TABLE]

The portfolio optimization method we propose here is a clustering approach based on the $p$ -median optimization problem to select portfolios by modeling the stock market as a network. In other words, we construct a portfolio by selecting each asset from the $p$ clusters obtained by partitioning the (possibly complete) asset network $G=(V,E)$ , where $V(G)$ is the set of stocks and the distances $d_{ij}$ , $i,j\in V(G)$ , are based on correlation coefficients $\rho_{ij}$ between each pair of assets.

Following the seminal paper by Markowitz [35], the classical portfolio selection problem consists in determining the amount of capital to be invested in each asset of a given stock market. The problem is modeled as a bi-criteria mean-risk optimization problem as follows:

[TABLE]

where $\Delta=\{\mathbf{x}\in\mathbb{R}^{n}|\mathbf{x}\geq 0,\sum\limits_{i=1}^{n}x_{i}=1\}$ , $x_{i}$ is the proportion of capital invested in asset $i$ , $i=1,\ldots,n$ , $\mu(\mathbf{x})$ is the expected rate of return of a portfolio $\mathbf{x}$ , and $\varrho:\mathbb{R}^{n}\rightarrow\mathbb{R}$ is a (coherent) risk measure.

In this paper we show how the objective function of the classical $p$ -median problem on networks can be used in order to filter the relevant information in a multivariate set of data. In particular, by means of an objective function of a classical location problem we are able to measure the clustering effect on the selected assets w.r.t. the non-selected ones. To this aim, given a network of assets $G=(V,E)$ , we propose a tri-criteria optimization problem of the form:

[TABLE]

being ${F_{p}(\mathbf{x})=F(V_{p})}$ . We note that we impose that a $p$ -median of $G$ must be a subset $V_{p}$ of vertices (assets) of $G$ and a vertex $j\in V_{p}$ can be considered as the representative asset of a given cluster. A vertex $i\notin V_{p}$ which is assigned to a vertex $j\in V_{p}$ , is said to be represented by $j$ .

We provide a multi-objective Mixed-Integer Linear Programming (MILP) formulation for dealing with the above problem, that can be written as follows:

[TABLE]

where:

$z_{jj}=\begin{cases}1\qquad\hbox{if asset$ j $is selected in the portfolio and considered as a representative}\\ 0\qquad\hbox{otherwise}\end{cases}$

$z_{ij}=\begin{cases}1\qquad\hbox{if asset$ i $is not selected in the portfolio but it is represented by asset$ j $}\\ 0\qquad\hbox{otherwise.}\end{cases}$

Constraint (1e) assures to select exactly $p$ representatives (facilities); constraints (1f) guarantee that each asset belongs to exactly one cluster, whereas constraints (1g) assure that an asset $i$ is represented by $j$ only if $j$ is selected as a representative of a cluster. Constraints (1h) tie together the location and portfolio selection problems. In fact, constraint (1h) states that if an asset $j$ is selected as a representative, then we invest in asset $x_{j}$ an amount of capital that must be at least equal to $\ell_{j}\geq 0$ , to prevent holding of a position below a minimal allowable size, and at most $u_{j}\geq 0$ , to prevent holding too large positions. In the following $\Gamma$ denotes the feasible region determined by (1d)-(1i).

There exist different approaches for handling the multi-objective optimization problem (1a)-(1i). Most of the methods convert the given multi-objective program to a single-objective one after a scalarization of the different objective functions. Among the most effective approaches one can consider the $\epsilon$ -constraint method. It consists in solving the problem with respect to one of its objective functions whereas the remaining objective functions are included as constraints parametrically varying their right-hand-sides with values [20]. Taking values $\varrho_{0}$ , $\mu_{0}$ , for the risk measure ( $\varrho$ ) and the average return, respectively, our problem becomes:

[TABLE]

This approach can also handle problems with integrality constraints by adequately choosing the different values of the right-hand-sides. From a portfolio selection viewpoint, as also shown in the seminal paper of Chang et al. [13], the presence of integrality constraints (e.g., cardinality constraints) can make the efficient frontier discontinuous, where the discontinuities might imply that there are certain returns which no rational investor would consider.

2.1 Complexity issues

To analyze the complexity status of our location and portfolio selection problem, let us consider precisely the single-objective program (2a)-(2d). It clearly contains the classical $p$ -median problem on networks as a special case. Assume $\varrho_{0}=+\infty$ and $\mu_{0}=-\infty$ , $\ell_{j}=0$ and $u_{j}=1$ , then constraints (2c) and (2d) can be ignored, i.e., a feasible portfolio can be always provided by setting $x_{j}=\frac{1}{p}$ , for all $j$ such that $z_{jj}=1$ , and problem (2a)-(2d), in fact, reduces to finding $p$ facilities in order to minimize the sum of the distances between the opened facilities and the vertices of $G$ .

It is well known that the $p$ -median problem is NP-hard even in the case where the network is a planar graph of maximum vertex degree 3, all of whose edges and vertices have weight 1 [27]. The proof in [27] is based on the fact that the $p$ -median problem is polynomially equivalent to the NP-complete problem of deciding if there exists a dominating set of cardinality $p$ in a planar graph of maximum vertex degree 3 [23]. Later, in [16, 17] the authors show that the dominating set problem is NP-complete on some special classes of perfect graphs namely, comparability graphs, bipartite graphs, chordal graphs, split graphs, $k$ -trees (arbitrary $k$ ), and undirected path graphs (which are defined as the intersection graphs of a set of undirected paths in a tree). Hence, the $p$ -median problem remains NP-complete on the same classes of graphs. From the above discussion we have the following result.

Theorem 1

The location and portfolio selection problem (2a)-(2d) is NP-hard for:

(i)

comparability graphs.

(ii)

bipartite graphs.

(iii)

chordal graphs.

(iv)

split graphs.

(v)

$k$ -trees (arbitrary $k$ ).

(vi)

undirected path graphs.

Finally, we observe that in [17] the authors also report the relationships among some families of perfect graphs. Therefore, the NP-hardness results imply a number of other NP-hardness results because of the containment relationships among the various perfect graph families.

3 Solution method

In this section we present the solution technique used for solving the multi-objective optimization problem (1a)-(1i). We adopt the well-known $\epsilon$ -constraint method which is based on retaining only one of the three objectives and to turn all the others into constraints [20]. We retain the risk measure as objective, so that model (1a)-(1i) can be rewritten as follows:

[TABLE]

where $\mu_{0}$ and $F_{p}^{0}$ are, respectively, a lower bound for $\mu(\mathbf{x})$ and an upper bound for $F_{p}(\mathbf{x})$ .

Problem (3a)-(3d) is a general location and portfolio selection problem in the sense that any (coherent) risk measures can be considered as objective function. Nevertheless, in this paper we focus on some measures based only on the historical returns. In particular, we consider the Conditional Value at Risk (CVaR) as objective function [40]. We observe that other historical returns based measures could be used (Mean Absolute Deviation, Gini’s index, minimax objective function…). There are several reasons for this choice. From a computational viewpoint, risk measures based on past observation periods lead to integer linear programming problems instead of nonlinear (e.g., quadratic) integer programs that are more difficult to solve. Moreover, in the case of the variance (or a correlation based) risk measure, it is well-known that it is based on the assumption that investors are risk averse and that either the distribution of the rate of return is multivariate normal or the utility of the investors is a quadratic function of the rate of returns. Unfortunately, neither of the two above conditions hold in practice (among others, see, e.g., [14, 28, 32] and the references therein). Finally, when the portfolio objective function is based on the variance, it has been found that the composition of the optimal portfolio can be very sensitive to estimation errors in the expected returns of the underlying assets [14, 26, 36].

Despite the above reasons, it is not possible to completely ignore the correlation between the time series of returns of each asset that must be considered in order to quantify a measure of similarity between pairs of stocks. A widespread choice consists in quantifying the similarity between two assets with Pearson’s correlation. Thus, the asset graph $G$ represents the correlation structure between stocks where the weights/distances associated to each edge refer to the linear correlation between them [15]. Therefore, following a common practice in the literature about filtering procedures based on correlation clustering [44], we assign to each edge $(i,j)\in E(G)$ a similarity measure (distance) $d_{ij}=\sqrt{2(1-\rho_{ij})}$ where $\rho_{ij}=\frac{\sigma_{ij}}{\sqrt{\sigma_{ii}\sigma_{jj}}}$ is the correlation coefficient between assets’ returns. We remark here that any alternative measures like the Kendall rank correlation coefficient or the Tail coefficient could be used as distances between assets (see, e.g., [15]). Hence, with these choices, in our model the two main criteria (location and risk) are evaluated with two different measures in order to make the clustering and the portfolio selection phases well characterized and distinct.

In the light of the above discussion, in the following we introduce problem (3a)-(3d) with the CVaR as objective function. Let us consider $T$ different scenarios for the returns of the $n$ assets. Each scenario $t$ has associated a probability $p_{t}$ , $t=1,\ldots,T$ , with $\sum_{t=1}^{T}p_{t}=1$ . Let $r_{jt}$ be the rate of return of asset $j$ at time $t$ , $j=1,\ldots,n$ , $t=1,\ldots,T$ . Let $\mu_{j}$ be the average rate of return of asset $j$ , i.e., $\mu_{j}=\sum_{t=1}^{T}p_{t}r_{jt}$ , $j=1,\ldots,n$ . The rate of return at time $t$ of a portfolio $\mathbf{x}=(x_{1},\ldots,x_{n})\in\Delta$ is

[TABLE]

and the corresponding expected rate of return is

[TABLE]

The CVaR of a portfolio $\mathbf{x}=(x_{1},\ldots,x_{n})\in\Delta$ with tolerance level $\beta\in(0,1]$ is defined as

[TABLE]

In order to obtain a linear formulation for model (3a)-(3d), we consider the dual of problem $M_{\beta}(\mathbf{x})$ , that is

[TABLE]

Hence our problem (3a)-(3d) can be reformulated as a Mixed-Integer Linear Programming (MILP) problem

[TABLE]

Remark 1

[22]** In a $p$ -median problem on networks, variable $z_{ij}$ , $i\neq j$ , takes value 1 in an optimal solution for a representative $j$ such that $d_{ij}=\min\limits_{k:z_{kk}=1}\{d_{ik}\}$ .

Remark 1 implies that in model (4a)-(4g) we can consistently reduce the number of binary variables from $O(n^{2})$ to $O(n)$ by substituting (1i) in $\Gamma$ with the following constraints

[TABLE]

3.1 Selection of $F_{p}^{0}$

The selection of the lower bound on the portfolio expected return $\mu_{0}$ in model (4a)-(4g) is, evidently, of the taste of the investor. However, how to control the effect of clusterization by means of the upper bound $F_{p}^{0}$ is not obvious. In this section we describe how to compute $F_{p}^{0}$ in (4a)-(4g).

Consider the problem

[TABLE]

and let $F_{p}^{\ell}$ be its optimal value. Note that $F_{p}^{\ell}$ is the minimum value for $F_{p}^{0}$ that makes problem (4a)-(4g) feasible.

Let $\mathcal{S}$ be the set of feasible solutions defined by (1d) together with (4b)-(4d). Consider now the problem

[TABLE]

and let $X_{p}$ be the set of the $p$ selected assets in an optimal solution. Then, $F_{p}(X_{p})=F_{p}^{u}$ is the tightest upper bound for $F_{p}^{0}$ , that is, for all $F_{p}^{0}\geq F_{p}^{u}$ the objective $F_{p}(\mathbf{x})$ is negligible in (4a)-(4g).

Finally, given $\gamma\in[0,1]$ , the parameter $F_{p}^{0}$ can be computed as follows

[TABLE]

In this way, the selection of the parameter $\gamma$ allows us to control the required clustering effect, from the highest one ( $\gamma=1$ ) to the lowest one ( $\gamma=0$ ).

4 Experimental results

This section presents an empirical analysis with the aim of evaluating the performance of the portfolios selected by our location and selection model (4a)-(4g). In addition, we also compare our model with the pure CVaR approach [40], regarded as the benchmark program, and the CVaR model with cardinality constraints.

The pure CVaR model is [32, 40]

[TABLE]

and the CVaR model with a limited number of assets (CVaR-CC) is precisely the above model (6a)-(6f) (see, e.g., [12]).

4.1 Data sets

We test all the above portfolio selection strategies on some real-world dataset belonging to the major stock markets across the world. We consider the following datasets:

DJIA (Dow Jones Industrial Average, USA), containing 28 assets and 1353 price observations (period: 07/05/1990 - 04/04/2016);

2.

EUROSTOXX50 (Europe’s leading blue-chip index, EU), containing 49 assets and 729 price observations (period: 22/04/2002-04/04/2016);

3.

FTSE100 (Financial Times Stock Exchange, UK), containing 83 assets and 625 price observations (period: 19/04/2004-04/04/2016);

4.

SP500 (Standard $\&$ Poor’s, USA), containing 442 assets and 573 observations (period: 18/04/2005-04/04/2016).

Each dataset consists of weekly prices data, and, as observed in [32], “the choice of weekly periodicity is consistent with the objective of reducing estimation errors [43]”. To evaluate the performance of our models in practice, we divide the observations in two sets, where the first one is regarded as the past (in-sample window), and so it is known, and the rest is regarded as the future (out-of-sample window), supposed unknown at the time of portfolio selection. The in-sample window is used for selecting the portfolio, while the out-of-sample one is used for testing the performance of the selected portfolio. Let $0,1,\ldots,T$ be the observations in the in-sample window; for each dataset, to compute the $T\times n$ matrix of the historical returns, we consider the time series of the prices of the $n$ stocks and denote by $P_{i}(t)$ the price of the $i$ th asset at time $t$ , $t=0,\ldots,T$ . The $i$ th asset return at time $t$ is computed as $r_{it}=\frac{P_{i}(t)-P_{i}(t-1)}{P_{i}(t-1)}$ , with $t=1,\ldots,T$ . As is standard practice in the literature, to model the correlation structure between stocks we refer to the series of the in-sample logarithmic returns of each asset $i$ computed as $R_{it}=\ln P_{i}(t)-\ln P_{i}(t-1)$ , with $t=1,\ldots,T$ (see, e.g., [34, 39]). Then, we compute the distances $d_{ij}$ between assets $i$ and $j$ , $i\neq j=1,\ldots,n$ , as $d_{ij}=\sqrt{2(1-\rho_{ij})}$ where $\rho_{ij}$ is the Pearson correlation coefficient between assets’ logarithmic returns. We do not set any threshold for the correlation coefficients, thus the resulting asset graph $G$ is a complete weighted graph. In our experiments we use a rolling time window scheme allowing for the possibility of rebalancing the portfolio composition during the holding period, at fixed intervals. Following [25, 32], for each dataset we adopt a period of 104 weeks (two years) as in-sample window and, in a first experimental set-up, we consider 52 weeks (one year) as out-of-sample, with rebalancing allowed every 52 weeks. In this phase, for each value of $p$ , each model solves overall 55 problems (24 for DJIA, 12 for EUROSTOXX50, 10 for FTSE100 and 9 for SP500).

As in several other papers (see, e.g., [10] and the references therein) the values of $p$ for the number of representatives, that are also used for bounding the number of assets in a portfolio, are set to $p=5,10,15,20$ . We choose $\ell_{i}=\frac{1}{n}$ and $u_{i}=1$ as lower and upper bounds for the minimum and maximum allowable amount of capital invested in each asset $i$ , $i=1,\ldots,n$ .

In portfolio selection problems, the out-of-sample performance of a portfolio is generally evaluated by using some performance measures. In our experiments we consider the following measures (see, e.g., [8])

Sharpe Ratio (Sh) ([41, 42]): it is defined as the ratio between the average of the out-of-sample return of a portfolio $\mathbf{x}$ , $\mu^{\rm{out}}(\mathbf{x})$ , minus a constant risk free rate of return $r_{f}$ (that we set equal to 0), and its standard deviation, namely:

[TABLE]

The larger is the value of the index, the better is the portfolio performance.

2.

Average return (Av): it is defined as the average $E[\mu^{\rm{out}}(\mathbf{x})]$ of the out-of-sample returns of a portfolio.

The models have been implemented in MATLAB R2017A and they make calls to XPRESS solver version 8.5 for solving the MILP programs. All experiments were run in a computer DellT5500 with a processor Intel(R) Xeon(R) with a CPU X5690 at 3.75 GHz and 48 GB of RAM memory. We report the out-of-sample performances in the following tables. In each table, the first column refers to the models implemented for the comparisons. In particular, with $\gamma=0.1,\ldots,1$ we refer to our MILP model (4a)-(4g) for the different values of $\gamma$ used for finding the parameter $F_{p}^{0}$ which bounds the function $F_{p}(\mathbf{x})$ in (4a)-(4g) (see Section 3.1). CVaR-CC and CVaR refer to the cardinality constraint CVaR model (6a)-(6f) and to the pure CVaR model (7a)-(7c), respectively. In all the models, we choose the average index return in each in-sample period as the lower bound $\mu_{0}$ on the expected rate of return. We also consider the out-of-sample performance of the market index (Index) as benchmark. For each value of $p=5,10,15,20$ , in the columns we report the out-of-sample average return (Av- $p$ ) and the value of the Sharpe Ratio (Sh- $p$ ). In addition, for the first three dataset (DJIA, EUROSTOXX50 and FTSE100) we also report the average in-sample solution times. Finally, for the pure CVaR model, we also report the average number of assets in the optimal in-sample portfolios.

For the SP500 dataset, due to the size of the corresponding MILP model, we were unable to find an optimal solution in reasonable times. Thus, we set a time limit of 7200 seconds, and in the corresponding table we report the average percentage GAP (GAP $\%$ ) in place of the solution times. Furthermore, we adopt a special methodology for solving the model in each in-sample period. Note that MILP model (4a)-(4g) for $\gamma=1$ can be solved in two easy steps: first, solve problem (5a)-(5c) and then, after having fixed the binary variables $z_{j}$ according to the $p$ -median solution found, solve problem (6a)-(6c). Note also that the optimal solution of problem (4a)-(4g) for $\gamma=1$ is a feasible solution of problem (4a)-(4g) for $\gamma=0.9$ , thus it can be used as initial solution in the branching procedure in order to get better solutions in smaller times. The same applies for a feasible solution of problem (4a)-(4g) for $\gamma=0.9$ and problem (4a)-(4g) for $\gamma=0.8$ , and so on.

In the tables, in bold we provide the best values of the out-of-sample performance measures for each value of $p$ . In order to highlight the effectiveness of our MILP approach, for each $p$ and value of $\gamma$ , we write the values of the two performance indexes in italic if they are both better than the corresponding values of the other models (CVaR-CC, CVaR, Index). In this way we can have a count of how many times our approach outperforms the other competing portfolio selection models.

[FIGURE:]

From Tables 1-4, except for two cases (EUROSTOXX50 with 5 assets and SP500 with 10 assets) there always exists at least one value of $\gamma$ for which our MILP model outperforms the other competing models, i.e., CVaR-CC, CVaR and the market index. We also note that, as expected, for the SP500 dataset, the GAP values, which are quite high for $p=5,10$ , tends to become very small when $p$ increases. In fact, for $p=15,20$ our MILP model produces in-sample solutions that we suppose are very close to the optimal ones, but XPRESS solver was unable to certify their optimality. This would suggest to develop an effective heuristic procedure that combines the good features of location and portfolio selection models in order to obtain optimal or near-optimal in-sample portfolios that will likely produce better out-of-sample performances, in particular, for large size financial dataset.

In order to highlight the effectiveness of our approach, we also decided to provide a second set of experiments. In this case, we adopt a period of 104 weeks as in-sample window and 12 weeks (three months) as out-of-sample, with rebalancing allowed every 12 weeks (see also [32]). For each value of $p$ , each model solves overall 204 problems (104 for DJIA, 55 for EUROSTOXX50, 45 for FTSE100). Note that, in this case, we did not consider the SP500 dataset, since we are now interested in comparing our approach with the other programs when our MILP model is able to find optimal (certified) in-sample solutions. In any case, we observed that some preliminary results with the SP500 dataset are in line with the ones reported in Table 4. Following the results in Tables 5-7, it is clear that, when the MILP model is able to find optimal in-sample solutions, our approach proves to be efficient both from a computational times of view, and always dominates the other methods. To conclude this section, we note that, quite surprisingly, in all the experimental analysis the pure CVaR model is always dominated by the other methods.

[FIGURE:]

5 Conclusion

In this paper we propose a novel framework for portfolio selection that combines the specific features of a clustering and a portfolio optimization techniques through the global solution of a hard Mixed-Integer Linear Programming problem. The idea of our approach is to overcome the classical two phases approach characterized by two distinct steps: cluster first and then selection. We show that our method is quite general since other risk measures as well as correlation measures can be adopted in our framework. The resulting MILP program is, in fact, portable in the sense that in model (4a)-(4g), only equations (4a)-(4d) depend on the specific risk measure adopted. Actually, constraints (4e)-(4g) are independent of such risk measure and they can be used in the formulation of MILP programs based on other measures (e.g., Mean Absolute Deviation, Gini’s index, minimax objective function etc…). Our model was tested on real financial dataset, compared to some benchmark models, and found to give good results in terms of realized profit. We also point out that the MILP program (4a)-(4g) can be efficiently solved at least for moderate sized problems.

To conclude, the results reported in this paper are encouraging and there is room for improving the optimization phase by providing an ad hoc heuristic approach for solving large size dataset. This is, in fact, one of our future lines of research.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Artzner, P., Delbaen, F., Eber, J. M., Heath, D.: Coherent Measures of Risk. Mathematical Finance 9, 203–228 (1999)
2[2] Bellini, F., Di Bernardino, E.: Risk management with expectiles. The European Journal of Finance 23, 487–506 (2017)
3[3] Benati, S.: Categorical data fuzzy clustering: an analysis of local search heuristics. Computers & \& Operations Research 35, 766–775 (2008)
4[4] Benati, S., García, S.: A mixed integer linear model for clustering with variable selection. Computers & \& Operations Research 43, 280–285 (2014)
5[5] Benati, S., García, S., Puerto, J.: Mixed integer linear programming and heuristic methods for feature selection in clustering. Journal of the Operational Research Society 69, 1379–1395 (2018)
6[6] Beraldi, P., Bruni, M.E.: A clustering approach for scenario tree reduction: an application to a stochastic programming portfolio optimization problem. TOP 22, 934–949 (2014)
7[7] Boginski, V., Butenko, S., Shirokikh, O., Trukhanov, S., Lafuente, J.G.: A network-based data mining approach to portfolio selection via weighted clique relaxations. Annals of Operations Research 216, 23–34 (2014)
8[8] Bruni, R., Cesarone, F., Scozzari, A., Tardella, A.: On exact and approximate stochastic dominance strategies for portfolio selection. European Journal of Operational Research 259, 322–329 (2017)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Location and portfolio selection problems: A unified framework

Abstract

1 Introduction and motivation

2 Notation, definitions and properties

2.1 Complexity issues

Theorem 1

3 Solution method

Remark 1

3.1 Selection of Fp0F_{p}^{0}Fp0​

4 Experimental results

4.1 Data sets

5 Conclusion

3.1 Selection of $F_{p}^{0}$