Multivariate Geometric Expectiles

Klaus Herrmann; Marius Hofert; Melina Mailhot

arXiv:1704.01503·q-fin.RM·January 19, 2018

Multivariate Geometric Expectiles

Klaus Herrmann, Marius Hofert, Melina Mailhot

PDF

Open Access

TL;DR

This paper introduces multivariate geometric expectiles as a new class of risk measures for d-dimensional distributions, demonstrating their theoretical properties and practical applications in multivariate risk assessment.

Contribution

It presents the first formulation of geometric expectiles for multivariate distributions, including their theoretical properties and consistency of the sample estimator.

Findings

01

Geometric expectiles are unique solutions to a convex risk minimization problem.

02

They are well-behaved under common data transformations.

03

Sample geometric expectiles are consistent estimators.

Abstract

A generalization of expectiles for d-dimensional multivariate distribution functions is introduced. The resulting geometric expectiles are unique solutions to a convex risk minimization problem and are given by d-dimensional vectors. They are well behaved under common data transformations and the corresponding sample version is shown to be a consistent estimator. We exemplify their usage as risk measures in a number of multivariate settings, highlighting the influence of varying margins and dependence structures.

Tables1

Table 1. Table 1: Specification of the random vectors 𝑿 1 , … , 𝑿 4 subscript 𝑿 1 … subscript 𝑿 4 \bm{X}_{1},\ldots,\bm{X}_{4} .

Vector	Copula	$X_{i 1} \sim$	$X_{i 2} \sim$
$𝑿_{1} = (X_{11}, X_{12})$	Independence	$𝒩 (0, 1)$	$𝒩 (0, 1)$
$𝑿_{2} = (X_{21}, X_{22})$	Independence	$𝒮 𝒩 (- 1, 1, 2)$	$t_{4}$
$𝑿_{3} = (X_{31}, X_{32})$	Gumbel, $θ = 2$	$𝒩 (0, 1)$	$𝒩 (0, 1)$
$𝑿_{4} = (X_{41}, X_{42})$	Gumbel, $θ = 2$	$𝒮 𝒩 (- 1, 1, 2)$	$t_{4}$

Equations153

ρ_{α} : R \to [0, \infty), t \mapsto α - \mathds 1_{(- \infty, 0]} (t) ∣ t ∣,

ρ_{α} : R \to [0, \infty), t \mapsto α - \mathds 1_{(- \infty, 0]} (t) ∣ t ∣,

λ_{α} : R \to [0, \infty), t \mapsto α - \mathds 1_{(- \infty, 0]} (t) ∣ t ∣^{2},

λ_{α} : R \to [0, \infty), t \mapsto α - \mathds 1_{(- \infty, 0]} (t) ∣ t ∣^{2},

Φ_{u} : R^{d} \to [0, \infty), t \mapsto Φ_{u} (t) = \frac{1}{2} (∥ t ∥_{2} + ⟨ u, t ⟩) .

Φ_{u} : R^{d} \to [0, \infty), t \mapsto Φ_{u} (t) = \frac{1}{2} (∥ t ∥_{2} + ⟨ u, t ⟩) .

VaR_{α} (X) = c \in R^{d} argmin E [Φ_{α} (X - c)] .

VaR_{α} (X) = c \in R^{d} argmin E [Φ_{α} (X - c)] .

λ_{α} (t) = \frac{1}{2} ∣ t ∣ (∣ t ∣ + (2 α - 1) t) .

λ_{α} (t) = \frac{1}{2} ∣ t ∣ (∣ t ∣ + (2 α - 1) t) .

Λ_{u} : R^{d} \to [0, \infty), t \mapsto Λ_{u} (t) = \frac{1}{2} ∥ t ∥_{2} (∥ t ∥_{2} + ⟨ u, t ⟩),

Λ_{u} : R^{d} \to [0, \infty), t \mapsto Λ_{u} (t) = \frac{1}{2} ∥ t ∥_{2} (∥ t ∥_{2} + ⟨ u, t ⟩),

e_{α} (X) = c \in R^{d} argmin E [Λ_{α} (X - c)] .

e_{α} (X) = c \in R^{d} argmin E [Λ_{α} (X - c)] .

\frac{\partial}{\partial t _{k}} Λ_{u} (t) = t_{k} + \frac{t _{k}}{2 ∥ t ∥ _{2}} ⟨ u, t ⟩ + \frac{1}{2} ∥ t ∥_{2} u_{k} .

\frac{\partial}{\partial t _{k}} Λ_{u} (t) = t_{k} + \frac{t _{k}}{2 ∥ t ∥ _{2}} ⟨ u, t ⟩ + \frac{1}{2} ∥ t ∥_{2} u_{k} .

t_{n} = r_{n} cos (ϕ_{n, 1}) sin (ϕ_{n, 1}) cos (ϕ_{n, 2}) sin (ϕ_{n, 1}) sin (ϕ_{n, 2}) cos (ϕ_{n, 3}) ⋮ sin (ϕ_{n, 1}) \dots sin (ϕ_{n, d - 2}) cos (ϕ_{n, d - 1}) sin (ϕ_{n, 1}) \dots sin (ϕ_{n, d - 2}) sin (ϕ_{n, d - 1}), = r_{n} ξ (ϕ_{n, 1}, \dots, ϕ_{n, d - 1})

t_{n} = r_{n} cos (ϕ_{n, 1}) sin (ϕ_{n, 1}) cos (ϕ_{n, 2}) sin (ϕ_{n, 1}) sin (ϕ_{n, 2}) cos (ϕ_{n, 3}) ⋮ sin (ϕ_{n, 1}) \dots sin (ϕ_{n, d - 2}) cos (ϕ_{n, d - 1}) sin (ϕ_{n, 1}) \dots sin (ϕ_{n, d - 2}) sin (ϕ_{n, d - 1}), = r_{n} ξ (ϕ_{n, 1}, \dots, ϕ_{n, d - 1})

\frac{\partial}{\partial t _{k}} Λ_{u} (t) = r_{n} ξ_{k} + \frac{r _{n} ξ _{k}}{2 r _{n}} r_{n} ⟨ u, ξ ⟩ + \frac{1}{2} r_{n} ∥ ξ ∥_{2} u_{k},

\frac{\partial}{\partial t _{k}} Λ_{u} (t) = r_{n} ξ_{k} + \frac{r _{n} ξ _{k}}{2 r _{n}} r_{n} ⟨ u, ξ ⟩ + \frac{1}{2} r_{n} ∥ ξ ∥_{2} u_{k},

0.5 f (t_{1}) + 0.5 f (t_{2}) - f (0.5 (t_{1} + t_{2})) ⩾ 0

0.5 f (t_{1}) + 0.5 f (t_{2}) - f (0.5 (t_{1} + t_{2})) ⩾ 0

- ∥ x - y ∥_{2}^{2} ⩽ 2 ∥ x ∥_{2} ⟨ u, x ⟩ + 2 ∥ y ∥_{2} ⟨ u, y ⟩ - ∥ x + y ∥_{2} ⟨ u, x + y ⟩ ⩽ ∥ x - y ∥_{2}^{2}

- ∥ x - y ∥_{2}^{2} ⩽ 2 ∥ x ∥_{2} ⟨ u, x ⟩ + 2 ∥ y ∥_{2} ⟨ u, y ⟩ - ∥ x + y ∥_{2} ⟨ u, x + y ⟩ ⩽ ∥ x - y ∥_{2}^{2}

f_{x, y} (u) = ⟨ u, (2 ∥ x ∥_{2} - ∥ x + y ∥_{2}) x + (2 ∥ y ∥_{2} - ∥ x + y ∥_{2}) y ⟩ .

f_{x, y} (u) = ⟨ u, (2 ∥ x ∥_{2} - ∥ x + y ∥_{2}) x + (2 ∥ y ∥_{2} - ∥ x + y ∥_{2}) y ⟩ .

- ∥ y ∥_{2}^{2} ⩽ f_{0, y} (u) ⩽ ∥ y ∥_{2}^{2}

- ∥ y ∥_{2}^{2} ⩽ f_{0, y} (u) ⩽ ∥ y ∥_{2}^{2}

L (x, y) = ∥ (2 ∥ x ∥_{2} - ∥ x + y ∥_{2}) x + (2 ∥ y ∥_{2} - ∥ x + y ∥_{2}) y ∥_{2} .

L (x, y) = ∥ (2 ∥ x ∥_{2} - ∥ x + y ∥_{2}) x + (2 ∥ y ∥_{2} - ∥ x + y ∥_{2}) y ∥_{2} .

- L (x, y) ⩽ f_{x, y} (u) ⩽ L (x, y),

- L (x, y) ⩽ f_{x, y} (u) ⩽ L (x, y),

∥ x - y ∥_{2}^{4} - L (x, y)^{2} ⩾ 0.

∥ x - y ∥_{2}^{4} - L (x, y)^{2} ⩾ 0.

L (σ x, σ y)

L (σ x, σ y)

∥ σ x - σ y ∥_{2}^{2}

∥ x + y ∥_{2}^{2} = ∥ x ∥_{2}^{2} + ∥ y ∥_{2}^{2} + 2 ⟨ x, y ⟩ = r^{2} + 2 ⟨ x, y ⟩ = 1,

∥ x + y ∥_{2}^{2} = ∥ x ∥_{2}^{2} + ∥ y ∥_{2}^{2} + 2 ⟨ x, y ⟩ = r^{2} + 2 ⟨ x, y ⟩ = 1,

∥ x - y ∥_{2}^{2} = ∥ x ∥_{2}^{2} + ∥ y ∥_{2}^{2} - 2 ⟨ x, y ⟩ = r^{2} - (1 - r^{2}) = 2 r^{2} - 1.

∥ x - y ∥_{2}^{2} = ∥ x ∥_{2}^{2} + ∥ y ∥_{2}^{2} - 2 ⟨ x, y ⟩ = r^{2} - (1 - r^{2}) = 2 r^{2} - 1.

α^{2}

α^{2}

β^{2}

α β

L (x, y)^{2}

∥ x - y ∥_{2}^{4} - L (x, y)^{2} = 2 r (r cos (θ) + r sin (θ) - 1)^{2} (2 r sin (θ) cos (θ) + sin (θ) + cos (θ))

∥ x - y ∥_{2}^{4} - L (x, y)^{2} = 2 r (r cos (θ) + r sin (θ) - 1)^{2} (2 r sin (θ) cos (θ) + sin (θ) + cos (θ))

h (x, y)

h (x, y)

= ∥ x - y ∥_{2}^{2} + 2 ∥ x ∥_{2} ⟨ u, x ⟩ + 2 ∥ y ∥_{2} ⟨ u, y ⟩ - ∥ x + y ∥_{2} ⟨ u, x + y ⟩,

- ∥ x - y ∥_{2}^{2} ⩽ 2 ∥ x ∥_{2} ⟨ u, x ⟩ + 2 ∥ y ∥_{2} ⟨ u, y ⟩ - ∥ x + y ∥_{2} ⟨ u, x + y ⟩,

- ∥ x - y ∥_{2}^{2} ⩽ 2 ∥ x ∥_{2} ⟨ u, x ⟩ + 2 ∥ y ∥_{2} ⟨ u, y ⟩ - ∥ x + y ∥_{2} ⟨ u, x + y ⟩,

∣ E [∥ X ∥_{2} ⟨ u, X ⟩] ∣ ⩽ E [∥ X ∥_{2} ∣ ⟨ u, X ⟩ ∣] ⩽ E [∥ X ∥_{2}^{2}] .

∣ E [∥ X ∥_{2} ⟨ u, X ⟩] ∣ ⩽ E [∥ X ∥_{2} ∣ ⟨ u, X ⟩ ∣] ⩽ E [∥ X ∥_{2}^{2}] .

E [Λ_{u} (X - c)]

E [Λ_{u} (X - c)]

⩽ 2 E [∥ X - c ∥_{2}^{2}]

= 2 j = 1 \sum d E [(X_{j} - c_{j})^{2}] < \infty,

ϕ (c) = E [Λ_{α} (X - c)]

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFinancial Risk and Volatility Modeling · Advanced Statistical Methods and Models · Statistical Methods and Inference

Full text

Multivariate Geometric Expectiles

Klaus Herrmann111Department of Mathematics and Statistics, Concordia University, 1400 de Maisonneuve Blvd. West, Montréal (Québec) Canada H3G 1M8; e-mail: [email protected]

Marius Hofert222Department of Statistics and Actuarial Science, University of Waterloo, 200 University Avenue West, Waterloo (Ontario), Canada N2L 3G1; e-mail: [email protected]

Mélina Mailhot333Department of Mathematics and Statistics, Concordia University, 1400 de Maisonneuve Blvd. West, Montréal (Québec) Canada H3G 1M8; e-mail: [email protected].

(March 14, 2024)

Abstract

A generalization of expectiles for $d$ -dimensional multivariate distribution functions is introduced. The resulting geometric expectiles are unique solutions to a convex risk minimization problem and are given by $d$ -dimensional vectors. They are well behaved under common data transformations and the corresponding sample version is shown to be a consistent estimator. We exemplify their usage as risk measures in a number of multivariate settings, highlighting the influence of varying margins and dependence structures.

Keywords: expectile, geometric quantile, elicitability, dependence, minimizing expected loss, multivariate risk measure

1 Introduction

A fundamental task in risk management and applied actuarial science is to quantify the risks associated with a given position. Prime examples of risky positions are portfolio holdings or (re-)insurance contracts. Quantifying risks is not only necessary for the internal decision making process of financial institutions, insurance companies or individual investors, but also mandatory from a regulatory perspective. For example, the regulatory frameworks for banks (OSFI, AMF, Basel II, 2.5, III) and insurance companies (CIA and, in Europe, Solvency II, Swiss Solvency Test) require not only internal risk modeling, but also specifically demand that businesses quantify and report risks in a specific way, using risk measures. This task is intrinsically multivariate in nature as one of the American Property and Casualty Minimum Capital Target Advisory Committee key principle is that ‘Risks should be aggregated. No diversification between risk categories is permitted until evidence confirms diversification will hold in a stress situation’a(see of the Superintendent of Financial Institutions (2010)). The Office of the Superintendent of Financial Institutions of Canada states : ‘Gross, ceded and net provisions for claims liabilities must be provided by actuarial lines of business’a(see of the Superintendent of Financial Institutions (2014)).

Until recently, regulatory economic capital has been calculated based on univariate risk measures. In this case, i.e., when considering risks separately, the theory of risk measures is well established, see, e.g., McNeil et al. (2015) Chapter 2 for an overview. The two most popular risk measures in this setting are value-at-risk (VaR) and tail-value-at-risk (TVaR; sometimes also referred to as conditional-tail-expectation or expected shortfall).

However, capital allocation has to be reinvestigated when dealing with a portfolio when it is more appropriate to secure capital simultaneously for multiple business activities. In this paper, we introduce a method that allows users to allocate capital to each risk based on possibly different confidence levels, and considering the dependence between and among business lines.

In a real world scenario markets and assets are interconnected or prone to systemic risk. The same holds true when considering insurance contracts where dependence can play an important role. It can thus be beneficial to consider risks in a joint framework rather than treating them as isolated entities, such as the top-down allocation rule. Many problems from this consideration have been pointed out in the last decade, notably by Bank of Canada (see Gauthier et al. (2010)). To this end, a general theory for multivariate risk measures which specifically take the underlying dependence structure into account has recently emerged.

Based on multivariate risk measures, the trade-off between two stock indices has been studied by Cherubini et al. (2004) using bivariate inverse quantiles. Losses and adjustment loss allocation expenses (ALAE) have been studied by Di Bernardino et al. (2013), using multivariate value-at-risk and tail-value-at-risk. Guégan and Hassani (2014) allocate risk capital based on bivariate quantiles, where operational risk and other related risks are considered as separate dependent classes. Most of the techniques use an acceptance set, as presented in Jouini et al. (2004), and calculate a metric for each risk class, considering the dependence between those classes. Balbás et al. (2011) present several properties from a general representation of multivariate risk functions.

From an actuarial perspective, multivariate risk measures generalizing VaR are treated in Embrechts and Puccetti (2006), Cossette et al. (2012), Cousin and Di Bernardino (2013) and Torres et al. (2015). Multivariate versions of TVaR have been defined in Cousin and Di Bernardino (2014) and Cossette et al. (2015). Maume-Deschamps et al. (2017) also introduce a multivariate extension of expectiles. However, this approach differs from ours in the sense that it is non-geometrical. Likewise, the statistical community has generalized the notion of quantiles, i.e., VaR, to higher dimensions, e.g., via the notion of statistical depth, see, e.g., Mosler (2013) for an overview, and optimization-based definitions as in Abdous and Theodorescu (1992), Chaudhuri (1996) or Chakraborty (2001). Although the two approaches set out from different starting points, interconnections are possible in some cases, see for example Hallin and Paindaveine (2010). A thorough overview of different approaches can be found in Serfling (2002), while a connection between half-space depth and stress testing risk factors is established in McNeil and Smith (2012).

Our work is motivated by the fact that despite its good properties and popularity, the tail-value-at-risk is not elicitable in the univariate case, see Gneiting (2011). Elicitability is a property that has been investigated in Osband (1985) in order to score the estimation of risks. Therefore, using the same criterion to do forecasting-based model selection and risk assessment is not possible when using $\operatorname{TVaR}$ , or, as shown in Ziegel (2014), any other spectral risk measure other than the expectation. While univariate quantiles are elicitable, and can thus be utilized in forecasting-based model selection, they, however, do not adhere to the broadly accepted framework of coherence, see Artzner et al. (1999), which establishes preferable properties of risk measures in an axiomatic fashion. This is a serious drawback in actuarial applications.

As shown in Ziegel (2014), the only elicitable, law-invariant and coherent risk measures are expectiles, introduced by Newey and Powell (1987). Expectiles generalize the mean for a given probability distribution in much the same way as quantiles generalize the median. Furthermore, they have a natural interpretation when considering the gain-loss ratio connected to a given position, i.e., the ratio of the expected gains over the expected losses, which is a popular performance measure in portfolio management, see Bellini and Di Bernardino (2017). The amount of money that needs to be added to a given position in order to achieve a pre-specified, and in practical applications sufficiently high, gain-loss ratio is given by an expectile. In the univariate case, expectiles therefore combine favourable properties of risk measures and constitute an important addition to the well established VaR and TVaR.

Considering both, the need for multivariate risk measures and the favourable properties of univariate expectiles, the main target of the present study is therefore to define a multivariate version of expectiles and to study its properties. Moreover, this paper introduces the novel concept of allocating a distinct confidence level to each risk, while considering the dependence structure between them.

The paper is structured as follows. We briefly summarize univariate quantiles and expectiles in Section 2, while the main ideas behind the multivariate framework are introduced in Section 3. Specifically, Section 3.1 reviews geometric quantiles, while Section 3.2 defines geometric expectiles as the main contribution of our paper. We discuss population and asymptotic properties of the newly introduced statistical functional in Section 4, while examples are discussed in Section 5. Finally Section 6 concludes. Selected plots can be reproduced with the latest version of the R package qrmtools; see the vignette geometric_risk_measures.

2 Univariate Quantiles and Expectiles

It is a standard approach in statistics to express population characteristics in terms of minimizing the expected loss of a random variable under a given loss function. Considering the absolute value $\left|\cdot\right|$ , the median solves $\operatorname{med}X=\operatorname*{argmin}_{c\in\mathbb{R}}\mathbb{E}[\left|X-c\right|]$ , while the mean is obtained when considering the square loss $\mathbb{E}[X]=\operatorname*{argmin}_{c\in\mathbb{R}}\mathbb{E}[(X-c)^{2}]$ . In case of the absolute value loss function it is readily observed that, using an asymmetric generalization of $\left|\cdot\right|$ , quantiles other than the median can be obtained. For $\alpha\in(0,1)$ we define the check loss as

[TABLE]

where we see that the case $\alpha=0.5$ is directly related to the usual absolute value. Similar to the median this leads to $F^{-1}(\alpha)=\operatorname*{argmin}_{c\in\mathbb{R}}\mathbb{E}[\rho_{\alpha}(X-c)]$ . In Koenker and Bassett (1978) this observation is the starting point to introduce the quantile regression framework. As an alternative Newey and Powell (1987), introduced an asymmetric version of the square loss along the same lines. To this end we set

[TABLE]

where again $\alpha\in(0,1)$ . The minimizers $e(\alpha)=\operatorname*{argmin}_{c\in\mathbb{R}}\mathbb{E}[\lambda_{\alpha}(X-c)]$ are called expectiles, analogously to quantiles minimizing the check loss. Again the case $\alpha=0.5$ reduces to the well known motivating example, i.e., $e(0.5)=\mathbb{E}[X]$ . The generalized loss functions are asymmetric versions of their symmetric $\alpha=0.5$ counterparts. Compared to $\rho_{\alpha}$ the loss function $\lambda_{\alpha}$ , however, is continuously differentiable, leading to favorable analytic properties in a minimization context.

3 Multivariate Geometric Risk Measures

In order to generalize univariate expectiles to the multivariate setting we first revisit in Section 3.1 the framework introduced by Chaudhuri (1996). This allows for a suitable generalization of the loss function in (1), leading to multivariate geometric quantiles. In Section 3.2 we then apply the underlying idea of Chaudhuri (1996) to give a multivariate generalization of (2) and to introduce multivariate geometric expectiles as the main contribution of this paper.

3.1. Multivariate Geometric Value-at-Risk

Chaudhuri (1996) provides a definition of multivariate quantiles by generalizing the approach outlined in Section 2. The resulting geometric quantiles are obtained by minimizing the expected loss based on a multivariate loss function generalizing $\rho_{\alpha}$ given in (1).

For $\bm{x},\bm{y}\in\mathbb{R}^{d}$ we denote by $\left\|\bm{x}\right\|_{2}=\sqrt{\bm{x}^{\operatorname{\top}}\bm{x}}$ and $\left\langle\bm{x},\bm{y}\right\rangle=\bm{x}^{\operatorname{\top}}\bm{y}$ the Euclidean norm and inner product respectively, and by $B^{d}=\{\bm{x}\in{\mathbb{R}^{d}}:\left\|\bm{x}\right\|_{2}<1\}\subset{\mathbb{R}^{d}}$ the open unit ball in $\mathbb{R}^{d}$ , where we neglect the superscript in unambiguous situations. For a fixed index $\bm{u}\in B$ Chaudhuri (1996) defines the loss function $\Phi_{\bm{u}}$ as

[TABLE]

While it is immediately clear that $\Phi_{\bm{u}}(\bm{0})=0$ for all $\bm{u}\in B$ , we also have that $\Phi_{\bm{u}}(\bm{t})\geqslant 0$ for all $(\bm{u},\bm{t})\in B\times{\mathbb{R}^{d}}$ using the Cauchy-Schwarz inequality. Convexity of $\Phi_{\bm{u}}$ follows directly from properties of the norm and inner product.

Based on $\Phi_{\bm{u}}$ the (multivariate) geometric quantile, or geometric $\operatorname{VaR}$ , at level $\bm{\alpha}\in B$ for a random vector $\bm{X}$ is then defined as

[TABLE]

As shown in Chaudhuri (1996), the right hand side of (4) is always finite and the minimization is thus well posed. Furthermore, the resulting geometric $\operatorname{VaR}$ is the unique minimizer of (4).

In (4) the vector $\bm{\alpha}\in B$ takes the role of the confidence level. However, due to the multivariate context, $\operatorname{VaR}$ is now indexed by a $d$ -dimensional vector instead of a scalar. This adds additional flexibility compared to other approaches such as Cousin and Di Bernardino (2013), Ben Tahar (2006) and Cossette et al. (2015), where only one scalar confidence level can be set for the multivariate risk $\bm{X}$ . It is also important to notice that the geometric quantile $\operatorname{VaR}_{\bm{\alpha}}(\bm{X})$ itself is represented by a vector in $\mathbb{R}^{d}$ . This makes the resulting risk measure easier to use for risk analysis than approaches such as Cousin and Di Bernardino (2014), Cousin and Di Bernardino (2013) and Mailhot et al. (2017) where the resulting multivariate quantiles are subsets in $\mathbb{R}^{d}$ .

When comparing traditional confidence levels in $(0,1)$ to the univariate case of our setting, care has to be taken to adjust the indices. Both settings are equivalent by simply re-indexing according to $f\colon[0,1]\to[-1,1],\quad x\mapsto f(x)=2x-1$ . An index of $99\%$ in the traditional setting is therefore comparable to an index of $98\%$ using the convention adopted in this paper.

The orientation of the contour lines of the objective function is influenced by the direction of the index $\bm{u}$ , while the magnitude of the index changes the shape of the contour lines. For smaller values of $\left\|\bm{u}\right\|_{2}$ the contour lines are more norm like, i.e., more circular, and in the limit $\left\|\bm{u}\right\|_{2}=0$ , i.e., if and only if $\bm{u}=(0,0)$ we are indeed left with the circular contour lines of the norm.

3.2. Multivariate Geometric Expectiles

Analogously to the approach of Chaudhuri (1996) we introduce our multivariate representation of expectiles via a multivariate generalization of $\lambda_{\alpha}$ . For this purpose it is more convenient to rewrite the original definition of $\lambda_{\alpha}$ given in (2) as

[TABLE]

It can easily be verified that both definitions coincide for all $t\in\mathbb{R}$ . Similarly to (3) this motivates our definition of the loss function $\Lambda_{\bm{u}}$ as

[TABLE]

where $\bm{u}\in B$ is a fixed element of the open unit ball. Given that $\Phi_{\bm{u}}(\bm{t})\geqslant 0$ for all $(\bm{u},\bm{t})\in B\times\mathbb{R}^{d}$ it is clear that we also have $\Lambda_{\bm{u}}(\bm{t})\geqslant 0$ for all $(\bm{u},\bm{t})\in B\times\mathbb{R}^{d}$ . As for $\Phi_{\bm{u}}$ we have $\Lambda_{\bm{u}}(\bm{0})=0$ for all $\bm{u}\in B$ .

For a given confidence level $\bm{\alpha}\in B$ we now define the geometric expectile of a random vector $\bm{X}$ as the minimizer of the expected loss based on $\Lambda_{\bm{\alpha}}$ , i.e.

[TABLE]

As in the case of the geometric $\operatorname{VaR}$ , the definition of geometric expectiles is based on an index $\bm{\alpha}\in B$ allowing to specify a direction and magnitude of the confidence level. Furthermore, geometric expectiles are vectors in $\mathbb{R}^{d}$ . This makes them easier to interpret than multivariate risk measures that are given as subsets of $\mathbb{R}^{d}$ . For example, for $\bm{\alpha}=\bm{0}$ it is easy to see that $e_{\bm{0}}(\bm{X})=\left(\mathbb{E}[X_{1}],\ldots,\mathbb{E}[X_{d}]\right)$ . The mean vector is therefore, analogously to the univariate case, a special case of the geometric expectiles defined in (6). In Section 4 we discuss the existence of a minimizer $e_{\bm{\alpha}}$ and its uniqueness together with further properties of $e_{\bm{\alpha}}$ .

Figure 1 displays the contour lines for a two dimensional example of $\Lambda_{\bm{u}}(\bm{t})$ for three indices $\bm{u}_{1}=0.9/\sqrt{2}(1,1)$ , $\bm{u}_{2}=0.9/\sqrt{2}(-1,1)$ and $\bm{u}_{3}=0.5/\sqrt{2}(-1,1)$ . The figure shows how the direction of the index, visualized by the arrow, changes the orientation of the contour lines (compare the left and middle plots). Also, the magnitude of the index $\left\|\bm{u}\right\|_{2}$ influences the shape of the contour lines (compare the middle and right plots), where smaller values of $\left\|\bm{u}\right\|_{2}$ lead to more norm like contours as already discussed in the case of quantiles.

The examples in Section 5 provide numerical illustrations of the resulting expectiles for a number of bivariate distributions, see Figures 4 and 11, as well as an analytic solution to (6) in the special case of a bivariate uniform distribution.

4 Properties of Geometric Expectiles

In this section we discuss properties of geometric expectiles $e_{\bm{\alpha}}$ defined in (6). Clearly, properties of the associated loss function $\Lambda_{\bm{u}}$ play a major role in this discussion which is why we discuss them first in Section 4.1. In Section 4.2 we then derive properties of $e_{\bm{\alpha}}$ . Finally, we discuss asymptotics in Section 4.3 when $e_{\bm{\alpha}}$ needs to be estimated from observed data or approximated when closed-form solutions to the minimization problem cannot be obtained.

4.1. Properties of $\Lambda_{\bm{u}}$

In the univariate setting an advantage of expectiles over quantiles is that the underlying loss function is differentiable at zero. This is also true for geometric quantiles and expectiles when $d\geqslant 2$ . The following theorem shows that the geometric expectile loss function continues to be differentiable for $d\geqslant 2$ , while it is straightforward to see that this is not the case for the geometric quantile loss function $\Phi_{\bm{u}}$ defined in (3).

Theorem 4.1 (Differentiability of $\Lambda_{\bm{u}}$ ).

For $\Lambda_{\bm{u}}$ defined in (5) the gradient $\nabla\Lambda_{\bm{u}}(\bm{t})$ exists for all $(\bm{u},\bm{t})\in B\times\mathbb{R}^{d}$ with $\nabla\Lambda_{\bm{u}}(\bm{0})=\bm{0}$ .

Proof.

For $\bm{t}\neq\bm{0}$ it is clear that the partial derivatives with respect to each variable once exist and are finite. To show the claim for $\bm{t}=\bm{0}$ we first consider the $k$ -th element of the gradient given by

[TABLE]

Now consider a sequence $(\bm{t}_{n})_{n=1}^{\infty}$ such that $\lim_{n\to\infty}\bm{t}_{n}=\bm{0}$ , and we can represent each element $\bm{t}_{n}$ via $d$ -dimensional polar coordinates, i.e., we consider a radius $r_{n}$ and angles $\phi_{n,1},\ldots,\phi_{n,d-1}$ such that $\bm{t}_{n}$ can be represented by

[TABLE]

where $r_{n}\to 0$ as $n\to\infty$ . Writing $\xi_{1},\ldots,\xi_{d}$ for the components of $\bm{\xi}=\bm{\xi}(\phi_{n,1},\ldots,\phi_{n,d-1})$ and noting that $\left\|\bm{\xi}\right\|_{2}=1$ we observe that

[TABLE]

which converges to zero for $n\to\infty$ for any sequence $(\bm{t}_{n})_{n=1}^{\infty}$ converging to zero. ∎

From the definition of $\Phi_{\bm{u}}$ in (3) it is straightforward to see that $\Phi_{\bm{u}}$ is a convex function. While this is also true for the loss function $\Lambda_{\bm{u}}$ tied to geometric expectiles, it is not immediately clear from (5). To simplify the discussion we first recall a well-known result from convex analysis, see for example Rudin (1976).

Lemma 4.1 (Midpoint convexity).

Denote by $f\colon\mathbb{R}^{d}\to\mathbb{R}$ a continuous function. Then $f$ is convex if and only if it is midpoint convex, i.e.

[TABLE]

for all $\bm{t}_{1},\bm{t}_{2}\in\mathbb{R}^{d}$ .

To further prepare the result we first present a theorem generalizing the familiar parallelogram identity. While this is an essential component of our convexity proof in Theorem 4.3, the result is interesting in its own right.

Theorem 4.2 (Parallelogram Inequality).

Denote by $\overline{B}=\{\bm{u}\in\mathbb{R}^{d}:\left\|\bm{u}\right\|_{2}\leqslant 1\}$ the closed unit ball in $\mathbb{R}^{d}$ . For any fixed vectors $\bm{x},\bm{y}\in\mathbb{R}^{d}$ it holds that

[TABLE]

for all $\bm{u}\in\overline{B}$ . For $\bm{u}$ such that $\left\|\bm{u}\right\|_{2}<1$ equality holds in (7) if and only if $\bm{x}=\bm{y}$ .

Proof.

We start by considering the bounded components of (7) as a function of $\bm{u}$ which can be rewritten as

[TABLE]

First, we consider two special cases. For $\bm{x}=\bm{y}$ all terms on all sides in (7) vanish and equality holds for all $\bm{u}\in\overline{B}$ . For $\bm{x}=\bm{0}\neq\bm{y}$ we have $f_{\bm{0},\bm{y}}(\bm{u})=\left\|\bm{y}\right\|_{2}\left\langle\bm{u},\bm{y}\right\rangle$ . Therefore

[TABLE]

holds as a consequence of the Cauchy-Schwarz inequality. Equality can only hold if $\left\|\bm{u}\right\|_{2}=1$ . Then, for the general case, we consider $\bm{y}\neq\bm{x}\neq\bm{0}$ and define

[TABLE]

With the Cauchy-Schwarz inequality and $\left\|\bm{u}\right\|_{2}\leqslant 1$ we have that

[TABLE]

where the equality only holds if $\left\|\bm{u}\right\|_{2}=1$ . Our claim is now equivalent to $L(\bm{x},\bm{y})\leqslant\left\|\bm{x}-\bm{y}\right\|_{2}^{2}$ , or equivalently to

[TABLE]

Due to the scale invariance of both terms

[TABLE]

for any $\sigma>0$ , we consider without loss of generality $\bm{x}$ and $\bm{y}$ such that $\left\|\bm{x}+\bm{y}\right\|_{2}=1$ . Any other cases can be handled by rescaling with $\sigma=1/\left\|\bm{x}+\bm{y}\right\|_{2}$ . We continue by considering polar coordinates of $(\left\|\bm{x}\right\|_{2},\left\|\bm{y}\right\|_{2})\in\mathbb{R}^{2}$ leading to $\left\|\bm{x}\right\|_{2}=r\cos(\theta)$ and $\left\|\bm{y}\right\|_{2}=r\sin(\theta)$ , where $r>0$ and $0\leqslant\theta\leqslant\pi/2$ due to the strict component wise positivity of $(\left\|\bm{x}\right\|_{2},\left\|\bm{y}\right\|_{2})$ . This yields

[TABLE]

or alternatively ( $2\left\langle\bm{x},\bm{y}\right\rangle=1-r^{2}$ ). For $\left\|\bm{x}-\bm{y}\right\|_{2}^{2}$ we have

[TABLE]

For the first term in (8) we therefore have $\left\|\bm{x}-\bm{y}\right\|_{2}^{4}=(2r^{2}-1)^{2}$ . Concerning $L(\bm{x},\bm{y})^{2}$ we have $L(\bm{x},\bm{y})^{2}=\alpha^{2}\left\|\bm{x}\right\|_{2}^{2}+\beta^{2}\left\|\bm{y}\right\|_{2}^{2}+\alpha\beta 2\left\langle\bm{x},\bm{y}\right\rangle$ with $\alpha=(2\left\|\bm{x}\right\|_{2}-\left\|\bm{x}+\bm{y}\right\|_{2})$ and $\beta=(2\left\|\bm{y}\right\|_{2}-\left\|\bm{x}+\bm{y}\right\|_{2})$ . In terms of $r$ and $\theta$ we then have

[TABLE]

Reformulating (8) in terms of $r$ and $\theta$ then yields

[TABLE]

which is non-negative given the restrictions on $\theta$ . ∎

Given Lemma 4.1 and Theorem 4.2 we can now establish the strict convexity of $\Lambda_{\bm{u}}$ .

Theorem 4.3 (Strict convexity of $\Lambda_{\bm{u}}$ ).

For every fixed $\bm{u}\in B$ the function $\Lambda_{\bm{u}}$ defined in (3) is strictly convex on $\mathbb{R}^{d}$ .

Proof.

Due to continuity of $\Lambda_{\bm{u}}$ and Lemma 4.1 we focus on midpoint convexity. To this end, define $D(\bm{x},\bm{y})=0.5\Lambda_{\bm{u}}(\bm{x})+0.5\Lambda_{\bm{u}}(\bm{y})-\Lambda_{\bm{u}}(0.5(\bm{x}+\bm{y}))$ , $\bm{x},\bm{y}\in\mathbb{R}^{d}$ , where we have that $\Lambda_{\bm{u}}(0.5(\bm{x}+\bm{y}))=0.25\left\|\bm{x}+\bm{y}\right\|_{2}^{2}+0.25\left\|\bm{x}+\bm{y}\right\|_{2}\left\langle\bm{u},\bm{x}+\bm{y}\right\rangle=0.25\Lambda_{\bm{u}}(\bm{x}+\bm{y})$ . The function $\Lambda_{\bm{u}}$ is then convex if and only if $h\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to\mathbb{R},\quad(\bm{x},\bm{y})\mapsto h(\bm{x},\bm{y})=4D(\bm{x},\bm{y})=2\Lambda_{\bm{u}}(\bm{x})+2\Lambda_{\bm{u}}(\bm{y})-\Lambda_{\bm{u}}(\bm{x}+\bm{y})$ is non-negative. For $h$ we have that

[TABLE]

where we used the parallelogram identity $2\left\|\bm{x}\right\|_{2}^{2}+2\left\|\bm{y}\right\|_{2}^{2}=\left\|\bm{x}+\bm{y}\right\|_{2}^{2}+\left\|\bm{x}-\bm{y}\right\|_{2}^{2}$ to get the second equality. The condition $h(\bm{x},\bm{y})\geqslant 0$ is equivalent to

[TABLE]

which holds by Theorem 4.2. The fact that $\Lambda_{\bm{u}}$ is strictly convex follows, as the index $\bm{u}$ is assumed to lie in the open ball, i.e., $\left\|\bm{u}\right\|_{2}=1$ is not permitted. ∎

We have so far established that $\Lambda_{\bm{u}}$ is differentiable with a stationary point at $\bm{t}=\bm{0}$ . Furthermore, the strict convexity of $\Lambda_{\bm{u}}$ guarantees that there exists at most one global minimum for $\Lambda_{\bm{u}}$ . To finally ensure the existence of such a minimizer we establish coercivity of $\Lambda_{\bm{u}}$ .

Definition 4.1 (Coercive function on $\mathbb{R}^{d}$ ).

A real valued function $f\colon\mathbb{R}^{d}\to\mathbb{R}$ is said to be coercive if $\lim_{n\to\infty}f(\bm{x}_{n})=\infty$ for all sequences $(\bm{x}_{n})_{n=1}^{\infty}$ such that $\lim_{n\to\infty}\left\|\bm{x}_{n}\right\|_{2}=\infty$ .

Coercivity plays an important role in optimization theory as it ensures the existance of at least one minimizer for a large class of real valued functions. This fact is formalized in the following theorem.

Theorem 4.4.

Denote by $f\colon\mathbb{R}^{d}\to\mathbb{R}$ a coercive and convex function. Then there exists an element $\bm{x}_{0}\in\mathbb{R}^{d}$ such that $f(\bm{x}_{0})=\inf_{\bm{x}\in\mathbb{R}^{d}}f(\bm{x})$ .

Proof.

The proof follows from the more general Theorem $2.11$ and Remark $2.13$ in Barbu and Precupanu (2012) applicable to lower-semicontinuous functions on reflexive Banach spaces. The necessary continuity of $f$ is guaranteed by Proposition $2.3$ in Tuy (2016) stating that a proper convex function on $\mathbb{R}^{d}$ is continuous on every interior point of its effective domain. ∎

To finally tie all parts together we establish the coercivity of $\Lambda_{\bm{u}}$ in the following theorem. Given the strict convexity of $\Lambda_{\bm{u}}$ established in Theorem 4.3, an application of Theorem 4.4 ensures the existence of a unique and global minimizer. From our previous observations, especially Theorem 4.1, we know that this minimum is located at $\bm{0}$ for every given $\bm{u}\in B$ .

Theorem 4.5 (Coercivity of $\Lambda_{\bm{u}}$ ).

The function $\Lambda_{\bm{u}}$ is coercive on $\mathbb{R}^{d}$ .

Proof.

Given that $\left\|\bm{u}\right\|_{2}=s<1$ the Cauchy-Schwarz inequality implies $\left\langle\bm{u},\bm{x}\right\rangle\geqslant-s\left\|\bm{x}\right\|_{2}$ . Therefore $\Lambda_{\bm{u}}(\bm{x})\geqslant 0.5\left\|\bm{x}\right\|_{2}^{2}(1-s)$ which proofs the claim. ∎

4.2. Properties of $e_{\bm{\alpha}}$

With the properties of $\Lambda_{\bm{u}}$ in place we can now tackle those of $e_{\bm{\alpha}}$ . In (6), it is necessary to ensure that the objective function, i.e., the expected loss, is finite. Similarly to the univariate case, we recover that a finite second moment condition for the marginal distributions is sufficient. We thus introduce the condition

(C) For a $d$ -dimensional random vector $\bm{X}$ assume $\mathbb{E}[X_{j}^{2}]<\infty$ for all $j\in\{1,\ldots,d\}$

for ease of reference. This leads to the following result.

Theorem 4.6.

If (C) holds for a $d$ -dimensional random vector $\bm{X}=(X_{1},\ldots,X_{d})$ then $0\leqslant\mathbb{E}[\Lambda_{\bm{u}}(\bm{X}-\bm{c})]<\infty$ for every $\bm{c}\in\mathbb{R}^{d}$ and $\bm{u}\in B$ .

Proof.

We use Jensen’s inequality and $\left\|\bm{u}\right\|_{2}<1$ to obtain that

[TABLE]

For a given $\bm{c}\in\mathbb{R}^{d}$ and $\bm{u}\in B$ , this leads to

[TABLE]

as $\Lambda_{\bm{u}}(\bm{t})\geqslant 0$ for all $(\bm{u},\bm{t})\in B\times\mathbb{R}^{d}$ and $\mathbb{E}[X_{j}^{2}]<\infty$ . ∎

Now that the finiteness of the expected loss is addressed, we turn to the existence and uniqueness of $e_{\bm{\alpha}}$ . To do so we adapt the proof of Theorem $6.8$ in Lehmann (1983) to our more general setting. To this end let

[TABLE]

denote the objective function used in (6) and recall the convergence in probability to infinity.

Definition 4.2 (Convergence in probability to $\infty$ ).

A sequence of positive random variables $(Y_{n})_{n=1}^{\infty}$ converges in probability to $\infty$ , if, for every $K>0$ , $\lim_{n\to\infty}\mathbb{P}[Y_{n}>K]=1.$

In preparation of showing coercivity of $\phi$ , we first discuss the probabilistic behaviour of $\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c})$ when $\left\|\bm{c}\right\|_{2}$ tends towards $\infty$ .

Lemma 4.2.

For a sequence of vectors $(\bm{c}_{n})_{n=1}^{\infty}$ such that $\left\|\bm{c}_{n}\right\|_{2}\to\infty$ and for a fixed random vector $\bm{X}$ , the sequence $(\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c}_{n}))_{n=1}^{\infty}$ converges in probability to $\infty$ .

Proof.

From the proof of Theorem 4.5 we have that $\Lambda_{\bm{\alpha}}(\bm{t})\geqslant 0.5(1-s)\left\|\bm{t}\right\|_{2}^{2}$ , where $s=\left\|\bm{\alpha}\right\|_{2}$ . Therefore, almost surely,

[TABLE]

With the reverse triangle inequality, we also have that

[TABLE]

almost surely. For an arbitrary fixed $K>0$ , we define the sets

[TABLE]

leading to $A_{n}(K)\subset B_{n}(K)\subset C_{n}(K)$ on the basis of inequalities (10) and (11). For $A_{n}(K)$ and every $K>0$ , we have that

[TABLE]

for $n\to\infty$ . Since $\mathbb{P}[A_{n}(K)]\leqslant\mathbb{P}[B_{n}(K)]\leqslant\mathbb{P}[C_{n}(K)]$ , $\lim_{n\to\infty}\mathbb{P}[C_{n}(K)]=1$ for every $K>0$ . ∎

In a second step, we show that the strict convexity of $\Lambda_{\bm{u}}$ established in Theorem 4.3 carries over to $\phi$ .

Theorem 4.7 (Strict convexity and continuity of $\phi$ ).

If (C) holds for a random vector $\bm{X}$ then $\phi\colon\mathbb{R}^{d}\to[0,\infty),\quad\bm{c}\mapsto\phi(\bm{c})=\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c})]$ is strictly convex and continuous on $\mathbb{R}^{d}$ for every fixed $\bm{\alpha}\in B$ .

Proof.

Given that the marginal second moments of $\bm{X}$ are finite, Theorem 4.6 guarantees that $\phi$ is well-defined for every $\bm{c}\in\mathbb{R}^{d}$ . With $\lambda\in[0,1]$ and $\bm{c}_{1},\bm{c}_{2}\in\mathbb{R}^{d}$ such that $\bm{c}_{1}\neq\bm{c}_{2}$ we have

[TABLE]

Continuity follows from the fact that every proper convex function on $\mathbb{R}^{d}$ is continuous on every interior point of its effective domain, see, for example, Proposition 2.3 of Tuy (2016). Having established that $\phi$ is strictly convex on all of $\mathbb{R}^{d}$ the claim follows. ∎

Combining Lemma 4.2 and Theorem 4.7 now allows us to ensure the existance and uniqueness of geometric expectiles.

Theorem 4.8 (Existence and uniqueness of $e_{\bm{\alpha}}$ ).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then there exists a unique solution $e_{\bm{\alpha}}(\bm{X})=\operatorname*{argmin}_{\bm{c}\in\mathbb{R}^{d}}\phi(\bm{c})$ for every fixed $\bm{\alpha}\in B$ .

Proof.

First, we show that $\phi$ is coercive and fix a sequence $(\bm{c}_{n})_{n=1}^{\infty}$ such that $\left\|\bm{c}_{n}\right\|_{2}\to\infty$ . To show that $\phi(\bm{c}_{n})$ diverges, we fix an arbitrary $K>0$ and define

[TABLE]

Given that $\Lambda_{\bm{\alpha}}$ is positive, it follows that

[TABLE]

Lemma 4.2 yields $K\mathbb{P}[C_{n}(K)]\to K$ as $n\to\infty$ , which shows that $\phi$ diverges, i.e., that $\phi$ is coercive. Note that from Theorem 4.7, $\phi$ is also continuous and strictly convex. Then apply Theorem 4.4. ∎

Having established the basic properties of geometric expectiles, we now discuss their behaviour under data transformations. As in Chaudhuri (1996) for $\operatorname{VaR}_{\bm{\alpha}}(\bm{X})$ , it is straightforward to show how geometric expectiles behave for translation, rotation and rescaling of the underlying random vector $\bm{X}$ . Adding a deterministic amount to an uncertain position simply shifts the resulting risk measure, in line with translation invariance of coherent risk measures.

Proposition 4.1 (Translation invariance).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then $e_{\bm{\alpha}}(\bm{X}+\bm{a})=e_{\bm{\alpha}}(\bm{X})+\bm{a}$ for all $\bm{a}\in\mathbb{R}^{d}$ .

Proof.

By definition, we have $e_{\bm{\alpha}}(\bm{X})=\operatorname*{argmin}_{\bm{c}\in\mathbb{R}^{d}}\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c})]$ . Therefore, $\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-(\bm{c}-\bm{a}))]$ will be minimized by $e_{\bm{\alpha}}(\bm{X})+\bm{a}$ . ∎

Reasonable behaviour under scaling transformations ensures that a change in the underlying measurement units (for example, going from cents to dollars) is appropriately reflected in the behaviour of the risk measure. For geometric expectiles this is the case as shown next. This is in resemblance with positive homogeneity of coherent risk measures.

Proposition 4.2 (Positive homogeneity).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then $e_{\bm{\alpha}}(\sigma\bm{X})=\sigma e_{\bm{\alpha}}(\bm{X})$ for every positive scalar $\sigma>0$ .

Proof.

[TABLE]

Given that $e_{\bm{\alpha}}(\bm{X})$ minimizes $\sigma^{2}\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c})]$ , as the positive factor $\sigma^{2}$ only changes the value of the objective function but not the location of the optimum, we have that $\sigma e_{\bm{\alpha}}(\bm{X})$ minimizes $\mathbb{E}[\Lambda_{\bm{\alpha}}(\sigma\bm{X}-\bm{c})]$ , i.e., $e_{\bm{\alpha}}(\sigma\bm{X})=\sigma e_{\bm{\alpha}}(\bm{X})$ . ∎

It is reasonable to expect that a permutation of the components of $\bm{X}$ should likewise result in a permutation of the entries of the resulting risk measure. Geometric expectiles are not only well behaved under permutations, but under general orthogonal rotations.

Proposition 4.3 (Rotation with orthogonal matrix).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then $e_{A\bm{\alpha}}(A\bm{X})=Ae_{\bm{\alpha}}(\bm{X})$ for every orthogonal matrix $A\in\mathbb{R}^{d\times d}$ .

Proof.

By orthogonality of $A$ , for every $\bm{x},\bm{y}\in\mathbb{R}^{d}$ , we have that $\left\langle A\bm{x},A\bm{y}\right\rangle=\left\langle\bm{x},\bm{y}\right\rangle$ and $\left\|A\bm{x}\right\|_{2}=\left\|\bm{x}\right\|_{2}$ . Denoting by $A^{\operatorname{\top}}$ the transpose of $A$ we therefore get

[TABLE]

Given that $e_{\bm{\alpha}}(\bm{X})$ minimizes $\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-\bm{c})]$ , the minimizer of $\mathbb{E}[\Lambda_{\bm{\alpha}}(\bm{X}-A^{\operatorname{\top}}\bm{c})]$ is given by $Ae_{\bm{\alpha}}(\bm{X})$ . ∎

Taken together, Propositions 4.1–4.3 guarantee that $e_{\bm{\alpha}}(\bm{X})$ is well behaved for the most relevant data transformations. In this context it is natural to ask if there exists a suitable ordering $\prec$ for random vectors such that $\bm{X}\prec\bm{Y}$ implies $e_{\bm{\alpha}}(\bm{X})\sqsubset e_{\bm{\alpha}}(\bm{Y})$ for a possibly different ordering $\sqsubset$ . While this point is of great interest it proofed too difficult to establish a suitable result and we thus leave it as an open question for further research.

In the following Corollary 4.1 and Proposition 4.4 we generalize well known symmetry properties of univariate expectiles to the multivariate setting. We start by establishing a connection between the geometric expectiles of $\bm{X}$ and $-\bm{X}$ as a corolloary of Proposition 4.3.

Corollary 4.1 (Vector sign symmetry).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then $e_{\bm{\alpha}}(-\bm{X})=-e_{-\bm{\alpha}}(\bm{X})$ for all $\bm{\alpha}\in B$ .

Proof.

Apply Proposition 4.3 with $A=-I$ , where $I$ is the appropriate identity matrix, and $-\bm{\alpha}$ and re-arrange terms. ∎

For radially symmetric distributions, see for example McNeil et al. (2015) Chapter 7, the resulting expectiles also obey a symmetry relation when changing the sign of the underlying index $\bm{\alpha}$ .

Proposition 4.4 (Index sign symmetry).

If (C) holds for a $d$ -dimensional radially symmetric random vector $\bm{X}$ with mean vector $\bm{\mu}$ , then $\bm{\mu}=\tfrac{1}{2}\left(e_{\bm{\alpha}}(\bm{X})+e_{-\bm{\alpha}}(\bm{X})\right)$ for all $\bm{\alpha}\in B$ .

Proof.

For $\bm{\alpha}\in B$ we have

[TABLE]

for all $\bm{c}\in\mathbb{R}^{d}$ . By translation invariance (Proposition 4.1) and radial symmetry $\bm{X}-\bm{\mu}\overset{d}{=}-(\bm{X}-\bm{\mu})$ we therefore have

[TABLE]

where the right hand side is minimized implying that $e_{-\bm{\alpha}}(\bm{X})=2\bm{\mu}-e_{\bm{\alpha}}(\bm{X})$ . ∎

In the univariate setting, expectiles are an attractive choice among possible risk measures due to their elicitability. As discussed in Gneiting (2011), elicitability is a property of statistical functionals when considering point forecasts. Denoting by $\mathcal{F}$ the class of probability distributions on $\mathbb{R}^{d}$ with finite second marginal moments, we denote by $T$ a statistical functional, i.e.,

[TABLE]

Statistical functionals can, in general, be set valued maps, as for example in the case of quantiles. However, we will here concentrate on the case where they take values in the Euclidean space, and we thus adjust the definition of elicitability given in Gneiting (2011) to this case.

Definition 4.3 (Elicitability).

A statistical functional $T$ is called elicitable relative to the class $\mathcal{F}$ , if

there exists a scoring function $S\colon\mathbb{R}^{d}\times\mathbb{R}^{d}\to[0,\infty),\quad(\bm{x},\bm{y})\mapsto S(\bm{x},\bm{y})$ such that there is a representation

[TABLE]

for every $F\in\mathcal{F}$ where $\bm{X}\sim F$ , and

2)

$\mathbb{E}[S(T(F),\bm{X})]=\mathbb{E}[S(\bm{c},\bm{X})]$ implies $\bm{c}=T(F)$ .

A functional $T$ is therefore elicitable, if it can be represented as the unique minimizer of a Bayes rule for a suitable scoring function. For geometric expectiles, we can define, for $\bm{\alpha}\in B$ , an associated functional $T_{\bm{\alpha}}$ as

[TABLE]

where Theorem 4.8 guarantees that $T_{\bm{\alpha}}(F)$ is not set-valued. It is then clear from the defintion of $e_{\bm{\alpha}}$ that the scoring function

[TABLE]

makes $T_{\bm{\alpha}}$ elicitable relative to the class $\mathcal{F}$ . Again here, Theorem 4.8 plays a crucial role under the assumption of a joint distribution with margins with finite second moments.

In the univariate case elicitability allows to assess and compare the forecasting performance of different competing models, see Nolde and Ziegel (2017) for a discussion. In a practical setting this allows one to select a best model based on expectile point-forecasting performance and to implement meaningful expectile-based backtesting procedures against real data. Elicitability of geometric expectiles now possibly opens the door to implement model selection and backtesting procedures for the underlying joint distribution as opposed to the the marginal distributions only. From a theoretical perspective, geometric expectiles also add to a further understanding of multivariate elicitability by provding a scoring function that is not a linear combination of univariate scoring functions; see Fissler and Ziegel (2016) for an in-depth discussion.

The scoring function $S_{\bm{\alpha}}$ tied to geometric expectiles is furthermore positively homogeneous of order two as shown in Proposition 4.5 below. Efron (1991) highlights the necessity of positive homogeneity, or scale invariance, in an estimation context. Scale invariance and estimation of scale is also central to the theory of robust statistics; see, for example, Huber and Ronchetti (2009). Patton (2011) furthermore argues for homogeneity in the context of forecast rankings, as the rankings obtained from a homogenious scoring function are invariant to a re-scaling of the underlying data. See also Gneiting (2011) and Nolde and Ziegel (2017) for a discussion in the context of univariate expectlies.

By establishing the positive homogeneity of $S_{\bm{\alpha}}$ we prepare likewise applications in the multivariate case.

Proposition 4.5 (Positive homogeneity of $S_{\bm{\alpha}}$ of order $2$ ).

For $c>0$ and $(\bm{x},\bm{y})\in\mathbb{R}^{d}\times\mathbb{R}^{d}$ , $S_{\bm{\alpha}}(c\bm{x},c\bm{y})=c^{2}S_{\bm{\alpha}}(\bm{x},\bm{y})$ .

Proof.

Using basic properties of norms and inner products we have that

[TABLE]

Univariate expectiles are attractive risk measures due to their coherence of which sub-additivity is a cornerstone. For univariate expectiles we have for any random variables $X$ and $Y$ sub-additivity $e_{\alpha}(X+Y)\leqslant e_{\alpha}(X)+e_{\alpha}(Y)$ when $\alpha\geqslant 0.5$ , while for $\alpha\leqslant 0.5$ we have super-additivity $e_{\alpha}(X+Y)\geqslant e_{\alpha}(X)+e_{\alpha}(Y)$ . It is important to recognize that $e_{0.5}(X)=\mathbb{E}[X]$ , i.e. there is one point which separates the sub- and super-additive cases.

While the univariate notions of sub- and superadditivity are based on the ordering in $\mathbb{R}$ , the multivariate case has no canonical ordering for $\mathbb{R}^{d}$ , $d\geqslant 2$ . To circumvent this issue we utilize set inclusions that continue to be valid in higher dimensions. Reconsidering the univariate case we can see that for any interval $I\subseteq(0,1)$ that includes $0.5$ we have $\{x\in\mathbb{R}:x=e_{\alpha}(X+Y),\alpha\in I\}\subseteq\{x\in\mathbb{R}:x=e_{\alpha}(X)+e_{\alpha}(Y),\alpha\in I\}$ . To propose a multivariate generalization based on this observation we replace the interval $I$ with a closed ball in $B$ .

Definition 4.4 (Multivariate subadditivity for geometric risk measures).

Denote by $\bm{X}$ and $\bm{Y}$ two $d$ -dimensional random vectors, and by $\rho_{\bm{\alpha}}$ a geometric risk measure based on an index $\bm{\alpha}\in B$ . For $0<r<1$ define the sets

[TABLE]

A multivariate geometric risk measure $\rho_{\bm{\alpha}}$ is multivariate sub-additive, if

[TABLE]

for all $0<r<1$ .

We use the numerical techniques discussed in Section 5 for a two dimensional illustration. To this end we introduce a random vector $\bm{Z}=(Z_{1},\ldots,Z_{4})$ , where the first marginal distribution $Z_{1}$ follows a Gumbel distribution, $Z_{2}\sim t_{4}$ , $Z_{3}$ follows a standard logistic distribution and $Z_{4}\sim\mathcal{N}(0,1)$ . To introduce dependence between the components of $\bm{Z}$ we join the margins by a four dimensional Clayton copula $C_{\theta}$ with parameter $\theta=5$ . The bivariate random vectors are then given as $\bm{X}=(Z_{1},Z_{2})$ and $\bm{Y}=(Z_{3},Z_{4})$ .

In Figure 2 (left) we show $A_{0.2}(\bm{X}+\bm{Y};\operatorname{VaR})$ and $A_{0.2}(\bm{X},\bm{Y};\operatorname{VaR})$ where it is clearly visible that geometric $\operatorname{VaR}$ is not multivariate sub-additive which is in line with the univariate case. This behaviour can be explained when focusing on the case $\bm{\alpha}=\bm{0}$ , in which case geometric $\operatorname{VaR}$ is the minimizer of the euclidean distance

[TABLE]

There is no reason that the resulting optimum is additive, i.e., $\operatorname{VaR}_{\bm{0}}(\bm{X}+\bm{Y})=\operatorname{VaR}_{\bm{0}}(\bm{X})+\operatorname{VaR}_{\bm{0}}(\bm{Y})$ . Given that the sets $A_{r}(\bm{X}+\bm{Y};\operatorname{VaR})$ and $A_{r}(\bm{X},\bm{Y};\operatorname{VaR})$ reduce to $\operatorname{VaR}_{\bm{0}}(\bm{X}+\bm{Y})$ and $\operatorname{VaR}_{\bm{0}}(\bm{X})+\operatorname{VaR}_{\bm{0}}(\bm{Y})$ when $\bm{\alpha}\to\bm{0}$ , the sets necessarily intersect for some $r$ whenever $\operatorname{VaR}_{\bm{0}}(\bm{X}+\bm{Y})\neq\operatorname{VaR}_{\bm{0}}(\bm{X})+\operatorname{VaR}_{\bm{0}}(\bm{Y})$ . This behaviour is shown on the left in Figure 2.

In Figure 2 (right) we show $A_{0.2}(\bm{X}+\bm{Y};e)$ and $A_{0.2}(\bm{X},\bm{Y};e)$ . In this case we observe $A_{0.2}(\bm{X}+\bm{Y};e)\subseteq A_{0.2}(\bm{X},\bm{Y};e)$ . Contrary to geometric $\operatorname{VaR}$ we have $e_{\bm{0}}(\bm{X})=\mathbb{E}[\bm{X}]$ in the case of geometric expectiles and therefore the additivity $e_{\bm{0}}(\bm{X}+\bm{Y})=e_{\bm{0}}(\bm{X})+e_{\bm{0}}(\bm{Y})$ . Constructing a counter example to multivariate sub-additivity along the same lines as for $\operatorname{VaR}$ is therefore ruled out. Although numerical checks for a number of different joint models and $r$ -levels suggest that geometric expectiles are multivariate subadditive a formal proof is not available at this point.

4.3. Asymptotics and Estimation

In Section 4 we have established that geometric expectiles defined in (6) are a well-defined functional for random vectors with finite marginal second moments. In terms of practical applications, this raises two questions. First, the computation of closed-form solutions of $e_{\bm{\alpha}}(\bm{X})$ might not be possible for a given random vector $\bm{X}$ and numerical approximation needs to be invoked instead. Second, in practical applications it is necessary to establish that a sample version of $e_{\bm{\alpha}}(\bm{X})$ is a consistent estimator of $e_{\bm{\alpha}}(\bm{X})$ . While the implicit definition of $e_{\bm{\alpha}}(\bm{X})$ might seem challenging at first, our functional falls into the well-established framework of M-estimators; see Huber and Ronchetti (2009) for an introduction.

To discuss consistency, we denote by $(\bm{X}_{i})_{i=1}^{\infty}$ a sequence of independent and identically distributed (iid) random vectors with the same distribution as $\bm{X}$ . While a generalization to ergodic and (weakly) stationary random vectors is straight forward, we focus on the iid case for ease of presentation. For a finite sample we replace the expectation in (6) by the sample average. This provides a finite sample version, or Monte Carlo estimator, of $\phi$ defined in (9) by

[TABLE]

where we immediately get $\phi_{n}(\bm{c})\overset{a.s.}{\longrightarrow}\phi(\bm{c})$ (and thus $\phi_{n}(\bm{c})\overset{p}{\longrightarrow}\phi(\bm{c})$ ) from the strong law of large numbers. To also guarantee the convergence of the minimizers we invoke Proposition 7.4 of Hayashi (2000).

Corollary 4.2 (Consistency).

If (C) holds for a $d$ -dimensional random vector $\bm{X}$ then

[TABLE]

Proof.

We apply Proposition 7.4 of Hayashi (2000) which guarantees the consistency of M-estimators. By Theorem 4.8 $\phi(\bm{c})$ is uniquely minimized on $\mathbb{R}^{d}$ at $e_{\bm{\alpha}}(\bm{X})$ , and $\phi(\bm{c})$ exists and is finite for all $\bm{c}\in\mathbb{R}^{d}$ . Furthermore, $\Lambda_{\bm{\alpha}}$ is convex. While the existence of a minimizer in Proposition $7.4$ in Hayashi (2000) is only asymptotic, it is clear that a minimizer exists for every $n\in\mathbb{N}$ in our case. ∎

Corollary 4.2 also suggests a simple approach to compute $e_{\bm{\alpha}}(\bm{X})$ when a close form solution cannot be established. If a sampling method for $\bm{X}$ is available then replacing the expectation by an empirical mean yields a valid approximation. Denoting by $(\bm{x}_{n})_{i=1}^{n}$ a realization of a sequence of random vectors $(\bm{X}_{i})_{i=1}^{n}$ , either obtained by simulation or from real data, it also important to notice that $\phi_{n}$ is strictly convex.

Corollary 4.3 (Strict convexity of $\phi_{n}$ ).

Denote by $(\bm{x}_{i})_{i=1}^{n}$ a sequence of vectors in $\mathbb{R}^{d}$ . The function $\phi_{n}\colon\mathbb{R}^{d}\to\mathbb{R},\quad\bm{c}\mapsto\phi_{n}(\bm{c})=\frac{1}{n}\sum_{i=1}^{n}\Lambda_{\bm{\alpha}}(\bm{x}_{i}-\bm{c})$ is strictly convex.

Proof.

Given that $\phi_{n}$ is a convex combination of strictly convex functions the proof follows from basic properties of convex functions. ∎

The importance of Corollary 4.3 is that the minimization

[TABLE]

is well behaved also in the finite sample case and there exists a unique minimizer that is consistent for the functional according to Corollary 4.2. The minimizer, i.e., the finite sample version of $e_{\bm{\alpha}}(\bm{X})$ , can then be obtained by numerical minimization techniques.

5 Illustration

In this section, we discuss a special case for which it is possible to obtain a closed-form expression for multivariate geometric expectiles. Moreover, we provide numerical illustrations for a number of different random vectors in order to highlight the impact of changing margins and dependence structures.

5.1. Analytic Solution for the Uniform Distribution

We consider the case of a bivariate uniform distribution and denote by $\bm{U}=(U_{1},U_{2})$ a random vector with density $\frac{1}{(b_{1}-a_{1})(b_{2}-a_{2})}$ where $b_{j}>a_{j}$ and $b_{j},a_{j}\in\mathbb{R}$ for $j=1,2$ . We first compute the expectation of the squared norm in terms of $\bm{c}=(c_{1},c_{2})$ as

[TABLE]

Defining the real valued functions $h_{1}$ and $h_{2}$ as

[TABLE]

we further have that

[TABLE]

Therefore,

[TABLE]

This finally leads to

[TABLE]

where we define $g_{2}$ analogously as $g_{2}(c_{1},c_{2})=\mathbb{E}[\left\|\bm{U}-\bm{c}\right\|_{2}(U_{2}-c_{2})]$ . Taking the preceding results together, we have for $\bm{\alpha}=(\alpha_{1},\alpha_{2})$ that

[TABLE]

The geometric expectiles $e_{\bm{\alpha}}(\bm{U})$ are now found as

[TABLE]

This example highlights more than anything that finding a closed-form solution can be challenging even in the simplest of cases. In this sense, the numerical approximation introduced in Section 4.3 takes a more prominent role. Full-fledged examples utilizing this method can be found in the following sections.

5.2. Numerical Illustration

In this section we visualize geometric expectiles for selected bivariate random vectors. To this end, we define four random vectors $\bm{X}_{1},\ldots,\bm{X}_{4}$ with different margins and dependence structures; see Table 1. The dependence structure is formalized in terms of copulas, see, for example, Nelsen (2006) or Joe (2014) for textbook introductions. As a baseline for our comparison, $\bm{X}_{1}=(X_{11},X_{12})$ follows a bivariate normal distribution with independent standard normal margins. Considering $\bm{X}_{2}$ we keep the independence between the components, but we change the margins. $X_{21}$ now follows a skew normal distribution, see Azzalini (1985), with parameters $(\xi,\omega,\alpha)=(-1,1,2)$ and $X_{22}$ follows a Student $t$ distribution with $\nu=4$ degrees of freedom. In case of $\bm{X}_{3}$ we only change the dependence structure compared to $\bm{X}_{1}$ , that is $X_{31}$ and $X_{32}$ still follow a standard normal distribution each but the dependence structure is now given by a Gumbel copula with parameter $\theta=2$ . Finally $\bm{X}_{4}$ differs from $\bm{X}_{1}$ in terms of margins and dependence structure, where we employ the skew normal and Student $t$ margins of $\bm{X}_{2}$ with the Gumbel dependence structure of $\bm{X}_{3}$ .

To illustrate the impact of different indices we consider two parameterizations for $\bm{\alpha}$ . First, we choose $\bm{\alpha}$ according to $\bm{\alpha}_{1}(\varphi)=0.98(\cos(\varphi),\sin(\varphi))$ , $\varphi\in[0,2\pi)$ , which describes a circle of radius $0.98$ . The magnitude $0.98$ corresponds to a confidence level of $0.99$ in the univariate case. Second, we choose $\bm{\alpha}$ according to $\bm{\alpha}_{2}(\varphi)=(0.98\cos(\varphi),0.90\sin(\varphi))$ , $\varphi\in[0,2\pi)$ , which describes an ellipse in $B$ , where a magnitude of $0.90$ corresponds to a confidence level of $0.95$ in the univariate case. Both choices of indices are visualized in Figure 3. For further reference we indicate the resulting indices $\bm{\alpha}_{j}(\varphi_{k})$ , $j\in\{1,2\}$ , for $\varphi_{k}=k2\pi/8$ , $k\in\{0,\ldots,7\}$ , by the respective value of $k$ . As there are no closed-form solutions available to compute $e_{\bm{\alpha}_{j}(\varphi)}(\bm{X}_{\ell})$ , $\ell\in\{1,2,3,4\}$ , we instead draw an iid sample of size $10,000$ from the respective distribution of $\bm{X}_{\ell}$ and utilize the numerical procedure outlined in Section 4.3; i.e., we use Monte Carlo integration.

Figure 4 shows the resulting geometric expectiles and density contour lines for $\bm{X}_{1}$ (top left), $\bm{X}_{2}$ (top right), $\bm{X}_{3}$ (bottom left) and $\bm{X}_{4}$ (bottom right). The gray lines indicate the density contours of the underlying bivariate distribution function. To indicate the effects of different index choices the solid orange line represents the resulting geometric expectiles $e_{\bm{\alpha}_{1}(\varphi)}(\bm{X}_{\ell})$ , $\ell\in\{1,\ldots,4\}$ , for $\varphi\in[0,2\pi)$ . Likewise the solid green line indicates $e_{\bm{\alpha}_{2}(\varphi)}(\bm{X}_{\ell})$ , $\ell\in\{1,\ldots,4\}$ , for $\varphi\in[0,2\pi)$ . In concordance with Figure 3 we mark the resulting geometric expectiles $e_{\bm{\alpha}_{j}(\varphi_{k})}(\bm{X}_{i})$ for indices $\bm{\alpha}_{j}(\varphi_{k})$ based on $\varphi_{k}=k2\pi/8$ , $k\in\{0,\ldots,7\}$ , by the respective value of $k$ .

From Figure 4 it becomes apparent that geometric expectiles adapt to the underlying distribution. For the radially symmetric distribution of $\bm{X}_{1}$ (top left panel) the lines indicating $e_{\bm{\alpha}_{j}(\varphi)}(\bm{X}_{1})$ for all possible $\varphi\in[0,2\pi)$ resemble the shape of the index $\bm{\alpha}_{j}(\varphi)$ . Furthermore we visually observe the symmetry established in Proposition 4.4. However, for skewed and heavier tailed margins (top right panel) the geometric expectiles adapt by bulging out. This also slightly changes the orientation in that, for example, $e_{\bm{\alpha}_{1}(\varphi_{2})}(\bm{X}_{2})$ is not centered on the $y$ -axis anymore. Introducing dependence between the components of $\bm{X}_{3}$ (bottom left panel) forces the geometric expectiles to deform. The deformation, compared to the top left panel, is, however, not by bulging out as in the top right panel, but rather by compressing and rotating. Finally when combining both effects in $\bm{X}_{4}$ (bottom right panel) we see that geometric expectiles widen and deform according to a superposition of the previously observed effects.

5.3. Comparing Geometric Value-at-Risk and Expectiles

In continuation of the numerical examples in Section 5.2 we now discuss differences between geometric $\operatorname{VaR}$ and geometric expectiles, as well as their univariate counterparts. For a fixed $\alpha_{1}\in(0,1)$ we therefore consider the corresponding index $\bm{\alpha}=(2\alpha_{1}-1)(1,0)$ , where we make the necessary adjustment to the magnitude of the index discussed in Section 3.1. We then compute the univariate $\operatorname{VaR}_{\alpha_{1}}(X_{11})$ and expectile $e_{\alpha_{1}}(X_{11})$ at level $\alpha_{1}$ for the first component of $\bm{X}_{1}=(X_{11},X_{12})$ , see Table 1, and also the geometric $\operatorname{VaR}_{\bm{\alpha}}(\bm{X}_{1})$ and geometric expectile $e_{\bm{\alpha}}(\bm{X}_{1})$ based on $\bm{\alpha}$ . Comparing the univariate risk measures to the first component of their multivariate counterparts in Figure 5, we see that the multivariate risk measures are more conservative, i.e., higher in absolute value. In fact, the geometric $\operatorname{VaR}$ provides the most conservative reserve estimates for a given level $\alpha_{1}$ , while the univariate expectiles are the least conservative for the same level.

We further compare geometric $\operatorname{VaR}$ and geometric expectiles by computing the magnitude for a given direction that leads to equal values in each component of the resulting multivariate risk measure. We therefore fix an element $\bm{u}\in\partial B=\{\bm{x}\in\mathbb{R}^{2}:\left\|\bm{x}\right\|_{2}=1\}$ and $\theta\in[0,1)$ to obtain a starting index $\bm{\alpha}=\theta\bm{u}$ for which we compute $e_{\bm{\alpha}}(\bm{X}_{1})$ . For $m\in[0,1)$ we then aim to find an optimal $m^{*}$ that yields $\operatorname{VaR}_{m^{*}\bm{u}}(\bm{X}_{1})=e_{\bm{\alpha}}(\bm{X}_{1})$ in the least-square sense, that is

[TABLE]

In Figure 6 we show the resulting plot for values of $\theta$ in $[0,0.999]$ and $\bm{u}=(1,1)/\sqrt{2}$ . In line with Figure 5 we find that the associated magnitude $m^{*}$ for the geometric $\operatorname{VaR}$ is lower than the corresponding magnitude $\theta$ for the geometric expectiles. That is to say that the geometric $\operatorname{VaR}$ is more conservative than geometric expectiles in this example.

5.4. Higher Dimensional Marginalization

While Section 5.3 has compared bivariate geometric expectiles to their univariate counterparts, it is of interest to compare geometric expectiles applied to higher dimensional margins to those applied to the full joint distribution. Denote by $\bm{X}=(X_{1},\ldots,X_{d})$ a random vector of dimension $d$ , and by $\bm{Y}$ a sub-vector of $\bm{X}$ of dimensions $k<d$ . Without loss of generality we assume $\bm{Y}=(X_{1},\ldots,X_{k})$ . Comparing $e_{\bm{\alpha}}(\bm{X})$ to $e_{\bm{\beta}}(\bm{Y})$ is challenging since the dimensions of the respective indices $\bm{\alpha}$ and $\bm{\beta}$ as well as the resulting vectors differ. Disregarding the choice of indices for now it seems sensible to compare the first $k$ entries of $e_{\bm{\alpha}}(\bm{X})$ to $e_{\bm{\beta}}(\bm{Y})$ . This comparison would then focus on differences introduced by the dependence of $(X_{1},\ldots,X_{k})$ on $(X_{k+1},\ldots,X_{d})$ which is neglected in $e_{\bm{\beta}}(\bm{Y})$ . Concerning the choice of indices $\bm{\alpha}$ and $\bm{\beta}$ , different scenarios are possible: One possible choice is to first choose $\bm{\beta}\in B^{k}$ and then set $\bm{\alpha}=(\bm{\beta},0,\ldots,0)$ . In this case $\left\|\bm{\alpha}\right\|_{2}=\left\|\bm{\beta}\right\|_{2}$ and $\bm{\alpha}\in B^{d}$ . Alternatively, the vector $\bm{\alpha}$ can be filled up with a vector $\bm{z}$ of non-zero values, that is $\bm{\alpha}=(\bm{\beta},\bm{z})$ . In this case the condition $\left\|\bm{\alpha}\right\|_{2}<1$ needs to be obeyed whatever non-zero values are chosen, which immediately leads to $\left\|\bm{\alpha}\right\|_{2}<1$ if and only if $\left\|\bm{z}\right\|_{2}<\sqrt{1-\left\|\bm{\beta}\right\|_{2}^{2}}$ .

To illustrate the effect of marginalization we consider the case $d=3$ with $\bm{X}=(X_{1},X_{2},X_{3})$ and $\bm{Y}=(X_{1},X_{2})$ . We further set $\bm{\beta}_{r}(t)=r(\cos(t),\sin(t))^{\operatorname{\top}}$ where $0<r<1$ and $t\in[0,2\pi)$ . For $\bm{\alpha}_{r}(t)=(\bm{\beta}_{r}(t),z(r))$ the possible values of $z(r)$ as a function of $r$ are then limited to the interval $(-\sqrt{1-r^{2}},\sqrt{1-r^{2}})$ to ensure $\left\|\bm{\alpha}\right\|_{2}<1$ .

For the illustration the first marginal distribution $X_{1}$ of $\bm{X}$ follows a Gumbel distribution, $X_{2}\sim t_{4}$ and $X_{3}$ follows a standard logistic distribution, while the dependence structure is given in terms of a Clayton copula with parameter $\theta=5$ . Consequently, $\bm{Y}=(X_{1},X_{2})$ has the same Gumbel and $t_{4}$ margins also joined by a Clayton copula with parameter $\theta=5$ . In Figure 7 we show the resulting geometric expectiles $e_{\bm{\beta}_{r}(t)}(\bm{Y})$ and the first two components of $e_{\bm{\alpha}_{r}^{i}(t)}(\bm{X})$ , $i\in\{1,\ldots,7\}$ , where $\bm{\alpha}_{r}^{i}(t)=(\bm{\beta}_{r}(t),\ (-\tfrac{3}{4}+(i-1)\tfrac{1}{4})\sqrt{1-r^{2}})$ . Figure 7 shows the results for $r=0.1$ (top left), $r=0.2$ (top right), $r=0.5$ (bottom left) and $r=0.9$ (bottom right). From the figure we see that multiple intersections between the expectile curves $e_{\bm{\beta}_{r}(t)}(\bm{Y})$ and the first two components of $e_{\bm{\alpha}_{r}^{i}(t)}(\bm{X})$ , $i\in\{1,\ldots,7\}$ are possible. There is, however, one exception: In case of $\bm{\alpha}_{r}^{4}(t)$ we see that $e_{\bm{\beta}_{r}(t)}(\bm{Y})$ (orange) is always contained in the respective expectile curve based on $\bm{\alpha}_{r}^{4}(t)$ (black). For this choice of $\bm{\alpha}$ the numerical result insinuates that the geometric expectiles of the sub-vector $\bm{Y}$ are, as a set, contained in the respective components of the geometric expectiles of the full vector $\bm{X}$ . A partial explanation is that the components $(X_{k+1},\ldots,X_{d})$ and their dependence with $(X_{1},\ldots,X_{k})$ are not at all taken into consideration when computing $e_{\bm{\beta}_{r}(t)}(\bm{Y})$ . While setting the respective elements in $\bm{\alpha}^{4}$ to zero does eliminate the inner product terms associated with $(X_{k+1},\ldots,X_{d})$ in (6), see also (5), they are still contributing to the objective function via the norm term when computing $e_{\bm{\alpha}_{r}^{4}(t)}(\bm{X})$ . While this leads to comparatively wider spread contours, forcing $\left\|\bm{\alpha}^{4}(t)\right\|_{2}=\left\|\bm{\beta}(t)\right\|_{2}$ continues to keep the results comparable.

In Figure 8 we also compute geometric $\operatorname{VaR}_{\bm{\beta}_{r}(t)}(\bm{Y})$ and $\operatorname{VaR}_{\bm{\alpha}_{r}^{i}(t)}(\bm{X})$ , $i\in\{1,\ldots,7\}$ , for the same joint model $\bm{X}$ with $r=0.1$ . From the figure it is clear that geometric value-at-risk does not exhibit the ordering for indices $\bm{\beta}(t)$ and $\bm{\alpha}_{r}^{4}(t)$ previously observed for geometric expectiles.

5.5. Bounded Random Vectors

In this section we study the effect of applying geometric expectiles to a bounded random vector. We therefore assume that $\bm{X}$ follows a Clayton copula $C_{\theta}$ with parameter $\theta=5$ , and compute $e_{\bm{\alpha}(t)}(\bm{X})$ for $\bm{X}\sim C_{5}$ and $\bm{\alpha}(t)=r(\cos(t),\sin(t))^{\operatorname{\top}}$ for $t\in[0,2\pi)$ and $r\in\{0.1,0.2,\ldots,0.9,0.95,0.99,0.9995,0.9999,0.99999\}$ . For extreme indices $\bm{\alpha}$ the geometric expectile contours can be outside the support of $\bm{X}$ as shown in Figure 9. Likewise, they can be outside of the convex hull of the data in an estimation setting. This is in line with geometric $\operatorname{VaR}$ , where $\left\|\operatorname{VaR}_{\bm{\alpha}}(\bm{X})\right\|_{2}\to\infty$ for sufficiently extreme indices $\left\|\bm{\alpha}\right\|_{2}\to 1$ , see Girard and Stupfler (2017). To further study the behaviour when the norm of the underlying index tends to one we (numerically) study the function

[TABLE]

where $\bm{\alpha}(r)=r\bm{u}$ for a fixed $\bm{u}$ with $\left\|\bm{u}\right\|_{2}=1$ and $0<r<1$ . In Figure 10 we show an example of $d(r)$ for a four dimensional joint distribution when $\bm{u}=-(1,1,1,1)/\sqrt{4}$ . For the illustration the first marginal distribution $X_{1}$ follows a Gumbel distribution, $X_{2}\sim t_{4}$ , $X_{3}$ follows a standard logistic distribution and $X_{4}\sim\mathcal{N}(0,1^{2})$ . The dependence structure is given in terms of a Frank copula with parameter $\theta=3$ . Based on the numerical experiments it seems that $d(r)$ is monotonically increasing in $r$ with no limit in $\mathbb{R}$ . Aside from their related definitions this observation further supports the idea that geometric expectiles behave comparably to geometric $\operatorname{VaR}$ for extreme indices $\bm{\alpha}$ . This potentially opens an avenue for studying the behaviour of geometric expectiles in a multivariate extreme value theory framework along the lines of Girard and Stupfler (2015).

5.6. Example Application

To demonstrate how geometric expectiles can be used in a practical scenario we consider a data generating process that generalizes the well-known compound Poisson model. By $\bm{E}=(E_{1},E_{2})$ we denote a random vector with exponentially distributed margins $E_{\ell}\sim\operatorname{Exp}(\beta_{\ell})$ , $\ell=1,2$ . The dependence structure between the components of $\bm{E}$ is given by a Clayton copula $C_{\theta}$ with parameter $\theta>0$ . For a Poisson random variable $N\sim\operatorname{Pois}(\lambda)$ our final random vector $\bm{X}=(X_{1},X_{2})$ is then given by

[TABLE]

where $\bm{E}_{k}$ is an independent (of $N$ and $\bm{E}_{j}$ for $j\neq k$ ) copy of $\bm{E}$ . By construction we see that $X_{j}$ , $j\in\{1,2\}$ , is a compound Poisson model with exponentially distributed severities. All in all the model captures the situation where a random number of risk occurs together, and the components of each incident are not independent. Our example is motivated by considering vehicle insurance that can444Coverage of medical costs depends on the respective jurisdiction. Vehicle insurance policies that cover medical and physical damage are common in the USA. On the other hand, there are, for example, no such products in the Québec province of Canada since medical costs are in this case taken over by the province. cover medical payments for the insured party as well as physical damages to the insured vehicle. From the point of the insurance company there will be a random number of accidents, where it is reasonable to assume a positive dependence between both components of the policy.

For our example we consider the parameters $\theta=0.9$ , $\beta_{1}=1/10$ , $\beta_{2}=1/15$ and $\lambda=1$ . The computation of the geometric expectiles is now based on a simulated iid sample $(\bm{x}_{i})_{i=1}^{100}$ of $\bm{X}$ . The computation therefore utilizes the Monte Carlo estimator according to Corollary 4.2 and the discussion therein. Figure 11 shows the resulting geometric expectiles, where we again consider the previously introduced, see Figure 3 and Section 5.2, indices $\bm{\alpha}_{1}(\varphi)=0.98(\cos(\varphi),\sin(\varphi))$ and $\bm{\alpha}_{2}(\varphi)=(0.98\cos(\varphi),0.90\sin(\varphi))$ . Given that in this example the margins are a.s. positive, we confine ourselves to directions in the first quadrant only, i.e., $\varphi\in[0,\pi/2]$ . In this case numbers indicate the resulting geometric expectiles for indices $\bm{\alpha}_{j}(\varphi_{k})$ , $j\in\{1,2\}$ , where $\varphi_{k}=k\pi/14$ , $k\in\{0,\ldots,7\}$ .

Concerning the individual variables $X_{1}$ and $X_{2}$ the insurer can now reserve losses according to the indices $\bm{\alpha}_{j}(\varphi_{0})$ or respectively $\bm{\alpha}_{j}(\varphi_{7})$ . Taking $j=1$ corresponds to a traditional confidence level of $0.99$ for both components, while $j=2$ corresponds to a traditional confidence level of $0.99$ for $X_{1}$ and $0.95$ for $X_{2}$ . More importantly, by extending the univariate forecast model validation theory outlined, for example, in Gneiting (2011) or Nolde and Ziegel (2017) it might be possible to validate the proposed model against real data by backtesting. Using geometric expectiles the backtest would then validate the full joint distribution function of $\bm{X}$ , and not just the individual marginal distributions of $X_{1}$ and $X_{2}$ .

6 Conclusion

In this paper we introduced geometric expectiles for multivariate distribution functions with finite second moments of the margins. This proposed functional naturally generalizes univariate expectiles introduced in Newey and Powell (1987) to the multivariate case for any fixed dimension $d$ . Instead of a single real number, geometric expectiles are represented by a $d$ -dimensional vector, which can be used for risk management purposes, for risk selection and comparison. This approach is in line with other recently introduced multivariate risk measures. Utilizing a framework comparable to the one introduced in Chaudhuri (1996) to generalize quantiles, the resulting geometric expectiles are indexed by an element of the open unit ball of $\mathbb{R}^{d}$ .

Seen as a statistical functional, geometric expectiles have a number of desirable properties. First, they are well-defined and unique for any multivariate distribution function with margins with finite second moments. Second, multivariate geometric expectiles have desirable properties under data transformations such as translating, re-scaling or re-ordering the data. Generalizing a re-ordering, also multiplications with orthogonal matrices are well behaved. Third, as in the univariate case, geometric expectiles are elicitable in a multivariate sense. Comparable to the univariate case, this may provide one with a mechanism to rank competing multivariate forecasting procedures, or to backtest a multivariate model against real data.

Aside from population characteristics, we also studied properties and asymptotics of the corresponding finite sample version. Here we find that the sample version is a consistent estimator of the population characteristics. A Monte Carlo estimator of geometric expectiles is readily available when a closed-form solution is not. Furthermore, to reduce the variance of the numerical estimates, quasi-Monte Carlo methods can be employed to improve the variance of the Monte Carlo estimators of the expectations in (4) and (6). This simplifies the computation of the minimizer from a numerical point of view.

In the presented examples, we utilized these simulation-based approximations to contrast geometric expectiles to the geometric quantiles introduced in Chaudhuri (1996) as well as univariate expectiles and quantiles. Our results indicate that geometric value-at-risk is more conservative than geometric expectiles for a given index.

In cases where the second moment condition on the margins is too restrictive it remains to be seen how tempering the margins interacts with geometric expectiles, providing a possible remedy.

Despite the extent of the present study, we can identify the following open questions concerning multivariate geometric expectiles: It is unclear which stochastic order $\prec$ between random vectors is compatible with the corresponding geometric expectiles, so that $e_{\bm{\alpha}}(\bm{X})\sqsubset e_{\bm{\alpha}}(\bm{Y})$ if $\bm{X}\prec\bm{Y}$ in this order. Furthermore, while the multivariate generalization of subadditivity proposed in this paper can numerically be verified for a wide range of distributions, it remains unclear how this property can be shown analytically. The same holds true for the marginalization discussed in Section 5.4 where we observed numerically the ordering of geometric expectiles when applied to (higher dimensional) margins and the full distribution. Concerning the distance of $e_{\bm{\alpha}}(\bm{X})$ to $\mathbb{E}[\bm{X}]$ , our findings are in line with geometric $\operatorname{VaR}$ and thus it is reasonable to expect a monotonic divergence to $\infty$ . In the special case of bounded random vectors this may hamper a straightforward application of geometric expectiles as risk measures, and addressing this issue will be part of further research.

Finally, while it is known, see Koltchinskii (1997), that geometric $\operatorname{VaR}_{\bm{\alpha}}(\bm{X})$ fully characterizes the joint distribution of $\bm{X}$ , it is not clear if this also holds for geometric $e_{\bm{\alpha}}(\bm{X})$ .

Acknowledgements

This work was supported by NSERC under Grant RGPIN-5010-2015 and RGPIN-2015-05447. The authors would also like to thank Connor Jackman for communicating a vital step in the proof of Theorem 4.2.

Bibliography48

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Abdous and Theodorescu [1992] B. Abdous and R. Theodorescu. Note on the spatial quantile of a random vector. Statistics & Probability Letters , 13(4):333–336, 1992.
2Artzner et al. [1999] P. Artzner, F. Delbaen, J. M. Eber, and D. Heath. Coherent measures of risk. Mathematical Finance , 9(3):203–228, 1999.
3Azzalini [1985] A. Azzalini. A class of distributions which includes the normal ones. Scandinavian Journal of Statistics , 12:171–178, 1985.
4Balbás et al. [2011] A. Balbás, R. Balbás, and P. Jiménez-Guerra. Vector Risk Functions. Mediterranean Journal of Mathematics , 6:139–150, 2011.
5Barbu and Precupanu [2012] V. Barbu and T. Precupanu. Convexity and Optimization in Banach Spaces . Springer, 4th edition, 2012.
6Bellini and Di Bernardino [2017] F. Bellini and E. Di Bernardino. Risk management with expectiles. The European Journal of Finance , 23(6):487–506, 2017.
7Ben Tahar [2006] I. Ben Tahar. Tail conditional expectation for vector-valued risks. SFB 649 Discussion Papers , 2006.
8Chakraborty [2001] B. Chakraborty. On affine equivariant multivariate quantiles. Annals of the Institute of Statistical Mathematics , 53(2):380–403, 2001.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Multivariate Geometric Expectiles

Abstract

1 Introduction

2 Univariate Quantiles and Expectiles

3 Multivariate Geometric Risk Measures

3.1. Multivariate Geometric Value-at-Risk

3.2. Multivariate Geometric Expectiles

4 Properties of Geometric Expectiles

4.1. Properties of Λu\Lambda_{\bm{u}}Λu​

Theorem 4.1** (Differentiability of Λu\Lambda_{\bm{u}}Λu​).**

Proof.

Lemma 4.1** (Midpoint convexity).**

Theorem 4.2** (Parallelogram Inequality).**

Proof.

Theorem 4.3** (Strict convexity of Λu\Lambda_{\bm{u}}Λu​).**

Proof.

Definition 4.1** (Coercive function on Rd\mathbb{R}^{d}Rd).**

Theorem 4.4**.**

Proof.

Theorem 4.5** (Coercivity of Λu\Lambda_{\bm{u}}Λu​).**

Proof.

4.2. Properties of eαe_{\bm{\alpha}}eα​

Theorem 4.6**.**

Proof.

Definition 4.2** (Convergence in probability to ∞\infty∞).**

Lemma 4.2**.**

Proof.

Theorem 4.7** (Strict convexity and continuity of ϕ\phiϕ).**

Proof.

Theorem 4.8** (Existence and uniqueness of eαe_{\bm{\alpha}}eα​).**

Proof.

Proposition 4.1** (Translation invariance).**

Proof.

Proposition 4.2** (Positive homogeneity).**

Proof.

Proposition 4.3** (Rotation with orthogonal matrix).**

Proof.

Corollary 4.1** (Vector sign symmetry).**

Proof.

Proposition 4.4** (Index sign symmetry).**

Proof.

Definition 4.3** (Elicitability).**

Proposition 4.5** (Positive homogeneity of SαS_{\bm{\alpha}}Sα​ of order 222).**

Proof.

Definition 4.4** (Multivariate subadditivity for geometric risk measures).**

4.3. Asymptotics and Estimation

Corollary 4.2** (Consistency).**

Proof.

Corollary 4.3** (Strict convexity of ϕn\phi_{n}ϕn​).**

Proof.

5 Illustration

5.1. Analytic Solution for the Uniform Distribution

5.2. Numerical Illustration

5.3. Comparing Geometric Value-at-Risk and Expectiles

5.4. Higher Dimensional Marginalization

5.5. Bounded Random Vectors

5.6. Example Application

6 Conclusion

Acknowledgements

4.1. Properties of $\Lambda_{\bm{u}}$

Theorem 4.1 (Differentiability of $\Lambda_{\bm{u}}$ ).

Lemma 4.1 (Midpoint convexity).

Theorem 4.2 (Parallelogram Inequality).

Theorem 4.3 (Strict convexity of $\Lambda_{\bm{u}}$ ).

Definition 4.1 (Coercive function on $\mathbb{R}^{d}$ ).

Theorem 4.4.

Theorem 4.5 (Coercivity of $\Lambda_{\bm{u}}$ ).

4.2. Properties of $e_{\bm{\alpha}}$

Theorem 4.6.

Definition 4.2 (Convergence in probability to $\infty$ ).

Lemma 4.2.

Theorem 4.7 (Strict convexity and continuity of $\phi$ ).

Theorem 4.8 (Existence and uniqueness of $e_{\bm{\alpha}}$ ).

Proposition 4.1 (Translation invariance).

Proposition 4.2 (Positive homogeneity).

Proposition 4.3 (Rotation with orthogonal matrix).

Corollary 4.1 (Vector sign symmetry).

Proposition 4.4 (Index sign symmetry).

Definition 4.3 (Elicitability).

Proposition 4.5 (Positive homogeneity of $S_{\bm{\alpha}}$ of order $2$ ).

Definition 4.4 (Multivariate subadditivity for geometric risk measures).

Corollary 4.2 (Consistency).

Corollary 4.3 (Strict convexity of $\phi_{n}$ ).