Elicitability of Return Risk Measures
M\"ucahit Ayg\"un, Fabio Bellini, Roger J. A. Laeven

TL;DR
This paper investigates the elicitability of return risk measures, providing dual representations, axiomatic characterizations of Orlicz premia, and constructing scoring functions to evaluate their performance.
Contribution
It introduces new axiomatic characterizations of Orlicz premia as the only elicitable return risk measures and develops scoring functions for their assessment.
Findings
Orlicz premia are uniquely elicitable among return risk measures
Dual representation results for convex and geometrically convex measures
A family of scoring functions for evaluating Orlicz premia
Abstract
Informally, a risk measure is said to be elicitable if there exists a suitable scoring function such that minimizing its expected value recovers the risk measure. In this paper, we analyze the elicitability properties of the class of return risk measures (i.e., normalized, monotone and positively homogeneous risk measures). First, we provide dual representation results for convex and geometrically convex return risk measures. Next, we establish new axiomatic characterizations of Orlicz premia (i.e., Luxemburg norms). More specifically, we prove, under different sets of conditions, that Orlicz premia naturally arise as the only elicitable return risk measures. Finally, we provide a general family of strictly consistent scoring functions for Orlicz premia, a myriad of specific examples and a mixture representation suitable for constructing Murphy diagrams.
| Orlicz Function | Orlicz Premium | Scoring Function |
| , | ||
| , | ||
| , | ||
| , , | ||
| no explicit form | ||
| , | no explicit form | |
| , | ||
| , | ||
| no explicit form |
| Forecaster | Predictive distribution of | Point forecast of -norm |
|---|---|---|
| Perfect | ||
| Unconditional | ||
| Unfocused | ||
| Sign-reversed |
| Forecaster | Predictive distribution of | Point forecast of -expectile |
|---|---|---|
| Perfect | ||
| Unfocused | ||
| Mean-Reversed |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRisk and Portfolio Optimization · Multi-Criteria Decision Making · Fuzzy Systems and Optimization
Elicitability of Return Risk Measures††thanks: We are very grateful to seminar participants at the University of Vienna and the University of Amsterdam for their comments and suggestions.
This research was funded in part by the Netherlands Organization for Scientific Research under an NWO-Vici grant 2020–2025 (Aygün and Laeven).
Mücahit Aygün
Dept. of Quantitative Economics
University of Amsterdam
and Tinbergen Institute
Fabio Bellini
Dept. of Statistics and Quantitative Methods
University of Milano Bicocca
Roger J. A. Laeven
Dept. of Quantitative Economics
University of Amsterdam, CentER
and EURANDOM
Abstract
Informally, a risk measure is said to be elicitable if there exists a suitable scoring function such that minimizing its expected value recovers the risk measure. In this paper, we analyze the elicitability properties of the class of return risk measures (i.e., normalized, monotone and positively homogeneous risk measures). First, we provide dual representation results for convex and geometrically convex return risk measures. Next, we establish new axiomatic characterizations of Orlicz premia (i.e., Luxemburg norms). More specifically, we prove, under different sets of conditions, that Orlicz premia naturally arise as the only elicitable return risk measures. Finally, we provide a general family of strictly consistent scoring functions for Orlicz premia, a myriad of specific examples and a mixture representation suitable for constructing Murphy diagrams.
**Keywords ** Return risk measures, elicitability, Orlicz premia, consistent scoring functions, geometric convexity.
1 Introduction
Since the seminal work of Savage ([41]) and Osband ([37]), an expanding and increasingly sophisticated literature has studied elicitability properties of risk measures. Classes of risk measures may, or may not, admit families of strictly consistent scoring functions, and hence be elicitable, with important implications for evaluating model performance and competing forecasts (see e.g., [26], [38], [2], [45], [36], [12], and the references therein). For example, Average Value-at-Risk per se is not elicitable, but it is jointly elicitable with Value-at-Risk since it admits bivariate strictly consistent scoring functions ([26], [21]).
Recently, [4] introduced the class of return risk measures, consisting of normalized, monotone and positively homogeneous risk measures. Return risk measures provide relative (or geometric) assessments of risk. They evaluate how much additional riskless log-return makes a financial position acceptable—whence their name. They constitute the relative counterparts of the class of monetary risk measures ([23], [16]), reminiscent of how relative risk aversion relates to absolute risk aversion. Their dynamic extensions, dynamic return risk measures, have been studied in [5].111Return risk measures that allow for probability distortion were recently analyzed in [42], whereas applications of return risk measures to capital allocation can be found in [33] and [9].
Whereas elicitability properties of monetary risk measures are by now quite well understood, little is known about the elicitability properties of return risk measures. This paper aims to fill this gap by analyzing the elicitability properties of return risk measures, with a particular emphasis on Orlicz premia, also known as Luxemburg norms, which as we will see play a central role in the theory of elicitable return risk measures. Orlicz premia, and the links between risk measures and Orlicz space theory, have been extensively studied in the financial and actuarial mathematics literature (see e.g., [28], [8], [10], [11], [16], [32], [4], [5] and the references therein); however, their connection to statistical decision theory in general, and elicitability in particular, has not been uncovered to our best knowledge.
This paper makes three main contributions. We start by providing dual representation results for convex and geometrically convex return risk measures and clarify their precise relationship. In full generality, the dual representation takes the form of a supremum of discounted logarithmic certainty equivalents, where the discount factor can be interpreted as an index of model plausibility under ambiguity. We show that convex return risk measures occur as a special case in the richer class of geometrically convex return risk measures, and we also analyze their law-invariant representations. Furthermore, we introduce and analyze the class of optimized return risk measures and derive their dual representation.
Second, we establish new characterization results for Orlicz premia. We prove that Orlicz premia naturally arise as the only return risk measures that are elicitable. It has been shown in [37] that an elicitable risk measure must satisfy the convex level sets (CxLS) property. We establish that a law-invariant geometrically convex return risk measure with the CxLS property is necessarily an Orlicz premium. We also show that requiring identifiability for return risk measures singles out the class of Orlicz risk measures: under weak regularity conditions, they are the only identifiable, law-invariant, monotone and positively homogeneous measures of risk. These are our central results, the preparations and mathematical details of which are somewhat involved.
Third, we provide a general, rich family of scoring functions that we prove to be strictly consistent with Orlicz premia. A plethora of examples illustrates the generality of our new family of scoring functions. Special attention is devoted to scoring functions of the relative error form in view of their appealing properties in forecast evaluation. We also provide a mixture representation of the general family of scoring functions in terms of elementary scoring functions, depending on a low-dimensional parameter. This enables the use of so-called Murphy diagrams to compare competing forecasts simultaneously with respect to a full class of strictly consistent scoring functions, and we illustrate this in two examples.
Statistical decision theory demonstrates that some classes of functionals do not allow for meaningful point forecast evaluation by means of expected scores. Functionals that admit a strictly consistent scoring function, guaranteeing that accurate forecasts are rewarded more than inaccurate forecasts, are referred to as elicitable (see Definition 38 for a formal definition). Our characterization results reveal the important place of the class of Orlicz premia in the extensive literature on risk measures. This is graphically illustrated in Figure 1. We know from [43], [2] and [17] that convex shortfall risk measures occur as the subclass of monetary risk measures that are elicitable. Furthermore, the only elicitable law-invariant coherent risk measures are given by expectiles ([45], [17]). We establish in Theorems 32 and 36 that Orlicz premia naturally arise as elicitable return risk measures. Furthermore, in Theorem 26, we provide a direct proof of the result that the only convex Orlicz premia that are translation invariant (and, hence, coherent risk measures) are the expectiles.
The rest of this paper is organized as follows. In Section 2, we recall the general properties of return risk measures and derive some useful continuity properties. In Section 3, we provide dual representation results for geometrically convex and convex return risk measures, explicate their connection and analyze optimized return risk measures. In Section 4, we establish our characterization results for Orlicz premia. Section 5 presents our results on families of scoring functions strictly consistent with Orlicz premia including many examples.
2 Return risk measures
Let be a nonatomic probability space. In the present paper, random variables represent financial losses. We will consider finite-valued risk measures defined on or on its subsets and . Equalities and inequalities between random variables are meant to hold -almost surely without further explicit mentioning.
Definition 1
We say that a functional is:
- a)
translation invariant if 2. b)
monotone if 3. c)
monetary if is translation invariant, monotone and satisfies 4. d)
positively homogeneous if 5. e)
convex if 6. f)
coherent if it is monetary, convex and positively homogeneous 7. g)
law invariant if , where means that and have the same distribution.
A law-invariant functional on induces a functional on , the set of probability measures with compact support in , by means of
[TABLE]
where each probability measure is identified with its distribution function .
We recall from [4] the notions of return risk measure and of its associated multiplicative acceptance set.
Definition 2
A return risk measure is a positively homogeneous and monotone functional satisfying . Its corresponding multiplicative acceptance set (at the level of random variables) is
For return risk measures the notion of geometric convexity—also known as multiplicative convexity or GG-convexity for functions on the positive real line (see e.g., [35])—will be of interest in what follows.
Definition 3
A functional is geometrically convex if for each and it holds that
[TABLE]
We will show in Lemma 16 that convex return risk measures are also geometrically convex. The class of geometrically convex risk measures is strictly larger than the class of convex return risk measures, a nonconvex example of the former being the logarithmic certainty equivalent .
A one-to-one correspondence between return risk measures and monetary risk measures has been outlined in [4] as follows: given a monetary risk measure , the associated return risk measure is given by
[TABLE]
and, vice versa, given a return risk measure , the associated monetary risk measure is
[TABLE]
The main properties of this correspondence are recalled in the following lemma.
Lemma 4
Let and be as in (1) and (2). Then:
- a)
** 2. b)
* is translation invariant is positively homogeneous* 3. c)
* is monotone is monotone * 4. d)
* is convex is geometrically convex* 5. e)
* is law invariant is law invariant* 6. f)
if is law invariant, then, for ,
[TABLE]
In Section 3, we will establish dual representations for geometrically convex return risk measures. It will turn out that for return risk measures the definitions of Fatou and Lebesgue properties have to be slightly modified. We introduce both the usual and the modified versions in the definition below.
Definition 5
A risk measure has the Fatou property if
[TABLE]
whereas it has the Lebesgue property if
[TABLE]
A return risk measure has the lower-bounded Fatou property if
[TABLE]
whereas it has the lower-bounded Lebesgue property if
[TABLE]
Clearly the lower-bounded Lebesgue property implies the lower-bounded Fatou property. Both properties are weaker than the usual ones, requiring respectively lower semicontinuity and continuity under more restrictive assumptions.
Lemma 6
Let and be as in (1) and (2). Then:
- (i)
* has the lower-bounded Fatou property if and only if has the Fatou property* 2. (ii)
* has the lower-bounded Lebesgue property if and only if has the Lebesgue property.*
Proof. (i) Let satisfy , , . By the continuous mapping theorem, it follows that , and , so from the Fatou property of it follows that
[TABLE]
and exponentiating both sides we get . The proof of the ‘only if’ part and the proof of (ii) are similar.
Since a law-invariant, monetary and convex risk measure automatically satisfies the Fatou property (see [29] and [24] for recent developments on the automatic validity of the Fatou property on general spaces), it follows from Lemma 6 that a law-invariant geometrically convex return risk measure automatically has the lower-bounded Fatou property. As a consequence, we have the following mixture continuity result, in which, as usual, we denote by a probability measure supported at .
Proposition 7
Let be a law-invariant geometrically convex return risk measure. Let . Then the mapping
[TABLE]
is continuous at each .
Proof. From Lemma 4 it follows that the corresponding given by equation (2) is a convex law-invariant monetary risk measure. From Proposition 2.1 in [17] suitably adapted to our sign conventions it follows that
[TABLE]
is continuous at each , for fixed with . Therefore, from the representation (3), we obtain
[TABLE]
and the thesis follows by the continuity of compositions with and .
3 Dual representations
In this section, we will denote by the set of probability measures on that are absolutely continuous with respect to the reference measure . In the next theorem, we derive a dual representation of geometrically convex return risk measure as suprema of suitably weighted, or discounted, logarithmic certainty equivalents; the less plausible the probabilistic model, the lower is the corresponding discount factor.
Theorem 8
Let be a geometrically convex return risk measure satisfying the lower-bounded Fatou property. Then there exists a multiplicative weighting function satisfying such that
[TABLE]
Furthermore, if satisfies the lower-bounded Lebesgue property, the supremum in (4) is attained.
Proof. From Lemma 4 and Lemma 6, it follows that is a convex monetary risk measure with the Fatou property, so as is well-known (see e.g., [23]) it has the dual representation
[TABLE]
where . Since , it follows that
[TABLE]
where is given by . From , it follows that . By Theorem 4.22 and Exercise 4.2.2 in [23], the supremum in (5) is attained if has the Lebesgue property. In view of Lemma 4 and Lemma 6, it then follows that the supremum in (4) is attained provided that satisfies the lower-bounded Lebesgue property.
Remark 9
The logarithmic certainty equivalent arising in the dual representation (4) can already be viewed as an example of an Orlicz premium corresponding to the unbounded Orlicz function , since
[TABLE]
We refer to Definition 23 in Section 4 for details on terminology and notation. As a consequence, every geometrically convex return risk measure satisfying the lower-bounded Fatou property can be seen as the supremum of a suitable family of multiplicatively weighted Orlicz premia.
We now derive a Kusuoka representation for law-invariant geometrically convex return risk measures that parallels the usual one for law-invariant convex risk measures. Recall first the definition of Average Value-at-Risk.
Definition 10
Let . For , the Average Value-at-Risk of at level is given by
[TABLE]
where
[TABLE]
and for we set by definition .
Denote by the set of probability measures with support in .
Theorem 11
Let be a law-invariant geometrically convex return risk measure. Then there exists such that
[TABLE]
If has the lower-bounded Lebesgue property, then .
Proof. As in the proof of the previous theorem, if is also law invariant, then from Lemma 4 the associated convex risk measure given by (2) is also law invariant, and hence has the Kusuoka representation (see e.g., [23], [16])
[TABLE]
for a suitable , from which it follows that
[TABLE]
with . From Lemma 6, if has the lower-bounded Lebesgue property, then has the Lebesgue property, and Theorem 35 in [16] implies that , from which the thesis follows.
Remark 12
In the same spirit of Remark 9, the Kusuoka representation of a law-invariant geometrically convex return risk measure given in equation (6) can be written in terms of Orlicz premia as follows:
[TABLE]
where indicates that and have the same distribution.
As for convex risk measures, the validity of the lower-bounded Lebesgue property is linked to a suitable weak compactness property of the level sets of the weighting function and in the law-invariant case to the so-called -weak continuity. Before stating the main result we recall two basic definitions.
Definition 13
We say that a monetary convex risk measure with dual representation (5) has the WC property if for each the lower level sets are compact in the topology. Similarly, a geometrically convex return risk measure with dual representation (4) has the property if for each the upper level sets are compact in the topology.
Definition 14
Let be continuous. The -weak topology on is the weakest topology that makes all mappings continuous, for each continuous satisfying , with . It holds that
[TABLE]
A functional is -weakly continuous if
[TABLE]
Proposition 15
Let be a law-invariant geometrically convex return risk measure. The following are equivalent:
- a)
* has the property* 2. b)
* has the lower-bounded Lebesgue property* 3. c)
* is -weakly continuous for some * 4. d)
For each with and , the function
[TABLE]
is continuous.
Proof. If and are related by the correspondence given in (1) and (2), then the WC property of is equivalent to the property of , since
[TABLE]
So (a) holds if and only if the associated convex risk measure has the WC property. As is well-known (see e.g., [16]), for convex risk measures the WC property is equivalent to the Lebesgue property, so from Lemma 6 it follows that (a) is equivalent to (b). From Proposition 2.7 in [17] adapted to our sign conventions, it follows that the WC property of is equivalent to -weak continuity with respect to some gauge function . From Lemma 4 of [4], it holds that is -weakly continuous if and only if is -weakly continuous with , which shows the equivalence between (a) and (c). Further, Proposition 2.7 in [17] shows that the WC property of is equivalent to its mixture continuity for , which combined with Proposition 7 shows that (a) is equivalent to (d).
As we anticipated after Definition 3, geometrically convex return risk measures are a generalization of convex risk measures, as the following shows.
Lemma 16
Let be a convex return risk measure. Then is geometrically convex.
Proof. If or the thesis is trivial. Let and By using the AM-GM inequality and the monotonicity and convexity of , it follows that
[TABLE]
Next, from positive homogeneity, it follows that
[TABLE]
which completes the proof.
It is then interesting to compare the dual representation of geometrically convex return risk measures given in equation (4) with the following dual representation for convex return risk measures.
Proposition 17
Let be a convex return risk measure satisfying the Fatou property. Then there exists with such that
[TABLE]
Furthermore, if satisfies the Lebesgue property, the supremum in (7) is attained.
Proof. The first part of the statement is easily derived from Proposition 4.3 in [31]. For the second part, it follows from the proof of Proposition 4.3 in [31] that
[TABLE]
where . If we take , then for any , which gives the norm-boundedness of the set . Furthermore, is weakly closed, since it is an intersection of weakly closed sets. Let us take a decreasing sequence of which the intersection is the empty set. For any , we have for every . Therefore, by using the Lebesgue property of , we have
[TABLE]
which gives that is uniformly integrable. Because is bounded, weakly closed and uniformly integrable, it is weakly compact as a consequence of the Dunford-Pettis theorem (see, e.g., Theorem A.67 in [23]). Therefore, the supremum is attained as a result of the Weierstrass Theorem (see, e.g., Corollary 2.35 in [1]). Suppose the supremum is attained for . Then, the supremum is attained for such that .
Since from Lemma 16 a convex return risk measure is geometrically convex, it follows that the dual representation (4) is implied by the dual representation (7). This can be seen starting from the well-known dual representation of the exponential certainty equivalent (see e.g., [18]), given by
[TABLE]
where is the relative entropy or Kullback-Leibler divergence defined by ([13])
[TABLE]
Letting and exponentiating both sides of (8), we obtain
[TABLE]
where
[TABLE]
Now note that when is not absolutely continuous with respect to . Using this fact, we can rewrite expression (9) for , as follows:
[TABLE]
since we take a supremum of nonnegative numbers, when and , where denotes the set of probability measures absolutely continuous with respect to . Upon substituting the expression for derived in (10) in (7), we have clarified the connection between (4) and (7), as follows:
[TABLE]
where
[TABLE]
Following the construction outlined in [6, 7] and [3], return risk measures may be optimized and become translation invariant, hence monetary, as follows:
Definition 18
An optimized return risk measure (henceforth, OR risk measure) is defined as
[TABLE]
for a corresponding return risk measure .
Lemma 19
An OR risk measure satisfies the following properties:
- a)
monotonicity
- b)
positive homogeneity
- c)
translation invariance
- d)
if is convex, then is convex.
Proof. Take such that . For an arbitrary , we have , which implies due to the monotonicity of . Since this is valid for any , by taking the infimum on both sides, we obtain . For (b), by using the positive homogeneity of and of the positive part function, we have, for any ,
[TABLE]
For (c), we have, for any ,
[TABLE]
Finally, let us assume that is convex and take . Because is positively homogeneous, it is sufficient for (d) to prove that is subadditive. We have
[TABLE]
where we have used the convexity and positive homogeneity of and of the positive part function in the second line.
The class of OR risk measures encompasses as special cases the Rockafellar-Uryasev [40] construction of Average-Value-at-Risk as well as its generalization given by the (robust) HG risk measure ([4]). We now establish that the OR risk measure admits an inf-convolution and a dual representation.
Definition 20
The inf-convolution of two convex functionals and is defined as follows:
[TABLE]
Lemma 21
An OR risk measure can be written as
[TABLE]
where and
[TABLE]
when the corresponding return risk measure is convex.
Proof. Note that the functional is convex since is convex and monotone and the positive part function is convex, and is convex, too. Then, the inf-convolution of the functionals and agrees with the definition of in (11).
Recall that the dual space of can be identified with w.r.t. the -topology. The convex conjugate of a function is defined as:
[TABLE]
Proposition 22
An OR risk measure , with a corresponding convex return risk measure , admits the following dual representation:
[TABLE]
where . Furthermore, if satisfies the Lebesgue property, then the supremum in (12) is attained.
Proof. From, e.g., Theorem 2.3.1 in [44], it is known that
[TABLE]
Let us consider the conjugates of the functionals and in Lemma 21. The conjugate can be calculated as follows:
[TABLE]
by using the positive homogeneity of . The conjugate can be calculated as follows:
[TABLE]
By using (13) and Lemma 21, we have the following:
[TABLE]
Therefore, is the indicator function of the set
[TABLE]
Since the functional is the inf-convolution of the functionals and , as a consequence of the Fenchel-Moreau theorem, we have
[TABLE]
where . Hence,
[TABLE]
Following the same argument used in the proof of Proposition 17, it follows that if has the Lebesgue property, then the set is weakly compact, from which the attainment of the maximum follows. Indeed, the set is norm-bounded by definition. It is weakly closed, since it is the intersection of weakly closed sets. Now let us take a decreasing sequence of which the intersection is the empty set. For any , we have for every . Hence, by using the Lebesgue property of , we have
[TABLE]
which gives that is uniformly integrable. Therefore, the supremum is attained due to the Dunford-Pettis and Weierstrass theorems; cf. also Theorem 3.6 in [15] and Theorem 8 in [14].
4 Axiomatizations of Orlicz premia
In this section, we first define Orlicz premia and derive some properties that are relevant in this paper; and next we establish new axiomatizations of Orlicz premia and compare them with the one given in [4].
4.1 Orlicz premia: Definition and properties
The mathematical definition of the Orlicz premium corresponds to the Luxemburg norm on the Orlicz space
[TABLE]
given by
[TABLE]
where the Orlicz function satisfies , is nondecreasing, left-continuous, convex, and nontrivial in the sense that for some and for some . We refer to [19] for the basic properties of Luxemburg norms under these assumptions. Notice that in the actuarial and financial mathematics literature (e.g., [28], [10], [11], [4], [5]) there are small differences in the set of properties required to . In this paper, we are interested in possibly nonconvex Orlicz functions that may not satisfy ; conversely, we will limit the domain of to . (When the function is convex and satisfies several additional properties, it is often referred to as a Young function; as these conditions are not assumed in this paper, we refer to as an Orlicz function.) This leads to the following definition.
Definition 23
Let satisfy:
- a)
, 2. b)
* is nondecreasing* 3. c)
* is left-continuous*
For , the Orlicz premium is defined by
[TABLE]
We recall the relevant properties of Orlicz premia in the following proposition.
Proposition 24
Let and be as in Definition 23. Then,
- a)
* is monotone, positively homogeneous and satisfies * 2. b)
for each , it holds that 3. c)
if is increasing, then
[TABLE] 4. d)
* is convex if and only if is convex.*
Proof. (a) The proof is standard. (b) Let . If then , so from the monotone convergence theorem it follows that is right-continuous. Since , it follows that , that is, . (c) The ‘only if’ part of the first implication follows by strict monotonicity of . The ‘if’ part of the second implication is just the definition of , while the ‘only if’ part follows from (c). (d) The proof of the ‘if’ part is standard. To prove the ‘only if’ part, assume first by contradiction that is not midconvex, i.e., there exist such that . Then, there exists and such that
[TABLE]
Let be disjoint sets with , and let
[TABLE]
From (15), we have and , which contradicts the convexity of . As a consequence, is midconvex and since it is nondecreasing it is also convex.
A remarkable example in which and is not differentiable is the following.
Example 25** (Expectiles)**
Let and let
[TABLE]
Then, , and is convex if and concave if . The corresponding Orlicz premium satisfies
[TABLE]
which gives
[TABLE]
so coincides with the -expectile of ([34, 30]), denoted by and defined for by the condition
[TABLE]
The following theorem shows that expectiles are the most general translation invariant convex Orlicz premia.
Theorem 26
If is increasing and is translation invariant and convex, then
[TABLE]
with and .
Proof. Let . Then and , and
[TABLE]
Fix and . Let be a solution of the equation
[TABLE]
and let . It follows that , and from translation invariance it follows that for each , which implies
[TABLE]
Let and note that . By combining (17) with (16), we have
[TABLE]
which gives
[TABLE]
From the convexity of it follows that and , so from the last equality we get and for every and for each and , from which the thesis follows.
It is immediate to check that , with . Further, the same argument also shows that expectiles with are the only concave translation invariant Orlicz premia. It is interesting to compare the theorem above with [28] and [27], where it is shown that a translation invariant Orlicz premium must be equal to the mean, but in their result actually also the differentiability of the Orlicz function is assumed.
Definition 27
A function is called GA-convex if, for each and , it holds that
[TABLE]
It is not difficult to verify that a nondecreasing and convex function on is GA-convex. For completeness we report the proof in Lemma 49 in the Appendix. The converse does not hold, an example being that is increasing and GA-convex but not convex. We refer to [35] for further properties of GA-convex functions.
Proposition 28
Let and be as in Definition 23. Then is geometrically convex if and only if is GA-convex.
Proof. We first prove the ‘if’ part. Notice first that, for each and each , it holds by definition that . Since is nondecreasing and left-continuous an application of the monotone convergence theorem shows that . Let now and . From the GA-convexity of it follows that
[TABLE]
which implies
[TABLE]
which from positive homogeneity gives
[TABLE]
To prove the ‘only if’ part, we first assume by contradiction that is not GA-midconvex, i.e., there exist such that Then, there exist and such that
[TABLE]
Take disjoint sets with , and let
[TABLE]
From (18), we have and , which contradicts with the geometric convexity of . As a consequence, is GA-midconvex. Since is also nondecreasing the thesis follows. Indeed, this is seen as follows. By the induction hypothesis, is rationally GA-convex. Now let us take and . Without loss of generality, assume that . Take a such that . By monotonicity and rational GA-convexity of , we have
[TABLE]
Since this inequality is valid for any such that , we can take the infimum over the set and obtain
[TABLE]
which gives the GA-convexity of .
Since a nondecreasing and convex function is GA-convex, it follows that a convex Orlicz premium is also geometrically convex. The converse does not hold, an example being the logarithmic certainty equivalent, which is also the Orlicz premium corresponding to .
4.2 Axiomatization based on the properties of the multiplicative acceptance set
This is Theorem 2 in [4] that we report below for convenience.
Theorem 29
Let be a law-invariant return risk measure and let be the corresponding multiplicative acceptance set as in Definition 2 (now at the level of distributions). Assume that
- a)
* and are convex with respect to mixtures, i.e., for each , , and similarly for * 2. b)
* is -weakly closed for some gauge function * 3. c)
for each and , there exists such that
[TABLE]
*Then there exists a nondecreasing function that satisfies
such that .*
As we will see, the convexity with respect to mixtures (at the level of distributions) of the multiplicative acceptance set and its complement assumed in item (a) in the theorem above, is implied by the CxLS property.
4.3 Axiomatizations based on CxLS
Definition 30
A law-invariant functional has the CxLS property if
[TABLE]
for each , and .
Lemma 31
Let be a law-invariant return risk measure with CxLS. Then and are convex with respect to mixtures.
Proof. Let us take and . Let such that the distributions of and are and . Take such that and and are independent. Choosing such and is possible because we are working in an atomless probability space, see Lemma 3.1 in [17]. Without loss of generality, assume that for some . Define and denote its distribution by . By positive homogeneity, we have . Then, has distribution and has distribution . Since and , we have
[TABLE]
By using the monotonicity and the CxLS property, we have
[TABLE]
which gives the convexity of with respect to mixtures. Similarly, it can be proved that is convex with respect to mixtures.
Theorem 32
Let be a law-invariant geometrically convex return risk measure with CxLS. Then there exists a nondecreasing GA-convex Orlicz function such that .
Proof. From the hypotheses and Lemma 4, it follows that the corresponding given by (2) is a convex law-invariant monetary risk measure with CxLS. From Theorem 3.10 in [17], there exists a convex function such that if and only if . Letting , it follows that
[TABLE]
From the convexity of , it follows that for each and ,
[TABLE]
which shows the GA-convexity of .
Notice the consistency between Proposition 28 and Theorem 32. Notice also that under the assumptions of Theorem 32, the function is not necessarily convex as the example of logarithmic certainty equivalents shows.
Theorem 33
Let be a law-invariant convex return risk measure with CxLS. Then there exists a nondecreasing convex Orlicz function such that .
Proof. Since a positively homogeneous, monotone convex functional defined on is geometrically convex, it follows from Theorem 32 that there exists a nondecreasing GA-convex Orlicz function such that . Since is convex only if is convex, as has been shown in Proposition 24, the thesis follows.
4.4 Axiomatization based on identifiability
Definition 34
We say that a functional is identifiable if there exists at least one identification function that satisfies, for each ,
[TABLE]
Example 35
Let . Then is identifiable and two possible identification functions are and .
An identifiable functional has the CxLS property, since
[TABLE]
for each .
Theorem 36
Let be an identifiable and positively homogeneous functional satisfying . Then there exists satisfying and such that
[TABLE]
Furthermore, under these assumptions, if is monotone then is nondecreasing and if is monotone and convex then is nondecreasing and convex.
The proof of Theorem 36 is based on the following lemma, the proof of which is postponed to the Appendix.
Lemma 37
Let be positively homogeneous with and identifiable by the function . Then there exists with and with
[TABLE]
Proof of Theorem 36. By applying Lemma 37 and letting , equations (19) follow. To prove monotonicity of , take with . If , then so monotonicity is trivial. Assume that . Take any and set
[TABLE]
Take with and let and . By construction, and . From the monotonicity of it follows that , so , from which the thesis follows. The proof in the case is similar. To prove convexity of , we first show mid-convexity. Assume by contradiction that there exist such that . Then there exist and such that
[TABLE]
Take disjoint sets such that , and and let , , and
[TABLE]
It holds that
[TABLE]
which implies and . Similarly,
[TABLE]
which implies , contradicting convexity of . Therefore, is mid-convex. Since a nondecreasing mid-convex function is convex, the thesis follows.
5 Consistent scoring functions for Orlicz premia
In this section, we show that Orlicz premia are elicitable and we study general families as well as specific examples of strictly consistent scoring functions.
5.1 Elicitability and strict consistency
We start by recalling a few standard definitions adapted to the class of strictly positive return risk measures.
Definition 38** (Elicitability and strictly consistent scoring functions)**
A functional is elicitable if there exists a strictly consistent scoring function satisfying and if and only if such that, for each , it holds that
[TABLE]
The strictly consistent scoring function is said to be of the prediction error form if and of the relative error form if , where are functions of a single variable.
The following theorem provides a general, rich family of scoring functions that are strictly consistent with Orlicz premia.
Theorem 39
Let be as in Definition 23, with increasing. Let be any integrable function. Then,
[TABLE]
is a strictly consistent scoring function for the Orlicz premium .
Proof. Let . We compute
[TABLE]
where in the last line we have used Fubini’s Theorem and item (c) of Proposition 24. The same argument shows that if , then again
[TABLE]
so is a strictly consistent scoring function for the Orlicz premium.
Two particularly interesting cases arise by taking and . In the first case, with the change of variable , we obtain
[TABLE]
where
[TABLE]
In the second case, with the change of variable , we obtain
[TABLE]
We now present a collection of examples of strictly consistent scoring functions for Orlicz premia, using Orlicz functions that are commonly adopted in the literature. As we will see, in some cases the general approach based on Theorem 39 or the specific forms in equations (5.1) and (24) recover scoring functions that are already known in the literature, whereas in many other cases new families of scoring functions are obtained. In several cases the corresponding Orlicz premium admits an explicit expression, whereas in some other cases it has to be computed numerically. All our examples, including some not discussed below, are summarized in Table 1.
Example 40** (Mean)**
Let . Then, and, from (5.1) and (23), we obtain
[TABLE]
which is an alternative scoring function for the mean. From the classical result of [41], as recalled e.g., in Theorem 7 of [26], the most general class of strictly consistent scoring functions for the mean belongs to the family of Bregman functions, of the form
[TABLE]
where is a convex function with subgradient . The scoring function given in equation (25) arises if . This scoring function is known as the quasi-likelihood (QLIKE) scoring function in the econometrics literature, and it is of common use in assessing forecasts of nonnegative quantities such as volatility (see e.g., [38] and the references therein).
Example 41** (Expectiles)**
Let with , as in Example 25. The corresponding Orlicz premium is the -expectile and, from (5.1) and (23), the corresponding scoring function is given by
[TABLE]
The class of strictly consistent scoring functions for expectiles has been characterized in Theorem 10 of [26], as an asymmetric extension of Bregman functions, defined by
[TABLE]
and as in the case of the mean our scoring function (26) corresponds to the case .
Example 42** (-norms)**
Let with . Then, the Orlicz premium is given by
[TABLE]
and the corresponding scoring function from (5.1) and (23) takes the form
[TABLE]
In view of (25), which arises as a special case when , we will refer to this novel scoring function as the PQLIKE scoring function.
When , the resulting Orlicz premium is no longer a norm, but the scoring function is still valid. Taking a (suitably normalized and scaled) limit for , such that arises, the Orlicz premium is the logarithmic certainty equivalent given by
[TABLE]
The corresponding scoring function from (5.1) and (23) takes the form
[TABLE]
Example 43** (Mean-variance)**
Let with as in Section 5.1 of [28]. Then, the Orlicz premium is given by
[TABLE]
and the corresponding scoring function from (5.1) and (23) is
[TABLE]
Example 44
Let with as in Section 5.2 of [28]. The corresponding Orlicz premium does not in general admit an explicit expression. It is the solution of the equation where is the moment generating function of . For example, if has a Gamma distribution with shape parameter and rate parameter , we have
[TABLE]
The corresponding scoring function from (24) is given by
[TABLE]
5.2 Mixture representations
It is clear that a scoring function of the form (21) depends on the choice of the function . Hence, the ranking of competing forecasts may depend on this choice, in particular in finite samples and under model misspecification (see e.g., [39]). To remedy the dependence of the ranking on the specific choice of the strictly consistent scoring function, [20] develop a method to compare forecasts simultaneously with respect to a class of strictly consistent scoring functions by considering so-called Murphy diagrams. This method relies on the availability of a mixture representation of the strictly consistent scoring functions under consideration, in terms of elementary scoring functions depending on a low-dimensional parameter. Mixture representations for the class of strictly consistent scoring functions for quantiles and expectiles have been given in [20]. A mixture representation of strictly consistent scoring functions for the triplet of Range Value-at-Risk and its two associated Value-at-Risks has been given in [22]. In the following theorem, we provide such a mixture representation for our new family of scoring functions in (21).
Theorem 45
Any strictly consistent scoring function for the Orlicz premium of the form (21) admits a representation of the form
[TABLE]
for a positive measure , where . Conversely, for any choice of the positive measure , we obtain a strictly consistent scoring function of the form (21) for the Orlicz premium .
Proof. By using (21), we have
[TABLE]
where and . Note that, since the function in (21) is strictly positive, the Riemann integral is well-defined.
As a corollary, we provide a mixture representation for the scoring functions of -norms.
Corollary 46
Any strictly consistent scoring function of the form (21) with , , admits a representation of the form:
[TABLE]
for a positive measure , where .
Proof. From Theorem 45 with , we obtain
[TABLE]
where .
We conduct two simulation experiments to illustrate how one can use Theorem 45 to rank competing forecasts. In particular, we will generate Murphy diagrams for logarithmic certainty equivalents, -norms and expectiles by using the corresponding elementary scoring functions.
Example 47
We first suppose that the true distribution of the outcome variable is given by where . We take . We consider four different forecasters, who will be referred to as perfect, unconditional, unfocused and sign-reversed, similar to [25] and [20], suitably modified to the current setting. The perfect forecaster issues the true distribution of the outcome as predictive distribution. Therefore, his/her point forecasts of the logarithmic certainty equivalent (LCE) and -norm are and with . The unconditional forecaster does not have knowledge of and issues the unconditional distribution of as predictive distribution: . Therefore, his/her point forecasts of the LCE and -norm are and with . The remaining two forecasters, unfocused and sign-reversed, have knowledge of , but their predictive distributions fail to be ideal. The unfocused forecaster issues a mixture distribution as predictive distribution of , involving an independent random variable that takes the values and with probability , leading to yielding and with as the corresponding forecasts of the LCE and -norm. The sign-reversed forecaster issues a predictive distribution of with the sign of flipped: . Therefore, his/her point forecasts are and . The point forecasts generated by the four predictive distributions are summarized in Table 2. Using simulations of sample size each, we obtain the Murphy diagrams displayed in Figure 2 for the LCE and -norm with . As can be seen in Figure 2, the perfect forecaster dominates the other forecasters for the LCE and all -norms considered, as expected. Although not clearly visible, the expected scores for the other three forecasters intersect in all four cases, such that none of these forecasters dominates the other.
Example 48
For -expectiles, from Theorem 45 with , , we have
[TABLE]
where . Therefore, the elementary scoring function for expectiles can be expressed as
[TABLE]
Suppose now that the true distribution of the outcome variable is given by where . We take . We consider three different forecasters: perfect, unfocused and mean-reversed, similar to Example 47. The perfect forecaster issues the true distribution of as predictive distribution. The unfocused forecaster issues as predictive distribution, involving an independent random variable that takes the values and each with probability . The mean-reversed forecaster issues a predictive distribution with the mean reversed: . The point forecasts generated by the three predictive distributions are displayed in Table 3. Using simulations of sample size each, we obtain the Murphy diagrams displayed in Figure 3 for . As we see from the figure, the perfect forecaster dominates the other forecasters, as expected. There is no ordering relationship between the unfocused and mean-reversed forecasters, because their expected scores intersect.
6 Appendix
Lemma 49
Let be nondecreasing and convex. Then, is GA-convex.
Proof. For and from the AM-GM inequality it holds that . Since is nondecreasing and convex it follows that
[TABLE]
which gives the thesis.
Proof of Lemma 37. Since is law invariant and positively homogeneous with , it follows that , for each . From identifiability, it follows that
[TABLE]
For each , define
[TABLE]
Since
[TABLE]
it follows that . From the positive homogeneity of , it follows that for each ,
[TABLE]
and from identifiability
[TABLE]
which gives
[TABLE]
so we can conclude that
[TABLE]
and letting we find that
[TABLE]
for each .
We now want to prove that
[TABLE]
with , from which the thesis follows immediately. We consider two cases.
If , we set and in (30), obtaining
[TABLE]
which gives
[TABLE]
If instead , we set and in (30), obtaining
[TABLE]
which gives
[TABLE]
Notice also that from (30) it follows that
[TABLE]
so combining (32) and (33) it follows that (31) is satisfied with
[TABLE]
from which the thesis follows.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Aliprantis, C. D., Border, K. C. (2006). Infinite Dimensional Analysis. Third edition, Springer Verlag, Berlin.
- 2[2] Bellini, F., Bignozzi, V. (2015). On elicitable risk measures. Quantitative Finance 15(5), 725-733.
- 3[3] Bellini, F., Rosazza Gianin, E. (2008). On Haezendonck risk measures. Journal of Banking and Finance 32(6), 986–994.
- 4[4] Bellini, F., Laeven, R. J. A., Rosazza Gianin, E. (2018). Robust return risk measures. Mathematics and Financial Economics 12(1), 5-32.
- 5[5] Bellini, F., Laeven, R. J. A., Rosazza Gianin, E. (2021). Dynamic robust Orlicz premia and Haezendonck-Goovaerts risk measures. European Journal of Operational Research 291(2), 438-446.
- 6[6] Ben-Tal, A., Teboulle, M. (1986). Expected utility, penalty functions, and duality in stochastic nonlinear programming. Management Science 32(11), 1445-1466.
- 7[7] Ben-Tal, A., Teboulle, M. (2007). An old-new concept of convex risk measures: The optimized certainty equivalent. Mathematical Finance 17(3), 449-476.
- 8[8] Biagini, S., Frittelli, M. (2008). A unified framework for utility maximization problems: An Orlicz space approach. The Annals of Applied Probability 18(3), 929-966.
