Elicitability of Return Risk Measures

M\"ucahit Ayg\"un; Fabio Bellini; Roger J. A. Laeven

arXiv:2302.13070·q-fin.RM·March 20, 2023

Elicitability of Return Risk Measures

M\"ucahit Ayg\"un, Fabio Bellini, Roger J. A. Laeven

PDF

Open Access

TL;DR

This paper investigates the elicitability of return risk measures, providing dual representations, axiomatic characterizations of Orlicz premia, and constructing scoring functions to evaluate their performance.

Contribution

It introduces new axiomatic characterizations of Orlicz premia as the only elicitable return risk measures and develops scoring functions for their assessment.

Findings

01

Orlicz premia are uniquely elicitable among return risk measures

02

Dual representation results for convex and geometrically convex measures

03

A family of scoring functions for evaluating Orlicz premia

Abstract

Informally, a risk measure is said to be elicitable if there exists a suitable scoring function such that minimizing its expected value recovers the risk measure. In this paper, we analyze the elicitability properties of the class of return risk measures (i.e., normalized, monotone and positively homogeneous risk measures). First, we provide dual representation results for convex and geometrically convex return risk measures. Next, we establish new axiomatic characterizations of Orlicz premia (i.e., Luxemburg norms). More specifically, we prove, under different sets of conditions, that Orlicz premia naturally arise as the only elicitable return risk measures. Finally, we provide a general family of strictly consistent scoring functions for Orlicz premia, a myriad of specific examples and a mixture representation suitable for constructing Murphy diagrams.

Tables3

Table 1. Table 1 : Examples of Orlicz functions with the corresponding Orlicz premia and the corresponding strictly consistent scoring functions. Eqn. ( 5.1 ) is used to obtain the scoring functions of the first eight examples. For the remaining examples, ( 24 ) is used. Although Φ ( x ) = α + 1 { x > 1 } Φ 𝑥 𝛼 subscript 1 𝑥 1 \Phi(x)=\alpha+1_{\{x>1\}} does not satisfy the conditions of Theorem 39 , it still gives rise to a strictly consistent scoring function. When Φ ( x ) = e α x − 1 e α − 1 Φ 𝑥 superscript 𝑒 𝛼 𝑥 1 superscript 𝑒 𝛼 1 \Phi(x)=\frac{e^{\alpha x}-1}{e^{\alpha}-1} , the corresponding Orlicz premium does not in general admit an explicit expression. The expression given in the table corresponds to the case in which Y 𝑌 Y follows a Gamma distribution with shape parameter θ > 0 𝜃 0 \theta>0 and rate parameter γ > 0 𝛾 0 \gamma>0 . When Φ ( x ) = x e α ( x 2 − 1 ) Φ 𝑥 𝑥 superscript 𝑒 𝛼 superscript 𝑥 2 1 \Phi(x)=xe^{\alpha(x^{2}-1)} , the corresponding Orlicz premium also does not in general admit an explicit expression. The given expression corresponds to the case in which Y = | Z | 𝑌 𝑍 Y=\left\lvert Z\right\rvert , where Z 𝑍 Z follows a normal distribution with mean 0 0 and standard deviation σ 𝜎 \sigma .

Orlicz Function $Φ (x)$	Orlicz Premium $H_{Φ} (Y)$	Scoring Function $S_{Φ} (x, y)$
$x$	$𝔼 [Y]$	$\frac{y}{x} - \log (\frac{y}{x}) - 1$
$α + 1_{{x > 1}}$ , $0 < α < 1$	$q_{α} (Y)$	$(1_{{x \geq y}} - α) \log (\frac{x}{y})$
$1 + q {(x - 1)}^{+} - (1 - q) {(x - 1)}^{-}$ , $0 < q < 1$	$e_{q} (Y)$	$q {(\frac{y}{x} - \log (\frac{y}{x}) - 1)}^{+} + (1 - q) {(\frac{y}{x} - \log (\frac{y}{x}) - 1)}^{-}$
$1 + \log (x)$	$\exp (𝔼 [\log Y])$	${(\log (\frac{y}{x}))}^{2}$
$x^{p}$ , $p \geq 1$	${‖ Y ‖}_{p}$	$\frac{1}{p} (\frac{y^{p}}{x^{p}} - 1) - \log (\frac{y}{x})$
$λ x^{p} + (1 - λ) x^{2 p}$ , $0 \leq λ \leq 1$ , $p \geq 1$	${(\frac{1}{2} (λ 𝔼 [Y^{p}] + \sqrt{λ^{2} 𝔼 {[Y^{p}]}^{2} + 4 (1 - λ) 𝔼 [Y^{2 p}]}))}^{\frac{1}{p}}$	$\frac{1 - λ}{2 p} {(\frac{y}{x})}^{2 p} + \frac{λ}{p} {(\frac{y}{x})}^{p} - \log (\frac{y}{x}) - \frac{λ + 1}{2 p}$
$x \log (e - 1 + x)$	no explicit form	$(\frac{y}{x} + e - 1) (\log (\frac{y}{x} + e - 1) - 1) - \log (\frac{y}{x})$
$x^{p} + 1_{{x > 1}} x^{p} \log (x)$ , $p \geq 1$	no explicit form	$- \log (\frac{y}{x}) + \frac{1}{p} (\frac{y^{p}}{x^{p}} - 1) + 1_{{y > x}} (\frac{1}{p} \frac{y^{p}}{x^{p}} \log (\frac{y}{x}) - \frac{1}{p^{2}} (\frac{y^{p}}{x^{p}} - 1))$
$\frac{e^{α x} - 1}{e^{α} - 1}$ , $α > 0$	$\frac{α e^{α / θ}}{θ (e^{α / θ} - 1)} 𝔼 [Y]$	$\frac{e^{α}}{(e^{α} - 1)} (\frac{1}{α y} (e^{α (\frac{y}{x} - 1)} - 1) + \frac{1}{y} - \frac{1}{x})$
$x e^{α (x^{2} - 1)}$ , $α > 0$	$\sqrt{\frac{2}{π}} (\frac{1}{2} e^{- α} + \sqrt{\frac{1}{4} e^{- 2 α} + π α}) σ$	$\frac{1}{2 α y} (e^{α (\frac{y^{2}}{x^{2}} - 1)} - 1) + \frac{1}{y} - \frac{1}{x}$
$\frac{e^{x} - x - 1}{e - 2}$	no explicit form	$\frac{1}{e - 2} (\frac{e^{\frac{y}{x}} - \frac{1}{2}}{y} + \frac{(2 - 2 e) x - y}{2 x^{2}})$

Table 2. Table 2 : Predictive distributions and point forecasts. The point forecasts for the LCE arise by taking p ≡ 0 𝑝 0 p\equiv 0 .

Forecaster	Predictive distribution of $\log (Y)$	Point forecast of $p$ -norm
Perfect	$𝒩 (μ, σ_{Y}^{2})$	$\exp (μ + σ_{Y}^{2} p / 2)$
Unconditional	$𝒩 (0, σ_{μ}^{2} + σ_{Y}^{2})$	$\exp ((σ_{μ}^{2} + σ_{Y}^{2}) p / 2)$
Unfocused	$\frac{1}{2} (𝒩 (μ, σ_{Y}^{2}) + 𝒩 (μ + τ, σ_{Y}^{2}))$	$\exp (μ + τ / 2 + σ_{Y}^{2} p / 4)$
Sign-reversed	$𝒩 (- μ, σ_{Y}^{2})$	$\exp (- μ + σ_{Y}^{2} p / 2)$

Table 3. Table 3 : Predictive distributions and point forecasts. W 𝑊 W denotes the Lambert function.

Forecaster	Predictive distribution of $Y$	Point forecast of $q$ -expectile
Perfect	$\exp (λ)$	$\frac{1}{λ} (1 + W (\frac{2 q - 1}{(1 - q) e}))$
Unfocused	$\exp (τ λ)$	$\frac{1}{τ λ} (1 + W (\frac{2 q - 1}{(1 - q) e}))$
Mean-Reversed	$\exp (1 / λ)$	$λ (1 + W (\frac{2 q - 1}{(1 - q) e}))$

Equations312

ρ (F) := ρ (X), if X \sim F,

ρ (F) := ρ (X), if X \sim F,

\tilde{ρ} (X^{α} Y^{1 - α}) \leq \tilde{ρ}^{α} (X) \tilde{ρ}^{1 - α} (Y) .

\tilde{ρ} (X^{α} Y^{1 - α}) \leq \tilde{ρ}^{α} (X) \tilde{ρ}^{1 - α} (Y) .

\tilde{ρ} (X) := exp (ρ (lo g (X))),

\tilde{ρ} (X) := exp (ρ (lo g (X))),

ρ (Y) := lo g (\tilde{ρ} (exp (Y))) .

ρ (Y) := lo g (\tilde{ρ} (exp (Y))) .

\tilde{ρ} (F) = exp (ρ (F (e^{t}))) .

\tilde{ρ} (F) = exp (ρ (F (e^{t}))) .

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k ⟹ ρ (X) \leq n \to + \infty lim inf ρ (X_{n}),

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k ⟹ ρ (X) \leq n \to + \infty lim inf ρ (X_{n}),

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k ⟹ ρ (X_{n}) \to ρ (X) .

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k ⟹ ρ (X_{n}) \to ρ (X) .

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k, X_{n} \geq c > 0 ⟹ \tilde{ρ} (X) \leq n \to + \infty lim inf \tilde{ρ} (X_{n}),

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k, X_{n} \geq c > 0 ⟹ \tilde{ρ} (X) \leq n \to + \infty lim inf \tilde{ρ} (X_{n}),

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k, X_{n} \geq c > 0 ⟹ \tilde{ρ} (X_{n}) \to \tilde{ρ} (X) .

X_{n} \to P X, ∥ X_{n} ∥_{\infty} \leq k, X_{n} \geq c > 0 ⟹ \tilde{ρ} (X_{n}) \to \tilde{ρ} (X) .

ρ (lo g X) \leq n \to + \infty lim inf ρ (lo g (X_{n})),

ρ (lo g X) \leq n \to + \infty lim inf ρ (lo g (X_{n})),

λ \mapsto \tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y})

λ \mapsto \tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y})

λ \mapsto ρ (λ δ_{u} + (1 - λ) δ_{v})

λ \mapsto ρ (λ δ_{u} + (1 - λ) δ_{v})

\tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y}) = exp (ρ (λ δ_{l o g (x)} + (1 - λ) δ_{l o g (y)})),

\tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y}) = exp (ρ (λ δ_{l o g (x)} + (1 - λ) δ_{l o g (y)})),

\tilde{ρ} (X) = Q \in P sup {\tilde{α} (Q) exp (E_{Q} [lo g X])} .

\tilde{ρ} (X) = Q \in P sup {\tilde{α} (Q) exp (E_{Q} [lo g X])} .

ρ (X) = Q \in P sup {E_{Q} [X] - α (Q)},

ρ (X) = Q \in P sup {E_{Q} [X] - α (Q)},

\tilde{ρ} (X)

\tilde{ρ} (X)

= Q \in P sup {\tilde{α} (Q) exp (E_{Q} [lo g X])},

\displaystyle\exp\left(\operatorname{\mathbb{E}}_{Q}\left[\log X\right]\right)=\inf\left\{k>0\;\Big{|}\;\operatorname{\mathbb{E}}_{Q}\left[1+\log\left(\frac{X}{k}\right)\right]\leq 1\right\}=H_{1+\log x,Q}(X).

\displaystyle\exp\left(\operatorname{\mathbb{E}}_{Q}\left[\log X\right]\right)=\inf\left\{k>0\;\Big{|}\;\operatorname{\mathbb{E}}_{Q}\left[1+\log\left(\frac{X}{k}\right)\right]\leq 1\right\}=H_{1+\log x,Q}(X).

A V @ R_{λ} (X) = \frac{1}{1 - λ} \int_{λ}^{1} q_{α} (X) d α,

A V @ R_{λ} (X) = \frac{1}{1 - λ} \int_{λ}^{1} q_{α} (X) d α,

q_{α} (X) = in f {x \in R ∣ F (x) \geq α},

q_{α} (X) = in f {x \in R ∣ F (x) \geq α},

\tilde{ρ} (X) = μ \in M_{1} ([0, 1]) sup {\tilde{β} (μ) exp (\int_{[0, 1]} A V @ R_{λ} (lo g X) μ (d λ))} .

\tilde{ρ} (X) = μ \in M_{1} ([0, 1]) sup {\tilde{β} (μ) exp (\int_{[0, 1]} A V @ R_{λ} (lo g X) μ (d λ))} .

ρ (X) = μ \in M_{1} ([0, 1]) sup (\int_{[0, 1]} A V @ R_{λ} (X) μ (d λ) - β (μ)),

ρ (X) = μ \in M_{1} ([0, 1]) sup (\int_{[0, 1]} A V @ R_{λ} (X) μ (d λ) - β (μ)),

\tilde{ρ} (X)

\tilde{ρ} (X)

= μ \in M_{1} ([0, 1]) sup \tilde{β} (μ) exp (\int_{[0, 1]} A V @ R_{λ} (lo g X) μ (d λ)),

\tilde{ρ} (X) = Q \in M_{1} (P) sup {\tilde{α} (Q) \tilde{Q} \sim Q sup H_{1 + l o g x, \tilde{Q}} (X)},

\tilde{ρ} (X) = Q \in M_{1} (P) sup {\tilde{α} (Q) \tilde{Q} \sim Q sup H_{1 + l o g x, \tilde{Q}} (X)},

F_{n} \to ψ F if F_{n} \to weakly F and \int ψ d F_{n} \to \int ψ d F .

F_{n} \to ψ F if F_{n} \to weakly F and \int ψ d F_{n} \to \int ψ d F .

F_{n} \to ψ F ⟹ ρ (F_{n}) \to ρ (F) .

F_{n} \to ψ F ⟹ ρ (F_{n}) \to ρ (F) .

λ \mapsto \tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y})

λ \mapsto \tilde{ρ} (λ δ_{x} + (1 - λ) δ_{y})

{Q \in P ∣ \tilde{α} (Q) \geq m} = {Q \in P ∣ exp (- α (Q)) \geq m} = {Q \in P ∣ α (Q) \leq - lo g (m)} .

{Q \in P ∣ \tilde{α} (Q) \geq m} = {Q \in P ∣ exp (- α (Q)) \geq m} = {Q \in P ∣ α (Q) \leq - lo g (m)} .

\tilde{ρ} ((\frac{X}{ρ ~ ( X )})^{λ} (\frac{Y}{ρ ~ ( Y )})^{1 - λ})

\tilde{ρ} ((\frac{X}{ρ ~ ( X )})^{λ} (\frac{Y}{ρ ~ ( Y )})^{1 - λ})

\leq λ \tilde{ρ} (\frac{X}{ρ ~ ( X )}) + (1 - λ) \tilde{ρ} (\frac{Y}{ρ ~ ( Y )}) = 1.

\tilde{ρ} (X^{λ} Y^{1 - λ}) \leq \tilde{ρ} (X)^{λ} \tilde{ρ} (Y)^{1 - λ},

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRisk and Portfolio Optimization · Multi-Criteria Decision Making · Fuzzy Systems and Optimization

Full text

Elicitability of Return Risk Measures††thanks: We are very grateful to seminar participants at the University of Vienna and the University of Amsterdam for their comments and suggestions.

This research was funded in part by the Netherlands Organization for Scientific Research under an NWO-Vici grant 2020–2025 (Aygün and Laeven).

Mücahit Aygün

Dept. of Quantitative Economics

University of Amsterdam

and Tinbergen Institute

[email protected]

Fabio Bellini

Dept. of Statistics and Quantitative Methods

University of Milano Bicocca

[email protected]

Roger J. A. Laeven

Dept. of Quantitative Economics

University of Amsterdam, CentER

and EURANDOM

[email protected]

Abstract

Informally, a risk measure is said to be elicitable if there exists a suitable scoring function such that minimizing its expected value recovers the risk measure. In this paper, we analyze the elicitability properties of the class of return risk measures (i.e., normalized, monotone and positively homogeneous risk measures). First, we provide dual representation results for convex and geometrically convex return risk measures. Next, we establish new axiomatic characterizations of Orlicz premia (i.e., Luxemburg norms). More specifically, we prove, under different sets of conditions, that Orlicz premia naturally arise as the only elicitable return risk measures. Finally, we provide a general family of strictly consistent scoring functions for Orlicz premia, a myriad of specific examples and a mixture representation suitable for constructing Murphy diagrams.

**Keywords ** Return risk measures, elicitability, Orlicz premia, consistent scoring functions, geometric convexity.

1 Introduction

Since the seminal work of Savage ([41]) and Osband ([37]), an expanding and increasingly sophisticated literature has studied elicitability properties of risk measures. Classes of risk measures may, or may not, admit families of strictly consistent scoring functions, and hence be elicitable, with important implications for evaluating model performance and competing forecasts (see e.g., [26], [38], [2], [45], [36], [12], and the references therein). For example, Average Value-at-Risk per se is not elicitable, but it is jointly elicitable with Value-at-Risk since it admits bivariate strictly consistent scoring functions ([26], [21]).

Recently, [4] introduced the class of return risk measures, consisting of normalized, monotone and positively homogeneous risk measures. Return risk measures provide relative (or geometric) assessments of risk. They evaluate how much additional riskless log-return makes a financial position acceptable—whence their name. They constitute the relative counterparts of the class of monetary risk measures ([23], [16]), reminiscent of how relative risk aversion relates to absolute risk aversion. Their dynamic extensions, dynamic return risk measures, have been studied in [5].111Return risk measures that allow for probability distortion were recently analyzed in [42], whereas applications of return risk measures to capital allocation can be found in [33] and [9].

Whereas elicitability properties of monetary risk measures are by now quite well understood, little is known about the elicitability properties of return risk measures. This paper aims to fill this gap by analyzing the elicitability properties of return risk measures, with a particular emphasis on Orlicz premia, also known as Luxemburg norms, which as we will see play a central role in the theory of elicitable return risk measures. Orlicz premia, and the links between risk measures and Orlicz space theory, have been extensively studied in the financial and actuarial mathematics literature (see e.g., [28], [8], [10], [11], [16], [32], [4], [5] and the references therein); however, their connection to statistical decision theory in general, and elicitability in particular, has not been uncovered to our best knowledge.

This paper makes three main contributions. We start by providing dual representation results for convex and geometrically convex return risk measures and clarify their precise relationship. In full generality, the dual representation takes the form of a supremum of discounted logarithmic certainty equivalents, where the discount factor can be interpreted as an index of model plausibility under ambiguity. We show that convex return risk measures occur as a special case in the richer class of geometrically convex return risk measures, and we also analyze their law-invariant representations. Furthermore, we introduce and analyze the class of optimized return risk measures and derive their dual representation.

Second, we establish new characterization results for Orlicz premia. We prove that Orlicz premia naturally arise as the only return risk measures that are elicitable. It has been shown in [37] that an elicitable risk measure must satisfy the convex level sets (CxLS) property. We establish that a law-invariant geometrically convex return risk measure with the CxLS property is necessarily an Orlicz premium. We also show that requiring identifiability for return risk measures singles out the class of Orlicz risk measures: under weak regularity conditions, they are the only identifiable, law-invariant, monotone and positively homogeneous measures of risk. These are our central results, the preparations and mathematical details of which are somewhat involved.

Third, we provide a general, rich family of scoring functions that we prove to be strictly consistent with Orlicz premia. A plethora of examples illustrates the generality of our new family of scoring functions. Special attention is devoted to scoring functions of the relative error form in view of their appealing properties in forecast evaluation. We also provide a mixture representation of the general family of scoring functions in terms of elementary scoring functions, depending on a low-dimensional parameter. This enables the use of so-called Murphy diagrams to compare competing forecasts simultaneously with respect to a full class of strictly consistent scoring functions, and we illustrate this in two examples.

Statistical decision theory demonstrates that some classes of functionals do not allow for meaningful point forecast evaluation by means of expected scores. Functionals that admit a strictly consistent scoring function, guaranteeing that accurate forecasts are rewarded more than inaccurate forecasts, are referred to as elicitable (see Definition 38 for a formal definition). Our characterization results reveal the important place of the class of Orlicz premia in the extensive literature on risk measures. This is graphically illustrated in Figure 1. We know from [43], [2] and [17] that convex shortfall risk measures occur as the subclass of monetary risk measures that are elicitable. Furthermore, the only elicitable law-invariant coherent risk measures are given by expectiles ([45], [17]). We establish in Theorems 32 and 36 that Orlicz premia naturally arise as elicitable return risk measures. Furthermore, in Theorem 26, we provide a direct proof of the result that the only convex Orlicz premia that are translation invariant (and, hence, coherent risk measures) are the expectiles.

The rest of this paper is organized as follows. In Section 2, we recall the general properties of return risk measures and derive some useful continuity properties. In Section 3, we provide dual representation results for geometrically convex and convex return risk measures, explicate their connection and analyze optimized return risk measures. In Section 4, we establish our characterization results for Orlicz premia. Section 5 presents our results on families of scoring functions strictly consistent with Orlicz premia including many examples.

2 Return risk measures

Let $(\Omega,\mathcal{F},P)$ be a nonatomic probability space. In the present paper, random variables $X\colon\Omega\to\mathbb{R}$ represent financial losses. We will consider finite-valued risk measures defined on $L^{\infty}(\Omega,\mathcal{F},P)$ or on its subsets $L^{\infty}_{+}(\Omega,\mathcal{F},P):=\{X\in L^{\infty}\mid X\geq 0\;P\mbox{-a.s.}\}$ and $L^{\infty}_{++}(\Omega,\mathcal{F},P):=\{X\in L^{\infty}\mid X>0\;P\mbox{-a.s.}\}$ . Equalities and inequalities between random variables are meant to hold $P$ -almost surely without further explicit mentioning.

Definition 1

We say that a functional $\rho\colon L^{\infty}(\Omega,\mathcal{F},P)\to\mathbb{R}$ is:

a)

translation invariant if $\rho(X+h)=\rho(X)+h,\,\forall h\in\mathbb{R},\,\forall X\in L^{\infty}$ 2. b)

monotone if $X\leq Y\Rightarrow\rho(X)\leq\rho(Y)$ 3. c)

monetary if $\rho$ is translation invariant, monotone and satisfies $\rho(0)=0$ 4. d)

positively homogeneous if $\rho(\lambda X)=\lambda\rho(X),\,\forall\lambda\geq 0,\,\forall X\in L^{\infty}$ 5. e)

convex if $\rho(\alpha X+(1-\alpha)Y)\leq\alpha\rho(X)+(1-\alpha)\rho(Y),\,\forall X,Y\in L^{\infty},\,\forall\alpha\in(0,1)$ 6. f)

coherent if it is monetary, convex and positively homogeneous 7. g)

law invariant if $X\sim Y\Rightarrow\rho(X)=\rho(Y)$ , where $X\sim Y$ means that $X$ and $Y$ have the same distribution.

A law-invariant functional on $L^{\infty}(\Omega,\mathcal{F},P)$ induces a functional on $\mathcal{M}_{1,c}(\mathbb{R})$ , the set of probability measures with compact support in $\mathbb{R}$ , by means of

[TABLE]

where each probability measure $\mu\in\mathcal{M}_{1,c}(\mathbb{R})$ is identified with its distribution function $F(x):=\mu(-\infty,x]$ .

We recall from [4] the notions of return risk measure and of its associated multiplicative acceptance set.

Definition 2

A return risk measure $\tilde{\rho}\colon L^{\infty}_{+}\to[0,+\infty)$ is a positively homogeneous and monotone functional satisfying $\tilde{\rho}(1)=1$ . Its corresponding multiplicative acceptance set (at the level of random variables) is $B_{\tilde{\rho}}=\{X\in L^{\infty}_{+}\mid\tilde{\rho}(X)\leq 1\}.$

For return risk measures the notion of geometric convexity—also known as multiplicative convexity or GG-convexity for functions on the positive real line (see e.g., [35])—will be of interest in what follows.

Definition 3

A functional $\tilde{\rho}\colon L^{\infty}_{+}\to[0,+\infty)$ is geometrically convex if for each $X,Y\in L^{\infty}_{+}$ and $\alpha\in(0,1)$ it holds that

[TABLE]

We will show in Lemma 16 that convex return risk measures are also geometrically convex. The class of geometrically convex risk measures is strictly larger than the class of convex return risk measures, a nonconvex example of the former being the logarithmic certainty equivalent $\tilde{\rho}(X)=\exp\operatorname{\mathbb{E}}[\log X]$ .

A one-to-one correspondence between return risk measures and monetary risk measures has been outlined in [4] as follows: given a monetary risk measure $\rho\colon L^{\infty}\to\mathbb{R}$ , the associated return risk measure $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ is given by

[TABLE]

and, vice versa, given a return risk measure $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ , the associated monetary risk measure $\rho\colon L^{\infty}\to\mathbb{R}$ is

[TABLE]

The main properties of this correspondence are recalled in the following lemma.

Lemma 4

Let $\rho\colon L^{\infty}\to\mathbb{R}$ and $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ be as in (1) and (2). Then:

a)

$\rho(0)=0\iff\tilde{\rho}(1)=1$ ** 2. b)

$\rho$ * is translation invariant $\iff$ $\tilde{\rho}$ is positively homogeneous* 3. c)

$\rho$ * is monotone $\iff$ $\tilde{\rho}$ is monotone * 4. d)

$\rho$ * is convex $\iff$ $\tilde{\rho}$ is geometrically convex* 5. e)

$\rho$ * is law invariant $\iff$ $\tilde{\rho}$ is law invariant* 6. f)

if $\rho$ is law invariant, then, for $F\in\mathcal{M}_{1,c}(0,+\infty)$ ,

[TABLE]

In Section 3, we will establish dual representations for geometrically convex return risk measures. It will turn out that for return risk measures the definitions of Fatou and Lebesgue properties have to be slightly modified. We introduce both the usual and the modified versions in the definition below.

Definition 5

A risk measure $\rho$ has the Fatou property if

[TABLE]

whereas it has the Lebesgue property if

[TABLE]

A return risk measure $\tilde{\rho}$ has the lower-bounded Fatou property if

[TABLE]

whereas it has the lower-bounded Lebesgue property if

[TABLE]

Clearly the lower-bounded Lebesgue property implies the lower-bounded Fatou property. Both properties are weaker than the usual ones, requiring respectively lower semicontinuity and continuity under more restrictive assumptions.

Lemma 6

Let $\rho\colon L^{\infty}\to\mathbb{R}$ and $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ be as in (1) and (2). Then:

(i)

$\tilde{\rho}$ * has the lower-bounded Fatou property if and only if $\rho$ has the Fatou property* 2. (ii)

$\tilde{\rho}$ * has the lower-bounded Lebesgue property if and only if $\rho$ has the Lebesgue property.*

Proof. (i) Let $X_{n}\in L^{\infty}_{++}$ satisfy $X_{n}\overset{P}{\to}X$ , $\|X_{n}\|_{\infty}\leq k$ , $X_{n}\geq c>0$ . By the continuous mapping theorem, it follows that $\log(X_{n})\overset{P}{\to}\log(X)$ , and $\|\log(X_{n})\|_{\infty}\leq\max(\log k,-\log c)$ , so from the Fatou property of $\rho$ it follows that

[TABLE]

and exponentiating both sides we get $\tilde{\rho}(X)\leq\liminf_{n\to+\infty}\tilde{\rho}(\log(X_{n}))$ . The proof of the ‘only if’ part and the proof of (ii) are similar.

Since a law-invariant, monetary and convex risk measure automatically satisfies the Fatou property (see [29] and [24] for recent developments on the automatic validity of the Fatou property on general spaces), it follows from Lemma 6 that a law-invariant geometrically convex return risk measure automatically has the lower-bounded Fatou property. As a consequence, we have the following mixture continuity result, in which, as usual, we denote by $\delta_{x}$ a probability measure supported at $x$ .

Proposition 7

Let $\tilde{\rho}\colon\mathcal{M}_{1,c}(0,+\infty)\to(0,+\infty)$ be a law-invariant geometrically convex return risk measure. Let $0<x<y$ . Then the mapping

[TABLE]

is continuous at each $\lambda\in[0,1)$ .

Proof. From Lemma 4 it follows that the corresponding $\rho\colon L^{\infty}\to\mathbb{R}$ given by equation (2) is a convex law-invariant monetary risk measure. From Proposition 2.1 in [17] suitably adapted to our sign conventions it follows that

[TABLE]

is continuous at each $\lambda\in[0,1)$ , for fixed $u,v\in\mathbb{R}$ with $u<v$ . Therefore, from the representation (3), we obtain

[TABLE]

and the thesis follows by the continuity of compositions with $\exp$ and $\log$ .

3 Dual representations

In this section, we will denote by $\mathbf{P}$ the set of probability measures on $(\Omega,\mathcal{F},P)$ that are absolutely continuous with respect to the reference measure $P$ . In the next theorem, we derive a dual representation of geometrically convex return risk measure as suprema of suitably weighted, or discounted, logarithmic certainty equivalents; the less plausible the probabilistic model, the lower is the corresponding discount factor.

Theorem 8

Let $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ be a geometrically convex return risk measure satisfying the lower-bounded Fatou property. Then there exists a multiplicative weighting function $\tilde{\alpha}\colon\mathbf{P}\to[0,1]$ satisfying $\sup_{Q\in\mathbf{P}}\tilde{\alpha}(Q)=1$ such that

[TABLE]

Furthermore, if $\tilde{\rho}$ satisfies the lower-bounded Lebesgue property, the supremum in (4) is attained.

Proof. From Lemma 4 and Lemma 6, it follows that $\rho(X)=\log(\tilde{\rho}(\exp(X)))$ is a convex monetary risk measure with the Fatou property, so as is well-known (see e.g., [23]) it has the dual representation

[TABLE]

where $\alpha\colon\mathbf{P}\to[0,+\infty]$ . Since $\tilde{\rho}(X)=\exp(\rho(\log(X)))$ , it follows that

[TABLE]

where $\tilde{\alpha}\colon\mathbf{P}\to[0,1]$ is given by $\tilde{\alpha}(Q)=\exp(-\alpha(Q))$ . From $\tilde{\rho}(1)=1$ , it follows that $\sup_{Q\in\mathbf{P}}\tilde{\alpha}(Q)=1$ . By Theorem 4.22 and Exercise 4.2.2 in [23], the supremum in (5) is attained if $\rho$ has the Lebesgue property. In view of Lemma 4 and Lemma 6, it then follows that the supremum in (4) is attained provided that $\tilde{\rho}$ satisfies the lower-bounded Lebesgue property.

Remark 9

The logarithmic certainty equivalent $\exp\left(\operatorname{\mathbb{E}}[\log X]\right)$ arising in the dual representation (4) can already be viewed as an example of an Orlicz premium corresponding to the unbounded Orlicz function $\Phi(x)=1+\log(x)$ , since

[TABLE]

We refer to Definition 23 in Section 4 for details on terminology and notation. As a consequence, every geometrically convex return risk measure satisfying the lower-bounded Fatou property can be seen as the supremum of a suitable family of multiplicatively weighted Orlicz premia.

We now derive a Kusuoka representation for law-invariant geometrically convex return risk measures that parallels the usual one for law-invariant convex risk measures. Recall first the definition of Average Value-at-Risk.

Definition 10

Let $X\in L^{1}(\Omega,\mathcal{F},P)$ . For $\lambda\in[0,1)$ , the Average Value-at-Risk of $X$ at level $\lambda$ is given by

[TABLE]

where

[TABLE]

and for $\lambda=1$ we set by definition $AV@R_{1}(X)=\operatorname*{ess\,sup}(X)$ .

Denote by $\mathcal{M}_{1}([0,1])$ the set of probability measures with support in $[0,1]$ .

Theorem 11

Let $\tilde{\rho}\colon L^{\infty}_{++}\to(0,+\infty)$ be a law-invariant geometrically convex return risk measure. Then there exists $\tilde{\beta}\colon\mathcal{M}_{1}([0,1])\to[0,1]$ such that

[TABLE]

If $\tilde{\rho}$ has the lower-bounded Lebesgue property, then $\mu(1)>0\Rightarrow\tilde{\beta}(\mu)=0$ .

Proof. As in the proof of the previous theorem, if $\tilde{\rho}$ is also law invariant, then from Lemma 4 the associated convex risk measure $\rho$ given by (2) is also law invariant, and hence has the Kusuoka representation (see e.g., [23], [16])

[TABLE]

for a suitable $\beta\colon\mathcal{M}_{1}([0,1])\to[0,+\infty]$ , from which it follows that

[TABLE]

with $\tilde{\beta}(\mu)=\exp(-\beta(\mu))$ . From Lemma 6, if $\tilde{\rho}$ has the lower-bounded Lebesgue property, then $\rho$ has the Lebesgue property, and Theorem 35 in [16] implies that $\mu(1)>0\Rightarrow\beta(\mu)=+\infty$ , from which the thesis follows.

Remark 12

In the same spirit of Remark 9, the Kusuoka representation of a law-invariant geometrically convex return risk measure given in equation (6) can be written in terms of Orlicz premia as follows:

[TABLE]

where $\tilde{Q}\sim Q$ indicates that $\frac{\mathrm{d}Q}{\mathrm{d}P}$ and $\frac{\mathrm{d}\tilde{Q}}{\mathrm{d}P}$ have the same distribution.

As for convex risk measures, the validity of the lower-bounded Lebesgue property is linked to a suitable weak compactness property of the level sets of the weighting function $\tilde{\alpha}$ and in the law-invariant case to the so-called $\psi$ -weak continuity. Before stating the main result we recall two basic definitions.

Definition 13

We say that a monetary convex risk measure $\rho$ with dual representation (5) has the WC property if for each $m\in\mathbb{R}$ the lower level sets $\{Q\in\mathbf{P}\,|\,\alpha(Q)\leq m\}$ are compact in the $\sigma(L_{1},L_{\infty})$ topology. Similarly, a geometrically convex return risk measure $\tilde{\rho}$ with dual representation (4) has the $\widetilde{WC}$ property if for each $m>0$ the upper level sets $\{Q\in\mathbf{P}\,|\,\tilde{\alpha}(Q)\geq m\}$ are compact in the $\sigma(L_{1},L_{\infty})$ topology.

Definition 14

Let $\psi\colon\mathbb{R}\to[1,+\infty)$ be continuous. The $\psi$ -weak topology on $\mathcal{M}_{1,c}(\mathbb{R})$ is the weakest topology that makes all mappings $F\mapsto\int f\,\mathrm{d}F$ continuous, for each continuous $f$ satisfying $|f|\leq c\psi$ , with $c>0$ . It holds that

[TABLE]

A functional $\rho\colon\mathcal{M}_{1,c}\to\mathbb{R}$ is $\psi$ -weakly continuous if

[TABLE]

Proposition 15

Let $\tilde{\rho}:\mathcal{M}_{1,c}\to(0,+\infty)$ be a law-invariant geometrically convex return risk measure. The following are equivalent:

a)

$\tilde{\rho}$ * has the $\widetilde{WC}$ property* 2. b)

$\tilde{\rho}$ * has the lower-bounded Lebesgue property* 3. c)

$\tilde{\rho}$ * is $\tilde{\psi}$ -weakly continuous for some $\tilde{\psi}\colon(0,+\infty)\to\mathbb{R}$ * 4. d)

For each $x,y>0$ with $x<y$ and $\lambda\in[0,1]$ , the function

[TABLE]

is continuous.

Proof. If $\rho$ and $\tilde{\rho}$ are related by the correspondence given in (1) and (2), then the WC property of $\rho$ is equivalent to the $\widetilde{\mathrm{WC}}$ property of $\tilde{\rho}$ , since

[TABLE]

So (a) holds if and only if the associated convex risk measure $\rho$ has the WC property. As is well-known (see e.g., [16]), for convex risk measures the WC property is equivalent to the Lebesgue property, so from Lemma 6 it follows that (a) is equivalent to (b). From Proposition 2.7 in [17] adapted to our sign conventions, it follows that the WC property of $\rho$ is equivalent to $\psi$ -weak continuity with respect to some gauge function $\psi$ . From Lemma 4 of [4], it holds that $\rho$ is $\psi$ -weakly continuous if and only if $\tilde{\rho}$ is $\tilde{\psi}$ -weakly continuous with $\tilde{\psi}(t)=\psi(\log(t))$ , which shows the equivalence between (a) and (c). Further, Proposition 2.7 in [17] shows that the WC property of $\rho$ is equivalent to its mixture continuity for $\lambda\to 1^{-}$ , which combined with Proposition 7 shows that (a) is equivalent to (d).

As we anticipated after Definition 3, geometrically convex return risk measures are a generalization of convex risk measures, as the following shows.

Lemma 16

Let $\tilde{\rho}\colon L^{\infty}_{+}\to[0,+\infty)$ be a convex return risk measure. Then $\tilde{\rho}$ is geometrically convex.

Proof. If $\tilde{\rho}(X)=0$ or $\tilde{\rho}(Y)=0$ the thesis is trivial. Let $X,Y\in L_{+}^{\infty}$ and $\lambda\in(0,1).$ By using the AM-GM inequality and the monotonicity and convexity of $\tilde{\rho}$ , it follows that

[TABLE]

Next, from positive homogeneity, it follows that

[TABLE]

which completes the proof.

It is then interesting to compare the dual representation of geometrically convex return risk measures given in equation (4) with the following dual representation for convex return risk measures.

Proposition 17

Let $\tilde{\rho}\colon L^{\infty}_{+}\to[0,+\infty)$ be a convex return risk measure satisfying the Fatou property. Then there exists $\beta\colon\mathbf{P}\to[0,1]$ with $\sup_{Q\in\mathbf{P}}\beta(Q)=1$ such that

[TABLE]

Furthermore, if $\tilde{\rho}$ satisfies the Lebesgue property, the supremum in (7) is attained.

Proof. The first part of the statement is easily derived from Proposition 4.3 in [31]. For the second part, it follows from the proof of Proposition 4.3 in [31] that

[TABLE]

where $H=\left\{Z\in L_{+}^{1}:\operatorname{\mathbb{E}}[ZY]\leq\tilde{\rho}(Y)\text{ for any }Y\in L_{+}^{\infty}\right\}$ . If we take $Y=1$ , then $\operatorname{\mathbb{E}}[Z]\leq 1$ for any $Z\in H$ , which gives the norm-boundedness of the set $H$ . Furthermore, $H$ is weakly closed, since it is an intersection of weakly closed sets. Let us take a decreasing sequence $\left(A_{n}\right)_{n}\in\mathcal{F}$ of which the intersection is the empty set. For any $Z\in H$ , we have $\operatorname{\mathbb{E}}[Z1_{A_{n}}]\leq\tilde{\rho}(1_{A_{n}})$ for every $n$ . Therefore, by using the Lebesgue property of $\tilde{\rho}$ , we have

[TABLE]

which gives that $H$ is uniformly integrable. Because $H$ is bounded, weakly closed and uniformly integrable, it is weakly compact as a consequence of the Dunford-Pettis theorem (see, e.g., Theorem A.67 in [23]). Therefore, the supremum is attained as a result of the Weierstrass Theorem (see, e.g., Corollary 2.35 in [1]). Suppose the supremum is attained for $\tilde{Z}\in H$ . Then, the supremum is attained for $\tilde{Q}$ such that $\frac{\mathrm{d}\tilde{Q}}{\mathrm{d}P}=\frac{\tilde{Z}}{\operatorname{\mathbb{E}}[\tilde{Z}]}$ .

Since from Lemma 16 a convex return risk measure is geometrically convex, it follows that the dual representation (4) is implied by the dual representation (7). This can be seen starting from the well-known dual representation of the exponential certainty equivalent (see e.g., [18]), given by

[TABLE]

where $H(R,Q)$ is the relative entropy or Kullback-Leibler divergence defined by ([13])

[TABLE]

Letting $X=\exp(Y)$ and exponentiating both sides of (8), we obtain

[TABLE]

where

[TABLE]

Now note that $\tilde{\alpha}(R)=0$ when $R$ is not absolutely continuous with respect to $Q$ . Using this fact, we can rewrite expression (9) for $Q\in\mathbf{P}$ , as follows:

[TABLE]

since we take a supremum of nonnegative numbers, $\tilde{\alpha}(R)=0$ when $R\notin\mathbf{Q}$ and $\mathbf{Q}\subset\mathbf{P}$ , where $\mathbf{Q}$ denotes the set of probability measures absolutely continuous with respect to $Q$ . Upon substituting the expression for $\operatorname{\mathbb{E}}_{Q}[X]$ derived in (10) in (7), we have clarified the connection between (4) and (7), as follows:

[TABLE]

where

[TABLE]

Following the construction outlined in [6, 7] and [3], return risk measures may be optimized and become translation invariant, hence monetary, as follows:

Definition 18

An optimized return risk measure (henceforth, OR risk measure) $\rho:L_{+}^{\infty}\to\mathbb{R}$ is defined as

[TABLE]

for a corresponding return risk measure $\tilde{\rho}:L^{\infty}_{+}\rightarrow[0,+\infty)$ .

Lemma 19

An OR risk measure satisfies the following properties:

a)

monotonicity

b)

positive homogeneity

c)

translation invariance

d)

if $\tilde{\rho}$ is convex, then $\rho$ is convex.

Proof. Take $X,Y\in L_{+}^{\infty}$ such that $X\leq Y$ . For an arbitrary $x\in\mathbb{R}$ , we have $(Y-x)^{+}\geq(X-x)^{+}$ , which implies $x+\tilde{\rho}((Y-x)^{+})\geq x+\tilde{\rho}((X-x)^{+})$ due to the monotonicity of $\tilde{\rho}$ . Since this is valid for any $x\in\mathbb{R}$ , by taking the infimum on both sides, we obtain $\rho(X)\leq\rho(Y)$ . For (b), by using the positive homogeneity of $\tilde{\rho}$ and of the positive part function, we have, for any $\lambda>0$ ,

[TABLE]

For (c), we have, for any $h\in\mathbb{R}$ ,

[TABLE]

Finally, let us assume that $\tilde{\rho}$ is convex and take $X,Y\in L_{+}^{\infty}$ . Because $\rho$ is positively homogeneous, it is sufficient for (d) to prove that $\rho$ is subadditive. We have

[TABLE]

where we have used the convexity and positive homogeneity of $\tilde{\rho}$ and of the positive part function in the second line.

The class of OR risk measures encompasses as special cases the Rockafellar-Uryasev [40] construction of Average-Value-at-Risk as well as its generalization given by the (robust) HG risk measure ([4]). We now establish that the OR risk measure admits an inf-convolution and a dual representation.

Definition 20

The inf-convolution $(f\ \Box\ g)$ of two convex functionals $f:L^{\infty}\to\overline{\mathbb{R}}$ and $g:L^{\infty}\to\overline{\mathbb{R}}$ is defined as follows:

[TABLE]

Lemma 21

An OR risk measure $\rho$ can be written as

[TABLE]

where $f(X)=\tilde{\rho}(X^{+})$ and

[TABLE]

when the corresponding return risk measure $\tilde{\rho}$ is convex.

Proof. Note that the functional $f$ is convex since $\tilde{\rho}$ is convex and monotone and the positive part function is convex, and $g$ is convex, too. Then, the inf-convolution of the functionals $f$ and $g$ agrees with the definition of $\rho$ in (11).

Recall that the dual space of $L^{\infty}$ can be identified with $L^{1}$ w.r.t. the $\sigma(L^{\infty},L^{1})$ -topology. The convex conjugate $h^{*}:L^{1}\to\overline{\mathbb{R}}$ of a function $h:L^{\infty}\to\overline{\mathbb{R}}$ is defined as:

[TABLE]

Proposition 22

An OR risk measure $\rho$ , with a corresponding convex return risk measure $\tilde{\rho}$ , admits the following dual representation:

[TABLE]

where $A_{P}=\left\{Q\in\mathbf{P}:\operatorname{\mathbb{E}}_{Q}[Y]\leq\tilde{\rho}(Y)\text{ for any }Y\in L_{+}^{\infty}\right\}$ . Furthermore, if $\tilde{\rho}$ satisfies the Lebesgue property, then the supremum in (12) is attained.

Proof. From, e.g., Theorem 2.3.1 in [44], it is known that

[TABLE]

Let us consider the conjugates of the functionals $f$ and $g$ in Lemma 21. The conjugate $f^{*}$ can be calculated as follows:

[TABLE]

by using the positive homogeneity of $\tilde{\rho}$ . The conjugate $g^{*}$ can be calculated as follows:

[TABLE]

By using (13) and Lemma 21, we have the following:

[TABLE]

Therefore, $\rho^{*}$ is the indicator function of the set

[TABLE]

Since the functional $\rho$ is the inf-convolution of the functionals $f$ and $g$ , as a consequence of the Fenchel-Moreau theorem, we have

[TABLE]

where $A_{P}=\left\{Q\in\mathbf{P}:\operatorname{\mathbb{E}}_{Q}[Y]\leq\tilde{\rho}(Y)\text{ for any }Y\in L_{+}^{\infty}\right\}$ . Hence,

[TABLE]

Following the same argument used in the proof of Proposition 17, it follows that if $\tilde{\rho}$ has the Lebesgue property, then the set $A_{P}$ is weakly compact, from which the attainment of the maximum follows. Indeed, the set $A_{P}$ is norm-bounded by definition. It is weakly closed, since it is the intersection of weakly closed sets. Now let us take a decreasing sequence $(A_{n})_{n}\in\mathcal{F}$ of which the intersection is the empty set. For any $Q\in A_{P}$ , we have $\operatorname{\mathbb{E}}_{Q}[1_{A_{n}}]\leq\tilde{\rho}(1_{A_{n}})$ for every $n$ . Hence, by using the Lebesgue property of $\tilde{\rho}$ , we have

[TABLE]

which gives that $A_{P}$ is uniformly integrable. Therefore, the supremum is attained due to the Dunford-Pettis and Weierstrass theorems; cf. also Theorem 3.6 in [15] and Theorem 8 in [14].

4 Axiomatizations of Orlicz premia

In this section, we first define Orlicz premia and derive some properties that are relevant in this paper; and next we establish new axiomatizations of Orlicz premia and compare them with the one given in [4].

4.1 Orlicz premia: Definition and properties

The mathematical definition of the Orlicz premium corresponds to the Luxemburg norm on the Orlicz space

[TABLE]

given by

[TABLE]

where the Orlicz function $\Phi\colon[0,+\infty)\to[0,+\infty]$ satisfies $\Phi(0)=0$ , is nondecreasing, left-continuous, convex, and nontrivial in the sense that $\Phi(x)>0$ for some $x>0$ and $\Phi(x)<+\infty$ for some $x>0$ . We refer to [19] for the basic properties of Luxemburg norms under these assumptions. Notice that in the actuarial and financial mathematics literature (e.g., [28], [10], [11], [4], [5]) there are small differences in the set of properties required to $\Phi$ . In this paper, we are interested in possibly nonconvex Orlicz functions that may not satisfy $\Phi(0)=0$ ; conversely, we will limit the domain of $H_{\Phi}$ to $L^{\infty}_{+}$ . (When the function $\Phi(\cdot)$ is convex and satisfies several additional properties, it is often referred to as a Young function; as these conditions are not assumed in this paper, we refer to $\Phi(\cdot)$ as an Orlicz function.) This leads to the following definition.

Definition 23

Let $\Phi\colon[0,+\infty)\to\mathbb{R}$ satisfy:

a)

$\Phi(1)=1$ , $\lim_{x\to+\infty}\Phi(x)=+\infty$ 2. b)

$\Phi$ * is nondecreasing* 3. c)

$\Phi$ * is left-continuous*

For $X\in L^{\infty}_{+}$ , the Orlicz premium is defined by

[TABLE]

We recall the relevant properties of Orlicz premia in the following proposition.

Proposition 24

Let $\Phi\colon[0,+\infty)\to\mathbb{R}$ and $H_{\Phi}(X)$ be as in Definition 23. Then,

a)

$H_{\Phi}$ * is monotone, positively homogeneous and satisfies $H_{\Phi}(1)=1$ * 2. b)

for each $X\in L^{\infty}_{++}$ , it holds that $\operatorname{\mathbb{E}}\left[\Phi(X/H_{\Phi}(X))\right]=1$ 3. c)

if $\Phi$ is increasing, then

[TABLE] 4. d)

$H_{\Phi}$ * is convex if and only if $\Phi$ is convex.*

Proof. (a) The proof is standard. (b) Let $g(k):=\operatorname{\mathbb{E}}\left[\Phi(X/k)\right]$ . If $k_{n}\downarrow k$ then $\Phi(X/k_{n})\uparrow\Phi(X/k)$ , so from the monotone convergence theorem it follows that $g$ is right-continuous. Since $H_{\Phi}(X)=\inf\{k\mid g(k)\leq 1\}$ , it follows that $g\left(H_{\Phi}(X)\right)=1$ , that is, $\operatorname{\mathbb{E}}\left[\Phi(X/H_{\Phi}(X))\right]=1$ . (c) The ‘only if’ part of the first implication follows by strict monotonicity of $\Phi$ . The ‘if’ part of the second implication is just the definition of $H_{\Phi}$ , while the ‘only if’ part follows from (c). (d) The proof of the ‘if’ part is standard. To prove the ‘only if’ part, assume first by contradiction that $\Phi$ is not midconvex, i.e., there exist $x_{1},x_{2}\geq 0$ such that $\Phi\left((x_{1}+x_{2})/2\right)>(\Phi(x_{1})+\Phi(x_{2}))/2$ . Then, there exists $z\in[0,+\infty)$ and $\lambda\in(0,1)$ such that

[TABLE]

Let $A,B,C\in\mathcal{F}$ be disjoint sets with $P(A)=\lambda$ , $P(B)=P(C)=\frac{1-\lambda}{2}$ and let

[TABLE]

From (15), we have $\operatorname{\mathbb{E}}[\Phi(Z)]>1$ and $\operatorname{\mathbb{E}}[\Phi(X)]=\operatorname{\mathbb{E}}[\Phi(Y)]<1$ , which contradicts the convexity of $H_{\Phi}$ . As a consequence, $\Phi$ is midconvex and since it is nondecreasing it is also convex.

A remarkable example in which $\Phi(0)\neq 0$ and $\Phi$ is not differentiable is the following.

Example 25 (Expectiles)

Let $0<q<1$ and let

[TABLE]

Then, $\Phi_{q}(0)=q$ , $\Phi_{q}(1)=1$ and $\Phi_{q}$ is convex if $1/2\leq q<1$ and concave if $0<q\leq 1/2$ . The corresponding Orlicz premium $H_{\Phi_{q}}$ satisfies

[TABLE]

which gives

[TABLE]

so $H_{\Phi_{q}}(X)$ coincides with the $q$ -expectile of $X$ ([34, 30]), denoted by $e_{q}(X)$ and defined for $X\in L^{1}(\Omega,\mathcal{F},P)$ by the condition

[TABLE]

The following theorem shows that expectiles are the most general translation invariant convex Orlicz premia.

Theorem 26

If $\Phi$ is increasing and $H_{\Phi}$ is translation invariant and convex, then

[TABLE]

with $a>b$ and $b<1$ .

Proof. Let $h(x):=\Phi(x+1)-1$ . Then $h(x)=0\iff x=0$ and $h(x)>0\iff x>0$ , and

[TABLE]

Fix $x<0$ and $z>0$ . Let $p$ be a solution of the equation

[TABLE]

and let $Y=p\delta_{x+1}+(1-p)\delta_{z+1}$ . It follows that $H_{\Phi}(Y)=1$ , and from translation invariance it follows that $H_{\Phi}(Y+c)=c+1$ for each $c\in\mathbb{R}_{+}$ , which implies

[TABLE]

Let $\lambda=\frac{1}{c+1}$ and note that $0<\lambda\leq 1$ . By combining (17) with (16), we have

[TABLE]

which gives

[TABLE]

From the convexity of $h$ it follows that $\lambda h(x)-h(\lambda x)\geq 0$ and $\lambda h(z)-h(\lambda z)\geq 0$ , so from the last equality we get $h(\lambda z)=\lambda h(z)$ and $h(\lambda x)=\lambda h(x)$ for every $0\leq\lambda\leq 1$ and for each $x<0$ and $z>0$ , from which the thesis follows.

It is immediate to check that $H_{\Phi}(X)=e_{q}(X)$ , with $q=a/(a+b)$ . Further, the same argument also shows that expectiles with $0<q\leq 1/2$ are the only concave translation invariant Orlicz premia. It is interesting to compare the theorem above with [28] and [27], where it is shown that a translation invariant Orlicz premium must be equal to the mean, but in their result actually also the differentiability of the Orlicz function $\Phi$ is assumed.

Definition 27

A function $f\colon[0,+\infty)\to\mathbb{R}$ is called GA-convex if, for each $\lambda\in(0,1)$ and $x,y>0$ , it holds that

[TABLE]

It is not difficult to verify that a nondecreasing and convex function on $(0,+\infty)$ is GA-convex. For completeness we report the proof in Lemma 49 in the Appendix. The converse does not hold, an example being $f(x)=\log x$ that is increasing and GA-convex but not convex. We refer to [35] for further properties of GA-convex functions.

Proposition 28

Let $\Phi\colon[0,+\infty)\to\mathbb{R}$ and $H_{\Phi}(X)$ be as in Definition 23. Then $H_{\Phi}$ is geometrically convex if and only if $\Phi$ is GA-convex.

Proof. We first prove the ‘if’ part. Notice first that, for each $X\in L^{\infty}_{+}$ and each $k>H_{\Phi}(X)$ , it holds by definition that $\operatorname{\mathbb{E}}[\Phi(X/k)]\leq 1$ . Since $\Phi$ is nondecreasing and left-continuous an application of the monotone convergence theorem shows that $\operatorname{\mathbb{E}}[\Phi(X/H_{\Phi}(X))]\leq 1$ . Let now $X,Y\in L_{+}^{\infty}$ and $\lambda\in(0,1)$ . From the GA-convexity of $\Phi$ it follows that

[TABLE]

which implies

[TABLE]

which from positive homogeneity gives

[TABLE]

To prove the ‘only if’ part, we first assume by contradiction that $\Phi$ is not GA-midconvex, i.e., there exist $x_{1},x_{2}\geq 0$ such that $\Phi\left(\sqrt{x_{1}x_{2}}\right)>(\Phi(x_{1})+\Phi(x_{2}))/2.$ Then, there exist $z\in[0,+\infty)$ and $\lambda\in(0,1)$ such that

[TABLE]

Take disjoint sets $A,B,C\in\mathcal{F}$ with $P(A)=\lambda$ , $P(B)=P(C)=\frac{1-\lambda}{2}$ and let

[TABLE]

From (18), we have $\operatorname{\mathbb{E}}[\Phi(Z)]>1$ and $\operatorname{\mathbb{E}}[\Phi(X)]=\operatorname{\mathbb{E}}[\Phi(Y)]<1$ , which contradicts with the geometric convexity of $H_{\Phi}$ . As a consequence, $\Phi$ is GA-midconvex. Since $\Phi$ is also nondecreasing the thesis follows. Indeed, this is seen as follows. By the induction hypothesis, $\Phi$ is rationally GA-convex. Now let us take $x,y\geq 0$ and $\lambda\in(0,1)$ . Without loss of generality, assume that $x>y$ . Take a $q\in\mathbb{Q}\cap[0,1]$ such that $q>\lambda$ . By monotonicity and rational GA-convexity of $\Phi$ , we have

[TABLE]

Since this inequality is valid for any $q\in\mathbb{Q}\cap[0,1]$ such that $q>\lambda$ , we can take the infimum over the set $\Lambda_{Q}:=\{q\in\mathbb{Q}:1\geq q>\lambda\}$ and obtain

[TABLE]

which gives the GA-convexity of $\Phi$ .

Since a nondecreasing and convex function is GA-convex, it follows that a convex Orlicz premium is also geometrically convex. The converse does not hold, an example being the logarithmic certainty equivalent, which is also the Orlicz premium corresponding to $\Phi(x)=1+\log(x)$ .

4.2 Axiomatization based on the properties of the multiplicative acceptance set

This is Theorem 2 in [4] that we report below for convenience.

Theorem 29

Let $\tilde{\rho}\colon\mathcal{M}_{1,c}(0,+\infty)\to\mathbb{R}$ be a law-invariant return risk measure and let $\mathcal{B}_{\tilde{\rho}}$ be the corresponding multiplicative acceptance set as in Definition 2 (now at the level of distributions). Assume that

a)

$\mathcal{B}_{\tilde{\rho}}$ * and $\mathcal{B}^{c}_{\tilde{\rho}}$ are convex with respect to mixtures, i.e., for each $\lambda\in(0,1)$ , $F,G\in\mathcal{B}_{\tilde{\rho}}$ $\Rightarrow\lambda F+(1-\lambda)G\in\mathcal{B}_{\tilde{\rho}}$ , and similarly for $\mathcal{B}^{c}_{\tilde{\rho}}$ * 2. b)

$\mathcal{B}_{\tilde{\rho}}$ * is $\tilde{\psi}$ -weakly closed for some gauge function $\tilde{\psi}$ * 3. c)

for each $0<\tilde{x}<1$ and $\tilde{y}>1$ , there exists $\alpha\in(0,1)$ such that

[TABLE]

*Then there exists a nondecreasing function $\Phi$ that satisfies

$\Phi(0)<1<\Phi(+\infty)$ such that $\tilde{\rho}(F)=H_{\Phi}(F)$ .*

As we will see, the convexity with respect to mixtures (at the level of distributions) of the multiplicative acceptance set and its complement assumed in item (a) in the theorem above, is implied by the CxLS property.

4.3 Axiomatizations based on CxLS

Definition 30

A law-invariant functional $\rho$ has the CxLS property if

[TABLE]

for each $\gamma\in\mathbb{R}$ , $F,G\in\mathcal{M}_{1,c}$ and $\lambda\in(0,1)$ .

Lemma 31

Let $\tilde{\rho}$ be a law-invariant return risk measure with CxLS. Then $\mathcal{B}_{\tilde{\rho}}$ and $\mathcal{B}_{\tilde{\rho}}^{c}$ are convex with respect to mixtures.

Proof. Let us take $F,G\in\mathcal{B}_{\tilde{\rho}}^{c}$ and $\lambda\in(0,1)$ . Let $X,Y\in L_{++}^{\infty}$ such that the distributions of $X$ and $Y$ are $F$ and $G$ . Take $A\in\mathcal{F}$ such that $P(A)=\lambda$ and $X,Y$ and $A$ are independent. Choosing such $X,Y$ and $A$ is possible because we are working in an atomless probability space, see Lemma 3.1 in [17]. Without loss of generality, assume that $\tilde{\rho}(X)=k\tilde{\rho}(Y)$ for some $k\geq 1$ . Define $X^{\prime}=X/k$ and denote its distribution by $F^{\prime}$ . By positive homogeneity, we have $\tilde{\rho}(X^{\prime})=\tilde{\rho}(Y)$ . Then, $X1_{A}+Y1_{A^{c}}$ has distribution $\lambda F+(1-\lambda)G$ and $X^{\prime}1_{A}+Y1_{A^{c}}$ has distribution $\lambda F^{\prime}+(1-\lambda)G$ . Since $k\geq 1$ and $X\in L_{++}^{\infty}$ , we have

[TABLE]

By using the monotonicity and the CxLS property, we have

[TABLE]

which gives the convexity of $\mathcal{B}_{\tilde{\rho}}^{c}$ with respect to mixtures. Similarly, it can be proved that $\mathcal{B}_{\tilde{\rho}}$ is convex with respect to mixtures.

Theorem 32

Let $\tilde{\rho}\colon L_{++}^{\infty}\to(0,+\infty)$ be a law-invariant geometrically convex return risk measure with CxLS. Then there exists a nondecreasing GA-convex Orlicz function $\Phi\colon[0,+\infty)\to\mathbb{R}\cup\{+\infty\}$ such that $\tilde{\rho}(X)=H_{\Phi}(X)$ .

Proof. From the hypotheses and Lemma 4, it follows that the corresponding $\rho$ given by (2) is a convex law-invariant monetary risk measure with CxLS. From Theorem 3.10 in [17], there exists a convex function $\varphi\colon\mathbb{R}\to\mathbb{R}\cup\{+\infty\}$ such that $\rho(X)\leq 0$ if and only if $\operatorname{\mathbb{E}}[\varphi(X)]\leq 0$ . Letting $\Phi(x):=1+\varphi(\log(x))$ , it follows that

[TABLE]

From the convexity of $\varphi$ , it follows that for each $x,y>0$ and $\lambda\in(0,1)$ ,

[TABLE]

which shows the GA-convexity of $\Phi$ .

Notice the consistency between Proposition 28 and Theorem 32. Notice also that under the assumptions of Theorem 32, the function $\Phi$ is not necessarily convex as the example of logarithmic certainty equivalents shows.

Theorem 33

Let $\tilde{\rho}\colon L_{++}^{\infty}\to(0,+\infty)$ be a law-invariant convex return risk measure with CxLS. Then there exists a nondecreasing convex Orlicz function $\Phi\colon[0,+\infty)\to\mathbb{R}\cup\{+\infty\}$ such that $\tilde{\rho}(X)=H_{\Phi}(X)$ .

Proof. Since a positively homogeneous, monotone convex functional defined on $L_{++}^{\infty}$ is geometrically convex, it follows from Theorem 32 that there exists a nondecreasing GA-convex Orlicz function $\Phi\colon[0,+\infty)\to\mathbb{R}\cup\{+\infty\}$ such that $\tilde{\rho}(X)=H_{\Phi}(X)$ . Since $H_{\Phi}$ is convex only if $\Phi$ is convex, as has been shown in Proposition 24, the thesis follows.

4.4 Axiomatization based on identifiability

Definition 34

We say that a functional $\rho\colon\mathcal{M}_{1,c}(0,+\infty)\to(0,+\infty)$ is identifiable if there exists at least one identification function $I\colon(0,+\infty)\times(0,+\infty)\to\mathbb{R}$ that satisfies, for each $F\in\mathcal{M}_{1,c}(0,+\infty)$ ,

[TABLE]

Example 35

Let $\rho(F)=\operatorname{\mathbb{E}}[F]$ . Then $\rho$ is identifiable and two possible identification functions are $I_{1}(x,y)=y-x$ and $I_{2}(x,y)=y/x-1$ .

An identifiable functional has the CxLS property, since

[TABLE]

for each $\lambda\in(0,1)$ .

Theorem 36

Let $\tilde{\rho}\colon\mathcal{M}_{1,c}(0,+\infty)\to(0,+\infty)$ be an identifiable and positively homogeneous functional satisfying $\tilde{\rho}(1)=1$ . Then there exists $\Phi\colon[0,+\infty)\to\mathbb{R}$ satisfying $\Phi(1)=1$ and $\Phi(x)>1\iff x>1$ such that

[TABLE]

Furthermore, under these assumptions, if $\tilde{\rho}$ is monotone then $\Phi$ is nondecreasing and if $\tilde{\rho}$ is monotone and convex then $\Phi$ is nondecreasing and convex.

The proof of Theorem 36 is based on the following lemma, the proof of which is postponed to the Appendix.

Lemma 37

Let $\tilde{\rho}\colon\mathcal{M}_{1,c}(0,+\infty)\to(0,+\infty)$ be positively homogeneous with $\tilde{\rho}(1)=1$ and identifiable by the function $I(x,y)\colon(0,+\infty)\times(0,+\infty)\to\mathbb{R}$ . Then there exists $g\colon(0,+\infty)\to\mathbb{R}$ with $g(1)=0$ and $g(t)>0\iff t>1$ with

[TABLE]

Proof of Theorem 36. By applying Lemma 37 and letting $\Phi(t)=1+g(t)$ , equations (19) follow. To prove monotonicity of $\Phi$ , take $x_{1},x_{2}\in[0,+\infty)$ with $x_{1}<x_{2}$ . If $x_{1}<1<x_{2}$ , then $\Phi(x_{1})<1<\Phi(x_{2})$ so monotonicity is trivial. Assume that $1<x_{1}<x_{2}$ . Take any $z<1$ and set

[TABLE]

Take $A\in\mathcal{F}$ with $P(A)=\lambda$ and let $X_{1}:=z1_{A}+x_{1}1_{A^{c}}$ and $X_{2}:=z1_{A}+x_{2}1_{A^{c}}$ . By construction, $X_{1}\leq X_{2}$ and $\operatorname{\mathbb{E}}[\Phi(X_{2})]=1$ . From the monotonicity of $\tilde{\rho}$ it follows that $\tilde{\rho}(X_{1})\leq\tilde{\rho}(X_{2})=1$ , so $\operatorname{\mathbb{E}}[\Phi(X_{1})]\leq 1=\operatorname{\mathbb{E}}[\Phi(X_{2})]$ , from which the thesis follows. The proof in the case $x_{1}<x_{2}<1$ is similar. To prove convexity of $\Phi$ , we first show mid-convexity. Assume by contradiction that there exist $x_{1},x_{2}\in[0,+\infty)$ such that $\Phi((x_{1}+x_{2})/2)>(\Phi(x_{1})+\Phi(x_{2}))/2$ . Then there exist $z\in[0,+\infty)$ and $\lambda\in(0,1)$ such that

[TABLE]

Take disjoint sets $A,B,C$ such that $P(A)=\lambda$ , $P(B)=\frac{1-\lambda}{2}$ and $P(C)=\frac{1-\lambda}{2}$ and let $X_{1}=z1_{A}+x_{1}1_{B}+x_{2}1_{C}$ , $X_{2}=z1_{A}+x_{1}1_{C}+x_{2}1_{B}$ , and

[TABLE]

It holds that

[TABLE]

which implies $\tilde{\rho}(Y_{1})<1$ and $\tilde{\rho}(Y_{2})<1$ . Similarly,

[TABLE]

which implies $\tilde{\rho}(X)>1$ , contradicting convexity of $\tilde{\rho}$ . Therefore, $\Phi$ is mid-convex. Since a nondecreasing mid-convex function is convex, the thesis follows.

5 Consistent scoring functions for Orlicz premia

In this section, we show that Orlicz premia are elicitable and we study general families as well as specific examples of strictly consistent scoring functions.

5.1 Elicitability and strict consistency

We start by recalling a few standard definitions adapted to the class of strictly positive return risk measures.

Definition 38 (Elicitability and strictly consistent scoring functions)

A functional $\rho\colon L^{\infty}_{++}\to(0,+\infty)$ is elicitable if there exists a strictly consistent scoring function $S\colon(0,+\infty)\times(0,+\infty)\to[0,+\infty)$ satisfying $S(x,y)\geq 0$ and $S(x,y)=0$ if and only if $x=y$ such that, for each $Y\in L^{\infty}_{++}$ , it holds that

[TABLE]

The strictly consistent scoring function $S$ is said to be of the prediction error form if $S(x,y)=f(x-y)$ and of the relative error form if $S(x,y)=g(y/x)$ , where $f,g$ are functions of a single variable.

The following theorem provides a general, rich family of scoring functions that are strictly consistent with Orlicz premia.

Theorem 39

Let $\Phi$ be as in Definition 23, with $\Phi$ increasing. Let $h\colon(0,+\infty)\to(0,+\infty)$ be any integrable function. Then,

[TABLE]

is a strictly consistent scoring function for the Orlicz premium $H_{\Phi}$ .

Proof. Let $x>H_{\Phi}(Y)$ . We compute

[TABLE]

where in the last line we have used Fubini’s Theorem and item (c) of Proposition 24. The same argument shows that if $x<H_{\Phi}(Y)$ , then again

[TABLE]

so $S_{\Phi}$ is a strictly consistent scoring function for the Orlicz premium.

Two particularly interesting cases arise by taking $h(z)=1/z$ and $h(z)=1/z^{2}$ . In the first case, with the change of variable $y/z=e^{t}$ , we obtain

[TABLE]

where

[TABLE]

In the second case, with the change of variable $y/z=t$ , we obtain

[TABLE]

We now present a collection of examples of strictly consistent scoring functions for Orlicz premia, using Orlicz functions that are commonly adopted in the literature. As we will see, in some cases the general approach based on Theorem 39 or the specific forms in equations (5.1) and (24) recover scoring functions that are already known in the literature, whereas in many other cases new families of scoring functions are obtained. In several cases the corresponding Orlicz premium $H_{\Phi}$ admits an explicit expression, whereas in some other cases it has to be computed numerically. All our examples, including some not discussed below, are summarized in Table 1.

Example 40 (Mean)

Let $\Phi(x)=x$ . Then, $H_{\Phi}(Y)=\operatorname{\mathbb{E}}[Y]$ and, from (5.1) and (23), we obtain

[TABLE]

which is an alternative scoring function for the mean. From the classical result of [41], as recalled e.g., in Theorem 7 of [26], the most general class of strictly consistent scoring functions for the mean belongs to the family of Bregman functions, of the form

[TABLE]

where $\phi$ is a convex function with subgradient $\phi^{\prime}$ . The scoring function given in equation (25) arises if $\phi(x)=-\log x$ . This scoring function is known as the quasi-likelihood (QLIKE) scoring function in the econometrics literature, and it is of common use in assessing forecasts of nonnegative quantities such as volatility (see e.g., [38] and the references therein).

Example 41 (Expectiles)

Let $\Phi_{q}(x)=1+q(x-1)^{+}-(1-q)(x-1)^{-}$ with $0<q<1$ , as in Example 25. The corresponding Orlicz premium is the $q$ -expectile and, from (5.1) and (23), the corresponding scoring function is given by

[TABLE]

The class of strictly consistent scoring functions for expectiles has been characterized in Theorem 10 of [26], as an asymmetric extension of Bregman functions, defined by

[TABLE]

and as in the case of the mean our scoring function (26) corresponds to the case $\phi(x)=-\log x$ .

Example 42 ( $p$ -norms)

Let $\Phi(x)=x^{p}$ with $p\geq 1$ . Then, the Orlicz premium $H_{\Phi}$ is given by

[TABLE]

and the corresponding scoring function from (5.1) and (23) takes the form

[TABLE]

In view of (25), which arises as a special case when $p\equiv 1$ , we will refer to this novel scoring function as the PQLIKE scoring function.

When $0<p<1$ , the resulting Orlicz premium is no longer a norm, but the scoring function is still valid. Taking a (suitably normalized and scaled) limit for $p\rightarrow 0$ , such that $\Phi(x)=1+\log(x)$ arises, the Orlicz premium $H_{\Phi}$ is the logarithmic certainty equivalent given by

[TABLE]

The corresponding scoring function from (5.1) and (23) takes the form

[TABLE]

Example 43 (Mean-variance)

Let $\Phi(x)=\lambda x+(1-\lambda)x^{2}$ with $0\leq\lambda\leq 1$ as in Section 5.1 of [28]. Then, the Orlicz premium $H_{\Phi}$ is given by

[TABLE]

and the corresponding scoring function from (5.1) and (23) is

[TABLE]

Example 44

Let $\Phi(x)=\frac{e^{\alpha x}-1}{e^{\alpha}-1}$ with $\alpha>0$ as in Section 5.2 of [28]. The corresponding Orlicz premium $H_{\Phi}$ does not in general admit an explicit expression. It is the solution of the equation $f_{Y}(\alpha/H_{\Phi})=e^{\alpha},$ where $f_{Y}(t)=\operatorname{\mathbb{E}}[\exp(tY)]$ is the moment generating function of $Y$ . For example, if $Y$ has a Gamma distribution with shape parameter $\theta>0$ and rate parameter $\gamma>0$ , we have

[TABLE]

The corresponding scoring function from (24) is given by

[TABLE]

5.2 Mixture representations

It is clear that a scoring function of the form (21) depends on the choice of the function $h$ . Hence, the ranking of competing forecasts may depend on this choice, in particular in finite samples and under model misspecification (see e.g., [39]). To remedy the dependence of the ranking on the specific choice of the strictly consistent scoring function, [20] develop a method to compare forecasts simultaneously with respect to a class of strictly consistent scoring functions by considering so-called Murphy diagrams. This method relies on the availability of a mixture representation of the strictly consistent scoring functions under consideration, in terms of elementary scoring functions depending on a low-dimensional parameter. Mixture representations for the class of strictly consistent scoring functions for quantiles and expectiles have been given in [20]. A mixture representation of strictly consistent scoring functions for the triplet of Range Value-at-Risk and its two associated Value-at-Risks has been given in [22]. In the following theorem, we provide such a mixture representation for our new family of scoring functions in (21).

Theorem 45

Any strictly consistent scoring function for the Orlicz premium $H_{\Phi}$ of the form (21) admits a representation of the form

[TABLE]

for a positive measure $H$ , where $S_{z}(x,y)=\left\lvert\Phi\left(\frac{y}{z}\right)-1\right\rvert 1_{\left\{x\leq z<y\right\}\cup\left\{y\leq z<x\right\}}$ . Conversely, for any choice of the positive measure $H$ , we obtain a strictly consistent scoring function of the form (21) for the Orlicz premium $H_{\Phi}$ .

Proof. By using (21), we have

[TABLE]

where $\mathrm{d}H(z)=h(z)\,\mathrm{d}z$ and $S_{z}(x,y)=\left\lvert\Phi\left(\frac{y}{z}\right)-1\right\rvert 1_{\left\{x\leq z<y\right\}\cup\left\{y\leq z<x\right\}}$ . Note that, since the function $h$ in (21) is strictly positive, the Riemann integral $\int_{0}^{+\infty}S_{z}(x,y)\,\mathrm{d}H(z)$ is well-defined.

As a corollary, we provide a mixture representation for the scoring functions of $p$ -norms.

Corollary 46

Any strictly consistent scoring function of the form (21) with $\Phi(x)=x^{p}$ , $p\geq 1$ , admits a representation of the form:

[TABLE]

for a positive measure $H$ , where $S^{p}_{z}(x,y)=\left\lvert y^{p}-z^{p}\right\rvert 1_{\left\{x\leq z<y\right\}\cup\left\{y\leq z<x\right\}}$ .

Proof. From Theorem 45 with $\Phi(x)=x^{p}$ , we obtain

[TABLE]

where $\mathrm{d}\widetilde{H}(z):=\frac{1}{z^{p}}\,\mathrm{d}H(z)$ .

We conduct two simulation experiments to illustrate how one can use Theorem 45 to rank competing forecasts. In particular, we will generate Murphy diagrams for logarithmic certainty equivalents, $p$ -norms and expectiles by using the corresponding elementary scoring functions.

Example 47

We first suppose that the true distribution of the outcome variable $Y$ is given by $\log(Y)|\mu\ \sim\mathcal{N}(\mu,\sigma_{Y}^{2})$ where $\mu\sim\mathcal{N}(0,\sigma_{\mu}^{2})$ . We take $\sigma_{Y}=\sigma_{\mu}=0.2$ . We consider four different forecasters, who will be referred to as perfect, unconditional, unfocused and sign-reversed, similar to [25] and [20], suitably modified to the current setting. The perfect forecaster issues the true distribution of the outcome $Y$ as predictive distribution. Therefore, his/her point forecasts of the logarithmic certainty equivalent (LCE) and $p$ -norm are $\exp(\mu)$ and $\exp(\mu+\sigma_{Y}^{2}p/2)$ with $\sigma_{Y}=0.2$ . The unconditional forecaster does not have knowledge of $\mu$ and issues the unconditional distribution of $\log(Y)$ as predictive distribution: $\mathcal{N}(0,\sigma_{\mu}^{2}+\sigma_{Y}^{2})$ . Therefore, his/her point forecasts of the LCE and $p$ -norm are $\exp(0)$ and $\exp((\sigma_{\mu}^{2}+\sigma_{Y}^{2})p/2)$ with $\sigma_{Y}=\sigma_{\mu}=0.2$ . The remaining two forecasters, unfocused and sign-reversed, have knowledge of $\mu$ , but their predictive distributions fail to be ideal. The unfocused forecaster issues a mixture distribution as predictive distribution of $\log(Y)$ , involving an independent random variable $\tau$ that takes the values $0.2$ and $-0.2$ with probability $1/2$ , leading to $\tfrac{1}{2}(\mathcal{N}(\mu,\sigma_{Y}^{2})+\mathcal{N}(\mu+\tau,\sigma_{Y}^{2}))$ yielding $\exp(\mu+\tau/2)$ and $\exp(\mu+\tau/2+\sigma_{Y}^{2}p/4)$ with $\sigma_{Y}=0.2$ as the corresponding forecasts of the LCE and $p$ -norm. The sign-reversed forecaster issues a predictive distribution of $\log(Y)$ with the sign of $\mu$ flipped: $\mathcal{N}(-\mu,\sigma_{Y}^{2})$ . Therefore, his/her point forecasts are $\exp(-\mu)$ and $\exp(-\mu+\sigma_{Y}^{2}p/2)$ . The point forecasts generated by the four predictive distributions are summarized in Table 2. Using $10\mathord{,}000$ simulations of sample size $1\mathord{,}000$ each, we obtain the Murphy diagrams displayed in Figure 2 for the LCE and $p$ -norm with $p=1,2,3$ . As can be seen in Figure 2, the perfect forecaster dominates the other forecasters for the LCE and all $p$ -norms considered, as expected. Although not clearly visible, the expected scores for the other three forecasters intersect in all four cases, such that none of these forecasters dominates the other.

Example 48

For $q$ -expectiles, from Theorem 45 with $\Phi_{q}(x)=1+q(x-1)^{+}-(1-q)(x-1)^{-}$ , $0<q<1$ , we have

[TABLE]

where $\mathrm{d}\widetilde{H}(z):=\frac{1}{z}\mathrm{d}H(z)$ . Therefore, the elementary scoring function for expectiles can be expressed as

[TABLE]

Suppose now that the true distribution of the outcome variable $Y$ is given by $Y|\lambda\sim\exp(\lambda)$ where $\log(\lambda)\sim\mathcal{N}(0,\sigma^{2}_{\lambda})$ . We take $\sigma_{\lambda}=0.2$ . We consider three different forecasters: perfect, unfocused and mean-reversed, similar to Example 47. The perfect forecaster issues the true distribution of $Y$ as predictive distribution. The unfocused forecaster issues $\exp(\tau\lambda)$ as predictive distribution, involving an independent random variable $\tau$ that takes the values $5/4$ and $4/5$ each with probability $1/2$ . The mean-reversed forecaster issues a predictive distribution with the mean reversed: $\exp(1/\lambda)$ . The point forecasts generated by the three predictive distributions are displayed in Table 3. Using $10\mathord{,}000$ simulations of sample size $1\mathord{,}000$ each, we obtain the Murphy diagrams displayed in Figure 3 for $q=0.5,0.7,0.9,0.95$ . As we see from the figure, the perfect forecaster dominates the other forecasters, as expected. There is no ordering relationship between the unfocused and mean-reversed forecasters, because their expected scores intersect.

6 Appendix

Lemma 49

Let $f\colon(0,+\infty)\to\mathbb{R}$ be nondecreasing and convex. Then, $f$ is GA-convex.

Proof. For $x,y>0$ and $\lambda\in(0,1)$ from the AM-GM inequality it holds that $\lambda x+(1-\lambda)y\geq x^{\lambda}y^{1-\lambda}$ . Since $f$ is nondecreasing and convex it follows that

[TABLE]

which gives the thesis.

Proof of Lemma 37. Since $\tilde{\rho}$ is law invariant and positively homogeneous with $\tilde{\rho}(1)=1$ , it follows that $\tilde{\rho}(\delta_{y})=y$ , for each $y>0$ . From identifiability, it follows that

[TABLE]

For each $0<y_{1}<x<y_{2}$ , define

[TABLE]

Since

[TABLE]

it follows that $\tilde{\rho}(\bar{F})=x$ . From the positive homogeneity of $\tilde{\rho}$ , it follows that for each $\lambda>0$ ,

[TABLE]

and from identifiability

[TABLE]

which gives

[TABLE]

so we can conclude that

[TABLE]

and letting $\lambda=1/x$ we find that

[TABLE]

for each $0<y_{1}<x<y_{2}$ .

We now want to prove that

[TABLE]

with $h(x)>0$ , from which the thesis follows immediately. We consider two cases.

If $y>x$ , we set $y_{1}=x/2$ and $y_{2}=y$ in (30), obtaining

[TABLE]

which gives

[TABLE]

If instead $y<x$ , we set $y_{1}=y$ and $y_{2}=2x$ in (30), obtaining

[TABLE]

which gives

[TABLE]

Notice also that from (30) it follows that

[TABLE]

so combining (32) and (33) it follows that (31) is satisfied with

[TABLE]

from which the thesis follows.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Aliprantis, C. D., Border, K. C. (2006). Infinite Dimensional Analysis. Third edition, Springer Verlag, Berlin.
2[2] Bellini, F., Bignozzi, V. (2015). On elicitable risk measures. Quantitative Finance 15(5), 725-733.
3[3] Bellini, F., Rosazza Gianin, E. (2008). On Haezendonck risk measures. Journal of Banking and Finance 32(6), 986–994.
4[4] Bellini, F., Laeven, R. J. A., Rosazza Gianin, E. (2018). Robust return risk measures. Mathematics and Financial Economics 12(1), 5-32.
5[5] Bellini, F., Laeven, R. J. A., Rosazza Gianin, E. (2021). Dynamic robust Orlicz premia and Haezendonck-Goovaerts risk measures. European Journal of Operational Research 291(2), 438-446.
6[6] Ben-Tal, A., Teboulle, M. (1986). Expected utility, penalty functions, and duality in stochastic nonlinear programming. Management Science 32(11), 1445-1466.
7[7] Ben-Tal, A., Teboulle, M. (2007). An old-new concept of convex risk measures: The optimized certainty equivalent. Mathematical Finance 17(3), 449-476.
8[8] Biagini, S., Frittelli, M. (2008). A unified framework for utility maximization problems: An Orlicz space approach. The Annals of Applied Probability 18(3), 929-966.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

Elicitability of Return Risk Measures††thanks: We are very grateful to seminar participants at the University of Vienna and the University of Amsterdam for their comments and suggestions.

Abstract

1 Introduction

2 Return risk measures

Definition 1

Definition 2

Definition 3

Lemma 4

Definition 5

Lemma 6

Proposition 7

3 Dual representations

Theorem 8

Remark 9

Definition 10

Theorem 11

Remark 12

Definition 13

Definition 14

Proposition 15

Lemma 16

Proposition 17

Definition 18

Lemma 19

Definition 20

Lemma 21

Proposition 22

4 Axiomatizations of Orlicz premia

4.1 Orlicz premia: Definition and properties

Definition 23

Proposition 24

Example 25** (Expectiles)**

Theorem 26

Definition 27

Proposition 28

4.2 Axiomatization based on the properties of the multiplicative acceptance set

Theorem 29

4.3 Axiomatizations based on CxLS

Definition 30

Lemma 31

Theorem 32

Theorem 33

4.4 Axiomatization based on identifiability

Definition 34

Example 35

Theorem 36

Lemma 37

5 Consistent scoring functions for Orlicz premia

5.1 Elicitability and strict consistency

Definition 38** (Elicitability and strictly consistent scoring functions)**

Theorem 39

Example 40** (Mean)**

Example 41** (Expectiles)**

Example 42** (ppp-norms)**

Example 43** (Mean-variance)**

Example 44

5.2 Mixture representations

Theorem 45

Corollary 46

Example 47

Example 48

6 Appendix

Lemma 49

Example 25 (Expectiles)

Definition 38 (Elicitability and strictly consistent scoring functions)

Example 40 (Mean)

Example 41 (Expectiles)

Example 42 ( $p$ -norms)

Example 43 (Mean-variance)