Stochastic derivative estimation for max-stable random fields
Erwan Koch, Christian Y. Robert

TL;DR
This paper develops and validates stochastic derivative estimation methods for max-stable random fields, crucial in spatial extremes, with applications demonstrated in risk and dependence measures for insurance.
Contribution
It provides tractable conditions for applying likelihood ratio and infinitesimal perturbation analysis methods to max-stable fields like Brown–Resnick and Smith, which was previously complex.
Findings
Both LRM and IPA perform well in simulations.
Conditions ensure the validity of derivative estimations.
Application to insurance risk measures demonstrates practical utility.
Abstract
We consider expected performances based on max-stable random fields and we are interested in their derivatives with respect to the spatial dependence parameters of those fields. Max-stable fields, such as the Brown--Resnick and Smith fields, are very popular in spatial extremes. We focus on the two most popular unbiased stochastic derivative estimation approaches: the likelihood ratio method (LRM) and the infinitesimal perturbation analysis (IPA). LRM requires the multivariate density of the max-stable field to be explicit, and IPA necessitates the computation of the derivative with respect to the parameters for each simulated value. We propose convenient and tractable conditions ensuring the validity of LRM and IPA in the cases of the Brown--Resnick and Smith field, respectively. Obtaining such conditions is intricate owing to the very structure of max-stable fields. Then we focus on…
| 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 0.048 | 0.046 | 0.131 | 0.126 | 0.061 | 0.058 | 0.167 | 0.158 | 0.784 | 0.797 | |
| 0.074 | 0.074 | -0.044 | -0.044 | 0.122 | 0.117 | -0.072 | -0.070 | 0.610 | 0.626 | |
| 0.087 | 0.089 | -0.439 | -0.452 | 0.306 | 0.302 | -1.552 | -1.529 | 0.283 | 0.296 | |
| 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | 2 | 3 | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.174 | 0.170 | 0.06 | 0.058 | 0.020 | 0.020 | 0.242 | 0.232 | 0.083 | 0.080 | 0.029 | 0.027 | 0.717 | 0.732 | |
| 0.233 | 0.243 | 0.05 | 0.053 | 0.011 | 0.011 | 1.669 | 1.655 | 0.362 | 0.359 | 0.078 | 0.078 | 0.139 | 0.147 | |
| Brown–Resnick | CLIC | ||||||
|---|---|---|---|---|---|---|---|
| 6020169 | |||||||
| Smith | CLIC | ||||||
| 6084169 |
| 0.039 | 0.106 | 0.046 | 0.126 | 0.840 | |
| 0.068 | -0.041 | 0.100 | -0.059 | 0.685 | |
| 0.096 | -0.486 | 0.278 | -1.409 | 0.345 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInsurance, Mortality, Demography, Risk Management · Insurance and Financial Risk Management · Financial Risk and Volatility Modeling
Stochastic derivative estimation for max-stable random fields
Erwan Koch111EPFL, Institute of Mathematics, EPFL SB MATH MATH-GE, MA B1 457 (Bâtiment MA), Station 8, 1015 Lausanne, Switzerland.
Email: [email protected] Christian Y. Robert222Laboratory in Finance and Insurance - LFA CREST - Center for Research in Economics and Statistics, ENSAE, Paris, France.
Email: [email protected]
(November 3, 2020)
Abstract
We consider expected performances based on max-stable random fields and we are interested in their derivatives with respect to the spatial dependence parameters of those fields. Max-stable fields, such as the Brown–Resnick and Smith fields, are very popular in spatial extremes. We focus on the two most popular unbiased stochastic derivative estimation approaches: the likelihood ratio method (LRM) and the infinitesimal perturbation analysis (IPA). LRM requires the multivariate density of the max-stable field to be explicit, and IPA necessitates the computation of the derivative with respect to the parameters for each simulated value. We propose convenient and tractable conditions ensuring the validity of LRM and IPA in the cases of the Brown–Resnick and Smith field, respectively. Obtaining such conditions is intricate owing to the very structure of max-stable fields. Then we focus on risk and dependence measures, which constitute one of the several frameworks where our theoretical results can be useful. We perform a simulation study which shows that both LRM and IPA perform well in various configurations, and provide a real case study that is valuable for the insurance industry.
Key words: Infinitesimal Perturbation Analysis; Likelihood Ratio Method; Max-stable random fields; Monte Carlo computation, Risk assessment.
1 Introduction
Sensitivity analysis (SA) is essential in many fields and constitutes an active research area. It consists in the quantitative assessment of how changes in a specific model parameter impact the so-called expected performance. The expected performance is the expectation of a function of an underlying stochastic model, and SA aims at computing its derivatives with respect to the model parameters. If the expected performance is excessively sensitive to some critical parameters, it warns the decision maker not to be overly confident in the obtained value, especially if there is a huge uncertainty on these parameters. It also informs him/her that energy should be invested to find more reliable estimates of these parameters (e.g., by developing new inference methods). By the delta method, the knowledge of the derivative of the expected performance with respect to a parameter of interest allows one to translate confidence bounds for this parameter into confidence bounds for the expected performance. Finally, if the aim is to maximize the expected performance with respect to some parameter, knowing its derivative is often required as most optimization algorithms are based on gradient methods.
To the best of our knowledge, SA was never considered for expected performances based on extreme-value models, although this is of high necessity. Indeed, such models are by essence generally fitted using a small amount of data, resulting in a large uncertainty in the parameter estimates. For a practical introduction about extreme-value theory, see, e.g., Coles, (2001). Extreme-value models are valuable in many applications. Max-stable random fields (e.g., de Haan,, 1984; de Haan and Ferreira,, 2006), which constitute an extension of univariate extreme-value theory to the infinite-dimensional (e.g., spatial) setting, are, for instance, well suited for the modelling of the pointwise maxima of variables having a spatial extent such as environmental ones. Such a modelling is useful for quantifying the impact of natural disasters, which is essential for insurance companies and civil authorities, especially in a context of climate change. However, owing to the complex structure of max-stable fields, their multivariate density is in general not available for more than two or three sites, making estimation highly non-trivial. Commonly used estimators are not asymptotically efficient (in the sense of the Cramér-Rao bound), which, combined with the uncertainty stemming from the scarcity of data mentioned above, can lead to estimates that are far from the true value. It is therefore essential to assess the sensitivity of any expected performance based on max-stable fields with respect to the parameters. The purpose of this paper is precisely to provide tractable conditions enabling such an assessment and to illustrate this in a case where the expected performance is a risk or dependence measure.
We focus on expected performances that do not have any closed-form expression, entailing that assessment of their derivative cannot be achieved using differentiation or finite differences based on the analytical expression, but requires simulation-based approaches. There are essentially three classic stochastic derivative estimation methods. The first one is based on finite-differences. Corresponding estimators involve a bias-variance trade-off and require simulating at multiple parameter values, making this methodology less powerful than the two others. The second approach, referred to as likelihood ratio method (LRM), is based on the derivatives of the density function associated with the simulation model. The third one, called infinitesimal perturbation analysis (IPA), relies on computing the derivative with respect to the parameter of each simulated value (and then averaging them); IPA is thus also referred to as a sample path differentiation method. Contrary to IPA where the parameter is considered as purely structural, in LRM it is viewed as a parameter of the probability measure. LRM requires the density function to be explicit and differentiable and, on the other hand, IPA requires differentiability of the sample paths with respect to the parameter, which is a quite strong condition. It is argued in Glasserman, (2003, Section 7.4) that LRM estimators often have a larger variance than IPA estimators, explaining why IPA is generally considered as the best derivative estimator. A huge literature is dedicated to LRM and IPA; excellent general references are in particular Asmussen and Glynn, (2010), Chapter VII, and Glasserman, (2003), Chapter 7. Both approaches are widely applied to many fields, for instance to finance where many risk hedging strategies involve computing sensitivities of option prices to the underlying assets’ prices and other parameters (the so-called Greeks). Broadie and Glasserman, (1996) and Chen and Fu, (2001) use IPA for option pricing and mortgage-backed securities, respectively. Motivated by financial applications, Glasserman and Liu, (2010) investigate LRM in the case where the relevant densities are only known through their characteristic functions or Laplace transforms.
To our knowledge, SA, and thus LRM and IPA, have not yet been considered for expected performances based on extreme-value models, a fortiori max-stable fields. Among both methods, none can be used for all max-stable fields systematically. LRM is often ruled out when the random performance involves values of the field at more than two or three sites since the multivariate density is not explicitly available. However, there are notable exceptions such as the Brown–Resnick random field (Brown and Resnick,, 1977; Kabluchko et al.,, 2009) or the extremal t random field (Opitz,, 2013). The main technical challenge to apply LRM is to show that interchange between derivative (with respect to the parameter) and integration over the space of possible values of the field at the sites considered is feasible. On the other hand, IPA involves showing that the paths of the random field are differentiable with respect to the spatial dependence parameters, which is often arduous as max-stable fields arise as the pointwise maxima over an infinite number of latent random fields. Nevertheless, when the paths of these latent fields are sufficiently smooth, such as for the Smith random field (Smith,, 1990), it is possible. The Smith field belongs to the class of Brown–Resnick fields, but, being a border case, its properties in terms of availability of a closed-form multivariate density and smoothness of the paths are very different than for most Brown–Resnick fields.
Our main theoretical contribution consists in proposing convenient conditions ensuring the validity of LRM and IPA for estimating the derivatives of expected performances based on the Brown–Resnick field and the Smith field, respectively. The Brown–Resnick field is very flexible and is one of the most appropriate models for environmental data among currently known max-stable models (e.g., Davison et al.,, 2012). We focus on the derivatives with respect to the spatial dependence parameters, since showing the validity of LRM or IPA for estimating the derivatives with respect to the marginal parameters (those of the generalized extreme-value distribution) is easy (under mild assumptions). Our conditions are as tractable as possible; e.g., in the case of IPA for the Smith field, the condition involves the derivative of the expected performance with respect to the field values but not to the spatial dependence parameters. Obtaining such practical conditions is difficult owing to the complex structure of max-stable fields.
Our second contribution pertains to the context of risk assessment in insurance or finance. Our results are insightful for studying the sensitivity of risk or dependence measures based, e.g., on insured losses triggered by extreme events having a spatial extent (typically such as weather events), or the sensitivity of prices of event-linked securities such as catastrophe bonds. Indeed, many risk or dependence measures as well as prices can be written as expected performances. After providing a general discussion, we focus on a specific dependence measure for insured losses due to extreme wind speeds. This measure has analytical derivatives for both the Brown–Resnick and the Smith fields, and hence allows us to compare the LRM and IPA estimates with the true values. We implement LRM and IPA (adapting when necessary existing simulation algorithms) and perform a thorough numerical study which shows that both estimation methods are very accurate. Finally, we propose a concrete application which is valuable for actuarial practice. Using reanalysis wind speed data, we study the sensitivities of the aforementioned dependence measure in an area centred over the Ruhr region in Germany. This application highlights the importance of assessing the sensitivity of risk or dependence measures in real practice.
The remainder of the paper is organized as follows. Section 2 first recalls useful results about max-stable fields and presents the concept of expected performance. Then it introduces LRM and IPA with their associated examples of max-stable fields (Brown–Resnick and Smith, respectively) and states our conditions guaranteeing the validity of both methods in these cases. Section 3 is devoted to risk assessment applications. After a general discussion, we introduce the aforementioned specific dependence measure, we present the simulation study and we finally expose our real case study. Section 4 briefly summarizes our contribution and presents some perspectives. Throughout the paper, we shall use the following notations. Let ′ denote transposition. For , , for a matrix , such that and, for a positive definite symmetric matrix , . Furthermore, denotes the supremum when the latter is taken over a countable set. Finally, denotes equality in distribution. In the case of random fields, by distribution we mean the set of all finite-dimensional distributions. All proofs can be found in the appendix.
2 Stochastic derivative estimators
2.1 Max-stable random fields and associated expected performance
A random field on with non-degenerate marginals is called max-stable if there are continuous functions and on such that if are independent copies of then
[TABLE]
(pointwise maxima). A random field on with standard Fréchet margins (i.e., , , ) is max-stable if
[TABLE]
where are independent copies of ; such a field is said to be simple max-stable. It is well-known that there exist continuous functions , and on , called the location, scale and shape functions such that
[TABLE]
Any simple max-stable random field on can be written (e.g., de Haan,, 1984) as
[TABLE]
where the are the points of a Poisson point process on with intensity function and the , are independent copies of a non-negative random field such that, for all , . Conversely, any field of the form (2) is simple max-stable. We assume that the distribution of depends on a spatial dependence parameter , and we now denote by (as well as by ). The dependence of with respect to will be explicitly described below.
We consider sites , and let . It is known that the distribution function of is
[TABLE]
where is the so-called exponent measure function given by
[TABLE]
If the -th order partial derivatives of exist, then the density function of may be derived by the Faà di Bruno’s formula for multivariate functions. Let . We denote by the set of all partitions of and, for a partition , means that is one of the blocks of the partition . Moreover, for any set and , we let . The density function of is then given by
[TABLE]
where denotes the number of blocks of the partition , the cardinality of the set and the partial derivative with respect to .
The random performance (sometimes also called model output) is defined by , where is a function from to , and the expected performance is its expectation, i.e.,
[TABLE]
provided that . Below we give conditions ensuring the applicability of LRM and IPA to estimate at a specific , for built from the Brown–Resnick and the Smith fields, respectively. The quantity in (4) is pretty general and the results developed in this section can thus be applied for various purposes. One of them, particularly important in finance and insurance, is to estimate the sensitivities of risk and dependence measures; this is done in Section 3.
When fitted to data, a max-stable field is not simple but written as in (1) with location, scale and shape functions not uniformly equal to unity. Hence, denoting by such a field and letting , the expected performance should be written
[TABLE]
where is a function from to such that . However, we see by (1) that the representation (4) encompasses the case of (5) by letting depend on the location, scale and shape parameters of at the sites. As technical issues arise for the derivatives with respect to the spatial dependence parameters only, we do not consider differentiation with respect to the marginal parameters, (although this is of practical interest), and focus on (4) for convenience.
2.2 Likelihood ratio method
2.2.1 General methodology
We first introduce LRM to the context of expected performances based on max-stable fields. Let be a simple max-stable random vector, be a function from to such that , , and be a possible value of the parameter . LRM requires to have a density function that can be differentiated with respect to . In that case, the expected performance satisfies
[TABLE]
We assume that there exists some non-random neighbourhood of , , such that
- •
for each , ,
- •
for almost all , exists for all ,
- •
there is an integrable function such that for all and almost every .
Then, by the dominated convergence theorem, differentiation and integration can be interchanged, giving
[TABLE]
where, provided that, for all , exists for all ,
[TABLE]
Then LRM consists in computing by estimating the expectation in the right-hand side of (6) by Monte Carlo. Note that the above assumptions are quite usual in the theory of maximum likelihood estimation where is called the score function.
2.2.2 The case of the Brown–Resnick random field
We now focus on the case of the Brown–Resnick field. Let be a centred Gaussian random field on with stationary increments and with semivariogram , and let us define , , where denotes the variance. Then the field defined by (2) with that is referred to as the Brown–Resnick random field associated with the semivariogram (Brown and Resnick,, 1977; Kabluchko et al.,, 2009). It is stationary333Throughout the paper, stationarity refers to strict stationarity. and its distribution only depends on the variogram (Kabluchko et al.,, 2009, Theorem 2 and Proposition 11, respectively). The case where is a fractional Brownian motion leads to the commonly used semivariogram
[TABLE]
where and are the range and the smoothness parameters, respectively, and . Generally, is allowed in (7), but we exclude that value here as it makes the multivariate density unavailable under a closed-form expression for , and thus LRM inapplicable in this case.
For any semivariogram , let , , . Let also , , and be the matrix with -th entry , . We denote, for , by and the -dimensional Gaussian distribution and density functions with covariance matrix , respectively. Provided that the matrices , , are positive definite, then the exponent measure function of the Brown–Resnick random field is given by (e.g., Huser and Davison,, 2013)
[TABLE]
where
[TABLE]
Combined with (3), (8) shows the existence of a closed-form expression for the density .
In the following theorem, we provide convenient conditions ensuring the applicability of LRM for estimating .
Theorem 1**.**
For a non-random neighbourhood of , , let
[TABLE]
Assume that there exist a non-random neighbourhood and a constant such that
[TABLE]
where . Assume moreover that
[TABLE]
Then
[TABLE]
These technical but tractable conditions are pretty natural ones to guarantee the validity of the interchange between differentiation and integration.
Realizations of the term inside the expectation in (12) can be obtained since the Brown–Resnick field can be simulated at a finite number of sites and the score can be computed. The simulation methods for the Brown–Resnick field are either exact (e.g., Dombry et al.,, 2016; Oesting et al.,, 2018) or approximate (Schlather,, 2002, Theorem 4).
Nonetheless, the computation of the density (and thus of the score) may be challenging as the number of terms in the sum equals the -th Bell number, which grows super-exponentially in the dimension . To circumvent this issue, a solution is to approximate (3) by Monte Carlo simulation; i.e., for ,
[TABLE]
where the partitions form an ergodic sequence with stationary distribution given by
[TABLE]
Dombry et al., (2013) design a Gibbs sampler to generate approximate simulations without explicitly computing the constant factor in the denominator of (13). We refer to that paper for more details about the practical implementation. Theoretically, the ergodicity of the resulting Markov chain implies that the precision of the approximation is arbitrarily high as . In practice, the number of iterations of the Gibbs sampler, , is typically much smaller than the cardinality of , but the approximation is reasonably good even for moderate values of because generally only a few partitions are compatible with the data; see, e.g., Huser et al., (2019). For the Brown–Resnick field associated with the semivariogram (7), they conclude that the Gibbs sampler converges quickly and that about iterations are enough for the algorithm to converge for a large number of parameter configurations.
Using the Monte Carlo based idea, the approximation of the score function is then given by
[TABLE]
Analytical expressions of are known for the Brown–Resnick random field (e.g., Dombry et al.,, 2017, Section B.4).
2.3 Infinitesimal perturbation analysis
2.3.1 General methodology
We first introduce IPA to the context of expected performances based on max-stable fields. Let be a simple max-stable random vector, be a function from to such that , , and be a possible value of the parameter . IPA requires the random performance to be differentiable with respect to . However, the density does not need to be explicit, making IPA a potential solution when LRM is invalid. When both LRM and IPA can be applied, IPA may be preferable, although the optimal choice depends on the specific form of the random performance. For stochastic processes and random fields, IPA is also called pathwise derivative estimation because it uses differentiation of sample path functionals.
The essence of IPA is to assess the derivative of interest using
[TABLE]
provided that the derivative and the expectation can be interchanged. The derivative can then be computed through estimation of the right-hand side of (14) by Monte Carlo. A sufficient condition for the interchange to hold in the general case is given, e.g., in Asmussen and Glynn, (2010), Chapter VII, Proposition 2.3.
Proposition 1**.**
Assume that is an almost surely (a.s.) differentiable function at and that a.s. satisfies the Lipschitz condition
[TABLE]
for , in a non-random neighbourhood of , where . Then (14) holds.
If is differentiable on an open set , and satisfies for all in , then is Lipschitz continuous with Lipschitz constant at most over . Therefore, we immediately deduce that, if there exists a random variable satisfying and such that a.s.
[TABLE]
where is a non-random neighbourhood of , then (14) holds.
2.3.2 The case of the Smith random field
We illustrate IPA in the case where the random performance is based on the Smith random field (Smith,, 1990), for which a closed-form expression for the density is only known when , the number of stations in , is smaller or equal than (Genton et al.,, 2011). Letting be the points of a Poisson point process on with intensity function , the Smith random field with covariance matrix is defined by
[TABLE]
i.e., by taking in (2). It is stationary and simple max-stable, and corresponds to the Brown–Resnick field associated with the semivariogram , (e.g., Huser and Davison,, 2013). Such a semivariogram leads to the impossibility of characterizing the density for , as mentioned above. As was the case of for the Brown–Resnick random field, completely characterizes the dependence structure of the Smith field. For ease, the vector is replaced by the positive definite matrix in the following. The derivative of an expected performance written as in (4) with respect to at some positive definite matrix is defined by (e.g., Dwyer,, 1967)
[TABLE]
Note that results concerning differentiation with respect to a scalar or a vector also hold in the case of differentiation with respect to a matrix.
Assume now that is differentiable. The differentiability of the function in a neighbourhood of will be shown in the proof of Theorem 3 (Appendix A.2). Then the chain rule gives
[TABLE]
We shall prove (Theorem 4 in Appendix A.2) that there exists some non-random neighbourhood of , , such that, for any , there exists a random variable satisfying a.s.
[TABLE]
This technical outcome will allow us to derive our main result.
Theorem 2**.**
Assume that is a differentiable function and that there exists such that
[TABLE]
where is a non-random neighbourhood of . Then
[TABLE]
This non-trivial theorem provides a sufficient condition to use IPA to compute the derivatives of . This condition is much more tractable and easier to check than that in Proposition 1. The simplification stems from the fact that we take care in Theorems 3 and 4 of the intricate term of (17), , which involves the sample path properties with respect to differentiation of the Smith random field. Theorems 3 and 4 are delicate to establish precisely due to the inherent structure of max-stable fields.
In practice we have to simulate realizations of the random matrix inside the expectation in (19), which can be done by simulating the Smith field and using (17). The expression of appearing in (17) is given in (29) (Appendix A.2). As the Brown–Resnick field, the Smith field can be simulated exactly (e.g., Oesting et al.,, 2018; Dombry et al.,, 2016) or approximately (Schlather,, 2002, Theorem 4); the latter approach is very accurate.
3 Application to risk assessment
We now focus on one framework (among several others) where the results of Section 2 are useful, which is the context of risk and dependence measures. After detailing the link with (4), we consider a dependence measure which is particularly suited to insurance of damage triggered by extreme wind speeds. We show that the conditions of Theorems 1 and 2 hold in that case and confirm through a simulation study that both LRM and IPA perform very well. Finally, we consider concrete data and show that the sensitivity of our dependence measure can be very high, highlighting the practical importance of studying the sensitivity of functions of max-stable fields in the context of risk assessment.
3.1 Risk measures based on max-stable fields
Here we show that the quantity defined in (4) encompasses many risk and dependence measures, and our focus is mainly on actuarial applications. A univariate risk measure is a mapping from a set of random variables to the real numbers. A dependence measure summarises the strength of dependence between several elements of such a set of random variables. In finance, these random variables often represent portfolio returns, and in an insurance context, they might be the claims associated with insurance policies. When claims are triggered by environmental events, a possible model for the insured cost field is (Koch,, 2017, Section 2.3)
[TABLE]
where is the (deterministic) insured exposure (i.e., insured value) field, the damage function at site and the random field of the environmental variable generating risk. The application of the damage function to yields the insured cost ratio (i.e., the insured cost divided by the insured value) at site , which, multiplied by the insured exposure, gives the corresponding insured cost.
Among the risk measures that can be written as in (4), one can find many sophisticated examples, e.g., in insurance/reinsurance pricing or regulation. For , let denote the claim of an insurance company at and assume that can be written as a function of the max-stable field (e.g., as in the right-hand side of (20)). Premium loadings that are proportional to specific moments of the sum of the at two or more sites constitute excellent examples in insurance pricing. In reinsurance, the premium is sometimes based on order statistics of the claims, as in the case of the “excédent du coût moyen relatif” (ECOMOR) or largest claims reinsurance (LCR) treaties. Let us consider, e.g., sites and assume that each of those is associated with an insurance policy whose corresponding claim is as above. We consider the ordered values of those claims, . For instance, the risk measure would be involved in the pricing of an ECOMOR reinsurance treaty having the third largest claim as priority. These quantities can be written under the form (4) but behave in a non-linear way and do not have any analytical expression. A valuable example of dependence measure, which will be considered until the end, is presented in the next section.
3.1.1 A specific dependence measure for wind speed
We present in this section a dependence measure for the costs due to high wind speeds that can be written as in (4). We model the cost by (20) in the case , where the field of wind speed extremes, , is assumed to be max-stable. Moreover, for any site , we choose as damage function , , where and , which is utterly appropriate in the case of wind. Indeed, as the force exerted by the wind and the corresponding rate of work are proportional to the second and third powers of wind speed, respectively, the total cost for a specific structure is expected to increase as the square or the cube of the maximum wind speed. For studies supporting the use of the square, see, e.g., Simiu and Scanlan, (1996, Equations (4.7.1), (8.1.1) and (8.1.8)), and for the cube, see Lamb and Frydendahl, (1991, Chapter 2, p.7), Emanuel, (2005) and Kantha, (2008). However, in the case of insured costs, several authors have recently found power-laws with much higher exponents; e.g., Prahl et al., (2012) obtained exponents spanning from to for insured losses on residential buildings in Germany. Indeed, the presence of a deductible in the insurance contract increases the exponent from or to a larger value depending on the deductible (e.g., Prahl et al.,, 2015; Koch,, 2019). The level corresponds to that value of wind speed which triggers an insured cost ratio equal to unity, and is thus assumed to be larger than the wind speeds observed in practice. For a more detailed review of wind damage functions, see, e.g., Koch, (2019).
We consider two sites (), . Recall that the link between and the associated simple max-stable field is given by (1), and let and , , such that . The shape parameter for wind speed maxima is often slightly negative, implying that the event occurs with probably not exactly [math] (although extremely close owing to the values of the location and scale parameters). Anyway, this is not a problem as . Moreover, we take to be the Brown–Resnick or the Smith random field and we consider as dependence measure the correlation between the costs due to extreme winds at the two sites, i.e.,
[TABLE]
where must be replaced by when the Smith field is considered. The condition , , ensures the existence of the correlation in (21). Correlation is used a lot by practitioners in the finance/insurance industry and the measure is of practical interest for any insurance/reinsurance company handling the risk of damage caused by extreme wind speeds; among others, it provides insight about potential spatial diversification. For details, see Koch, (2019) where (21) is thoroughly studied.
We now explain why the measure in (21) is of the form (4) with . It is well-known that, if is a random variable following the standard Fréchet distribution, then for any , where denotes the gamma function. We now also assume that for . Consequently, using (1) and the binomial theorem, we obtain, for ,
[TABLE]
where
[TABLE]
Moreover, Corollary 1 in Koch, (2019) gives
[TABLE]
where
[TABLE]
with
[TABLE]
Therefore, (22) and (23) yield
[TABLE]
with
[TABLE]
where since , .
On top of being useful for actuarial applications, the measure (21) has a closed-form derivative with respect to the dependence parameters of the max-stable field, allowing us to compare the values of obtained using LRM or IPA with their true values. The analytical formulas of these derivatives are given in Appendix B.1.
3.1.2 Validity of the assumptions
We now show that the assumptions of Theorems 1 and 2 are valid in the case of (21), for being the Brown–Resnick and the Smith random field, respectively. Consequently, LRM and IPA can be used to assess the derivative of (21) with respect to and , respectively.
Proposition 2**.**
Let be defined as in (25).
i) Assume that is the Brown–Resnick random field with semivariogram given by (7). There exist a non-random neighbourhood of , , and a constant such that (10) and (11) hold true.
ii) Assume that is the Smith random field. Then there exist and a non-random neighbourhood of , , such that (18) holds true.
3.2 Simulation study
We numerically assess the accuracy of the LRM and IPA for computing the sensitivities of the dependence measure in (21), where is the Brown–Resnick field associated with the semi-variogram (7), and the Smith field, respectively. The number of simulations used to approach the expectations in (12) and (19) is denoted by . We display the boxplots of the relative errors of estimates in different configurations. We take with , , and we look at different combinations of sites . We recall that the relative errors can be calculated since the sensitivities of can be obtained analytically with an integral form. The integrals in (53) and (54) were computed using adaptive quadrature with a relative tolerance of to allow an accurate approximation.
3.2.1 LRM for the Brown–Resnick field
We examine three combinations of sites: we set and . These have been chosen in order to cover a wide range of sensitivities and relative sensitivities. Moreover, we take , , and , which are the estimates obtained on the data used in the application below (Section 3.3). We simulated the Brown–Resnick field using the rmaxstab function of the SpatialExtremes R package by Ribatet et al., (2018).
Table 1 shows that, for and , the theoretical values of , its sensitivity and its relative sensitivity do not evolve much when increasing from to ; the absolute values of the relative sensitivities slightly decrease whereas weakly increases. The absolute values of the sensitivities and of their relative counterparts are the highest for ; in that case, the relative sensitivities are very high (about for and more than for ), probably owing to the fairly low value of . Such large values typically explain the practical importance of properly assessing sensitivities, e.g., in an insurance context.
As , the density of the Brown–Resnick field is explicitly known and thus the score function has a closed-form expression (available upon request) with a small number of terms; it is therefore needless to apply the MCMC methodology through Gibbs sampler described above.
It is important to remark that and are proportional (Appendix B.2), which entails that the LRM estimates of and are also proportional. Moreover, by (54) and (55), the true values of those derivatives are proportional with the same factor, implying that the relative errors of the estimates of and are equal. Consequently, we only display the results for .
Figure 1 shows that the LRM estimator is unbiased, as expected. Moreover its variability substantially decreases when increasing from to and becomes very small (the relative error of most estimates is less than ). This variability is the lowest for , perhaps due to the fact that this configuration corresponds by far to the largest sensitivities and relative sensitivities. To some extent, we may expect the accuracy of the estimation to be an increasing function of the absolute value of the relative sensitivity. Nevertheless, this seems to be more complex: the variability is lower for than for and, for all site combinations, for than for . Overall, the LRM estimator is very satisfying in all configurations, whether the sensitivity (or relative sensitivity) is low or large.
3.2.2 IPA for the Smith field
We consider two combinations of sites: we set and . Additionally, we take
[TABLE]
, and , which are the estimates obtained in Section 3.3. Unlike in Section 3.2.1, we do not consider the case as is approximately equal to [math] in that configuration. Note that we do not display the results for as they are exactly the same as those for , consistently with the theory. The analytical computation of the terms , , which is necessary to implement IPA (see (17)), requires the coordinates of the centers of the “storms” (see Smith,, 1990, for the interpretation of the Smith field in terms of storms) realizing the maxima at the sites (see (29) in Section A.2). To the best of our knowledge, these quantities cannot be obtained from the simulation algorithms available on the Web (e.g., in R packages like SpatialExtremes by Ribatet et al., (2018) or in the code by Dombry et al., (2016) available on the Biometrika website). To overcome this impediment and for other technical reasons, we programmed the simulation algorithm of the Smith random field ourselves by adapting the approach of Schlather, (2002, Theorem 4). Regarding the quantity appearing in that approach, we took the value in order to ensure an accurate simulation. The corresponding code will be available.
Table 2 shows that, when increasing from to , the sensitivities and relative sensitivities slightly decrease whereas weakly increases. The relative sensitivities are the highest for ; they take very high values for and (about for and more than for ), probably due to the low value of .
Figures 2–4 indicate that the IPA estimator is unbiased (at least for large enough) and, as expected, the variability decreases when increasing from to . In most cases, it reaches a low level (with a relative error of most estimates less than ). The variability is slightly lower in the case of , which can be explained by the fact that the relative sensitivity is much larger in that case. It is however the converse for , coefficient for which the increase of relative sensitivity compared to the case is lower than for the other coefficients. Moreover, the variability is systematically higher for than for the other coefficients, and, whatever the coefficient and the combination of sites, it tends to be lower for than for . We intuitively expect the method to perform the best for high relative sensitivities, but we see that it seems more complex. Overall, IPA performs very well on this example, whether the sensitivity (or relative sensitivity) is low or high.
3.3 Application
This section is devoted to the computation of sensitivities of the measure (see (21)) in a real case study that is valuable for the insurance/reinsurance industry. We consider concrete wind speed data on a region where insurance/reinsurance plays a significant role and for which power-law type damage functions for losses stemming from extreme wind speeds are clearly documented in the literature.
3.3.1 Data
We consider publicly available data from the European Centre for Medium Range Weather Forecasting (ECMWF), more precisely a subset of the ERA5 (ECMWF Reanalysis 5th Generation) data set. More specifically the data we study consist in hourly m wind gust time series from 1 January 1979 at 07:00 to 1 June 2019 at 00:00. The region we consider is a rectangle from to longitude and to latitude, and the resolution is longitude and latitude; see Figure 5. Thus, the rectangle contains 176 grid points and is basically centred over the Ruhr region in Germany. This area exhibits a high total residential insured value per unit of surface, and, according to Prahl et al., (2012, Figure 2), a damage function of power-law type with an exponent around 8 seems appropriate for insured losses (due to high wind speeds) on residential buildings in that region. At each grid point, we compute the seasonal (from October to March) maxima, leading to the dataset we use to fit different max-stable models for . For the first season, the maximum is computed over January–March. Considering the period October–March allows us to remove seasonal non-stationarity in the wind speed time series and to focus mainly on winter storms (and not on highly localized extreme winds occurring during intense summer thunderstorms).
3.3.2 Results
The max-stable models we consider are both the Brown–Resnick random field with semi-variogram specified in (7) and the Smith field. Regarding the location, scale and shape parameters, it is reasonable to consider them as constant over the region (shown in a more detailed analysis of the same data that will be soon available in a subsequent paper). Modelling these parameters using trend surfaces (instead of considering specific parameters at each grid point) is common as it reduces the estimation time and allows prediction at sites where no observations are available.
Both models are fitted using maximum pairwise likelihood estimation (e.g., Padoan et al.,, 2010; Davison et al.,, 2012) implemented in the fitmaxstab function of SpatialExtremes (Ribatet et al.,, 2018). The marginal and dependence parameters are estimated in a single step. Then we perform model selection by minimization of the composite likelihood information criterion (CLIC); see Varin and Vidoni, (2005).
Table 3 shows that, according to the CLIC, the Brown–Resnick random field is more compatible with our data than the Smith random field is. We also fitted the Schlather random field (Schlather,, 2002) with various correlation functions (Whittle-Matérn, powered exponential and Cauchy) and the Brown–Resnick random field is more appropriate according to the CLIC. This is in agreement, e.g., with the results obtained by Davison et al., (2012) in the case of rainfall.
Figure 6 shows that the theoretical extremal coefficient function (e.g., Schlather and Tawn,, 2003) of the fitted Brown–Resnick model agrees reasonably well with the empirical extremal coefficients, though it is slightly above their binned estimates. This low underestimation of the spatial dependence may come from the choice of very parsimonious trend surfaces for the location, scale and shape parameters. The comparison done in Figure 6 constitutes a classical graphical goodness-of-fit diagnostic for max-stable models and it shows here that the proposed model fits the data sufficiently well for the purpose of this application.
The final dependence measure is
[TABLE]
where is a general Brown–Resnick field having as marginal parameters those in Table 3, and we aim at providing its derivative at . As the Brown–Resnick random field appears as a suitable model for wind speed maxima and the power-law with as exponent is an appropriate damage function, this dependence measure is well suited in practice for (re)insurance companies on the region considered. The values of the sensitivities obtained by LRM and their relative counterparts are given in Table 4 for the same combinations of sites as in Section 3.2.1. The combinations and correspond to pairs of sites within our region of interest if we put the origin on the left part and in the lower-left corner of the rectangle, respectively; by stationarity of the field, the origin can be chosen arbitrarily. The site is far to the East of our region even we take as origin the lower-left corner of the rectangle, but we chose it to emphasize that sensitivities can be very high for sites that are highly distant. The relative sensitivities are slightly lower in absolute value than in the cases and (see Table 1). However they can be quite substantial (e.g., for in the case and in the case ) or very high (for both and in the case ), which highlights that looking at sensitivities is strongly recommended in concrete risk assessment studies.
4 Discussion
Max-stable random fields are particularly suitable to model extreme spatial (e.g., environmental) events and, if applicable, LRM as well as IPA are powerful techniques to estimate the derivatives of an expected performance with respect to the model parameters. In this paper, we introduce these two methods to extreme-value theory by applying them to expected performances based on the Brown–Resnick and the Smith max-stable random fields. In a first part, we give convenient and tractable conditions on these fields that ensure the validity of LRM and IPA. Obtaining such tractable conditions is non-trivial due to the complex structure of max-stable fields. Then, we focus on one of the several frameworks where our theoretical results can be valuable, more precisely the context of risk assessment. We apply LRM and IPA to a specific example of dependence measure which is suited to assess the dependence between damage due to extreme wind speeds at two stations, and is thus worthwhile for actuarial practice. We show through a simulation study that both methods perform very well in various configurations. We finally present a concrete application involving reanalysis wind speed data; it emphasizes the importance of considering sensitivities in a risk assessment context when using max-stable fields.
It would be interesting to see how the LRM and IPA estimators behave in other configurations. In some situations (e.g., where the derivatives are very close to [math]), appropriate improvements of the classical Monte Carlo method (such as importance sampling) might be considered to reduce the variability of the estimators. Future appealing work would also consist in extending our conditions to other max-stable random fields and possibly even proposing other methods for max-stable fields (such as, e.g., the one by Peng et al.,, 2018) for which LRM and IPA are not applicable.
Acknowledgements
Both authors gratefully acknowledge Anthony C. Davison and anonymous referees for insightful comments. Erwan Koch would like to thank the Swiss National Science Foundation (project number ) for financial support.
Appendix A Proofs of the main results
A.1 Proof of Theorem 1
Proof.
From Section 4.4 in Dombry et al., (2017), we know that has an Hüsler-Reiss distribution with a strictly conditionally negative definite matrix given by . By Lemma B.4. in Dombry et al., (2017), we deduce that, for any neighbourhood of and any , there exists a constant such that, for all , with and ,
[TABLE]
where
[TABLE]
Equation (14) in Dombry et al., (2017) then gives, for all ,
[TABLE]
Now let us consider the ratio
[TABLE]
By (9), we have
[TABLE]
where is given in (9). From Equation (28) in Asadi et al., (2015), we know that for all and
[TABLE]
with
[TABLE]
Let . It follows that, for ,
[TABLE]
with
[TABLE]
Let
[TABLE]
We can note that are positive and uniformly bounded for any partition , any , and any . Then
[TABLE]
The numerator and the denominator are polynomials of order in . If , then and
[TABLE]
If , then and
[TABLE]
The ratio of both polynomials is bounded over any compact set included in and has limits as tends to [math] or . Moreover these limits are also bounded for any . We can deduce that there exists a constant such that
[TABLE]
It follows from (27) that
[TABLE]
Let us choose
[TABLE]
to get
[TABLE]
for all and almost every . The result follows by the dominated convergence theorem.
∎
A.2 Proof of Theorem 2
Recall that are the points of a Poisson point process on with intensity function . For , let be the function from to defined by .
Let us begin by noting that for each , the supremum is a.s. attained by a unique function at .
Proposition 3**.**
Let , and define for ,
[TABLE]
Then a.s., we have for all , where we recall that stands for the cardinality of a set.
Proof.
Note that for every , is a Poisson random measure on , which has no atoms. Therefore, the maxima will have no ties with probability one.
∎
Remark 1**.**
The proof is also a direct consequence of Proposition 2.5 in Dombry and Eyi-Minko, (2013).
By Proposition 3, we can define, for , the a.s. unique indexes satisfying
[TABLE]
We now fix and consider one realization of the point process , denoted by . For ease of exposition, we do not mention further below. Let us consider the set
[TABLE]
i.e., the set of sites for which the maximum is attained by at least two distinct functions and .
Remark 2**.**
If (identity matrix with dimension ), the set is the set of boundaries of the cells of a Poisson Laguerre Tessellation (e.g., Dombry and Kabluchko,, 2018).
A key result is that has a null Lebesgue measure.
Proposition 4**.**
The set has zero Lebesgue measure with probability one.
Proof.
A point is characterized as follows:
[TABLE]
Let such that is in a neighbourhood of but still belongs to , i.e., such that
[TABLE]
Using (28), we obtain that the previous equality is equivalent to
[TABLE]
Therefore, in a neighbourhood of implies that is orthogonal to the vector for the inner product induced by . Thus, only one direction is suitable for . is an union of segments in . Moreover there is no ball around belonging to , which implies that the interior of is empty.
Let
[TABLE]
It is easily seen that
[TABLE]
and hence that , where denotes the Lebesgue measure in . Therefore,
[TABLE]
∎
Let . We are now interested in the existence of a neighbourhood of over which is constant and thus becomes differentiable with respect to .
Theorem 3**.**
Let . There exists a neighbourhood of , , such that is constant over this neighbourhood. Moreover, the function is differentiable over and
[TABLE]
Proof.
It follows from Proposition 3 that
[TABLE]
and, for , we have, for all ,
[TABLE]
or equivalently
[TABLE]
i.e.,
[TABLE]
Let and define
[TABLE]
such that . Moreover it is easy to see, using the definition of and the form of the intensity function of the point process , that is finite, is finite, but is infinite.
Let be a positive-definite matrix of size . We have, for any ,
[TABLE]
In addition,
[TABLE]
Since and are equivalent norms, there exists a positive constant such that
[TABLE]
It follows that, for any , there exists a function such that
[TABLE]
and
[TABLE]
Using (31), we obtain
[TABLE]
Using (32), we see that there exists such that, for , we have, for all such that , that
[TABLE]
and, for all ,
[TABLE]
Hence, using (33), we obtain, for all ,
[TABLE]
Now, using (30), the continuity of and the fact that and are finite, there exists such that, for , we have, for all , that
[TABLE]
Combining (34) and (35), we obtain that, for all positive definite matrix satisfying , we have, for all such that ,
[TABLE]
Accordingly, . Hence we can choose the neighbourhood of
[TABLE]
to define the derivative of at .
We now compute the corresponding derivative. Using Proposition 3, we have
[TABLE]
and hence
[TABLE]
Formula (11.7) in Dwyer, (1967) gives, for any symmetric matrix , that
[TABLE]
Moreover, since is constant over , Equation (11.8) in Dwyer, (1967) provides, for any symmetric matrix ,
[TABLE]
Combining (36) and (37), we finally obtain
[TABLE]
∎
We now prove that the derivative of can be uniformly bounded by an integrable random variable over a neighbourhood of .
Theorem 4**.**
There exists a non-random neighbourhood of , , such that, for any , there exists a random variable satisfying a.s.
[TABLE]
and .
Proof.
First recall that a.s.
[TABLE]
which gives a.s.
[TABLE]
Consequently, using the well-known fact that, for all and , , we obtain
[TABLE]
Let , such that . Note that is a non-negative symmetric matrix of rank and thus it only has one positive eigenvalue given by
[TABLE]
(since we have that ). It follows that
[TABLE]
For a positive definite symmetric matrix , we denote by and respectively the maximum and the minimum of its positive eigenvalues. We have that
[TABLE]
and
[TABLE]
which yield
[TABLE]
Combining (38), (39) and (40), we have, for any , that a.s.
[TABLE]
where
[TABLE]
In order to control , it is sufficient to control . As the center of the “storm” realizing the maximum at point , is characterized by
[TABLE]
and therefore
[TABLE]
Additionally, we have, for all ,
[TABLE]
which yields, by equivalence of the norms and , that
[TABLE]
for some positive constant . Hence let us now choose such that
[TABLE]
Now, for two real-valued random variables and , implies that or , giving Thus, using (43) and (44), we obtain
[TABLE]
We first deal with the first term of the right-hand side of (46). Since the are the points of a Poisson process on with intensity function , we have
[TABLE]
where
[TABLE]
Making the change of variable and , we obtain
[TABLE]
The latter integral is finite and it follows from (47) that
[TABLE]
Let us now deal with the second term of the right-hand side of (46). Observe that
[TABLE]
The mapping applied to the points of the Poisson point process yields a new Poisson point process with intensity function . Furthermore, such a Poisson point process on is homogeneous and can be represented as the sum of independent standard exponential random variables. Thus, we can write . Moreover, , implying
[TABLE]
which gives for large
[TABLE]
Now, for any , we have
[TABLE]
We carry out the change of variable , which gives and hence . Accordingly, we obtain
[TABLE]
and therefore, using (46), (49) and (50),
[TABLE]
Finally, let us recall that
[TABLE]
Consequently, using (45), we finally deduce that any neighbourhood of , , satisfying
[TABLE]
leads to
[TABLE]
∎
We finally provide the proof of Theorem 2.
Proof.
Let us choose as in the proof of Theorem 4. It follows from (17) that
[TABLE]
which gives
[TABLE]
Hence we choose
[TABLE]
We have
[TABLE]
Let such that . By Hölder inequality, we have
[TABLE]
By Theorem 4 and the assumptions, the result follows. ∎
A.3 Proof of Proposition 2
A.3.1 LRM for the Brown–Resnick random field
Let us first recall that and that is given by (25) with , . Let us now note that . Therefore there exists a positive constant such that
[TABLE]
where, for , is a set of constants whose the maximal value is . Moreover,
[TABLE]
Since has a standard Fréchet distribution, the expectation is finite, if and . But it is possible to choose and to satisfy such constraints since and is defined by (9).
Finally note that the derivatives and are easily obtained and therefore it is straightforward to conclude that the condition in (11) also holds with such a choice of neighbourhood .
A.3.2 IPA for the Smith random field
From (25), there exist positive constants , , such that
[TABLE]
Therefore, we have for ,
[TABLE]
Next result will allow us to prove that Condition (18) holds.
Proposition 5**.**
There exists a non-random neighbourhood of , , such that, for any and ,
[TABLE]
Proof.
We have
[TABLE]
and thus
[TABLE]
It is well-know that has a standard Fréchet distribution and therefore since . We deduce that it suffices to choose such that and . This is possible since any symmetric invertible matrix admits a neighbourhood of matrices such that, for all , , where . ∎
By Proposition 5, and the Hölder inequality, there exists a non-random neighbourhood of , , such that for some satisfying and , we have
[TABLE]
Appendix B Analytical formulas
B.1 Expressions for the derivatives of
when is given by (25)
For , we introduce the function defined by
[TABLE]
where, for ,
[TABLE]
with and denoting the standard univariate Gaussian distribution and density functions, respectively.
Let be our sites of interest and be a Brown–Resnick random field with dependence parameters and gathered in , i.e., . Let be the parameter at which we wish to compute the derivative of . As before, let be the location, scale and shape parameters of , , and such that . Theorem 2 in Koch, (2019) yields
[TABLE]
where
[TABLE]
This yields
[TABLE]
We obtain from (52) that
[TABLE]
where . The term
[TABLE]
has a closed (although complicated) expression (available upon request). The corresponding integral can be computed using numerical methods such as adaptive quadrature. Finally, we obtain the true values of by plugging the appropiate values of (55) in (54).
Now, if is the Smith random field with covariance matrix and is a symmetric positive-definite matrix at which we want to compute the derivative of , exactly the same formulas and procedure can be applied upon replacement of with , with , and with .
B.2 Proportionality of the components of the score function for the
Brown–Resnick random field
The bivariate density of the (simple) Brown–Resnick random field (at and ) satisfies, for ,
[TABLE]
where
[TABLE]
This easily yields
[TABLE]
and it follows from (6) that the LRM estimates of and are proportional by the factor given in the right-hand side of (57).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Asadi et al., (2015) Asadi, P., Davison, A. C., and Engelke, S. (2015). Extremes on river networks. The Annals of Applied Statistics , 9(4):2023–2050. https://doi.org/10.1214/15-AOAS 863 . · doi ↗
- 2Asmussen and Glynn, (2010) Asmussen, S. and Glynn, P. W. (2010). Stochastic Simulation: Algorithms and Analysis . Springer-Verlag New York.
- 3Broadie and Glasserman, (1996) Broadie, M. and Glasserman, P. (1996). Estimating security price derivatives using simulation. Management Science , 42(2):269–285.
- 4Brown and Resnick, (1977) Brown, B. M. and Resnick, S. I. (1977). Extreme values of independent stochastic processes. Journal of Applied Probability , 14(4):732–739. https://doi.org/10.2307/3213346 . · doi ↗
- 5Chen and Fu, (2001) Chen, J. and Fu, M. C. (2001). Efficient sensitivity analysis of mortgage backed securities. In 12th Annual Derivatives Securities Conference, New York .
- 6Coles, (2001) Coles, S. (2001). An Introduction to Statistical Modeling of Extreme Values . Springer London.
- 7Davison et al., (2012) Davison, A. C., Padoan, S. A., and Ribatet, M. (2012). Statistical modeling of spatial extremes. Statistical Science , 27(2):161–186. https://doi.org/10.1214/11-STS 376 . · doi ↗
- 8de Haan, (1984) de Haan, L. (1984). A spectral representation for max-stable processes. The Annals of Probability , 12(4):1194–1204. https://doi.org/10.1214/aop/1176993148 . · doi ↗
