Extreme value statistics for censored data with heavy tails under   competing risks

Julien Worms (LM-Versailles); Rym Worms (LAMA)

arXiv:1701.05458·math.ST·January 20, 2017

Extreme value statistics for censored data with heavy tails under competing risks

Julien Worms (LM-Versailles), Rym Worms (LAMA)

PDF

Open Access

TL;DR

This paper introduces a novel estimator for the extreme value index in censored data with competing risks, demonstrating its asymptotic normality and finite-sample performance through simulations.

Contribution

It proposes the first estimator based on an Aalen-Johansen integral for extreme value index in this context, addressing heavy tails and censoring.

Findings

01

Estimator is asymptotically normal.

02

Performs well in finite-sample simulations.

03

Enables estimation of extreme quantiles in competing risks.

Abstract

This paper addresses the problem of estimating, in the presence of random censoring as well as competing risks, the extreme value index of the (sub)-distribution function associated to one particular cause, in the heavy-tail case. Asymptotic normality of the proposed estimator (which has the form of an Aalen-Johansen integral, and is the first estimator proposed in this context) is established. A small simulation study exhibits its performances for finite samples. Estimation of extreme quantiles of the cumulative incidence function is also addressed.

Equations377

Z_{i}=\min(X_{i},C_{i}),\hskip 8.5359pt\delta_{i}=\mathbb{I}_{X_{i}\leq C_{i}},\hskip 8.5359pt\xi_{i}=\left\{\begin{array}[]{ll}0&\mbox{if $\delta_{i}=0$,}\\ {\mathscr{C}}_{i}&\mbox{if $\delta_{i}=1$.}\end{array}\right.

Z_{i}=\min(X_{i},C_{i}),\hskip 8.5359pt\delta_{i}=\mathbb{I}_{X_{i}\leq C_{i}},\hskip 8.5359pt\xi_{i}=\left\{\begin{array}[]{ll}0&\mbox{if $\delta_{i}=0$,}\\ {\mathscr{C}}_{i}&\mbox{if $\delta_{i}=1$.}\end{array}\right.

X_{i} = min (X_{i, 1}, \dots, X_{i, K}),

X_{i} = min (X_{i, 1}, \dots, X_{i, K}),

\widebar F^{(k)} (t) = P [X > t, C = k],

\widebar F^{(k)} (t) = P [X > t, C = k],

F^{(k)} (t) = P [X \leq t, C = k] .

F^{(k)} (t) = P [X \leq t, C = k] .

F_{n}^{(k)} (t) = Z_{i} \leq t \sum \frac{δ _{i} I _{C_{i} = k}}{n \widebar G _{n} ( Z _{i}^{-} )},

F_{n}^{(k)} (t) = Z_{i} \leq t \sum \frac{δ _{i} I _{C_{i} = k}}{n \widebar G _{n} ( Z _{i}^{-} )},

\widebar F_{n}^{(k)} (t) = Z_{i} > t \sum \frac{δ _{i} I _{C_{i} = k}}{n \widebar G _{n} ( Z _{i}^{-} )} .

\widebar F_{n}^{(k)} (t) = Z_{i} > t \sum \frac{δ _{i} I _{C_{i} = k}}{n \widebar G _{n} ( Z _{i}^{-} )} .

\forall1 \leq k \leq K, \forall x > 0, t \to + \infty lim \widebar F^{(k)} (t x) / \widebar F^{(k)} (t) = x^{- 1/ γ_{k}} \makebox [42.67912 pt] [c] an d t \to + \infty lim \widebar G (t x) / \widebar G (t) = x^{- 1/ γ_{C}} .

\forall1 \leq k \leq K, \forall x > 0, t \to + \infty lim \widebar F^{(k)} (t x) / \widebar F^{(k)} (t) = x^{- 1/ γ_{k}} \makebox [42.67912 pt] [c] an d t \to + \infty lim \widebar G (t x) / \widebar G (t) = x^{- 1/ γ_{C}} .

t \to + \infty lim \frac{1}{\widebar F ^{(k)} ( t )} \int_{t}^{+ \infty} lo g (u / t) d F^{(k)} (t) = γ_{k} .

t \to + \infty lim \frac{1}{\widebar F ^{(k)} ( t )} \int_{t}^{+ \infty} lo g (u / t) d F^{(k)} (t) = γ_{k} .

γ_{n, k} = \int ϕ_{n} (u) d F_{n}^{(k)} (u) \makebox [51.21504 pt] [c] w h er e ϕ_{n} (u) = \frac{1}{\widebar F _{n}^{(k)} ( t _{n} )} lo g (\frac{u}{t _{n}}) I_{u > t_{n}},

γ_{n, k} = \int ϕ_{n} (u) d F_{n}^{(k)} (u) \makebox [51.21504 pt] [c] w h er e ϕ_{n} (u) = \frac{1}{\widebar F _{n}^{(k)} ( t _{n} )} lo g (\frac{u}{t _{n}}) I_{u > t_{n}},

γ_{n, k} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} i = 1 \sum n \frac{lo g ( Z _{i} / t _{n} )}{\widebar G _{n} ( Z _{i}^{-} )} I_{ξ_{i} = k} I_{Z_{i} > t_{n}} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} Z_{(i)} > t_{n} \sum \frac{lo g ( Z _{(i)} / t _{n} )}{\widebar G _{n} ( Z _{(i - 1)} )} δ_{(i)} I_{C_{(i)} = k},

γ_{n, k} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} i = 1 \sum n \frac{lo g ( Z _{i} / t _{n} )}{\widebar G _{n} ( Z _{i}^{-} )} I_{ξ_{i} = k} I_{Z_{i} > t_{n}} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} Z_{(i)} > t_{n} \sum \frac{lo g ( Z _{(i)} / t _{n} )}{\widebar G _{n} ( Z _{(i - 1)} )} δ_{(i)} I_{C_{(i)} = k},

v_{n} ⟶ n \to \infty + \infty \makebox [62.59596 pt] [c] s u c h t ha t n^{- η_{0}} v_{n} ⟶ n \to \infty + \infty \mbox f or so m e η_{0} > 0.

v_{n} ⟶ n \to \infty + \infty \makebox [62.59596 pt] [c] s u c h t ha t n^{- η_{0}} v_{n} ⟶ n \to \infty + \infty \mbox f or so m e η_{0} > 0.

\forall x > 0, \frac{l _{k} ( t x )}{l _{k} ( t )} - 1 \sim t \to \infty h_{ρ_{k}} (x) g (t) (\forall x > 1),

\forall x > 0, \frac{l _{k} ( t x )}{l _{k} ( t )} - 1 \sim t \to \infty h_{ρ_{k}} (x) g (t) (\forall x > 1),

\sqrt{v_{n}}(\widehat{\gamma}_{n,k}-\gamma_{k})\ \stackrel{{\scriptstyle d}}{{\longrightarrow}}\ {\cal N}(\lambda m,\sigma^{2})\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}

\sqrt{v_{n}}(\widehat{\gamma}_{n,k}-\gamma_{k})\ \stackrel{{\scriptstyle d}}{{\longrightarrow}}\ {\cal N}(\lambda m,\sigma^{2})\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}

m=\left\{\begin{array}[]{ll}\frac{\gamma_{k}^{2}}{1-\gamma_{k}\rho_{k}}&\mbox{ if }\rho_{k}<0,\vspace{0.2cm}\\ \gamma_{k}^{2}&\mbox{ if }\rho_{k}=0,\end{array}\right.\mbox{ and }\ \sigma^{2}=\frac{\gamma_{k}^{2}}{(1-r)^{3}}\left((1+r^{2})-2cr\right),

m=\left\{\begin{array}[]{ll}\frac{\gamma_{k}^{2}}{1-\gamma_{k}\rho_{k}}&\mbox{ if }\rho_{k}<0,\vspace{0.2cm}\\ \gamma_{k}^{2}&\mbox{ if }\rho_{k}=0,\end{array}\right.\mbox{ and }\ \sigma^{2}=\frac{\gamma_{k}^{2}}{(1-r)^{3}}\left((1+r^{2})-2cr\right),

\widehat{\gamma}_{n,k}\ \stackrel{{\scriptstyle\mathbb{P}}}{{\longrightarrow}}\ \gamma_{k}\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}.

\widehat{\gamma}_{n,k}\ \stackrel{{\scriptstyle\mathbb{P}}}{{\longrightarrow}}\ \gamma_{k}\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}.

\overset{x}{^}_{p_{n}, t_{n}}^{(k)} = t_{n} (\frac{\widebar F _{n}^{(k)} ( t _{n} )}{p _{n}})^{γ_{n, k}},

\overset{x}{^}_{p_{n}, t_{n}}^{(k)} = t_{n} (\frac{\widebar F _{n}^{(k)} ( t _{n} )}{p _{n}})^{γ_{n, k}},

v_{n} / lo g (d_{n}) ⟶ n \to \infty \infty,

v_{n} / lo g (d_{n}) ⟶ n \to \infty \infty,

\frac{\sqrt{v_{n}}}{\log(d_{n})}\left(\frac{\hat{x}^{(k)}_{p_{n},t_{n}}}{{x}^{(k)}_{p_{n}}}-1\right)\ \stackrel{{\scriptstyle d}}{{\longrightarrow}}\ {\cal N}(\lambda m,\sigma^{2})\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}.

\frac{\sqrt{v_{n}}}{\log(d_{n})}\left(\frac{\hat{x}^{(k)}_{p_{n},t_{n}}}{{x}^{(k)}_{p_{n}}}-1\right)\ \stackrel{{\scriptstyle d}}{{\longrightarrow}}\ {\cal N}(\lambda m,\sigma^{2})\hskip 14.22636pt\mbox{as $n\rightarrow\infty$}.

\widebar G (t) = exp (- t^{- 1/ γ_{C}}) (t \geq 0) \makebox [39.83368 pt] [c] or \widebar G (t) = (1 + t^{τ_{C}} / β)^{- 1/ (γ_{C} τ_{C})} (t \geq 1) . \vspace 0.1 c m

\widebar G (t) = exp (- t^{- 1/ γ_{C}}) (t \geq 0) \makebox [39.83368 pt] [c] or \widebar G (t) = (1 + t^{τ_{C}} / β)^{- 1/ (γ_{C} τ_{C})} (t \geq 1) . \vspace 0.1 c m

\overset{γ}{^}_{1} = \frac{1}{n \widebar F _{n}^{(1)} ( Z _{(n - k_{n})} )} i = 1 \sum k_{n} \frac{lo g ( Z _{(n - i + 1)} / Z _{(n - k_{n})} )}{\widebar G _{n} ( Z _{(n - i, n)} )} δ_{(n - i + 1)} I_{C_{(n - i + 1)} = 1}

\overset{γ}{^}_{1} = \frac{1}{n \widebar F _{n}^{(1)} ( Z _{(n - k_{n})} )} i = 1 \sum k_{n} \frac{lo g ( Z _{(n - i + 1)} / Z _{(n - k_{n})} )}{\widebar G _{n} ( Z _{(n - i, n)} )} δ_{(n - i + 1)} I_{C_{(n - i + 1)} = 1}

\overset{γ}{^}_{1}^{(B D F G)}

\overset{γ}{^}_{1}^{(B D F G)}

\overset{γ}{^}_{1}^{(K M)}

M_{n}^{(α)} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} i = 1 \sum n \frac{lo g ^{α} ( Z _{i} / t _{n} )}{\widebar G _{n} ( Z _{i}^{-} )} I_{ξ_{i} = k} I_{Z_{i} > t_{n}},

M_{n}^{(α)} = \frac{1}{n \widebar F _{n}^{(k)} ( t _{n} )} i = 1 \sum n \frac{lo g ^{α} ( Z _{i} / t _{n} )}{\widebar G _{n} ( Z _{i}^{-} )} I_{ξ_{i} = k} I_{Z_{i} > t_{n}},

ϕ_{n} (u)

ϕ_{n} (u)

ϕ_{n} (u)

γ_{n, k}

γ_{n, k}

γ_{n, k}

Δ_{n} = \widebar F_{n}^{(k)} (t_{n}) / \widebar F^{(k)} (t_{n}) = \int g_{n} (u) d F_{n}^{(k)} (u) \makebox [48.36958 pt] [c] an d g_{n} (u) = \frac{1}{\widebar F ^{(k)} ( t _{n} )} I_{u > t_{n}},

Δ_{n} = \widebar F_{n}^{(k)} (t_{n}) / \widebar F^{(k)} (t_{n}) = \int g_{n} (u) d F_{n}^{(k)} (u) \makebox [48.36958 pt] [c] an d g_{n} (u) = \frac{1}{\widebar F ^{(k)} ( t _{n} )} I_{u > t_{n}},

U_{i, n}^{(1)}

U_{i, n}^{(1)}

U_{i, n}^{(2)}

U_{i, n}^{(3)}

U_{i, n}

ψ (f, z) = \int_{z}^{+ \infty} f (t) d F^{(k)} (t) \makebox [45.52458 pt] [c] an d C (z) = \int_{0}^{z} \frac{d G ( t )}{\widebar H ( t ) \widebar G ( t )} .

ψ (f, z) = \int_{z}^{+ \infty} f (t) d F^{(k)} (t) \makebox [45.52458 pt] [c] an d C (z) = \int_{0}^{z} \frac{d G ( t )}{\widebar H ( t ) \widebar G ( t )} .

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsStatistical Distribution Estimation and Applications · Financial Risk and Volatility Modeling · Statistical Methods and Inference

Full text

Extreme value statistics for censored data with heavy tails under competing risks

Julien Worms (1) & Rym Worms111Corresponding author (2)

(1) Université Paris-Saclay / Université de Versailles-Saint-Quentin-En-Yvelines

Laboratoire de Mathématiques de Versailles (CNRS UMR 8100),

F-78035 Versailles Cedex, France,

e-mail : [email protected]

(2) Université Paris-Est

Laboratoire d’Analyse et de Mathématiques Appliquées

(CNRS UMR 8050),

UPEMLV, UPEC, F-94010, Créteil, France,

e-mail : [email protected]

Extreme value statistics for censored data with heavy tails under competing risks

**Abstract

This paper addresses the problem of estimating, in the presence of random censoring as well as competing risks, the extreme value index of the (sub)-distribution function associated to one particular cause, in the heavy-tail case. Asymptotic normality of the proposed estimator (which has the form of an Aalen-Johansen integral, and is the first estimator proposed in this context) is established. A small simulation study exhibits its performances for finite samples. Estimation of extreme quantiles of the cumulative incidence function is also addressed.

**

*AMS Classification. * Primary 62G32 ; Secondary 62N02

*Keywords and phrases. * Extreme value index. Tail inference. Random censoring. Competing Risks. Aalen-Johansen estimator.

1 Introduction

The study of duration data (lifetime, failure time, re-employment time…) subject to random censoring is a major topic of the domain of statistics, which finds applications in many areas (in the sequel we will, for convenience, talk about lifetimes to refer to these observed durations, but without restricting our scope to lifetime data analysis). In general, the interest lies in obtaining informations about the central characteristics of the underlying lifetime distribution (mean lifetime or survival probabilities for instance), often with the objective of comparing results between different conditions under which the lifetime data are acquired. In this work, we will address the problem of inferring about the (upper) tail of the lifetime distribution, for data subject both to random (right) censoring and competing risks.

Suppose indeed that we are interested in the lifetimes of $n$ individuals or items, which are subject to $K$ different causes of death or failure, and to random censorship (from the right) as well. We are particularly interested in one of these causes (this main cause will be considered as cause number $k$ thereafter, where $k\in\{1,\ldots,K\}$ ), and we suppose that all causes are exclusive and are likely to be dependent on the others. The censoring time is assumed to be independent of the different causes of death or failure and of the observed lifetime itself. However, since the other causes (different from the $k$ -th cause of interest) generally cannot be considered as independent of the main cause, in no way they can be included in the censoring mechanism. This prevents us from relying on the basic independent censoring statistical framework, and we are thus in the presence of what is called a competing risks framework (see Moeschberger and Klein (1995)).

For instance, if a patient is suffering from a very serious disease and starts some treatment, then the final outcome of the treatment can be death due to the main disease, or death due to other causes (nosocomial infection for instance). And censoring can occur due to loss of follow up or end of the clinical study. Another example, in a reliability experiment, is that the failure of some mechanical system can be due to the failure of a particular subpart, or component, of the system : since separating the different components for studying the reliability of only one of them is generally not possible, accounting for these different competing causes of failure is necessary. Another field where competing risks often arise are labor economics, for instance in re-employment studies (see Fermanian (2003) for practical examples).

One way of formalising this is to say that we observe a sample of $n$ independent couples $(Z_{i},\xi_{i})_{1\leq i\leq n}$ where

[TABLE]

The i.i.d. samples $(X_{i})_{i\leq n}$ and $(C_{i})_{i\leq n}$ , of respective continuous distribution functions $F$ and $G$ , represent the lifetimes and censoring times of the individuals, and are supposed to be independent. For convenience, we will suppose in this work that they are non-negative. The variables $({\mathscr{C}}_{i})_{i\leq n}$ form a discrete sample with values in $\{1,\ldots,K\}$ , and represent the causes of failure or death of the $n$ individuals or items. It is important to note that these causes are observed only when the data is uncensored (i.e. when $\delta_{i}=1$ ), therefore we only observe the $\xi_{i}$ ’s, not the complete ${\mathscr{C}}_{i}$ ’s.

One way of considering the failure times $X_{i}$ is to write

[TABLE]

where the variable $X_{i,k}$ is a (rather artificial) variable representing the imaginary latent lifetime of the $i$ -th individual when the latter is only affected by the $k$ -th cause (the other causes being absent). This viewpoint may be interesting in its own right, but we will not keep on considering it in the sequel, one reason being that such variables $X_{i,1},\ldots,X_{i,K}$ cannot be realistically considered as independent, and their respective distributions are of no practical use or interpretability (as explained and demonstrated in the competing risks literature, these distributions are in fact not statistically identifiable, see Tsiatis (1975) for example).

The object of interest is the probability that a subject dies or fails after some given time $t$ , due to the $k$ -th cause, for high values of $t$ . This quantity, denoted by

[TABLE]

is related to the so-called cumulative incidence function $F^{(k)}$ defined by

[TABLE]

Note that $\widebar{F}^{(k)}(t)$ is not equal to $1-F^{(k)}(t)$ , but to $\mathbb{P}({\mathscr{C}}=k)-F^{(k)}(t)$ , because $F^{(k)}$ is only a sub-distribution function. However we have $\widebar{F}^{(k)}(t)=\int_{t}^{\infty}dF^{(k)}(u)$ . In the sequel, the notation $\bar{S}(.)=S(\infty)-S(.)$ will be used, for any non-decreasing function $S$ .

In this paper, we are interested in investigating the behaviour of $\widebar{F}^{(k)}(t)$ for large values of $t$ . This amounts to statistically study extreme values in a context of censored data under competing risks, and will lead us to consider some extreme value index $\gamma_{k}$ related to $\widebar{F}^{(k)}$ , which will be defined in a few lines. Equivalently, the object of interest is the high quantile $x^{(k)}_{p}=(\widebar{F}^{(k)})^{-}(p)=\inf\{\,x\in\mathbb{R}\,;\,\widebar{F}^{(k)}(x)\geq p\,\}$ when $p$ is close to [math], which can be interpreted as follows (in the context of lifetimes of individuals or failure times of systems) : in the presence of the other competing causes, a given individual (or item) will die (or fail), due to cause $k$ after such a time $x^{(k)}_{p}$ , only with small probability $p$ . A nonparametric inference for quantiles of fixed (and therefore not extreme) order, in the competing risk setting, has been already proposed in Peng and Fine (2007).

One way of addressing this problem could be through a parametric point of view (see Crowder (2001) for further methods in the competing risk setting), however, the non-parametric approach is the most common choice of people faced with data presenting censorship or competing risks. Of course, the standard Kaplan-Meier method for survival analysis does not yield valid results for a particular risk if failures from other causes are treated as censoring times, because the other causes cannot always be considered independent of the particular cause of interest.

The commonly used nonparametric estimator of the cumulative incidence function $F^{(k)}$ is the so-called Aalen-Johansen estimator (see Aalen and Johansen (1978), or Geffray (2009) equation $(7)$ ) defined by

[TABLE]

where $\widebar{G}_{n}$ denotes the standard Kaplan-Meier estimator of $G$ (and $\widebar{G}_{n}(t^{-})$ denotes $\lim_{s\uparrow t}\widebar{G}_{n}(s)$ ), so that we can introduce the following estimator for $\widebar{F}^{(k)}$ :

[TABLE]

But if the value $t$ considered is so high that only very few (if any) observations $Z_{i}$ (such that ${\mathscr{C}}_{i}=k$ ) exceed $t$ , then this purely nonparametric approach will lead to very unstable estimations $\widebar{F}_{n}^{(k)}(t)$ of $\widebar{F}^{(k)}(t)$ . This is why a semiparametric approach is desirable, and the one we will consider here is the one inspired by classical extreme value theory.

First note that in this paper, we will only consider situations where the underlying distributions $F$ and $G$ of the variables $X$ and $C$ are supposed to present power-like tails (also commonly named heavy tails), and we will focus on the evaluation of the order of this tail. Our working hypothesis will be thus that the different functions $\widebar{F}^{(k)}$ (for $k=1,\ldots,K$ ) as well as $\widebar{G}=1-G$ belong to the Fréchet maximum domain of attraction. In other words, we assume that they are (see Definition 1 in the Appendix) regularly varying at infinity, with respective negative indices $-1/\gamma_{1},\ldots,-1/\gamma_{K}$ and $-1/\gamma_{C}$

[TABLE]

Consequently, $\widebar{F}=1-F=\sum_{k=1}^{K}\widebar{F}^{(k)}$ and $\widebar{H}=\widebar{F}\widebar{G}$ (the survival function of $Z$ ) are regularly varying (at $+\infty$ ) with respective indices $-1/\gamma_{F}$ and $-1/\gamma$ , where $\gamma_{F}=\max(\gamma_{1},\ldots,\gamma_{K})$ and $\gamma$ satisfies $\gamma^{-1}={\gamma_{F}}^{-1}+{\gamma_{C}}^{-1}$ (these relations are constantly used in this paper).

The estimation of $\gamma_{F}$ has been already studied in the literature, as it corresponds to the random (right) censoring framework, without competing risks. We can cite Beirlant et al. (2007) and Einmahl et al. (2008), where the authors propose to use consistent estimators of $\gamma$ divided by the proportion of non-censored observations in the tail, or Worms and Worms (2014), where two Hill-type estimators are proposed for $\gamma_{F}$ , based on survival analysis techniques. However, our target here is $\gamma_{k}$ (for a fixed $k=1,\ldots,K$ ) and the point is that there seems to be no way to deduce an estimator of $\gamma_{k}$ from an estimator of $\gamma_{F}$ . Note that the useful trick used in Beirlant et al. (2007) and Einmahl et al. (2008) to construct an estimator of $\gamma_{F}$ does not seem to be extendable to this competing risks setting. To the best of our knowledge, our present paper is the first one addressing the problem of estimating the cause-specific extreme value index $\gamma_{k}$ .

Considering assumption (1), it is simple to check that, for a given $k$ , we have

[TABLE]

It is therefore most natural to propose the following (Hill-type) estimator of $\gamma_{k}$ , for some given threshold value $t_{n}$ (assumptions on this threshold are detailed in the next section) :

[TABLE]

which can be also written as

[TABLE]

where $Z_{(1)}\leq\ldots\leq Z_{(n)}$ are the ordered random variables associated to $Z_{1},\ldots,Z_{n}$ , and $\delta_{(i)}$ and ${\mathscr{C}}_{(i)}$ are the censoring indicator and cause number which correspond to the order statistic $Z_{(i)}$ . It is clear that this estimator is a generalisation of one of the estimators proposed in Worms and Worms (2014), in which the situation $K=1$ (with only one cause of failure/death) was considered. The asymptotic result we prove in the present work is then valid in the situation studied in the latter, where only consistency was proved and a random threshold was used.

Our paper is organized as follows: in Section 2, we state the asymptotic normality result of the proposed estimator, and of a corresponding estimator of an extreme quantile of the cumulative incidence function. Section 5 is devoted to the proofs. In Section 3, we present some simulations in order to illustrate finite sample behaviour of our estimator. Some technical aspects of the proofs are postponed to the Appendix.

2 Assumptions and Statement of the results

The central limit theorem which is going to be proved has the rate $\sqrt{v_{n}}$ where $v_{n}=n\widebar{F}_{n}^{(k)}(t_{n})\widebar{G}(t_{n})$ and $t_{n}$ is a threshold tending to $\infty$ with the following constraint

[TABLE]

If we note $l_{k}$ the slowly varying function associated to $\widebar{F}^{(k)}$ (i.e. such that $\widebar{F}^{(k)}(x)=x^{-1/\gamma_{k}}l_{k}(x)$ in condition (1)), the second order condition we consider is the classical $SR2$ condition for $l_{k}$ (see Bingham, Goldie and Teugels (1987)),

[TABLE]

where $g$ is a positive measurable function, slowly varying with index $\rho_{k}\leq 0$ , and $h_{\rho_{k}}(x)=\frac{x^{\rho_{k}}-1}{\rho_{k}}$ when $\rho_{k}<0$ , or $h_{\rho_{k}}(x)=\log x$ when $\rho_{k}=0$ .

Theorem 1

Under assumptions $(\ref{Ordre1})$ , $(\ref{condvntn})$ and $(\ref{Ordre2})$ , if there exists $\lambda\geq 0$ such that $\sqrt{v_{n}}g(t_{n})\stackrel{{\scriptstyle n\rightarrow\infty}}{{\longrightarrow}}\lambda$ , and if $\gamma_{k}<\gamma_{C}$ then we have

[TABLE]

where

[TABLE]

with $c=\lim_{x\rightarrow\infty}\widebar{F}^{(k)}(x)/\widebar{F}(x)\in[0,1]$ and $r=\gamma_{k}/\gamma_{C}\in]0,1[$ .

Remark 1

Note that when $\gamma_{k}<\gamma_{F}$ , then $c=0$ , and, when $\gamma_{k}=\gamma_{F}$ and $c=1$ (for instance when there is only one cause of failure/death), then $\sigma^{2}$ reduces to $\gamma_{F}^{2}/(1-r)$ .

Proposition 1

Under assumptions $(\ref{Ordre1})$ and $(\ref{condvntn})$ , we have

[TABLE]

Remark 2

The condition $\gamma_{k}<\gamma_{C}$ (weak censoring) is not necessary for the consistency of $\widehat{\gamma}_{n,k}$ .

Now, concerning the estimation of an extreme quantile $x^{(k)}_{p_{n}}$ (of order $p_{n}$ tending to [math]) associated to $\widebar{F}^{(k)}$ , we propose the usual Weissman-type estimator (in this heavy tailed context), associated to the threshold $t_{n}$ used in the estimation of $\gamma_{k}$ ,

[TABLE]

where $p_{n}$ is assumed to satisfy the constraint $p_{n}=o\big{(}\widebar{F}^{(k)}(t_{n})\big{)}$ . Remind that by definition $\widebar{F}^{(k)}({x}^{(k)}_{p_{n}})=p_{n}$ , and thus the definition of this estimator is based on the fact that, by the assumed regular variation of $\widebar{F}^{(k)}$ , the ratio $\widebar{F}^{(k)}({x}^{(k)}_{p_{n}})/\widebar{F}^{(k)}(t_{n})$ is close to $({x}^{(k)}_{p_{n}}/t_{n})^{-1/\gamma_{k}}$ .

Corollary 1

Under the assumptions of Theorem 1, if in addition $\rho_{k}<0$ (in (3)) and $d_{n}=\widebar{F}^{(k)}(t_{n})/p_{n}\rightarrow\infty$ satisfies the condition

[TABLE]

then (with $\lambda$ , $m$ and $\sigma^{2}$ being defined in the statement of Theorem 1)

[TABLE]

3 Simulations

In this section, a small simulation study is conducted in order to illustrate the finite-sample behaviour of our new estimator in some simple cases, and discuss the main issues associated with the competing risks setting.

For simplicity, we focus on the situation with two competing risks ( $K=2$ ), also called causes below, and our aim is the extreme value index $\gamma_{1}$ associated to the first cause. Data are generated from one of the following two models : for $c_{1}$ , $c_{2}$ non-negative constants satisfying $c_{1}+c_{2}=1$ , we consider the following (sub-)distribution for each cause-specific function $\widebar{F}^{(k)}$ ( $k\in\{1,2\}$ ) :

$-$ Fréchet : $\widebar{F}^{(k)}(t)=c_{k}\ \exp(-t^{-1/\gamma_{k}})$ , for $t\geq 0$ ;

$-$ Burr : $\widebar{F}^{(k)}(t)=c_{k}\ (1+t^{\tau_{k}}/\beta)^{-1/(\gamma_{k}\tau_{k})}$ , for $t\geq 1$ , where $\tau_{k}>0$ , $\beta>0$ .

The lifetime $X$ , of survival function $\bar{F}=\bar{F}^{(1)}+\bar{F}^{(2)}$ , is generated by the inversion method (with numerical computation of $\widebar{F}^{-1}$ ). Censoring times are then generated from a Fréchet or a Burr distribution :

[TABLE]

In this section, we consider (as it is often done in simulation studies) that the threshold $t_{n}$ used in the definition of our new estimator $\hat{\gamma}_{n,1}$ is taken equal to $Z_{(n-k_{n})}$ (i.e. we consider it as random). One aim of this section is to show how our estimator (with random threshold)

[TABLE]

of $\gamma_{1}$ behaves when the proportion $c_{1}$ of cause $1$ events varies : we consider $c_{1}\in\{1,0.9,0.7,0.5\}$ , the case $c_{1}=1$ corresponding to the simple censoring framework, without competing risk.

Another aim is to illustrate the impact of dependency between the causes, when estimating the tail. The starting point is that, if cause $2$ could be considered independent of cause $1$ , then we could (and would) include it in the censoring mechanism and we would be in the simple random censoring setting, without competing risk. In this case, it would be possible to estimate $\gamma_{1}$ by one of the following two estimators, the first one being proposed in Beirlant et al. (2007) (a Hill estimator weighted with a constant weight), and the second one in Worms and Worms (2014) (a Hill estimator weighted with varying Kaplan-Meier weights):

[TABLE]

where, in Equation $(\ref{vieuxestim1})$ , $\hat{p}_{1}=\frac{1}{k_{n}}\sum_{i=1}^{k_{n}}\delta_{(n-i+1)}\mathbb{I}_{{\mathscr{C}}_{(n-i+1)}=1}$ , and in Equation $(\ref{vieuxestim2})$ , the Kaplan Meier estimators $\widebar{F}_{n,b}$ and $\widebar{G}_{n,b}$ are based on the $\tilde{\delta}_{i}=\delta_{i}\mathbb{I}_{{\mathscr{C}}_{i}=1}$ . These two estimators consider the uncensored lifetimes associated to cause 2 as independent censoring times. Comparing our new estimator with these latter two estimators, when $c_{1}<1$ , will empirically prove that considering cause $2$ as a competing risk independent of cause $1$ has a great (negative) impact on the estimation of $\gamma_{1}$ . Note that when $c_{1}=1$ , the new estimator $\hat{\gamma}_{1}$ and $\hat{\gamma}_{1}^{(KM)}$ are exactly the same (therefore the thick and dashed lines in sub-figures (a), (c) and (e) of Figures 2 and 3 are overlapping, identical).

We address these two aims for each set-up (Fréchet, or Burr), by generating $2000$ datasets of size $500$ , with three configurations of the triplet $(\gamma_{1},\gamma_{2},\gamma_{C})$ : $(0.1,0.25,0.3)$ ( $\gamma_{1}<\gamma_{2}$ , moderate censoring $\gamma_{C}>\gamma_{F}$ ), $(0.1,0.25,0.2)$ ( $\gamma_{1}<\gamma_{2}$ , heavy censoring $\gamma_{C}<\gamma_{F}$ ), or $(0.25,0.1,0.45)$ ( $\gamma_{1}>\gamma_{2}$ , moderate censoring $\gamma_{C}<\gamma_{F}$ ). Median bias and mean squared error (MSE) of the different estimators are plotted against different values of $k_{n}$ , the number of excesses used. When Burr distributions are simulated, the parameter $\beta$ is taken equal to $1$ , and the parameters $(\tau_{1},\tau_{2},\tau_{C})$ are taken equal to $(12,6,5)$ in configurations 1 and 2, and to $(6,12,5)$ in configuration 3.

Figure 1 illustrates the behaviour of our estimator when $c_{1}$ varies. In terms of bias and MSE, we can see that the first configuration is a little better than the second one, which is itself much better than the third one. We observed this phenomenon in many other cases, not reported here : our estimator behaves best when it is the smallest parameter $\gamma_{k}$ which is estimated, and when the censoring is not too strong. Our simulations also show that the quality of our estimator (especially in terms of the MSE) diminishes with $c_{1}$ .

Figures 2 and 3 present the comparison between our new estimator and the ones described in (5) and (6). A general conclusion (confirmed by other simulations not reported here) is that $\hat{\gamma}_{1}^{(BDFG)}$ and $\hat{\gamma}_{1}^{(KM)}$ behave worse in most cases, even for a value of $c_{1}$ of $0.9$ , which is only a slight modification of the situation without competing risk ( $c_{1}=1$ ). Therefore, a contamination of the cause $1$ distribution by another cause rapidly yield inadequate estimations of $\gamma_{1}$ if dependency between causes is ignored ; this conclusion is true for both $\hat{\gamma}_{1}^{(BDFG)}$ and $\hat{\gamma}_{1}^{(KM)}$ , but to a greater extent for $\hat{\gamma}_{1}^{(BDFG)}$ . In the third configuration $(\gamma_{1},\gamma_{2},\gamma_{C})=(0.25,0.1,0.45)$ , the improvement provided by $\hat{\gamma}_{1}$ (with respect to $\hat{\gamma}_{1}^{(KM)}$ ) becomes notable when $c_{1}$ drops below $0.7$ .

4 Conclusion

In this paper, we consider heavy tailed lifetime data subject to random censoring and competing risks, and use the Aalen-Johansen estimator of the cumulative incidence function to construct an estimator for the extreme value index associated to the main cause of interest. To the best of our knowledge, this is the first estimator proposed in this context. Its asymptotic normality is proved and a small simulation study exhibiting its finite-sample performance shows that accounting for the dependency of the different causes is important, but that the bias can be particularly high. Estimating second order tail parameters would then be interesting in order to reduce this bias. A first step towards this aim could be to study the following moments

[TABLE]

which asymptotic behaviour can be derived following the same lines as in the proof of Theorem 1.

5 Proofs

This section is essentially devoted to the proof of the main Theorem 1. Some hints about the proof of the consistency result contained in Proposition 1 are given in Subsection 5.3, and Corollary 1 is proved in Subsection 5.4.

We adopt a strategy developed by Stute in Stute (1995) in order to prove his Theorem 1.1, a well-known result which states that a Kaplan-Meier integral of the form $\int\phi\,dF_{n}$ can be approximated by a sum of independent terms. This idea is used in Suzukawa (2002) in the context of competing risks. We thus intend to approximate $\widehat{\gamma}_{n,k}$ by the integral $\widetilde{\gamma}_{n,k}=\int\phi_{n}\,dF_{n}^{(k)}$ of some deterministic function $\phi_{n}$ , with respect to the Aalen-Johansen estimator, and approximate this integral by the mean $\widecheck{\gamma}_{n,k}$ of independent variables $U_{i,n}$ (defined a few lines below). The passage from $\widehat{\gamma}_{n,k}$ to $\widetilde{\gamma}_{n,k}$ (which amounts to replacing $\widebar{F}_{n}^{(k)}(t_{n})$ by $\widebar{F}^{(k)}(t_{n})$ in the denominator of $\widehat{\gamma}_{n,k}$ ) will imply an additional sum of independent variables $V_{i,n}$ , which will participate to the asymptotic variance of our estimator.

However, a major difference with Stute (1995) or Suzukawa (2002) is that the function we integrate here, $\phi_{n}(u)=\frac{1}{\widebar{F}^{(k)}(t_{n})}\log(u/t_{n})\mathbb{I}_{u>t_{n}}$ , is not only an unbounded function, depending on $n$ , but it also has a ”sliding” support $[t_{n},+\infty[$ , which is therefore always close to the endpoint $+\infty$ of the distribution $H$ . In Stute (1995), a crucial point of the proof consists in temporarily considering that the integrated function $\phi$ has a support which is bounded away from the endpoint of $H$ (condition (2.3) there). Considering the kind of function $\phi_{n}$ we have to deal with here, we cannot follow the same strategy : dealing with the remainder terms will thus be a particularly challenging part of our work. Finally note that, in order to deal with the ratio $\widebar{F}_{n}^{(k)}(t_{n})/\widebar{F}^{(k)}(t_{n})$ (and somehow approximate $\widehat{\gamma}_{n,k}$ by $\widetilde{\gamma}_{n,k}$ ) we will have to consider simultaneously integrals (with respect to $F_{n}^{(k)}$ ) of $\phi_{n}$ and of another function $g_{n}$ , defined below, which basically shares the same flaws as $\phi_{n}$ .

Let us first recall or define the following objects :

[TABLE]

We thus have $\widehat{\gamma}_{n,k}=\Delta_{n}^{-1}\widetilde{\gamma}_{n,k}$ , where

[TABLE]

and we now introduce the following new quantities, related to the Stute-like decomposition of $\widetilde{\gamma}_{n,k}$ and $\Delta_{n}$ :

[TABLE]

where, for any function $f:\mathbb{R}_{+}\rightarrow\mathbb{R}$ , we note (for any given $z\geq 0$ )

[TABLE]

This enables us to finally define the important objects

[TABLE]

which are the triangular sums of independent terms which will respectively approximate $\widetilde{\gamma}_{n,k}$ and $\Delta_{n}$ . At the beginning of section 5.1, it will be proved that $\mathbb{E}(U^{(1)}_{i,n})=\gamma_{n,k}$ and $\mathbb{E}(V^{(1)}_{i,n})=1$ , while $\mathbb{E}(U^{(2)}_{i,n})=\mathbb{E}(U^{(3)}_{i,n})$ and $\mathbb{E}(V^{(2)}_{i,n})=\mathbb{E}(V^{(3)}_{i,n})$ , yielding $\mathbb{E}(\widecheck{\gamma}_{n,k})=\gamma_{n,k}$ and $\mathbb{E}(\widehat{\Delta}_{n})=1$ ; the terms $U^{(2)}_{i,n}$ , $U^{(3)}_{i,n}$ , $V^{(2)}_{i,n}$ and $V^{(3)}_{i,n}$ only participate to the variance component of the estimator. The relation between all these quantities is made clearer in the following Lemma :

Lemma 1

We have

[TABLE]

where

[TABLE]

The proof of Lemma 1 is simple :

[TABLE]

which leads to the desired relation (8).

The main theorem thus becomes an immediate consequence of the following four results, the second one being the most difficult to establish.

Proposition 2

Under condition $(\ref{Ordre1})$ and assuming that

[TABLE]

if $\gamma_{k}<\gamma_{C}$ , then

[TABLE]

where $\sigma^{2}$ is defined in the statement of Theorem 1.

Proposition 3

Under conditions $(\ref{Ordre1})$ and $(\ref{condvntn})$ , if $\gamma_{k}<\gamma_{C}$ , then

[TABLE]

where $R_{n,\phi}$ , $R_{n,g}$ (and consequently $R_{n}$ too) are $o_{\mathbb{P}}(v_{n}^{-1/2})$ .

Corollary 2

Under the conditions of Proposition 3, $\sqrt{v_{n}}(\Delta_{n}-1)$ converges in distribution to ${\cal N}(0,1/(1-r))$ where $r=\gamma_{k}/\gamma_{C}\in]0,1[$ .

Lemma 2

Under conditions $(\ref{Ordre1})$ , $(\ref{Ordre2})$ and $\sqrt{v_{n}}g(t_{n})\rightarrow\lambda\geq 0$ , the bias term $\sqrt{v_{n}}(\gamma_{n,k}-\gamma_{k})$ in (8) converges to $\lambda m$ as $n\rightarrow\infty$ , where $m$ is defined in Theorem 1.

Propositions 2 and 3 will be proved in Sections 5.1 and 5.2 respectively, sometimes with the help of other results stated and established in the Appendix. The proofs of Corollary 2 and Lemma 2 are short, we state them below.

Concerning Corollary 2, once the proof of Proposition 2 has been gone through, it will become clear to the reader that $\sqrt{v_{n}}(\hat{\Delta}_{n}-1)$ converges in distribution to the centred gaussian distribution of variance $1/(1-r)$ , because $\sqrt{v_{n}}(\hat{\Delta}_{n}-1)=\sum_{i=1}^{n}\tilde{W}_{i,n}$ where $\tilde{W}_{i,n}=\frac{\sqrt{v_{n}}}{n}\tilde{V}_{i,n}=\frac{\sqrt{v_{n}}}{n}(V_{i,n}-1)$ are centred, and ${\mathbb{V}}ar(\tilde{W}_{i,n})=\frac{1}{n}\frac{1}{1-r}+o(1/n)$ (this is proved similarly as (11) and (15)). Since Proposition 3 states that $\Delta_{n}=\hat{\Delta}_{n}+o_{\mathbb{P}}(v_{n}^{1/2})$ , the same central limit theorem holds for $\Delta_{n}$ and the corollary is proved.

Concerning now Lemma 2, remind that $\gamma_{n,k}=\int\phi_{n}(u)dF_{n}^{(k)}(u)$ . An integration by parts and the fact that $\widebar{F}^{(k)}(x)=x^{-1/\gamma_{k}}l_{k}(x)$ yield

[TABLE]

and, using assumption $(\ref{Ordre2})$ and Proposition 3.1 in de Haan and Ferreira (2006), we can write

[TABLE]

The result then follows from assumption $\sqrt{v_{n}}g(t_{n})\rightarrow\lambda\geq 0$ and the fact that $\int_{1}^{+\infty}y^{-1/\gamma_{k}-1}h_{\rho_{k}}(y)dy=m$ .

In the rest of the paper, we will very often handle the well-known sub-distributions functions $H^{(0)}$ and $H^{(1,k)}$ defined, for all $t\geq 0$ , by

[TABLE]

Note that we have

[TABLE]

5.1 Proof of Proposition 2

We first write

[TABLE]

where $W_{i,n}$ , $\tilde{U}_{i,n}$ and $\tilde{V}_{i,n}$ are centred, because the random variables $U_{i,n}$ and $V_{i,n}$ have expectations respectively equal to $\gamma_{n,k}$ and $1$ . Indeed, we have

[TABLE]

and

[TABLE]

as well as

[TABLE]

The proof for $\mathbb{E}(V_{i,n})=1$ is similar.

We will now prove the asymptotic normality of $Z_{n}$ by using the Lyapunov criteria.

Lemma 3

Under the conditions $(\ref{Ordre1})$ and $(\ref{condvntn1})$ , if $\gamma_{k}<\gamma_{C}$ :

$(i)$

we have

[TABLE] 2. $(ii)$

we have

[TABLE] 3. $(iii)$

we have, noting $r=\gamma_{k}/\gamma_{C}$ (which belongs to $]0,1[$ under our conditions) as well as $p=\gamma/\gamma_{F}=\gamma_{C}/(\gamma_{F}+\gamma_{C})\in]0,1[$ ,

[TABLE]

Lemma 4

Under the conditions $(\ref{Ordre1})$ and $(\ref{condvntn1})$ , if $\gamma_{k}<\gamma_{C}$ , then

$\sum_{i=1}^{n}\mathbb{E}\left|W_{i,n}\right|^{2+\delta}\rightarrow 0$ , as $n$ tends to infinity, for some $\delta>0$ .

We can then immediately prove Proposition 2. Indeed, since $Z_{n}=\sum_{i=1}^{n}W_{i,n}$ , Lemma 3 yields

[TABLE]

which, since $v_{n}=n\widebar{F}_{n}^{(k)}(t_{n})\widebar{G}(t_{n})$ , becomes

[TABLE]

Therefore, depending on the limit $c$ of the ratio $\widebar{F}^{(k)}(t_{n})/\widebar{F}(t_{n})$ when $n\rightarrow\infty$ (for instance, it converges to [math] when $\gamma_{k}<\gamma_{F}$ ), it is simple to check that the variance of $Z_{n}$ converges to the value $\sigma^{2}$ described in the statement of Theorem 1. Thanks to Lemma 4, the Lyapunov CLT applies and Proposition 2 is proved.

The two subsections 5.1.1 and 5.1.2 are now respectively devoted to the proofs of Lemmas 3 and 4.

5.1.1 Proof of Lemma 3

Part $(i)$ of the lemma is straightforward : since $\tilde{U}_{1,n}$ and $\tilde{V}_{1,n}$ are centred, we have indeed

[TABLE]

and the result comes by using the fact that $\gamma_{n,k}$ converges to $\gamma_{k}$ as $n\rightarrow\infty$ .

Now we proceed to the proof of part $(ii)$ , and will only prove (12) because, by definition of $\phi_{n}$ and $g_{n}$ , the proofs for (13) and (14) will be completely similar. First of all, we obviously have

[TABLE]

The first term in the right-hand side of (12) is equal to $\mathbb{E}((U^{(1)}_{1,n})^{2})$ , and the second one (without the minus sign) is equal to $\mathbb{E}((U^{(2)}_{1,n})^{2})$ and to $\mathbb{E}(U^{(1)}_{1,n}U^{(3)}_{1,n})$ because

[TABLE]

and

[TABLE]

The expectation $\mathbb{E}(U^{(1)}_{1,n}U^{(2)}_{1,n})$ equals [math] because $\delta_{1}(1-\delta_{1})$ is constantly [math], and we are now going to prove that $\mathbb{E}((U^{(3)}_{1,n})^{2})=2\mathbb{E}(U^{(2)}_{1,n}U^{(3)}_{1,n})$ , which ends the proof of (12) in view of (16). Indeed, noting $h(z)=\int_{0}^{z}\psi(\phi_{n},u)dC(u)$ and using the simple fact that $h(z)=h(y)+\int_{y}^{z}\psi(\phi_{n},u)dC(u)$ for every $y<z$ , we have

[TABLE]

and

[TABLE]

as announced.

We can now start proving part $(iii)$ of the lemma, in which the exact nature of the function $\phi_{n}$ matters. First remind that functions $\widebar{F}^{(k)}$ , $\widebar{G}$ and $C$ are regularly varying of respective orders $-1/\gamma_{k}$ , $-1/\gamma_{C}$ and $1/\gamma$ (for $C$ , this is proved in Lemma 8 with $\delta=1$ ). Let us define the constants $c_{j}$ and $d_{j}$ ( $j=0,1,2$ ) by

[TABLE]

Since $\gamma_{k}<\gamma_{C}$ was assumed, then according to Lemma 7 part $(ii)$ (applied first with $a+b=1/\gamma_{C}-1/\gamma_{k}<0$ for $c_{j}$ , and then with $a+b=-2/\gamma_{k}+1/\gamma=(1/\gamma_{C}-1/\gamma_{k})+(1/\gamma_{F}-1/\gamma_{k})<0$ for $d_{j}$ ) , we have

[TABLE]

Hence, by definition of $\phi_{n}$ , $g_{n}$ , the first terms of $\mathbb{E}(U_{1,n}^{2})$ , $\mathbb{E}(V_{1,n}^{2})$ and $\mathbb{E}(U_{1,n}V_{1,n})$ in relations (12), (13) and (14) are respectively equivalent (as $n\rightarrow\infty$ ) to $c_{2}D(t_{n})$ , $c_{0}D(t_{n})$ and $c_{1}D(t_{n})$ where $D(t_{n})$ denotes

[TABLE]

Since $c_{2}+\gamma_{k}^{2}c_{0}-2\gamma_{k}c_{1}$ is found to be equal to $\gamma_{k}^{2}(1+r^{2})/(1-r)^{3}$ , then in view of (11) this proves the first term in relation (15). We now need to obtain equivalent expressions for the quantities $\int(\psi(\phi_{n},u))^{2}\,dC(u)$ , $\int(\psi(g_{n},u))^{2}\,dC(u)$ and $\int\psi(\phi_{n},u)\psi(g_{n},u)\,dC(u)$ in order to prove the second part of relation (15) and therefore finish the proof of Lemma 3.

For saving space, we will use temporarily the following notations :

[TABLE]

According to the technical Lemma 9 of the Appendix and, after splitting the integral into $\int_{0}^{+\infty}$ and $\int_{t_{n}}^{+\infty}$ , we can write

[TABLE]

where $o(C(t_{n}))$ in $(\ref{JU})$ is due to part $(ii)$ of Lemma 7 and to the fact that $\epsilon_{n}(u)$ in Lemma 9 converges to [math] uniformly in $u$ . According to the second part of relation $(\ref{cjdj})$ , we thus have

[TABLE]

The other terms are treated similarly (using the fact that $\psi(g_{n},u)=1$ when $u\leq t_{n}$ , and $=\widebar{F}^{(k)}(u)/\widebar{F}^{(k)}(t_{n})$ when $u>t_{n}$ ) and we obtain

[TABLE]

In view of (11), combining $(\ref{equivintpsiphi})$ , $(\ref{equivintpsig})$ and $(\ref{equivintpsiphig})$ and using Remark 3 (following Lemma 8) to write that $C(t_{n})\sim(1-p)/\widebar{H}(t_{n})$ (as $n\rightarrow\infty$ ), this proves the second term in relation (15).

5.1.2 Proof of Lemma 4

We have to prove that, for some $\delta>0$ small enough, $n\mathbb{E}\left|W_{1,n}\right|^{2+\delta}$ tends to [math], as $n\rightarrow\infty$ . In the sequel, $cst$ denotes an unspecified absolute positive constant. According to the definition of $W_{1,n}$ , it is clear that

[TABLE]

First, we clearly have $n^{-1-\delta}\ v_{n}^{1+\delta/2}\left|\gamma_{n,k}-\gamma_{k}\right|^{2+\delta}\longrightarrow 0$ as $n\rightarrow\infty$ . Secondly, since $V^{(j)}_{1,n}$ has the same form as $U^{(j)}_{1,n}$ , with $g_{n}$ instead of $\phi_{n}$ (i.e. without the log factor), we will only prove that there exists some $\delta>0$ such that, as $n\rightarrow\infty$ ,

[TABLE]

For $j=1$ , we have

[TABLE]

Applying part $(ii)$ of Lemma 7 for $\alpha=2+\delta$ , $a=(1+\delta)/\gamma_{C}$ and $b=-1/\gamma_{k}$ (with $\delta$ sufficiently small so that $a+b=1/\gamma_{C}-1/\gamma_{k}+\delta/\gamma_{C}$ is kept $<0$ ), and using the fact that $v_{n}=n\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n})\rightarrow\infty$ , this ends the proof of (22) for $j=1$ .

For $j=2$ , we have

[TABLE]

By definition of $\psi$ , $\phi_{n}$ , and $\gamma_{n,k}$ , we have $\psi(\phi_{n},z)=\gamma_{n,k}$ when $z\leq t_{n}$ . Therefore, splitting the integral above into two integrals $\int_{0}^{t_{n}}$ and $\int_{t_{n}}^{+\infty}$ we obtain

[TABLE]

where, on one hand,

[TABLE]

and, on the other hand, using the technical Lemma 9, for some $\delta^{\prime}>0$ ,

[TABLE]

Applying Lemma 8 to $\delta=1$ , we have $C(t_{n})=O\left(1/\widebar{H}(t_{n})\right)$ , therefore $I_{1}(t_{n})=O\left((\bar{H}(t_{n}))^{-1-\delta}\right)$ . It is then easy to check that $n^{-\delta/2}\left(\widebar{F}^{(k)}(t_{n})\bar{G}(t_{n})\right)^{1+\delta/2}\ I_{1}(t_{n})$ tends to 0, because $\widebar{F}^{(k)}\leq\widebar{F}$ and $n\widebar{H}(t_{n})\rightarrow\infty$ , since $\widebar{H}(t_{n})\geq\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n})$ .

For $I_{2}(t_{n})$ , since by Lemma 8 the function $C$ is regularly varying with index $1/\gamma$ , the application of part $(ii)$ of Lemma 7 to $\alpha=0$ or $2+\delta$ and to various couples of values of $a$ and $b$ finally yields $I_{2}(t_{n})=O\left((\bar{H}(t_{n}))^{-1-\delta}\right)$ , and consequently $n^{-\delta/2}\left(\widebar{F}^{(k)}(t_{n})\bar{G}(t_{n})\right)^{1+\delta/2}\ I_{2}(t_{n})$ tends to 0.

We now come to the study of relation (22) for $j=3$ . We have

[TABLE]

Proceeding as above by splitting the integral into two integrals $\int_{0}^{t_{n}}$ and $\int_{t_{n}}^{+\infty}$ , we obtain

[TABLE]

where

[TABLE]

and

[TABLE]

where

[TABLE]

and, using the technical Lemma 9 as we did some lines above,

[TABLE]

Using Lemma 8 and part $(iii)$ of Lemma 7, we find that both $J_{1}(t_{n})$ and $J^{(1)}_{2}(t_{n})$ are $O\left((\bar{H}(t_{n}))^{-1-\delta}\right)$ and, though the term $J^{(2)}_{2}(t_{n})$ is more involved, we are also going to prove below that the same property holds for $J^{(2)}_{2}(t_{n})$ : this will finish the proof of Lemma 4 because $n^{-\delta/2}\left(\widebar{F}^{(k)}(t_{n})\bar{G}(t_{n})\right)^{1+\delta/2}(\bar{H}(t_{n}))^{-1-\delta}$ tends to [math], as already seen in the proof for $j=2$ .

We only treat the first integral in the right-hand side of $(\ref{majJ22})$ , since the two others are very similar, i.e. we need to prove that

[TABLE]

Now,

[TABLE]

Using Potter-bounds $(\ref{BornesPotter})$ for $\widebar{F}^{(k)}\in RV_{-1/\gamma_{k}}$ , integration by parts and then Potter-bounds $(\ref{BornesPotter})$ for $C\in RV_{1/\gamma}$ , it is easy to see that for $n$ sufficiently large and $\epsilon>0$ , there exists some positive constants $c$ , $c^{\prime}$ , $c^{\prime\prime}$ such that

[TABLE]

where $a=(2+\delta)(\frac{1}{\gamma}-\frac{1}{\gamma_{k}}+2\epsilon)$ . Consequently

[TABLE]

This yields $(\ref{integFkLiap})$ , by using part $(ii)$ of Lemma 7 to this value of $a$ , to $b=-1/\gamma$ (and to $\alpha=2+\delta$ or $\alpha=0$ ), as well as Lemma 8.

5.2 Proof of Proposition 3

Let us start with an important note. In Proposition 3, the main result is that the remainder terms $R_{n,\phi}$ and $R_{n,g}$ are $o_{\mathbb{P}}(v_{n}^{-1/2})$ . Proving this will be conducted in a similar way as proving that $R_{n}$ is $o_{\mathbb{P}}(n^{-1/2})$ in Theorem 1.1 of Stute (1995). But, recall that in our situation, the function that we integrate here is $\phi_{n}$ , which is depending on $n$ , with a ”sliding” support $[t_{n},+\infty[$ . We will need to be particularly cautious with integrability issues, especially when dealing with U-statistics for the terms $R_{n,2}$ and $R_{n,3}$ in the remainder $R_{n,C}$ , defined below.

Before we proceed with the proof, let us define the following empirical (sub)-distribution functions : for $t\geq 0$ ,

[TABLE]

First note that, since $g_{n}$ is the function $\phi_{n}$ without the log factor, it should be clear to the reader that proving that $\Delta_{n}=\hat{\Delta}_{n}+R_{n,g}$ and $\sqrt{v_{n}}R_{n,g}=o_{\mathbb{P}}(1)$ will be simpler than proving that $\widetilde{\gamma}_{n,k}=\widecheck{\gamma}_{n,k}+R_{n,\phi}$ and $\sqrt{v_{n}}R_{n,\phi}=o_{\mathbb{P}}(1)$ . We will thus only prove the latter two relations.

Let us start with the first one, in other words let us define the remainder term $R_{n,\phi}$ . Remind that the definitions of $\widetilde{\gamma}_{n,k}$ and $\widecheck{\gamma}_{n,k}$ are $\widetilde{\gamma}_{n,k}=\int\phi_{n}(u)dF_{n}^{(k)}(u)$ and $\widecheck{\gamma}_{n,k}=\widebar{U}_{n}^{(1)}+\widebar{U}_{n}^{(2)}-\widebar{U}_{n}^{(3)}$ , where $\widebar{U}_{n}^{(j)}$ denotes the mean of the $n$ variables $U^{(j)}_{i,n}$ . We need to decompose the integral of $\phi_{n}$ with respect to $F_{n}^{(k)}$ , which is a stepwise subdistribution function which jumps at the (ordered) observations $Z_{(i)}$ are equal to $\mathbb{I}_{\xi_{(i)}=k}/(n\widebar{G}_{n}(Z_{(i-1)}))$ . But it is known that (see Lemma 2.1 in Stute (1995))

[TABLE]

Therefore, using the fact that $\widebar{G}(z)=\exp\left(-\int_{0}^{z}\widebar{H}^{-1}dH^{(0)}\right)$ , we have

[TABLE]

Consequently, using the mean value theorem for $\exp$ , and introducing the important notations

[TABLE]

it is easy to see that

[TABLE]

where $\widebar{U}_{n}^{(1)}$ is the first term in the definition of $\widecheck{\gamma}_{n,k}$ , and $\Delta_{i,n}$ is a random quantity lying between $\int_{0}^{Z_{i}^{-}}\widebar{H}^{-1}dH^{(0)}$ and $n\int_{0}^{Z_{i}^{-}}\log(1+(n\widebar{H}_{n}(x))^{-1})\,dH_{n}^{(0)}(x)$ .

What we now need to do is to show that the term involving the quantity $C_{i,n}$ in relation (25) above can be written as $\widebar{U}_{n}^{(2)}-\widebar{U}_{n}^{(3)}$ plus a remainder term $R_{n,C}$ , and therefore we have $\widetilde{\gamma}_{n,k}=\widecheck{\gamma}_{n,k}+R_{n,\phi}$ , where

[TABLE]

The rest of the proof will, afterwards, be devoted to showing that each term of $R_{n,\phi}$ is $o_{\mathbb{P}}(v_{n}^{-1/2})$ .

Proceeding as in Stute (1995) or Suzukawa (2002), and using the fact that for any given function $f$ we have $\int f\,dH_{n}^{(1,k)}=\frac{1}{n}\sum_{i=1}^{n}f(Z_{i})\mathbb{I}_{\xi_{i}=k}$ , we can write

[TABLE]

where

[TABLE]

Note that $C_{n}^{(1)}$ and $C_{n}^{(2)}$ are a kind of $U$ -statistics, which need to be approximated by sums of independent variables called Hoeffding decompositions : more precisely, if we introduce the functions (important in the sequel)

[TABLE]

for $u\in\mathbb{R}$ , $v\in\mathbb{R}\cup\{+\infty\}$ and $w\in\mathbb{R}\cup\{+\infty\}$ , then these decompositions are defined by

[TABLE]

Therefore, if we introduce the remainder terms

[TABLE]

then (27) becomes

[TABLE]

We are thus left to prove that $-\widehat{C}_{n}^{(1)}+2\widehat{C}_{n}^{(2)}-C_{n}^{(3)}=\ \widebar{U}_{n}^{(2)}-\widebar{U}_{n}^{(3)}$ . This is indeed the case because, if we note

[TABLE]

then, by definition of $\underline{h}$ , the last (fourth) term in $\widehat{C}_{n}^{(1)}$ equals $-2\theta_{n}$ , the third one equals $C_{n}^{(3)}$ , the second one is (because $dH^{(1,k)}(w)=\widebar{G}(w^{-})dF^{(k)}(w)$ )

[TABLE]

and the first one is (because $dH^{(1,k)}(w)=\widebar{G}(w^{-})dF^{(k)}(w)$ and $dH^{(0)}(v)=\widebar{F}(v)dG(v)$ )

[TABLE]

Likewise, the first term of $\widehat{C}_{n}^{(2)}$ equals $\widebar{U}_{n}^{(2)}$ , the second one equals $C_{n}^{(3)}$ , and the last one equals $-\theta_{n}$ . After straightforward simplifications, we obtain the desired equality $-\widehat{C}_{n}^{(1)}+2\widehat{C}_{n}^{(2)}-C_{n}^{(3)}=\ \widebar{U}_{n}^{(2)}-\widebar{U}_{n}^{(3)}$ , and the proof of $\widetilde{\gamma}_{n,k}=\widecheck{\gamma}_{n,k}+R_{n,\phi}$ is over.

The proof of Proposition 3 is now based on the following two lemmas : Lemma 5 is proved in subsection 5.2.1, and Lemma 6 is the longest to establish, its proof will be split across subsections 5.2.2 to 5.2.5.

Lemma 5

If conditions $(\ref{Ordre1})$ and (2) hold with $\gamma_{k}<\gamma_{C}$ , then we have

[TABLE]

Lemma 6

If conditions $(\ref{Ordre1})$ and (2) hold with $\gamma_{k}<\gamma_{C}$ , then we have

[TABLE]

5.2.1 Proof of Lemma 5

$\bullet$ We start with the remainder term $R_{n,B}$ , which is defined as

[TABLE]

where $B_{i,n}=n\int_{0}^{Z_{i}^{-}}\log(1+(n\widebar{H}_{n}(x))^{-1})\,dH_{n}^{(0)}(x)\;-\;\int_{0}^{Z_{i}^{-}}\frac{dH_{n}^{(0)}}{\widebar{H}_{n}}$ . Since, for all $x\geq 0$ , $x-\frac{x^{2}}{2}\leq\log(1+x)\leq x$ , we obtain

[TABLE]

and then

[TABLE]

But

[TABLE]

so, if we define

[TABLE]

where

[TABLE]

then it remains to prove (thanks to part $(i)$ of Lemma 10) that $\sqrt{v_{n}}T_{n}^{(1)}=o_{\mathbb{P}}(1)$ and $\sqrt{v_{n}}T_{n}^{(2)}=o_{\mathbb{P}}(1)$ .

Concerning $T_{n}^{(1)}$ , since $\widebar{H}\geq\widebar{H}^{(0)}$ implies that $T_{i,n}^{(1)}\leq\frac{1}{n}\frac{\phi_{n}(Z_{i})}{\widebar{G}(Z_{i}^{-})\widebar{H}^{(0)}(Z_{i}^{-})}\mathbb{I}_{\xi_{i}=k}$ , then $\sqrt{v_{n}}T_{n}^{(1)}=o_{\mathbb{P}}(1)$ is a consequence of Lemma 11, used with $\alpha=0$ and $d=1$ .

Concerning $T_{n}^{(2)}$ , an integration by parts yields

[TABLE]

for any given $0<\alpha<\frac{1}{2}$ . Lemma 10 (applied with $a=1/2-\alpha<1/2$ ) and the fact $\widebar{H}^{(0)}\leq\widebar{H}$ thus imply that

[TABLE]

so that, by definition of $T_{i,n}^{(2)}$ , the desired statement $\sqrt{v_{n}}T_{n}^{(2)}=o_{\mathbb{P}}(1)$ is a consequence of Lemma 11, applied with $\alpha>0$ sufficiently small and $d=\frac{3}{2}$ , and of

[TABLE]

Indeed $\widebar{U}_{n}^{(1)}$ converges to $\gamma_{k}$ and $\widebar{H}_{n}^{(0)}(0)-\widebar{H}^{(0)}(0)$ equals $\frac{1}{n}\sum_{i=1}^{n}\mathbb{I}_{\delta_{i}=0}-\mathbb{P}(\delta=0)$ , which is $O_{\mathbb{P}}(n^{-1/2})$ by the standard central limit theorem.

$\bullet$ Let us now turn to the remainder term $R_{n,1}$ , which is defined as

[TABLE]

A simple calculation leads to

[TABLE]

for any $0<\alpha<\frac{1}{2}$ . Taking $\alpha$ sufficiently small, the rest of the proof is very similar to the one for $R_{n,B}$ (compare to $(\ref{inequRnB})$ ) and relies on Lemma 10 and Lemma 11.

$\bullet$ We can finally deal with the last remainder term $R_{n,\Delta}$ , defined as

[TABLE]

where $\Delta_{i,n}$ is a random quantity lying between $a_{n}:=\int_{0}^{Z_{i}^{-}}\widebar{H}^{-1}dH^{(0)}$ and $b_{n}:=\int_{0}^{Z_{i}^{-}}\log(1+(n\widebar{H}_{n}(x))^{-1})\,dH_{n}^{(0)}(x)$ . Since $e^{a}=1/\bar{G}(Z_{i}^{-})$ , we have

[TABLE]

Since $b_{n}-a_{n}=B_{i,n}+C_{i,n}$ , where $B_{i,n}<0$ and $C_{i,n}=\int_{0}^{Z_{i}^{-}}\frac{dH_{n}^{(0)}}{\widebar{H}_{n}}\;-\;\int_{0}^{Z_{i}^{-}}\frac{dH^{(0)}}{\widebar{H}}$ , we clearly have

[TABLE]

But $C_{i,n}=\hat{\Lambda}_{n,G}(Z_{i})-\Lambda_{G}(Z_{i})$ , where $\Lambda_{G}$ is the cumulative hazard function associated to $G$ , and $\hat{\Lambda}_{n,G}$ its Nelson-Alen estimator. Relying on Zhou (1991) Theorem 2.1, we can deduce that $\sup_{1\leq i\leq n}|C_{i,n}|=O_{\mathbb{P}}(1)$ . Hence, $e^{(\Delta_{i,n}-a)}=O_{\mathbb{P}}(1)$ .

Now,

[TABLE]

By writing

[TABLE]

we prove (using Lemma 10 and simple integrations as for the previous treatment of $T_{n}^{(2)}$ above) that $|C_{i,n}|\leq O_{\mathbb{P}}(1)\ 1/(\sqrt{n}(\widebar{H}(Z_{i}))^{1/2+\alpha})+|\widebar{H}_{n}^{(0)}(0)-\widebar{H}^{(0)}(0)|$ for $0<\alpha<1/2$ .

Hence, on one hand $(C_{i,n})^{2}\leq O_{\mathbb{P}}(1)(n(\widebar{H}(Z_{i}))^{1+2\alpha})^{-1}+O_{\mathbb{P}}(n^{-1})\leq O_{\mathbb{P}}(1)(n(\widebar{H}^{(0)}(Z_{i}))^{1+2\alpha})^{-1}+O_{\mathbb{P}}(n^{-1})$ , and on the other hand

[TABLE]

for any given $0<\alpha<1/2$ (where the $O_{\mathbb{P}}(n^{-1})$ comes from $|\widebar{H}_{n}^{(0)}(0)-\widebar{H}^{(0)}(0)|^{2}$ , which does not depend on $i$ ). Therefore, it is sufficient to prove that

[TABLE]

are $o_{\mathbb{P}}(1)$ , and that

[TABLE]

is $o_{\mathbb{P}}(1)$ as well. But the first two statements are consequences of Lemma 11 with $\alpha>0$ sufficiently close to [math] and, respectively, $d=1$ and $d=2$ . And for the third statement, the expectation of the expression turns out (thanks to Lemma 7 part $(ii)$ ) to be equivalent to a constant times $n^{-1/2}(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{1/2}$ , which tends to [math].

5.2.2 Preliminaries to the proof of Lemma 6

We start this section by introducing important objects, issued from an idea appearing (to the best of our knowledge) in Stute (1994). We define the improper variables $(V_{i})_{1\leq i\leq n}$ and $(W_{j})_{1\leq j\leq n}$ by

[TABLE]

which have $H^{(0)}$ and $H^{(1,k)}$ for respective subdistribution functions. We thus have $1-\delta_{i}=\mathbb{I}_{V_{i}<\infty}$ and $\mathbb{I}_{{\mathscr{C}}_{j}=k}=\mathbb{I}_{W_{j}<\infty}$ , which, according to the definitions of $C_{n}^{(1)}$ and $C_{n}^{(2)}$ on one hand, and of functions $h$ and $\underline{h}$ (in (28)) on the other hand, leads to

[TABLE]

and

[TABLE]

Since the latter triple sum is not convenient, we also define

[TABLE]

where $\widetilde{C}_{n}^{(1)}$ will be the quantity approximated by $\widehat{C}_{n}^{(1)}$ , and $\raisebox{7.68236pt}{$ \approx $}C_{n}^{(1)}$ will be a remainder. We can indeed rewrite (31) as

[TABLE]

The terms in parentheses in (34) and (35) turn out to be genuine U-statistics of 2 and 3 variables, denoted by

[TABLE]

where functions ${\cal H}$ and $\underline{\cal H}$ will be defined in a few lines (relation (37)) after some preliminaries, certainly well-known in the U-statistics literature, but which we include here to make our proof self-contained (and since we are dealing with improper variables).

If $V$ and $W$ denote independent improper random variables with subdistribution functions $H^{(0)}$ and $H^{(1,k)}$ (i.e. $V=Z\mathbb{I}_{\delta=0}+\infty\mathbb{I}_{\delta=1}$ and $W=Z^{\prime}\delta^{\prime}\mathbb{I}_{{\mathscr{C}}^{\prime}=k}+\infty(1-\delta^{\prime}+\mathbb{I}_{{\mathscr{C}}^{\prime}\neq k})$ where $(Z,\delta,{\mathscr{C}})$ and $(Z^{\prime},\delta^{\prime},{\mathscr{C}}^{\prime})$ are independent copies of $(Z_{1},\delta_{1},{\mathscr{C}}_{1})$ ), we introduce the following notations : for any function $g:[0,\infty]\times[0,\infty]\rightarrow\mathbb{R}$ ,

[TABLE]

as well as, for any function $g:[0,\infty[\times[0,\infty]\times[0,\infty]\rightarrow\mathbb{R}$ , with $Z$ (of distribution function $H$ ) independent of $V$ and $W$ ,

[TABLE]

Since $h(v,w)=0$ whenever $v$ or $w$ equals $\infty$ , we then have (the proof is simple)

[TABLE]

Therefore, setting (for $z$ in $[0,\infty[$ and $v$ and $w$ in $[0,\infty]$ )

[TABLE]

it is then not difficult to check (using (29) and (30)) that ${\cal U}_{n}$ and ${\cal V}_{n}$ in relation (36) are indeed equal to the differences in parentheses in relations (34) and (35), respectively. Lemma 6 thus becomes a consequence of the following facts : $\sqrt{v_{n}}{\cal U}_{n}=o_{\mathbb{P}}(1)$ , $\sqrt{v_{n}}{\cal V}_{n}=o_{\mathbb{P}}(1)$ , and

[TABLE]

We will prove these statements in the next 3 subsections.

5.2.3 Proof of $\sqrt{v_{n}}{\cal U}_{n}=o_{\mathbb{P}}(1)$

We note ${\cal I}=\{I=(i,j)\,;\,1\leq i<j\leq n\}$ , ${\cal H}_{I}={\cal H}(V_{i},W_{j})$ when $I=(i,j)\in{\cal I}$ , and $N=n(n-1)/2$ . It is clear that it suffices to prove that

[TABLE]

The good point is that $S_{N}$ turns out to be a sum of identically distributed centred and uncorrelated random variables ${\cal H}_{I}$ , but unfortunately these variables ${\cal H}_{I}$ are not square-integrable and potentially only have a moment of order slightly larger than $4/3$ when $\gamma_{k}<\gamma_{C}$ . In order to deal with this difficulty, since we cannot handle directly the $L^{p}$ norm of $S_{N}$ of order $p=4/3$ , we will follow a strategy similar to that found in Csorgo, Szyszkowicz and Wang (2008), based on truncation. We set

[TABLE]

The variables ${{\cal H}}^{*}_{I}\,=\,{{\cal H}}^{*}(V_{i},W_{j})$ ( $I\in{\cal I}$ ) are centred and bounded, but they lose the non-correlation property of the variables ${\cal H}_{I}$ . This is why we define now

[TABLE]

which are centred and bounded but are also uncorrelated (see part $(i)$ of Lemma 13), and we write

[TABLE]

We thus need to prove that $S_{N}^{(1)}$ and $S_{N}^{(2)}$ both converge to [math] in probability.

Concerning $S_{N}^{(1)}$ , since the ${{\cal H}}^{**}_{I}$ are centred and uncorrelated, we have

[TABLE]

where ${{\cal H}}_{1}$ was defined in (39) (the justification of the last inequality is postponed to part $(ii)$ of Lemma 13). Remind that ${{\cal H}}_{1}$ is not square-integrable and $M_{n}=n^{2}/\sqrt{v_{n}}=n^{3/2}/(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{1/2}$ , and introduce $m_{n}=n^{3/2}/(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{1/2-\epsilon}=o(M_{n})$ for some given small $\epsilon>0$ . We then write

[TABLE]

Thanks to Lemma 12 (parts $(i)$ and $(ii)$ ) and to the definition of $m_{n}$ , the term $A_{n}$ is bounded by a quantity which is equivalent (as $n\rightarrow\infty$ ) to $\frac{v_{n}}{n^{2}}m_{n}^{2/3}\left(\frac{v_{n}}{n}\right)^{-2/3}=(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{2\epsilon/3}=o(1)$ . We now rely on Hölder’s inequality for dealing with the term $B_{n}$ . Let $p>1$ and $q>1$ such that $1/p+1/q=1$ . Since $\theta_{n}=\mathbb{E}({{\cal H}}_{1})$ , again thanks to Lemma 12 ( $(i)$ , $(ii)$ and $(v)$ ), for $p$ sufficiently close to $1$ so that $4p/3<1+(1+2\gamma_{k}/\gamma_{C})^{-1}$ , we have

[TABLE]

which converges to [math] thanks to assumption (2), for $\epsilon>0$ small enough (we used part $(v)$ of Lemma 12 in the third upper bound).

We are thus left to prove that $S_{N}^{(2)}$ also converges to [math], but this time in $L^{1}$ . We start by writing that

[TABLE]

the last inequality being proved in the appendix (part $(iii)$ of Lemma 13). The follow-up is a bit similar to the treatment of $B_{n}$ above, relying on Lemma 12 (parts $(i)$ , $(ii)$ and $(v)$ ) and on Hölder’s inequality : for $p>1$ close to $1$ and a large $q$ such that $1/p+1/q=1$ , we can write

[TABLE]

which, for $\epsilon>0$ small enough, is $o(1)$ thanks to assumption (2).

5.2.4 Proof of $\sqrt{v_{n}}{\cal V}_{n}=o_{\mathbb{P}}(1)$

The proof is very similar to the one contained in the previous subsection. We nonetheless provide a few details to convince the reader of the validity of the result. We note now ${\cal I}=\{I=(i,j,l)\,;\,1\leq i<j<l\leq n\}$ and $\underline{\cal H}_{I}=\underline{\cal H}(Z_{l},V_{i},W_{j})$ when $I=(i,j,l)\in{\cal I}$ , with $N=n(n-1)(n-2)/6$ denoting the cardinal of the index set ${\cal I}$ . Since the observations $(Z_{i})_{i\leq n}$ are i.i.d., it should be clear to the reader that it suffices to prove that

[TABLE]

As previously, the problem lies with the moments of the centred and uncorrelated variables $\underline{\cal H}_{I}$ , and now we only have a guaranteed moment of order slightly more than $6/5$ instead of $4/3$ in the previous situation. Fortunately, the cardinal $N$ is now of order $n^{3}$ , which turns out to be the right compensation.

We thus define, for $(u,v,w)\in[0,\infty[\times[0,\infty]\times[0,\infty]$ ,

[TABLE]

as well as

[TABLE]

which are centred and bounded but are also uncorrelated (see part $(i)$ of Lemma 13 in the Appendix), and we write

[TABLE]

Introducing $m_{n}=M_{n}(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{\epsilon}$ and skipping details, we assess that

[TABLE]

and that this quantity converges to [math], as $n\rightarrow\infty$ , thanks to parts $(i)$ and $(iii)$ of Lemma 12. The same argument is used to prove that $\mathbb{E}(|S_{N}^{(2)}|)\stackrel{{\scriptstyle n\rightarrow\infty}}{{\longrightarrow}}0$ .

5.2.5 Proof of relation (38)

Let us first prove that, for some $d\in]4/5,1[$ , $\mathbb{E}(|\sqrt{v_{n}}\raisebox{7.68236pt}{$ \approx $}C_{n}^{(1)}|^{d})$ tends to 0, as $n$ tends to infinity. Recall that $\raisebox{7.68236pt}{$ \approx $}C_{n}^{(1)}=\frac{1}{n^{3}}\sum\sum_{i\neq j}h(V_{i},W_{j})/\widebar{H}(V_{i})$ . Since $d<1$ , we have

[TABLE]

According to part $(iv)$ of Lemma 12, the right-hand side of the inequality above is $O(1)\;v_{n}^{2-5d/2}$ , which tends to [math], since $d>4/5$ , and so we are done.

Let us now prove that $\mathbb{E}(|\sqrt{v_{n}}\widehat{C}_{n}^{(1)}/n|)$ tends to 0, as $n$ tends to infinity. $\widehat{C}_{n}^{(1)}$ is defined in $(\ref{def-Cchap1})$ , where the expectation of each of the four integrals is $\theta_{n}$ : therefore, we only need to prove that $\frac{\sqrt{v_{n}}}{n}\theta_{n}$ tends to [math]. This is straightforward using part $(v)$ of Lemma 12.

We can prove in a very similar way that $\mathbb{E}(|\sqrt{v_{n}}\widehat{C}_{n}^{(2)}/n|)$ tends to 0, as $n$ tends to infinity.

5.3 Proof of Proposition 1

Using the same notations as in the begining of Section 5, we have,

[TABLE]

The fact that $\frac{Z_{n}}{\sqrt{v_{n}}}\stackrel{{\scriptstyle\mathbb{P}}}{{\rightarrow}}0$ is due to the application of a triangular weak law of large numbers (see Chow and Teicher (1997) for example) to $\frac{1}{n}\sum\tilde{U}_{i,n}$ and to $\frac{1}{n}\sum\tilde{V}_{i,n}$ . By carrefully following the proof of proposition 3 in Section 5.2, we can see that $R_{n}=o_{\mathbb{P}}(1)$ . The condition $\gamma_{k}<\gamma_{C}$ is not used, neither in the treatment of $\frac{Z_{n}}{\sqrt{v_{n}}}$ nor in that of $R_{n}$ . Details are omited.

5.4 Proof of Corollary 1

The proof is very similar to the one of Theorem 2 in Worms and Worms (2016), with $\gamma_{k}$ and $\widebar{F}^{(k)}$ here replacing $\gamma_{1}$ and $\widebar{F}$ there. For completeness, we provide some details about it. Reminding the notations $d_{n}=\widebar{F}^{(k)}(t_{n})/p_{n}\rightarrow\infty$ and $\Delta_{n}=\frac{\widebar{F}_{n}^{(k)}(t_{n})}{\widebar{F}^{(k)}(t_{n})}$ , we easily write

[TABLE]

where $T_{n}^{1}:=d_{n}^{\widehat{\gamma}_{n,k}-\gamma_{k}}-1$ , $T_{n}^{2}:=\frac{t_{n}}{{x}^{(k)}_{p_{n}}}d_{n}^{\gamma_{k}}-1$ and $T_{n}^{3}:=1-\Delta_{n}^{-\widehat{\gamma}_{n,k}}$ . We are going to prove that both $T_{n}^{2}$ and $T_{n}^{3}$ are $o_{\mathbb{P}}(\log d_{n}/\sqrt{v_{n}})$ , and that $\frac{\sqrt{v_{n}}}{\log d_{n}}T_{n}^{1}\stackrel{{\scriptstyle\cal d}}{{\longrightarrow}}{\cal N}\left(\lambda m,\sigma^{2}\right)$ : this will conclude the proof, since both $\Delta_{n}$ (Corollary 2) and $\frac{t_{n}}{{x}^{(k)}_{p_{n}}}\ d_{n}^{\gamma_{k}}$ tend to $1$ .

Concerning $T_{n}^{1}$ , the mean value theorem yields

[TABLE]

where $|E_{n}|\leq|\widehat{\gamma}_{n,k}-\gamma_{k}|\log d_{n}$ and therefore $E_{n}$ tends to [math] in probability thanks to Theorem 1 and assumption $(\ref{conditiondn})$ . The desired result for $T_{n}^{1}$ is then implied by Theorem 1 again.

Concerning the fact that $T_{n}^{2}=o_{\mathbb{P}}(\log d_{n}/\sqrt{v_{n}})$ , the proof is completely similar to the evoked one in Worms and Worms (2016), so we omit it here (basically, this is based on some uniform regular variation implied by the assumed negativity of the second order parameter $\rho_{k}$ , and on the assumption that $\sqrt{v_{n}}g(t_{n})$ converges).

Finally, concerning $T_{n}^{3}$ we use the mean value theorem to write

[TABLE]

where $D_{n}$ lies between $\Delta_{n}$ and $1$ . But Corollary 2 (and the consistency of $\widehat{\gamma}_{n,k}$ ) implies that $\widehat{\gamma}_{n,k}D_{n}^{-\widehat{\gamma}_{n,k}-1}\stackrel{{\scriptstyle\mathbb{P}}}{{\longrightarrow}}\gamma_{k}$ on one hand, and $\sqrt{v_{n}}(\Delta_{n}-1)=O_{\mathbb{P}}(1)$ on the other hand ; therefore, $\frac{\sqrt{v_{n}}}{\log d_{n}}T_{n}^{3}=O(1/\log(d_{n}))=o(1)$ .

6 Appendix

This appendix contains various results : some of them are used repeatedly in the proof of the main result (in particular Proposition 4, Lemmas 7, and 10, and to a lesser extent Lemmas 9 and 8), the other ones concern parts of the main proof which are postponed to the appendix for better clarity of the main flow of the proof (Lemmas 11, 12 and 13).

Definition 1

An ultimately positive function $f$ : $\mathbb{R}^{+}\rightarrow\mathbb{R}$ is regularly varying (at infinity) with index $\alpha\in\mathbb{R}$ , if

[TABLE]

This is noted $f\in RV_{\alpha}$ . If $\alpha=0$ , $f$ is said to be slowly varying.

Proposition 4

*(See de Haan and Ferreira (2006) Proposition B.1.9)

Suppose $f\in RV_{\alpha}$ . If $x<1$ and $\epsilon>0$ , then there exists $t_{0}=t_{0}(\epsilon)$ such that for every $t\geq t_{0}$ ,*

[TABLE]

and if $x\geq 1$ ,

[TABLE]

Lemma 7

Let $x\in\mathbb{R}_{+}^{*}$ , $\alpha\in\mathbb{R}_{+}$ , $\beta>-1$ , and for $a$ and $b$ real numbers, $f$ and $g$ are two regular varying functions at infinity, with index, respectively, $a$ and $b$ . Then, as $t\rightarrow+\infty$ ,

$(i)$

$\displaystyle J_{\beta}(x)=\int_{1}^{+\infty}\log^{\beta}(y)\ y^{-x-1}dy=\frac{\Gamma(\beta+1)}{x^{\beta+1}}$ .

$(ii)$

$\displaystyle I_{\alpha,a,b}=\int_{1}^{+\infty}\log^{\alpha}(y)\ \frac{f(yt)}{f(t)}\ \frac{dg(yt)}{g(t)}\rightarrow\frac{b\Gamma(\alpha+1)}{(-a-b)^{\alpha+1}}$ , if $a+b<0$

$(iii)$

$\displaystyle J_{a,b}=\int_{0}^{1}\frac{f(yt)}{f(t)}\ \frac{dg(yt)}{g(t)}\rightarrow\frac{b}{a+b}$ , if $a+b>0$

Proof :

$(i)$

A simple change of variable and the definition of the $\Gamma$ function yields the result.

$(ii)$

For the sake of simplicity, we are going to treat the case $a<0$ and $b<0$ . The only difference for the other cases is the sign in front of the $\epsilon$ or $\epsilon^{\prime}$ appearing below (coming from the application of $(\ref{BornesPotter})$ several times), which can depend on the sign of $a$ , $b$ or another constant, but does not affect the result. Using Potter-bounds $(\ref{BornesPotter})$ for $f$ yields, for $n$ sufficiently large and $\epsilon>0$ ,

[TABLE]

Let us treat only the upper bound and the case $\alpha\neq 0$ (the other cases being similar). By integration by parts, with $a+b<0$ , we have

[TABLE]

Using Potter-bounds $(\ref{BornesPotter})$ for $g$ yields, for $n$ sufficiently large and $\epsilon^{\prime}>0$

[TABLE]

Doing the same with the lower bound and making $\epsilon$ and $\epsilon^{\prime}$ tend to [math], yields the result after simplifications.

$(iii)$

As in $(ii)$ , using Potter-bounds $(\ref{BornesPotter})$ for $f$ , integration by parts and then again $(\ref{BornesPotter})$ for $g$ yields the result.

Lemma 8

For any $\delta>0$ , let $C_{\delta}$ denote the function

[TABLE]

Under condition $(\ref{Ordre1})$ , this function is regularly varying of order $\delta/\gamma$ and we have $C_{\delta}(t)\sim(\gamma/\gamma_{C})/(\delta\widebar{H}^{\delta}(t))$ , as $t\rightarrow+\infty$ .

Proof : by writing $\widebar{H}^{\delta}(t)C_{\delta}(t)=-\int_{0}^{1}\frac{\widebar{H}^{\delta}(t)}{\widebar{H}^{\delta}(tu)}\frac{\widebar{G}(t)}{\widebar{G}(tu)}\frac{d\widebar{G}(tu)}{\widebar{G}(t)}$ , the lemma is an immediate consequence of part $(iii)$ of Lemma 7, with $a+b=(\delta/\gamma+1/\gamma_{C})+(-1/\gamma_{C})=\delta/\gamma>0$ and $-b/(a+b)=(\gamma/\gamma_{C})/\delta$ .

Remark 3

In the Lemma above, $C_{1}$ is the important function $C$ introduced at the beginning of Section 5, and thus $C(t)\sim(\gamma/\gamma_{C})/\widebar{H}(t)=(1-\gamma/\gamma_{F})/\widebar{H}(t)$ , as $t\rightarrow+\infty$ . Hence, $C$ is regularly varying at infinity with index $1/\gamma$ , a property which proves useful several times in the main proofs.

Lemma 9

Let $\psi(\phi_{n},u)=\int_{u}^{+\infty}\phi_{n}(s)dF^{(k)}(x)$ , for $u\geq 0$ and $\phi_{n}(u)=\frac{1}{\widebar{F}^{(k)}(t_{n})}\log(u/t_{n})\mathbb{I}_{u>t_{n}}$ . Under condition $(\ref{Ordre1})$ , we have

[TABLE]

where $\epsilon_{n}(u)$ is a sequence tending to [math] uniformly in $u$ , as $n\rightarrow\infty$ , and $\delta$ a positive real number such that $-\frac{1}{\gamma_{k}}+\delta<0$ .

Proof : We only consider the second situation where $u>t_{n}$ (the first one is straightforward) :

[TABLE]

An integration by part and the fact that $\widebar{F}^{(k)}$ is regularly varying at infinity with index $-1/\gamma_{k}$ , yields

[TABLE]

where

[TABLE]

Let $\delta$ be a positive real number. Then

[TABLE]

where the function $y\rightarrow y^{1/\gamma_{k}-\delta}\widebar{F}^{(k)}(y)$ is regularly varying with index $-\delta$ . Then since

[TABLE]

and, when $-\frac{1}{\gamma_{k}}+\delta<0$ , we have $\int_{\frac{u}{t_{n}}}^{+\infty}y^{-1/\gamma_{k}-1+\delta}dy=cst\left(u/t_{n}\right)^{-1/\gamma_{k}+\delta}$ , this concludes the proof.

Lemma 10

Recalling that $H$ is a distribution function with infinite right endpoint, we have :

$(i)$

$\sup_{0\leq x<Z^{(n)}}\widebar{H}(x)/\widebar{H}_{n}(x)=O_{\mathbb{P}}(1)$ **

$(ii)$

for any $a<1/2$ ,

[TABLE]

Proof : part $(i)$ is well known (see for instance section 3 of chapter 10 of Shorack and Wellner (1986)), while the two statements in $(ii)$ are proved by usual empirical processes techniques, showing that the family of functions $(f_{t})_{t<\infty}$ defined in one case by $f_{t}(z)=\mathbb{I}_{z>t}/(\widebar{H}(t))^{a}$ , and in the other case by $f_{t}(\delta,z)=(1-\delta)\mathbb{I}_{z>t}/(\widebar{H}^{(0)}(t))^{a}$ are Donsker whenever $a<1/2$ (using respective square integrable envelope functions $f^{*}(z)=1/(\widebar{H}(z))^{a}$ and $f^{*}(\delta,z)=(1-\delta)/(\widebar{H}^{(0)}(z))^{a}$ , which bound from above the functions $f_{t}$ uniformly in $t$ ) .

Lemma 11

Under conditions (1) and (2), suppose that $\alpha\geq 0$ and $d\geq 1$ are real numbers. If $\gamma_{k}<\gamma_{C}$ and

[TABLE]

then we have $\sum_{i=1}^{n}X_{i,n}\stackrel{{\scriptstyle\mathbb{P}}}{{\longrightarrow}}0$ , as $n$ tends to infinity, if $\alpha$ is [math] or sufficiently close to it.

Proof :

According to the LLN for triangular arrays, we need to prove the following three statements :

[TABLE]

But, $X_{i,n}$ being positive, $(iii)$ clearly implies $(ii)$ . We thus need to prove that $(i)$ and $(iii)$ hold.

Let us start with assertion $(i)$ . If $\epsilon>0$ is given, then

[TABLE]

Now, put $a=\frac{1}{\gamma_{C}}+\frac{d+\alpha}{\gamma}$ ( $>0$ ); since, for a given $\epsilon^{\prime}>0$ , there exists $c>0$ such that $\forall x\geq 1$ , $\log(x)\leq cx^{\epsilon^{\prime}}$ , and using Potter-bounds $(\ref{BornesPotter})$ for $\widebar{G}^{-1}(\widebar{H}^{(0)})^{-(d+\alpha)}\in RV_{-a}$ , we can write (using the definition of $v_{n}$ )

[TABLE]

where $w_{n}=\left(v_{n}^{1/2}n^{d}\left(\widebar{H}^{(0)}(t_{n})\right)^{d+\alpha}\right)^{1/(a+2\epsilon^{\prime})}$ and $c(\epsilon,\epsilon^{\prime})$ is a constant depending on $\epsilon$ and $\epsilon^{\prime}$ only. Consequently, if $w_{n}$ tends to infinity,

[TABLE]

where $\beta=\frac{1}{\gamma_{C}}+\frac{1}{\gamma_{k}}-\epsilon^{\prime}$ and the last inequality is due to Potter-bounds $(\ref{BornesPotter})$ applied to $\widebar{F}^{(k)}\widebar{G}\in RV_{-\frac{1}{\gamma_{C}}-\frac{1}{\gamma_{k}}}$ . Then, assertion $(i)$ above will be true as soon as we prove that $w_{n}\rightarrow\infty$ and $v_{n}w_{n}^{-\beta}\rightarrow 0$ , as $n\rightarrow\infty$ .

Since $\widebar{H}^{(0)}(t)$ is equivalent to a positive constant times $\widebar{H}(t)$ when $t\rightarrow+\infty$ , and $\widebar{H}(t_{n})\geq v_{v}/n$ , then $w_{n}^{a+2\epsilon^{\prime}}\geq cst\ (n^{-\eta}v_{n})^{r}$ , for $r=\frac{1}{2}+d+\alpha>0$ and $\eta=\frac{\alpha}{r}\geq 0$ . Assumption (2) finally yields that $w_{n}$ tends to $+\infty$ , since $0\leq\eta\leq\eta_{0}$ for $\alpha$ sufficiently close to [math].

Now, proving that $v_{n}w_{n}^{-\beta}$ tends to [math] is equivalent to proving that $v_{n}^{-(a+2\epsilon^{\prime})/\beta}v_{n}^{1/2}n^{d}\left(\widebar{H}^{(0)}(t_{n})\right)^{d+\alpha}$ tends to $+\infty$ . The same arguments as in the previous paragraph yield that it is sufficient to prove that $v_{n}^{A}n^{-\alpha}=\left(n^{-\eta}v_{n}\right)^{A}$ tends to $+\infty$ , for $A=-(a+2\epsilon^{\prime})/\beta+1/2+d+\alpha$ and $\eta=\frac{\alpha}{A}$ . This is a consequence of hypothesis $(\ref{condvntn})$ , since $A>0$ and $\alpha\leq\eta_{0}A$ , for $\alpha$ sufficiently close to [math]. This ends the proof of $(i)$ .

Let us now start the proof of assertion $(iii)$ . If $\epsilon>0$ is given, using Potter-Bounds (41) for $\widebar{G}^{-1}(\widebar{H}^{(0)})^{-(d+\alpha)}$ which belongs to $RV_{-a}$ , and introducing $h(x)=\log(x)x^{a-\epsilon}$ , we find that (for some positive constant $c$ )

[TABLE]

where we set $w_{n}=v_{n}^{1/2}n^{d}\left(\widebar{H}^{(0)}(t_{n})\right)^{d+\alpha}$ . Hence, denoting by $h^{-1}$ the inverse function of $h$ ,

[TABLE]

Consequently, using once again Potter-Bounds $(\ref{BornesPotter})$ and bounding the log with a constant times a power of $z/t_{n}$ , we get

[TABLE]

where $b=\frac{d+\alpha}{\gamma}$ and $\epsilon^{\prime}>0$ is some given positive value (the inequality $\log(s)\leq cst\,s^{\epsilon^{\prime}},\ \forall s\geq 1$ , was used). But, by integration by parts and $(\ref{BornesPotter})$ applied to $\widebar{F}^{(k)}$ , setting $h_{n}=h^{-1}(cw_{n})$ , we have

[TABLE]

Proceeding similarly as in the previous paragraphs, we find that $w_{n}/v_{n}\rightarrow\infty$ (and thus $w_{n}$ and $h_{n}$ as well) thanks to assumption (2), for $\alpha$ close to [math]. We are thus left to prove that $(v_{n}/w_{n})\times h_{n}^{b^{\prime}}$ tends to [math], where $b^{\prime}=b-1/\gamma_{k}+3\epsilon^{\prime}$ . If $b-1/\gamma_{k}$ is negative, this is immediate. We thus suppose that $b-1/\gamma_{k}\geq 0$ and, after some simple computations, we find out that $(v_{n}/w_{n})h_{n}^{b^{\prime}}$ tends to [math] if $v_{n}^{-a+\epsilon^{\prime}}w_{n}^{a-b^{\prime}-\epsilon^{\prime}}$ tends to $\infty$ , a property which holds true thanks to assumption (2), for $\alpha$ close to [math] (we omit the details).

Lemma 12

Suppose that $V_{1}$ and $W_{2}$ are independent improper random variables of respective subdistribution functions $H^{(0)}$ and $H^{(1,k)}$ , and $Z_{3}$ is independent of $V_{1}$ and $W_{2}$ and has distribution $H$ . Consider $h$ , $\underline{h}$ , ${\cal H}$ and $\underline{\cal H}$ the functions defined in (28) and (37).

$(i)$

For any $d\geq 1$ , there exist some positive constants $c$ and $c^{\prime}$ such that

[TABLE]

$(ii)$

For any $d\in]1,1+(1+2\gamma_{k}/\gamma_{C})^{-1}[$ , we have

[TABLE]

In particular, if $\gamma_{k}<\gamma_{C}$ , then $\mathbb{E}(h^{4/3}(V_{1},W_{2}))$ is of the order of $(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{-2/3}$ and $\mathbb{E}(h^{d}(V_{1},W_{2}))$ is finite whenever $d$ is (greater than but) sufficiently close to $4/3$ .

$(iii)$

For any $d\in]1,1+(1+3\gamma_{k}/\gamma_{C})^{-1}[$ , we have

[TABLE]

In particular, if $\gamma_{k}<\gamma_{C}$ , then $\mathbb{E}(\underline{h}^{6/5}(Z_{3},V_{1},W_{2}))$ is of the order of $(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{-3/5}$ and $\mathbb{E}(\underline{h}^{d}(Z_{3},V_{1},W_{2}))$ is finite whenever $d$ is (greater than but) sufficiently close to $6/5$ .

$(iv)$

For any $d\in]1/2,(2\gamma_{C}^{-1}+\gamma_{F}^{-1}+\gamma_{k}^{-1})/(3\gamma_{C}^{-1}+2\gamma_{F}^{-1})[$ , we have $\mathbb{E}\left(h^{d}(V_{1},W_{2})/\widebar{H}^{d}(V_{1})\right)=O\left((\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{2-3d}\right)$ . In particular, if $\gamma_{k}<\gamma_{C}$ then taking $\delta$ (greater than but) sufficiently close to $4/5$ is permitted, otherwise it is $2/3$ instead of $4/5$ .

$(v)$

The integral $\theta_{n}=\iint h(v,w)dH^{(0)}(v)dH^{(1,k)}(w)$ is equivalent, as $n\rightarrow\infty$ , to $\gamma_{k}(-\log\widebar{G}(t_{n}))$ .

Proof :

$(i)$

Let $d\geq 1$ , and remind that $h$ is a non-negative function. Using several times the inequality $|a+b|^{d}\leq 2^{d-1}(|a|^{d}+|b|^{d})$ , we can write

[TABLE]

But using the fact that the $L^{1}$ norm is bounded by the $L^{d}$ norm whenever $d\geq 1$ , we have $(\mathbb{E}(h(V_{1},W_{2})))^{d}\leq\mathbb{E}(h^{d}(V_{1},W_{2}))$ and it is quite simple to prove (by independency of $V_{1}$ and $W_{2}$ ) that it is also the case of $\mathbb{E}[(h_{1\bullet}(V_{1}))^{d}]=\mathbb{E}[(\mathbb{E}(h(V_{1},W_{2})|V_{1}))^{d}]\leq\mathbb{E}[\mathbb{E}(h^{d}(V_{1},W_{2})|V_{1})]=\mathbb{E}(h^{d}(V_{1},W_{2}))$ , as well as for $\mathbb{E}[(h_{{\bullet}1}(W_{2}))^{d}]$ . The inequality is thus proved. The other one (concerning $\underline{\cal H}$ and $\underline{h}$ ) is proved similarly.

$(ii)$

Let $d>1$ . Since $h(v,\infty)=h(\infty,w)=0$ ( $\forall v,w$ ), we have

[TABLE]

where the function $C_{d-1}$ was defined in the statement of Lemma 8. This lemma and Lemma 7, applied with $\alpha=d$ , $a=(d-1)/\gamma_{C}+(d-1)/\gamma$ and $b=-1/\gamma_{k}$ (the constraint specified on $d$ certifies that $a+b<0$ ), imply that the integral in the previous line converges to a constant. And Lemma 8 also implies that the ratio in front of this integral is equivalent, as $n\rightarrow\infty$ , to a positive constant times $\left(\widebar{H}(t_{n})\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n})\right)^{1-d}$ , which is itself lower than $\left(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n})\right)^{2(1-d)}$ , as desired.

$(iii)$

Let $d>1$ . By definition of $\underline{h}$ in (28), and proceeding as in the previous item, $\mathbb{E}(\underline{h}^{d}(Z_{3},V_{1},W_{2}))$ equals

[TABLE]

which is equivalent to $O\left((\widebar{H}(t_{n}))^{2-2d}(\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{1-d}\right)=O\left((\widebar{F}^{(k)}(t_{n})\widebar{G}(t_{n}))^{3(1-d)}\right)$ as soon as, thanks to Lemma 7, the sum $\left((d-1)/\gamma_{C}+(2d-2)/\gamma\right)-1/\gamma_{k}$ is negative, which turns out to be true whenever $d<1+(2+3\gamma_{k}/\gamma_{C})^{-1}$ , as specified.

$(iv)$

The proof is very similar to the previous ones, starting from

[TABLE]

so we omit the details.

$(v)$

Noting that $-\log\widebar{G}$ is slowly varying at infinity null at [math], we have

[TABLE]

which can be dealt with using part $(ii)$ of Lemma 7 with $\alpha=1$ , $a=0$ and $b=-1/\gamma_{k}$ : the obtained constant is indeed equal to $\gamma_{k}$ .

Lemma 13

In this Lemma, various notations defined in sections 5.2.2 to 5.2.4 are used.

$(i)$

The variables ${{\cal H}}^{**}_{I}$ for $I\in\{(i,j)\,;\,1\leq i<j\leq n\}$ are centred and uncorrelated . This is also true for the variables ${\underline{\cal H}}^{**}_{I}$ for $I\in\{(i,j,l)\,;\,1\leq i<j<l\leq n\}$ .

$(ii)$

We have $\mathbb{E}\left[({{\cal H}}^{**}(V_{1},W_{2}))^{2}\right]\leq 48\mathbb{E}[{{\cal H}}_{1}^{2}\mathbb{I}_{|{{\cal H}}_{1}|\leq M_{n}}]$ .

$(iii)$

We have $\mathbb{E}\left(\,|{{\cal H}}_{1}-{\cal H}^{*}(V_{1},W_{2})+{{\cal H}}^{*}_{1\bullet}(V_{1})+{{\cal H}}^{*}_{{\bullet}1}(W_{2})|\,\right)\leq 4\mathbb{E}\left(|{{\cal H}}_{1}|\mathbb{I}_{|{{\cal H}}_{1}|>M_{n}}\right)$

Proof :

$(i)$

Let us consider the first situation, where ${\cal I}=\{(i,j)\,;\,1\leq i<j\leq n\}$ . First, if $I=(i,j)\in{\cal I}$ , then $\mathbb{E}({{\cal H}}^{**}_{I})=0-\mathbb{E}({{\cal H}}^{*}_{1\bullet}(V_{i}))-\mathbb{E}({{\cal H}}^{*}_{{\bullet}1}(W_{j}))$ ; but, by definition of ${{\cal H}}^{*}_{1\bullet}$ and independency of $V_{i}$ and $W_{j}$ , we have $\mathbb{E}({{\cal H}}^{*}_{1\bullet}(V_{i}))=\mathbb{E}({{\cal H}}^{*}(V_{i},W_{j}))=0$ , and $\mathbb{E}({{\cal H}}^{*}_{{\bullet}1}(W_{j}))=0$ is obtained similarly, so we proved that $\mathbb{E}({{\cal H}}^{**}_{I})=0$ . Note that we can prove (with similar arguments) that ${{\cal H}}^{**}_{1\bullet}(v)={{\cal H}}^{**}_{{\bullet}1}(w)=0$ for every $v,w$ in $[0,\infty]$ , a property which is repeatedly used below . Let us now deal with the non-correlation of ${{\cal H}}^{**}_{I}$ and ${{\cal H}}^{**}_{I^{\prime}}$ , by considering the various cases where $I\neq I^{\prime}$ with $I=(i,j)$ and $I^{\prime}=(k,l)$ are in ${\cal I}$ .

If all four indices $i,j,k,l$ are distinct, then non-correlation of ${{\cal H}}^{**}_{I}$ and ${{\cal H}}^{**}_{I^{\prime}}$ is immediate by mutual independence of the variables $Z_{1},\ldots,Z_{n}$ .

If $i=k$ but $j\neq l$ , then $\mathbb{E}({{\cal H}}^{**}_{I}{{\cal H}}^{**}_{I^{\prime}})=\mathbb{E}(\psi(V_{i}))$ where $\psi(v)=\mathbb{E}({{\cal H}}^{**}(v,W_{j}){{\cal H}}^{**}(v,W_{l}))=({{\cal H}}^{**}_{1\bullet}(v))^{2}=0$ , by independence of $V_{i}$ with $(W_{j},W_{l})$ , and of $W_{j}$ and $W_{l}$ .

The case $i\neq k$ and $j=l$ is similar using ${{\cal H}}^{**}_{{\bullet}1}(\cdot)\equiv 0$ .

If $i=l$ but $j\neq k$ , then $\mathbb{E}({{\cal H}}^{**}_{I}{{\cal H}}^{**}_{I^{\prime}})=\mathbb{E}(\psi(V_{i},W_{i}))$ where $\psi(v,w)=\mathbb{E}({{\cal H}}^{**}(v,W_{j}){{\cal H}}^{**}(V_{k},w))={{\cal H}}^{**}_{1\bullet}(v){{\cal H}}^{**}_{{\bullet}1}(w)=0\times 0=0$ ; the case $j=k$ and $i\neq l$ is treated similarly.

Note that the case $i=l$ and $j=k$ (i.e. ${{\cal H}}^{**}_{I}={{\cal H}}^{**}(V_{i},W_{j})$ , ${\cal H}_{I^{\prime}}={{\cal H}}^{**}(V_{j},W_{i})$ ) is not permitted (it would lead to dependency) since we cannot have simultaneously $i<j$ and $j<i$ ; this is the reason why, in the beginning of section 5.2.3, we restricted the study of the sum ${\cal U}_{n}$ to that of the sum $S_{N}$ having terms ${\cal H}(V_{i},W_{j})$ satisfying $i<j$ .

The second situation, for ${\underline{\cal H}}^{**}_{I}$ and ${\underline{\cal H}}^{**}_{I^{\prime}}$ with $I\neq I^{\prime}$ in ${\cal I}=\{I=(i,j,l)\,;\,1\leq i<j<l\leq n\}$ , is a bit more tedious (with more cases to detail) but very similar, so we omit its proof.

$(ii)$

We start by the trivial bound

[TABLE]

Noting ${{\cal H}}_{1}^{-}={{\cal H}}_{1}\mathbb{I}_{|{{\cal H}}_{1}|\leq M_{n}}$ , we can write, on one hand, by definition of ${{\cal H}}^{*}$ , $\mathbb{E}[({{\cal H}}^{*}(V_{1},W_{2}))^{2}]\leq 2\left\{\mathbb{E}[({{\cal H}}_{1}^{-})^{2}]+(\mathbb{E}[{{\cal H}}_{1}^{-}])^{2}\right\}\leq 4\mathbb{E}[({{\cal H}}_{1}^{-})^{2}]$ . On the other hand, if $W$ is independent of $V_{1}$ , we have $\mathbb{E}[({{\cal H}}^{*}_{1\bullet}(V_{1}))^{2}]=\mathbb{E}[(\mathbb{E}[{{\cal H}}^{*}(V_{1},W)|V_{1}])^{2}]\leq\mathbb{E}[\mathbb{E}[({{\cal H}}^{*}(V_{1},W))^{2}|V_{1}]]=\mathbb{E}[({{\cal H}}^{*}(V_{1},W_{2}))^{2}]$ , which is the same term as the first one, and is thus lower than $4\mathbb{E}[({{\cal H}}_{1}^{-})^{2}]$ . The same is true of $\mathbb{E}[({{\cal H}}^{*}_{{\bullet}1}(W_{2}))^{2}]$ , so the desired inequality is proved.

$(iii)$

First recall that ${\cal H}_{1}$ denotes ${\cal H}(V_{1},W_{2})$ . Now, since ${\cal H}_{1}$ is centred and we trivially have ${\cal H}_{1}={\cal H}_{1}\mathbb{I}_{|{\cal H}_{1}|\leq M_{n}}+{\cal H}_{1}\mathbb{I}_{|{\cal H}_{1}|>M_{n}}$ , noting ${\cal H}_{1}^{+}={\cal H}_{1}\mathbb{I}_{|{\cal H}_{1}|>M_{n}}$ yields

[TABLE]

Secondly, using the fact that ${\cal H}_{1\bullet}(\cdot)\equiv 0$ (simple to prove), we can write

[TABLE]

where ${\cal H}_{1\bullet}^{+}(v)$ denotes $\mathbb{E}({\cal H}(v,W)\mathbb{I}_{|{\cal H}(v,W)|>M_{n}})$ and satisfies $\mathbb{E}({\cal H}_{1\bullet}^{+}(V_{1}))=\mathbb{E}({{\cal H}}_{1}^{+})$ , and similarly

[TABLE]

with ${\cal H}_{{\bullet}1}^{+}(w)=\mathbb{E}({\cal H}(V,w)\mathbb{I}_{|{\cal H}(V,w)|>M_{n}})$ and $\mathbb{E}({\cal H}_{{\bullet}1}^{+}(W_{2}))=\mathbb{E}({{\cal H}}_{1}^{+})$ . Summing these three terms finally leads to

[TABLE]

which is lower than $4\mathbb{E}(|{{\cal H}}_{1}^{+}|)$ , as announced.

References

Bibliography20

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Aalen and Johansen (1978) A. Aalen and S. Johansen . An empirical transition matrix for nonhomogeneous Markov chains based on censored observations. In Scand J Stat (5) pages 141-150 (1978)
2Beirlant et al. (2007) J. Beirlant, G. Dierckx, A. Guillou and A. Fils-Villetard . Estimation of the extreme value index and extreme quantiles under random censoring. In Extremes 10 , pages 151-174 (2007)
3Bingham, Goldie and Teugels (1987) N. H. Bingham, C.M. Goldie and I.L. Teugels. Regular variation. Cambridge University press (1987)
4Chow and Teicher (1997) Y.S. Chow and H. Teicher . Probability theory. Independence, interchangeability, martingales. Springer (1997)
5Crowder (2001) M. Crowder . Classical competing risks. Chapman and Hall, London (2001)
6Csorgo, Szyszkowicz and Wang (2008) M. Csorgo, B. Szyszkowicz and Q. Wang . Asymptotics of studentized U-type processes for change-point problems. In Acta Math. Hunga. 121 (4) , pages 333-357 (2008)
7de Haan and Ferreira (2006) L. de Haan and A. Ferreira . Extreme Value Theory : an Introduction. Springer Science + Business Media (2006)
8Einmahl et al. (2008) J. Einmahl, A. Fils-Villetard and A. Guillou . Statistics of extremes under random censoring. In Bernoulli 14 , pages 207-227 (2008)

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Taxonomy

1 Introduction

2 Assumptions and Statement of the results

Theorem 1

Remark 1

Proposition 1

Remark 2

Corollary 1

3 Simulations

4 Conclusion

5 Proofs

Lemma 1

Proposition 2

Proposition 3

Corollary 2

Lemma 2

5.1 Proof of Proposition 2

Lemma 3

Lemma 4

5.1.1 Proof of Lemma 3

5.1.2 Proof of Lemma 4

5.2 Proof of Proposition 3

Lemma 5

Lemma 6

5.2.1 Proof of Lemma 5

5.2.2 Preliminaries to the proof of Lemma 6

5.2.3 Proof of vnUn=oP(1)\sqrt{v_{n}}{\cal U}_{n}=o_{\mathbb{P}}(1)vn​​Un​=oP​(1)

5.2.4 Proof of vnVn=oP(1)\sqrt{v_{n}}{\cal V}_{n}=o_{\mathbb{P}}(1)vn​​Vn​=oP​(1)

5.2.5 Proof of relation (38)

5.3 Proof of Proposition 1

5.4 Proof of Corollary 1

6 Appendix

Definition 1

Proposition 4

Lemma 7

Lemma 8

Remark 3

Lemma 9

Lemma 10

Lemma 11

Lemma 12

Lemma 13

5.2.3 Proof of $\sqrt{v_{n}}{\cal U}_{n}=o_{\mathbb{P}}(1)$

5.2.4 Proof of $\sqrt{v_{n}}{\cal V}_{n}=o_{\mathbb{P}}(1)$