Bayesian nonparametric estimation of survival functions with multiple-samples information
Alan Riva Palacio, Fabrizio Leisen

TL;DR
This paper introduces a Bayesian nonparametric approach for estimating survival functions that accounts for dependence among multiple samples, extending previous models to higher dimensions with theoretical and practical validation.
Contribution
It develops a flexible, dependent vector of nonparametric priors for survival analysis, extending existing models to arbitrary dimensions with theoretical insights.
Findings
Model effectively captures dependence among groups
Theoretical results on posterior behavior are established
Performance validated on simulated Clayton copula data
Abstract
In many real problems, dependence structures more general than exchangeability are required. For instance, in some settings partial exchangeability is a more reasonable assumption. For this reason, vectors of dependent Bayesian nonparametric priors have recently gained popularity. They provide flexible models which are tractable from a computational and theoretical point of view. In this paper, we focus on their use for estimating survival functions with multiple-samples information. Our methodology allows to model the dependence among survival times of different groups of observations and extend previous work to an arbitrary dimension . Theoretical results about the posterior behaviour of the underlying dependent vector of completely random measures are provided. The performance of the model is tested on a simulated dataset arising from a distributional Clayton copula.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBayesian Methods and Mixture Models · Financial Risk and Volatility Modeling · Statistical Methods and Inference
Bayesian nonparametric estimation of survival functions with multiple-samples information
Alan Riva Palacio and Fabrizio Leisen
**School of Mathematics, Statistics and Actuarial Sciences, University of Kent
Sibson Building, Canterbury, Kent CT2 7FS**
Abstract
In many real problems, dependence structures more general than exchangeability are required. For instance, in some settings partial exchangeability is a more reasonable assumption. For this reason, vectors of dependent Bayesian nonparametric priors have recently gained popularity. They provide flexible models which are tractable from a computational and theoretical point of view. In this paper, we focus on their use for estimating multivariate survival functions. Our model extends the work of Epifani and Lijoi (2010) to an arbitrary dimension and allows to model the dependence among survival times of different groups of observations. Theoretical results about the posterior behaviour of the underlying dependent vector of completely random measures are provided. The performance of the model is tested on a simulated dataset arising from a distributional Clayton copula.
1 Introduction
Bayesian nonparametric modelling in survival analysis problems often relies on the assumption that the times observed are exchangeable, see for example [5] and [10]. Such assumption fails to hold when we consider events that are pooled from different dependent scenarios. For example, consider patients under the same treatment but in different hospitals. The survival times of patients from the same hospital could be assumed exchangeable. On the other hand, this is not a reasonable assumption when we consider patients from different hospitals since factors specific to each hospital might exert significant influence. In general, we can consider that the data is originated from different but related studies. Formally, we have sets of observations where the exchangeability assumption is assumed only within each set. In the above cases, it would be more appropriate to assume a form of dependence called partial exchangeability (see Section 2 for a formal account on exchangeability and partial exchangeability). This motivates the extension of Bayesian nonparametric models into a partially exchangeable setting where multiple-samples information could be used.
Applications of Bayesian nonparametrics in survival analysis go back, for example, to [5] and [8], who used non-decreasing independent increment processes to construct random survival functions. [6] and [16] focused on random hazard rates. More recently, [10] used a general class of random hazard rate-based models, and [19] used a general short-term and long-term hazard ratios model. There is an ongoing effort in Bayesian nonparametrics to propose flexible dependent random probability measures as set forth with the seminal work of [17]. In survival analysis, for example, [4] introduced a model based on a dependent Dirichlet process. In a partial exchangeable setting, survival analysis models have been used, for example, in [7] where a dependent two-dimensional extension of the neutral to the right (NTR) model was introduced and in [15] where a dependent vector of hazard rates was constructed. [9] introduced a new class of vectors of dependent completely random measures, called Compound Random Measures, where the dependence contribution is modelled with a parametric distribution.
In the seminal work of [5], the NTR model for survival functions was introduced. The NTR model can be expressed in terms of a Completely Random Measure (CRM) . This means that when is evaluated at pairwise disjoint sets it gives rise to mutually independent nonnegative random variables. We say that a positive random variable has a NTR distribution given by a CRM , denoted , if
[TABLE]
where is such that
[TABLE]
NTR distributions have several appealing properties, including the independence of normalized increments and posterior conjugacy for censored to the right data. An extension of the NTR model into a partially exchangeable setting was given by [7] for the dimensional case. In the present work, we follow the approach of [7] and focus on models based on a -dimensional * vector of completely random measures* (VCRM). More precisely, we consider collections of survival times such that, for ,
[TABLE]
with arbitrary . This model is convenient for modeling data where the dependence among the entries of the VCRM accounts for dependence among the multiple-samples in a partially exchangeable setting. Furthermore, marginally we recover the NTR model, namely
[TABLE]
with , . In (2) we want to model the dependence of the VCRM in a way that allows us to fix a marginal behavior so to exploit the fact that marginally we recover a NTR model; Lévy copulas are a natural framework to model the dependence structure of VCRM’s in such way.
In this paper we provide a posterior characterization for the above model, see Theorem 1. Similarly to [7] for 2-dimensional setting, we show that the posterior distribution corresponds to a survival function of the type as in (1) leading to a conjugacy property. Extensions of some results in [7] are also provided. We would like to stress that the derivation of such results are not trivial when considering an arbitrary dimension. In particular, Proposition 1 gives a general expression for the Laplace exponent when a Lévy copula is considered to set the dependence of the VCRM underlying the dimensional NTR model; Proposition 3 gives an alternative characterization of the multivariate NTR. Furthermore, other theoretical results are proved in order to facilitate the calculation of posterior means when the inferential exercise is implemented. Finally, we illustrate the methodology on a synthetic dataset.
The paper is organized as follows: Section 2 presents the preliminary notions which are needed in this work. In Section 3 we extend some results in [7] to the multivariate setting. In particular, we state the posterior characterization of the model and provide some useful corollaries for implementing the posterior inference. In Section 4, an application with synthetic data is illustrated. All the proofs can be found in the appendix.
2 Preliminaries
In this section, we provide some preliminaries about exchangeability, partial exchangeability and vectors of completely random measures which are the building blocks of our Bayesian nonparametric proposal. Furthermore, we will illustrate the concept of a positive Lévy copula which is useful to model the dependence structure between the components of a vector of completely random measures.
2.1 Exchangeability and Partial exchangeability
Let be a complete and separable metric space, with corresponding Borel -algebra
Definition 1**.**
A collection of random variables in is exchangeable if for any permutation of we have that
[TABLE]
As highlighted in the Introduction, in several problems the exchangeability assumption appears far too restrictive. In particular, we considered groups of observations where the order in which they are collected within each group is irrelevant. To describe this setting we resorted to the notion of partial exchangeability, as set forth by [3], that formalizes the idea of partitioning the entire set of observations into a certain number of classes, say d, in such a way that exchangeability may be reasonably assumed within each class. For ease of exposition, we confine ourselves to consider the case where d = 2.
Definition 2**.**
The collection of random vectors
[TABLE]
in is partially exchangeable if, for any and for all permutations and of and respectively, we have that
[TABLE]
2.2 Vectors of completely random measures
Given a complete and separable metric space , with corresponding Borel -algebra , we call a measure on boundedly finite if for any bounded set . A random measure is a measurable function from a probability space onto which is the measure space formed by , the space of boundedly finite measures on , and its corresponding Borel -algebra . In particular we will focus on the class of completely random measures as introduced in [12].
Definition 3**.**
A random measure on a complete and separable metric space with corresponding Borel -algebra is called a completely random measure (CRM) if for any collection of disjoint sets the random variables are mutually independent.
A CRM has the following representation [12],
[TABLE]
where is a deterministic measure, is a measure that consists on jumps with possibly random jump heights but fixed jump locations, and
[TABLE]
where for are random jump locations and are random jump heights. The measures , and are mutually independent. In particular, is again a CRM and is characterized by the following Laplace transform
[TABLE]
where and is a measure on such that
[TABLE]
for any bounded set . The measure is usually called the Lévy intensity of . In the remainder of this work we only consider CRM’s without fixed jump locations nor deterministic part so we take to be solely determined by (3). In particular we focus on Lévy intensities which are homogeneous, i.e.
[TABLE]
where is a non-atomic measure on referring to the jump locations and is a measure on referring to the jump heights. A popular example of an homogeneous CRM is the -stable process given by
[TABLE]
As an illustration, we plot in Figure 1 the associated process for the -stable process (4) with .
We extend this framework to the multivariate setting by considering vectors where each is a homogeneous CRM on with respective Lévy intensities . Moreover we take the intensity to be smooth in the sense that with a non-decreasing and differentiable function such that and ; this last conditions on the limit behaviour will enable us to get, marginally, the associated NTR cumulative distributions in our models. We have that for any in , with for any , the random vectors and are mutually independent; furthermore, one has a multivariate analogue of the Laplace transform (3)
[TABLE]
where and is a measure on . In particular, we introduce the notation for the multivariate Laplace transform
[TABLE]
Henceforth, is called the Laplace exponent of ; in the case at hand, where and is the usual inner product in . Marginalizing, we have that
[TABLE]
In Section 3, we use this particular kind of homogeneous and additive vector of CRM’s to construct priors for survival analysis models.
2.3 Positive Lévy copulas
Although in this work we consider vectors of CRM’s with fixed marginal behaviour, it remains to establish the dependence structure. [11] introduced the concept of positive Lévy copulas which allows to construct vectors of CRM’s with fixed marginals.
Definition 4**.**
A function is a positive Lèvy copula if
- (i)
such that we have that
[TABLE]
with
[TABLE] 2. (ii)
If is such that for some then . 3. (iii)
Let and
for then .
For example, a vector of independent CRM’s is obtained with
[TABLE]
A vector of completely dependent CRM’s, in the sense that the jumps of the stochastic vector are in a set such that whenever then either or for all , is obtained with
[TABLE]
An interesting example of positive Lévy copulas is the Clayton Lévy copula
[TABLE]
The parameter is positive and regulates the level of dependence. The above copulas are special cases of the Clayton Lévy copula, i.e.
[TABLE]
We define the tail integral of an univariate Lévy intensity to be . In the setting of Section we use a Lévy copula and the marginal tail integrals associated to to specify an absolutely continuous via
[TABLE]
Therefore, under suitable regularity conditions, we can recover the multivariate Lévy intensity from the copula and marginal intensities in the following way
[TABLE]
For example, consider the Clayton Lévy copula with -stable margins, given by (4), and . Figure 2 shows the dependence behaviour when a -dimensional Clayton Lévy copula with parameter and is employed; we plot the associated stochastic processes with similarly to Figure 1. As expected, when , at each jumping time, the processes have one jump weight big and one small since we are close to the independence case (where the processes almost surely share no jumping times). On the other hand, when is increased to , we can appreciate the higher dependence induced by a larger value of the copula parameter. We simulated the trajectories in Figure 2 by using Algorithm 6.15 in [2], where a full treatment of the dependence structure of Lévy intensities is also given. [13], [14] and [20] used a Lévy copula approach for building vectors of dependent completely random measures.
2.3.1 Working example
If we consider the Lévy intensity arising from (8) when considering the -dimensional Clayton Lévy copula, (7), with parameter and -stable marginals, (4), with parameters , we obtain
[TABLE]
Furthermore, if we take we obtain the simplified Lévy intensity
[TABLE]
Such intensity corresponds to a particular family of vectors of completely random measures known as Compound Random Measures (CoRM’s) and introduced in [9]; the previous Lévy intensity arises when taking in equation (4.4) of the aforementioned paper. A convenient feature of this Lévy intensity is that, as shown in Proposition 3.1 of [20], we can explicitly get the corresponding Laplace exponent
[TABLE]
where we take the appropriate limits when is such that for distinct . As indicated in the remark at the end of section 3, evaluation of the Laplace exponent is necessary for the explicit calculation of the posterior mean of the survival function given censored data.
3 Main results
Let , and suppose we have collections of random variables
[TABLE]
We characterize the probability distribution of these random variables in terms of a vector of CRM’s . For , let
[TABLE]
We observe that under such model the random variables (12) are partially exchangeable and marginally follow a process. The dependence structure in this model can be given through the Lévy copula associated to the CRM . This model extends the one in [7] to an arbitrary dimension .
The family of Clayton Lévy copulas is of interest because it has both the independence and complete dependence cases as limit behaviour. In the next result, we work towards finding expressions for the Laplace exponent associated to the Clayton family in such a way that the dependence structure is decoupled across dimensions. This result could be useful since, as we will see, an explicit calculation of is of key importance to implement the Bayesian inference in our survival analysis model.
Let be the Lévy intensity associated via (8) to the Clayton Lévy copula and fixed marginal Lévy intensities with corresponding Laplace transforms . We denote the vector of tail integrals corresponding to the marginal Lévy intensities as and fix the notation
[TABLE]
where , , , and is such that .
Proposition 1**.**
Suppose that and
[TABLE]
then
[TABLE]
where .
We refer to the Appendix A.1 for the proof. We incorporate the Lévy exponent in the multivariate survival analysis setting of (12), in the next result. We introduce the notation
[TABLE]
for and distinct ; and denote for the respective Laplace exponents.
Proposition 2**.**
In the context of (12), let . For and such that then
[TABLE]
We refer to the Appendix A.2 for the proof. This result showcases the importance of the Laplace exponent for calculating probabilities in the model and the impact of the function , related to the time depending part of the Laplace exponent, in the survival function. In Section 4, we will show that the availability of the Laplace exponent is also of main importance to implement the Bayesian inference for the model. The model we are working on generalizes to arbitrary dimension the classic model of [5]. We present a multivariate extension of Theorem 3.1 in [5], which relates our model with the notion of neutrality to the right. Let be a -variate random distribution function on and, for a -variate vector of CRM’s , denote with . Then, we have the following multivariate extension to Theorem 3.1 in [5] and Proposition 4 in [7].
Proposition 3**.**
* has the same distribution as*
[TABLE]
for some -variate CRM if and only if for and vectors with and , there exists independent random vectors such that
[TABLE]
where with and .
We refer to the Appendix A.3 for the proof. We now establish some notation in order to address the posterior distribution arising from (12) when some survival data is available. Let , , be groups of observations that come from the distribution given by
[TABLE]
where and the event corresponds to the event . Let be their respective censoring times; therefore, the set of censored data is the following
[TABLE]
where and . The number of exact observations is and the number of censored observations is . Taking into account the possible repetition of values among the observations, we consider the order statistics of the distinct observations where is the number of distinct observed times among all groups.
Let define the set functions
[TABLE]
for , which denote the number of, respectively, exact and censored marginal observations in , with respect to group . We define , , for and , , for ; and the corresponding vectors , , for and , .
The next theorem determines the calculation of the posterior distribution for a vector of CRM’s given some censored data and it applies to general vectors of CRM’s. In particular, the assumption that the respective Lèvy intensity is homogeneous has been dropped.
Theorem 1**.**
Let be a -variate CRM such that its corresponding Lèvy intensity is differentiable with respect to on in the sense that for the partial derivative \eta^{\prime}_{t_{0}}(\boldsymbol{s})=\partial{\left.\kern-1.2pt\eta_{t}(\boldsymbol{s})/\partial t\vphantom{\big{|}}\right|_{t=t_{0}}} exists. Moreover we assume that the entries of are not independent. Then the posterior distribution of given data is the distribution of the random measure
[TABLE]
where
- i)
* is a -variate CRM with Lévy intensity such that*
[TABLE]
for . 2. ii)
The vectors of jumps , with , are mutually independent and the vector of jumps corresponding to the exact observation has density
[TABLE] 3. iii)
*The random measure is independent of , with *
.
We refer to the Appendix A.4 for the proof. The previous result showcases that the posterior distribution arising from (12) can be modeled in the same framework via a vector of CRM’s by updating the prior vector of CRM’s to as above.
This result is enough to provide a scheme for posterior inference. In particular, in the setting of (12) and Theorem 1, we want to estimate the corresponding survival function when multiple samples information is available.
A natural approach in Bayesian nonparametrics is to marginalize over the infinite dimensional random element which characterizes the probability model. In our case, given censored data , we calculate the mean of the survival function given the data by marginalizing over the vector of CRM’s . As a result of Theorem 1, we can calculate such quantity. The next results allow us to implement the necessary inferential scheme for performing the estimation of the survival function as a posterior mean. We denote for the canonical basis of , and for , . In view of the independent increments of the CRM’s, calculation of the posterior mean of is all that is needed for the evaluation of the posterior mean of . The next corollary shows how to evaluate the posterior mean of .
Corollary 1**.**
Let be a vector of CRM’s with corresponding Lèvy intensity such that with a differentiable function satisfying for . Moreover we assume that the entries of are not independent. Let and set
[TABLE]
where . Then,
[TABLE]
where and for
[TABLE]
We see that we can estimate for arbitrary in terms of the estimates defined in the previous corollary. Indeed, let and be a permutation of such that . We define, for , the following sets
[TABLE]
From the independence of increments of CRM’s, it follows that the posterior mean of the survival function given censored data is
[TABLE]
Usually, we deal with Lévy intensities which exhibit some dependences in a vector of hyper-parameters . On the proof of Theorem 1, it is outlined how, given censored data as before, we could derive the likelihood of the hyper-parameters in the Lévy intensity. This likelihood is necessary for implementing the inferential procedure and it is displayed in the next corollary.
Corollary 2**.**
Let be a vector of CRM’s with corresponding Lèvy intensity such that with a differentiable function satisfying for , and a vector of hyper-parameters. Given censored data we get the likelihood on .
[TABLE]
where is the Laplace exponent associated to .
The next lemma provides a useful identity for the computation of the integrals in Corollary 1 and Corollary 2.
Lemma 1**.**
For and
[TABLE]
We omit the proof as it is just an application of the binomial theorem in the same line as the proof of Lemma 3 in the appendix.
Remark**.**
The previous results highlights that the implementation of the inferential procedure depends on whether we can perform evaluations of the Laplace exponent or not.
4 Applications
In this section we perform the fitting of a multivariate survival function given censored to the right data in the framework of (12). As mentioned in the previous remark, the evaluation of the Laplace exponent of in (12) is necessary to evaluate the posterior mean in Corollary 1 and the likelihood in Corollary 2; with this in mind, we choose the random measure given by the Lévy intensity showcased in (9), so that the corresponding Laplace exponent is readily given by (10). For illustration purposes, we use 4-dimensional data arising from a distributional copula with fixed marginal distributions, see [18] for an overview of distributional copulas. More precisely, we generate simulated data with probability distribution given by a distributional Clayton copula with parameter and exponential marginals with parameter . Then, we perform right-censoring by considering censoring time variables consisting of independent exponential random variables with parameter , and define
[TABLE]
For fitting the data, we use the 4-dimensional Lévy intensity given by (9) and assign priors for the hyper-parameters in (9), and . We choose a log-normal prior for the parameter A and a Beta prior for the parameter . We use the Metropolis within Gibbs algorithm to draw samples from the posterior distributions of and by making use of the likelihood showed in Corollary 2. We present a Monte Carlo approximation of the estimator (16), where we have averaged over the posterior draws of and . A more in depth description of the simulation algorithm is given in Appendix A.5. In Figures 3 and 4 we show the fit for 150 possibly right censored observations as in (17). The simulated synthetic observations are such that
[TABLE]
We chose so we have at least of exact observations for in each dimension. The construction of through a distributional Clayton allows us to calculate explicitly the associated survival function as showcased in Appendix A.6. We use the true survival function for comparison with the fitted survival functions. The estimated survival function are given by the posterior mean
[TABLE]
as in (16). The prior distributions of the hyperparameters are
[TABLE]
We ran iterations for the associated Metropolis within Gibbs sampler. Figure 3 and Figure 4 show that the estimated survival functions approximate well the true functions. For comparison purposes, we presented a Kaplan-Meier estimator for the true survival function, see for example [1]. As there is no multivariate Kaplan-Meier, we use the next estimator for a multivariate survival function:
[TABLE]
where each estimator is treated as a univariate Kaplan-Meier estimator restricted to the corresponding set of observations. In Figure 3 and Figure 4, we could appreciate that in the last subplots of each column the Kaplan-Meier can fit poorly as there are less observations on the conditioned Kaplan-Meier functions, as presented in the formula above.
Appendix
A.1 Proof of Proposition 1
Given , we use the notation and for . Furthermore we define integrals
[TABLE]
and
[TABLE]
where , and such that ; we also define , and denote for the vector without its -th entry.
An integration by parts shows that
[TABLE]
and in general for we get the recursion formula
[TABLE]
We prove the next technical lemma
Lemma 2**.**
If then the next identities hold
[TABLE]
Proof.
We proceed by mathematical induction over the dimension . We observe that from the definition of we always have
[TABLE]
For the case we have from Proposition 1 in [7] that
[TABLE]
And integrating by parts we obtain
[TABLE]
Therefore, we get the validity of the equations in (A.19) for the case . Now, suppose that (A.19) is true for , we must show the validity for . From the recursion formula (A.18) we get for
[TABLE]
The validity of (A.19) for follows from the validity for and a combinatorial argument. ∎
Proposition 1 follows by considering the first equation in the Lemma statement and the definition of .
A.2 Proof of Proposition 2
Proof.
Using the independent increments property of CRM’s we get that
[TABLE]
∎
A.3 Proof of Proposition 3
For notation purposes, in this proof we use the shorthand for a measure and positive real number .
Proof.
For the only if part we define for and so by supposing we have
[TABLE]
We observe that for and
[TABLE]
So for
[TABLE]
Concluding the only if part.
For the if part we define for and suppose for , with and the existence of independent random vectors such that we have (15).
Marginalizing in , we can apply Theorem 3.1 of [5] to each so we obtain that for some CRM that is stochastically continuous, almost surely non-decreasing and has the appropriate limit behaviour.
We observe that
[TABLE]
Hence defines a vector of CRM’s. ∎
A.4 Proof of Theorem 1
This proof is not only restricted to the homogeneous Lévy intensity case; in this general setting, we recall that the Laplace exponent has the form (6). In order to prove the theorem we use the next technical lemma.
Lemma 3**.**
Let be a -variate CRM such that are not independent and let the Lévy intensity of be such that is differentiable with respect to at some and denote \eta^{\prime}_{t_{0}}(\boldsymbol{s})=\partial{\left.\kern-1.2pt\eta_{t}(\boldsymbol{s})/\partial t\vphantom{\big{|}}\right|_{t=t_{0}}}. If are such that and are such that , then
[TABLE]
as , with for some .
Proof.
We denote for a function where and . We use the binomial theorem and apply expectation to write the left hand side in the equation above as
[TABLE]
We note that for , , , a Taylor expansion yields
[TABLE]
Furthermore by the binomial theorem we get the next identities
- (1)
- (2)
- ⋮
- (d-1)
\displaystyle\sum_{\stackrel{{\scriptstyle i_{1},\dots,i_{d-1}\in\{1,\dots,d\}}}{{i_{1}<\cdots<i_{d-1}}}}\sum_{j_{1}=1}^{q_{i_{1}}}\cdots\sum_{j_{d-1}=1}^{q_{i_{d-1}}}\binom{q_{i_{1}}}{j_{1}}\cdots\binom{q_{i_{d-1}}}{j_{d-1}}(-1)^{j_{1}+\cdots+j_{d-1}}(1-\mathbf{e}^{-j_{1}s_{i_{1}}-\cdots-j_{d-1}s_{i_{d-1}}})\\ =\sum_{\stackrel{{\scriptstyle i_{1},\dots,i_{d-1}\in\{1,\dots,d\}}}{{i_{1}<\cdots<i_{d-1}}}}\Bigg{\{}(-1)^{d-1}\sum_{j=1}^{d-1}(1-\mathbf{e}^{-s_{i_{j}}})^{q_{i_{j}}}+\\ (-1)^{d-2}\sum_{\stackrel{{\scriptstyle j_{1},j_{2}\in\{i_{1},\dots,i_{d-1}\}}}{{j_{1}<j_{2}}}}(1-\mathbf{e}^{-s_{j_{1}}})^{q_{j_{1}}}(1-\mathbf{e}^{-s_{j_{2}}})^{q_{j_{2}}}+\cdots-(1-\mathbf{e}^{-s_{i_{1}}})^{q_{i_{1}}}\cdots(1-\mathbf{e}^{-s_{i_{d-1}}})^{q_{i_{d-1}}}\Bigg{\}}\\
- (d)
So we have that (D.20) becomes
[TABLE]
[TABLE]
∎
Define
[TABLE]
so that
[TABLE]
We observe that defining , for and selecting sufficiently small such that for all
[TABLE]
So defining
[TABLE]
We get from the independence property of CRM’s that
[TABLE]
We observe that for , we have that and for such that is an exact observation we have that so lemma 2 can be applied yielding
[TABLE]
On the other hand, for we have so by the continuity of in we have
[TABLE]
From (D.23), (D.24) and the independence property of CRM’s we obtain
[TABLE]
Also by continuity and independence, defining , we get
[TABLE]
So by (D.22), (D.24) and (D.23) we get that
[TABLE]
And similarly
[TABLE]
We set so we conclude
[TABLE]
A.5 Simulation Algorithm
We use a Metropolis within Gibbs sampler to draw simulations from and as in Section 4. We recall that Corollary 2 gives the likelihood and we denote , for the prior distributions of and as in Section 4. Given initial values , , the algorithm is as follows
- (1)
Draw from a Metropolis-Hastings sampler with proposal distribution and target distribution
[TABLE] 2. (2)
Draw from a Metropolis-Hastings sampler with Uniform proposal distribution and target distribution
[TABLE]
For the fits in Section 4 we used 100 iterations for each inner Metropolis-Hasting sampler and 1000 iterations for the overall Gibbs sampler.
A.6 Survival function of .
Let be a -dimensional distributional Clayton copula and , , a collection of marginal cumulative distribution functions; then the survival function associated to the Clayton distributional copula and marginals is given by
[TABLE]
see Section 2.6 in [18].
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Aalen et al. [2008] Aalen, O., Borgan, O., and Gjessing, H. (2008). Survival and event history analysis: a process point of view. Springer Science & Business Media.
- 2Cont and Tankov [2004] Cont, R. and Tankov, P. (2004). Financial modelling with jump processes . Chapman & Hall.
- 3de Finetti [1938] De Finetti, B. (1938). ’Sur la condition de ”equivalence partielle”’, Colloque consacré à la théorie des probabilités, Vol. VI, Université de Genève, Hermann et C. ie, Paris.
- 4De Iorio et al. [2009] De Iorio, M., Johnson, W. O., Müller, P., & Rosner, G. L. (2009). Bayesian nonparametric nonproportional hazards survival modeling. Biometrics , 65 , 762-771.
- 5Doksum [1974] Doksum, K. (1974). Tailfree and neutral random probabilities and their posterior distributions. The Annals of Probability , 2 , 183-201.
- 6Dykstra and Laud [1981] Dykstra, R. L. and Laud, P. (1981). A Bayesian nonparametric approach to reliability. The Annals of Statistics , 9 , 356-367.
- 7Epifani and Lijoi [2010] Epifani, I. and Lijoi. A. (2010). Nonparametric priors for vectors of survival functions. Statistica Sinica , 20 , 1455–1484.
- 8Ferguson and Phadia [1979] Ferguson, T. S., and Phadia, E. G. (1979). Bayesian nonparametric estimation based on censored data. The Annals of Statistics , 7 , 163-186.
