Dependence Properties of B-Spline Copulas
Xiaoling Dou, Satoshi Kuriki, Gwo Dong Lin, Donald Richards

TL;DR
This paper introduces a new class of copulas constructed with B-spline functions, extending classical Bernstein copulas, and explores their correlation properties, bounds, and moments with explicit formulas.
Contribution
It develops B-spline copulas, generalizing Bernstein copulas, and proves their correlation bounds and total positivity properties, including explicit moment formulas using Stirling numbers.
Findings
Range of correlation is characterized.
Frechet--Hoeffding upper bound is attained with infinite B-spline functions.
Explicit moments are derived using Stirling numbers.
Abstract
We construct by using B-spline functions a class of copulas that includes the Bernstein copulas arising in Baker's distributions. The range of correlation of the B-spline copulas is examined, and the Frechet--Hoeffding upper bound is proved to be attained when the number of B-spline functions goes to infinity. As the B-spline functions are well-known to be an order-complete weak Tchebycheff system from which the property of total positivity of any order follows for the maximum correlation case, the results given here extend classical results for the Bernstein copulas. In addition, we derive in terms of the Stirling numbers of the second kind an explicit formula for the moments of the related B-spline functions for nonnegative real numbers.
| Bernstein∗ | |||||
| 0.333 | 0.75 | 0.333 | NA | NA | |
| 0.5 | 0.889 | 0.667 | 0.5∗ | NA | |
| 0.6 | 0.938 | 0.827 | 0.688 | 0.6∗ | |
| 0.667 | 0.96 | 0.896 | 0.796 | 0.72∗∗ | |
| 0.714 | 0.972 | 0.931 | 0.867 | 0.796 | |
| 0.75 | 0.980 | 0.951 | 0.908 | 0.851 | |
| 0.778 | 0.984 | 0.963 | 0.933 | 0.892 | |
| 0.8 | 0.988 | 0.971 | 0.949 | 0.919 | |
| 0.818 | 0.99 | 0.977 | 0.960 | 0.937 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStochastic processes and financial applications · Financial Risk and Volatility Modeling · Mathematical functions and polynomials
∎
11institutetext: X. Dou 22institutetext: Waseda University, 3-4-1 Ohkubo, Shinjuku, Tokyo 169-8555, Japan
Tel.: +81-3-5286-2358
22email: [email protected] 33institutetext: S. Kuriki 44institutetext: The Institute of Statistical Mathematics, 10-3 Midoricho, Tachikawa, Tokyo 190-8562, Japan 55institutetext: G. D. Lin 66institutetext: Institute of Statistical Science, Academia Sinica, Taipei 11529, Taiwan, R.O.C. 77institutetext: D. Richards 88institutetext: Department of Statistics, Pennsylvania State University, University Park, PA 16802, U.S.A.
Dependence Properties of B-Spline Copulas
††thanks: This work was supported by JSPS KAKENHI Grants, Numbers 16K00060 and 16H02792.
Dependence properties of B-spline copulas
Xiaoling Dou
Satoshi Kuriki
Gwo Dong Lin
Donald Richards
(Received: date / Accepted: date)
Abstract
We construct by using B-spline functions a class of copulas that includes the Bernstein copulas arising in Baker’s distributions. The range of correlation of the B-spline copulas is examined, and the Fréchet–Hoeffding upper bound is proved to be attained when the number of B-spline functions goes to infinity. As the B-spline functions are well-known to be an order-complete weak Tchebycheff system from which the property of total positivity of any order (TP∞) follows for the maximum correlation case, the results given here extend classical results for the Bernstein copulas. In addition, we derive in terms of the Stirling numbers of the second kind an explicit formula for the moments of the related B-spline functions on .
Keywords:
Bernstein copula Fréchet–Hoeffding upper bound Order-complete weak Tchebycheff system Schur function Stirling number of the second kind Total positivity of order
1 Introduction: A review of the Bernstein copulas
A novel method that applied the theory of order statistics to construct multivariate distributions with given marginal distributions was developed by Baker Baker08 . We refer to Lin, et al. Lin-etal14 for a recent survey of this topic.
Baker’s idea, applied to univariate cumulative distribution functions and , can be described as follows: Let and be independent random samples from the distributions and , respectively. Let be the th smallest order statistic of , and denote by the distribution of ; we write this as . Similarly, we denote by the th smallest order statistic of and we let be its corresponding distribution, written . (Note that and can be discrete distributions.)
Let be a parameter matrix whose matrix entries satisfy the conditions
[TABLE]
Now choose the pair with probability , . Then follows Baker’s bivariate distribution: For , the joint cumulative distribution function satisfies
[TABLE]
where “” denotes transpose. It is immediately evident that has marginal distributions and .
Let be the distribution function of the th smallest order statistic of a random sample of size from the uniform distribution on . It is well-known that
[TABLE]
where
[TABLE]
. Furthermore, equals the composition (see, e.g., Hwang and Lin Hwang-Lin84 ) and Baker’s bivariate distribution (1.2) can be rewritten as
[TABLE]
, where, for ,
[TABLE]
is a copula function with parameter matrix satisfying (1.1). Conversely, if the marginals and are equal to then Baker’s bivariate distribution (1.2) reduces to the copula (1.4).
The copula in (1.4) is called the Bernstein copula with parameter matrix because is a Bernstein polynomial (see Dou, et al. Dou-etal14 ). By differentiating (1.4) with respect to and , we obtain the Bernstein copula density:
[TABLE]
.
Within the parameter space (1.1) of , the maximum correlation is attained when , i.e., :
[TABLE]
with corresponding density
[TABLE]
.
Dou, et al. Dou-etal13 proved that the maximum correlation copula and its density both are totally positive of order (TP2) in Karlin-Studden66 ; Karlin68 ; Pinkus10 . One of the main purposes of the present paper is to show further that both and are TP*∞, i.e., TPr* for all .
In Section 2, we introduce first the general order-complete weak Tchebycheff (OCWT) systems and then a class of copulas, based on B-spline functions, that includes the Bernstein copulas in (1.4). The maximum correlation copula and its total positivity properties are investigated in Sections 3 and 4, respectively. Finally, in Section 5 we calculate the moments of the related B-spline functions on and make the connection with the Stirling numbers of the second kind.
2 B-spline copulas
We consider first a general setting based on OCWT systems, and then we define a class of B-spline copulas that includes the Bernstein copulas as special cases. In other words, we will show that a larger class of candidate copulas can play the same roles as and while still retaining the desired properties of the copula .
Let , , , and let be probability densities on such that
[TABLE]
. We assume further that is an order-complete weak Tchebycheff system (OCWT-system), i.e.,
- (i)
are linearly independent, and
- (ii)
is totally positive of order (TPn) in , i.e., for each ,
[TABLE]
for all and .
See Karlin and Studden (Karlin-Studden66, , Chapter 1) or Schumaker (Schumaker07, , Chapter 2) for examples of OCWT systems.
Let , , such that . Also, let , , such that . Letting
[TABLE]
, we define the B-spline copula, a generalization of the Bernstein copula (1.4), by
[TABLE]
, with parameter matrix
[TABLE]
The copula (2.3) is a bona fide copula since, for any ,
[TABLE]
and similarly, , .
Throughout the paper, we restrict our attention to the case in which and ; further, we use the notation for the diagonal matrix with diagonal entries .
Theorem 2.1
For the copula (2.3) with the parameter space (2.4), the maximum correlation is attained when , equivalently, .
In the maximum correlation case, becomes
[TABLE]
.
To prove Theorem 2.1, we need the following crucial lemma. This result is a generalization of the weak majorization inequality on the closed simplicial cone
[TABLE]
for doubly stochastic matrices (Marshall-Olkin-Arnold11, , p. 639).
Lemma 1
Let and be given. Let satisfy . Then,
[TABLE]
Proof. Let and . Define
[TABLE]
For given , it is straightforward to see that is a doubly stochastic matrix. Hence, by the famous characterization of majorization due to Hardy, Littlewood, and Pólya HLW29 , the vector is majorized by , denoted .
We now rearrange the components of in decreasing order, listing them as , and let . Then we have also and hence because (see (Marshall-Olkin-Arnold11, , p. 133)). On the other hand, note that because . These results together imply that .
Consequently, , which we can write alternatively as
[TABLE]
Equivalently, by cancelling the common term on both sides above. This is exactly (2.6), so the proof now is complete. ∎
Proof of Theorem 2.1. Since is an OCWT-system then, for all and ,
[TABLE]
Integrating this inequality with respect to over and , we obtain
[TABLE]
. Therefore, we obtain the stochastic order,
[TABLE]
for all . Combining this result with Lemma 1, we obtain the inequality
[TABLE]
for all and satisfying (2.4). The theorem now follows from Hoeffding’s covariance formula,
[TABLE]
(see, e.g., Lin-etal14 ). The proof is complete. ∎
Functions satisfying (2.1) and (2.2) can be constructed by B-spline functions as we now show. Let be a B-spline function on of degree defined as a non-zero B-spline basis with knots:
[TABLE]
Then, is generated by the recursion formula,
[TABLE]
, for , with initial conditions
[TABLE]
(see deBoor72 ; deBoor01 ; Nurnberger89 ). The number of non-zero bases is
[TABLE]
The B-spline is known to satisfy
- (i)
,
- (ii)
The support is given by
[TABLE]
, and
- (iii)
The “partition of unity” property:
[TABLE]
For given and , let
[TABLE]
where and . Then, (2.1) holds, and we have the following result (see deBoor76 , or (Schumaker07, , Theorems 4.18 and 4.65)).
Theorem 2.2
Under the hypotheses (2.7) and (2.8), the set of B-spline functions, and hence also the B-spline system , forms an OCWT-system satisfying (2.2).
To illustrate the use of B-spline systems, we now provide some examples.
Theorem 2.3
Let and the degree . Then the B-splines (2.8) reduce to the Bernstein system (1.3). Specifically, for and ,
[TABLE]
Proof. We prove the result by induction on . Note that for (i.e., ), , , and hence
[TABLE]
. For (i.e., ), we have the required , , and as follows:
[TABLE]
[TABLE]
[TABLE]
[TABLE]
Assume that the theorem holds true for , then we want to prove (2.9) for . In this case, the B-spline functions are of the form
[TABLE]
It can be shown that for ,
[TABLE]
and
[TABLE]
. This completes the proof. ∎
From now on, for simplicity, we consider only the B-spline with equally-spaced knots, i.e., the B-spline functions on of order having knots given in (2.7) with , .
Example 1
Suppose , i.e., ; then the B-spline system becomes a “histogram”. Namely, for ,
[TABLE]
, where denotes the indicator function of the set .
Example 2
For , i.e., , we have
[TABLE]
and the B-spline system is
[TABLE]
and, for ,
[TABLE]
.
We remark that Shen, et al. Shen-etal08 earlier proposed the “linear B-spline copula”, which corresponds to the case .
Example 3
For and , i.e., , we have
[TABLE]
and the B-spline system is
[TABLE]
. The means of the densities are , , , , and , respectively. We use these values in the computation of Table 1 of maximum correlations for .
3 The maximum correlation copula: Range of correlation
For copula functions, the range of the correlation is of particular importance. In particular, great attention is paid to the maximum achievable correlation (see, e.g., Lin and Huang Lin-Huang10 ). By Theorem 2.1, the maximum is attained when the copula density is
[TABLE]
Suppose that is from the copula density (3.1). Then,
[TABLE]
Noting that and , it follows that
[TABLE]
In the Bernstein case (), it follows from Theorem 2.3 that and hence
[TABLE]
In the case of the B-spline of order zero, given in (2.10),
[TABLE]
In particular, when and , on , and hence . This is the independent case, so .
In order to calculate the maximum correlation for general , we present first a lemma in which it is understood that the vectors and reduce to the central parts when .
Lemma 2
Suppose that , i.e., . Let , , be the B-spline functions on of order having knots (2.7) with , . In addition, denote the integral and the first moment of by
[TABLE]
. Then,
[TABLE]
Proof. For , we have and , where and are given below in (5.8) and (5.9), respectively. Also, for , we have the relations
[TABLE]
because , . Solving the equations (3.4) in a successive manner, we obtain the stated results. ∎
Theorem 3.1
Under the assumptions of Lemma 2, suppose that have the copula density in (3.1) with defined through the B-spline functions (2.8) having knots given in Lemma 2. Then the correlation of is
[TABLE]
Proof. Using (3.2), (3.4), and the notations in Lemma 2, write first
[TABLE]
The final result is obtained by substituting (3) in (3.3) and carry out the calculations to obtain (3.5) with the help of Lemma 2. ∎
The maximum correlation in (3.5) remains valid for all cases . Further, the maximum correlation converges to as , so we obtain , or in probability. Therefore, as , the random variable converges in law to , a bivariate random variable whose joint distribution, remarkably, happens to provide the Fréchet–Hoeffding upper bound, . Thus, we have the following result.
Theorem 3.2
Let be the maximum correlation copula function (2.5) that is constructed by the B-spline
[TABLE]
on of degree , having equally-spaced knots (2.7) with , , where . As , for all , the Fréchet–Hoeffding upper bound.
Table 1 shows the maximum correlations when the number of basis functions is . In view of Table 1, the range of correlation for the B-spline copulas of small order is wider than that of the Bernstein copula. Indeed,
[TABLE]
On the other hand, determines the smoothness of the copula density. Consequently, some criterion is needed to evaluate data fitness so as to balance the accuracy of the approximation with the smoothness of the density; this problem will be studied in future work.
We conjecture that Theorem 3.2 holds in more general settings.
Conjecture 1
Let be distributed as the maximum correlation distribution (2.5) constructed by the B-spline
[TABLE]
on , of degree , with the knots (2.7). As with , converges to ; hence, for all , converges to , the Fréchet–Hoeffding upper bound.
4 The maximum correlation copula: Total positivity
The next two results improve significantly the previous ones about the Bernstein copulas.
Theorem 4.1
The copula in (2.5) is TP∞, i.e., for any ,
[TABLE]
for all and .
Proof. All determinants arising in the proof are of order , unless otherwise specified. Further, we consider two cases: (I) , and (II) .
Case I: . In this case, the matrix (C^{*}(u_{i},v_{j})\bigr{)}_{1\leq i,j\leq r} satisfies
[TABLE]
Consequently, the rank of this matrix is at most , and hence is degenerate. Therefore, it follows obviously that \det\big{(}C^{*}(u_{i},v_{j})\big{)}=0.
Case II: . We will show that \det\big{(}C^{*}(u_{i},v_{j})\big{)}\geq 0. By the Binet–Cauchy formula,
[TABLE]
Writing
[TABLE]
it follows by the continuous version of the Binet–Cauchy formula Gross-Richards98 ; Karlin68 that
[TABLE]
By Theorem 2.2, is an OCWT-system, hence
[TABLE]
for all and . Also, it is well-known from Karlin68 ; Karlin-Studden66 that
[TABLE]
for all and .
Therefore, we deduce from (4.2) and (4.3) that \det\big{(}\Phi_{k_{i}}(u_{j})\big{)}\geq 0 for all and . Similarly, we obtain \det\big{(}\Phi_{k_{i}}(v_{j})\big{)}\geq 0 for and . Hence, it follows from (4) that \det\big{(}C^{*}(u_{i},v_{j})\big{)}\geq 0 for and . The proof is complete. ∎
Remark 1
For the case of the Bernstein copula, we note that (4.4) is proved as follows. Consider
[TABLE]
. Then,
[TABLE]
and
[TABLE]
For , set , , and define the partition , i.e., are nonnegative integers and . Also, let , , and let . Recall from Macdonald15 that the Schur function corresponding to the partition is defined as
[TABLE]
Then we obtain
[TABLE]
It is well-known that for and Gross-Richards98 ; Macdonald15 , and hence
[TABLE]
for all and . This completes the proof of (4.4). ∎
As a consequence of Theorem 4.1, we obtain a new proof of the total positivity of the function ; see (Karlin68, , Chapter 2).
Corollary 1
The Fréchet–Hoeffding upper bound, , is TP∞.
Proof. Recall that for the Bernstein copula, increases to as the number of basis functions goes to infinity (see Huang, et al. Huang-etal13 ). Moreover, it follows from Theorem 3.2 that for the equally-spaced knot B-spline copula, converges to . In either case, by taking the limit, as , of the nonnegative determinant, \det\big{(}C^{*}(u_{i},v_{j})\big{)}, we obtain
[TABLE]
which proves that the function is TPr. Finally, since is arbitrary then it follows that the function is TP∞. ∎
By mimicking the proof of Theorem 4.1, we actually have the following stronger result. The proof is omitted.
Theorem 4.2
The copula density in (3.1) is TP.
Theorem 4.1 is in fact a consequence of Theorem 4.2 by using Lemma 3 below, but we provide a direct proof there.
Let with marginal distributions and , and copula function . Using the language of reliability theory, define the survival functions
[TABLE]
and
[TABLE]
. It follows from the definition of the copula function that and for all . Recently, Lin et al. Lin-etal18 proved the following result.
Lemma 3
If the bivariate distribution has a TPr density with , then both and are TP. Consequently, if has a TP∞ density, both and are TP∞.
An immediate consequence of the last two theorems is the following result. In part (ii) of this result, we apply the fact that both and are non-decreasing, while both are non-increasing (see Marshall, et al. (Marshall-Olkin-Arnold11, , p. 758)).
Corollary 2
*Let be the copula defined in (2.5).
(i) The survival function is TP.
(ii) If with copula , then both and are TP.*
We next discuss some implications of the total positivity. By the results of Gross and Richards (Gross-Richards98, , Section 3, Example 3.7) we have the following inequalities.
Corollary 3
Let with marginals and copula in (2.5) and . Then the matrix
[TABLE]
is TPr, provided the expectations exist.
Let and . The matrix
[TABLE]
is TPr.
In particular, when , it follows from (4.5) that
[TABLE]
an inequality that is equivalent to
[TABLE]
Let and . Note that and . By (4.6), the matrix
[TABLE]
is totally positive of order . By calculating the principal minor of this matrix, we find that ; equivalently, , , i.e., the distribution function is positively quadrant dependent. Further, by calculating the determinant of this matrix, we obtain
[TABLE]
for , .
We remark that more general inequalities can be deduced from (Gross-Richards98, , Example 3.11).
5 Moments of the B-spline functions with initial boundary
In this section, we provide the moment formula for the B-spline functions with initial boundary at defined on . The expressions for and in Lemma 2 are obtained, in Corollary 4 below, as a consequence of the moment formula.
Let be a B-spline function of degree on with knots:
[TABLE]
(compare with the previously studied B-spline function defined in (2.7)). Here, we have and, as before, is generated by the following recursion formula:
[TABLE]
, with initial conditions
[TABLE]
For each , is a non-zero function with support . The recurrence (5.2) can be written more concretely as
[TABLE]
For , denote the -moment of ,
[TABLE]
this quantity was used in the proof of Lemma 2 above. Then, we have the following recurrence relation for these moments.
[TABLE]
with boundary condition
[TABLE]
The next result, which is interesting in its own right, presents the solution of the recurrence system in terms of the Stirling numbers of the second kind:
[TABLE]
Here, , for , and whenever it arises. Note also that and . The Stirling numbers of the second kind satisfy the recurrence formula
[TABLE]
and the identity
[TABLE]
which will be used later. For the identity (5.6), see Wagner Wagner96 and the end of Remark 2 below.
Theorem 5.1
For , the -moment of the B-spline function in (5.1) is of the form
[TABLE]
Corollary 4
For , we have
[TABLE]
and
[TABLE]
The formulas (5.8) and (5.9) can be applied to obtain the formula for the maximum correlation (3.5).
Corollary 5
For ,
[TABLE]
Proof. For the case ,
[TABLE]
by the identity (5.6). ∎
Proof of Theorem 5.1. We prove the statement by mathematical induction on . Note first that (5.7) with coincides with the boundary conditions (5.4) for all and .
Suppose that (5.7) is true for the case and for all and , then we wish to prove that it also holds true for the case and for all and .
(i) For , by the assumption of induction,
[TABLE]
Then, we have
[TABLE]
and by expanding using the binomial theorem, we obtain
[TABLE]
Interchanging the order of summation and using the identity,
[TABLE]
we find that
[TABLE]
Replacing by , we have
[TABLE]
and using the identity,
[TABLE]
we deduce that
[TABLE]
where the last equality follows from the identity (5.6). Moreover, using the identity,
[TABLE]
we obtain
[TABLE]
Since then
[TABLE]
and substituting this result into (5), we obtain
[TABLE]
Similarly, it follows from (5.11) that
[TABLE]
Hence, by substituting (5.13) and (5) into (5.3), we find that
[TABLE]
where the last equality follows from the recurrence formula (5.5).
(ii) When , by the assumption of induction,
[TABLE]
It then follows from (5.3) that
[TABLE]
Here we used the recurrence formula (5.5) with and , viz.,
[TABLE]
(iii) When , by the inductive hypothesis,
[TABLE]
Then, by (5.3), we have
[TABLE]
which coincides with (5.7) with .
The proof is completed by induction on . ∎
Remark 2
Recall the generalized (higher-order) Bernoulli polynomial defined by the generating function
[TABLE]
, , where is a polynomial of degree in with rational coefficients. Neuman (Neuman81, , Proposition 3.5) showed that , from which (5.10) for in Corollary 5 also follows by the relationship between the Stirling number and the generalized Bernoulli polynomial:
[TABLE]
The latter can be verified by (5.16) and the exponential generating function,
[TABLE]
, . See Carlitz60 and Branson00 for (5.17) and (5.18), respectively. The generating function (5.18) can also be established by induction on and this is equivalent to verifying the above useful identity (5.6).
Remark 3
By iteration, we have
[TABLE]
and hence its -moment is equal to
[TABLE]
as shown in (5.15).
Remark 4
It can be shown that for ,
[TABLE]
Therefore, the -moment of can be calculated as
[TABLE]
This is equivalent to the first formula of (5.7) (with ). Indeed, for , it follows from (5.7) that
[TABLE]
and if we now change variables from to , where , then we obtain
[TABLE]
Acknowledgments
This work was supported by JSPS KAKENHI Grants, Numbers 16K00060 and 16H02792.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Baker, R. An order-statistics-based method for constructing multivariate distributions with fixed marginals. Journal of Multivariate Analysis 99 , 2312–2327 (2008)
- 2(2) Branson, D. Stirling numbers and Bell numbers: their role in combinatorics and probability. Mathematical Scientist 25 , 1–31 (2000)
- 3(3) Carlitz, L. Note on Nörlund’s Polynomial B n ( z ) superscript subscript 𝐵 𝑛 𝑧 B_{n}^{(z)} . Proceedings of the American Mathematical Society 11 , 452–455 (1960)
- 4(4) de Boor, C. On calculating with B-splines. Journal of Approximation Theory 6 , 50–62 (1972)
- 5(5) de Boor, C. Total positivity of the spline collocation matrix. Indiana University Mathematics Journal 25 , 541–551 (1976)
- 6(6) de Boor, C. A Practical Guide to Splines , Revised edn. Springer, New York (2001)
- 7(7) Dou, X., Kuriki, S., and Lin, G. D. Dependence structures and asymptotic properties of Baker’s distributions with fixed marginals. Journal of Statistical Planning and Inference 143 , 1343–1354 (2013)
- 8(8) Dou, X., Kuriki, S., Lin, G. D., and Richards, D. EM algorithms for estimating the Bernstein copula. Computational Statistics & \& Data Analysis 93 , 228–245 (2014)
