TL;DR
This paper introduces new statistical data depth functions for collections of Hermitian positive definite matrices, leveraging Riemannian geometry to enable robust inference and characterization of centrality in such matrix data.
Contribution
It develops the first intrinsic data depth functions for Hermitian positive definite matrices, with computationally efficient methods and practical applications in confidence region construction.
Findings
Proposed two fast data depth functions satisfying key properties.
Demonstrated robustness and efficiency of the depth functions.
Applied depth-based methods to analyze covariance matrices from a clinical trial.
Abstract
Nondegenerate covariance, correlation and spectral density matrices are necessarily symmetric or Hermitian and positive definite. The main contribution of this paper is the development of statistical data depths for collections of Hermitian positive definite matrices by exploiting the geometric structure of the space as a Riemannian manifold. The depth functions allow one to naturally characterize most central or outlying matrices, but also provide a practical framework for inference in the context of samples of positive definite matrices. First, the desired properties of an intrinsic data depth function acting on the space of Hermitian positive definite matrices are presented. Second, we propose two computationally fast pointwise and integrated data depth functions that satisfy each of these requirements and investigate several robustness and efficiency aspects. As an application, we…
| Intrinsic zonoid depth, | Geodesic distance depth, | ||||||
|---|---|---|---|---|---|---|---|
| 5-GND | Ave.- | Ave.-Size (SE) | Coverage | Ave.- | Ave.-Size (SE) | Coverage | |
| -CR | 0.0181 | 0.171 (0.78) | 0.760 | 0.810 | 0.171 (0.21) | 0.805 | |
| -CR | 0.0064 | 0.196 (0.95) | 0.889 | 0.794 | 0.195 (0.25) | 0.901 | |
| -CR | 0.0023 | 0.214 (1.20) | 0.935 | 0.780 | 0.216 (0.30) | 0.957 | |
| 2-GND | Ave.- | Ave.-Size (SE) | Coverage | Ave.- | Ave.-Size (SE) | Coverage | |
| -CR | 0.0181 | 0.205 (1.71) | 0.796 | 0.775 | 0.207 (0.61) | 0.825 | |
| -CR | 0.0064 | 0.236 (2.19) | 0.897 | 0.756 | 0.237 (0.60) | 0.898 | |
| -CR | 0.0023 | 0.260 (2.80) | 0.947 | 0.740 | 0.264 (0.70) | 0.950 | |
| 1.5-GND | Ave.- | Ave.-Size (SE) | Coverage | Ave.- | Ave.-Size (SE) | Coverage | |
| -CR | 0.0181 | 0.228 (2.64) | 0.798 | 0.755 | 0.230 (0.91) | 0.828 | |
| -CR | 0.0065 | 0.262 (3.55) | 0.892 | 0.734 | 0.263 (0.88) | 0.914 | |
| -CR | 0.0023 | 0.284 (4.73) | 0.925 | 0.716 | 0.294 (0.92) | 0.952 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Intrinsic data depth for Hermitian positive definite matrices
Joris Chau111 Corresponding author, [email protected], Institute of Statistics, Biostatistics, and Actuarial Sciences, Université catholique de Louvain, Voie du Roman Pays 20, B-1348, Louvain-la-Neuve, Belgium. , Hernando Ombao222 Department of Statistics, University of California at Irvine, Bren Hall 2206, Irvine, CA, 92697, United States. Department of Applied Mathematics and Computational Science, King Abdullah University of Science and Technology, Thuwal 23955-6900, Saudi Arabia. and Rainer von Sachs333 Institute of Statistics, Biostatistics, and Actuarial Sciences, Université catholique de Louvain, Voie du Roman Pays 20, B-1348, Louvain-la-Neuve, Belgium.
Abstract
Nondegenerate covariance, correlation and spectral density matrices are necessarily symmetric or Hermitian and positive definite. The main contribution of this paper is the development of statistical data depths for collections of Hermitian positive definite matrices by exploiting the geometric structure of the space as a Riemannian manifold. The depth functions allow one to naturally characterize most central or outlying matrices, but also provide a practical framework for inference in the context of samples of positive definite matrices. First, the desired properties of an intrinsic data depth function acting on the space of Hermitian positive definite matrices are presented. Second, we propose two computationally fast pointwise and integrated data depth functions that satisfy each of these requirements and investigate several robustness and efficiency aspects. As an application, we construct depth-based confidence regions for the intrinsic mean of a sample of positive definite matrices, which is applied to the exploratory analysis of a collection of covariance matrices associated to a multicenter research trial.
Keywords: Data depth, Hermitian positive definite matrices, Riemannian manifold, Confidence regions, Affine-invariant metric, Covariance matrices.
1 Introduction
In numerous applications in multivariate statistics, we are interested not only in the first-order behavior (mean) of a sample of random vectors, but also in the second-order behavior or variability of the sample. In fact, our primary interest is often precisely the analysis of covariance or correlation structures between components of the random vectors. In many areas of statistical research, such as neuroscience, biomedical science, environmental science, demographics or finance, it is increasingly common to encounter covariance or correlation matrices across a large number of temporal or spatial locations, or across a large number of replicated subjects or trials in an experiment. In this work, our aim is to develop data exploration and inference tools for large collections or samples of such matrices.
The data objects of interest, nondegenerate covariance or correlation matrices, are necessarily elements of the space of Hermitian positive definite (HPD) matrices, which includes the space of symmetric positive definite (SPD) matrices in the real-valued case. The space of HPD matrices, although very well-structured, is inherently non-Euclidean and standard Euclidean-based statistical procedures (e.g., regression, clustering or inference procedures) may be unstable or break down due to the geometric constraints of the space. For this reason, it is necessary to generalize statistical procedures for data in the space of symmetric or Hermitian PD matrices, taking into account the non-Euclidean geometry of the space. Several recent works addressing this issue include: Smith (2000), Pennec et al. (2006), Fletcher et al. (2009), Zhu et al. (2009), Dryden et al. (2009), Fletcher et al. (2011), Yuan et al. (2012) Said et al. (2015), Holbrook et al. (2016) and Chau and von Sachs (2017) among others. The main contribution of this paper is the generalization of notions of data depth for samples of HPD matrices to provide a center-to-outward ordering of positive definite matrix-valued objects.
Data depth is a useful tool for data exploration to identify most central or outlying data observations (as in Liu et al. (1999) or Sun and Genton (2012) in a Euclidean context); or as a means of inference, by way of rank-based hypothesis testing (as in Liu and Singh (1993), Chenouri and Small (2012), or (Mosler, 2002, Chapter 5)), classification (see e.g., Li et al. (2012)) or the construction of confidence regions (see e.g., Yeh and Singh (1997)) among other applications. Although many different depth functions have been proposed and studied in the literature over the years, most data depth functions are constructed in the first place for vector-valued observations in the Euclidean space . Exceptions include Liu and Singh (1992), where the authors consider depth functions for directional data on circles or spheres; Hu et al. (2011), on projection depth for tensor objects; or the recent work, Paindaveine and Van Bever (2017), on halfspace depths for scatter, concentration and shape matrices. For an overview of various Euclidean data depth functions and their specific properties, we refer the reader to e.g., Liu et al. (1999), Zuo and Serfling (2000), or Mosler (2002).
The space of -dimensional Hermitian (not necessarily PD) matrices together with matrix addition and matrix scalar multiplication is a real vector space, and each Hermitian matrix bijectively maps to a vector in by expanding the matrix with respect to some basis. To calculate data depth values for a sample of Hermitian matrices, it suffices to apply any ordinary Euclidean data depth function to the basis component vectors of the Hermitian matrices, given that the computed depth values do not depend on the chosen basis. In contrast, due to the nonlinear positive definite constraints, the space of HPD matrices is not a vector space. Moreover, the cone of HPD matrices embedded in a Euclidean space endowed with the Eulidean metric is not a complete metric space. As a consequence, Euclidean data depth applied to a sample of HPD matrices violates the basic properties of a proper depth function. To illustrate, according to Zuo and Serfling (2000), a proper depth function should be monotonicallly non-increasing moving outwards from a well-defined center. Moving away from a central point along a straight line is not always well-defined in the cone of HPD matrices, as the boundary of the space lies at a finite distance. Also, pointwise or uniform continuity properties of the data depth functions fail to hold due to the incompleteness of the metric space.
Instead of embedding the space of HPD matrices in an ambient Euclidean space, we exploit the geometric structure of the space of HPD matrices as a curved Riemannian manifold equipped with the affine-invariant (Pennec et al. (2006)) –also natural invariant (Smith (2000)), canonical (Holbrook et al. (2016)), trace (Yuan et al. (2012)), Rao-Fisher (Said et al. (2015))– Riemannian metric, or simply the Riemannian metric (Bhatia (2009), Dryden et al. (2009)). The affine-invariant metric plays an important role in estimation problems in the space of symmetric or Hermitian PD matrices for several reasons: (i) the space of HPD matrices equipped with the affine-invariant metric is a complete metric space, (ii) the affine-invariant metric is invariant under congruence transformation by any invertible matrix, see Section 2, and (iii) there is no swelling effect as with the Euclidean metric, where interpolating two HPD matrices may yield a matrix with a determinant larger than either of the original matrices, which may lead to computational instability, (Pasternak et al. (2010)). The first property allows us to construct proper data depth functions in the space of HPD matrices satisfying all of the intrinsic versions of the axiomatic properties in Zuo and Serfling (2000). The second property is important to ensure that the depth functions are general linear congruence invariant, which in practice means that the depth values do not non-trivially depend on the chosen coordinate system of the data. In Dryden et al. (2009), the authors list several additional metrics for estimation problems in the space of HPD matrices, such as the Log-Euclidean metric, also studied in Arsigny et al. (2006). The Log-Euclidean metric transforms the space of HPD matrices into a complete metric space and is invariant under congruence transformations by the unitary group, but not by the general linear group, as is true for the affine-invariant metric.
In the preliminary Section 2, we introduce the necessary geometric tools to develop data depths acting directly on the space of HPD matrices as a geodesically complete manifold. In Section 3, we present the desired properties an intrinsic depth function should satisfy, and we propose two data depth functions that satisfy each of these requirements. In addition, we consider integrated depth functions that act on curves of HPD matrices, such as spectral density matrices. In Section 4, we compare the two depth functions in terms of robustness and efficiency aspects. In Section 5, as an application of the depth functions, we construct depth-based confidence regions for the intrinsic mean of a sample of HPD matrices, and in Section 6 we apply the intrinsic depth functions to explore a collection of covariance matrices from a multicenter clinical trial. The technical proofs and additional figures can be found in the supplementary material. The accompanying R-code, containing the necessary tools to compute the intrinsic data depths and to perform rank-based hypothesis testing for samples of HPD matrices, is publicly available in the R-package pdSpecEst on CRAN, (Chau (2017)).
2 Preliminaries
2.1 Geometry of HPD matrices
In order to develop data depths for observations in the space of HPD matrices, we study the space as a Riemannian manifold as in Pennec et al. (2006), (Bhatia, 2009, Chapter 6), or Smith (2000) among others. Denote for the space of HPD matrices. is an open subset of the space Hermitian matrices , and as such a smooth manifold. The tangent space at a point (i.e., a matrix) can be identified by the Hermitian space , and the Frobenius inner product on induces the affine-invariant Riemannian metric on the manifold given by the smooth family of inner products:
[TABLE]
with . Here and throughout this paper, always denotes the Hermitian square root matrix of , and we write for matrix congruence transformation, where ∗ denotes the conjugate transpose of a matrix. The Riemannian distance on derived from the affine-invariant Riemannian metric is given by:
[TABLE]
where denotes the matrix Frobenius norm and is the matrix logarithm. Denote the general linear group by . The mapping is an isometry for each invertible matrix , i.e., it is distance-preserving:
[TABLE]
By (Bhatia, 2009, Theorem 6.1.6 and Prop. 6.2.2), the Riemannian manifold is geodesically complete. By the Hopf-Rinow Theorem this implies that there exists a unique geodesic segment joining any two points and every geodesic can be extended indefinitely. The Hopf-Rinow Theorem also implies that for every the exponential map and the logarithmic (i.e., inverse exponential) map are global diffeomorphisms with domains and respectively. By (Pennec et al. (2006)), the exponential and logarithmic maps are given by,
[TABLE]
where denotes the matrix exponential. The Riemannian distance may now also be expressed in terms of the logarithmic map as:
[TABLE]
where throughout this paper denotes the norm of induced by the affine-invariant metric.
As there exists a unique geodesic curve connecting any two points , geodesically convex sets are well-defined. A subset is said to be convex or geodesically convex if for each pair of points , the geodesic segment is contained entirely in . If , then the convex hull of , denoted by , is the smallest convex set containing . This set is conveniently expressed as,
[TABLE]
where is the Lebesgue measure on the finite-dimensional metric space and is a measurable function. For more details on the construction of (approximate) convex hulls on the manifold , we refer to Fletcher et al. (2011).
2.2 Probability distributions and random variables
A random variable on the Riemannian manifold is a measurable function from some probability space to the measurable space , where is the Borel algebra, i.e., the smallest -algebra containing all open sets in . In the following, we always work directly with the induced probability on , . By , we denote the set of all probability measures on and denotes the subset of probability measures in that have finite moments of order with respect to the Riemannian distance, i.e., the -Wasserstein space (Villani, 2009, Definition 6.4):
[TABLE]
Note that if for some and , this is true for any . This follows by the triangle inequality and the fact that for any , as . For a sequence of probability measures in , denotes weak convergence to the probability measure in the usual sense, i.e., for every continuous and bounded function , and a sequence is said to be uniformly integrable if for some . Note that if is uniformly integrable for some , then the sequence is uniformly integrable for any . Finally, we use the notation for the convex hull of the support of the measure on , and and for its relative interior and relative boundary.
2.3 Measures of centrality
Intrinsic mean.
To characterize the center of a random variable with probability measure , one important measure of centrality is the Karcher or Fréchet mean, which is also referred to as the intrinsic mean as it is intrinsic to the Riemannian distance measure on the manifold. The intrinsic mean turns out to be the point of maximum depth in the intrinsic zonoid depth introduced in Section 3.2. The set of intrinsic means consists of the points that minimize the second moment with respect to the Riemannian distance,
[TABLE]
If , then at least one intrinsic mean exists as the above expectation is finite for . Moreover, since the manifold is a geodesically complete manifold of non-positive curvature (see Pennec et al. (2006) or Skovgaard (1984)), by (Le, 1995, Proposition 1) the intrinsic mean is unique for any distribution . By (Pennec, 2006, Corollary 1), the intrinsic mean is also represented by the point that satisfies,
[TABLE]
where is the zero matrix. The sample intrinsic mean of a set of manifold-valued observations minimizes a sum of squared Riemannian distances and can be computed efficiently through a gradient descent algorithm as in Pennec (2006).
Intrinsic median.
A second measure of centrality of primary interest is the intrinsic median as in Fletcher et al. (2009), which is the point of maximum depth in the geodesic distance depth defined in Section 3.4. The set of intrinsic medians minimizes the first moment with respect to the Riemannian distance,
[TABLE]
On , a geodesically complete manifold with non-positive curvature, the intrinsic median exists and is unique for any distribution . This follows by the proof of (Fletcher et al., 2009, Theorem 1) combined with an application of Leibniz’s integral rule. Furthermore, the intrinsic median is uniquely characterized by the point that satisfies,
[TABLE]
- Remark.
If the distribution of a random variable is centrally symmetric around in the sense that , then the intrinsic mean and median coincide and are equal to . Here, equality in distribution () is read as equality in terms of the joint distribution of all matrix components. The claim for the intrinsic mean follows by the fact that , which implies that is the intrinsic mean of the random variable . For the intrinsic median, if is centrally symmetric around , then is also angularly symmetric around in the sense that . Substituting , we observe that , which implies that is also the intrinsic median of the random variable .
3 Data depth for HPD matrices
Before introducing the manifold data depth functions, we present the desired properties a proper intrinsic data depth function –acting directly on the space of HPD matrices– should satisfy. These requirements are the natural intrinsic generalizations of the properties in Zuo and Serfling (2000) for depth functions acting on vectors in a Euclidean space . We also consider integrated analogs for depth functions acting on curves of HPD matrices with , such as spectral density matrices in the Fourier domain.
3.1 Depth properties
Below, we denote for the depth of a matrix with respect to a distribution ; or for the integrated depth of a matrix curve with respect to a curve of marginal measures , such that for each . If a nonnegative bounded function or satisfies the pointwise (resp. integrated) properties P.1 to P.4, we say that it is a proper data depth function on the Riemannian manifold .
P.1 (Congruence invariance) The depth function should be invariant under matrix congruence transformation of the form , with . That is, for each ,
[TABLE]
where is the distribution of the transformed random variable , such that is distributed according to . Generalizing this property for an integrated depth function , we require that the same property holds pointwise for each . In this case, is a curve of invertible matrices, with for each .
In a standard Euclidean context, for a depth function acting on vectors in the Euclidean space , it is desirable that the depth is affine-invariant for each , where is the distribution of the random vector , with , and distributed according to . In the current setup, we are concerned with covariance or correlation matrices, corresponding to the second-order behavior of a random vector. For a random vector with covariance matrix , the covariance matrix of the affine transformation is given by . A natural requirement for the depth functions acting on symmetric or Hermitian PD matrices is therefore invariance under congruence transformations of the data. Another way to view this is that a depth function acting on the covariance matrix of a data vector should be invariant under a change of basis in the data space of .
P.2 (Maximality at center) The depth function should attain its maximum value, i.e., deepest point, at a well-defined unique center of the distribution, such as the intrinsic mean or median, which are characterized as the points of central and angular symmetry respectively. Let be a unique central point of the distribution , then,
[TABLE]
Similarly, for an integrated depth function, the maximum value should be attained at a well-defined unique central curve with , such as the curve of intrinsic means or medians.
P.3 (Monotonicity relative to center) As moves away from the deepest point along a geodesic curve emanating from , the depth of the point with respect to the distribution should be monotonically non-increasing. Let , , be the geodesic emanating from with unit tangent vector . Then,
[TABLE]
For an integrated depth function, let be real-valued curves over , such that for each . Denote and , where is a curve of unit tangent vectors. Then,
[TABLE]
P.4 (Vanishing at infinity) The depth of a point should approach zero as the point converges to a singular matrix, i.e., a matrix with zero or infinite eigenvalues,
[TABLE]
Similarly, for an integrated depth function, if the curve converges to a curve of singular matrices for each , then the integrated depth should approach zero.
Below, we give two additional continuity properties, which although not strictly required are nonetheless useful to derive asymptotic results in subsequent applications, such as rank-based hypothesis testing or the construction of depth-based confidence sets as in Section 5.
(P.5) (Continuity in ) Let be a convergent sequence with for each , such that . Then the depth function is continuous in in the sense that,
[TABLE]
(P.6) (Uniform continuity in ) The depth function is uniformly continuous in terms of the probability measure in the sense that if is a uniformly integrable sequence of probability measures, such that . Then,
[TABLE]
3.2 Intrinsic zonoid depth
As geodesic convex hulls are well-defined on the Riemannian manifold , there exist natural manifold generalizations of the simplicial depth or convex hull peeling depth (Liu et al. (1999)) for Euclidean vectors. However, the simplicial depth requires the computation of possibly many convex hulls, which quickly becomes computationally infeasible, especially for higher-dimensional matrices. Instead, we propose a straightforward manifold generalization of another depth measure based on trimmed convex depth regions, the zonoid depth (e.g., Mosler (2002)). The intrinsic manifold zonoid depth can be computed with the same tools as the standard zonoid depth for Euclidean vectors and its computation remains efficient, also for higher-dimensional HPD matrices.
In a Euclidean context, let be a probability measure on with finite first moment, then the zonoid -trimmed region, with , is defined as the set,
[TABLE]
If , we set . By (Mosler, 2002, Chapter 3), is convex and monotone decreasing in , creating a nested sequence of convex sets for decreasing values . If , consists of the single point , the Euclidean mean of the distribution . The Euclidean zonoid depth of a point with respect to a distribution is characterized by the smallest -trimmed region still containing ,
[TABLE]
The zonoid data depth is extended to the Riemannian manifold as follows.
Definition 3.1**.**
(Intrinsic zonoid depth) Let and let be the probability measure on of the random variable as a -dimensional random real basis component vector, where has probability measure . The intrinsic zonoid depth of a point with respect to the distribution is defined as:
[TABLE]
where is a -dimensional zero vector, and is the Euclidean zonoid -trimmed region of the distribution of the normal coordinate vector on . Equivalently, the intrinsic zonoid depth can be written as,
[TABLE]
where is the intrinsic zonoid -trimmed region defined as,
[TABLE]
with a measurable function.
- Remark.
Computation of the intrinsic zonoid depth is straightforward via the definition and can be calculated directly by the Euclidean zonoid depth as in (Mosler, 2002, Chapter 4). Note that if is an orthonormal basis of the vector space , then an orthonormal basis of is simply . In fact, the basis components of can be computed directly using only an orthonormal basis of , since .
Theorem 3.1**.**
The intrinsic zonoid depth is a proper data depth function in the sense of Section 3.1, satisfying properties P.1–P.4 for distributions in . The unique point of maximum depth coincides with the intrinsic mean of the distribution.
In order to show that the continuity properties P.5 and P.6 also hold for the intrinsic zonoid depth, we need the following lemma.
Lemma 3.2**.**
Let . Then, . In particular, for each , by definition of the intrinsic zonoid depth.
Theorem 3.3**.**
The intrinsic zonoid depth is continuous in as in P.5 for and , i.e., if with for all , then,
[TABLE]
The intrinsic zonoid depth is uniformly continuous in as in P.6 for and in uniformly integrable. If , then,
[TABLE]
Example 3.1**.**
In Figure 1, we display several central intrinsic zonoid depth regions for generated i.i.d. samples of -dimensional SPD matrices from a distribution with intrinsic mean . Denoting for the empirical distribution of , the central depth-region is given by the set of SPD matrices:
[TABLE]
In the left-hand image, data matrices are sampled from a Riemannian log-normal distribution as in e.g., Yuan et al. (2012), with intrinsic mean equal to the identity matrix. That is, , with , where is an orthonormal basis of (). In the right-hand image, is a rescaled Wishart distribution with intrinsic mean , such that , with a complex Wishart distribution with degrees of freedom and the intrinsic bias-correction in (Chau and von Sachs, 2017, Theorem 5.1). The -axes in Figure 1 correspond to the three independent components in the symmetric matrix .
3.3 Integrated intrinsic zonoid depth
A straightforward generalization of the pointwise intrinsic zonoid depth in Definition 3.1 to compute the depth of a curve with respect to a collection of marginal measures for is to consider the integrated intrinsic zonoid depth given by,
[TABLE]
where is the probability of the components of the random variable , such that has probability measure . This is similar to the construction of the modified band depth (MBD) in a functional data context, where the pointwise Euclidean simplicial depths are integrated over a functional domain , (López-Pintado and Romo (2009) or Sun and Genton (2012)). The integrated versions of the properties P.1 to P.6 continue to hold for the integrated intrinsic zonoid depth and are straightforward generalizations of their pointwise analogs.
Theorem 3.4**.**
The integrated intrinsic zonoid depth is a proper integrated depth function in the sense of Section 3.1, satisfying the integrated versions of properties P.1–P.4 for collections of marginal distributions for . The unique curve of maximum depth coincides with the curve of pointwise intrinsic means of the marginal distributions.
Proposition 3.5**.**
Let , and for each , such that uniformly in , i.e., . Then the integrated manifold zonoid depth is continuous in as in P.5 in the sense that,
[TABLE]
If , in is a uniformly integrable sequence of measures uniform in , and uniformly in . Then,
[TABLE]
Here, means that for each , and the uniform weak convergence is read as for every continuous and bounded function .
3.4 Geodesic distance depth
As a second notion of data depth on the geodesically complete manifold , we consider the geodesic distance depth, the natural analog on the metric space of the arc distance depth in Liu and Singh (1992) for data observations on circles and spheres. The geodesic distance depth is straightforward to calculate, also for high-dimensional matrices, as the only required operation is the computation of Riemannian distances between HPD matrices.
Definition 3.2**.**
(Geodesic distance depth) Let , then the geodesic distance depth of a point with respect to the distribution is defined as:
[TABLE]
Theorem 3.6**.**
The geodesic distance depth is a proper data depth function in the sense of Section 3.1, satisfying P.1–P.4 for distributions in . The unique point of maximum depth coincides with the intrinsic median of the distribution.
Theorem 3.7**.**
The geodesic distance depth is continuous in as in P.5 for , the closure of , and . That is, if with for all , then,
[TABLE]
The geodesic distance depth is uniformly continuous in as in P.6 for and uniformly integrable. If , then,
[TABLE]
- Remark.
In order to compute the empirical depth of each observation in a sample with respect to the empirical distribution of the sample , it suffices to compute the -dimensional distance matrix with -th entry . This matrix is fully determined by components, as the diagonal entries are zero and . In particular, in online applications where the depths need to be updated each time a new observation enters the database, we simply add one extra column and row to the distance matrix and update the depth values.
- Remark.
A third notion of data depth on the Riemannian manifold , closely related to the geodesic distance depth, is the intrinsic spatial depth. This is the natural manifold generalization of the spatial depth in Vardi and Zhang (2000) or Serfling (2002). For a distribution and a point , the intrinsic spatial depth is given by:
[TABLE]
The intrinsic spatial depth attains its maximum value at the intrinsic median, since by definition of the intrinsic median, and the depth is lower bounded by zero, which is a direct consequence of the triangle inequality combined with the fact that . The intrinsic spatial depth is closely associated to the geodesic distance depth in the sense that it is based on the gradient of the distance function, i.e., the gradient of for fixed is given by , see Fletcher et al. (2009).
3.5 Integrated geodesic distance depth
In order to generalize the pointwise geodesic distance depth to the depth of a curve , with respect to a collection of marginal measures for , we replace the pointwise expected distance in Definition 3.2 by an integrated expected distance as:
[TABLE]
The integrated versions of the properties P.1 to P.6 continue to hold for the integrated geodesic distance depth and are straightforward generalizations of their pointwise analogs as in the case of the integrated intrinsic zonoid depth.
Theorem 3.8**.**
The integrated geodesic distance depth is a proper function depth function in the sense of Section 3.1, satisfying the integrated versions of properties P.1–P.4 for collections of marginal distribution for . The unique curve of maximum depth coincides with the curve of pointwise intrinsic medians of the marginal distributions.
Proposition 3.9**.**
Let and for each , such that uniformly in , i.e., . Then the integrated geodesic distance depth is continuous in as in P.5 in the sense that,
[TABLE]
If , in is a uniformly integrable sequence of measures uniform in , and uniformly in . Then,
[TABLE]
where is read as for each .
4 Aspects of robustness and efficiency
Depth-median breakdown.
An intuitive measure of robustness of the intrinsic depth functions is given by their breakdown points according to Hampel et al. (1986). In order to assess the robustness of the depth functions, a first step is to compute the breakdown point of the location estimator that maximizes the depth, i.e., the depth-median, as in Donoho and Gasko (1992) or Liu and Singh (1992), which we explain as follows. Let be an initial set of HPD observations and let be a set of contaminating HPD observations. Denote and consider the –not necessarily in-sample– depth-median , with the empirical distribution of . The breakdown point of the depth-median is the smallest fraction of arbitrarily large contaminating observations that breaks down the estimator:
[TABLE]
Note that , such that for all , and if is a singular matrix lying on the boundary of the metric space . The breakdown point of the depth-median for the intrinsic zonoid depth is as the depth-median coincides with the sample intrinsic mean and it requires only a single large contaminating observation to make the sample intrinsic mean arbitrarily large. The intrinsic zonoid depth-median is therefore not robust against outlying observations in terms of the depth-median breakdown point, analogous to the Euclidean case, as discussed in Mosler (2002). For the geodesic distance depth, the depth-median coincides with the intrinsic median and the intrinsic median in a geodesically complete manifold is known to have maximum breakdown point , as shown in (Fletcher et al., 2011, Theorem 2).
Simultaneous depth-rank breakdown.
The above definition of the breakdown point gives us an intuitive measure of robustness for the depth-median. However, it does not tell us how robust the depth function is with respect to the depth-ranked observations in the sample itself. As a more informative robustness measure, we study the breakdown point simultaneously over all the depth-ranked observations in an initial sample of size . Let us write for the -th center-to-outward order statistic (or -th depth-ranked observation) with respect to a given depth measure. The simultaneous breakdown point is the smallest fraction of arbitrarily large contaminating observations that breaks down at least one of the first depth-ranked observations:
[TABLE]
For the intrinsic zonoid depth, if we break ties by assigning the same rank to each observation with equal depth, the simultaneous breakdown point is . If we break ties by assigning increasing ranks based on increasing Frobenius norms , then the simultaneous breakdown point is . This is illustrated as follows. Let be a first contaminating observation with , such that , where denotes the intrinsic mean of the contaminated sample . Assuming without loss of generality that for each , the contaminating observation will be assigned depth-rank and the first depth-ranked observations do not break down. Let be a second contaminating observation, then has maximum depth by Theorem 3.1, and thus . Since we can choose , such that for any , it follows that .
Proposition 4.1**.**
For the geodesic distance depth, the depth-ranked observations have maximum simultaneous breakdown point equal to the median breakdown point .
The above proposition asserts that if we observe a number of (large) contaminating observations smaller than the initial sample size , the geodesic distance depth will assign the contaminating observations to the ranks . The depth-rankings with respect to the geodesic distance depth are therefore highly robust against arbitrarily large contaminating observations, in contrast to the intrinsic zonoid depth-rankings, also illustrated in Figure 4.
Example 4.1**.**
The above depth measures share the same robustness properties in terms of their depth-median and simultaneous depth-rank breakdown point. In general, this does not have to be the case. For instance, consider the simplicial or convex hull peeling depth on the real line (e.g., Liu et al. (1999)), which are highly robust in terms of their depth-median breakdown point , as argued in Chen (1995) for the simplicial depth. In contrast, both data depths have simultaneous breakdown points as two well-placed large contaminating observations can ensure that for any .
- Remark.
The definitions of the depth-median and simultaneous breakdown points for the integrated depth functions are straightforward generalizations of the pointwise definitions above and it is easily verified that the breakdown points for the integrated depth functions coincide with their pointwise analogs.
Depth-median efficiency.
The robustness of the depth functions may result in a loss of efficiency of the depth-median as an intrinsic location estimator on the Riemannian manifold . Figure 5 displays the relative efficiency of the geodesic distance depth-median , (i.e., the intrinsic median), relative to the intrinsic zonoid depth-median , (i.e., the intrinsic mean), in terms of the Riemannian mean squared error. That is,
[TABLE]
The depth-medians are computed from simulated samples , where is a centrally symmetric distribution, such that the intrinsic mean and median coincide and are equal to the identity matrix. In particular, , where is an orthonormal basis of (), and are i.i.d. random variables from a -generalized normal distribution (Sinz et al. (2009)), with mean zero and standard deviation , such that . The family of -generalized normal distributions (-GNDs) allows us to generate tail behavior that is either heavier or lighter than the normal distribution. For , the -GND coincides with the normal distribution. As shown in Figure 5, for random variables generated from a light-tailed -GND ( and and in particular small dimensions ), the intrinsic zonoid depth regions are better centered around the population mean of the generating distributions than the geodesic distance depth regions; for a heavier-tailed -GND (), the efficiency gain of the intrinsic zonoid depth-median relative to the geodesic distance depth-median diminishes.
4.1 Computational effort
To demonstrate the computational effort of the depth calculations in practice, Figure 6 displays median computation times in milliseconds (single-core Intel Xeon E5-2650, 2.40Ghz) of the intrinsic depths of a single -dimensional HPD matrix with respect to a sample of HPD matrices calculated with the function pdDepth() in the accompanying R-package pdSpecEst, (including the intrinsic spatial depth computation times). On the left, the sample size is fixed at , and on the right the matrix-dimensions are fixed at . The displayed times are the median computation times of depth calculations for simulated samples, i.e., a total of depth calculations per scenario. The intrinsic zonoid depth requires that and for this reason there are several missing values in the left-hand image. Changing the default affine-invariant metric in the intrinsic depth computations to e.g., the Log-Euclidean, Cholesky, root-Euclidean or Euclidean metric –all are available in the function pdDepth()– the depth computation times are either similar or faster than the times displayed in Figure 6.
5 Application: Confidence sets for HPD matrices
As an illustrating application of the intrinsic depth functions, we construct intrinsic matrix confidence regions in the space of HPD or SPD matrices, such as confidence regions for estimated covariance or spectral density matrices. In the context of spectral density matrix estimation, a common approach is to construct asymptotic or bootstrapped confidence regions individually for each element of the spectral matrix, as demonstrated in Dai and Guo (2004) or Fiecas and von Sachs (2014) among others. Although this is a suitable approach to assess the variability of the estimator in each of the individual matrix components, this does not allow for the construction of simultaneous confidence regions across matrix elements, as the combined elementwise confidence intervals do not take the positive definite constraints of the full matrix object into account. In contrast, the intrinsic depth regions provide a natural way to construct joint matrix confidence regions taking into account the geometric constraints of the target space. This is illustrated by the construction of depth-based confidence regions for the intrinsic mean of a sample of i.i.d. HPD random matrices.
Consider , with centered around a population intrinsic mean . Denote for the sample intrinsic mean, i.e., , then the intrinsic central limit theorem in (Said et al., 2015, Proposition 11) tells us that,
[TABLE]
where is a random Hermitian matrix, such that , with and an orthonormal basis of equipped with the associated metric .
To cast this into a standard Euclidean framework, the Euclidean logarithmic map is given by . If for some fixed matrices , then , and in the random setting, the construction of asymptotic confidence sets for is straightforward based on an estimate and knowledge of the limiting distribution of . In a curved Riemannian manifold, if , with fixed, then in general \mu\neq\textnormal{Exp}_{\bar{m}}\big{(}-\frac{1}{\sqrt{n}}Z\big{)}. Instead, \mu=\textnormal{Exp}_{\bar{m}}\big{(}-\frac{1}{\sqrt{n}}\widetilde{Z}_{\mu}\big{)}, where is the parallel transport of the matrix from the tangent space at to the tangent space at . In the Euclidean setting , as the parallel transport in a Euclidean or flat space equals the identity map, but on the Riemannian manifold the parallel transport is nontrivial due to the nonzero curvature of the space and it depends on the unknown population mean . One working solution is to approximate the parallel transport using a plug-in estimator for , such as , in which case the parallel transport is approximated by the identity map. Another approach that is considered here, is to construct approximate confidence sets for the intrinsic mean through resampling, which does not require knowledge of the population mean . That is, (i) generate bootstrap intrinsic sample means by resampling with replacement from , (ii) define a percentile confidence region for in the same fashion as Yeh and Singh (1997) or Wei and Lee (2012) through the trimmed depth-region:
[TABLE]
where is the empirical distribution of . First-order convergence of the percentile confidence regions to the asymptotically correct confidence regions, as and tend to infinity, follows in the same fashion as in Yeh and Singh (1997). The proof relies on the uniform continuity property P.6, satisfied by both the intrinsic zonoid and geodesic distance depth.
- Remark.
Note that the depth-based confidence regions are equivariant under matrix congruence transformations of the sample , with , in the sense that . This is an immediate consequence of property P.1 and the fact that the intrinsic mean is general linear congruence equivariant, i.e., .
Table 1 displays the empirical coverage of the percentile bootstrap confidence regions for simulated samples , with a centrally symmetric distribution around the identity matrix simulated from a -generalized normal distribution (-GND) equivalent to the data generating processes in Figure 5. The column Ave.- displays the average lower depth confidence bounds, using the notation for as in eq.(LABEL:eq:5.1). The column Ave.-Size displays the distance of the center of the confidence ball to the furthest boundary, i.e., , averaged across simulations, and the coverage is the proportion of times the identity matrix has a depth value larger or equal to the lower depth bound .
6 Analysis of multicenter clinical trial data
The intrinsic data depth functions provide a fast and intuitive procedure to explore samples of covariance matrices by identifying most central or most outlying covariance matrices, based on the Riemannian geometry of the space. This is illustrated by the exploratory analysis of a collection of sample covariance matrices obtained from 246 clinical centers (C1-C246), which have been anonymized for reasons of confidentiality. For each clinical center, medical analysts have recorded the height (ht), weight (wt), systolic blood pressure (systol) and diastolic blood pressure (diastol) for a number of clinical center patients. As part of a broader analysis, we explore the variability among clinical centers in terms of the second-order behavior, i.e., the variance-covariance structure, of the measured variables. On the one hand, we wish to identify outlying clinical centers to be flagged for further inspection or removal in subsequent data analysis. On the other hand, we are interested in the average or mean behavior of the sample covariance matrices across clinical centers.
Addressing the first objective, the left image in Figure 7 displays the 15 most central depth-ranked clinical centers (from left to right, with most central clinic C107) based on the geodesic distance depth applied to the collection of 246 -dimensional symmetric positive definite covariance matrices. The bottom rows display the six symmetric cross-correlations ht-wt, systol-wt, diastol-ht, systol-wt, diastol-wt and diastol-systol. In addition, the top rows display the four variances ht-ht, wt-wt, systol-systol and diastol-diastol, providing information about the scale of the covariance matrices. The right image in Figure 7 displays the 15 most outlying depth-ranked clinical centers (from right to left, with most outlying clinic C224) based on the geodesic distance depth in the same fashion. The center-to-outward orderings obtained via the intrinsic zonoid depth are comparable and can be found in the supplementary material. We point out that the data depth functions capture clinical centers that are outlying primarily in terms of the correlation- or covariance-structure, (e.g., center C191), primarily in terms of the variance-structure, (e.g., center C170), or both, (e.g., center C71). Regarding the second objective, to assess the average behavior across covariance matrices, we display in Figure 8 the intrinsic sample mean of the set of 246 sample covariance matrices across clinical centers, including a 95- intrinsic geodesic distance depth percentile bootstrap confidence region. Here, the left-hand image displays the four variances and the right-hand image displays the six cross-correlations analogous to the decomposition in Figure 7. The grey confidence region displays the bootstrapped sample means contained in the confidence region . In particular, a covariance matrix is included in the confidence region if and only if , where is the empirical distribution of the bootstrapped sample means and denotes the lower depth-bound as in Section 5.
7 Concluding remarks
In this paper, we studied intrinsic data depth measures acting on the Riemannian manifold of symmetric or Hermitian PD matrices. The primary focus of this work is on the Riemannian manifold equipped with the affine-invariant metric, as this is the only metric that is invariant under congruence transformation of the data as described in property P.1 in Section 3.1. However, the construction of the depth functions does not fundamentally rely on the affine-invariant metric and the equivalent notions of properties P.2 to P.6 are expected to hold for any Riemannian metric that constitutes a geodesically complete manifold, such as the Log-Euclidean metric as discussed in Arsigny et al. (2006) among others. For each of the proposed intrinsic depth functions, (including the intrinsic spatial depth), the sample data depth values are straightforward to compute and remain computationally efficient also for relatively high-dimensional matrices, with implementations directly available in the R-package pdSpecEst, Chau (2017). As such, the data depths serve as an easy-to-use data exploration tool, but also provide a practical framework for inference in the context of random samples of HPD matrices, as illustrated in Section 5 through the construction of depth-based confidence regions.
Additional material available in the package pdSpecEst includes implementations of several intrinsic rank-based hypothesis tests, replacing the ordinary ranks by the depth-induced ranks analogous to Liu and Singh (1993), Chenouri and Small (2012), or (Mosler, 2002, Chapter 5) for samples of Euclidean vectors. Another interesting application of the intrinsic data depth is depth-based classification or clustering for groups or samples of covariance matrices analogous to e.g., Li et al. (2012). To conclude, Hermitian or symmetric positive definite matrices play an important role in many different fields of statistical research, see Pennec et al. (2006), and it is of interest to apply the intrinsic data depths in other contexts than demonstrated in this paper. For instance, applied to diffusion tensor imaging, the depth functions show potential for fast detection of anomalies or artifacts in large collections of SPD diffusion tensors.
Acknowledgements
The authors gratefully acknowledge the financial support from the following agencies and projects: the Belgian Fund for Scientific Research FRIA/FRS-FNRS (J. Chau), the contract ‘Projet d’Actions de Recherche Concertées’ (ARC) No. 12/17-045 of the ‘Communauté française de Belgique’ (J. Chau and R. von Sachs), IAP research network P7/06 of the Belgian government (R. von Sachs), the US National Science Foundation and KAUST (H. Ombao). We thank Lieven Desmet and the SMCS/UCL for providing access to the anonymized clinical trial data. Computational resources have been provided by the supercomputing facilities of the CISM/UCL and the CÉCI funded by the FRS-FNRS under convention 2.5020.11.
8 Appendix I: Proofs
8.1 Proof of Theorem 3.1
Proof.
P.1 This is a direct consequence of the claim that the following two events are equivalent:
[TABLE]
with the probability measure of and the probability measure of , where has probability measure . Here, the Euclidean zonoid trimmed region is represented as a set of -dimensional Hermitian matrices, instead of an equivalent set of -dimensional real basis component vectors, as in Section 3.2, and is the zero matrix. For , the equivalence in eq.(8.1) is true by definition, since .
Suppose that for some . Noting that can be identified by the real vector space of Hermitian matrices for each , by definition of the zonoid -trimmed region, there exists a measurable function , such that,
[TABLE]
It is straightforward to verify that for each and , . Define , then is a measurable function such that,
[TABLE]
and,
[TABLE]
Therefore . The other direction follows by a similar argument, using that .
P.2 The zonoid trimmed region contains the single point by construction. The deepest point is therefore characterized by the point that satisfies . By eq.(2.4) in the main document, on the Riemannian manifold with , this point is the uniquely existing geometric expectation of the distribution .
P.3 Using the equivalent definition , by construction is a geodesically convex set that contains the geometric mean for each . Also, for each . Combining the above arguments, it follows that a geodesic curve , with increasing, has monotone non-increasing depth as it moves further away from the center .
P.4 With the same notation as above, for we claim that the sets are closed and bounded, and therefore also compact by the Hopf-Rinow theorem. The fact that the sets are closed follows directly from the definition of . The fact that they are bounded is seen as follows; for , by construction . Therefore, if , necessarily , which follows by the fact that both Id and are elements of , combined with (Bhatia, 2009, Theorem 6.1.6). Let be an unbounded sequence, such that as . The divergence implies in particular also that , which violates the boundedness (or compactness) of for , and therefore we must have . ∎
8.2 Proof of Lemma 3.2
Proof.
By definition of the intrinsic zonoid trimmed regions with as in eq.(8.1). The distribution has finite first moment with respect to the Riemannian metric in , since
[TABLE]
using eq.(2.3) in the main document and the fact that . By (Mosler, 2002, Theorem 3.13) for a probability measure defined on with finite first moments,
[TABLE]
where denotes the convex hull of the support of in , based on the Riemannian metric on , i.e., a rescaled Euclidean metric. Using the above result, we write out:
[TABLE]
where the last step follows by definition as the geodesic convex hull of the support of on the manifold. ∎
8.3 Proof of Theorem 3.3
8.3.1 Continuity in (P.5)
Proof.
We argue that the map is both upper- and lower-semicontinuous for .
Upper-semicontinuity: the map is upper-semincontinuous if and only if for each the sets are open in or equivalently the sets are closed in . If , , and is closed in itself. If , note that we can rewrite , since on the one hand, if , then , and on the other hand, if , then by nestedness of the intrinsic zonoid trimmed regions. For each , by construction is closed, therefore is also closed.
Lower-semicontinuity: the map is lower-semicontinuous if and only if for each the sets are closed in or equivalently the sets are open in . If , , and the empty set is open in . If , by Lemma 3.2, and is open in itself. If , note that we can rewrite , where,
[TABLE]
with measurable. To see that the set-equivalence is true: on the one hand, if , then , since . On the other hand, if , take sufficiently small such that , then . As a consequence, by nestedness of the intrinsic zonoid trimmed regions. For , distinguish between two cases: (i) , then the set is open as is open in itself, (ii) . In this case, writing for the relative boundary of the geodesic convex set in , we note that . Here, the relative boundary of is characterized by those points in for which the weighting function attains the maximum value . Since , it follows that is open. By combining the above arguments, we conclude that is open for each .
Since the map is both upper- and lower-semicontinuous on it is also continuous on . ∎
8.3.2 Uniform continuity in (P.6)
Proof.
Pointwise convergence of depths: first, we show pointwise convergence of to for each , where denotes the relative interior of the geodesic convex set . We note that if and only if , where is the convex hull of the support of in as in the proof of Lemma 3.2. This is seen as follows: by Lemma 3.2, if and only if , such that , but this is equivalent to which holds if and only if by (Mosler, 2002, Theorem 3.13). Since the sets and are equivalent their relative interiors are equivalent as well. By Definition 3.1, , where is the distribution of as a -dimensional real basis component vector, with , such that . Similarly, . By the same argument as in the proof of Lemma 3.2, we know that for each , where denotes the set of probability measures on with finite first moment, i.e., if then . Furthermore, the sequence of measures is uniformly integrable with respect to the Riemannian metric in , since for any ,
[TABLE]
By (Mosler, 2002, Theorem 4.6), under these conditions, for or equivalently , it follows that,
[TABLE]
Uniform convergence of depths: uniform depth convergence now follows from the pointwise depth convergence above by a generalized version of the proof of (Dyckerhoff, 2016, Theorem 4.8) for the complete metric space , using Lemma 3.2 and the fact that is a normed geodesically convex depth, continuous in by the first part of Theorem 3.3. Since the proof is completely analogous to the proof of (Dyckerhoff, 2016, Theorem 4.8), we omit the details here. Note that the only required modification is to replace the Euclidean metric space by the complete metric space . In particular, Euclidean open balls, convex sets and convergence are replaced by geodesic open balls, geodesic convex sets and convergence in the Riemannian distance function respectively.
By the generalized proof of (Dyckerhoff, 2016, Theorem 4.8), the depths are continuously convergent for . That is, for in the sense that , also . By (Dyckerhoff, 2016, Proposition A.1), since is a metric space, continuous convergence of the depths implies compact convergence, i.e., for every compact set ,
[TABLE]
Consequently, by (Dyckerhoff, 2016, Theorem 4.4), compact convergence implies uniform convergence, since the arguments in the proof of (Dyckerhoff, 2016, Theorem 4.4) continue to hold for the intrinsic zonoid depth defined on the complete metric space , where closed and bounded subsets are compact. ∎
8.4 Proof of Theorem 3.4 and Proposition 3.5
Proof.
Properties P.1–P.4 follow directly by Theorem 3.1, using the definition of the depth as the integrated pointwise zonoid depth (integrated over ).
For the first part (P.5) of Proposition 3.5: using that , by the first part of Theorem 3.3, uniformly over . By definition of the integrated intrinsic zonoid depth also,
[TABLE]
by the pointwise convergence and the fact that the depth function is bounded.
For the second part (P.6) in Proposition 3.5: under the given assumptions, by the second part of Theorem 3.3,
[TABLE]
and similarly as above,
[TABLE]
using the pointwise convergence and the fact that the depth function is bounded. ∎
8.5 Proof of Theorem 3.6
Proof.
P.1 This follows directly from the definition of the depth by the fact that the map with is distance preserving, i.e., for each .
P.2 Since and is strictly decreasing in , the point of maximum depth is attained at . By eq.(2.5) in the main document, on the Riemannian manifold with , this point is the uniquely existing geometric median of the distribution .
P.3 By the proof of (Fletcher et al., 2009, Theorem 1) and an application of Leibniz’s integral rule, is a (strictly) convex function, and by P.2 it attains its unique minimum at . This implies that is a nondecreasing function for , where is a geodesic curve emanating from with unit tangent vector . As a consequence is monotone non-increasing for .
P.4 Let be an unbounded sequence such that as , then also for each , and as a consequence . ∎
8.6 Proof of Theorem 3.7
8.6.1 Continuity in (P.5)
Proof.
First, suppose that is an unbounded sequence as , i.e., , where is a singular matrix. Since , by P.4 in Theorem 3.6, . Second, suppose that is a bounded sequence, i.e., . Since , there exists an such that . By the triangle inequality,
[TABLE]
using that as both Id and are elements of , (see (Bhatia, 2009, Theorem 6.1.6)). We show continuity directly from the definition of the geodesic distance depth. The function is continuous in , also the function is continuous in , since is a composition of continuous functions. Furthermore, by the dominated convergence theorem, , since . Combining these arguments, . ∎
8.6.2 Uniform continuity in (P.6)
Proof.
We start by noting that the uniform integrability condition implies in particular that for each . Also, since is continuous in , by the continuous mapping theorem , with and , and by Vitali’s convergence theorem for any . Note that the convergence implies in particular also that . For two measures define their -Wasserstein distance as:
[TABLE]
where denotes the collection of all probability measures on with marginal measures and . Substituting , the point measure in , it follows that . Therefore, a sufficient condition to ensure uniform convergence in of to , is , since
[TABLE]
where the last step follows by the reverse triangle inequality for the -Wasserstein distance. The manifold is a complete separable metric space, and therefore by (Villani, 2009, Theorem 6.9) a necessary and sufficient condition for is that the sequence of probability measures converges weakly in to , i.e., (i) and (ii) for any . Condition (i) holds by assumption, and condition (ii) has already been shown above.
The function is uniformly continuous for , therefore the uniform convergence of the geodesic distance depth follows as well since,
[TABLE]
∎
8.7 Proof of Theorem 3.8 and Proposition 3.9
Proof.
Properties P.1–P.4 follow directly by the pointwise depth properties in Theorem 3.6, using the definition of the depth in terms of the integrated Riemannian distance (integrated over ).
For the first part (P.5) of Proposition 3.9: using that , by the first part of the proof in Theorem 3.7 also,
[TABLE]
and as a direct consequence . Using again that is continuous in , the composition converges as well and we conclude that .
For the second part (P.6) of Proposition 3.9. Denote by and respectively the distributions of and , such that and . Let be a continuous and bounded function and write for a curve with for each . Then for any curve ,
[TABLE]
where the last step follows by the fact that for each the composition is again a continuous and bounded function, and the fact that uniformly in . Thus, for any curve , the weak convergence holds as well uniformly in . By the uniform integrability of uniformly in , combined with Vitali’s convergence theorem, it follows that for each curve ,
[TABLE]
By the same argument as in the second part of the proof of Theorem 3.7, a sufficient condition for uniform convergence in of to is the condition . Again by (Villani, 2009, Theorem 6.9), the convergence is implied by the conditions (i) uniformly in , which holds by assumption and (ii) the convergence in eq.(8.5) pointwise in .
The function is uniformly continuous for , therefore the uniform convergence of the integrated geodesic distance depth follows as well,
[TABLE]
∎
8.8 Proof of Proposition 4.1
Proof.
First, we verify that .
Let be contaminating observations, such that for some . Denote for the empirical distribution of the contaminated sample . For each ,
[TABLE]
using the triangle inequality for each . Since , for each . Therefore, , with the -th depth ranked observation in the sample . As we can choose , such that for arbitrarily large, with can be made arbitrarily large by adding contaminating observations. This implies that .
Second, we verify that .
Consider the contaminated sample , with . If we can show that for each and each . Then , , such that and consequently , denoting . The latter implies that it takes at least contaminating observations to make arbitrarily large for , i.e., . It remains to show that for each and each .
Let and arbitrary, then:
[TABLE]
Let us denote , and .
First, by the triangle inequality . Therefore, by the reverse triangle inequality, ,
[TABLE]
Also, by definition of and , ,
[TABLE]
Without loss of generality, assume that , where . Denoting Id for the identity matrix, it follows that,
[TABLE]
Here, we used two triangle inequalities and the fact that by definition of the Riemannian distance. Combining eq.(8.7-8.9) above yields:
[TABLE]
where we used that by the fact that and by eq.(8.9). Returning to eq.(8.8), it follows that . As this result holds for any and , we conclude that . Since also , it follows that , which finishes the proof. ∎
9 Appendix II: Additional figures
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Arsigny et al. (2006) Arsigny, V., P. Fillard, X. Pennec, and N. Ayache (2006). Log-Euclidean metrics for fast and simple calculus on diffusion tensors. Magnetic Resonance in Medicine 56 (2), 411–421.
- 2Bhatia (2009) Bhatia, R. (2009). Positive Definite Matrices . Princeton University Press.
- 3Chau (2017) Chau, J. (2017). pd Spec Est: An Analysis Toolbox for Hermitian Positive Definite Matrices (v 1.2.1) .
- 4Chau and von Sachs (2017) Chau, J. and R. von Sachs (2017). Intrinsic wavelet regression for curves of Hermitian positive definite matrices. Ar Xiv preprint 1701.03314 .
- 5Chen (1995) Chen, Z. (1995). Bounds for the breakdown point of the simplicial median. Journal of Multivariate Analysis 55 (1), 1–13.
- 6Chenouri and Small (2012) Chenouri, S. and C. Small (2012). A nonparametric multivariate multisample test based on data depth. Electronic Journal of Statistics 6 , 760–782.
- 7Dai and Guo (2004) Dai, M. and W. Guo (2004). Multivariate spectral analysis using Cholesky decomposition. Biometrika 91 (3), 629–643.
- 8Donoho and Gasko (1992) Donoho, D. and M. Gasko (1992). Breakdown properties of location estimates based on halfspace depth and projected outlyingness. The Annals of Statistics , 1803–1827.
