Geodesic analysis in Kendall's shape space with epidemiological applications
Esfandiar Nava-Yazdani, Hans-Christian Hege, T. J. Sullivan and, Christoph von Tycowicz

TL;DR
This paper develops analytical tools for geodesic analysis in Kendall's shape space, enabling efficient Riemannian optimization and applying it to longitudinal epidemiological shape data for early osteoarthritis prediction.
Contribution
It introduces analytical expressions for Jacobi fields and parallel transports in Kendall's shape space, significantly reducing computational costs for geodesic regression.
Findings
Identified shape differences between osteoarthritis and control groups
Demonstrated efficient geodesic regression in shape analysis
Enabled early prediction of knee osteoarthritis using shape data
Abstract
We analytically determine Jacobi fields and parallel transports and compute geodesic regression in Kendall's shape space. Using the derived expressions, we can fully leverage the geometry via Riemannian optimization and thereby reduce the computational expense by several orders of magnitude over common, nonlinear constrained approaches. The methodology is demonstrated by performing a longitudinal statistical analysis of epidemiological shape data. As an example application we have chosen 3D shapes of knee bones, reconstructed from image data of the Osteoarthritis Initiative (OAI). Comparing subject groups with incident and developing osteoarthritis versus normal controls, we find clear differences in the temporal development of femur shapes. This paves the way for early prediction of incident knee osteoarthritis, using geometry data alone.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\automark
[section]section \setkomafontpageheadfoot \setkomafontpagenumber \clearpairofpagestyles
\cohead\xrfill[0.525ex]0.6pt \theshorttitle \xrfill[0.525ex]0.6pt \cehead\xrfill[0.525ex]0.6pt \theshortauthor \xrfill[0.525ex]0.6pt \cfoot*\xrfill[0.525ex]0.6pt \pagemark \xrfill[0.525ex]0.6pt
Geodesic analysis in Kendall’s shape space
with epidemiological applications111This is a preprint version of an article to appear in the Journal of Mathematical Imaging and Vision © Springer with the DOI 10.1007/s10851-020-00945-w and differs from the final published version in layout and typographical detail.
Esfandiar Nava-YazdaniLABEL:affiliationZIB
Hans-Christian HegeLABEL:affiliationZIB
T. J. SullivanLABEL:affiliationZIB,LABEL:affiliationFUB
Christoph von TycowiczLABEL:affiliationZIB
1
2
(a; for fanning (Louis et al., 2018); Utilizing closed form expressions of the pre-shape sphere, we reduce parallel transport to the solution of a homogeneous first-order differential equation that allows for highly efficient computations. Moreover, we reduce the important case of parallel transport along a geodesic path to the solution of a low-dimensional equation that only depends on the dimension of the ambient space and not on the spatial resolution of the discrete representation.; The paper is organized as follows. In Section 2, after a short overview of Kendall’s shape space, we provide a computationally efficient approach (via the so-called Sylvester equation) for the canonical decomposition of tangent vectors into horizontal and vertical components, which is essential for the geometry and analysis of shapes and trajectories. Moreover, we determine parallel transport and Jacobi fields, which will be employed for geodesic regression. Parallel transport is essential for statistical normalization, alignment of trajectories and also computation of Jacobi fields. The latter describes the variability of trajectories that will be modeled as best-fitting geodesics in Section 3, where we also present our algorithm for the computation of geodesic regression. In Section 4 we apply this algorithm to yield longitudinal statistical analysis of femur data from an epidemiological study dealing with osteoarthritis and discuss the numerical results. ; under rotations; Denoting the columns of by and their Euclidean mean by , in order to remove translations, we replace by .; λm-1≥; with respect to the metric induced by the ambient Euclidean space; ; We recall that, for a Riemannian submersion and , is a submanifold of . For any , denoting the kernel of by , the tangent space to at admits an orthogonal decomposition where and the are the so-called horizontal and vertical subspaces.; Furthermore, a smooth curve is called horizontal if and only if its tangent field is horizontal. Geodesics in the shape space are equivalence classes of horizontal geodesics.; Moreover, the above equation has a unique skew-symmetric solution if ; and ; We observe that both equations and are uniquely solvable, since is invertible. Furthermore, the solution of the first equation is skew-symmetric, since its right-hand side is skew-symmetric. It follows that
[TABLE]
with arbitrary, is skew-symmetric and solves the Sylvester equation (2) which also implies that is symmetric. Hence is the vertical component of . If , then . If has full rank, then ; ; and ; U; derive formulas for; (4); t; The last equation follows from the fact that is skew-symmetric and is symmetric, hence their product is trace-free.; Thus is also symmetric.; ; due to skew-symmetry of ; Next, we derive the differential equation for Jacobi fields and provide a constructive approach to its solution utilizing parallel transport.; We recall that a smooth horizontal curve in is a geodesic if and only if is a geodesic in .;
Theorem 2.4**.**
Let be a normal vector field along and denote
[TABLE]
- (a)
* is a Jacobi field if and only if*
[TABLE] 2. (b)
A normal Jacobi field of the pre-shape sphere projects to a Jacobi field if and only if .
Proof. (a) Obviously, solutions of the equations are, due to -equivariance of horizontal and vertical projection, invariant under action. Let and be vector fields on . Following O’Neill (1966), the and -tensor fields are defined as
[TABLE]
Due to (O’Neill, 1967, Theorem 2), is a Jacobi field if and only if
[TABLE]
where , stands for and the vector field is given by
[TABLE]
Note that as is normal, its covariant and Euclidean derivative coincide. In our setting, the fibers are totally geodesic, hence . Therefore . Moreover, we may suppose . Thus and we arrive at (9) and (10). (b) It follows immediately from . In the following, we give a geometric construction for normal Jacobi fields which will be employed for geodesic regression. For , we set , where denotes the solution (cf. Lemma 2.1) of the Sylvester equation
In the sequel , , , and .
Theorem 2.5**.**
Suppose that has unit speed . Let be the solution of the differential equations (9), (10) with , and \frac{{\leavevmode\color[rgb]{0,0,0}\mathrm{D}}J^{h}}{dt}(0)=\xi_{2}. Then and the following hold.
- (a)
Let and denote the parallel extensions of resp. along . Then
[TABLE] 2. (b)
Suppose that . Let denote the orthogonal projection of tangent to and its orthogonal complement. Then
[TABLE]
where and are parallel extensions of and .
Proof. We may write with , and . The variation
[TABLE]
is a variation of through horizontal geodesics since is symmetric, and defines a vertical Jacobi field with . (a) As any Jacobi field is a linear combination of parallel vector fields, and those vector fields conserve orthogonality and length, we may assume {\leavevmode\color[rgb]{0,0,0}\|}\xi_{i}\|=1. Furthermore, due to and , and . Now, let denote the h-parallel transport of along the geodesic given by . Due to (5), we have . Now, consider the variations of given by
[TABLE]
and
[TABLE]
A straightforward computation shows that is symmetric, i.e., is a variation of through horizontal geodesics. Hence is a Jacobi field, where . Therefore, is a Jacobi field. Moreover, and . Hence the solution with and \frac{{\leavevmode\color[rgb]{0,0,0}\mathrm{D}}J^{h}}{dt}(0)=\xi_{2} is given by , where . It follows that the solution with and is given by . Furthermore, and . The fact that the space of horizontal vector fields along is linear, completes the proof. (b) Let . Then enjoys the orthogonal decomposition , where , and . Moreover, and for some . A straightforward computation shows (note that ). Hence . Now, the vector is horizontal (cf. Remark 1) and normal. Hence the vector is also horizontal and normal. Therefore its parallel extension is given by . Using , we arrive at , i.e. . Moreover, utilizing the fact that is minus indentity, we have . Similarily, for the constant vector field , we have . Implying in the expression of from part a), we arrive at the desired formula. ; , the so-called slerp (spherical linear interpolation),; Having in mind that (cf. Pennec (2006) and Jost (2017))
[TABLE]
where , the gradient of the cost function can be computed using Jacobi fields, since they express the derivatives of the exponential map and therefore those of . Now, for fixed and , let denote the gradient of with respect to where . Then for any , where is the horizontal Jacobi field along with and . In the following, and denote tangent resp. orthogonal components of vectors. Now, let denote the unit speed horizontal geodesic from to , i.e., with , and . Denoting the horizontal component of the parallel extension of along by , and , due to Theorem 2.5, is a Jacobi field with and . Reparametrization only changes the tangent component of the Jacobi field. Moreover, horizontal projection does not depend on the parametrization. Due to the fact that parametrizes the reverse geodesic by arc length (), we arrive at . Hence
[TABLE]
Let denote the h-parallel transport to along . In view of (12), is the adjoint of the mapping . As the latter is self-adjoint it follows that
[TABLE]
where . To get the gradient of with respect to , we simply replace by (another advantage of employing the parametrization (1)) and arrive at
[TABLE]
where . Now, our procedure for geodesic regression can be summarized as presented in Algorithm 1.
; An overview of OA-related dysmorphisms is shown in Figure 1.; (cf. Absil et al. (2007)). In particular, for the intrinsic treatment of the optimization problem underlying the geodesic regression we use the open-source Matlab toolbox manopt (Boumal et al., 2014). In our experiments; routine for nonlinear regression (viz. fitnlm); The geodesic representation provides a less cluttered visualization of the trajectory population making it easier to identify trends within as well as across groups. For 2-dimensional visualization we perform dimension reduction for the trajectories with , i.e. we apply tangent PCA to at the mean of all baseline shapes in HH. In the remainder, the latter is referred to as reference shape Ref. ; Principal components; Principal components; group-wise medians (95% confidence intervals) of 0.40 (0.33–0.46), 0.55 (0.48–0.63), and 0.51 (0.40–0.72); variability is better caputured by a single variable (time) than the physiological trends in HH;
; In order to perform group-wise analysis of longitudinal shape changes we compare the estimated geodesic trends of the femoral trajectories. This requires the consistent integration of intra- and inter-subject variability in order to obtain statistically significant localization of changes at the population level. In fact, the comparison of longitudinal shape changes is usually performed after normalizing (i.e. transporting) them into a common system of coordinates (see Lorenzi and Pennec (2014) and the references therein). Such a normalization can be realized by adapting parallel transport as presented in Algorithm 2. In particular, for geodesic trends this scheme reduces to parallel transport of the subject-specific velocity along the baseline-to-reference shape geodesic. The group-wise longitudinal progression was modeled as the mean of the transported velocities. The areas of significant differences between longitudinal changes were investigated by two-sample Hotelling’s tests on the vertex-wise displacements corresponding to the transported velocity-fields. While the displacements differ significantly (, after Benjamini-Hochberg false discovery correction) between normal controls and the OA groups (for 55% and 19% of the vertices for HH vs. HD and HH vs. DD, respectively), there are no differences in the longitudinal changes in-between both OA groups. Figure 3 shows a qualitative visualization of the group tests in terms of the magnitude of the difference between the group-wise means. Visible are changes along the ridge of the cartilage plate (characteristic regions for osteophytic growth, cf. Figure 1) in comparison of both HH vs. HD as well as HH vs. DD, albeit the latter are less pronounced suggesting a saturation of morphological developments. Additionally, the changes are more developed on the medial compartment, which is in line with previous findings (Vincent et al., 2012). ; Group-wise analysis of femoral geodesic trends: Magnitude of differences between the group-average trends for HH vs. HD (left column), HH vs. DD (middle column), and HD vs. DD (right column) after transport to common reference shape. Only significantly different displacements () are shown (2.0e-4 4.2e-4). ; Group-wise analysis of femoral geodesic trends: Magnitude of differences between the group-average trends for HH vs. HD (left column), HH vs. DD (middle column), and HD vs. DD (right column) after transport to common reference shape. Only significantly different displacements () are shown (2.0e-4 4.2e-4). ; While velocities are constant for subject-specific geodesics, their parallel transport depends on the path (an effect called holonomy). To investigate this path dependency, we repeated the above experiment using different paths for the HD group. In particular, we chose the shape at the onset time (transition time to severe OA, viz. Kellgren–Lawrence score 3) as the initial point for the transport path. In line with previous work (Lorenzi and Pennec, 2013), we found that the results are not sensitive to the path. More precisely, the results of the group tests agreed for 99.70% and 100.00% of the vertices for HH vs. HD and HD vs. DD, respectively. ; First, we would like to use the presented methodology within the mixed-effect framework (see e.g. Bône et al. (2018)), which provides a joined analysis of longitudinal and cross-sectional variability.; On the application side, based on the results found, it can be said in summary that the shape trajectories of the healthy subjects expose significantly different temporal changes than those found in groups with incident and developing OA. Our analysis delivered detailed insights into the complex morphological changes that fit medical knowledge.)
Abstract
Abstract:
We analytically determine Jacobi fields and parallel transports and compute geodesic regression in Kendall’s shape space. Using the derived expressions, we can fully leverage the geometry via Riemannian optimization and thereby reduce the computational expense by several orders of magnitude over common, nonlinear constrained approaches. The methodology is demonstrated by performing a longitudinal statistical analysis of epidemiological shape data. As an example application we have chosen 3D shapes of knee bones, reconstructed from image data of the Osteoarthritis Initiative (OAI). Comparing subject groups with incident and developing osteoarthritis versus normal controls, we find clear differences in the temporal development of femur shapes. This paves the way for early prediction of incident knee osteoarthritis, using geometry data alone.
Keywords:
Longitudinal modeling Shape trajectory Riemannian metric Principal geodesic analysis Geodesic regression Parallel transport Jacobi fields
ZIBZuse Institute Berlin, Takustraße 7, 14195 Berlin, Germany. (, , ,
) FUBFreie Universität Berlin, Arnimallee 6, 14195 Berlin, Germany. ()
1 Introduction
In recent years, there has been an increased interest in statistical analysis of geometric shapes. Such analyses are especially often performed in the field of morphometry, but mostly for static forms. A frequently-encountered situation, however, is that instead of a set of discrete shapes, series of shapes are given, often together with co-varying parameters. For example, longitudinal imaging studies track biological shape changes over time within and across individuals to gain insight into dynamical processes such as ageing or disease progression. Statistical modeling and analysis of shapes is of critical importance for better understanding of such temporal shape data.
The main challenge is that shape variability is inherently nonlinear and high-dimensional, so that classical statistical approaches are not always appropriate. One way to address this is linearization. The quality of the resulting statistical model, however, then depends strongly on the validity of the linearity assumption, i.e. that the observed data points lie to a good approximation in a flat Euclidean subspace. Since the natural variability in populations often leads to a large spread in shape space and the observed data may lie in highly-curved regions (see Huckemann and Hotz (2014)), linearity often cannot be assumed in practical applications.
In the context of longitudinal studies, an important task is to estimate continuous trajectories from sparse and potentially noisy samples. For smooth individual biological changes, subject-specific spatiotemporal regression models are adequate. They also provide a way to describe the data at unobserved times (i.e. shape changes between observation times and — within certain limits — also at future times) and to compare trends across subjects in the presence of unbalanced data (e.g. due to drop-outs). One approach in use is to approximate the observed temporal shape data by geodesics in shape space and, based on these, to estimate overall trends within groups. Geodesic models are attractive as they feature a compact representation (similar to the slope and intercept term in linear regression) and therefore allow for computationally efficient inference.
The intrinsic theory of least squares and geodesic regression in shape spaces has been introduced in Fletcher (2013). For the derivation of the corresponding Euler–Lagrange equations for some important manifolds, we refer to Machado and Silva Leite (2007). An extension to intrinsic Riemannian polynomials has been considered in Hinkle et al. (2014). Earlier related results in the framework of large deformation diffeomorphic metric mapping (LDDMM) can be found in Qiu et al. (2008) and Qiu et al. (2009). In Banerjee et al. (2016), the authors present a kernel-based generalization of geodesic regression to manifold-valued longitudinal parameters. For an overview of statistical analysis on Riemannian manifolds see Huckemann and Hotz (2014) and Pennec (2006).
An additional challenge in the analysis of shape trajectories is to distinguish between morphological differences due to (i) temporal shape evolutions of a single individual and (ii) the geometric variability in a population of an object class under study. To obtain a statistically significant localization of structural changes at the population level (group-wise statistics), the subject-specific trajectories need to be transferred in a standard reference frame. Among the different techniques proposed for normalizing longitudinal deformations (Rao et al., 2004; Bossa et al., 2010), constructions based on parallel transport provide the most natural approach and have shown superior sensitivity and stability in the context of diffeomorphic registration (Lorenzi et al., 2011). Note also that, for general trajectories, the simple transport of each shape is not suitable because the distances between the shapes are not preserved. However, if the shapes belong to the same geodesic, this problem does not arise, which is another advantage of geodesic regression.
As parallel transport in curved shape spaces is rarely given in closed form, in general it has to be approximated numerically, e.g. employing Schild’s ladder (Lorenzi et al., 2011) . For shapes in 2D, Kendall’s shape space is isomorphic to the projective space, which is a symmetric space, so that the essential geometric quantities are well known (cf. Huckemann et al. (2010) and Fletcher (2013)). However, for three and more dimensions, because of less restrictive structure, many questions remain open.
2 Geodesic Analysis in Shape Space
A pre-shape is a -ad of landmarks (i.e. particular points) in after removing translations and similarity transformations. A shape is a pre-shape with rotations removed. For a comprehensive introduction to Kendall’s shape space and details on the subjects of this section, we refer to Kendall et al. (1999). For the relevant tools from Riemannian geometry, we refer to Gallot et al. (2005).
2.1 Shape Space
In the following we present a brief overview of Kendall’s shape space, provide a computationally efficient method to determine horizontal and vertical components of tangent vectors of the pre-shape space, and also prove the corresponding equivariance .
Let , where denotes the space of real matrices. The result , identified with , will be endowed with its canonical scalar product given by . Denoting the Frobenius norm by \|\hbox to5.71527pt{\hss\cdot\hss}\|, we call the sphere pre-shape space and endow it with the spherical Procrustes metric . Now, the left action of on given by defines an equivalence relation given by if and only if for some . Kendall’s shape space is defined as . Provided that , the dimension of is . Now, denoting the canonical projection of by , the induced distance between any two shapes and is given by
[TABLE]
where denote the pseudo-singular values of . Denoting , it turns out that inherits a differential structure that is compatible with its quotient topology. Following Kendall et al. (1999), we refer to as the singular part of . In particular, is a strata of manifolds with varying dimensions and is open and dense in . Away from the singular part, the quotient map is a Riemannian submersion . Moreover, for , the shape space (resp. ) is isometric to the sphere (resp. projective space). We call well positioned, and write , if and only if is symmetric and . For each , there exists an optimal rotation such that . Note that does not need to be unique. Let denote a neighborhood in with radius smaller then (the diameter of is ) such that
[TABLE]
For the optimal rotation is unique and the function
[TABLE]
is well-defined.
Due to Kendall et al. (1999) the vertical space at is given by
[TABLE]
and the horizontal space is given by
[TABLE]
We denote the vector space of skew-symmetric real matrices by . Thus .
Now, let and denote the exponential and logarithm map of the pre-shape space. For the geodesic from to given by
[TABLE]
with , is horizontal. Hence realizes the minimizing geodesic from to . The following result concerns determination and -equivariance for horizontal and vertical projection.
Lemma 2.1**.**
Fix and . Let resp. denote the restriction of vertical resp. horizontal projection to .
- (a)
* if and only if solves the Sylvester equation*
[TABLE]
. 2. (b)
Fix . Then .
Proof. For (a), let , i.e., with symmetric and . A straightforward computation eliminating implies that (2) holds. To prove the converse, let . Suppose without loss of generality that and write with
[TABLE]
where is . .
For (b) note that , i.e., implies . Now, where is the solution of . Hence , which implies that .
Henceforth the superscript (resp. ) denotes the vertical (resp. horizontal) component, i.e., for any we have the orthogonal decomposition . Due to the explicit computation above, and , i.e., horizontal and vertical projections are -equivariant. Note that this property holds even if belongs to the singular part of the shape space. As appropriate for our applications and for brevity, unless otherwise specified, we restrict our data to the open and dense set on which is a Riemannian submersion, thus the geometry of the shape space is mainly described by its horizontal lift in the pre-shape space. In particular, for the Sylvester equation (2) has a unique solution determining horizontal and vertical projections and the restriction of to is an isometry of Euclidean vector spaces and . Denoting the covariant derivatives in the pre-shape and shape space by resp. , for horizontal vector fields and we have
[TABLE]
In the following [\hbox to5.71527pt{\hss\cdot\hss},\hbox to5.71527pt{\hss\cdot\hss}] denotes the Lie bracket in , i.e., ( Euclidean). For the Euclidean derivative of a vector field along a curve in we use and also for simplicity of notation a dot, i.e., if , and , etc. We set222Note that the Riemannian exponential map of the shape space denoted by satisfies .
[TABLE]
For the computation of the Fréchet mean (cf. Huckemann et al. (2010) and Pennec (2006)) of the shapes with , i.e.,
[TABLE]
we apply Newton’s method to Karcher’s equation as follows. We search for the unique zero of the function defined by
[TABLE]
and set
[TABLE]
A suitable initial value is the normalized Euclidean mean
[TABLE]
The total variance of reads
[TABLE]
2.2 Parallel Transport
Next, we parallel transport in the shape space and its relation to parallel transport in the pre-shape space.333Essentially, part (a) of Proposition 2.2 was recently also obtained by Kim et al. (2018).
We call a vector field along a horizontal curve horizontally parallel (for brevity h-parallel) if and only if is horizontal and is parallel along . In the following, we derive the differential equation for the h-parallelism of and a corresponding constructive approach using a Sylvester equation in certain cases.
Proposition 2.2**.**
Let be a smooth horizontal curve with initial velocity , a horizontal vector at and a vector field along with .
- (a)
The vector field is h-parallel transport of along if and only if where is the unique solution of
[TABLE] 2. (b)
Suppose that is a unit-speed geodesic. Then equation (4) reduces to
[TABLE] 3. (c)
Let denote the orthogonal projection of on , i.e. . Suppose that is horizontal. If is a unit-speed geodesic, then the h-parallel transport of is given by
[TABLE]
where denotes the Euclidean parallel extension of along , i.e., f.a. . If with , then the h-parallel transport of along to reads
[TABLE]
Proof. (a) is parallel along if and only if , i.e., infinitesimal variation of must be vertical. Hence , which due to Lemma 2.1 equals with . Moreover, -equivariance of vertical projection implies the well-definedness, i.e., if is parallel, then is parallel for all . Note that existence and uniqueness of the solution for with is immediate from the existence and uniqueness of parallel transport and vertical projection. Now, is horizontal if and only if where , since . If equation (4) holds, then
[TABLE]
and
[TABLE]
Now, we arrive at , i.e., remains horizontal. To prove the converse, note that if is horizontal, then and therefore vanishes. Hence and and the Sylvester equation for the vertical component of reads . Thus (4) follows.
(b) Note that and are symmetric and . Now, (4) implies
[TABLE]
(c) Obviously given by (6) satisfies the initial condition . Moreover, it satisfies , i.e., (5) holds with . To prove (7), insert and into (6).
Note that, , the differential equation for the h-parallel transport can also be written as
[TABLE]
Hence, a vector field along a curve in is parallel if and only if it has a horizontal lift satisfying the above equation.
Remark 2.3**.**
We mention two cases such that (6) and (7) apply. First, coincides with the spherical parallel transport of if and only if or, equivalently, . Secondly, for planar shapes. To see this, let and denote the rows of a shape and a horizontal vector at . Fix and let . Then is symmetric since . Hence, is horizontal for arbitrary .
2.3 Jacobi Fields
Hence, for a horizontal geodesic , any geodesic variation of in the latter space reads with a variation of through horizontal geodesic. Thus the variation field \frac{\mathrm{d}}{\mathrm{d}s}(\pi\circ\Gamma(s,\hbox to5.71527pt{\hss\cdot\hss}))|_{s=0}=\mathrm{d}\pi(\frac{\mathrm{d}}{\mathrm{d}s}\Gamma(s,\hbox to5.71527pt{\hss\cdot\hss})|_{s=0}) is a Jacobi field of the shape space. Recall that a vector field along is called normal if and only if and the tangential component of any Jacobi field is just given by with , which is obviously horizontal. Thus the challenge is to find those normal vector fields that project to a Jacobi field in the shape space.
We recall that for , the shape space is isometric to the complex projective space endowed with its standard (Fubini–Study) metric. The given formula for in this case is well-known (cf. Fletcher (2013) and Jost (2017)).
3 Geodesic Regression
In the following, we employ the results of the previous section to derive an efficient and robust approach for finding the relation between an independent scalar variable, i.e. time, and a dependent shape-valued random variable.
Regression analysis is a fundamental tool for the spatiotemporal modeling of longitudinal observations. Given scalars and distinct pre-shapes , the goal of geodesic regression is to find a geodesic curve in shape space that best fits the data in a least-squares sense. In particular for a horizontal geodesic from to with , we define the misfit between the data and the geodesic as a sum of squared distances with respect to , i.e.
[TABLE]
We can assume that and . While the authors of Fletcher (2013) and Machado and Silva Leite (2007) identify geodesics by their initial point and velocity — and hence they consider — we use for the identification their endpoints, i.e., we consider
[TABLE]
The reason is that geodesic computations in terms of the function defined in equation (1) are more efficient. Model estimation is then formulated as the least-squares problem
[TABLE]
In the absence of an analytic solution, the regression problem has to be solved numerically. To this end, we employ a Riemannian trust-regions solver (Boumal et al., 2014) with a Hessian approximation based on finite differences and use as initial guess.
4 Application to Epidemiological Data
In this section, we analyze the morphological variability in longitudinal data of human distal femora in order to quantify shape changes that are associated with femoral osteoarthritis.
4.1 Data Description
We apply the derived scheme to the analysis of group differences in longitudinal femur shapes of subjects with incident and developing osteoarthritis (OA) versus normal controls.
The dataset is derived from the Osteoarthritis Initiative (OAI), which is a longitudinal study of knee osteoarthritis maintaining (among others) clinical evaluation data and radiological images from 4,796 men and women of age 45–79. The data are available for public access at http://www.oai.ucsf.edu/.
From the OAI database, we determined three groups of shapes trajectories: HH (healthy, i.e. no OA), HD (healthy to diseased, i.e. onset and progression to severe OA), and DD (diseased, i.e. OA at baseline) according to the Kellgren–Lawrence score (Kellgren and Lawrence, 1957) of grade 0 for all visits, an increase of at least 3 grades over the course of the study, and grade 3 or 4 for all visits, respectively. We extracted surfaces of the distal femora from the respective 3D weDESS MR images (0.370.37 mm matrix, 0.7 mm slice thickness) using a state-of-the-art automatic segmentation approach (Ambellan et al., 2018). For each group, we collected 22 trajectories (all available data for group DD minus a record that exhibited inconsistencies, and the same number for groups HD and HH, randomly selected), each of which comprises shapes of all acquired MR images, i.e. at baseline, the 12-, 24-, 36-, 48- and 72-month visits. In a supervised post-process, the quality of segmentations as well as the correspondence of the resulting meshes (8,988 vertices) were ensured.
4.2 Geodesic Modeling of Femoral Trajectories
We apply the geodesic regression approach detailed in Section 3 to the femoral shape trajectories described above and represented in Kendall’s shape space. Due to the expressions derived for the parallel transport and Jacobi fields, we can fully leverage the geometry using Riemannian optimization procedures , we observed a superlinear convergence of the intrinsic trust-region solver for most of the shape trajectories. Solving the high-dimensional (54k degrees of freedom) regression problem on a laptop computer with Intel Core i7-7500U (GHz) CPU took about 0.3s on average. In contrast, the generic Matlab required about 25s to determine a solution, thus being two orders of magnitude slower.
The resulting estimated geodesics along with the original trajectories are visualized in Figure 2.
Next we would like to answer the question of how well the observed data is replicated by the estimated geodesic trends. A common approach to test this is to compute the coefficient of determination, denoted as , that is the proportion of the total variance in the data explained by the model. Following Fletcher (2013), a generalization to manifolds is defined as
[TABLE]
with and as defined in equations (11) and (3), respectively. As the unexplained variance cannot exceed the total variance (since the Fréchet mean lies in the search space of the regression problem) and both variances are nonnegative, must lie in the interval (with larger values indicating a higher proportion of the variance being explained by the model).
The coefficients of determination were computed for all estimated trends amounting to for group HH, DD, and HD, respectively. While for all groups the geodesic model is able to describe a relatively large portion of the shape variability, there is a clear difference between the control group HH and the groups DD and HD associated to osteoarthritis. In particular, pairwise Mann–Whitney U tests confirm that the differences are highly unlikely due to random chance (with -values of , and for HH vs. DD, and HH vs. HD, respectively). These findings indicate that the OA-related shape . Based on the coefficient of determination we also test for the significance of the estimated trends employing permutation tests as suggested in Fletcher (2013). For each of the trajectories we performed 1,000 permutations and considered the results as statistically significant for -values less than . In almost all cases (63 out of 66) the trends were significant, such that we can expect them to be highly unlikely due to random chance.
4.3 Group-wise Analysis of Longitudinal Trends
5 Concluding Remarks
This work presented characterizations of and computationally efficient methods for the determination of parallel transport, Jacobi fields and geodesic regression of data represented as shapes in Kendall’s space. Furthermore, an application to longitudinal statistical analysis of epidemiological data (femur data for analysis of knee osteoarthritis) has been shown. An advantage of modeling trajectories by geodesics is the following: A main task in longitudinal analysis is to translate trajectories to start at a reference shape. The intermediate distances between the shapes of a geodesic are preserved by parallel transport, which is not the case for general transports. Moreover, data inconsistencies are minimized by considering the best-fitting geodesics, and Jacobi fields can be employed to analyze the variability of the geodesics, hence providing a canonical descriptor of trends and differences for the trajectories.
There are many potential avenues for future work.
In particular, group-wise means of the geodesics can be computed with respect to a natural metric in the tangent bundle (e.g. the Sasaki metric) to determine the group parameters as described in Muralidharan and Fletcher (2012). Second, an extension of the method to higher-dimensional longitudinal parameters instead of just time can be examined, to achieve even more differentiated results. Third, spline regression poses a natural generalization providing more degrees of freedom.
It seems possible to make a correct assignment to one of the three groups based on just two measurements. The aim of further investigations must be to substantiate this statement, by determining with what reliability a prediction can be made about the onset of knee osteoarthritis depending on the baseline shape and trend as well as the sensitivity of the latter with respect to the number of observations made and the time intervals between them.
Acknowledgements
This research was carried out in the framework of Matheon supported by the Einstein Foundation Berlin (E. Nava-Yazdani), DFG project In-vivo soft tissue kinematics from dynamic MRI (C. von Tycowicz), and the Freie Universität Berlin through the Excellence Initiative of the Deutsche Forschungsgemeinschaft (T. J. Sullivan). Furthermore we are grateful for the open-access dataset provided by the OAI444The Osteoarthritis Initiative is a public-private partnership comprised of five contracts (N01-AR-2-2258; N01-AR-2-2259; N01-AR-2-2260; N01-AR-2-2261; N01-AR-2-2262) funded by the National Institutes of Health, a branch of the Department of Health and Human Services, and conducted by the OAI Study Investigators. Private funding partners include Merck Research Laboratories; Novartis Pharmaceuticals Corporation, GlaxoSmithKline; and Pfizer, Inc. Private sector funding for the OAI is managed by the Foundation for the National Institutes of Health. This manuscript was prepared using an OAI public use data set and does not necessarily reflect the opinions or views of the OAI investigators, the NIH, or the private funding partners. as well as for the open-source software Manopt (Boumal et al., 2014).
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Absil et al. [2007] P.-A. Absil, R. Mahony, and R. Sepulchre. Optimization Algorithms on Matrix Manifolds . Princeton University Press, Princeton, NJ, USA, 2007. URL http://www.manopt.org .
- 2Ambellan et al. [2018] F. Ambellan, A. Tack, M. Ehlke, and S. Zachow. Automated segmentation of knee bone and cartilage combining statistical shape knowledge and convolutional neural networks: Data from the Osteoarthritis Initiative. In Medical Imaging with Deep Learning , 2018. URL https://doi.org/10.1016/j.media.2018.11.009 . · doi ↗
- 3Banerjee et al. [2016] M. Banerjee, R. Chakraborty, E. Ofori, M. S. Okun, D. E. Viallancourt, and B. C. Vemuri. A nonlinear regression technique for manifold valued data with applications to medical image analysis. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 4424–4432, 2016. URL https://doi.org/10.1109/CVPR.2016.479 . · doi ↗
- 4Bône et al. [2018] A. Bône, O. Colliot, and S. Durrleman. Learning distributions of shape trajectories from longitudinal datasets: a hierarchical model on a manifold of diffeomorphisms. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition , pages 9271–9280, 2018.
- 5Bossa et al. [2010] M. N. Bossa, E. Zacur, and S. Olmos. On changing coordinate systems for longitudinal tensor-based morphometry. In Spatiotemporal Image Analysis for Longitudinal and Time-Series Image Data , 2010. CD publication.
- 6Boumal et al. [2014] N. Boumal, B. Mishra, P.-A. Absil, and R. Sepulchre. Manopt, a Matlab toolbox for optimization on manifolds. J. Machine. Learn. Res. , 15:1455–1459, 2014. URL http://www.manopt.org .
- 7Fletcher [2013] P. T. Fletcher. Geodesic regression and the theory of least squares on Riemannian manifolds. Int. J. Comp. Vis. , 105(2):171–185, 2013. URL https://doi.org/10.1007/s 11263-012-0591-y . · doi ↗
- 8Gallot et al. [2005] S. Gallot, D. Hulin, and J. Lafontaine. Riemannian Geometry . Springer, third edition, 2005. URL https://doi.org/10.1007/978-3-642-18855-8 . · doi ↗
