The Coherent Point Drift for Clustered Point Sets
Dmitry Lachinov, Vadim Turlapov

TL;DR
This paper introduces an extension to the Coherent Point Drift algorithm that incorporates prior clustering information to improve non-rigid point set registration accuracy, particularly in medical imaging applications.
Contribution
It extends the probabilistic framework of CPD to include clustering priors via a Gaussian mixture model, enhancing registration accuracy without significant performance loss.
Findings
Improved registration accuracy with clustering priors.
Effective for medical data, especially heart model personalization.
Comparable or better than existing methods.
Abstract
The problem of non-rigid point set registration is a key problem for many computer vision tasks. In many cases the nature of the data or capabilities of the point detection algorithms can give us some prior information on point sets distribution. In non-rigid case this information is able to drastically improve registration results by limiting number of possible solutions. In this paper we explore use of prior information about point sets clustering, such information can be obtained with preliminary segmentation. We extend existing probabilistic framework for fitting two level Gaussian mixture model and derive closed form solution for maximization step of the EM algorithm. This enables us to improve method accuracy with almost no performance loss. We evaluate our approach and compare the Cluster Coherent Point Drift with other existing non-rigid point set registration methods and show…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| Metric | ccpd | cpd | ecpd |
|---|---|---|---|
| Hausdorff | 8.88 | 9.29 | 9.07 |
| Cluster Hausdorff | 8.80 | 23.09 | 19.13 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Numerical Analysis Techniques · Computer Graphics and Visualization Techniques · Digital Image Processing Techniques
11institutetext: Lobachevsky State University, Gagarina ave. 23, 603950 Nizhny Novgorod, Russian Federation 11email:
{dlachinov, vadim.turlapov}@gmail.com
The Coherent Point Drift for Clustered Point Sets
Dmitry Lachinov 11 0000-0002-2880-2887
Vadim Turlapov 11 0000-0001-8484-0565
Abstract
The problem of non-rigid point set registration is a key problem for many computer vision tasks. In many cases the nature of the data or capabilities of the point detection algorithms can give us some prior information on point sets distribution. In non-rigid case this information is able to drastically improve registration results by limiting number of possible solutions. In this paper we explore use of prior information about point sets clustering, such information can be obtained with preliminary segmentation. We extend existing probabilistic framework for fitting two level Gaussian mixture model and derive closed form solution for maximization step of the EM algorithm. This enables us to improve method accuracy with almost no performance loss. We evaluate our approach and compare the Cluster Coherent Point Drift with other existing non-rigid point set registration methods and show it’s advantages for digital medicine tasks, especially for heart template model personalization using patient’s medical data.
Keywords:
Template Personalization Point Set Registration CPD CCPD
1 Introduction
Points set registration task often arises in many computer vision problems. Statistical shape reconstruction, medical image registration, template personalization and various number of other tasks incorporate point set registration. This task consists of computing point correspondences and spatial transformation between two given point sets. These sets could be artificially generated, obtained with scanner or extracted from an image using various point detection approaches. In practice point set registration is primarily used for obtaining transformation between pre-defined template and points extracted from some data. In this paper we consider a special case of point set registration - where information on point sets clustering is already known. As an example, points extracted from CT scan of the person’s heart can have such property. Heart ventricles and atriums are considered as clusters in this case.
The task of non-rigid point set registration is ill-posed by itself. So additional constrains are used to achieve a solution. In general, the type of the transformation between template set and extracted set is unknown, so the solution of the problem results in finding optimal function with respect to problem’s constrains in non-rigid transformations class. These constrains are motivated by assumptions on point sets distribution and target transformation type. Desired solution should describe original transformation the best, be robust to outliers, noise and missing data, be able to process high dimensional data and have reasonable computational complexity. The Coherent Point Drift [8] stands out of other methods[12] and is able to provide accurate results in reasonable time. Still, it fails to perform in some complex cases. In this paper we extend it’s capabilities by taking the idea of including prior information described by V. Golyanik et al. [5] and expand it further to include information on point sets clustering. This information often comes naturally with properties of the point extraction algorithm or nature of the data. For example, many biomedical tasks include segmentation as pre-processing tool but no information on clusters is actually used. In this paper we present CPD modification for clustered point sets that extends existing approach to adopt prior information. In the experiments section we evaluate proposed method using 20 CT samples from MMWHS-2017 [16, 15, 14] challenge and compare results with CPD[8] and ECPD [5]. For the results evaluation we are using Hausdorff distance [13] and it’s variant for clustered point sets. This shows advantages of the proposed method over existing ones for the given task.
2 Related Work
The Coherent Point drift algorithm as a working idea was formulated at 2007 by Myronenko et al. [9] for the first time. The idea behind the method is to formulate point set registration as a maximum likelihood estimation problem. The transformation (model parameters) aligning point set Y with point set X that maximizes likelihood with subject to constrains is chosen as the result. Only non-rigid registration case is considered in that work. The authors expand this approach further at [8] for both rigid and non-rigid cases. Techniques reducing algorithm’s complexity and significantly improving performance were proposed. Authors demonstrate method’s advantages over existing approaches like LM-ICP [4] and TPS-RPM [3] in presence of noise and outliers. Review of the method’s modern applications is given in [6].
Golyanik et al. extended existing approach to take prior knowledge into account [5]. In their work they adopted coarse-to-fine strategy for processing large point set in reasonable time. Authors considered a special case of the non-rigid registration problem, when a sparse set of correspondence priors is known in advance. A modification for Coherent Point Drift was proposed that allows to embed correspondence priors in a closed-form. ECPD employs correspondence preserving subsampling counterbalancing the polynomial complexity by splitting the problem into two subproblems of smaller size and reducing the number of operations by a linear factor.
In [10], two novel elements were proposed: (1) generalization of the CPD (gCPD) that used a procedure of group-wise registration (alignment) for statistical shape reconstruction and (2) more complex point sets, in which each point that belonged to the surface was associated with a normal; moreover, instead of the Gaussian mixture model (GMM), the hybrid mixture model (HMM) was employed that combined the Student t-distribution and the von Mises–Fisher distribution for a point and a normal, respectively. Experiments used clinical data on 27 brain ventricles (Neuro) and 15 hearts (Cardiac). The HMM was shown to provide a significant increase in accuracy compared to the methods (including the gCPD) that used conventional point sets. As a metric, the mean surface distance (MSD) was employed.
2.1 Coherent Point Drift
The Coherent point drift algorithm was introduced in [9] and improved later in [8]. This method is designed to align two point sets and where one point set represents GMM centroids and another represents data. points set representing the data is considered to be fixed, is moving. In order to make points move coherently regularization is used.
Authors are using the following notation we will also stick to.
- •
- dimension of the point sets;
- •
- number of points in X and Y respectively;
- •
- the first point set representing the data;
- •
- the second point set representing the centroids;
- •
- transformation applied to , where is the transformation parameters;
- •
- identity matrix;
- •
- column vector of all ones;
- •
- diagonal matrix formed from the vector a.
The set is GMM centroids with probability density written as
[TABLE]
where . To account presence of the noise and outliers was added to mixture model. Equal isotropic covariances and equal membership probabilities are used. Resulting mixture model density can be written as
[TABLE]
where is the fraction of the noise in the model.
The model parameters are estimated by maximizing likelihood function. This task is equivalent to minimization of the negative log-likelihood that can be written as
[TABLE]
For parameters estimation EM algorithm [1] is used. It consists of two steps, first, posterior probabilities are calculated using old parameters, then new parameters values are estimated during second step. The upper bound of the negative log-likelihood can be written as
[TABLE]
rewriting it we can get the following:
[TABLE]
where and - posterior probabilities of mixture components calculated with old parameters values. can be written as
[TABLE]
where . Minimizing the function , we necessarily decrease the negative log-likelihood function , unless it is already at a local minimum. In the paper [8] both rigid and non-rigid cases are considered. From biomedical applications perspective non-rigid case is more valuable, so we will focus on non-rigid CPD version. In order to represent non-rigid transformation the displacement field is introduced.
[TABLE]
Adding regularization term to negative log-likelihood we obtain
[TABLE]
where is negative log-likelihood and is regularization term. Authors show that regularization term can be rewritten as
[TABLE]
where operator extracts the high frequency content [8, 2, 11]. Authors show that displacement function can be written as
[TABLE]
where and . Matrix can be evaluated by solving following equation
[TABLE]
where and
[TABLE]
2.2 Extended Coherent Point Drift
Extended Coherent Point Drift [5] integrates prior knowledge into registration algorithm. It requires sparse set of points correspondences between point sets to be defined.
These correspondences are defined as pairs where . Correspondence priors are modeled as a product of particular independent density functions
[TABLE]
where and is priors’ degree of reliability. Correspondence priors are incorporated into GMM
[TABLE]
Modified negative log-likelihood will be written as
[TABLE]
Upper bound will be defined as
[TABLE]
where defined as
[TABLE]
matrix can be pre-calculated. For non-rigid case upper-bound function will be written as
[TABLE]
The following linear system is solved during maximization step of the EM algorithm
[TABLE]
3 Proposed method
In this paper we consider a special case for point set registration, where point sets are clustered and correspondences between clusters of the different sets are known. These points can be extracted form heart CT scans so clusters will represent different heart cavities. In general, such information can be extracted for various number of biomedical tasks using different segmentation techniques like neural nets [7]. In the paper we will be using following notation:
- •
- number of the cluster, clusters in total;
- •
- probability of and both to be inside cluster ;
- •
- probability of to be inside cluster ;
- •
- probability of to be inside cluster ;
- •
- probability of the cluster ;
Let us define additional cluster so that with . This cluster includes all pairs that don’t share any cluster. We will rewrite probability density function of the mixture model as
[TABLE]
The negative log-likelihood is defined as
[TABLE]
And it’s upper bound can be written as
[TABLE]
[TABLE]
where . The posterior probabilities can be calculated as follows
[TABLE]
After adding regularization and removing the term that corresponds to we can write upper bound as
[TABLE]
After rewriting equations in matrix form and setting derivative to zero we will get
[TABLE]
So during maximization step we iteratively update matrix using following equation:
[TABLE]
where . Setting derivatives to zero we can get update rule
[TABLE]
where
Strictly speaking is unknown, but it has to be initialized. From it’s definition - probability of given and to be in the same cluster ; we can estimate it with product , where equals to 1 if belongs to cluster and equals to 0 otherwise.
As a result, Cluster Coherent Point drift can be described as
- •
inputs: point sets and , probabilities and ;
- •
outputs: aligned point set T(Y);
- •
algorithm: CCPD;
- •
Initialize:
[TABLE]
- •
Repeat until convergence;
- –
E-step: compute
[TABLE]
- –
M-step: solve
[TABLE]
- –
update
- •
Aligned set is
The key difference between ECPD and proposed method is the assumption made about prior information.
4 Evaluation
We implemented CPD, ECPD and CCPD in C++ and tested them on Xeon E5-2660 with K40. Efficient GPU version was implemented with CUDA C++. All experiments were conducted using GPU version. We tested methods listed above with MMWHS-2017 dataset [16, 15, 14]. This dataset consists of multiple CT and MRI labeled heart scans (figure 1). Among other labels, heart ventricles and atriums are landmarked. For the experiment we will be using only these four labels. Experiments were conducted on 20 training CT volumes with corresponding ground-truth segmentation with goal to align STL template with blood cavities surfaces.
After aligning template’s ventricles and atriums, whole heart structure can be personalized using resulting transformation. Unfortunately we cannot measure quality of the alignment for neighborhood structures. An example of template personalization shown at figure 2.
The results of template registration are shown at figure 3. CPD doesn’t incorporate any prior information and ECPD uses modified matrix where
[TABLE]
. Proposed methods shows the best result among others.
For quantitative evaluation two metrics were used: Hausdorff distance and it’s cluster variant. Let us write Hausdorff distance as follows:
[TABLE]
where , and it’s cluster variant as a mean of :
[TABLE]
where so that corresponds to cluster .
Table 1 shows experimental results. For ECPD column the best result is present. First metric shows quality of alignment with no known information during evaluation stage. The second metric computes average between ground truth clusters, that is more relevant in our case. For both cases proposed method shows better results.
5 Conclusion & Discussion
The problem of non-rigid point set registration is a key problem for many computer vision tasks. In the paper we discussed point set registration problem for clustered point clouds. As an example, points extracted from CT scan of the person’s heart can have such property. Heart ventricles and atriums are considered as clusters in this case.
We presented modified version of Coherent Pint Drift for special type of point sets - sets that can be clustered. We defined loss function and showed the way to optimize it using EM algorithm. In the experiments section we compared three methods: proposed one, Coherent Point Drift and Extended Coherent Point Drift that is able to embed prior information on sparse correspondences. We evaluated them in the same environment using MMWHS-2017 data. Proposed method shows more then two times better accuracy than its competitor.
Presented approach is robust to outliers and noise presence and able to produce precise enough transformation even for heart computational models personalization.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bishop, C.M.: Neural Networks for Pattern Recognition. Oxford University Press, Inc., New York, NY, USA (1995)
- 2[2] Chen, Z., Haykin, S.: On different facets of regularization theory. Neural Computation 14 (12), 2791–2846 (2002)
- 3[3] Chui, H., Rangarajan, A.: A new point matching algorithm for non-rigid registration. Computer Vision and Image Understanding 89 (2), 114 – 141 (2003). https://doi.org/https://doi.org/10.1016/S 1077-3142(03)00009-2, http://www.sciencedirect.com/science/article/pii/S 1077314203000092 , nonrigid Image Registration
- 4[4] Fitzgibbon, A.W.: Robust registration of 2d and 3d point sets. Image and Vision Computing 21 , 1145–1153 (2003)
- 5[5] Golyanik, V., Taetz, B., Reis, G., Stricker, D.: Extended coherent point drift algorithm with correspondence priors and optimal subsampling. In: 2016 IEEE Winter Conference on Applications of Computer Vision (WACV). pp. 1–9 (March 2016). https://doi.org/10.1109/WACV.2016.7477719
- 6[6] Lachinov, D., Getmanskaya, A., Turlapov, V.: Refinement of the coherent point drift registration results by the example of cephalometry problems. Programming and Computer Software 44 (4), 248 – 257 (2018). https://doi.org/https://doi.org/10.1134/S 0361768818040084
- 7[7] Litjens, G.J.S., Kooi, T., Bejnordi, B.E., Setio, A.A.A., Ciompi, F., Ghafoorian, M., van der Laak, J.A.W.M., van Ginneken, B., Sánchez, C.I.: A survey on deep learning in medical image analysis. Co RR abs/1702.05747 (2017), http://arxiv.org/abs/1702.05747
- 8[8] Myronenko, A., Song, X.: Point set registration: Coherent point drift. IEEE Transactions on Pattern Analysis and Machine Intelligence 32 (12), 2262–2275 (Dec 2010). https://doi.org/10.1109/TPAMI.2010.46
