Variational Registration of Multiple Images with the SVD based SqN Distance Measure
Kai Brehmer, Hari Om Aggrawal, Stefan Heldmann, and Jan Modersitzki

TL;DR
This paper introduces a novel image registration method for multiple images using the SVD-based SqN distance measure, demonstrating its superiority over existing approaches in various applications.
Contribution
It proposes and evaluates the use of the Schatten q-norm based SqN distance for aligning multiple images, extending image registration techniques beyond two-image scenarios.
Findings
SqN distance is effective for multiple image registration
SqN outperforms competing methods in experiments
Applicable to dynamic sequences and histological stacks
Abstract
Image registration, especially the quantification of image similarity, is an important task in image processing. Various approaches for the comparison of two images are discussed in the literature. However, although most of these approaches perform very well in a two image scenario, an extension to a multiple images scenario deserves attention. In this article, we discuss and compare registration methods for multiple images. Our key assumption is, that information about the singular values of a feature matrix of images can be used for alignment. We introduce, discuss and relate three recent approaches from the literature: the Schatten q-norm based SqN distance measure, a rank based approach, and a feature volume based approach. We also present results for typical applications such as dynamic image sequences or stacks of histological sections. Our results indicate that the SqN approach…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
11institutetext: Institute of Mathematics and Image Computing, University of Lübeck, Germany 11email: [email protected] 22institutetext: Fraunhofer Institute for Digital Medicine MEVIS, Lübeck, Germany
Variational registration of multiple images with the SVD based distance measure
Kai Brehmer 11
Hari Om Aggrawal 11
Stefan Heldmann 22
Jan Modersitzki 1122
Abstract
Image registration, especially the quantification of image similarity, is an important task in image processing. Various approaches for the comparison of two images are discussed in the literature. However, although most of these approaches perform very well in a two image scenario, an extension to a multiple images scenario deserves attention. In this article, we discuss and compare registration methods for multiple images. Our key assumption is, that information about the singular values of a feature matrix of images can be used for alignment. We introduce, discuss and relate three recent approaches from the literature: the Schatten -norm based distance measure, a rank based approach, and a feature volume based approach. We also present results for typical applications such as dynamic image sequences or stacks of histological sections. Our results indicate that the approach is in fact a suitable distance measure for image registration. Moreover, our examples also indicate that the results obtained by are superior to those obtained by its competitors.
Keywords:
Groupwise registration, Dynamic Imaging, 3D reconstruction
1 Introduction
Typical applications in medical imaging are to analyze spatio-temporal variations of bio-medical images. A prerequisite for such analysis is that images are aligned and in many cases joint registration of multiple images is required. Examples are, e.g., analysis of images from different time points and/or different complimentary modalities, atlas registration, longitudinal normalization, motion correction or image reconstruction [1, 4, 7, 8, 12, 13, 18, 19, 21]
A number of registration models are already available to register a pair of two images [15, 20, 22], but their simple extension to register a group of images might suffer from various problems. Generally, these pair-wise methods assume one of the images as a reference image, and therefore registrations are implicitly biased towards the reference image. Moreover, the selection of a reference image from the given image sequence is not always a very straight forward process. Most importantly, these registration models are primarily influenced by features shared by the image pair and less affected by the features other images have in the image sequence. Therefore, this approach does not account the global information available in the image sequence. It has also been shown that these methods have slow convergence rate compared to the groupwise methods [2, 3].
To avoid the selection of a reference image and the related bias, Joshi et al. [13] proposed the registration of each image from the image sequence with respect to the group mean of the registered image sequence. This approach does not need to define the reference image explicitly, moreover accounts the global information through the group mean. This approach inherits the assumption that every image in the image sequence is almost similar to the group mean.
Recently, Guyader [8] and Brehmer [2, 3] proposed groupwise registration methods for a sequence of images. The underlying assumption is that images are linearly dependent if they are aligned. The linear dependency idea completely circumvents the need of defining a group mean image. Both of these methods construct an image matrix where each column is corresponding to an image from the sequence. Brehmer [2, 3] estimates transformation fields by minimizing the rank of the matrix and implicitly forcing columns of the matrix to become linear dependent to each other. Guyader [8] utilizes the multivariate version of mutual information, called total correlation, to define a groupwise registration model.
The paper is structured as follows: In Section 2, we discuss mathematical formulations of SVD based image registration approaches. More precise, we discuss a general framework for groupwise registration models based on correlation maximization. In Section 3 we briefly discuss the used numerical setting. After that, in Section 4, we demonstrate the performance of some of the proposed methods on two datasets and compare them with other state-of-the-art methods.
2 Registration approaches for multiple images
In this section, we describe our Schatten -norm based distance measure for multiple images. We start by briefly outlining a standard variational registration framework for two images [15]. We then present a straightforward extension for multiple images and discuss the drawbacks of the naive approach drawbacks. The main drawbacks are its sequential and thus ordering dependent assessment of the image frames and the weak coupling of image information over the frames.
We then present the setting of the distance measure. The main idea is to make use of the singular values of an image feature array. Finally, we relate the Schatten -norm based distance measure to work of Friedman et al. [6] and Guyader et. al. [8].
2.1 Variational registration approach for two images
We start the discussion with a standard approach to image registration; see e.g. [15] for details. To simplify discussion, an image is assumed to be a real valued intensity function with compact support in a domain . Given two images , the goal of image registration is to find a transformation such that ideally , where . To achieve this goal, we choose a variational framework where a joined functional
[TABLE]
is to be minimized over an admissible set of transformations. Various choices for distance measures and regularizers are discussed in the literature; see e.g. [15] and references therein. A thorough discussion is beyond the scope of this paper. Here, we only briefly recall the -norm (sum of squared distances, ), the normalized gradient field () [10], and the elastic potential [5]:
[TABLE]
with , and strain where is the identity matrix.
Derivations of image intensities are also commonly used to quantify image similarity. For a unified conceptual framework, we introduce a feature map that maps an image to a Hilbert space of features. Any metrics on the feature space can then be used for registration: . Examples of such feature maps are e.g. intensity normalization or the normalized gradient field, , to name a few. Note that the NGF distance measure is based on whereas the feature map is based on .
2.2 Sequential registration approach for multiple images
Our goal is to extend the standard registration to sequences of images . Note that the images might be given as a time series such as our DCE-MRI example, a structured process such as the HISTO application, or even an unstructured ensemble of images such as an atlas generation.
The first approach is to simply apply the above framework sequentially. With transformations the corresponding energy to be minimized with respect to reads
[TABLE]
Note that typically, one of the deformations is fixed, e.g., for well-posedness. However, as the problem is usually too big to be solved straightforwardly, a non-linear Gauss-Seidel type iteration is usually applied. Here, one assumes that is a good starting guess and sequentially improves component by component for by determining optimizers
[TABLE]
setting and iterates until convergence. This process is generally rather expensive and therefore slow. A problem is that the coupling of the different components of is weak. An update of has impact only every -th step in the procedure. Therefore, potentially a high number of iterations is required.
2.3 Global registration approach for multiple images
Here, we propose a registration approach that provides a full coupling of all image frames. Our objective is to find a minimizer of the energy ,
[TABLE]
where we use the suggestive abbreviation and for sake of simplicity let be with any of the regularizers discussed in Sec. 2.1. Clearly, one could debate for a more general or even stronger regularization of . However, this is not in the scope of the paper and we leave the discussion for future work. The essential contribution is thus the global distance measure that is based on the feature array which comprises the features of the image sequence and its symmetric, positive semi-definite correlation matrix where assembles the correlations of and . Note that we assumed maps into a Hilbert space such that the correlation is well defined according to the corresponding inner product. Our key assumption is that the rank of the feature array is minimal if the image frames are aligned. Note that we actually aim to exclude the trivial situation as this implies that all features are zero. We also note that the assumption may not hold for multi-modal images, if the feature map does not compensate intensity variations. Therefore, a plain image intensity based feature map may not be successful. If we expect that intensity changes will occur at similar positions in space, e.g., the NGF feature map is a valid choice.
2.4 Schatten -norm based image similarity measure
The above considerations suggest to choose as a distance measure. In [2, 3], Brehmer et al. proposed to reformulate the rank minimization problem in terms of a relaxation of the rank function based on a so-called Schatten -norm. Roughly speaking, the Schatten -norm of an operator is the -norm of the vector of its singular values. Thus
[TABLE]
where , , denote the non-zero singular values of . Before we discuss numerical details, we relate this measure to other rank based similarity measures for image stacks. Particularly we address volume minimization of the feature parallelotope and correlation maximization of normalized features.
2.5 Volume minimization of the feature parallelotope
The above approach can be linked to work of Guyader et. al. [8]. To this end, we consider the minimization of the volume of the parallelotope spanned by the columns of . Equivalently, we can consider the determinant of or, exploring the monotonicity of the logarithm, set
[TABLE]
This expression is related to the volume of a normalized covariance matrix which is the total correlation in [8] and used as a similarity measure for group-wise registration.
However, a volume based approach has a severe drawback; see also the discussion in [11]. To illustrate this, we consider two feature vectors and with angle . Hence, . This value is minimal if the vectors are linearly dependent. Unfortunately, this also happens if . In a registration context, this implies that a translation of one of the images, say, about the diameter of yields a global optimizer. In [11] it is therefore suggested to replace the minimization of volume by a maximization of correlation . This value is maximal iff and only iff and is in fact minimal if . This subtle difference is very important in a registration context.
2.6 Correlation maximization of normalized features
In this section we focus on correlation maximization and do not discuss the corresponding minimization formulation. We also assume that feature vectors are normalized, i.e. . For the correlation matrix holds
[TABLE]
where denotes the angle between the -th and -th feature. In the two image setting it is therefore natural to maximize if we account both, for positive and negative correlation. This is the underlying idea of normalized cross correlation. Note that the NGF approach is still different as the correlation is computed point wise and finally averaged.
For the multiple image setting, the best scenario is . If only non-negative correlation is considered, the ideal case is . On the opposite, the worst case scenario for registration is that meaning all features are fully uncorrelated. Therefore, a suitable distance measure is to maximize the difference
[TABLE]
where denotes a suitable matrix norm.
2.7 Correlation maximization and Schatten -norms
Specifically, choosing a Schatten -norm in (11) we obtain
[TABLE]
We investigate the special cases and . Note that
[TABLE]
Thus, choosing the Schatten -norm yields maximizing . This is equivalent to maximizing the largest singular value of , see also [6]:
[TABLE]
For the Schatten -norm we have which shows that the distance is quadratic mean of the correlation among the image features. Furthermore, a direct computation shows
[TABLE]
Here, we exploit the special structure of correlation matrix , i.e., .
To this end, we define the two distance measures for NGF features as follows:
[TABLE]
3 Numerical methods
For the optimization of the functional (cf. (5)) we use the discretize-then-optimize framework introduced in [9]. The basic concept is to use a sequence of discretized finite dimensional optimization problems. A smooth approximation of the problem is represented with few degrees of freedom. It is expected that the optimization is fast as the problem is low dimensional and smooth. Its numerical solution is prolongated and then serves as a starting guess for the finer resolved problem. It is expected that a numerical solution can be computed fast, as the starting point is expected to be close to the solution. The process is generally terminated when reaching the resolution of the given data. Note that the images are only smoothed in the spatial domain.
To solve the discrete problem on a fixed resolution we use a quasi-Newton type approach. More precisely, we use L-BFGS with the Hessian of the regularizer as an initial approximation of the metric and a Wolfe linesearch; see, e.g. [16] for optimization and [15] for details.
For the optimization of ,
[TABLE]
we use similar concepts as above for the regularization term.
For the distance, we remark that the distance is a rather simple algebraic expression of the singular values of the feature matrix. The challenging part is thus the derivative of the singular values. Here, we follow [17]. A singular value decomposition of the feature matrix is denoted by , where the matrices and are orthogonal and is a non-negative diagonal matrix with the singular values as diagonal entries. From [17] we have the surprisingly simple relation
that is used in our implementation.
4 Results
We now present results for the registration of histological serial sectioning of a marmoset monkey brain as well as for DCE-MRI sequences of a human kidney. For the given datasets, we will compare the registration results of , in comparison to a total correlation based approach like in [8] and sequential . We start with registrations of a serial sectioning of a marmoset monkey brain; data courtesy of Harald Möller, Max Planck Institute for Human Cognitive and Brain Sciences, Leipzig, Germany [14]. The dataset consists of every 4th slice of the original serial sectioning of the brain, in total 69 slices of sizes from pixels up to pixels. For proof of concept we reduced the number of pixels per slice to reduce computation time to a reasonable level. The objective of the registration of histological slices is to align them in order to reconstruct the volume of the tissue.
Fig. 1 shows three representative axial slices of the data set. The main difficulties of registering this particular dataset are the different sizes of the slices on the one hand and the translation of whole parts of the imagestack within the domain on the other hand. Furthermore we didn’t use a pre-segmentation of the dataset to show robustness of the registration approaches against artifacts in the background region. The background region of the slices contains several markings of the examiners like white rectangles as well as dust and dirt from the object slide captured during the high resolution scanning process; see Fig. 1.
Fig. 2 shows two sagittal slices (top and bottom row) through the image stack from the reduced, unregistered monkey brain dataset besides the registration results to illustrate the alignment of the slices. As expected the results of are quite similar to the results of . The computation for the groupwise approaches using as well as the total correlation approach from [8] took about 45 to 50 minutes for a resolution of pixels for each of the 69 slices. Compared to this, the sequential NGF approach with just one sweep needed about 2.2 times the computation time (ca. 110 minutes). However, from visual comparison it is obvious that many more sweeps are needed to achieve results comparable to those of the groupwise approaches; see Fig. 2. Everything was implemented in Python using Numpy and Scipy for optimization.
Moreover, we used a random permutation of the stack of histological serial sections to demonstrate invariance to the order of images of the singular value based groupwise registration approaches. We randomly permuted the order of images, registered the stack in random order using and reordered it afterwards; see Fig. 3, center column. As expected, the results are the same as for registration using without random permutation; cf. Fig. 2 and Fig. 3 for comparison.
Next we present registration results for a DCE-MRI sequence of a human kidney; data courtesy of Jarle Rørvik, Haukeland University Hospital Bergen, Norway. Here, 3D images are taken at 45 time points. For ease of presentation and to have a reasonable level of computation time we show results for a 2D slice over time. More precisely, we use 178-by-95 coronal slices of a 178-by-95-by-30-by-45 volume for z-slice 18; see Fig. 4 for representative slices. All time points are used for registration. The objective here is to register the slices while maintaining the dynamics. Fig. 5 illustrates the stack of slices for the different registration approaches using a sagittal cut through the stack, analog to the results for the histological serial sections shown in Fig. 2. The illustrated results were achieved using three different levels of spatial resolution up to half the original resolution in about 8 minutes per groupwise approach. The result of the sequential approach was achieved in about twice the time using just one sweep. For the alignment using the approach from [8], we couldn’t find a parameter setting to achieve results comparable to the - approaches.
5 Discussion and Conclusions
The registration of multiple images is an important task in image processing. Conventional approaches often use an extension of a pairwise approach for two images. In this paper, we demonstrate that this approach may come with numerous disadvantages and may be time consuming. We also describe and analyze a recently proposed alternative. The Schatten -norm based [2, 3] distance measure is a reference for our investigations on different singular value based measures such as the maximization of correlation between different images as well as minimization of spanned volumes. For this purpose we have introduced a general formulation using feature maps that map images into Hilbert spaces. This opens a door for even further investigation on image registration methods for multiple images. With our numerical results we demonstrate that based motion compensation is applicable in dynamic imaging as well as for the alignment of histological serial sections. Moreover, the results clearly show that performs at least as good as standard approaches from the literature. In our experiments both the alignment and the computation time of the groupwise approaches were closer to a desirable solution than the sequential approach using pairwise .
Furthermore, we outlined that a singular value based approach exploits the global information of a dataset, which cannot be achieved by using two-neighbourhoods in registration. In some specific applications, such as dynamic imaging or reconstruction of histological volumes from serial sections, this can avoid unwanted effects like the so-called banana-effect. Future work will address the optimal choice of the parameter and investigations of different variants of feature maps. Finally, different regularization strategies will be investigated.
Acknowledgement
The authors acknowledge the financial support by the Federal Ministry of Education and Research of Germany in the framework of MED4D (project number 05M16FLA)
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Bhatia, K.K., Hajnal, J.V., Puri, B.K., Edwards, A.D., Rueckert, D.: Consistent groupwise non-rigid registration for atlas construction. In: 2004 2nd IEEE International Symposium on Biomedical Imaging: Nano to Macro (IEEE Cat No. 04EX 821). pp. 908–911 Vol. 1 (2004)
- 2[2] Brehmer, K., Wacker, B., Modersitzki, J.: A novel similarity measure for image sequences. In: International Workshop on Biomedical Image Registration. pp. 47–56. Springer (2018)
- 3[3] Brehmer, K., Wacker, B., Modersitzki, J.: Simultaneous registration of image sequences - a novel singular value based images similarity measure. PAMM 18 (1), e 201800370 (2018)
- 4[4] Cootes, T.F., Marsland, S., Twining, C.J., Smith, K., Taylor, C.J.: Groupwise diffeomorphic non-rigid registration for automatic model building. In: Pajdla, T., Matas, J. (eds.) Computer Vision - ECCV 2004. pp. 316–327. Springer Berlin Heidelberg, Berlin, Heidelberg (2004)
- 5[5] Fischler, M.A., Elschlager, R.A.: The representation and matching of pictorial structures. IEEE Transactions on Computers 22 (1), 67–92 (1973)
- 6[6] Friedman, S., Weisberg, H.F.: Interpreting the first eigenvalue of a correlation matrix. Educational and Psychological Measurement 41 (1), 11–21 (1981)
- 7[7] Geng, X., Christensen, G.E., Gu, H., Ross, T.J., Yang, Y.: Implicit reference-based group-wise image registration and its application to structural and functional MRI. Neuro Image 47 (4), 1341–1351 (2009)
- 8[8] Guyader, J.M., Huizinga, W., Poot, D.H.J., van Kranenburg, M., Uitterdijk, A., Niessen, W.J., Klein, S.: Groupwise image registration based on a total correlation dissimilarity measure for quantitative MRI and dynamic imaging data. Scientific Reports 8 (1) (2018)
