Unbiased Shape Compactness for Segmentation
Jose Dolz, Ismail Ben Ayed, Christian Desrosiers

TL;DR
This paper introduces an unbiased, dimensionless shape compactness prior for segmentation, optimized efficiently with ADMM, and demonstrates its effectiveness in medical image segmentation tasks like abdominal aorta MRI segmentation.
Contribution
The paper presents a novel shape compactness prior that is unbiased and position-independent, along with an efficient ADMM-based optimization method for high-order segmentation problems.
Findings
Competitive performance on abdominal aorta MRI segmentation
Efficient optimization of high-order energy with dense graphs
Effective integration with CNN-based classifiers
Abstract
We propose to constrain segmentation functionals with a dimensionless, unbiased and position-independent shape compactness prior, which we solve efficiently with an alternating direction method of multipliers (ADMM). Involving a squared sum of pairwise potentials, our prior results in a challenging high-order optimization problem, which involves dense (fully connected) graphs. We split the problem into a sequence of easier sub-problems, each performed efficiently at each iteration: (i) a sparse-matrix inversion based on Woodbury identity, (ii) a closed-form solution of a cubic equation and (iii) a graph-cut update of a sub-modular pairwise sub-problem with a sparse graph. We deploy our prior in an energy minimization, in conjunction with a supervised classifier term based on CNNs and standard regularization constraints. We demonstrate the usefulness of our energy in several medical…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16
Figure 17
Figure 18
Figure 19
Figure 20
Figure 21Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques · Advanced Neural Network Applications
11institutetext: Laboratory for Imagery, Vision and Artificial Intelligence
École de Technologie Supérieure, Montreal, Canada
Unbiased Shape Compactness for Segmentation
Jose Dolz 11
Ismail Ben Ayed 11
Christian Desrosiers 11
Abstract
We propose to constrain segmentation functionals with a dimensionless, unbiased and position-independent shape compactness prior, which we solve efficiently with an alternating direction method of multipliers (ADMM). Involving a squared sum of pairwise potentials, our prior results in a challenging high-order optimization problem, which involves dense (fully connected) graphs. We split the problem into a sequence of easier sub-problems, each performed efficiently at each iteration: (i) a sparse-matrix inversion based on Woodbury identity, (ii) a closed-form solution of a cubic equation and (iii) a graph-cut update of a sub-modular pairwise sub-problem with a sparse graph. We deploy our prior in an energy minimization, in conjunction with a supervised classifier term based on CNNs and standard regularization constraints. We demonstrate the usefulness of our energy in several medical applications. In particular, we report comprehensive evaluations of our fully automated algorithm over subjects, showing a competitive performance for the challenging task of abdominal aorta segmentation in MRI.
1 Introduction
Several recent studies have shown that generic shape constraints such as convexity [10, 17], compactness [1, 6], axial symmetry [15], tubularity [13], skeleton consistency [12] and inter-region topology [2] can be very powerful in medical image segmentation. Such constraints can boost substantially the performances of state-of-the-art segmentation algorithms, including powerful supervised learning methods such as convolutional neural networks (CNN) [2]. Imposing these constraints can be beneficial in a breadth of medical applications, particularly when training data is limited or when the target segments undergo strong noise and poor contrasts/resolutions (see Fig. 1 for examples where target boundaries have varied shapes and low contrast). Unfortunately, these shape constraints are typically high-order functionals, which yield challenging optimization problems. For instance, the recent convexity term in [10] involves a large number of non-submodular triple-cliques, and the compactness in [1] is a high-order ratio, both requiring computationally expensive approximations and iterative schemes to reach a local minimum.
This study focuses on constraining segmentation functionals with shape compactness, a problem investigated previously in several works [1, 6, 14, 11]. In segmentation, the most common compactness functional is the ratio of boundary length to area (or surface to volume in 3D) [14, 11], which is related to the well-known isoperimetric graph partitioning problem (also referred to as the Cheeger problem). Unlike common length regularizers [4], this prior does not suffer from strong shrinking bias (i.e., bias towards small regions). Despite its simplicity, optimizing a discrete form of this ratio functional is an NP-hard problem [14]. However, it is possible to transform this problem into a family of parameterized sub-modular pairwise costs, which can be explored efficiently with parametric max-flow optimizers [14]. Unfortunately, length-to-area ratio varies significantly with region size111This is due to the fact that, in image segmentation, region size might be several orders of magnitude larger then boundary length.. As pointed out (and observed experimentally) in several studies [11, 14], the isoperimetric partitioning problem has a strong bias towards large regions, which is a serious limitation in practice.
The recent segmentation study in [1] proposed an alternative compactness measure based on a ratio of regional moment statistics, thereby ensuring size (or scale) invariance. This algorithm showed promising performances in the case of a vascular structure, and can handle arbitrary variations in region size. Unfortunately, this compactness term is pose-dependent, requiring the user to specify the center of mass of the target region in each 2D slice of a subject data set. This precludes its use for fully automated segmentation, particularly in the case of large 3D volumes. More importantly, it is strongly biased towards circular shapes, significantly limiting its applicability (see the examples in Fig. 1). Another related work is the semi-automatic segmentation algorithm in [6], whose compactness prior also favors large regions and requires user seeds.
The contributions of this study can be summarized as follows.
- •
General-purpose segmentation: We propose to constrain segmentation functionals with a dimensionless, unbiased and position-independent shape compactness prior, which we solve efficiently with an alternating direction method of multipliers (ADMM). To the best of our knowledge, the ratio of length-squared to area has not been used before in segmentation functionals, although it is widely accepted as a compactness measure in the area of shape metrics [16]. This is probably due to the fact that the ensuing high-order optimization problem is challenging. Involving a squared sum of pairwise potentials, our problem makes the use of powerful sub-modular optimization techniques (e.g., parametric max-flow) impractical for image segmentation problems, as it would result in dense (fully connected) graphs. We propose an efficient alternative by splitting the problem into a sequence of easier sub-problems. At each iteration, our ADMM solution alternates three steps, each can be performed efficiently: (i) a sparse-matrix inversion based on Woodbury identity, (ii) a closed-form solution of a cubic equation and (iii) a graph-cut update of a sub-modular pairwise sub-problem involving a sparse graph.
- •
Clinical applications: We first report a comprehensive evaluation of our algorithm on the challenging task of abdominal aorta segmentation in 3D MRI. Results obtained on a dataset of 40 subjects demonstrate a state-of-the-art performance of the proposed method, with an average Dice metric of 0.81. Except for the recent semi-automatic algorithms described in [8, 1], most existing aorta segmentation techniques have focused on CTA. Unlike these techniques, our algorithm allows the development of fully automated tools, which could be used retrospectively on large-scale MRI datasets to identify potentially fatal abnormalities like abdominal aortic aneurysms (AAA) [9]. Our experiments further show the usefulness of our method in three important and challenging segmentation problems: the delineation of the esophagus in CT, the right ventricle in MR, and the aorta in CT.
2 Formulation
Let denote the image domain and the input-feature vector of pixel (or voxel) . Segmentation assigns to each a label ( is the set of possible labels). While our formulation could be easily extended to multi-class segmentation, we focus on a binary (two-region) statement for a clearer presentation. We minimize the following functional with respect to a discrete binary vector :
[TABLE]
where encodes unary-potential priors, and is our shape compactness. As is typical in segmentation, can be expressed in term of log-likelihoods: , where . Although any (semi-) supervised technique can be used to learn , in this study, we employed a fully-convolutional neural network (FCNN) [7].
Our dimensionless, unbiased and position-independent shape compactness prior is based on a measure that is well accepted in the context of shape metrics [16], the ratio of length-squared to area, which we can write in discrete form (i.e., as a function of our segmentation variable): . The area can be expressed as , and the length is proportional to the number of neighboring pixels with different labels, i.e., , where pairwise potential if and are neighbors, otherwise . Note that this compactness measure can be easily extended so as to attract the solution towards strong edges in the image: , with controlling the relative importance of feature on the weight. With these definitions of and , and re-writing length as , where is the Laplacian matrix corresponding to weights , our compactness model becomes:
[TABLE]
where is a vector with the value one for each element.
2.1 ADMM optimization
The general principle of ADMM is to decompose a hard problem into easier-to-solve sub-problems, which are coupled together via equality constraints. In our case, we introduce auxiliary variable and , and reformulate problem (2) as
[TABLE]
Note that since we impose , we can relax the binarity constraints on . This strategy allows an efficient update of this variable.
In the next step, we move the constraints in the cost-function via a augmented Lagrange formulation:
[TABLE]
In this new formulation, and are dual variables corresponding to the two constraints, and , parameters controlling the trade-off between the primal objective and constraint satisfaction. The benefit of this formulation is that solving for each variable, considering all other variables fixed, can be carried out efficiently and to optimality. We thus adopt an iterative optimization method, where each variable is updated in turn until convergence.
Updating :
Let , the task of updating can be expressed from (4) as
[TABLE]
Minimizing this convex quadratic problem yields
[TABLE]
Note that computing requires solving a large and dense linear system. Let , following the Woodbury identity and using the fact that , we can reformulate the matrix inversion as:
[TABLE]
Since is very sparse, the resulting system can be solved efficiently via standard techniques like the preconditioned conjugate gradients method.
Updating :
Let , updating amount to solving
[TABLE]
Deriving this equation w.r.t. and setting the result to zero gives the following cubic equation:
[TABLE]
It can be shown that this equation has a real valued root . In practice, is positive and converges toward zero, and will be positive.
Updating :
Let and . We update by considering the following problem:
[TABLE]
Since is binary, we have that and . Hence, we can reformulate (10) as
[TABLE]
This corresponds to a simple graph-cut problem, which can be solved efficiently using the Boykov-Kolmogorov algorithm [5].
Finally, we update dual variables following the standard ADMM algorithm: and . This iterative updating process is repeated until is less than a small epsilon.
3 Experiments
We first present a quantitative evaluation of our shape compactness on the task of segmenting the abdominal aorta in a MR-T1 data set of 40 subjects. The usefulness of the proposed method is then shown qualitatively for three additional segmentation scenarios: CT esophagus, MRI right ventricle and CT aorta segmentation. Our method’s accuracy is measured in terms of Dice coefficient, and compared to that of graph cuts (GC) [5] and TRIC222Note that TRIC needs a reference point to define the shortest-path distance. This point was defined as the centroid of the CNN-based segmentation in this work. compactness [1]. To measure the contribution of the pixelwise (i.e., unary potential) prior, we also report results obtained via a simple thresholding.
While any technique can be used to obtain the unary-potentials in Eq. (2), we considered the 3D fully-CNN (FCNN) architecture presented in [7]. This architecture is comprised of 9 convolutional layers with non-linear activation units, 3 fully-connected layers (converted into standard convolution operations), and a soft-max layer. The output of this model is a map indicating the probability of each pixel to belong to a given structure. The same unary potential was used for all three tested approaches. For all segmentation applications, the FCNN was trained using a k-fold validation strategy, with .
For our method’s ADMM parameters, we used and =50 for all experiments. As mentioned in [3], ADMM algorithms are not overly sensitive to these parameters. To facilitate convergence, we increase the value of these parameters by 1% (i.e., 1.01 multiplication factor) at each iteration. In contrast, the compactness regularization was tuned for each segmentation problem: = 5000 for MR-T1 abdominal aorta, = 1000 for CT esophagus, = 15000 for MR right ventricle, and = 3000 for CT abdominal aorta. The code has been made publicly available at https://github.com/josedolz/UnbiasedShapeCompactness.
MR-T1 abdominal aorta segmentation: As shown in the first row of Fig. 1, segmenting the aorta from MR-T1 images is a very challenging problem due to the noise and lack of visible boundaries. In this example, thresholding the FCNN probabilities yields parts of the background along with irregular contours. Because it regularizes the segmentation, GC provides a smoother contour. However, this contour follows the probability map of FCNN and does not reflect the compact shape of the target region. Increasing regularization weight in GC yields a compact region but decreases performance significantly due to the well-known shrinkage bias of standard pairwise length regularization (bias to small regions); note that regularization weight was tuned for an optimal GC performance. The next experimental examples will further highlight shrinkage bias. Unlike GC, TRIC finds a more compact region, due to its shape circularity prior. Nonetheless, this example illustrates this approach’s two main drawbacks: 1) it is biased to near-circular regions, and 2) it is not translation invariant and finds regions centered on the probability map’s center of mass. As seen in the figure, our pose-independent method did not suffer from a circularity/shrinkage bias, yielding a much more accurate segmentation, highly similar to the reference contours.
The bar plot of Fig. 2 gives the accuracy of tested methods obtained for each subject333Two subjects were excluded from the quantitative analysis, as the FCNN failed to generate a usable probability map.. The mean Dice and standard deviation computed across all subjects, is reported in Table 1. These results confirm that the proposed compactness term provides a considerable improvement in accuracy. In terms of runtime, our method takes about 80 seconds on average to segment a single subject, 60 seconds of time spend computing the unary potentials via the FCNN. In contrast, TRIC requires nearly 20 minutes per subject. Hence, our method is more suitable for the fully automated segmentation of large-scale datasets.
CT esophagus segmentation: Fig. 1 (second row) shows delineations of the esophagus in CT, also a challenging problem due to its complex shape and non-homogeneous appearance. In this particular example, we can observe how the well-known shrinkage bias problem of GC leads to an under-segmentation of the esophagus. Notice that GC yielded a compact region but did not improve FCNN’s performance. As in the previous case, we also observe how TRIC over-enforces shape circularity on the segmentation. On the other hand, our compactness term is able to preserve the target region’s shape.
MR Right Ventricle Segmentation: This qualitative result (Fig. 1, third row) shows how our compactness term can accommodate a more general class of shapes that differs significantly from tubular structures. Although TRIC [1] can also handle shapes different than a circle, multiple reference points, which form a skeleton, are required from the user in this case, a prohibitively time-consuming effort for 3D data. However, to keep the process fully automatic, only the centroid of the FCNN segmentation was provided. Having only a single reference point, and particularly in this scenario, led to failure of TRIC to achieve a satisfactory segmentation.
CT Aorta Segmentation: The application of the proposed method to CT aorta segmentation is illustrated in the last row of Fig 1. We see that our method can handle multi-region scenarios, i.e. bifurcations, in contrast to other compactness terms [1]. Note that, for this example, we applied TRIC separately on each of the two regions to avoid having a single contour centered in between these regions. This means that we fed TRIC with additional supervision information, unlike our method which is fully automatic. Notice that, to obtain compact and smooth regions, GC worsened FCNN performance due to shrinkage bias.
4 Conclusion
We presented an unbiased, fully-invariant and multi-region prior for the segmentation of compact shapes, based on the ratio of length-squared to area. An efficient ADMM strategy was proposed to solve the high-order energy minimization problem resulting from this formulation. Using an FCNN to obtain unary probabilities, the proposed method achieved high accuracy in four challenging segmentation problems.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Ben Ayed, I., Wang, M., Miles, B., Garvin, G.J.: TRIC: Trust region for invariant compactness and its application to abdominal aorta segmentation. In: MICCAI (1). pp. 381–388. Springer (2014)
- 2[2] Ben Taieb, A., Hamarneh, G.: Topology aware fully convolutional networks for histology gland segmentation. In: MICCAI (2). pp. 460–468. Springer (2016)
- 3[3] Boyd, S., Parikh, N., Chu, E., Peleato, B., Eckstein, J.: Distributed optimization and statistical learning via the alternating direction method of multipliers. Foundations and Trends® in Machine Learning 3(1), 1–122 (2011)
- 4[4] Boykov, Y., Funka-Lea, G.: Graph cuts and efficient nd image segmentation. International Journal of Computer Vision 70(2), 109–131 (2006)
- 5[5] Boykov, Y., Kolmogorov, V.: An experimental comparison of min-cut/max-flow algorithms for energy minimization in vision. IEEE Transactions on Pattern Analysis and Machine Intelligence 26(9), 1124–1137 (2004)
- 6[6] Das, P., Veksler, O., Zavadsky, V., Boykov, Y.: Semi-automatic segmentation with compact shape prior. Image and Vision Computing 27(1), 206–219 (2009)
- 7[7] Dolz, J., Desrosiers, C., Ben Ayed, I.: 3D fully convolutional networks for subcortical segmentation in mri: A large-scale study. ar Xiv:1612.03925 (2016)
- 8[8] Duquette, A.A., Jodoin, P.M., Bouchot, O., Lalande, A.: 3D segmentation of abdominal aorta from CT-scan and MR images. Computerized Medical Imaging and Graphics 36(4), 294–303 (2012)
