A Finite Element Nonoverlapping Domain Decomposition Method with Lagrange Multipliers for the Dual Total Variation Minimizations
Chang-Ock Lee, Jongho Park

TL;DR
This paper introduces a primal-dual domain decomposition method using Lagrange multipliers for total variation regularized image processing, demonstrating superior performance in denoising, inpainting, and segmentation tasks.
Contribution
It develops a novel finite element domain decomposition approach with Lagrange multipliers for dual total variation minimization, improving computational efficiency and accuracy.
Findings
Outperforms existing state-of-the-art methods in image denoising, inpainting, and segmentation.
Effective handling of interface constraints via Lagrange multipliers.
Applicable to problems with $L^2$- and $L^1$-fidelity.
Abstract
In this paper, we consider a primal-dual domain decomposition method for total variation regularized problems appearing in mathematical image processing. The model problem is transformed into an equivalent constrained minimization problem by tearing-and-interconnecting domain decomposition. Then, the continuity constraints on the subdomain interfaces are treated by introducing Lagrange multipliers. The resulting saddle point problem is solved by the first order primal-dual algorithm. We apply the proposed method to image denoising, inpainting, and segmentation problems with either -fidelity or -fidelity. Numerical results show that the proposed method outperforms the existing state-of-the-art methods.
| 1 1 Test image | 1 1 | ALG1 | LNP | ||||||||||||||
| PSNR | iter |
|
|
PSNR | iter |
|
|
||||||||||
| 1 Cameraman | 1 | 47.68 | - | 546 | 86.30 | ||||||||||||
| 48.26 | 39 | 266 | 155.85 | 46.75 | 33 | 1761 | 587.86 | ||||||||||
| 48.26 | 45 | 277 | 50.40 | 46.75 | 61 | 1679 | 141.31 | ||||||||||
| 48.25 | 49 | 274 | 14.79 | 46.78 | 134 | 1796 | 42.37 | ||||||||||
| 48.24 | 54 | 279 | 2.19 | 46.79 | 175 | 1200 | 7.82 | ||||||||||
| 1 Boat | 1 | 33.82 | - | 2093 | 476.02 | ||||||||||||
| 33.92 | 61 | 283 | 531.97 | 33.60 | 43 | 1815 | 970.12 | ||||||||||
| 33.92 | 63 | 276 | 165.45 | 33.62 | 71 | 1823 | 255.70 | ||||||||||
| 33.92 | 66 | 258 | 53.68 | 33.62 | 114 | 1413 | 87.23 | ||||||||||
| 33.92 | 71 | 246 | 8.18 | 33.63 | 161 | 844 | 14.07 | ||||||||||
| Test image | PSNR | iter |
|
|
||||||
|---|---|---|---|---|---|---|---|---|---|---|
| 1 Cameraman | 1 | 24.96 | - | 2324 | 389.88 | |||||
| 25.00 | 438 | 613 | 805.88 | |||||||
| 25.00 | 440 | 713 | 260.06 | |||||||
| 25.00 | 445 | 99 | 88.96 | |||||||
| 25.00 | 460 | 99 | 13.40 | |||||||
| 1 Boat | 1 | 24.32 | - | 1861 | 436.32 | |||||
| 24.33 | 285 | 817 | 1085.96 | |||||||
| 24.33 | 286 | 588 | 361.04 | |||||||
| 24.33 | 289 | 111 | 132.14 | |||||||
| 24.33 | 298 | 117 | 25.63 |
| 1 1 Test image | 1 1 | ALG1 | LNP | ||||||||||||||
| PSNR | iter |
|
|
PSNR | iter |
|
|
||||||||||
| 1 Cameraman | 1 | 34.70 | - | 1693 | 304.05 | ||||||||||||
| 35.56 | 78 | 921 | 615.5 | 34.18 | 173 | 1530 | 1132.71 | ||||||||||
| 35.53 | 86 | 958 | 244.21 | 34.24 | 190 | 1672 | 323.98 | ||||||||||
| 35.53 | 90 | 274 | 68.44 | 34.26 | 205 | 1796 | 97.65 | ||||||||||
| 35.53 | 101 | 279 | 16.10 | 34.37 | 239 | 1200 | 24.67 | ||||||||||
| 1 Boat | 1 | 30.83 | - | 2262 | 604.34 | ||||||||||||
| 31.08 | 80 | 860 | 1283.93 | 30.53 | 368 | 1703 | 2134.45 | ||||||||||
| 31.08 | 84 | 765 | 432.68 | 30.55 | 374 | 1888 | 635.88 | ||||||||||
| 31.08 | 88 | 258 | 128.09 | 30.56 | 384 | 1413 | 217.33 | ||||||||||
| 31.07 | 97 | 246 | 31.95 | 30.58 | 402 | 844 | 60.68 | ||||||||||
| 1 1 Test image | 1 1 | ALG1 | DCT | ||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| iter |
|
|
iter |
|
|
||||||||||
| 1 Cameraman | 1 | - | 1070 | 172.47 | |||||||||||
| 16 | 258 | 153.72 | 14 | 290 | 85.67 | ||||||||||
| 16 | 63 | 37.58 | 15 | 52 | 18.21 | ||||||||||
| 16 | 55 | 6.65 | 15 | 52 | 5.31 | ||||||||||
| 16 | 56 | 1.10 | 16 | 55 | 1.43 | ||||||||||
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopology Optimization in Engineering · Composite Structure Analysis and Optimization · Advanced Numerical Methods in Computational Mathematics
∎
11institutetext: Chang-Ock Lee 22institutetext: Department of Mathematical Sciences, KAIST, Daejeon 34141, Korea
22email: [email protected] 33institutetext: Jongho Park 44institutetext: Department of Mathematical Sciences, KAIST, Daejeon 34141, Korea
Tel.: +82-42-350-2790
44email: [email protected]
A Finite Element Nonoverlapping Domain Decomposition Method with Lagrange Multipliers for the Dual Total Variation Minimizations††thanks: The first author’s work was supported by NRF grant funded by MSIT (NRF-2017R1A2B4011627)
and the second author’s work was supported by NRF grant funded by the Korean Government (NRF-2015-Global Ph.D. Fellowship Program).
Chang-Ock Lee
Jongho Park
(Received: date / Accepted: date)
Abstract
In this paper, we consider a primal-dual domain decomposition method for total variation regularized problems appearing in mathematical image processing. The model problem is transformed into an equivalent constrained minimization problem by tearing-and-interconnecting domain decomposition. Then, the continuity constraints on the subdomain interfaces are treated by introducing Lagrange multipliers. The resulting saddle point problem is solved by the first order primal-dual algorithm. We apply the proposed method to image denoising, inpainting, and segmentation problems with either -fidelity or -fidelity. Numerical results show that the proposed method outperforms the existing state-of-the-art methods.
Keywords:
Total Variation Lagrange Multipliers Domain Decomposition Parallel Computation Image Processing
MSC:
65N30 65N55 65Y05 68U10
††journal: arXiv
1 Introduction
After a pioneering work of Rudin et al. ROF:1992 , total variation minimization has been widely used in image processing. In their work, authors proposed an image denoising model with the total variation regularizer, which is called Rudin–Osher–Fatemi (ROF) model, as follows:
[TABLE]
where is an image domain, is a corrupted image, is a positive denoising parameter, and is the total variation of defined as
[TABLE]
where denotes the Euclidean norm in . The solution space is the collection of functions with finite total variation. Thanks to the anisotropic diffusion property of the total variation term, the model (1.1) effectively removes Gaussian noise and preserves edges and discontinuities of the image SC:2003 .
Total variation minimization can be applied to not only image denoising, but also other interesting problems in image processing. In CS:2002 , an image inpainting model with the total variation regularizer was proposed. The model in CS:2002 is a simple modification of (1.1), which excludes the inpainting domain from the domain of integration of the fidelity term in (1.1). In addition, (1.1) can be generalized to the image deconvolution problem by replacing in the fidelity term by , where is either a specific convolution operator CW:1998 ; YK:1996 . In order to treat impulse noise, the - model, which uses fidelity instead of fidelity, was introduced in CE:2005 ; Nikolova:2004 . It is well-known that the - model preserves contrast of the image, while the conventional ROF model does not. We also note that variational image segmentation problem can be represented as the total variation minimization by appropriate change of variables CEN:2006 . One can refer CP:2016 for further results of total variation minimization.
This paper is concerned with domain decomposition methods (DDMs) for such total variation regularized minimization problems. DDMs are suitable for parallel computation since they solve a large scale problem by dividing it into smaller problems and treating them in parallel. While DDMs for elliptic partial differential equations have been successfully developed over past decades, there have been relatively modest achievements in total variation minimization problems due to their own difficulties. At first, the total variation term is nonsmooth and nonseparable. Thus, the energy functional cannot be expressed as the sum of the local energy functionals in the subdomains in general. Even more, the solution space allows discontinuities of a solution on the subdomain interfaces, so that it is difficult to impose appropriate boundary conditions to the local problems in the subdomains.
There have been several researches to overcome such difficulties and develop efficient DDMs for the total variation minimization CTWY:2015 ; DCT:2016 ; Fornasier:2007 ; FLS:2010 ; FS:2009 ; HL:2013 ; HL:2015 ; LLWY:2016 ; LN:2017 ; LNP:2018 ; LPP:2019 . The Gauss-Seidel and Jacobi type subspace correction methods for the ROF model were proposed in Fornasier:2007 ; FLS:2010 ; FS:2009 ; LLWY:2016 , and they were extended to the case of mixed fidelity in HL:2013 . However, in LN:2017 , a counterexample was provided for convergence of subspace correction methods. In DCT:2016 , a convergent overlapping DDM for the convex Chan–Vese model CEN:2006 was proposed. Recently, a domain decomposition framework for the case of fidelity was introduced in LNP:2018 . One of the effective ideas in DDMs for the total variation minimization is to consider the Fenchel–Rockafellar dual formulation of the model:
[TABLE]
instead of the original one. Here, denotes the space of vector fields : such that and on . Consideration of the dual formulation resolves some difficulties mentioned above; the dual energy functional is separable, and the solution space requires some regularity on the subdomain interfaces. Even if there arises another difficulty of treating the inequality constraint , several successful researches have been done CTWY:2015 ; HL:2015 ; LN:2017 ; LPP:2019 .
In this paper, we generalize the primal-dual DDM proposed in LPP:2019 to a wider class of the total variation minimizations. We consider a general model problem
[TABLE]
where : is a proper, convex, and lower semicontinuous functional satisfying additional properties such that is separable and simple in the sense that (1.2) can be efficiently solved by the first order primal-dual algorithm CP:2011 . Such a class of total variation minimizations contains the ROF model, the - model, their inpainting variants, and the convex Chan–Vese model for image segmentation. As we noted above, we treat the Fenchel–Rockafellar dual problem of (1.2):
[TABLE]
where is the Legendre–Fenchel conjugate of . The image domain is decomposed into a number of disjoint subdomains , and the continuity along the subdomain interfaces is enforced. By treating the continuity constraints on the subdomain interfaces by the method of Lagrange multipliers, we obtain an equivalent saddle point problem. Application of the Chambolle–Pock primal-dual algorithm CP:2011 to the resulting saddle point problem yields our proposed method, that shows good performance for various total variation minimization problems such as image denoising, inpainting, and segmentation.
The rest of the paper is organized as follows. We present the basic settings for design of DDM in Sect. 2. In Sect. 3, we state the abstract model problem which generalizes various problems in image processing such as image denoising, inpainting, and segmentation. A convergent nonoverlapping DDM for the model problem is proposed in Sect. 4. We apply the proposed method to several image processing problems and compare with the existing state-of-the-art methods in Sect. 5. We conclude the paper with remarks in Sect. 6.
2 Preliminaries
In this section, we present the basic setting for design of DDM. At first, we introduce notations that will be used throughout the paper. Then, the discrete setting for the dual total variation minimization based on the finite element framework is provided.
2.1 Notations
Let be the generic -dimensional Hilbert space. For and , the -norm of is denoted by
[TABLE]
and the Euclidean inner product of and is denoted by
[TABLE]
We may drop the subscript if there is no ambiguity.
In this paper, we use the symbol ∗ as superscripts with two different meanings. At first, for a convex functional : , : denotes the Legendre–Fenchel conjugate of , which is defined as
[TABLE]
for every . On the other hand, when : is a linear operator on , denotes the adjoint of , that is,
[TABLE]
for every . It is well-known that the matrix representation of is the transpose of the matrix representation of .
For a convex functional : , the effective domain of , denoted by , is defined as
[TABLE]
Also, for a convex subset of , we denote the relative interior of by , which is defined as the interior of when is regarded as a subset of its affine hull. One may refer Sect. 6 of Rockafellar:2015 for detailed topological properties of .
For a subset of , we define the characteristic functional : of by
[TABLE]
It is clear that the functional is convex if and only if is a convex subset of .
2.2 Fenchel–Rockafellar duality
Throughout this paper, we use the notion of Fenchel–Rockafellar duality frequently. For the sake of completeness, we provide its key features. For more rigorous texts, readers may refer CP:2016 ; Rockafellar:2015 , for instance.
Let and be finite-dimensional Hilbert spaces. Consider the minimization problem of the form
[TABLE]
where : is a linear operator and : , : are proper, convex, lower semicontinuous functionals. We use the assumptions on and presented in Corollary 31.2.1 of Rockafellar:2015 .
Assumption 2.1
There exists such that and .
Under Assumption 2.1, the following relations hold Rockafellar:2015 :
[TABLE]
The saddle point problem in the right hand side of (2.2a) is called the primal-dual formulation of (2.1), and the minimization problem in (2.2b) is called the dual formulation of (2.1). Hence, it is enough to solve (2.2a) or (2.2b) instead of (2.1) in many cases.
2.3 Discrete setting
Let be a rectangular image domain consisting of a number of rows and columns of pixels. Since each pixel holds a value representing the intensity at a point, we may regard the image as a pixelwise constant function on . We write the collection of all pixels in as and define the space for an image as
[TABLE]
We note that . We regard each pixel in as a square element so that becomes a piecewise constant square finite element space whose side length equals to 1. It is clear that the functions
[TABLE]
form a basis for . For and , let denote the degree of freedom (dof) of associated with the basis function . Then, is represented as
[TABLE]
To obtain a finite element discretization of the dual problem, the space for the dual variables is defined by the lowest order Raviart–Thomas elements:
[TABLE]
where is the collection of the vector functions : of the form
[TABLE]
We notice that the divergence operator in the continuous setting is well-defined on and . Each dof of is the value of the normal component on a pixel edge. Let be the set of indices of the basis functions for and be the basis. For and , we denote the dof of associated with the basis function as ; we have
[TABLE]
Throughout the paper, the -norm of is defined as the -norm of the vector of dofs of for . In this case, one may regard as a lumped -norm with proper scaling; see Remark 2.2 of LPP:2019 .
In order to treat the inequality constraints appearing in the dual formulation (1.3), we define the convex subset of by
[TABLE]
The projection of onto can be easily computed by
[TABLE]
2.4 Domain decomposition setting
We decompose the image domain into disjoint rectangular subdomains . For two subdomains and () sharing a subdomain edge (interface), let be the shared subdomain edge between them. Also, we define the union of the subdomain interfaces by . We assume that does not cut through any elements in .
Now, we define the local function spaces in the subdomains. For , let be the collection of all pixels in . The local primal function space is defined by
[TABLE]
Obviously, we have . Furthermore, we define the natural restriction operator : by
[TABLE]
Then, the adjoint : of becomes the extension-by-zero operator
[TABLE]
The local dual function spaces are defined in the tearing-and-interconnectingsense FR:1991 . More precisely, we define the local dual function space as
[TABLE]
Note that the boundary condition is not imposed on for . Thus, has dofs on the pixel edges contained in . Let be the set of indices of the basis functions for . Similarly to (2.3), the inequality-constrained subset of is defined by
[TABLE]
The orthogonal projection onto is computed as
[TABLE]
Let and be the direct sums of the sets ’s and ’s, respectively. By definition, functions in may have discontinuities on . Let be the collection of dofs of on . The jump operator : measures the magnitude of such discontinuities, that is, is defined as
[TABLE]
where . Clearly, there is a natural isomorphism between and . For later use, we provide an upper bound of the operator norm of LPP:2019 .
Proposition 2.2
The operator norm of :* has a bound .*
Proposition 2.2 will be used for the estimation of the range of the parameters in the proposed method.
3 The model problem
The model problem we consider in this paper is the total variation regularized convex minimization problem defined on the image domain :
[TABLE]
where : is a proper, convex, lower semicontinuous functional, is the discrete total variation of given by
[TABLE]
and is a positive parameter. In addition, we assume that satisfies the following three assumptions.
Assumption 3.1
is separable in the sense that there exist proper, convex, lower semicontinuous local energy functionals : such that
[TABLE]
where is the restriction operator defined in (2.5).
Assumption 3.2
For any and , the proximity operator defined by
[TABLE]
has a closed-form formula.
Assumption 3.3
.
Assumption 3.1 makes (3.1) more suitable for designing DDMs. It will be explained in Sect. 4. Also, by Assumption 3.2, we are able to adopt the primal-dual algorithm CP:2011 to solve (3.1). Indeed, if Assumption 3.2 holds, we can solve an equivalent primal-dual form of (3.1),
[TABLE]
by the primal-dual algorithm. For the model problem (3.1), Assumption 3.3 is equivalent to Assumption 2.1 when since . That is, Assumption 3.3 is essential for the primal-dual equivalence. It is trivial that all the problems we will deal with in this paper satisfy Assumption 3.3.
Next, we consider the dual formulation of (3.1):
[TABLE]
From (3.3), it is possible to deduce a relationship between solutions of the primal problem (3.1) and the dual problem (3.4):
[TABLE]
As the standard discretization for the total variation minimization is the finite difference method, we provide a relation between the finite difference discretization and our finite element discretization. The following proposition means that a solution of the finite difference discretization of (1.2) can be recovered from a solution of (3.4).
Proposition 3.4
Assume that the image domain consists of pixels. Let be a solution of (3.4). If and satisfy the primal-dual relation (3.5), then is a solution of the minimization problem
[TABLE]
where is the forward finite difference operator
[TABLE]
and .
Proof
It is straightforward by the same argument as Theorem 2.3 of LPP:2019 . ∎
Remark 3.5
In Proposition 3.4, equals to the finite difference anisotropic total variation due to the structure of the constraint set in (2.3). On the other hand, one can make equal to the finite difference isotropic total variation by replacing by an appropriate convex subset of ; see LPP:2019 for details.
Remark 3.6
We give a remark on the relation between the continuous problem (1.2) and the discrete problem (3.1). For simplicity, we assume that the resolution of the image is . We fix the side length of by 1 and introduce a side length parameter . The space is defined in the same manner as in Sect. 2.3. Then the question we have is whether a solution of the minimization problem
[TABLE]
accumulates at a solution of (1.2) in as . However, since does not -converge to the -seminorm in as (see Example 4.1 of Bartels:2012 ), this is not true in general. It means that the space may not be considered as a correct space for solving (1.2). To avoid this uncomfortable situation, one may use higher order Raviart–Thomas elements to discretize the dual formulation Bartels:2012 ; HHSNW:2018 .
Nevertheless, thanks to Proposition 3.4, one can construct a sequence accumulating at in the -topology from . For any , we have
[TABLE]
where is the dofs of and : is the forward finite difference operator defined similarly to (3.6). As it can be shown without much difficulty the right hand side of (3.7) -converges to the -seminorm in as CLL:2011 ; WL:2011 , there exists an interpolation operator : such that accumulates at in as . We note that for , in general.
The model problem (3.1) occurs in various areas of mathematical image processing. One of the typical examples is the image denoising problem. In ROF:1992 , authors proposed the well-known ROF model which consists of the -fidelity term and the total variation regularizer. In the ROF model, is given by
[TABLE]
and Assumptions 3.1 and 3.2 are satisfied with
[TABLE]
In order to preserve the contrast of an image, the - model which uses the -fidelity term was introduced in CE:2005 ; Nikolova:2004 :
[TABLE]
The local functionals and the proximity operator are readily obtained as follows:
[TABLE]
where the elementwise shrinkage operator : on is defined as follows WYYZ:2008 :
[TABLE]
with the convention . For simplicity, is assumed to be a union of elements of in (3.8).
The models for the image denoising problem are easily extended to the image inpainting problem CS:2002 . Let be the inpainting domain and be the known part of an image. We assume that does not cut through any elements in . We set on for simplicity. Also, let : be the restriction operator onto , that is, on for all . Then, is given by
[TABLE]
Note that Assumptions 3.1 and 3.2 are ensured with
[TABLE]
where for . One can easily check that . Similarly to the denoising model, we may use the fidelity term instead of the fidelity term as follows:
[TABLE]
In this case, and are given by
[TABLE]
Another typical example is the image segmentation problem. In CEN:2006 , authors proposed a convex image segmentation model with the total variation regularizer as follows:
[TABLE]
where is a given image, and are predetermined intensity values. Writing in (3.9) yields the following simpler form:
[TABLE]
Then, (3.10) is of the form (3.1) with
[TABLE]
and satisfies Assumptions 3.1 and 3.2 with
[TABLE]
Here, can be computed pointwise like (2.4).
We close this section by mentioning that the image deconvolution problem is not a case of (3.1) in general. For an operator : defined by the matrix convolution with some matrix kernel, since the computation of needs values of in as well as adjacent subdomains. Consequently, Assumption 3.1 cannot be satisfied for the deconvolution problem.
4 Proposed method
In this section, we extend the primal-dual DDM for the ROF model introduced in LPP:2019 to the more general model problem (3.1). The continuity of a solution on the subdomain interfaces is imposed in the dual sense, that is, it is imposed by the method of Lagrange multipliers. As a result, we obtain an equivalent saddle point problem, which is solved by the first order primal-dual algorithm CP:2011 .
We start the section by stating the following simple proposition, which means that the Legendre–Fenchel conjugate is separable if the original functional is separable.
Proposition 4.1
Let : be a proper, convex, lower semicontinuous functional satisfying Assumption 3.1. Then, its Legendre–Fenchel conjugate also satisfies Assumption 3.1.
Proof
For , we have
[TABLE]
Letting for completes the proof. ∎
Thanks to Proposition 4.1, we can transform the dual model problem (3.4) to an equivalent constrained minimization problem
[TABLE]
In order to treat the continuity constraint , the method of Lagrange multipliers for (4.1) yields the saddle point formulation
[TABLE]
where
[TABLE]
The following proposition summarizes the equivalence between the dual model problem (3.4) and the resulting saddle point problem (4.2).
Proposition 4.2
If is a solution of (3.4), then is a primal solution of (4.2). Conversely, if is a primal solution of (4.2), then . Hence and is a solution of (3.4).
Now, we are ready to propose the main algorithm of this paper. In the case of ROF model, recovering a primal solution from the computed dual solution can be easily done by the primal-dual relation . However, in the general case, the primal solution may not be obtained as in the ROF case since the primal-dual relation (3.5) does not always give an explicit formula for . Instead, we consider an algorithm to obtain a primal solution and a dual solution simultaneously. We begin with the primal-dual algorithm CP:2011 applied to (4.2). In each iteration of the primal-dual algorithm, we need to solve the local problems of the following form:
[TABLE]
for . The term appears because we compute proximal descent/ascent in each step of the primal-dual algorithm. To obtain a primal solution and a dual solution simultaneously, we replace (4.3) by the following primal-dual formulation of (4.3):
[TABLE]
There is another advantage to solve (4.4) instead of (4.3). Differently from the ROF model, it is sometimes cumbersome to get an explicit formula for , which makes the design of a local solver difficult. We note that an explicit formula for in the case of with nonsingular is given in DHN:2009 , but it is somewhat complicated. However, considering (4.4) does not require an explicit formula for .
Similarly to the ROF case, the solution pair can be constructed by assembling the local solution pairs in the subdomains, i.e., and . The local solution pair in the subdomain is obtained by solving the local problem
[TABLE]
and each local problem can be solved in parallel. We will address how to solve (4.5) in Sect. 5 in detail.
In summary, the proposed algorithm is presented in Algorithm 1. As in the ROF case, the range comes from Proposition 2.2.
Next, we analyze convergence of the proposed method. Since the sequence generated by Algorithm 1 agrees with the one generated by the standard primal-dual algorithm, ergodic convergence is guaranteed.
Theorem 4.3
Let be the sequence generated by Algorithm 1. Then, converges to a saddle point of (4.2) and it satisfies
[TABLE]
for any and , where and .
Proof
Since the term
[TABLE]
in (4.2) is convex, by Theorem 5.1 in CP:2016 , we get the desired result. ∎
By Theorem 4.3, we ensure the convergence of the sequences and . On the other hand, since the sequence is a kind of byproduct of the primal-dual algorithm, the convergence theory for the primal-dual algorithm developed in CP:2011 ; CP:2016 does not ensure global convergence of . Thus, we will prove that it tends to a solution of (3.1). For the sake of convenience, we rewrite (4.4) as the following more compact form:
[TABLE]
where : is defined as . Note that if , then so that . Furthermore, since has dofs on , we have . Then, we readily see that is a solution of the minimization problem
[TABLE]
where is defined as
[TABLE]
the Legendre–Fenchel conjugate of
[TABLE]
At first, we verify the boundedness of .
Lemma 4.4
There exist a finite functional :* and a coercive functional : such that*
[TABLE]
Proof
We first see that for any , we have , where is the dof-wise absolute value of and the division is done dof-wise with convention . An upper bound of is obtained as follows:
[TABLE]
Take , which is clearly finite.
Now, we find a lower bound of . Note that in Algorithm 1. Since and are convergent by Theorem 4.3, is also convergent. Hence, is bounded, i.e. there exists a constant such that
[TABLE]
Then, we have
[TABLE]
Note that denotes the set of indices of the basis functions for and the value of depends on the image size and the number of subdomains . Take
[TABLE]
which is coercive due to the term . ∎
Next lemma provides a criterion for equi-coercivity of a collection of functionals.
Lemma 4.5
Let , , be a collection of functionals from into . If there exists a coercive functional :* such that for all , then is equi-coercive, that is, for every , there exists a compact subset of such that*
[TABLE]
Proof
Let . Since is coercive, there exists a compact subset of such that
[TABLE]
On the other hand, for every , implies that
[TABLE]
Therefore, is equi-coercive. ∎
Now, with the help of the lemmas above, we state the main theorem, which ensures that the sequence generated by Algorithm 1 approaches to solutions of (3.1).
Theorem 4.6
Let be the sequence generated by Algorithm 1. Then, is bounded and every limit point of is a solution of (3.1).
Proof
Recall that is a solution of the minimization problem
[TABLE]
Since is proper, we may choose with . By Lemma 4.4, we have
[TABLE]
Thanks to the minimization property of , we get
[TABLE]
By Lemmas 4.4 and 4.5, is equi-coercive, that is, there exists a compact subset of independent of such that
[TABLE]
Thus, implies that . Since , the map is a continuous isomorphism between and . Therefore, we can deduce that
[TABLE]
which is a compact subset of independent of . This implies that is bounded.
Now, we may refine so that it converges to its limit point . Since is a solution of a saddle point problem (4.6), it satisfies
[TABLE]
By Theorem 4.3, converges to . As and , by closedness of the graph of (See Theorem 24.4 in Rockafellar:2015 ), we get
[TABLE]
By Proposition 4.2, , and hence . We obtain the relation
[TABLE]
From the facts that is a solution of (3.4) (See Proposition 4.2) and satisfies the relation (3.5), we conclude that is a solution of (3.1). ∎
As a direct consequence of of Theorem 4.6, we get the following result.
Corollary 4.7
Let be the sequence generated by Algorithm 1. Then, converges to , where is a solution of (3.1).
5 Applications
In this section, we apply our proposed DDM to various total variation based image processing problems mentioned in Sect. 3. The proposed method was implemented in ANSI C with OpenMPI and compiled by Intel Parallel Studio XE. All the computations were done on a cluster composed of seven machines, where each machine has two Intel Xeon SP-6148 CPUs (2.4GHz, 20C), 192GB memory, and the operating system CentOS 7.4 64bit.
To emphasize efficiency of the proposed method as a parallel algorithm, we compare the wall-clock time of the proposed method with the primal-dual algorithm CP:2011 for the full dimension problem (3.4). The wall-clock time measures the total elapsed time including the communication time.
At each iteration of Algorithm 1, we solve the local saddle point problems of the form
[TABLE]
where is given in Assumption 3.1 and
[TABLE]
The primal-dual algorithm for (5.1) consists of computation of the proximity operators of and , that is,
[TABLE]
for some . For more details, we refer readers to see CP:2011 . The proximity operator of is computed easily as follows:
[TABLE]
Computation of the proximity operator of depends on the problem to solve. Thus, we will give details in each subsection.
Since is uniformly convex with parameter , we are able to adopt the convergent primal-dual algorithm (CP:2011, , Algorithm 2) for the local problems. Such acceleration of the local solvers for DDMs was discussed in LNP:2018 ; LPP:2019 .
Next, we provide the setting of the used parameters. We set the parameters for the outer iterations by and for the image denoising and inpainting problems, and for the image segmentation problem. For the convergent algorithms for the local problems, we set , , and . (The same notations for the parameters are used as in CP:2011 ). The full dimension problems () are solved by the primal-dual algorithm with the same parameters as the corresponding local problems.
Finally, we note that the local solutions from the previous outer iteration were chosen as initial guesses for the local problems to reduce the number of inner iterations.
5.1 Image denoising
We present the results of numerical experiments for the - model for image denoising:
[TABLE]
We note that numerical results of the proposed method for ROF model were given in LPP:2019 . In (5.2), is given by
[TABLE]
and its proximity operator can be computed as
[TABLE]
where the shrinkage operator on is defined by replacing and by and in (3.8), respectively.
We use two test images “Cameraman ” and “Boat ” with salt-and-pepper noise (See Fig. 1). PSNR denotes peak signal-to-noise ratio. We set .
Thanks to Proposition 3.4, it is able to compare the proposed method with the existing methods for (5.2) based on the finite difference discretization. The following algorithms are used for our performance evaluation:
- •
ALG1: Proposed method described in Algorithm 1, .
- •
LNP: DDM proposed by Lee, Nam, and Park LNP:2018 with the anisotropic total variation, , , .
- •
CP: Primal-dual algorithm proposed by Chambolle and Pock CP:2011 with the anisotropic total variation, , .
In the numerical experiments, the following stopping criterion is used for outer iterations:
[TABLE]
and the following ones are used for inner iterations of ALG1 and LNP, respectively:
[TABLE]
Fig. 1 shows the denoised images obtained by ALG1 with and CP (). In order to highlight the efficiency of the proposed method as a parallel solver, Table 1 shows the performance of the methods with the varying number of subdomains. In addition, to compare the convergence rate of the proposed method with existing algorithms, we present Fig. 2 which shows decay of for 1000 iterations of three algorithms ALG1, LNP, and CP, where the minimum primal energy is computed approximately by iterations of the primal-dual algorithm.
5.2 Image inpainting
We first consider the inpainting model with the fidelity term:
[TABLE]
Here, : is the restriction operator onto , so that its matrix representation is a diagonal matrix whose diagonal entries are either [math] or . Thus, computation of the proximity operator of is as easy as the case of the denoising problem. Indeed, we have
[TABLE]
Two test images “Cameraman ” and “Boat ” corrupted by additive Gaussian noise with mean [math] and variance and a text mask are used for numerical experiments (See Fig. 3). The model parameter is set as . Table 2 shows the computation results of the proposed method for the varying number of subdomains. Stop conditions (5.3) and (5.4a) are used for outer iterations and inner iterations of ALG1, respectively. Fig. 3 shows the resulting images of ALG1 with and CP. Fig. 4 shows decay of for two algorithms ALG1 and CP. Here, is computed by iterations of the primal-dual algorithm.
Now, we consider the following inpainting model:
[TABLE]
Similarly to the inpainting problem, the proximity operator of is given by
[TABLE]
Test images are corrupted by salt-and-pepper noise and the same text mask as the inpainting problem (See Fig. 5). We use as the model parameter. Numerical results of ALG1, LNP, and CP for (5.6) are given in Table 3. Both ALG1 and LNP uses (5.3) as the stopping criterion for outer iterations. Stopping criteria for local problems are given in (5.4a) and (5.4b) for ALG1 and LNP, respectively. The recovered images obtained by ALG1 and CP are given in Fig. 5. Fig. 6 shows decay of the value of for three algorithms ALG1, LNP, and CP.
5.3 Image segmentation
As we mentioned in Sect. 3, the convex Chan–Vese model for the image segmentation model is represented as
[TABLE]
where . The proximity operator of is computed as
[TABLE]
As shown in Fig. 7, we use a test image “Cameraman .” We set the model parameters , , and heuristically. Also, to convert the results to binary functions, we use a threshold parameter . Numerical results of ALG1, DCT, and CP for (5.7) are presented in Table 4, where DCT denotes the following algorithm:
- •
DCT: DDM proposed by Duan, Chang, and Tai DCT:2016 with the anisotropic total variation, , , .
Both ALG1 and DCT uses (5.3) as a stop condition for outer iterations. For stop conditions for local problems, ALG1 uses (5.4a) while DCT uses (5.4b). The segmentation results obtained by ALG1 and CP are provided in Fig. 7. Fig. 8 presents the energy decay of several algorithms containing the proposed method for (5.7). Since the minimum primal energy is negative in this case, we plot the value of , where is computed by iterations of the primal-dual algorithm.
5.4 Discussion
As shown in Figs. 1, 3, 5, and 7, the results of the full dimension problem and the proposed method are not visually distinguishable. Note that the resulting images of the proposed method show no trace of the subdomain interfaces. In Tables 1–4, one can observe that solutions for different values of have different PSNRs. It means that the algorithms converge to different solutions. This is because (1.2) admits nonunique solutions in general. However, as the stopping criterion (5.3) indicates, their primal energies tend to the minimum.
In Tables 1–4, one can see that the number of outer iterations monotonically increases as the number of subdomains increases. We also observe that the numbers of maximum inner iterations of the proposed method are much smaller than the numbers of iterations of the full dimension problem. The reason is that we utilize more accelerated solvers for the local problems than standard ones. Reduction of the numbers of inner iterations makes the proposed method faster.
For every problem, we see that the wall-clock time is decreasing as the number of subdomains grows. In particular, large scale images such as “Cameraman ” and “Boat ” can be processed in a minute with sufficiently many subdomains, while it takes quite a long time with a single domain. This shows efficiency of the proposed method as a parallel solver for image processing.
Finally, as shown in Figs. 2, 4, 6, and 8, the proposed method shows good performance in terms of the decay rate of the primal energy . To be more precise, the primal energy of the proposed method decreases much faster than the one of CP for the full-dimension problem. The proposed method also outperforms LNP which is a recently developed DDM for - problems. Even more, for the segmentation problem, the convergence rate of the proposed method seems to be faster than . However, a theoretical evidence for such fast convergence is still missing. It is observed in Figs. 2, 4, 6, and 8 that the primal energy of the proposed method becomes stagnant when the relative error is sufficiently small. This is due to that local problems are solved inexactly by iterative methods in each iteration. Such stagnation of the primal energy is not problematic in practice because the quality of the recovered image becomes acceptable enough before the stagnation starts. Meanwhile, in terms of wall-clock time, the proposed method outperforms LNP for both denoising and inpainting problems while it shows the similar performance to DCT for the segmentation problem.
6 Conclusion
In this paper, we generalized the primal-dual DDM for the ROF model proposed in LPP:2019 to more general total variation minimization problem. The Fenchel–Rockafellar dual of the model problem was considered. We constructed the constrained minimization problem which has domain decomposition structure and is equivalent to the dual model problem. The constrained minimization problem was converted to the equivalent saddle point problem by the method of Lagrange multipliers. The resulting saddle point problem was solved by the first order primal-dual algorithm and convergence of the dual solution was guaranteed. We also proved convergence of the primal solution. Numerical results showed that the proposed method is superior to the existing methods in the sense of convergence rate, and is much faster with sufficiently many subdomains than the full dimension problem.
Even though the proposed DDM is applicable for various total variation regularized problems, we point out that the proposed method is not appropriate for the image deconvolution problem. We will develop a DDM with similar strategy which is applicable for the image deconvolution problem later.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1(1) Bartels, S.: Total variation minimization with finite elements: convergence and iterative solution. SIAM J. Numer. Anal. 50 (3), 1162–1180 (2012)
- 2(2) Chambolle, A., Levine, S.E., Lucier, B.J.: An upwind finite-difference method for total variation-based image smoothing. SIAM J. Imaging Sci. 4 (1), 277–299 (2011)
- 3(3) Chambolle, A., Pock, T.: A first-order primal-dual algorithm for convex problems with applications to imaging. J. Math. Imaging Vis. 40 (1), 120–145 (2011)
- 4(4) Chambolle, A., Pock, T.: An introduction to continuous optimization for imaging. Acta Numer. 25 , 161–319 (2016)
- 5(5) Chan, T.F., Esedoglu, S.: Aspects of total variation regularized L 1 superscript 𝐿 1 {L}^{1} function approximation. SIAM J. Appl. Math. 65 (5), 1817–1837 (2005)
- 6(6) Chan, T.F., Esedoglu, S., Nikolova, M.:Algorithms for finding global minimizers of image segmentation and denoising models. SIAM J. Appl. Math. 66 (5), 1632–1648 (2006)
- 7(7) Chan, T.F., Wong, C.-K.: Total variation blind deconvolution. IEEE Trans. Image Process. 7 (3), 370–375 (1998)
- 8(8) Chang, H., Tai, X.-C., Wang, L.-L., Yang, D.: Convergence rate of overlapping domain decomposition methods for the Rudin-Osher-Fatemi model based on a dual formulation. SIAM J. Imaging Sci. 8 (1), 564–591 (2015)
