Low-Rank Regularized Convex-Non-Convex Problems for Image Segmentation or Completion
Mohamed El Guide, Anas El Hachimi, Khalide Jbilou, Lothar Reichel

TL;DR
This paper introduces a new convex-non-convex formulation for image segmentation and completion that combines low-rank and smoothness regularizations, solved efficiently with ADMM and validated through numerical experiments.
Contribution
It presents a novel formulation integrating low-rank and smoothness regularizations for image tasks, with convergence analysis and empirical validation.
Findings
Effective in image segmentation and completion tasks.
Convergence of the ADMM algorithm is established.
Numerical experiments demonstrate superior performance.
Abstract
This work proposes a novel convex-non-convex formulation of the image segmentation and the image completion problems. The proposed approach is based on the minimization of a functional involving two distinct regularization terms: one promotes low-rank structure in the solution, while the other one enforces smoothness. To solve the resulting optimization problem, we employ the alternating direction method of multipliers (ADMM). A detailed convergence analysis of the algorithm is provided, and the performance of the methods is demonstrated through a series of numerical experiments.
| Data | Method | CNC | ATCG-TV | LR-CNC | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SR | PSNR | CPU-time | Iter | PSNR | CPU-time | Iter | PSNR | CPU-time | Iter | |
| MRI | 0.1 | 21.22 | 20.22 | 730 | 18.44 | 8.20 | 968 | 21.36 | 4.54 | 174 |
| 0.2 | 23.14 | 5.68 | 355 | 21.96 | 7.13 | 782 | 23.37 | 2.69 | 98 | |
| 0.3 | 25.01 | 4.00 | 262 | 24.37 | 8.56 | 889 | 25.32 | 1.65 | 60 | |
| Cameraman | 0.1 | 21.34 | 11.53 | 757 | 19.31 | 6.58 | 770 | 21.52 | 5.09 | 184 |
| 0.2 | 23.11 | 5.52 | 361 | 22.11 | 7.31 | 785 | 23.27 | 2.64 | 91 | |
| 0.3 | 24.33 | 3.35 | 220 | 24.08 | 6.90 | 755 | 24.70 | 1.66 | 58 | |
| Data | Method | CNC | ATCG-TV | LR-CNC | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| SR | PSNR | CPU-time | Iter | PSNR | CPU-time | Iter | PSNR | CPU-time | Iter | |
| Airplane | 0.1 | 22.26 | 56.64 | 714 | 18.64 | 19.24 | 736 | 22.47 | 4.14 | 78 |
| 0.2 | 24.39 | 13.48 | 355 | 23.04 | 20.90 | 736 | 24.53 | 2.11 | 38 | |
| 0.3 | 25.86 | 9.56 | 212 | 25.21 | 22.42 | 753 | 26.13 | 2.05 | 37 | |
| Barbara | 0.1 | 23.11 | 24.30 | 722 | 20.73 | 20.78 | 849 | 23.24 | 3.63 | 71 |
| 0.2 | 25.19 | 13.14 | 346 | 24.42 | 24.59 | 814 | 25.37 | 2.39 | 41 | |
| 0.3 | 26.76 | 8.47 | 217 | 26.63 | 22.32 | 769 | 26.89 | 2.00 | 36 | |
| method | CNC | ATCG-TV | Chan Vese [18] | LR-CNC | |||||
|---|---|---|---|---|---|---|---|---|---|
| PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | ||
| MRI | 0.01 | 24.94 | 0.42 | 24.37 | 0.49 | 24.17 | 0.64 | 25.12 | 0.49 |
| 0.1 | 18.73 | 0.33 | 18.11 | 0.37 | 18.71 | 0.37 | 20.50 | 0.38 | |
| 0.2 | 13.78 | 0.29 | 13.80 | 0.31 | 13.71 | 0.32 | 15.43 | 0.35 | |
| 0.3 | 10.62 | 0.27 | 10.63 | 0.26 | 10.54 | 0.27 | 12.03 | 0.31 | |
| Millennium | 0.01 | 22.74 | 0.71 | 22.40 | 0.69 | 18.94 | 0.37 | 22.40 | 0.69 |
| 0.1 | 18.14 | 0.62 | 16.20 | 0.30 | 16.66 | 0.30 | 20.30 | 0.65 | |
| 0.2 | 13.34 | 0.49 | 12.21 | 0.26 | 13.09 | 0.24 | 18.29 | 0.59 | |
| 0.3 | 10.10 | 0.40 | 9.15 | 0.22 | 10.11 | 0.20 | 13.60 | 0.51 | |
| method | CNC | ATCG-TV | Chan Vese [18] | LR-CNC | |||||
|---|---|---|---|---|---|---|---|---|---|
| PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | PSNR | SSIM | ||
| Samson | 0.01 | 26.87 | 0.51 | 29.04 | 0.79 | 29.61 | 0.70 | 28.02 | 0.75 |
| 0.1 | 18.81 | 0.30 | 19.54 | 0.60 | 19.66 | 0.52 | 21.38 | 0.63 | |
| 0.2 | 13.48 | 0.18 | 13.91 | 0.47 | 13.93 | 0.41 | 18.22 | 0.53 | |
| 0.3 | 10.16 | 0.13 | 10.45 | 0.39 | 10.46 | 0.34 | 15.66 | 0.51 | |
| Jasper | 0.01 | 24.88 | 0.51 | 25.28 | 0.60 | 26.16 | 0.62 | 27.01 | 0.68 |
| 0.1 | 18.40 | 0.39 | 18.96 | 0.50 | 19.30 | 0.49 | 22.54 | 0.57 | |
| 0.2 | 13.44 | 0.32 | 13.74 | 0.42 | 13.87 | 0.40 | 20.54 | 0.48 | |
| 0.3 | 10.17 | 0.26 | 10.35 | 0.36 | 10.44 | 0.35 | 18.62 | 0.43 | |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSparse and Compressive Sensing Techniques · Medical Image Segmentation Techniques · Numerical methods in inverse problems
Low-Rank Regularized Convex-Non-Convex Problems for Image Segmentation or
Completion
M. El Guide Mohammed VI Polytechnic University, Rabat, Morocco. [email protected]
A. El Hachimi33footnotemark: 3 The UM6P Vanguard center, Mohammed VI Polytechnic University, Rabat, Morocco. [email protected]
K. Jbilou Université du Littoral, Côte d’Opale, batiment H. Poincarré, 50 rue F. Buisson, F-62280 Calais Cedex, France. [email protected]
L. Reichel Department of Mathematical Sciences, Kent State University, Kent 44242, Ohio USA. [email protected]
Abstract
This work proposes a novel convex-non-convex formulation of the image segmentation and the image completion problems. The proposed approach is based on the minimization of a functional involving two distinct regularization terms: one promotes low-rank structure in the solution, while the other one enforces smoothness. To solve the resulting optimization problem, we employ the alternating direction method of multipliers (ADMM). A detailed convergence analysis of the algorithm is provided, and the performance of the methods is demonstrated through a series of numerical experiments.
keywords:
Convex-non-convex problem, image completion, image segmentation, low-rank approximation, matrix nuclear normregularization.
1 Introduction
Image segmentation and completion are fundamental computational tasks in computer vision and data analysis with many applications in medical and hyperspectral imaging, as well as in remote sensing. These tasks arise when reconstructing or segmenting images with missing or occluded pixels. Existing approaches, including variational methods [38] and deep learning techniques [33] have been applied to solve these tasks under specific circumstances. However, these approaches have limitations, such as sensitivity of the computed results to noise in the given data and high computational cost.
Recovery of meaningful information from corrupted or incomplete data is a long-standing problem in image processing. For example, in medical imaging, modalities such as Magnetic Resonance Imaging (MRI) and Computerized Tomography (CT) frequently yield data that suffer from noise contamination or incomplete acquisition due to hardware limitations or patient constraints [21, 28]. Similarly, hyperspectral imaging, known for its ability to capture detailed spectral information, often suffers from missing data values because of sensor malfunction [19, 40]. These difficulties make image recovery and segmentation indispensable for accurate data analysis, diagnosis, and decision-making. Traditional approaches to deal with these issues include the use of i) regularization, e.g., Total Variation (TV) regularization [2, 4, 5], ii) low-rank matrix or tensor factorization [3, 19], and iii) variational formulations such as the Mumford-Shah model [35]. The latter has inspired many variations, including the two-stage Mumford-Shah model by Cai et al. [15] and the Convex-Non-Convex (CNC) model proposed by Chan et al. [17]. Despite their effectiveness, these approaches face limitations, such as high computational complexity, sensitivity to parameter settings, and difficulties in preserving fine details in high-dimensional data.
Image segmentation involves partitioning an image into meaningful regions based on certain properties of the image such as intensity, texture, or spectral information. Classical techniques, including active contour methods [18] and region-growing algorithms [27], have been studied extensively. Variational approaches, such as the Mumford-Shah model [35] and its convex variants [15], have advanced the field by providing robust mathematical formulations. More recently, the hybrid convex-non-convex framework has gained prominence due to its ability to balance computational feasibility with modeling flexibility; see Chan et al. [17]. We remark that traditional convex formulations, such as those based on the nuclear norm, are robust but can be overly restrictive and may give suboptimal solutions. Conversely, purely non-convex approaches, while offering greater flexibility, often lack stability and scalability; see, e.g., Lu et al. [34].
To overcome these limitations, this paper introduces a hybrid convex-non-convex optimization framework for image segmentation and completion. By combining the robustness of convex optimization and the flexibility of non-convex regularization, the proposed methods strike a balance between accuracy and computational efficiency. The methods use regularization that promotes matrix solutions of low rank. This has emerged as a powerful technique for image recovery as well as for image segmentation. Low-rank matrix solutions can be represented with reduced degrees of freedom; see Bell et al. [3]. We seek to determine low-rank matrix solutions by regularization with the nuclear matrix norm, which is a convex surrogate of the matrix rank function [16]. However, while the use of the nuclear norm ensures mathematical tractability, the solution of minimization problems that involve the nuclear matrix norm can be very demanding computationally for large-scale problems. To address this issue, the Alternating Direction Method of Multipliers (ADMM) is frequently employed; see, e.g., [12, 29, 32, 36]. The ability of ADMM to decompose complex problems into simple subproblems has made it a popular method for solving a variety of optimization problems in image processing. Computed examples presented in Section 4 show the proposed methods to be competitive with available methods both in terms of accuracy and computing time.
We formulate image completion and image segmentation within a unified framework. Let represent an observed image. For image completion, the matrix has zero or blank entries at locations for which no data is available, and for image segmentation the matrix represents a fully observed noisy image. The desired solution of many image processing problems is a piecewise smooth image. Therefore, many solution methods use regularization that promotes piecewise smoothness of the computed solution. For instance, this can be achieved with Total Variation (TV) regularization; see [4, 6, 7]. We obtain a minimization problem of the form
[TABLE]
where denotes a total variation regularization functional, stands for the Frobenius norm, and is a regularization parameter that balances the influence of the terms in (1) on the solution. Chan et al. [17] proposed the use of the total variation functional
[TABLE]
where
[TABLE]
The parameters and have to be chosen by a user: is used to tune the degree of non-convexity of the regularization functional as discussed below, while is chosen to emphasize interesting features of the image. In image segmentation, is used to indicate which entries of the image should not be considered boundary points of the segmented regions of the images. The chosen form behaves like a quadratic smoothing term for small gradients but transitions to a flat response for large gradients, thereby preserving edges, as proposed by Chan et al. [17]. We comment on the choices of , , and below; see also Chan et al. [17] for a discussion on how to choose these parameters.
Following Chan et al. [17], we define a gradient-type operator by
[TABLE]
where the superscript T stands for transposition,
[TABLE]
and and are the circulant matrices
[TABLE]
The function in (2) satisfies:
is continuously differentiable for . 2. 2.
is twice continuously differentiable for . 3. 3.
is convex and monotonically increasing for . 4. 4.
concave and monotonically non-decreasing for . 5. 5.
is constant for . 6. 6.
.
We would like to determine an approximate solution of (1) of low rank. Recovery of data so that the computed solution or correction of an available approximate solution are of low rank has received considerable attention; see, e.g., [19, 21, 31, 38, 40]. Reasons for this include that the imposition of a low-rank constraint may be meaningful in the model considered, may yield a computed solution of higher quality, or may be easier to interpret. We therefore modify the minimization problem (1) to obtain
[TABLE]
where is user-specified positive integer.
However, the solution of the minimization problem (3) is NP-hard. It therefore is impractical to solve (3), except when all matrices involved are very small. To circumvent this difficulty, we replace the rank function by its convexification, which is the matrix nuclear norm; see [3, 13, 16] for discussions on this replacement. We solve the minimization so obtained by ADMM.
In summary, the contributions of this work are: (i) a new unified model for image completion and segmentation that promotes both low-rank structure and piecewise smoothness, (ii) an efficient ADMM-based algorithm with proven convergence properties, and (iii) a comprehensive experimental evaluation for a variety of images (grayscale, color, hyperspectral) that demonstrate improved accuracy and speed compared to existing methods.
This paper is organized as follows. Section 2 discusses the solution of the optimization problem (3) with the rank function replaced by the nuclear norm, and Section 3 is concerned with the convergence properties of the proposed solution method. Numerical examples that illustrate the performance of the solution methods are reported in Section 4. Concluding remarks can be found in Section 5.
It is a pleasure to dedicate this paper to Åke Björck and Lars Eldén, who made profound contributions to Numerical Linear Algebra and pioneered the development of numerical methods for the solution linear discrete ill-posed problems; see, e.g., [8, 9, 10, 11, 23, 24, 25, 26].
2 The solution method
This section describes the proposed solution method. We convexify the rank-regularized problem (3) by using the nuclear norm surrogate, and derive an efficient algorithm based on ADMM to solve the resulting problem.
Let the matrix have the singular values . The nuclear norm of is defined as
[TABLE]
The matrix nuclear norm is continuous, convex, and coercive. Its subdifferential is given by
[TABLE]
Here and throughout this paper denotes the spectral matrix norm or the Euclidean vector norm.
Replacing the rank constraint in the optimization problem (3) by the matrix nuclear norm gives the minimization problem
[TABLE]
We introduce some auxiliary variables to obtain an equivalent minimization problem that we will solve:
[TABLE]
such that and . Here, captures the discrete gradient of (so that the regularizer can be written in terms of ), and is introduced to facilitate the nuclear norm term. These substitutions allow splitting the terms of the objective function in the ADMM framework and enable more efficient optimization. In detail, we define the differential operator , and express the auxiliary variable , where each matrix entry captures the local gradient information. We propose to solve the resulting optimization problem using the ADMM; see, e.g., [12, 29, 32, 36] for discussions of this method. The augmented Lagrangian is given by
[TABLE]
where and are Lagrangian parameters and and are Lagrangian multipliers. The inner product of and is defined as
[TABLE]
In particular, .
At the st iteration of ADMM, we solve the following subproblems
[TABLE]
The remainder of this section discusses the solution of these subproblems.
2.1 Solution of subproblem (7)
We obtain from (7) that
[TABLE]
which is equivalent to
[TABLE]
The solution satisfies
[TABLE]
where is the adjoint of the gradient operator, i.e., the divergence operator. Let
[TABLE]
We then can write equation (12) as
[TABLE]
which is equivalent to
[TABLE]
Since the matrices and are circulants, they are diagonalizable by Discrete Fourier Transform (DFT) matrices, i.e.,
[TABLE]
where and denote DFT matrices of sizes and , respectively, and the superscript H stands for transposition and complex conjugation. The diagonal entries of the diagonal matrices and are the eigenvalues of and , respectively. Let denote the Kronecker product and let the operator vec stack all entries of a matrix column by column to give a vector. We obtain
[TABLE]
The parameters and are positive and the matrices and are invertible. Therefore, the matrix is invertible. It follows that can be expressed as
[TABLE]
2.2 Solving subproblem (8)
The matrix satisfies
[TABLE]
which can be written as
[TABLE]
The right-hand side defines a proximal operator of the matrix nuclear norm. It can be shown that since , the iterates converge to a unique solution as ; see [14]. The iterate can be expressed with the aid of Singular Value Thresholding (SVT) of the matrix
[TABLE]
Let denotes the singular value decomposition of the matrix (14). Thus, and are orthogonal matrices and is a diagonal matrix, whose nontrivial entries are the singular values. Then
[TABLE]
where, elementwise, .
2.3 Solving subproblem (9)
Let . Then this subproblem can be written as
[TABLE]
This expression can can be simplified to
[TABLE]
Following the discussion by Chan et al. [17], we find that the minimizer is given by
[TABLE]
where
[TABLE]
and
[TABLE]
with
[TABLE]
Chan et al. [17] show that the cost function for the minimization problem (16) is strongly convex (convex) if and only if ().
We are in a position to discuss the solution of the minimization problem (4). The following algorithm outlines the solution process.
ADMM is guaranteed to converge to a global minimum of the convex problem (5); in our non-convex formulation, we show in Section 3 that every limit point of the sequence generated by Algorithm 1 is a stationary point of (5).
A numerically practical way to compute the solution is to first apply the MATLAB function psf2otf to the matrix
[TABLE]
This MATLAB function is used to compute the fast Fourier transform of the matrix (considered as a Point-Spread Function (PSF) array) and creates the Optical Transfer Function (OTF) array. Using MATLAB notation, can be computed as
[TABLE]
Generally, the matrix is easy to compute. The main computational work required by Algorithm 1 is the computation of the matrix as this requires the evaluation of a singular value decomposition. The matrices in the computed examples of Section 4 are small enough to make this feasible. For very large matrices, we can compute a partial singular value decomposition that is made up of all singular triplets with singular values larger than . These singular triples can be computed, e.g., with the MATLAB function svds, which implements the method described in [1].
3 Convergence analysis
This section investigates the convergence of Algorithm 1. We establish that the sequence of the generated iterates has limit points and any accumulation point is a solution (or stationary point) of the minimization problem. Let
[TABLE]
where
[TABLE]
The functionals , , and are referred to as the fidelity, low-rank, and regularization functionals, respectively. Consider the saddle point problem of determining a quintuple such that
[TABLE]
Chan et al. [17, Lemma 3.1] show the following result.
Lemma 1**.**
The functional defined by
[TABLE]
is strictly convex if and only if the function given by
[TABLE]
is strictly convex.
Lemma 2**.**
The functional is strictly convex if and only if the function is strictly convex.
Proof.
The functional can be written as
[TABLE]
The nuclear norm function is convex. Therefore, it does not affect the strict convexity. This shows the lemma. ∎
Corollary 3**.**
The functional is strictly convex if and only if is strictly convex.
The next theorem furnishes conditions that secure strict convexity of .
Theorem 4**.**
A sufficient condition for the functional to be strictly convex is that the pair satisfies
[TABLE]
Proof.
The theorem can be shown similarly as [17, Theorem 3.5] by using Corollary 3. ∎
A function is said to be -strongly convex if there is a constant such that is convex.
Proposition 5**.**
The functional is proper, continuous, bounded from below by zero, coercive, and -strongly convex, where
[TABLE]
Proof.
Chan et al. [17, Proposition 3.7] show that the functional is proper, continuous, bounded from below by zero, coercive, and -strongly convex, with given by (22). The nuclear norm function satisfies
[TABLE]
The representations (20) and (23) show that the functional is continuous, bounded below by zero, and strongly -convex. ∎
Lemma 6**.**
Let the pair of parameters satisfy (21). Then the functional , the regularization term , the quadratic fidelity term , and the functional in (18) are locally Lipschitz continuous functions.
Proof.
Chan et al. [17] show that the functions and are locally Lipschitz continuous. The lemma follows from the observation that the function also is locally Lipschitz continuous. ∎
Proposition 7**.**
For any pair of parameters that satisfies (21), the functional has a unique (global) minimizer that satisfies
[TABLE]
where [math] denotes the null matrix of size and stands for the subdifferential of . Moreover,
[TABLE]
where denotes the Clarke generalized gradient [20] of the (non-convex and non-smooth) regularization function , and is the subdifferential of the nuclear norm function at .
Proof.
The functional is strongly convex when (21) holds. Therefore, has a unique minimizer and (24) follows from the generalized Fermat’s rule [37].
The concept of a generalized gradient extends the concept of a subdifferential for non-smooth convex functions to non-smooth non-convex functions that are locally Lipschitz. According to Lemma 6, the functionals , , , and are locally Lipschitz. Therefore, the generalized gradient is defined.
For non-smooth and convex functions, the Clarke generalized gradient equals the subdifferential, i.e.,
[TABLE]
Hence,
[TABLE]
which gives
[TABLE]
because in the case of a continuously differentiable function, the generalized gradient reduces to the gradient. ∎
Proposition 8**.**
Let be the fidelity function, and let , , , , , and . Then the function
[TABLE]
is convex if
[TABLE]
This inequality also is necessary if the inequality is to hold for all .
Proof.
In order for the function to be convex, its Hessian has to be positive semidefinite. Let . The Hessian is given by
[TABLE]
where denotes the identity matrix of order . The matrix can be written as
[TABLE]
where . It can be shown by a simple calculation that . Let . Then the Hessian can be expressed as
[TABLE]
Since the matrix is symmetric and positive semidefinite, it has the spectral factorization
[TABLE]
with eigenvalues and orthogonal eigenvector matrix . In order for
[TABLE]
to be positive semidefinite, , , and must satisfy
[TABLE]
Since
[TABLE]
we obtain by using Gershgorin discs that for all . It follows that convexity is secured if (25) holds. The largest eigenvalue converges to as increases. Therefore (25) is necessary if we would like to determine independently of and . ∎
Proposition 9**.**
For any Lagrangian multipliers and , we have that the augmented Lagrangian functional is proper, continuous, and coercive jointly in the primal variables . Moreover, is jointly convex in if the penalty parameters and satisfy
[TABLE]
where
[TABLE]
The relations (26) are meaningful if and .
Proof.
It is clear that the function is proper and continuous. To show that is coercive jointly in , we note that
[TABLE]
Since is bounded and the terms and are independent of , it follows that and these terms do not affect the coercivity. The expression
[TABLE]
is coercive with respect to , the expression
[TABLE]
is coercive with respect to , and the expression
[TABLE]
is coercive with respect to . Therefore, is jointly coercive with respect to .
To show convexity, we express as
[TABLE]
Define the functions
[TABLE]
and note that can be written as
[TABLE]
The functional is convex if
[TABLE]
and is convex if the inequality
[TABLE]
holds. Since the nuclear norm is convex, the functional is convex when . In view of that the functional is affine with respect to , , and , it does not affect the convexity of . Finally, the functional is jointly convex if it can be reduced to the form
[TABLE]
Thus, we require the coefficients of , , , and be positive, that the product of the square roots of and equal the coefficient of , and that the product of the square roots of and equal the coefficient of . This implies that
[TABLE]
Thus, and should satisfy
[TABLE]
and the quadruple should satisfy (27). Moreover, the set is non-empty. ∎
Proposition 9 shows that we can determine parameters and such that the functional is jointly convex in . Since
[TABLE]
it follows from (26) and (27) that there are parameters such that
[TABLE]
Assume that . Then condition (28) yields
[TABLE]
and
[TABLE]
Thus,
[TABLE]
Therefore, and satisfy the inequalities
[TABLE]
Lemma 10**.**
Assume that , where and are lower semi-continuous convex functions from to , and is Gateau differentiable with derivative . If , then the following conditions are equivalent:
, 2. 2.
, .
Moreover, when the function has a separable structure of the type
[TABLE]
where are independent variables, conditions and are equivalent to
[TABLE]
Proof.
The proposition can be shown similarly as in [17]. ∎
Theorem 11**.**
For any pair of parameters that satisfies condition (21), any penalty parameters and that satisfy (26), and any Lagrange multipliers and , the augmented Lagrangian functional satisfies
[TABLE]
The saddle point problem (19) has at least one solution, and all solutions are of the form , where is the unique global minimizer of the functional .
Proof.
It follows from the proof of Proposition 9 that there is a quadruple such that the functional
[TABLE]
is proper, continuous, coercive, and convex jointly in the variables . We can express as
[TABLE]
where
[TABLE]
with
[TABLE]
The functions and are semi-continuous and convex. Moreover, is Gatteau differentiable. Therefore, Lemma 10 yields the desired result. ∎
Theorem 12**.**
Let the parameters satisfy (21) and let the Lagrangian parameters satisfy (26). Then the saddle point problem (19) admits at least one solution, and all the solutions have the form , where denotes the unique minimizer of (3). Furthermore, and .
Proof.
The proof is similar to the proof presented in [17]. ∎
Let solve the saddle point problem (19). Then
[TABLE]
where , , and are restrictions of the functional to , , and , respectively. These functionals can be written as
[TABLE]
In order to apply Lemma 10 to the functionals , , and , we have to verify that the first and second parts of each functional are semi-continuous and convex, and that the second part is Gateau differentiable. For this to hold, the parameters have to satisfy the conditions
[TABLE]
We obtain
[TABLE]
In the following, let be a solution of the saddle point problem (19). Introduce the variables
[TABLE]
where , , is the sequence generated by Algorithm 1. The following propositions are important for showing convergence of this sequence to the solution of the saddle point of (19).
Proposition 13**.**
For any parameter pairs that satisfies (21), and parameter pairs such that
[TABLE]
we have
[TABLE]
for certain values of and , and for specific values of to be determined.
See Appendix A for a proof.
Proposition 14**.**
Let satisfy (35) and let and satisfy (48) and (47), respectively. Assume that the coefficients satisfy (49). Then
[TABLE]
A proof is provided in Appendix B.
The following theorem gives a condition on and so that the sequence generated by Algorithm (1) converges to the solution of the saddle point problem (19).
Theorem 15**.**
*Assume that is a solution of a saddle point problem and let the parameter pair satisfy (21). Then for any parameters such that (35) holds, the sequence
generated by Algorithm 1 satisfies*
[TABLE]
See Appendix C for a proof.
Theorem 15 implies that the iterates generated by Algorithm 1 converge to a stationary solution of (5) when increases. In other words, the proposed method is guaranteed to converge under the stated conditions. The following section illustrates the effectiveness of this algorithm when applied to various image completion and segmentation tasks.
4 Numerical examples
We compare the LR-CNC algorithm (Algorithm 1) to several available methods for image completion and segmentation, including the Convex-Non-Convex (CNC) segmentation method by Chan et al. [17], the ATCG-TV image restoration algorithm proposed by Benchettou et al. [4] (a total variation-based image reconstruction method using an alternating conditional gradient scheme), and the classical Chan-Vese segmentation method [18]. We also apply a standard K-means clustering [30] to determine segmentations of reconstructed images.
The iterations with Algorithm 1 are terminated as soon as two consecutive approximations are sufficiently close. Specifically, we terminate the iterations when
[TABLE]
where tol is a user-chosen tolerance. In all the experiments, we set . We report the Peak Signal-to-Noise Ratio (PSNR)
[TABLE]
where and denote arrays that represent the original and the recovered data, respectively, and is the maximum squared entry (pixel-value) of the array .
When segmenting images, we also compute the Structural Similarity Index Measure (SSIM) to determine the closeness of the recovered and the original image. The definition of this index is somewhat involved, see [39], and we do not provide it here. We just recall that a larger SSIM-value corresponds to a more accurate reconstruction and the maximum value is 1. All computations were carried out on a laptop computer equipped with a 2.3 GHz Intel Core i5 processor and 8 GB of memory using MATLAB 2023.
For the numerical experiments, we need to fix several parameters: , , , , and . Based on the equations (21), we let , and (35) shows that
[TABLE]
with and . Thus, we only need to set the parameters , , , , , , and .
4.1 Image completion
This subsection compares the performance of Algorithm 1 to that of the recently proposed algorithms ATCG-TV by Benchettou et al. [4] and CNC by Chan et al. [17]. This subsection is divided into two parts: the first part is concerned with the gray scale images, while the second part considers color images.
In this subsection we use the sampling rate, which is defined as
[TABLE]
where is the number of the pixels of the data, and is the number of missed entries. For all the experiments in this subsection, we set , , , , , , and .
4.1.1 Gray scale images
We illustrate the performance of Algorithm 1 when applied to the well-known MRI and cameraman images. Figure 1 shows results of completion for the MRI image determined by Algorithm 1, the CNC algorithm [17], and the ATCG-TV algorithm [4] for missing data (SR). Table 1 reports PSNR-values, CPU-times (in seconds), and the number of iterations (Iter) required by each algorithm to satisfy the stopping criteria. The stopping criteria for the ATCG-TV and CNC algorithms were the default choices provided in [4] and [17], respectively.
Figure 1 and Table 1 show that for small SR-values, the LR-CNC algorithm outperforms the ATCG-TV algorithm. However, for larger SR-values, the difference in performance between the two algorithms is less significant. The LR-CNC algorithm demonstrates a clear advantage in terms of CPU time and the number of iterations required in comparison with both the CNC and ATCG-TV algorithms. Figure 2 displays the graphs of the logarithm of the mean square error versus the number of iterations for the LR-CNC, CNC, and ATCG-TV algorithms, as well as the values of
[TABLE]
and the evolution of the PSNR-values as a function of the number of iterations when these algorithms are applied to the MRI image for SR.
Figure 2 shows the relative differences (37) obtained with the LR-CNC algorithm to decrease faster than the corresponding differences for the CNC and ATCG-TV algorithms. Moreover, the PSNR-values of the restorations determined by the LR-CNC algorithm increase quickly and smoothly with the iteration number. In fact, they increase much faster and in a smoother manner with the iteration number than for the CNC and ATCG-TV algorithms.
4.1.2 Color images
We apply the LR-CNC, CNC, and ATCG-TV algorithms to the restoration of color images. In our experiments, we use the well-known Airplane and Barbara images, both of which are represented by arrays of size . Thus, the red, green, and blue channels are each represented by a tensor slice. These images are reshaped into matrices of size by using the MATLAB command .
Figure 3 displays results for the color images Airplane and Barbara with missing data when reconstructed by the CNC, ATCG-TV, and LR-CNC algorithms.
Table 2 displays the PSNR-values, CPU-time, and the number of iterations required by the CNC, ATCG-TV, and LR-CNC algorithms to satisfy the stopping criterion. The computations are carried out for the Airplane and Barbara images for several SR-values. Figure 4 depicts the relative differences (38) and the PSNR-values as functions of the iteration number for the CNC, ATCG-TV, and LR-CNC algorithms.
The above results show the LR-CNC algorithm to consistently produce images with higher PSNR/SSIM values than the other algorithms in our comparison, especially in challenging scenarios (very low sampling ratio or high noise levels). Qualitatively, the LR-CNC algorithm gives reconstructions that retain details and segments images into regions more distinctly than the other algorithms; see Figures 3 and 4. An advantage of the LR-CNC algorithm is its fast convergence: the algorithm achieves accurate results in far fewer iterations (2) than the other algorithms due to the efficiency of low-rank regularization and our ADMM scheme.
4.2 Image segmentation
This subsection applies the LR-CNC algorithm (Algorithm 1) to image segmentation. We compare the performance of this algorithm to the CNC and ATCG-TV algorithms, as well as to an algorithm proposed by Chan and Vese [18]. In all experiments, we apply these algorithms to images that have been contaminated by Gaussian noise and then use the K-means algorithm [30] to segment the resulting image, with equal to the number of regions in the ground truth. Given a noise-free image ’Im’, we generate a noise-contaminated image with the MATLAB command
[TABLE]
where the parameter specifies the mean of the Gaussian noise; is the variance of the noise. The variance is the default value. We will refer to the value of the parameter as the Gaussian noise level. For all segmentation experiments, we let . If is relatively large, then we set , otherwise we set . Moreover, we let , , , , , and . Code for the algorithm by Chan & Vese [18] is available at 111https://fr.mathworks.com/matlabcentral/fileexchange/34548-active-contour-without-edge.
Figure 5 displays results obtained with the CNC, ATCG-TV, Chan & Vese, and LR-CNC algorithms for Gaussian noise level . These tests are applied to the MRI image with the cluster number set to , and to a Millennium simulation 222https://wwwmpa.mpa-garching.mpg.de/galform/virgo/millennium/ with cluster number .
Table 3 shows PSNR and SSIM values for segmented images determined by the CNC, ATCG-TV, Chan & Vese, and LR-CNC algorithms for several Gaussian noise levels when applied to the MRI and Millennium images. Figures 6 and 7 display the relative differences (38) and PSNR-values of images determined at each iteration of the CNC, ATCG-TV, and LR-CNC algorithms as functions of the iteration number when these algorithms are applied to the MRI and Millennium images. The Gaussian noise level is and we seek to determine and clusters in the MRI and Millenium images, respectively.
Figure 5 shows the segmented images produced by the LR-CNC algorithm to retain details with high precision for both images, also when the amount of noise is large. Furthermore, the PSNR and SSIM values reported in Table 3 indicate that the LR-CNC algorithm consistently exhibits greater robustness than the other algorithm, with particularly strong performance for larger noise levels. Figures 6 and 7 illustrate the smooth and rapid convergence of the iterates determined by the LR-CNC algorithm.
Our last example is concerned with segmentation of hyperspectral images. Hyperspectral images contain more detailed information than RGB images. However, material separation based on hyperspectral images remains a challenging task. It involves isolating materials from one another. Various approaches have been developed to carry out material separation, including non-negative matrix factorization [22].
We consider two hyperspectral images: Samson and Jasper Ridge. Both images are captured by the Airborne Visible/Infrared Imaging Spectrometer AVIRIS. Initially, these images contained 224 spectral bands. After removing the water absorption bands, the Samson image has 156 bands and the Jasper Ridge image has 198 bands. The Samson dataset is a tensor of size , and the Jasper Ridge dataset is a tensor of size . The Samson dataset contains three materials: water, soil, and trees, whereas the Jasper Ridge dataset includes four materials: water, soil, trees, and roads. Figures 8 and 9 show the different materials of each dataset. By using the non-negative tensor factorization described in [22], it can be shown that the abundance of each material is presented by the color between red and yellow.
Figure 10 displays the results of the CNC, Chan & Vese, ATCG-TV, and LR-CNC algorithms applied to the hyperspectral images Samson and Jasper Ridge. Both images are corrupted by Gaussian noise of level . For the Samson image, we represent water in black, trees in gray with an intensity of , and soil in white. For the Jasper Ridge image, water is represented in black, trees in gray with an intensity of , soil in gray with an intensity of , and roads are shown in white. Given the presence of three materials in the Samson test, we use , while for the Jasper Ridge image, we use to represent the four materials.
Figure 9 shows that the LR-CNC method identifies each material more distinctly than the other algorithms. Table 4 displays PSNR- and SSIM-values for the recovered data for the Samson and Jasper Ridge datasets for various Gaussian noise levels. Figures 10 and 11 depict relative differences (38) and PSNR-values at each iteration of the algorithms for the Gaussian noise level for the Samson and Jasper Ridge data sets, respectively.
Table 4 illustrates that, at a low Gaussian noise level of , the performance of all algorithms is comparable, with only slight differences in achieved PSNR- and SSIM-values. However, for noise levels and larger, the LR-CNC algorithm yields segmentations with significantly larger PSNR- and SSIM-values. Furthermore, the graphs of Figures 10 and 11 demonstrate the convergence of the LR-CNC algorithm to be more regular and faster.
5 Conclusion
In this work, we proposed a novel approach to consider convex-non-convex optimization problems with focus on promoting low-rank solutions. By combining low-rank regularization with a non-convex penalty, the method achieves reconstructions that are both accurate and computationally efficient. This is supported by measured quality metrics. The optimization problems are solved with the Alternating Direction Method of Multipliers (ADMM), and we provide an analysis that establishes the convergence properties of the LR-CNC algorithm. Extensive numerical experiments illustrate the effectiveness of this algorithm. The LR-CNC framework balances the robustness of convex models and the flexibility of non-convex formulations. This makes the LR-CNC algorithm a promising tool for image segmentation and completion tasks.
The authors declare that there is no conflict of interest.
Appendix A Proof of Proposition 13
. From Theorem 12, we have and . Therefore,
[TABLE]
Using (6) we obtain the equations
[TABLE]
Adding these equations, we get
[TABLE]
At the th step with , Algorithm 1 yields relations that are analogous to (32), (33), and (34), namely,
[TABLE]
where , and satisfy the relations (31).
Substituting into (32) and into (A), and adding the equations so obtained, we get
[TABLE]
Similarly, we substitute into (33), into (A), into (33), and into (A), and obtain
[TABLE]
[TABLE]
Moreover, addition of (42), (44), and (A) yields
[TABLE]
where .
We would like to show that the third and the fourth terms of the above inequality are equal to, respectively,
[TABLE]
for some nonnegative coefficients . This requires that the coefficients of , , , and are nonnegative, i.e., that
[TABLE]
By also considering the conditions (31) on , and , we obtain
[TABLE]
It follows from the first and the second inequalities that
[TABLE]
We also have the relations
[TABLE]
which give
[TABLE]
We obtain
[TABLE]
Since and , it follows that we can have a solution only if and . Imposing the conditions
[TABLE]
we get
[TABLE]
We now investigate if and admit some solutions. Considering the above two equations and the inequalities (29), we have
[TABLE]
Since and are positive, accepts solutions for every and accepts solutions only if
[TABLE]
Letting
[TABLE]
we find that the third and the fourth terms of (A) can be written as
[TABLE]
with
[TABLE]
It follows that and . We will use these inequalities below.
Inequality (A) can be written as
[TABLE]
which is equivalent to
[TABLE]
This implies
[TABLE]
which gives
[TABLE]
This concludes the proof.
Appendix B Proof of Proposition 14
.
According Proposition 13, we have
[TABLE]
We now seek to determine lower bounds for and . We have
[TABLE]
and
[TABLE]
Moreover,
[TABLE]
From the construction of , we get
[TABLE]
and from the construction of , we obtain
[TABLE]
Replacing with in (B), with in (A), summing the two inequalities, and proceeding similarly when replacing with in (B) and with in (A), yields
[TABLE]
and
[TABLE]
Since
[TABLE]
it follows that
[TABLE]
Therefore,
[TABLE]
and
[TABLE]
Consequently,
[TABLE]
This completes the proof.
Appendix C Proof of Theorem 15
It follows from Proposition 14 that
[TABLE]
with , , and satisfying. respectively, (47), (48), and (49). Thus, we have
[TABLE]
Introduce the sequence
[TABLE]
This sequence is bounded and decreasing, and therefore it converges. This implies that
[TABLE]
converges to [math]. This implies that the sequences
[TABLE]
are bounded and, therefore, the sequences
[TABLE]
also are bounded. Moreover, the sequences
[TABLE]
are bounded. We conclude that
[TABLE]
As we have discussed, , with and . Furthermore, and . We can see that
[TABLE]
and
[TABLE]
Moreover, we have
[TABLE]
Consequently, from the inequalities in (60), (61), and (62), and by using (56) and (58), as well as from (57) and (59), we obtain
[TABLE]
This concludes the proof.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] J. Baglama, L. Reichel, Augmented implicitly restarted Lanczos bidiagonalization methods, SIAM Journal on Scientific Computing, 27, 19–42 (2005).
- 2[2] A. Beck, M. Teboulle, Fast gradient-based algorithms for constrained total variation image denoising and deblurring problems, IEEE Transactions on Image Processing, 18(11) 2419–2434 (2009).
- 3[3] R. Bell, Y. Koren, C. Volinsky, Matrix factorization techniques for recommender systems, Computer, 42(8), 30-37 (2009).
- 4[4] O. Benchettou, A. H. Bentbib, A. Bouhamidi, K. Kreit, Constrained tensorial total variation problem based on an alternating conditional gradient algorithm, Journal of Computational and Applied Mathematics, 451, Art. 116018 (2024).
- 5[5] A. H. Bentbib, M. El Guide, K. Jbilou, E. Onunwor, L. Reichel, Solution methods for linear discrete ill-posed problems for color image restoration, BIT Numerical Mathematics 58, 555–576 (2018).
- 6[6] A. H. Bentbib, A. El Hachimi, K. Jbilou, A. Ratnani, Fast multidimensional completion and principal component analysis methods via the cosine product, Calcolo, 59(3), Art. 26 (2022).
- 7[7] A. H. Bentbib, A. El Hachimi, K. Jbilou, A. Ratnani, A tensor regularized nuclear norm method for image and video completion, Journal of Optimization Theory and Applications, 192(2), 401–425 (2022).
- 8[8] Å. Björck, Solving linear least squares problems by Gram-Schmidt orthogonalization, BIT Numerical Mathematics, 7, 1–210 (1967).
