Non-Convex Weighted Lp Minimization based Group Sparse Representation Framework for Image Denoising
Qiong Wang, Xinggan Zhang, Yu Wu, Lan Tang, Zhiyuan Zha

TL;DR
This paper introduces a novel non-convex weighted Lp minimization framework for image denoising, utilizing group sparsity and adaptive patch search to outperform existing methods in accuracy and speed.
Contribution
It proposes a new non-convex weighted Lp minimization approach with a generalized soft-thresholding algorithm and adaptive patch search for improved image denoising.
Findings
Outperforms state-of-the-art methods like BM3D and WNNM.
Achieves better denoising quality with competitive speed.
Effectively handles practical image inverse problems.
Abstract
Nonlocal image representation or group sparsity has attracted considerable interest in various low-level vision tasks and has led to several state-of-the-art image denoising techniques, such as BM3D, LSSC. In the past, convex optimization with sparsity-promoting convex regularization was usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization can not still obtain the correct sparsity solution under some practical problems including image inverse problems. In this paper we propose a non-convex weighted minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex minimization problem. In addition, to improve the accuracy of the nonlocal similar patches…
| Algorithm 1: Generalized Soft-Thresholding (GST) [25]. |
| Input: . |
| 1. ; |
| 2. If |
| 3. ; |
| 4. else |
| 5. ; |
| 6. Iterate on |
| 7. ; |
| 8. ; |
| 9. ; |
| 10. End |
| Input:: . |
| Algorithm 2: The Proposed Denoising Algorithm. |
| Input: Noisy image Y. |
| ; |
| For do |
| Iterative regularization ; |
| If |
| Similar patch selection based on . |
| Else |
| If |
| Similar patches index selection based on . |
| Else |
| Similar patches index selection based on . |
| End if |
| End if |
| For each patch do |
| Find a group via NN. |
| Constructing dictionary by by PCA operator. |
| Generating the group sparse coefficient by . |
| Update computing by . |
| Update computing by Algorithm 1. |
| Get the estimation =. |
| End for |
| Aggregate to form the recovered image . |
| End for |
| Output: . |
| Images | BM3D | LINC | AST-NLS | MSEPLL | WNNM | Proposed | BM3D | LINC | AST-NLS | MSEPLL | WNNM | Proposed |
| House | 33.77 | 33.82 | 33.87 | 33.27 | 34.04 | 34.08 | 32.09 | 32.26 | 32.26 | 31.71 | 32.52 | 32.65 |
| lin | 32.83 | 33.04 | 33.84 | 32.80 | 33.00 | 33.08 | 30.95 | 31.03 | 30.83 | 30.96 | 31.07 | 31.14 |
| flower | 30.01 | 30.30 | 30.28 | 30.10 | 33.34 | 30.48 | 27.97 | 28.13 | 28.20 | 28.05 | 28.26 | 28.36 |
| foreman | 34.54 | 34.76 | 34.55 | 34.09 | 34.72 | 34.86 | 32.75 | 32.93 | 32.79 | 32.34 | 33.00 | 33.31 |
| plants | 32.68 | 32.83 | 32.75 | 32.58 | 33.04 | 33.09 | 30.70 | 30.67 | 30.65 | 30.66 | 30.94 | 31.05 |
| Miss | 33.71 | 33.64 | 33.64 | 33.68 | 33.70 | 33.80 | 31.89 | 31.75 | 31.72 | 31.92 | 31.93 | 32.04 |
| Average | 32.92 | 33.07 | 32.99 | 32.80 | 33.14 | 33.23 | 31.06 | 31.13 | 31.08 | 30.93 | 31.29 | 31.42 |
| Images | BM3D | LINC | AST-NLS | MSEPLL | WNNM | Proposed | BM3D | LINC | AST-NLS | MSEPLL | WNNM | Proposed |
| House | 30.65 | 31.00 | 30.91 | 30.47 | 31.31 | 31.49 | 29.69 | 29.87 | 30.13 | 29.47 | 30.32 | 30.52 |
| lin | 29.52 | 29.94 | 29.39 | 29.68 | 29.80 | 29.89 | 28.71 | 28.85 | 28.50 | 28.69 | 28.83 | 28.90 |
| flower | 26.48 | 26.79 | 26.75 | 26.64 | 26.85 | 26.90 | 25.49 | 25.47 | 25.77 | 25.56 | 25.80 | 25.88 |
| foreman | 31.29 | 31.31 | 31.29 | 31.05 | 31.54 | 32.08 | 30.36 | 30.33 | 30.46 | 30.04 | 30.75 | 31.03 |
| plants | 29.14 | 29.09 | 29.05 | 29.25 | 29.28 | 29.70 | 28.11 | 27.96 | 28.04 | 28.09 | 28.23 | 28.60 |
| Miss | 30.50 | 30.29 | 30.19 | 30.56 | 30.53 | 30.78 | 29.48 | 29.22 | 29.26 | 29.55 | 29.34 | 29.70 |
| Average | 29.59 | 29.74 | 29.60 | 29.61 | 29.88 | 30.14 | 28.62 | 28.59 | 28.69 | 28.57 | 28.88 | 29.10 |
| 20 | 30 | 40 | 50 | |
|---|---|---|---|---|
| No-APS | 33.10 | 31.23 | 29.94 | 28.80 |
| APS | 33.23 | 31.42 | 30.14 | 29.10 |
| Methods | LINC | AST-NLS | MSEPLL | WNNM | Ours |
|---|---|---|---|---|---|
| Average Time (s) | 263 | 300 | 182 | 172 | 82 |
| BM3D | LINC | AST-NLS | MSEPLL | WNNM | Ours | |
| 20 | 29.86 | 29.92 | 29.98 | 29.95 | 30.11 | 30.14 |
| 30 | 27.93 | 27.94 | 28.02 | 28.02 | 28.17 | 28.15 |
| 40 | 26.58 | 26.61 | 26.68 | 26.73 | 26.88 | 26.89 |
| 50 | 25.71 | 25.64 | 25.80 | 25.84 | 25.96 | 25.97 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Non-Convex Weighted Minimization based Group Sparse Representation Framework for Image Denoising
Qiong Wang, Xinggan Zhang, Yu Wu, Lan Tang and Zhiyuan Zha Q. Wang, X. Zhang, Y. Wu and Z. Zha are with the department of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China. E-mail: [email protected]. Tang is the department of Electronic Science and Engineering, Nanjing University, and National Mobile Commun. Research Lab., Southeast University, Nanjing 210023, China.This work was supported by the NSFC (61571220, 61462052, 61502226) and the open research fund of National Mobile Commune. Research Lab., Southeast University (No.2015D08).
Abstract
Nonlocal image representation or group sparsity has attracted considerable interest in various low-level vision tasks and has led to several state-of-the-art image denoising techniques, such as BM3D, LSSC. In the past, convex optimization with sparsity-promoting convex regularization was usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems. In this paper we propose a non-convex weighted minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex minimization problem. In addition, to improve the accuracy of the nonlocal similar patch selection, an adaptive patch search (APS) scheme is proposed. Experimental results demonstrate that the proposed approach not only outperforms many state-of-the-art denoising methods such as BM3D and WNNM, but also results in a competitive speed.
Index Terms:
Image denoising, group sparsity, weighted minimization, generalized soft-thresholding algorithm, adaptive patch search.
I Introduction
The goal of image denoising is to restore the clean image X from its noisy observation Y as accurately as possible, while preserving significant detail features such as edges and textures. The degradation model for the denoising problem can be represented as: , where V is usually assumed to be additive white Gaussian noise. Image denoising problem is mathematically ill-posed and image priors are exploited to adjust it such that meaningful solutions exist. Over the past few decades, numerous image denoising methods have been developed, including total variation based [1, 2], sparse representation based [3, 4], nonlocal self-similarity based [5, 6, 7, 8] and deep learning based ones [9, 10, 38], etc.
Early models mainly consider the priors on level of pixel, such as total variation (TV) regularization methods [1, 2]. These methods actually assume that natural image gradients exhibit heavy-tailed distributions, which can be fitted by Laplacian or hyper-Laplacian models [11]. Since the TV model favors the piecewise constant image structures, it often damages the image details and tends to over-smooth the images.
As an alternative, another significant property of natural images is to model the prior on patches. The most representative work is sparse representation based scheme [3, 4], which encodes an image patch as a sparse linear combination of the atoms in an over-complete redundant dictionary. The dictionary is usually learned from natural images [12]. The seminal of KSVD dictionary [4] has not only confirmed promising denoising performance, but also extended and successfully exploited it in various image processing and computer vision tasks [13, 14]. However, patch-based sparse representation model usually suffers from some limits, such as dictionary learning with great computational complexity and neglecting the relationships among similar patches [7, 15, 16].
Motivated by the observation that nonlocal similar patches in a natural image are linearly correlated with each other, this so-called nonlocal self-similarity (NSS) prior was initially employed in the work of nonlocal means denoising [5], which has become the most effective priors for the task of image restoration [17, 18]. Due to its favorable reconstruction performance, a large amount of further developments have been proposed [6, 7, 8, 15, 16, 19, 41]. For instance, a very popular scheme is BM3D [6], which groups similar patches into 3D array and disposes these arrays by sparse collaborative filtering. Marial [7] proposed the learned simultaneous sparse coding (LSSC) to improve the denoising performance of K-SVD [4] via group sparse coding. Gu [19, 20] proposed the weighted nuclear norm minimization (WNNM) model, which turned the image denoising into the problem of low rank matrix approximation of noisy nonlocal similar patches. Lately, deep learning based techniques for image denoising have been attracting considerable attentions due to its impressive denoising performance [9, 10, 38].
Traditional sparse representation based image denoising methods exploit the -norm based sparsity of an image and the resulting convex optimization problems can be efficiently solved by the class of surrogate-function based methods [21, 22]. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems [39].
Inspired by the success of () sparse optimization [23, 24, 25, 40] and our previous work [39], this paper proposes a non-convex weighted minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex minimization problem. Moreover, we propose an adaptive patch search (APS) scheme to improve the accuracy of the nonlocal similar patch selection. Experimental results show that the proposed approach not only outperforms many state-of-the-art denoising methods such as BM3D and WNNM, but also results in a competitive speed.
II Group-based Sparse Representation
Recent advances have suggested that structured or group sparsity can offer powerful performance for image restoration [7, 8, 16]. Since the unit of our proposed sparse representation model is group, this section will give briefs to introduce how to construct the groups. More specifically, image X with size N is divided into n overlapped patches of size . Then for each exemplar patch , its most similar patches are selected from an sized searching window to form a set . Since then, all the patches in are stacked into a matrix , which contains every element of as its column, i.e., . The matrix consisting of all the patches with similar structures is called as a group, where denotes the -th similar patch (column form) of the -th group. Finally, similar to patch-based sparse representation [3, 4], given a dictionary , which is often learned from each group, such as DCT, PCA-based dictionary [32], each group can be sparsely represented as and solved by the following -norm minimization problem,
[TABLE]
where denotes the Frobenious norm and is the regularization parameter. is -norm, counting the nonzero entries of .
In image denoising, each noise patch is extracted from the noisy image Y. We search for its similar patches to generate a group , i.e., . Thus, image denoising is translated into how to reconstruct from by using group sparse representation,
[TABLE]
Once all group sparse codes are obtained, the latent clean image X can be reconstructed as , where the group sparse code includes the set of .
However, since the minimization is discontinuous optimization and NP-hard, solving Eq. (2) is a difficult combinatorial optimization problem. For this reason, it has been suggested that minimization can be replaced by its convex counterpart,
[TABLE]
However, minimization is hard to achieve the desired sparsity solution in some practical problems, such as image denoising, image compressive sensing [26, 27], etc.
III Non-convex Weighted minimization based Group Sparse Representation Framework for Image Denoising
Conventional convex optimization with sparsity-promoting convex regularization is usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems [39]. This section introduces a non-convex weighted minimization based group sparse representation framework for image denoising. To make the optimization tractable, the generalized soft-thresholding (GST) algorithm [25] is adopted to solve the non-convex minimization problem. To improve the accuracy of the nonlocal similar patch selection, an adaptive patch search scheme is proposed.
III-A Modeling of Non-convex Weighted Minimization
Inspired by the success of () sparse optimization [23, 24, 25, 40] and our previous work [39], to obtain sparsity solution more accurately, we extend the non-convex weighted () penalty function on group sparse coefficients of the data matrix to substitute the convex norm. Specifically, instead of Eq. (3), a non-convex weighted minimization based group sparse representation framework for image denoising is proposed by solving the following minimization,
[TABLE]
where is a weight assigned to each group . Each weight matrix will enhance the representation capability of each group sparse coefficient . In addition, one important issue of the proposed denoising approach is the selection of the dictionary. To adapt to the local image structures, instead of learning an over-complete dictionary for each group as in [7], we learn the principle component analysis (PCA) based dictionary [32] for each group . Due to orthogonality of each dictionary , and thus, based on the orthogonal invariance, Eq. (4) can be rewritten as
[TABLE]
where . , and denote the vectorization of the matrix , and , respectively.
III-B *Solving the Non-convex Weighted Minimization by the Generalized Soft-thresholding Algorithm *
To achieve the solution of Eq. (5) effectively, in this subsection, the generalized soft-thresholding (GST) algorithm [25] is used to solve Eq. (5). Specifically, given , and , there exists a specific threshold,
[TABLE]
where , and are the -th element of , and , respectively. Here if , is the global minimum. Otherwise, the optimum will be obtained at non-zero point. According to [25], for any , Eq. (5) has one unique minimum , which can be obtained by solving the following equation,
[TABLE]
The complete description of the GST algorithm is exhibited in Algorithm 1. For more details about the GST algorithm, please refer to [25].
III-C Adaptive Patch Search
Nearest Neighbors (NN) method [28] has been widely used to nonlocal similar patch selection. Given a noisy reference patch and a target dataset, the aim of NN is to find the most similar patches. However, since the given reference patch is noisy, NN has a drawback that some of the selected patches may not be truly similar to given reference patch. Therefore, to obtain an effective similar patches index via NN, an adaptive patch search scheme is proposed. We define the following formula,
[TABLE]
where SSIM represents structural similarity [29], is pre-filtering 111This paper BM3D is chosen as a pre-filtering. denoised image and represents the -th iteration denoising result. We empirically define that if , is regarded as target image to fetch the similar patch indexes of each group, otherwise is regarded as target image. is a small constant.
For the weight of each group sparse coefficient , large values of each usually represent major edge and texture information. Therefore, we should shrink large values less, while shrinking smaller ones more [30]. Inspired by [31], the weight of each group is set as , where , denotes the estimated variance of , and is a small constant.
In addition, we could execute the above denoising procedure for better results after several iterations. In the -th iteration, the iterative regularization strategy [33] is used to update the estimation of noise variance. Then the standard divation of noise in -th iteration is adjusted as , where is a constant. The proposed denoising procedure is summarized in Algorithm 2.
IV Experimental Results
To demonstrate the efficacy of the proposed denoising algorithm, in this section, we compare it with recently proposed state-of-the-art denoising methods, including BM3D [6], LINC [34], AST-NLS [35], MSEPLL [36] and WNNM [20]. The experimental images are shown in Fig. 1. The Matlab code can be downloaded at: https://drive.google.com/open?id=0B0wKhHwcknCjM0doVFhlRElXWjg.
The parameter setting of proposed approach is as follows: the searching window for similar patches is set to be . The searching matched patches is set to be 60. The size of each patch is set to be and for and , respectively. are set to (1, 0.3, 0.1, 0.5, 2e-4, 2), (0.85, 0.3, 0.2, 0.8, 2e-4, 2), (0.8, 1.2, 0.1, 0.4, 6e-4, 2) and (0.75, 1.6, 0.1, 0.4, 2e-4, 2) for and , respectively.
We first evaluate the proposed approach and the competing algorithms on 6 test images. Table I shows the PSNR results. It can be seen that the proposed approach performs competitively compared to other methods. The proposed approach achieves 0.42dB, 0.34dB, 0.39dB, 0.51dB and 0.18dB improvement on average over the BM3D, LINC, AST-NLS, MSEPLL and WNNM, respectively. Fig. 2 shows the denoised image of plants by the competing methods. It can be seen that BM3D, LINC, AST-NLS, MSEPLL and WNNM still generate some undesirable artifacts and some details are lost. In contrast, the proposed approach not only preserves the sharp edges, but also suppresses undesirable artifacts more effectively than other competing methods.
Second, to verify the proposed adaptive patch selection (APS) scheme effectively, we compare it with No-APS scheme. The average PSNR results of APS and No-APS schemes on 6 test images are shown in Table II. One can observe that the PSNR results of APS scheme are better than No-APS. Thus, under the task of image denoising, the proposed APS scheme can enhance the accuracy of nonlocal similar patch selection.
Third, to evaluate the computational cost of the competing algorithm, we compare the running time on 6 test images with different noise levels. All experiments are conducted under the Matlab 2012b environment on a machine with Intel (R) Core (TM) i3-4150 with 3.56Hz CPU and 4GB memory. The average run time (s) of the competing methods is shown in Table III. It can be seen that the proposed approach clearly requires less computation time than other methods. Note that the run time of the proposed approach includes the pre-filtering process.
Finally, We also comprehensively evaluate the proposed method on 200 test images from the BSD dataset [37]. Table IV shows qualitative comparisons of the competing denosing methods on four noise levels (). It can be seen that the proposed approach achieves very competitive denoising performance compared to WNNM.
V Conclusion
Different from the conventional convex optimization, this paper proposed a non-convex weighted minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, we adopted the generalized soft-thresholding (GST) algorithm to solve the non-convex minimization problem. Moreover, we proposed an adaptive patch search (APS) scheme to boost the accuracy of the nonlocal similar patch selection. Experimental results have verified that the proposed approach outperforms many state-of-the-art denoising methods such as BM3D and WNNM, and results in a competitive speed.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] Rudin L I, Osher S, Fatemi E. Nonlinear total variation based noise removal algorithms[J]. Physica D: Nonlinear Phenomena, 1992, 60(1-4): 259-268.
- 2[2] Chambolle A. An algorithm for total variation minimization and applications[J]. Journal of Mathematical imaging and vision, 2004, 20(1): 89-97.
- 3[3] Elad M, Aharon M. Image denoising via sparse and redundant representations over learned dictionaries[J]. IEEE Transactions on Image processing, 2006, 15(12): 3736-3745.
- 4[4] Aharon M, Elad M, Bruckstein A. k 𝑘 k -SVD: An algorithm for designing overcomplete dictionaries for sparse representation[J]. IEEE Transactions on signal processing, 2006, 54(11): 4311-4322.
- 5[5] Buades A, Coll B, Morel J M. A non-local algorithm for image denoising[C]//Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. IEEE, 2005, 2: 60-65.
- 6[6] Dabov K, Foi A, Katkovnik V, et al. Image denoising by sparse 3-D transform-domain collaborative filtering[J]. IEEE Transactions on image processing, 2007, 16(8): 2080-2095.
- 7[7] Mairal J, Bach F, Ponce J, et al. Non-local sparse models for image restoration[C]//Computer Vision, 2009 IEEE 12th International Conference on. IEEE, 2009: 2272-2279.
- 8[8] Zuo C, Jovanov L, Goossens B, et al. Image Denoising Using Quadtree-Based Nonlocal Means With Locally Adaptive Principal Component Analysis[J]. IEEE Signal Processing Letters, 2016, 23(4): 434-438.
