Non-Convex Weighted Lp Minimization based Group Sparse Representation   Framework for Image Denoising

Qiong Wang; Xinggan Zhang; Yu Wu; Lan Tang; Zhiyuan Zha

arXiv:1704.01429·cs.CV·November 22, 2017

Non-Convex Weighted Lp Minimization based Group Sparse Representation Framework for Image Denoising

Qiong Wang, Xinggan Zhang, Yu Wu, Lan Tang, Zhiyuan Zha

PDF

TL;DR

This paper introduces a novel non-convex weighted Lp minimization framework for image denoising, utilizing group sparsity and adaptive patch search to outperform existing methods in accuracy and speed.

Contribution

It proposes a new non-convex weighted Lp minimization approach with a generalized soft-thresholding algorithm and adaptive patch search for improved image denoising.

Findings

01

Outperforms state-of-the-art methods like BM3D and WNNM.

02

Achieves better denoising quality with competitive speed.

03

Effectively handles practical image inverse problems.

Abstract

Nonlocal image representation or group sparsity has attracted considerable interest in various low-level vision tasks and has led to several state-of-the-art image denoising techniques, such as BM3D, LSSC. In the past, convex optimization with sparsity-promoting convex regularization was usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization can not still obtain the correct sparsity solution under some practical problems including image inverse problems. In this paper we propose a non-convex weighted $ℓ_{p}$ minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex $ℓ_{p}$ minimization problem. In addition, to improve the accuracy of the nonlocal similar patches…

Tables6

Algorithm 1: Generalized Soft-Thresholding (GST) [25].

Input:

{\tilde{γ}}_{i, j}, {\tilde{w}}_{i, j}, p, J

.

1.

τ_{p}^{GST} ​ ({\tilde{w}}_{i, j}) = {(2 ​ {\tilde{w}}_{i, j} ​ (1 - p))}^{\frac{1}{2 - p}} + {\tilde{w}}_{i, j} ​ p ​ {(2 ​ {\tilde{w}}_{i, j} ​ (1 - p))}^{\frac{p - 1}{2 - p}}

;

2. If

| {\tilde{γ}}_{i, j} | \leq τ_{p}^{GST} ​ ({\tilde{w}}_{i, j})

3.

T_{p}^{GST} ​ ({\tilde{γ}}_{i, j}; {\tilde{w}}_{i, j}) = 0

;

4. else

5.

k = 0, {\tilde{α}}_{i, j}^{(k)} = | {\tilde{γ}}_{i, j} |

;

6. Iterate on

k = 0, 1, \dots, J

7.

{\tilde{α}}_{i, j}^{(k + 1)} = | {\tilde{γ}}_{i, j} | - {\tilde{w}}_{i, j} ​ p ​ {({\tilde{α}}_{i, j}^{(k)})}^{p - 1}

;

8.

k \leftarrow k + 1

;

9.

T_{p}^{GST} ​ ({\tilde{γ}}_{i, j}; {\tilde{w}}_{i, j}) = sgn ​ ({\tilde{γ}}_{i, j}) ​ {\tilde{α}}_{i, j}^{k}

;

10. End

Input::

T_{p}^{GST} ​ ({\tilde{γ}}_{i, j}; {\tilde{w}}_{i, j})

.

Algorithm 2: The Proposed Denoising Algorithm.

Input: Noisy image Y.

Initialization: ​ \hat{X} = Y, 𝜽, c, d, m, L, J, σ, ρ, δ, λ

;

For

t = 1, 2, \dots, K

do

Iterative regularization

Y^{t + 1} = {\hat{X}}^{t} + λ ​ (Y - {\hat{X}}^{t})

;

If

t = 1

Similar patch selection based on

𝜽

.

Else

If

SSIM ​ (Y^{t + 1}, 𝜽) - SSIM ​ (Y^{t}, 𝜽) < ρ

Similar patches index selection based on

Y^{t + 1}

.

Else

Similar patches index selection based on

𝜽

.

End if

For each patch

y_{i}

do

Find a group

Y_{i}^{t + 1}

via

k

NN.

Constructing dictionary

D_{i}^{t + 1}

by

Y_{i}

by PCA operator.

Generating the group sparse coefficient

𝜸_{i}^{t + 1}

by

D_{i}^{- 1} ​ Y_{i}

.

Update

W_{i}^{t + 1}

computing by

{\tilde{w}}_{i, j} = c * 2 ​ \sqrt{2} ​ σ^{2} / 𝝈_{i}

.

Update

𝜶_{i}^{t + 1}

computing by Algorithm 1.

Get the estimation

X_{i}^{t + 1}

=

D_{i}^{t + 1}

𝜶_{i}^{t + 1}

.

End for

Aggregate

X_{i}^{t + 1}

to form the recovered image

{\hat{X}}^{t + 1}

.

End for

Output:

{\hat{X}}^{t + 1}

.

Table 3. TABLE I: Denoising PSNR ( d B) results by different denoising methods.

	$σ = 20$						$σ = 30$
Images	BM3D	LINC	AST-NLS	MSEPLL	WNNM	Proposed	BM3D	LINC	AST-NLS	MSEPLL	WNNM	Proposed
House	33.77	33.82	33.87	33.27	34.04	34.08	32.09	32.26	32.26	31.71	32.52	32.65
lin	32.83	33.04	33.84	32.80	33.00	33.08	30.95	31.03	30.83	30.96	31.07	31.14
flower	30.01	30.30	30.28	30.10	33.34	30.48	27.97	28.13	28.20	28.05	28.26	28.36
foreman	34.54	34.76	34.55	34.09	34.72	34.86	32.75	32.93	32.79	32.34	33.00	33.31
plants	32.68	32.83	32.75	32.58	33.04	33.09	30.70	30.67	30.65	30.66	30.94	31.05
Miss	33.71	33.64	33.64	33.68	33.70	33.80	31.89	31.75	31.72	31.92	31.93	32.04
Average	32.92	33.07	32.99	32.80	33.14	33.23	31.06	31.13	31.08	30.93	31.29	31.42
	$σ = 40$						$σ = 50$
Images	BM3D	LINC	AST-NLS	MSEPLL	WNNM	Proposed	BM3D	LINC	AST-NLS	MSEPLL	WNNM	Proposed
House	30.65	31.00	30.91	30.47	31.31	31.49	29.69	29.87	30.13	29.47	30.32	30.52
lin	29.52	29.94	29.39	29.68	29.80	29.89	28.71	28.85	28.50	28.69	28.83	28.90
flower	26.48	26.79	26.75	26.64	26.85	26.90	25.49	25.47	25.77	25.56	25.80	25.88
foreman	31.29	31.31	31.29	31.05	31.54	32.08	30.36	30.33	30.46	30.04	30.75	31.03
plants	29.14	29.09	29.05	29.25	29.28	29.70	28.11	27.96	28.04	28.09	28.23	28.60
Miss	30.50	30.29	30.19	30.56	30.53	30.78	29.48	29.22	29.26	29.55	29.34	29.70
Average	29.59	29.74	29.60	29.61	29.88	30.14	28.62	28.59	28.69	28.57	28.88	29.10

Table 4. TABLE II: Average PSNR ( d B) results of ADS and No-ADS on 6 test images.

$σ$	20	30	40	50
No-APS	33.10	31.23	29.94	28.80
APS	33.23	31.42	30.14	29.10

Table 5. TABLE III: Average run time ( s ) with different methods on the 6 test images (size: 256 × 256 256 256 256\times 256 ).

Methods	LINC	AST-NLS	MSEPLL	WNNM	Ours
Average Time (s)	263	300	182	172	82

Table 6. TABLE IV: Average PSNR ( dB ) results with different methods on BSD200 dataset [ 37 ] .

$σ$	BM3D	LINC	AST-NLS	MSEPLL	WNNM	Ours
20	29.86	29.92	29.98	29.95	30.11	30.14
30	27.93	27.94	28.02	28.02	28.17	28.15
40	26.58	26.61	26.68	26.73	26.88	26.89
50	25.71	25.64	25.80	25.84	25.96	25.97

Equations17

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ X_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{0})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ X_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{0})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{0})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{0})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{1})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + λ_{i} ∣∣ α_{i} ∣ ∣_{1})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + ∣∣ W_{i} α_{i} ∣ ∣_{p})

α_{i} = ar g α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ Y_{i} - D_{i} α_{i} ∣ ∣_{F}^{2} + ∣∣ W_{i} α_{i} ∣ ∣_{p})

α_{i} = α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ γ_{i} - α_{i} ∣ ∣_{F}^{2} + ∣∣ W_{i} α_{i} ∣ ∣_{p})

α_{i} = α_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ γ_{i} - α_{i} ∣ ∣_{F}^{2} + ∣∣ W_{i} α_{i} ∣ ∣_{p})

= \tilde{α}_{i} min \sum_{i = 1}^{n} (\frac{1}{2} ∣∣ \tilde{γ}_{i} - \tilde{α}_{i} ∣ ∣_{2}^{2} + ∣∣ \tilde{w}_{i} \tilde{α}_{i} ∣ ∣_{p})

τ_{p}^{GST} (\tilde{w}_{i, j}) = (2 \tilde{w}_{i, j} (1 - p))^{\frac{1}{2 - p}} + \tilde{w}_{i, j} p (2 \tilde{w}_{i, j} (1 - p))^{\frac{p - 1}{2 - p}}

τ_{p}^{GST} (\tilde{w}_{i, j}) = (2 \tilde{w}_{i, j} (1 - p))^{\frac{1}{2 - p}} + \tilde{w}_{i, j} p (2 \tilde{w}_{i, j} (1 - p))^{\frac{p - 1}{2 - p}}

T_{p}^{GST} (\tilde{γ}_{i, j}; \tilde{w}_{i, j}) - \tilde{γ}_{i, j} + \tilde{w}_{i, j} p (T_{p}^{GST} (\tilde{γ}_{i, j}; \tilde{w}_{i, j}))^{p - 1} = 0

T_{p}^{GST} (\tilde{γ}_{i, j}; \tilde{w}_{i, j}) - \tilde{γ}_{i, j} + \tilde{w}_{i, j} p (T_{p}^{GST} (\tilde{γ}_{i, j}; \tilde{w}_{i, j}))^{p - 1} = 0

φ = SSIM (θ, \hat{X}^{t + 1}) - SSIM (θ, \hat{X}^{t})

φ = SSIM (θ, \hat{X}^{t + 1}) - SSIM (θ, \hat{X}^{t})

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Full text

Non-Convex Weighted $\ell_{p}$ Minimization based Group Sparse Representation Framework for Image Denoising

Qiong Wang, Xinggan Zhang, Yu Wu, Lan Tang and Zhiyuan Zha Q. Wang, X. Zhang, Y. Wu and Z. Zha are with the department of Electronic Science and Engineering, Nanjing University, Nanjing 210023, China. E-mail: [email protected]. Tang is the department of Electronic Science and Engineering, Nanjing University, and National Mobile Commun. Research Lab., Southeast University, Nanjing 210023, China.This work was supported by the NSFC (61571220, 61462052, 61502226) and the open research fund of National Mobile Commune. Research Lab., Southeast University (No.2015D08).

Abstract

Nonlocal image representation or group sparsity has attracted considerable interest in various low-level vision tasks and has led to several state-of-the-art image denoising techniques, such as BM3D, LSSC. In the past, convex optimization with sparsity-promoting convex regularization was usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems. In this paper we propose a non-convex weighted $\ell_{p}$ minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex $\ell_{p}$ minimization problem. In addition, to improve the accuracy of the nonlocal similar patch selection, an adaptive patch search (APS) scheme is proposed. Experimental results demonstrate that the proposed approach not only outperforms many state-of-the-art denoising methods such as BM3D and WNNM, but also results in a competitive speed.

Index Terms:

Image denoising, group sparsity, weighted $\ell_{p}$ minimization, generalized soft-thresholding algorithm, adaptive patch search.

I Introduction

The goal of image denoising is to restore the clean image X from its noisy observation Y as accurately as possible, while preserving significant detail features such as edges and textures. The degradation model for the denoising problem can be represented as: $\textbf{\emph{Y}}=\textbf{\emph{X}}+\textbf{\emph{V}}$ , where V is usually assumed to be additive white Gaussian noise. Image denoising problem is mathematically ill-posed and image priors are exploited to adjust it such that meaningful solutions exist. Over the past few decades, numerous image denoising methods have been developed, including total variation based [1, 2], sparse representation based [3, 4], nonlocal self-similarity based [5, 6, 7, 8] and deep learning based ones [9, 10, 38], etc.

Early models mainly consider the priors on level of pixel, such as total variation (TV) regularization methods [1, 2]. These methods actually assume that natural image gradients exhibit heavy-tailed distributions, which can be fitted by Laplacian or hyper-Laplacian models [11]. Since the TV model favors the piecewise constant image structures, it often damages the image details and tends to over-smooth the images.

As an alternative, another significant property of natural images is to model the prior on patches. The most representative work is sparse representation based scheme [3, 4], which encodes an image patch as a sparse linear combination of the atoms in an over-complete redundant dictionary. The dictionary is usually learned from natural images [12]. The seminal of KSVD dictionary [4] has not only confirmed promising denoising performance, but also extended and successfully exploited it in various image processing and computer vision tasks [13, 14]. However, patch-based sparse representation model usually suffers from some limits, such as dictionary learning with great computational complexity and neglecting the relationships among similar patches [7, 15, 16].

Motivated by the observation that nonlocal similar patches in a natural image are linearly correlated with each other, this so-called nonlocal self-similarity (NSS) prior was initially employed in the work of nonlocal means denoising [5], which has become the most effective priors for the task of image restoration [17, 18]. Due to its favorable reconstruction performance, a large amount of further developments have been proposed [6, 7, 8, 15, 16, 19, 41]. For instance, a very popular scheme is BM3D [6], which groups similar patches into 3D array and disposes these arrays by sparse collaborative filtering. Marial $\emph{et al}.$ [7] proposed the learned simultaneous sparse coding (LSSC) to improve the denoising performance of K-SVD [4] via group sparse coding. Gu $\emph{et al}.$ [19, 20] proposed the weighted nuclear norm minimization (WNNM) model, which turned the image denoising into the problem of low rank matrix approximation of noisy nonlocal similar patches. Lately, deep learning based techniques for image denoising have been attracting considerable attentions due to its impressive denoising performance [9, 10, 38].

Traditional sparse representation based image denoising methods exploit the $\ell_{1}$ -norm based sparsity of an image and the resulting convex optimization problems can be efficiently solved by the class of surrogate-function based methods [21, 22]. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems [39].

Inspired by the success of $\ell_{p}$ ( $0<p<1$ ) sparse optimization [23, 24, 25, 40] and our previous work [39], this paper proposes a non-convex weighted $\ell_{p}$ minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, the generalized soft-thresholding (GST) algorithm is adopted to solve the non-convex $\ell_{p}$ minimization problem. Moreover, we propose an adaptive patch search (APS) scheme to improve the accuracy of the nonlocal similar patch selection. Experimental results show that the proposed approach not only outperforms many state-of-the-art denoising methods such as BM3D and WNNM, but also results in a competitive speed.

II Group-based Sparse Representation

Recent advances have suggested that structured or group sparsity can offer powerful performance for image restoration [7, 8, 16]. Since the unit of our proposed sparse representation model is group, this section will give briefs to introduce how to construct the groups. More specifically, image X with size N is divided into n overlapped patches $\textbf{\emph{x}}_{i}$ of size $\sqrt{d}\times\sqrt{d},i=1,2,...,n$ . Then for each exemplar patch $\textbf{\emph{x}}_{i}$ , its most similar $m$ patches are selected from an $L\times L$ sized searching window to form a set ${\textbf{\emph{S}}}_{i}$ . Since then, all the patches in ${\textbf{\emph{S}}}_{i}$ are stacked into a matrix ${\textbf{\emph{X}}}_{i}\in\Re^{{d}\times{m}}$ , which contains every element of ${\textbf{\emph{S}}}_{i}$ as its column, i.e., ${\textbf{\emph{X}}}_{i}=\{{\textbf{\emph{x}}}_{i,1},{\textbf{\emph{x}}}_{i,2},...,{\textbf{\emph{x}}}_{i,m}\}$ . The matrix ${\textbf{\emph{X}}}_{i}$ consisting of all the patches with similar structures is called as a group, where ${\textbf{\emph{x}}_{i,m}}$ denotes the $m$ -th similar patch (column form) of the $i$ -th group. Finally, similar to patch-based sparse representation [3, 4], given a dictionary ${\textbf{\emph{D}}}_{i}$ , which is often learned from each group, such as DCT, PCA-based dictionary [32], each group ${\textbf{\emph{X}}}_{i}$ can be sparsely represented as $\boldsymbol{\alpha}_{i}={{\textbf{\emph{D}}}_{i}}^{-1}\textbf{\emph{X}}_{i}$ and solved by the following $\ell_{0}$ -norm minimization problem,

[TABLE]

where $||\bullet||_{F}^{2}$ denotes the Frobenious norm and $\lambda_{i}$ is the regularization parameter. $||\bullet||_{0}$ is $\ell_{0}$ -norm, counting the nonzero entries of $\boldsymbol{\alpha}_{i}$ .

In image denoising, each noise patch $\textbf{\emph{y}}_{i}$ is extracted from the noisy image Y. We search for its similar patches to generate a group ${\textbf{\emph{Y}}}_{i}$ , i.e., ${\textbf{\emph{Y}}}_{i}=\{{\textbf{\emph{y}}}_{i,1},{\textbf{\emph{y}}}_{i,2},...,{\textbf{\emph{y}}}_{i,m}\}$ . Thus, image denoising is translated into how to reconstruct ${\textbf{\emph{X}}}_{i}$ from ${\textbf{\emph{Y}}}_{i}$ by using group sparse representation,

[TABLE]

Once all group sparse codes $\{\boldsymbol{\alpha}_{i}\}$ are obtained, the latent clean image X can be reconstructed as $\hat{\textbf{\emph{X}}}={\textbf{\emph{D}}}\boldsymbol{\alpha}$ , where the group sparse code $\boldsymbol{\alpha}$ includes the set of $\{\boldsymbol{\alpha}_{i}\}$ .

However, since the $\ell_{0}$ minimization is discontinuous optimization and NP-hard, solving Eq. (2) is a difficult combinatorial optimization problem. For this reason, it has been suggested that $\ell_{0}$ minimization can be replaced by its convex $\ell_{1}$ counterpart,

[TABLE]

However, $\ell_{1}$ minimization is hard to achieve the desired sparsity solution in some practical problems, such as image denoising, image compressive sensing [26, 27], etc.

III Non-convex Weighted $\ell_{p}$ minimization based Group Sparse Representation Framework for Image Denoising

Conventional convex optimization with sparsity-promoting convex regularization is usually regarded as a standard scheme for estimating sparse signals in noise. However, using convex regularization cannot still obtain the correct sparsity solution under some practical problems including image inverse problems [39]. This section introduces a non-convex weighted $\ell_{p}$ minimization based group sparse representation framework for image denoising. To make the optimization tractable, the generalized soft-thresholding (GST) algorithm [25] is adopted to solve the non-convex $\ell_{p}$ minimization problem. To improve the accuracy of the nonlocal similar patch selection, an adaptive patch search scheme is proposed.

III-A Modeling of Non-convex Weighted $\ell_{p}$ Minimization

Inspired by the success of $\ell_{p}$ ( $0<p<1$ ) sparse optimization [23, 24, 25, 40] and our previous work [39], to obtain sparsity solution more accurately, we extend the non-convex weighted $\ell_{p}$ ( $0<p<1$ ) penalty function on group sparse coefficients of the data matrix to substitute the convex $\ell_{1}$ norm. Specifically, instead of Eq. (3), a non-convex weighted $\ell_{p}$ minimization based group sparse representation framework for image denoising is proposed by solving the following minimization,

[TABLE]

where ${{\textbf{\emph{W}}}_{i}}$ is a weight assigned to each group ${\textbf{\emph{Y}}}_{i}$ . Each weight matrix ${{\textbf{\emph{W}}}_{i}}$ will enhance the representation capability of each group sparse coefficient $\boldsymbol{\alpha}_{i}$ . In addition, one important issue of the proposed denoising approach is the selection of the dictionary. To adapt to the local image structures, instead of learning an over-complete dictionary for each group ${{\textbf{\emph{Y}}}}_{i}$ as in [7], we learn the principle component analysis (PCA) based dictionary [32] for each group ${{\textbf{\emph{Y}}}}_{i}$ . Due to orthogonality of each dictionary $\textbf{\emph{D}}_{i}$ , and thus, based on the orthogonal invariance, Eq. (4) can be rewritten as

[TABLE]

where ${\textbf{\emph{Y}}}_{i}={{\textbf{\emph{D}}}_{i}{{{\boldsymbol{\gamma}}}}_{i}}$ . ${\tilde{{{\boldsymbol{\alpha}}}}_{i}}$ , ${\tilde{{{\boldsymbol{\gamma}}}}_{i}}$ and ${\tilde{\textbf{\emph{w}}}}_{i}$ denote the vectorization of the matrix ${{{{\boldsymbol{\alpha}}}}_{i}}$ , ${{{{\boldsymbol{\gamma}}}}_{i}}$ and ${\textbf{\emph{W}}}_{i}$ , respectively.

III-B Solving the Non-convex Weighted $\ell_{p}$ Minimization by the Generalized Soft-thresholding Algorithm

To achieve the solution of Eq. (5) effectively, in this subsection, the generalized soft-thresholding (GST) algorithm [25] is used to solve Eq. (5). Specifically, given $p$ , ${\tilde{{{\boldsymbol{\gamma}}}}_{i}}$ and ${\tilde{\textbf{\emph{w}}}}_{i}$ , there exists a specific threshold,

[TABLE]

where ${\tilde{\gamma}_{i,j}}$ , ${\tilde{\alpha}_{i,j}}$ and ${\tilde{{\emph{w}}}_{i,j}}$ are the $j$ -th element of ${\tilde{{{\boldsymbol{\gamma}}}}_{i}}$ , ${\tilde{{{\boldsymbol{\alpha}}}}_{i}}$ and ${\tilde{\textbf{\emph{w}}}}_{i}$ , respectively. Here if ${\tilde{\gamma}_{i,j}}<\tau_{p}^{\emph{GST}}({\tilde{{\emph{w}}}_{i,j}})$ , ${\tilde{\alpha}_{i,j}}=0$ is the global minimum. Otherwise, the optimum will be obtained at non-zero point. According to [25], for any ${\tilde{\gamma}_{i,j}}\in(\tau_{p}^{\emph{GST}}({\tilde{{\emph{w}}}_{i,j}}),+\infty)$ , Eq. (5) has one unique minimum ${\textbf{\emph{T}}}_{p}^{\emph{GST}}({\tilde{\gamma}_{i,j}};{\tilde{{\emph{w}}}_{i,j}})$ , which can be obtained by solving the following equation,

[TABLE]

The complete description of the GST algorithm is exhibited in Algorithm 1. For more details about the GST algorithm, please refer to [25].

III-C Adaptive Patch Search

$k$ Nearest Neighbors ( $k$ NN) method [28] has been widely used to nonlocal similar patch selection. Given a noisy reference patch and a target dataset, the aim of $k$ NN is to find the $k$ most similar patches. However, since the given reference patch is noisy, $k$ NN has a drawback that some of the $k$ selected patches may not be truly similar to given reference patch. Therefore, to obtain an effective similar patches index via $k$ NN, an adaptive patch search scheme is proposed. We define the following formula,

[TABLE]

where SSIM represents structural similarity [29], $\boldsymbol{\theta}$ is pre-filtering 111This paper BM3D is chosen as a pre-filtering. denoised image and ${\hat{{\textbf{\emph{X}}}}}^{t}$ represents the $t$ -th iteration denoising result. We empirically define that if $\varphi<\rho$ , ${\hat{{\textbf{\emph{X}}}}}^{t+1}$ is regarded as target image to fetch the $k$ similar patch indexes of each group, otherwise $\boldsymbol{\theta}$ is regarded as target image. $\rho$ is a small constant.

For the weight $\textbf{\emph{W}}_{i}$ of each group sparse coefficient $\boldsymbol{\alpha}_{i}$ , large values of each $\boldsymbol{\alpha}_{i}$ usually represent major edge and texture information. Therefore, we should shrink large values less, while shrinking smaller ones more [30]. Inspired by [31], the weight $\textbf{\emph{W}}_{i}$ of each group $\textbf{\emph{Y}}_{i}$ is set as ${\tilde{\textbf{\emph{w}}}}_{i}=[{\tilde{{\emph{w}}}}_{i,1},{\tilde{{\emph{w}}}}_{i,2},...,{\tilde{{\emph{w}}}}_{i,j}]$ , where ${\tilde{{\emph{w}}}}_{i,j}=c*2\sqrt{2}\sigma^{2}/\boldsymbol{\sigma}_{i}$ , $\boldsymbol{\sigma}_{i}$ denotes the estimated variance of ${\tilde{\boldsymbol{\alpha}}_{i}}$ , and $c$ is a small constant.

In addition, we could execute the above denoising procedure for better results after several iterations. In the $t$ -th iteration, the iterative regularization strategy [33] is used to update the estimation of noise variance. Then the standard divation of noise in $t$ -th iteration is adjusted as ${(\sigma^{t})}=\delta*\sqrt{({\sigma^{2}-||{{\textbf{\emph{Y}}}}-{\hat{{\textbf{\emph{X}}}}}^{t}||_{2}^{2}})}$ , where $\delta$ is a constant. The proposed denoising procedure is summarized in Algorithm 2.

IV Experimental Results

To demonstrate the efficacy of the proposed denoising algorithm, in this section, we compare it with recently proposed state-of-the-art denoising methods, including BM3D [6], LINC [34], AST-NLS [35], MSEPLL [36] and WNNM [20]. The experimental images are shown in Fig. 1. The Matlab code can be downloaded at: https://drive.google.com/open?id=0B0wKhHwcknCjM0doVFhlRElXWjg.

The parameter setting of proposed approach is as follows: the searching window $L\times L$ for similar patches is set to be $30\times 30$ . The searching matched patches $m$ is set to be 60. The size of each patch $\sqrt{d}\times\sqrt{d}$ is set to be $6\times 6$ and $7\times 7$ for $\sigma\leq 20$ and $20<\sigma\leq 50$ , respectively. $(p,c,\lambda,\delta,\rho,J)$ are set to (1, 0.3, 0.1, 0.5, 2e-4, 2), (0.85, 0.3, 0.2, 0.8, 2e-4, 2), (0.8, 1.2, 0.1, 0.4, 6e-4, 2) and (0.75, 1.6, 0.1, 0.4, 2e-4, 2) for $\sigma\leq 20,20<\sigma\leq 30,30<\sigma\leq 40$ and $40<\sigma\leq 50$ , respectively.

We first evaluate the proposed approach and the competing algorithms on 6 test images. Table I shows the PSNR results. It can be seen that the proposed approach performs competitively compared to other methods. The proposed approach achieves 0.42dB, 0.34dB, 0.39dB, 0.51dB and 0.18dB improvement on average over the BM3D, LINC, AST-NLS, MSEPLL and WNNM, respectively. Fig. 2 shows the denoised image of plants by the competing methods. It can be seen that BM3D, LINC, AST-NLS, MSEPLL and WNNM still generate some undesirable artifacts and some details are lost. In contrast, the proposed approach not only preserves the sharp edges, but also suppresses undesirable artifacts more effectively than other competing methods.

Second, to verify the proposed adaptive patch selection (APS) scheme effectively, we compare it with No-APS scheme. The average PSNR results of APS and No-APS schemes on 6 test images are shown in Table II. One can observe that the PSNR results of APS scheme are better than No-APS. Thus, under the task of image denoising, the proposed APS scheme can enhance the accuracy of nonlocal similar patch selection.

Third, to evaluate the computational cost of the competing algorithm, we compare the running time on 6 test images with different noise levels. All experiments are conducted under the Matlab 2012b environment on a machine with Intel (R) Core (TM) i3-4150 with 3.56Hz CPU and 4GB memory. The average run time (s) of the competing methods is shown in Table III. It can be seen that the proposed approach clearly requires less computation time than other methods. Note that the run time of the proposed approach includes the pre-filtering process.

Finally, We also comprehensively evaluate the proposed method on 200 test images from the BSD dataset [37]. Table IV shows qualitative comparisons of the competing denosing methods on four noise levels ( $\sigma=20,30,40,50$ ). It can be seen that the proposed approach achieves very competitive denoising performance compared to WNNM.

V Conclusion

Different from the conventional convex optimization, this paper proposed a non-convex weighted $\ell_{p}$ minimization based group sparse representation (GSR) framework for image denoising. To make the proposed scheme tractable and robust, we adopted the generalized soft-thresholding (GST) algorithm to solve the non-convex $\ell_{p}$ minimization problem. Moreover, we proposed an adaptive patch search (APS) scheme to boost the accuracy of the nonlocal similar patch selection. Experimental results have verified that the proposed approach outperforms many state-of-the-art denoising methods such as BM3D and WNNM, and results in a competitive speed.

Bibliography41

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] Rudin L I, Osher S, Fatemi E. Nonlinear total variation based noise removal algorithms[J]. Physica D: Nonlinear Phenomena, 1992, 60(1-4): 259-268.
2[2] Chambolle A. An algorithm for total variation minimization and applications[J]. Journal of Mathematical imaging and vision, 2004, 20(1): 89-97.
3[3] Elad M, Aharon M. Image denoising via sparse and redundant representations over learned dictionaries[J]. IEEE Transactions on Image processing, 2006, 15(12): 3736-3745.
4[4] Aharon M, Elad M, Bruckstein A. k 𝑘 k -SVD: An algorithm for designing overcomplete dictionaries for sparse representation[J]. IEEE Transactions on signal processing, 2006, 54(11): 4311-4322.
5[5] Buades A, Coll B, Morel J M. A non-local algorithm for image denoising[C]//Computer Vision and Pattern Recognition, 2005. CVPR 2005. IEEE Computer Society Conference on. IEEE, 2005, 2: 60-65.
6[6] Dabov K, Foi A, Katkovnik V, et al. Image denoising by sparse 3-D transform-domain collaborative filtering[J]. IEEE Transactions on image processing, 2007, 16(8): 2080-2095.
7[7] Mairal J, Bach F, Ponce J, et al. Non-local sparse models for image restoration[C]//Computer Vision, 2009 IEEE 12th International Conference on. IEEE, 2009: 2272-2279.
8[8] Zuo C, Jovanov L, Goossens B, et al. Image Denoising Using Quadtree-Based Nonlocal Means With Locally Adaptive Principal Component Analysis[J]. IEEE Signal Processing Letters, 2016, 23(4): 434-438.

TL;DR

Contribution

Findings

Abstract

Peer Reviews

Videos

Non-Convex Weighted ℓp\ell_{p}ℓp​ Minimization based Group Sparse Representation Framework for Image Denoising

Abstract

Index Terms:

I Introduction

II Group-based Sparse Representation

III Non-convex Weighted ℓp\ell_{p}ℓp​ minimization based Group Sparse Representation Framework for Image Denoising

III-A Modeling of Non-convex Weighted ℓp\ell_{p}ℓp​ Minimization

III-B *Solving the Non-convex Weighted ℓp\ell_{p}ℓp​ Minimization by the Generalized Soft-thresholding Algorithm *

III-C Adaptive Patch Search

IV Experimental Results

V Conclusion

Non-Convex Weighted $\ell_{p}$ Minimization based Group Sparse Representation Framework for Image Denoising

III Non-convex Weighted $\ell_{p}$ minimization based Group Sparse Representation Framework for Image Denoising

III-A Modeling of Non-convex Weighted $\ell_{p}$ Minimization

III-B Solving the Non-convex Weighted $\ell_{p}$ Minimization by the Generalized Soft-thresholding Algorithm