Super-Resolution of Brain MRI Images using Overcomplete Dictionaries and   Nonlocal Similarity

Yinghua Li; Bin Song; Jie Guo; Xiaojiang Du; Mohsen Guizani

arXiv:1902.04902·cs.CV·February 14, 2019

Super-Resolution of Brain MRI Images using Overcomplete Dictionaries and Nonlocal Similarity

Yinghua Li, Bin Song, Jie Guo, Xiaojiang Du, Mohsen Guizani

PDF

Open Access

TL;DR

This paper introduces a super-resolution method for brain MRI images that leverages overcomplete dictionaries, nonlocal similarity, and compressive sensing to enhance resolution more accurately than traditional interpolation techniques.

Contribution

The paper presents a novel super-resolution approach combining dictionary classification, nonlocal similarity, and compressive sensing for improved MRI image quality.

Findings

01

Outperforms existing super-resolution methods visually and quantitatively.

02

Effectively classifies image blocks into smooth, texture, and edge categories.

03

Utilizes joint reconstruction with sparsity and similarity constraints.

Abstract

Recently, the Magnetic Resonance Imaging (MRI) images have limited and unsatisfactory resolutions due to various constraints such as physical, technological and economic considerations. Super-resolution techniques can obtain high-resolution MRI images. The traditional methods obtained the resolution enhancement of brain MRI by interpolations, affecting the accuracy of the following diagnose process. The requirement for brain image quality is fast increasing. In this paper, we propose an image super-resolution (SR) method based on overcomplete dictionaries and inherent similarity of an image to recover the high-resolution (HR) image from a single low-resolution (LR) image. We explore the nonlocal similarity of the image to tentatively search for similar blocks in the whole image and present a joint reconstruction method based on compressive sensing (CS) and similarity constraints. The…

Tables3

Table 1. TABLE I: PSNR and SSIM of the recovered HR images with upscaling factor 2.

		Bicubic	SROD	BSRCNN	Proposed
Brainweb	PSNR	21.51	28.56	32.99	35.36
Brainweb	SSIM	0.827	0.912	0.923	0.961
MRT	PSNR	29.18	30.12	34.21	35.1
MRT	SSIM	0.872	0.91	0.924	0.941
MIDAS	PSNR	27.69	30.23	31.1	33.21
MIDAS	SSIM	0.801	0.909	0.914	0.956

Table 2. TABLE II: PSNR and SSIM of the recovered HR images with upscaling factor 3.

		Bicubic	SROD	BSRCNN	Proposed
Brainweb	PSNR	18.12	22.12	24.21	25.47
Brainweb	SSIM	0.712	0.745	0.785	0.842
MRT	PSNR	21.34	24.61	27.49	29.31
MRT	SSIM	0.671	0.841	0.891	0.905
MIDAS	PSNR	21.41	23.48	26.75	29.12
MIDAS	SSIM	0.715	0.756	0.814	0.816

Table 3. TABLE III: PSNR and SSIM of the recovered HR images with upscaling factor 4.

		Bicubic	SROD	BSRCNN	Proposed
Brainweb	PSNR	16.39	20.41	21.31	23.15
Brainweb	SSIM	0.541	0.645	0.778	0.801
MRT	PSNR	20.12	23.41	26.12	27.1
MRT	SSIM	0.563	0.674	0.741	0.756
MIDAS	PSNR	19.54	21.41	24.56	26.45
MIDAS	SSIM	0.689	0.701	0.731	0.751

Equations44

y = Φ x .

y = Φ x .

∥ x ∥_{0} := ∣ {ℓ : x_{ℓ} \neq = 0} ∣ = # {ℓ : x_{ℓ} \neq = 0} \leq s .

∥ x ∥_{0} := ∣ {ℓ : x_{ℓ} \neq = 0} ∣ = # {ℓ : x_{ℓ} \neq = 0} \leq s .

∥ x ∥_{p} = (i = 1 \sum N ∣ x_{i} ∣^{p})^{1/ p}, 1 \leq p < \infty.

∥ x ∥_{p} = (i = 1 \sum N ∣ x_{i} ∣^{p})^{1/ p}, 1 \leq p < \infty.

y = Φx .

y = Φx .

min ∥ z ∥_{0} subject to Φz = y,

min ∥ z ∥_{0} subject to Φz = y,

min ∥ z ∥_{1} subject to Φz = y,

min ∥ z ∥_{1} subject to Φz = y,

∥ z ∥_{1} = ∣ z_{1} ∣ + ∣ z_{2} ∣ + \dots + ∣ z_{N} ∣ for z = (z_{1}, z_{2}, ..., z_{N}) \in C^{N} .

∥ z ∥_{1} = ∣ z_{1} ∣ + ∣ z_{2} ∣ + \dots + ∣ z_{N} ∣ for z = (z_{1}, z_{2}, ..., z_{N}) \in C^{N} .

(1 - δ_{s}) ∥ x ∥_{2}^{2} \leq ∥ Φx ∥_{2}^{2} \leq (1 + δ_{s}) ∥ x ∥_{2}^{2} for all x \in C^{N} with ∥ x ∥_{0} \leq s .

(1 - δ_{s}) ∥ x ∥_{2}^{2} \leq ∥ Φx ∥_{2}^{2} \leq (1 + δ_{s}) ∥ x ∥_{2}^{2} for all x \in C^{N} with ∥ x ∥_{0} \leq s .

δ_{κ s} < δ^{⋆}

δ_{κ s} < δ^{⋆}

y = Φx + e, ∥ e ∥_{2} \leq α,

y = Φx + e, ∥ e ∥_{2} \leq α,

∥ x - \tilde{x} ∥_{2} \leq C_{1} \frac{1}{s} σ_{s} (x)_{1} + C_{2} α,

∥ x - \tilde{x} ∥_{2} \leq C_{1} \frac{1}{s} σ_{s} (x)_{1} + C_{2} α,

σ_{s} (x)_{1} = ∥ z ∥_{0} \leq s in f ∥ x - z ∥_{1}

σ_{s} (x)_{1} = ∥ z ∥_{0} \leq s in f ∥ x - z ∥_{1}

\begin{array}[]{l}\alpha{\rm{=argmin}}{\left\|\alpha\right\|_{0}}\\ s.t.{Y_{k}}={M_{k}}X={M_{k}}D\alpha\end{array}

\begin{array}[]{l}\alpha{\rm{=argmin}}{\left\|\alpha\right\|_{0}}\\ s.t.{Y_{k}}={M_{k}}X={M_{k}}D\alpha\end{array}

C_{y} \approx \frac{n}{m} C_{q},

C_{y} \approx \frac{n}{m} C_{q},

N L (I) (i_{0}, j_{0}) = (i, j) \in I \sum w (i, j) I (i, j)

N L (I) (i_{0}, j_{0}) = (i, j) \in I \sum w (i, j) I (i, j)

w (i, j) = \frac{1}{Z ( i , j )} exp (- ∥ z (N_{i_{0}, j_{0}}) - z (N_{i, j}) ∥_{2}^{2} / h^{2})

w (i, j) = \frac{1}{Z ( i , j )} exp (- ∥ z (N_{i_{0}, j_{0}}) - z (N_{i, j}) ∥_{2}^{2} / h^{2})

i = 1, 2, .. n α, α^{i} min ∥ F y - F D_{l} α ∥_{2}^{2} + y^{i} \in S \sum F y^{i} - F D_{l} α^{i}_{2}^{2} + λ (∥ α ∥_{1} + i = 1 \sum n α^{i}_{1}) + i = 1 \sum n γ_{i} D_{h} α - D_{h} α^{i}_{2}^{2}

i = 1, 2, .. n α, α^{i} min ∥ F y - F D_{l} α ∥_{2}^{2} + y^{i} \in S \sum F y^{i} - F D_{l} α^{i}_{2}^{2} + λ (∥ α ∥_{1} + i = 1 \sum n α^{i}_{1}) + i = 1 \sum n γ_{i} D_{h} α - D_{h} α^{i}_{2}^{2}

γ_{i} = \frac{1}{Z} exp {- \frac{y - y ^{i} _{2}^{2}}{h ^{2}}}

γ_{i} = \frac{1}{Z} exp {- \frac{y - y ^{i} _{2}^{2}}{h ^{2}}}

x = D_{h} α

x = D_{h} α

M S E = \frac{\sum _{i = 1}^{M} \sum _{j = 1}^{N} ( X _{ij} - Y _{ij} ) ^{2}}{M \times N}

M S E = \frac{\sum _{i = 1}^{M} \sum _{j = 1}^{N} ( X _{ij} - Y _{ij} ) ^{2}}{M \times N}

P S N R = 10 lo g_{10} \frac{255 \times 255}{M S E}

P S N R = 10 lo g_{10} \frac{255 \times 255}{M S E}

S S I M (X, Y) = \frac{( 2 μ _{X} μ _{Y} + C _{1} ) ( 2 δ _{X Y} + C _{2} )}{( μ _{X}^{2} + μ _{Y}^{2} + C _{1} ) ( δ _{X}^{2} + δ _{Y}^{2} + C _{2} )}

S S I M (X, Y) = \frac{( 2 μ _{X} μ _{Y} + C _{1} ) ( 2 δ _{X Y} + C _{2} )}{( μ _{X}^{2} + μ _{Y}^{2} + C _{1} ) ( δ _{X}^{2} + δ _{Y}^{2} + C _{2} )}

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsAdvanced Image Processing Techniques · Sparse and Compressive Sensing Techniques · Image Processing Techniques and Applications

Full text

Super-Resolution of Brain MRI Images using Overcomplete Dictionaries and Nonlocal Similarity

Yinghua Li, Bin Song∗, Jie Guo, Xiaojiang Du, Mohsen Guizani This work was supported by the National Natural Science Foundation of China under Grant (Nos. 61772387 and 61802296), the Fundamental Research Funds for the Central Universities (JB180101), China Postdoctoral Science Foundation Grant (No. 2017M620438), Fundamental Research Funds of Ministry of Education and China Mobile (MCM20170202), and also supported by the ISN State Key Laboratory.Y. Li, B. Song and J. Guo are with the State Key Laboratory of Integrated Services Networks, Xidian University, 710071, China. Bin Song is the corresponding author. Emails: [email protected](Y. Li), [email protected](B. Song), [email protected](J. Guo). X. Du is with Dept. of Computer and Information Sciences, Temple University, Philadelphia PA, 19122, USA (email: [email protected]) M. Guizani is with Dept. of College of Engineering, Qatar University, Qatar (email: [email protected])

Abstract

Recently, the Magnetic Resonance Imaging (MRI) images have limited and unsatisfactory resolutions due to various constraints such as physical, technological and economic considerations. Super-resolution techniques can obtain high-resolution MRI images. The traditional methods obtained the resolution enhancement of brain MRI by interpolations, affecting the accuracy of the following diagnose process. The requirement for brain image quality is fast increasing. In this paper, we propose an image super-resolution (SR) method based on overcomplete dictionaries and inherent similarity of an image to recover the high-resolution (HR) image from a single low-resolution (LR) image. We use the linear relationship among images in the measurement domain and frequency domain to classify image blocks into smooth, texture and edge feature blocks in the measurement domain. The dictionaries for different blocks are trained using different categories. Consequently, an LR image block of interest may be reconstructed using the most appropriate dictionary. Additionally, we explore the nonlocal similarity of the image to tentatively search for similar blocks in the whole image and present a joint reconstruction method based on compressive sensing (CS) and similarity constraints. The sparsity and self-similarity of the image blocks are taken as the constraints. The proposed method is summarized in the following steps. First, a dictionary classification method based on the measurement domain is presented. The image blocks are classified into smooth, texture and edge parts by analyzing their features in the measurement domain. Then, the corresponding dictionaries are trained using the classified image blocks. Equally important, in the reconstruction part, we use the CS reconstruction method to recover the HR brain MRI image, considering both nonlocal similarity and the sparsity of an image as the constraints. This method performs better both visually and quantitatively than some existing methods.

Index Terms:

brain MRI, super-resolution, dictionary, sparse representation, compressed sensing, self-similarity.

I Introduction

Over the past decade, the brain Magnetic Resonance Imaging (MRI) has become one of the most important methods to diagnose the ailing brains. High-resolution (HR) images with sufficient details have found significant applications in medical imaging. Therefore, the requirement for image quality is fast increasing. However, due to the limitations of the physical resolution of the terminal devices or the bandwidth in the transmission process, it is difficult to obtain the high-resolution brain MR images that satisfy the basic requirement for applications. Attempts to resolve this dilemma have resulted in the development of an emerging research topic in image signal processing, known as super-resolution (SR) image reconstruction, which has been extensively studied in recent years. SR is an inverse problem that tackles the recovery of a high-resolution image from a single image or multiple low-resolution images of the same scene based on either specific a priori knowledge or reasonable assumptions about the imaging model that degrades the high-resolution image to the low-resolution ones.

SR image recovery is a terrible ill-posed problem because there are no sufficient low-resolution images, the blurring operators are unknown, and the solution from the recovery constraint is not unique. Many regularization methods have been presented to further improve the inversion of this underdetermined problem, such as [1, 2, 3]. However, these reconstruction-based SR algorithms often lead to poor robustness and unsatisfied performance when the magnification factor is large. Thus, the reconstructed images may be overly smooth and absent of critical high-frequency details [4]. The interpolation-based SR approach is another type of SR method. Takeda et al. presented an interpolation algorithm based on the controllable kernel regression, which constructs the direction-controllable interpolation kernel function through a covariance matrix [5]. Li et al. applied different interpolation strategies for image blocks with various features. That is, in the bilinear interpolation for smooth regions and particular edge regions, the local covariance is used to adjust the interpolation coefficients [6]. Recently, some structural adaptive interpolation methods have achieved good results. Yeon et al. proposed an edge-oriented local RBF interpolation algorithm [7]. Romano et al. combined the interpolation with the nonlocal self-similarity and sparse representation of images and explored a new adaptive interpolation method [8]. However, high-resolution images recovered by these interpolation-based methods are prone to be overly smooth and have ringing and jagged artifacts.

Another category of SR methods is based on machine learning techniques, which seek to obtain the co-occurrence prior between low-resolution (LR) and high-resolution (HR) image patches. Freeman et al. first put forward using learning techniques to improve the image resolution. The authors used the Markov random field (MRF) to establish the corresponding relationship between the HR image block and the LR image block. The initial value of the HR image was obtained by interpolation. The lost high-frequency details of the HR image were recovered by learning and added with the initial value; then, the HR image was obtained [9]. Sun et al. further improved this approach by applying the primal sketch priors to improve blurred edges, ridges, and corners. The SR methods using the convolutional neural network are presented in [10, 11], which performed single- and multi-contrast super-resolution reconstructions simultaneously. Unfortunately, the aforementioned approaches generally require databases, which contain millions of HR and LR patch pairs and are therefore computationally intensive. In addition, there exist untrue high-frequency details, which are recovered by learning from external training databases.

The emergence of compressed sensing (CS) offers a new different perspective to address large underdetermined problems. CS can reconstruct sparse or compressible signals using fewer measurements than conventional methods without prior knowledge about the support of the signals. CS claims the inaccuracy of the conventional wisdom that the acquisition and reconstruction must follow Nyquist sampling theory [13, 14, 15, 16, 17, 18]. This favorable and promising tool has proven to be applicable for various fields, including machine learning [19, 20], wireless communication [21, 22], and medical imaging [23, 24]. Fortunately, due to its favorable property, CS can be applied to solve the SR problem. The application of CS and sparse representation in the field of SR recovery has captured the interest and attention of an enormous number of researchers in the past decade. The pioneer works can be traced to [25, 26, 27, 28, 29, 30]. Sen et al. proposed a new algorithm to generate a super-resolution image from a single, low-resolution input without using a training data set [25]. The CS theory was used to recover the HR image in magnetic resonance imaging [27]. Then, these methods were extended in [28, 29, 30]. The authors presented new approaches to the single-image SR problem based on the sparse representation. In [31], Rueda et al. proposed a sparse-based super-resolution method coupling up high and low frequency information to reconstruct a high-resolution brain MR image. Several papers (e.g., [32, 33, 34, 35, 36]) have studied related sensing issues. However, these previous work failed to consider the combination of the sparse representation and nonlocal self-similarity. Although much effort has been spent on improving the performance of SR recovery, an efficient and effective method has not been developed.

The purpose of this paper is to apply CS, the sparse representation and inherent similarity of an image to recover an HR image from a single LR image. It is of great interest and significance to address the questions in CS for ill-posed problems such as SR. In this paper, we have extended the previous work by paying attention to the nonlocal self-similarity of an LR image. We propose an image SR algorithm based on compressed sensing and self-similarity constraint. Because the difference of image blocks is not considered when training dictionaries, a dictionary classification method based on the measurement domain is proposed in the dictionary training part. Specifically, we use the linear relationship between images in the measurement domain and frequency domain to classify the image blocks into smooth, texture and edge feature blocks in the measurement domain. The dictionaries for different blocks are trained by using different categories. Consequently, an LR image block of interest may be reconstructed using the most appropriate dictionary. If one merely learns the prior knowledge from the external image database, it tends to generate false details of the reconstructed HR image.

In our proposed method, we use the nonlocal similarity of the image to tentatively search for similar blocks in the whole image and present a joint reconstruction method based on CS and similarity constraints. The sparsity and self-similarity of the image blocks are used as the constraints. The proposed method is summarized in the following steps. First, a dictionary classification method based on the measurement domain is presented. The image blocks are classified into smooth, texture and edge parts by analyzing their features in the measurement domain. Then, the corresponding dictionaries are trained using the classified image blocks. Equally important, in the reconstruction part, we use the CS reconstruction method to recover the HR image considering both the nonlocal similarity and sparsity of an image as constraints. This approach results in visually and quantitatively better performance than some existing methods.

The remainder of this paper is organized as follows. In Sec.II, we briefly introduce the correlative theoretical basis, including CS, followed by the discussion of image SR using CS. Then, the proposed SR method based on CS and self-similarity is described in detail in Sec. III. The explanation, illustration, and analysis of the experimental results are demonstrated in Sec. IV. Finally, the summary of this paper is presented in Sec. V.

II Image Super-Resolution Using CS

II-A Compressed Sensing

For completeness, we briefly introduce the fundamental background of CS. CS can reconstruct sparse or compressible signals using fewer measurements than the traditional approach uses. The advent of CS has tremendously affected signal acquisition and signal recovery [13, 14, 15] because the compressibility or sparsity is of great significance. Suppose that $x$ is a discrete signal with size $n$ ; if it has no more than $r$ nonzero values, then $x$ is called “ $r$ -sparse”. A signal may have no sparsity in some domains. Fortunately, we can always find a certain domain where signal $x$ can be considered sparse with an appropriate basis.

Considering the natural images, it is beneficial that there are sufficient bases and dictionaries so that the natural images in these bases become sparse or approximately sparse. A signal is considered “approximately sparse” if its amplitude exponentially decays. A signal is referred to as “compressible” if it has an approximately sparse representation on a certain basis. Concerning a sparse signal, there is much less valuable “information” than unimportant data. CS can reconstruct sparse or compressible signals with much fewer samples than traditional methods.

Let $x$ ( $x\in R^{N}$ ) be a discrete signal; $\theta$ represents its coefficients in a certain orthonormal basis

[TABLE]

Then, $x$ is $K$ -sparse if only $K$ coefficients are nonzero. The procedure can be formulated as follows.

[TABLE]

where ${\left\|{\mathbf{x}}\right\|_{0}}$ represents the $\ell_{0}$ -norm of $\bf x$ , which denotes the number of nonzero elements of $\bf x$ . The $\ell_{p}$ -norm is defined as

[TABLE]

We call a matrix ${\mathbf{\Phi}}\in{\mathbb{C}^{n\times N}}$ the measurement matrix; then, the recovery process is to reconstruct ${\bf x}\in{\mathbb{C}}^{N}$ from the measurements

[TABLE]

If $n\ll N$ , this problem is underdetermined and has no solution. Fortunately, CS theory finds that the solution can be obtained with extra information that ${\bf x}$ is $s$ -sparse.

The original recovery method adopts $\ell_{0}$ -minimization:

[TABLE]

but this is an NP-hard problem. Then, tractable substitutions are used, e.g., $\ell_{1}$ -minimization:

[TABLE]

where

[TABLE]

Assuring the recovering ability of $x$ in Eq.(LABEL:eqn:01) via $\ell_{1}$ -minimization and greedy algorithms is a sufficient condition to establish the RIP (restricted isometry property) of measurement matrix ${\mathbf{\Phi}}$ : Given ${\mathbf{\Phi}}\in{\mathbb{C}^{n\times N}}$ and $s<N$ , the RIC (restricted isometry constant) $\delta_{s}$ is defined as the smallest positive number such that

[TABLE]

Eq. (3) demands that at most $s$ columns of ${\mathbf{\Phi}}$ are well-conditioned. ${\mathbf{\Phi}}$ is said to satisfy the RIP with order $s$ when $\delta_{s}$ is small.

Many recovery methods are effective if the measurement matrix $\bf\Phi$ satisfies the RIP. More accurately, if the measurement matrix $\bf\Phi$ follows Eq. (3) with

[TABLE]

for appropriate constants $\kappa\geq 1$ and $\delta^{\star}$ , then several algorithms can precisely reconstruct any $s$ -sparse signals $\bf x$ from ${\mathbf{y}}={\mathbf{\Phi x}}$ . Furthermore, if $\bf x$ can be approximated by an $s$ sparse vector, then for noisy observations,

[TABLE]

these algorithms can acquire the recovery ${{\mathbf{\tilde{x}}}}$ that satisfy an error bound as

[TABLE]

where

[TABLE]

represents the error of the best $s$ -term approximation in $\ell_{1}$ , and $C_{1},C_{2}>0$ are constants.

II-B Super-Resolution based on Compressed Sensing

The CS theory aims at solving the underdetermined problems and reconstructing a high-dimensional signal from fewer measurements than the traditional approach. For the SR problem, its goal is to recover a high-resolution image from a low-resolution one in the same scene. These two problems share a high similarity, so CS theory may be applied to solve the SR reconstruction problem. An SR problem may be viewed as the recovery process in the CS frame, where $Y$ can be considered the low-resolution image acquired as a measurement of the original high-resolution image $X$ . Generally, matrix $M$ , which degrades the HR image to an LR image in the SR problem, is considered the projection matrix in CS theory. The sparse basis is taken from the overcomplete dictionary $D$ . In this work we consider only the case of a single image. Then, the process of solving the SR problem using CS theory is as follows:

[TABLE]

However, many factors must be considered, including the estimation of the degradation matrix, the method of training overcomplete dictionary $D$ , and the specific reconstruction algorithm. The essence of applying the CS theory to SR is to make full use of the sparsity and fully excavate the intrinsic structural features of an image. SR based on CS theory has also made significant progress in recent years. The feasibility of applying CS theory to single-image SR has been proven in [17]. The mapping relationship between the HR dictionary and the LR dictionary has been established in [26].

This paper mainly studies how to reconstruct the HR image by using the sparsity of an image and the nonlocal similarity information inside the image. The principle of the SR algorithm based on the sparse representation is to regularize the image sparsity as a priori information. LR images are degraded, while the degradation model of HR to LR images is uncertain. The algorithm assumes that HR and LR images have similar geometric structures. Their sparse representations are approximate under a certain transform basis or redundant dictionary. We ensure the corresponding relationship between LR dictionary $D_{l}$ and HR dictionary $D_{h}$ atoms while training the dictionaries. Then, the relationship obtained by learning is applied to the current input image so that an HR image is reconstructed. This algorithm mainly includes the dictionary training process and reconstruction process, which are introduced in detail in Sec. III.

III Proposed Method

This paper presents an image SR method based on the CS and nonlocal similarity. Because the difference of image blocks is not considered when training dictionaries, a dictionary classification method based on the measurement domain is proposed in the dictionary training part. Specifically, we use the linear relationship between images in the measurement domain and frequency domain to classify image blocks into smooth, texture and edge feature blocks in the measurement domain. The dictionaries for different blocks are trained using different categories. Consequently, an LR image block of interest may be reconstructed using the most appropriate dictionary. If one merely learns the prior knowledge from the external image database, it tends to generate untrue details of the reconstructed HR image. In our proposed method, we use the nonlocal similarity of the image itself to tentatively search for similar blocks in the whole image and present a joint reconstruction method based on CS and similarity constraints. The sparsity and self-similarity of the image blocks are taken as the constraints.

III-A Classified Dictionary Training

The existing SR methods based on the sparse representation failed to consider the differences among sample blocks in the training dictionary. Remarkable differences between the input LR image and the sample database may lead to the poor quality of the reconstructed HR image. To overcome this problem, we propose a dictionary classification method based on the measurement domain. In our past work [37, 38, 39, 40, 41], we have proposed an adaptive ADMM algorithm with support and a maximum-likelihood dictionary to improve the ability of the dictionary to represent the signal sparsely. First, we classify the images in the sample database in the measurement domain; then, we use them to train different categories of dictionaries and reconstruct the input image block using the closest dictionary to improve the definition of the HR image. In our previous work [42, 43], we theoretically proved the approximately linear relationship between the cross-covariance matrixes in the measurement domain and frequency domain, which can be formulated as follows:

[TABLE]

where $m$ and $n$ represent the sample numbers in the measurement domain and frequency domain, respectively. The images in the frequency domain and pixel domain are also closely related. Generally, an edge texture block is more sparse than a smooth block. We propose a classification method in the measurement domain using covariance matrixes to classify the image blocks in the training set. Different types of dictionaries are trained using different kinds of image blocks. The overall block diagram of the dictionary classification method based on the measurement domain is shown in Fig. 1.

We select the brain tissue MRI image as sample for the experiment to show the performance of classifying image blocks in the measurements. The images are divided into $8\times 8$ blocks using the Gaussian matrix as the measurement matrix. Here, we use $T_{1}=3\times 10^{6}$ and $T_{2}=3\times 10^{7}$ . The result is shown in Fig. 2.

The experimental result shows that the proposed classification method in this paper can better classify the image blocks into smooth blocks, texture blocks, and edge blocks. The sampling rate determines the amount of data to be sorted and processed. The lower the sampling rate, the fewer data there are to calculate. However, when the sampling rate is extraordinarily low, the measured value vector will be reduced accordingly. This process fails to contain all information of the original image, which leads to a large deviation in the classification results. The experimental experience value shows that if the sampling rate is not less than 0.4, better results can be guaranteed.

III-B Nonlocal Similarity of an Image

Natural images should preferably be rich in content and have certain repeatability in structural features. The repetitive information of an image has been widely used in image recovery, image denoising, and other issues. The fundamental principle of a nonlocal algorithm is to give different weight coefficients to the similar points of the current pixel using their linear combination to represent the current pixel. Therefore, the internal structure of the pixels can be maintained. Of course, the value of the coefficients dramatically depends on the similarity of the two pixels. The local phase theory holds that the similarity points of pixels exist in their adjacent local regions and that the neighborhood points have a high degree of approximation with the current point. However, the nonlocal similarity theory considers the repeatability of the image structure and holds that two pixels may have a higher degree of approximation even in the case of a longer spatial distance. Inspired by the nonlocal features of the image, this paper applies it to the SR algorithm to improve the quality of the HR image reconstruction.

Suppose that an image $I=\{I(i,j)\}|(i,j\in\Omega)$ has definition in $\Omega\subset N^{2}$ , we use the linear combination of other similar pixel points with different weight coefficients to represent the current pixel $(i_{0},j_{0})$ ; its weighting value is:

[TABLE]

where the value of ${w(i,j)}_{(i_{0},j_{0})}$ is determined by the approximation degree of $(i,j)$ and $(i_{0},j_{0})$ , which obeys $0\leq w(i,j)\leq 1$ and $\Sigma{w(i,j)}=1$ . Taking Fig. 3 as an example, $q_{1}$ is similar to $p$ in terms of gray value, whereas $q_{2}$ is significantly different from $p$ . Therefore, the value of $w(q_{1})$ is far greater than that of $w(q_{1})$ .

We define the pixel-centered window as the subset of $\Omega$ : $N={\{N_{i,j}\}_{(i,j)\in\Omega}}$ . We define the similarity between two central pixels by comparing the similarity of two window regions. Thus, the weight coefficient is proportional to the similarity of the two window areas and can be computed as:

[TABLE]

where $Z(i,j)=\sum\limits_{i,j}{\exp(-\left\|{z({N_{{i_{0}},{j_{0}}}})-z({N_{i,j}})}\right\|_{2}^{2}/{h^{2}})}$ is the normalization factor, and $h$ is the decline rate of function.

III-C CS and Nonlocal Similarity-based Reconstruction

The previous work has shown that the image blocks may frequently appear to be more similar in the interior of the image than in the exterior training database [44].

Compared with the learning of the exterior library, more useful information can be obtained from the relevant information extracted from the interior of the image. However, for some image blocks, the information learned by themselves is limited and is not sufficient to reconstruct high-quality HR image blocks. Therefore, it is also necessary to obtain prior information through external learning to guide the current image block reconstruction. In this paper, we combine the nonlocal self-similar information of the image with the external dictionary and propose an SR method based on CS and self-similarity.

There are many similar blocks in the image and among different scales. A larger search area yields more similar blocks. To obtain more information contained within a single image, a tentative nonlocal search strategy is proposed in this paper. The adjacent regions of the current image block are helically squared matching to find similar blocks; for remote blocks, variable step-size searching is used according to the effect of similar blocks that have been found. This approach which may fully mine the similar information in the image and can be quickly completed.

The reconstruction process in this paper is shown in Fig. 4. For any image block $y$ of an input LR image, a dictionary pair $(D_{h},D_{l})$ of the corresponding category is selected according to its variance. All of its similar blocks $S=\{y^{1},y^{2},...,y^{n}\}$ are found in the whole image. We add the self-similarity as the constraint, which requires coefficient $\alpha$ to be of high sparsity, and the HR image block represented by it has high similarity with its similar block $S$ . The joint solution process using $s$ and $(D_{h},D_{l})$ can be expressed as:

[TABLE]

where $\alpha$ is the sparsity degree of current image block $y$ , and ${\alpha}^{i}$ is the representation coefficient of $y^{i}$ on $D_{l}$ . The first two items in the equation are used to guarantee the fidelity of the input LR image blocks, the two middle $l_{1}$ regularization items guarantee the sparsity of representation of the LR blocks on $D_{l}$ , and the last item ensures the degree of approximation between the recovered HR image block and the similar block. The degree of approximation is controlled by ${\gamma}_{i}$ :

[TABLE]

where $Z$ is the normalization parameter.

The second, fourth and fifth items in Equation 9 represent the nonlocal similarity information of the image blocks. We obtain coefficient $\alpha$ by solving Equation 9. Then, the HR image block can be obtained by

[TABLE]

By processing all LR blocks according to these steps, we recover the HR image $X$ . Then, the IBP algorithm is used to more consistently guide $X$ to adjust along the direction with the image degradation model so that the final reconstructed HR image is consistent with the input LR image based on the image degradation model.

IV Experimental Results

The experiments are performed on both synthetic and real brain MRI images with the magnification factors of 2 and 4. We compare the results with the existing work [7, 10]. Their methods are denoted as Bicubic, and BSRCNN, respectively, for convenience.

We adopt the synthetic brain MRI images selected from Brainweb dataset [45] 111http://brainweb.bic.mni.mcgill.ca/brainweb/, MRT dataset 222https://www.mr-tip.com/, and the real MRI data from MIDAS dataset333http://insight-journal.org/midas/collection/view/190 which acquired with a 3T GE scanner at Brigham and Women’s Hospital in Boston, MA and contains 10 normal and 10 schizophrenic patients.

IV-A Evaluation Criterion

Generally, the performance of the SR algorithm is evaluated from the following two perspectives:

•

Subjective evaluation. This method is mainly based on the visual perception of the human eyes to evaluate the quality of the image. Because individuals have different perceptions of the same image, this evaluation method is more influenced by subjective factors, which leads to the existence of individual differences.

•

Objective evaluation. Since the LR test image in the SR algorithm is usually simulated by the degradation model of the HR image, there exists an original HR image, which is compared with the reconstructed image. The objective evaluation method is to determine the similarity between the recovered image and the original image using a calculation method. In this paper, two important criteria to evaluate the objective quality of SR methods are the PSNR (peak signal-to-noise ratio) and SSIM (structural similarity image measurement).

[TABLE]

where $X$ is the original HR image, $Y$ is the recovered HR image, and $M$ and $N$ represent the size of the image.

IV-B Visual Results

Because the human eye system is sensitive to the luminance component, we only focus on the luminance $Y$ channel in the SR reconstruction of color images. The values of the chroma $C_{b}$ and $R_{c}$ channels are directly obtained using Bicubic upsampling. In the experiments, the size of the image block is $5\times 5$ , the overlap part is 4 pixels, and the number of dictionary atoms is 512. The HR images in the test sets are downsampled by using the fuzzy downsampling matrix, and the corresponding LR images are generated by simulating the image degradation model. We use the proposed method and other reference algorithms to perform $2\times$ and $4\times$ SR reconstructions, respectively.

The visual results obtained using Bicubic, BSRCNN and the presented method are illustrated in Figs. 5, 6, 7, and 8.

Therefore, we conclude that the reconstructed images using our proposed method are rich in texture areas, have more natural outlines, and have no apparent zigzag effect.

IV-C Objective evaluation

In terms of objective quality, our proposal is compared with Bicubic [7], SROD [31] and BSRCNN [10]. The performance is measured regarding PSNR and SSIM. We average the results of ten test images as the PSNR/SSIM value shown in the following tables. The results in Tabs. I, II and III show that the proposed method has better objective quality than other algorithms. Both PSNR and SSIM are improved: the PSNR value is increased by approximately 0.9-5.9dB, and the SSIM value is increased by approximately 0.02-0.14. Compared with the result using the magnification factor of 2, the improvement of 4 times magnification is more remarkable. Thus, when the magnification factor increases, we can obtain more significant improvement in HR image quality.

V Conclusion

In this paper, we have extended the previous work by paying attention to the nonlocal self-similarity and the block classification of an LR image. We propose an image SR algorithm based on compressed sensing and self-similarity constraint. This proposed method is applied to solve the brain MRI super-resolution problem, and the satisfactory results may be acquired. Because the difference of image blocks are not considered when training dictionaries, a dictionary classification method based on the measurement domain is proposed in the dictionary training part. Specifically, we use the linear relationship between images in the measurement domain and frequency domain to classify the image blocks into smooth, texture and edge feature blocks in the measurement domain. The dictionaries for different blocks are trained using different categories. Consequently, an LR image block of interest may be reconstructed using the most appropriate dictionary. If one merely learns the prior knowledge from the external image database, it tends to generate untrue details of the reconstructed HR image. In our proposed method, we use the nonlocal similarity of the image to tentatively search for similar blocks in the whole image and present a joint reconstruction method based on the classified dictionaries and similarity constraints. The sparsity and self-similarity of the image blocks are taken as the constraints.

In summary, a dictionary classification method based on the measurement domain is presented. Then, the corresponding dictionaries are trained using the classified image blocks. Equally important, in the reconstruction part, we use the CS reconstruction method to recover the HR image, considering both nonlocal similarity and sparsity of an image as the constraints. This method visually and quantitatively performs better than some existing methods. To verify the performance of the proposed method, many experiments have been accomplished on both the synthetic and real brain MRI images. The experimental results indicate that the proposal enhances the quality of the recovered HR brain MRI image, and our method results in visually and quantitatively superior performance.

Bibliography45

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1[1] R. C. Hardie, K. J. Barnard, and E. A. Armstrong, “Joint map registration and high-resolution image estimation using a sequence of undersampled images,” IEEE Transactions on Image Process, vol. 6, no. 12, pp.1621-1633, Dec. 1997.
2[2] S. Farsiu, M. D. Robinson, M. Elad, and P. Milanfar, “Fast and robust multiframe super-resolution,” IEEE Trans. Image Process., vol. 13, no. 10, pp. 1327-1344, Oct. 2004.
3[3] M. E. Tipping and C. M. Bishop, “Bayesian image super-resolution,” in Proc. Adv. Neural Inf. Process. Syst. 16 2003, pp. 1303-1310.
4[4] S. Baker and T. Kanade, “Limits on super-resolution and how to break them,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 24, no. 9, pp.1167-1183, Sep. 2002.
5[5] H. Takeda, S. Farsiu, and P. Milanfar, “Kernel regression for image processing and reconstruction,” IEEE Transactions on image processing, vol. 16, no. 2, pp.349-366, 2007.
6[6] X. Li and M. Orchard, “New edge-directed interpolation,” IEEE Transactions on image processing, vol. 10, no. 10, pp. 1521-1527, 2001.
7[7] Y. Lee and J. Yoon, “Nonlinear image upsampling method based on radial basis function interpolation,” IEEE Transactions on image processing, vol. 19, no. 10, pp. 2682-2692, 2010.
8[8] Y. Romano, M. Protter, and M. Elad, “Single image interpolation via adaptive nonlocal sparsity-based modeling,” IEEE Transactions on image processing, vol. 23, no. 7, pp.3085-3098, 2014.