TL;DR
This paper introduces a novel low-rank matrix estimation method for hyperspectral super-resolution, leveraging global and local low-rank structures to improve image reconstruction from multispectral and hyperspectral data.
Contribution
It proposes a global-local low-rank regularization framework and an efficient optimization algorithm for hyperspectral super-resolution, accounting for local spectral variations.
Findings
Outperforms benchmark algorithms in recovery accuracy
Effective in synthetic, semi-real, and real data scenarios
Leverages recent non-convex optimization advances
Abstract
Hyperspectral super-resolution (HSR) is a problem that aims to estimate an image of high spectral and spatial resolutions from a pair of co-registered multispectral (MS) and hyperspectral (HS) images, which have coarser spectral and spatial resolutions, respectively. In this paper we pursue a low-rank matrix estimation approach for HSR. We assume that the spectral-spatial matrices associated with the whole image and the local areas of the image have low-rank structures. The local low-rank assumption, in particular, has the aim of providing a more flexible model for accounting for local variation effects due to endmember variability. We formulate the HSR problem as a global-local rank-regularized least-squares problem. By leveraging on the recent advances in non-convex large-scale optimization, namely, the smooth Schatten-p approximation and the accelerated majorization-minimization…
| Chikusei | Cuprite | |||
| ( pixels, bands) | ( pixels, bands) | |||
| patch size | approx. rank | patch size | approx. rank | |
| 10 | 9 | |||
| 8.750.96 | 8.251.26 | |||
| 8.331.73 | 7.441.33 | |||
| 8.191.83 | 7.131.20 | |||
| 7.921.89 | 6.761.13 | |||
| 7.811.98 | 6.641.13 | |||
| 7.382.13 | 6.381.06 | |||
| 7.082.29 | 6.121.09 | |||
| 6.892.34 | 6.031.02 | |||
| 6.522.35 | 5.841.00 | |||
| 6.472.36 | 5.791.01 | |||
| Indian Pine | University of Pavia | |||
| ( pixels, bands) | ( pixels, bands) | |||
| patch size | approx. rank | patch size | approx. rank | |
| 22 | 22 | |||
| 19.751.71 | 20.751.89 | |||
| 17.564.10 | 20.442.07 | |||
| 16.633.77 | 20.442.66 | |||
| 15.203.94 | 20.282.84 | |||
| 14.694.44 | 20.033.08 | |||
| 13.364.47 | 19.803.43 | |||
| Washington DC Mall | Moffett Field | |||
| ( pixels, bands) | ( pixels, bands) | |||
| patch size | approx. rank | patch size | approx. rank | |
| 6 | 13 | |||
| 5.500.58 | 12.002.71 | |||
| 5.220.67 | 11.562.65 | |||
| 4.501.03 | 11.002.94 | |||
| 4.521.26 | 10.682.87 | |||
| 4.391.23 | 10.562.92 | |||
| 4.201.17 | 10.082.89 | |||
| Chikusei | Indian Pines | Washinton DC Mall | University of Pavia | Moffett Field | |
| MS response | IKONOS | Landsat 4 TM | Landsat 4 TM | IKONOS | Landsat 4 TM |
| Image size () | 128230,400 | 17814,400 | 19157,600 | 10357,600 | 18757,600 |
| Patch size () | 1283,600 | 178900 | 1903,600 | 1033,600 | 1873,600 |
| Patch number | 64 | 16 | 16 | 16 | 16 |
| Method | Time (sec.) | PSNR | SAM | ERGAS | UIQI |
| Ideal value | 0 | 0 | 0 | 1 | |
| Dataset - Chikusei | |||||
| GSA | 1.440.15 | 33.110.01 | 4.850.01 | 6.590.01 | 0.7870.000 |
| GLP | 43.448.84 | 29.130.00 | 5.360.01 | 7.920.01 | 0.7330.000 |
| CNMF | 147.5464.93 | 34.130.11 | 4.210.08 | 5.240.11 | 0.8110.004 |
| FUMI | 293.3124.18 | 35.180.04 | 2.550.03 | 4.030.03 | 0.8820.001 |
| HySure | 130.8811.80 | 36.150.15 | 2.840.09 | 4.310.08 | 0.8630.003 |
| LRSR | 279.2643.04 | 35.840.09 | 2.880.03 | 4.410.06 | 0.8800.002 |
| NNM | 91.9512.48 | 32.630.01 | 4.960.00 | 7.490.01 | 0.7640.000 |
| GLORIA | 119.9921.78 | 37.730.01 | 2.310.00 | 3.590.00 | 0.8940.000 |
| Dataset - Indian Pines | |||||
| GSA | 0.440.08 | 19.072.24 | 7.199.30 | 4.092.78 | 0.5120.067 |
| GLP | 2.380.51 | 18.060.48 | 4.840.09 | 3.660.20 | 0.3970.034 |
| CNMF | 16.574.04 | 22.640.22 | 4.290.10 | 2.430.07 | 0.5230.009 |
| FUMI | 11.400.30 | 24.850.15 | 2.690.05 | 1.760.04 | 0.7620.007 |
| HySure | 10.960.37 | 26.700.16 | 2.740.04 | 1.410.03 | 0.6980.008 |
| LRSR | 18.800.28 | 27.980.14 | 2.530.03 | 1.180.02 | 0.7960.004 |
| NNM | 6.810.89 | 26.430.04 | 2.930.01 | 1.710.01 | 0.6790.002 |
| GLORIA | 14.560.92 | 29.090.03 | 2.280.00 | 1.040.00 | 0.8040.001 |
| Dataset - Washington DC Mall | |||||
| GSA | 0.970.09 | 20.090.10 | 6.510.03 | 20.350.38 | 0.6540.004 |
| GLP | 7.681.83 | 18.200.09 | 7.000.09 | 14.850.06 | 0.5820.005 |
| CNMF | 31.7416.95 | 25.710.19 | 3.840.08 | 6.940.22 | 0.7720.007 |
| FUMI | 44.330.69 | 29.170.10 | 2.490.02 | 2.910.06 | 0.9150.004 |
| HySure | 31.288.81 | 28.820.33 | 3.000.07 | 3.740.25 | 0.8720.009 |
| LRSR | 71.921.55 | 27.910.16 | 3.850.10 | 3.590.15 | 0.8690.007 |
| NNM | 27.631.71 | 27.570.12 | 3.280.02 | 8.050.03 | 0.8130.005 |
| GLORIA | 57.068.14 | 29.360.23 | 2.850.02 | 3.180.20 | 0.8880.002 |
| Dataset - University of Pavia | |||||
| GSA | 0.490.05 | 30.500.02 | 6.790.03 | 3.050.01 | 0.9000.001 |
| GLP | 4.451.14 | 25.710.07 | 7.880.11 | 5.540.06 | 0.7640.005 |
| CNMF | 27.0821.05 | 35.860.18 | 4.090.09 | 1.840.04 | 0.9460.002 |
| FUMI | 36.720.81 | 35.290.04 | 3.120.01 | 1.760.01 | 0.9620.000 |
| HySure | 27.469.73 | 36.710.14 | 3.340.02 | 1.550.01 | 0.9610.001 |
| LRSR | 58.291.20 | 36.430.09 | 3.450.05 | 1.650.03 | 0.9590.001 |
| NNM | 10.550.41 | 36.810.01 | 3.480.00 | 1.550.00 | 0.9570.000 |
| GLORIA | 16.680.50 | 37.510.01 | 3.160.00 | 1.460.00 | 0.9640.000 |
| Dataset - Moffett Field | |||||
| GSA | 0.870.06 | 30.560.02 | 5.840.01 | 2.790.01 | 0.8430.000 |
| GLP | 6.500.70 | 28.180.19 | 5.300.08 | 3.530.07 | 0.7750.013 |
| CNMF | 17.389.60 | 36.080.12 | 3.190.06 | 1.570.03 | 0.9190.002 |
| FUMI | 35.071.31 | 34.650.02 | 2.480.02 | 1.680.01 | 0.9460.000 |
| HySure | 26.727.39 | 36.900.19 | 2.730.05 | 1.410.02 | 0.9390.002 |
| LRSR | 55.780.69 | 36.970.12 | 2.580.04 | 1.360.02 | 0.9420.001 |
| NNM | 22.793.52 | 37.490.10 | 2.510.03 | 1.340.01 | 0.9400.001 |
| GLORIA | 44.385.46 | 38.230.01 | 2.300.00 | 1.220.00 | 0.9490.000 |
| / - 15dB | |||||
| Method | PSNR | SAM | ERGAS | UIQI | |
| Ideal value | 0 | 0 | 1 | ||
| GSA | 10.981.27 | 19.294.71 | 9.491.83 | 0.2160.055 | |
| GLP | 14.760.32 | 11.090.18 | 5.900.17 | 0.2730.048 | |
| CNMF | 15.730.41 | 10.520.52 | 5.430.22 | 0.2990.050 | |
| FUMI | 17.630.16 | 7.690.42 | 4.320.17 | 0.4360.055 | |
| HySure | 17.020.51 | 9.210.32 | 4.770.17 | 0.3630.064 | |
| LRSR | 18.820.72 | 6.980.54 | 3.920.26 | 0.4340.074 | |
| NNM | 17.290.50 | 8.880.20 | 4.430.10 | 0.3720.065 | |
| GLORIA | 18.460.23 | 7.900.10 | 4.000.06 | 0.4320.060 | |
| 21.460.12 | 5.030.13 | 3.000.07 | 0.5830.051 | ||
| 21.760.14 | 4.610.10 | 2.830.06 | 0.5980.053 | ||
| / - 25dB | |||||
| GSA | 18.642.05 | 9.439.14 | 4.913.69 | 0.6080.102 | |
| GLP | 19.410.47 | 4.010.14 | 3.400.21 | 0.6040.068 | |
| CNMF | 24.710.35 | 3.890.20 | 1.970.10 | 0.7290.048 | |
| FUMI | 22.540.34 | 2.470.14 | 2.510.08 | 0.8530.034 | |
| HySure | 31.330.26 | 1.260.09 | 0.900.04 | 0.9230.021 | |
| LRSR | 30.630.22 | 1.510.12 | 1.010.05 | 0.9140.021 | |
| NNM | 27.300.44 | 2.780.05 | 1.420.03 | 0.8200.045 | |
| GLORIA | 28.990.21 | 2.210.08 | 1.250.03 | 0.8700.029 | |
| 30.700.22 | 1.620.08 | 1.000.03 | 0.9090.023 | ||
| 31.320.21 | 1.420.10 | 0.900.04 | 0.9240.020 | ||
| / - 35dB | |||||
| GSA | 21.312.55 | 7.8511.01 | 4.384.35 | 0.7960.109 | |
| GLP | 21.070.53 | 1.680.09 | 2.790.19 | 0.7820.070 | |
| CNMF | 32.250.35 | 1.760.12 | 0.890.06 | 0.9250.020 | |
| FUMI | 23.450.30 | 1.470.11 | 2.320.09 | 0.9490.010 | |
| HySure | 35.930.42 | 1.020.10 | 0.550.04 | 0.9770.006 | |
| LRSR | 34.890.40 | 1.030.11 | 0.630.04 | 0.9710.007 | |
| NNM | 34.870.32 | 1.190.02 | 0.590.01 | 0.9590.014 | |
| GLORIA | 35.480.51 | 1.090.08 | 0.580.05 | 0.9660.012 | |
| 36.180.22 | 1.010.03 | 0.520.01 | 0.9700.011 | ||
| 36.730.49 | 0.960.09 | 0.530.04 | 0.9740.008 | ||
| Exact MM | Inexact MM via nominal PG | Inexact MM via APG (GLORIA) | |
|---|---|---|---|
| Time (sec.) | 214.16114.83 | 102.482.44 | 24.250.35 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
\fail
Hyperspectral Super-Resolution via
Global-Local Low-Rank Matrix Estimation
Ruiyuan Wu*†, Wing-Kin Ma†, Xiao Fu‡*, and Qiang Li⋆
*†*Department of Electronic Engineering, The Chinese University of Hong Kong,
Hong Kong SAR of China
*‡*School of Electrical Engineering and Computer Science,
Oregon State University, Corvallis, USA
⋆School of Information and Communications Engineering,
University of Electronic Science and Technology of China, China
E-mails: [email protected], [email protected],
[email protected], [email protected]
Abstract
Hyperspectral super-resolution (HSR) is a problem that aims to estimate an image of high spectral and spatial resolutions from a pair of co-registered multispectral (MS) and hyperspectral (HS) images, which have coarser spectral and spatial resolutions, respectively. In this paper we pursue a low-rank matrix estimation approach for HSR. We assume that the spectral-spatial matrices associated with the whole image and the local areas of the image have low-rank structures. The local low-rank assumption, in particular, has the aim of providing a more flexible model for accounting for local variation effects due to endmember variability. We formulate the HSR problem as a global-local rank-regularized least-squares problem. By leveraging on the recent advances in non-convex large-scale optimization, namely, the smooth Schatten- approximation and the accelerated majorization-minimization method, we develop an efficient algorithm for the global-local low-rank problem. Numerical experiments on synthetic, semi-real and real data show that the proposed algorithm outperforms a number of benchmark algorithms in terms of recovery performance.
1 Introduction
Hyperspectral (HS) sensors have limited spatial resolution as a tradeoff for achieving high spectral resolution. Such a tradeoff is due to hardware limitations and the measurement mechanism. In a nutshell, a certain amount of light energy reflected by the scene is required for each spectral band of an HS pixel to achieve sufficiently high SNRs. For a sensor with coarse spectral resolution, enough energy can be acquired from a small area by accumulating energy of a wide range of spectral bands. When the spectral resolution increases, the area sensed by a pixel needs to be enlarged to acquire the same amount of energy for each spectral band, which leads to a lower spatial resolution. How to enhance the spatial resolution of HS images has been a subject of great interest. Recently, the idea of using an additional multispectral (MS) image—which has finer spatial resolution than the HS but possesses only several coarse spectral bands—for HS spatial resolution enhancement has shed new light on the subject. This MS-aided enhancement problem is called hyperspectral super-resolution (HSR) or HS-MS data fusion. One approach for HSR is to adapt pansharpening techniques in fusion of panchromatic and HS images [1]. Another approach, which is currently more popular, is based on low-dimensional data models. The low-dimensional model approach assumes that the spectral pixels of the target high-spatial-resolution HS image, or the super-resolution (SR) image for short, lie in a low-dimensional subspace. This assumption is particularly reasonable in the linear spectral mixture scenario: Since the aforesaid scenario has every spectral pixel described as a linear combination of the spectral signatures of the underlying endmembers, the spectral pixels lie in a subspace spanned by the endmember spectral signatures. Also, since the number of endmembers is often much smaller than the number of HS spectral bands, the subspace dimension is low. The low-dimensional model may also be applicable to some classes of nonlinear spectral mixtures such as the bilinear mixture model [2, 3]. The low-dimensional model approach has strong connections to hyperspectral unmixing (HU). To be specific, insights and methods in HU are quite often used in the low-dimensional model approach. A comparative review has shown that the low-dimensional model approach can lead to better recovery than those from the pansharpening approach, assuming no or negligible HS-MS co-registration error [1].
To facilitate our discussion later, in this paper we taxonomize the existing low-dimensional model-based HSR methods into two types.
Matrix factorization: This type of methods models the spectral-spatial matrix of the SR image as a product of two matrix factors—one being the spectral dictionary, and another the coefficients for low-dimensional representation, and it seeks to jointly estimate the two matrix factors from the observed HS-MS pair. Coupled non-negative matrix factorization (CNMF) [4], a pioneering HSR method, falls into this type. As its name suggests, CNMF exploits the non-negativity of the matrix factors. Subsequent research follows the same route and exploits other problem structures—sparsity [5], the sum-to-one abundance condition from the linear spectral mixture model [6, 7], non-local pixel similarity [8], and many more—to attempt to improve recovery quality. 2. 2.
Dictionary-based regression: This type of methods also assumes that the spectral-spatial matrix of the SR image is the product of a spectral dictionary and the associated coefficient matrix. The difference is that it does not seek to jointly estimate the spectral dictionary and the coefficients. It first determines the spectral dictionary via some easy way, and then uses that spectral dictionary to perform regression to recover the coefficients. A typical example is HySure [9], which extracts the spectral dictionary by applying vertex component analysis (VCA) [10] to the HS image, and then recovers the coefficients by applying spatial total variation-regularized linear regression to the HS-MS image pair. Other methods include [11, 12], which exploit the local low-rank structure; this will be further discussed later. The dictionary-based regression methods are easy to implement compared to the matrix factorization methods.
Research on these two types of methods is mostly focused on the practical aspects, and it is worthwhile to note that some specific methods have recently been shown to exhibit theoretical recovery guarantees as well [13, 14]—which supports the soundness of the low-dimensional model approach via a theoretical lens.
Matrix factorization and dictionary-based regression are considered most representative in HSR methods, although there are others. For example, tensor factorization has recently been studied for HSR [15, 16, 17, 18, 19, 20]. The tensor model is also a low-dimensional model, and it exploits not only the spectral-spatial structure but also the two-dimensional spatial structure. Tensor factorization is shown to exhibit favorable sufficient conditions on exact recovery guarantees [16, 17]. In addition, deep learning for HSR has most recently gained growing interest [21, 22, 23].
Under the low-dimensional model, HSR can be seen as a problem of recovering a low-rank matrix from incomplete observations; this will be elaborated upon in Sections 3 and 4. From this perspective, the problem is nearly the same as the matrix completion problem which has drawn widespread interest in fields such as recommender systems, machine learning and mathematical optimization [24, 25, 26]. The problem in matrix completion is that we have a matrix with many missing entries, and we aim to recover the missing entries from the available entries. The main assumption in matrix completion is that the matrix to be recovered has low rank structure. This assumption is the same as the low-dimensional model assumption in HSR. In matrix completion we see two main types of methods. One is matrix factorization, which shares the same rationale as matrix factorization for HSR. Another is low-rank matrix estimation. This approach does not pre-determine the target dimension of the low-dimensional subspace, or the target rank of the matrix to be recovered, which is the case in matrix factorization. Instead of using the matrix factorization model, it seeks the minimum rank matrix for accomplishing the task. A well-known method in low-rank matrix estimation is the nuclear norm minimization method [24]. It is a convex solution, and the idea is to approximate the hard-to-handle rank function by the nuclear norm which is convex. Non-convex rank approximation, such as the Schatten- approximation, was also considered for approximating the low-rank problem better [27]. Back to HSR, while we have seen numerous studies on matrix factorization and dictionary-based regression, we see far less on low-rank matrix estimation.
In addition, and as a different issue, the existing low-dimensional model-based HSR methods are usually not designed to account for the endmember variability (EV) effects due, for instance, to illumination conditions and intrinsic spectral variability of the materials [28]. In low-dimensional models, EV means that the spectral dictionary can vary in space. Taking a step back to HU, we have seen studies that use the matrix factorization method to deal with endmember variability [29, 30, 31]. In that regard, a possibility one can consider is to adapt such matrix factorization methods to the HSR application. We are however unaware of such development as of the writing of this paper.
In this paper, our objective is to explore the potential of low-rank matrix estimation in HSR. Our study is not a direct application of the existing low-rank matrix estimation methods, such as the nuclear norm minimization method. Our formulation takes the possibility of EV into consideration. We posit a global-local low-rank structure with the SR image, in which not only the spectral-spatial matrix of the whole SR image has low-rank structure, but that of each local area also has another low-rank structure. This assumption means that each local area can have its low-dimensional representation. The local low-rank assumption provides the model with the flexibility to account for EV. Moreover, since the low-dimensional subspaces of the local areas should be related, particularly the neighboring ones, we also assume that the whole spectral-spatial matrix has low rank and utilize it to tie the local subspaces together. As we will see, our global-local low-rank matrix estimation leads to a fairly clean formulation. In comparison, if one applies the EV-present matrix factorization methods in HU to HSR, the resulting formulation would be more complicated.
The arising challenge and our proposed solution should be described. The global-local low-rank matrix estimation problem is a non-convex large-scale problem. For example, to recover an SR image of spectral bands and of size , our problem requires us to handle optimization variables. An efficient optimization strategy is clearly needed. We attack the problem by leveraging on the recent advances in non-convex large-scale optimization, namely, the smooth Schatten- approximation [27] and an accelerated version of the majorization-minimization (MM) method [32]. As mentioned, the smooth Schatten- approximation, albeit non-convex, approximates rank better in comparison with the convex nuclear norm. Also, its smooth nature enables us to access powerful machinery in smooth optimization. The accelerated MM method is a combination of inexact MM and the accelerated projected gradient (PG) method. Our recent research in another context [32]—which also deals with large problem sizes—has suggested that this type of accelerated methods runs very fast in practice. Using the above two techniques, we develop a fast algorithm called Global-Local lOw-Rank promotIng Algorithm (GLORIA). As will be shown by numerical results, GLORIA gives competitive recovery performance compared to the state of the arts and related methods. We conducted semi-real experiments on five different datasets, and GLORIA consistently ranks first or second in performance indicators such as peak SNR and spectral angle mappers. We also provide results on synthetic and real data experiments, in which GLORIA also exhibits promising performance.
Before we proceed to the description of our method, we should mention related works. First, the dictionary-based regression method in [11], HSR-LDL-EIA, also utilizes some kind of local low-rank structures. HSR-LDL-EIA considers the linear spectral mixture model and assumes that the number of endmembers in each local area is very small and no greater than the number of MS bands. With that assumption, the HSR problem can be easily solved in a local-area-by-local-area fashion—which is what HSR-LDL-EIA does. The local low-rank assumption used in HSR-LDL-EIA is not the same as the one used by us. As discussed earlier, our local low-rank assumption is to cater for EV. Second, the recent work in [12] takes insight from the local low-rank assumption in HSR-LDL-EIA and proposes a dictionary-based regression method using local nuclear norm-regularized linear regression. Again, the local low-rank assumption in [12] is founded on the argument that the number of endmembers in each local area is small. We should also mention the works [33, 34] which follow similar rationales as those in [11, 12].
Let us summarize our contributions.
We consider low-rank matrix estimation for HSR, which has not been studied in prior works. We propose a global-local low-rank approach which aims to account for EV effects. 2. 2.
The global-local low-rank approach requires us to tackle a large-scale non-convex optimization problem. We custom-develop an efficient algorithm for such purpose, using recent advances in large-scale non-convex optimization. As will be shown in numerical experiments, our algorithm has competitive recovery performance.
Readers who are interested in trying our algorithm can find the source codes at https://github.com/REIYANG/GLORIA.
2 Background
2.1 The Measurement Model
Let us begin by providing a concise review of the background. Fig. 1 depicts the scenario. We have a scene observed by an HS sensor and an MS sensor. The MS sensor has a lower spectral resolution than the HS sensor, while the HS sensor has a lower spatial resolution than the MS sensor. The goal of HSR is to use the observed MS and HS images to construct a higher resolution image whose spectral resolution is identical to that of the HS sensor, and spatial resolution the MS. For convenience, the image we seek to construct will be called the super-resolution (SR) image. As a common assumption (see, e.g., [4]), the HS image is modeled as a spatially degraded version of the SR image by means of spatial blurring and down-sampling. Also, the MS image is modeled as spectrally degraded version of the SR image by means of spectral bandpass averaging.
The HSR data model is as follows. Assuming that the HS and MS images are co-registered, we model the HS and MS images as
[TABLE]
where and are the spectral-spatial matrices of the observed MS and HS images, respectively (resp.); and are the numbers of spectral bands of the MS and HS images, resp., with ; and are the numbers of pixels of the MS and HS images, resp., with ; is the spectral-spatial matrix of the SR image; and describe the measurement responses that lead to the MS and HS images, resp.; and are noise. Note that designates the relative spectral bandpass responses from the SR image to the MS image, while specifies the spatial blurring and down-sampling responses that result in the HS image. The measurement response matrices and are assumed to be known, and in practice and can be acquired either by calibration [4] or by estimation from the HS and MS images [35, 9]. Furthermore, the MS and HS images are measured by means of reflectance, with values lying between [math] and . As such, we may assume that , for all .
2.2 The Matrix Factorization Model
Next, we describe the matrix factorization model which is the core assumption for matrix factorization and dictionary-based regression methods in HSR. In the matrix factorization model we assume that the SR image can be factored as
[TABLE]
where is the spectral dictionary; is the coefficient matrix; is the target rank or model order, which is pre-fixed and is often chosen to be much less than and . In many existing studies, the model (2) is seen as the linear spectral mixture model in which the columns ’s of are interpreted as spectral signatures of the endmembers of the scene, and the columns ’s of the associated abundances of the pixels. Under the model (2), the matrix factorization methods seek to find from by minimizing a data-fitting loss function, and thereby reconstruct by . For example, in CNMF [4], the idea is to solve
[TABLE]
where means that is element-wise non-negative; denotes the Frobenius norm. CNMF, as well as other matrix factorization formulations, considers structured factors to better exploit the underlying problem structure. CNMF utilizes the fact that, under the linear spectral mixture model, and are non-negative. Moreover, in the dictionary-based regression methods, one first determines from by methods such as principal component analysis (PCA) or VCA. Then, is estimated by solving the data-fitting problem like the one in (3), but with fixed. In estimating , regularization such as total variation may be added for problem structure exploitation [9]. A common, and key, concept behind the various matrix factorization and dictionary-based regression methods is that although is a high-dimensional matrix, we hold the belief that every spectral pixel of should lie in a low-dimensional subspace spanned by .
As an alternative view, matrix factorization and dictionary-based regression may be regarded as methods for estimating a low-rank matrix from the incomplete HS-MS observations. Specifically, the low-dimensional subspace assumption with implies .
3 Global-Local Low-Rank Formulation
This section describes the main development of this paper, global-local low-rank matrix estimation.
3.1 A Brief Review of Low-Rank Matrix Estimation
The low-rank matrix estimation methods have recently become popular in the context of matrix completion [24, 26]. Let us first describe how the de facto standard in low-rank matrix estimation, namely, the nuclear norm approximation, can be applied to HSR. In low-rank matrix estimation, we assume to be a low-rank matrix. This assumption can be interpreted as requiring the columns to lie in a low-dimensional subspace. The matrix factorization model (2) can also be seen as constraining to lie in a low-dimensional subspace, with the subspace dimension no greater than . Hence, both the low-rank matrix estimation and matrix factorization methods exploit low-dimensional data structures. The idea with low-rank matrix estimation is to find a low-rank whose data fitting loss is small. A common low-rank matrix estimation formulation is as follows
[TABLE]
where is given and is called a regularization parameter; denotes the data-fitting loss function and is given by
[TABLE]
Let us give a brief comparison of low-rank matrix estimation and matrix factorization. Matrix factorization methods, such as the one in (3), require one to pre-determine the target rank . There is no such rank constraint in low-rank matrix estimation. The rank of itself serves as the regularization for the low-rank matrix recovery endeavor, and the parameter controls the balance between low rankness and goodness of data fitting.
The challenge with solving problem (4) is that the rank function in (4) is hard to handle; it is non-convex and non-differentiable. The state of the art handles this issue by applying the nuclear norm approximation. The reader is referred to the literature [26] for detailed description of the concept, and here we concisely explain the idea. The rank of is identical to the number of nonzero singular values of , and hence we can express as
[TABLE]
where denotes the th largest singular value of ; if , and if . The idea with nuclear norm approximation is to approximate by removing from (6), which leads to the following approximate function:
[TABLE]
The above function is called the nuclear norm and is known to be convex [24, 26]. Applying this nuclear norm approximation of rank to problem (4) gives rise to the following problem:
[TABLE]
The advantage of the above approximation is that it is a convex problem. Moreover, problem (7) can be efficiently solved by methods such as the accelerated proximal gradient method [36] and ADMM [37].
3.2 The Global-Local Low-Rank Model
We consider a global-local low-rank assumption for the HSR problem. To explain the idea, consider the illustration in Fig. 2. We segment the SR image into a number of local patches. Our belief is that each local patch exhibits its own local rank structure. Or, the low-dimensional subspace of one patch does not need to be the same as that of another. This assumption appears to make sense since real images may have local variation effects due to EV. Moreover, we still keep the old low-rank assumption with . This is because the low-dimensional subspace of one patch should be related to those of its neighboring patches, and such correlations may result in a low-rank in the global sense (with a higher global rank than the local ranks). Alternatively speaking, the low-dimensional subspace of the whole plays the role of tying together the low-dimensional subspaces of the local patches.
To write down the global-local low-rank assumption, we assume that the pixel indices of the image are arranged such that can be conveniently expressed as
[TABLE]
where each is the spectral-spatial matrix of a local area, or local patch, of the image; is the number of patches; is the number of pixels of patch . For example, as illustrated in Fig. 2, we can divide the image into equal-space rectangular blocks as our local patches. Other ways to form the local patches, e.g., via segmentation [11], may also be considered. We assume that every is a low-rank matrix, and is also a low-rank matrix.
The global-local low-rank assumption stated above is fairly general and does not restrict itself to specific mixture models such as the linear spectral mixture model. On the other hand, we can better understand the assumption by a more concrete example, in which the linear spectral mixture model is used, as follows. Suppose we model each to follow the linear spectral mixture model
[TABLE]
where and are the endmember and abundance matrices of patch ; is the total number of endmembers in the whole SR image . In this model, we assume the presence of EV by allowing the endmember matrix to be different for each patch. Note that our model assumes that EV appears at the patch level, not at the pixel level, and this can be justified if the local region of each patch is small enough such that the endmember spectral signatures experience little or no variation within the patch. We also want to impose an assumption that are correlated, since, in reality, they should be variations of one another. Such correlation would mean that can be linearly represented by a “global” basis , whose dimension is no less than , but not significantly greater than owing to the correlations. This further means that lives in a low-dimensional subspace with as its basis. Our global low-rank assumption is to exploit the global low-dimensional structure.
Additionally, in the above motivating model example, at first sight one would be tempted to say that the rank of is universally given by (under the slightly technical premise that and have full column rank and full row rank, resp.). In fact, it is reasonable to assume non-identical . In practice, it is likely that each local region is composed of a small number of endmembers, rather than all of the endmembers. Thus we can assume that among all the rows of , only of them are nonzero (or active). Consequently we have (again, under the technical premise that has full column rank and that the nonzero ’s are linearly independent), which is our local low-rank assumption.
3.3 An Experiment
To support our argument that the global-local low-rank structure would be a reasonable assumption, we perform the following numerical experiment. We take real HS images and numerically evaluate their global and low rank values. The images come from six different datasets, namely, Chikusei, Cuprite, Indian Pines, University of Pavia, Washington DC Mall and Moffett Field. They are shown in Fig. 3 in color composite forms. For each image, we obtain the local patches ’s by the equal-space rectangular segmentation in Fig. 2. For each local patch we evaluate its rank in an approximate manner, specifically, by finding the smallest integer such that
[TABLE]
Table 1 shows the approximate ranks of the tested images under different patch sizes. One can clearly see that all the tested images exhibit global-local low-rank characteristics. For example, for the Chikusei image, the global rank is while the average local rank for is around .
3.4 The Global-Local Low-Rank Matrix Estimation Formulation
Under the global-local low-rank assumption in the preceding subsection, it is natural to formulate the HSR problem as the following global-local low-rank matrix estimation problem:
[TABLE]
where are given regularization parameters; we denote for notational convenience; has been defined in (5); is given by .
As reviewed previously, the standard approach to handle problem (4) is to approximate each by the nuclear norm . Here we pursue a different option, namely, the the smooth Schatten- approximation [27]. To put it into context, let us first define a notation. Given a symmetric positive definite matrix and a number , we define
[TABLE]
where and constitute the eigen-decomposition ; note that is orthogonal, with , and . The smooth Schatten- function of an matrix , with , is defined as
[TABLE]
where are given. This function has the following properties:
- (i)
is smooth (or has derivatives of all orders); 2. (ii)
is convex for , and non-convex for ; 3. (iii)
as , ; 4. (iv)
as , .
As can be seen in the above properties, the smooth Schatten- function is a smooth approximation of rank. Ideally we would like to choose very small and so that closely approximates , but using very small and will also make poorly behaved (e.g., very large Lipschitz constant of the gradient of ).
By replacing in problem (9) with the smooth Schatten- function, we obtain the Schatten- approximation of the global-local low-rank matrix estimation formulation (9) as follows:
[TABLE]
where and are given. We will be interested in the case of . The corresponding problem (11) is non-convex, but we found that, empirically, using results in better recovery performance than using (the smooth nuclear norm case).
4 Global-Local Low-Rank Algorithm
In this section, we develop an algorithm for the global-local low-rank matrix estimation formulation (9). Problem (9) is a large-scale optimization problem with optimization variables, and computational efficiency is a main concern in algorithm design. The algorithm to be presented is custom-designed for problem (9), where we exploit the problem structure for computational efficiency. The main optimization concepts used in our algorithm design are majorization-minimization (MM) and the accelerated projected gradient method.
4.1 Majorization-Minimization
Firstly, we describe the MM method. For notational convenience, let
[TABLE]
In MM we seek a function , called a majorant of , that satisfies
[TABLE]
We also require that, for any given , is convex and continuously differentiable. Given a starting point , the MM method handles problem (9) by iteratively solving
[TABLE]
where the problem at each iteration in (13) is a convex problem. The MM iteration (13) is known to guarantee convergence to a stationary solution to problem (9) [38]. We identify a majorant for problem (9) by resorting to the following result.
Fact 1
[27, Sec. 3.1 and Appendix A]** For , the smooth Schatten- function admits an alternative characterization
[TABLE]
where denotes the set of all symmetric and positive definite matrices;
[TABLE]
Also, the minimum in (14) is uniquely attained at
[TABLE]
By applying Fact 1 to (12), and noting for any , we obtain the following majorant
[TABLE]
where
[TABLE]
Note that computing requires computing the eigendecomposition of , which takes floating point operations; cf. (10). It is easy to show that this is convex and continuously differentiable. Also, solving the MM iteration in (13) is the same as solving
[TABLE]
The above problem is a quadratically regularized least-squares, with an iteratively reweighted quadratic regularizer. Hence, the MM method developed above can be interpreted as an iteratively regularizer-reweighted least-squares method. In fact, if we remove the bound constraints we can solve problem (16) in closed form. However, we would like to keep the bound constraints, and this leads to the development in the next subsection.
4.2 Accelerated Projected Gradient for Solving the MM Iteration
Secondly, we describe an iterative method for solving the MM iteration in (16). We employ the accelerated projected gradient (APG) method [39, 40, 41], which is a fast first-order algorithm. The APG method for solving problem (16) is as follows. Let
[TABLE]
for convenience. Given a starting point , we perform the recursion
[TABLE]
Here, is the step size; denotes the gradient of at ; denotes the projection onto ; is the extrapolated point of the ’s and is given by
[TABLE]
where ; the sequence , called the extrapolation sequence, is given by
[TABLE]
with . The APG iteration (18) is efficient to implement. The gradient is given by
[TABLE]
Also, we have
[TABLE]
It can be verified that, given , computing (20)–(21) takes floating-point operations, where denotes the number of nonzero elements of . We should note that a large number of elements of are zero. Recall that describes the spatial degradation process of local spatial blurring and downsampling. One can show that the number of nonzero elements of each column of depends on the size of the blurring kernel, and in practice the blurring kernel size is often small. Also, the projection operation for the bound set is merely a clipping function, i.e.,
[TABLE]
where and denote all-zero and all-one matrices, resp., and and are taken in the element-wise manner.
We complete the APG development by specifying our step-size rule. The APG method guarantees convergence to the optimal solution if is chosen to be a Lipschitz constant of [40, 41]. We have the following result.
Fact 2
A Lipschitz constant of in (17) is
[TABLE]
where denotes the largest eigenvalue of the argument.
The proof of Fact 2 is shown in Appendix A. The computational cost of (2) is mainly with computing the largest eigenvalues of symmetric matrices, and there are such eigenvalues to compute. The eigenvalue can be computed offline, and the eigenvalues are byproducts of computing in (15) (again, cf. (10)). It suffices to calculate at every MM iteration. Hence, computing (2) takes floating-point operations.
4.3 Algorithm Speedup via Inexact MM
Let us summarize the MM method developed in the last two subsections: We perform the MM iteration (13). Every iteration requires solving a regularized least-squares (with bound constraints) exactly, and we do that by applying the APG method in (18)–(22). The setback with this exact MM approach is that while the APG method is considered fast, it still takes time to solve each MM problem exactly.
The algorithm we finally adopt is an inexact MM method. At each MM iteration, we apply the APG method with one iteration only. Specifically, we replace the exact MM iteration (13) by
[TABLE]
where is given by (19); is chosen as , with given by (2). By using this inexact MM update we hope that the total number of iterations (i.e., the sum of APG iterations incurred by all the MM iterations) may be reduced, and the runtime improved. Based on our numerical experience, the inexact MM method runs much faster than the exact MM. Also, it is shown in [32] that, under some technical assumptions, the above inexact MM method guarantees convergence to a stationary solution.
We summarize the inexact MM algorithm in pseudo-form in Algorithm 1. We call this algorithm Global-Local lOw-Rank promotIng Algorithm (GLORIA). It can be verified that the complexity of GLORIA is per iteration.
5 Numerical Results
We performed extensive numerical experiments to benchmark GLORIA against a number of existing algorithms. The benchmarked algorithms we choose are considered most representative in the context or are related to our method. Namely, they are GSA [42], GLP [43], CNMF [4], FUMI [6], HySure [9], the nuclear norm minimization (NNM) method in (7), and LRSR [12]. GSA and GLP are pansharpening-based methods; CNMF and FUMI are representative methods in matrix factorization-based HSR; HySure is a representative method in dictionary-based regression, and it employs spatial total variation regularization in its regression; LRSR is another dictionary-based regression method which uses local nuclear-norm regularization; NNM is regarded as the baseline method for low-rank matrix estimation in the context of matrix completion, and thus we include it in our experiments. All the algorithms are implemented on a desktop computer with Intel Core i7-5760X@3GHz CPU and 32GB memory. Codes are written in MATLAB R2015a.
The parameter settings of GLORIA are as follows, unless specified otherwise. The parameters of the smooth Schatten- function are , . The local patches are obtained by dividing the image into equal-spaced rectangular blocks, as in Fig. 2. The regularization parameters ’s are chosen to be identical; i.e., . We initialize the algorithm by using a -uniform i.i.d. generated . We stop the algorithm when the relative change of the objective function is below or when the number of iterations exceeds .
The parameter settings of the benchmarked algorithms basically follow the recommended settings in [44, 12]. In addition, for matrix factorization and dictionary-based regression methods, we fix the target rank as . We use VCA to initialize the matrix factorization algorithms, and we stop the algorithms by the same stopping rule as that for GLORIA. Also, the NNM method is implemented by applying the APG method in [45] to problem (7).
The performance measures employed for evaluating the recovery performance are the peak SNR (PSNR), spectral angle mapper (SAM), Erreur Relative Globale Adimensionnelle de Sythes̀e (ERGAS) and universal image quality index (UIQI). They have been extensively used in the HSR literature, and we refer the reader to [1] for their definitions.
5.1 Semi-Real Data Experiments
First, we consider semi-real data experiments. The experiments were based on the widely-used Wald’s protocol [46], where we take a real HS image as the ground-truth SR image and use it to generate the observed MS and HS images, and , through the model (1). We consider the following real HS datasets.
Hyperspec-VNIR-C Chikusei: This dataset was acquired by the Headwall Hyperspec-VNIR-C imaging sensor [47]. It covers 128 spectral bands whose wavelength range is from 363nm to 1,018nm. We take a 480480 subimage from this dataset as our SR image. 2. 2.
AVIRIS Indian Pines: This dataset was captured by the Airborne Visible/Infrared Imaging Spectrometer (AVIRIS) HS sensor [48]. The wavelength range is from 400nm to 2,500nm. It has 178 spectral bands after dropping bands that are corrupted by water absorption. In the experiment, a 120120 subimage is used. 3. 3.
HYDICE Washington DC Mall: This dataset was taken by the Hyperspectral Digital Imagery Collection Experiment (HYDICE) HS sensor [49]. We take a subimage of this dataset, which has 240240 pixels and 191 clean spectral bands. The wavelength range is from 400nm to 2,500nm. 4. 4.
ROSIS University of Pavia: This dataset was measured by the Reflective Optics System Imaging Spectrometer (ROSIS) HS sensor. This dataset has 103 spectral bands whose wavelength range is from 430nm to 850nm. We take a 240240 subimage from this dataset as the SR image. 5. 5.
AVIRIS Moffett Field: This dataset, recorded by AVIRIS HS sensor, has 187 uncorrupted spectral bands. The wavelength range is from 400nm to 2,500nm. We take a 240240 subimage from this dataset.
These five images have been displayed in Fig. 3. The settings with the spectral and spatial measurement response matrices and should also be described. The matrix is chosen such that it is equivalent to the spectral response of either the Landsat 4 TM sensor [50] (6 bands, with spectral coverage from 400nm to 2,500nm) or the IKONOS sensor [51] (4 bands, with spectral coverage from 450nm to 900nm). We choose the Landsat 4 TM sensor response for Indian Pines, Washington DC Mall and Moffett Field, and the IKONOS sensor for Chikusei and University of Pavia; such choosing is to match the spectral coverage of the HS images. As discussed previously, corresponds to the process of spatial blurring and downsampling. The blurring function is a 1111 Gaussian point spread function, with variance . The downsampling is done every pixels, both horizontally and vertically. Furthermore, the noise terms and are randomly generated following an i.i.d. mean-zero Gaussian distribution. We fix the SNRs at .
Table 2 summarizes some of the settings with the experiments. There, we also show the settings with the patch number of GLORIA. The regularization parameter of GLORIA is fixed as . We ran independent trials in each image. The obtained performance is shown in Table 3, where, for each performance measure, we use blue, brown and red boldfaced letters to mark the best, second best and third best algorithms. To give the reader an additional reference on the performance, we show the SAM maps of the various algorithms in Figs. 5–8; note that the SAM maps shown are from one realization.
From Table 3 we see that, except for runtimes, GLORIA ranks best or second best in all of the performance measures. From Figs. 5–8, we also see that GLORIA yields good results compared to the other algorithms—and this is particularly so for the Indian Pine image in Fig. 5. In fact, we observe that even the NNM method, which is the baseline low-rank matrix estimation method and can be regarded as the precursor of our global-local low-rank pursuit, works reasonably. The above reported results suggest that the exploitation of low-rank spectral-spatial data structures in HSR is a working idea. We speculate that the good performance of GLORIA compared to the other algorithms is because GLORIA exploits the local low-rank data structure, which may provide better robustness to the EV effects. We will use synthetic experiments to examine the EV effects in the next subsection.
We should also discuss the runtime performance. The best algorithms are GSA and GLP, which do not perform very well in recovery performance. Let us compare GLORIA to the representative CNMF and HySure methods. GLORIA runs faster than CNMF and HySure for the cases of Chikusei and University of Pavia, and slower for the cases of Washington DC Mall and Moffett Field. On this issue, we should note that GLORIA deals with optimization variables, e.g., variables in Chikusei. In comparison, CNMF and HySure require and optimization variables, resp., which amount to and variables, resp., in Chikusei. In terms of runtime per variable, GLORIA is considered efficient.
5.2 Synthetic Data Experiments
Second, we consider synthetic data experiments. The way we prepare the experiment is similar to the one in [9, 11], with the difference that EV is also involved. The procedure is described as follows. We use the local-patch-wise, and EV-present, linear spectral mixture model (8) to generate the SR image. The number of endmembers is . The generations of the local endmember matrix and abundance matrix will be described shortly. Each local patch is rectangular, but its horizontal and vertical lengths are, in each trial, random. Fig. 9(b) shows one such arrangement. We obtain such patches, and it is important to note that none of the algorithms, including GLORIA, has knowledge of such patch arrangement. In GLORIA we will apply the same equi-spaced rectangular segmentation as before, and there will be mismatches between the actual patches and the patches presumed by GLORIA. Doing so is to provide a more realistic simulation as, in reality, it is impossible to exactly know how EV changes in space.
The endmember matrix is chosen as a collection of the spectral signatures of five materials, specifically, Actinolite, Albite, Muscovite, Olivine and Topaz. To simulate the EV effect, for each patch and for each material we randomly pick one variation of that material from the U.S. geological survey (USGS) spectral library [52]. The abundance matrix is chosen as the abundance maps extracted from a real HS image, namely, the AVIRIS Cuprite dataset; the extraction is done by applying an HU method called SVMAX [53]. In each trial, we randomly cropped a 120120 submap from AVIRIS Cuprite; see the illustration in Fig. 9(a) where the submap is marked as a red rectangle. The abundance maps are then extracted from that submap.
Some other simulation settings are as follows. The spectral measurement response matrix corresponds to the spectral response of the Landsat 4 TM sensor. The regularization parameter of GLORIA is . We ran independent trials. Table 4 shows the results, where again the best three algorithms are marked in blue, brown and red. As can be seen, GLORIA generally gives the best HSR recovery performance. This suggests that GLORIA has the flexibility to accommodate the EV effect. Also, GLORIA works better when the patch size is smaller, or when the number of patches increases. Another observation is that at the lower SNR, i.e., dB, GLORIA works considerably better than the state-of-art methods.
Previously we mentioned that GLORIA, an inexact MM scheme using accelerated projected gradient, runs very fast. To give the reader some idea, we conduct the following runtime test. We benchmark GLORIA against the exact MM scheme and the inexact MM scheme via the nominal projected gradient (PG). The exact MM scheme solves each MM iteration exactly via the accelerated PG method (see Section 4.2). The inexact MM scheme via the nominal PG removes the extrapolation by setting in the inexact MM iteration (24). The previously described synthetic experiment is used to test the three MM schemes, and we consider SNR= dB and . The runtime results, shown in Table 5, indicate significant runtime advantages of GLORIA over the other two candidates.
5.3 Real Data Experiments
Finally, we test the algorithms on real data. The dataset is the one used in [54]. The HS image was acquired by the Hyperion HS sensor. It covers a spectral range from 400nm to 2,500nm, and has 89 spectral bands after removing 131 noisy bands. The MS image was captured by the MS sensor mounted on the Sentinel-2A satellite. It has 13 bands, and we adopt 4 bands whose central wavelengths are 490nm, 560nm, 665nm and 842nm, resp. Readers are referred to [54] for further details. After pre-processing such as co-registration and cropping, we obtain the HS-MS image pair. The image pair is illustrated in Fig. 10. The setting is .
We employ the algorithm in [9] to estimate the spectral and spatial measurement responses and . Empirically we found that, for the tested HS-MS pair, this algorithm happens to yield a poor estimate of . To mend this issue, we consider a second-stage estimation: we fix the blurring kernel to be a Gaussian kernel and estimate its standard deviation by the nonlinear least-squares , where denotes the Gaussian kernel with variance , corresponds to the 2-D convolution operation, and and are the first row of and the estimated , resp.
We follow the same simulation settings as those in semi-real experiments, except that we use for GLORIA. Figs. 12-12 illustrate the 5th and 20th bands of the original MS image and recovered images. From the figures, we note that GLP does not work well compared to the other algorithms. Moreover, if we zoom in the recovered images, we can see that the images recovered by HySure and LRSR have strip noise, while the images recovered by CNMF and NNM have pepper noise. In comparison, the images recovered by FUMI and GLORIA appear smoother.
6 Conclusion
In this paper we explored the route of low-rank matrix estimation for HSR. By positing a low-rank spectral-spatial data structure, both globally and locally, we built an algorithmic solution, called GLORIA, that exploits such structure for HSR. Our extensive numerical studies, which include semi-real data experiments on five different datasets, one synthetic experiment and one real experiment, show that exploiting global-local low-rank structure not only is a working idea but also provides satisfactory reconstruction results. Our global-local low-rank exploitation is made possible by customizing an efficient first-order strategy in large-scale structured optimization. We close this paper by naming a future direction. It would be interesting to study how low-rank matrix estimation would be useful in other hyperspectral problems such as multisource and multitemporal data fusion.
Appendix A
By definition, a constant is said to be a Lipschitz constant of on if for any . From (20) we have
[TABLE]
for any , where (25b) is due to the triangle inequality; (25c) is due to i) the inequality in which denotes the spectral norm of , and ii) the identity for positive semidefinite . The proof of Fact 2 is complete.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1[1] L. Loncan, L. B. De Almeida, J. M. Bioucas-Dias, X. Briottet, J. Chanussot, N. Dobigeon, S. Fabre, W. Liao, G. A. Licciardi, M. Simoes et al. , “Hyperspectral pansharpening: A review,” IEEE Geosci. Remote Sens. Mag. , vol. 3, no. 3, pp. 27–46, 2015.
- 2[2] N. Dobigeon, J.-Y. Tourneret, C. Richard, J. C. M. Bermudez, S. Mc Laughlin, and A. O. Hero, “Nonlinear unmixing of hyperspectral images: Models and algorithms,” IEEE Signal Process. Mag. , vol. 31, no. 1, pp. 82–94, 2013.
- 3[3] R. Heylen, M. Parente, and P. Gader, “A review of nonlinear hyperspectral unmixing methods,” IEEE J. Sel. Topics Appl. Earth Observ. Remote Sens. , vol. 7, no. 6, pp. 1844–1868, 2014.
- 4[4] N. Yokoya, T. Yairi, and A. Iwasaki, “Coupled nonnegative matrix factorization unmixing for hyperspectral and multispectral data fusion,” IEEE Trans. Geosci. Remote Sens. , vol. 50, no. 2, pp. 528–537, 2012.
- 5[5] E. Wycoff, T.-H. Chan, K. Jia, W.-K. Ma, and Y. Ma, “A non-negative sparse promoting algorithm for high resolution hyperspectral imaging,” in Proc. IEEE Int. Conf. Acoustics, Speech, Signal Process. (ICASSP) , 2013, pp. 1409–1413.
- 6[6] Q. Wei, J. Bioucas-Dias, N. Dobigeon, J.-Y. Tourneret, M. Chen, and S. Godsill, “Multiband image fusion based on spectral unmixing,” IEEE Trans. Geosci. Remote Sens. , vol. 54, no. 12, pp. 7236–7249, 2016.
- 7[7] R. Wu, H.-T. Wai, and W.-K. Ma, “Hybrid inexact BCD for coupled structured matrix factorization in hyperspectral super-resolution,” to appear in IEEE Trans. Signal Process. , 2020.
- 8[8] R. Dian, S. Li, L. Fang, and Q. Wei, “Multispectral and hyperspectral image fusion with spatial-spectral sparse representation,” Inf. Fusion , vol. 49, pp. 262–270, 2019.
