# Unsupervised and Unregistered Hyperspectral Image Super-Resolution with   Mutual Dirichlet-Net

**Authors:** Ying Qu, Hairong Qi, Chiman Kwan, Naoto Yokoya, Jocelyn, Chanussot

arXiv: 1904.12175 · 2021-08-03

## TL;DR

This paper introduces u^2-MDN, an unsupervised method for hyperspectral image super-resolution that effectively fuses unregistered low-resolution hyperspectral images with high-resolution multispectral images by maximizing mutual information.

## Contribution

It proposes a novel unsupervised framework that does not require registration between modalities, using mutual information to link unregistered images for improved super-resolution.

## Key findings

- Outperforms state-of-the-art methods on benchmark datasets.
- Effectively captures spatial and spectral correlations without registration.
- Reduces spectral distortion while enhancing spatial resolution.

## Abstract

Hyperspectral images (HSI) provide rich spectral information that contributed to the successful performance improvement of numerous computer vision tasks. However, it can only be achieved at the expense of images' spatial resolution. Hyperspectral image super-resolution (HSI-SR) addresses this problem by fusing low resolution (LR) HSI with multispectral image (MSI) carrying much higher spatial resolution (HR). All existing HSI-SR approaches require the LR HSI and HR MSI to be well registered and the reconstruction accuracy of the HR HSI relies heavily on the registration accuracy of different modalities. This paper exploits the uncharted problem domain of HSI-SR without the requirement of multi-modality registration. Given the unregistered LR HSI and HR MSI with overlapped regions, we design a unique unsupervised learning structure linking the two unregistered modalities by projecting them into the same statistical space through the same encoder. The mutual information (MI) is further adopted to capture the non-linear statistical dependencies between the representations from two modalities (carrying spatial information) and their raw inputs. By maximizing the MI, spatial correlations between different modalities can be well characterized to further reduce the spectral distortion. A collaborative $l_{2,1}$ norm is employed as the reconstruction error instead of the more common $l_2$ norm, so that individual pixels can be recovered as accurately as possible. With this design, the network allows to extract correlated spectral and spatial information from unregistered images that better preserves the spectral information. The proposed method is referred to as unregistered and unsupervised mutual Dirichlet Net ($u^2$-MDN). Extensive experimental results using benchmark HSI datasets demonstrate the superior performance of $u^2$-MDN as compared to the state-of-the-art.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.12175/full.md

## Figures

128 figures with captions in the complete paper: https://tomesphere.com/paper/1904.12175/full.md

## References

79 references — full list in the complete paper: https://tomesphere.com/paper/1904.12175/full.md

---
Source: https://tomesphere.com/paper/1904.12175