Harnessing Multi-resolution and Multi-scale Attention for Underwater   Image Restoration

Alik Pramanick; Arijit Sur; V. Vijaya Saradhi

arXiv:2408.09912·cs.CV·August 20, 2024

Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration

Alik Pramanick, Arijit Sur, V. Vijaya Saradhi

PDF

Open Access 1 Repo

TL;DR

This paper introduces Lit-Net, a lightweight multi-stage neural network that leverages multi-resolution and multi-scale attention mechanisms to effectively restore underwater images, improving color, detail, and spatial accuracy over existing methods.

Contribution

Lit-Net is a novel multi-stage network utilizing multi-resolution and multi-scale analysis with a new encoder block and color-specific loss for superior underwater image restoration.

Findings

01

Achieves 29.477 dB PSNR, 1.92% improvement over state-of-the-art.

02

Attains 0.851 SSIM, 2.87% higher than previous methods.

03

Demonstrates robustness and effectiveness on public datasets.

Abstract

Underwater imagery is often compromised by factors such as color distortion and low contrast, posing challenges for high-level vision tasks. Recent underwater image restoration (UIR) methods either analyze the input image at full resolution, resulting in spatial richness but contextual weakness, or progressively from high to low resolution, yielding reliable semantic information but reduced spatial accuracy. Here, we propose a lightweight multi-stage network called Lit-Net that focuses on multi-resolution and multi-scale image analysis for restoring underwater images while retaining original resolution during the first stage, refining features in the second, and focusing on reconstruction in the final stage. Our novel encoder block utilizes parallel $1 \times 1$ convolution layers to capture local information and speed up operations. Further, we incorporate a modified weighted color…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

alik033/lit-net
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsImage and Signal Denoising Methods · Image Enhancement Techniques · Advanced Image Processing Techniques

MethodsConvolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings