Harnessing Multi-resolution and Multi-scale Attention for Underwater Image Restoration
Alik Pramanick, Arijit Sur, V. Vijaya Saradhi

TL;DR
This paper introduces Lit-Net, a lightweight multi-stage neural network that leverages multi-resolution and multi-scale attention mechanisms to effectively restore underwater images, improving color, detail, and spatial accuracy over existing methods.
Contribution
Lit-Net is a novel multi-stage network utilizing multi-resolution and multi-scale analysis with a new encoder block and color-specific loss for superior underwater image restoration.
Findings
Achieves 29.477 dB PSNR, 1.92% improvement over state-of-the-art.
Attains 0.851 SSIM, 2.87% higher than previous methods.
Demonstrates robustness and effectiveness on public datasets.
Abstract
Underwater imagery is often compromised by factors such as color distortion and low contrast, posing challenges for high-level vision tasks. Recent underwater image restoration (UIR) methods either analyze the input image at full resolution, resulting in spatial richness but contextual weakness, or progressively from high to low resolution, yielding reliable semantic information but reduced spatial accuracy. Here, we propose a lightweight multi-stage network called Lit-Net that focuses on multi-resolution and multi-scale image analysis for restoring underwater images while retaining original resolution during the first stage, refining features in the second, and focusing on reconstruction in the final stage. Our novel encoder block utilizes parallel convolution layers to capture local information and speed up operations. Further, we incorporate a modified weighted color…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage and Signal Denoising Methods · Image Enhancement Techniques · Advanced Image Processing Techniques
MethodsConvolution · SPEED: Separable Pyramidal Pooling EncodEr-Decoder for Real-Time Monocular Depth Estimation on Low-Resource Settings
