Unveiling the Underwater World: CLIP Perception Model-Guided Underwater Image Enhancement

Jiangzhong Cao; Zekai Zeng; Xu Zhang; Huan Zhang; Chunling Fan; Gangyi Jiang; Weisi Lin

arXiv:2507.06234·cs.CV·July 10, 2025

Unveiling the Underwater World: CLIP Perception Model-Guided Underwater Image Enhancement

Jiangzhong Cao, Zekai Zeng, Xu Zhang, Huan Zhang, Chunling Fan, Gangyi Jiang, Weisi Lin

PDF

TL;DR

This paper introduces a novel underwater image enhancement method guided by a CLIP perception model, improving perceptual quality by aligning enhancement with human visual perception and employing curriculum contrastive regularization.

Contribution

The work integrates CLIP-based perception modeling into underwater image enhancement, introducing a perception loss and regularization that better align with human perception and improve image quality.

Findings

01

Outperforms state-of-the-art methods in visual quality

02

Enhances generalization ability across diverse underwater images

03

Effectively balances enhancement levels to avoid under- or over-enhancement

Abstract

High-quality underwater images are essential for both machine vision tasks and viewers with their aesthetic appeal.However, the quality of underwater images is severely affected by light absorption and scattering. Deep learning-based methods for Underwater Image Enhancement (UIE) have achieved good performance. However, these methods often overlook considering human perception and lack sufficient constraints within the solution space. Consequently, the enhanced images often suffer from diminished perceptual quality or poor content restoration.To address these issues, we propose a UIE method with a Contrastive Language-Image Pre-Training (CLIP) perception loss module and curriculum contrastive regularization. Above all, to develop a perception model for underwater images that more aligns with human visual perception, the visual semantic feature extraction capability of the CLIP model is…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.