Retinex Meets Language: A Physics-Semantics-Guided Underwater Image Enhancement Network

Shixuan Xu; Yabo Liu; Chao Huang; Junyu Dong; Xinghui Dong

arXiv:2603.07076·cs.CV·April 23, 2026

Retinex Meets Language: A Physics-Semantics-Guided Underwater Image Enhancement Network

Shixuan Xu, Yabo Liu, Chao Huang, Junyu Dong, Xinghui Dong

PDF

TL;DR

This paper introduces PSG-UIENet, a novel underwater image enhancement network that combines physics-based illumination correction with language-informed semantic guidance, utilizing a new multimodal dataset and achieving superior results.

Contribution

The study presents the first integration of textual guidance and multimodal datasets into underwater image enhancement, improving perceptual quality and semantic consistency.

Findings

01

PSG-UIENet outperforms fifteen state-of-the-art methods on multiple datasets.

02

Constructed a large-scale image-text UIE dataset, LUIQD-TD, with 6,418 triplets.

03

Designed an Image-Text Semantic Similarity loss to enhance semantic consistency.

Abstract

Underwater images often suffer from severe degradation caused by light absorption and scattering, leading to color distortion, low contrast and reduced visibility. Existing Underwater Image Enhancement (UIE) methods can be divided into two categories, i.e., prior-based and learning-based methods. The former rely on rigid physical assumptions that limit the adaptability, while the latter often face data scarcity and weak generalization. To address these issues, we propose a Physics-Semantics-Guided Underwater Image Enhancement Network (PSG-UIENet), which couples the Retinex-grounded illumination correction with the language-informed guidance. This network comprises a Prior-Free Illumination Estimator and a Semantics-Guided Image Restorer. In particular, the restorer leverages the textual descriptions generated by the Contrastive Language-Image Pre-training (CLIP) model to inject…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.