Natural Language Supervision for Low-light Image Enhancement
Jiahui Tang, Kaihua Zhou, Zhijian Luo, Yueen Hou

TL;DR
This paper introduces NaLSuper, a novel low-light image enhancement method using natural language supervision and cross-modal feature alignment, improving visual quality by leveraging textual descriptions and advanced attention mechanisms.
Contribution
The paper proposes a new framework that integrates natural language supervision with image enhancement, utilizing a textual guidance mechanism and information fusion attention to improve results.
Findings
NaLSuper outperforms existing LLIE methods in robustness and effectiveness.
The proposed TCM and IFA modules enhance feature extraction and fusion.
Extensive experiments validate the superiority of the approach.
Abstract
With the development of deep learning, numerous methods for low-light image enhancement (LLIE) have demonstrated remarkable performance. Mainstream LLIE methods typically learn an end-to-end mapping based on pairs of low-light and normal-light images. However, normal-light images under varying illumination conditions serve as reference images, making it difficult to define a ``perfect'' reference image This leads to the challenge of reconciling metric-oriented and visual-friendly results. Recently, many cross-modal studies have found that side information from other related modalities can guide visual representation learning. Based on this, we introduce a Natural Language Supervision (NLS) strategy, which learns feature maps from text corresponding to images, offering a general and flexible interface for describing an image under different illumination. However, image distributions…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
MethodsSoftmax · Attention Is All You Need
