CLIP-Guided Unsupervised Semantic-Aware Exposure Correction
Puzhen Wu, Han Weng, Quan Zheng, Yi Zhan, Hewei Wang, Yiming Li, Jiahui Han, Rui Xu

TL;DR
This paper introduces an unsupervised, semantic-aware exposure correction method that leverages CLIP and FastSAM models to improve image quality without manual labels, effectively restoring details and colors in real-world images.
Contribution
It proposes a novel unsupervised exposure correction network integrating semantic information from FastSAM and CLIP, with a pseudo-ground truth generator guided by CLIP for automatic correction.
Findings
Outperforms state-of-the-art unsupervised methods in quantitative metrics.
Effectively restores details and colors in real-world exposure images.
Maintains semantic consistency during correction.
Abstract
Improper exposure often leads to severe loss of details, color distortion, and reduced contrast. Exposure correction still faces two critical challenges: (1) the ignorance of object-wise regional semantic information causes the color shift artifacts; (2) real-world exposure images generally have no ground-truth labels, and its labeling entails massive manual editing. To tackle the challenges, we propose a new unsupervised semantic-aware exposure correction network. It contains an adaptive semantic-aware fusion module, which effectively fuses the semantic information extracted from a pre-trained Fast Segment Anything Model into a shared image feature space. Then the fused features are used by our multi-scale residual spatial mamba group to restore the details and adjust the exposure. To avoid manual editing, we propose a pseudo-ground truth generator guided by CLIP, which is fine-tuned…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsImage Enhancement Techniques · Visual Attention and Saliency Detection · Generative Adversarial Networks and Image Synthesis
