RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement
Tatiana Gaintseva, Martin Benning, Gregory Slabaugh

TL;DR
This paper introduces a novel residual vector embedding method for CLIP-guided backlit image enhancement, significantly reducing training time and improving image quality without artifacts, applicable in both supervised and unsupervised settings.
Contribution
The paper proposes a new residual vector approach that bypasses prompt tuning, enabling faster training and bias analysis in CLIP-guided image enhancement.
Findings
Residual vectors effectively guide image enhancement without prompt tuning.
Training time is significantly reduced with the residual vector method.
High-quality enhanced images are produced without artifacts.
Abstract
In this paper we propose a novel modification of Contrastive Language-Image Pre-Training (CLIP) guidance for the task of unsupervised backlit image enhancement. Our work builds on the state-of-the-art CLIP-LIT approach, which learns a prompt pair by constraining the text-image similarity between a prompt (negative/positive sample) and a corresponding image (backlit image/well-lit image) in the CLIP embedding space. Learned prompts then guide an image enhancement network. Based on the CLIP-LIT framework, we propose two novel methods for CLIP guidance. First, we show that instead of tuning prompts in the space of text embeddings, it is possible to directly tune their embeddings in the latent space without any loss in quality. This accelerates training and potentially enables the use of additional encoders that do not have a text encoder. Second, we propose a novel approach that does not…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection
MethodsContrastive Language-Image Pre-training
