RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image   Enhancement

Tatiana Gaintseva; Martin Benning; Gregory Slabaugh

arXiv:2404.01889·cs.CV·July 23, 2024·1 cites

RAVE: Residual Vector Embedding for CLIP-Guided Backlit Image Enhancement

Tatiana Gaintseva, Martin Benning, Gregory Slabaugh

PDF

Open Access 1 Repo

TL;DR

This paper introduces a novel residual vector embedding method for CLIP-guided backlit image enhancement, significantly reducing training time and improving image quality without artifacts, applicable in both supervised and unsupervised settings.

Contribution

The paper proposes a new residual vector approach that bypasses prompt tuning, enabling faster training and bias analysis in CLIP-guided image enhancement.

Findings

01

Residual vectors effectively guide image enhancement without prompt tuning.

02

Training time is significantly reduced with the residual vector method.

03

High-quality enhanced images are produced without artifacts.

Abstract

In this paper we propose a novel modification of Contrastive Language-Image Pre-Training (CLIP) guidance for the task of unsupervised backlit image enhancement. Our work builds on the state-of-the-art CLIP-LIT approach, which learns a prompt pair by constraining the text-image similarity between a prompt (negative/positive sample) and a corresponding image (backlit image/well-lit image) in the CLIP embedding space. Learned prompts then guide an image enhancement network. Based on the CLIP-LIT framework, we propose two novel methods for CLIP guidance. First, we show that instead of tuning prompts in the space of text embeddings, it is possible to directly tune their embeddings in the latent space without any loss in quality. This accelerates training and potentially enables the use of additional encoders that do not have a text encoder. Second, we propose a novel approach that does not…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

atmyre/rave
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Industrial Vision Systems and Defect Detection

MethodsContrastive Language-Image Pre-training