Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot   Generalization of Vision-Language Models

Raza Imam; Hanan Gani; Muhammad Huzaifa; Karthik Nandakumar

arXiv:2407.15913·cs.CV·July 24, 2024

Test-Time Low Rank Adaptation via Confidence Maximization for Zero-Shot Generalization of Vision-Language Models

Raza Imam, Hanan Gani, Muhammad Huzaifa, Karthik Nandakumar

PDF

Open Access 1 Repo

TL;DR

This paper proposes TTL, a test-time low-rank adaptation method that fine-tunes attention weights in vision-language models to improve zero-shot generalization without modifying prompts or the backbone.

Contribution

Introducing TTL, a novel, parameter-efficient test-time adaptation method that maximizes prediction confidence by updating attention weights in large vision-language models.

Findings

01

TTL outperforms prompt tuning baselines in zero-shot tasks.

02

Significant improvements in average performance across diverse tasks.

03

Efficient adaptation with minimal trainable parameters.

Abstract

The conventional modus operandi for adapting pre-trained vision-language models (VLMs) during test-time involves tuning learnable prompts, ie, test-time prompt tuning. This paper introduces Test-Time Low-rank adaptation (TTL) as an alternative to prompt tuning for zero-shot generalization of large-scale VLMs. Taking inspiration from recent advancements in efficiently fine-tuning large language models, TTL offers a test-time parameter-efficient adaptation approach that updates the attention weights of the transformer encoder by maximizing prediction confidence. The self-supervised confidence maximization objective is specified using a weighted entropy loss that enforces consistency among predictions of augmented samples. TTL introduces only a small amount of trainable parameters for low-rank adapters in the model space while keeping the prompts and backbone frozen. Extensive experiments…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

razaimam45/ttl-test-time-low-rank-adaptation
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications

MethodsSoftmax · Attention Is All You Need