Prompt Distribution Learning

Yuning Lu; Jianzhuang Liu; Yonggang Zhang; Yajing Liu; Xinmei Tian

arXiv:2205.03340·cs.CV·May 9, 2022

Prompt Distribution Learning

Yuning Lu, Jianzhuang Liu, Yonggang Zhang, Yajing Liu, Xinmei Tian

PDF

Open Access

TL;DR

This paper introduces prompt distribution learning, a method that models prompt embeddings with a Gaussian distribution to improve adaptation of vision-language models for recognition tasks, especially with few samples.

Contribution

It proposes a novel approach that learns prompt output embeddings as a distribution, enabling better adaptation with limited data and outperforming existing methods.

Findings

01

Outperforms existing methods on 12 datasets

02

Achieves 9.1% relative improvement with one sample per category

03

Effectively models prompt diversity with Gaussian distribution

Abstract

We present prompt distribution learning for effectively adapting a pre-trained vision-language model to address downstream recognition tasks. Our method not only learns low-bias prompts from a few samples but also captures the distribution of diverse prompts to handle the varying visual representations. In this way, we provide high-quality task-related content for facilitating recognition. This prompt distribution learning is realized by an efficient approach that learns the output embeddings of prompts instead of the input embeddings. Thus, we can employ a Gaussian distribution to model them effectively and derive a surrogate loss for efficient training. Extensive experiments on 12 datasets demonstrate that our method consistently and significantly outperforms existing methods. For example, with 1 sample per category, it relatively improves the average result by 9.1% compared to…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning · Multimodal Machine Learning Applications · Human Pose and Action Recognition