DynaPrompt: Dynamic Test-Time Prompt Tuning

Zehao Xiao; Shilin Yan; Jack Hong; Jiayin Cai; Xiaolong Jiang; Yao Hu,; Jiayi Shen; Qi Wang; Cees G. M. Snoek

arXiv:2501.16404·cs.LG·January 29, 2025

DynaPrompt: Dynamic Test-Time Prompt Tuning

Zehao Xiao, Shilin Yan, Jack Hong, Jiayin Cai, Xiaolong Jiang, Yao Hu,, Jiayi Shen, Qi Wang, Cees G. M. Snoek

PDF

Open Access 3 Reviews

TL;DR

DynaPrompt is a dynamic test-time prompt tuning method that adaptively selects and updates prompts based on test sample relevance, improving zero-shot generalization of vision-language models while mitigating error accumulation.

Contribution

It introduces a novel dynamic prompt selection and appending strategy that leverages test data distribution information during inference.

Findings

01

Effective on fourteen datasets

02

Reduces prompt collapse and error accumulation

03

Enhances zero-shot generalization

Abstract

Test-time prompt tuning enhances zero-shot generalization of vision-language models but tends to ignore the relatedness among test samples during inference. Online test-time prompt tuning provides a simple way to leverage the information in previous test samples, albeit with the risk of prompt collapse due to error accumulation. To enhance test-time prompt tuning, we propose DynaPrompt, short for dynamic test-time prompt tuning, exploiting relevant data distribution information while reducing error accumulation. Built on an online prompt buffer, DynaPrompt adaptively selects and optimizes the relevant prompts for each test sample during tuning. Specifically, we introduce a dynamic prompt selection strategy based on two metrics: prediction entropy and probability difference. For unseen test data information, we develop dynamic prompt appending, which allows the buffer to append new…

Peer Reviews

Decision·ICLR 2025 Poster

Reviewer 01Rating 6Confidence 4

Strengths

1. The paper is well-organized and easy to follow. 2. The proposed DynaPrompt effectively mitigates error accumulation, a prevalent challenge in online test-time tuning，leading to more stable performance across sequential test samples while exploiting beneficial information from prior online test samples. 3. Despite the increased time costs associated with larger prompt buffer sizes, the experimental outcomes confirm the effectiveness of the proposed method.

Weaknesses

1. In prompt learning, the initial prompts might affect the final performance. I wonder whether a similar situation can occur with the proposed method. The authors are encouraged to conduct related experiments. 2. Could the proposed method be extended to incorporate visual prompts, thereby evolving into a multimodal approach? Additionally, when integrated with MaPLe, is the method only applied to the textual branch?

Reviewer 02Rating 5Confidence 5

Strengths

1. The paper is well-structured and easy to follow, with the three technical components clearly and accessibly presented. 2. The proposed method is well-reasoned, employing dynamic prompt selection and updating mechanisms that are both effective and distinct from prior studies, which primarily focus on data manipulation. 3. The experiments are thorough, and the results convincingly demonstrate the effectiveness of the proposed method.

Weaknesses

1. The range of comparison methods could be expanded, as the paper overlooks one relevant comparison method [1]. 2. Both the prediction entropy metric and probability difference metric provide insights into model prediction confidence, though from different perspectives. It is unclear why entropy is specifically used to measure relevance while difference is used to maintain diversity. Why does the direct combination of these two types of prompts yield effective results? Would a two-step prompt s

Reviewer 03Rating 6Confidence 3

Strengths

1. This paper propose a dynamic prompting method that utilizes useful information from online test samples while mitigating the problem of error accumulation. 2. DynaPrompt improves the adaptive capability of the model by introducing a dynamic prompt selection strategy that adaptively selects and optimizes relevant prompts for each test sample based on two metrics: predictive entropy and probability difference. 3. During the dynamic prompt selection process, if no suitable prompts can be found,

Weaknesses

1. The computational complexity increases, and the authors' approach requires dynamic updating and selection of cues at each stage, whether it introduces more computational time. 2. The authors' approach demonstrates the advantages of dynamic prompting, however did the authors consider whether comparable performance could also be achieved if online testing was performed from scratch using a prompt length comparable to that of the final model.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsVLSI and Analog Circuit Testing · Engineering and Test Systems · Software Testing and Debugging Techniques