Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining

Hyeonseo Jang; Jaebyeong Jeon; Joong-Won Hwang; Kibok Lee

arXiv:2604.27715·cs.CV·May 1, 2026

Improving Calibration in Test-Time Prompt Tuning for Vision-Language Models via Data-Free Flatness-Aware Prompt Pretraining

Hyeonseo Jang, Jaebyeong Jeon, Joong-Won Hwang, Kibok Lee

PDF

1 Repo

TL;DR

This paper proposes Flatness-aware Prompt Pretraining (FPP), a data-free method that improves calibration and performance of test-time prompt tuning in vision-language models by initializing prompts in flatter loss landscape regions.

Contribution

Introducing FPP, a simple pretraining framework that enhances calibration and performance of TPT without extra data or computational costs.

Findings

01

FPP improves calibration of vision-language models during TPT.

02

FPP enhances the performance of test-time prompt tuning.

03

FPP requires no labeled data and adds no extra computational cost.

Abstract

Test-time prompt tuning (TPT) has emerged as a promising technique for enhancing the adaptability of vision-language models by optimizing textual prompts using unlabeled test data. However, prior studies have observed that TPT often produces poorly calibrated models, raising concerns about the reliability of their predictions. Recent works address this issue by incorporating additional regularization terms that constrain model outputs, which improve calibration but often degrade performance. In this work, we reveal that these regularization strategies implicitly encourage optimization toward flatter minima, and that the sharpness of the loss landscape around adapted prompts is a key factor governing calibration quality. Motivated by this observation, we introduce Flatness-aware Prompt Pretraining (FPP), a simple yet effective pretraining framework for TPT that initializes prompts within…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

YonseiML/fpp
github

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.