Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models

Behraj Khan; Tahir Syed

arXiv:2501.17595·cs.CV·September 26, 2025

Technical report on label-informed logit redistribution for better domain generalization in low-shot classification with foundation models

Behraj Khan, Tahir Syed

PDF

Open Access

TL;DR

This paper introduces a confidence misalignment penalty (CMP) for fine-tuning foundation models, significantly improving calibration in low-shot vision classification and domain generalization tasks.

Contribution

It proposes a novel penalty method that enhances confidence calibration by adjusting logit scores during fine-tuning of foundation models.

Findings

01

CMP improves Expected Calibration Error (ECE) by up to 9.72%.

02

The method outperforms existing prompt learning techniques.

03

Experiments on 12 vision and 5 domain datasets validate effectiveness.

Abstract

Confidence calibration is an emerging challenge in real-world decision systems based on foundations models when used for downstream vision classification tasks. Due to various reasons exposed, logit scores on the CLIP head remain large irrespective of whether the image-language pairs reconcile. It is difficult to address in data space, given the few-shot regime. We propose a penalty incorporated into loss objective that penalizes incorrect classifications whenever one is made during finetuning, by moving an amount of log-likelihood to the true class commensurate to the relative amplitudes of the two likelihoods. We refer to it as \textit{confidence misalignment penalty (CMP)}. Extensive experiments on $12$ vision datasets and $5$ domain generalization datasets supports the calibration performance of our method against stat-of-the-art. CMP outperforms the benchmarked prompt learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDomain Adaptation and Few-Shot Learning

MethodsContrastive Language-Image Pre-training