Contrastive Residual Energy Test-time Adaptation

Yewon Han; Seoyun Yang; Taesup Kim

arXiv:2505.19607·cs.LG·May 5, 2026

Contrastive Residual Energy Test-time Adaptation

Yewon Han, Seoyun Yang, Taesup Kim

PDF

TL;DR

CreTTA introduces a contrastive residual energy approach for scalable, reliable test-time adaptation that avoids costly sampling and overfitting, improving model calibration in real-world scenarios.

Contribution

It reformulates marginal distribution adaptation as learning a residual energy function with a contrastive objective, eliminating sampling and reducing overfitting.

Findings

01

Achieves scalable and well-calibrated adaptation in real-world settings.

02

Removes the need for costly sampling by canceling the partition function.

03

Prevents overfitting through adaptive gradient reweighting.

Abstract

Test-time adaptation (TTA) enhances model robustness by enabling adaptation to target distributions that differ from training distributions, improving real-world generalizability. However, most existing TTA approaches focus on adjusting the conditional distribution and therefore exhibit poor calibration, as they rely on uncertain predictions in the absence of labels. Energy-based TTA frameworks provide an alternative by modeling the marginal distribution of target data without depending on label predictions, but their reliance on costly sampling hinders scalability in real-world scenarios where decisions must be made without latency. In this work, we propose Contrastive Residual Energy Test-time Adaptation (CreTTA), a practical solution for reliable adaptation. We theoretically reformulate the marginal distribution adaptation as learning a residual energy function. This formulation…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.