Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning

Wenxuan Bao; Ruxi Deng; Ruizhong Qiu; Tianxin Wei; Hanghang Tong; Jingrui He

arXiv:2507.21494·cs.LG·July 30, 2025

Latte: Collaborative Test-Time Adaptation of Vision-Language Models in Federated Learning

Wenxuan Bao, Ruxi Deng, Ruizhong Qiu, Tianxin Wei, Hanghang Tong, Jingrui He

PDF

TL;DR

Latte is a federated learning framework that enables vision-language models to adapt to diverse data distributions by maintaining local and external memories, improving performance in decentralized test-time adaptation scenarios.

Contribution

Latte introduces a novel memory-based test-time adaptation framework for federated learning, allowing personalized and robust adaptation across clients with limited data.

Findings

01

Outperforms existing methods on domain adaptation benchmarks

02

Maintains high performance with minimal communication overhead

03

Effectively leverages client similarities for personalized adaptation

Abstract

Test-time adaptation with pre-trained vision-language models has gained increasing attention for addressing distribution shifts during testing. Among these approaches, memory-based algorithms stand out due to their training-free nature and ability to leverage historical test data. However, existing test-time adaptation methods are typically designed for a single domain with abundant data. In decentralized settings such as federated learning, applying these methods individually to each client suffers from limited test data, while directly sharing a single global memory via the server prevents proper personalization to each client's unique distribution. To address this, we propose Latte, a novel framework where each client maintains a local memory to store embeddings from its own historical test data and an external memory to store class prototypes from other relevant clients. During…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.