FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models
Hui Lin, Chao Zhang, Danfeng Hong, Kexin Dong, and Congcong Wen

TL;DR
FedRSCLIP introduces a federated learning framework for remote sensing image classification using CLIP, employing prompt learning and alignment constraints to address data heterogeneity and communication challenges.
Contribution
This work is the first to adapt Vision-Language Models for federated remote sensing classification with a novel dual-prompt mechanism and alignment constraints.
Findings
FedRSCLIP outperforms baseline methods in classification accuracy.
The dual-prompt mechanism effectively balances global and local model adaptation.
The proposed framework reduces communication costs while maintaining high performance.
Abstract
Remote sensing data is often distributed across multiple institutions, and due to privacy concerns and data-sharing restrictions, leveraging large-scale datasets in a centralized training framework is challenging. Federated learning offers a promising solution by enabling collaborative model training across distributed data sources without requiring data centralization. However, current Vision-Language Models (VLMs), which typically contain billions of parameters, pose significant communication challenges for traditional federated learning approaches based on model parameter updates, as they would incur substantial communication costs. In this paper, we propose FedRSCLIP, the first federated learning framework designed for remote sensing image classification based on a VLM, specifically CLIP. FedRSCLIP addresses the challenges of data heterogeneity and large-scale model transmission in…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote-Sensing Image Classification
MethodsSparse Evolutionary Training · ALIGN · Contrastive Language-Image Pre-training
