FedRSClip: Federated Learning for Remote Sensing Scene Classification   Using Vision-Language Models

Hui Lin; Chao Zhang; Danfeng Hong; Kexin Dong; and Congcong Wen

arXiv:2501.02461·cs.CV·January 7, 2025

FedRSClip: Federated Learning for Remote Sensing Scene Classification Using Vision-Language Models

Hui Lin, Chao Zhang, Danfeng Hong, Kexin Dong, and Congcong Wen

PDF

Open Access

TL;DR

FedRSCLIP introduces a federated learning framework for remote sensing image classification using CLIP, employing prompt learning and alignment constraints to address data heterogeneity and communication challenges.

Contribution

This work is the first to adapt Vision-Language Models for federated remote sensing classification with a novel dual-prompt mechanism and alignment constraints.

Findings

01

FedRSCLIP outperforms baseline methods in classification accuracy.

02

The dual-prompt mechanism effectively balances global and local model adaptation.

03

The proposed framework reduces communication costs while maintaining high performance.

Abstract

Remote sensing data is often distributed across multiple institutions, and due to privacy concerns and data-sharing restrictions, leveraging large-scale datasets in a centralized training framework is challenging. Federated learning offers a promising solution by enabling collaborative model training across distributed data sources without requiring data centralization. However, current Vision-Language Models (VLMs), which typically contain billions of parameters, pose significant communication challenges for traditional federated learning approaches based on model parameter updates, as they would incur substantial communication costs. In this paper, we propose FedRSCLIP, the first federated learning framework designed for remote sensing image classification based on a VLM, specifically CLIP. FedRSCLIP addresses the challenges of data heterogeneity and large-scale model transmission in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsRemote-Sensing Image Classification

MethodsSparse Evolutionary Training · ALIGN · Contrastive Language-Image Pre-training