MVP: Meta Visual Prompt Tuning for Few-Shot Remote Sensing Image Scene Classification
Junjie Zhu, Yiying Li, Chunping Qiu, Ke Yang, Naiyang Guan, Xiaodong, Yi

TL;DR
This paper introduces MVP, a parameter-efficient meta visual prompt tuning method for few-shot remote sensing image scene classification, combining prompt tuning with meta-learning and a novel data augmentation to improve performance and reduce overfitting.
Contribution
The paper proposes MVP, integrating VPT into meta-learning for remote sensing, and introduces a patch embedding recombination augmentation, addressing overfitting and efficiency issues.
Findings
MVP outperforms existing methods on FS-RSSC benchmark.
Effective in various few-shot and cross-domain scenarios.
Reduces overfitting and storage compared to full fine-tuning.
Abstract
Vision Transformer (ViT) models have recently emerged as powerful and versatile models for various visual tasks. Recently, a work called PMF has achieved promising results in few-shot image classification by utilizing pre-trained vision transformer models. However, PMF employs full fine-tuning for learning the downstream tasks, leading to significant overfitting and storage issues, especially in the remote sensing domain. In order to tackle these issues, we turn to the recently proposed parameter-efficient tuning methods, such as VPT, which updates only the newly added prompt parameters while keeping the pre-trained backbone frozen. Inspired by VPT, we propose the Meta Visual Prompt Tuning (MVP) method. Specifically, we integrate the VPT method into the meta-learning framework and tailor it to the remote sensing domain, resulting in an efficient framework for Few-Shot Remote Sensing…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Remote-Sensing Image Classification · interferon and immune responses
MethodsAttention Is All You Need · Softmax · Dense Connections · Absolute Position Encodings · Position-Wise Feed-Forward Layer · Linear Layer · Residual Connection · Adam · Multi-Head Attention · Layer Normalization
