Take a Peek: Efficient Encoder Adaptation for Few-Shot Semantic Segmentation via LoRA
Pasquale De Marinis, Gennaro Vessio, Giovanna Castellano

TL;DR
The paper introduces TaP, a LoRA-based encoder adaptation method that significantly improves few-shot semantic segmentation performance across various datasets with minimal computational cost.
Contribution
TaP is a novel, model-agnostic approach that enhances encoder adaptability in FSS using Low-Rank Adaptation, enabling efficient and effective generalization to unseen classes.
Findings
Consistently improves segmentation accuracy across multiple benchmarks.
Effective in multi-class and cross-domain scenarios.
Achieves strong performance with low-rank adaptations.
Abstract
Few-shot semantic segmentation (FSS) aims to segment novel classes in query images using only a small annotated support set. While prior research has mainly focused on improving decoders, the encoder's limited ability to extract meaningful features for unseen classes remains a key bottleneck. In this work, we introduce \textit{Take a Peek} (TaP), a simple yet effective method that enhances encoder adaptability for both FSS and cross-domain FSS (CD-FSS). TaP leverages Low-Rank Adaptation (LoRA) to fine-tune the encoder on the support set with minimal computational overhead, enabling fast adaptation to novel classes while mitigating catastrophic forgetting. Our method is model-agnostic and can be seamlessly integrated into existing FSS pipelines. Extensive experiments across multiple benchmarks--including COCO , Pascal , and cross-domain datasets such as DeepGlobe, ISIC, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDomain Adaptation and Few-Shot Learning · Advanced Neural Network Applications · COVID-19 diagnosis using AI
