Textual and Visual Guided Task Adaptation for Source-Free Cross-Domain Few-Shot Segmentation
Jianming Liu, Wenlong Qiu, Haitao Wei

TL;DR
This paper introduces a source-free cross-domain few-shot segmentation method that uses textual and visual information to adapt models to new domains without source data, improving accuracy across multiple datasets.
Contribution
The work proposes a novel source-free CD-FSS approach leveraging multi-modal alignment and task-specific adapters, advancing privacy-preserving domain adaptation in segmentation.
Findings
Achieves 2.18% and 4.11% accuracy improvements in 1-shot and 5-shot settings.
Outperforms state-of-the-art methods on four cross-domain datasets.
Utilizes CLIP-based textual priors for effective cross-modal adaptation.
Abstract
Few-Shot Segmentation(FSS) aims to efficient segmentation of new objects with few labeled samples. However, its performance significantly degrades when domain discrepancies exist between training and deployment. Cross-Domain Few-Shot Segmentation(CD-FSS) is proposed to mitigate such performance degradation. Current CD-FSS methods primarily sought to develop segmentation models on a source domain capable of cross-domain generalization. However, driven by escalating concerns over data privacy and the imperative to minimize data transfer and training expenses, the development of source-free CD-FSS approaches has become essential. In this work, we propose a source-free CD-FSS method that leverages both textual and visual information to facilitate target domain task adaptation without requiring source domain data. Specifically, we first append Task-Specific Attention Adapters (TSAA) to the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
