Textual and Visual Guided Task Adaptation for Source-Free Cross-Domain Few-Shot Segmentation

Jianming Liu; Wenlong Qiu; Haitao Wei

arXiv:2508.05213·cs.CV·August 8, 2025

Textual and Visual Guided Task Adaptation for Source-Free Cross-Domain Few-Shot Segmentation

Jianming Liu, Wenlong Qiu, Haitao Wei

PDF

TL;DR

This paper introduces a source-free cross-domain few-shot segmentation method that uses textual and visual information to adapt models to new domains without source data, improving accuracy across multiple datasets.

Contribution

The work proposes a novel source-free CD-FSS approach leveraging multi-modal alignment and task-specific adapters, advancing privacy-preserving domain adaptation in segmentation.

Findings

01

Achieves 2.18% and 4.11% accuracy improvements in 1-shot and 5-shot settings.

02

Outperforms state-of-the-art methods on four cross-domain datasets.

03

Utilizes CLIP-based textual priors for effective cross-modal adaptation.

Abstract

Few-Shot Segmentation(FSS) aims to efficient segmentation of new objects with few labeled samples. However, its performance significantly degrades when domain discrepancies exist between training and deployment. Cross-Domain Few-Shot Segmentation(CD-FSS) is proposed to mitigate such performance degradation. Current CD-FSS methods primarily sought to develop segmentation models on a source domain capable of cross-domain generalization. However, driven by escalating concerns over data privacy and the imperative to minimize data transfer and training expenses, the development of source-free CD-FSS approaches has become essential. In this work, we propose a source-free CD-FSS method that leverages both textual and visual information to facilitate target domain task adaptation without requiring source domain data. Specifically, we first append Task-Specific Attention Adapters (TSAA) to the…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.