TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks

Yang Yu; Chen Xu; Kai Wang

arXiv:2408.01835·cs.CV·August 6, 2024

TS-SAM: Fine-Tuning Segment-Anything Model for Downstream Tasks

Yang Yu, Chen Xu, Kai Wang

PDF

Open Access 1 Repo

TL;DR

TS-SAM introduces a novel two-stream fine-tuning approach for the Segment-Anything Model, significantly enhancing its performance on downstream segmentation tasks by integrating feature fusion and multi-scale refinement.

Contribution

The paper proposes TS-SAM, a new fine-tuning framework with a lightweight convolutional side adapter and multi-scale modules, bridging the performance gap with domain-specific models.

Findings

01

TS-SAM outperforms recent SAM-Adapter and SSOM methods.

02

Achieves competitive results with state-of-the-art domain-specific models.

03

Validated on ten public datasets across three segmentation tasks.

Abstract

Adapter based fine-tuning has been studied for improving the performance of SAM on downstream tasks. However, there is still a significant performance gap between fine-tuned SAMs and domain-specific models. To reduce the gap, we propose Two-Stream SAM (TS-SAM). On the one hand, inspired by the side network in Parameter-Efficient Fine-Tuning (PEFT), we designed a lightweight Convolutional Side Adapter (CSA), which integrates the powerful features from SAM into side network training for comprehensive feature fusion. On the other hand, in line with the characteristics of segmentation tasks, we designed Multi-scale Refinement Module (MRM) and Feature Fusion Decoder (FFD) to keep both the detailed and semantic features. Extensive experiments on ten public datasets from three tasks demonstrate that TS-SAM not only significantly outperforms the recently proposed SAM-Adapter and SSOM, but…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

maoyangou147/ts-sam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsHuman-Automation Interaction and Safety

MethodsAdapter · Segment Anything Model