Prompt-Free and Efficient SAM2 Adaptation for Biomedical Semantic Segmentation via Dual Adapters
Hinako Mitsuoka, Kazuhiro Hotta

TL;DR
This paper introduces a prompt-free, efficient adaptation framework for biomedical segmentation using SAM2, combining dual adapters and a convolutional positional encoding generator to improve accuracy and reduce computational costs.
Contribution
It presents a novel prompt-free, parameter-efficient fine-tuning method with dual adapters and a positional encoding generator for biomedical segmentation, outperforming existing baselines.
Findings
Improved segmentation accuracy by up to 19.66% over vanilla SAM2.
Reduced computational costs by approximately 87%.
Effective handling of arbitrary aspect ratios with convolutional positional encoding.
Abstract
Segment Anything Model 2 (SAM2) demonstrated impressive zero-shot capabilities on natural images but faces challenges in biomedical segmentation due to significant domain shifts and prompt dependency. To address these limitations, we propose a prompt-free, parameter-efficient fine-tuning framework designed for multi-class segmentation on variable-sized inputs. We introduce a convolutional Positional Encoding Generator to adapt effectively to arbitrary aspect ratios and present a dual-adapter strategy: High-Performance Adapter utilizing deformable convolutions for precise boundary modeling and Lightweight Adapter employing structural re-parameterization to minimize inference latency. Experiments on ISBI 2012, Kvasir-SEG, Synapse, and ACDC datasets demonstrate that our approach significantly outperforms strong adaptation baselines. Specifically, our method improved segmentation accuracy…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
