SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation
Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, Zongyuan Ge, Yong Xia, Jiebo, Luo, Zhiyong Wang

TL;DR
SurgicalPart-SAM introduces a novel prompt-based approach that integrates instrument part knowledge with SAM for detailed and accurate surgical instrument segmentation, outperforming existing methods.
Contribution
It proposes a collaborative prompting and cross-modal encoding framework that explicitly models instrument parts and structures for improved segmentation.
Findings
Achieves state-of-the-art results on EndoVis datasets.
Requires minimal parameter tuning compared to prior methods.
Effectively captures both overall instrument structure and fine-grained details.
Abstract
The Segment Anything Model (SAM) exhibits promise in generic object segmentation and offers potential for various applications. Existing methods have applied SAM to surgical instrument segmentation (SIS) by tuning SAM-based frameworks with surgical data. However, they fall short in two crucial aspects: (1) Straightforward model tuning with instrument masks treats each instrument as a single entity, neglecting their complex structures and fine-grained details; and (2) Instrument category-based prompts are not flexible and informative enough to describe instrument structures. To address these problems, in this paper, we investigate text promptable SIS and propose SurgicalPart-SAM (SP-SAM), a novel SAM efficient-tuning approach that explicitly integrates instrument structure knowledge with SAM's generic knowledge, guided by expert knowledge on instrument part compositions. Specifically, we…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Imaging in Medicine · Surgical Simulation and Training
MethodsSegment Anything Model
