SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical   Instrument Segmentation

Wenxi Yue; Jing Zhang; Kun Hu; Qiuxia Wu; Zongyuan Ge; Yong Xia; Jiebo; Luo; Zhiyong Wang

arXiv:2312.14481·cs.CV·March 26, 2024·2 cites

SurgicalPart-SAM: Part-to-Whole Collaborative Prompting for Surgical Instrument Segmentation

Wenxi Yue, Jing Zhang, Kun Hu, Qiuxia Wu, Zongyuan Ge, Yong Xia, Jiebo, Luo, Zhiyong Wang

PDF

Open Access 2 Repos

TL;DR

SurgicalPart-SAM introduces a novel prompt-based approach that integrates instrument part knowledge with SAM for detailed and accurate surgical instrument segmentation, outperforming existing methods.

Contribution

It proposes a collaborative prompting and cross-modal encoding framework that explicitly models instrument parts and structures for improved segmentation.

Findings

01

Achieves state-of-the-art results on EndoVis datasets.

02

Requires minimal parameter tuning compared to prior methods.

03

Effectively captures both overall instrument structure and fine-grained details.

Abstract

The Segment Anything Model (SAM) exhibits promise in generic object segmentation and offers potential for various applications. Existing methods have applied SAM to surgical instrument segmentation (SIS) by tuning SAM-based frameworks with surgical data. However, they fall short in two crucial aspects: (1) Straightforward model tuning with instrument masks treats each instrument as a single entity, neglecting their complex structures and fine-grained details; and (2) Instrument category-based prompts are not flexible and informative enough to describe instrument structures. To address these problems, in this paper, we investigate text promptable SIS and propose SurgicalPart-SAM (SP-SAM), a novel SAM efficient-tuning approach that explicitly integrates instrument structure knowledge with SAM's generic knowledge, guided by expert knowledge on instrument part compositions. Specifically, we…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Imaging in Medicine · Surgical Simulation and Training

MethodsSegment Anything Model