SCISSR: Scribble-Conditioned Interactive Surgical Segmentation and Refinement
Haonan Ping, Jian Jiang, Cheng Yuan, Qizhen Sun, Lv Wu, Yutong Ban

TL;DR
SCISSR is an interactive surgical segmentation framework that uses scribble prompts and lightweight modules to improve accuracy and robustness across different surgical scenes, enabling iterative refinement without retraining the backbone.
Contribution
The paper introduces a scribble-based promptable segmentation framework with lightweight modules compatible with existing models like SAM 2, enhancing surgical scene segmentation accuracy and robustness.
Findings
Achieves 95.41% Dice on EndoVis 2018 with five interactions.
Achieves 96.30% Dice on CholecSeg8k with three interactions.
Outperforms iterative point prompting on both benchmarks.
Abstract
Accurate segmentation of tissues and instruments in surgical scenes is annotation-intensive due to irregular shapes, thin structures, specularities, and frequent occlusions. While SAM models support point, box, and mask prompts, points are often too sparse and boxes too coarse to localize such challenging targets. We present SCISSR, a scribble-promptable framework for interactive surgical scene segmentation. It introduces a lightweight Scribble Encoder that converts freehand scribbles into dense prompt embeddings compatible with the mask decoder, enabling iterative refinement for a target object by drawing corrective strokes on error regions. Because all added modules (the Scribble Encoder, Spatial Gated Fusion, and LoRA adapters) interact with the backbone only through its standard embedding interfaces, the framework is not tied to a single model: we build on SAM 2 in this work, yet…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Advanced Neural Network Applications · Multimodal Machine Learning Applications
