Video-Instrument Synergistic Network for Referring Video Instrument Segmentation in Robotic Surgery
Hongqiu Wang, Lei Zhu, Guang Yang, Yike Guo, Shichen Zhang, Bo Xu,, Yueming Jin

TL;DR
This paper introduces a novel network for referring surgical video instrument segmentation, leveraging video and language cues to improve accuracy and enable interactive, targeted segmentation in robotic surgery.
Contribution
It proposes the VIS-Net model that learns both video-level and instrument-level features, and introduces a graph-based module for multi-modal correlation, along with two new datasets for the task.
Findings
VIS-Net outperforms existing methods on new datasets
The model effectively integrates video and language information
Experimental results show significant accuracy improvements
Abstract
Robot-assisted surgery has made significant progress, with instrument segmentation being a critical factor in surgical intervention quality. It serves as the building block to facilitate surgical robot navigation and surgical education for the next generation of operating intelligence. Although existing methods have achieved accurate instrument segmentation results, they simultaneously generate segmentation masks for all instruments, without the capability to specify a target object and allow an interactive experience. This work explores a new task of Referring Surgical Video Instrument Segmentation (RSVIS), which aims to automatically identify and segment the corresponding surgical instruments based on the given language expression. To achieve this, we devise a novel Video-Instrument Synergistic Network (VIS-Net) to learn both video-level and instrument-level knowledge to boost…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMultimodal Machine Learning Applications · Surgical Simulation and Training · Lung Cancer Diagnosis and Treatment
