A Nested U-Structure for Instrument Segmentation in Robotic Surgery
Yanjie Xia, Shaochen Wang, and Zhen Kan

TL;DR
This paper introduces a nested U-structure model for pixel-wise segmentation of robotic surgical instruments, improving context capture and segmentation quality in complex surgical environments.
Contribution
It proposes a novel nested U-structure encoder-decoder architecture with skip-connections for enhanced instrument segmentation in robotic surgery.
Findings
Effective in binary, parts, and type segmentation tasks
Outperforms existing methods in qualitative and quantitative evaluations
Captures multi-scale context for high-quality segmentation
Abstract
Robot-assisted surgery has made great progress with the development of medical imaging and robotics technology. Medical scene understanding can greatly improve surgical performance while the semantic segmentation of the robotic instrument is a key enabling technology for robot-assisted surgery. However, how to locate an instrument's position and estimate their pose in complex surgical environments is still a challenging fundamental problem. In this paper, pixel-wise instrument segmentation is investigated. The contributions of the paper are twofold: 1) We proposed a two-level nested U-structure model, which is an encoder-decoder architecture with skip-connections and each layer of the network structure adopts a U-structure instead of a simple superposition of convolutional layers. The model can capture more context information from multiple scales and better fuse the local and global…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMedical Image Segmentation Techniques · Advanced Neural Network Applications · Robotics and Sensor-Based Localization
