World Models for General Surgical Grasping
Hongbin Lin, Bin Li, Chun Wai Wong, Juan Rojas, Xiangyu Chu, and Kwok, Wai Samuel Au

TL;DR
This paper introduces GAS, a world-model-based deep reinforcement learning framework for surgical grasping that achieves high success and robustness in unstructured, real-world surgical environments by handling diverse objects and disturbances.
Contribution
The paper presents a novel pixel-level visuomotor policy for surgical grasping using a unified world-model approach, enabling generality and robustness in complex surgical scenes.
Findings
Achieved 69% success rate in real surgical grasping tasks.
Demonstrated robustness across 6 different environmental conditions.
Handled diverse objects and disturbances effectively in real-world experiments.
Abstract
Intelligent vision control systems for surgical robots should adapt to unknown and diverse objects while being robust to system disturbances. Previous methods did not meet these requirements due to mainly relying on pose estimation and feature tracking. We propose a world-model-based deep reinforcement learning framework "Grasp Anything for Surgery" (GAS), that learns a pixel-level visuomotor policy for surgical grasping, enhancing both generality and robustness. In particular, a novel method is proposed to estimate the values and uncertainties of depth pixels for a rigid-link object's inaccurate region based on the empirical prior of the object's size; both depth and mask images of task objects are encoded to a single compact 3-channel image (size: 64x64x3) by dynamically zooming in the mask regions, minimizing the information loss. The learned controller's effectiveness is extensively…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsHistory of Medical Practice · Surgical Simulation and Training · Anatomy and Medical Technology
