Toward Force Estimation in Robot-Assisted Surgery using Deep Learning   with Vision and Robot State

Zonghe Chua; Anthony M. Jarc; Allison M. Okamura

arXiv:2011.02112·cs.RO·July 8, 2022

Toward Force Estimation in Robot-Assisted Surgery using Deep Learning with Vision and Robot State

Zonghe Chua, Anthony M. Jarc, Allison M. Okamura

PDF

TL;DR

This paper develops a deep learning model that estimates interaction forces in robot-assisted surgery using vision and robot state data, aiming to improve force feedback and skill evaluation without direct force sensors.

Contribution

It introduces a neural network combining RGB images and robot state for force estimation, demonstrating improved accuracy and robustness over single-input models and physics-based baselines.

Findings

01

Vision-based networks are sensitive to viewpoint shifts.

02

State-only networks are robust to workspace changes.

03

Combined vision and state inputs yield the best accuracy and real-time performance.

Abstract

Knowledge of interaction forces during teleoperated robot-assisted surgery could be used to enable force feedback to human operators and evaluate tissue handling skill. However, direct force sensing at the end-effector is challenging because it requires biocompatible, sterilizable, and cost-effective sensors. Vision-based deep learning using convolutional neural networks is a promising approach for providing useful force estimates, though questions remain about generalization to new scenarios and real-time inference. We present a force estimation neural network that uses RGB images and robot state as inputs. Using a self-collected dataset, we compared the network to variants that included only a single input type, and evaluated how they generalized to new viewpoints, workspace positions, materials, and tools. We found that vision-based networks were sensitive to shifts in viewpoints,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.