TL;DR
This paper introduces a novel 4D spatio-temporal deep learning approach for force estimation in OCT-based minimally-invasive surgery, demonstrating improved accuracy and the potential for short-term force prediction.
Contribution
It extends deep learning-based force estimation to 4D OCT data, compares data representations, and analyzes temporal effects and force prediction capabilities.
Findings
4D approach outperforms lower-dimensional methods with 10.7mN MAE
Temporal information improves force estimation accuracy
Force prediction for safety features is feasible
Abstract
Estimating the forces acting between instruments and tissue is a challenging problem for robot-assisted minimally-invasive surgery. Recently, numerous vision-based methods have been proposed to replace electro-mechanical approaches. Moreover, optical coherence tomography (OCT) and deep learning have been used for estimating forces based on deformation observed in volumetric image data. The method demonstrated the advantage of deep learning with 3D volumetric data over 2D depth images for force estimation. In this work, we extend the problem of deep learning-based force estimation to 4D spatio-temporal data with streams of 3D OCT volumes. For this purpose, we design and evaluate several methods extending spatio-temporal deep learning to 4D which is largely unexplored so far. Furthermore, we provide an in-depth analysis of multi-dimensional image data representations for force estimation,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
