TL;DR
SurgeonAssist-Net is a lightweight, efficient AI framework that enables real-time surgical workflow recognition on head-mounted displays, improving accuracy and speed while reducing computational requirements for augmented reality surgical guidance.
Contribution
It introduces a novel, resource-efficient neural network architecture for surgical task recognition compatible with commercial AR headsets, achieving near real-time performance.
Findings
Competitive accuracy on laparoscopic workflow dataset
Significantly fewer parameters and FLOPS compared to state-of-the-art
Capable of near real-time inference on HoloLens 2
Abstract
We present SurgeonAssist-Net: a lightweight framework making action-and-workflow-driven virtual assistance, for a set of predefined surgical tasks, accessible to commercially available optical see-through head-mounted displays (OST-HMDs). On a widely used benchmark dataset for laparoscopic surgical workflow, our implementation competes with state-of-the-art approaches in prediction accuracy for automated task recognition, and yet requires 7.4x fewer parameters, 10.2x fewer floating point operations per second (FLOPS), is 7.0x faster for inference on a CPU, and is capable of near real-time performance on the Microsoft HoloLens 2 OST-HMD. To achieve this, we make use of an efficient convolutional neural network (CNN) backbone to extract discriminative features from image data, and a low-parameter recurrent neural network (RNN) architecture to learn long-term temporal dependencies. To…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
