Surgical Gesture Recognition Based on Bidirectional Multi-Layer Independently RNN with Explainable Spatial Feature Extraction
Dandan Zhang, Ruoxi Wang, Benny Lo

TL;DR
This paper introduces a novel bidirectional multi-layer RNN with explainable spatial feature extraction for surgical gesture recognition, achieving high accuracy and interpretability in minimally invasive surgery tasks.
Contribution
It proposes a new BML-indRNN model combined with explainable DCNN features using Grad-CAM for surgical gesture recognition, enhancing accuracy and interpretability.
Findings
Achieved 87.13% accuracy on suturing task
Outperformed most state-of-the-art algorithms
Provided explainable visualizations of features
Abstract
Minimally invasive surgery mainly consists of a series of sub-tasks, which can be decomposed into basic gestures or contexts. As a prerequisite of autonomic operation, surgical gesture recognition can assist motion planning and decision-making, and build up context-aware knowledge to improve the surgical robot control quality. In this work, we aim to develop an effective surgical gesture recognition approach with an explainable feature extraction process. A Bidirectional Multi-Layer independently RNN (BML-indRNN) model is proposed in this paper, while spatial feature extraction is implemented via fine-tuning of a Deep Convolutional Neural Network(DCNN) model constructed based on the VGG architecture. To eliminate the black-box effects of DCNN, Gradient-weighted Class Activation Mapping (Grad-CAM) is employed. It can provide explainable results by showing the regions of the surgical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Soft Robotics and Applications · Augmented Reality Applications
MethodsDiffusion-Convolutional Neural Networks · Dropout · Dense Connections · Max Pooling · Convolution · Softmax
