Evaluating the Task Generalization of Temporal Convolutional Networks for Surgical Gesture and Motion Recognition using Kinematic Data
Kay Hutchinson, Ian Reyes, Zongyu Li, and Homa Alemzadeh

TL;DR
This study evaluates how well Temporal Convolutional Networks generalize for surgical gesture and motion recognition using kinematic data, focusing on unseen users and tasks in robot-assisted surgery.
Contribution
It introduces a comprehensive evaluation of TCNs on a multi-task surgical dataset with novel cross-validation methods for assessing generalization to new users and tasks.
Findings
Gesture models outperform motion primitive models in accuracy.
Training separate models for each robotic arm improves recognition.
MP models generalize better to unseen tasks when trained on similar data.
Abstract
Fine-grained activity recognition enables explainable analysis of procedures for skill assessment, autonomy, and error detection in robot-assisted surgery. However, existing recognition models suffer from the limited availability of annotated datasets with both kinematic and video data and an inability to generalize to unseen subjects and tasks. Kinematic data from the surgical robot is particularly critical for safety monitoring and autonomy, as it is unaffected by common camera issues such as occlusions and lens contamination. We leverage an aggregated dataset of six dry-lab surgical tasks from a total of 28 subjects to train activity recognition models at the gesture and motion primitive (MP) levels and for separate robotic arms using only kinematic data. The models are evaluated using the LOUO (Leave-One-User-Out) and our proposed LOTO (Leave-One-Task-Out) cross validation methods…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsStroke Rehabilitation and Recovery · Surgical Simulation and Training · Artificial Intelligence in Healthcare and Education
