Using Hand Pose Estimation To Automate Open Surgery Training Feedback
Eddie Bkheet, Anne-Lise D'Angelo, Adam Goldbraikh, Shlomi Laufer

TL;DR
This paper demonstrates that 2D hand pose estimation can effectively segment surgical gestures and assess surgeon skill, enabling automated, remote, and markerless surgical training feedback.
Contribution
It introduces a novel framework combining 2D hand pose estimation with domain-specific proxies for surgical skill assessment and gesture segmentation.
Findings
Achieved 88.35% gesture segmentation accuracy.
Identified significant differences between novices and experts.
Proposed actionable surgical skill proxies.
Abstract
Purpose: This research aims to facilitate the use of state-of-the-art computer vision algorithms for the automated training of surgeons and the analysis of surgical footage. By estimating 2D hand poses, we model the movement of the practitioner's hands, and their interaction with surgical instruments, to study their potential benefit for surgical training. Methods: We leverage pre-trained models on a publicly-available hands dataset to create our own in-house dataset of 100 open surgery simulation videos with 2D hand poses. We also assess the ability of pose estimations to segment surgical videos into gestures and tool-usage segments and compare them to kinematic sensors and I3D features. Furthermore, we introduce 6 novel surgical dexterity proxies stemming from domain experts' training advice, all of which our framework can automatically detect given raw video footage. Results:…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Anatomy and Medical Technology
