Temporally Guided Articulated Hand Pose Tracking in Surgical Videos
Nathan Louis, Luowei Zhou, Steven J. Yule, Roger D. Dias, Milisa, Manojlovich, Francis D. Pagani, Donald S. Likosky, Jason J. Corso

TL;DR
This paper introduces CondPose, a temporally guided articulated hand pose tracking model for surgical videos, which improves accuracy over state-of-the-art methods by leveraging past predictions and a new dataset, Surgical Hands.
Contribution
The paper presents a novel temporally guided hand pose estimation model, CondPose, and introduces the Surgical Hands dataset with multi-instance annotations for surgical videos.
Findings
CondPose outperforms state-of-the-art methods in pose estimation and tracking.
Temporal guidance significantly improves localization accuracy.
The Surgical Hands dataset enables robust evaluation of hand pose tracking in surgical videos.
Abstract
Articulated hand pose tracking is an under-explored problem that carries the potential for use in an extensive number of applications, especially in the medical domain. With a robust and accurate tracking system on surgical videos, the motion dynamics and movement patterns of the hands can be captured and analyzed for many rich tasks. In this work, we propose a novel hand pose estimation model, CondPose, which improves detection and tracking accuracy by incorporating a pose prior into its prediction. We show improvements over state-of-the-art methods which provide frame-wise independent predictions, by following a temporally guided approach that effectively leverages past predictions. We collect Surgical Hands, the first dataset that provides multi-instance articulated hand pose annotations for videos. Our dataset provides over 8.1k annotated hand poses from publicly available surgical…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSurgical Simulation and Training · Human Pose and Action Recognition · Stroke Rehabilitation and Recovery
