Preterm infants' pose estimation with spatio-temporal features
Sara Moccia, Lucia Migliorelli, Virgilio Carnielli, Emanuele, Frontoni

TL;DR
This paper introduces a novel deep-learning framework utilizing spatio-temporal features from depth videos to accurately estimate limb poses of preterm infants in NICUs, improving reliability over spatial-only methods.
Contribution
The study presents the first use of depth videos from clinical settings for infant pose estimation and introduces a new annotated dataset, babyPose, enhancing the state-of-the-art in neonatal health assessment.
Findings
Spatio-temporal features significantly improve pose estimation accuracy.
Median root mean squared distance was 9.06 pixels, better than spatial-only methods.
The approach is robust in challenging conditions like homogeneous image intensity.
Abstract
Objective: Preterm infants' limb monitoring in neonatal intensive care units (NICUs) is of primary importance for assessing infants' health status and motor/cognitive development. Herein, we propose a new approach to preterm infants' limb pose estimation that features spatio-temporal information to detect and track limb joints from depth videos with high reliability. Methods: Limb-pose estimation is performed using a deep-learning framework consisting of a detection and a regression convolutional neural network (CNN) for rough and precise joint localization, respectively. The CNNs are implemented to encode connectivity in the temporal direction through 3D convolution. Assessment of the proposed framework is performed through a comprehensive study with sixteen depth videos acquired in the actual clinical practice from sixteen preterm infants (the babyPose dataset). Results: When applied…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
