Comparison of marker-less 2D image-based methods for infant pose estimation
Lennart Jahn, Sarah Fl\"ugge, Dajie Zhang, Luise Poustka, Sven, B\"olte, Florentin W\"org\"otter, Peter B Marschik, Tomas Kulvicius

TL;DR
This study compares generic and infant-specific 2D pose estimation methods for infants, finding that generic models perform well and that top-down camera views improve accuracy, informing better recording practices for infant movement analysis.
Contribution
It demonstrates that generic adult-trained pose estimators perform effectively on infants and highlights the benefits of top-down views for improved pose estimation accuracy.
Findings
ViTPose performs best among generic models on infants.
Retraining generic models improves infant pose estimation accuracy.
Top-down view significantly enhances pose detection, especially for hips.
Abstract
In this study we compare the performance of available generic- and infant-pose estimators for a video-based automated general movement assessment (GMA), and the choice of viewing angle for optimal recordings, i.e., conventional diagonal view used in GMA vs. top-down view. We used 4500 annotated video-frames from 75 recordings of infant spontaneous motor functions from 4 to 26 weeks. To determine which pose estimation method and camera angle yield the best pose estimation accuracy on infants in a GMA related setting, the distance to human annotations and the percentage of correct key-points (PCK) were computed and compared. The results show that the best performing generic model trained on adults, ViTPose, also performs best on infants. We see no improvement from using infant-pose estimators over the generic pose estimators on our infant dataset. However, when retraining a generic model…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSocial Robot Interaction and HRI
