A Spatio-temporal Attention-based Model for Infant Movement Assessment   from Videos

Binh Nguyen-Thai; Vuong Le; Catherine Morgan; Nadia Badawi; Truyen; Tran; and Svetha Venkatesh

arXiv:2105.09783·cs.CV·May 21, 2021

A Spatio-temporal Attention-based Model for Infant Movement Assessment from Videos

Binh Nguyen-Thai, Vuong Le, Catherine Morgan, Nadia Badawi, Truyen, Tran, and Svetha Venkatesh

PDF

1 Repo

TL;DR

This paper introduces a spatio-temporal attention-based model that uses human pose estimation and graph convolutional networks to assess infant fidgety movements from videos, improving accuracy and interpretability for cerebral palsy screening.

Contribution

The study presents a novel approach combining pose-based features, graph neural networks, and attention mechanisms for infant movement assessment, addressing limitations of appearance-based methods.

Findings

01

Achieved ROC-AUC of 81.87% on real-life videos.

02

Outperformed existing methods in accuracy.

03

Enhanced interpretability of movement assessment.

Abstract

The absence or abnormality of fidgety movements of joints or limbs is strongly indicative of cerebral palsy in infants. Developing computer-based methods for assessing infant movements in videos is pivotal for improved cerebral palsy screening. Most existing methods use appearance-based features and are thus sensitive to strong but irrelevant signals caused by background clutter or a moving camera. Moreover, these features are computed over the whole frame, thus they measure gross whole body movements rather than specific joint/limb motion. Addressing these challenges, we develop and validate a new method for fidgety movement assessment from consumer-grade videos using human poses extracted from short clips. Human poses capture only relevant motion profiles of joints and limbs and are thus free from irrelevant appearance artifacts. The dynamics and coordination between joints are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

nguyenthaibinh/stam
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

MethodsHow do I get a human at Expedia immediately? (2025-2026) · Communication--Guide||How Do I Communicate to Expedia? · Graph Convolutional Networks