# Benchmark Study of 2D and 3D Gait Analysis for AI-Powered Mobile Fall Risk Screeners

**Authors:** Zengyan Wang, Chen Chen, Chitra Banarjee, Rui Xie, Ladda Thiamwong

PMC · DOI: 10.1093/geroni/igaf122.4076 · 2025-12-31

## TL;DR

This paper compares 2D and 3D gait analysis models for fall risk screening in mobile health units, finding that 2D models like HRNet are more efficient and effective for resource-limited settings.

## Contribution

The study introduces a benchmark comparison of 2D and 3D gait analysis models for mobile fall risk screening, highlighting the practicality of lightweight 2D models.

## Key findings

- 2D models, especially HRNet, outperformed 3D models in gait recognition accuracy and inference speed.
- Temporal metrics from video analysis were comparable to those from IMUs, but spatial alignment with IMU-derived pelvis trajectories failed across all models.
- A 2-camera setup provided stable gait analysis but limited visibility of side-plane movements, reducing confidence in spatial parameters.

## Abstract

Mobile health clinics are commonly used to assess health but rarely equipped to assess fall risk indicators such as gait impairment. To address this gap, we propose AI-driven mobile units integrating video-based gait analysis and inertial measurement units (IMUs) for rapid, low-cost fall risk screening. This study benchmarked eight 2D and 3D gait extraction models (OpenPose, AlphaPose, HRNet, MediaPipe, ViTPose, videopose, MotionBERT, MotionAGFormer) using publicly available videos of older adults completing the Timed Up and Go test (N = 28), which includes one front-facing camera view and another with a higher, top-down viewing angle. Models were compared in their computational, temporal, and spatial gait metrics (including gait speed, cadence, stride time and length). Results show that 2D methods, particularly HRNet, consistently outperformed other models in both successful recognition rate (HRNet 99% vs 88.3% averaged across all other models). HRNet delivered the most reliable ankle tracking and the fastest inference time, key advantages for resource-constrained environments like mobile units. By contrast, 3D models required more computational resources without improving gait recognition. Temporal metrics from the videos were comparable to IMU-derived metrics. However, Spatial alignment with IMU-derived pelvis trajectories failed across all models. While the 2-camera setup provides stable perspectives for gait analyses, it limits visibility of certain side-plane movements, reducing confidence in spatial parameters such as ankle flexion and knee angle. The superior performance of 2D models confirms the feasibility of deploying lightweight, AI-powered mobile systems for fall risk screening, especially in underserved areas with limited access to specialized geriatric care.

---
Source: https://tomesphere.com/paper/PMC12762478