# A wearable monitoring system for running gait analysis by diffusion transformer

**Authors:** Xiaoxue Hu, Guoyu Wang, Guodong Ma

PMC · DOI: 10.1371/journal.pone.0341066 · PLOS One · 2026-01-23

## TL;DR

This paper introduces a wearable system using a Diffusion Transformer to improve running gait analysis with high accuracy and real-time monitoring.

## Contribution

The novel integration of a Diffusion Transformer with LSTM for high-precision running posture recognition in wearable systems.

## Key findings

- The DiT-LSTM model achieved over 97% accuracy on the Human3.6M dataset, outperforming other algorithms.
- The system demonstrated real-time performance with a 9.4 ms inference latency and high positioning accuracy via UWB.
- Position drift correction showed low mean errors and RMSE values, confirming stability and precision.

## Abstract

Conventional wearable monitoring devices often suffer from insufficient data accuracy and low posture recognition rates, making them inadequate for the demands of professional sports health monitoring. To address these issues, this study proposes a wearable monitoring system for running gait analysis based on the Diffusion Transformer (DiT). The system aims to achieve high-precision running posture recognition and real-time motion monitoring through multi-sensor data fusion and advanced deep learning architecture. First, a wearable system was developed using a nine-axis Micro-Electro-Mechanical System (MEMS) inertial sensor and an UltraWide Band (UWB) positioning module. Data quality was enhanced through sensor calibration, noise compensation, and an adaptive filtering algorithm. Then, a DiT-LSTM running posture recognition model was constructed by integrating the DiT with a Long Short-Term Memory (LSTM) network to perform posture recognition within the wearable system. Experimental results show that on the Human3.6M dataset, the DiT-LSTM model achieved an accuracy of 97.54%, a precision of 97.61%, a recall of 97.73%, an F1-score of 97.58%, and an Area Under the Curve (AUC) of 98.61%. On the HumanEva dataset, the model attained an accuracy of 96.39%, a precision of 96.47%, a recall of 96.8%, an F1-score of 96.92%, and an AUC of 97.9%, all outperforming other algorithms. The complexity assessment showed that DiT-LSTM reached 14.7 GFLOPs on the Human3.6M dataset, with a per-epoch training time of 62.3 seconds, while its per-sample inference latency was only 9.4 ms, meeting real-time monitoring requirements. In the UWB-based position drift correction experiment, Sample 1 achieved mean errors of 0.637 m, 0.581 m, and 0.349 m on the X/Y/Z axes, with corresponding RMSE values of 0.041 m, 0.023 m, and 0.025 m, demonstrating high positioning accuracy and stability. By combining multimodal sensors with the DiT-LSTM model, the study offers reliable technical support for running gait analysis, injury prevention, and personalized training guidance.

## Full-text entities

- **Genes:** SDS (serine dehydratase) [NCBI Gene 10993] {aka SDH, hSDH}
- **Diseases:** discoordination (MESH:C562757), CoM (MESH:C536030), arm-swing discoordination (MESH:D001134), foot (MESH:D005530), injury (MESH:D014947)
- **Chemicals:** DiT (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12829845/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12829845/full.md

## References

30 references — full list in the complete paper: https://tomesphere.com/paper/PMC12829845/full.md

---
Source: https://tomesphere.com/paper/PMC12829845