Test–Retest Reliability and Validity of a Sums-of-Gaussians-Based Markerless Motion Capture System for Human Lower-Limb Gait Kinematics
Yifei Shou, Chuang Gao, Chenbin Xi, Junqi Jia, Jiaojiao Lü, Yufei Fang, Chengte Lin, Zhiqiang Liang

TL;DR
This study evaluates a markerless motion capture system's reliability and accuracy for measuring lower-limb gait movements compared to a traditional system.
Contribution
The study provides a detailed validation of a markerless motion capture system's performance in measuring gait kinematics.
Findings
MocapGS showed acceptable reliability for sagittal-plane ROM at the hip and knee but had larger measurement errors than Vicon.
Systematic biases and large limits of agreement were observed in several joint axes during walking and running.
MocapGS overestimated ROM compared to Vicon, especially in non-sagittal planes and at distal joints.
Abstract
Background and aim: Traditional marker-based optical motion capture systems are costly, time-consuming to operate, and constrained by laboratory environments, limiting their broader adoption in clinical practice and naturalistic settings. Markerless motion capture based on a sums-of-Gaussians (SoG) body model is a potential alternative; however, its metrological properties for kinematic assessment during walking and slow running remain insufficiently validated. Using a conventional marker-based Vicon system as the reference, this study evaluated the reliability and concurrent validity of an SoG-based markerless system (MocapGS) for bilateral lower-limb joint range of motion (ROM) during gait. Methods: Thirty-six healthy adults completed self-selected-pace speed walking and slow running tasks while both systems synchronously acquired bilateral lower-limb kinematics. The intraclass…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15
Figure 16- —China Postdoctoral Science Foundation
- —Zhejiang Province Rehabilitation Medical Society Research Special Fund
- —Student Research and Innovation Program of Ningbo University
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsBalance, Gait, and Falls Prevention · Lower Extremity Biomechanics and Pathologies · Sports Performance and Training
1. Introduction
Precise quantification of human gait is a central topic in biomechanics, rehabilitation medicine, and motor control, and it is essential for assessing lower-limb functional status, identifying movement disorders, and optimizing training prescriptions [1,2,3]. High-quality three-dimensional gait kinematic acquisition for humans depends substantially on the performance of motion capture systems and their applicability across different settings [2,3]. Marker-based optical motion capture systems have well-recognized advantages in measurement accuracy, temporal resolution, and test–retest reliability under controlled laboratory conditions and are still widely regarded as one of the gold standards for human movement analysis [3,4,5]. Nevertheless, their susceptibility to soft-tissue artifacts, strict requirements for operator expertise, clothing and environment constraints, and relatively complex testing procedures have limited their wider use in naturalistic environments and large-sample studies [4,6,7].
With advances in computer vision, machine learning, and multi-view 3D reconstruction, markerless motion capture has increasingly become an important complement to marker-based systems [2,3,4,5,8,9]. Markerless systems typically integrate multi-camera video/depth information with human body models and pose estimation algorithms to automatically identify skeletal pose and reconstruct kinematics [8,9]. Compared with marker-based systems, markerless approaches offer the clear advantages of simplified workflows, reduced dependence on clothing and lab conditions, and better preservation of natural movement, making them particularly suitable for multi-speed, multi-task movement analyses and long-term monitoring [2,8,9]. However, because markerless systems differ substantially in their modeling strategies, algorithm robustness, and quantitative accuracy, systematic comparative studies are needed to establish whether their kinematic outputs are sufficiently consistent with gold-standard systems [3,5,8,9].
The sums-of-Gaussians body model (SoG) is a recently developed representative modeling framework for markerless motion capture [10,11,12,13]. This approach parameterizes body segment shape and appearance using a set of Gaussian distributions and performs continuous pose tracking via optimization under multi-camera image constraints [10,11,12]. Compared with approaches that rely heavily on large-scale training data or complex silhouette extraction, SoG-based modeling may require fewer prior data and achieve 3D pose estimation at relatively low computational cost, providing a favorable balance between reconstruction accuracy and real-time performance [10,11,12]. Existing studies suggest that SoG models can achieve a degree of concurrent validity and reliability in joint kinematics reconstruction in algorithmic and simulation contexts [10,11,12,13]. However, systematic quantitative validation of real human periodic movements, particularly walking and running across different speeds, remains limited, which hinders standardized clinical gait analysis and broader methodological adoption for performance assessment.
Therefore, this study used a conventional marker-based optical motion capture system as a reference under unified experimental conditions and compared SoG-based markerless motion capture with marker-based motion capture during two typical periodic lower-limb tasks (walking and running). We aimed to (i) compare bilateral lower-limb joint kinematics outputs and (ii) quantify agreement between the systems across tasks, thereby systematically evaluating the feasibility and applicability of SoG-based markerless motion capture for walking and running gait analysis. This work is intended to provide methodological evidence and empirical support for the standardized use of SoG-based markerless motion capture in clinical gait evaluation, rehabilitation monitoring, and biomechanical research. In addition to supporting clinical and laboratory gait assessment, establishing the metrological boundaries of markerless kinematics is directly relevant to field-ready applications such as footwear biomechanics evaluation and sport injury prevention, where rapid, scalable gait screening is often needed.
2. Materials and Methods
2.1. Participants
Thirty-six healthy adult university students (height: 171.20 ± 8.04 cm; mass: 59.90 ± 7.28 kg) were recruited. Inclusion criteria were as follows: (1) age ≥ 18 years; (2) right leg dominance; (3) regular physical activity (≥3 sessions/week; ≥30 min/session); and (4) no lower-limb injury within the past six months and no other diseases that could affect exercise capacity. Exclusion criteria were as follows: (1) minors; (2) sedentary individuals; and (3) individuals with habitual late-night routines or other high-risk characteristics for exercise, which aimed to minimize the potential confounding effects of circadian rhythm disruption on gait. The study was approved by the Ethics Committee of Ningbo University (Approval No. TY2025010).
2.2. Experimental Procedure
A randomized crossover experimental design was used. All participants visited the laboratory twice. The first visit was used to collect basic information and familiarize participants with the experimental procedure and testing tasks. During the second visit, participants performed walking and slow running tasks in a randomized order. All walking and slow running trials were performed on a level laboratory floor. For each task, participants completed two sets of three trials at a self-selected speed. During all trials, bilateral lower-limb joint kinematics were synchronously recorded using the MocapGS 3D markerless system (China) based on SoG and the Vicon 3D infrared optical system (UK). ROM over the gait cycle was calculated for subsequent statistical analyses.
2.3. Marker-Based Infrared Optical Motion Capture System
An eight-camera Vicon motion capture system was used to collect 3D kinematics of the hip, knee, and ankle joints bilaterally during walking and slow running. The built-in Plug-in Gait model was adopted, and the sampling frequency was 200 Hz. Before data acquisition, 16 reflective markers were attached to anatomical landmarks according to Plug-in Gait requirements: left anterior superior iliac spine, right anterior superior iliac spine, left posterior superior iliac spine, right posterior superior iliac spine, left thigh, right thigh, left knee, right knee, left tibia, right tibia, left ankle, right ankle, left toe, right toe, left heel, and right heel.
2.4. Markerless Motion Capture System
A four-camera MocapGS (China) markerless motion capture system was used to capture 3D kinematics of the hip, knee, and ankle joints bilaterally during walking and running at 50 Hz. This system employs the SoG algorithm to automatically track the participant and build a skeletal model; its software provides real-time visual feedback on the skeletal model overlay. During gait measurement, the four cameras were positioned at the four corners of the capture volume on vertical mounts, with their lenses angled inward at 65° relative to the mount. A dynamic calibration procedure using a predefined wand was performed before each testing session to define the capture space (5 m in length and 3 m in width) and synchronize the cameras. Consistent with the Vicon protocol, participants wore tight-fitting, single-color athletic clothing, and the MocapGS system recorded the bilateral lower-limb kinematics during walking and slow running without requiring participants to wear any markers. After data collection, kinematics were computed via automatic matching to a static skeletal model and exported in C3D format.
2.5. Ground Reaction Force Measurement
Vertical ground reaction force (GRF) data were collected using two force plates (AMTI, Watertown, MA, USA) embedded in the walkway. The force plates were sampled at 1000 Hz. The raw GRF signals were filtered using a fourth-order zero-lag Butterworth low-pass filter with a cutoff frequency of 50 Hz. Gait events (initial contact and toe-off) were identified automatically for each step by the force plate. Initial contact was defined as the instant the vertical GRF exceeded 20 N, and toe-off was defined as the instant it fell below 20 N. The force plate system was synchronized with both the Vicon and MocapGS motion capture systems via an external analog trigger signal, ensuring the temporal alignment of all data streams for subsequent analysis.
2.6. Data Processing and Statistical Analysis
2.6.1. Data Processing
After preliminary processing using the proprietary software of each system, 3D joint kinematics were imported into Visual3D (v3.21.0; C-Motion, Inc., Germantown, MD, USA). A participant-specific lower-limb skeletal model was built based on the static trial. Data of the lower-limb joint kinematics were filtered using a fourth-order zero-lag Butterworth low-pass filter with a cutoff frequency of 10 Hz. Gait events were automatically identified using vertical ground reaction force. Each walking and slow running bout was segmented into independent gait cycles to compute the 3D ROM for each joint.
2.6.2. Statistical Analysis
All ROM data were reported as mean ± standard deviation (M ± SD). Normality was assessed using the Shapiro–Wilk test; nonparametric tests were applied if normality assumptions were violated.
Relative reliability was assessed using the ICCs for the ROM of the hip, knee, and ankle along the X-, Y-, and Z-axes bilaterally. ICC values were computed by comparing the first three trials versus the last three trials. The ICC was calculated using a two-way mixed-effects model for absolute agreement, where each measurement was based on the mean of three trials (k = 3). Reliability thresholds were ICC > 0.75 (high), 0.40 < ICC ≤ −0.75 (moderate), and ICC < 0.40 (low).
Absolute reliability was quantified using the SEM, SEM%, MDC, and MDC%. The SEM was also derived from the ICC model to assess measurement precision. SEM% and MDC% were calculated as SEM% = (SEM/Mean ROM) × 100% and MDC% = (MDC/Mean ROM) × 100%, respectively. The mean ROM was the average from the corresponding system for each joint. Descriptive statistics were used to compare the above parameters to evaluate the measurement precision and sensitivity between systems.
Concurrent validity was evaluated using Pearson’s correlations and paired-sample t-tests. The RMSE was calculated to quantify differences between the measurements from the two systems. The CCC was calculated to assess the agreement between the two measurement systems. Bland–Altman analysis was performed to quantify the mean systematic bias (Vicon minus MocapGS) and the 95% LOA, with the SD of differences and LOA width used to assess random error magnitude across joints, axes, and lower-limb sides. Correlation strength was interpreted as 0.8 < r ≤ 1.0 (very strong), 0.6 < r ≤ 0.8 (strong), 0.4 < r ≤ 0.6 (moderate), and r ≤ 0.4 (weak). The CCC threshold was consistent with the ICC threshold. All analyses were conducted in SPSS 21.0 (IBM Inc., Armonk, NY, USA), with α = 0.05.
3. Results
3.1. Relative Reliability of Lower-Limb Joint Kinematics Measured by the Two Systems During Walking
Figure 1 and Table 1 summarize the ICCs for the ROM of the bilateral hip, knee, and ankle joints along the X-, Y-, and Z-axes during walking. Vicon produced statistically significant reliability for the ROM of the three lower-limb joints (p < 0.05); except for the comparatively lower ICCs for the left ankle and left knee along the X-axis, most joint axes achieved at least moderate reliability. In contrast, the MocapGS showed statistically significant reliability (p < 0.05) only for selected joint axes. Specifically, the ICC for the left hip ROM along the X-axis was high, whereas the left knee ROM along the X-, Y-, and Z-axes showed moderate ICCs, with the knee X-axis ICC exceeding that of Vicon. On the right side, the knee ROM along the Z-axis approached a high ICC, while the hip ROM along the X-axis and the ankle ROM along the Z-axis showed moderate ICCs.
3.2. Absolute Reliability of Lower-Limb Joint Kinematics Measured by the Two Systems During Walking
Absolute reliability of the bilateral hip, knee, and ankle ROM during walking was quantified using the SEM, SEM%, MDC, and MDC%. Figure 2 and Figure 3 and Table 2 and Table 3 report the SEM, SEM%, MDC, and MDC% for ROM of the bilateral hip, knee, and ankle joints during walking.
3.2.1. Comparison of Measurement Precision During Walking
Descriptive comparisons of measurement precision (Figure 2 and Table 2) indicated that, compared to Vicon, MocapGS produced a smaller SEM for the left ankle ROM along the Y- and Z-axes, the left knee ROM along the X-axis, and the right ankle ROM along the Z-axis. When normalized to ROM, MocapGS showed a lower SEM% for the left knee X-axis than Vicon. For the remaining joint axes, neither the SEM nor SEM% from MocapGS surpassed those obtained with Vicon.
3.2.2. Comparison of Measurement Sensitivity During Walking
Regarding measurement sensitivity (Figure 3 and Table 3), MocapGS produced a smaller MDC than Vicon for the left ankle ROM along the Y- and Z-axes and for the left knee ROM along the X-axis; the MDC% for the left knee X-axis ROM was lower for MocapGS. For the right limb, MocapGS produced a smaller MDC for the ankle Z-axis ROM than Vicon, whereas there was no MDC% advantage for MocapGS over Vicon.
3.3. Concurrent Validity of Lower-Limb Joint Kinematics Measured by the Two Systems During Walking
3.3.1. Correlation Analysis of ROM Between Systems During Walking
The Pearson correlation analysis did not show any significant correlations in the joint ROM between MocapGS and Vicon during walking (all p > 0.05, Figure 4).
3.3.2. Comparisons of ROM Between Systems During Walking
Paired-sample t-tests (Figure 5 and Table 4) showed significant between-system differences for the bilateral ankle ROM along the X-, Y-, and Z-axes; the left knee ROM along the X- and Z-axes; the right knee ROM along the X-, Y-, and Z-axes; and the left hip ROM along the Y- and Z-axes (p < 0.05). MocapGS showed larger ankle ROM along the X-axis bilaterally and greater knee ROM along the X-axis bilaterally than Vicon. MocapGS showed smaller ankle ROM along the Y- and Z-axes, smaller right knee ROM along the Y-axis, smaller bilateral knee ROM along the Z-axis, and smaller left hip ROM along the Y- and Z-axes than Vicon.
RMSE analyses further indicated systematic between-system discrepancies in ROM during walking (Figure 6 and Table 4). The mean RMSE across the three joints of the left lower limb was 11.09°, with joint-specific mean RMSEs of 6.19° at the hip, 12.08° at the knee, and 15.01° at the ankle. For the right lower limb, the mean RMSE across the three joints was 10.63°, with joint-specific mean RMSEs of 5.49° at the hip, 11.14° at the knee, and 15.24° at the ankle. The smallest discrepancies were observed in the Y-axis ROM at the knee and hip, both below 5°.
3.3.3. Agreement Analysis of ROM Between Systems During Walking
The CCC and Bland–Altman analyses showed the patterns of measurement agreement between the MocapGS and Vicon systems across all lower-limb joints and planes (Figure 7 and Figure 8, Table 5). The CCC results indicated no significant consistency between the two systems (p > 0.05).
The Bland–Altman analysis found axis-dependent systematic bias and spatial heterogeneity in random errors between the Vicon and MocapGS for ROM. The axis-dependent systematic bias, as reported by the mean difference (MD), showed that MocapGS overestimated the left lower-limb ROM at the ankle X-axis and knee X- and Y-axes and underestimated ROM at the ankle Y- and Z-axes, knee Z-axis, and all hip axes, compared to Vicon. MocapGS overestimated the right lower-limb ROM at the ankle X-axis, knee X-axis, and hip X- and Y-axes and underestimated ROM at the remaining joint axes, compared to Vicon. The random errors, as represented by the SD and 95% LOA, showed spatial heterogeneity. The knee X- and Z-axes and the ankle Y-axis of the left lower limb showed larger random errors. For the right lower limb, the ankle X- and Y-axes and the hip Z-axis showed larger random errors.
3.4. Relative Reliability of Lower-Limb Joint Kinematics Measured by the Two Systems During Slow Running
Table 6 summarizes the ICCs for the ROM of the bilateral hip, knee, and ankle joints along the X-, Y-, and Z-axes during slow running. The Vicon system produced 15 statistically significant ICCs (p < 0.05), including 8 for the left lower limb and 7 for the right lower limb, with ICCs overall falling within the moderate-to-high reliability range (Figure 9). By contrast, the MocapGS system produced seven statistically significant ICCs (p < 0.05), including four for the left lower limb and three for the right lower limb. On the left lower limb, the ankle and hip X-axis ROM showed high reliability, and the knee X-axis ROM and ankle Y-axis ROM showed moderate reliability. On the right lower limb, the hip X-axis ROM showed high reliability, and the knee X-axis ROM and ankle Y-axis ROM showed moderate reliability.
3.5. Absolute Reliability of Lower-Limb Joint Kinematics Measured by the Two Systems During Slow Running
3.5.1. Measurement Precision Between Systems During Slow Running
Figure 10 summarizes the SEM and SEM% measurements across the joints and planes during slow running. Descriptive comparisons of measurement precision during slow running (Figure 10 and Table 7) found that, compared to Vicon, MocapGS produced a smaller SEM for the knee ROM along the Z-axis bilaterally. However, the SEM% did not favor MocapGS for these variables, and an SEM% advantage was observed only for the left knee ROM along the X-axis. For the remaining joint axes, MocapGS did not show a lower SEM or SEM% than Vicon.
3.5.2. Comparisons of Measurement Sensitivity Between Systems During Slow Runing
Figure 11 summarizes the MDC and MDC% across the joints and planes during slow running. Descriptive comparisons of measurement sensitivity (Figure 11 and Table 7) found that MocapGS produced a smaller MDC for the ankle ROM along the Z-axis bilaterally compared with Vicon. Nevertheless, MocapGS did not outperform Vicon in the MDC% for any lower-limb ROM.
3.6. Concurrent Validity of Lower-Limb Joint Kinematics Measured by the Two Systems During Slow Running
3.6.1. Correlation Analysis of ROM Between Systems During Slow Runing
The Pearson correlation analysis showed no significant correlations between MocapGS- and Vicon-derived joint ROM during slow running (all p > 0.05; Figure 12).
3.6.2. Comparisons of ROM Between Systems During Slow Runing
Paired-sample t-tests comparing the joint ROM between the two systems during slow running (Figure 13 and Table 8) showed no significant between-system differences for the left hip ROM along the Z-axis or for the right hip ROM along the Y-axis (p > 0.05). In contrast, ROM about all remaining joint axes for both limbs differed significantly between systems (p < 0.05). Specifically, MocapGS reported significantly greater ankle and knee ROM along the X-axis bilaterally than Vicon, whereas the ankle and knee ROMs along the Y- and Z-axes were significantly smaller than those measured by Vicon (p < 0.05). For the hip, MocapGS-derived ROM along the X-axis was significantly greater than Vicon bilaterally, while the hip ROM along the Y-axis on the left and along the Z-axis on the right was significantly smaller than Vicon (p < 0.05).
RMSE analysis further quantified these discrepancies (Table 8 and Figure 14). The overall RMSE difference across the left lower-limb joint ROM reached 13.96°, with mean joint-specific RMSEs of 7.84° at the hip, 17.74° at the knee, and 16.29° at the ankle. For the right lower limb, the overall RMSE difference across the joint ROM reached 13.40°, with mean RMSEs of 8.31°at the hip, 17.48° at the knee, and 14.40°at the ankle.
3.6.3. Agreement Analysis of ROM Between Systems During Slow Runing
The agreement analysis of the slow-running ROM showed systematic measurement discrepancies between the MocapGS and Vicon systems across all joints (Figure 15 and Figure 16, Table 9). The CCC results indicated no significant consistency between the two systems (p > 0.05).
Bland–Altman analysis found axis-dependent systematic bias and spatial heterogeneity in random errors between the Vicon and MocapGS systems for measuring lower limb joint ROM. MocapGS overestimated the left lower-limb ROM at the ankle, knee, and hip X-axes and underestimated the left lower-limb ROM at all joint Y- and Z-axes, compared to Vicon. MocapGS overestimated the right lower-limb ROM at the ankle, knee, and hip X-axes and hip Y-axis and underestimated the right lower-limb ROM at all remaining joint axes, compared to Vicon.
The random errors, as represented by the SD and 95% LOA width, showed spatial heterogeneity. The hip X-axis, ankle Y-axis, and knee Z-axis of the left lower limb showed larger random errors. The hip X- and Z-axes and knee Y-axis of the right lower limb showed larger random errors.
4. Discussion
This study evaluated the feasibility of MocapGS, an SoG-based markerless motion capture system, for human gait analysis by comparing bilateral lower-limb joint ROM with a Vicon system during walking and slow running. The results indicated that MocapGS exhibited acceptable reliability primarily in the sagittal plane for hip and knee ROM. However, its concurrent validity and consistency remained significantly distinct from those of the Vicon system, especially in the frontal and transverse planes. These findings suggest that the current implementation of MocapGS does not achieve a level of measurement reliability and concurrent validity comparable to that of the Vicon system for human gait analysis. There is still much room for improvement in quantitative gait assessment.
4.1. Reliability and Concurrent Validity
The reliability performance observed for the Vicon is consistent with prior work showing that marker-based optical systems provide relatively stable joint ROM estimates under controlled conditions, particularly in the sagittal plane [3,14]. In contrast, the reduced reliability of MocapGS for frontal and transverse plane ROM aligns with the broader literature reporting that markerless systems are more sensitive to occlusion, pose ambiguity, and model-fitting instability for rotations that are comparatively small in amplitude and more susceptible to noise [3,9,15,16,17,18]. These limitations may be amplified during running because of higher segment velocities, larger inter-frame motion, and increased limb self-occlusion, which can degrade multi-view correspondence and tracking stability.
Beyond reliability, the concurrent validity analyses revealed a fundamental limitation in agreement between the systems. This finding is compounded by the lack of measurement consistency and the Bland–Altman analysis, which quantified substantial systematic biases [19,20,21,22]. These results indicate that the discrepancies captured by the elevated RMSE are not merely random but include consistent directional errors. Consequently, the MocapGS, in its current implementation, cannot be considered metrologically interchangeable with the marker-based Vicon system for quantitative gait assessment where precise multi-planar kinematics are required.
4.2. Joint-Specific Performance and Methodological Considerations
MocapGS exhibited only moderate-to-high ICCs for sagittal ROM at the hip and knee during walking and slow running, with knee sagittal ROM during walking approaching the reliability of Vicon. This limited agreement indicates that the SoG model merely captures gross flexion–extension patterns at proximal joints, rather than providing high-fidelity kinematic data. While this observation superficially aligns with reports that markerless methods perform adequately for high signal-to-noise movements [9,14,17], it also underscores a fundamental constraint: even under favorable conditions, MocapGS yields a larger SEM% and MDC% than Vicon, reflecting poor measurement precision and an inadequate sensitivity to detect clinically meaningful changes.
In contrast to the proximal joints, more pronounced limitations were observed at the ankle. MocapGS showed low reliability for ankle sagittal ROM during walking and for ankle frontal ROM during slow running. Moreover, no significant correlations between systems were found for the ankle frontal ROM in either task. These discrepancies are consistent with the known challenges of image-based tracking for distal segments, including motion blur, occlusion by the contralateral limb, and limited visual features. Small rotational ROMs at the ankle in the frontal and transverse planes are particularly susceptible to being obscured by reconstruction error [9,16,17,18]. While marker-based systems are also subject to errors such as soft-tissue artifact and marker placement variability at the ankle, the nature and magnitude of the errors differ between the approaches [23,24]. This distinction reinforces the conclusion that the two systems are not directly interchangeable.
Several methodological factors likely account for the observed error patterns. First, the disparity in sampling frequency (Vicon: 200 Hz; MocapGS: 50 Hz) may be critical. The lower rate of MocapGS could be insufficient to accurately capture rapid distal-segment movements during slow running, potentially introducing aliasing or loss of kinematic detail [25]. Second, although the four-camera setup provided a calibrated volume, it presented challenges for dynamic tasks. Occlusion of the ankle and lower leg during slow running could destabilize the SoG model fitting, an effect particularly detrimental for estimating smaller-magnitude rotations in non-sagittal planes. Beyond measurement constraints, fundamental modeling differences must be considered. Inherent variations in coordinate system definitions and joint center estimation methods between the Plug-in Gait model and the proprietary SoG-based algorithm can lead to systematic offsets in the reported ROM [3,14,26], which further explain the lack of agreement between systems.
4.3. Practical Implications for Clinical and Research Applications
Our findings provide concrete guidance for practitioners considering the adoption of MocapGS. This system demonstrates acceptable reliability for monitoring the sagittal-plane range of motion at the hip and knee during gait. This suggests its potential utility in applied settings where tracking gross movement patterns is sufficient, such as in longitudinal rehabilitation monitoring or large-scale biomechanical screening. However, the outputs require careful interpretation for ankle kinematics and for motions in the frontal and transverse planes at all joints, where measurement errors were larger and reliability was often poor.
The RMSE between systems frequently exceeded 5 degrees for many joint-plane combinations, a threshold often referenced for clinically meaningful differences in gait kinematics [27,28]. This underscores that changes smaller than this margin may be indistinguishable from system noise. Consequently, the current MocapGS system is not a substitute for marker-based motion capture in applications demanding high precision, such as clinical decision-making or the detection of subtle gait alterations [29,30].
Building on these performance boundaries, the system’s suitability varies by application: (1) For sports monitoring and large-scale screening, it may be appropriate for identifying gross deviations in lower-limb movement patterns, especially in the sagittal plane, within scalable workflows. (2) In rehabilitation settings, it could support the longitudinal tracking of broad kinematic trends during recovery. (3) For clinical gait laboratories, the evidence does not support its use as a replacement for marker-based systems in diagnostic decision-making or the detection of subtle pathological changes; its role would currently be limited to supplemental screening or monitoring where exact agreement with gold-standard kinematics is not critical [31,32,33].
4.4. Future Directions
Future work should aim to improve the robustness and accuracy of SoG-based markerless systems for gait analysis through multi-faceted approaches. This includes optimizing hardware configuration, such as employing more cameras, enhancing the algorithmic core by improving model fitting under occlusion, and exploring advanced data augmentation and balancing strategies, such as generative AI techniques to synthesize training data for challenging motions [34]. Methodological harmonization and standardization should also be promoted [35,36]. Additional validation should include larger and more diverse samples, multiple gait speeds, and clinical populations and examine not only ROM but also time-series kinematics and other relevant parameters [3,36,37,38].
5. Conclusions
This study evaluated the reliability and concurrent validity of an SoG-based markerless motion capture system (MocapGS) for estimating bilateral lower-limb joint ROM during walking and slow running, using a marker-based Vicon system as the reference. The MocapGS system demonstrated acceptable test–retest reliability primarily for the sagittal-plane ROM at the hip and knee, whereas reliability and agreement were poor for the non-sagittal-plane ROM and ankle kinematics. Therefore, the current MocapGS system may be suitable for coarse monitoring of large-magnitude sagittal-plane kinematic changes in gait, but it is not yet a substitute for marker-based motion capture in applications requiring precise multi-planar kinematics or detection of subtle gait changes. From a translational standpoint, these findings provide an evidence base for using markerless gait assessment in footwear biomechanics and sports injury prevention scenarios that prioritize scalable screening and longitudinal monitoring over fine-grained clinical decision thresholds.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Akhtaruzzaman M. Shafie A.A. Khan M.R. Gait analysis: Systems, technologies, and importance J. Mech. Med. Biol.201616163000310.1142/S 0219519416300039 · doi ↗
- 2Muro-de-la-Herran A. Garcia-Zapirain B. Mendez-Zorrilla A. Gait Analysis Methods: An Overview of Wearable and Non-Wearable Systems, Highlighting Clinical Applications Sensors 2014143362339410.3390/s 14020336224556672 PMC 3958266 · doi ↗ · pubmed ↗
- 3Kanko R.M. Laende E.K. Davis E.M. Selbie W.S. Deluzio K.J. Concurrent assessment of gait kinematics using marker-based and markerless motion capture J. Biomech.202112711066510.1016/j.jbiomech.2021.11066534380101 · doi ↗ · pubmed ↗
- 4A Mathis A. Mamidanna P. Abe T. Cury K.M. Murthy V.N. Mathis M.W. Bethge M. Markerless tracking of user-defined features with deep learningar Xiv 201810.48550/ar Xiv.1804.031421804.0314230127430 · doi ↗ · pubmed ↗
- 5Liang S.G. Chung H.Y. Chu K.W. Gao Y.H. Lau F.Y. Lai W.I. Fong G.C.-H. Kwong P.W.-H. Lam F.M.H. Validity and Reliability of a Smartphone-Based Gait Assessment in Measuring Temporal Gait Parameters: Challenges and Recommendations Biosensors 20251539710.3390/bios 1507039740710047 PMC 12294008 · doi ↗ · pubmed ↗
- 6Cronin N.J. Rantalainen T. Ahtiainen J.P. Hynynen E. Waller B. Markerless 2D kinematic analysis of underwater running: A deep learning approach J. Biomech.201987758210.1016/j.jbiomech.2019.02.02130850178 · doi ↗ · pubmed ↗
- 7Clark R.A. Pua Y.-H. Bryant A.L. Hunt M.A. Validity of the Microsoft Kinect for providing lateral trunk lean feedback during gait retraining Gait Posture 2013381064106610.1016/j.gaitpost.2013.03.02923643880 · doi ↗ · pubmed ↗
- 8Nakano N. Sakura T. Ueda K. Omura L. Kimura A. Iino Y. Fukashiro S. Yoshioka S. Evaluation of 3D Markerless Motion Capture Accuracy Using Open Pose With Multiple Video Cameras Front. Sports Act. Living 202025010.3389/fspor.2020.0005033345042 PMC 7739760 · doi ↗ · pubmed ↗
