References values and standardized testing protocols for performance-based and patient-reported outcome measures among individuals with lower limb amputation
Matthew McGuire, Sara Nataletti, Rachel Maronati, Shenan Hoppe-Ludwig, Anushua Banerjee, Amber Wacek, Juan Cave, John M. Looft, Christopher L. Dearth, Brad D. Hendershot, Arun Jayaraman

TL;DR
This study provides standardized testing protocols and reference values for measuring outcomes in people with lower limb amputation to improve rehabilitation and prosthetic care.
Contribution
The study introduces standardized administration procedures and population-specific reference values for outcome measures in lower limb amputation patients.
Findings
Reference values for 8 performance-based and 11 patient-reported outcome measures were reported by K-level and amputation type.
10 Meter Walk Test speeds were up to 31% faster than prior studies, possibly due to methodological differences.
Standardized guidelines promote transparency and comparability in prosthetic care and rehabilitation research.
Abstract
Reliable reference data and standardized administration procedures for outcome measures are critical to guide rehabilitation, justify prosthetic prescription, and evaluate treatment outcomes in individuals with lower limb amputation (LLA). Although numerous performance-based and patient-reported measures are used in this population, few studies provide both population-specific reference values and detailed testing protocols. This study describes standardized administration procedure and presents reference values for commonly used outcome measures categorized by functional K-level and amputation level to support consistency in clinical and research applications. Data was collected from the initial visit of a larger longitudinal study across three medical research centers. Participants included 58 adults (aged 18–89) with unilateral or bilateral transtibial or transfemoral amputation,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
| Amputation Type | Mean (SD) [range] or count (%) | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Unilateral TT | Unilateral TF* | All Unilateral (TT and TF) | |||||||||
|
|
|
|
|
|
|
|
|
| |||
| Age (years) | 63 | 59.6 | 34.0 | 56.5 | 73 | 54.7 | 41.5 | 54.4 | 57.6 | 36.5 | 55.6 |
| Weight (kg)§ | 118.0 | 94.0 | 73.4 | 92.1 | 90.7 | 94.1 | 99.8 | 94.4 | 94.0 | 82.2 | 93.1 |
| Height (cm) | 190.5 | 175.2 | 174.9 | 175.6 | 163.8 | 177.5 | 193.0 | 178.3 | 176.1 | 181.0 | 176.7 |
| Time since amputation (years) | 2.1 | 14.2 | 16.2 | 14.1 | 7.4 | 25.8 | 1.2 | 22.7 | 19.0 | 11.2 | 17.6 |
|
| |||||||||||
| Female | 0 (0%) | 5 (19%) | 1 (25%) | 6 (19%) | 0 | 1 (5%) | 0 | 1 (5%) | 6 (13%) |
| 47 (87%) |
| Male | 1 (100%) | 22 (81%) | 3 (75%) | 26 (81%) | 1 (100%) | 18 (95%) | 2 (100%) | 21 (95%) | 40 (87%) |
| 7 (13%) |
|
| |||||||||||
| Trauma | 0 | 14 (52%) | 4 (100%) | 18 (56%) | 0 | 11 (58%) | 2 (100%) | 13 (59%) | 25 (51%) | 6 (100%) | 31 (57%) |
| Congenital | 0 | 0 | 0 | 0 | 0 | 1 (5%) | 0 | 1 (5%) | 1 (2%) | 0 | 1 (2%) |
| Vascular disease | 1 (100%) | 5 (19%) | 0 | 6 (19%) | 1 (100%) | 2 (11%) | 0 | 3 (14%) | 7 (14%) | 0 | 9 (17%) |
| Infection | 0 | 5 (19%) | 0 | 5 (16%) | 0 | 1 (5%) | 0 | 1 (5%) | 6 (12%) | 0 | 6 (11%) |
| Cancer | 0 | 3 (11%) | 0 | 3 (9%) | 0 | 4 (21%) | 0 | 4 (18%) | 7 (14%) | 0 | 7 (13%) |
|
| |||||||||||
| K2 | 1 (3%) | 1 (5%) | 2 (4%) | ||||||||
| K3 | 27 (84%) | 19 (86%) | 46 (85%) | ||||||||
| K4 | 4 (13%) | 2 (9%) | 6 (11%) | ||||||||
|
| |||||||||||
| Unilateral TT | 27 (59%) | 4 (67%) | 32 (59%) | ||||||||
| Unilateral TF | 19 (41%) | 2 (33%) | 22 (41%) | ||||||||
| Bilateral TT/TT | N/A | N/A | N/A | ||||||||
| Bilateral TT/TF | N/A | N/A | N/A | ||||||||
| Bilateral TF/TF | N/A | N/A | N/A | ||||||||
|
| |||||||||||
| Civilian | 0 | 14 (52%) | 4 (100%) | 18 (56%) | 0 | 12 (63%) | 1 (50%) | 13 (59%) | 26 (57%) | 5 (83%) | 31 (57%) |
| Active Duty | 0 | 1 (4%) | 0 | 1 (3%) | 0 | 1 (5%) | 1 (50%) | 2 (9%) | 2 (4%) | 01 (17%) | 3 (6%) |
| Veteran | 1 (100%) | 12 (44%) | 0 | 13 (41%) | 1 (100%) | 6 (32%) | 0 | 7 (32%) | 18 (39%) | 0 | 20 (37%) |
|
| |||||||||||
| Employed Full-Time | 0 | 5 (19%) | 2 (50%) | 7 (22%) | 0 | 8 (42%) | 1 (50%) | 9 (41%) | 13 (28%) | 3 (50%) | 16 (30%) |
| Employed Part-Time | 0 | 3 (11%) | 0 | 3 (9%) | 0 | 3 (16%) | 0 | 3 (14%) | 6 (13%) | 0 | 6 (11%) |
| Retired | 0 | 9 (33%) | 0 | 9 (28%) | 1 (100%) | 5 (26%) | 0 | 6 (27%) | 14 (30%) | 0 | 15 (28%) |
| On Disability Leave | 1 (100%) | 4 (15%) | 0 | 5 (16%) | 1 (5%) | 1 (50%) | 2 (9%) | 5 (11%) | 1 (17%) | 7 (13%) | |
| Other | 0 | 5 (19%) | 1 (25%) | 6 (19%) | 0 | 1 (5%) | 0 | 1 (5%) | 6 (13%) | 1 (17%) | 7 (13%) |
| No Response | 0 | 1 (4%) | 1 (25%) | 2 (6%) | 0 | 1 (5%) | 0 | 1 (5%) | 2 (4%) | 1 (17%) | 3 (6%) |
|
| |||||||||||
| | N/A | N/A | N/A | N/A | |||||||
| MPK/Powered | 1 (100%) | 14 (74%) | 2 (100%) | 17 (77%) | See Unilateral TF (left) | See Unilateral TF (left) | See Unilateral TF (left) | ||||
| Mechanical (hydraulic) | 0 | 5 (26%) | 0 | 5 (23%) | |||||||
| | |||||||||||
| MADR/ESR‡ | 1 (100%) | 22 (81%) | 4 (100%) | 27 (84%) | 1 (100%) | 19 (100%) | 2 (100%) | 22 (100%) | 41 (89%) | 6 (100%) | 49 (91%) |
| Hydraulic | 0 | 3 (11%) | 0 | 3 (9%) | 0 | 0 | 0 | 0 | 3 (7%) | 0 | 3 (6%) |
| MPA | 0 | 1 (4%) | 0 | 1 (3%) | 0 | 0 | 0 | 0 | 1 (2%) | 0 | 1 (2%) |
| SACH | 0 | 1 (4%) | 0 | 1 (3%) | 0 | 0 | 0 | 0 | 1 (2%) | 0 | 1 (2%) |
| Mean (SD) [range] | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Amputation Level | Unilateral TT | Unilateral TF | All Unilateral (TT and TF) | ||||||||
|
| |||||||||||
| 10MWT – SSV (m/s) | 0.98 | 1.20 | 1.25 | 1.20 | 0.61 | 1.06 | 1.25 | 1.06 | 1.14 | 1.25 | 1.14 |
| 10MWT – FV (m/s) | 1.42 | 1.61 | 1.81 | 1.63 | 0.51 | 1.38 | 1.67 | 1.37 | 1.52 | 1.76 | 1.52 |
| 6MWT (m) | 371.5 | 430.0 | 566.6 | 445.8 | 59.5 | 382.0 | 499.0 | 372.2 | 409.8 | 553.1 | 416.1 |
| 5XSTS (sec) | 30 | 13.8 | 10.7 | 13.9 | 25.02 | 18.9 | 14.5 | 18.5 | 15.4 | 12.0 | 15.4 |
| N (%) Successful† | 25 (93%) | 4 (100%) | 30 (94%) | 12 (63%) | 2 (100%) | 15 (68%) | 37 (80%) | 6 (100%) | 45 (83%) | ||
| FSST (sec) | 15 | 11.7 | 9.2 | 11.5 | 52.39 | 14.6 | 11.2 | 16.1 | 12.9 | 9.8 | 13.3 |
| N (%) Successful† | 26 (96%) | 4 (100%) | 31 (97%) | 18 (95%) | 2 (100%) | 21 (95%) | 44 (96%) | 6 (100%) | 52 (96%) | ||
| BBS score | 50 | 49.0 | 54.0 | 49.6 | 38 | 46.3 | 55.5 | 46.7 | 47.9 | 54.5 | 48.4 |
| FGA score | 19 | 22.9 | 29.3 | 23.6 | N/A | 18.7 | 24.0 | 19.2 | 21.2 | 27.5 | 21.9 |
| AMP-Pro score | 40 | 41.5 | 45.8 | 42.0 | 37 | 37.8 | 43.5 | 38.3 | 40.0 | 45 | 40.5 |
| CHAMP score | N/A | 16.2 | 25.1 | 18.0 | N/A | 13.8 | 13.5 | 13.8 | 15.2 | 21.3 | 16.3 |
| N (%) Attempted† | 16 (59%) | 4 (100%) | 20 (63%) | 11 (58%) | 2 (100%) | 13 (59%) | 27 (59%) | 6 (100%) | 33 (61%) | ||
| Mean (SD) [range] | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Amputation Level | Unilateral TT | Unilateral TF | All Unilateral (TT and TF) | ||||||||
|
| |||||||||||
|
| |||||||||||
| Health Quality of Life Index | 56.1 | 60.3 | 62.0 | 60.4 | 57.1 | 58.0 | 58.0 | 59.3 | 61.2 | 59.4 | |
| Lower Extremity Functional Status | 58.3 | 55.2 | 72.3 | 57.5 | 44.1 | 54.7 | 54.6 | 55.0 | 70.4 | 56.3 | |
| Satisfaction with Device | 45.7 | 50.7 | 48.4 | 50.3 | 43.9 | 49.7 | 49.7 | 50.3 | 49.7 | 50.0 | |
|
| |||||||||||
| Ambulation | 78.3 | 68.3 | 86.7 | 70.9 | 37.6 | 69.5 | 68.9 | 68.8 | 86.9 | 70.1 | |
| Appearance | 97.0 | 76.2 | 69.0 | 76.0 | 88.3 | 70.8 | 72.3 | 74.0 | 72.2 | 74.5 | |
| Frustration | 69.1 | 73.0 | 75.5 | 73.2 | 99.0 | 69.1 | 71.5 | 71.4 | 78.2 | 72.5 | |
| Perceived Response | 90.6 | 88.5 | 91.0 | 88.7 | 96.6 | 91.8 | 92.0 | 89.9 | 91.1 | 90.1 | |
| Residual Limb Health | 20.4 | 73.4 | 73.3 | 71.7 | 87.5 | 79.0 | 80.0 | 75.7 | 76.7 | 75.0 | |
| Social Burdens | 92.6 | 81.1 | 83.8 | 82.0 | 56.0 | 83.8 | 82.4 | 78.1 | 85.2 | 82.2 | |
| Sounds | 25.0 | 70.5 | 60.4 | 67.8 | 92.0 | 76.2 | 78.1 | 72.8 | 68.1 | 71.9 | |
| Utility | 29.4 | 74.1 | 81.3 | 73.6 | 38.9 | 72.4 | 71.5 | 73.4 | 82.7 | 72.8 | |
| Well Being | 94.1 | 77.0 | 89.3 | 79.1 | 63.0 | 74.0 | 72.4 | 75.8 | 81.4 | 76.4 | |
| Mobility (Ambulation + Transfers) | 65.4 | 71.4 | 89.4 | 73.4 | 43.3 | 70.8 | 70.5 | 71.1 | 89.9 | 72.2 | |
|
| |||||||||||
| Score | 9.6 | 8.7 | 9.2 | 8.8 | 7.1 | 8.4 | 8.4 | 8.6 | 9.4 | 8.7 | |
|
| |||||||||||
| T-score | 52.7 | 53.3 | 58.6 | 53.9 | 41.5 | 52.3 | 52.1 | 52.9 | 58.8 | 53.2 | |
|
| |||||||||||
| Physical Function for Mobility Aid Users | 39.4 | 43.1 | 51.5 | 44.0 | 32.3 | 42.9 | 42.6 | 43.0 | 50.7 | 43.5 | |
| Mobility | 43.4 | 43.1 | 51.4 | 44.2 | 35.0 | 41.8 | 41.8 | 42.6 | 50.7 | 43.2 | |
| Depression | 56.8 | 46.4 | 51.4 | 47.4 | 54.2 | 49.3 | 49.7 | 47.6 | 51.6 | 48.3 | |
| Satisfaction with SRA | 35.2 | 52.6 | 56.5 | 52.5 | 40.3 | 53.0 | 52.0 | 52.7 | 54.3 | 52.3 | |
| Ability to Participate in SRA | 49.9 | 50.3 | 51.6 | 50.5 | 37.6 | 50.7 | 49.8 | 50.5 | 50.0 | 50.2 | |
|
| |||||||||||
| Score | 8 | 4.3 | 5.0 | 4.5 | N/A | 4 | 4.1 | 4.2 | 5.2 | 4.3 | |
| Depression Severity | None/Minimal | 16 (59.3%) | 2 (50%) | 18 (56.3%) | 13 (68.4%) | 13 (65%) | 29 (63%) | 2 (40%) | 31 (59.6%) | ||
| Mild | 8 (29.6%) | 2 (50%) | 11 (34.4%) | 5 (26.3%) | 6 (30%) | 13 (28.3%) | 3 (60%) | 17 (32.7%) | |||
| Moderate | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | 0 (0%) | |||
| Moderately-Severe | 2 (7.4%) | 0 (0%) | 2 (6.3%) | 1 (5.3%) | 1 (5%) | 3 (6.5%) | 0 (0%) | 3 (5.8%) | |||
| Severe | 1 (3.7%) | 0 (0%) | 1 (3.1%) | 0 (0%) | 0 (0%) | 1 (2.2%) | 0 (0%) | 1 (1.9%) | |||
|
| |||||||||||
| Important Activities Done Often Enough (%) | 75.0 | 78.1 | 53.9 | 75.0 | 25.0 | 65.6 | 64.9 | 73.0 | 61.3 | 71.0 | |
| Involvement in Life Situations | 58.2 | 53.7 | 56.9 | 54.3 | 51.5 | 57.1 | 56.7 | 55.1 | 56.6 | 55.2 | |
| Control over Participation | 63.6 | 65.9 | 62.3 | 65.4 | 65.0 | 66.8 | 66.7 | 66.3 | 63.1 | 65.9 | |
- —U.S. Department of Defense10.13039/100000005
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProsthetics and Rehabilitation Robotics · Diabetic Foot Ulcer Assessment and Management · Total Knee Arthroplasty Outcomes
Introduction
Lower limb amputation (LLA) often adversely impacts mobility, demanding a multidisciplinary rehabilitation approach with reliable performance-based and patient-reported outcome benchmarks to guide clinical care and track improvements. Establishing normative values for LLA outcomes is crucial for advancing rehabilitative medicine and enhancing clinical care in prosthetics (1, 2) These benchmarks help healthcare professionals set realistic rehabilitation goals and expectations for improved functioning and quality of life. By identifying a standard range for individuals in similar conditions, clinicians can tailor rehabilitation programs to be both achievable and challenging.
Normative values are essential for clinicians to justify prosthetic prescriptions and advocate for reimbursement. They provide the empirical data to demonstrate the impact of prosthetic use and support the case for investment based on anticipated health outcomes. In clinical research, normative values offer a reference for assessing the efficacy of new treatments, prosthetic components, and therapeutic interventions/technologies, facilitating the advancement of evidence-based practices (3).
Despite their importance, LLA-specific normative values, typically derived from large, representative samples, are limited or unavailable for many commonly used outcome measures. Tools such as the Amputee Mobility Predictor Assessment Tool (AMP), the Comprehensive High Level Mobility Predictor (CHAMP), and the Prosthetic Limb User Survey for Mobility (PLUS-M) have been specifically designed and validated for the LLA population and have established normative values (4–7) However, widely used measures such as the Functional Gait Assessment (FGA) and the Patient-Reported Outcomes Measurement Information System (PROMIS) Mobility scale, were originally validated in older adult or neurologic populations, and although they are frequently used with prosthesis users, robust LLA-specific normative data are still lacking. This often leads to inappropriate comparison with the general population or other diagnoses, potentially adversely influencing prosthetic prescriptions, insurance reimbursements, and rehabilitation outcomes.
Finally, for normative values to be useful, both standardization and methodological transparency are vital. Standardized administration and scoring allow researchers and clinicians to report, share, and compare data across studies and clinical practices. Standardized assessments also enable data aggregation, meta-analyses, and systematic reviews enhancing conclusions about the effectiveness of therapeutic interventions and prosthetic components. However, when full standardization is not feasible or has not yet been universally adopted, clear and detailed reporting becomes essential. Transparent reporting helps contextualize the findings, ensures that observed differences across studies can be attributed to patient factors rather than methodological inconsistencies and allows others to assess the degree to which normative values are comparable across studies or clinical environments.
Efforts to address these gaps are ongoing, but significant data shortages remain, particularly regarding standardized procedures for administering and interpreting these measures. This study takes a first step toward addressing this need by providing population-specific reference values for commonly used performance-based and patient-reported outcome measures, together with clear documentation of the administration procedures followed at each site. Although subgroup sample sizes do not allow for the establishment of formal normative standards across all amputation and functional categories, the resulting reference values provide a practical starting point for interpreting clinical outcomes, supporting goal setting and facilitating comparison across research settings without implying normality. Ultimately, by reporting scores stratified by amputation and K level and by detailing how each measure was administered, this work supports more informed clinical decision-making and promotes greater methodological consistency in both research and clinical practice.
Methods
Participants
Sixty-five participants were recruited from 2020 to 2022 through research registries and prosthetics clinics of three medical research facilities within the scope of a larger clinical trial (ClinicalTrial.gov NCT03930199). The study was approved by the Institutional Review Board of each site, and all participants provided written informed consent.
Inclusion criteria were individuals aged 18–89 years with unilateral or bilateral LLA at the transtibial or transfemoral level, including knee disarticulations, who had been using at least one definitive prosthesis (i.e., a stable, non-preparatory prosthesis with a finished socket and componentry not anticipated to change) for at least six months. Participants were included if they had a designated Medicare Functional Classification Level, or equivalent, (K-Level) of K2-4 (i.e., at least a household ambulator). Exclusion criteria included co-morbidities that would limit prosthesis use, such as a cerebral vascular accident or severe traumatic brain injury, and unwillingness to use a study-related smartphone app.
Data collection
In the broader study, participants engaged in up to 4 testing sessions, spaced three months apart, during each of which the same battery of 8 performance-based and 11 patient-reported assessments was administered. To avoid the effects of practice and exposure, the current report includes only data collected at the first session. To ensure measurement consistency within this study and in future comparisons, established protocols were followed for each measure as outlined in Appendix 1. Relevant pdf forms and scoring guides are provided in Supplementary Material 1.
Performance-based outcome measures
The performance-based assessments were all completed in the research lab or clinic and included the Ten-Meter Walk Test (10MWT), Six-Minute Walk Test (6MWT), Five-Time Sit to Stand Test (5XSTS), Four Square Step Test (FSST), Berg Balance Scale (BBS), FGA, AMP, and the CHAMP. Data for performance-based measures were recorded on paper, later digitalized and stored in a REDCap database hosted at Northwestern University (8).
The 10MWT measures walking speed (9) by timing participants over the middle 10 meters of a 14-meter walk, with trials recorded at both a self-selected (SSV) and fast velocity (FV). The 6MWT assesses walking endurance (10) by measuring the distance participants cover within a 6-minute timeframe. The 5XSST provides a measure of lower extremity strength and transfer skill where the score is the time to stand and sit 5 times (11). The FSST measures functional mobility by timing participants as they step through a 2 × 2 grid on the floor, requiring changing step direction and increased foot clearance (12). The BBS quantifies static balance control and risk of falling through 14 balance tasks (13). The FGA assesses postural stability and balance during 10 walking tasks (14). The AMP measures the mobility potential of a person with LLA, and is often used to derive an appropriate K-Level (4). Participants perform static and dynamic sitting and standing activities, as well as transfer and gait skills of progressive difficulty. The CHAMP was designed to mitigate the ceiling effect of traditional outcome measures used for highly active individuals with LLA (15).
Patient-reported outcome measures
The patient-reported outcome measures included the Orthotics and Prosthetics Users’ Survey (OPUS), Modified Falls Efficacy Scale (mFES), Prosthesis Evaluation Questionnaire (PEQ), Prosthetic Limb User Survey of Mobility (PLUS-M), five different PROMIS tests, Patient Health Questionnaire – 9 (PHQ-9), and Community Participation Indicators (CPI). Participants entered responses directly into validated digitalized forms in REDCap (16). Participants had the option to complete the electronic forms at home up to one week before the performance-based assessments. Those unable to fill out electronic forms were provided paper versions during the in-person visit, with researchers subsequently entering the data into REDCap.
The OPUS questionnaire assesses the effectiveness of orthotic and prosthetic devices in meeting user needs and their impact on quality of life (17).The mFES is a 14-item questionnaire that evaluates fear of falling by measuring an individual's perceived balance during daily activities (18). The PEQ evaluates a person's satisfaction with their prosthesis over various functional and social domains, such as ambulation and appearance (19, 20). The PLUS-M is a questionnaire that assesses prosthetic users’ perceived ability to perform indoor and outdoor activities that require use of both lower limbs (21). PROMIS, created by the National Institute of Health, are an expansive system of non-population-specific, person-centered measures for assessing physical, mental, and social health (22). The PHQ-9 is a 9-item questionnaire used to assess the presence and severity of depressive symptoms (23). The CPI is a measure of a person's satisfaction with their participation in the community (24).
For all outcome measures except 5XSTS, FSST, PROMIS Depression, and PHQ-9, a higher score is considered a better outcome.
Data analysis
The primary objective of our analysis is to summarize reference data across various subgroups within the LLA population. Descriptive analyses detailed participant demographic characteristics and outcome measures, stratified by both amputation level and K-level. Participants with knee disarticulations were included with the TF group for the purposes of analysis.
Results
Sixty-five participants enrolled in the larger clinical trial. Prior to the first assessment session, four participants withdrew from the study for health-related reasons and three due to conflicting time commitments. The remaining fifty-eight were included in this analysis (demographic information is summarized in Table 1). Overall, participants had a mean [range] age of 56 [24–76] years. Most were male (88%), had a K-Level of K3 (84%), and had unilateral amputations (93%).
Scores from the performance-based outcome measures are summarized in Table 2. Of note, 11 participants (19%) were not able to complete the 5XSTS test as they were unable to perform the test without using their arms. Two participants (3.4%) were unable to complete the FSST without touching a cane on the floor. Thirty-four participants (59%), all either K-level 3 or 4, performed the CHAMP. Scores from patient-reported outcome measures are summarized in Table 3. One participant did not complete any patient-reported measures due to time constraints.
Complete data for each participant and outcome measure are available in Supplementary Material 2, which also provides detailed information on assistive device use, level of assistance, prosthetic foot type, and when applicable knee component type, as well as other specific testing conditions.
Discussion
This study enriches existing literature by providing population-specific reference values for outcome measures used in the LLA population, together with clear documentation of the administration procedures applied across sites. By reporting scores stratified by K-level for each amputation level and in aggregated groups, this study allows clinicians and researchers to interpret outcomes within the context of comparable patient profiles, supporting goal setting, progress monitoring, and communication with payers, while avoiding inappropriate comparisons with general population norms. Reporting individual-level scores and demographic characteristics further promotes transparency and facilitates future cross-study comparisons.
To the best of our knowledge, this study is the first to report scores for the LLA population on the PROMIS Physical Function for Mobility Aid Users and Mobility scales, and the CPI Percentage of Important Activities Done Often Enough. Apart from Erbes et al. (25) (N = 235, Veterans, mean age 64 years), who had a notably different cohort and scored differently on all common measures, we are also the first to report scores for the PROMIS Ability to Participate in SRA and CPI Involvement in Life Situations and Control over Participation scales for an LLA population.
The average scores found are similar to those of existing literature for the following outcome measures used with an LLA population: FGA (26); AMP (4, 5, 27); PEQ: Ambulation, Appearance, Frustration, Residual Limb Health, and Utility (27); PHQ-9 (28, 29); and again excluding Erbes et al., PROMIS Depression and Satisfaction with SRA scales (30).
For the remaining outcomes, deviations from previous reports can likely be explained by differences in cohort composition or testing protocol, as outlined below. In all but a few cases, scores exhibited expected trends for functional capacity of various groups (e.g., those designated K4 outperformed K3; the TT group outperformed the TF group).
10MWT
For each K-level, 10MWT speeds, both SSV and FV, are faster than those previously reported by Sions et al. (N = 55) and Beisheim et al. (N = 41) (31, 32). A critical methodological difference could account for the lower speeds of the other studies, across all groups: calculating the speed by first averaging the 3 trial times then dividing by the distance—rather than calculating the speed of each trial then averaging—would underestimate the actual average speed. Additionally, both of these studies used a shorter track: a 6 m timed track segment compared to our 10 m. Though both versions excluded the acceleration and deceleration phases of the walk, the difference in track length might still affect the calculated speed. Finally, in both prior studies consisted of a substantially larger proportion of participants with vascular-disease etiology (58% and 40%) compared to ours (17%). Individuals with LLA from vascular-disease etiologies have consistently been shown to walk slower than those with other etiologies (33, 34). On the other hand, our walking speeds align closely with the self-selected speeds reported by Akarsu et al. (35) for individuals with both unilateral and bilateral LLA, and with those calculated for all unilateral sub-groups from Sawers et al. (36). However it is unreported whether Akarsu et al. timed the acceleration and deceleration stages of the walk. This is the first study to report scores categorized by K-level for both amputation levels (TT and TF) at both assessed speeds and with a 10 m timed length.
6MWT
In our study, the mean 6MWT scores are markedly higher than those reported by Sions et al. (N = 35 K3 and N = 20 K4) (31). As their participants were also slower walkers than ours on the 10MWT, we expect this is a difference of the makeup of the cohorts. Similarly, our scores were appreciably higher than those reported by Resnik et al. (N = 44, unilateral) (27). A plausible explanation is the 10-year higher mean age of their participants. Gailey et al. report 6MWT scores by K-level for 167 LLA but used a different protocol: where other studies instructed participants to “cover as much ground as possible”, Gailey et al. instructed participants to walk at a comfortable speed (4). In fact, their participants walked a noticeably shorter distance on average across all groups. Our study is the first to report 6MWT scores by both amputation level and K-level, with the directive to cover the greatest distance.
5XSTS and FSST
Two previous studies report 5XSTS and FSST scores for an LLA cohort (37, 38). Beisheim et al. (N = 67) observed considerably better scores (lower completion times) for the overall group compared to those in our study (37). For the 5XSTS, this can be attributed to a shorter portion of the test being timed. Specifically, they stopped timing at the peak of the 5th stand while we stopped when the participant returned to a seated position. For the FSST, Beisheim et al. used tape on the floor instead of canes which likely facilitated the task by requiring less effort and time to lift feet over obstacles. Wilken et al. (N = 19) have significantly better scores on both tests, likely because their participants were all active-duty military personnel, who are presumably more functional on average than those in our study (38). Considering differences in protocol and cohort composition, the observed trends in scores are reasonable, and align with expected functional differences among various groups. The high failure rate in the 5XSTS should be considered before selecting it as an evaluation tool for LLA.
BBS
Previous research has often aggregated BBS scores for individuals with unilateral TT and TF LLA (39), or for unilateral and bilateral LLA (40), stratified them by various balance abilities (41, 42), or included only one amputation level (41–43). In our study, we observed notable differences between unilateral K3 and K4 participants and between unilateral and bilateral participants, indicating the utility in reporting scores by these group (though no distinct difference was found between TT and TF participants).
CHAMP
Our CHAMP results were markedly lower than those reported by Gailey et al. for each TT, TF, and combined unilateral amputations (6). This difference is likely again in the respective demographics of the cohorts: their participants were all active duty or retired service men with a mean age of 29 ± 6 years, where our participants were primarily non-military with a mean age of 56 ± 16 years. Of note, Gailey et al. required participants to have minimum AMP and 6MWT scores in order to participate in the CHAMP, while we only required that participants be able to safely stand from prone on the ground. However, we expect the difference in protocol to have had little effect, as scores in our study were comparable to those of two other studies (7, 44) who used similar protocols to Gailey, but with cohorts more similar to ours (non-military, average age >38 years).
OPUS
Resnik et al. reported scores for the same components of the OPUS without distinguishing between K-level or type of amputation (27). Their scores for the Health-related Quality of Life and for Lower Extremity Functional Status components were notably lower compared to those in the current study, while their Satisfaction with Device scores were not. A plausible explanation for this discrepancy is the update to the OPUS scoring guide, which included minor modifications to the Rasch code tables. For this reason, raw scores are provided in Supplementary Material 2.
mFES
Barnett et al. represents the only other paper, to our knowledge, that explored the mFES with the LLA population (45). Their research reported mFES scores exclusively for 7 unilateral TT males 1 to 6 months post-discharge from rehabilitation. Considering our TT participants have substantially more experience with prostheses (13 years more, on average), it is understandable that participants in our study reported markedly higher confidence in balance than those in Barnett's study.
PEQ
PEQ scores for Perceived Response, Social Burden, and Well-Being, were notably different than those of Resnik et al. (27). As with the 6MWT, the difference with scores from Resnik et al. is likely explained by the difference in average age of the two study groups. Our scores for the Mobility subscale were markedly higher than those reported by Franchignoni et al. (46). First, our participants had a longer mean time since amputation (18.3 years), allowing for better long-term adaptation to prosthetic use, compared to the shorter adaptation period in Franchignoni's (up to 5 years) study. Second, Hafner's study had a higher percentage of participants with vascular-disease etiologies (43.7%) compared to ours (17%), who may struggle with the mobility tasks assessed more than those with other etiologies. These factors might explain why our cohort demonstrated higher mobility scores.
PLUS-M
While our participants’ PLUS-M T-scores are higher on average than those of Hafner et al. (5) and Erbes et al. (25), they are similar to those of the development sample (21). Additionally, Erbes's cohort had a mean age 10 years greater than ours, aligning with the inverse relationship between age and score exhibited in the development sample.
Study limitations
While this report provides score summaries by K-level, the majority of participants enrolled in this study had unilateral amputation classified at a K3 level (only 3 K2-level, 6 K4-level, and 4 bilateral participants were enrolled), which limit the utility of this dataset for those specific sub-categories. In addition, the sample included a predominance of male participants, reflecting typical enrollment patterns in prosthetic rehabilitation studies, particularly that include military and veteran populations. This imbalance limits the ability to explore potential sex-specific differences in performance and patient-reported outcomes, despite evidence that such differences exist in body composition, strength, gait biomechanics, and health and quality-of-life measures between males and females (47–49) As such, reference values for less-represented groups (e.g., K2, K4, bilateral amputees and female) should be interpreted cautiously, used in conjunction with individual patient factor and considered as one component of a broader clinical assessment rather than as standalone indicators of performance or recovery. Future work should aim to recruit a more balanced cohort across sex and functional levels to confirm reference ranges, explore potential sex-specific patterns, and establish meaningful clinically important difference metrics for these outcome measures.
Conclusions
This study provides clear documentation of the administration procedures used for each outcome measure and presents population-specific reference values stratified by amputation levels and K-levels, aiming to clearly articulate where our findings align or diverge from existing literature. We explore potential reasons for any discrepancies, including variations in test execution, scoring, and participant characteristics. This work also highlights areas where LLA-specific normative data remain limited and, while not intended to establish formal norms, the present findings offer preliminary reference values that can aid interpretation, support clinical benchmarking, and inform future research and rehabilitation planning. Building on this foundation, future research should incorporate longitudinal study designs to characterize changes in outcomes and to establish empirically based criteria for clinically meaningful change over time.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Heinemann AW Connelly L Ehrlich-Jones L Fatone S. Outcome instruments for prosthetics. Phys Med Rehabil Clin N Am. (2014) 25(1):179–98. 10.1016/j.pmr.2013.09.00224287247 · doi ↗ · pubmed ↗
- 2Porter ME. What is value in health care? N Engl J Med. (2010) 363(26):2477–81. 10.1056/NEJ Mp 101102421142528 · doi ↗ · pubmed ↗
- 3Kelly MP Heath I Howick J Greenhalgh T. The importance of values in evidence-based medicine. BMC Med Ethics. (2015) 16(1):69. 10.1186/s 12910-015-0063-326459219 PMC 4603687 · doi ↗ · pubmed ↗
- 4Gailey RS Roach KE Applegate EB Cho B Cunniffe B Licht S The amputee mobility predictor: an instrument to assess determinants of the lower-limb amputee's ability to ambulate. Arch Phys Med Rehabil. (2002) 83(5):613–27. 10.1053/apmr.2002.3230911994800 · doi ↗ · pubmed ↗
- 5Hafner BJ Gaunaurd IA Morgan SJ Amtmann D Salem R Gailey RS. Construct validity of the prosthetic limb users survey of mobility (PLUS-M) in adults with lower limb amputation. Arch Phys Med Rehabil. (2017) 98(2):277–85. 10.1016/j.apmr.2016.07.02627590443 PMC 5276724 · doi ↗ · pubmed ↗
- 6Gailey RS Gaunaurd IA Raya MA Roach KE Linberg AA Campbell SM Development and reliability testing of the comprehensive high-level activity mobility predictor (CHAMP) in male servicemembers with traumatic lower-limb loss. J Rehabil Res Dev. (2013) 50(7):905–18. 10.1682/JRRD.2012.05.009924301428 · doi ↗ · pubmed ↗
- 7Anton A Legault Z Dudek N. Validity of the comprehensive high-level activity mobility predictor in a heterogeneous population with lower extremity amputations. Prosthet Orthot Int. (2020) 44(2):60–5. 10.1177/030936461988755931942810 · doi ↗ · pubmed ↗
- 8Harris PA Taylor R Minor BL Elliott V Fernandez M O'Neal L The RED Cap consortium: building an international community of software platform partners. J Biomed Inform. (2019) 95:103208. 10.1016/j.jbi.2019.10320831078660 PMC 7254481 · doi ↗ · pubmed ↗
