# A Comparison of the Psychometric Properties of PROMIS Computer Adaptive Tests and Short Forms Vs Legacy Patient-Reported Outcome Measures in Total Knee Arthroplasty Patients

**Authors:** Christel Braaksma, Nienke Wolterbeek, Maurits Remmelt Veen, Rudolf Wilhelm Poolman, Yvette Pronk, Ariena Jorien Rasker, Raymond Willem Jozef Gerardus Ostelo, Caroline Barbara Terwee

PMC · DOI: 10.1016/j.artd.2026.101964 · Arthroplasty Today · 2026-03-06

## TL;DR

This study compares PROMIS measures with traditional tools for evaluating outcomes in knee replacement patients, finding PROMIS more efficient and accurate.

## Contribution

The study provides empirical evidence that PROMIS measures outperform legacy PROMs in psychometric properties for TKA patients.

## Key findings

- PROMIS CATs and SFs showed lower measurement error and fewer extreme scores compared to legacy PROMs.
- PROMIS PF measures required fewer items while maintaining reliability and validity.
- PROMIS PI-8a was most effective for pain assessment without extreme scores.

## Abstract

Traditionally used PROMs evaluating total knee arthroplasty (TKA) have large measurement error and limited measurement range. Patient-Reported Outcomes Measurement Information System (PROMIS) has theoretical possibilities to overcome these limitations. This study evaluates the psychometric properties of PROMIS vs legacy PROMs in TKA patients.

210 patients were included from 3 orthopaedic departments. Patients completed a questionnaire twice in a 2-week interval, including 2 PROMIS Computer Adaptive Tests (CATs), assessing Physical Function (PF) and Pain Interference (PI), 5 PROMIS short forms (SF) (PF-SF8b, PF-SF10a, PF-SF-20a, PI-SF8a, PI-1a) and 6 legacy PROMs (KOOS, KOOS-PS, WOMAC, OKS, 2 numeric rating scales). Reliability, measurement precision, smallest detectable change (SDC), construct validity, burden and extreme scores were investigated.

All PROMIS CATs, SFs, and legacy PROMs showed adequate test–retest reliability. PROMIS CATs and SFs showed sufficient construct validity. Regarding PF, the SDC varied between 2.8-6.1 for PROMIS and 7.3-19.9 for legacy PROMs. PROMIS CAT and SFs showed no extreme scores using 5-20 items, legacy PROMs showed 1.2%-3% extreme scores using 7-17 items. Concerning pain, the SDC varied between 2.8-6.1 for PROMIS and 22.5-35.4 for legacy PROMs. PROMIS showed 0%-9.5% extreme scores using 5-8 items and legacy PROMs 8.7%-19.1% using 1-9 items.

PROMIS CAT and SFs seem more efficient assessing PF in TKA compared to legacy PROMs by offering reduced burden and measurement error, and minimizing extreme scores. PROMIS PI-8a seems most suitable in measuring pain without extreme scores. This study supports a shift from legacy PROMs toward PROMIS in TKA patients.

## Full-text entities

- **Diseases:** Pain (MESH:D010146)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12993152/full.md

## Figures

1 figure with captions in the complete paper: https://tomesphere.com/paper/PMC12993152/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12993152/full.md

---
Source: https://tomesphere.com/paper/PMC12993152