# Interobserver reliability of the pivot shift test: A modified classification improves agreement

**Authors:** Juan P. Martinez‐Cano, Sebastian Mejia‐Barreto, Jacobo Triviño‐Arias, Maria C. Gomez‐Ayala, Alejandro Mejia, Juan F. Londoño, Ruben Guzman

PMC · DOI: 10.1002/jeo2.70354 · 2025-08-05

## TL;DR

This study shows that a modified classification of the pivot shift test improves agreement among doctors when assessing knee instability in ACL injuries.

## Contribution

The novel contribution is a modified pivot shift test classification that increases interobserver reliability by combining grades II and III into a single high-grade category.

## Key findings

- The modified classification achieved substantial agreement (kappa 0.73) compared to fair agreement (kappa 0.39) with the classic classification.
- Combining grades II and III into a high-grade category improved agreement from poor/fair to substantial.
- The modified classification is recommended for clinical and research use due to better reliability.

## Abstract

The pivot shift test evaluates the anterolateral rotational instability of the knee in patients with anterior cruciate ligament (ACL) injuries. The aim of this study was to evaluate the interobserver reliability of the classic pivot shift test and a modified classification.

An interobserver reliability study involving 4 observers and 17 patients with high suspicion of ACL injury. Observers were blind to diagnostic images and independently evaluated each patient. Kappa–Fleiss was used to assess the pivot shift test agreement between observers. Interobserver reliability was calculated for the classic classification (grades I, II and III), as well as for a modified classification in low‐grade (I) and high‐grade pivot shift (II and III). Kappa agreement was categorised as poor (<0.00), slight (0.00–0.20), fair (0.21–0.40), moderate (0.41–0.60), substantial (0.61–0.80) and perfect (0.81–1.00).

Patients had mean age of 29 years (range: 24–32) with similar sex distribution (male: 53%). The global interobserver reliability assessment favoured the modified (0.73) over the classic classification (0.39). Agreement for the classic categories (negative: 0.87, grade I: 0.61, grade II: 0.06 and grade III: 0.34), was surpassed by the modified approach (negative: 0.87, low‐grade: 0.61 and high grade: 0.73).

Our study demonstrated that the modified pivot shift test grading improves interobserver reliability compared to the classic classification. By combining the grade II and III categories into a single high‐grade category, we transformed poor and fair agreement into substantial agreement. The modified pivot shift classification can be utilised in both clinical practice and research, particularly due to its clinical implications for decision‐making in patients with grade II and III pivot shifts, which may be similar.

Level I.

## Full-text entities

- **Genes:** PHF1 (PHD finger protein 1) [NCBI Gene 5252] {aka MTF2L2, PCL1, TDRD19C, hPHF1}
- **Diseases:** falls (MESH:C537863), LET (MESH:D011111), anterolateral (MESH:D056988), pain (MESH:D010146), valgus (MESH:D060906), re-tear (MESH:D000084063), II (MESH:C537730), pivot shift (MESH:D020178), rupture (MESH:D012421), hyperextension (MESH:C563315), knee osteoarthritis (MESH:D020370), trauma (MESH:D014947), knee laxity (MESH:D007593), ACL rupture (MESH:D000070598), rotational instability (MESH:D009759), glide of the knee (MESH:D007718), sports injuries (MESH:D001265), traffic accidents (MESH:D000081084)
- **Chemicals:** LET (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

---
Source: https://tomesphere.com/paper/PMC12322695