# Inter- and intra-rater agreement among novices and comparison to an experienced consensus using the Gächter scale for the evaluation of septic joints

**Authors:** Tiffany I. Stockman, Michael P. Kowaleski, Jacqueline M. Hicks, Chanel N. Berns, W Brian Saunders, Robert J. McCarthy

PMC · DOI: 10.3389/fvets.2025.1577046 · Frontiers in Veterinary Science · 2025-07-15

## TL;DR

This study found that novice raters had low agreement when using the Gächter scale to evaluate septic joints in dogs, compared to experienced experts.

## Contribution

The study highlights the unreliability of the Gächter grading scale when applied by novices to canine septic joints.

## Key findings

- Intra-rater agreement among novice raters was consistently low.
- Inter-rater agreement between novices and an expert consensus was unreliable.
- Training did not significantly improve agreement over time.

## Abstract

The purpose of this retrospective study was to determine the inter-and intra-rater agreement among novice raters, as well as agreement between novice raters and an experienced consensus using the Gächter grading scale for the evaluation of the severity of septic joints in dogs.

Three surgical residents served as novice raters, and two American College of Veterinary Surgery (ACVS) diplomates, experienced with arthroscopic evaluation of canine joints, served as the experienced consensus. Arthroscopy images were first evaluated by the experienced consensus and scored using the Gächter scale. After two supervised training sessions, novices applied the scale twice to the same images, 2 weeks apart.

The application of the Gächter grading scale was unreliable in dogs when utilized by novice raters.

Both the intra-rater agreement measured among the three novice raters and inter-rater reliability comparing the three novice raters to an expert consensus showed a consistently low concurrence among the individuals when tested at two separate time intervals. Lack of skill with arthroscopy, awareness of the anatomy and potential anatomic variations, and inadequate training in the application of the Gächter grading scheme could play a large part in a novice’s ability to apply the grading scale to a septic joint. Inter-rater agreement, while initially moderate, had a decreasing concurrence between the two-time intervals.

## Full-text entities

- **Diseases:** inflammatory (MESH:D007249), Infection (MESH:D007239), Septic joints (MESH:D001170), joint disease (MESH:D007592), joint effusion (MESH:D000080324), dental disease (MESH:D009057), fever (MESH:D005334), TS (MESH:D005879), hyperemia (MESH:D006940), cartilage erosion (MESH:D002357), pain (MESH:D010146), rheumatoid arthritis (MESH:D001172), lameness (MESH:D007794), sepsis (MESH:D018805), osteoarthritis (MESH:D010003)
- **Species:** Staphylococcus pseudintermedius (species) [taxon 283734], Homo sapiens (human, species) [taxon 9606], Oryctolagus cuniculus (domestic rabbit, species) [taxon 9986], Canis lupus familiaris (dog, subspecies) [taxon 9615], Actinomyces (genus) [taxon 1654], Pasteurella (genus) [taxon 745], Staphylococcus schleiferi (species) [taxon 1295]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12306484/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12306484/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12306484/full.md

---
Source: https://tomesphere.com/paper/PMC12306484