# Temporal consistency of behavior trait measurements in guide dogs

**Authors:** Emma K. Hilby, Aaron Rendahl, Jane Russenberger, Madeline Zimmermann, James R. Mickelson, Molly E. McCue

PMC · DOI: 10.3389/fvets.2025.1549360 · Frontiers in Veterinary Science · 2025-07-04

## TL;DR

This study shows that behavior scores for guide dogs are consistent over time and can predict future performance, with early scores indicating likely outcomes.

## Contribution

The study demonstrates the BCL's predictive validity and temporal consistency in assessing guide dog behavior traits.

## Key findings

- Dogs with high early BCL scores tend to maintain high scores over time.
- Poorly scoring dogs either improve with training or are removed from training programs.
- BCL evaluations within 3-6 months show the highest agreement.

## Abstract

Guide dog organizations have strict criteria to breed, raise, and select dogs to assist people with visual impairments. In collaboration with Dr. James Serpell, several guide dog training organizations developed a scoring tool called the Behavior Checklist (BCL) to evaluate candidate guide dogs. The tool’s use has expanded to the entire assistance dog industry and is rapidly emerging as the standard behavior assessment. Since 2003, Guiding Eyes for the Blind (GEB) has used the BCL to measure individual dogs’ behaviors up to 8 times between puppyhood and final placement. Here, we evaluate the consistency of the BCL over multiple evaluations in a population of 3,969 Labrador Retrievers raised by Guiding Eyes. We grouped BCL evaluations by two methods, factor analysis, and trainer-defined groups, and summarized groupings of behavior in two ways, using mean and lowest scores. We then determined the agreement between pairs of evaluations using kappa statistics and the predictive capacity of early BCL scores to predict later scores using positive and negative predictive values. Evaluations that are similar in nature and those that are scored within 3 to 6 months of one another agree the most. When a dog scores well early in life, they are likely to consistently score well and the dog’s behavior is unlikely to regress over time. We also found that dogs who score poorly early in life either improve their scores on later evaluations with training intervention or are removed from training. One limitation of this data is that dogs who score poorly at early time points are often removed from training and the data from later BCL evaluations is biased toward higher-scoring dogs. Regardless, these data show that the BCL is an effective way to evaluate assistance dog behavior and has some predictive capacity.

## Full-text entities

- **Diseases:** visual impairments (MESH:D014786)
- **Species:** Canis lupus familiaris (dog, subspecies) [taxon 9615], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12272753/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12272753/full.md

## References

23 references — full list in the complete paper: https://tomesphere.com/paper/PMC12272753/full.md

---
Source: https://tomesphere.com/paper/PMC12272753