# Do evaluative statements in facial identification overstate the strength of the evidence?

**Authors:** Nada Aggadi, Reuben Moreton, Thomas Busey

PMC · DOI: 10.1111/1556-4029.70265 · Journal of Forensic Sciences · 2026-01-26

## TL;DR

This study finds that the language used by facial identification examiners may overstate the strength of their evidence by a large margin.

## Contribution

The paper introduces a quantitative method to assess the strength of facial identification evidence using likelihood ratios.

## Key findings

- Likelihood ratios from the study are lower than the values implied by current evaluative statements.
- Examiners' language overstates the strength of evidence by several orders of magnitude.
- Ordered probit models effectively summarize examiner responses and produce likelihood ratios.

## Abstract

Facial identification examiners assess whether two facial images—such as an image of an unknown person from surveillance footage and a controlled image of a known individual—depict the same person or different people. To communicate their observations, they rely on predefined verbal articulation scales that sometimes have associated numeric equivalents. However, these terms have not been calibrated against the actual strength of the evidence except indirectly through proficiency tests and black box studies. The present research reanalyzes the findings of face comparisons from the most comprehensive facial identification black box study to date, as well as multiple facial examination proficiency tests, to generate a quantitative measure of the strength of the evidence for each comparison. We used an ordered probit model to summarize the distribution of responses of both individual examiners and examiner teams to produce a set of likelihood ratios for each group and test. The likelihood ratios can be lower than values implied by the evaluative statements, which do not seem to justify the strengths of evidence implied by current articulation scales used in facial comparisons. Our analyses suggest that examiners are using language that overstates the strength of the evidence by several orders of magnitude.

## Full-text entities

- **Genes:** TTR (transthyretin) [NCBI Gene 7276] {aka AMYLD1, ATTR, CTS, CTS1, HEL111, HsT2651}
- **Diseases:** NORMAL DISTRIBUTION (MESH:D020243), MODEL (MESH:D004195)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12967692/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12967692/full.md

## References

33 references — full list in the complete paper: https://tomesphere.com/paper/PMC12967692/full.md

---
Source: https://tomesphere.com/paper/PMC12967692