Pitfalls and Limits in Automatic Dementia Assessment

Franziska Braun; Christopher Witzl; Andreas Erzigkeit; Hartmut Lehfeld; Thomas Hillemacher; Tobias Bocklet; Korbinian Riedhammer

arXiv:2508.04512·eess.AS·August 7, 2025·Interspeech

Pitfalls and Limits in Automatic Dementia Assessment

Franziska Braun, Christopher Witzl, Andreas Erzigkeit, Hartmut Lehfeld, Thomas Hillemacher, Tobias Bocklet, Korbinian Riedhammer

PDF

TL;DR

This paper critically examines speech-based dementia assessment methods, revealing biases and artifacts that affect accuracy across different impairment levels, emphasizing the need for detailed error analysis beyond numerical performance.

Contribution

It provides an in-depth analysis of an automated dementia assessment, highlighting biases and pitfalls that influence the reliability of speech-based evaluation tools.

Findings

01

High correlation with human scores for severely impaired individuals

02

Speech production decreases with cognitive decline, affecting test scoring

03

Fallback handling introduces biases favoring certain groups

Abstract

Current work on speech-based dementia assessment focuses on either feature extraction to predict assessment scales, or on the automation of existing test procedures. Most research uses public data unquestioningly and rarely performs a detailed error analysis, focusing primarily on numerical performance. We perform an in-depth analysis of an automated standardized dementia assessment, the Syndrom-Kurz-Test. We find that while there is a high overall correlation with human annotators, due to certain artifacts, we observe high correlations for the severely impaired individuals, which is less true for the healthy or mildly impaired ones. Speech production decreases with cognitive decline, leading to overoptimistic correlations when test scoring relies on word naming. Depending on the test design, fallback handling introduces further biases that favor certain groups. These pitfalls remain…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.