Zero-Shot Cognitive Impairment Detection from Speech Using AudioLLM

Mostafa Shahin; Beena Ahmed; Julien Epps

arXiv:2506.17351·cs.SD·June 24, 2025

Zero-Shot Cognitive Impairment Detection from Speech Using AudioLLM

Mostafa Shahin, Beena Ahmed, Julien Epps

PDF

TL;DR

This paper introduces a zero-shot speech-based method for detecting cognitive impairment using AudioLLM, which performs comparably to supervised models and generalizes well across languages and datasets.

Contribution

It is the first to apply a zero-shot approach with AudioLLM for cognitive impairment detection from speech, eliminating the need for manual annotation.

Findings

01

Achieves performance comparable to supervised methods

02

Demonstrates strong cross-lingual and cross-dataset generalization

03

Works effectively on multilingual and multi-task datasets

Abstract

Cognitive impairment (CI) is of growing public health concern, and early detection is vital for effective intervention. Speech has gained attention as a non-invasive and easily collectible biomarker for assessing cognitive decline. Traditional CI detection methods typically rely on supervised models trained on acoustic and linguistic features extracted from speech, which often require manual annotation and may not generalise well across datasets and languages. In this work, we propose the first zero-shot speech-based CI detection method using the Qwen2- Audio AudioLLM, a model capable of processing both audio and text inputs. By designing prompt-based instructions, we guide the model in classifying speech samples as indicative of normal cognition or cognitive impairment. We evaluate our approach on two datasets: one in English and another multilingual, spanning different cognitive…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.