Integrating Pause Information with Word Embeddings in Language Models for Alzheimer's Disease Detection from Spontaneous Speech
Yu Pu, Wei-Qiang Zhang

TL;DR
This paper introduces a novel method that incorporates pause information into transformer-based language models to improve early detection of Alzheimer's disease from spontaneous speech, achieving high accuracy on relevant datasets.
Contribution
The study presents a new approach that encodes pause information into embeddings and integrates them into language models for better AD detection from speech data.
Findings
Achieved 83.1% accuracy on ADReSSo dataset.
Pause information significantly improves AD detection performance.
Demonstrated potential of speech pauses as indicators for Alzheimer's disease.
Abstract
Alzheimer's disease (AD) is a progressive neurodegenerative disorder characterized by cognitive decline and memory loss. Early detection of AD is crucial for effective intervention and treatment. In this paper, we propose a novel approach to AD detection from spontaneous speech, which incorporates pause information into language models. Our method involves encoding pause information into embeddings and integrating them into the typical transformer-based language model, enabling it to capture both semantic and temporal features of speech data. We conduct experiments on the Alzheimer's Dementia Recognition through Spontaneous Speech (ADReSS) dataset and its extension, the ADReSSo dataset, comparing our method with existing approaches. Our method achieves an accuracy of 83.1% in the ADReSSo test set. The results demonstrate the effectiveness of our approach in discriminating between AD…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
