Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease   Detection

Yin-Long Liu; Rui Feng; Jia-Hong Yuan; Zhen-Hua Ling

arXiv:2412.06259·eess.AS·December 10, 2024

Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection

Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

PDF

Open Access

TL;DR

This study enhances Alzheimer's disease detection from speech transcripts by applying prompt learning, incorporating pause information, and using ensemble methods, achieving state-of-the-art accuracy with manual transcripts.

Contribution

It introduces prompt-based fine-tuning of pre-trained language models and explores pause encoding, improving AD detection accuracy over traditional methods.

Findings

01

Maximum detection accuracy of 95.8% with manual transcripts

02

Prompt-based fine-tuning outperforms traditional fine-tuning

03

Ensemble techniques improve robustness and accuracy

Abstract

Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previous studies have demonstrated the efficacy of fine-tuning pre-trained language models (PLMs) for AD detection. However, the objective of this traditional fine-tuning method, which involves inputting only transcripts, is inconsistent with the masked language modeling (MLM) task used during the pre-training phase of PLMs. In this paper, we investigate prompt-based fine-tuning of PLMs, converting the classification task into a MLM task by inserting prompt templates into the transcript inputs. We also explore the impact of incorporating pause information from forced alignment into manual transcripts. Additionally, we compare the performance of various automatic speech recognition…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare

MethodsSpatial-Channel Token Distillation