Leveraging Prompt Learning and Pause Encoding for Alzheimer's Disease Detection
Yin-Long Liu, Rui Feng, Jia-Hong Yuan, Zhen-Hua Ling

TL;DR
This study enhances Alzheimer's disease detection from speech transcripts by applying prompt learning, incorporating pause information, and using ensemble methods, achieving state-of-the-art accuracy with manual transcripts.
Contribution
It introduces prompt-based fine-tuning of pre-trained language models and explores pause encoding, improving AD detection accuracy over traditional methods.
Findings
Maximum detection accuracy of 95.8% with manual transcripts
Prompt-based fine-tuning outperforms traditional fine-tuning
Ensemble techniques improve robustness and accuracy
Abstract
Compared to other clinical screening techniques, speech-and-language-based automated Alzheimer's disease (AD) detection methods are characterized by their non-invasiveness, cost-effectiveness, and convenience. Previous studies have demonstrated the efficacy of fine-tuning pre-trained language models (PLMs) for AD detection. However, the objective of this traditional fine-tuning method, which involves inputting only transcripts, is inconsistent with the masked language modeling (MLM) task used during the pre-training phase of PLMs. In this paper, we investigate prompt-based fine-tuning of PLMs, converting the classification task into a MLM task by inserting prompt templates into the transcript inputs. We also explore the impact of incorporating pause information from forced alignment into manual transcripts. Additionally, we compare the performance of various automatic speech recognition…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Healthcare
MethodsSpatial-Channel Token Distillation
