Exploiting prompt learning with pre-trained language models for Alzheimer's Disease detection
Yi Wang, Jiajun Deng, Tianzi Wang, Bo Zheng, Shoukang Hu, Xunying Liu,, Helen Meng

TL;DR
This paper explores prompt-based fine-tuning of pre-trained language models for Alzheimer's Disease detection using speech transcripts, incorporating disfluency features and system voting to improve accuracy.
Contribution
It introduces a prompt-based fine-tuning approach aligned with AD classification, integrating disfluency features and combining multiple models for enhanced detection performance.
Findings
Achieved 84.20% accuracy with manual transcripts.
Achieved 82.64% accuracy with ASR transcripts.
System ensemble improved robustness across experiments.
Abstract
Early diagnosis of Alzheimer's disease (AD) is crucial in facilitating preventive care and to delay further progression. Speech based automatic AD screening systems provide a non-intrusive and more scalable alternative to other clinical screening techniques. Textual embedding features produced by pre-trained language models (PLMs) such as BERT are widely used in such systems. However, PLM domain fine-tuning is commonly based on the masked word or sentence prediction costs that are inconsistent with the back-end AD detection task. To this end, this paper investigates the use of prompt-based fine-tuning of PLMs that consistently uses AD classification errors as the training objective function. Disfluency features based on hesitation or pause filler token frequencies are further incorporated into prompt phrases during PLM fine-tuning. The decision voting based combination among systems…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsInterpreting and Communication in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · Spatial-Channel Token Distillation · Test · Linear Warmup With Linear Decay · Attention Dropout · Weight Decay · Dense Connections · Linear Layer · Layer Normalization
