GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate Degradation of Artificial Neural Language Models
Changye Li, David Knopman, Weizhe Xu, Trevor Cohen, Serguei, Pakhomov

TL;DR
This paper introduces GPT-D, a novel method pairing a pre-trained language model with an artificially degraded version to detect and induce dementia-related linguistic anomalies, improving generalization and understanding of language deterioration in Alzheimer's disease.
Contribution
The paper proposes a new approach using model degradation and perplexity ratios to detect dementia-related language changes, enhancing generalization and interpretability over existing methods.
Findings
Achieves state-of-the-art performance on the 'Cookie Theft' task
Generalizes well to spontaneous conversations
Induces dementia-like linguistic anomalies in generated text
Abstract
Deep learning (DL) techniques involving fine-tuning large numbers of model parameters have delivered impressive performance on the task of discriminating between language produced by cognitively healthy individuals, and those with Alzheimer's disease (AD). However, questions remain about their ability to generalize beyond the small reference sets that are publicly available for research. As an alternative to fitting model parameters directly, we propose a novel method by which a Transformer DL model (GPT-2) pre-trained on general English text is paired with an artificially degraded version of itself (GPT-D), to compute the ratio between these two models' \textit{perplexities} on language from cognitively healthy and impaired individuals. This technique approaches state-of-the-art performance on text data from a widely used "Cookie Theft" picture description task, and unlike established…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsTopic Modeling · Text Readability and Simplification · Interpreting and Communication in Healthcare
MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Label Smoothing · Absolute Position Encodings · Layer Normalization · Residual Connection
