GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate   Degradation of Artificial Neural Language Models

Changye Li; David Knopman; Weizhe Xu; Trevor Cohen; Serguei; Pakhomov

arXiv:2203.13397·cs.CL·March 28, 2022

GPT-D: Inducing Dementia-related Linguistic Anomalies by Deliberate Degradation of Artificial Neural Language Models

Changye Li, David Knopman, Weizhe Xu, Trevor Cohen, Serguei, Pakhomov

PDF

Open Access 2 Repos

TL;DR

This paper introduces GPT-D, a novel method pairing a pre-trained language model with an artificially degraded version to detect and induce dementia-related linguistic anomalies, improving generalization and understanding of language deterioration in Alzheimer's disease.

Contribution

The paper proposes a new approach using model degradation and perplexity ratios to detect dementia-related language changes, enhancing generalization and interpretability over existing methods.

Findings

01

Achieves state-of-the-art performance on the 'Cookie Theft' task

02

Generalizes well to spontaneous conversations

03

Induces dementia-like linguistic anomalies in generated text

Abstract

Deep learning (DL) techniques involving fine-tuning large numbers of model parameters have delivered impressive performance on the task of discriminating between language produced by cognitively healthy individuals, and those with Alzheimer's disease (AD). However, questions remain about their ability to generalize beyond the small reference sets that are publicly available for research. As an alternative to fitting model parameters directly, we propose a novel method by which a Transformer DL model (GPT-2) pre-trained on general English text is paired with an artificially degraded version of itself (GPT-D), to compute the ratio between these two models' \textit{perplexities} on language from cognitively healthy and impaired individuals. This technique approaches state-of-the-art performance on text data from a widely used "Cookie Theft" picture description task, and unlike established…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsTopic Modeling · Text Readability and Simplification · Interpreting and Communication in Healthcare

MethodsMulti-Head Attention · Attention Is All You Need · Linear Layer · Position-Wise Feed-Forward Layer · Dense Connections · Byte Pair Encoding · Label Smoothing · Absolute Position Encodings · Layer Normalization · Residual Connection