Semantic Coherence Markers for the Early Diagnosis of the Alzheimer   Disease

Davide Colla; Matteo Delsanto; Marco Agosto; Benedetto Vitiello,; Daniele Paolo Radicioni

arXiv:2302.01025·cs.CL·February 3, 2023

Semantic Coherence Markers for the Early Diagnosis of the Alzheimer Disease

Davide Colla, Matteo Delsanto, Marco Agosto, Benedetto Vitiello,, Daniele Paolo Radicioni

PDF

Open Access 1 Repo

TL;DR

This study demonstrates that language model perplexity scores can accurately distinguish between healthy individuals and those with Alzheimer’s disease, offering a promising tool for early diagnosis based on language analysis.

Contribution

The paper introduces the use of perplexity scores from various language models as a novel method for early Alzheimer’s detection through language transcript analysis.

Findings

01

Perplexity scores achieved 100% accuracy in classification.

02

Transformer-based GPT-2 outperformed n-gram models.

03

Language models can effectively discriminate between healthy and Alzheimer’s-affected speech.

Abstract

In this work we explore how language models can be employed to analyze language and discriminate between mentally impaired and healthy subjects through the perplexity metric. Perplexity was originally conceived as an information-theoretic measure to assess how much a given language model is suited to predict a text sequence or, equivalently, how much a word sequence fits into a specific language model. We carried out an extensive experimentation with the publicly available data, and employed language models as diverse as N-grams, from 2-grams to 5-grams, and GPT-2, a transformer-based language model. We investigated whether perplexity scores may be used to discriminate between the transcripts of healthy subjects and subjects suffering from Alzheimer Disease (AD). Our best performing models achieved full accuracy and F-score (1.00 in both precision/specificity and recall/sensitivity) in…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

davidecolla/semantic_coherence_markers
noneOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMachine Learning in Healthcare · Topic Modeling · Text Readability and Simplification

MethodsAttention Is All You Need · Adam · Cosine Annealing · Linear Warmup With Cosine Annealing · Linear Layer · Residual Connection · Weight Decay · Attention Dropout · Dense Connections · Multi-Head Attention