Finding clues for Alzheimer's disease in online language
Shira Darchi, Yuval Pinter, Talya Sadeh

TL;DR
This study shows that language patterns on Reddit can reveal early signs of Alzheimer's disease, offering a non-invasive way to detect cognitive decline.
Contribution
The study introduces a method to detect Alzheimer's using Reddit data and linguistic analysis, showing early cognitive decline markers.
Findings
AD patients used more past tense, emotional, and social words compared to controls.
Controls used significantly more internal details and analytic language than AD patients.
No significant differences were found in the use of external details between groups.
Abstract
Natural Language Processing (NLP) has shown promise in identifying linguistic markers of MCI and Alzheimer's Disease (AD). Prior studies have achieved classification accuracy of 80%‐85% but were often limited by cross‐sectional designs and reliance on curated texts like novels and speeches. Social media data offer a more ecologically valid, continuous assessment of linguistic changes. This study explores whether cognitive decline can be detected through online activity on Reddit, contributing to developing a more complex model capable of detecting early changes indicative of cognitive deterioration from online activity. We applied a Self‐Disclosure approach, commonly used in research for identifying self‐reports of various mental illnesses. We identified self‐reported AD cases on Reddit and extracted their posts. Data from 42 Reddit users (20 diagnosed with AD, 22 matched controls)…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Health Literacy and Information Accessibility · Neurobiology of Language and Bilingualism
