Augmenting word2vec with latent Dirichlet allocation within a clinical   application

Akshay Budhkar; Frank Rudzicz

arXiv:1808.03967·cs.CL·August 14, 2018·1 cites

Augmenting word2vec with latent Dirichlet allocation within a clinical application

Akshay Budhkar, Frank Rudzicz

PDF

Open Access

TL;DR

This paper introduces three hybrid models combining LDA and word2vec to improve Alzheimer's disease detection from speech transcripts, achieving state-of-the-art results on the DementiaBank dataset.

Contribution

The paper proposes novel hybrid models integrating LDA and word2vec specifically for clinical speech analysis, demonstrating improved diagnostic accuracy.

Findings

01

Two models outperform current state-of-the-art F-scores

02

Models effectively distinguish Alzheimer's from non-Alzheimer's speech

03

Hybrid approach enhances clinical language analysis

Abstract

This paper presents three hybrid models that directly combine latent Dirichlet allocation and word embedding for distinguishing between speakers with and without Alzheimer's disease from transcripts of picture descriptions. Two of our models get F-scores over the current state-of-the-art using automatic methods on the DementiaBank dataset.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsNatural Language Processing Techniques · Topic Modeling · Text Readability and Simplification