Evaluating Large Language Models to Support Dementia Caregivers: Identifying Opportunities for Improvement

Kyeung Mi Oh; Sungsoo Hong; Ziwei Zhu; Huayu Zhou; Jung Ah Lee

PMC · DOI:10.1093/geroni/igaf122.4109·December 31, 2025

Evaluating Large Language Models to Support Dementia Caregivers: Identifying Opportunities for Improvement

Kyeung Mi Oh, Sungsoo Hong, Ziwei Zhu, Huayu Zhou, Jung Ah Lee

PDF

Open Access

TL;DR

This study explores how large language models can help dementia caregivers by evaluating different versions of ChatGPT and identifying ways to improve their usefulness.

Contribution

The study introduces an enhanced ChatGPT model refined with health science and gerontology knowledge to better support dementia caregivers.

Findings

01

The enhanced ChatGPT-4o model scored higher in actionability, relevance, and satisfaction compared to the baseline version.

02

Interview data highlighted themes like empathy, accuracy, and bias as important for preferred responses.

03

Both models were seen as overly verbose, but the enhanced model provided more comprehensive and caregiver-centered information.

Abstract

Awareness and access to the dementia caregiving resources is crucial for informal caregivers of people with early-stage dementia. Large language models (LLMs) offer easy access to caregiving information, but the risks, challenges, and ways to improve LLM-generated responses remain understudied. This mixed methods study evaluated LLMs, including the baseline ChatGPT-4o model and an enhanced version refined through prompt engineering grounded in health science and gerontology literature, to support informal dementia caregivers. This study aimed to assess key factors influencing preferred responses from LLMs and to identify related risks and challenges, thereby informing opportunities for improvement. Surveys and interviews with 12 stakeholders, including 10 healthcare professionals and 2 caregivers, were conducted to assess model responses to questions commonly asked by caregivers. The…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

dementia

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsArtificial Intelligence in Healthcare and Education · Mental Health via Writing · Dementia and Cognitive Impairment Research