Evaluating Large Language Models to Support Dementia Caregivers: Identifying Opportunities for Improvement
Kyeung Mi Oh, Sungsoo Hong, Ziwei Zhu, Huayu Zhou, Jung Ah Lee

TL;DR
This study explores how large language models can help dementia caregivers by evaluating different versions of ChatGPT and identifying ways to improve their usefulness.
Contribution
The study introduces an enhanced ChatGPT model refined with health science and gerontology knowledge to better support dementia caregivers.
Findings
The enhanced ChatGPT-4o model scored higher in actionability, relevance, and satisfaction compared to the baseline version.
Interview data highlighted themes like empathy, accuracy, and bias as important for preferred responses.
Both models were seen as overly verbose, but the enhanced model provided more comprehensive and caregiver-centered information.
Abstract
Awareness and access to the dementia caregiving resources is crucial for informal caregivers of people with early-stage dementia. Large language models (LLMs) offer easy access to caregiving information, but the risks, challenges, and ways to improve LLM-generated responses remain understudied. This mixed methods study evaluated LLMs, including the baseline ChatGPT-4o model and an enhanced version refined through prompt engineering grounded in health science and gerontology literature, to support informal dementia caregivers. This study aimed to assess key factors influencing preferred responses from LLMs and to identify related risks and challenges, thereby informing opportunities for improvement. Surveys and interviews with 12 stakeholders, including 10 healthcare professionals and 2 caregivers, were conducted to assess model responses to questions commonly asked by caregivers. The…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsArtificial Intelligence in Healthcare and Education · Mental Health via Writing · Dementia and Cognitive Impairment Research
