A Framework for Evaluating AI-Powered Virtual Assistants to Support Older Adults’ Information-Seeking Needs
Walter Boot, Emily Langston, Varitnan Hattakitjamroen, Mario Hernandez, Hye Soo Lee, Hannah Mason, Willencia Louis-Charles

TL;DR
This paper introduces a framework to evaluate AI-powered virtual assistants for helping older adults find health and financial information.
Contribution
The study presents a novel framework and case example for evaluating AI assistants' accuracy and usability for older adults.
Findings
LLM-based assistants like Bard and ChatGPT-4 were more accurate than non-LLM systems like Alexa.
Bard provided additional information in 79% of responses, compared to 37% for ChatGPT-4.
Response variability over time highlights the need for refinement and user training.
Abstract
Older adults often face the challenge of searching for critical health, financial, and resource-related information to make complex decisions, a process further complicated by age-related cognitive changes that impact information processing and decision-making. Artificial intelligence (AI)-powered virtual assistants may help by providing concise, easy-to-understand information, yet their accuracy and effectiveness remain unclear. This presentation will introduce a general framework for evaluating AI’s potential to support important decisions of older adults and provide a case example illustrating this approach. To examine the accuracy and utility of AI-powered virtual assistants, we assessed the responses of Alexa, Google Assistant, Bard, and ChatGPT-4 to queries related to Medicare, long-term care insurance, and resource access. Findings showed that Large Language Model (LLM)-based…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAI in Service Interactions · Artificial Intelligence in Healthcare and Education · Technology Use by Older Adults
