A Framework for Evaluating Appropriateness, Trustworthiness, and Safety   in Mental Wellness AI Chatbots

Lucia Chen; David A. Preece; Pilleriin Sikka; James J. Gross; Ben; Krause

arXiv:2407.11387·cs.HC·July 17, 2024·3 cites

A Framework for Evaluating Appropriateness, Trustworthiness, and Safety in Mental Wellness AI Chatbots

Lucia Chen, David A. Preece, Pilleriin Sikka, James J. Gross, Ben, Krause

PDF

Open Access

TL;DR

This paper introduces the MHealth-EVAL framework for evaluating mental wellness chatbots and demonstrates its application through Psyfy, a new LLM-based CBT chatbot, showing improved appropriateness and safety but with some limitations.

Contribution

It presents the MHealth-EVAL evaluation framework and introduces Psyfy, a novel LLM-based mental wellness chatbot, demonstrating their effectiveness in real-world assessments.

Findings

01

Psyfy outperformed baseline chatbots in response appropriateness and user engagement.

02

Psyfy effectively identified unsafe situations and avoided unsafe responses.

03

Both Psyfy and baseline chatbots showed US-centric resource bias.

Abstract

Large language model (LLM) chatbots are susceptible to biases and hallucinations, but current evaluations of mental wellness technologies lack comprehensive case studies to evaluate their practical applications. Here, we address this gap by introducing the MHealth-EVAL framework, a new role-play based interactive evaluation method designed specifically for evaluating the appropriateness, trustworthiness, and safety of mental wellness chatbots. We also introduce Psyfy, a new chatbot leveraging LLMs to facilitate transdiagnostic Cognitive Behavioral Therapy (CBT). We demonstrate the MHealth-EVAL framework's utility through a comparative study of two versions of Psyfy against standard baseline chatbots. Our results showed that Psyfy chatbots outperformed the baseline chatbots in delivering appropriate responses, engaging users, and avoiding untrustworthy responses. However, both Psyfy and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health via Writing · Digital Mental Health Interventions · Artificial Intelligence in Healthcare and Education