A Framework for Evaluating Appropriateness, Trustworthiness, and Safety in Mental Wellness AI Chatbots
Lucia Chen, David A. Preece, Pilleriin Sikka, James J. Gross, Ben, Krause

TL;DR
This paper introduces the MHealth-EVAL framework for evaluating mental wellness chatbots and demonstrates its application through Psyfy, a new LLM-based CBT chatbot, showing improved appropriateness and safety but with some limitations.
Contribution
It presents the MHealth-EVAL evaluation framework and introduces Psyfy, a novel LLM-based mental wellness chatbot, demonstrating their effectiveness in real-world assessments.
Findings
Psyfy outperformed baseline chatbots in response appropriateness and user engagement.
Psyfy effectively identified unsafe situations and avoided unsafe responses.
Both Psyfy and baseline chatbots showed US-centric resource bias.
Abstract
Large language model (LLM) chatbots are susceptible to biases and hallucinations, but current evaluations of mental wellness technologies lack comprehensive case studies to evaluate their practical applications. Here, we address this gap by introducing the MHealth-EVAL framework, a new role-play based interactive evaluation method designed specifically for evaluating the appropriateness, trustworthiness, and safety of mental wellness chatbots. We also introduce Psyfy, a new chatbot leveraging LLMs to facilitate transdiagnostic Cognitive Behavioral Therapy (CBT). We demonstrate the MHealth-EVAL framework's utility through a comparative study of two versions of Psyfy against standard baseline chatbots. Our results showed that Psyfy chatbots outperformed the baseline chatbots in delivering appropriate responses, engaging users, and avoiding untrustworthy responses. However, both Psyfy and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing · Digital Mental Health Interventions · Artificial Intelligence in Healthcare and Education
