VERA-MH: Validation of Ethical and Responsible AI in Mental Health
Luca Belli, Kate H. Bentley, Josh Gieringer, Emily Van Ark, Nilu Zhao, Pradip Thachile, Matt Hawrilenko, Millard Brown, Adam M. Chekroud

TL;DR
This paper introduces VERA-MH, a clinically-validated evaluation framework for assessing the safety of chatbots in mental health support, focusing on suicidal ideation risks.
Contribution
It presents a novel multi-step evaluation method involving conversation simulation, clinical judgment, and model rating for mental health chatbot safety.
Findings
Evaluated four leading LLM providers using VERA-MH.
Developed clinically-guided user personas for simulation.
Structured rubric improves consistency in safety assessments.
Abstract
Chatbot usage has increased, including in fields for which they were never developed for--notably mental health support. To that end, we introduce Validations of Ethical and Responsible AI in Mental Health (VERA-MH), a novel clinically-validated evaluation for safety of chatbots in the context of mental health support. The first iteration of VERA-MH focuses on Suicidal Ideation (SI) risks, by assessing how well chatbots can responds to users that might be in crisis. VERA-MH is comprised of three steps: conversation simulation, conversation judging and model rating. First, to simulate conversations with the chatbot under evaluation, another chatbot is tasked with role-playing users based on specific personas. Such user personas have been developed under clinical guidance, to make sure that, among others, multiple risk factors, demographic characteristics and disclosure factors were…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
