VERA-MH: Validation of Ethical and Responsible AI in Mental Health

Luca Belli; Kate H. Bentley; Josh Gieringer; Emily Van Ark; Nilu Zhao; Pradip Thachile; Matt Hawrilenko; Millard Brown; Adam M. Chekroud

arXiv:2605.13318·cs.AI·May 20, 2026

VERA-MH: Validation of Ethical and Responsible AI in Mental Health

Luca Belli, Kate H. Bentley, Josh Gieringer, Emily Van Ark, Nilu Zhao, Pradip Thachile, Matt Hawrilenko, Millard Brown, Adam M. Chekroud

PDF

TL;DR

This paper introduces VERA-MH, a clinically-validated evaluation framework for assessing the safety of chatbots in mental health support, focusing on suicidal ideation risks.

Contribution

It presents a novel multi-step evaluation method involving conversation simulation, clinical judgment, and model rating for mental health chatbot safety.

Findings

01

Evaluated four leading LLM providers using VERA-MH.

02

Developed clinically-guided user personas for simulation.

03

Structured rubric improves consistency in safety assessments.

Abstract

Chatbot usage has increased, including in fields for which they were never developed for--notably mental health support. To that end, we introduce Validations of Ethical and Responsible AI in Mental Health (VERA-MH), a novel clinically-validated evaluation for safety of chatbots in the context of mental health support. The first iteration of VERA-MH focuses on Suicidal Ideation (SI) risks, by assessing how well chatbots can responds to users that might be in crisis. VERA-MH is comprised of three steps: conversation simulation, conversation judging and model rating. First, to simulate conversations with the chatbot under evaluation, another chatbot is tasked with role-playing users based on specific personas. Such user personas have been developed under clinical guidance, to make sure that, among others, multiple risk factors, demographic characteristics and disclosure factors were…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.