User experience and safety of generative AI-based mental health chatbots: Scoping review protocol

Lotenna Olisaeloka; Chris Richardson; Daniel Vigo

PMC · DOI:10.1371/journal.pone.0341631·January 23, 2026

User experience and safety of generative AI-based mental health chatbots: Scoping review protocol

Lotenna Olisaeloka, Chris Richardson, Daniel Vigo

PDF

Open Access

TL;DR

This paper outlines a scoping review protocol to examine user experience and safety of generative AI-based mental health chatbots.

Contribution

The study introduces a systematic review protocol focusing on user experience and safety in GenAI-based mental health chatbots.

Findings

01

The review will identify current GenAI-based mental health chatbots and their features.

02

It will highlight strategies for mitigating risks and enhancing user safety in these chatbots.

Abstract

Mental health problems constitute a significant global health challenge due to their rising prevalence and substantial treatment gap. Digital Mental Health Interventions (DMHIs) including mental health chatbots have emerged as promising solutions due to their effectiveness and scalability. Recent advances in Generative Artificial Intelligence (GenAI) have improved the conversational abilities of these chatbots, further amplifying their potential. However, despite instances of inadvertent harm stemming from the unpredictable nature of GenAI, little attention has been paid to user experience and safety of these chatbots. This proposed review will explore existing research on GenAI-based mental health chatbots. Specifically, it aims to identify and describe current chatbots, focusing on user experience, safety and risk mitigation strategies. The review will follow the Joanna Briggs…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Diseases1

Mental health problems

Figures2

Click any figure to enlarge with its caption.

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsDigital Mental Health Interventions · AI in Service Interactions · Artificial Intelligence in Healthcare and Education

Full text

Background

Mental health and substance use disorders affect over one billion people globally, contributing substantially to disability, premature mortality, and economic burden worldwide [1–3]. Despite effective therapeutic approaches, a significant treatment gap persists with a majority of affected individuals remaining untreated [3,4]. This gap is driven by persistent barriers to care including treatment costs, shortages of trained personnel, geographical inaccessibility, and stigma [3].

Digital Mental Health Interventions (DMHI) have emerged as a promising strategy to address some of these challenges and expand access to care. DMHI encompass a range of technology-based tools such as online platforms, mobile apps, chatbots and virtual reality (VR), designed to deliver mental health services and support [5,6]. Asynchronous and self-guided DMHI, like chatbots are especially promising due to their accessibility and scalability [5,7].

Conversational Agents (CA), commonly called chatbots, are software applications that mimic human conversation through text or voice interactions. These agents have been used to deliver mental health interventions for a variety of conditions including depression, anxiety, eating disorders, and substance use [8–13]. However, user engagement with these tools is often limited, attributable to the lack of personalization and dynamic interaction [14–16].

Personalization—the tailoring of an intervention to a user’s unique context—has been shown to enhance engagement, user experience, and effectiveness [12]. However, traditional mental health chatbots are primarily rule-based, relying on pre-programmed conversational flow which limit their flexibility and ability to personalize responses. While retrieval-based chatbots offer more adaptability, they still depend on pre-scripted responses, which restrict their ability to meet complex user needs [17,18]. In contrast, GenAI mental health chatbots powered by large language models (LLMs) can produce more interactive and contextually relevant responses. This allows for natural and tailored empathetic conversations, which emerging evidence suggest may improve engagement and therapeutic outcomes [19–21].

The emergence of LLMs marked a turning point in the development of sophisticated mental health chatbots capable of human-like support [22,23]. However, the same flexibility and sophistication that enhance personalization and user engagement also introduce novel risks and safety challenges. GenAI chatbots may generate misinformation, produce inappropriate or harmful responses, and exhibit algorithmic bias. Further, their “black box” nature make them unpredictable and less reliable especially in crisis situations [24–26]. These concerns have triggered broader debates about the ethical and safe deployment of GenAI in mental healthcare [27,28].

Despite increasing discourse on the ethical application of GenAI for mental health, there remains limited research on how to design and deploy GenAI-based mental health chatbots in effective and safe ways. Existing reviews in this area largely focus on traditional rule- and retrieval-based models [8,10–12]. Notwithstanding, a recent meta-analysis highlighted the superior efficacy of AI-based mental health chatbots compared to traditional ones, due to their ability to simulate empathetic conversations and personalize interactions [21]. Still, there remains a lack of systematic synthesis examining their characteristics, user experience and safety profiles.

In this review, User Experience (UX) is conceptualized according to the International Organization for Standardization (ISO 9241−210), which defines UX as an individual’s “perceptions and responses resulting from the use and/or anticipated use of a product, system or service.” The ISO notes that UX “includes all the users’ emotions, beliefs, preferences, perceptions, physical and psychological responses, behaviours and accomplishments that occur before, during and after use.” [29]. In the context of DMHIs, this encompasses measures of acceptability, usability, perceived impact, and engagement [9,30]. Emerging research highlights both the appeal and pitfalls of GenAI mental health chatbots: users appreciate the engaging on-demand and non-judgmental support, but also express concerns about unreliable or potentially harmful content, as well as the risk of overdependence [31,32].

User safety is another critical consideration for DMHI and refers to how digital tools minimize harm, uphold data protection and privacy, and promote psychological well-being throughout intervention design and delivery [33]. The World Health Organization in its guidelines for digital interventions calls for safety considerations including assessing benefits and harms, ensuring data privacy, and using evidence to guide implementation [34]. The safety of traditional mental health chatbots previously received limited attention, mainly because the rule- and retrieval-based systems were perceived as low-risk [35]. In contrast, GenAI’s unpredictability poses challenges in ensuring reliable and safe responses which could result in serious consequences [36]. For instance, there have been reports of GenAI chatbots offering harmful advice [37], promoting substance use, and soliciting explicit content from minors [31]. In some cases, persistent interactions with GenAI chatbots have been linked to tragic outcomes, including suicide [38,39]. These incidents underscore the urgent need for robust safety protocols. The American College of Physicians has called for transparency, rigorous testing, and focused research to understand and mitigate AI-related risks in healthcare [40].

This scoping review fills a critical research gap relating to user experience and safety of GenAI-based mental health interventions. While, existing reviews have examined broader applications of AI and large language models (LLMs), none have systematically mapped Generative AI-based chatbots or explored how user experience and safety are conceptualized and operationalized within this emerging domain [12,24,41]. The proposed review therefore fills this gap by focusing specifically on LLM-powered chatbots and by integrating both user-centered and safety-oriented perspectives. A preliminary search of MEDLINE, the Cochrane Database of Systematic Reviews, and JBI Evidence Synthesis revealed no published or registered scoping or systematic reviews on this specific topic as of August 2024, when this review was registered.

Review questions

The proposed review seeks to:

Identify and describe Generative AI-based chatbots developed specifically to deliver mental health interventions.Assess how user experience (e.g., acceptability, usability, engagement) are reported in studies of these chatbot interventions.Examine the safety mechanisms and risk mitigation strategies integrated during the development and deployment of these chatbot interventions.

Methods

The proposed scoping review will follow guidelines outlined in the Joanna Briggs Institute (JBI) manual for scoping reviews [42] and adhere to the Preferred Reporting Items for Systematic Reviews and Meta-Analyses extension for Scoping Reviews (PRISMA-ScR) [43]. The review has been registered in Open Science Framework Registries (doi.org/10.17605/OSF.IO/HSNXA).

Eligibility criteria

The PCC (population, concept, context) framework recommended by JBI was used to develop the scope and eligibility criteria of the review to ensure a clear and effective search strategy [42]. Table 1 presents the inclusion and exclusion criteria.

Table 1: Inclusion and exclusion criteria for the scoping review.

Search strategy

A preliminary search of MEDLINE was conducted to identify articles on the topic. Keywords and index (MeSH) terms identified from relevant articles were used to develop the full search strategy for MEDLINE (OVID) (S1 Appendix). This search strategy will be adapted to other selected databases: Scopus, PsycINFO, ACM Digital Library and IEEE Xplore. The databases were chosen to capture a broad range of sources from different disciplines related to the review objectives. PubMed was selected for its extensive coverage of publications in medicine and health sciences, while Scopus was chosen to include studies in relevant multidisciplinary areas such as science and technology, medicine, and social sciences. PsycINFO specializes in psychiatry and psychology related articles, making it essential for this review. The ACM Digital Library and IEEE Xplore were included because they index publications relating to AI and NLP application in mental health. The database search will be complemented by research-based search engines (Google Scholar and Consensus) to capture other relevant grey literature. The reference lists of all included sources of evidence will also be screened for additional studies.

Selection of evidence sources

Following the search, all identified citations will be imported into Covidence with duplicates automatically removed [44]. Following a pilot test, titles and abstracts will be screened by two independent reviewers against the eligibility criteria. Potentially relevant sources will be retrieved in full, and the full text of selected citations assessed in detail against the eligibility criteria by the same independent reviewers. At this stage, reasons for exclusion of sources of evidence will be recorded and reported in the scoping review. Any disagreements that arise between the reviewers at each stage of the selection process will be resolved through discussion, or with an additional reviewer. The results of the search and the study inclusion process will be reported in full in the final scoping review and presented in a PRISMA-ScR flow diagram [42].

Data extraction

Data will be extracted by two independent reviewers using a data extraction tool developed by the reviewers. The data extracted will include specific details about the participants, concept, context, study methods and key findings relevant to the review questions. Table 2 lists important data that will be extracted from included studies. These will be used to develop a draft extraction form in Covidence which will be modified and revised as necessary during the data extraction process. As recommended by JBI, any disagreements that arise between the reviewers will be resolved through discussion, or with an additional reviewer, to achieve consensus [42]. Where appropriate, authors of papers will be contacted to request missing or additional data, where required.

Table 2: Proposed Data Extraction.

Data analysis and presentation

Data will be analysed and presented using descriptive statistics and narrative synthesis, focusing on the review objectives. Data visualization including summary tables, graphs and figures will be used to concisely present findings. Data extraction will be conducted within Covidence, which will also facilitate version control and audit trails. Extracted datasets will be exported to R (v4.2.3) for descriptive analysis and NVivo (v14) for qualitative synthesis, ensuring transparent data management and reproducibility [44,45]. Quantitative data will be summarized using descriptive statistics, including means, standard deviations, and frequency distributions, where applicable. No inferential or meta-analytic procedures will be performed, consistent with the scoping-review design. Qualitative data (e.g., user feedback and narrative findings) will undergo inductive thematic analysis following Braun and Clarke’s six-phase framework [46]. Coding will be performed independently by at least two reviewers, who will iteratively compare and refine themes through reflexive discussion until consensus is reached. An audit trail of coding decisions will be maintained to enhance transparency and trustworthiness [42].

Study characteristics (e.g., author, country, design, population, and context) will be summarized in structured tables and figures accompanied by a narrative overview. To map the GenAI-based chatbot interventions, a dedicated table will outline identified chatbots, their key features (e.g., deployment platform, interaction mode), and mental health problems they target.

User experience measures will be categorized and presented according to common themes such as acceptability, usability, engagement, and personalization, while safety mechanisms will be grouped into pre-deployment and delivery-focused strategies. To enhance conceptual clarity, pre-deployment (development-phase) safety mechanisms will be analyzed separately from deployment-phase risk mitigation strategies. The former includes model-training safeguards, bias and accuracy testing, and data-protection measures applied before user interaction. The latter captures real-world implementation safeguards such as human-in-the-loop oversight, user-support features, crisis-response protocols, and reporting of adverse events. This distinction will guide both data extraction and thematic synthesis. Overall, narrative synthesis will integrate findings across themes, highlighting innovative practices, safety considerations, and implications for future research, policy, and practice. The findings will inform the ethical design, evaluation, and regulation of GenAI tools for mental health care, offering timely guidance for developers, researchers, and policymakers seeking to ensure human-centered and safe deployment.

Ethical considerations

This review involves analysis of publicly available literature and does not require ethical approval. Nonetheless, the broader ethical dimensions of GenAI mental health tools are recognized. Particular attention will be paid to how included studies address informed consent, data privacy, transparency, and the mitigation of potential psychological harm arising from chatbot use. These aspects will be highlighted in the synthesis to inform ethical best practices for future AI-enabled mental health interventions.

Limitations

We acknowledge few anticipated limitations in this scoping review. First, the exclusion of non-English language publications may introduce language bias and limit the global comprehensiveness of the findings. Second, the expected variability across included studies in terms of study designs, reporting quality, and the varied operationalizations of key constructs such as user experience and safety may constrain the ability to directly compare results or synthesize findings quantitatively. Finally, given the rapidly evolving field of generative AI, relevant interventions may exist outside the academic literature, such as unpublished, proprietary, or inadequately described tools. This may affect the completeness of our review and highlights the need for ongoing updates as new evidence emerges. Despite these limitations, the review’s findings are expected to support the development of evidence-informed frameworks for responsible and equitable integration of generative AI in mental health interventions.

Supporting information

S1 AppendixInitial Search for OVID MEDLINE (3/07/2024).(DOCX)

S2 AppendixPRISMA-P (Preferred Reporting Items for Systematic review and Meta-Analysis Protocols) 2015 checklist*.(PDF)

Bibliography46

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Dattani S, Rodés-Guirao L, Ritchie H, Roser M. Mental Health. https://ourworldindata.org/mental-health. 2023.
2Walker ER, Mc Gee RE, Druss BG. Mortality in mental disorders and global disease burden implications: a systematic review and meta-analysis. JAMA Psychiatry. 2015;72(4):334–41. doi: 10.1001/jamapsychiatry.2014.2502 25671328 PMC 4461039 · doi ↗ · pubmed ↗
3WHO. World mental health report: Transforming mental health for all. World Health Organization. https://www.who.int/publications-detail-redirect/9789240049338
4Mental health atlas 2017. Geneva: World Health Organization. https://www.who.int/publications-detail-redirect/9789241514019
5Kuhn E, Saleem M, Klein T, Köhler C, Fuhr DC, Lahutina S, et al. Interdisciplinary perspectives on digital technologies for global mental health. PLOS Glob Public Health. 2024;4(2):e 0002867. doi: 10.1371/journal.pgph.0002867 38315676 PMC 10843075 · doi ↗ · pubmed ↗
6Schueller SM, Torous J. Scaling evidence-based treatments through digital mental health. Am Psychol. 2020;75(8):1093–104. doi: 10.1037/amp 0000654 33252947 PMC 7709142 · doi ↗ · pubmed ↗
7Naslund JA, Aschbrenner KA, Araya R, Marsch LA, Unützer J, Patel V, et al. Digital technology for treating and preventing mental disorders in low-income and middle-income countries: a narrative review of the literature. Lancet Psychiatry. 2017;4(6):486–500. doi: 10.1016/S 2215-0366(17)30096-2 28433615 PMC 5523650 · doi ↗ · pubmed ↗
8Vaidyam AN, Wisniewski H, Halamka JD, Kashavan MS, Torous JB. Chatbots and Conversational Agents in Mental Health: A Review of the Psychiatric Landscape. Can J Psychiatry. 2019;64(7):456–64. doi: 10.1177/0706743719828977 30897957 PMC 6610568 · doi ↗ · pubmed ↗