Editorial: Mental health in the age of artificial intelligence
Jasmine M. Noble, Kyooseob Ha, Andrew James Greenshaw

Abstract
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Mental Health Interventions · Mental Health via Writing · Artificial Intelligence in Healthcare and Education
Introduction
Artificial intelligence (AI) is rapidly maturing, with applications spanning the full spectrum of healthcare, from foundational research to clinical implementation. Its use has extended beyond lab-based proofs of concept into real-world settings (1), including but not limited to clinical documentation (2, 3), administrative triage (4, 5), diagnostic support (6), and conversational agents (7, 8). These innovations hold the potential to expand access to care (9, 10), personalize treatment (7, 11–13), predict risk (14, 15), and ease the administrative burdens (2, 4) on strained systems.
Despite the global prevalence and economic burden of mental health conditions (16), the clinical adoption of AI in the field of psychiatry remains limited (13). This lag reflects the intrinsic complexity of the field, which challenges conventional algorithmic approaches and underscores the need for AI solutions that are ethically grounded, reproducible, contextually adaptive, and attuned and supportive to the nuances of human experience.
The Research Topic, “Mental Health in the Age of Artificial Intelligence”, explores this rapidly shifting terrain through five timely contributions. Each publication offers a unique lens, spanning methodological, empirical, cultural, and existential dimensions, with ethical reflection as a common, unifying thread. Together they converge on a single, urgent imperative: for AI to be more widely trusted and in turn adopted in the mental health ecosystem, it must be designed, deployed, and evaluated in ways that are accountable, reproducible, fair, and deeply human-centered.
These themes surface immediately in the first contribution. The question of reproducibility and interpretability has become a cornerstone of responsible AI (13, 17), and Celeste et al. tackle it head-on in their contribution, “A software pipeline for systematizing machine learning of speech data”. Here they offer a suite of configurable software pipelines built within Python Luigi which are then used to test the reproducibility of three machine learning studies, involving depression, mild cognitive impairment, and aphasia, as a proof of concept. Authors then warn of the reproducibility crisis and argue that the ability to reproduce machine learning experiments, including model configurations, optimal hyperparameters, validation predictions, and performance metrics, is not merely a methodological ideal, but a moral and professional responsibility.
The ethical conversation then shifts to raise the spotlight on the human experience. Lee et al in “Artificial intelligence conversational agents in mental health: Patients see potential, but prefer humans in the loop,” gathers feedback from patients with self-reported mild to moderate anxiety in terms of their experiences, perceptions, and acceptability of mental health AI conversational agents (CAs). This timely study poses a clinically meaningful question at a critical juncture in the adoption of digital mental health tools, highlighting the importance of amplifying the patient’s voice within a values-based framework that is increasingly recognized as best practice (13, 18). The authors findings remind us while there are perceptions of utility and benefit in terms of potential increases in access to care through use of AI chatbots, the perceived risk of lack of empathy, concerns over privacy and other technical limitations of these models remain leading concerns, and participants signalled a consistent preference for “human-in-the-loop” models wherein AI serves as an extender of care, not a replacement for it. This notion is reinforced in a recent review, where while AI-based CAs were found to be more effective for clinical and subclinical populations, the need persists to “untangle the complex interplay” between a variety of factors, including when “human support is indispensable.” (19).
The ethical conversation continues in, Denecke and Gabarron’s “The ethical aspects of integrating sentiment and emotion analysis in chatbots for depression intervention,” where authors explore the ethical dimensions of sentiment analysis in chatbots seeking to support individuals with the provision of depression specific interventions. Reflecting the cautions of other authors in other works included in the special topic, the importance of balance is reiterated, as misclassification of emotion/harms increase risk of inappropriate or missed system responses, risk detection, and risk escalation. Authors emphasize the importance of thoughtfully integrating chatbots into care settings under the supervision of qualified health professionals. They stress that emotion should be treated as a complex, clinically significant, and ethically sensitive signal that demands careful and responsible handling. This reflects findings from other recent meta-analysis of AI-based CAs, where while a large effect size for AI-based CA is observed for the mitigation of psychological distress (especially when multi-modal), they can still generate “unnatural or repetitive interactions, potentially reducing clinical effectiveness.” (19).
Questions about acceptability and safety are magnified when AI systems are designed to generate text, make inferences, or interact dynamically with users. In “Applications of large language models in psychiatry: a systematic review,” Omar et al. synthesize a growing body of evidence on the use of large language models (LLMs) in mental health contexts. While they identify promising applications in clinical reasoning, educational tools, and even therapeutic support, the review also highlights critical issues in the underestimation of suicide risk, inconsistency in complex scenarios, and the lack of rigorous safety evaluation. Their findings align with emerging international concerns and caution in the field, such as declining medical safety messaging in generative AI models which are estimated to have reduced from ~26% to 1% over the last 2–3 years (20). While LLMs offer flexibility and scalability, their implementation should proceed cautiously and with significant oversight.
Finally, Alkhalifah et al. remind us to reflect on the human experience. In “Existential anxiety about artificial intelligence (AI): is it the end of humanity era or a new chapter in the human revolution?,” The authors explore public perceptions of AI’s role in society, and in particular, its psychological and existential consequences. Drawing on survey data from a public population in Saudi Arabia, they find significant levels of AI-related anxiety, including fears of human obsolescence, concerns of unpredictability, and sense of emptiness. The authors acknowledge this underlying unease as a sentiment that warrants attention and consideration, positioning it as pertinent to broader discussions about AI adoption, particularly its influence on social systems and its deeper implications for our understanding of what it means to be human.
Taken together, these five articles provide a cross-section of where the field stands, while offering an ethical and reflective pathway forward. Each surface that while the technological promise of AI is real and growing, its ethical, clinical, and human foundations require careful consideration. We must ensure systems are reproducible in their development, calibrated and fair in their outputs, interpretable in their logic and iterations, accountable in their consequences, and deeply and deliberately human-centered.
We must do all this while juggling the need to fully realize the potential of these innovations, while ensuring the appropriate safeguards are in place to protect end users. In a world arguably gripped by what Günther Anders’ described as “Promethean Shame,” a sense of human inadequacy in the face of our own technological creations, we find ourselves drawn to this promise to transcend our biological limits, even as we fail to grasp the full intended and unintended repercussions of these innovations, and the ways it unsettles deeply held values, purposes, and understandings of meaning (21).
At the same time, the accelerating momentum of commercial AI development, often obscured by proprietary opacity, is outpacing our existing systems for evaluation, governance, and ethical oversight. As this monopolized and monetized structure threatens to consolidate power further, we must confront a pressing question: on whose terms is mental health care evolving? Again, we see that the future of AI in this space is not solely a technical matter. It is also a clinical, philosophical, and political one, requiring sustained dialogue, shared standards, and a commitment to human-centered care.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Han R Acosta J Shakeri Z Ioannidis J Topol E Rajpurkar P . Randomised controlled trials evaluating artificial intelligence in clinical practice: a scoping review. Lancet Digital Health. (2024) 6:e 367–73. doi: 10.1016/S 2589-7500(24)00047-5, PMID: 38670745 PMC 11068159 · doi ↗ · pubmed ↗
- 2Olson K Meeker D Troup M Barker T Nguyen V Manders J . Use of ambient AI scribes to reduce administrative burden and professional burnout. JAMA Netw Open. (2025) 8:e 2534976. doi: 10.1001/jamanetworkopen.2025.34976, PMID: 41037268 PMC 12492056 · doi ↗ · pubmed ↗
- 3Bracken A Reilly C Feeley A Sheehan E Merghani K Feeley I . Artificial intelligence (AI) – powered documentation systems in healthcare: A systematic review. J Med Syst. (2025) 49:28. doi: 10.1007/s 10916-025-02157-4, PMID: 39966286 PMC 11835907 · doi ↗ · pubmed ↗
- 4Garcia P Ma S Shah S Smith M Jeong Y Devon-Sand A . Artificial intelligence–generated draft replies to patient inbox messages. JAMA Netw Open. (2024) 7:e 243201. doi: 10.1001/jamanetworkopen.2024.3201, PMID: 38506805 PMC 10955355 · doi ↗ · pubmed ↗
- 5Hu D Guo Y Zhou Y Flores L Zheng K . A systematic review of early evidence on generative AI for drafting responses to patient messages. NPJ Health Syst. (2025) 2:27. doi: 10.1038/s 44401-025-00032-5 · doi ↗
- 6Martinez-Gutierrez J Kim Y Salazar-Marioni S Tariq M Abdelkhaleq R Niktabe A . Automated large vessel occlusion detection software and thrombectomy treatment times: A cluster randomized clinical trial. JAMA Neurol. (2023) 80:1182–90. doi: 10.1001/jamaneurol.2023.3206, PMID: 37721738 PMC 10507590 · doi ↗ · pubmed ↗
- 7Nayak A Vakili S Nayak K Nikolov M Chiu M Sosseinheimer P . Use of voice-based conversational artificial intelligence for basal insulin prescription management among patients with type 2 diabetes: A randomized clinical trial. JAMA Netw Open. (2023) 6:e 2340232. doi: 10.1001/jamanetworkopen.2023.40232, PMID: 38039007 PMC 10692866 · doi ↗ · pubmed ↗
- 8Noble JM Zamani A Gharaat M Merrick D Maeda N Lambe Foster A . Developing, Implementing, and Evaluating an Artificial Intelligence–Guided Mental Health Resource Navigation Chatbot for Health Care Workers and Their Families During and Following the COVID-19 Pandemic: Protocol for a Cross-sectional Study JMIR. Res Protoc. (2022) 11:e 33717. doi: 10.2196/33717, PMID: 35877158 PMC 9361145 · doi ↗ · pubmed ↗
