Applying and Evaluating Large Language Models in Mental Health Care: A Scoping Review of Human-Assessed Generative Tasks
Yining Hua, Hongbin Na, Zehan Li, Fenglin Liu, Xiao Fang, David, Clifton, John Torous

TL;DR
This scoping review assesses the current use of large language models in mental health care, highlighting their potential and the need for standardized evaluation, ethical considerations, and transparency for clinical integration.
Contribution
It provides a comprehensive overview of human-assessed generative tasks involving LLMs in mental health, emphasizing gaps in evaluation and ethical issues.
Findings
Limited standardized evaluation methods used in studies
Concerns about privacy, safety, and fairness in applications
Reliance on proprietary models raises transparency issues
Abstract
Large language models (LLMs) are emerging as promising tools for mental health care, offering scalable support through their ability to generate human-like responses. However, the effectiveness of these models in clinical settings remains unclear. This scoping review aimed to assess the current generative applications of LLMs in mental health care, focusing on studies where these models were tested with human participants in real-world scenarios. A systematic search across APA PsycNet, Scopus, PubMed, and Web of Science identified 726 unique articles, of which 17 met the inclusion criteria. These studies encompassed applications such as clinical assistance, counseling, therapy, and emotional support. However, the evaluation methods were often non-standardized, with most studies relying on ad hoc scales that limit comparability and robustness. Privacy, safety, and fairness were also…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health via Writing
MethodsRefunds@Expedia|||How do I get a full refund from Expedia? · Attention Is All You Need · Cosine Annealing · Linear Layer · Dropout · Residual Connection · Linear Warmup With Cosine Annealing · Discriminative Fine-Tuning · Multi-Head Attention · Attention Dropout
