A Comprehensive Review of Datasets for Clinical Mental Health AI Systems

Aishik Mandal; Prottay Kumar Adhikary; Hiba Arnaout; Iryna Gurevych; Tanmoy Chakraborty

arXiv:2508.09809·cs.CL·August 19, 2025

A Comprehensive Review of Datasets for Clinical Mental Health AI Systems

Aishik Mandal, Prottay Kumar Adhikary, Hiba Arnaout, Iryna Gurevych, Tanmoy Chakraborty

PDF

Open Access

TL;DR

This paper surveys existing clinical mental health datasets for AI, highlighting gaps in data diversity, standardization, and accessibility, and offers recommendations to improve dataset quality for better AI system development.

Contribution

It provides the first comprehensive categorization and analysis of clinical mental health datasets, identifying critical gaps and proposing standards for future dataset curation.

Findings

01

Datasets are scattered and under-documented.

02

Limited cultural, linguistic, and modality diversity.

03

Significant gaps in longitudinal and synthetic data.

Abstract

Mental health disorders are rising worldwide. However, the availability of trained clinicians has not scaled proportionally, leaving many people without adequate or timely support. To bridge this gap, recent studies have shown the promise of Artificial Intelligence (AI) to assist mental health diagnosis, monitoring, and intervention. However, the development of efficient, reliable, and ethical AI to assist clinicians is heavily dependent on high-quality clinical training datasets. Despite growing interest in data curation for training clinical AI assistants, existing datasets largely remain scattered, under-documented, and often inaccessible, hindering the reproducibility, comparability, and generalizability of AI models developed for clinical mental health care. In this paper, we present the first comprehensive survey of clinical mental health datasets relevant to the training and…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMental Health Research Topics