Balancing Privacy and Utility in Child and Adolescent Mental Health Services Research: Retrospective Cohort Study on Synthetic Data Generation
Mounir Haizoune, Bennett L Leventhal, Dipendra Pant, Øystein Nytrø, Kaban Koochakpour, Roman A Koposov, Lars Ravn Øhlckers, Norbert Skokauskas

TL;DR
This study shows that synthetic data can protect privacy while maintaining usefulness for child and adolescent mental health research.
Contribution
A hierarchical synthetic data generator is shown to preserve data utility and privacy in CAMHS data.
Findings
Synthetic data achieved high statistical similarity with real data across multiple metrics.
Privacy risks were minimal under simulated reidentification attacks.
Models trained on synthetic data performed nearly as well as those on real data.
Abstract
Electronic health records are essential for advancing research aimed at improving clinical outcomes. However, stringent data protection and privacy concerns severely limit the accessibility and use of real clinical data, particularly within Child and Adolescent Mental Health Services (CAMHS) involving vulnerable young individuals. This challenge can be effectively addressed through synthetic data generation, which safeguards individual privacy while facilitating comprehensive analyses of clinical information. This study aims to investigate whether hierarchical synthetic data generators (SDGs) can effectively replicate the statistical properties, preserve the utility, and maintain the privacy of real CAMHS clinical data, thereby enabling data sharing and broader access to research-ready datasets. This retrospective cohort study used electronic medical record data from 6924 distinct…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDigital Mental Health Interventions · Mobile Health and mHealth Applications · Ethics in Clinical Research
