Latent Space Data Fusion Outperforms Early Fusion in Multimodal Mental Health Digital Phenotyping Data

Youcef Barkat; Dylan Hamitouche; Deven Parekh; Ivy Guo; David Benrimoh

arXiv:2507.14175·cs.LG·July 22, 2025

Latent Space Data Fusion Outperforms Early Fusion in Multimodal Mental Health Digital Phenotyping Data

Youcef Barkat, Dylan Hamitouche, Deven Parekh, Ivy Guo, David Benrimoh

PDF

TL;DR

This study demonstrates that latent space data fusion significantly improves the accuracy and robustness of predicting depressive symptoms from multimodal mental health data compared to early fusion methods.

Contribution

It introduces a latent space fusion approach using autoencoders and neural networks, outperforming traditional early fusion models in psychiatric data prediction tasks.

Findings

01

Latent space fusion achieved lower mean squared error and higher R2 scores.

02

The combined model maintained consistent generalization across data splits.

03

Early fusion models showed signs of overfitting and less robustness.

Abstract

Background: Mental illnesses such as depression and anxiety require improved methods for early detection and personalized intervention. Traditional predictive models often rely on unimodal data or early fusion strategies that fail to capture the complex, multimodal nature of psychiatric data. Advanced integration techniques, such as intermediate (latent space) fusion, may offer better accuracy and clinical utility. Methods: Using data from the BRIGHTEN clinical trial, we evaluated intermediate (latent space) fusion for predicting daily depressive symptoms (PHQ-2 scores). We compared early fusion implemented with a Random Forest (RF) model and intermediate fusion implemented via a Combined Model (CM) using autoencoders and a neural network. The dataset included behavioral (smartphone-based), demographic, and clinical features. Experiments were conducted across multiple temporal splits…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.