How and why does deep ensemble coupled with transfer learning increase performance in bipolar disorder and schizophrenia classification?

Sara Petiton (NeuroSpin/GAIA); Antoine Grigis (NeuroSpin/GAIA); Benoit Dufumier (EPFL; NeuroSpin/GAIA); Edouard Duchesnay (NeuroSpin/GAIA)

arXiv:2604.02002·cs.AI·April 3, 2026

How and why does deep ensemble coupled with transfer learning increase performance in bipolar disorder and schizophrenia classification?

Sara Petiton (NeuroSpin/GAIA), Antoine Grigis (NeuroSpin/GAIA), Benoit Dufumier (EPFL, NeuroSpin/GAIA), Edouard Duchesnay (NeuroSpin/GAIA)

PDF

TL;DR

This study investigates how deep ensemble learning combined with transfer learning enhances classification accuracy and stability in bipolar disorder and schizophrenia, revealing optimal ensemble sizes and the impact of pre-training on generalization.

Contribution

It provides insights into the mechanisms by which DE and TL improve model stability and performance, including ensemble size effects and the influence of pre-training on model convergence.

Findings

01

Deep ensemble reaches performance plateau at 10 models.

02

Transfer learning constrains models to similar loss basins, improving generalization.

03

Using TL with DE significantly enhances classifier stability and accuracy.

Abstract

Transfer learning (TL) and deep ensemble learning (DE) have recently been shown to outperform simple machine learning in classifying psychiatric disorders. However, there is still a lack of understanding as to why that is. This paper aims to understand how and why DE and TL reduce the variability of single-subject classification models in bipolar disorder (BD) and schizophrenia (SCZ). To this end, we investigated the training stability of TL and DE models. For the two classification tasks under consideration, we compared the results of multiple trainings with the same backbone but with different initializations. In this way, we take into account the epistemic uncertainty associated with the uncertainty in the estimation of the model parameters. It has been shown that the performance of classifiers can be significantly improved by using TL with DE. Based on these results, we investigate…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.