Decoding brain structure-function dynamics in health and in psychosis via an autoencoder

Qing Cai; Hannah Thomas; Vanessa Hyde; Pedro Luque Laguna; Carolyn B. McNabb; Krish D. Singh; Derek K. Jones; Eirini Messaritaki

PMC · DOI:10.1038/s41598-025-24232-z·November 14, 2025

Decoding brain structure-function dynamics in health and in psychosis via an autoencoder

Qing Cai, Hannah Thomas, Vanessa Hyde, Pedro Luque Laguna, Carolyn B. McNabb, Krish D. Singh, Derek K. Jones, Eirini Messaritaki

PDF

Open Access

TL;DR

This paper uses a deep-learning model to better understand how brain structure relates to function in health and psychosis, showing stronger connections in certain brain activity bands.

Contribution

A deep-learning framework using a Graph Multi-Head Attention AutoEncoder outperforms analytical models in predicting functional connectivity from structural brain data.

Findings

01

The deep-learning model achieved mean correlation coefficients above 0.8 in alpha and beta bands for healthy participants.

02

Structure-function coupling in psychosis differs significantly from healthy individuals, with pronounced changes in the model's predictions.

03

Analytical models better capture delta and theta band dynamics in psychosis, while deep-learning models are needed for alpha and beta bands.

Abstract

Understanding the intricate relationship between brain structure and function is a cornerstone challenge in neuroscience, critical for deciphering the mechanisms that underlie healthy and pathological brain function. In this work, we present a comprehensive framework for mapping structural connectivity measured via diffusion-MRI to resting-state functional connectivity measured via magnetoencephalography, utilizing a deep-learning model based on a Graph Multi-Head Attention AutoEncoder. We compare the results to those from an analytical model that utilizes shortest-path-length and search-information communication mechanisms. The deep-learning model outperformed the analytical model in predicting functional connectivity in healthy participants at the individual level, achieving mean correlation coefficients higher than 0.8 in the alpha and beta frequency bands, in comparison to 0.45 for…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases1

psychosis

Figures14

Click any figure to enlarge with its caption.

Scatter plots of the oFC-pFC correlation coefficients versus age, when the FC is predicted via the analytical model. The 5 columns refer to the 5 SC edge weights, from left to right: NS, NS/v, FA, iRD and FRt. The best-fit line is shown. The correlation coefficients and p-values are also shown at the top of each plot. Colored font / line indicate that the related *p*-value survived FDR correction, while grey indicates that it did not.

The oFC-pFC correlations of the psychosis participants (green markers) are overlaid on the equivalent correlation distributions of the healthy participants (boxplots and clouds of points), when the analytical model is used to predict the FC. Each marker shape indicates a different psychosis participant. The asterisks indicate the frequency bands and edge weights for which the two distributions are statistically significantly different after multiple comparison correction.

Correlations between oFC and pFC for the 5 psychosis participants for the GMHA-AE model (yellow markers) and the analytical model (green markers) across FC frequency bands and SC edge weights. Each marker shape denotes a different psychosis participant. The asterisks denote the cases for which the differences in the distributions are statistically significantly different after FDR correction.

Edge strength variability in the healthy participants for the 4 frequency bands. Notably, the variability is higher in the delta band.

Analysis pipeline. From left to right: *Data collection* comprises MRI data collected on a 3T Connectom scanner and MEG data collected on a 275-channel gradiometer. *Processing* includes cleaning of the data, tractography, cortical parcellation, MEG source localization, and results in tractograms, cortical parcellations and activation time series for each participant. *SC matrices* are generated from the tractograms by mapping them onto the cortical parcellations. *FC matrices* are generated from the MEG time series via amplitude-amplitude correlations. The matrices are then fed into two disti

Histograms of the fraction of edge weights of the normalized SC matrices for the 5 choices of edge weight. The histograms are over all connections and all healthy participants.Fig. 3Histograms of the fraction of edge weights of the normalized FC matrices for the 4 frequency bands. The histograms are over all connections and all healthy participants.

The oFC-pFC correlations of the psychosis participants (green markers) overlaid on the corresponding correlation distributions of the healthy participants (boxplots and clouds of points), for the case in which the FC is predicted with the GMHA-AE model. Each marker shape indicates a different psychosis participant. Asterisks indicate statistically significant differences (after FDR correction) between the distributions for the healthy participants and those with psychosis. Table 3*p*-values for the Mann-Whitney U test comparing the correlation distributions for the healthy participants and the

Distributions of correlations between SC (left panel) / FC (right panel) matrices of people with psychosis to those of the healthy participants, for each edge weight / frequency band. The dashed lines show the mean of the correlations between the matrices of the healthy participants for comparison. The labels ’pat 1-5’ indicate the 5 different psychosis participants..

Distributions of the connection strength of the healthy participants (boxplots) in comparison to the connection strength of the psychosis participants (green markers). Each marker indicates one of the 5 participants with psychosis. The *p*-values of the the Mann-Whitney U test for the total edge strength distributions between healthy participants and those with psychosis are given in the legend. None of the values survived FDR multiple-comparison correction.

Left panel: Distributions (over the 126 participants) of the correlation coefficients between oFC and pFC derived with the GMHA-AE model, when no permutations of the SC matrices have been done. Right panel: Distribution (over the 126 participants) of the mean (over the 500 permutations) of the oFC-pFC correlations derived with the GMHA-AE model.Table 2*p*-values for the t-test between the correlation distributions for oFC and pFC for pairs of SC edge weights for each frequency band. Left 4 columns: prediction with the autoencoder model; right 4 columns: predictions with the analytical model. B

Scatter plots of the oFC-pFC correlation coefficients versus age. The 5 columns refer to the 5 SC edge weights, from left to right: NS, NS/v, FA, iRD and FRt. The best-fit line is shown. The correlation coefficients and *p*-values are also shown at the top of each plot. None of the *p*-values were statistically significant after FDR correction.

Left panel: Distributions (over the 126 participants) of the correlation coefficients between oFC and pFC derived with the analytical model when no permutations of the SC matrices have been done. Right panel: Distributions (over the 126 participants) of the mean values (over permutations) of the oFC-pFC correlations derived with the analytical model.

Funding2

—https://doi.org/10.13039/100010269Wellcome Trust
—https://doi.org/10.13039/100004440Wellcome

Keywords

Neural circuitsNeuroscienceComputational neuroscienceDynamical systemsNetwork models

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Brain Connectivity Studies · Advanced Neuroimaging Techniques and Applications · Schizophrenia research and treatment

Full text

Introduction

Quantifying the mapping between structural connectivity (SC) and functional connectivity (FC) in the human brain is a major goal of neuroscience research^1^. Studying the SC-FC relationship in healthy and patient populations can reveal the origins and evolution of neurological disorders. It can enhance our understanding of the structural underpinnings of functional disorders, shedding light on whether such conditions primarily arise from structural damage or functional impairments. Current neuroimaging techniques, such as magnetic resonance imaging (MRI), functional MRI (fMRI), electroencephalography (EEG) and magnetoencephalography (MEG), allow for the non-invasive measurement of SC and FC.

Various methodologies have been used to relate SC to fMRI-measured FC^2–5^, including forward-generating models that link the brain’s structure to FC^6–9^, and analytical models that assume specific communication mechanisms between brain areas (such as shortest path, search information, navigation, etc.)^10^. These studies confirm that interactions among brain areas are influenced by the broad network of white matter connections in addition to shortest paths. SC has also been linked to electrophysiologically-measured FC^11–16^, including via the use of analytical models^17^. This is of particular interest, given that electrophysiological FC reflects brain connectivity at the milli-second time scale, i.e., the actual time scale of functional interactions in the human brain.

Even though analytical models capture a significant fraction of the SC-FC relationship and elucidate the mechanisms via which communication happens between brain areas, they cannot be guaranteed to capture all aspects of their complex association. Machine-learning approaches, on the other hand, are well-suited to this kind of complexity, as recently shown for fMRI-measured FC^18^. To effectively link the complex nature of SC and FC patterns, we require a model that can naturally express these connectivity relationships. Graph Neural Networks (GNNs) are specifically designed for this purpose. GNNs are neural networks tailored for graph-structured data, where nodes represent entities and connections (edges) denote the relationships between them^19^. GNNs excel at handling these graph structures by iteratively aggregating information from neighboring nodes, thereby creating enriched representations of each node. This process captures the complex interactions within the graph and enhances the model’s understanding of the underlying relationships.

In this work, we investigate for the first time how well a GNN model can predict electrophysiological resting-state FC from SC. We leverage node attributes and local topological information derived from SC matrices to generate high-dimensional representations that capture the properties of the underlying connectome. To achieve this, we employ a Graph Multi-Head Attention Autoencoder (GMHA-AE) based on multi-head attention mechanisms. Autoencoders, first introduced in 1986^20^, are neural networks that consist of an encoder and a decoder. The encoder compresses the input data into a latent representation, while the decoder reconstructs the original data from this representation. Attention mechanisms allow the model to selectively focus on the most relevant parts of the input data during predictions, much like humans concentrate on the most relevant details when processing complex information. Multi-head attention extends this concept by enabling the network to focus on multiple parts of the input simultaneously, capturing diverse patterns in the data. We also compare the results derived via the GMHA-AE to those derived via an analytical model that combines the shortest-path-length and search-information algorithms^10,21^, incorporating communication via both direct and indirect structural connections.

Electrophysiological FC contains rich information about the frequency characteristics of the signals in the brain, and reflects connectivity at the milli-second time scale, i.e., the actual time scale of functional interactions in the human brain. Recent work showed that the ability to predict MEG-measured FC via analytical models depends on both the frequency band of the FC and the structural measure used to assign strength to the connections of the SC matrices^17^. In this work, we test whether that is also the case when a GMHA-AE model is used to relate FC to SC.

Due to the similarities of connectivity patterns in healthy brains, several studies have used group-averaged structural and functional connectomes when calculating the SC-FC relationship. Although undoubtedly informative, those analyses cannot capture individualized aspects of the SC-FC relationship such as age dependence. A question that naturally arises is how specific to a given participant the group-derived relationship is. This is also of interest when considering populations that span a wide age range (because the SC-FC relation changes across the lifespan^22,23^) or when studying patient populations^24^.

With the above-mentioned considerations in mind, here we answer the following questions:

Is the GMHA-AE or the analytical model better at predicting electrophysiological resting-state FC from SC?
Does the predictive ability of each model depend on the MEG frequency band for which the resting-state FC is calculated?
Does the predictive ability of each model depend on the structural measure that is used to quantify connection strength in the SC matrices?
Is there an effect of participant age in the predictive ability of each model?
Is there an effect of disease in the predictive ability of each model? Specifically for the last question, our expectation is that a given model’s performance in elucidating the human brain’s SC-FC relationship will be different when that is applied to healthy participants compared to when that is applied to patients, as has been demonstrated in studies employing fMRI-measured FC^24^.

Methods

The study was approved by the Cardiff University School of Psychology Ethics Committee (EC.18.08.14.5332RA3). All participants gave written informed consent. All methods were performed in accordance with the relevant guidelines and regulations.

A schematic of the analysis pipeline is shown in Fig. 1.Fig. 1. Analysis pipeline. From left to right: Data collection comprises MRI data collected on a 3T Connectom scanner and MEG data collected on a 275-channel gradiometer. Processing includes cleaning of the data, tractography, cortical parcellation, MEG source localization, and results in tractograms, cortical parcellations and activation time series for each participant. SC matrices are generated from the tractograms by mapping them onto the cortical parcellations. FC matrices are generated from the MEG time series via amplitude-amplitude correlations. The matrices are then fed into two distinct algorithms for FC prediction: the Graph-Attention Mechanism Autoencoder and the Analytical Algorithm. The fainter arrows linking oFC to the algorithms indicate that the oFC is needed for model assessment.

Data acquisition and processing

Sample

MRI and MEG data were collected from 126 healthy participants between the ages of 18 and 50 (73 female), 101 of whom were between the ages of 18 and 35 (55 female) via the Welsh Advanced Neuroimaging Database (WAND) study^25,26^. The age distribution of the healthy participants is given in Figure S1 of the Supplementary Material.

Data were also collected from 5 participants with psychosis, between the ages of 18 and 35 (3 female). Four of the psychosis participants had been diagnosed between 9 and 12 months before taking part in the study, while one was diagnosed 6 years before taking part. Importantly for the purposes of this study, the psychosis participants underwent the Positive and Negative Syndrome Scale (PANSS) test^27^ to assess symptom severity, and were on anti-psychotic medication. Table 1 shows the PANSS scores^27^, as a total score and individually for each category, reflecting positive, negative, and general psychopathology.Table 1PANSS scores of the psychosis participants.PsychosisTotalPositiveNegativeGeneral13798202591118303491413224419923556161129

The details of the acquisition^28^ and pre-processing are described in the Supplementary material. The pre-processing was performed using FreeSurfer^29–41^, FSL^42–44^ and MRtrix^43,43,45–58^ for the MRI data, and Fieldtrip^59^ and in-house pipelines as previously described^60^ for the MEG data.

Connectivity matrices

The result of this part of the analysis are connectivity matrices for each participant. The matrices are derived with the nodes being the cortical and subcortical brain areas defined via the Desikan-Killiany atlas^61^.

There are 5 SC matrices; in each the edges are weighted with a different structural measure: number of reconstructed streamlines (NS), volume-normalized NS (NS/v), fractional anisotropy (FA), inverse radial diffusivity (iRD), and total restricted signal fraction (FRt). Our choice of edge weights is motivated by the fact that SC matrices weighted with those are good at predicting FC from both fMRI^2,10^ and MEG^17^, and provide good reproducibility for the SC matrices and their graph theoretical properties when used as edge weights^62,63^. Additionally, the FA is related to the myelination and axonal characteristics of white matter tracts and therefore is a good proxy for SC^10,17,64–66^, the iRD is related to the myelination of the white matter tracts^67–69^ and has been used in studies in health and disease^17,70^, and the FRt^71^ is attributed to water within the intra-axonal space and therefore related to axonal characteristics of the white matter tracts.

There are 4 resting-state FC matrices, one for each of the 4 MEG frequency bands: delta (1-4 Hz), theta (4-8 Hz), alpha (8-13 Hz) and beta (13-30 Hz). These 4 matrices reflect the observed FC (oFC).

Finally, there is the Euclidean distance (ED) matrix of each participant, in which each edge is the distance between the centers of the Desikan-Killiany brain areas^61^.

Prediction of FC

Our aim was to achieve a mapping from a participant’s SC to that participant’s FC. The resting-state FC from each frequency band was predicted for each participant individually via each of the two models (GMHA-AE and analytical), using that participant’s structural connectome, separately for each of the 5 edge weightings. Gõni et al^10^ showed that, when an analytical model is used, the prediction of FC from SC is improved by including the ED matrix as a predictor. For that reason, we include the ED matrices in the analytical model. In order to keep the GMHA-AE model comparable to the analytical one, we included the ED matrices alongside the SC matrices when predicting the FC via the GMHA-AE model, thus incorporating both connection strength (via the SC matrices) and anatomical proximity (via the ED matrices) in our models. Each of the two models resulted in 20 predicted FC (pFC) matrices for each participant, one for each frequency-band/edge-weight combination.

GMHA-AE model

We constructed a model based on a GMHA-AE, the structure of which is depicted in Fig. 1. The encoder, equipped with multi-head attention, processes the input data SC (which include the ED matrices) to generate a latent representation that captures both direct and indirect connections within the brain. Specifically, the SC matrix serves as the weighted adjacency matrix, defining the graph’s topology and the strength of connections between the brain regions (nodes). The ED matrix, on the other hand, is used to construct the initial node feature matrix. The features include statistics such as the mean distance from that node to all other nodes, the standard deviation of its distances, and the distance to its nearest anatomical neighbour. These features are then concatenated with node attributes to form a feature vector for each node which encodes both anatomical proximity and connectivity properties, and is used as the initial input to the first layer of the GMHA-AE, alongside the SC adjacency matrix. This approach allows the model to leverage both white-matter pathways (from SC) and the physical layout of the brain (from ED) from the very first layer of processing.

The latent representation is then used by the decoder to reconstruct the FC matrix. By reconstructing the FC matrix, the decoder implicitly models the mapping from SC to FC, completing the autoencoder’s function. Throughout this process, the multi-head attention mechanism provides a flexible approach that captures the complex relationships inherent in brain connectivity data. More details on the autoencoder are provided in the Supplementary material, including detailed descriptions of the handling of the data in the encoder and the decoder.

We employed a rigorous five-fold cross-validation protocol to evaluate the model’s generalization performance on the healthy participant dataset. To ensure a complete separation between training and evaluation data and prevent any form of data leakage, the dataset was split at the participant level. In each of the five iterations, one fold (approximately 20% of participants) was designated as the held-out test set. The remaining four folds (approximately 80% of participants) were used for model development, which included splitting them further into training and validation sets for parameter training and hyperparameter tuning. This protocol guarantees that the data used to report the model’s performance was never seen by the model during any stage of its training or selection process, providing an unbiased estimate of its performance on unseen data. The final performance metrics reported in our results are aggregated from the predictions across all five test folds.

Parameters were optimized and selected using grid search based solely on the training data. The total number of training epochs was set to 200, with a batch size of 32, a learning rate of $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$1 \times 10^{-3}$$\end{document}$ , and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\gamma$$\end{document}$ in the objective function set to 0.01.

The GMHA-AE model was first trained on the 126 healthy participants and used to predict their FC. For the analysis that pertains to the participants with psychosis, it was first trained on the 101 healthy participants that were 18 - 35 years old (for a detailed explanation of this choice, please see Section ”Psychosis” below). Once trained and validated, the data of the psychosis participants were passed through it, and their FC was predicted.

Analytical model

A number of algorithms that model communication between brain areas were described in an earlier publication^10^ and implemented in the Brain Connectivity Toolbox^72^. These algorithms calculate the potential predictors of FC based on a SC matrix, employing different methods to identify the optimal structural links between brain areas, and then use regression to generate a pFC that best matches the oFC. Many algorithms assume that stronger structural connections are more likely to be utilized, and several of them take into account both direct and indirect structural connections between brain areas.

In this work, we used the shortest-path-length (SPL) and search-information (SI) algorithms^10^ to derive two predictors for the analytical model. This was used for the healthy participants and the participants with psychosis. The ED matrices of the participants were used in the analytical model as an additional predictor because, as mentioned earlier, they can increase the model’s predictive ability^10,17^.

Statistical analysis

Characteristics of the connectivity matrices

The 5 SC matrices derived for each participant, each weighted with a different structural measure, are intended to represent the structural connectome of that participant. To quantify how different these matrices are and evaluate whether they would result in differences when predicting FC from SC, we histogramed the connection strengths of the SC for each metric across the 126 healthy participants.

Prediction accuracy

To evaluate the accuracy of the predictions, we calculated the correlation coefficients between the oFC for each frequency band and the pFC generated for that frequency band with each of the 5 SC matrices, for the two models. To quantify how the choice of edge weight impacted the ability of the structural connectome to predict FC we used a paired t-test to compare the distributions of the oFC-pFC correlations for the 5 SC edge weights, for each frequency band, for each of the two models.

Individuality of the SC-FC relationship

To test how individual (i.e., specific to each participant) the relationship between FC and SC is, we shuffled the order of the SC matrices of the 126 healthy participants while keeping the order of the FC matrices the same, and repeated the FC prediction with the two models. Thus, the FC of each healthy participant was predicted using the SC of another healthy participant. We then calculated the correlation coefficient between pFC and oFC. We repeated this process 500 times for each of the two models, and derived the mean of the correlation coefficients for each healthy participant.

Age dependence

To evaluate the impact of participant age on the relationship between SC and FC, we calculated the correlation coefficients between the participant age and the oFC-pFC correlation coefficients, for both the GMHA-AE and the analytical model, for the 126 healthy participants.

Psychosis

We tested the hypothesis that the SC-FC relationship is altered as a result of psychosis in comparison to healthy participants. People with psychosis exhibit brain alterations that mimic aging, such as cognitive dysfunction^73^, dendritic spine loss^74,75^ and cortical atrophy^76^. To prevent the results of our comparison being diluted by those aging-like effects, we compared the participants with psychosis to the healthy participants that matched the ages of the people with psychosis, i.e., in the age range of 18 to 35 years. This left 101 healthy participants to perform the comparison on.

We performed a sensitivity analysis to evaluate the effect size that we can detect with our sample of 101 healthy participants and 5 psychosis participants^77,78^. The analysis showed that with those samples, we have power 0.8 to detect an effect with d=1.61.

To compare the connectomes of participants with psychosis to those of the healthy participants we calculated correlations between the respective connectivity matrices, for each frequency band for the FC matrices and each edge weight for the SC matrices. These correlations were then compared with the correlations of the SC matrices and the FC matrices for the healthy participants.

We used a Mann-Whitney U test to compare the distributions of the total connection strength (sum of all connections) of the healthy participants to those of the participants with psychosis, for the 5 edge weights for the SC matrices and the 4 frequency bands for the FC matrices.

To quantify any differences in the SC-FC relationship between healthy participants and participants with psychosis, we used a Mann-Whitney U test to compare the distributions of the oFC-pFC correlation coefficients between participants with psychosis and healthy participants, for each of the FC-frequency-band/SC-edge-weight combinations.

In order to assess whether the accuracy of prediction of each model was related to the symptom severity of the participants with psychosis, we calculated the Spearman correlations between the SC-FC prediction error (i.e., 1 - (oFC-pFC correlation)) and the PANSS scores.

To compare the performance of the two models in predicting FC for the participants with psychosis, we performed permutation testing.

In our analyses, all p-values were corrected for multiple comparisons using the false-discovery-rate (FDR) algorithm^79,80^.Fig. 2. Histograms of the fraction of edge weights of the normalized SC matrices for the 5 choices of edge weight. The histograms are over all connections and all healthy participants.Fig. 3. Histograms of the fraction of edge weights of the normalized FC matrices for the 4 frequency bands. The histograms are over all connections and all healthy participants.

Results

Characteristics of the connectivity matrices

The distributions of edge weights for the SC matrices weighted with different structural metrics (Fig. 2) indicate the distinct nature of the SC networks they represent. The distributions of edge weights for the FC matrices in the 4 frequency bands are shown in (Fig. 3).

Comparison of connectivity matrices of healthy participants to psychosis participants

The SC and FC matrices of the participants with psychosis were compared to the matrices of the 101 healthy participants who were between the ages of 18 and 35 years.

The NS-weighted and NS/v-weighted SC matrices of the psychosis participants exhibited high correlations with those of the healthy participants, while the ones for the other edge weights did not (left panel of Fig. 4). However, all those correlations were comparable to the correlations between SC matrices of the healthy participants (the means of the correlations for healthy participants are shown in the same figure as dashed lines for comparison).Fig. 4. Distributions of correlations between SC (left panel) / FC (right panel) matrices of people with psychosis to those of the healthy participants, for each edge weight / frequency band. The dashed lines show the mean of the correlations between the matrices of the healthy participants for comparison. The labels ’pat 1-5’ indicate the 5 different psychosis participants..

The FC matrices of the people with psychosis exhibited moderate correlations with the FC matrices of the 101 healthy participants aged 18 - 35 years (right panel of Fig. 4) but those correlations were also comparable with those between FC matrices of healthy participants.Fig. 5. Distributions of the connection strength of the healthy participants (boxplots) in comparison to the connection strength of the psychosis participants (green markers). Each marker indicates one of the 5 participants with psychosis. The p-values of the the Mann-Whitney U test for the total edge strength distributions between healthy participants and those with psychosis are given in the legend. None of the values survived FDR multiple-comparison correction.

We compared the total connectivity strength of the SC and FC matrices of the participants with psychosis to those of the healthy participants (Fig. 5). The p-values resulting from the Mann-Whitney U tests between the distributions of the connectivity strength for the healthy participants and those with psychosis are given in that figure. The only p-value that was under 0.05 is the one for the beta band FC, but it did not survive FDR multiple-comparison correction.

Our results for the SC matrices are in agreement with those of earlier work^81^, which demonstrated no differences in the NS-based total SC strength and a small reduction in the FA-based total SC strength (in our cohort, the latter showing a trend towards smaller values for the psychosis participants in comparison to the healthy participants; respective means: 2300 and 2370). They are also in agreement with the results reported in a different study^82^, considering the fact that most of our psychosis participants had been recently diagnosed.

Our results for the FC matrices are in agreement with the results presented previously^83^, where no statistically significant differences were found between the EEG amplitude-amplitude connectivity of schizophrenia patients and healthy participants. As pointed out in the review article^84^, there is heterogeneity in the results coming from MEG- and EEG-based connectivity studies of psychosis.

To conclude, in agreement with the results of existing literature, the SC and FC matrices of the psychosis participants exhibit some small differences to those of the healthy participants, when the summary statistics shown in Figs. 4 and 5 are taken into consideration.

Prediction of FC with the GMHA-AE model

Maintaining the participant link between SC and FC

The GMHA-AE model was good at predicting the oFC (left panel of Fig. 6) with the mean of the correlations being above 0.8 for the alpha and beta frequency bands and above 0.4 for the delta and theta bands. The only exception was the prediction of the oFC with the NS-based SC matrices for the delta band, for which the correlation coefficients between oFC and pFC were low.Fig. 6. Left panel: Distributions (over the 126 participants) of the correlation coefficients between oFC and pFC derived with the GMHA-AE model, when no permutations of the SC matrices have been done. Right panel: Distribution (over the 126 participants) of the mean (over the 500 permutations) of the oFC-pFC correlations derived with the GMHA-AE model.Table 2p-values for the t-test between the correlation distributions for oFC and pFC for pairs of SC edge weights for each frequency band. Left 4 columns: prediction with the autoencoder model; right 4 columns: predictions with the analytical model. Bold-faced p-values are not statistically significant after FDR correction. The p-value threshold resulting from the application of the FDR correction was 0.003.Autoencoder modelAnalytical modeldeltathetaalphabetadeltathetaalphabetaNS - NS/v $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$2 \times 10^{-4}$$\end{document}$ 0.12 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8 \times 10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ NS - FA $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$5 \times 10^{-5}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ 0.73NS - iRD $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6 \times 10^{-5}$$\end{document}$ 0.71 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-8}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-6}$$\end{document}$ NS - FRt $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4 \times 10^{-9}$$\end{document}$ 0.02 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-6}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4 \times 10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-7}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ NS/v - FA $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-3}$$\end{document}$ NS/v - iRD $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4 \times 10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ NS/v - FRt $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4 \times 10^{-9}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ FA - iRD0.004 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6 \times 10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-8}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ FA - FRt $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$6 \times 10^{-8}$$\end{document}$ 0.07 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ 0.47 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8 \times 10^{-4}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$ iRD - FRt $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-9}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$4 \times 10^{-6}$$\end{document}$ 0.02 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-3}$$\end{document}$ 0.05 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$3 \times 10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-3}$$\end{document}$ $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$< 10^{-10}$$\end{document}$

Breaking the participant link between SC and FC

The pFC calculated via the GMHA-AE model after shuffling the SC matrices of the healthy participants exhibited a level of correlation with the oFC similar to that when the SC matrices of the participants had not been shuffled (right panel of Fig. 6). We note that the figures on the two panels of Fig. 6 are not directly comparable, because the left panel shows the distribution of correlations over the 126 participants when the SC matrices have not been shuffled, while the right panel shows the distribution over the 126 participants of the mean over the 500 shufflings of the oFC-pFC correlations.

Age dependence

For the pFC calculated via the GMHA-AE model, we found no statistically significant correlations between participant age and the oFC-pFC correlation coefficients after FDR correction (Fig. 7). This indicates that the predictive ability of the GMHA-AE model is stable across ages and not altered with participant age, at least for healthy adults in the age range of 18 - 50 years.Fig. 7. Scatter plots of the oFC-pFC correlation coefficients versus age. The 5 columns refer to the 5 SC edge weights, from left to right: NS, NS/v, FA, iRD and FRt. The best-fit line is shown. The correlation coefficients and p-values are also shown at the top of each plot. None of the p-values were statistically significant after FDR correction.

Psychosis

Fig. 8. The oFC-pFC correlations of the psychosis participants (green markers) overlaid on the corresponding correlation distributions of the healthy participants (boxplots and clouds of points), for the case in which the FC is predicted with the GMHA-AE model. Each marker shape indicates a different psychosis participant. Asterisks indicate statistically significant differences (after FDR correction) between the distributions for the healthy participants and those with psychosis. Table 3p-values for the Mann-Whitney U test comparing the correlation distributions for the healthy participants and the psychosis participants, when FC is predicted via the autoencoder model (left 4 columns) and the analytical model (right 4 columns). All p-values shown in the table that are lower than 0.05 remained statistically significant after FDR correction. The p-value threshold that resulted from the application of the FDR correction was 0.014. Autoencoder model

Analytical model deltathetaalphabetadeltathetaalphabetaNS0.000180.000170.000180.000170.00020.600.180.22NS/v0.480.000180.000170.00020.00020.540.520.26FA0.000170.100.000170.000170.00140.600.00270.0035iRD0.000170.000170.000180.000170.00270.490.0140.0046FRt0.000170.000170.000170.000170.00020.860.100.0091

When the GMHA-AE model that had been trained on healthy participants was used to calculate the pFC of the psychosis participants, the correlations between oFC and pFC were lower for all frequency bands and all SC edge weights compared to the healthy participants (Fig. 8). The p-values of the Mann-Whitney U test comparing the distributions of the healthy participants to those with psychosis were all less than 2 $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\times 10^{-4}$$\end{document}$ (Table 3) and survived FDR correction, with the exception of the NS/delta and FA/theta pairs (p = 0.48 and 0.10 respectively). The Cohen’s d for the comparison between the distributions for the healthy and psychosis participants are given in Table S2 of the Supplementary Material. All values are larger than 0.8, indicating a large effect.

The Spearman correlation coefficients between the PANSS scores and the prediction error of the GMHA-AE model are given in Table 4. All the related p-values were greater than 0.05, indicating no significant relationship between symptom severity and the prediction accuracy of the model.Table 4. Spearman correlations between the PANSS scores of the psychosis participants and the prediction error of the GMHA-AE model.NSNS/vFAiRDFRtdelta0.70.30.3−0.10.3theta−0.2−0.2−0.1−0.2−0.2alpha−0.1−0.1−0.1−0.1−0.1beta−0.1−0.1−0.1−0.1−0.1

To conclude, the GMHA-AE model is very good at predicting the FC of healthy participants across SC edge weights and frequency bands, except when the delta band FC is predicted with NS-based weights. The performance of the model is stable across ages, but it does show a drop when psychosis participant FC data is passed through it.

Prediction of FC with the analytical model

Maintaining the participant link between SC and FC

Fig. 9. Left panel: Distributions (over the 126 participants) of the correlation coefficients between oFC and pFC derived with the analytical model when no permutations of the SC matrices have been done. Right panel: Distributions (over the 126 participants) of the mean values (over permutations) of the oFC-pFC correlations derived with the analytical model.

When the analytical model that combines the SPL and SI algorithms was used to predict the healthy participants’ FC, the correlations between the oFC and pFC depended on the frequency band of the observed FC and on the edge weight used in the SC matrices, as shown on the left panel of Fig. 9. The higher frequency bands resulted in higher correlations between observed and predicted FC, indicating that a combination of the SPL and SI algorithms is better at predicting FC in higher rather than lower frequencies. Additionally, NS-based structural metrics (NS and NS/v) were worse at predicting FC in all 4 frequency bands considered as opposed to the other 3 structural metrics.

The paired t-test between the correlation distributions for each pair of SC edge weights showed that the distributions are statistically significantly different after FDR correction with very few exceptions (Table 2). The NS-based SCs were generally worse at predicting the oFC than the other 4 edge weights (Fig. 9).

Breaking the participant link between SC and FC

When the SC matrices of the participants were shuffled, so that the SC matrix used in the analytical model to predict the FC no longer corresponded to the same participant, small differences were observed in the correlation coefficients between oFC and pFC (Fig. 9). As previously, we note that the figures on the two panels of Fig. 9 are not directly comparable, because the left panel shows the distribution of correlations over the 126 participants when the SC matrices have not been shuffled, while the right panel shows the distribution over the 126 participants of the mean over the 500 shufflings of the oFC-pFC correlations.

Age dependence

Fig. 10. Scatter plots of the oFC-pFC correlation coefficients versus age, when the FC is predicted via the analytical model. The 5 columns refer to the 5 SC edge weights, from left to right: NS, NS/v, FA, iRD and FRt. The best-fit line is shown. The correlation coefficients and p-values are also shown at the top of each plot. Colored font / line indicate that the related p-value survived FDR correction, while grey indicates that it did not.

We found statistically significant correlations (after FDR multiple comparison correction) between the participant age and the oFC-pFC correlation coefficients derived with the analytical model for the iRD SCs (all frequency bands), for the FA SC matrices (theta, alpha and beta frequency bands), and the FRt SC matrices (alpha and beta frequency bands) as shown in Fig. 10. The correlations were negative, indicating a weakening of the predictive power of the analytical model as participants age.

Psychosis

A Mann-Whitney U test between homologous distributions of oFC-pFC correlations showed that the psychosis participants exhibited statistically significant differences in the correlations between oFC and pFC for the delta band (all edge weights), the alpha band (for the FA- and iRD-weighted SC matrices) and the beta band (for SC matrices weighted with FA and FRt) in comparison to the healthy participants. The correlations for the psychosis participants were higher in the delta band and lower in the alpha and beta bands (Fig. 11). This indicates that the analytical model is a better representation of the communication mechanisms between brain areas in psychosis participants than it is in healthy participants in the delta band, while the opposite is true in the alpha and beta bands.Fig. 11. The oFC-pFC correlations of the psychosis participants (green markers) are overlaid on the equivalent correlation distributions of the healthy participants (boxplots and clouds of points), when the analytical model is used to predict the FC. Each marker shape indicates a different psychosis participant. The asterisks indicate the frequency bands and edge weights for which the two distributions are statistically significantly different after multiple comparison correction.

Table 3 gives the p-values for the Mann-Whitney U test between the healthy-participant and the psychosis-participant distributions of the correlations, for each frequency band and edge weight. The Cohen’s d for the comparison between the distributions for the healthy and psychosis participants are given in Table S3 of the Supplementary Material. Most absolute values are larger than 0.7, indicating a large effect.

The Spearman correlation coefficients between the PANSS scores and the prediction error of the analytical model are given in Table 5. All the related p-values were greater than 0.05, indicating no significant relationship between symptom severity and the prediction accuracy of the model.Table 5. Spearman correlations between the PANSS scores of the psychosis participants and the prediction error of the GMHA-AE model.NSNS/vFAiRDFRtdelta0.60.3-0.50.50.5theta0.10.10.10.10.2alpha−0.1−0.1−0.100beta0.30.30.30.30.1Table 6The p-values resulting from the permutation testing to compare the correlation distributions for the psychosis participants between the GMHA-AE and the analytical model. p-values that remain statistically significant after FDR correction are bold-faced. The p-value threshold that resulted from the application of the FDR procedure was 0.012..deltathetaalphabetaNS0.0040.0120.016****0.012NS/v0.0080.0600.0400.016FA0.0040.0400.0960.004iRD0.004****0.0120.0560.012FRt0.004****0.0040.1270.060

To conclude, we observe a dependence of the predictive ability of the analytical model on the frequency band of the FC, and on the SC edge weight used. The performance of the model decreases with age. Notably, it is better at predicting the delta band FC of psychosis participants, but shows a stable, somewhat decreased performance for psychosis participants in the alpha and beta frequency bands.

Comparison of results of the two models for the psychosis participants

Fig. 12. Correlations between oFC and pFC for the 5 psychosis participants for the GMHA-AE model (yellow markers) and the analytical model (green markers) across FC frequency bands and SC edge weights. Each marker shape denotes a different psychosis participant. The asterisks denote the cases for which the differences in the distributions are statistically significantly different after FDR correction.

Figure 12 shows the oFC-pFC correlations for the psychosis participants for the GMHA-AE and the analytical model. The distributions of the correlations are statistically significantly different for all FC frequency bands and SC edge weights. The p-values resulting from the permutation testing are shown in Table 6. Notably, the GMHA-AE model performs better than the analytical one for the alpha and beta frequency bands, while the opposite is true for the delta and theta bands. The Cohen’s d for the comparison between the distributions for the two models are given in Table S4 of the Supplementary Material. All absolute values are larger than 1.0, indicating a large effect.

Discussion

To the best of our knowledge, this is the first study to investigate the relationship between SC and electrophysiological resting-state FC with an attention-based deep-learning model, in both health and in psychosis.

The GMHA-AE model resulted in pFC that better matches the oFC compared to the analytical model, across the 4 frequency bands and 5 edge weights, with the exception of the delta band/NS-based edge weights combination. Additionally, in the alpha and beta frequency bands the mean of the correlation distributions was above 0.8 across SC edge weights, which is higher than the 0.55 for the distributions of the individual-level correlations reported in the fMRI-based study described in an earlier study by Sarwar et al^18^ (in which the NS was used to assign SC strength between regions). This difference in performance could be due to differences in our deep-learning model compared to that in their study^18^, the fact that we use electrophysiological FC which is not confounded by the haemodynamic response delay, or the fact that in that study^18^ they did not use information on the Euclidean distance between nodes of the networks, which can improve the FC prediction^10^. An exact comparison between our MEG-based study and their fMRI-based one is not feasible due to the different aspects of FC that the two modalities measure. When comparing our results to those of that study^18^, we do so using our results for the alpha and beta bands because those give FC that is the most comparable to fMRI-measured FC^12^. However, in accordance with the conclusion reached in that study^18^ about fMRI-measured FC, our results imply that FC measured via electrophysiological recordings is tightly linked to SC derived via tractography, and that relationship is present at individual-participant level. Therefore, existing models of communication between brain areas can be improved to incorporate sources of that interdependence^1^ that could increase the correlations.

Both the GMHA-AE and the analytical model were better at predicting the FC in the alpha and beta frequency bands in comparison to the delta and theta bands. This is in agreement with the results presented in earlier work^17^, which examined a cohort of 90 healthy participants and investigated different microstructural measures as edge weights for the SC matrices (NS, FA, iRD). That study found lower oFC-pFC correlations compared to the correlations we observed here, across functional frequency bands and SC edge weights, which could be attributed to differences in the tractography algorithm used, and the quality of the diffusion-MRI data acquired (the MRI data for this study were collected on a Connectom scanner that has high diffusion gradients as opposed to the MRI scanner used for the data collected in^17^; MEG data were collected in the same system for both studies).

The structural measure that was used as edge weight in the SC matrices impacted how well the oFC was predicted, mainly for the analytical model. Edge weights that are related to myelin or axonal characteristics (FA, iRD and FRt) resulted in better FC prediction than those related to NS when the analytical model was used, with the differences in the respective distributions being highly statistically significant (Table 2). This indicates that, if the analytical model is representative of communication between brain areas, resting-state FC is supported by myelin and axonal characteristics more than it is by the relative number of axonal projections between brain areas. When the GMHA-AE model was used, SC matrices with edge weights related to myelin and axonal measures gave better prediction for the delta and theta frequency bands (except when theta-band FC was predicted using the iRD-weighted SC matrices). That model gave a much more stable performance across SC edge weights for the alpha and beta bands, even though the distributions were statistically significantly different. This likely points to an ability of the GMHA-AE model to capture patterns in the SC matrices that pertain to communication between brain areas, regardless of the edge weights used. We now elaborate on the possible reasons for this.

The model’s ability to capture consistent communication patterns across different SC edge weightings stems primarily from two key features of its architecture: the multi-head attention mechanism and the learning of high-dimensional node representations. Unlike the analytical model, which operates on rules determined by the edge weights (e.g., the path length is the inverse of the edge weight), in our GMHA-AE model the multi-head attention mechanism allows the model to dynamically learn the importance of each connection in a context-dependent manner. Each head can learn a different set of attention weights, effectively focusing on different types of structural information. For instance, one head might learn that high FA is crucial for certain connections, while another might learn to prioritize the existence of a path, regardless of its weight. By thusly aggregating information, the model can build a holistic understanding of brain communication that is not dependent on a single structural metric. Furthermore, the model does not operate on the raw graph structure alone. Through its layers, it computes high-dimensional embeddings for each brain region (node). These embeddings capture a node’s direct connections but also its role within its local neighborhood and the broader network topology. The GNN is adept at learning this topological information, which is more stable across weighting schemes than the raw edge values themselves. In essence, the model learns a representation of the brain’s communication architecture that includes and goes beyond the values of the edge weights. This is why its predictive performance remains high and stable across edge weights.

We also note that for the GMHA-AE model, when the delta band FC was predicted with the NS-weighted or the NS/v-weighted SC matrices, there was a subset of the sample for which the oFC-pFC correlations were high, and another subset for which the correlations were low (Fig. 6). The poor performance for the latter subset points to the fact that some of the test sets may contain individuals whose brain connectivity differs significantly from those in the training set, or that the model is learning specific characteristics from the training data that do not apply to the test data. This can make it difficult for the model to generalize properly and thus it does not perform well across all folds. This issue is present in the delta band - we observed that there is higher variability in the edge strengths in participants in the delta band compared to the other 3 bands (Fig. 13). Specifically, the median values of the variability were 0.21, 0.19, 0.18 and 0.17 for the delta, theta, alpha and beta bands respectively. The distribution of the delta band was statistically significantly different from the other 3, with p-values of the related t-test less than $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$10^{-10}$$\end{document}$ in all cases. This implies that there are more varied edge strengths that the model is trying to predict in the delta band, leading to the difference in performance between the delta and the other bands. Measures of performance of the GMHA-AE model are given in Table S1 of the Material.Fig. 13. Edge strength variability in the healthy participants for the 4 frequency bands. Notably, the variability is higher in the delta band.

We observed no significant change in the predictive ability of the two models when the SC matrices of the healthy participants were shuffled (i.e., the FC of a participant was predicted by the SC of a different participant). This was true across structural edge weights and FC frequency bands. It is an indication of the similarity of the connectomes and the communication mechanisms in healthy participants, and is in contrast to the results obtained when the structural and functional matrices of the psychosis patients were used.

In order to investigate the interpretability of our GMHA-AE model and understand what features it prioritizes, we conducted an analysis on its learned latent space. We performed t-distributed Stochastic Neighbor Embedding (t-SNE) to visualize the high-dimensional latent representations of all test samples in a 2-dimensional space. The results of the analysis are shown in Fig. 14. Each point in the plot corresponds to a participant, and it is colored by the model’s prediction error (MSE) for that participant. The resulting visualization provides compelling evidence of a well-structured latent space, where the model has learned to organize the high-dimensional representations of participants in a meaningful way, such that their geometric arrangement reflects an important underlying property: the predictability of their FC from their SC. A distinct, tight cluster of participants with high prediction error is clearly located in the bottom-right corner. This group represents less typical or harder to predict participants. The separation itself is evidence that the model has learned to identify individuals with ’atypical’ or hard-to-predict SC-FC mappings and groups them together in a specific region of its internal latent space. On the other hand, the majority of participants, for which the model achieved low-to-moderate prediction error, form another large, distinct group. Their congregation in a single, continuous region suggests that the model has identified a common set of SC-FC relationships. The model effectively learned to map the complex neurobiological characteristics of most participants onto a coherent geometric structure. This structural separation demonstrates that our model has successfully learned to distinguish between typical and atypical patterns in the SC-to-FC mapping. It prioritizes features that are indicative of this predictability, effectively translating the difficulty of the prediction task into geometric separation in the latent space. Overall, the visualization does more than just show a distribution of errors. It demonstrates that the model has translated the abstract concept of ’predictability’ into a concrete geometric organization. The distance and relationship between points in the latent space correspond to the similarity in their SC-FC relationships, demonstrating the model’s ability to learn salient features from the data.

We note here that a more detailed interpretability investigation could include other methods, for example an ablation analysis or other. This falls beyond the scope of this work, which was to compare the GMHA-AE model to an analytical one and to explore the impact of psychosis participants on them, and will be explored in future work.Fig. 14t-SNE analysis of the GMHA-AE model.

We observed some interesting relationships with age. The oFC-pFC correlation coefficients derived via the analytical model correlated negatively with healthy participant age for the non-NS weighted SC matrices (surviving FDR correction). The correlations were strongest for the alpha frequency band and the iRD-weighted SC matrices. This indicates that the analytical model is a better model of communication between brain areas for younger healthy participants than for older ones. It also points to the fact that alternative analytical models should be explored for describing communication between brain areas at different times across the life span. This result is in agreement with the results of an earlier study^22^, which reported decreases of the SC-FC relationship with age for fMRI-based FC. We also note that a similar trend was observed for a cohort of adolescent participants^23^. The fact that there is no correlation between participant age and oFC-pFC correlation coefficients when pFC is derived via the GMHA-AE model indicates that that model is more robust when it comes to detecting such relationships than the analytical model, and can capture intricate relationships even as those change over time.

We now discuss our results that pertain to psychosis. The GMHA-AE model that was trained on the healthy participant data, and which gave high prediction accuracy for those participants, did poorly at predicting the oFC of the psychosis participants, with the differences between the distributions of healthy and psychosis participants being highly statistically significant after FDR correction (Table 3). Despite the fact that the GMHA-AE model does not provide direct information about the communication mechanisms that underlie the SC-FC relationship, this finding clearly points to the fact that the features that the GMHA-AE model captures in the healthy-participant SC and FC matrices, and which result to it predicting oFC very well, are likely not present, or reduced, in the SC and/or FC connectomes of the psychosis participants.

In contrast to the poorer performance of the GMHA-AE model for the psychosis participants across frequency bands, the analytical model gave much better predictions for the FC of psychosis participants than that of healthy participants for the delta band, across SC edge weights. As the p-values for the differences between the healthy versus the psychosis participant distributions of the correlation coefficients indicate (Table 3), the difference is highly statistically significant. This indicates that the SPL-SI algorithm employed in our analytical model is a better representation of the communication mechanisms that are responsible for the evocation of delta band resting-state FC in psychosis than it is for healthy participants. When looking at the beta frequency band, however, the prediction accuracy of the same analytical model was lower for psychosis participants compared to healthy participants for the FA-, and FRt-weighted SC matrices; the same trend was present for the iRD-weighted SC matrices (p = 0.0307, but that p-value did not survive FDR correction). No statistically significant differences were observed in the correlation distributions between healthy participants and psychosis participants for the theta and alpha bands, with the exception of the $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{$$\end{document}$ alpha band / FA $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\}$$\end{document}$ and $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\{$$\end{document}$ alpha band / iRD $\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\}$$\end{document}$ combinations, where a reduction in the prediction accuracy was observed. These results are in agreement with the results reported in a different study^24^, where healthy participants exhibited larger correlation between the average FC (measured via fMRI) and white-matter microstructure in comparison to schizophrenia patients.

The performance of the two models on the psychosis participants depends strongly on the FC frequency band (Fig. 12). Specifically, the analytical model performs much better in the delta and theta frequency bands, while the GMHA-AE model performs better in the alpha and beta bands. Keeping in mind that the GMHA-AE model was trained on healthy participants, this implies that the healthy-participant communication mechanisms are worse at describing the SC-FC relationship in psychosis participants compared to the simpler analytical model based on the SPL-SI algorithms. For the alpha and beta frequency bands, on the other hand, the complex communication patterns that the GMHA-AE model captures from the healthy participants are better at describing the SC-FC relationship than the analytical model.

It is important to stress that, above and beyond the small FC-specific and SC-specific alterations that we observed in the psychosis participants (Figs. 4 and 5), we observed alterations in the relationship between SC and FC which were much more pronounced, in particular when that relationship was predicted via the autoencoder model. Our study shows that investigating the SC-FC relationship goes further than the summary statistics presented in Figs. 4 and 5, and identifies differences in the relationships between the structural and functional connections.

One way to improve the GMHA-AE model for the participants with psychosis would be to employ fine-tuning or domain adaptation techniques. This is not possible in our study: due to the small number of participants with psychosis, there is a risk of overfitting, where the model would learn participant-specific features rather than a generalizable disease pattern. We also note that such techniques, even though able to improve the model for the psychosis participants, would have a detrimental impact on its predictive ability for the healthy participants. They would shift the model from a normative benchmark that can help understand the divergence of the psychosis SC-FC relationship from that in healthy participants.

Our study has some limitations. Firstly, the GMHA-AE model does not provide insight into the communication mechanisms between brain areas. This lack of insight is, however, compensated by the fact that the model can capture complex relationships that analytical algorithms of brain communication cannot fully incorporate. Future work will target the interpretability and explainability of the model. Secondly, even though the healthy participant sample we used was large (126 participants) the number of psychosis participants was low (5). This is an issue with many similar studies - in particular some psychosis participants who volunteered for our study were not eligible to be scanned in our state-of-the-art Connectom MRI scanner, resulting in a smaller sample than a study using more conventional scanners would have. However, the statistical tests we used when comparing healthy to psychosis participants are appropriate to use when there is such a difference in the two samples, and therefore the differences we detected between healthy and psychosis participants are robust. The high statistical significance of our results indicates the need for followup studies of larger cohorts.

To conclude, in this work, we proposed a brain SC-FC mapping framework based on the Graph Multi-Head Attention AutoEncoder. By employing an autoencoder, our GMHA-AE model is capable of efficiently handling the distributed and heterogeneous patterns of the brain and of learning complex relationships between brain structure and function by utilizing attention mechanisms. We compared the GMHA-AE performance to that of an analytical model. We also used both models to investigate the SC-FC relationship in psychosis participants and compare it to that of healthy participants, and identified crucial differences in that relationship between the two groups. In future, the use of deep-learning models could be explored to quantify the SC-FC relationship for younger age groups, for task-based electrophysiological recordings, as well as for neurological disorders other than psychosis.

Supplementary Information

Supplementary Information.

Bibliography7

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Rumelhart, D., Hinton, G. & Williams, R. ”learning internal representations by error propagation”. Parallel Distributed Processing, Volume I: Foundations. MIT Press, Cambridge, MA (1986).
2Mc Nabb, C. et al. The welsh advanced neuroimaging database: an open-source state-of-the-art resource for brain research. In Proce. Ann. Meet. ISMRM 241 (2024).
3Dhollander, T., Raffelt, D. & Connelly, A. Unsupervised 3-tissue response function estimation from single-shell or multi-shell diffusion mr data without a co-registered t 1 image. ISMRM Workshop on Breaking the Barriers of Diffusion MRI 5 (2016).
4Jenkinson, M., Beckmann, C. F., Behrens, T. E., Woolrich, M. W. & Smith, M. A. Fsl. Neuro Image 62, 782–790 (2012).10.1016/j.neuroimage.2011.09.01521979382 · doi ↗ · pubmed ↗
5Grissom, R.J. & Kim, J.J. Effect sizes for research: Univariate and multivariate applications (2nd ed.). Routledge (2012).
6Cohen, J. Statistical power analysis for the behavioral sciences (2nd ed.). Routledge (1988).
7Groppe, D. fdr-bh. MATLAB Central File Exchange (2019).