Brain causality alterations in major depressive disorder treatment

Madhurima Bhattacharjee; Ioannis Vlachos; Aditi Kathpalia; Jaroslav Hlinka; Martin Brunovský; Martin Bareš; Milan Paluš

PMC · DOI:10.3389/fpsyt.2025.1718216·January 27, 2026

Brain causality alterations in major depressive disorder treatment

Madhurima Bhattacharjee, Ioannis Vlachos, Aditi Kathpalia, Jaroslav Hlinka, Martin Brunovský, Martin Bareš, Milan Paluš

PDF

Open Access

TL;DR

This study shows that early brain connectivity changes can predict treatment response in depression, helping choose better therapies faster.

Contribution

The study identifies early frequency-specific brain causality changes that distinguish treatment responders and nonresponders in MDD.

Findings

01

Pharmacological treatment increases β2 band information flow, especially in the right hemisphere.

02

Neurostimulation enhances δ band connectivity from the left hemisphere to the whole brain.

03

Baseline α band right-dominant connectivity predicts poor response to pharmacological treatment.

Abstract

Major depressive disorder (MDD) remains a leading cause of disability worldwide, with limited objective biomarkers to guide treatment selection and monitor early therapeutic effects. This study examines whether pharmacological and neurostimulation treatments produce distinct alterations in directional brain connectivity patterns within the first week of treatment and whether baseline connectivity patterns differ between treatment responders and nonresponders assessed at 4–6 weeks. We perform an information theory-based causality analysis on instantaneous phase time series derived from electroencephalography recordings of 176 MDD patients. Patients received either pharmacological treatment or neurostimulation and were recorded at baseline (visit 1) and one week post-treatment initiation (visit 2). We quantified directional information flow across 19 EEG channels in five frequency bands…

Linked entities

Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.

Species1

Homo sapiens(human · species)

Diseases2

Major depressive disorder MDD

Figures9

Click any figure to enlarge with its caption.

The adjacency matrices for the three cases of coupled Rössler oscillators with N = 3, 5, and 19 subsystems in (a–c), respectively, and the corresponding pPMIME causality matrices (RP values) for the coupling strength where the average of all real connections was maximum in (d–f). Average RP values ± standard error (shaded area above and below the average - barely visible in some cases) as a function of coupling strength for the three networks in (g–i). RP values for real connections are plotted in solid blue lines and those for non-connections in orange dotted lines. For clarity, in panel (i), instead of the individual lines, their averages are plotted.

Causality matrix of a subject, along with the regions that are used to define FLOW metrics for hemisphere-to-hemisphere interaction (a), per-hemisphere inflow (b) and per-hemisphere outflow (c).

The colormap displays p-values from the repeated measures ANOVA. All p-values above 0.05 are highlighted in yellow. R, Response; T, Treatment; V, Visit. Each row represents the results of a particular band. There are seven blocks corresponding to each factor, separated by a horizontal red line. The metrics corresponding to FLOW, INFLOW and OUTFLOW per channel are separated by a black vertical line.

Change in metric values between visit 1 (v1) and visit 2 (v2) in the β2 band for the pharmacological treatment. The y-axis of all panels (a–i) displays the metric values averaged over all subjects along with the standard errors.

Descriptions of all panels (a–i) are similar to Figure 4, but for the neurostimulation treatments and the δ band. The dashed lines in panels (e–g) indicate that there was no statistically significant difference in these cases.

Brain topographical plots for the comparison of inflow per channel between the two visits for the pharmacological group. The first row of panels is the p-values for the comparison. The 2nd and 3rd rows present the average values (across subjects) of the inflow for visit 1 (v1) and visit 2 (v2), while the 4th row presents the difference between the visits. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the first row.

Similar to Figure 6, but for the neurostimulation group. Rows of the average inflow per channel for the two visits (2nd and 3rd rows in Figure 6) are omitted (given in Supplementary Materials, Figure 7).

Bar plots of the significantly different FLOW metrics between respondents (R) and nonrespondents (NR) to pharmacological treatment at the time of visit 1 in the α band.

Brain topographical plots for the comparison of inflow per channel between respondents (R) and nonrespondents (NR). The first row of panels is the p-values for the pharmacological (Ph) group, and the second row is the difference in inflow between R and NR averaged over all subjects. The 3rd and 4th rows are the same as the first two but for the neurostimulation (St) group. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the 1st and 3rd rows.

Tables4

Table 1. Patient demographics and information. Respondents (R), nonrespondents (NR), number of patients (n).

Characteristics	All Subjects (n=176)	Pharmacological treatment subjects (n=129)		Stimulation treatment subjects (n=47)
Characteristics	All Subjects (n=176)	R (n=68)	NR (n=61)	R (n=16)	NR (n=31)
AGE	45.78 ± 11.31	47.58 ± 10.24	43.93 ± 11.83	48.06 ± 11.42	44.29 ± 12.20
SEX (F:M)	128:48	54:14	40:21	12:4	22:9
MADRS Score
Baseline	27.56 ± 3.83	27.23± 3.41	27.90 ± 4.39	25.18 ± 2.04	28.87± 3.68
Final	16.23 ± 8.29	9.04 ± 3.74	22.32 ± 5.33	9.68 ± 2.77	23.41 ± 6.33
% Change	41.52 ± 27.85	66.78 ± 13.00	19.42 ± 17.32	61.50 ± 10.62	19.28 ± 18.12
Illness Duration (Weeks)	58.77 ± 73.73	51.72 ± 67.52	66.15 ± 75.94	89.56 ± 107.46	44.10 ± 57.29

Table 2. Total number of metrics/band combinations that provide significant p-values for different factors and their interactions.

Factors	FLOW	INFLOW	OUTFLOW	Total
Response	4	4	4	12
Treatment	10	16	10	36
Visit	6	5	6	17
Response : Treatment	8	5	3	16
Response : Visit	1	3	3	7
Treatment : Visit	8	16	2	26
Response : Treatment : Visit	0	5	0	5

Table 3. P-values for the comparison of the FLOW metrics between visits 1 and 2 for the two treatment groups. p-values< 0.05 are in bold font.

EEG Band	TF	LL	LR	out_L	RR	RL	out_R	in_L	in_R
Pharmacological: visit 1 vs visit 2
δ	0.931	0.931	0.931	0.931	0.931	0.931	0.931	0.931	0.931
θ	0.064	0.312	0.082	0.136	0.500	0.136	0.251	0.064	0.064
α	0.718	0.718	0.718	0.718	0.960	0.772	0.895	0.718	0.718
β ₁	0.837	0.765	0.765	0.765	0.302	0.302	0.302	0.769	0.967
β ₂	<0.001	0.049	0.029	0.031	0.023	0.0497	0.031	<0.001	<0.001
Neurostimulation: visit 1 vs visit 2
δ	0.009	0.002	0.006	0.002	0.297	0.551	0.510	0.010	0.010
θ	0.916	0.218	0.218	0.218	0.773	0.773	0.773	0.916	0.920
α	0.603	0.386	0.386	0.386	0.507	0.507	0.507	0.691	0.507
β ₁	0.862	0.761	0.761	0.761	0.761	0.761	0.761	0.862	0.994
β ₂	0.229	0.229	0.291	0.229	0.291	0.291	0.291	0.229	0.229

Table 4. P-values for the comparison of the FLOW metrics at visit 1 between respondents and nonrespondents for the two treatment groups. Significant p-values less than 0.05 are in bold font.

EEG band	TF	LL	LR	out_L	RR	RL	out_R	in_L	in_R
Pharmacological: respondents vs nonrespondents
δ	0.790	0.790	0.790	0.790	0.790	0.790	0.953	0.790	0.790
θ	0.516	0.131	0.131	0.131	0.876	0.876	0.876	0.516	0.516
α	0.040	0.818	0.688	0.760	0.040	0.040	0.040	0.040	0.040
β ₁	0.292	0.292	0.292	0.292	0.292	0.292	0.292	0.292	0.292
β ₂	0.914	0.974	0.963	0.968	0.914	0.914	0.914	0.914	0.914
Neurostimulation: respondents vs nonrespondents
δ	0.123	0.531	0.468	0.523	0.468	0.468	0.443	0.123	0.123
θ	0.391	0.391	0.391	0.391	0.944	0.944	0.944	0.391	0.391
α	0.765	0.765	0.765	0.765	0.765	0.765	0.765	0.777	0.765
β ₁	0.851	0.212	0.212	0.212	0.316	0.398	0.367	0.883	0.851
β ₂	0.146	0.146	0.119	0.142	0.119	0.119	0.119	0.119	0.244

Equations3

Keywords

braincausalityelectroencephalographymajor depressive disordermutual information

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsFunctional Brain Connectivity Studies · Mental Health Research Topics · Neural dynamics and brain function

Full text

Introduction

1

Major depressive disorder (MDD) is a widespread mental health condition affecting millions of people worldwide (1). It can contribute to various physiological illnesses characterized by enduring unhappiness, loss of interest in activities, and a range of physical and cognitive symptoms. Despite the development of novel treatment strategies for MDD, a critical clinical challenge remains: treatment selection is largely trial-and-error, with patients often enduring weeks of ineffective therapy before alternative approaches are considered. This prolonged period of treatment uncertainty stems from the absence of reliable early biomarkers to predict which patients will respond to specific treatment modalities and to monitor early therapeutic effects before clinical symptom changes manifest (2, 3). The diagnostic criteria for MDD predominantly focus on behavioral aspects and depend on self-reported symptoms, providing limited insight into underlying neurophysiological mechanisms that might guide treatment decisions. There is an urgent necessity for objective biomarkers of MDD to facilitate personalized treatment selection and enable early detection of therapeutic response. Determining efficient, non-invasive diagnostic and therapeutic approaches for MDD poses a significant challenge in clinical neuroscience. Herein two critical clinical questions are addressed: 1) Do pharmacological and neurostimulation treatments produce distinct, detectable alterations in brain connectivity patterns within the first week of treatment before clinical response manifests? 2) Can baseline directional connectivity patterns distinguish patients who will ultimately respond versus not respond to specific treatment modalities assessed after 4–6 weeks through patient self-reporting?

Electroencephalography (EEG) is a widely used, non-invasive neuroimaging technique that measures cerebral electrical activity with a high temporal resolution in the millisecond range, providing benefits over alternative brain imaging modalities. Numerous studies have examined changes in brain functional connectivity, i.e., the pattern of statistical dependencies between remote neurophysiological processes (4), in individuals with MDD, utilizing EEG data (5). However, most EEG analyses typically involves extraction of functional connectivity measures such as correlation, coherence or cordance derived from cerebral signals (6–11). These measures quantify the strength of relationships between brain regions but cannot determine the directionality of information flow. This limitation is significant because understanding which brain regions influence others may be critical for differentiating treatment mechanisms and identifying patients likely to respond to specific interventions. The data obtained from multiple scalp electrodes is analyzed across different frequency bands since these bands are linked to diverse neurophysiological processes and cognitive states. Connectivity patterns within and across these bands provide a more comprehensive and functionally relevant picture of brain dynamics.

Causality analysis addresses this gap by going beyond correlation or coherence to identify cause-and-effect relationships and directional influence patterns in brain networks. In MDD, several studies have already applied directional connectivity methods, including time- and frequency-domain Granger causality, to characterize abnormal information flow in cortico-limbic and fronto-parietal circuits. These approaches have mainly relied on models (typically linear or pre-specified non-linear) and have only rarely been extended to early treatment-related changes in large clinical cohorts. Comparatively little work has used information theory-based causality estimators (like transfer entropy) or estimators operating on phase dynamics to study MDD with EEG, particularly in multivariate set-ups and in the context of treatment prediction and early neurophysiological changes (12–15). The information-theoretic generalization of the Granger causality concept (16) provides a solid mathematical basis for several reliable causality inference methods (17, 18), including the phase dynamics adaptation used in this study (19). Analyzing causality in multi-channel EEG signals might enhance treatment stratification by revealing how different therapies alter the direction and intensity of causal influences among brain regions, thereby illuminating fundamental therapeutic mechanisms.

In this study, we perform a multivariate causality analysis on EEG data recorded from participants diagnosed with MDD and treated with medication or neurostimulation (specifically, low-frequency repetitive transcranial magnetic stimulation [rTMS] or transcranial direct current stimulation [tDCS] targeting the dorsolateral prefrontal cortex). While prior literature has established biomarkers based on non-directional measures, it remains unclear whether directional phase-based causality exhibits similar patterns. To mitigate this gap without bias, we adopt a data-driven approach to address the two aforementioned clinical questions, i.e., of detecting treatment-specific alterations in brain connectivity and of treatment response stratification from baseline connectivity patterns. Early identification of treatment-specific neurophysiological signatures and response-predictive patterns could be particularly advantageous in the initial phase of treatment, as it may facilitate prompt identification of patients unlikely to benefit from their current therapy, enabling timely transition to alternative interventions. Answering these questions could enable earlier, more informed treatment decisions and reduce the trial-and-error period that prolongs patient suffering.

To estimate causality across brain regions, we employ an information-theoretic method called partial mutual information from mixed embedding (PMIME) (19, 20), and examine alterations in brain connectivity and the directional influence among these regions following the initiation of antidepressant treatment. Traditional causality analysis methods such as Granger causality, transfer entropy or partial transfer entropy fail to capture nonlinear relationships in high-dimensional real-world systems. Furthermore, they either assume linear dependencies, are difficult to extrapolate to higher dimensions, are data demanding or are computationally expensive. PMIME addresses these challenges by leveraging mutual information and optimized embedding strategies, and is the only method capable of “general” causality analysis (i.e., no assumption of a model form) in high dimensions. PMIME is a model-free method that is widely applicable for identifying direct causal links between two components in multivariate systems, such as multi-channel EEG signals. This study employs pPMIME, which is a version of PMIME that operates on instantaneous phase time series of EEG data and whose performance has been extensively validated on artificial and real-world data. Phase-based causality analysis effectively reveals genuine causal relationships in weakly linked oscillatory systems, mitigating the effect of noise potentially introduced by amplitude variations. It is a well-known fact that weakly coupled oscillators can alter the phases of each other while their amplitudes remain unchanged (21). We leverage the oscillatory nature of EEG signals and employ phase-based causality analysis to achieve more reliable results. In Section 2 we present the methodology and data used in the analysis. In Section 3 we present the obtained results and then discuss them in Section 4.

Methods and materials

2

Phase-based causality analysis (pPMIME)

2.1

PMIME/pPMIME (19) is an iterative method that uses forward selection to estimate inherent lagged dependence in coupled multivariate nonlinear dynamical systems. It generates mixed embedding vectors from multiple observed time series of connected subsystems to reveal the causal relationships. The method employs mutual information/conditional mutual information (MI/CMI) $[eqn]$ / $[eqn]$ as a selection criterion and uses a statistical test as a stopping criterion of the forward selection procedure. MI measures the average shared information between random variables $[eqn]$ and $[eqn]$ , whereas CMI measures the same but conditioned by variable $[eqn]$ . Consider a coupled dynamical system consisting of $[eqn]$ subsystems. The time series data for these K variables are denoted by $[eqn]$ , where $[eqn]$ is the time index and $[eqn]$ is the index of the variables. PMIME aims to predict the $[eqn]$ -steps ahead future values of a target variable $[eqn]$ by finding a mixed embedding vector $[eqn]$ with an undetermined dimension that maximizes the MI term $[eqn]$ . The elements of $[eqn]$ belong to the superset $[eqn]$ , with the sets $[eqn]$ containing the lagged version of each variable, i.e. $[eqn]$ , $[eqn]$ , where $[eqn]$ is the maximum lag, which can be arbitrarily chosen to be appropriately large. Starting from an empty vector $[eqn]$ Ø, the embedding vector is populated sequentially using a forward selection procedure based on the maximization of MI/CMI. The CMI terms for step $[eqn]$ of the procedure are of the form $[eqn]$ , where $[eqn]$ is the embedding vector at the previous step ( $[eqn]$ ) and $[eqn]$ is any element of $[eqn]$ . The $[eqn]$ that maximizes the CMI term is selected to be added in $[eqn]$ in order to create the new $[eqn]$ . The whole procedure is terminated using a bootstrap resampling test as a stopping criterion, i.e., testing the null hypothesis $[eqn]$ . Detailed information regarding the entire procedure can be found in (19, 22). If for a given $[eqn]$ , any element of $[eqn]$ is contained in $[eqn]$ , then we can interpret that $[eqn]$ causes $[eqn]$ . A causality measure can be derived from the following normalized CMI term, bounded between $[eqn]$ and $[eqn]$ :

[eqn]

In Equation 1, $[eqn]$ is split into elements that belong to $[eqn]$ and all other elements, i.e., $[eqn]$ and $[eqn]$ corresponds to the proportion of information in $[eqn]$ which can be explained by the past values of $[eqn]$ when taking into account the values of the rest of $[eqn]$ . PMIME essentially tries to detect which of the present and past values (lags) of $[eqn]$ , $[eqn]$ , i.e., $[eqn]$ can be used to optimally predict the future values $[eqn]$ , where T can be any value of $[eqn]$ .

Similarly, when applied to phase time series ( $[eqn]$ ), the algorithm (pPMIME in this case) tries to detect which of the present and past lags of $[eqn]$ , i.e., $[eqn]$ for all $[eqn]$ , can be used to predict the values of the phase increments $[eqn]$ . Paluš and Stefanovska demonstrated that, in the bivariate case, $[eqn]$ yields more sensitive causality tests than $[eqn]$ itself (23). Equation 1 is modified accordingly using the phase variables.

[eqn]

The phase time series $[eqn]$ can be extracted from the original time series $[eqn]$ using various methodologies, but herein we employ the commonly used Hilbert transform (HT) method (24, 25). Recently, it was shown that for systems with well-behaving phases or narrowly bandpass filtered data (like the EEG filtered in EEG bands), values of $[eqn]$ and $[eqn]$ are sufficient to capture causality with reduced computational time (22), so this is adopted herein. With respect to the original pPMIME, we make another simple but intuitive logical change that is critical for mitigating volume conduction effects. Instead of starting from an empty embedding vector, we start with one that includes the $[eqn]$ lag of the target variable; i.e., if we are creating embedding vectors for $[eqn]$ then the embedding vector is initialized as $[eqn]$ . This initialization effectively substantially reduces the effect of zero-lag (instantaneous) interactions common in EEG due to volume conduction. Since the CMI criterion only selects source components that provide novel information about the future beyond what is known from the target’s own current state, strong instantaneous correlations (where source and target carry identical information) are recognized as redundant and excluded. This ensures that the derived metrics reflect genuine lagged information flow.

Artificial data and method benchmark

2.2

We benchmark the pPMIME method for three cases of basic coupled nonlinear oscillators. We study coupled Rössler systems (26, 27) given by sets of the following equations (one set for each oscillator):

[eqn]

where $[eqn]$ , and $[eqn]$ is the total number of subsystems or nodes. The parameters $[eqn]$ , $[eqn]$ , and $[eqn]$ were kept constant for all $[eqn]$ and were chosen so that all subsystems maintain a stable periodic orbit. The elements of the binary ( $[eqn]$ ) adjacency matrix $[eqn]$ are denoted by $[eqn]$ and define the connectivity structure of the system, with a value of $[eqn]$ for $[eqn]$ indicating a causal connection from subsystem $[eqn]$ to $[eqn]$ . The parameters $[eqn]$ determine the main frequency of oscillation of the $[eqn]$ subsystem and are set to different values for the subsystems. Finally $[eqn]$ is the coupling strength, which we consider to change globally (i.e., it is the same for all subsystems). We studied three cases of $[eqn]$ , $[eqn]$ , and $[eqn]$ coupled subsystems to validate our method on multi-coupled systems with known directions of influence to imitate real-life scenarios where there are large coupled systems. The values of $[eqn]$ were chosen as $[eqn]$ for $[eqn]$ (step of $[eqn]$ for $[eqn]$ (step of $[eqn]$ and $[eqn]$ for $[eqn]$ (step of $[eqn]$ For $[eqn]$ and $[eqn]$ we chose the elements of the adjacency matrix such that each node was unidirectionally coupled, forming a serial connection for $[eqn]$ and a “ring” of connections for $[eqn]$ . For $[eqn]$ we generated an adjacency matrix randomly such that only $[eqn]$ of the connections were present. The adjacency matrices for the three cases are displayed in Figures 1a–c.

The adjacency matrices for the three cases of coupled Rössler oscillators with N = 3, 5, and 19 subsystems in (a–c), respectively, and the corresponding pPMIME causality matrices (RP values) for the coupling strength where the average of all real connections was maximum in (d–f). Average RP values ± standard error (shaded area above and below the average - barely visible in some cases) as a function of coupling strength for the three networks in (g–i). RP values for real connections are plotted in solid blue lines and those for non-connections in orange dotted lines. For clarity, in panel (i), instead of the individual lines, their averages are plotted.

We studied 100 different values of coupling strength $[eqn]$ (step size of $[eqn]$ ), and for each one we generated $[eqn]$ data points from Equation 3 using the Runge-Kutta (4, 5) method (ode45 in MATLAB ver. R2021a). We chose a sampling time of 0.314 time units, and the first 1000 transient points were discarded. The data were Hilbert transformed, and the first and last 2032 time points were discarded to eliminate any distortion of the data owing to the edge effect of the transform. The remaining data were then divided into 31 segments, with each segment containing 4064 time points. pPMIME and Equation 2 was then applied to each of these segments, which resulted in $[eqn]$ ( $[eqn]$ ) causality matrices that contained the $[eqn]$ values for all pairs of $[eqn]$ . The matrices were then averaged over all 31 segments. Each element in the matrix quantifies the causal influence of a particular node on another node, with the diagonal elements corresponding to the self-influences. For example, the $[eqn]$ component of the matrix provides the causal influence of node $[eqn]$ on node $[eqn]$ . Figures 1d–f show causality matrices for the three cases of $[eqn]$ , respectively, each for a selected coupling strength for which the average of all real connections was maximum. The specific values of epsilon for the three cases ( $[eqn]$ for $[eqn]$ , $[eqn]$ , and $[eqn]$ , respectively) are small enough to not have widespread synchronization but large enough so that the obtained $[eqn]$ values are substantial and illustrate the results sufficiently well. As expected, the causality matrices correlate well with the adjacency matrices. Figures 1g, h, respectively, show the $[eqn]$ values for the individual connections for $[eqn]$ and $[eqn]$ , as a function of coupling strength. For the case of $[eqn]$ , in Figure 1i, we show the average of the $[eqn]$ values corresponding to real connections and of those that correspond to no connections, again as a function of coupling, for reasons of visual clarity. All $[eqn]$ values plotted are averages across all $[eqn]$ segments, with error bars showing the standard error of the mean. The real connections are plotted as solid blue lines, and the rest are indicated by orange dotted lines. For all three cases and for low coupling strength values (i.e., prior to the synchronization of the subsystems), the blue lines exhibit monotonically increasing trends and exceed the values of the orange lines. This indicates that the method can accurately capture the direction of causality between the subsystems. Upon synchronization of subsystems, causality cannot be uniquely inferred, and we can see that the orange lines increase and the blue lines decrease. In addition, as the number of subsystems increases, the strength of the detected connections decreases.

EEG data

2.3

The study included 176 adult inpatients (age range 18–65 years) with a primary diagnosis of MDD, single or recurrent episode, without psychotic features. The diagnosis of MDD was established according to DSM-IV criteria and confirmed in all participants using the Mini-International Neuropsychiatric Interview (M.I.N.I., Czech version 5.0.0). To ensure at least moderate depressive symptom severity at baseline, participants were required to have a total score on the Montgomery–Åsberg Depression Rating Scale (MADRS) (28, 29) of ≥ 20 and, in the majority of cohorts, a Clinical Global Impression (CGI) score of ≥ 4. The sample predominantly comprised patients with treatment-resistant depression. All participants were hospitalized due to an unsatisfactory response to previous outpatient treatment and fulfilled at least Stage I criteria for treatment-resistant depression according to Thase and Rush (i.e. nonresponse to ≥ 1 adequate antidepressant trial in the current episode). The adequacy of prior antidepressant treatment was verified using the Antidepressant Treatment History Form (ATHF); an ATHF score of ≥ 3 was required to confirm at least 4 weeks of treatment at an adequate dose. A psychotropic medication washout period (median duration 4 days) was implemented for all participants prior to initiation of the new antidepressant treatment. Stringent exclusion criteria were applied to reduce potential confounding, particularly related to concurrent substance use and additional psychopathology. Participants were excluded if they had a current or past history of alcohol or illicit drug abuse or dependence, or any additional Axis I or Axis II psychiatric comorbidity. To minimize pharmacological confounding, individuals with prior exposure to fluoxetine were excluded because of its prolonged biological half-life, and patients who had received electroconvulsive therapy (ECT) within the 3 months preceding study enrollment were not included, given the substantial impact of ECT on EEG measures. More details regarding the sample, recruitment criteria and treatment can be found in previous clinical analysis reports (30–36). Details regarding sex, age, MADRS score, illness duration and response to treatments are shown in Table 1. The patients were subjected to either pharmacological or neurostimulation treatments for four weeks. Individuals who showed a ≥ 50% reduction in the MADRS score at the end of the treatment were considered as responders to the treatment. Pharmacological treatment included the following classes of antidepressants: serotonin norepinephrine reuptake inhibitors (SNRI), selective serotonin reuptake inhibitors (SSRI), norepinephrine-dopamine reuptake inhibitors (NDRI), noradrenergic and specific serotonergic antidepressants (NaSSA), and tricyclic antidepressants (TCA). Neurostimulation treatment included low-frequency repetitive transcranial magnetic stimulation (LF-rTMS) at 1 Hz over the right dorsolateral prefrontal cortex (DLPFC) and transcranial direct current stimulation (tDCS) consisted of anodal stimulation over the left DLPFC and cathodal placement over the right supraorbital region, delivered at 2 mA. There were two EEG recording sessions for each patient, one before treatment initiation (visit 1) and a second after one week of treatment (visit 2). Data were recorded with the participants lying in a semirecumbent position with their eyes closed in a maximally alert state. Approximately 10 minutes of data were sampled at 250 Hz or 1000 Hz from 19 electrode channels (Fp1, Fp2, F3, F4, C3, C4, P3, P4, O1, O2, F7, F8, T3, T4, T5, T6, Fz, Cz, and Pz) placed according to the International 10–20 system (37).

This study was conducted in accordance with the principles outlined in the Declaration of Helsinki and adhered to applicable guidelines for Good Clinical Practice. Ethical approval was obtained from the Ethics Committee of the Prague Psychiatric Centre/National Institute of Mental Health for all study protocols contributing data to this research, with the current retrospective analysis falling under the existing institutional ethics approval for the EEG registry. While a portion of the dataset was derived from clinical trials registered in the EUDRACT database (EUDRACT Nos. 2005-000826–22 and 2015-001639-19, registered via www.clinicaltrialsregister.eu), additional data were obtained from ethically approved studies that were not formally registered in public trial registries. In all cases, adult participants were thoroughly informed about the study procedures and objectives, and they provided written informed consent prior to participation that explicitly authorized the secondary use of their clinical and neurophysiological data for future research purposes.

Preprocessing and causality estimation

2.3.1

The data have been used in previous studies and were preprocessed as described in Ref. (38). In short, the preprocessing included removing extra channels, resampling records to a common 250 Hz sampling frequency, discarding the first and last 30 seconds of each record, re-referencing to the average reference, bandpass filtering the data between 1 and 40 Hz, and removal of data segments with high-power artefacts. The overall aim is to increase the signal-to-noise ratio and make the data more interpretable. More details on the specifics of the preprocessing steps and their individual effects are provided in (38). We further filtered the data into six different frequency bands to focus on causality related to the specific brain activity in each of these bands. The bands were divided as follows: δ : (1.5Hz - 3.5Hz), θ : (4Hz - 8Hz), α : (8Hz - 12Hz), β1: (12.5Hz - 17.5Hz), β2: (18Hz -25.5Hz) and γ : (26Hz - 40Hz). We implement pPMIME on data segments of 8s duration (2000 time points), as the estimation of CMI has large data requirements. Thus, for each data segment, we estimated six 19 × 19 matrices, one for each band. For each subject, a varying number of segments were available, ranging from 14 to 92. The findings of the γ band are not used herein because of R_P_ estimation problems due to improper phase extraction at high frequencies with HT (attributed to the extensive range of the γ band).

Flow metrics

2.4

The pPMIME method generates causality matrices of dimensions ( $[eqn]$ ) for the $[eqn]$ EEG channels. Each element in the matrix quantifies the causal influence of one channel on another. As detailed in Section 2.1, each entry $[eqn]$ is a normalized conditional mutual information term bounded between $[eqn]$ and $[eqn]$ that expresses the proportion of phase increment information in channel $[eqn]$ that is uniquely explained by the past of channel $[eqn]$ , after accounting for the past of $[eqn]$ and of all other channels. Because all entries share this common, dimensionless scale, averages of $[eqn]$ over anatomically defined sets of connections can be interpreted as expected directional information flow within or between regions. The causality matrices are structured such that the initial eight nodes correspond to channels in the left hemisphere (Fp1, F3, C3, P3, O1, F7, T3, and T5), the subsequent eight nodes correspond to channels in the right hemisphere (Fp2, F4, C4, P4, O2, F8, T4, and T6), and the final three nodes correspond to channels located along the midline of the head (Fz, Cz, and Pz). We define FLOW metrics based on the average of specific parts from a standard causality matrix, as illustrated in Figure 2. This allows for the quantification of information flow between brain regions derived from the causal relationships identified in the EEG channel data. We define nine FLOW metrics: The Total Flow ( $[eqn]$ ) metric is the aggregate information flow across the $[eqn]$ channels, calculated by averaging all elements of the matrix. We define $[eqn]$ metrics for the left hemisphere, $[eqn]$ , $[eqn]$ , $[eqn]$ , and $[eqn]$ , which respectively represent the information flow from the left hemisphere to itself, to the right hemisphere, to all the regions of the brain, and the information flow from all regions of the brain to it. Similarly, we define metrics for the right hemisphere, which we denote as $[eqn]$ , $[eqn]$ , $[eqn]$ , and $[eqn]$ . The averages are calculated for the regions (excluding the diagonal elements corresponding to the self influences) indicated by the red boxes labeled LL, LR, out L, in L, RR, RL, out R, and in R in Figure 2. Additionally, we study the inflow and outflow for each channel separately, which are defined as $[eqn]$ , $[eqn]$ . So in total, we have $[eqn]$ metrics. The first $[eqn]$ are the FLOW metrics, which are global and quantify the information flow of the brain as a whole or per brain hemisphere. The other $[eqn]$ are the INFLOW and OUTFLOW per channel, which are local metrics and characterize small brain regions that correspond to the positions of the electrodes.

Causality matrix of a subject, along with the regions that are used to define FLOW metrics for hemisphere-to-hemisphere interaction (a), per-hemisphere inflow (b) and per-hemisphere outflow (c).

Statistical analysis

2.5

After obtaining the causality matrices for each subject over all time segments, visits, and frequency bands, we compute the metrics defined in Section 2.4 for every time segment of each participant. Subsequently, we calculate the mean of the metrics across all the time segments. Thus, for each band and each of the 47 metrics, we have two sets of 176 values (one set for each visit). Since there are multiple factors (response, treatment, and visit) and one of them is a repeated factor (the EEG recording is performed twice on the same patient), we analyzed the data with repeated measures analysis of variance (ANOVA) (39). The repeated measures ANOVA design includes two independent between-subject factors (response and treatment) and one repeated (dependent) within-subject factor (the visit), each with two levels. The response factor has two levels: respondents and nonrespondents, while the levels for the treatment factor are pharmacological and neurostimulation. For the visit, the two levels are visit 1 and visit 2. The setup of the repeated measures ANOVA model includes the individual factors along with all their interactions (3 pairwise and 1 with all three). The analysis for each EEG frequency band is performed separately. Because of the multiple factors and the “rigidity” of the ANOVA design, we do not actually derive statistical conclusions from these p-values; rather, we utilize these results as preliminary information to guide subsequent analysis. Informed by the outcomes derived from the repeated measures ANOVA study, further post-hoc analysis for the comparison of individual patient subgroups is performed with Student’s t-tests (independent or paired), and the effect size of any change/difference is estimated by Cohen’s $[eqn]$ (40). Non-parametric tests (41) gave similar results to the t-tests, with p-values generally slightly higher with certain differences in statistical significance for some metrics, but no substantial change in the overall conclusions. The ultimate goal of this analysis is to reveal which, if any, of these metrics has the potential to serve as biomarkers.

Our study is exploratory and hypothesis-generating in nature. Given that pPMIME captures non-linear, directional dependencies distinct from the linear correlations found in most prior literature, we opted against restricting our analysis to pre-defined regions of interest. There are no specific hypotheses tested where there is an expected difference based on some prior knowledge. We are conducting a “blind” investigation on which metrics show a difference, and to do this, we employ a comprehensive set of metrics on all possible EEG bands, expecting that most of them will probably not provide statistically significant results. As a matter of fact, we expect our significant results to be constrained to one or, at most, two frequency bands. Furthermore, certain metrics are, by construction, correlated. Because of the high correlation and large number of metrics, global corrections for multiple comparisons are very likely to lead to most p-values being “over-corrected” and rendered insignificant. Since the study is exploratory, over-correcting the p-values is antithetical to the main objective, and in accordance to recent recommendations (42–44), herein we treat our analysis as three separate experiments based on the three metric types (i.e., FLOW, INFLOW, OUTFLOW). Each experiment has its corresponding experiment-wise hypothesis $[eqn]$ : There is no change in the values of any metric of the specific type, along with a “family” of statistical hypothesis tests for the individual metrics of each type. Given that the hypotheses tests inside each family are interchangeable for the corresponding experiment-wise hypothesis (but not interchangeable for the other two experiments) we employ the Benjamini–Yekutieli FDR controlling procedure (45) to obtain adjusted p-values per metric family and EEG band. All p-values in the following section are FDR adjusted and assessed at the α = 0.05 significance level. The original, unadjusted p-values from the t-tests and the p-values from the non-parametric permutation tests are provided in the Supplementary Materials, Tables 4–7. Controlling for the effects of age and sex using linear models instead of Student’s t-tests did not substantially change our statistical results, neither quantitatively nor qualitatively.

Results

3

Results from repeated measures ANOVA

3.1

The results of the repeated measures ANOVA are graphically presented in Figure 3 and summarized in Table 2. The figure displays a colormap highlighting the p-values for the factors and interactions with p< 0.05 for each metric. The colormap is organized vertically in seven blocks separated by red horizontal lines. Each block corresponds to a factor or interaction of the factors, and within the block are the five frequency bands. The horizontal axis displays the metrics. The three families of metrics are separated by black vertical lines. The p-values ≥ 0.05 are all set to yellow, while the colder colors correspond to lower p-values. The text around the colormap denotes factors, bands, and metrics. For example, for the TF metric, the treatment factor is significant in the $[eqn]$ band ( $[eqn]$ ) and is presented in the first column, the first value of the second block. For the same metric, the only other p-values that are below $[eqn]$ are the response-treatment and treatment-visit interactions in the $[eqn]$ ( $[eqn]$ , $[eqn]$ , respectively). Table 2 summarizes the repeated measures ANOVA results by presenting the number of metrics/bands that provide a significant p-value for each factor/interaction.

The colormap displays p-values from the repeated measures ANOVA. All p-values above 0.05 are highlighted in yellow. R, Response; T, Treatment; V, Visit. Each row represents the results of a particular band. There are seven blocks corresponding to each factor, separated by a horizontal red line. The metrics corresponding to FLOW, INFLOW and OUTFLOW per channel are separated by a black vertical line.

In Figure 3, we observe that the majority of p-values that are less than 0.05 are mostly for the $[eqn]$ and the $[eqn]$ bands, and that INFLOW metrics show more cases of statistical significance than the OUTFLOW metrics. The summarized repeated measures ANOVA results from Table 2 show that the three-way interaction factor is not significant for the majority of the metrics and nearly all bands. Only the INFLOW exhibits significant p-values for this interaction for just five metrics spread across all bands. The pairwise interactions between the treatment factor with response and the treatment factor with visit yield the highest number of significant p-values (16 and 26, respectively). On its own, the treatment factor yields 36 statistically significant metric/band combinations. These results indicate that the treatment has the most significant effect on the values of the metrics. Thus, the results suggest analyzing the data from the two treatments independently. The single factors of response and visit give significant values for 12 and 17 cases, but their interaction gives 7, and most of them have relatively high p-values (5 out of the 7 are around 0.04 and the 2 around 0.015). This indicates that even though there is a change in the metric values between visits and there exists a difference between those that responded to treatment and those that did not, these two factors are relatively independent. That is, changes in metric values between visits 1 and 2 are not much related to treatment outcome (response). Combining this with the result for the treatment and visit interaction, we can deduce that treatment may be a strong confounding factor when studying the relationship between response to treatment and metric values. We note that since visit 2 was one week after visit 1, but the respondent/nonrespondent assessment occurred at week 4, it is possible that the week 2 metrics (i.e., brain connectivity) may not have changed enough to “capture” the treatment outcome.

Based on the results of the repeated measures ANOVA, we chose to examine the subjects who received pharmacological treatment and neurostimulation independently. Additionally, we perform two distinct analyses. First, we compare the metrics of the two visits. Here we are not concerned with the treatment outcome but with what effect, if any, the two treatments have on the metrics and consequently on the causal relation in the brain. The second analysis compares the metrics between respondents and nonrespondents to treatment. In this case we compare the metrics of visit 1 for the two response groups. We exclude the visit 2 data for two reasons. First, according to the repeated measures ANOVA results, there is no significant interaction between the response and visit interactions. Second, we cannot be sure about the therapeutic state of the patients at visit 2. It is possible that for some subjects the treatment has produced a change in brain connectivity related to therapy response, while in others it may not have. As a matter of fact, analysis of the visit 2 data alone produced results similar to that of visit 1, but a bit worse, further strengthening the idea that the week 2 brain connectivity changes may not be indicative of the outcome for all subjects.

In the rest of the manuscript we only present results for the FLOW and INFLOW metrics. The ANOVA results for the OUTFLOW metrics show that the statistically significant results for the cases of interest are either much fewer than the INFLOW metrics (2 versus 16 for the treatment/visit interaction) or spread across bands (e.g., compare Figure 3 results for INFLOW and OUTFLOW for the treatment factor). For OUTFLOW, a minimum of one and a maximum of four channels were sporadically significant in a particular band, whereas all other channels had high p-values. All numerical values of the repeated measures ANOVA p-values are presented in the Supplementary Materials (Tables 1–3) for the sake of completeness.

Effect of treatment on brain causality

3.1.1

This section presents the results of the change in metrics between visits 1 and 2 for the two therapy groups independently. We perform Student’s t-test (paired) to compare the metric values from two visits. The p-values for all FLOW metrics and all five bands are listed in Table 3. The $[eqn]$ and the δ bands are primarily significant for the FLOW metrics in the pharmacological and neurostimulation groups, respectively. In the first group, the p-values in the $[eqn]$ band indicate a substantial difference between visits across all FLOW metrics. In the second treatment group TF, LL, LR, out_L, in_L, and in_R are significantly different in the δ band. Figure 4 illustrates the variation in all the FLOW metrics between the two visits for the pharmacological group in the $[eqn]$ band, and Figure 5 presents the same for the neurostimulation group in the $[eqn]$ . No metric achieved statistical significance after FDR correction in the other three frequency bands; therefore, we omit the presentation of these results graphically (figures corresponding to all metrics and all bands are presented in the Supplementary Materials, Figures 1–5). In both figures, the y-axis of the panels displays the metric values averaged over all subjects along with the standard errors of the mean.

Table 3: P-values for the comparison of the FLOW metrics between visits 1 and 2 for the two treatment groups. p-values< 0.05 are in bold font.

Change in metric values between visit 1 (v1) and visit 2 (v2) in the β2 band for the pharmacological treatment. The y-axis of all panels (a–i) displays the metric values averaged over all subjects along with the standard errors.

Descriptions of all panels (a–i) are similar to Figure 4, but for the neurostimulation treatments and the δ band. The dashed lines in panels (e–g) indicate that there was no statistically significant difference in these cases.

For the pharmacological group, in the $[eqn]$ band, all the FLOW metrics (Figures 4a–i) show a statistically significant increase from visit 1 to visit 2 with p-values ranging between < 0.001 and 0.049. The smallest p-values are observed for metrics TF, in_L, and in_R, all less than 0.001. The effect size for these three metrics based on Cohen’s d is moderate, or close to moderate (d = 0.3885, 0.3166, 0.4138, respectively). For the remaining metrics, the effect size is small (d in the range of 0.1900 − 0.2429).

In the case of the neurostimulation group and the $[eqn]$ band, most metrics, but not all, show significant changes. The FLOW metrics $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , and $[eqn]$ exhibit a statistically significant increase between the two visits with very low p-values, at a maximum of $[eqn]$ (Figures 5a–d, h, i). We note that the smallest p-values ( $[eqn]$ ) correspond to flows originating from the left hemisphere ( $[eqn]$ and $[eqn]$ ), for which the effect size is again moderate ( $[eqn]$ , respectively) but larger than the effect sizes observed in the pharmacological group. Metrics $[eqn]$ , $[eqn]$ , $[eqn]$ , and $[eqn]$ also exhibit a moderate effect size change ( $[eqn]$ in the range of $[eqn]$ ). Contrary to what we observe for the left hemisphere, the metrics corresponding to outflow from the right hemisphere ( $[eqn]$ , $[eqn]$ and $[eqn]$ ) do not exhibit any statistically significant changes (Figures 5e–g), with very small effect size ( $[eqn]$ in the range of $[eqn]$ ).

Figures 6 and 7 illustrate the results of the change in inflow per channel between the two visits. The results are presented graphically in brain topographical maps and are given numerically in the Supplementary Materials, Table 5. In both figures, the first row of panels shows the p-values for the 19 channels in all five bands. The channels that show statistically significant changes in inflow from visit 1 to visit 2 are highlighted in green text. The results for the pharmacological group are presented in Figure 6. The second and third rows show topographical maps of the actual values of information flow per channel averaged over all subjects in the five frequency bands for visits 1 and 2, respectively. The fourth row shows the difference in the inflow as visit 2 minus visit 1. The $[eqn]$ and the $[eqn]$ bands are the only ones in which the channels had a statistically significant change in inflow. The overall inflow per channel increases from visit 1 to visit 2 in both bands. The channels that are significant for both $[eqn]$ and $[eqn]$ bands correspond to the right frontal (F4), central (Cz, Pz), left parietal (P3), posterior temporal (T5, T6), and occipital (O1, O2) regions. Fp2 and P4 are significant only in the $[eqn]$ band but still have p-values quite low for the $[eqn]$ band ( $[eqn]$ both, marginally above $[eqn]$ ). Comparing the topographic maps of the two visits for both the bands, it can be observed that the $[eqn]$ band shows a larger increase in the inflow compared to the $[eqn]$ band. The effect size of the inflow metrics for the significant channels is around moderate ( $[eqn]$ in the range of $[eqn]$ ) for the $[eqn]$ band and small for the $[eqn]$ band ( $[eqn]$ in the range of $[eqn]$ ). With respect to localization and the change in inflow, for the $[eqn]$ we observe that the statistically significant changes are quite widespread, excluding mainly left prefrontal and mid-temporal regions of the brain. The largest changes (smallest p-values) are observed for the midline frontal (Fz, $[eqn]$ ), parietal (P3, P4, $[eqn]$ ), occipital (O1, O2, $[eqn]$ ), and right posterior temporal (T6, $[eqn]$ ) regions. Comparing the left and right hemisphere p-values for the channels with statistically significant change (F3, C3, P3, O1, F7, T5 vs F4, C4, P4, O2, F8, T6), we see that the average p-value in the left is 0.012, while in the right it is 0.002 (almost an order of magnitude difference), indicating that there is more significant change (increase) of inflow in the right hemisphere. Average effect sizes of the significant channels in the left and right hemisphere are 0.3076 and 0.374, respectively.

Brain topographical plots for the comparison of inflow per channel between the two visits for the pharmacological group. The first row of panels is the p-values for the comparison. The 2nd and 3rd rows present the average values (across subjects) of the inflow for visit 1 (v1) and visit 2 (v2), while the 4th row presents the difference between the visits. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the first row.

Similar to Figure 6, but for the neurostimulation group. Rows of the average inflow per channel for the two visits (2nd and 3rd rows in Figure 6) are omitted (given in Supplementary Materials, Figure 7).

Figure 7 presents the results for the neurostimulation group. In this case, the p-values are displayed in the first row of the panels and the difference in the average INLFOW per channel in the second row of panels. The δ band is the only band where a significant change in inflow is observed, with p-values of significant channels ranging between 0.013 and 0.040 (Cohen’s d in the range 0.2905 − 0.5348). The positive values of the difference in inflow for the δ band show that there is a significant increase from visit 1 to visit 2. For the δ band, the changes are relatively widespread, including the frontal (Fp1, F3, F4), central (Fz, Cz, Pz), parietal (P3, P4), occipital(O2), and temporal (T3, T5, T6) regions, but the most significant changes, i.e., the lowest p-values, are observed in the central and posterior temporal regions (Cz, T5, and T6 with p = 0.013 for all three and d = 0.5348, 0.5048, and 0.5007, respectively). Similar to pharmacological treatment, we observed that the average increase in inflow in the significant channels is greater in the right hemisphere (average d = 0.4203) than the left (average d = 0.3769).

Baseline brain causality difference between response groups

3.1.2

This section presents the results of the metric analysis with respect to the response to treatment, i.e., the comparison of respondents with nonrespondents. We perform Student’s t-test (independent) to compare the metric values of visit 1 for the two response groups. The p-values for all FLOW metrics and for all five bands are listed in Table 4. Any significant difference in the metrics between the response groups will inform us about a difference in brain connectivity patterns of individuals who are going to potentially respond or not to the treatment. From Table 4, we observe that only the $[eqn]$ band in the pharmacological group shows significant differences for the FLOW metrics $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , $[eqn]$ , and $[eqn]$ with p-values $[eqn]$ for all six metrics (the equality in the p-values is due to the FDR adjustment). For the neurostimulation group, there is no metric that achieves statistical significance. Prior to p-value adjustment there were three metrics in the $[eqn]$ band that showed significant change ( $[eqn]$ , $[eqn]$ , and $[eqn]$ with $[eqn]$ , $[eqn]$ , and $[eqn]$ , respectively), but they did not survive the FDR procedure. The difference of the significant metric values for the pharmacological group in the $[eqn]$ band is depicted in Figure 8. The y-axis of the figure displays the average over all subjects of the metrics for each response group along with their standard errors. The bar plots indicate that the nonrespondents have higher values for all these metrics than the respondents. The effect size for all these metrics is again at or near the moderate level ( $[eqn]$ in the range of $[eqn]$ ). While statistically significant, these effect sizes indicate a degree of distributional overlap between responders and nonresponders, suggesting that these metrics should be interpreted as group-level mechanistic differences rather than standalone individual diagnostic tools. The bar plots corresponding to all metrics and bands are presented in the Supplementary Materials (Figures 8–12).

Bar plots of the significantly different FLOW metrics between respondents (R) and nonrespondents (NR) to pharmacological treatment at the time of visit 1 in the α band.

Figure 9 depicts the results of the comparison of inflow per channel between respondents and nonrespondents at visit 1 in all five frequency bands. The topographic plots depict the p-values for the comparisons, along with the difference in inflow averaged over all subjects. The first two rows correspond to the pharmacological group, and the last two correspond to the neurostimulation group. The $[eqn]$ band is the most significant band for the pharmacological group. The most significant differences between the two response groups can be localized to the frontal (Fp1, F4), left anterior temporal (F7), right parietal (P4), right occipital (O2), and central (Pz) regions (all six having p-values $[eqn]$ and Cohen’s $[eqn]$ to $[eqn]$ , indicating moderate effect size). The channels Fp2, Fz, F8, C3, C4, and P3 had p-values $[eqn]$ , which are marginally above the significance threshold. With respect to the nature of the difference, for all the significant channels, the nonrespondents have a higher inflow per channel than the respondents. The $[eqn]$ band also had two channels (Fp1 and T5) with $[eqn]$ before FDR adjustment, but they lost significance after adjustment, and the other three bands had no significant channels.

Brain topographical plots for the comparison of inflow per channel between respondents (R) and nonrespondents (NR). The first row of panels is the p-values for the pharmacological (Ph) group, and the second row is the difference in inflow between R and NR averaged over all subjects. The 3rd and 4th rows are the same as the first two but for the neurostimulation (St) group. Columns correspond to frequency bands, as indicated by the text above the top panels. Channels with p-values< 0.05 are indicated in green-colored text in the 1st and 3rd rows.

For the comparison of the response groups in the case of neurostimulation treatment, the results are not as positive. No bands show any statistically significant difference between the response groups for any channel. Prior to the FDR procedure, channels Fp1, T3, T5, C4, and O2 in the $[eqn]$ band had $[eqn]$ , but after adjustment, all of them lost significance with the adjusted p-values of C4 to be $[eqn]$ and the rest higher. The $[eqn]$ band also had three channels (F7, T3, and F4), with p-values $[eqn]$ before adjustment, which lost significance afterwards. In both bands, contrary to the pharmacological group, the respondents have a (non-significantly) higher inflow per channel compared to the nonrespondents.

Discussion

4

In this study, we conducted an information theory-based causality analysis on EEG data of patients diagnosed with MDD who received pharmacological or neurostimulation treatments. We introduced a slightly adjusted version of a previously developed method (pPMIME) and first tested it on artificial data, where it performed very well in capturing causal relationships for weak coupling. We then used pPMIME to estimate the pairwise strength of the causal relationships between brain locations. To study information flow in the brain, we defined sets of global (FLOW) and local (INFLOW and OUTFLOW per channel) causality metrics. We used repeated measures ANOVA to determine which factors (visit, treatment, response, and all their possible interactions) were the most important. The ANOVA results indicated overall differences between the two treatment groups and that there was no interaction between the response and visit factors. Therefore, we decided to analyze separately the data from the two treatments. We focused on two independent studies: 1) analyzing the impact of each treatment modality on brain causality between visits and 2) comparing baseline/visit 1 metrics for the two response groups.

The FLOW metrics performed well in differentiating the two visits as well as the two response groups, the latter only for the pharmacological treatment. With respect to the local metrics, the OUTFLOW per channel performed much worse than INFLOW. OUTFLOW provided fewer in number and magnitude significant p-values that did not provide any strong localization information. On the other hand, INFLOW provided insight about brain region-specific information flow and indicated brain regions whose “causal behavior” is altered by treatment and may potentially serve as indicators of response to treatment.

A significant consideration in our analysis was the heterogeneity of treatment types within the pharmacological and neurostimulation groups. The pharmacological group comprised five subgroups (SNRI, SSRI, NDRI, TCA, and NaSSA), while neurostimulation included rTMS and tDCS. We performed a sensitivity analysis to test for interaction effects between treatment subgroups and the outcome metrics. Repeated measures ANOVA on the two groups separately followed by FDR adjustment over each family and subsequent post-hoc rank based comparison tests revealed that there was no statistically significant difference between the subgroups with respect to causality. While we acknowledge that the absence of statistical significance does not constitute definitive proof of physiological equivalence—given the potential for Type II errors in smaller subgroups—the current analysis did not detect a substantial effect of the heterogeneity. This lack of detectable divergence between the modalities statistically justifies their pooling into a single cohort, consistent with the view that these interventions share convergent downstream effects on neural circuitry. Theoretically, all the different subgroups in pharmacological treatment are known to increase synaptic neurotransmitter availability, such as serotonin, norepinephrine, and dopamine by different mechanisms to prevent the reuptake of the corresponding neurotransmitter into the presynaptic neuron (46). All three neurotransmitters work together to address neurochemical imbalances or neural circuit dysfunction to improve mood, reduce anxiety, and restore emotional equilibrium, thus converging on common downstream circuit-level effects. Similarly, both rTMS and tDCS aim to restore balance in prefrontal–limbic circuitry, albeit through different neurophysiological mechanisms—modulating neurotransmitter release, metabolism, or gene expression but achieving convergent network-level outcomes (47, 48). rTMS (1 Hz right DLPFC): provides inhibitory modulation of overactive right prefrontal regions, indirectly facilitating left prefrontal activity and rebalancing hemispheric asymmetry associated with depression. tDCS (anodal left DLPFC): enhances excitability and connectivity of hypoactive left prefrontal cortex, also aiming to restore network balance. Thus, the first part of this study focused on the changes of information flow metrics due to the overall effect of the pharmacological versus neurostimulation treatment irrespective of subgroups variations.

After one week of pharmacological treatment, we observed a relatively global increase in information flow in $[eqn]$ band without strong hemispheric bias. This finding complements prior findings of elevated $[eqn]$ power and phase synchronization post-treatment (49, 50), which enhance signal-to-noise ratio and aligns windows of excitability across distant regions. Together, these changes create stronger and more precisely timed carrier signals that could indicate the observed increase in information flow—quantified by causality analyses. One plausible neurobiological mechanism likely involves restoration of neurotransmitter dynamics in circuits affected by depression. Certain brain regions may exhibit insufficient neurotransmitter activation in MDD, leading to impaired synaptic communication. Pharmacological treatments, such as SSRIs, limit the action of corresponding serotonin reuptake transporters, thereby prolonging the presence of neurotransmitters like serotonin at the synapse. This boosts neuronal signaling and facilitates symptom relief. The resulting enhancement in circuit-level communication may manifest as increased information flow, as observed from visit 1 to 2 in the $[eqn]$ band for the metrics corresponding to all brain regions. Regionally, the largest inflow increases occurred in the $[eqn]$ band, with maximal effects at the midline frontal, parietal, and occipital sites. Since MDD is known to disrupt large-scale cortical and subcortical networks spanning multiple lobes of the brain (51, 52), these results suggest treatment exerts its strongest influence on these interconnected systems. Notably, at visit 2, inflow to local sites in the $[eqn]$ band increased more in right hemisphere regions compared to the left. This pattern supports the view that the right hemisphere is affected more in MDD (53) and suggests that treatment may enhance left hemisphere’s capacity to send regulatory signals to the right.

After one week of neurostimulation treatment, the $[eqn]$ band distinguished the two visits by showing a statistically significant increase in global information flow from the left hemisphere to the rest of the brain. This pattern aligns with prior findings that focused on frontal channels and reported stimulation-induced changes in $[eqn]$ band connectivity (54). Regionally, the effects were widespread, with the largest differences observed in the central and posterior temporal areas, alongside significant alterations in the frontal sites—including the left frontopolar, bilateral dorsolateral prefrontal, and midline frontal sites. Physiologically, rTMS and tDCS modulate synaptic plasticity and neural excitability, thereby reshaping functional connectivity by strengthening beneficial neural pathways and weakening maladaptive ones (47, 48). Both approaches excite the hypoactive left prefrontal cortex, which is reflected in the marked increase in left-to-whole-brain information flow in the δ band from visit 1 to 2. Notably, FLOW metrics for both hemispheric outflow showed opposite trends from visit 1 to 2 (Supplementary Materials, Figures 1–5) across most frequency bands, except for $[eqn]$ , where right-hemisphere increases were present but substantially smaller in effect size compared to the left. This asymmetry aligns with the therapeutic mechanisms of stimulation, which act by selectively enhancing or suppressing neural connections to restore balanced communication patterns, reflected as an increase and decrease in information flow. Unlike the pharmacological group, where nearly all metrics across bands trended upward, neurostimulation produced a more differentiated profile of increases and decreases across regions and frequencies.

The frequency-specific nature of our findings ( $[eqn]$ for pharmacological, $[eqn]$ for neurostimulation) suggests that different treatments engage distinct neural oscillatory mechanisms. $[eqn]$ oscillations are associated with top-down cortical control and GABAergic inhibition (55, 56), while delta oscillations reflect large-scale cortical-subcortical interactions and homeostatic sleep processes (57, 58). This frequency dissociation may explain why some patients respond better to one treatment modality over another, and supports the development of frequency-targeted therapeutic interventions (59, 60).

The baseline analysis at visit 1 revealed that in the α band, patients unlikely to respond to traditional antidepressants exhibited greater total information flow—especially right-to-left outflow and bilateral inflow—than potential responders. At the local level, the significant difference were widespread, the frontal, right parietal, left anterior temporal and right occipital regions showed the largest difference, where the nonresponders showed higher inflow. Our agnostic approach revealed patterns that align with, yet extend, established theories. For example, these findings are consistent with prior reports of right hemisphere hyperactivity (53) and elevated α band activity in depression, often spanning large portions of the cortex (2, 61). This confirms that our data-driven causality analysis captures known pathological asymmetries while providing novel detail regarding the directionality of these disparate flows. Physiologically, the elevated information flow corresponding to the right hemisphere before treatment might indicate that the brain is attempting to compensate by increasing overall connectivity and information transfer. However, this hyperconnectivity appears to be maladaptive rather than beneficial which causes the brain to work harder but with reduced efficiency in supporting functions such as processing negative emotions, engaging in repetitive thought patterns, and managing automatic bodily functions that the right hemisphere struggles with.

Across individual subjects, all six baseline significant metrics correlated negatively with proportional symptom changes (i.e. percentage change in depression severity score from baseline : $[eqn]$ with Pearson correlation $[eqn]$ whose $[eqn]$ percentile bootstrap confidence intervals (CIs) (bootstrap resamples: $[eqn]$ ) excluded zero, two-sided permutation p-values were small ( $[eqn]$ , $[eqn]$ permutations), and FDR adjusted p-values across metrics ranged between $[eqn]$ , while left-to-right flow metrics showed near-zero effects and non-significant p values. The narrow, bootstrap-stable CIs and low permutation p-values (Supplementary Materials, Table 8) suggest that these associations are robust to sampling variability and therefore plausibly reproducible in cohorts with similar characteristics, although independent validation is still required. These results are consistent with group-level findings, where nonresponders exhibited higher baseline connectivity than responders. Together these results suggest that elevated $[eqn]$ band flow particularly right dominant global measures may indicate a less plastic, maladaptive network configuration that limits pharmacological responsiveness. However, the observed correlations are small-to-moderate in magnitude likely too small to serve as standalone clinical decision tools; therefore as a future work we plan to build a prospective predictive modelling that combines these metrics with other clinical and neurophysiological features, evaluated with proper cross−validation and external validation is required to assess clinical utility and generalizability.

Baseline comparisons between responders $[eqn]$ and nonresponders $[eqn]$ to neurostimulation treatment lacked strong statistical significance due to smaller sample size. However, several consistent patterns emerged: responders exhibited higher $[eqn]$ band total flow and bilateral inflow, higher local inflow across channels, and moderate-to-large effect sizes (Cohen’s $[eqn]$ , $[eqn]$ , and $[eqn]$ , respectively) for the three global metrics and locally ( $[eqn]$ in the range of 0.643 - 0.910 for Fp1, T3, T5, C4, O2). These effect sizes exceed those in the pharmacological group and mirror prior reports of elevated baseline $[eqn]$ band activity in stimulation responders (54), suggesting these differences would likely reach significance in larger samples.

The findings of our study complement the existing results in the literature, which have established that several brain regions are affected in MDD, which is reflected in different frequency bands (2). Our frequency-specific findings align with recent MEG evidence showing that beta-band connectivity within inhibitory control networks is particularly relevant for treatment response prediction in MDD (62). The study reported that non-responders exhibited decreased β band connectivity in a left-lateralized frontoparietal network centered on superior parietal gyrus and orbitofrontal cortex—regions we also identified as showing altered causality patterns. This convergence across different methodological approaches (MEG coherence vs. EEG phase-based causality) strengthens the evidence that frequency-specific network disruptions are fundamental to treatment resistance, arguing for multiband integration in future analyses.

The larger effect sizes for right hemisphere inflow metrics between visits for both treatments suggest that therapies restore hyperactive right-sided circuits by channeling healthier signals from the rest of the brain. The hemispheric specificity of our findings gains additional support from recent findings (62), that while all MDD patients showed right-lateralized network disruptions (potentially reflecting general disease pathology), only non-responders showed additional left-lateralized frontoparietal hypoconnectivity. This pattern of bilateral dysfunction in non-responders versus primarily right-sided alterations in responders suggests that preserved left hemisphere compensatory mechanisms may be crucial for treatment response. The convergent evidence across studies suggests a hierarchical model of network dysfunction in treatment resistance. Primary right-hemisphere disruptions (observed in all MDD patients) may represent core pathophysiology, while secondary left-hemisphere decompensation (seen specifically in non-responders) indicates a failure of compensatory mechanisms (63, 64). While we did not find statistically significant differences in left hemisphere outflow between responders and non-responders, our finding that non-responders exhibited significantly greater total flow, right-to-left flow, and bilateral inflow is consistent with this pattern of dysregulated interhemispheric communication. This bilateral breakdown of top-down control (65, 66) may explain why standard antidepressants, which primarily target monoaminergic systems, fail to restore network function in these patients (67, 68).

These findings have important clinical implications for personalized medicine in MDD. The distinct α band baseline signatures in pharmacological nonresponders suggest that EEG-based causality metrics could be incorporated into clinical decision-making algorithms. While single-metric prediction is limited by moderate effect sizes, patients showing high baseline α band information flow, particularly from the right hemisphere, display a physiological profile that suggests they might be better candidates for neurostimulation or combined treatment approaches rather than pharmacotherapy alone. This could reduce the trial-and-error period often associated with antidepressant selection, potentially improving patient outcomes and reducing healthcare costs. An important consideration is the temporal dynamics of treatment response. While clinical response was assessed at 4–6 weeks, our brain connectivity changes were measured at one week. This early timepoint may capture initial neuroplastic changes that precede clinical improvement. The observation that visit 2 metrics did not improve prediction over visit 1 metrics suggests that baseline characteristics may be more informative than early treatment-induced changes for predicting ultimate treatment response. Future studies should include multiple intermediate timepoints to map the trajectory of causality changes throughout treatment.

Study limitations include the absence of explicit statistical tests for hemispheric asymmetry metrics and limited statistical power for baseline responder/nonresponder comparisons in the neurostimulation cohort due to small sample size. Future studies should use larger cohorts, include intermediate timepoints to map causality trajectories, and develop prospective predictive models that combine these metrics with clinical and other neurophysiological features.

This study demonstrates that pharmacological and neurostimulation treatments affect brain information flow differently, with frequency-specific signatures ( $[eqn]$ for pharmacological, $[eqn]$ for neurostimulation). We observed significant imbalances in brain connection patterns between responders and nonresponders, with distinct profiles for each treatment modality. The α band baseline differences in pharmacological treatment indicate pretreatment EEG profiles carry predictive value for treatment response. These information flow metrics can serve as valuable mechanistic indicators augmenting personalized therapeutic approaches in MDD, though further validation in larger, independent cohorts is essential for clinical translation.

Bibliography68

The reference list from the paper itself. Each links out to its DOI / PubMed record.

1Vos T Lim SS Abbafati C Abbas KM Abbasi M Abbasifard M . Global burden of 369 diseases and injuries in 204 countries and territories, 1990-2019: a systematic analysis for the global burden of disease study 2019. Lancet. (2020) 396:1204–22. doi: 10.1016/S 0140-6736(20)30925-9, PMID: 33069326 PMC 7567026 · doi ↗ · pubmed ↗
2Mumtaz W Malik AS Yasin MAM Xia L . Review on EEG and ERP predictive biomarkers for major depressive disorder. Biomed Signal Process Control. (2015) 22:85–98. doi: 10.1016/j.bspc.2015.07.003 · doi ↗
3Kennis M Gerritsen L van Dalen M Williams A Cuijpers P Bockting C . Prospective biomarkers of major depressive disorder: a systematic review and meta-analysis. Mol Psychiatry. (2020) 25:321–38. doi: 10.1038/s 41380-019-0585-z, PMID: 31745238 PMC 6974432 · doi ↗ · pubmed ↗
4Friston KJ . Functional and effective connectivity in neuroimaging: A synthesis. Hum Brain Mapp. (1994) 2:56–78. doi: 10.1002/hbm.460020107 · doi ↗
5Simmatis L Russo EE Geraci J Harmsen IE Samuel N . Technical and clinical considerations for electroencephalography-based biomarkers for major depressive disorder. NPJ Ment Health Res. (2023) 2:18. doi: 10.1038/s 44184-023-00038-7, PMID: 38609518 PMC 10955915 · doi ↗ · pubmed ↗
6PalušM Kathpalia A BrunovskýM . (2023). EEG connectivity in treatment of major depressive disorder: Tackling the conductivity effects, in: 2023 14th International Conference on Measurement. Bratislava: Institute of Measurement Science, Slovak Academy of Sciences. pp. 76–9. doi: 10.23919/MEASUREMENT 59122.2023.10164615 · doi ↗
7Khan DM Masroor K Jailani MFM Yahya N Yusoff MZ Khan SM . Development of wavelet coherence EEG as a biomarker for diagnosis of major depressive disorder. IEEE Sensors J. (2022) 22:4315–25. doi: 10.1109/JSEN.2022.3143176 · doi ↗
8Ho SI Lin IM Hsieh JC Yen CF . EEG coherences of the default mode network among patients comorbid with major depressive disorder and anxiety symptoms. J Affect Disord. (2024) 361:728–38. doi: 10.1016/j.jad.2024.06.041, PMID: 38889861 · doi ↗ · pubmed ↗