Symptom Network Dynamics during Antipsychotic Treatment in First-Episode Psychosis
Melissa G Zandstra, Floortje E Scheepers, Gabriela Lunansky, Silvana Galderisi, Birte Y Glenthøj, Inge Winter-van Rossum, Metten Somers, Edwin van Dellen

TL;DR
The study found that changes in symptom relationships over time, rather than initial symptoms, better predict treatment outcomes in psychosis patients.
Contribution
The study introduces temporal symptom network analysis to differentiate treatment response in psychosis.
Findings
Baseline symptom networks did not differ between remitters and non-remitters.
Temporal networks showed minimal overlap and no correlation in connection strengths between remitters and non-remitters.
Key symptoms differed between groups, but no specific medication effects were found.
Abstract
Background and Hypothesis: Treatment response in first-episode psychosis varies substantially, yet underlying factors remain poorly understood. Symptom network theory suggests that inter-symptom relationships may influence treatment response. We hypothesized that symptom networks at baseline, as well as dynamic interactions over time, would differ between remitters and non-remitters, and that specific antipsychotics would show differential symptom-targeting effects. Study Design: We compared baseline and temporal symptom networks between remitters (n = 250) and non-remitters (n = 196) from the OPTiMiSE trial using 21-item Positive and Negative Syndrome Scale (PANSS) data. Baseline networks were estimated using Gaussian graphical models and compared with the Network Comparison Test. Temporal networks across baseline, week 2, and week 4 were modeled using Cross-Lagged Panel Network…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4|
|
| |||||
|---|---|---|---|---|---|---|
|
|
|
|
|
|
| |
| Age, M (SD) | 25.5 (6.0) | 26.3 (6.3) | 24.4 (5.3) | 25.0 (5.5) | 25.1 (5.5) | 24.8 (5.6) |
| Sex, male, | 312 (70.0%) | 174 (69.6%) | 138 (70.4%) | 66 (77.6%) | 35 (81.4%) | 31 (73.8%) |
| Race, | 386 (86.5%) | 215 (86.0%) | 171 (87.2%) | 79 (92.9%) | 40 (93.0%) | 39 (92.9%) |
| Disease type, | 229 (51.3%) | 117 (46.8%) | 112 (57.2%) | 58 (68.2%) | 31 (72.1%) | 27 (64.3%) |
| Duration of current episode, months, M (SD) | 6.3 (6.2), | 6.0 (6.0), | 6.6 (6.6), | 12.0 (2.7), | 12.7 (2.8), | 11.3 (2.4), |
| Education, years, M (SD) | 12.3 (3.0), | 12.5 (3.1), | 12.0 (2.7), | 8.4 (7.4), | 9.8 (7.9), | 7.0 (6.7), |
| PANSS total score | 54.7 (14.8) | 51.5 (14.6) | 58.9 (14.1) | 52.4 (13.5) | 53.7 (12.5) | 51.2 (14.4) |
- —European Commission Seventh Framework Program
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMental Health Research Topics · Functional Brain Connectivity Studies · Schizophrenia research and treatment
Introduction
Initial treatment for first-episode psychosis (FEP) fails to achieve remission in a substantial proportion of patients, contributing to an elevated suicide risk and long-term functional impairment.1^,^2 Despite decades of research, the factors underlying divergent treatment outcomes remain poorly understood. Traditional models cast psychotic disorders as latent diseases and evaluate efficacy by overall symptom reduction or diagnostic change. In practice, however, individual symptoms interact dynamically—delusional ideas often provoke social withdrawal, while hallucinations might fuel anxiety—and these interconnections may influence response trajectories.
Symptom network theory reframes mental disorders as systems of causally linked symptoms rather than passive indicators of an underlying pathology.3 Within this framework, autonomous symptoms activate one another in feedback loops that sustain and amplify the disorder. This approach has been discussed for its potential to optimize treatment planning and evaluation,4 and may also aid in understanding outcome variations. If symptoms form interconnected networks, treatment response might depend on network organization and its capacity for change. Comparing symptom network characteristics between treatment responders and non-responders could offer insights into why some patients achieve remission while others do not.
Research in depression has begun to demonstrate the clinical relevance of this network approach. Patients who respond to treatment show different baseline symptom network patterns than non-responders,5–7 suggesting prognostic value. However, psychosis research remains limited. Baseline centrality differences between treatment responders and non-responders have been described in patients with psychosis,8 but the single study specifically in FEP found no differences between outcome groups.9 However, this study constructed networks using composite domain scores from multiple questionnaires rather than individual symptom items, which may have obscured important symptom-level network differences. It is therefore unclear if baseline symptom network configurations can differentiate between eventual remitters and non-remitters in FEP.
Beyond baseline differences, it has been investigated how treatment affects symptom networks over time. A cross-sectional study with repeated measures demonstrated that antipsychotic treatment reduces overall network strength (ie, symptoms become less strongly connected to each other).10 Crucially, other work demonstrates that remitters' networks display greater flexibility in network structure and nodal strength compared to non-remitters.9 These findings highlight the potential of longitudinal network analysis to differentiate varying treatment responses.
However, these longitudinal approaches do not model how symptoms influence each other over time. Temporal network analysis addresses this limitation by modeling directed relationships between symptoms across time points, testing whether changes in 1 symptom predict subsequent changes in other symptoms. Zhang et al.11 used Cross-Lagged Panel Network (CLPN) analysis in FEP patients over 6 months, finding certain negative symptoms preceded changes in other symptoms. While providing insights into symptom dynamics, the relatively long measurement intervals make it difficult to isolate medication effects, and analyzing the total sample together obscures potential differences in temporal symptom dynamics between remitters and non-remitters. Understanding how temporal symptom interactions differ between these groups could provide important insights into treatment response or resistance.
Different treatments, particularly those with distinct pharmacological profiles, may influence symptom networks uniquely. Previous research has addressed this by examining medication-specific effects on symptom centrality through the construction of separate networks for different antipsychotics.10 Network intervention analysis (NIA) advances this research by embedding treatment directly into the network, allowing identification of which symptoms are directly targeted by specific medications.12 This enables more targeted treatment by focusing on medications that directly address patients' most prominent symptoms.
This study uses data from the Optimization of Treatment and Management of Schizophrenia in Europe (OPTiMiSE) trial13^,^14 to examine symptom networks before and during controlled antipsychotic treatment: a 4-week open-label amisulpride phase for all patients, followed by double-blind randomization of non-remitters to continue amisulpride or switch to olanzapine. We aimed to compare both baseline and temporal networks between remitters and non-remitters, and examine differential effects of continuing amisulpride versus switching to olanzapine on specific symptoms in initial non-remitters over the course of treatment.
Methods
Participants
In the OPTiMiSE trial, participants with FEP were recruited from 27 hospitals and psychiatric clinics across 14 European countries (Austria, Belgium, Bulgaria, Czech Republic, Denmark, France, Germany, Italy, the Netherlands, Poland, Romania, Spain, Switzerland, and the UK), as well as Israel. Eligible individuals were aged 18-40 and met the criteria for schizophrenia, schizophreniform disorder, or schizoaffective disorder according to the Diagnostic and Statistical Manual of Mental Disorders, 4th edition (DSM-IV). Diagnoses were verified using the Mini-International Neuropsychiatric Interview-Plus.15
Exclusion criteria were as follows: (1) more than 2 years between the onset of psychosis and study enrollment; (2) prior antipsychotic use exceeding 2 weeks in the previous year or a lifetime total of 6 weeks; (3) known intolerance to the medications used in the trial; (4) any contraindications for the study medications; (5) being under compulsory treatment, legal guardianship, or custody; and (6) pregnancy or breastfeeding during the trial.
For the current study, data from phases I and II of the original study were utilized. In phase I, participants received open-label treatment with amisulpride (200-800 mg per day) for 4 weeks. Out of the 481 participants who were assessed and provided informed consent, 446 were enrolled in this phase. Those who did not achieve symptomatic remission after the initial 4 weeks proceeded to phase II, a 6-week randomized, double-blind trial, in which participants either continued amisulpride (200-800 mg) or switched to olanzapine (5-20 mg). The transition from phase I to phase II occurred between day 1 and day 7 of phase II, depending on the dosage received in phase I. A total of 92 participants were enrolled in phase II. The study adhered to the ethical principles of the Declaration of Helsinki, with ethics approval obtained in all participating countries. All participants were fully informed about the study and provided written consent prior to participating.
Measures
The Positive and Negative Syndrome Scale (PANSS)16 was used to characterize psychotic symptoms. The PANSS exists of 30 items, of which 7 measure positive symptoms, 7 negative symptoms, and 16 general symptoms. It has a minimum score of 30 and maximum score of 210. Symptomatic remission was measured according to the Andreasen criteria17: 8 specific symptoms (items P1, P2, P3, N1, N4, N6, G5, and G9) are at most only mildly present (maximum rating of “3”). In our study, we utilized a recently proposed 21-item version of the PANSS.18 This version excludes 9 items that showed low stability in network analyses. Stability here refers to how consistently items cluster with the same symptom group when the analysis is repeated on multiple bootstrap samples. In the development of the PANSS-21, Dal Santo et al. applied a conservative stability cutoff of 0.75, such that items appearing in their original symptom community in fewer than 75% of resampled networks were excluded, informed by Monte Carlo simulation guidelines indicating reduced stability below approximately 0.65–0.75.19 We used this 21-item version for 3 reasons. First, as demonstrated by Dal Santo et al., restricting analyses to structurally stable items is essential for network modeling, as items with low stability undermine the reliability of estimated symptom relationships. Second, focusing on stable items yields a more interpretable and reproducible network structure, as unstable items that shift between symptom dimensions obscure the meaning of network edges. Third, the reduced number of nodes improves the participant-to-variable ratio, enhancing the robustness of network estimation given the available sample size. The remaining 21 items are distributed across 4 symptom domains: Positive, Cognitive/Disorganized, Excited/Aggressive, and Negative (see Table S1 for details). PANSS assessments were conducted at baseline, week 1, 2, and 4 during phase I, and at week 6, 8, and 10 during phase II.
Network Analysis
Cross-Sectional Networks
We estimated regularized partial correlation networks to compare the symptom structures of remitters and non-remitters at baseline. These networks consist of nodes (representing individual PANSS items) and edges (representing the statistical associations between symptoms). Due to the ordinal nature of the PANSS items, we constructed the networks using Spearman correlations.
To obtain sparse networks with reduced false-positive connections, we applied the Least Absolute Shrinkage and Selection Operator (LASSO) regularization technique.20 This approach implements an ℓ1 penalty that shrinks small edges toward zero, resulting in a more conservative network estimation. The degree of penalization was controlled by the hyperparameter γ, set to the default value of 0.5.21
To enable a direct visual comparison of the remitter and non-remitter networks, the node layout was standardized using the "averageLayout" function, ensuring consistent positioning. The maximum edge value was capped at 0.54 (based on the strongest edge of the 2 networks), ensuring that edge weights were comparable across networks.
Empirical comparisons of symptom networks may be influenced by differences in symptom severity and sample size.5 Therefore, to ensure that group-level differences were not driven by baseline severity scores and to derive equally sized groups, we performed a sensitivity analysis comparing networks of subgroups matched on baseline severity scores. Matching was conducted using the 'nearest neighbor' method.22
Network Centrality
To examine node centrality within the network, we followed prior research in psychosis and estimated both strength and one-step expected influence (EI) centrality.11^,^23 Node strength is defined here as the sum of the absolute values of all edge weights and provides a measure of overall connectivity irrespective of edge sign. Expected influence extends this measure by accounting for both the direction and magnitude of connections.24 Specifically, one-step EI is calculated as the sum of all edge weights connected to a given node, with positive edges increasing and negative edges decreasing the node's EI.
Network and Centrality Stability
In accordance with recommendations of Epskamp et al.,25 the robustness of the network model was assessed by estimating the accuracy of edge weights and the stability of centrality indices. To estimate the accuracy of the edge estimates, non-parametric bootstrapping with 1000 samples was done. The confidence intervals (CIs) obtained from these analyses can be used to determine the accuracy of edge weights, with narrower CIs indicating higher accuracy of estimated edges.25 We also evaluated the stability of the centrality indices by using a case-dropping bootstrap procedure. This approach examines whether centrality rankings remain consistent when portions of the data are randomly removed.25 For the EI and strength centrality, we calculated a correlation stability coefficient (CS-coefficient), which represents the maximum proportion of cases that can be dropped while maintaining a correlation of 0.70 or higher with the original centrality values (based on 1000 bootstrap samples). Following established guidelines, we used a CS-coefficient threshold of 0.25 to define network stability.25 Networks with CS-coefficients above this threshold were considered sufficiently stable for interpretation.
Comparison of Network Characteristics
To compare the remitter and non-remitter networks, we used the Network Comparison Test (NCT), which is a permutation test that assesses the difference between 2 networks.26 The NCT was performed with 1000 permutations to test for differences in both global strength (absolute sum of all edge weights), network structure (distribution of edge weights), and EI and strength centrality. When significant differences in network structure were detected, we performed post-hoc testing on individual edges using Holm-Bonferroni corrections for multiple comparisons.
All analyses were conducted using R statistical software version 4.3.2.27 For cross-sectional networks, we used bootnet (1.6)25 (estimateNetwork with EBICglasso) for estimation, qgraph (1.9.8)21 for visualization and centrality analysis (centralityPlot), and MatchIt (4.7.0)22 for matching. Stability was assessed with bootnet (using the same LASSO parameters as in the original estimation), and group differences were tested with NetworkComparisonTest (2.2.2).28
Temporal Networks
To model the temporal relationships between PANSS items during the first 4 weeks of treatment, we used the CLPN approach.29 This method captures directed relationships between symptoms across discrete time points, allowing us to examine how symptoms influence each other over the course of treatment. These directed paths represent the shared variation between a symptom at time t and another symptom (either the same or different) at time t + 1, while accounting for all other symptoms at time t. Although phase I consisted of measurements at baseline, week 1, week 2 and week 4, we only included baseline (T0), week 2 (T1) and week 4 (T2) in our analyses, as CLPN analysis requires equal time intervals between measurements.
To estimate the CLPN, we computed autoregressive and cross-lagged coefficients using regularized regressions with a penalized maximum likelihood and LASSO penalty,20 with penalty selection via 10-fold cross-validation (CV). While LASSO effectively identifies true zero paths, it biases non-zero edge weights toward zero29 and the CV approach often retains false positive connections.30 To address the bias in non-zero edge estimation, we implemented a two-step procedure following Rhemtulla et al.29 where paths estimated as zero in the first step were fixed to zero, and the remaining non-zero edges were re-estimated using standard, non-regularized regression within a structural equation modeling (SEM) framework. This approach maintains the sparse structure identified by LASSO while providing unbiased estimates for the non-zero paths.
The use of SEM also allowed us to examine whether the predictive relations across successive occasions were equal (eg, if the paths from T0 to T1 are equivalent to those from T1 to T2). We fitted two nested models: an unconstrained model that allows different values for paths across time points, and a constrained model where both the presence and values of paths from T0 to T1 are constrained to be equal to those from T1 to T2.29 Once both models have been fit to the data, they can be compared using a nested chi-square difference test and approximate fit measures (eg, Root Mean Square Error of Approximation Difference [RMSEAD]).31 If imposing constraints did not significantly worsen model fit, we selected the constrained model for parsimony. If the constraints significantly degraded model fit, we retained the unconstrained model to better capture temporal variations in symptom dynamics.
Network Centrality
To quantify symptom centrality in the temporal networks, two indices were calculated: in-prediction and out-prediction.29 In-prediction reflects the extent to which each symptom at a given timepoint is predicted by all other symptoms measured at the preceding timepoint. Out-prediction captures the extent to which each symptom predicts other symptoms at the subsequent timepoint. Both indices are expressed as proportions of explained variance (R^2^), ranging from 0 (no predictive value) to 1 (perfect prediction). Compared to traditional directed centrality measures, such as in-strength and out-strength—which quantify only the summed strength of directed associations—in-prediction and out-prediction explicitly capture the predictive relationships between symptoms across consecutive measurement points. As such, these prediction-based indices offer greater clinical insight by directly quantifying how strongly individual symptoms forecast and respond to changes in the broader symptom network over the course of treatment.
Network Stability
To assess network stability, we conducted bootstrap analyses (1000 iterations) using non-parametric bootstrap resampling. Bootstrap CIs were calculated for all edge weights to evaluate the precision and robustness of parameter estimates.
Comparison of Network Characteristics
Since the NCT has been developed specifically for cross-sectional data and has not been validated for longitudinal data analysis, we needed alternative methods to test differences between remitter and non-remitter temporal networks. Following previous CLPN literature, we used two complementary measures: the correlation between non-zero edge weights and the Jaccard Index (JI).32–35 Matrix correlations quantified the similarity of existing edge strengths, and the JI36^,^37 quantified overlap in edge presence (range 0-1). Details are provided in the Supplementary Materials.
For temporal networks, we applied glmnet (4.1.8)38 for regularized regressions, lavaan (0.6.17)39 for SEM to re-estimate the remaining non-zero edges, qgraph for visualization, and bootstrapLavaan (lavaan) for stability analysis.
Network Intervention Analysis
For Phase II, we applied NIA12 to assess the differential effects of olanzapine and amisulpride at weeks 6, 8, and 10. A Mixed Graphical Model (MGM)40 including all PANSS items (continuous) and the treatment variable (binary) was used to estimate the symptom network. Since the networks were estimated on full datasets, we included all available data at each assessment point, which led to slight variations in sample sizes.
Consistent with our previous analyses, we estimated the network using LASSO regularization with CV to select the optimal tuning parameter. For the estimated network, we computed the predictability of each symptom, representing the proportion of variance explained by all the other symptoms in the network.41 In case of a differential effect of treatment, we performed non-parametric bootstrapping with 1000 samples.25 Besides the CIs of the estimated edge weights, the plot also shows the proportion of bootstrap samples that included the edge in the network. As the focus was on the differential effect of treatment, we concentrated only on links involving treatment.
Given that we were contrasting two active treatment conditions, any edge between the treatment variable and a specific symptom reflects a treatment-specific effect. Importantly, these direct effects highlight differences between the two treatments. If both treatments affect a symptom similarly, this effect will not be displayed in the network. Therefore, the absence of an edge should not be interpreted as the absence of a treatment effect but rather as the absence of a differential treatment effect between amisulpride and olanzapine.
We used the mgm package (version 1.2.14)40 for estimating the networks, the qgraph package for visualization, and the bootnet package for stability analyses.
Results
Sample Characteristics
The baseline sample of phase I included 446 participants, with 250 classified as remitters and 196 as non-remitters by the end of the phase. The mean age of the participants was 25.5 years (SD = 6.0), and 70% of the sample were male. A total of 386 participants completed the assessment at week 2, and 371 completed the final assessment. The baseline sample of phase II included 85 participants, with 43 randomized to receive amisulpride and 42 to receive olanzapine. Detailed baseline characteristics for both Phase I and Phase II are presented in Table 1. Sample characteristics for subsequent time points (Phase I: Table S2; Phase II: Table S3) and individual PANSS scores (Phase I: Table S4; Phase II: Table S5) are available in the Supplementary Materials.
Cross-Sectional Networks
Comparison of Baseline Networks in Remitters and Nonremitters
Figure 1A and B show the baseline symptom networks for patients who responded to treatment by the end of Phase I (remitters) and those who did not (non-remitters). The remitter network contained 90 non-zero edges, while the non-remitter network contained 87 non-zero edges. There was no significant difference in the overall network structures between the two groups (M = 0.15, P = .94). Global strength also did not significantly differ between remitters (8.56) and non-remitters (7.94, S = 0.62, P = .48). Figure 1C shows the standardized EI and strength centrality for remitters and non-remitters. For remitters, disturbance of volition (G13) showed the highest EI and strength centrality, followed by emotional withdrawal (N2) and poor rapport (N3). In non-remitters, emotional withdrawal (N2) showed the highest EI, followed by preoccupation (G15) and poor rapport (N3), whereas strength centrality was highest for delusions (P1), followed by emotional withdrawal (N2) and preoccupation (G15). Despite these descriptive differences, no nodes showed statistically significant differences in EI or strength centrality between remitters and non-remitters. Sensitivity analyses using a matched sample yielded similar results, with no significant differences in overall network structure (M = 0.16, P = .79) or global strength (S = 0.57, P = .82) between remitters and non-remitters. Node-specific EI or strength values also did not differ significantly between groups. See Figure S1 for details. Bootstrapped 95% CIs of edge weights suggest that edges are fairly stable (Figure S2). EI and strength centrality estimates were also fairly stable (Figure S3), with a CS-coefficient of 0.44 for the remitters and 0.29 for the non-remitters. Complete edge weight matrices are provided in Tables S6 and S7.
Baseline networks, expected influence (EI), and strength centrality for remitters and non-remitters. A, network of remitters (n = 250). B, network of non-remitters (n = 196). C, Z-scored EI and strength centrality for remitters and non-remitters. In the networks, blue solid lines represent positive associations, and red dashed lines represent negative associations. Edge thickness indicates the strength of the association, with thicker edges reflecting stronger connections; all edges were standardized to a maximum value of 0.54, corresponding to the strongest observed association.
Cross-lagged panel network model plots for T0 → T1 network and T1 → T2 network of remitters and non-remitters. A, T0 → T1 remitters. B, T1 → T2 remitters. C, T0 → T1 non-remitters. D, T1 → T2 non-remitters. The direction of the edges indicates the direction of the cross-lagged coefficients between nodes. Dashed red edges signify negative associations, while blue edges represent positive associations. The saturation and thickness of the edges indicate the strength of the associations. All edges were standardized to a maximum value of 0.40, corresponding to the strongest observed association. Only significant (P < .05) edges are visualized.
Temporal Symptom Dynamics
The primary analyses focused on comparing temporal symptom networks between remitters and non-remitters during antipsychotic treatment. For completeness, additional analyses of the full sample can be found in the Supplementary Materials.
Symptom Severity over Time
Item-level symptom severity over time for remitters and non-remitters showed distinct patterns (Figure S4, standard deviations in Table S4). Remitters showed substantial symptom improvement between T0 and T1, particularly for positive symptoms (P1 and P6), followed by a more gradual decline from T1 to T2**.**
In contrast, non-remitters exhibited a more heterogeneous and dynamic symptom trajectory. Between T0 and T1, most symptoms decreased moderately, but several symptoms increased in severity, including blunted affect (N1), lack of spontaneity and flow of conversation (N6), mannerisms and posturing (G5), and motor retardation (G7). From T1 to T2, although a few symptoms showed slight reductions, the majority showed renewed increases. These included delusions (P1), hostility (P7), blunted affect (N1), emotional withdrawal (N2), poor rapport (N3), social withdrawal (N4), lack of spontaneity and flow of conversation (N6), mannerisms and posturing (G5), motor retardation (G7), unusual thought content (G9), disturbance of volition (G13), and poor impulse control (G14). The most notable increase was observed in N6 (lack of spontaneity and flow of conversation), with the mean score rising from 2.79 at T1 to 3.07 at T2.
Testing Temporal Stability of Psychotic Symptom Relationships
As a first step, we examined whether symptom dynamics remained consistent across time intervals during antipsychotic treatment by testing the imposition of cross-time constraints. For remitters, the unconstrained model fitted the data well; χ^2^(df = 859) = 904.7, P = .136, RMSEA = .015 (.000,.024); CFI = .994, TLI = .988 (robust statistics reported). The constrained model showed acceptable fit; χ^2^ (df = 1091) = 1344.38, P < .001; RMSEA = .016 (.000,.025); CFI = .990, TLI = .984. The chi-square difference test showed a significant difference, χ^2^diff (df = 232) = 439.7, P < .001. This suggests that adding the cross-time constraints significantly reduces model fit, indicating that symptom dynamics changed significantly between the first-time interval (T0 to T1) and the second time interval (T1 to T2), a pattern also observed in non-remitters (χ^2^diff (df = 213) = 437.1, P < .001). Therefore, we used the unconstrained model for all subsequent analyses to accurately capture the temporal variations in symptom dynamics during antipsychotic treatment.
Comparison of Cross-Lagged Panel Networks in Remitters and Nonremitters
Figures 2 and 3 present the directed symptom networks for remitters and non-remitters across both time intervals (T0 → T1 and T1 → T2), along with the in-and-out prediction values. The directed edges represent temporal associations between symptoms after controlling for all other symptoms at baseline (for T0 → T1) and week 2 (for T1 → T2). To enhance interpretability, autoregressive edges were excluded from the plots (see Figure S5 for complete networks including autoregressive edges). The mean weight of the autoregressive edges was 0.48 for the remitter T0 → T1 network, 0.51 for the non-remitter T0 → T1 network, 0.51 for the remitter T1 → T2 network, and 0.54 for the non-remitter T1 → T2 network (Figure S6).
Cross-lagged in-and-out prediction plots for T0 → T1 and T1 → T2 for remitters and non-remitters. A, In-prediction plot T0 → T1. B, In-prediction plot for T1 → T2. C, out-prediction plot T0 → T1. D, out-prediction plot T1 → T2. The x-axis represents the explained variance.
Network intervention analysis comparing amisulpride and olanzapine effects. A, week 6. B, week 8. C, week 10. The networks include the PANSS items (circles) and the treatment (square). Treatment includes amisulpride (scored as 0) and olanzapine (scored as 1). A green line indicates an effect in favor of olanzapine, and a yellow line indicates an effect in favor of amisulpride. For each node, the proportion of explained variance by all other nodes, that is, the predictability, is visualized by a ring around the node: A completely filled ring indicates that 100% of the variance is explained, a completely empty ring corresponds to an explained variance of 0%. Abbreviations: PANSS, Positive and Negative Syndrome Scale.
Network similarity analyses revealed substantial differences between remitters and non-remitters. There was minimal overlap in which symptom connections were present between groups (T0 → T1: JI = 0.014; T1 → T2: JI = 0.055, indicating that only 1.4% and 5.5% of all unique connections were shared), and the strength of non-zero edge weights showed virtually no correlation between groups (T0 → T1: r = -0.09, P = .45; T1 → T2: r = 0.005, P = .97). These findings indicate largely distinct patterns of symptom-to-symptom associations between remitters and non-remitters over time.
Examining these differences in more detail, remitters initially (T0 → T1) showed delusions (P1) as the most predicted symptom (R^2^ = 0.18), influenced by a combination of positive (P6), negative (N3), excited/aggressive (G14) and cognitive/disorganized symptoms (N5, G13, G15). Additionally, the excited/aggressive symptom, poor impulse control (G14), emerged as the strongest predictor (R^2^ = 0.012), influencing several positive (P1, P6, G9) and excited/aggressive (G8, P7) symptoms. These predictive patterns shifted in the T1 → T2 network, where the previously central nodes showed markedly lower in-or-out prediction values (P1: R^2^ = 0.05; G14: R^2^ = 0.001). Instead, the negative symptom poor rapport (N3) emerged as the strongest predictor (R^2^ = 0.016), influencing cognitive/disorganized (N7), positive (P3), and excited/aggressive (G8) symptoms. Uncooperativeness (G8) became the most strongly predicted symptom (R^2^ = 0.24), though its only predictor was N3.
For non-remitters, symptom dynamics followed a different trajectory. In the T0 → T1 network, the cognitive/disorganized symptom G10 (disorientation) was the most strongly predicted node (R^2^ = 0.20), influenced by negative (N4) and cognitive/disorganized (G13) symptoms. G13 (disturbance of volition) was also the strongest predictor (R^2^ = 0.006), although it only influenced G10. This shifted in the T1 → T2 network. The cognitive/disorganized symptom motor retardation (G7) emerged as the most predicted symptom (R^2^ = 0.28), driven by negative (N1, N2, N3), positive (P6, G9) symptoms. At the same time, difficulty in abstract thinking (N5) became the strongest predictor (R^2^ = 0.012), influencing P6 and G10. Though cross-lagged effects appear small, they remain meaningful given high symptom autocorrelations that constrain available variance for cross-symptom prediction.
Notably, in the non-remitter T1 → T2 network, all outgoing edges from positive symptoms were negative, indicating that lower levels of positive symptoms at week 2 were associated with higher levels of other symptoms at week 4, particularly in the negative and cognitive/disorganized domains. This pattern is reflected in the mean scores (Figure S4), which generally shows a continued decline in positive symptoms alongside increasing negative symptoms among non-remitters.
Complete edge weight matrices for all networks are provided in Tables S8-S11, detailing all directed associations between PANSS items for each group and time interval. Bootstrapped CIs around edge weights were moderate (see Figure S7).
Differential Effects of Amisulpride and Olanzapine
Detailed item-level severity scores for both treatment groups across all assessment timepoints of phase II are presented in Table S5. Figure 4 shows the network models at week 6, week 8, and week 10. While there were no symptom-specific differences between treatments at weeks 6 and 10, olanzapine demonstrated a symptom-specific effect on poor rapport (N3) at week 8. However, stability analysis revealed that the link between treatment and poor rapport was retrieved in only 31% of the samples, indicating low stability (Figure S8).
Discussion
This study provides novel insights into the symptom dynamics before and during antipsychotic treatment in FEP, comparing remitters and non-remitters and examining differential symptom-level effects of amisulpride versus olanzapine. Cross-sectional analyses revealed no significant differences between outcome groups. In contrast, temporal network analyses uncovered distinct patterns of symptom interactions at both intervals. Among remitters, early dynamics were characterized by delusions as the most strongly predicted symptom and poor impulse control as the dominant predictor; at the later interval, the emphasis shifted toward uncooperativeness and poor rapport. Non-remitters followed a different trajectory: disorientation was initially the most predicted symptom, and disturbance of volition the strongest predictor, later shifting to motor retardation and difficulty in abstract thinking. While olanzapine showed a symptom-specific effect on poor rapport at week 8, this effect demonstrated low stability. Overall, no robust symptom-specific medication effects emerged.
Although this is the first item-level examination of baseline network differences, our findings align with previous research showing no baseline network differences between remitters and non-remitters using composite PANSS domains, depressive symptoms, and cognitive schemas.9 Importantly, while these cross-sectional networks could not distinguish outcome groups, our longitudinal CLPN analyses revealed marked differences in symptom dynamics during treatment. This suggests that temporal symptom interactions may provide more meaningful insights into treatment response mechanisms than static baseline relationships alone.
At the global level, remitters showed a dense, highly interconnected network in the initial phase (baseline→week 2), where baseline symptoms relatively strongly predicted symptom improvement. This structured predictability suggests symptom evolution in remitters follows a systematic trajectory responsive to pharmacological intervention. Conversely, non-remitters exhibited a fragmented network with fewer and weaker predictive connections, indicating their baseline profiles were less informative for anticipating subsequent change. This reduced predictability may reflect a more heterogeneous or unstable illness course influenced by factors beyond direct antipsychotic effects.
These patterns diverged further during the subsequent treatment phase (week 2 → week 4). Among remitters, the symptom network became less densely connected, mirroring their clinical course of substantial early improvement followed by more gradual change. This aligns with prior research showing that the most pronounced therapeutic effects typically occur within the first two weeks of antipsychotic treatment.42 The observed reduction in network density may reflect a relative decoupling of symptom interactions as symptom severity declines, with individual symptoms becoming more independent.
In contrast, non-remitters showed a shift from an initially fragmented network to a more densely interconnected 1 in the week 2 → week 4 interval, reflecting a more complex and increasingly interdependent symptom landscape. Notably, all outgoing edges from positive symptoms in this network carried negative weights. Combined with the observed symptom trajectories—where positive symptoms generally continued to decline while negative and cognitive/disorganized symptoms increased—this pattern suggests an inverse relationship: reductions in positive symptoms at week 2 predicted increases in other symptom domains by week 4. This dynamic may reflect a trade-off during treatment, where improvement in positive symptoms is accompanied by a worsening of negative or cognitive symptoms. Such dynamics could point to emerging secondary negative symptoms, neurobiological compensation, or unintended effects of treatment itself.
Some specific findings warrant detailed discussion. During baseline→week 2 in remitters, poor impulse control showed the highest out-prediction value, suggesting that baseline behavioral regulation may facilitate early antipsychotic response, consistent with evidence linking impulse control to treatment engagement and adherence.43 While previous research demonstrated that poor impulse control predicts worse functional outcomes in early-onset psychosis,44 our network analysis extends this by revealing the temporal influence of impulse control on subsequent symptom severity. Delusions’ high in-prediction value in remitters suggests that this symptom serves as a central marker of early treatment response, with suspiciousness as its primary predictor. This aligns with research showing suspiciousness often precedes delusional thinking as part of an evolving "delusional mood,"45 suggesting paranoid symptoms may influence delusion improvement rates during treatment. Poor rapport's highest out-prediction value at week 2 → week 4 suggests that after initial improvement (baseline→week 2), improvement in social functioning facilitates positive changes in other symptoms, consistent with research showing lack of social contact predicted more severe negative symptoms.46 In previous CLPN research, poor rapport was also found to predict changes in other symptoms.11
Notable findings for non-remitters included minimal predictability for delusions at baseline→week 2, suggesting changes in delusions were not systematically related to prior symptom states. Combined with high in-prediction for disorientation and high out-prediction for disturbance of volition, this pattern may indicate a symptom profile that is less responsive to conventional dopamine blockade. During week 2 → week 4, difficulty in abstract thinking emerged as the strongest predictor in the network, indicating that persistent cognitive/disorganized impairments may shape broader symptom dynamics and potentially impact recovery trajectories. This aligns with perspectives positioning cognitive dysfunction as the core feature of schizophrenia47^,^48 and is consistent with research showing that disorganized symptoms—including abstract thinking deficits—predict treatment resistance.49 Taken together, these findings raise the possibility that non-remitters comprise a subgroup in which cognitive dysfunction contributes to sustained symptom interconnectedness that standard antipsychotics do not fully address. These findings underscore the need to investigate whether adjunctive interventions targeting cognitive domains might benefit patients who show limited early response to antipsychotic medication.
Our finding of a few differences between continuing amisulpride and switching to olanzapine on a symptom level aligns with the primary findings from the OPTiMiSE trial,14 which demonstrated no significant difference in overall remission rates between the 2 treatments or in total PANSS score reductions. Our item-level analysis revealed only one difference: at week 8, olanzapine demonstrated a larger effect than amisulpride on poor rapport, suggesting a potential targeted benefit in this symptom domain despite overall comparable efficacy profiles. However, it is important to emphasize that the link between treatment and poor rapport lacked statistical robustness in our stability analysis, indicating this finding should be interpreted with caution rather than as definitive evidence of a treatment-specific effect.
Collectively, our findings challenge how treatment efficacy is currently evaluated in psychotic disorders. Clinical trials continue to rely predominantly on static severity measures such as total PANSS scores, treating psychosis as a problem of overall symptom burden rather than acknowledging its dynamic, multidimensional structure. Our temporal network analyses demonstrate that symptom interactions evolve distinctly across treatment trajectories, dynamics missed by conventional sum scores. This suggests that future trials could benefit from incorporating network-based endpoints that capture evolving symptom interdependencies during treatment. However, the present findings are hypothesis-generating: whether remission is reliably associated with reduced density requires replication. Moreover, our analyses reflect group-averaged patterns; individual patients may show heterogeneous network responses. Person-specific network approaches, for example, using ecological momentary assessment, will be needed to inform personalized treatment targets.50 Together, these findings motivate future work to define, validate, and prespecify network-based markers of treatment response.
Strengths and Limitations
A major strength of our study is its multi-layered network approach, which integrates cross-sectional and temporal analyses to better capture treatment response and resistance. CLPN enabled the analysis of non-stationary temporal symptom dynamics, a limitation of many traditional time-series models. Additionally, integrating NIA offered a novel way to assess differential medication effects at the symptom level, highlighting treatment targets often missed by conventional methods.
Limitations of this work include its hypothesis-generating rather than causality-confirming design. Network analyses identify temporal patterns between symptoms but cannot establish causal relationships without experimental manipulation. Moreover, CLPN models operate at the group level and do not disaggregate within-person dynamics from between-person differences, precluding individual-level inference. Data quality considerations include potential measurement noise in PANSS ratings. Although stability analyses were conducted, some of the greater heterogeneity observed in non-remitters may partially reflect measurement error rather than true symptom dynamics.51 Future work could further disentangle these effects by comparing observed networks to null models with predefined levels of measurement noise. In addition, no validated method currently exists to statistically compare CLPN networks, limiting formal inference. The relatively small phase II sample size may also have limited power to detect differential medication effects with NIA and likely contributed to the low stability of the olanzapine–poor rapport association. Finally, the scope of the present study was restricted to FEP and active antipsychotic treatment. Future research is therefore needed to determine whether similar temporal network dynamics characterize later illness stages, where longer illness duration and sustained treatment exposure may give rise to more stable or entrenched symptom interdependencies. Likewise, studies incorporating placebo or non-pharmacological control conditions would help clarify the relative contribution of specific medication effects versus non-specific processes such as expectancy or natural symptom fluctuation. Comparing network-level changes between pharmacological and psychosocial interventions could help determine whether distinct patterns of symptom reorganization emerge or replicate across treatment modalities.
Conclusions
In conclusion, temporal symptom-network analysis reveals dynamic patterns in early FEP treatment that are not evident from cross-sectional baseline assessments. The distinct temporal network trajectories in remitters versus non-remitters suggest that monitoring of dynamic symptom interactions could help identify patients who may benefit from alternative or adjunctive strategies. The absence of robust medication-specific effects between amisulpride and olanzapine suggests these treatments may have more general therapeutic effects, though this finding is exploratory given the small sample size.
Supplementary Material
Supplementary_Material_sbag016
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Ventriglio A, Gentile A, Bonfitto I, et al. Suicide in the early stage of schizophrenia. Front Psychiatry. 2016;7:116.27445872 10.3389/fpsyt.2016.00116 PMC 4921745 · doi ↗ · pubmed ↗
- 2Jordan G, Lutgens D, Joober R, Lepage M, Iyer SN, Malla A. The relative contribution of cognition and symptomatic remission to functional outcome following treatment of a first episode of psychosis. J Clin Psychiatry. 2014;75:e 566-e 572. 10.4088/JCP.13m 0860625004197 · doi ↗ · pubmed ↗
- 3Borsboom D . A network theory of mental disorders. World Psychiatry. 2017;16:5-13. 10.1002/wps.2037528127906 PMC 5269502 · doi ↗ · pubmed ↗
- 4Blanchard MA, Heeren A. Ongoing and future challenges of the network approach to psychopathology: from theoretical conjectures to clinical translations. Compr Clin Psychol. 2022;11:32-46. 10.1016/B 978-0-12-818697-8.00044-3 · doi ↗
- 5van Borkulo C, Boschloo L, Borsboom D, Penninx BW, Waldorp LJ, Schoevers RA. Association of symptom network structure with the course of depression. JAMA Psychiatry. 2015;72:1219-1226. 10.1001/jamapsychiatry.2015.207926561400 · doi ↗ · pubmed ↗
- 6Zhou J, Liu S, Mayes TL, et al. The network analysis of depressive symptoms before and after two weeks of antidepressant treatment. J Affect Disord. 2022;299:126-134. 10.1016/j.jad.2021.11.05934838606 · doi ↗ · pubmed ↗
- 7Mc Elroy E, Napoleone E, Wolpert M, Patalay P. Structure and connectivity of depressive symptom networks corresponding to early treatment response. E Clinical Medicine. 2019;8:29-36. 10.1016/j.eclinm.2019.02.00931193604 PMC 6537518 · doi ↗ · pubmed ↗
- 8Esfahlani FZ, Sayama H, Visser KF, Strauss GP. Sensitivity of the positive and negative syndrome scale (PANSS) in detecting treatment effects via network analysis. Innov Clin Neurosci. 2017;14:59-67.29410938 PMC 5788252 · pubmed ↗
