Multi-Site Classification of Autism Spectrum Disorder Using Spatially Constrained ICA on Resting-State fMRI Networks
Talha Imtiaz Baig, Junlin Jing, Peng Hu, Bochao Niu, Zhenzhen Yang, Bharat B. Biswal, Benjamin Klugah-Brown

TL;DR
This study uses brain imaging data from multiple sites to classify autism using machine learning, achieving high accuracy by analyzing specific brain networks.
Contribution
The study introduces a novel method combining spatially constrained ICA and ComBat harmonization for multi-site ASD classification.
Findings
The Visual Sensory Network achieved the highest classification accuracy (83.23%) and AUC (87.90%).
ComBat harmonization improved consistency across multi-site datasets, enhancing classification performance.
Network-based approaches show potential for identifying reproducible features in ASD diagnosis.
Abstract
Background/Objectives: Autism Spectrum Disorder (ASD) is a complex neurodevelopmental condition characterized by differences in social communications and restricted, repetitive patterns of behaviors and interests, affecting approximately 1% of children globally. While functional magnetic resonance imaging (fMRI) has provided insights into altered brain connectivity patterns in ASD, classification based on neuroimaging remains a challenging due to the heterogeneity of the disorder and variability in imaging data across sites. This study employs a network-based approach using large-scale, multi-site rs-fMRI dataset from the Autism Brain Imaging Data Exchange (ABIDE I and II) to classify ASD and healthy controls using machine learning. Methods: A semi-blind Independent Component Analysis method, specifically the spatial constraint reference ICA, is applied to identify functional brain…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8- —China MOST2030 Brain Project
- —National Natural Science Foundation of China
- —Natural Science Foundation of Sichuan Province
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFunctional Brain Connectivity Studies · Autism Spectrum Disorder Research · EEG and Brain-Computer Interfaces
1. Introduction
Autism Spectrum Disorder (ASD) is a persistent neurodevelopmental disability characterized by social interaction difficulties and repetitive behaviors [1]. The significant variability in neurocognitive features across autism from childhood to adulthood point out the complexity of identifying consistent neurobiological markers [2]. According to the World Health Organization (WHO), approximately 1 in 100 children globally is affected by ASD [3]. Recently, ASD diagnosis has relied on behavioral assessments [4,5,6] which are accurate but time-consuming and resource-intensive, and must be carried out by an expert.
Neuroimaging techniques offer promising paths for identifying effective physiological features associated with ASD by examining brain connectivity patterns. Functional magnetic resonance imaging (fMRI) is widely known for its ability to investigate these dynamics [7,8]. fMRI studies have shed light on the neurological basis of ASD, revealing altered functional connectivity in key brain regions such as the cerebellum, amygdala, and prefrontal cortex [9,10]. Given the complexity and heterogeneity of ASD, data-driven methods like independent component analysis (ICA) [11,12] enable brain signals to be decomposed into distinct, independent sources, allowing researchers to identify functional networks. This approach is particularly valuable in ASD research, providing a more comprehensive understanding of the disorder’s neural mechanisms.
Machine learning in healthcare has also emerged as a powerful data-driven tool in neuroimaging research [13,14]. Supervised methods including Support Vector Machines (SVMs) are frequently employed as diagnostic tools [15]. SVMs are particularly effective due to their robustness against overfitting and their ability to handle high-dimensional data efficiently [16,17]. In ASD classification, when processing data that is nonlinear and is also nonlinear in the original feature space, SVM can employ the kernel function to project the dataset into a larger space, where it becomes discrete [18,19].
First, although large multi-site repositories such as ABIDE provide increased statistical power, many existing studies have either focused on single-site analyses [20,21] or pooled multi-site data without explicit harmonization procedures [22]. This practice has been shown to reduce model generalizability due to scanner- and site-related variability [23,24,25,26]. Second, a large proportion of ICA-based ASD studies rely on unconstrained ICA approaches or fully data-driven component estimation, which can produce subject-specific or dataset-specific components. Such variability undermines the reproducibility and cross-site comparability of functional networks, particularly in large heterogeneous datasets [27]. Third, although modern machine learning and deep learning models have achieved moderate classification performance using ABIDE data [22], many of these approaches provide limited insight into network-level contributions, restricting their interpretability in the context of large-scale functional brain organization relevant to ASD.
To address these limitations, the present study integrates three complementary methodological components: (i) spatially constrained ICA using validated network templates to ensure consistent component extraction across subjects and sites; (ii) ComBat harmonization to reduce site-specific variability while preserving biologically meaningful signals; and (iii) a network-specific machine learning framework evaluated across 11 independent sites. The current investigation focuses on the default mode, sensorimotor, and visual networks, which have been repeatedly implicated in ASD-related alterations in social cognition, sensory processing, and motor function [28,29,30].
The primary objective of this research is to develop a reproducible, interpretable, and multi-site harmonization-driven analytical framework that advances beyond conventional ICA- or SVM-based approaches to Autism Spectrum Disorder (ASD) classification [31]. Specifically, this study aims to (1) mitigate multi-site variability through the application of a robust data harmonization strategy; (2) evaluate the discriminative contribution of major large-scale functional brain networks to ASD classification; and (3) assess the reliability and generalizability of machine learning-based classification performance across independent sites. All technical aspects, including functional network selection, ICA methodology, and harmonization procedures, are detailed in the Section 2.
2. Materials and Methods
Five key steps were incorporated to develop a methodology based on network analysis: (1) acquisition of data; (2) preprocessing; (3) spatial constraint ICA analysis; (4) harmonization using ComBat; (5) classifying ASD and HC into two categories.
2.1. Data Acquisition
Our study utilized publicly available datasets from the Autism Brain Imaging Data Exchange (ABIDE-I and ABIDE-II) repository (http://fcon_1000.projects.nitrc.org/indi/abide/ (accessed on 15 August 2025)) [32,33]. This dataset includes contributions from 38 different sites worldwide, encompassing a total of 2264 participants, with 1083 subjects diagnosed with ASD and 1181 healthy controls. Each contributing site confirmed ASD diagnoses using standardized clinical assessments [16].
To enhance the dataset quality, the study implemented data cleaning and preprocessing steps. Initially, sites with limited data on ASD subjects, such as CMU from ABIDE-I, were excluded. Data from individuals with excessive head motion (greater than 2 mm) or rotations exceeding 2 degrees were also removed [34]. Data from two ASD-only sites (KUL and NYU2 from ABIDE-II) were excluded. Approximately 300 participants were removed due to head motion, noise, or missing features data. Subjects with fewer time points were also excluded.
Following standard preprocessing procedures, data from 35 sites, comprising 1750 participants, were initially retained. To ensure reliable ICA decomposition and effective site-specific harmonization, only sites with at least 50 participants (including both individuals with ASD and healthy controls) were included in the final analysis. Prior research [35] has demonstrated that small site-level sample sizes in multi-site studies can lead to unstable ICA component estimation, inflated variance, and compromised model generalizability. Adequate per-site sample sizes are also critical for high-dimensional machine learning approaches, as they help mitigate overfitting and support robust within-site cross-validation. Applying this criterion yielded a final dataset of 996 participants from 11 sites, including 451 individuals with ASD and 545 typically developing controls. Participant ages ranged from 5 to 64 years, and the sample exhibited a male predominance, consistent with the sex distribution commonly observed in ASD research. This curated dataset was used to validate and quantitatively evaluate the performance of the proposed method.
2.2. fMRI Data Preprocessing
Resting-state fMRI data were preprocessed using the DPARSFA toolbox (version V7.0; http://www.rfmri.org; State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China) [36,37] in MATLAB (version R2022b; The MathWorks Inc., Natick, MA, USA). The preprocessing was performed according to the following procedure: (1) removing the initial 10 volumes to avoid the impact of uncertainty of system scanning; (2) performing slice time correction; (3) performing three-dimensional realignment for head movement correction; (4) removing non-brain tissues using BET; (5) performing T1 weighted co-registration; (6) using DARTEL for segmentation; (7) global signal regression was skipped because of apprehension about negative correlations [38,39,40]; (8) data was normalized with [3 × 3 × 3] mm voxel size to MNI space [41]; (9) spatial smoothing was performed. So, 996 subjects from 11 different sites including 451 ASD patients and 545 HC controls were preprocessed.
2.3. Spatial Constraint ICA
This study employed spatially constrained ICA using predefined templates [11,42], which substantially reduced sensitivity to initialization and model-order variability compared to unconstrained ICA. The analysis was performed independently across 11 sites and on the harmonized dataset, yielding consistent extraction of the same 28 components. While multiple random initializations were not explicitly tested, prior work has demonstrated the improved stability of this approach.
Following preprocessing, a group-level average brain mask was generated across all sites to ensure a consistent feature space for subsequent analyses. Semi-blind spatially constrained independent component analysis (sICA) [43] was then performed using the GIFT toolbox (Group ICA of fMRI Toolbox, version 4.0.4.11; www.nitrc.org/projects/gift; The Mind Research Network, Albuquerque, NM, USA) (accessed on 30 August 2025)). Prior methodological work indicates that a minimum of approximately 20 components is required to obtain meaningful intrinsic connectivity networks, whereas model orders exceeding 100 components can substantially reduce ICA reproducibility [35,44]. Based on these considerations, the present study extracted 28 predefined intrinsic connectivity network (ICN) spatial maps.
Spatial constraints were derived from well-established resting-state network templates provided by the NeuroMark atlas within the GIFT framework. These templates were originally estimated from large, independent healthy control datasets and have been validated to represent stable and reproducible functional brain networks. Incorporating these spatial priors into the ICA framework ensures consistent extraction of homologous components across subjects and sites, thereby improving cross-site comparability and reproducibility.
For each ICN, normalized spatial maps were obtained by applying voxel-wise z-score normalization. Within each component mask, mean voxel intensity values were computed to generate a numerical feature vector for each component per subject. Prior to network-level filtering, this procedure yielded 28 component-level feature values per subject. Spatially constrained ICA is particularly advantageous for large multi-site datasets such as ABIDE, as it enforces spatial consistency while retaining the ability to identify spatially independent functional components.
The sICA procedure followed the standard multi-stage pipeline described previously [45,46]: (1) subject-level principal component analysis (PCA) was first applied individually to each participant’s data; (2) group-level PCA was then performed after temporal concatenation of subject-level data; (3) the spatially constrained ICA algorithm was applied to estimate ICNs; and (4) artifactual group-level components were identified and removed using established criteria [47].
This study aimed to identify network-level features associated with ASD that could facilitate network-based classification. The application of spatial constraint ICA enabled us to comprehensively identify brain functional connectivity signals, addressing the heterogeneity of ASD across diverse populations.
2.4. Network Selection and Component Pairing
Following preprocessing, spatially constrained ICA (scICA) was applied using the NeuroMark functional network template to decompose the brain into seven canonical resting-state networks: the Default Mode Network (DMN), Sensorimotor Network (SMN), Basal Ganglia Network (BGN), Auditory Network (AN), Attention Network (ATN), Visual Sensory Network (VSN), and Frontal Network (FN). These seven networks constitute a well-established, widely accepted functional parcellation in the neuroimaging literature [48]. The template further subdivides these networks into 28 predefined, anatomically and functionally validated components. Specifically, the DMN comprises four components (07, 17, 19, 26); the SMN includes six components (01, 05, 06, 08, 10, 21); the BGN and AN each consist of a single component (04 and 02, respectively); the ATN contains six components (09, 18, 20, 23, 27, 28); the VSN also includes six components (11, 13, 15, 22, 24, 25); and the FN is composed of four components (03, 12, 14, 16). Guided by neurofunctional relevance and prior evidence linking these networks to Autism Spectrum Disorder [49,50], the present study examined all 28 components across the seven standard resting-state networks to ensure that the network-level analyses were biologically grounded and reproducible.
Network selection was guided by three quantitative and neurobiologically grounded criteria. First, component quality and spatial correspondence were assessed by computing spatial correlations with the NeuroMark atlas templates (requiring r > 0.5) and applying standard GIFT toolbox quality metrics, including peak z-scores of spatial maps exceeding 3.0 and acceptable temporal signal-to-noise ratios. Second, neurobiological relevance to Autism Spectrum Disorder (ASD) was considered, with priority given to networks consistently implicated in prior ASD research, particularly the Default Mode Network (DMN), sensorimotor network (SMN), and visual sensory network (VSN), which are associated with social cognition, sensory integration, and motor processing, respectively. Third, to ensure robustness in multi-site analyses, only networks represented by at least four components consistently identified across all sites were retained.
This selection procedure is standard in ICA-based neuroimaging studies, where the interpretability of components is important. After selecting the networks and components, identical components from these networks were harmonized using the ComBat model across all sites.
2.5. Harmonization of fMRI Datasets
Combining neuroimaging data from multiple sites is often necessary to achieve suitable sample sizes and ensure consistency, providing reliable results [51,52]. However, prior studies have highlighted challenges in merging fMRI datasets due to variations in imaging techniques [53,54]. Despite efforts to standardize imaging procedures, inconsistencies persist due to factors such as scanner brand, magnetic field strength, coil type, stabilization mechanisms, channel numbers, image resolution, and pulse sequence settings (e.g., TR, TE, and flip angle) [55]. Specifically, ComBat harmonization [56] was performed using site as the batch variable, while age and sex were included as covariates to preserve biologically meaningful variance. This approach ensures that site-related effects are minimized while controlling for demographic variability. The harmonized features were subsequently used in the classification pipeline. The whole dataset (n = 996) was subjected to ComBat harmonization following ICA feature extraction but before cross-validation. This approach is suitable for ComBat because it needs sufficient within-site sample sizes to precisely calculate site specific mean and variance parameters. Importantly, site effects represent technical artifacts, which are constant for all subjects. ComBat removes all effects while maintaining biological signals. The parameters obtained from all the sites were used to harmonize the reserved test site for leave-one-site-out cross-validation. Although the test site is included in parameter estimations, this does not constitute information leakage in the traditional sense because the harmonization is only informed by site-level technical factors instead of individual subject data. This approach is useful for harmonizing test sites using only training site parameters, which would apply corrections obtained from different scanner types. Additionally, every machine learning step that followed LDA and PCA hyperparameters adjustment and SVM training was caried out only inside training folds with strict test isolation. This ensures that only training data is used for test prediction [57], while harmonization simply takes care of technical variance. The ComBat harmonization model and mathematical formulations were explained in Section S1 of the Supplementary Materials [58].
2.6. Feature Extraction and Selection
Spatial features were derived by extracting voxel-wise z-score-normalized intensities from each component’s spatial map. After applying a group-level brain mask, each component contained approximately 214 voxels. Across 28 components, voxel intensities were vectorized, yielding approximately 6000 spatial features per subject across the selected networks.
Selecting the right features is important, mainly when using large feature vectors. A high feature-space can decrease accuracy, increase training and testing times, and increase model complexity. Features selection decreases feature size, improves model generalizability, and increases classification performance [59,60,61]. A two-step dimensionality reduction approach was used to handle high feature dimensionality and reduce redundancy. Principal Component Analysis (PCA) and Linear Discriminant Analysis (LDA) were used to minimize high dimensionality. PCA keeps the most important features, whereas LDA improves the selection to maximize class separation [62]. PCA was only applied to training folds (not to test data) to avoid data leakage. PCA-transformed data was used as an input to LDA. PCA and LDA were explained in Section S2 of the Supplementary Materials [58].
2.7. SVM as a Classification Model
The classifier uses the information given to it to build a hyperplane between distinct groups, such as ASD and HC, in a space with several characteristics. The SVM finds the best separating hyperplane between two classes while maintaining the maximum distance between them. SVM classification was performed using the radial basis function (RBF) kernel. Model parameters were optimized through grid-search tuning and the best parameter pairs (b*, y*) were selected based on the average cross-validation area under the curve (AUC). In all experiments, cross-validation was implemented using site-specific data splits to ensure balanced representation of sites in both training and testing sets, thereby mitigating site-related bias. Classification analyses were conducted at two levels: (1) within individual sites to assess site-specific performance, and (2) across combined sites to evaluate model generalizability under diverse acquisition settings. This dual-level approach enhances the robustness and external validity of the classification results. Dimensionality reduction steps were repeated independently for each site and network, confirming that no data leakage occurred. Mathematically, it can be expressed in Section S3 of the Supplementary Materials [58].
2.8. Evaluation
To validate the efficiency of the proposed ASD/HC classification approach, several performance indices were calculated, namely accuracy (ACC), area under the curve (AUC), sensitivity (SEN), and specificity (SPE) [63,64]. The performance metrics can be defined mathematically and are provided in Section S4 of the Supplementary Materials [58].
2.9. Implementation
All the analyses were performed in MATLAB (version R2022b; The MathWorks Inc., Natick, MA, USA) using GIFT toolbox (Group ICA of fMRI Toolbox, version 4.0.4.11; www.nitrc.org/projects/gift; The Mind Research Network, Albuquerque, NM, USA), DPARSFA toolbox (version V7.0; http://www.rfmri.org; State Key Laboratory of Cognitive Neuroscience and Learning, Beijing Normal University, Beijing, China). Computation was performed on a workstation intel i9-13900k CPU (Intel Corporation, Santa Clara, CA, USA), 64 GB RAM, RTX 4090 GPU, Windows 11.
3. Results
The demographic information of the cumulative records of 996 subjects is shown in Table 1. The detailed information of global diagnostic imaging sites for Autism Spectrum Disorder from the ABIDE library are shown in Supplementary Material Table S1 [58]. Table S2 of the Supplementary Material [58] contains details of the institutions and the scanner selection parameters and all the experimental settings of all 11 sites. The detailed research methodology employed in this study is outlined in Figure 1, which presents a unique workflow.
3.1. Classification Results
The DMN, SMN, and VSN satisfied all criteria and were therefore selected for downstream analyses. Other networks were excluded for objective reasons: the basal ganglia network (BGN) and auditory network (AN) contained only single components; the attention network (ATN) showed a substantially higher mean artifact component ratio (0.23 vs. 0.08 in selected networks); and the frontal network (FN) demonstrated lower mean spatial correlation across sites (0.52 compared to 0.71 in selected networks), indicating reduced cross-site consistency. These clarifications are explicitly stated in the Section 2 prior to harmonization.
In the classification results, individual-site and group-level site analysis shows outstanding results for ASD and HC classification, with individual-site analysis achieving up to 80.40% pointing out spatial features. Multi-site analysis showed accuracy of up to 83.23%.
3.1.1. Individual-Site Classification Results
To determine the versatility of the proposed networks analysis approach, we present the results of the top three brain networks associated with ASD and HC classification. The highest accuracies for DMN, SMN, and VSN are 80.40%, 79.72%, and 80.40%, with specificities of 87.87%, 88.54%, and 90.32% and sensitivities of 65.30%, 63.46%, and 63.63%. The AUCs were 79.54%, 82.97%, and 83.17%. See Table 2 for detailed results across sites and networks.
Table 2 presents the classification results of three main networks (DMN, SMN, and VSM) across multiple sites’ components, calculating accuracy (ACC), area under the curve (AUC), specificity (SP), and sensitivity (SN). The VSM frequently achieves higher accuracy and specificity, representing a stronger performance. The DMN and SMN show a little variability across sites, with some occurrences of higher sensitivity. This shows site-dependent differences in network effectiveness for precise classification in this context.
For the DMN, the individual sites that show high accuracies are as follows: BNI with component 17 (70.91%), GU with component 26 (67.47%), KKI with component 26 (80.41%), LEU with component 07 (68.97%), NYU with component 07 (63.95%), NYU1 with component 26 (75.68%), OHSU with component 17 (71.59%), SDSU with component 17 (71.15%), UCL with component 07 (67.86%), UOM with component 07 (66.35%), and USM with component 17 (71.79%). A visual representation of the DMN component results is shown in Figure 2 and Figure 3.
The accuracy results for the SMN are as follows: BNI with component 06 (67.27%), GU with component 08 (68.67%), KKI with component 06 (79.73%), LEU with component 01 (68.97%), NYU with component 21 (66.28%), NYU1 with component 08 (70.27%), OHSU with component 06 (69.32%), SDSU with component 21 (69.23%), UCL with component 06 (66.67%), UOM with component 05 (71.15%), and USM with component 08 (73.08%). A visual representation of SMN component results is shown in Figure 4 and Figure 5.
For the VSN, the components showing high accuracy for all the sites are as follows: BNI with component 24 (72.73%), GU with component 25 (75.90%), KKI with component 11 (80.41%), LEU with component 13 (65.51%), NYU with component 13 (66.86%), NYU1 with component 25 (74.32%), OHSU with component 24 (67.04%), SDSU with component 24 (73.07%), UCL with component 25 (73.80%), UOM with component 13 (68.26%), and USM with component 24 (70.51%). A visual representation of SMN component results is shown in Figure 6 and Figure 7.
Spatial features were used for individual sites (see Supplementary Tables S3.1–S3.3 in the Supplementary Materials and Supplementary Figures S1–S10) for full results of the individual components [58].
3.1.2. Multi-Site Combined Results
This investigation applied the current proposed network analysis approach to the multi-site ASD data from 11 sites of ABIDE-I and II for classification. Spatial features were used as an input. Due to known inter-site heterogeneity factors like age and gender, the ComBat harmonization technique was applied as described above. Table 3 presents all three network results of accuracy (ACC), sensitivity (SN), specificity (SP), and area under the curve (AUC).
Higher sensitivity was observed in the combined multi-site analyses compared with individual-site results, reflecting the effects of increased sample size, harmonization, and improved classifier stability, rather than simply aggregating site-level performance metrics. The performance measures reported in Table 3 differ from those in Table 2 due to differences in the feature representation and preprocessing strategy. In the combined multi-site analysis, features were harmonized using ComBat and represented as mean voxel intensities within component masks, whereas individual-site analyses (Table 2) relied on voxel-level spatial features without harmonization. Harmonized features are less sensitive to scanner- and site-specific variability and reduce the risk of overfitting when pooling heterogeneous multi-site data. In contrast, voxel-level features preserve fine-grained spatial information, which is advantageous for single-site analyses with uniform acquisition parameters but may limit generalizability across sites.
Additionally, pooling data across sites substantially increases the effective sample size and the diversity of training examples, enabling the classifier to learn more stable, generalized decision boundaries. The combination of harmonization increased subject diversity and reduced technical variance, which explains why the sensitivity in the combined analysis exceeded that observed in individual-site analyses, rather than reflecting an inconsistent model performance.
Table 3 shows that for DMN, SMN and VSN, the model achieved 81.43%, 80.52%, and 83.23% highest accuracies, respectively. The specificities were as follows: 82.14%, 83.43%, and 84.74%, with sensitivities of 80.50%, 77.28%, and 81.42%. The AUC of all the networks was 84.53%, 84.96%, and 87.90%. The current approach addresses inter-site heterogeneity using harmonization and minimizes feature variations between sites, whereas other approaches neglect heterogeneity when merging multi-site data. These outcomes support the importance of standardizing rs-fMRI data in multi-site research.
Figure 8 shows all these results. Table 3 and Figure 8 show that the proposed network analysis approach delivers the most favorable results across all 11 sites (see Supplementary Figures S11–S13 for full results) [58]. For combined sites, the VSN achieved the highest performance (ACC: 83.23%, AUC: 87.90%, SP: 84.74%, SN: 81.42%).
This proposed model’s performance is compared with prior ASD classification studies to investigate these results. The obtained classification metrics demonstrate the effectiveness of the current model across selected network components. A detailed comparative analysis with existing studies is discussed in Table 4.
Table 4 presents a qualitative comparison based on previous studies for context. However, because of significant methodological variability, direct performance comparisons need to be interpreted with care, as different ABIDE datasets may include multiple sites and subjects, distinct preprocessing pipelines, multiple validation approaches (some using single-site validation), and different feature extraction methods. As a result, the current findings present well-established methods that provide significant advantages in multi-site consistency. This research not only provides classification accuracy but also supports a multi-site harmonization approach, network-specific interpretability, and a reproducible analysis framework.
4. Discussion
Past studies have investigated different aspects of brain connectivity in ASD. Unlike those that usually relied on small datasets, our research used a large dataset of 996 individuals from 11 independent sites and achieved higher classification accuracy using SVM. The novelty of this research lies not only in its use of multi-site datasets but in its creation of a harmonization-validated, network-interpretable, computationally explainable framework. Furthermore, instead of only addressing the performance metrics, this study conducts a difficult heterogeneity and constancy analysis, recognizing the visual networks as a reproducible biological signature.
Current findings align with and expand upon previous studies that highlight the DMN, SMN, and VSN as critical networks in understanding the neural underpinnings of ASD. There is strong theoretical and empirical support for the selection of these networks and components. The DMN is fundamentally involved in self-referential thinking, social cognition, and introspection—processes that are frequently impaired in individuals with ASD [81,82]. The altered connectivity within the DMN may contribute to the social alterations observed in ASD, as individuals often struggle with understanding social cues, theory of mind, and engaging in typical social interactions [83,84]. Also, these alterations in the DMN are thought to reflect difficulties in shifting attention between internal and external stimuli, which may underlie the social and communication challenges characteristic of ASD.
The SMN, on the other hand, plays a pivotal role in motor regulation and sensory processing, both of which are often compromised in people with ASD. The motor and sensory processing differences common in people with ASD suggest a direct link between the SMN function [82,85]. For instance, motor challenges may affect nonverbal communication, such as gestures and facial expressions, while sensory sensitivities may lead to avoidance or overreaction to environmental stimuli.
In this study, the VSN emerged as the most important feature for classification accuracy when it came to distinguishing individuals with ASD from controls. This finding highlights the VSN’s critical role in the pathophysiology of ASD, particularly in relation to sensory processing. Previous research has established that individuals with ASD often exhibit atypical sensory processing, which aligns with the significant contributions of the VSN to sensory integration and altered brain connectivity [86,87]. The VSN’s involvement in integrating visual inputs and connecting sensory information is essential for understanding how individuals with ASD perceive and interact with their environment. For example, altered VSN connectivity may contribute to the sensory overload commonly addressed in ASD, where individuals experience heightened sensitivity to visual stimuli, leading to difficulties filtering out irrelevant information and focusing on socially relevant cues.
The importance of the DMN, SMN, and VSN suggests that network-level interconnections are key ASD symptoms. For instance, the DMN’s role in self-referential thought processes can influence how sensory information is interpreted and integrated by the SMN and VSN [82,88]. The VSN’s role in processing visual information is particularly relevant, as it has been implicated in the sensory overload that many individuals with ASD experience. This sensory overload can exacerbate social communication difficulties, as individuals may become preoccupied with sensory inputs rather than engaging with social cues. Outside of these networks, components often showed lower stability or were affected by noise and physiological artifacts, which would reduce interpretability and compromise classifier robustness.
Furthermore, to address the heterogeneity and variability in neuroimaging data, this study demonstrated that by combining spatial constraint ICA with ComBat harmonization provides a robust and reproducible technique for handling multi-site resting state data by using spatial priors. Spatial constraint ICA ensured component-level correspondence across subjects and sites by standardizing the extracted networks before harmonization. The authors of [43] discuss how spatial constraints can enhance the identification of commonly occurring networks, such as the DMN, across different subjects and tasks. The ComBat model further reduced the scanner variability and stabilized the feature distribution across all 11 sites, thereby enhancing the generalizability and reliability and confirming the effectiveness of this pipeline. The combination of these techniques enabled us to achieve high classification accuracy by justifying inter-site differences and providing a positive answer to the first research question.
All the results presented in this paper show that specific functional brain networks provided meaningful discriminative information for classifying ASD from healthy controls. These findings suggests that interventions targeting the DMN, SMN, and VSN may hold promise for supporting individuals with core features of ASD. For example, neurofeedback or brain-stimulation techniques aimed at modulating connectivity within these networks could potentially improve social cognition, sensory processing, and motor coordination in individuals with ASD. Additionally, behavioral interventions that focus on sensory integration and social skills training may benefit from incorporating strategies that address the specific neural mechanisms underlying these challenges. All these findings show that network-level features extracted from spatial constraint ICA not only capture meaningful neurobiological differences but also preserve discriminative control across heterogeneous imaging sites, addressing our second research question.
Dimensionality reduction of features using LDA/PCA with the SVM classifier resulted in a robust performance across networks and sites. The multi-site SVM classifier was trained on harmonized features, highlighting the importance of combining large-scale multi-site data. In particular, the VSN yielded an accuracy of 83.23%, emphasizing the critical role of harmonizing data from diverse sources. A network-specific comparison also revealed statistically meaningful differences in performance. The proposed approach performed significantly better in ASD classification than other existing methods and the findings directly support third research question by showing the harmonized network features and effective machine learning-based classification in a multi-site context. However, there are specific constraints to consider.
This study emphasizes that the results are derived from ABIDE data with standardized diagnostic procedures and quality control, and that external validation on independent cohorts, including different institutions, scanner manufacturers, demographic compositions, and geographic regions, is required before any translational claims can be considered. This research also explicitly acknowledges persistent challenges in rs-fMRI-based ASD classification, including inter-site variability, limited reproducibility, and developmental effects. Consistent with prior work [89], it is stated that substantial methodological and validation hurdles must be tackled before neuroimaging-based classification can demonstrate added value beyond established behavioral assessments. Accordingly, the revised manuscript positions the present work as a methodological framework for identifying network-level patterns relevant to ASD neurobiology rather than as an immediate clinical diagnostic tool. Though we used data from 11 different institutions, the absence of individual data gathering is a significant limitation. The results of the present study are dependent solely on the ABIDE I and II datasets, which could be insufficient. High-dimensional features may still result in bias even when SVM reduces overfitting through normalization. However, this research employed cross-validation to avoid leakage and PCA and LDA to reduce dimensionality.
Future research should integrate additional ASD clinical datasets from various sources, preferably across different age groups, cultural contexts, and MRI acquisition protocols, to strengthen the assessment of the classification method and generalizability beyond ABIDE and help us to evaluate the model’s robustness in a range of context rather than performing evaluations based solely on past research datasets. While this study identified which large-scale brain networks and associated spatial regions contribute to group classification, it did not examine the relationship between these network-level features and core clinical dimensions of Autism Spectrum Disorder (ASD), such as social communication deficits and restricted, repetitive behaviors, as measured by standardized instruments like the ADOS and ADI-R. Furthermore, the potential of these network markers to track longitudinal symptom trajectories remains unexplored. Addressing these gaps in future research is essential to bridge the divide between neuroimaging-based classification and clinically meaningful outcomes, ultimately advancing the translational utility of functional connectivity biomarkers in ASD. The observed differences in classification performance across functional networks were relatively modest and were reported descriptively. No formal statistical comparison, such as parametric or non-parametric hypothesis testing or confidence interval estimation, was performed to assess the significance of these differences. Future studies should incorporate rigorous statistical frameworks to quantitatively evaluate and compare the discriminative power of individual networks, thereby strengthening the interpretability and reliability of network-based biomarkers in ASD classification. Furthermore, existing research excluded two additional MRI modalities (structural MRI and diffusion tensor imaging). The use of various modalities could enhance classification by offering more information. Also, investigating the predictive ability of the current machine learning framework could uncover the important functional correlations connected to ASD processes. Although these drawbacks were not taken into account in this investigation, we want to consider them in future ASD identification research. In future work, we will collaborate with neurologists and radiologists to qualitatively evaluate components’ relevance and assess the potential of the proposed framework.
5. Conclusions
The proposed brain network analysis approach produced promising results across all 11 single and combined sites. These findings point out the critical roles of the DMN, SMN, and VSN in the functional connectivity architecture of ASD, with the VSN evolving as the most important network for classification. The use of spatially constrained ICA and ComBat shows the importance of strong signal processing and data harmonization approaches in neuroimaging research, particularly when it comes to addressing the site-specific heterogeneity and data variability that are common in multi-site rs-fMRI studies. These findings not only develop our understanding of the neural foundations of ASD but also provide a basis for targeting the brain networks involved in ASD disorder. Future research should continue to explore the dynamic interactions between these networks and their potential as biomarkers for identification and behavior response in ASD.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Elsabbagh M. Divan G. Koh Y.J. Kim Y.S. Kauchali S. Marcin C. Montiel-Nava C. Patel V. Paula C.S. Wang C. Global prevalence of autism and other pervasive developmental disorders Autism Res.2012516017910.1002/aur.23922495912 PMC 3763210 · doi ↗ · pubmed ↗
- 2Megari K. Frantzezou C.K. Polyzopoulou Z.A. Tzouni S.K. Neurocognitive features in childhood & adulthood in autism spectrum disorder: A neurodiversity approach Int. J. Dev. Neurosci.20248447149910.1002/jdn.1035638953464 · doi ↗ · pubmed ↗
- 3World Health Organization Autism World Health Organization Geneva, Switzerland 20231
- 4Le Couteur A. Rutter M. Lord C. Rios P. Robertson S. Holdgrafer M. Mc Lennan J. Autism diagnostic interview: A standardized investigator-based instrument J. Autism Dev. Disord.19891936338710.1007/BF 022129362793783 · doi ↗ · pubmed ↗
- 5Lord C. Rutter M. Le Couteur A. Autism Diagnostic Interview-Revised: A revised version of a diagnostic interview for caregivers of individuals with possible pervasive developmental disorders J. Autism Dev. Disord.19942465968510.1007/BF 021721457814313 · doi ↗ · pubmed ↗
- 6Alharthi A.G. Alzahrani S.M. Multi-Slice Generation s MRI and f MRI for Autism Spectrum Disorder Diagnosis Using 3D-CNN and Vision Transformers Brain Sci.202313157810.3390/brainsci 1311157838002538 PMC 10670036 · doi ↗ · pubmed ↗
- 7Yang X. Zhang N. Schrader P. A study of brain networks for autism spectrum disorder classification using resting-state functional connectivity Mach. Learn. Appl.2022810029010.1016/j.mlwa.2022.100290 · doi ↗
- 8Qian S. Yang Q. Cai C. Dong J. Cai S. Spatial-Temporal Characteristics of Brain Activity in Autism Spectrum Disorder Based on Hidden Markov Model and Dynamic Graph Theory: A Resting-State f MRI Study Brain Sci.20241450710.3390/brainsci 1405050738790485 PMC 11118919 · doi ↗ · pubmed ↗
