Cognitive Profile of Autism and Intellectual Disorder in Wechsler’s Scales: Meta-Analysis
Gustavo Mortari Ferreira, Calliandra Maria de Souza Silva, Alexandre Sampaio Rodrigues Pereira, Larissa Sousa Silva Bonasser, Maria Gabriela do Nascimento Araújo, Marcelly de Oliveira Barros, Roniel Sousa Damasceno, Fauston Negreiros, Izabel Cristina Rodrigues da Silva

TL;DR
This study compares cognitive profiles of autism, intellectual disability, and their co-occurrence using Wechsler scales, finding distinct patterns for each group.
Contribution
The study provides a meta-analysis of cognitive profiles in ASD, ID, and ASD+ID using Wechsler scales, revealing distinct impairment patterns.
Findings
ASD shows greater impairments in Processing Speed and Working Memory indices but near-normative scores in others.
ID and ASD+ID exhibit generalized cognitive deficits with lowest scores in Full-Scale IQ.
Wechsler scales are consistent across versions for identifying cognitive profiles in these groups.
Abstract
Autism spectrum disorder (ASD) and intellectual disability (ID) frequently coexist and share heterogeneous cognitive manifestations, yet their specific performance patterns on Wechsler scales remain poorly systematized. This meta-analysis synthesized data from 31 studies using the WISC-IV, WISC-V, WAIS-III, and WAIS-IV to compare cognitive index profiles in individuals with ASD, ID and ASD+ID. Standardized mean differences (Hedges’ g) were calculated using random-effects models, adopting a normative reference of mean 100 and SD 15. Results showed a distinct profile for ASD, with greater impairments in the Processing Speed Index (PSI) and Working Memory Index (WMI), while the Vocabulary Comprehension Index (VCI), Perceptual/Fluid Reasoning Index (PRI/FRI), and Visual Processing Index (VPI) remained close to normative scores. In contrast, ID and ASD+ID exhibited generalized deficits…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4- —CAPES
- —FAPDF
- —National Council for Scientific and Technological Development
- —Ministry of Health/Department of Science and Technology
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAutism Spectrum Disorder Research · Down syndrome and intellectual disability research · Genetics and Neurodevelopmental Disorders
1. Introduction
Autism spectrum disorder (ASD) is a neurodevelopmental condition characterized by persistent deficits in cognitive and social functioning, accompanied by repetitive and restricted patterns of interests and behaviors. Intellectual disability (ID), in turn, is defined by significant limitations in intellectual functioning (IQ), as well as impairments in adaptive behavior encompassing the pragmatic, social, and conceptual domains (Crippa et al., 2023). Notably, approximately 26 to 30% of individuals on the autism spectrum present comorbidity with ID or exhibit IQ levels comparable to this group (Bougeard et al., 2021).
Recent studies have demonstrated the existence of multiple ASD subtypes, encompassing distinct groups that may or may not exhibit intellectual impairment and other comorbidities, in addition to presenting genotypic and phenotypic variability (Litman et al., 2025). Moreover, both disorders display heterogeneous clinical profiles and considerable variability in symptomatology, developmental trajectories, and cognitive performance, extending beyond their core diagnostic criteria (Masi et al., 2017).
Given the complexity of these presentations, neuropsychological assessment remains the most appropriate method for delineating the individual’s cognitive strengths and weaknesses. Specifically, in cases involving comorbidities between ASD and ID, the evaluation of intellectual functioning through standardized intelligence testing is particularly relevant (Braconnier & Siper, 2021; Yu et al., 2024).
1.1. The Role of Wechsler Scales in ASD and ID Psychometric Evaluation
Wechsler Scales (WS) are the most extensively utilized instruments for assessing intelligence worldwide. The theoretical model proposed by Wechsler conceptualizes intelligence as a composite of cognitive processes that enable individuals to interact effectively with their environment, acquire knowledge, and engage in purposeful, goal-directed behavior. Based on these premises, WS are structured into specific indices that encompass various cognitive domains, which, when integrated, yield a comprehensive profile of an individual’s intellectual functioning. The index currently employed globally includes Verbal Comprehension (VCI), Perceptual Reasoning (PRI), which in the WISC-V edition was replaced by Fluid Reasoning (FRI) and Visual Processing (VPI), Working Memory (WMI), and Processing Speed (PSI). In addition, the scales provide estimates of general intellectual ability, such as the Full-Scale Intelligence Quotient (FSIQ), Verbal IQ, and Performance IQ (Barnard, 2023; Dale et al., 2022; Wechsler & do Nascimento, 2004).
The Wechsler Scales are further subdivided according to age group, comprising the Wechsler Intelligence Scale for Children (WISC) for the pediatric population and the Wechsler Adult Intelligence Scale (WAIS) for adults. Each edition of these instruments includes distinct subtests and tasks, along with specific standardizations that reflect linguistic, cultural, and national adaptations. Although strong or moderate correlations are generally observed among different versions of the scales, variability in performance outcomes has been documented in populations with autism spectrum disorder (ASD) or intellectual disability (ID). To date, there remains no definitive consensus regarding whether these instruments tend to underestimate, overestimate, or accurately capture the intellectual abilities of these groups (A. M. Nader et al., 2015; A.-M. Nader et al., 2016).
1.2. ASD and ID Cognitive Profile
A lack of consensus remains regarding the specific cognitive and diagnostic profiles derived from the Wechsler Scales (WS). Nonetheless, these instruments provide valuable information commonly associated with relevant clinical outcomes. Consequently, contemporary research employing WS aims to establish associations between their results and specific cognitive profiles (such as intellectual disability—ID), clinical impairments (such as deficits in adaptive behavior), and long-term functional outcomes (Sánchez-Izquierdo et al., 2023; Zemach et al., 2023). Within the spectrum of neurodevelopmental conditions, Full-Scale Intelligence Quotient (FSIQ) has been shown to correlate more strongly with reduced adaptive behavior and diminished performance in activities of daily living than in typically developing (TD) individuals (McQuaid et al., 2021). More specifically, individuals with autism spectrum disorder (ASD) who exhibit below-average IQ scores tend to demonstrate poorer adaptive behavior outcomes, as well as a marked discrepancy between intellectual performance and the ability to perform daily living activities (Tamm et al., 2022; Tassé & Kim, 2023).
From a complementary perspective, ID—characterized by reduced FSIQ and impaired adaptive functioning as defining diagnostic markers—consistently yields significantly lower scores across these metrics, along with notable deficits in executive functions (Erostarbe-Pérez et al., 2022; Spaniol & Danielsson, 2022). Furthermore, evidence indicates that individuals with ID exhibit considerable variability in FSIQ and index scores across subgroups, including those with mild or moderate ID. As previously mentioned, this disorder is notably heterogeneous, with studies often stratifying participants by FSIQ performance (Santegoeds et al., 2022).
Regarding the specific WS indices, ASD has been associated with greater impairments in Working Memory (WMI) and Processing Speed (PSI), typically resulting in approximately one standard deviation below the mean. Comparative studies between ASD and Attention-Deficit/Hyperactivity Disorder (ADHD) have demonstrated a more consistent intellectual profile in ASD, whereas ADHD does not exhibit a distinct or stable cognitive pattern (Kim & Song, 2020; Wilson, 2024). Additional research has identified subgroups within the ASD population that display comparatively higher Verbal Comprehension Index (VCI) scores and enhanced verbal intelligence (Calero et al., 2015; Chua & Xie, 2022; Litman et al., 2025).
On the other hand, individuals with ASD and intellectual impairment (ASD+ID) and those with ID appear to show a generalized pattern of deficits across cognitive indices, with no consensus regarding a comparative profile either among the indices themselves or between these groups (Mungkhetklang et al., 2016).
1.3. Meta-Analytic Framework for Cognitive Profiles in ASD, ID, and ASD+ID
Even with relevant data on both conditions, there is a lack of meta-analyses focused on the specific profile of non-syndromic intellectual disability (ID), particularly considering its Index variations rather than just Full-Scale IQ (FSIQ). Additionally, few studies attempt to compare autism spectrum disorder (ASD) and ID simultaneously, or to cross-analyze Wechsler Scale (WS) tests to identify this profile (Barnard, 2023; Dale et al., 2021).
Therefore, the present study proposes a meta-analysis that simultaneously examines ASD, ID, and ASD+ID across different versions of the main Wechsler scales (WAIS-III, WAIS-IV, WISC-IC, and WISC-V) to better understand the current literature on intelligence assessment in neurodevelopmental disorders. Specifically, this meta-analysis aims to:
- Examine cognitive differences among individuals with ASD, ID, and ASD+ID, considering performance across indices and test types.
- Compare the results obtained for these diagnoses against expected normative performance on the tests.
- Compare the performance across different indices within each disorder.
- Investigate moderators that may influence changes in cognitive profiles across distinct conditions.
- Assess heterogeneity to better delineate the expressive variations found in ASD and ID.
2. Materials and Methods
2.1. Study Selection and Search Strategy
PsycNet, PubMed, Embase, and ScienceDirect databases were used for data collection. The strategy uses Boolean-based key terms, including: “ASD,” “IDD,” “ID,” “pervasive development,” “intellectual disability,” “asperger,” “autism spectrum disorder,” “WISC,” “WAIS,” “Wechsler Intellectual Profile,” adapting them to the specifications of each database. For a comprehensive overview of the search strategies and specific formulas, refer to the Supplementary Material, Figure S1. This research is registered with PROSPERO (2025 CRD420251038045) and used a 10-year time frame for the searches.
The “Patient Population, Interventions, Comparisons, and Outcomes” (PICO) model was selected to guide the selection and differentiation of data as follows (Eckstein et al., 2001):
- Participants (P): The study included individuals diagnosed with autism spectrum disorder (ASD) or Intellectual and Developmental Disabilities (IDD), including variations such as intellectual disability (ID) or Asperger’s Syndrome, as defined by the DSM-IV, DSM-V, or DSM-V-TR criteria. Articles involving ASD with FSIQ < 70 or those focusing on individuals with ASD and intellectual impairment were categorized as ASD+ID.
- Interventions (I): The focused intervention was the application of Wechsler’s scales, specifically WISC-IV, WISC-V, WAIS-III, or WAIS-IV. The studies were required to report FSIQ and cognitive index data explicitly.
- Comparison (C): Eligible studies included those that compared results using the instrument’s own normative data, a control group, or another mental health condition.
- Outcome (O): The primary outcomes were the cognitive indices derived from the tests. Inclusion in this phase required the presentation of all index scores from the respective test (FSIQ, VCI, PRI, WMI, and PSI, or FSIQ, VCI, FRI, VPI, and WMI in WISC-V).
Articles were excluded if they did not employ any of the selected tests, failed to report cognitive index data, or did not investigate ASD or ID. Additionally, only original empirical studies were accepted, and publications such as book chapters, conference presentations, brief reports, or review articles were not included. Studies for which full-text access or exact variable values were unavailable were also excluded from the analysis.
2.2. Data Extraction, Synthesis and Bias Assessment
The article screening and data extraction were conducted using Rayyan, a platform developed by the Qatar Computing Research Institute (QCRI) for systematic review management. Data collection was conducted on 23 May 2025, following a double-blind screening model involving five independent reviewers (GM, FN, MA, MO, and RS). The selection process was organized in two phases. In the first phase, MGA, MO, and RS independently screened one-third of the articles, focusing solely on the title, identification, and abstract to assess eligibility according to the PICO strategy. Meanwhile, GM reviewed all selected articles, serving as the double-blind reviewer responsible for cross-validation of initial decisions. In the second phase, full texts meeting the inclusion criteria were then reviewed in detail by GM, MA, MO, and RS according to predefined criteria. At each selection stage, at least two reviewers independently assessed the articles, and in cases of disagreement, FN provided additional analysis.
The following variables were extracted and organized in Microsoft Excel from the selected article: Author, Year, Country, Diagnosis, Intelligence Test Used, Number of Participants, Mean Age, Biological Sex (% Male), Diagnostic Criteria, Research Groups, Type of Study, and Index Scores (FSIQ, VCI, PRI, WMI, PSI, VPI, FRI). All included studies were then analyzed using the Strengthening the Reporting of Observational Studies in Epidemiology (STROBE) checklist, which comprised 22 items with multiple sub-items (Cuschieri, 2019). We considered studies to be at low risk of bias when at least 75% of the criteria are met. Results of the STROBE assessment are provided in Supplementary Material S1.
2.3. Data Synthesis
From each article, the following data were extracted: author, year, country, diagnosis, intelligence test used, number of participants, mean age, biological sex (% male), population diagnostic criteria, research groups, type of study, and cognitive indices (FSIQ, VCI, PRI, WMI, PSI, VPI, FRI). The primary variables included clinical diagnosis (ASD, ASD+ID, ID), assessment instrument (WISC-IV, WISC-V, WAIS-III, WAIS-IV), cognitive outcomes (FSIQ, VCI, PRI, WMI, PSI, VPI, FRI), and descriptive statistics (mean, standard deviation, and sample size).
2.4. Data Analysis
This meta-analysis was conducted using R (version 4.04) and the packages metafor, meta, tidyverse, openxlsx, and haven for data analysis and visualization. A continuous data model with random effects was applied to account for variations in test adaptations, populations, and standardizations. Effect sizes were converted into standardized mean differences (Hedges’ g), using a standardized control with a mean of 100 and a standard deviation of 15 to represent expected normative performance, as theorized by Cohen (1988). Effect magnitude was interpreted according to Cohen (1992), with g = 0.20 considered small, g = 0.50 medium, and g = 0.80 large.
Descriptive statistics (mean, standard deviation, and sample size) were collected and used to calculate standard error (SE = sd/√n) and variance (Var = SE^2^) for the sample groups in each study. Data with missing or zero variance were excluded. To assess heterogeneity, I^2^ statistical analyses were used and interpreted based on Cochrane guidelines: 0–40% may be unimportant, 30–60% may represent moderate heterogeneity, 50–90% substantial, and 75–100% considerable (J. Higgins et al., 2002; J. P. T. Higgins & Li, 2022). Heterogeneity was evaluated both for combined analyses across different tests, indices, and diagnoses, and more specifically, for diagnostic comparisons by test type and cognitive outcome (J. P. T. Higgins et al., 2020). Tau^2^ was calculated using the Restricted Maximum Likelihood (REML) method, with an unstructured covariance matrix and standard error adjustment via the Hartung-Knapp method (Röver et al., 2015). Finally, meta-regressions were conducted using categorical moderators (test type, diagnosis, and index by diagnosis), and subgroup analyses were performed by test type, diagnosis, and cognitive index. Finally, Sensitivity and Certainty analyses were performed and are available in Supplementary Material S4.
3. Results
Initially, 1090 files were collected from the four databases (899 after duplicates were removed). Selection based on titles and abstracts yielded 140 articles; after full-text screening, 31 eligible studies were retained. Notably, several tests included more than one distinct sample group (e.g., cases stratified by FSIQ ranges or other distinguishing characteristics). Figure 1 presents the PRISMA flowchart detailing the screening process.
3.1. Population Characteristics
Table 1 presents the sociodemographic data for the selected articles, including country, diagnosis, intelligence test used, number of participants, average age, gender distribution, research group, study type, and exclusion criteria.
Most of the study samples consisted predominantly of males: 16 studies with ≥80% male participants and 15 with 50–80%. The cognitive assessments performed were the WISC-IV (n = 24), followed by the WAIS-IV (n = 7), WAIS-III (n = 4), and, lastly, the WISC-V (n = 3). Remarkably, a significantly higher number of research samples included ASD (k = 28) than ID (k = 08) or ASD+ID (k = 3); the repercussions of this will be addressed in the Section 4.
Most studies originated from Europe (Italy, France, Spain, the UK, Sweden, and Germany), North America (the USA and Canada), and Asia (Japan, South Korea, Hong Kong, and China), with additional research conducted in Brazil, Israel, and Iran. Most research followed observational or cross-sectional designs and commonly applied test-based inclusion criteria. Among these, the autismautism Diagnostic Observation Schedule (ADOS) was most frequently used for autism spectrum disorder (ASD) diagnosis, followed by the autismautism Diagnostic Interview-Revised (ADI-R) and Social Responsiveness Scale (SRS); for intellectual disability (ID), the Wechsler scales were employed, using FSIQ < 70 as the threshold.
For database construction, studies involving High-Functioning autismautism spectrum disorder (HFASD), Asperger’s Syndrome (AS), and ASD were grouped under the ASD category, considering that it includes autism spectrum conditions without intellectual impairment. Conversely, studies on ID and intellectual disabilityintellectual development disorder (IDD) were categorized together, excluding those focused predominantly or specifically on syndromic samples.
3.2. Descriptive Data
Table 2 presents the basic descriptive results of the research, including the mean (with its standard deviation), the number of sample groups (k), the standardized mean difference, and the number of participants per category.
The results in Table 2 revealed distinct cognitive profiles between ASD and ID individuals. In the ASD group (aggregated sample: 553–2024 participants), the mean FSIQ was 95.00 (SD = 9.94; k = 28). Specific indices showed varied patterns: PRI had the highest mean (100.2; SD = 9.94), followed by VCI (98.66; SD = 12.27). In contrast, PSI (88.14; SD = 8.57) and WMI (92.29; SD = 8.76) yielded the lowest scores. Additionally, VPI (101.4; SD = 4.02) and FRI (100.7; SD = 4.29) were close to the normative value of 100, although based on fewer studies (k = 3).
For the ID group (n = 335; k = 7), the results were lower than those for ASD, consistent with the diagnostic classification. The FSIQ averaged 55.35 (SD = 1.93), which is the lowest score. WMI (59.81; SD = 3.39) and VCI (61.18; SD = 0.64) showed the lowest performance, while PRI (66.98; SD = 5.33) and PSI (64.71; SD = 1.85) were comparatively higher.
In the ASD+ID subgroup (n = 102; k = 3), FSIQ (53.35; SD = 1.93) and WMI (59.81; SD = 3.39) were below 60. VCI (61.18; SD = 0.64), PSI (64.71; SD = 1.85), and PRI (66.98; SD = 5.33) exceeded 60 but remained within the intellectual impairment range (<70).
3.3. Comparisons Between Tests and Diagnosis
The standardized mean difference test, Hedge’s g, was performed, considering a standardized control sample with a mean of 100 and a standard deviation of 15 for the test, as theorized by Cohen (1988). Figure 2 evaluated performance based on diagnosis and cognitive index using the standardized mean deviation strategy (Hedges’ g), accounting for random effects and providing 95% CIs.
In the ASD group, effect sizes were predominantly small to moderate in magnitude. The largest was observed in PSI-ASD (g = −0.75), classified as large, followed by a moderate effect in WMI-ASD (g = −0.48). Other indices showed small or negligible effects: FSIQ-ASD (g = −0.34), VCI-ASD (g = −0.07), FRI-ASD (g = 0.03), PRI-ASD (g = 0.01), and VPI-ASD (g = 0.07).
In contrast, all ID and ASD+ID values had large and negative effects. The smallest reduction was in PRI-ASD+ID (g = −2.03), while the most pronounced effects were in FSIQ-ASD+ID (g = −3.34) and FSIQ-ID (g = −3.87), indicating the most significant divergence from the control groups. ID indices were consistently high: PRI-ID (g = −2.86), PSI-ID (g = −2.98), VCI-ID (g = −2.78), and WMI-ID (g = −2.92), all within overlapping confidence intervals, reinforcing a homogeneous deficit profile.
The ASD+ID group mirrored the ID profile but showed wider confidence intervals. The ascending order of effects was: PRI (g = −2.03), PSI (g = −2.62), VCI (g = −2.86), and WMI (g = −2.84). Additionally, standardized mean differences (Hedges’ g) were calculated across subgroups by test type, cognitive index, and diagnosis (Figure 3).
Stratified results by test type revealed small to moderate effects in the ASD group. In the WAIS-III, effects were predominantly small—FSIQ (g = 0.14), PRI (g = 0.14), WMI (g = −0.13), VCI (g = 0.57)—with moderate reduction in PSI (g = −0.62). In the WAIS-IV, effects ranged from small, PRI (g = −0.09), VCI (g = −0.15), WMI (g = 0.00), to moderate in PSI (g = −0.62). The WISC-IV showed a similar pattern, with small effects in FRI (g = 0.05), VCI (g = 0.07), VPI (g = 0.09), and PRI (g = 0.06), but more pronounced reductions in PSI (g = −0.72) and WMI (g = −0.45).
In the ASD+ID group, all effects were large and negative. WAIS-IV scores showed negative substantial magnitudes: FSIQ (g = −3.34), PRI (g = −1.93), PSI (g = −2.93), VCI (g = −2.96), and WMI (g = −2.89). WISC-IV results followed a similar profile, with even stronger effects: FSIQ (g = −3.53), VCI (g = −2.88), WMI (g = −2.38), PRI (g = −2.38), and PSI (g = −2.84).
For the ID group, consistently large effects were observed across all test versions. In the WAIS-III, PSI (g = −2.13) and WMI (g = −2.12) stood out. WAIS-IV showed even greater deficits: FSIQ (g = −4.06), PRI (g = −3.43), PSI (g = −3.37), VCI (g = −3.17), and WMI (g = −2.27). WISC-IV results remained elevated: FSIQ (g = −4.00), PRI (g = −2.71), VCI (g = −2.88), and WMI (g = −3.27).
3.4. Meta-Regressions
Two meta-regressions were performed using a categorical model to examine the effects of diagnosis or test type on cognitive outcomes. Table 3 presents the meta-regression results, which used diagnosis type as a moderator.
ID (β = 67.50; SE = 2.29; Z = 29.40; p < 0.001; 95% CI = 63.00–72.00) was selected as the reference intercept, as intellectual impairment is a diagnostic criterion in multiple studies. Compared to ID, the ASD group showed a significant positive difference in the coefficient (+26.01; SE = 2.55; Z = 10.20; p < 0.001). In contrast, the ASD+ID group did not differ significantly from ID (β = −4.67; SE = 2.66; Z = −1.75; p = 0.080; 95% CI = −9.89–0.55).
Additionally, a meta-regression model was applied using test type as a moderating variable (Table 4).
The WAIS-III was used as the moderator in the meta-regression model, yielding a coefficient of β = 82.88 (SE = 4.33; Z = −0.27; p = 0.70; 95% CI = 76.28–89.47). No other test showed a statistically significant difference when compared to WAIS, regardless of version or age range. WAIS-III was selected as the intercept because it demonstrated the highest mean performance among all tests. Finally, a regression analysis was conducted that combined diagnosis and cognitive index simultaneously (as shown in Table 5).
In this model, the ASD group showed a significant difference only in FSIQ, PSI, and WMI compared to FRI (moderator and index values close to the norm). However, analyzing the ASD+ID group, only PRI showed a significant difference with FSIQ (moderator and with the lowest result). Finally, in the ID group, all indexes (PSI, PRI, VCI, and WMI) showed a statistically significant difference from FSIQ (the lowest result).
3.5. Heterogeneity
An I^2^ heterogeneity test (Huggies and Thompson) was also performed to assess the degree of heterogeneity in the study sample. Figure 4 displays the I^2^ values in a heatmap from the iteration covering diagnosis and outcomes.
In the ASD group, all heterogeneity values were classified as substantial (I^2^ > 75%). In ascending order, the heterogeneity rates were: PSI (83.2%), VPI (81.6%), FRI (84.8%), PRI (85.9%), WMI (88.5%), FSIQ (90.7%), and VCI (93.3%). In the ID group, PSI (73.6%) was classified as moderate (<75%), while FSIQ (88.2%), PRI (85.8%), VCI (82.3%), and WMI (93.2%) presented high to very high values. In ascending order: PSI (73.6%), VCI (82.3%), PRI (85.8%), FSIQ (88.2%), and WMI (93.2%).
In the ASD+ID group, heterogeneity was null (I^2^ = 0) for nearly all outcomes, except for PRI (39.4%), which was classified as low or moderate depending on the analysis. This lack of heterogeneity is attributed to the limited number of available studies (reduced k). Finally, there were no results for IPV or FRI in the ASD+ID and ID groups, which were treated as non-existent data and represented in gray in the graph.
4. Discussion
The study identified distinct profiles for autism spectrum disorder (ASD) and intellectual disability (ID), with exploratory data for ASD+ID. While ASD and ID profiles showed significant differences, no differences were found between ID and ASD+ID. No significant variation emerged across versions of the Wechsler scales, and heterogeneity levels were moderate to high for ASD and ID.
4.1. Categorization and Description
A key observation was the predominance of data collection in countries with higher Human Development Index (HDI) or socioeconomic status, with limited research from Latin America, Africa, the Middle East, and Oceania. Other data also indicate that countries in the Global North have better access to evidence-based treatments and cognitive testing for neurodevelopmental conditions. Furthermore, there are signs of underrepresentation of ethnically and politically marginalized groups, underscoring a gap in the current literature. Notably, some studies failed to report racial demographics of their samples (Huang et al., 2020; Rosen et al., 2021).
Geographically, most studies were conducted in university hospitals, specialized centers, or mental health clinics. In contrast, research from lower-income countries was less frequent, possibly reflecting limited resources and reduced availability of specialized services. This discrepancy highlights a gap in the literature regarding ASD and ID data from less developed regions (Lakens, 2022). This lack of knowledge from countries in LAMIC may limit analysis due to genotypic differences and regional peculiarities such as reduced access to inclusive educational and health practices (especially in poorer regions), along with cultural and regional differences (e.g., varying levels of linguistic complexity and approaches in social comunication), or even divergent public health policy strategies (Girianelli et al., 2023; Sukiennik et al., 2022).
Another challenge is the difficulty in isolating ASD from ID due to frequent neurodevelopmental comorbidities. Few studies excluded other conditions beyond the target diagnosis, hindering the development of more accurate profiles, as many studies repeatedly analyzed mixed populations of Attention Deficit Hyperactivity Disorder (ADHD), ASD, and/or ID (Bougeard et al., 2021; Wilson, 2024).
Finally, regarding sample characterization, although a male predominance in ASD and ID groups was expected, some studies reported samples composed almost entirely of males (90–100%). This skewed gender discrepancy raises concerns about the underrepresentation and potential invisibility of pervasive disorders in females, who—despite genotypic and statistical differences—can still present the condition with distinct symptomatic and cognitive profiles, including on Wechsler scales (Calderoni, 2023; McQuaid et al., 2021). Previous research has demonstrated significant sex-based differences in intellectual functioning, with women often achieving higher scores on the VCI and PSI but lower scores on the PRI compared to men (Giofrè et al., 2024; McQuaid et al., 2021). The predominance of male samples in the present review—drawn mainly from Europe, Asia, and North America, and with a greater proportion of children—further limits the generalizability of findings (Di Salvo, 2024).
4.2. Analysis Between Diagnoses and Tests
4.2.1. Analysis Between Diagnoses
Clear cognitive profiles were identified relative to the normative sample, across all diagnoses and within each cognitive index. ASD consistently demonstrated the lowest performance in the Processing Speed Index (PSI; g = −0.76; 95% CI: −0.92 to −0.60), followed by the Working Memory Index (WMI; g = −0.48; 95% CI: −0.67 to −0.29). These findings align with prior autism literature, which frequently reports deficits in processing speed and working memory. Notably, this pattern emerged across all versions of the Wechsler scales, particularly for PSI, consistent with other studies of ASD profiles (Kim & Song, 2020; Wilson, 2024).
In this study, individuals with ASD consistently performed worse in processing speed compared to typically developing (TD) individuals. This slower processing speed appears to be a stable impairment pattern across different cognitive tasks in ASD (Möde et al., 2023; Wu et al., 2018), indicating difficulties in perceiving, processing, and responding to stimuli. However, PSI tasks also involve other cognitive processes such as attention, inhibitory control, and motor coordination (Zapparrata et al., 2023), which may contribute to the observed impairments. Several studies have found longer reaction and performance times in distinct cognitive tasks, even in contexts where the success rate and quality of the task are not impaired, which corroborates the hypothesis of stability of WMI and PSI impairments in ASD across different assessments (Rabiee et al., 2019; Wu et al., 2018; Zapata-Fonseca et al., 2018; Zapparrata et al., 2023). Some authors hypothesize that this difficulty may be associated with the weak central coherence hypothesis in ASD, leading to slower responses due to difficulties integrating information into a coherent Gestalt (Bojda et al., 2021).
From a neurobiological perspective, consistent findings of reduced corpus callosum (CC) volume and altered structural connectivity, together with increased overall brain volume, have been strongly linked to deficits in attention, executive functions, and working memory (WMI) (Bojda et al., 2021; Sacco et al., 2015; Valenti et al., 2020). These alterations may partly explain the cognitive profiles observed in individuals with ASD. Additional studies have reported differences in connectivity in regions such as the lateral occipital cortex (LOTC), the fusiform gyrus (FG), the temporal lobe, and the insula, which are associated with ASD-related symptoms and cognitive impairments (Eyler, 2018; Delbruck et al., 2019). Taken together, these findings reinforce the neurobiological basis of ASD while underscoring the heterogeneity of structural and functional brain differences that may contribute to variability in cognitive outcomes.
ID data revealed a significantly more pronounced and consistent reduction in impairment across the Wechsler scales (Figure 2). These generalized deficits are historically associated with ID, though performance differences across indices were relatively narrow—except for FSIQ, which showed significantly lower scores. Meta-regression and plot-intercept analyses confirmed this discrepancy between FSIQ and other indices. It is hypothesized that intelligence distribution tends to be more variable at the extremes compared to individual cognitive indices, as supported by prior studies (Toffalini et al., 2019; Tuon et al., 2023; Wechsler & do Nascimento, 2004). From a neurophysiological perspective, intellectual impairment is associated with disproportionate alterations in Gray Matter Volume (GMV) and White Matter Volume (WMV), thereby impacting functional connectivity across various brain regions. Specifically, the dorsal medial prefrontal cortex (dmPFC), bilateral orbital part of the inferior frontal gyrus (orb_IFG.L, orb_IFG.R), right cuneus (cuneus.R), and bilateral middle frontal gyrus (MFG.L, MFG.R) show significant effects. Negative correlations between GMV malformation and FSIQ further reinforce the structural basis of intellectual deficits in ID (Ma et al., 2021)
Regarding the co-occurrence of ASD and ID, results revealed a distinct impairment profile on Wechsler scales, more closely resembling ID than ASD. Research addressing all WS indices in ASD+ID is scarce; however, the available data indicate significant intellectual deficits in this population (Santambrogio et al., 2023). Figure 2 reinforces this pattern, with most ID indices falling within the range of their ASD+ID counterparts. Meta-regression also failed to differentiate between ASD+ID and ID profiles, strongly reinforcing their similarity. This similarity suggests that ID presence in comorbid cases exerts a more substantial influence on intelligence profile (cognitive performance), further supporting models that include ID and ASD+ID within the same response pattern on Wechsler scales. Two hypotheses emerge: first, ID to mask the cognitive presence of ASD, and, second, the ASD+ID profile is effectively similar to ID, reinforcing the theory of genotypic-phenotypic differences between ASD+ID and ASD (Genovese & Butler, 2023; Litman et al., 2025). Complementary eye-tracking data have identified distinct response profiles in ID and ASD+ID, reinforcing the thesis that these are separate profiles but with similar intellectual characteristics in standardized testing (Lian et al., 2023). These findings highlight the need for comprehensive assessments—including multiple tests and analytic strategies—to differentiate ASD, ID, and ASD+ID outcomes. Notably, when the diagnosis type was moderated (with ID as the intercept), no significant differences were found between ID and ASD+ID. This finding represents a novel contribution of the present review, as no prior studies have conducted this type of analysis using Wechsler scales.
4.2.2. Analysis Between Tests
Figure 3 presents the profile based on the subgroup combination of outcome, diagnosis, and test type. No significant differences were found between test, a finding corroborated by the forest plot, which showed that the outcome+diagnosis+test combinations fell within the confidence intervals of their counterparts across different test categories (e.g., ASD–WAIS-III–FSIQ: g = 0.14; −0.19 to 0.48; ASD–WISC-IV–FSIQ: g = −0.43; −0.97 to 0.12). This consistency suggests stability and reliability across current versions of the Wechsler scales when dealing with identical constructs, likely due to the maintenance of core subtests such as matrix reasoning, cubes, and vocabulary during updates (Wilson, 2024).
Table 3 confirmed this lack of differentiation through meta-regression using WAIS-III as the moderator. However, these results contrast with prior literature reporting differences between WISC-IV and WISC-V, and even between WAIS-IV and WISC-V—particularly in indices such as VCI (Kuehnel et al., 2019; Stephenson et al., 2021; Wilson, 2024). For example, Kuehnel et al. (2019) observed differences in VCI between WISC-IV and WISC-V, with a slight difference in the Composite score (5.08) in a longitudinal study. In contrast, A. M. Nader et al. (2015) found more pronounced differences between WISC-III and WISC-IV, particularly in the fourth version’s PRI and VCI. Nonetheless, findings remain inconsistent, with no consensus across studies (Stephenson et al., 2021; Wilson, 2024).
Some possibilities may explain the profile measured in this review. The meta-regression method may have limited sensitivity in groups with high heterogeneity, significant standard errors, or deviations, making it difficult to detect subtle differences. Moreover, comparing Figure 3 with Supplementary Figure S9, aggregated mean values indicated more pronounced differences, whereas standardized mean differences (g) showed no significant effect. These contrasting results indicate that, even when raw scores vary, the effect sizes of ASD, ID, or ASD+ID across test versions do not differ sufficiently from TD performance to overcome sample heterogeneity (Cohen, 1992, 1988).
Finally, unlike PSI, WMI scores were closer to the normative range, with WAIS-IV and WAIS-III showing no significant deviations from the mean. This disparity raises two hypotheses: first, that WAIS tests may have lower sensitivity for detecting working memory deficits; second, that such deficits in ASD are more detectable in childhood and diminish in adulthood. Supporting this, Demetriou et al. (2018) found, in their meta-analysis, developmental differences in working memory among individuals with ASD, with more potent effects in children under 12, reduced effects in adults, and no significant differences in adolescents (Ceruti et al., 2024).
4.3. Heterogeneity and Bias
Regarding heterogeneity (Figure 4), ASD presented considerable variability across all outcomes. While this heterogeneity may influence the results, it was anticipated—especially in comparison to other meta-analyses focused on cognitive processes in autism, where IQ differences between samples are not controlled (as in the present analysis, which explicitly analyzes these differences) (Ceruti et al., 2024; Demetriou et al., 2018). For ID, heterogeneity was also substantial, reflecting the broad variability in cognitive and symptomatic profiles within the intellectual disability spectrum, which includes subcategories stratified by FSIQ levels (e.g., moderate vs. mild intellectual disability). Elevated I^2^ values confirmed this heterogeneity for both ASD and ID.
Even when test type was included as a moderator, ASD and ID continued to show high heterogeneity (as detailed in the Supplementary Material). This result suggests that test type variation alone cannot fully account for the observed heterogeneity (see Supplementary Table S1). Contributing factors likely include the diversity of subtypes, variations in these neurodevelopmental conditions, regional differences in test administration (e.g., language differences and statistical differences in validation), and difficulties controlling for comorbidities in the included studies. Similar findings have been reported in previous reviews (J. P. T. Higgins & Li, 2022; Huang et al., 2020; Wilson, 2024).
In contrast, ASD+ID results indicated low or no heterogeneity, consistent with the minimal variation in mean differences and confidence intervals across the studies (Supplementary Figures S1–S5). However, these results are strongly influenced by the limited number of studies (k = 3), reducing the reliability of generalizations. Thus, ASD+ID data should be interpreted as exploratory, while ASD and ID findings carry greater relative weight (J. P. T. Higgins & Li, 2022).
Final point: due to the limited sample size, only ASD (excluding VPI and FRI) was included in funnel plot analysis, with results detailed in the Supplementary Material S3. Overall, these findings highlight the need for caution when generalizing across disorders, given their high variability and individual differences, and emphasize that diagnosis and intervention require individualized, context-sensitive clinical analyses.
4.4. Clinical Relevance
Impairments in Working Memory Index (WMI) and Processing Speed Index (PSI) in individuals with autism have significant implications for outcomes such as academic achievement, workplace functioning, daily living activities, and adaptive behavior. These deficits are consistently associated with challenges in ASD, even when overall IQ does not differ from that of typically developing (TD) individuals (McQuaid et al., 2021; Rommelse et al., 2020).
In intellectual disability (ID), deficits in Full Scale IQ (FSIQ) and other cognitive indices predict moderate to poor outcomes in education, adaptive behavior, and quality of life—even within ID severity gradations, where the degree of impairment varies according to FSIQ levels. While increases in IQ are associated with improvements in adaptive behavior, the reverse is not necessarily true: gains in adaptive functioning do not correspond to increases in FSIQ in most cases (Dakopolos et al., 2024; Tassé & Kim, 2023).
This study confirmed the hypothesized cognitive profiles for ASD and ID, demonstrating their consistency across diverse samples and test versions. These findings support the use of the Wechsler scales for identifying such deficits regardless of the applied version. However, the data also revealed that the Wechsler scales have limited sensitivity in distinguishing autism with intellectual deficit within ID samples. Therefore, while valuable for assessing both conditions, the Wechsler scales should be complemented by instruments that more precisely capture the autistic profile within ID, such as caregiver-report measures and ASD symptom-based assessments (e.g., SRS-2, VABS-III, ADOS) (Yu et al., 2024).
It is important to emphasize that both ASD and ID remain clinical diagnoses. ASD benefits more from index-based assessment given its specific functional profile, whereas ID groups often show similar scores across indices, except for FSIQ. Consequently, neither intelligence scores nor behavioral rating scales alone are sufficient for comprehensive diagnostic conclusions. In addition to symptom scales and cognitive tests, clinical interviews with caregivers, family members, or close associates, as well as direct clinical observation, are essential. The main differentiating factor for ID with or without ASD comorbidities is the presence of social symptoms and behavioral restrictions (e.g., echolalia, stereotypies). For ID, assessment of comprehension—particularly in pragmatic and conceptual domains—is critical, given the shared social impairments across both conditions (Braconnier & Siper, 2021; Carreiro et al., 2017; Yu et al., 2024).
4.5. Limitations
Despite consistent findings, several limitations of this review warrant clarification. First, notable sampling biases were identified in the selected studies. There was a predominance of male participants and a concentration of research conducted in North America, Western Europe, and Asia, with limited representation from Latin America, Africa, and the Middle East—highlighting a gap in the current state of the literature.
Another relevant point is that many studies were conducted in specialized clinics, often relying on convenience sampling without formal sample size calculations. This sampling method raises concerns about the representativeness of the samples relative to the general population, which should be considered when interpreting the results. Moreover, many studies did not control for comorbidities, including various neurodevelopmental disorders within ASD or ID participant groups.
Few studies included ASD+ID samples, and it was not possible to meta-analyze subtests due to a methodological preference for studies reporting all index scores, rather than all the subtests that comprise them (Lakens, 2022; Noordzij et al., 2011). These characteristics of literature, along with the present meta-analysis, should be carefully considered when generalizing the findings.
5. Conclusions
This review identified consistent patterns of cognitive performance, with relative weaknesses in Working Memory Index (WMI) and Processing Speed Index (PSI) among individuals with ASD, and more pronounced impairments—particularly in Full Scale IQ (FSIQ)—in both ID and ASD+ID groups. These patterns persisted regardless of the Wechsler scale version used, with no statistically significant differences in performance across test editions. Notably, the ASD+ID group did not differ significantly from the ID group but showed marked differences compared to the ASD group. However, few studies specifically addressed the ASD+ID comorbidity, so further studies are necessary to confirm this finding.
Future research should expand the investigation of ASD+ID and ID using the Wechsler scales to enhance understanding of their cognitive profiles. Greater precision in sample size estimation is also strongly recommended, along with increased research efforts in underrepresented regions of the Global South—particularly Latin America, Africa, and the Middle East. Additionally, whenever possible, improved control of comorbidities within samples is encouraged to strengthen variable control and the interpretability of findings. Finally, research analyzing other key variables in understanding ASD and ID, such as adaptive behavior, social cognition, and Executive Functions, is also strongly recommended.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Audras-Torrent L. Miniarikova E. Couty F. Dellapiazza F. Berard M. Michelon C. Picot M. C. Baghdadli A. WISC-V profiles and their correlates in children with autism spectrum disorder without intellectual developmental disorder: Report from the ELENA cohort Autism Research 2021145997100610.1002/aur.244433247546 · doi ↗ · pubmed ↗
- 2Barnard H. A systematic review of literature and meta-analysis of the cognitive performance of children with autism spectrum disorder using the wechsler intelligence scale for children Master’s thesis University of the Witwatersrand 2023
- 3Baum K. T. Shear P. K. Howe S. R. Bishop S. L. A comparison of WISC-IV and SB-5 intelligence scores in adolescents with autism spectrum disorder Autism 201519673674510.1177/136236131455492025398894 · doi ↗ · pubmed ↗
- 4Bojda A. Srebnicki T. KonowałekŁ. Bryńska A. Weak central coherence–construct conception, development, research methods Psychiatria Polska 20215561373138610.12740/PP/Online First/12093135472233 · doi ↗ · pubmed ↗
- 5Bougeard C. Picarel-Blanchot F. Schmid R. Campbell R. Buitelaar J. Prevalence of autism spectrum disorder and co-morbidities in children and adolescents: A systematic literature review Front Psychiatry 20211274470910.3389/fpsyt.2021.74470934777048 PMC 8579007 · doi ↗ · pubmed ↗
- 6Braconnier M. L. Siper P. M. Neuropsychological assessment in autism spectrum disorder Current Psychiatry Reports 202123101910.1007/s 11920-021-01277-1PMC 832444234331144 · doi ↗ · pubmed ↗
- 7Bucaille A. Grandgeorge M. Degrez C. Mallégol C. Cam P. Botbol M. Planche P. Cognitive profile in adults with Asperger syndrome using WAIS-IV: Comparison to typical adults Research in Autism Spectrum Disorders 2016211910.1016/j.rasd.2015.09.001 · doi ↗
- 8Calderoni S. Sex/gender differences in children with autism spectrum disorder: A brief overview on epidemiology, symptom profile, and neuroanatomy Journal of Neuroscience Research 2023101573975010.1002/jnr.2500035043482 · doi ↗ · pubmed ↗
