# Maximizing Diagnostic Yield in Intellectual Disability Through Exome Sequencing: Genotype–Phenotype Insights in a Vietnamese Cohort

**Authors:** Thu Lan Hoang, Thi Kim Phuong Doan, Thi Ngoc Lan Hoang, Cam Tu Ho, Thi Ha Vu, Thi Trang Nguyen, Thi Huyen Vu, Thi Trang Dao, Thi Minh Ngoc Nguyen, Phuong Mai Nguyen, Huu Duc Anh Nguyen, Chi Dung Vu, Phuong Thao Do, Quang Phuc Pham, Quang Trung Nguyen, Thi Phuong Mai Nguyen, Thi Thuy Ninh To, Hoa Giang, Thi Lan Anh Luong

PMC · DOI: 10.3390/diagnostics15222821 · 2025-11-07

## TL;DR

This study shows how whole and clinical exome sequencing improve diagnosis of intellectual disability in Vietnamese children by linking genetic variants to specific clinical features.

## Contribution

A multidimensional phenotypic scoring system and clustering method to enhance genotype–phenotype integration in diagnosing intellectual disability.

## Key findings

- Three major biological patient groups were identified with distinct genetic and phenotypic profiles.
- Higher Z-scores correlated with earlier disease onset and greater neurological severity.
- WES and CES showed superior diagnostic resolution compared to traditional methods.

## Abstract

Background: Intellectual disability (ID) is a heterogeneous condition caused by diverse genetic factors, including single-nucleotide variants (SNVs) and copy number variants (CNVs). Whole-exome sequencing (WES) and clinical exome sequencing (CES) have become essential tools for identifying pathogenic variants; however, their relative diagnostic performance in ID has not been fully characterized. Methods: Children diagnosed with ID or related neurodevelopmental disorders underwent WES or CES. Identified variants were classified according to ACMG/AMP and ClinGen guidelines, with segregation analysis performed when parental samples were available. Diagnostic yields were compared across demographic, prenatal, and phenotypic subgroups. A multidimensional semi-quantitative scoring system encompassing 15 clinical domains (e.g., age at onset, neuro-motor function, seizures, MRI findings, vision, and dysmorphic features) was developed. Z-scores were calculated for each parameter, followed by hierarchical cluster analysis (HCA) and correlation modeling to define genotype–phenotype associations and pathway-level clustering. Results: A broad spectrum of pathogenic and likely pathogenic variants across multiple genes and biological pathways was identified in our study. CNV-associated cases frequently exhibited prenatal anomalies or multisystem phenotypes associated with large chromosomal rearrangements. Monogenic variants and their corresponding phenotypic profiles were identified through clinical exome sequencing (CES) and whole-exome sequencing (WES). Phenotypic HCA based on Z-scores revealed three major biological groups of patients with coherent genotype–phenotype relationships: Group 1, severe multisystem neurodevelopmental disorders dominated by transcriptional and RNA-processing genes (POLR1C, TCF4, HNRNPU, NIPBL, ACTG1); Group 2, intermediate epileptic and metabolic forms associated with ion-channel and excitability-related genes (SCN2A, PAH, IQSEC2, GNPAT); and Group 3, milder or focal neurodevelopmental phenotypes involving myelination and signaling-related genes (NKX6-2, PLP1, PGAP3, SMAD6, ATP1A3). Gene distribution significantly differed among these biological categories (χ2 = 54.566, df = 34, p = 0.0141), confirming non-random, biologically consistent grouping. Higher Z-scores correlated with earlier onset and greater neurological severity, underscoring the clinical relevance of the multidimensional analytical framework. Conclusions: This study highlights the genetic complexity and clinical heterogeneity of intellectual disability and demonstrates the superior diagnostic resolution of WES and CES. Integrating multidimensional phenotypic profiling with genomic analysis enhances genotype–phenotype integration and enables data-driven phenotype stratification and pathway-based re-analysis. This combined diagnostic and analytical framework offers a more comprehensive approach to diagnosing monogenic ID and provides a foundation for future predictive and functional studies.

## Linked entities

- **Genes:** POLR1C (RNA polymerase I and III subunit C) [NCBI Gene 9533], TCF4 (transcription factor 4) [NCBI Gene 6925], HNRNPU (heterogeneous nuclear ribonucleoprotein U) [NCBI Gene 3192], NIPBL (NIPBL cohesin loading factor) [NCBI Gene 25836], ACTG1 (actin gamma 1) [NCBI Gene 71], SCN2A (sodium voltage-gated channel alpha subunit 2) [NCBI Gene 6326], PAH (phenylalanine hydroxylase) [NCBI Gene 5053], IQSEC2 (IQ motif and Sec7 domain ArfGEF 2) [NCBI Gene 23096], GNPAT (glyceronephosphate O-acyltransferase) [NCBI Gene 8443], NKX6-2 (NK6 homeobox 2) [NCBI Gene 84504], PLP1 (proteolipid protein 1) [NCBI Gene 5354], PGAP3 (post-GPI attachment to proteins phospholipase 3) [NCBI Gene 93210], SMAD6 (SMAD family member 6) [NCBI Gene 4091], ATP1A3 (ATPase Na+/K+ transporting subunit alpha 3) [NCBI Gene 478]
- **Diseases:** intellectual disability (MONDO:0001071)

## Full-text entities

- **Genes:** PLP1 (proteolipid protein 1) [NCBI Gene 5354] {aka GPM6C, HLD1, MMPL, PLP, PLP/DM20, PMD}, GNPAT (glyceronephosphate O-acyltransferase) [NCBI Gene 8443] {aka DAP-AT, DAPAT, DHAPAT, RCDP2}, SMAD6 (SMAD family member 6) [NCBI Gene 4091] {aka AOVD2, HsT17432, MADH6, MADH7}, POLR1C (RNA polymerase I and III subunit C) [NCBI Gene 9533] {aka AC40, HLD11, RPA39, RPA40, RPA5, RPAC1}, ACTG1 (actin gamma 1) [NCBI Gene 71] {aka ACT, ACTG, DFNA20, DFNA26, HEL-176}, NIPBL (NIPBL cohesin loading factor) [NCBI Gene 25836] {aka CDLS, CDLS1, IDN3, IDN3-B, Scc2}, HNRNPU (heterogeneous nuclear ribonucleoprotein U) [NCBI Gene 3192] {aka DEE54, EIEE54, GRIP120, HNRNPU-AS1, HNRPU, SAF-A}, NKX6-2 (NK6 homeobox 2) [NCBI Gene 84504] {aka GTX, NKX6.2, NKX6B, SPAX8}, IQSEC2 (IQ motif and Sec7 domain ArfGEF 2) [NCBI Gene 23096] {aka BRAG1, IQ-ArfGEF, MRX1, MRX18, MRX78, NEDXSB}, SCN2A (sodium voltage-gated channel alpha subunit 2) [NCBI Gene 6326] {aka BFIC3, BFIS3, BFNIS, DEE11, EA9, EIEE11}, TCF4 (transcription factor 4) [NCBI Gene 6925] {aka CDG2T, E2-2, FCD2, FECD3, ITF-2, ITF2}, PGAP3 (post-GPI attachment to proteins phospholipase 3) [NCBI Gene 93210] {aka AGLA546, CAB2, PERLD1, PP1498, hCOS16}, ATP1A3 (ATPase Na+/K+ transporting subunit alpha 3) [NCBI Gene 478] {aka AHC2, CAPOS, DEE99, DYT12, RDP}, PAH (phenylalanine hydroxylase) [NCBI Gene 5053] {aka PH, PKU, PKU1}
- **Diseases:** dysmorphic features (MESH:D000013), neurodevelopmental disorders (MESH:D002658), epileptic and metabolic (MESH:D024821), seizures (MESH:D012640), neurodevelopmental phenotypes (MESH:C537393), ID (MESH:D008607)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12651281/full.md

---
Source: https://tomesphere.com/paper/PMC12651281