# Multi-System Genetic Architecture of Hypermobile Ehlers–Danlos Syndrome: Integrating Machine Learning with Subject-Level Genomic Analysis

**Authors:** Arash Shirvani, Purusha Shirvani, Michael F. Holick

PMC · DOI: 10.3390/genes17020211 · Genes · 2026-02-09

## TL;DR

This study uses machine learning and genomic analysis to uncover multiple biological systems linked to hypermobile Ehlers–Danlos syndrome, a condition with previously unknown genetic causes.

## Contribution

The study introduces an integrated approach combining machine learning and subject-level statistical analysis to reveal multi-system genetic associations in hEDS.

## Key findings

- Variants in collagen biosynthesis, HLA/immune, and mitochondrial systems were significantly enriched in hEDS patients.
- Seven genes encoding structural/cytoskeletal proteins achieved genome-wide significance.
- Machine learning models achieved 80% accuracy in classifying hEDS cases.

## Abstract

Background/Objectives: Hypermobile Ehlers–Danlos syndrome (hEDS) remains genetically unexplained despite decades of clinical investigation, with the molecular basis undefined for the vast majority of cases. This study employs integrated machine learning approaches with rigorous subject-level statistical methods to decode the genetic architecture underlying hEDS. Methods: We analyzed 35,923 rare genetic variants (gnomAD MAF < 0.2) across 116 subjects from 43 families (86 hEDS patients diagnosed per 2017 international criteria; 30 unaffected intrafamilial controls) using whole-exome sequencing. Machine learning analysis employed Random Forest feature selection, deep neural networks, and ensemble methods with subject-stratified cross-validation to prevent data leakage. Statistical association testing used subject-level Fisher’s exact tests with Bonferroni correction (α = 3.77 × 10−6 for 13,281 genes). Sensitivity analyses assessed robustness to family structure. Results: Subject-level analysis identified statistically significant enrichment in variants associated with three major biological systems: (1) collagen biosynthesis pathway variants (present in 63% of hEDS subjects vs. 17% of controls, Fisher’s p = 1.06 × 10−5, OR = 8.4), predominantly affecting COL5A1, COL18A1, COL17A1, and post-translational modification enzymes; (2) HLA/adaptive immune axis variants (74% of hEDS vs. 30% of controls, p = 2.23 × 10−5, OR = 6.8), involving HLA-B, HLA-A, HLA-C, and TAP transporters; (3) mitochondrial respiratory chain variants (34% of hEDS vs. 7% of controls, p = 2.29 × 10−3, OR = 7.1), with striking 4.2-fold enrichment in pediatric fracture cases (52% vs. 21%, p = 0.021, 95% CI: 1.2–14.6). These associations require independent validation and functional studies to determine their mechanistic relevance. Genome-wide analysis identified seven genes achieving Bonferroni significance (p < 3.77 × 10−6), all encoding structural/cytoskeletal proteins. Machine learning models with proper subject-stratified cross-validation achieved 80% accuracy (95% CI: 73–86%, sensitivity = 82%, specificity = 77%). Conclusions: Our findings suggest that hEDS may involve genetic variation across multiple biological systems beyond classical collagen pathways. These hypothesis-generating associations require validation in independent cohorts and functional studies before mechanistic or clinical conclusions can be drawn.

## Linked entities

- **Genes:** COL5A1 (collagen type V alpha 1 chain) [NCBI Gene 1289], COL18A1 (collagen type XVIII alpha 1 chain) [NCBI Gene 80781], COL17A1 (collagen type XVII alpha 1 chain) [NCBI Gene 1308], HLA-B (major histocompatibility complex, class I, B) [NCBI Gene 3106], HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105], HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107]
- **Diseases:** hypermobile Ehlers–Danlos syndrome (MONDO:0007523), hEDS (MONDO:0007523)

## Full-text entities

- **Genes:** ND4 (NADH dehydrogenase subunit 4) [NCBI Gene 4538] {aka MTND4}, ND4L (NADH dehydrogenase subunit 4L) [NCBI Gene 4539] {aka MTND4L}, COL18A1 (collagen type XVIII alpha 1 chain) [NCBI Gene 80781] {aka GLCC, KNO, KNO1, KS}, ATP6 (ATP synthase F0 subunit 6) [NCBI Gene 4508] {aka ATPase6, MTATP6}, KRT5 (keratin 5) [NCBI Gene 3852] {aka CK5, DDD, DDD1, EBS1, EBS2, EBS2A}, COL5A2 (collagen type V alpha 2 chain) [NCBI Gene 1290] {aka EDSC, EDSCL2}, P3H1 (prolyl 3-hydroxylase 1) [NCBI Gene 64175] {aka GROS1, LEPRE1, OI8}, ATP8 (ATP synthase F0 subunit 8) [NCBI Gene 4509] {aka ATPase8, MTATP8}, HSPG2 (heparan sulfate proteoglycan 2) [NCBI Gene 3339] {aka HSPG, PLC, PRCAN, SJA, SJS, SJS1}, PLOD1 (procollagen-lysine,2-oxoglutarate 5-dioxygenase 1) [NCBI Gene 5351] {aka EDS6, EDSKCL1, LH, LH1, LLH, PLOD}, RELN (reelin) [NCBI Gene 5649] {aka ETL7, LIS2, PRO1598, RL}, LAMA2 (laminin subunit alpha 2) [NCBI Gene 3908] {aka LAMM, MDC1A}, DMD (dystrophin) [NCBI Gene 1756] {aka BMD, CMD3B, DXS142, DXS164, DXS206, DXS230}, FLG (filaggrin) [NCBI Gene 2312] {aka ATOD2, FLG-1, FLG1}, COX2 (cytochrome c oxidase subunit II) [NCBI Gene 4513] {aka COII, MTCO2}, HLA-C (major histocompatibility complex, class I, C) [NCBI Gene 3107] {aka D6S204, HLA-JY3, HLAC, HLC-C, MHC, PSORS1}, HLA-DQA1 (major histocompatibility complex, class II, DQ alpha 1) [NCBI Gene 3117] {aka CELIAC1, DQ-A1, DQA1, HLA-DQA, HLA-DQA1*}, TAP1 (transporter 1, ATP binding cassette subfamily B member) [NCBI Gene 6890] {aka ABC17, ABCB2, APT1, D6S114E, MHC1D1, PSF-1}, FLNC (filamin C) [NCBI Gene 2318] {aka ABP-280, ABP280A, ABPA, ABPL, ARVC15, CMD1PP}, ND1 (NADH dehydrogenase subunit 1) [NCBI Gene 4535] {aka MTND1}, HLA-DRB1 (major histocompatibility complex, class II, DR beta 1) [NCBI Gene 3123] {aka DRB1, HLA-DR1B, HLA-DRB, SS1}, COL11A2 (collagen type XI alpha 2 chain) [NCBI Gene 1302] {aka DFNA13, DFNB53, FBCG2, HKE5, OSMEDA, OSMEDB}, ND5 (NADH dehydrogenase subunit 5) [NCBI Gene 4540] {aka MTND5}, HLA-B (major histocompatibility complex, class I, B) [NCBI Gene 3106] {aka AS, B-4901, HLAB}, PLOD3 (procollagen-lysine,2-oxoglutarate 5-dioxygenase 3) [NCBI Gene 8985] {aka BCARD, LH3}, COL1A2 (collagen type I alpha 2 chain) [NCBI Gene 1278] {aka EDSARTH2, EDSCV, OI4}, HLA-DQB1 (major histocompatibility complex, class II, DQ beta 1) [NCBI Gene 3119] {aka CELIAC1, HLA-DQB, IDDM1}, PCDHGA1 (protocadherin gamma subfamily A, 1) [NCBI Gene 56114] {aka PCDH-GAMMA-A1}, ND3 (NADH dehydrogenase subunit 3) [NCBI Gene 4537] {aka MTND3}, SERPINH1 (serpin family H member 1) [NCBI Gene 871] {aka AsTP3, CBP1, CBP2, HSP47, OI10, PIG14}, COL11A1 (collagen type XI alpha 1 chain) [NCBI Gene 1301] {aka CO11A1, COLL6, DFNA37, STL2}, COL5A1 (collagen type V alpha 1 chain) [NCBI Gene 1289] {aka EDSC, EDSCL1, FMDMF}, NEB (nebulin) [NCBI Gene 4703] {aka AMC6, NEB177D, NEM2}, SYNE2 (spectrin repeat containing nuclear envelope protein 2) [NCBI Gene 23224] {aka EDMD5, KASH2, NUA, NUANCE, Nesp2, Nesprin-2}, TTN (titin) [NCBI Gene 7273] {aka CMD1G, CMH9, CMPD4, CMYO5, CMYP5, EOMFC}, PLOD2 (procollagen-lysine,2-oxoglutarate 5-dioxygenase 2) [NCBI Gene 5352] {aka BRKS2, LH2, TLH}, SYNE1 (spectrin repeat containing nuclear envelope protein 1) [NCBI Gene 23345] {aka 8B, AMC3, AMCM, ARCA1, C6orf98, CPG2}, COL17A1 (collagen type XVII alpha 1 chain) [NCBI Gene 1308] {aka BA16H23.2, BP180, BPA-2, BPAG2, ERED, JEB4}, COL1A1 (collagen type I alpha 1 chain) [NCBI Gene 1277] {aka CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3}, KRT74 (keratin 74) [NCBI Gene 121391] {aka ADWH, ECTD7, HTSS2, HYPT3, K6IRS4, KRT5C}, COX3 (cytochrome c oxidase subunit III) [NCBI Gene 4514] {aka COIII, MTCO3}, TAP2 (transporter 2, ATP binding cassette subfamily B member) [NCBI Gene 6891] {aka ABC18, ABCB3, APT2, D6S217E, MHC1D2, PSF-2}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, COL3A1 (collagen type III alpha 1 chain) [NCBI Gene 1281] {aka EDS4A, EDSVASC, PMGEDSV}, CCDST (cervical cancer associated DHX9 suppressive transcript) [NCBI Gene 339400] {aka FLG-AS1, LINC02962, lnc-CCDST}, OBSCN (obscurin, cytoskeletal calmodulin and titin-interacting RhoGEF) [NCBI Gene 84033] {aka ARHGEF30, RHABDO1, UNC89}, COX1 (cytochrome c oxidase subunit I) [NCBI Gene 4512] {aka COI, MTCO1}, HLA-A (major histocompatibility complex, class I, A) [NCBI Gene 3105] {aka HLAA}, HLA-DPB1 (major histocompatibility complex, class II, DP beta 1) [NCBI Gene 3115] {aka DPB1, HLA-DP, HLA-DP1B, HLA-DPB}, ND2 (NADH dehydrogenase subunit 2) [NCBI Gene 4536] {aka MTND2}, NXF1 (nuclear RNA export factor 1) [NCBI Gene 10482] {aka MEX67, TAP}, ND6 (NADH dehydrogenase subunit 6) [NCBI Gene 4541] {aka MTND6}, CYTB (cytochrome b) [NCBI Gene 4519] {aka MTCYB}
- **Diseases:** capillary (OMIM:163000), skin hyperextensibility (MESH:D012871), ID (MESH:C537985), Mitochondrial Dysfunction (MESH:D028361), Fracture (MESH:D050723), child abuse (MESH:C535569), injury to (MESH:D014947), orthostatic intolerance (MESH:D054971), Ehlers-Danlos Syndrome (MESH:D004535), inflammation (MESH:D007249), Alzheimer's disease (MESH:D000544), vitamin D deficiency (MESH:D014808), gastrointestinal dysmotility (MESH:D015154), atrophic scarring (MESH:D002921), ML (MESH:D007859), autoimmune comorbidities (MESH:D001327), EDS (MESH:C536196), autonomic dysfunction (MESH:D001342), dysregulation (MESH:D021081), genetic disorder (MESH:D030342), fragility fractures (MESH:D005600), rare diseases (MESH:D035583), connective tissue disorder (MESH:D003240), mast cell activation disorder (MESH:D000090362), cardiovascular disease (MESH:D002318), chronic pain syndromes (MESH:D059350), type 2 diabetes (MESH:D003924), bruising (MESH:D003288), collagen dysfunction (MESH:D003095), joint hypermobility (MESH:D007593), chronic (MESH:D002908), immune dysregulation (OMIM:614878)
- **Chemicals:** vitamin D (MESH:D014807), CoQ10 (MESH:C024989), vitamin C (MESH:D001205), riboflavin (MESH:D012256), ATP (MESH:D000255), calcium (MESH:D002118)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12940503/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12940503/full.md

## References

35 references — full list in the complete paper: https://tomesphere.com/paper/PMC12940503/full.md

---
Source: https://tomesphere.com/paper/PMC12940503