# Systematic Review of Non-Coding Genomic Variants in Globin and Non-Globin Clusters and Their Impact on Phenotypic Severity in Thalassemia and Sickle Cell Disease

**Authors:** Abeer M. Al-Subaie, J. Francis Borgio

PMC · DOI: 10.3390/jcm15041345 · Journal of Clinical Medicine · 2026-02-09

## TL;DR

This study reviews how non-coding genetic variants influence the severity of hemoglobin disorders like thalassemia and sickle cell disease, beyond the known coding mutations.

## Contribution

The paper systematically identifies non-coding variants that modulate clinical severity in hemoglobinopathies, offering new insights into 'missing heritability'.

## Key findings

- Cis-acting variants in globin gene clusters and trans-acting QTLs like BCL11A and HBS1L-MYB influence clinical severity in hemoglobinopathies.
- Non-globin NCVs in UGT1A1 and promoter regions of TNF-α and VCAM1 are linked to disease severity and complications in β-thal and SCD.
- XmnI polymorphism in HBG2 is a key modifier of β-thal intermedia severity.

## Abstract

Background: Haemoglobinopathies such as beta-thalassemia (β-thal), alpha-thalassemia (α-thal) and sickle cell disease (SCD) are characterised by pathogenic gene variations (mutations) in the globin genes. Patients with haemoglobinopathies have the same disease-causing coding variations with very different disease phenotypes, from requiring blood transfusions to being non-symptomatic. The gap between the expected clinical outcomes based on primary coding mutations (the genotype) and the actual observed symptoms (the phenotype) often remains unexplained. We refer to the contribution of secondary genetic modifiers—specifically, non-coding variants of the genome that alter globin gene expression and pathophysiology—as the “missing heritability” of the clinical presentation [Primary Mutation + Missing Heritability (Non-Coding Variants) = Actual Clinical Phenotype]. Objectives: This systematic review aims to find evidence connecting genetic differences outside of the protein-coding region, as in promoters, enhancers or untranslated regions (UTRs), to the clinical severity (phenotype) of beta-thalassemia, alpha-thalassaemia and SCD. We summarise the molecular basis of phenotypic variation among haemoglobinopathy patients with identical variations to reveal their missing heritability and to enhance our understanding of prognostic strategies. Methods: This systematic review was performed in accordance with the PRISMA 2020 guidelines. We used search terms related to haemoglobinopathies, non-coding variation, SNP, promoters, enhancers and clinical severity to search major databases (PubMed and Google Scholar) as of October 2025. A total of 527 (out of 572) abstracts were fit for initial screening to identify the eligible reports. Due to heterogeneity in study designs and reported outcomes, findings were synthesised descriptively and grouped by variant mechanism (cis-acting and trans-acting). The final analysis included 89 articles that demonstrated a direct association between a non-coding genomic variant and a quantitative measure of clinical severity. Results: Two main groups of non-coding variants (NCVs) that modulate foetal haemoglobin (HbF) induction were identified. The first major group comprises cis-acting variants within globin gene clusters (HBG2 promoter XmnI polymorphism, HBB promoter mutations and α-globin enhancer variants), while the second major group comprises trans-acting quantitative trait loci (QTLs) (BCL11A and HBS1L-MYB loci). Non-globin NCVs in the UGT1A1 promoter were also found to influence the severity measures in β-thal and SCD. NCVs primarily alter the binding of transcription factors and the looping dynamics of chromatin, modulating the α/β chain balance ratio and γ-globin repression. The XmnI polymorphism is the most prominent cis-acting modifier associated with β-thal intermedia. The promoter polymorphisms in TNF-α and VCAM1 are associated with vascular complications in SCD. Conclusions: NCVs are fundamental when determining the clinical measures of haemoglobinopathies, in addition to coding variants. NCV screening should be integrated for clinical prognosis for the accurate prediction of haemoglobinopathy severity and associated high-risk complications. NCVs may represent promising targets for next-generation gene editing and therapeutic intervention strategies aimed at modifying the severity of β-thal, α-thal and SCD.

## Linked entities

- **Genes:** HBG2 (hemoglobin subunit gamma 2) [NCBI Gene 3048], HBB (hemoglobin subunit beta) [NCBI Gene 3043], UGT1A1 (UDP glucuronosyltransferase family 1 member A1) [NCBI Gene 54658], BCL11A (BCL11 transcription factor A) [NCBI Gene 53335], TNF (tumor necrosis factor) [NCBI Gene 7124], VCAM1 (vascular cell adhesion molecule 1) [NCBI Gene 7412]
- **Diseases:** beta-thalassemia (MONDO:0019402), alpha-thalassemia (MONDO:0011399), sickle cell disease (MONDO:0011382)

## Full-text entities

- **Genes:** HBA1 (hemoglobin subunit alpha 1) [NCBI Gene 3039] {aka ECYT7, HBA-T3, HBH, METHBA}, GATA1 (GATA binding protein 1) [NCBI Gene 2623] {aka CNSHA9, ERYF1, GATA-1, GF-1, GF1, HAEADA}, HBS1L (HBS1 like translational GTPase) [NCBI Gene 10767] {aka EF-1a, ERFS, HBS1, HSPC276, eRF3c}, HBB (hemoglobin subunit beta) [NCBI Gene 3043] {aka CD113t-C, ECYT6, beta-globin}, KRT90P (keratin 90, pseudogene) [NCBI Gene 85340] {aka HBA, KRT124P, KRTHBP1}, HMOX1 (heme oxygenase 1) [NCBI Gene 3162] {aka HMOX1D, HO-1, HSP32, bK286B10}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, NR2F2 (nuclear receptor subfamily 2 group F member 2) [NCBI Gene 7026] {aka ARP-1, ARP1, CHTD4, COUPTF2, COUPTFB, COUPTFII}, HBD (hypophosphatemic bone disease) [NCBI Gene 100187828], HBA2 (hemoglobin subunit alpha 2) [NCBI Gene 3040] {aka ECYT7, HBA-T2, HBH}, VCAM1 (vascular cell adhesion molecule 1) [NCBI Gene 7412] {aka CD106, INCAM-100}, KLF1 (KLF transcription factor 1) [NCBI Gene 10661] {aka CDAN4A, CDAN4B, EKLF, EKLF/KLF1}, SUPT5H (SPT5 homolog, DSIF elongation factor subunit) [NCBI Gene 6829] {aka SPT5, SPT5H, Tat-CT1}, BCL11A (BCL11 transcription factor A) [NCBI Gene 53335] {aka CTIP1, DILOS, EVI9, HBFQTL5, SMARCM1, ZNF856}, ZBTB7A (zinc finger and BTB domain containing 7A) [NCBI Gene 51341] {aka FBI-1, FBI1, LRF, MNDLFH, TIP21, ZBTB7}, UGT1A1 (UDP glucuronosyltransferase family 1 member A1) [NCBI Gene 54658] {aka BILIQTL1, GNT1, HUG-BR1, UDPGT, UDPGT 1-1, UGT1}, HBG1 (hemoglobin subunit gamma 1) [NCBI Gene 3047] {aka HBG-T2, HBGA, HBGR, HSGGL1, PRO2979}, HBG2 (hemoglobin subunit gamma 2) [NCBI Gene 3048] {aka HBG-T1, TNCY}, APP (amyloid beta precursor protein) [NCBI Gene 351] {aka AAA, ABETA, ABPP, AD1, APPI, CTFgamma}, MYB (MYB proto-oncogene, transcription factor) [NCBI Gene 4602] {aka Cmyb, c-myb, c-myb_CDS, efg}, TNF (tumor necrosis factor) [NCBI Gene 7124] {aka DIF, IMD127, TNF-alpha, TNFA, TNFSF2, TNLG1F}
- **Diseases:** Mendelian inherited disorders (MESH:D030342), cholelithiasis (MESH:D002769), chronic hyperbilirubinemia (MESH:D006932), monogenic disorders (MESH:D009358), Thalassemia (MESH:D013789), damage (MESH:D020263), Chronic haemolysis (MESH:D006461), NCVs (MESH:C580335), organ damage (MESH:D000092124), Gilbert's syndrome (MESH:D005878), Beta-thalassemia (MESH:D017086), alpha-thal (MESH:D017085), stroke (MESH:D020521), SCD (MESH:D000755), coagulation (MESH:D001778), hypochromic microcytosis (OMIM:616959), large-vessel stroke (MESH:C536223), gallstone (MESH:D042882), ACS (MESH:D000168), pigment (MESH:D010859), alpha-thalassaemia (MESH:D000795), Haemolysis and Hepatobiliary Complications (MESH:D004066), inflammation (MESH:D007249), injury to (MESH:D014947), acute chest syndrome (MESH:D056586), VOCs (MESH:D013224), HPFH (MESH:D009386), cerebral vasculopathy (MESH:C566007), globin gene defect (MESH:C564194)
- **Chemicals:** hydroxyurea (MESH:D006918), bilirubin (MESH:D001663), heme (MESH:D006418)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** rs11865131, C33G, rs7482144, +20 (C>T), c.307+1G>A, -136C > G, 506 (A>C), -86 (C > G), (-308)G/A, rs3024735, c.*+108A>G, +90 del 13 bp, rs3024718, -29 (A to G, c.*+132C>T, -75G>T, G   A, A   T, A/rs3024719G/rs3024731T/, C   T, G/rs3024719G/rs3024731T/, -71 C>T, c.-76A>C, rs66650371, -92(C-->T), g.-109G>T, G   T, -86 C, IVS1-116 (A-->G), rs11886868, rs9402686, rs9399137, IVS-II-745 (C>G), -26 (A>C), -73(A-->T), rs4671393

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12941110/full.md

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12941110/full.md

## References

101 references — full list in the complete paper: https://tomesphere.com/paper/PMC12941110/full.md

---
Source: https://tomesphere.com/paper/PMC12941110