# Global Evaluation of Congenital Heart Disease-Associated Non-Coding Variants

**Authors:** Edwin G. Peña-Martínez, Shreya Sharma, Joshua G. Medina-Feliciano, Elise Root, Lois G. Parks, Marissa Granitto, Diego A. Pomales-Matos, Jean L. Messon-Bird, Adriana C. Barreiro-Rosario, Leandro Sanabria-Alberto, Alejandro Rivera-Madera, Jessica M. Rodríguez-Ríos, Rosalba Velázquez-Roig, Juan A. Figueroa-Rosado, Mackenzie Noon, Omer A. Donmez, Carmy Forney, Hayley K. Hesse, Katelyn A. Dunn, Xiaoting Chen, Matthew R. Hass, Lucinda P. Lawson, Matthew T. Weirauch, Leah C. Kottyan, Steven K. Reilly, Devesh Bhimsaria, José A. Rodríguez-Martínez

PMC · DOI: 10.21203/rs.3.rs-8429365/v1 · Research Square · 2026-01-07

## TL;DR

This study evaluates thousands of non-coding genetic variants linked to heart defects to understand how they affect gene regulation and disease risk.

## Contribution

The study introduces a high-throughput method combining SNP Bind-n-Seq and MPRA to functionally validate CHD-risk variants.

## Key findings

- 170 variants show allelic transcription factor binding.
- 187 variants modulate gene expression in cardiac cells.
- Three high-confidence variants exhibit multiple regulatory effects.

## Abstract

Genome-wide association studies (GWAS) have mapped thousands of congenital heart disease (CHD)-associated variants within non-coding regions of the genome. Noncoding variants can alter regulatory mechanisms, such as transcription factor (TF) binding control of gene expression, potentially contributing human diseases. However, with the increasing number of disease-associated variants, comprehensive functional validation remains a significant challenge. In this work, we developed a novel method called SNP Bind-n-Seq to evaluate >3,000 CHD-risk variants for allelic binding for the cardiac TFs NKX2-5, GATA4, and TBX5 in a high-throughput manner. These binding affinity data sets were coupled with a massively parallel reporter assay (MPRA) to screen CHD-risk variant genotype-dependent regulatory activity. We identified 170 variants that exhibit allelic TF binding and 187 that modulate gene expression. Combining both approaches revealed three high-confidence variants with genotype-dependent TF binding, genotype-dependent transcriptional activity, and eQTL behavior in cardiac cells. Collectively, this study provides the first combined high-throughput biochemical and functional genomic evaluation of thousands of CHD-risk variants.

## Linked entities

- **Proteins:** NKX2-5 (NK2 homeobox 5), GATA4 (GATA binding protein 4), TBX5 (T-box transcription factor 5)
- **Diseases:** congenital heart disease (MONDO:0005453)

## Full-text entities

- **Genes:** KLF10 (KLF transcription factor 10) [NCBI Gene 7071] {aka EGR-alpha, EGRA, TIEG, TIEG1}, TBX21 (T-box transcription factor 21) [NCBI Gene 30009] {aka IMD88, T-PET, T-bet, TBET, TBLYM}, GATA4 (GATA binding protein 4) [NCBI Gene 2626] {aka ASD2, TACHD, TOF, VSD1}, NKX2-1 (NK2 homeobox 1) [NCBI Gene 7080] {aka BCH, BHC, NK-2, NKX2.1, NKX2A, NMTC1}, GATA6 (GATA binding protein 6) [NCBI Gene 2627], TBX5 (T-box transcription factor 5) [NCBI Gene 6910] {aka HOS}, CCR4 (C-C motif chemokine receptor 4) [NCBI Gene 1233] {aka CC-CKR-4, CD194, CKR4, CMKBR4, ChemR13, HGCN:14099}, TCF21 (transcription factor 21) [NCBI Gene 6943] {aka POD1, bHLHa23}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, CREB1 (cAMP responsive element binding protein 1) [NCBI Gene 1385] {aka CREB, CREB-1}, SMAD2 (SMAD family member 2) [NCBI Gene 4087] {aka CHTD8, JV18, JV18-1, LDS6, MADH2, MADR2}, NKX2-2 (NK2 homeobox 2) [NCBI Gene 4821] {aka NKX2.2, NKX2B}, NKX3-1 (NK3 homeobox 1) [NCBI Gene 4824] {aka BAPX2, NKX3, NKX3.1, NKX3A}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, CNOT6L (CCR4-NOT transcription complex subunit 6 like) [NCBI Gene 246175] {aka CCR4b}, DDR1 (discoidin domain receptor tyrosine kinase 1) [NCBI Gene 780] {aka CAK, CD167, DDR, EDDR1, HGK2, MCK10}, GALC (galactosylceramidase) [NCBI Gene 2581], MECP2 (methyl-CpG binding protein 2) [NCBI Gene 4204] {aka AUTSX3, MRX16, MRX79, MRXS13, MRXSL, PPMX}, GATA1 (GATA binding protein 1) [NCBI Gene 2623] {aka CNSHA9, ERYF1, GATA-1, GF-1, GF1, HAEADA}, QRSL1 (glutaminyl-tRNA amidotransferase subunit QRSL1) [NCBI Gene 55278] {aka COXPD40, GatA}, VARS2 (valyl-tRNA synthetase 2, mitochondrial) [NCBI Gene 57176] {aka COXPD20, VALRS, VARS2L, VARSL}, GATA3 (GATA binding protein 3) [NCBI Gene 2625] {aka HDR, HDRS}, IRF3 (interferon regulatory factor 3) [NCBI Gene 3661] {aka IIAE7}, MGAT4C (MGAT4 family member C) [NCBI Gene 25834] {aka GNTIVH, HGNT-IV-H}, NKX2-5 (NK2 homeobox 5) [NCBI Gene 1482] {aka CHNG5, CSX, CSX1, HLHS2, NKX2.5, NKX2E}, SRF (serum response factor) [NCBI Gene 6722] {aka MCM1}, MEF2A (myocyte enhancer factor 2A) [NCBI Gene 4205] {aka ADCAD1, RSRFC4, RSRFC9, mef2}, TBX6 (T-box transcription factor 6) [NCBI Gene 6911] {aka SCDO5}, IRF8 (interferon regulatory factor 8) [NCBI Gene 3394] {aka H-ICSBP, ICSBP, ICSBP1, IMD32A, IMD32B, IRF-8}, POU3F1 (POU class 3 homeobox 1) [NCBI Gene 5453] {aka OCT6, OTF6, SCIP}, BACH2 (BACH transcriptional regulator 2) [NCBI Gene 60468] {aka BTBD25, IMD60}, FOXH1 (forkhead box H1) [NCBI Gene 8928] {aka FAST-1, FAST1}, MAFK (MAF bZIP transcription factor K) [NCBI Gene 7975] {aka NFE2U, P18}, RAB2A (RAB2A, member RAS oncogene family) [NCBI Gene 5862] {aka LHX, RAB2}, MYOM1 (myomesin 1) [NCBI Gene 8736] {aka SKELEMIN}, RAD21 (RAD21 cohesin complex component) [NCBI Gene 5885] {aka CDLS4, HR21, HRAD21, MCD1, MGS, NXP1}, ASCL2 (achaete-scute family bHLH transcription factor 2) [NCBI Gene 430] {aka ASH2, HASH2, MASH2, bHLHa45}
- **Diseases:** genetic (MESH:D030342), conotruncal heart defects (MESH:C535464), neurological and immune diseases (MESH:D020274), cardiomyopathies (MESH:D009202), cardiac diseases (MESH:D006331), CHD (MESH:D006330), DGF (MESH:D042822), respiratory and neurological diseases (MESH:D012140), abnormalities (MESH:D000014), CVDs (MESH:D002318), deaths (MESH:D003643), heart failures (MESH:D006333)
- **Chemicals:** hygromycin (MESH:C026273), galactose (MESH:D005690), Ni (MESH:D009532), boric acid (MESH:C032688), EB (MESH:C478160), LB Broth (-), SDS (MESH:D012967), nitrogen (MESH:D009584), HEPES (MESH:D006531), HI (MESH:D006639), NaCl (MESH:D012965), CO2 (MESH:D002245), Lipofectamine (MESH:C086724), DTT (MESH:D004229), polyacrylamide (MESH:C016679), carbenicillin (MESH:D002228), Glycerol (MESH:D005990), penicillin (MESH:D010406), Tween-20 (MESH:D011136), phosphate (MESH:D010710), DMSO (MESH:D004121), streptomycin (MESH:D013307), EDTA (MESH:D004492)
- **Species:** Homo sapiens (human, species) [taxon 9606], Escherichia coli (E. coli, species) [taxon 562]
- **Mutations:** rs2896074, M0544S, R0193S, C for 12-16, rs13353548, rs3911240, E2611L, E0554S, rs77931854, R063L, rs28394479, rs863392, C>T, rs839154, M0345S, C3019H, rs2137643, rs7303642, rs559405101, R0145S, rs2465147, rs57527611, rs7032149, M0492L
- **Cell lines:** 293 — Homo sapiens (Human), Transformed cell line (CVCL_0045), -28a — Oryctolagus cuniculus (Rabbit), Transformed cell line (CVCL_6E94), Flp — Cricetulus griseus (Chinese hamster), Spontaneously immortalized cell line (CVCL_U424), Flp-In 293 — Homo sapiens (Human), Transformed cell line (CVCL_U006)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12803342/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12803342/full.md

## References

168 references — full list in the complete paper: https://tomesphere.com/paper/PMC12803342/full.md

---
Source: https://tomesphere.com/paper/PMC12803342