# Computational identification of rare pathogenic genomic variants in esophageal cancer markers: Transcript-level analysis, sequence-based insights, and structural-functional impacts of non-synonymous SNPs

**Authors:** Muhammad Bilal Azmi, Sajida Qureshi, Rafia Wasi, Saad Khalid Niaz, Syed Danish Haseen Ahmed

PMC · DOI: 10.1016/j.bbrep.2026.102503 · Biochemistry and Biophysics Reports · 2026-02-23

## TL;DR

This paper identifies rare harmful genetic changes in esophageal cancer genes and explores their impact on protein function and cancer progression.

## Contribution

The study computationally identifies and validates rare pathogenic nsSNPs in 28 EC-related genes, highlighting their structural and functional impacts.

## Key findings

- Rare nsSNPs in genes like GRB7, SLCO1A2, and HIF1AN disrupt protein stability and regulatory functions.
- Conserved residue substitutions are linked to structural and signaling pathway disruptions in EC-related proteins.

## Abstract

Esophageal cancer (EC) is a rapidly progressing malignancy that significantly contributes to cancer-related mortality. The genetic causes of EC, particularly rare coding pathogenic variants, remain incompletely defined. This study focuses on non-synonymous single nucleotide polymorphisms (nsSNPs) because they can impact the functions of critical proteins implicated in carcinogenesis.

We employed an extended computational framework for the identification and analysis of rare-coding nsSNPs within 28 EC-associated genes and identified 126 protein isoforms from public databases such as NCBI and ENSEMBL, focusing on those with MAF <1%. Functional predictions were evaluated using different bioinformatics tools to ascertain pathogenicity. Furthermore, ten rare-coding nsSNPs were mapped to 23 transcript-level variants within genes such as GRB7, SLCO1A2, HIF1AN, KCNQ3, and DLL1.

Computational analysis showed that these variants have a prominent impact on protein stability and function because substitutions occurred at conserved residues, thereby perturbing the stability and important regulatory attributes of the proteins. Structural modeling supported our findings: some variants may compromise domain integrity and influence the signal transduction pathways relevant to the progression of EC. The oncogenic potential of the conserved variants was also computationally validated using the Cscape tool, with candidates including rs1591837395 (G68R) in SLCO1A2 and rs758624092 (D201H) in HIF1AN. Collectively, this study identified rare and potentially pathogenic coding variants in EC-related genes and elaborated on their effects at the transcript, sequence, and structural levels.

Integrating these insights into EC provides an extended view of its molecular mechanisms and may be used in the near future as a basis for precision medicine for early diagnosis and targeted therapy.

•Rare deleterious nsSNPs in 28 esophageal cancer genes were screened computationally.•GRB7, SLCO1A2, HIF1AN, KCNQ3 and DLL1 showed high-impact variants.•Most variants affected conserved residues linked to structure and regulation.•Stability, flexibility and RMSF analyses indicated domain or signaling disruption.•Prioritized variants are proposed for experimental validation as biomarkers.

Rare deleterious nsSNPs in 28 esophageal cancer genes were screened computationally.

GRB7, SLCO1A2, HIF1AN, KCNQ3 and DLL1 showed high-impact variants.

Most variants affected conserved residues linked to structure and regulation.

Stability, flexibility and RMSF analyses indicated domain or signaling disruption.

Prioritized variants are proposed for experimental validation as biomarkers.

## Linked entities

- **Genes:** GRB7 (growth factor receptor bound protein 7) [NCBI Gene 2886], SLCO1A2 (solute carrier organic anion transporter family member 1A2) [NCBI Gene 6579], HIF1AN (hypoxia inducible factor 1 subunit alpha inhibitor) [NCBI Gene 55662], KCNQ3 (potassium voltage-gated channel subfamily Q member 3) [NCBI Gene 3786], DLL1 (delta like canonical Notch ligand 1) [NCBI Gene 28514]
- **Diseases:** esophageal cancer (MONDO:0007576)

## Full-text entities

- **Genes:** IFI6 (interferon alpha inducible protein 6) [NCBI Gene 2537] {aka 6-16, FAM14C, G1P3, IFI-6-16, IFI616}, RBBP6 (RB binding protein 6, ubiquitin ligase) [NCBI Gene 5930] {aka MY038, P2P-R, PACT, RBQ-1, SNAMA}, XRCC2 (X-ray repair cross complementing 2) [NCBI Gene 7516] {aka FANCU, POF17, SPGF50}, RHOU (ras homolog family member U) [NCBI Gene 58480] {aka ARHU, CDC42L1, G28K, WRCH1, hG28K}, SLC10A2 (solute carrier family 10 member 2) [NCBI Gene 6555] {aka ASBT, IBAT, ISBT, NTCP2, PBAM, PBAM1}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, ELMO2 (engulfment and cell motility 2) [NCBI Gene 63916] {aka CED-12, CED12, Ced-12A, ELMO-2, VMPI}, CARD6 (caspase recruitment domain family member 6) [NCBI Gene 84674] {aka CINCIN1}, GPR176 (G protein-coupled receptor 176) [NCBI Gene 11245] {aka HB-954}, PHLDA3 (pleckstrin homology like domain family A member 3) [NCBI Gene 23612] {aka TIH1}, ERCC1 (ERCC excision repair 1, endonuclease non-catalytic subunit) [NCBI Gene 2067] {aka COFS4, RAD10, UV20}, SLCO1A2 (solute carrier organic anion transporter family member 1A2) [NCBI Gene 6579] {aka OATP, OATP-A, OATP1A2, SLC21A3}, XPC (XPC complex subunit, DNA damage recognition and repair factor) [NCBI Gene 7508] {aka RAD4, XP3, XPCC, p125}, GNG7 (G protein subunit gamma 7) [NCBI Gene 2788] {aka HG3B}, ADH1B (alcohol dehydrogenase 1B (class I), beta polypeptide) [NCBI Gene 125] {aka ADH2, HEL-S-117}, ALDH2 (aldehyde dehydrogenase 2 family member) [NCBI Gene 217] {aka ALDH-E2, ALDHI, ALDM}, EVPL (envoplakin) [NCBI Gene 2125] {aka EVPK}, ZNF382 (zinc finger protein 382) [NCBI Gene 84911] {aka KS1}, HIF1A (hypoxia inducible factor 1 subunit alpha) [NCBI Gene 3091] {aka HIF-1-alpha, HIF-1A, HIF-1alpha, HIF1, HIF1-ALPHA, MOP1}, XRCC3 (X-ray repair cross complementing 3) [NCBI Gene 7517] {aka CMM6}, RNF187 (ring finger protein 187) [NCBI Gene 149603] {aka RACO-1, RACO1}, HIF1AN (hypoxia inducible factor 1 subunit alpha inhibitor) [NCBI Gene 55662] {aka FIH1}, QKI (QKI, KH domain containing RNA binding) [NCBI Gene 9444] {aka Hqk, QK, QK1, QK3, hqkI}, DMBT1 (deleted in malignant brain tumors 1) [NCBI Gene 1755] {aka GP340, SAG, SALSA, muclin}, GSTP1 (glutathione S-transferase pi 1) [NCBI Gene 2950] {aka DFN7, FAEES3, GST3, GSTP, GSTP1-1, HEL-S-22}, XRCC1 (X-ray repair cross complementing 1) [NCBI Gene 7515] {aka RCC, SCAR26}, RHCG (Rh family C glycoprotein) [NCBI Gene 51458] {aka C15orf6, PDRC2, RHGK, SLC42A3}, NELFA (negative elongation factor complex member A) [NCBI Gene 7469] {aka NELF-A, P/OKcl.15, WHSC2}, GRB7 (growth factor receptor bound protein 7) [NCBI Gene 2886], PRDM5 (PR/SET domain 5) [NCBI Gene 11107] {aka BCS2, PFM2}, KCNQ3 (potassium voltage-gated channel subfamily Q member 3) [NCBI Gene 3786] {aka BFNC2, EBN2, KV7.3}, CRNN (cornulin) [NCBI Gene 49860] {aka C1orf10, PDRC1, SEP53}, DLL1 (delta like canonical Notch ligand 1) [NCBI Gene 28514] {aka DELTA1, DL1, Delta, NEDBAS}, INPP5J (inositol polyphosphate-5-phosphatase J) [NCBI Gene 27124] {aka INPP5, PIB5PA, PIPP}
- **Diseases:** carcinogenesis (MESH:D063646), obesity (MESH:D009765), hypoxic (MESH:D002534), hypoxia (MESH:D000860), Barrett's esophagus (MESH:D001471), GERD (MESH:D005764), ESCC (MESH:D000077277), epithelial malignancies (MESH:D002277), EAC (MESH:D000230), Oesophageal cancer (MESH:D009369), CADD (MESH:D019966), EC (MESH:D004938), carcinogenic (MESH:D011230), carcinogenic transformation (MESH:D002472)
- **Chemicals:** glycine (MESH:D005998), polycyclic aromatic hydrocarbons (MESH:D011084), nitrosamine (MESH:D009602), ethanol (MESH:D000431), glutamate (MESH:D018698), oxygen (MESH:D010100), acids (MESH:D000143), phosphoinositide (MESH:D010716), histidine (MESH:D006639), CABS (MESH:C055322), lipids (MESH:D008055), alcohol (MESH:D000438), TPM (MESH:D000077236), hydrogen (MESH:D006859), acetaldehyde (MESH:D000079), ASA (-)
- **Species:** Nicotiana tabacum (American tobacco, species) [taxon 4097], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** E170G, rs1799793, D94H, G68R, W454C, G237R, R461 H/L, rs772287804, rs778186811, D440Y, rs3212986, rs1298634519

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12952785/full.md

## Figures

12 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12952785/full.md

## References

100 references — full list in the complete paper: https://tomesphere.com/paper/PMC12952785/full.md

---
Source: https://tomesphere.com/paper/PMC12952785