# Synonymous and non-synonymous variants at splice junctions can disrupt splicing and are frequently linked to disease associated loss of function genes

**Authors:** Sruthi Srinivasan, Swetha Subramanian, Hui Zhou, Anselm Hoppmann, Patrick Metzger, Melanie Boerries, Senthilkumar Ramamoorthy

PMC · DOI: 10.1186/s12864-025-12466-0 · BMC Genomics · 2025-12-23

## TL;DR

This study shows that genetic variants near RNA splice junctions can disrupt splicing and are often found in genes linked to disease through loss of function.

## Contribution

The study identifies that synonymous and missense variants near splice junctions are frequently associated with disease-related loss-of-function genes.

## Key findings

- Variants at splice donor and acceptor sites are enriched in genes intolerant to protein truncation.
- G > A and G > T mutations at splice junctions are common and often missense.
- Splice junction variants are rarely observed in population data, suggesting disease relevance.

## Abstract

RNA splicing facilitated by the spliceosome complex, relies on specific motifs at exon-intron junctions of pre-mRNAs to generate mature mRNAs. Mutations in splice junctions can disrupt splicing, potentially leading to premature protein truncation. Nucleotides within the exonic component of the junction are also essential for splicing. Evaluation of silent and missense variants in the exonic splice junction on RNA splicing is essential to investigate the significance of such variants in disease pathogenesis.

We analyzed cancer-associated silent and missense variants reported in the COSMIC database that are located within three nucleotides of splice donor and acceptor sites. We examined the prevalence of these variants in genes for which loss of function is a known mechanism of disease development. We also studied the performance of splicing impact prediction tools and evaluated their clinical relevance as well as their alignment with experimentally validated splicing outcomes.

Nucleotide composition analysis revealed a high preference for the nucleotide G at the donor 1 (d1) and acceptor (a1) positions, 87% and 69%, respectively. We observed a high prevalence of G > A and G > T variants at d1 and a1 positions. Interestingly, 66% to 86% of the identified variants at these positions are missense mutations, with G > T variants being specific for this type of mutation. Evolutionary conservation analysis indicates high nucleotide conservation for these positions at donor and acceptor sites, highlighting their importance at the nucleotide level. The frequently occurring variants are associated with tumor suppressor genes, and 58 of the top 100 genes have LOEUF scores below 1, indicating low tolerance to protein truncation. In contrast, such genes are rarely observed among population variants.

Our data driven computational study emphasizes the importance of evaluating silent and missense variants at splice junctions to understand their impact on RNA splicing. These variants may have a neutral effect on protein function. However, evaluating their effect at the RNA level is essential to understanding the significance of these variants in disease pathogenesis. This is particularly important for genes in which loss of function is the mechanism of disease development.

The online version contains supplementary material available at 10.1186/s12864-025-12466-0.

## Full-text entities

- **Genes:** BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, RB1 (RB transcriptional corepressor 1) [NCBI Gene 5925] {aka OSRC, PPP1R130, RB, p105-Rb, p110-RB1, pRb}, SF3B1 (splicing factor 3b subunit 1) [NCBI Gene 23451] {aka Hsh155, MDS, PRP10, PRPF10, SAP155, SF3b155}, PTEN (phosphatase and tensin homolog) [NCBI Gene 5728] {aka 10q23del, BZS, CWS1, DEC, GLM2, MHAM}, TWSG1 (twisted gastrulation BMP signaling modulator 1) [NCBI Gene 57045] {aka TSG}, GGH (gamma-glutamyl hydrolase) [NCBI Gene 8836] {aka GATD10, GH}, VWF (von Willebrand factor) [NCBI Gene 7450] {aka F8VWF, VWD}, MAF (MAF bZIP transcription factor) [NCBI Gene 4094] {aka AYGRP, CCA4, CTRCT21, c-MAF}, SPINK1 (serine peptidase inhibitor Kazal type 1) [NCBI Gene 6690] {aka PCTT, PSTI, Spink3, TATI, TCP}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, NF1 (neurofibromin 1) [NCBI Gene 4763] {aka NFNS, VRNF, WSS}, VHL (von Hippel-Lindau tumor suppressor) [NCBI Gene 7428] {aka HRCA1, RCA1, VHL1, pVHL}, SCN1A (sodium voltage-gated channel alpha subunit 1) [NCBI Gene 6323] {aka DEE6, DEE6A, DEE6B, DRVT, EIEE6, FEB3}, CDKN2A (cyclin dependent kinase inhibitor 2A) [NCBI Gene 1029] {aka ARF, CAI2, CDK4I, CDKN2, CMM2, INK4}, GH1 (growth hormone 1) [NCBI Gene 2688] {aka GH, GH-N, GHB5, GHN, IGHD1A, IGHD1B}, RUNX1 (RUNX family transcription factor 1) [NCBI Gene 861] {aka AML1, AML1-EVI-1, AMLCR1, CBF2alpha, CBFA2, EVI-1}, COL4A5 (collagen type IV alpha 5 chain) [NCBI Gene 1287] {aka ASLN, ATS, ATS1, CA54}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, ALK (ALK receptor tyrosine kinase) [NCBI Gene 238] {aka ALK1, CD246, NBLST3}, APC (APC regulator of Wnt signaling pathway) [NCBI Gene 324] {aka BTPS2, DESMD, DP2, DP2.5, DP3, GS}, POU1F1 (POU class 1 homeobox 1) [NCBI Gene 5449] {aka CPHD1, GHF-1, PIT1, POU1F1a, Pit-1}
- **Diseases:** LOEUF (MESH:D006315), COSMIC (MESH:D009369), pituitary hormone deficiency (MESH:C580003), genetic disorders (MESH:D030342), epilepsy (MESH:D004827), VEP (MESH:D065606), neurofibromatosis type 1 (MESH:D009456), oncogenes (MESH:D000074723), rare diseases (MESH:D035583), gnomAD (MESH:D042822), CRC (MESH:D015179), retinitis pigmentosa (MESH:D012174), adrenocortical tumor (MESH:D018268), tumorigenesis (MESH:D063646)
- **Chemicals:** guanine (MESH:D006147), ICGC (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** A > N, G > C, p.K1423R, p.T125M, p.Q449H, p.S2479S, T > N, A > G, C > T, A373G, C > G, G > T, p.C370Y, G > N, c.A373C, c.G672T, p.T125T, A > T, c.G375A, G > A
- **Cell lines:** COSMIC — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_IY73)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12838422/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12838422/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12838422/full.md

---
Source: https://tomesphere.com/paper/PMC12838422