# Alu-mediated RNA duplexes are associated with widespread exon skipping across primate transcriptomes

**Authors:** Hyunbeen Lee, Xinang Cao, Guillermo E. Parada, Jack Daiyang Li, Ira A. Iosub, Ulrich Braunschweig, Kenny Rebelo, Jernej Ule, Yue Wan, Benjamin J. Blencowe

PMC · DOI: 10.1186/s13059-026-04029-7 · 2026-03-26

## TL;DR

This paper shows that RNA structures formed by Alu repeats are linked to exon skipping in primates and are regulated by specific RNA-binding proteins.

## Contribution

The study reveals that Alu-mediated RNA duplexes are a major driver of exon skipping divergence in primate transcriptomes.

## Key findings

- Most long-range intronic RNA duplexes are mediated by inverted Alu-repeat elements.
- Alu-derived RNA duplexes are associated with divergent exon skipping patterns in primates.
- RNA binding proteins HNRNPC, ILF2, and ILF3 regulate Alu duplex-associated splicing.

## Abstract

Alternative splicing patterns have diverged rapidly during vertebrate evolution. By integrating genome-wide predictions of stable RNA duplexes, alternative splicing profiles, and proximity ligation-detection of RNA-RNA interactions, we observe that the majority of long-range intronic RNA duplexes are mediated by inverted Alu-repeat elements, and that these structures are associated with divergent exon skipping patterns in primates. We further observe evidence that the RNA binding proteins HNRNPC, ILF2 and ILF3 directly control Alu duplex-associated alternative splicing levels. Collectively, our results provide evidence that Alu-derived RNA duplexes modulated by RNA binding proteins have contributed to the remarkable divergence in alternative splicing patterns during mammalian evolution.

The online version contains supplementary material available at 10.1186/s13059-026-04029-7.

## Linked entities

- **Proteins:** HNRNPC (heterogeneous nuclear ribonucleoprotein C), ILF2 (interleukin enhancer binding factor 2), ILF3 (interleukin enhancer binding factor 3)

## Full-text entities

- **Genes:** RBP4 (retinol binding protein 4) [NCBI Gene 5950] {aka MCOPCB10, RDCCAS}, ACSM1 (acyl-CoA synthetase medium chain family member 1) [NCBI Gene 116285] {aka BUCS1, MACS1}, FASTKD1 (FAST kinase domains 1) [NCBI Gene 79675], DONSON (DNA replication fork stabilization factor DONSON) [NCBI Gene 29980] {aka B17, C21orf60, MGORS10, MIMIS, MISSLA}, PAMR1 (peptidase domain containing associated with muscle regeneration 1) [NCBI Gene 25891] {aka DKFZP586H2123, FP938, RAMP}, PLA2G2D (phospholipase A2 group IID) [NCBI Gene 26279] {aka PLA2IID, SPLASH, sPLA2-IID, sPLA2S}, EEF1A2 (eukaryotic translation elongation factor 1 alpha 2) [NCBI Gene 1917] {aka DEE33, EEF1AL, EF-1-alpha-2, EF1A, EIEE33, HS1}, ILF2 (interleukin enhancer binding factor 2) [NCBI Gene 3608] {aka NF45, PRO3063}, NQO2 (N-ribosyldihydronicotinamide:quinone dehydrogenase 2) [NCBI Gene 4835] {aka DHQV, DIA6, NMOR2, QR2}, HNRNPC (heterogeneous nuclear ribonucleoprotein C) [NCBI Gene 3183] {aka HNRNP, HNRPC, MRD74, SNRPC}, ILF3 (interleukin enhancer binding factor 3) [NCBI Gene 3609] {aka CBTF, DRBF, DRBP76, MMP4, MPHOSPH4, MPP4}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, PLSCR1 (phospholipid scramblase 1) [NCBI Gene 5359] {aka MMTRA1B}, LGR5 (leucine rich repeat containing G protein-coupled receptor 5) [NCBI Gene 8549] {aka FEX, GPR49, GPR67, GRP49, HG38}, DDIAS (DNA damage induced apoptosis suppressor) [NCBI Gene 220042] {aka C11orf82, noxin}
- **Diseases:** H-I (MESH:D000848), brain disorders (MESH:D001927), cancer (MESH:D009369), microcephalic dwarfism (MESH:C537533)
- **Chemicals:** L-glutamine (MESH:D005973), Lipofectamine (MESH:C086724), KCl (MESH:D011189), streptomycin (MESH:D013307), agarose (MESH:D012685), ethidium bromide (MESH:D004996), Biotin PEG4 alkyne (-), biotin (MESH:D001710), TRIzol (MESH:C411644), EDTA (MESH:D004492), MgCl2 (MESH:D015636), penicillin (MESH:D010406), PBS (MESH:D007854), amino acids (MESH:D000596), water (MESH:D014867), NaCl (MESH:D012965), urea (MESH:D014508), glucose (MESH:D005947), copper (MESH:D003300), HCl (MESH:D006851), CO2 (MESH:D002245)
- **Species:** Macaca (macaque, genus) [taxon 9539], Mus musculus (house mouse, species) [taxon 10090], Pan troglodytes (chimpanzee, species) [taxon 9598], Homo sapiens (human, species) [taxon 9606], Mycoplasma (genus) [taxon 2093]
- **Cell lines:** HEK293T — Homo sapiens (Human), Transformed cell line (CVCL_0063), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004), HepG2 — Homo sapiens (Human), Hepatoblastoma, Cancer cell line (CVCL_0027), S2H — Homo sapiens (Human), Soft tissue sarcoma, Cancer cell line (CVCL_JB75)

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13019944/full.md

---
Source: https://tomesphere.com/paper/PMC13019944