# Integrative Analysis Reveals Conserved R-Loop Features in Mouse Embryonic Stem Cells

**Authors:** Ohbeom Kwon, Hyeonwoo La, Seonho Yoo, Hyeonji Lee, Heeji Lee, Hoseong Lim, Chanhyeok Park, Dong Wook Han, Jeong-Tae Do, Hyuk Song, Youngsok Choi, Kwonho Hong

PMC · DOI: 10.3390/epigenomes10010016 · 2026-03-02

## TL;DR

This study identifies conserved R-loop regions in mouse embryonic stem cells and shows they are linked to RNA metabolism and regulatory functions.

## Contribution

The first integrated analysis of conserved R-loop regions in mouse embryonic stem cells using multiple experimental platforms.

## Key findings

- Common R-loop regions are enriched in promoter-proximal and genic regions with CpG islands.
- Genes with Common R-loops are highly expressed and involved in RNA metabolic processes.
- Common R-loops are associated with specific transcription factor motifs like OCT4-SOX2 and CTCF.

## Abstract

R-loops, three-stranded nucleic acid structures formed by an RNA-DNA hybrid, have emerged as important regulators of transcription and genome stability. Although advances in high-throughput sequencing have revealed widespread R-loop landscapes, platform-specific biases hinder the identification of conserved R-loops in specific cell types. Mouse embryonic stem cells, which are transcriptionally active, provide an ideal system for investigating the potential roles of stable R-loops in RNA biology. Here, we integrated 13 independent R-loop profiling datasets from four experimental platforms to define 27,950 Common R-loop regions in mouse embryonic stem cells and characterized their chromatin environment and associated biological functions. Common R-loop regions were reproducibly detected across methods and were preferentially localized to promoter-proximal and genic regions enriched in CpG islands. Genes associated with Common R-loops were highly and stably expressed, showing strong functional enrichment in RNA metabolic processes such as mRNA processing, RNA splicing, and ribonucleoprotein complex biogenesis. Chromatin state analysis revealed that Common R-loops are enriched in transcriptionally active and regulatory contexts. Sequence feature analysis further identified GC skew as a prominent signature of Common R-loops, particularly within transcribed chromatin states. Transcription factor motif analyses have identified distinct regulatory environments in Common R-loop regions, including pluripotency-associated OCT4-SOX2-TCF-NANOG motifs in enhancers, CTCF motifs in open chromatin, and YY1 motifs in promoters. Together, this study provides the first integrated analysis of conserved R-loop regions in mouse embryonic stem cells, revealing their preferential localization at regulatory loci linked to RNA metabolism and highlighting R-loops as structural and functional nodes in RNA biology.

## Linked entities

- **Genes:** POU5F1 (POU class 5 homeobox 1) [NCBI Gene 5460], SOX2 (SRY-box transcription factor 2) [NCBI Gene 6657], HNF4A (hepatocyte nuclear factor 4 alpha) [NCBI Gene 3172], NANOG (Nanog homeobox) [NCBI Gene 79923], CTCF (CCCTC-binding factor) [NCBI Gene 10664], YY1 (YY1 transcription factor) [NCBI Gene 7528]
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** Gapdh (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 14433] {aka Gapd}, Ctcf (CCCTC-binding factor) [NCBI Gene 13018], Yy1 (YY1 transcription factor) [NCBI Gene 22632] {aka NF-E1, YY-1}, Nanog (Nanog homeobox) [NCBI Gene 71950] {aka 2410002E02Rik, ENK, Stm1, ecat4}, Pou5f1 (POU domain, class 5, transcription factor 1) [NCBI Gene 18999] {aka NF-A3, Oct-3, Oct-3/4, Oct-4, Oct3, Oct3/4}, Sox2 (SRY (sex determining region Y)-box 2) [NCBI Gene 20674] {aka Sox-2, lcc, ysb}, Actb (actin, beta) [NCBI Gene 11461] {aka Actx, E430023M04Rik, beta-actin}, Ostn (osteocrin) [NCBI Gene 239790] {aka Ostc}, Xab2 (XPA binding protein 2) [NCBI Gene 67439] {aka 0610041O14Rik}
- **Diseases:** cancer (MESH:D009369), leukemia (MESH:D007938), tumorigenesis (MESH:D063646), injury to (MESH:D014947), RNA metabolism disorders (MESH:D012327), neurodegenerative disorders (MESH:D019636), immunodeficiencies (MESH:D007153), -associated (MESH:D018886)
- **Chemicals:** MNase (-)
- **Species:** Musculus (genus) [taxon 112137], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13025856/full.md

---
Source: https://tomesphere.com/paper/PMC13025856