# Transcriptional competence defines the heterochromatin nucleating potential of isolated MSR units

**Authors:** Yi-Hsuan Lo, Nicholas Shukeir, Galina Erikson, Raghu Ram Edupuganti, Reagan Ching, Deepika Puri, Lisa Jerabek, Ramin Shiekhattar, Thomas Jenuwein

PMC · DOI: 10.1038/s41467-026-70991-2 · Nature Communications · 2026-03-20

## TL;DR

This study shows that transcriptionally active mouse satellite repeats can form heterochromatin, revealing a DNA/RNA-based mechanism for heterochromatin formation.

## Contribution

The study identifies transcriptional competence as a key factor in MSR units nucleating de novo heterochromatin.

## Key findings

- Transcriptionally competent MSR units induce heterochromatin marks and recruit HP1.
- Multi-copy intact MSR units create an unwound DNA template that aids RNA polymerase engagement.
- MSR-originating transcripts are regulated by the Integrator complex and are bidirectional.

## Abstract

In mouse cells, constitutive heterochromatin is associated with underlying arrays of A/T-rich DNA repeat elements, called the major satellite repeats (MaSat or MSR). We examine >18,000 MSR copies in mouse ES cells and identify that heterochromatin forms only at transcriptionally competent MSR units. To directly dissect the function of MSR DNA, we insert isolated MSR units into an inert genomic region that is repeat- and gene-free. Insertion of three or more intact MSR units induces heterochromatic histone marks, recruitment of HP1 and incorporation of histone H1. Only transcriptionally competent MSR units, but not permutated MSR variants or LINE1 5’UTR elements, nucleate de novo heterochromatin. MSR-derived transcription is bi-directional and MSR-originating transcripts are attenuated by the RNAPII-associated Integrator complex. Instructively, multi-copy intact MSR units impart an unwound DNA template that facilitates RNAPII engagement. Together, this study uncovers a DNA/RNA-based logic and transcription-coupled mechanism for the nucleation of heterochromatin.

Multiple copies of intact MSR nucleate de novo heterochromatin and reveal that the establishment of mouse pericentric heterochromatin is associated with bidirectional repeat RNA, Integrator complex engagement, and topological alterations.

## Linked entities

- **Proteins:** DEFA1 (defensin alpha 1), Polr2A (RNA polymerase II subunit A)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Genes:** F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, CBX5 (chromobox 5) [NCBI Gene 23468] {aka HEL25, HP1, HP1A, HP1alpha}, MALAT1 (metastasis associated lung adenocarcinoma transcript 1) [NCBI Gene 378938] {aka HCN, LINC00047, NCRNA00047, NEAT2, PRO2853, miPEP-52}, H1f2 (H1.2 linker histone, cluster member) [NCBI Gene 50708] {aka 0610008C09Rik, H1-2, H1.2, H1c, H1var1, His1a}, Lif (leukemia inhibitory factor) [NCBI Gene 16878], Msr1 (macrophage scavenger receptor 1) [NCBI Gene 20288] {aka MRS-A, MSR, MSR-A, SR-A, SR-AI, SR-AII}, Polr2A (RNA polymerase II subunit A) [NCBI Gene 32100] {aka 5, 8WG16, CG1554, CTD, DmCTD, Dmel\CG1554}, Ints9 (integrator complex subunit 9) [NCBI Gene 210925] {aka D14Ertd231e}, Exoc6 (exocyst complex component 6) [NCBI Gene 107371] {aka 4833405E05Rik, C430002C19, Sec15, Sec15l1, hbd, msec15}, MSR1 (macrophage scavenger receptor 1) [NCBI Gene 4481] {aka CD204, SCARA1, SR-A, SR-AI, SR-AII, SR-AIII}, Cbx5 (chromobox 5) [NCBI Gene 12419] {aka 2610029O15Rik, HP1, Hp1a, Hp1alpha}, Rpa1 (replication protein A1) [NCBI Gene 68275] {aka 5031405K23Rik, 70kDa, RF-A, RP-A, Rpa}, TRIM28 (tripartite motif containing 28) [NCBI Gene 10155] {aka KAP1, PPP1R157, RNF96, TF1B, TIF1B, TIF1beta}, Polr2a (polymerase (RNA) II (DNA directed) polypeptide A) [NCBI Gene 20020] {aka 220kDa, Rpb1, Rpo2-1}, Tmco5a (transmembrane and coiled-coil domains 5A) [NCBI Gene 67356] {aka 1700095F04Rik, Tmco5}, HBD (hypophosphatemic bone disease) [NCBI Gene 100187828], MAGEC1 (MAGE family member C1) [NCBI Gene 9947] {aka CT7, CT7.1}, Top2a (topoisomerase (DNA) II alpha) [NCBI Gene 21973] {aka Top-2}, SUV39H1 (SUV39H1 histone lysine methyltransferase) [NCBI Gene 6839] {aka H3-K9-HMTase 1, KMT1A, MG44, SUV39H}, Cmv3 (cytomegalovirus resistance 3) [NCBI Gene 100035620], Ints12 (integrator complex subunit 12) [NCBI Gene 71793] {aka 1110020M19Rik, 2810027J24Rik, 4930529N21Rik, A230056J18Rik, Phf22, int12}, Rnase1 (ribonuclease, RNase A family, 1 (pancreatic)) [NCBI Gene 19752] {aka Rib-1, Rib1}, RPA1 (replication protein A1) [NCBI Gene 6117] {aka HSSB, MST075, PFBMFT6, REPA1, RF-A, RP-A}, Dhfr (dihydrofolate reductase) [NCBI Gene 13361] {aka 8430436I03Rik}, Hprt1 (hypoxanthine phosphoribosyltransferase 1) [NCBI Gene 15452] {aka HPGRT, Hprt}, MTRR (5-methyltetrahydrofolate-homocysteine methyltransferase reductase) [NCBI Gene 4552] {aka MSR, cblE}, Meis2 (Meis homeobox 2) [NCBI Gene 17536] {aka A430109D20Rik, Mrg1, Stra10}, Gapdh (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 14433] {aka Gapd}, H1-0 (H1.0 linker histone) [NCBI Gene 3005] {aka H1.0, H10, H1F0, H1FV}, SETDB1 (SET domain bifurcated histone lysine methyltransferase 1) [NCBI Gene 9869] {aka ESET, H3-K9-HMTase4, KG1T, KMT1E, TDRD21}
- **Diseases:** ES (MESH:D012512), cancer (MESH:D009369), CMV (MESH:D003586), mycoplasma (MESH:D009175)
- **Chemicals:** NaHCO3 (MESH:D017693), CO2 (MESH:D002245), Trimethoprim (MESH:D014295), Etoposide (MESH:D005047), 2-mercaptoethanol (MESH:D008623), agarose (MESH:D012685), ChemScene (-), Sucrose (MESH:D013395), streptomycin (MESH:D013307), formaldehyde (MESH:D005557), DSG (MESH:C037258), oligonucleotides (MESH:D009841), MgCl2 (MESH:D015636), SDS (MESH:D012967), KCL (MESH:D011189), nitrogen (MESH:D009584), essential amino acids (MESH:D000601), Glucose (MESH:D005947), NaCl (MESH:D012965), Puromycin (MESH:D011691), Lipofectamine (MESH:C086724), PBS (MESH:D007854), L-glutamine (MESH:D005973), ethanol (MESH:D000431), DMSO (MESH:D004121), TRIzol (MESH:C411644), penicillin (MESH:D010406), water (MESH:D014867), Trypan Blue (MESH:D014343), Laemmli buffer (MESH:C088816), NP-40 (MESH:C010615), CaCl2 (MESH:D002122), DTT (MESH:D004229), T (MESH:D014316), EB (MESH:C478160), glycine (MESH:D005998)
- **Species:** Schizosaccharomyces pombe (fission yeast, species) [taxon 4896], Mus musculus (house mouse, species) [taxon 10090], Escherichia coli (E. coli, species) [taxon 562], Homo sapiens (human, species) [taxon 9606], Human betaherpesvirus 5 (no rank) [taxon 10359], Erysiphe sp. RV (species) [taxon 662690], Drosophila melanogaster (fruit fly, species) [taxon 7227]
- **Mutations:** 208 bp G, 234 bp A, M0658S, N0466S
- **Cell lines:** WT26 — Homo sapiens (Human), Kidney Wilms tumor, Cancer cell line (CVCL_6D82), COS-7 — Chlorocebus aethiops (Green monkey), Transformed cell line (CVCL_0224), mESC — Homo sapiens (Human), Embryonic stem cell (CVCL_UI95), E1383 — Homo sapiens (Human), Fibrosarcoma, Cancer cell line (CVCL_ZZ50), ES — Homo sapiens (Human), Embryonic stem cell (CVCL_C769), DN57 — Homo sapiens (Human), Melanoma, Cancer cell line (CVCL_6226)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13005003/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13005003/full.md

## References

4 references — full list in the complete paper: https://tomesphere.com/paper/PMC13005003/full.md

---
Source: https://tomesphere.com/paper/PMC13005003