# N-terminal tagging of RNA Polymerase II shapes transcriptomes more than C-terminal alterations

**Authors:** Adam Callan-Sidat, Emmanuel Zewdu, Massimo Cavallaro, Juntai Liu, Daniel Hebenstreit

PMC · DOI: 10.1016/j.isci.2024.109914 · 2024-05-07

## TL;DR

Modifying the N-terminus of RNA Polymerase II has a bigger impact on gene expression than changing its C-terminal domain.

## Contribution

N-terminal tagging of Pol II, not CTD alterations, shapes transcriptomes and affects non-coding RNA and LLPS-related factors.

## Key findings

- Transcriptional bursting remains largely unchanged with CTD modifications.
- N-terminal tags significantly alter transcriptome-wide gene expression patterns.
- N-terminal tags correlate with changes in non-coding RNA and LLPS-related factor expression.

## Abstract

RNA polymerase II (Pol II) has a C-terminal domain (CTD) that is unstructured, consisting of a large number of heptad repeats, and whose precise function remains unclear. Here, we investigate how altering the CTD’s length and fusing it with protein tags affects transcriptional output on a genome-wide scale in mammalian cells at single-cell resolution. While transcription generally appears to occur in burst-like fashion, where RNA is predominantly made during short bursts of activity that are interspersed with periods of transcriptional silence, the CTD’s role in shaping these dynamics seems gene-dependent; global patterns of bursting appear mostly robust to CTD alterations. Introducing protein tags with defined structures to the N terminus cause transcriptome-wide effects, however. We find the type of tag to dominate characteristics of the resulting transcriptomes. This is possibly due to Pol II-interacting factors, including non-coding RNAs, whose expression correlates with the tags. Proteins involved in liquid-liquid phase separation appear prominently.

•ScRNA-seq datasets for cells expressing CTD-mutant RNA polymerase II (Pol II)•Transcriptional bursting is robust to Pol II CTD alterations•N-terminal- rather than CTD-modifications of Pol II shape transcriptomes•Altered expression of ncRNAs and LLPS-related factors due to N-terminal Pol II tags

ScRNA-seq datasets for cells expressing CTD-mutant RNA polymerase II (Pol II)

Transcriptional bursting is robust to Pol II CTD alterations

N-terminal- rather than CTD-modifications of Pol II shape transcriptomes

Altered expression of ncRNAs and LLPS-related factors due to N-terminal Pol II tags

Molecular biology; Biophysics; Transcriptomics

## Linked entities

- **Proteins:** RNA polymerase II (DNA-directed RNA polymerase II subunit RPB7), Polr2A (RNA polymerase II subunit A)

## Full-text entities

- **Genes:** ATP8A2 (ATPase phospholipid transporting 8A2) [NCBI Gene 51761] {aka ATP, ATPIB, CAMRQ4, IB, ML-1}, DST (dystonin) [NCBI Gene 667] {aka BP240, BPA, BPAG1, CATX-15, CATX15, CMYO29}, HFM1 (helicase for meiosis 1) [NCBI Gene 164045] {aka MER3, POF9, SEC63D1, Si-11, Si-11-6, helicase}, POLR2A (RNA polymerase II subunit A) [NCBI Gene 5430] {aka NEDHIB, POLR2, POLRA, RPB1, RPBh1, RPO2}, Polr2a (polymerase (RNA) II (DNA directed) polypeptide A) [NCBI Gene 20020] {aka 220kDa, Rpb1, Rpo2-1}, GOLGB1 (golgin B1) [NCBI Gene 2804] {aka GCP, GCP372, GOLIM1}, MIF (macrophage migration inhibitory factor) [NCBI Gene 4282] {aka GIF, GLIF, MMIF}, MALAT1 (metastasis associated lung adenocarcinoma transcript 1) [NCBI Gene 378938] {aka HCN, LINC00047, NCRNA00047, NEAT2, PRO2853, miPEP-52}, HNRNPU (heterogeneous nuclear ribonucleoprotein U) [NCBI Gene 3192] {aka DEE54, EIEE54, GRIP120, HNRNPU-AS1, HNRPU, SAF-A}, CRYAB (crystallin alpha B) [NCBI Gene 1410] {aka CMD1II, CRYA2, CTPP2, CTRCT16, HEL-S-101, HSPB5}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, NEAT1 (nuclear paraspeckle assembly transcript 1) [NCBI Gene 283131] {aka LINC00084, NCRNA00084, TP53LC15, TncRNA, VINC}, CTSL (cathepsin L) [NCBI Gene 1514] {aka CATL, CTSL1, MEP}, DDX5 (DEAD-box helicase 5) [NCBI Gene 1655] {aka G17P1, HLR1, HUMP68, p68}, GOLGA4 (golgin A4) [NCBI Gene 2803] {aka CRPF46, GCP2, GOLG, MU-RMS-40.18, p230}
- **Diseases:** osteosarcoma (MESH:D012516), CTD (OMIM:211750), LLPS (MESH:D000210), inflammatory (MESH:D007249)
- **Chemicals:** EDTA (MESH:D004492), alpha-amanitin (MESH:D053959), Halo (-), CO2 (MESH:D002245), DTT (MESH:D004229), Laemmli buffer (MESH:C088816), Hoechst 33342 (MESH:C017807), Oil (MESH:D009821), Chromium (MESH:D002857), PBS (MESH:D007854), camptothecin (MESH:D002166), Phenol Red (MESH:D010637)
- **Species:** Homo sapiens (human, species) [taxon 9606], Drosophila melanogaster (fruit fly, species) [taxon 7227], Saccharomyces cerevisiae (baker's yeast, species) [taxon 4932], Schizosaccharomyces pombe (fission yeast, species) [taxon 4896], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** S17A, N792D, S17, R120G, S17C
- **Cell lines:** S3 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z233), H25 — Homo sapiens (Human), Transformed cell line (CVCL_KS50), D25 — Canis lupus familiaris (Dog), Canine mammary carcinoma, Cancer cell line (CVCL_C1IG), S20 — Mus musculus (Mouse), Mouse neuroblastoma, Cancer cell line (CVCL_VU14), S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232), A549 — Homo sapiens (Human), Lung adenocarcinoma, Cancer cell line (CVCL_0023), U2OS — Homo sapiens (Human), Osteosarcoma, Cancer cell line (CVCL_0042), Dendra2 — Homo sapiens (Human), Colon carcinoma, Cancer cell line (CVCL_A628), D52 — Mus musculus (Mouse), Hybridoma (CVCL_A7AF)

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11126984/full.md

---
Source: https://tomesphere.com/paper/PMC11126984