# The role of cytosine modification symmetry in mammalian epigenome regulation

**Authors:** Zeyneb Vildan Cakil, Lena Engelhard, Daniel Summerer

PMC · DOI: 10.1039/d5sc09022a · 2025-12-29

## TL;DR

This paper explores how different combinations of cytosine modifications in DNA affect genome regulation and chromatin interactions.

## Contribution

The paper reviews recent advances in understanding the functional roles of symmetric and asymmetric cytosine modifications in CpG dyads.

## Key findings

- Cytosine modifications in CpG dyads create unique DNA signatures influencing chromatin regulation.
- Technical progress allows better sequencing and mapping of individual CpG dyad states.
- Understanding these modifications could improve insights into development and disease mechanisms.

## Abstract

5-Methylcytosine (mC) is a key regulatory element of mammalian genomes, and plays important roles in development and disease. mC is predominantly written onto CpG dyads by DNA methyltransferases, and can be further oxidized by ten-eleven translocation dioxygenases (TETs) to 5-hydroxymethyl-, 5-formyl-, and 5-carboxylcytosine. This process results in different symmetric and asymmetric combinations of cytosine forms across the two strands of CpGs, each of which represents a unique physicochemical signature in the major groove of DNA. A comprehensive understanding of the individual functions of oxidized mC modifications can therefore only be achieved by considering both strands of CpG dyads. Here, we provide a brief overview of the current state of knowledge on the sequencing and mapping of individual CpG dyad states, their influence on the intrinsic properties of DNA, and their interactions with chromatin proteins.

Cytosine 5-modifications are central to human chromatin regulation. They occur in different combinations in the two DNA strands of CpG dyads, but the functions of individual combinations are poorly understood. We review recent technical developments and biological insights in this area.

## Linked entities

- **Proteins:** tet(S) (tetracycline resistance ribosomal protection protein Tet(S))
- **Chemicals:** 5-Methylcytosine (PubChem CID 65040), 5-hydroxymethylcytosine (PubChem CID 70751), 5-formylcytosine (PubChem CID 10986305), 5-carboxylcytosine (PubChem CID 77213)

## Full-text entities

- **Genes:** WT1 (WT1 transcription factor) [NCBI Gene 7490] {aka AWT1, GUD, NPHS4, WAGR, WIT-2, WT-1}, UHRF2 (ubiquitin like with PHD and ring finger domains 2) [NCBI Gene 115426] {aka NIRF, RNF107, TDRD23, URF2}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, ETS1 (ETS proto-oncogene 1, transcription factor) [NCBI Gene 2113] {aka ETS-1, EWSR2, c-ets-1, p54}, MBD1 (methyl-CpG binding domain protein 1) [NCBI Gene 4152] {aka CXXC3, PCM1, RFT}, TET3 (tet methylcytosine dioxygenase 3) [NCBI Gene 200424] {aka BEFAHRS, hCG_40738}, MBD5 (methyl-CpG binding domain protein 5) [NCBI Gene 55777] {aka C2DELq23.1, DEL2Q23.1, MRD1}, DNMT1 (DNA methyltransferase 1) [NCBI Gene 1786] {aka ADCADN, AIM, CXXC9, DNMT, HSN1E, MCMT}, MBD3 (methyl-CpG binding domain protein 3) [NCBI Gene 53615], TDG (thymine DNA glycosylase) [NCBI Gene 6996] {aka hTDG}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, MECP2 (methyl-CpG binding protein 2) [NCBI Gene 4204] {aka AUTSX3, MRX16, MRX79, MRXS13, MRXSL, PPMX}, PKNOX2 (PBX/knotted 1 homeobox 2) [NCBI Gene 63876] {aka PREP2}, RFX5 (regulatory factor X5) [NCBI Gene 5993] {aka MHC2D3, MHC2D5}, TCF4 (transcription factor 4) [NCBI Gene 6925] {aka CDG2T, E2-2, FCD2, FECD3, ITF-2, ITF2}, MBD6 (methyl-CpG binding domain protein 6) [NCBI Gene 114785], MBD4 (methyl-CpG binding domain 4, DNA glycosylase) [NCBI Gene 8930] {aka MED1, TPDS2, UVM1}, CA2 (carbonic anhydrase 2) [NCBI Gene 760] {aka CA-II, CAC, CAII, Car2, HEL-76, HEL-S-282}, MLX (MAX dimerization protein MLX) [NCBI Gene 6945] {aka MAD7, MXD7, TCFL4, TF4, bHLHd13}, MBD2 (methyl-CpG binding domain protein 2) [NCBI Gene 8932] {aka DMTase, NY-CO-41}
- **Diseases:** cancer (MESH:D009369)
- **Chemicals:** 5-carboxylcytosine (MESH:C560974), CpG (MESH:C015772), alpha-ketoglutarate (MESH:D007656), malononitrile (MESH:C000945), 5-hydroxymethyl-, 5-formyl-, and 5-carboxylcytosine (-), Cytosine (MESH:D003596), oxi (MESH:D000068759), GSH (MESH:D005978), S-adenosylmethionine (MESH:D012436), Cs (MESH:D002586), imine (MESH:D007097), Hydrogen (MESH:D006859), borane (MESH:D001880), water (MESH:D014867), 5-formylcytosine (MESH:C560973), 5-Methylcytosine (MESH:D044503), succinate (MESH:D019802), 5-hydroxymethylcytosine (MESH:C011865), C (MESH:D002244), phosphate (MESH:D010710)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Mutations:** A3A, lysine/arginine
- **Cell lines:** mESC — Mus musculus (Mouse), Embryonic stem cell (CVCL_4378)

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12796950/full.md

---
Source: https://tomesphere.com/paper/PMC12796950