# Interconnected Codons: Unravelling the Epigenetic Significance of Flanking Sequences in CpG Dyads

**Authors:** Leo Douglas Creasey, Eran Tauber

PMC · DOI: 10.1007/s00239-024-10172-1 · Journal of Molecular Evolution · 2024-04-18

## TL;DR

This paper explores how pairs of codons containing CpG sequences influence gene function and regulation, especially during embryonic development.

## Contribution

The study reveals a novel connection between CpG codon dyads and gene function, linking them to specific biological processes and developmental stages.

## Key findings

- High CpG codon dyad density is associated with homeobox domains and RNA polymerase II transcription factors.
- Low CpG codon dyad density is linked to DNA damage repair and mitosis.
- CpG-containing genes are more expressed during embryonic development, suggesting a role in gene regulation.

## Abstract

Hypothesizing that CpG codon dyads, formed by consecutive codons containing a cytosine-guanine pair (NNC-GNN), may play a crucial role in gene function, we conducted an extensive analysis to investigate their distribution and conservation within mammalian genes. Our findings reveal that genes characterized by a high density of CpG codon dyads are notably associated with homeobox domains and RNA polymerase II transcription factors. Conversely, genes exhibiting low CpG codon dyad density have links to DNA damage repair and mitosis. Importantly, our study identifies a re-markable increase in expressed genes that harbor CpG during embryonic development, suggesting their potential involvement in gene regulation at these developmental stages. These results under-score the functional significance of CpG codon dyads in DNA methylation and gene expression, fur-ther demonstrating the coevolution of consecutive codons and their contribution to codon usage bias.

The online version contains supplementary material available at 10.1007/s00239-024-10172-1.

## Full-text entities

- **Genes:** PAX6 (paired box 6) [NCBI Gene 5080] {aka AN, AN1, AN2, ASGD5, D11S812E, FVH1}, STAR (steroidogenic acute regulatory protein) [NCBI Gene 6770] {aka STARD1}, SLC12A3 (solute carrier family 12 member 3) [NCBI Gene 6559] {aka NCC, NCCT, TSC}, DNMT1 (DNA methyltransferase 1) [NCBI Gene 1786] {aka ADCADN, AIM, CXXC9, DNMT, HSN1E, MCMT}, IL6 (interleukin 6) [NCBI Gene 3569] {aka BSF-2, BSF2, CDF, HGF, HSF, IFN-beta-2}, TTC41P (tetratricopeptide repeat domain 41, pseudogene) [NCBI Gene 253724] {aka GNN, GNNP}, TSPAN6 (tetraspanin 6) [NCBI Gene 7105] {aka T245, TM4SF6, TSPAN-6}, PXMP4 (peroxisomal membrane protein 4) [NCBI Gene 11264] {aka PMP24}, FOXD2 (forkhead box D2) [NCBI Gene 2306] {aka FKHL17, FREAC-9, FREAC9}
- **Diseases:** CDS (MESH:D010855), cancer (MESH:D009369)
- **Chemicals:** Bisulfite (MESH:C042345), -acid (MESH:D000143), 5-methylcytosine (MESH:D044503), cytosine (MESH:D003596)
- **Species:** Caenorhabditis elegans (species) [taxon 6239], Mus musculus (house mouse, species) [taxon 10090], Drosophila melanogaster (fruit fly, species) [taxon 7227], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030), S2 — Drosophila melanogaster (Fruit fly), Spontaneously immortalized cell line (CVCL_Z232)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11169003/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11169003/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/PMC11169003/full.md

---
Source: https://tomesphere.com/paper/PMC11169003