# Systematic analysis of specificities and flanking sequence preferences of bacterial DNA-(cytosine C5)-methyltransferases reveals mechanisms of enzyme- and sequence-specific DNA readout

**Authors:** Greta Sogl, Sabrina Pilling, Lukas F J Fischer, Jan Ludwig, Nahom Mihretu, Pavel Bashtrykov, Albert Jeltsch

PMC · DOI: 10.1093/nar/gkaf126 · Nucleic Acids Research · 2025-03-04

## TL;DR

This study explores how bacterial enzymes that methylate DNA recognize specific sequences and how their activity is influenced by surrounding DNA sequences.

## Contribution

The study reveals how DNA methylation specificity and flanking sequence preferences are modulated by enzyme-DNA interactions and DNA conformation.

## Key findings

- Bacterial MTases show high (>1000-fold) target sequence specificity and strong flanking sequence preferences (∼100-fold).
- Mutations in DNA-contacting amino acids alter both local and global enzyme specificity and flanking sequence preferences.
- Flanking sequence preferences are likely due to sequence-dependent modulation of DNA conformation.

## Abstract

DNA-(cytosine C5)-methyltransferases (MTases) represent a large group of evolutionary related enzymes with specific DNA interaction. We systematically investigated the specificity and flanking sequence preferences of six bacterial enzymes of this class and many MTase mutants. We observed high (>1000-fold) target sequence specificity reflecting strong evolutionary pressure against unspecific DNA methylation. Strong flanking sequence preferences (∼100-fold) were observed which changed for methylation of near-cognate sites suggesting that the DNA structures in the transition states of the methylation of these sites differ. Mutation of amino acids involved in DNA contacts led to local changes of specificity and flanking sequence preferences, but also global effects indicating that larger conformational changes occur upon transition state formation. Based on these findings, we conclude that the transition state of the DNA methylation reaction precedes the covalent enzyme–DNA complex conformations with flipped target base that are resolved in structural studies. Moreover, our data suggest that alternative catalytically active conformations exist whose occupancy is modulated by enzyme–DNA contacts. Sequence dependent DNA shape analyses suggest that MTase flanking sequence preferences are caused by flanking sequence dependent modulation of the DNA conformation. Likely, many of these findings are transferable to other DNA MTases and DNA interacting proteins.

Graphical Abstract

## Full-text entities

- **Genes:** DNMT3A (DNA methyltransferase 3 alpha) [NCBI Gene 1788] {aka DNMT3A2, HESJAS, M.HsaIIIA, TBRS}, CPA1 (carboxypeptidase A1) [NCBI Gene 1357] {aka CPA}, DNMT3B (DNA methyltransferase 3 beta) [NCBI Gene 1789] {aka FSHD4, ICF, ICF1, M.HsaIIIB}, DNMT1 (DNA methyltransferase 1) [NCBI Gene 1786] {aka ADCADN, AIM, CXXC9, DNMT, HSN1E, MCMT}, CHPT1 (choline phosphotransferase 1) [NCBI Gene 56994] {aka CPT, CPT1}
- **Chemicals:** Lysogeny Broth (-), D-(+)-glucose (MESH:D005947), agarose (MESH:D012685), DTT (MESH:D004229), bisulfite (MESH:C042345), U (MESH:D014501), glycerol (MESH:D005990), cytosine (MESH:D003596), nitrogen (MESH:D009584), Ni (MESH:D009532), KCl (MESH:D011189), EDTA (MESH:D004492), E-64 (MESH:C024974), sodium dodecyl sulphate (MESH:D012967), L-(+)-arabinose (MESH:D001089), Cys (MESH:D003545), NNN (MESH:C008655), hydrogen (MESH:D006859), AdoMet (MESH:D012436), imidazole (MESH:C029899), nucleotide (MESH:D009711), bestatin (MESH:C012211), leupeptin (MESH:C032854), oligodeoxynucleotide (MESH:D009838), NaCl (MESH:D012965), pepstatin A (MESH:C031375), C (MESH:D002244), PMSF (MESH:D010664), His (MESH:D006639), acrylamide (MESH:D020106), A (MESH:D001151), Tween  20 (MESH:D011136)
- **Species:** Escherichia coli (E. coli, species) [taxon 562], Serratia marcescens (species) [taxon 615], Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** H127A, Q237A, A > G, A > T, S294, Y242, R209A, T250A, T250, S294A, S224, A-C, T226A, S22S, R240, K89A, T132A, R87A, Q244A, G > T, T > C, S224A, S296A, S219, R81A, G > A, R97A, R228A, S219A, Y242A, R209, Q244, K162A, R243A, R97, R225A, S252A, 8A-C, C to G, C at the -2, cytosine-C5, T226, C > T, R240A, T at the +5, K234A, Q297A, S126, C5-cytosine, T > GC, S79A, S126A, R225, S296, Q237, C > A
- **Cell lines:** N4XN4 — Mus musculus (Mouse), Embryonic stem cell (CVCL_QF09)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11879396/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11879396/full.md

## References

59 references — full list in the complete paper: https://tomesphere.com/paper/PMC11879396/full.md

---
Source: https://tomesphere.com/paper/PMC11879396