# Precise mapping of single-stranded DNA breaks by sequence-templated erroneous DNA polymerase end-labelling

**Authors:** Leonie Wenson, Johan Heldin, Marcel Martin, Yücel Erbilgin, Barış Salman, Anders Sundqvist, Wesley Schaal, Friederike A. Sandbaumhüter, Erik T. Jansson, Xingqi Chen, Anton Davidsson, Bo Stenerlöw, Jaime A. Espinoza, Mikael Lindström, Johan Lennartsson, Ola Spjuth, Ola Söderberg

PMC · DOI: 10.1038/s41467-025-62512-4 · Nature Communications · 2025-08-04

## TL;DR

This paper introduces STEEL-seq, a new method to precisely map single-stranded DNA breaks using an engineered error-prone polymerase.

## Contribution

The novel STEEL-seq method uses a custom polymerase to detect and map single-stranded DNA breaks with high precision.

## Key findings

- STEEL-seq identifies SSBs by introducing mismatches downstream of break sites using a nucleotide omission strategy.
- The method is compatible with multiple sequencing technologies including Sanger, Illumina, PacBio, and Nanopore.
- SSB frequency in the human genome is 0.7 to 3.8 × 10−6, with enrichment in active promoter regions.

## Abstract

The ability to analyze whether DNA contains lesions is essential in identifying mutagenic substances. Currently, the detection of single-stranded DNA breaks (SSBs) lacks precision. To address this limitation, we develop a method for sequence-templated erroneous end-labelling sequencing (STEEL-seq), which enables the mapping of SSBs. The method requires a highly error-prone DNA polymerase, so we engineer a chimeric DNA polymerase, Sloppymerase, capable of replicating DNA in the absence of one nucleotide. Following the omission of a specific nucleotide (e.g., dATP) from the reaction mixture, Sloppymerase introduces mismatches directly downstream of SSBs at positions where deoxyadenosine should occur. This mismatch pattern, coupled with the retention of sequence information flanking these sites, ensures that the identified hits are bona fide SSBs. STEEL-seq is compatible with a variety of sequencing technologies, as demonstrated using Sanger, Illumina, PacBio, and Nanopore systems. Using STEEL-seq, we determine the SSB/base pair frequency in the human genome to range between 0.7 and 3.8 × 10−6 with an enrichment in active promoter regions.

Identifying DNA lesions is key to assessing mutagenic activity. Here, the authors develop STEEL-seq, a method that maps single-stranded DNA breaks (SSBs) using a custom error-prone DNA polymerase. This approach reveals SSB enrichment near active promoters in the human genome.

## Linked entities

- **Chemicals:** dATP (PubChem CID 15993)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, TOP3A (DNA topoisomerase III alpha) [NCBI Gene 7156] {aka MGRISCE2, PEOB5, TOP3, ZGRF7}, IL11 (interleukin 11) [NCBI Gene 3589] {aka AGIF, IL-11}, ERCC5 (ERCC excision repair 5, endonuclease) [NCBI Gene 2073] {aka COFS3, ERCC5-201, ERCM2, UVDR, XPG, XPGC}, TBP (TATA-box binding protein) [NCBI Gene 6908] {aka GTF2D, GTF2D1, HDL4, SCA17, TBP1, TFIID}, AICDA (activation induced cytidine deaminase) [NCBI Gene 57379] {aka AID, ARP2, CDA2, HEL-S-284, HIGM2}, ERCC4 (ERCC excision repair 4, endonuclease catalytic subunit) [NCBI Gene 2072] {aka ERCC11, FANCQ, RAD1, XFEPS, XPF}, UNG (uracil DNA glycosylase) [NCBI Gene 7374] {aka DGU, HIGM4, HIGM5, UDG, UNG1, UNG15}, DNA Polymerase [NCBI Gene 13897297], JUNB (JunB proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3726] {aka AP-1}, ERCC2 (ERCC excision repair 2, TFIIH core complex helicase subunit) [NCBI Gene 2068] {aka COFS2, CXPD, EM9, TFIIH, TTD, TTD1}, SSB [NCBI Gene 20466802], SSB (small RNA binding exonuclease protection factor La) [NCBI Gene 6741] {aka LARP3, La, La/SSB, SSB/La}, DNTT (DNA nucleotidylexotransferase) [NCBI Gene 1791] {aka TDT}, CHD4 (chromodomain helicase DNA binding protein 4) [NCBI Gene 1108] {aka CHD-4, Mi-2b, Mi2-BETA, SIHIWES}, SMAD7 (SMAD family member 7) [NCBI Gene 4092] {aka CRCS3, MADH7, MADH8}, ERCC3 (ERCC excision repair 3, TFIIH core complex helicase subunit) [NCBI Gene 2071] {aka BTF2, GTF2H, RAD25, Ssl2, TFIIH, TTD2}, XPC (XPC complex subunit, DNA damage recognition and repair factor) [NCBI Gene 7508] {aka RAD4, XP3, XPCC, p125}, TOP1 (DNA topoisomerase I) [NCBI Gene 7150] {aka TOPI}, SERPINE1 (serpin family E member 1) [NCBI Gene 5054] {aka PAI, PAI-1, PAI1, PLANH1}, APEX1 (apurinic/apyrimidinic endodeoxyribonuclease 1) [NCBI Gene 328] {aka APE, APE1, APEN, APEX, APX, HAP1}
- **Diseases:** SSBs (MESH:D012640), cancer (MESH:D009369)
- **Chemicals:** formic acid (MESH:C030544), deoxyadenosine (MESH:C058118), magnesium (MESH:D008274), MgCl2 (MESH:D015636), Bis-Tris (MESH:C026272), glycerol (MESH:D005990), dTTP (MESH:C024157), adenine (MESH:D000225), dGTP (MESH:C029603), thymine (MESH:D013941), Biotin-11-dUTP (MESH:C045931), Trifluoroacetic acid (MESH:D014269), dCTP (MESH:C024107), HF (MESH:D006195), MnCl2 (MESH:C025340), LI-COR (-), biotin (MESH:D001710), TD (MESH:C076628), acetonitrile (MESH:C032159), deoxyuridine (MESH:D003857), 7,8-dihydro-8-oxoguanine (MESH:C453560), EDTA (MESH:D004492), NaCl (MESH:D012965), ATP (MESH:D000255), Peptide (MESH:D010455), Glutamax (MESH:C054122), EB (MESH:C478160), ampicillin (MESH:D000667), deoxyguanosine (MESH:D003849), L-arabinose (MESH:D001089), His (MESH:D006639), oligo (MESH:D009841), guanine (MESH:D006147), agarose (MESH:D012685), deoxythymidine (MESH:D013936), BMH-21 (MESH:C000627087), reactive oxygen species (MESH:D017382), dUTP (MESH:C027078), Urea (MESH:D014508), deoxycytidine (MESH:D003841), polyethylene glycol-8000 (MESH:C000595216), dATP (MESH:C026600), CO2 (MESH:D002245), TritonX-100 (MESH:D017830), sodium phosphate (MESH:C018279), imidazole (MESH:C029899), deoxyadenosines (MESH:D003839), water (MESH:D014867), DMSO (MESH:D004121), polyacrylamide (MESH:C016679)
- **Species:** Escherichia coli (E. coli, species) [taxon 562], Homo sapiens (human, species) [taxon 9606]
- **Mutations:** C with 1, F566S, C with 225, R3136L, C with 0
- **Cell lines:** BL21 E. coli — Homo sapiens (Human), EBV-related Burkitt lymphoma, Cancer cell line (CVCL_M639), BMH-21 — Mus musculus (Mouse), Hybridoma (CVCL_C5HW), HaCaT — Homo sapiens (Human), Spontaneously immortalized cell line (CVCL_0038), HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030), TK-6 — Homo sapiens (Human), Hereditary spherocytosis, Transformed cell line (CVCL_0561)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12322144/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12322144/full.md

## References

5 references — full list in the complete paper: https://tomesphere.com/paper/PMC12322144/full.md

---
Source: https://tomesphere.com/paper/PMC12322144