# Indirect identification of genomic G-quadruplexes via a small protein probe that specifically recognizes C-rich single-stranded DNA

**Authors:** Juan-nan Chen, Mei-lin Xie, Jiang-yu Yan, Ting-ting Cai, Yong-wen Ding, Tian-xiang He, Jiankang Wang, Jing Huang, Ke-wei Zheng

PMC · DOI: 10.1093/nar/gkag068 · 2026-01-30

## TL;DR

This paper introduces a new protein probe that detects genomic G-quadruplexes by recognizing complementary C-rich DNA, offering a novel way to map G4 structures in the genome.

## Contribution

The study introduces CK13, a novel protein probe that detects G4s indirectly by targeting C-rich single-stranded DNA released during G4 formation.

## Key findings

- CK13 identified tens of thousands of C-rich ssDNA sites in the human genome, most of which overlap with known G4 sites.
- CK13 detects G4s even when they are occupied by G4-binding proteins, offering broader detection capabilities than traditional probes.

## Abstract

Detecting intracellular genomic G-quadruplexes (G4s) is crucial for understanding their biological functions. Although various G4 recognition probes have been developed, there remains a need for new G4 detection technologies to create detailed and reliable genomic G4 maps. In this study, we developed a small protein (CK13) that specifically recognizes the complementary C-rich single-stranded DNA (ssDNA) released during the formation of G4. Based on CK13 and CUT&Tag technology, we identified tens of thousands of C-rich ssDNA sites within human genomic DNA. These sites contain the vast majority of G4 sites detected by G4 probes, indicating that CK13 can well confirm the results of traditional G4 probes. Since CK13’s binding to C-rich ssDNA is minimally influenced by G4-binding proteins, it produces strong signals at the sites where intracellular G4-binding proteins are present. This indicates that, beyond free G4 structures, CK13 can also detect G4s occupied by G4-binding proteins within cells. Our findings demonstrate that C-rich ssDNA complementary to G4 can serve as an indirect marker for G4 formation, offering a promising approach to further explore the regulatory roles of G4s and their interacting proteins.

Graphical Abstract

## Linked entities

- **Proteins:** KRT13 (keratin 13)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Genes:** NUCLEOLIN (nucleolin multifunctional protein) [NCBI Gene 4691] {aka C23, NCL, Nsr1}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, KRT13 (keratin 13) [NCBI Gene 3860] {aka CK13, K13, WSN2}, EGFR (epidermal growth factor receptor) [NCBI Gene 1956] {aka ERBB, ERBB1, ERRP, HER1, NISBD2, NNCIS}, CSTB (cystatin B) [NCBI Gene 1476] {aka CPI-B, CST6, EPM1, EPM1A, PME, STFB}, CST12P (cystatin 12, pseudogene) [NCBI Gene 106478911] {aka Cst, Ctes4, E2}, MBP (myelin basic protein) [NCBI Gene 4155], DNAI1 (dynein axonemal intermediate chain 1) [NCBI Gene 27019] {aka CILD1, DIC1, ICS1, PCD, oda6}, PCBP1 (poly(rC) binding protein 1) [NCBI Gene 5093] {aka HEL-S-85, HNRPE1, HNRPX, hnRNP-E1, hnRNP-X}, C9orf72 (C9orf72-SMCR8 complex subunit) [NCBI Gene 203228] {aka ALSFTD, DENND9, DENNL72, FTDALS, FTDALS1}, TOP1 (DNA topoisomerase I) [NCBI Gene 7150] {aka TOPI}, F10 (coagulation factor X) [NCBI Gene 2159] {aka FX, FXA}, HNRNPK (heterogeneous nuclear ribonucleoprotein K) [NCBI Gene 3190] {aka AUKS, CSBP, HNRPK, TUNP}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, ETV6 (ETS variant transcription factor 6) [NCBI Gene 2120] {aka TEL, TEL/ABL, THC5}, BCL2 (BCL2 apoptosis regulator) [NCBI Gene 596] {aka Bcl-2, PPP1R50}, TXK (TXK tyrosine kinase) [NCBI Gene 7294] {aka BTKL, PSCTK5, PTK4, RLK, TKL}, KIT (KIT proto-oncogene, receptor tyrosine kinase) [NCBI Gene 3815] {aka C-Kit, CD117, MASTC, PBT, SCFR}, C4A (complement C4A (Chido/Rodgers blood group)) [NCBI Gene 720] {aka C4, C4A2, C4A3, C4A4, C4A6, C4AD}, ABTB2 (ankyrin repeat and BTB domain containing 2) [NCBI Gene 25841] {aka ABTB2A, BTBD22, CCA3}
- **Diseases:** cancers (MESH:D009369)
- **Chemicals:** DTT (MESH:D004229), polyacrylamide (MESH:C016679), AEBSF (MESH:C002010), oligonucleotide (MESH:D009841), PBS (MESH:D007854), NaCl (MESH:D012965), KCl (MESH:D011189), SYBR green I. (MESH:C098022), HEPES (MESH:D006531), formamide (MESH:C031066), Alexa Fluor 555 (MESH:C000608607), G4 (MESH:D004003), streptomycin (MESH:D013307), EDTA (MESH:D004492), K+ (MESH:D011188), penicillin (MESH:D010406), PEG200 (MESH:C000619859), C (MESH:D002244), paraformaldehyde (MESH:C003043), spermidine (MESH:D013095), Triton X-100 (MESH:D017830), poly(dA) (MESH:C015465), CY5 (MESH:C085321), FAM (MESH:C031179), glycerol (MESH:D005990), digitonin (MESH:D004072), MgCl2 (MESH:D015636), cytosine (MESH:D003596), Cy3 (-), (dT (MESH:D013936), SDS (MESH:D012967), Alexa Fluor 488 (MESH:C000711379), ampicillin (MESH:D000667), LiCl (MESH:D018021), poly(C) (MESH:D011066), HCl (MESH:D006851), KOH (MESH:C029943), DAPI (MESH:C007293)
- **Species:** Homo sapiens (human, species) [taxon 9606], Escherichia coli (E. coli, species) [taxon 562], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** BL21 — Homo sapiens (Human), EBV-related Burkitt lymphoma, Cancer cell line (CVCL_M639), HeLa — Homo sapiens (Human), Human papillomavirus-related endocervical adenocarcinoma, Cancer cell line (CVCL_0030), BG4 — Homo sapiens (Human), Chondrosarcoma, grade 3, Cancer cell line (CVCL_M614), HCT116 — Homo sapiens (Human), Colon carcinoma, Cancer cell line (CVCL_0291), S13A — Homo sapiens (Human), Conditionally immortalized cell line (CVCL_LF75), HEK293T — Homo sapiens (Human), Transformed cell line (CVCL_0063)

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12856213/full.md

---
Source: https://tomesphere.com/paper/PMC12856213