# Paradoxical G-quadruplex distribution in coronavirus genomes reveals functional constraints and antiviral therapeutic opportunities

**Authors:** Masato Tanigawa, Takafumi Iwaki

PMC · DOI: 10.1016/j.virusres.2026.199692 · 2026-01-20

## TL;DR

This paper finds that coronavirus genomes have a unique pattern of G-quadruplex structures, which could be used to develop new antiviral drugs.

## Contribution

The study identifies conserved and stable G-quadruplex structures in critical viral proteins as potential broad-spectrum antiviral targets.

## Key findings

- Coronavirus genomes show genome-wide depletion of G4s but strong regional enrichment in Spike and Nucleocapsid proteins.
- 38 stable G4 candidates were identified, with 52.6% located in Nucleocapsid regions.
- A primary G4 target (GGCTGGCAATGGCGG) is 54.8% conserved and highly stable (ΔG = −7.35 kcal/mol).

## Abstract

•Coronavirus G4 pattern: genome-wide depletion coupled with regional enrichment•Spike (IRR=17.9) and Nucleocapsid (IRR=15.2) show strong G4 enrichment•38 stable G4 candidates (ΔG < −5 kcal/mol) identified as therapeutic targets•Primary target GGCTGGCAATGGCGG: 54.8% conservation, ΔG = −7.35 kcal/mol•Betacoronavirus G4 conservation supports broad-spectrum antiviral strategies

Coronavirus G4 pattern: genome-wide depletion coupled with regional enrichment

Spike (IRR=17.9) and Nucleocapsid (IRR=15.2) show strong G4 enrichment

38 stable G4 candidates (ΔG < −5 kcal/mol) identified as therapeutic targets

Primary target GGCTGGCAATGGCGG: 54.8% conservation, ΔG = −7.35 kcal/mol

Betacoronavirus G4 conservation supports broad-spectrum antiviral strategies

We computationally characterized G-quadruplex (G4) distributions across 31 coronavirus genomes to identify conserved structural features as potential antiviral therapeutic targets. Through an integrated approach combining consensus G4 detection, dinucleotide-preserving null models, and pooled Poisson rate ratios, we identified a paradoxical G4 distribution pattern: genome-wide depletion (mean fold change = 0.56) coupled with strong regional enrichment in Spike (S) protein (incidence rate ratio [IRR] = 17.9; 95% CI: 11.7-27.6) and Nucleocapsid (N) protein (IRR = 15.2; 95% CI: 8.7-26.6), while untranslated regions (UTRs) showed complete G4 absence. Thermodynamic stability assessment identified 38 stable G4 candidates (ΔG < -5 kcal/mol), with 52.6% concentrated in Nucleocapsid protein regions, suggesting candidates for structure-based antiviral drug development awaiting experimental confirmation.

This paradoxical distribution pattern—genome-wide depletion coupled with strong regional enrichment in functionally critical proteins (Spike and Nucleocapsid)—provides a mechanistic framework for developing betacoronavirus-targeted therapeutics based on conserved G4 structures.

## Linked entities

- **Proteins:** CHMP5 (charged multivesicular body protein 5)

## Full-text entities

- **Genes:** RIGI (RNA sensor RIG-I) [NCBI Gene 23586] {aka DDX58, RIG-I, RIG1, RLR-1, SGMRT2}, ORF3a (ORF3a protein) [NCBI Gene 43740569], N (nucleocapsid phosphoprotein) [NCBI Gene 43740575], E (envelope protein) [NCBI Gene 43740570], ORF1ab (ORF1a polyprotein;ORF1ab polyprotein) [NCBI Gene 43740578], DDX21 (DExD-box helicase 21) [NCBI Gene 9188] {aka GUA, GURDB, II/Gu, RH, RH II/Gu, RH-II/GU}, M (membrane glycoprotein) [NCBI Gene 43740571], IFIH1 (interferon induced with helicase C domain 1) [NCBI Gene 64135] {aka AGS7, Hlcd, IDDM19, IMD95, MDA-5, MDA5}, ERVK-6 (endogenous retrovirus group K member 6, envelope) [NCBI Gene 64006] {aka ERVK6, HERV-K(C7), HERV-K108, K-Rev, c-orf, cORF}, DHX36 (DEAH-box helicase 36) [NCBI Gene 170506] {aka DDX36, G4R1, MLEL1, RHAU}, PPP1R3A (protein phosphatase 1 regulatory subunit 3A) [NCBI Gene 5506] {aka GM, PP1G, PPP1R3}, S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}
- **Diseases:** influenza (MESH:D007251), respiratory infections (MESH:D012141), COVID-19 (MESH:D000086382), deaths (MESH:D003643), cancer (MESH:D009369), cytotoxicity (MESH:D064420)
- **Chemicals:** pyridostatin (MESH:C567962), remdesivir (MESH:C000606551), oligonucleotide (MESH:D009841), S/N (MESH:D014001), nirmatrelvir (MESH:C000718217), G4 (MESH:D004003), BRACO-19 (MESH:C454064), PhenDC3 (MESH:C000710336), S (MESH:D013455), N (MESH:D009584), quarfloxin (MESH:C545184), TMPyP4 (MESH:C021096), dinucleotide (MESH:D015226)
- **Species:** Homo sapiens (human, species) [taxon 9606], Human coronavirus HKU1 (no rank) [taxon 290028], Monkeypox virus (no rank) [taxon 10244], Human coronavirus NL63 (no rank) [taxon 277944], Betacoronavirus (genus) [taxon 694002], Nipah virus [taxon 121791], Human coronavirus OC43 (no rank) [taxon 31631], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Gammacoronavirus (genus) [taxon 694013], Ebola virus [taxon 186536], Alphacoronavirus (genus) [taxon 693996], Severe acute respiratory syndrome-related coronavirus (no rank) [taxon 694009], Orthocoronavirinae (subfamily) [taxon 2501931], Influenza A virus (no rank) [taxon 11320], Human coronavirus 229E (no rank) [taxon 11137], Middle East respiratory syndrome-related coronavirus (no rank) [taxon 1335626], Cricetus cricetus (black-bellied hamster, species) [taxon 10034], Human immunodeficiency virus 1 (no rank) [taxon 11676]

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12860367/full.md

---
Source: https://tomesphere.com/paper/PMC12860367