# Investigating the evolutionary dynamics and mutational pattern of SARS-CoV-2 spike gene on selected SARS-CoV-2 variants

**Authors:** Bachir Balech, Alessandra Lo Presti, Claudia Telegrafo, Lucia Maisto, Emanuela Giombini, Angela Di Martino, Luigina Ambrosio, Apollonia Tullo, Paola Stefanelli

PMC · DOI: 10.1371/journal.pone.0333093 · PLOS One · 2025-10-21

## TL;DR

This study analyzes mutations in the SARS-CoV-2 spike gene to track viral evolution and understand how new variants emerge.

## Contribution

The study introduces a combined evolutionary and mutational analysis framework for tracking SARS-CoV-2 variants.

## Key findings

- Phylogenetic analysis showed clear separation and expected order of viral variants.
- Selection pressure and mutation frequency trends were linear and parallel in many cases.
- Low-frequency mutations may be important for predicting new virus variants.

## Abstract

The continuous evolution of SARS-CoV-2 has led to the emergence of several variants representing significant challenges for public health. Many studies highlight the relevance of phylogenetic inference or mutational pattern analysis to understand the evolutionary relatedness of viral variants and to estimate the potential effect of new mutations on viral transmission, virulence and antigenicity. Here we describe an evolutionary investigation approach combined with mutational analyses of SARS-CoV-2 Spike gene to annotate and potentially track important amino acid site variation of specific functional domain relevant for viral survival. This approach was applied on XBB*, EG* and BA* and their sub-lineages (see materials and methods) available from GISAID. In addition, we considered the major variants of concern (Alpha, Delta, Omicron) and Wuhan-Hu-1 strain as references. Maximum likelihood phylogenetic tree was constructed from the complete dataset while selection pressure and mutational analyses were conducted on single variants separately. The obtained phylogenetic tree of Spike amino acid gene sequence showed a clear separation of viral variants as well as their expected appearance order. This result supported the significance of selection pressure analyses outcomes combined with amino acid mutational frequencies where in many cases they showed a linear and parallel trend. This allowed also to hypothesize the potential importance of low-frequency mutations in new potential virus variants. This study constitutes an asset of important insights to be considered in regular monitoring programs. In addition, the analysis framework described here introduces a starting point for further standardization, optimization and application on different data types and in large-scale studies.

## Linked entities

- **Genes:** CHMP5 (charged multivesicular body protein 5) [NCBI Gene 51510]
- **Diseases:** SARS-CoV-2 (MONDO:0100096)

## Full-text entities

- **Genes:** S (surface glycoprotein) [NCBI Gene 43740568] {aka spike glycoprotein}
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12539718/full.md

## Figures

2 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12539718/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC12539718/full.md

---
Source: https://tomesphere.com/paper/PMC12539718