# Genetic Diversity of SARS-CoV-2 in Kazakhstan from 2020 to 2022

**Authors:** Altynay Gabiden, Andrey Komissarov, Aknur Mutaliyeva, Aidar Usserbayev, Kobey Karamendin, Alexander Perederiy, Artem Fadeev, Ainagul Kuatbaeva, Dariya Jussupova, Askar Abdaliyev, Manar Smagul, Yelizaveta Khan, Marat Kumar, Temirlan Sabyrzhan, Aigerim Abdimadiyeva, Aidyn Kydyrmanov

PMC · DOI: 10.3390/v18010138 · Viruses · 2026-01-21

## TL;DR

This study tracks the genetic changes of SARS-CoV-2 in Kazakhstan from 2020 to 2022, showing how different virus variants spread and evolved over time.

## Contribution

The paper provides a detailed national genomic overview of SARS-CoV-2 dynamics in Kazakhstan using harmonized data from multiple sources.

## Key findings

- Multiple SARS-CoV-2 lineages were introduced and locally transmitted in Kazakhstan, including a unique B.4.1 lineage.
- Sequencing intensity varied over time, potentially underestimating lineage diversity and persistence.
- Alpha and Omicron waves showed repeated introductions, while Delta was dominated by specific clusters.

## Abstract

Coronavirus disease 2019 (COVID-19), caused by SARS-CoV-2, has had major social and economic consequences worldwide. Whole genome sequencing (WGS) is essential for genomic monitoring, enabling tracking of viral evolution, detection of emerging variants, and identification of introductions and transmission chains to inform timely public health responses. Here, we compile and harmonize SARS-CoV-2 genomic data generated by multiple laboratories across Kazakhstan together with publicly available sequences to provide a national overview of genomic dynamics across successive epidemic waves from 2020 to 2022. We analyzed 4462 genomes deposited in GISAID (including 340 generated in this study), of which 3299 passed Nextclade quality filters, and summarized lineage turnover across major phases (pre-VOC, Alpha, Delta, Omicron BA.1/BA.2, Omicron BA.4/BA.5, and a later recombinant-dominant period). Sequencing intensity varied markedly over time (0.60‰ of confirmed cases during Delta vs. 11.57‰ during the Omicron BA.5 wave), suggesting that lineage diversity and persistence may be underestimated. Pre-VOC circulation included ≥12 Pango lineages with evidence of multiple introductions and sustained local transmission, including a Kazakhstan-restricted B.4.1 lineage that emerged in Nur-Sultan/Astana and disappeared after April 2020. The Tengizchevroil oilfield outbreak comprised B.1.1 viruses with phylogenetic support for ≥three independent introductions. Alpha and Omicron waves were characterized by repeated introductions and heterogeneous origins, whereas Delta was dominated by AY.122 with an additional distinct AY.122 cluster; a notable BF.7 local transmission event was observed during BA.5. We also highlight locally enriched non-lineage-defining mutations. Overall, recurrent importations and variable local amplification shaped SARS-CoV-2 dynamics in Kazakhstan, while interpretation is constrained by strongly time-skewed sequencing.

## Linked entities

- **Diseases:** Coronavirus disease 2019 (MONDO:0100096), SARS-CoV-2 (MONDO:0100096)
- **Species:** Homo sapiens (taxon 9606)

## Full-text entities

- **Diseases:** COVID-19 (MESH:D000086382)
- **Species:** Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12846377/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12846377/full.md

## References

29 references — full list in the complete paper: https://tomesphere.com/paper/PMC12846377/full.md

---
Source: https://tomesphere.com/paper/PMC12846377