Unveiling Intra-Clonal Diversity of Monkeypox Virus from Brazil’s First Outbreak Wave
Amanda Stéphanie Arantes Witt, João Victor Rodrigues Pessoa Carvalho, Izabela Mamede, Talita Emile Ribeiro Adelino, Felipe Campos de Melo Iani, Maurício Teixeira Lima, Thalita Souza Arantes, Denilson Eduardo Silva Cunha, Rodrigo Araújo Lima Rodrigues, Giliane de Souza Trindade

TL;DR
This study examines the genetic diversity of monkeypox virus within a single skin lesion from Brazil's first outbreak, revealing subtle differences among viral clones.
Contribution
The study provides new insights into intra-clonal diversity of MPXV from a single patient sample during Brazil's first outbreak wave.
Findings
Three MPXV clones were isolated from a single skin lesion sample, showing distinct plaque characteristics.
MPXV clones lacked comet-like structures, indicating reduced EEV release compared to VACV-WR.
Genomic analysis revealed intra-clonal differences but no significant protein-level variations.
Abstract
The monkeypox virus (MPXV) is an emerging zoonotic pathogen responsible for mpox, a disease characterized by some smallpox-like symptoms, typically mild but occasionally fatal. The largest mpox recorded global outbreak began in May 2022, with over 162,000 cases across 140 countries. Herein, we have analyzed the intra-clonal diversity of MPXV obtained from a single skin lesion sample from a male patient (June 2022). Three viral clones were obtained following phenotypic evaluation of MPXV lysis plaque characteristics over a three-course infection in BSC-40 cells. Unlike the vaccinia virus Western Reserve (VACV-WR) strain, MPXV clones did not produce comet-like structures, suggesting reduced extracellular enveloped virus (EEV) morphotype release, which is associated with viral dissemination. Whole-genome sequencing and assembly identified subtle differences among clones. Comparative…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5| MPXV ORF a | Position | Ref | Alt | Description | Original Sample b | Clone 1 | Clone 2 | Clone 3 |
|---|---|---|---|---|---|---|---|---|
| OPG019 | 7771 | C | T | EGF-like domain containing-protein | SY | SY | SY | SY |
| OPG025 | 14000 | G | T | Ankyrin repeat domain containing-protein | SY | SY | SY | NA |
| OPG031 | 17961 | A | G | C4L/C10L-like family protein | NA | SY | SY | SY |
| OPG031 | 18769 | A | G | C4L/C10L-like family protein | SY | SY | SY | NA |
| OPG037 | 21723 | G | A | Ankyrin repeat domain containing-protein | SY | SY | SY | SY |
| OPG037 | 23564 | C | T | Ankyrin repeat domain containing-protein | SY | SY | SY | SY |
| OPG105 | 81275 | G | A | DNA-dependent RNA polymerase | SY | SY | SY | SY |
| OPG105 | 81977 | A | G | DNA-dependent RNA polymerase | SY | SY | SY | SY |
| OPG105 | 82373 | C | T | DNA-dependent RNA polymerase | SY | SY | SY | SY |
| OPG105 | 82451 | G | A | DNA-dependent RNA polymerase | SY | SY | SY | SY |
| OPG105 | 83326 | C | T | DNA-dependent RNA polymerase | NSY | NSY | NSY | NSY |
| OPG105 | 84587 | C | T | DNA-dependent RNA polymerase | SY | SY | SY | SY |
| OPG145 | 124130 | G | A | DNA helicase | NSY | NSY | NSY | NSY |
| OPG188 | 162243 | G | A | Schlafen | SY | SY | SY | SY |
| OPG188 | 162331 | C | T | Schlafen | SY | SY | SY | SY |
| OPG210 | 183519 | C | T | B22R family serpin | NSY | NSY | NSY | NSY |
| OPG210 | 186578 | G | A | B22R family serpin | NSY | NSY | NSY | NSY |
| OPG210 | 186933 | A | T | B22R family serpin | NA | SY | SY | SY |
- —Conselho Nacional de Desenvolvimento Científico e Tecnológico (CNPq)
- —Fundação de Amparo à Pesquisa do Estado de Minas Gerais (FAPEMIG)
- —Coordenação de Aperfeiçoamento de Pessoal de Nível Superior (CAPES)
- —Instituto Nacional de Ciência e Tecnologia em Poxvírus (INCT-Pox)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPoxvirus research and outbreaks · SARS-CoV-2 and COVID-19 Research · Bacillus and Francisella bacterial research
1. Introduction
The monkeypox virus (MPXV) was first described in 1958 (Copenhagen, Denmark) associated with captive cynomolgus monkeys (Macaca fascicularis) presenting with a smallpox-like infection [1]. The first human MPXV infection (mpox) was reported years later in the Democratic Republic of Congo in a 9-month-old patient diagnosed by the World Health Organization (WHO) Smallpox Reference Center (Moscow, Russia) [1,2]. Mpox symptoms are similar to smallpox but are typically mild, including fever, body chills, swollen lymph nodes, exhaustion, headache, adenopathy, and vesiculopustular rashes. The infection lasts from 3 to 17 days and, in severe cases, may lead to death [3].
Current viral taxonomy classifies poxviruses as belonging to the Nucleocytoviricota phylum, the Pokkesviricetes class, Chitovirales order, family Poxviridae. The monkeypox viruses have been classified into two genetic clades: clade I, prevalent in Central Africa (Congo Basin), and clade II, endemic to West Africa—based on geographic origin and endemicity. These clades are subdivided into lineages (clade I lineages 1–5 and clade IIa and IIb lineages) with distinct lethality rates [4,5,6]. Phylogenetically, the Clade IIb MPXV shares 99.6% nucleotide identity with Clade I and 99.7% nucleotide identity with Clade IIa [7]. MPXV genome consists of a double-stranded DNA strand ranging in size from 196 to 211 kilobases (Kbp), containing the information required to encode about 200 genes [8]. Interestingly, there seem to be some differences among genomic features when comparing both MPXV clades. For example, clade I is more uniform in terms of genome size than clade II. Additionally, one clade has genes that the other does not [9].
Historically, this zoonotic virus has been reported endemically in African countries since its discovery, with rare cases of mpox outside the continent until 2022, either caused by clade I, clade IIa, or clade IIb [7]. As of September 2025, the ongoing global epidemic of mpox that began in 2022 registers more than 162,000 confirmed cases in 140 countries. Of all the non-endemic regions where MPXV has circulated since May of 2022, the region of the Americas is the most affected, with the United States of America and Brazil being the countries with the most confirmed mpox reported cases (35,590 and 14,328 reported cases, respectively) (2022–2025) [3]. Moreover, the 2022 global mpox outbreak is linked to the emergence of the clade IIb MPXV viruses, which has also been previously associated with mpox outbreaks in Nigeria, the United Kingdom, Israel, and Sin39gapore [7,10,11].
Nevertheless, mpox has been considered a neglected disease due to its restricted geographical distribution until its global emergence in 2022. Given the proportions of the 2022 ongoing mpox multicontinental outbreak, MPXV has been studied through many distinct approaches that aimed to better understand various aspects of its biology. As usual, genomic surveillance has played a key role in our understanding of the diversity of MPXV following the 2022 mpox outbreak. However, it is worth noting that, since the beginning of the outbreak in 2022, most of the sequenced viral isolates have provided genomic sequences that reflect the characteristics of the sample as a whole. Therefore, further information about the genomic variation within the sample (i.e., intra-sample genomic diversity) remains poorly explored, creating a gap in our understanding of the MPXV genomic diversity that has yet to be filled.
In this context, the phenotypic characterization of MPXV lysis plaques in vitro provides insight into the variation in the virus within the same sample. As has been demonstrated for other orthopoxviruses (OPVs), such as the Vaccinia virus (VACV), OPV species can co-circulate as distinct genetic groups or viral populations that exhibit significant phenotypic differences, which may even contribute to viral evolution rate by genome recombination and viral diversity [12,13]. In light of the ongoing efforts to improve our understanding of MPXV biology following the unusual global outbreak in 2022, here we explore phenotypic and genomic features of MPXV clones obtained from a single exanthematous lesion. Our results suggest that different MPXV variants can be found in a single lesion, contributing to our understanding on MPXV IIb clade evolution.
2. Methods
2.1. Sample Description
The MPXV sample used in this study was obtained from a single genital lesion of a male patient admitted to a public health system hospital in Brazil in June 2022 [14]. The molecular diagnosis was performed by the Ezequiel Dias Foundation (FUNED), the reference laboratory for diagnosis of infectious diseases in Minas Gerais State, Brazil (Ethics Committee approval, CAAE: 62702222.6.0000.9507, SISGEN-AF47BB2) Part of the sample was submitted to DNA extraction, followed by PCR targeting MPXV, as described by CDC [15]. The molecular diagnosis by PCR confirmed that the patient was infected with MPXV. After PCR confirmation, 15 µL of the vesicular content of the lesion was sent to the Virus Laboratory at the Federal University of Minas Gerais (UFMG), Brazil, where it was processed for virus isolation and clarification. The sample was manipulated in a BSL-3 laboratory, following all institutional biosafety protocols. Five hundred microlitres of Dulbecco’s modified essential medium supplemented with 1% fetal calf serum was added to the vesicular content, homogenized, and then inoculated into a monolayer of Vero cells in a T75 culture flask for virus clarification. Vero cells were used for this stage of the viral manipulation process, as Vero cells are more permissive of OPV in vitro infection. Plaque-forming units (PFUs) were observed at 48 h post-infection (h.p.i.) under incubation at 37 °C and 5% CO_2_. Cells were collected by physical detachment using a sterile cell scraper, and a portion of this extract was re-inoculated into a new monolayer of Vero cells for confirmation and observation of PFU.
2.2. Plaque Phenotype Characterization Assay and Viral Clone Isolation
To identify possible phenotypic variation between different MPXV lysis plaques, three consecutive passages of infection, PFU observation, and collection were performed on BSC-40. Similarly to Vero, the BSC-40 is an excellent cell lineage for in vitro OPV infections. However, it is a better platform for evaluating the lysis plaque phenotype due to its slightly reduced size, which allows for more defined plaques. For phenotypic characterization of MPXV lysis plaques and clone isolation, BSC-40 cells at 95% confluence were used. For each infection, the virus sample was serially diluted in minimal essential medium (MEM) containing 0% fetal calf serum, after which the plates were incubated at 37 °C and 5% CO_2_ for one hour to allow adsorption. After adsorption, the virus inoculum was completely removed, and a semi-solid medium of MEM supplemented with 1% fetal calf serum and 0.5% agarose was carefully added to each well. The cells were then incubated at 37 °C and 5% CO_2_, and the PFUs were carefully observed using a Sunny Instruments SOP TOP ICX41 microscope at 40× and 100× magnification. The PFUs were selected after 48 h of incubation. The PFUs were selected based on characteristics such as approximate size, shape, and peripheral cellular aspects, as well as the presence or absence of cells and cellular debris within the PFU. The selected PFUs were carefully collected using 10 µL filtered pipette tips positioned vertically over the PFU and stored at −80 °C in MEM containing 10% fetal calf serum to maximize viral particle stability. This procedure was repeated for each of the three consecutive infections.
For the first of the three passages, we have used the clarified viral sample after serial dilution. PFU collected from the first infection were labeled, serially diluted, and used for the second infection. Similarly, PFU collected from the second infection were labeled, serially diluted, and used for the third and last infection. By the end of the three-course infection assay, the final PFUs were collected and determined as final viral clones. At last, MPXV clones were expanded into T75 flasks containing BSC-40 monolayer at 90% confluency and MEM 1% fetal calf serum under incubation at 37 °C and 5% CO_2_ until observation of cytopathic effect and formation of plaques. Cell monolayers were washed with PBS buffer, and collected with a sterile cell-scraper, and both pellet and supernatant were stored in cryotubes at −80 °C.
2.3. Comet-Forming Assay
We conducted a comet-forming assay in order to evaluate viral dissemination of MPXV clones through EEV liberation. For this assay, 6-well plates containing BSC-40 monolayers at 95% confluency were infected with each MPXV clone serially diluted in MEM 0% fetal calf serum, followed by adsorption at 37 °C and 5% CO_2_ for an hour. VACV-WR was included as a positive control for comet comparison. After the adsorption period, 2.5 mL MEM supplemented with 1% fetal calf serum was added per well, and plates were incubated at a slight incline for 72 h at 37 °C and 5% CO_2_. Subsequently, the cell monolayers were fixed with 3.7% formaldehyde and stained with 1% crystal violet solution to visualize plaques and any comet visualization. The stained monolayers were examined macroscopically and over a transilluminator to assess plaque morphology.
2.4. Viral DNA Extraction
Viral DNA was extracted from 200 µL of supernatant from each MPXV clone, previously expanded in BSC-40 cells and harvested as described in the Plaque phenotype characterization assay and viral clone isolation section. The extraction was performed using the High Pure Viral Nucleic Acid Kit (Roche 14781400) (Roche Diagnostics GmbH, Mannheim, Germany) following the manufacturer’s instructions. Briefly, samples were incubated with Binding Buffer, poly(A), and proteinase K at 72 °C for 10 min to lyse viral particles and degrade nucleases. The resulting lysates were then loaded onto the kit’s spin columns to facilitate nucleic acid binding to the silica membrane through a series of centrifugation and washing steps using the supplied Inhibitor Removal Buffer and Wash Buffer. Finally, the purified DNA was eluted in 50 µL of Elution Buffer. The extracted DNA samples were immediately stored at −20 °C for subsequent sequencing.
2.5. Viral Genome Sequencing, Gene Annotation, and SNV Calling
Viral DNA was amplified by MPV-specific multiplex PCR using tiling primers and Q5 High-Fidelity DNA Polymerase (New England Biolabs, Ipswich, MA, USA) as previously described by Guimarães et al. (2023), and sequenced on an Illumina MiSeq with normalized libraries loaded onto a 300-cycle according to the manufacturer’s instructions (Illumina, San Diego, CA, USA) [14]. Raw reads were trimmed with Trimmomatic (v 0.36.6) using the SLIDINGWINDOW filter [16]. Genome assemblies were generated with SPAdes v4.2.0 via Galaxy (tool spades/4.2.0+galaxy0), and reads and contigs were then mapped with Minimap2 v2.24 to the Clade IIb reference (NC_063383) [17,18,19]. Consensus sequences were aligned with MAFFT v7.450, visualized in Geneious Prime (v2024.1.2), and subsequently manually curated [20,21]. Assemblies are available at SRA under accession PX448464, PX448465 and PX448466.
The assembled genome for the isolates was aligned against the Zaire reference using MAFFT. CDS calling was performed using Prodigalv2.6.3 and the predicted ORFs were annotated following the workflow described by Carvalho et al. (2024) [22,23]. Briefly, we used Blastp (e-value < 10^−5^) against the NCBI nr database, followed by protein domain search using HHpred [24,25,26]. In case of results discordance, we used Interproscan and compiled the results to achieve the final gene annotation [27]. The genome and a gene identity comparison was performed using data matrixes from minimap2-2.30 for genomes and FastANI v1.34 for genes, that was then used as inputs for heatmaps, using the package pheatmap (kolde, 2025—https://github.com/raivokolde/pheatmap (accessed on 12 September 2025)) in R version 4.4.3 (R Core Team, 2025) [17,28,29].
To compare conservation among the genomic sequences, we used DigAllign 2.0, using the reference genomes for Clade I MPXV strain Zaire 1979-005 (GenBank accession: HM172544), and clade II MPXV 2022 (GenBank accession: NC_063383) reference viruses [30]. The order was chosen to better represent their similarity. SNV calling was performed using an in-house script available at https://github.com/iza-mcac/2025-10-Pox-Variant_calling (accessed on 12 September 2025), and the resulting variants were filtered to include only those associated with genes in Monkeypox virus [31]. In summary, we used Biostrings and tidyverse packages in R, where variants were annotated based on their position relative to coding regions in the reference genome. Custom functions classified SNVs as synonymous, non-synonymous, or stop-gained based on codon changes, while indels were categorized as frameshift or non-frameshift. Annotated variants were reshaped into a wide-format table for comparison across viral clones.
Additional analysis of the original sample sequenced by Guimarães et al. (2023) has been conducted in consideration of the inherent possibility of identifying in vitro-derived SNVs [14]. The raw reads obtained from the sequencing of the original clinical sample were aligned to the MPXV clade II reference genome (NC_063383.1). Each clonal SNV position identified previously was used to find and quantify by extraction the number of reads supporting the reference base and the alternate observed base on the original genome. The alternative allele frequency count was considered a parameter for clonal SNV validation as proven represented within the original sample.
2.6. Phylogenetic Reconstruction
A dataset of 284 genomes was retrieved from GISAID (accessed on 10 October 2025) [32]. After re-evaluating sequences and metadata, we excluded entries with anomalous insertions or excess mutations, yielding 248 public genomes plus 3 clonal genomes, alongside the MPV RefSeqs for Clade I (NC_003310) and Clade II (NC_063383). Genomes were aligned with MAFFT v7.450, and the alignment was trimmed with ClipKIT v0.1.0 in smart-gappy model [21,33]. Maximum-likelihood tree was inferred with IQ-TREE 2 v2.4.0 using 1000 bootstrap replicates [34]. The best-fit substitution model was selected by ModelFinder [35]. Trees were visualized and edited in iTOL v7 [36].
2.7. Structural Comparisons
Structural modeling was performed using SwissModel web server and AlphaFold2, via ColabFold v1.5.5, using pdb100 as a database for templates [37,38]. The models generated by AlphaFold were used in FoldSeek to search for homologous structures [39]. Predicted domains were characterized using InterProScan 106.0 and pHMMER 3.3.2 web servers [27,40]. The Gibbs free energy change values for the mutated amino acids were calculated using the MAESTRO 1.2.35 web server [41]. Physico-chemical parameters were measured using Ramachandran plots, generated in RamplotR online tool (https://bioit.shinyapps.io/RamplotR/ (accessed on 12 September 2025)), and ProtParam [42].
3. Results
3.1. MPXV Lysis Plaque Phenotype Variation and Clone Selection
In this study, we have evaluated MPXV lysis plaque phenotypic variation through three passages of infection and harvest of selected plaque forming units (PFU) depending on different plaque aspects such as size, shape, presence or absence of stressed cells on the plaque periphery and interior. Initially, following the first infection with the 2022 MPXV sample, we observed considerable variation in plaque size, although all MPXV lysis plaques were considered small. This was particularly evident when MPXV plaques were compared to VACV-Western Reserve (VACV-WR) plaques, the latter of which were used as a reference point in this study since VACV is well-characterized in the poxvirus literature. After careful PFU evaluation and selection, a total of ten potential clones were collected for the second infection. However, despite selecting ten distinct potential MPXV clones, most of the PFU variation was considerably reduced after the second and third infections rounds, as we had anticipated, consistent with the expected effects of clonal purification, which separates co-existing phenotypes clones. Following an in-depth analysis of all the potential MPXV clones, we successfully obtained three MPXV clones from the 2022 outbreak in Brazil.
The three MPXV clones obtained exhibited an irregular format, maintaining this characteristic from the early infection. We observed that the plaque size for each clone was roughly the same when comparing plaques from the same clone after 48 h of infection. Precisely measuring lysis plaques was not our primary objective, given the irregular nature of MPXV plaques, but the length of a total of ten plaques per final isolated clone were randomly measured and the average plaque size was calculated. However, when we compared the plaques from the three MPXV clones, we observed slight size variations, with clone 2 presenting the smallest plaques with 273.45 µm average length, followed by clone 1 with 362.34 µm, and clone 3 with 493.72 µm, approximately. All plaques presented stressed cells at the periphery and stressed cells or cell debris within (Figure 1). Moreover, in an attempt to observe more pronounced variation in plaque size among the final isolated clones, we also evaluated the plaque phenotypes after 72 h of infection. However, the longer incubation period did not result in larger lysis plaques forming, so we maintained a 48 h incubation period.
Overall, we observed that the Brazilian MPXV isolate from the 2022 outbreak produced irregularly shaped, small lysis plaques in BSC-40 cells incubated with agar-enriched minimal essential medium (MEM) after 48 h of infection. The Brazilian MPXV isolate exhibited phenotypic variation, enabling the selection and isolation of distinct clones, which were characterized in subsequent sections of this study.
3.2. MPXV Clones Comet-Forming Tendencies
To evaluate the Brazilian 2022 outbreak MPXV isolate clone dissemination through the liberation of the extracellular enveloped virus (EEV), we have performed a comet-forming assay. For comparison purposes, we included VACV-WR as a positive control, since this virus is known to release large quantities of EEVs, producing large, well-defined comets [43,44,45,46]. In our analysis of the comet-forming assay, none of the three Brazilian isolated MPXV clones formed comets, in contrast to the VACV-WR positive control (Figure 2). When the fixed and stained BSC-40 monolayers were observed over a transilluminator, we observed all MPXV clones formed small plaques that did not present any projections, as would be expected if the clones had formed comets.
An important aspect of the Brazilian 2022 MPXV outbreak isolate’s inability to form comets is that this same sample had its multiplication cycle analyzed using an ultrastructural approach, which revealed the absence of EEV morphotypes [47]. In our previous work, transmission electron microscopy of MPXV-infected VERO cells enabled us to study the virus multiplication cycle in detail, including the morphogenesis of new particles. While MPXV morphogenesis remained consistent with what is documented in the poxvirus literature, we have mainly observed the formation of the intracellular mature virus (IMV) morphotype [47]. This suggests a probable low production of the intermediate morphotype, the immature enveloped virus (IEV), and consequently, low or no production of EEV.
As we know, the production and liberation of EEV is a notable feature of the multiplication cycle of many poxviruses. EEV liberation is considered one of the most important aspects of poxvirus biology regarding long-distance cell-to-cell dissemination within tissue and evasion of the host immune system, both of which correlate with virus virulence [44,45,46]. Like VACV and other poxviruses, MPXV can produce and release EEV morphotype particles, which induce the formation of comets [45,48]. Distinct MPXV strains of both clades I and II can form comets since this characteristic is well-conserved; however, it is worth noticing that the clade IIb.1 presents markedly reduced comet formation in comparison to other MPXV strains [49]. Furthermore, the observed characteristics of the lysis plaques produced by the 2022 MPXV sample in BSC-40, as well as the absence of comet formation, are similar to those previously described by McGrail et al. (2024) when comparing a 2022 MPXV outbreak sample with the USA 2003 strain and the WRAIR 7-61 strain [50]. Overall, the characterization of the 2022 MPXV lysis plaque phenotype and comet formation tendencies in BSC-40 indicates diminished viral growth and reduced dissemination through EEV liberation, which differs from what can be observed in MPXV clade II strains prior to the clade IIb emergence in 2022.
3.3. Genomic Analysis of MPXV Clones
To further explore the differences among the clones of the Brazilian 2022 MPXV outbreak isolate, we sequenced the clones’ genomes and performed an exploratory genomic analysis. The genome assemblies revealed scaffolds of size 197,243 bp for clone 1, 197,209 bp for clone 2, and 197,121 bp for clone 3. Complete sequencing, assembly, and prediction data can be found in Table 1, as well as access codes for the NCBI database, GenBank. During the quality assessment of the assemblies, we observed a high number of degenerate bases in clone 1—accounting for 9.42% of the genome (18,563 bp)—and in clone 3—accounting for 12% of the genome (23,660 bp). The clone 2 genome was found to be the most complete, leading to its interpretation as the closest representation of the actual state of the genome (Table 1). This information is significant because it may have ramifications for further analyses, including coding DNA sequences (CDS) prediction. For the comparative purpose of this study, we opted to utilize all genomes for subsequent analysis.
Following the genome sequencing and assembly, we inferred a phylogeny from MPVX circulating in Brazil during the initial 2022 Clade IIb wave, in order to evaluate inter-clone similarity and place them in context with published sequences. Genomes were aligned (open reading frames (ORFs) OPG001–OPG210) using MAFFT v7.450, then trimmed with ClipKIT v0.1.0, yielding a final alignment of 187,913 nucleotide positions after gap filtering and site trimming. A maximum-likelihood tree was inferred in IQ-TREE 2 with the substitution model General Time Reversible with empirical base frequencies and FreeRate heterogeneity with 5 rate categories (GTR+F+R5), and node support was assessed with 1000 bootstraps.
As expected, the three clones formed a cohesive cluster with within Clade IIb Brazilian B.1 genomes. The clonal sequences formed a single, well-supported clade (Supplementary Figure S1). Notably, clones 1 and 3 were more similar to each other and appeared to harbor more mutations than clone 2, which seemed comparatively conserved relative to other contemporaneous genomes.
To facilitate broader understanding and comparison, we included two reference genomes in the next analysis: one for each MPXV clade. The Zaire 1979-005 strain (GenBank accession number HM172544) was used as a reference for clade I, and the MPXV 2022 strain (GenBank accession number NC_063383) for clade II. Figure 3A shows the comparison between each genomic sequence used in this study, highlighting a relative dissimilarity among them when comparing with the references. As cited before, clones 1 and 3 had a high number of Ns in the sequence, due to the quality of the sequencing, which explains this variation. Even so, it is observed that this variation increases the more phylogenetically distant the comparison pair is, showing that, even with unknown bases, the depth of sequencing generated data for a good assembly. The genome of clone 2, considered the best sequenced, has a high similarity to that of the clade II reference, but we observed a variation of about 2% of the genome, which will be evaluated below.
Additionally, we evaluated the percentage of shared sequence identity between the complete genomes of the MPXV clones and the references (Figure 3B). This result is more consistent with expectations, with genes in clade II being more similar to each other. This is because the analysis only uses homologous sequences present in all genomes, which minimizes the impact of Ns. In general, all genomes exhibit high levels of shared identity, with a minimum of 98.6% between clone 3 and the Zaire strain. The three clones share nearly 99.5% similarity with each other, while interestingly, clone 2 showed the highest percentage of shared identity (99.9%) with the MPXV 2022 reference strain, followed by clones 1 and 3 with 99.4% and 99.3%, respectively.
ORFs were predicted using Prodigal and annotated using blastn, HHpred and InterproScan. The annotation showed that clone 1 contained 209 predicted genes, clone 2 coded for 213 genes, and clone 3 had 194 genes. For comparison, the MPXV Zaire strain has 197 coding sequences, while MPXV clade II has 179. Despite the variation in the number of genes among the clones, which could potentially indicate an artifact of genomic assembly and annotation bias, a more thorough examination of the sequences revealed that the composition does not exhibit significant divergence. As we lacked the necessary transcriptome data to validate or reject the predicted sequences, we elected to utilize them as they were provided by the program.
To further evaluate the MPXV clones’ gene composition, we categorized the annotated genes according to the Nucleo-Cytoplasmic Virus Orthologous Groups (NCVOG) classification [51]. Most of the annotated CDS belonged to the category of “Host–virus interaction,” such as cytokine antagonists and entry/fusion complex proteins, followed by “Virion structure and morphogenesis,” such as membrane proteins associated with different infectious forms and DNA packing ATPase A32-like, and “Other metabolic functions”, including proteins with roles that are not yet defined or with more than one role, such as telomere-binding proteins and glutaredoxins, for example. NCVOG categories with the least annotated CDSs were ‘Signal transduction regulation’, ‘Lipid metabolism’—lipases—and ‘Protein metabolism’, the latter being a metalloendopeptidase found only in clone 2 (Figure 4).
Synteny analysis confirmed the similarity results, with clones 2 and 3 presenting a distinct dissimilarity close to the 14 kb mark of the genome. Single nucleotide variation (SNV) analysis showed a similar result with most of the mutations present in proximity to the 14 kb mark and close to the last portion of the pox genome. Using an in-house script (https://github.com/iza-mcac/2025-10-Pox-Variant_calling (accessed on 12 September 2025)) we called SNVs in all the ORFs associated with the annotated CDS, which found from 54 to 60 SNVs present in each clone (Figure 5). Among these SNVs we found two variants in common between two clones and ten variants that were clone specific (Figure 5B).
To facilitate understanding, the SNVs have been condensed into a select group that exclusively comprises those located within a standard pox ORF, as detailed in Table 2. A subsequent analysis of these mutations reveals a preponderance of synonymous mutations (14/18), followed by non-synonymous mutations (4/18). Among the genes exhibiting the most mutations, DNA-dependent RNA polymerase is the main variation hotspot with six SNVs, followed by the B22R family serpin with three SNVs. The C4L/C10L-like family protein, OPG037-homolog ankyrin repeat domain containing-protein and schlafen had two SNVs each, while EGF-like domain containing-protein, OPG025-homolog ankyrin repeat domain containing-protein and DNA helicase had only one. As indicated by Yadav et al. (2023), the functions of the mutated genes are associated with DNA replication, transcription, and interaction with the host [52]. Considering these associated functions to genes and identified mutations, we have hypothesized that some of the found SNVs could possibly be correlated to the phenotypic plaque variation observed among MPXV clones to some degree.
To support that the SNVs found on MPXV clones are representative of the intra-clonal diversity of the MPXV original sample, we also performed SNV calling on the raw sequencing data of the original sample by Guimarães et al. (2023) and found that 16 of the 18 mutations in Table 2 were present in the original sample [14]. This emphasizes that these mutations did not arise from the experimental passages. Specifically, the C>T mutation at position 83,326 and the G>A mutation at position 124,130—the non-synonymous ones—were observed with counts above 900 in the reads of the original sample). These data show that the expected bottleneck effect of the passages was not a relevant selective factor for the SNVs found.
Utilizing these data, a three-gene subset with the non-synonymous mutations was isolated and subjected to further investigation to ascertain whether these mutations would impact the biology of the clone, employing structural modeling. The first ORF selected was the catalytic domain of DNA-dependent RNA polymerase, which presented a variation in a cytosine in reference to a thymine in the clones in position 83,326, resulting in the substitution of a serine residue, a nonpolar amino acid, to leucine, a polar amino acid. This change had a score of −1 in the BLOSUM62 matrix. To build the tertiary structures, SwissModel presented a good model based on high-quality templates, which led us to use the server service for modeling (Table 3).
The mapping process revealed the location of the variation within the bridge helix (BH) of the cleft (Supplementary Figure S2). This structure, consisting of an alpha helix, functions as a wall that divides the nucleic acid loading channels into a primary channel, which accommodates double-stranded DNA to be transcribed, and a secondary channel, into which free NTPs enter and are accommodated for base pairing [53]. Furthermore, evidence suggests that BH plays a role in a transcription checkpoint function, as demonstrated by the ability of a threonine residue to serve as a probe, indicating the stability of the nascent RNA molecule. This process can reveal potential pairing errors, which in turn can result in backtracking [54]. Our in silico analyses did not indicate major changes in the structure of the clone molecules, with no new atomic interactions or variation in the volume of the mutation site (Supplementary Figure S3). The predicted ΔΔG for this mutation was calculated as −0.109 Kcal, indicating that this variant stabilizes the structure. Furthermore, a thorough review of the existing literature revealed the absence of any functions, other than structural ones, for the initial serine. However, we cannot rule out the possibility that this mutation may impact the function of the molecule. RNA polymerase is an enzyme that plays an essential role in the cycle of these viruses, so mutations so close to relevant sites that alter the polarity of the region may generate functional changes, albeit not very significant ones. As observed, these clones appear to generate smaller plaques, so it could be inferred that this mutation has a slight influence on this characteristic. Nevertheless, further investigation is necessary to ascertain the extent to which this mutation impacts the MPXV clone multiplication cycle, if any.
The subsequent ORF was related to DNA helicase, wherein a guanine in the reference genome was substituted with an adenine in the clones, resulting in the replacement of glutamate, an acidic amino acid, with lysine, a basic amino acid. This substitution has a score of 1 in the BLOSUM62 matrix. As template-dependent methods did not generate good models, we used AlphaFold2, via ColabFold, to infer the structures (Table 4). DNA helicase has been hypothesized to play a role in RNA transcription termination and DNA repair, though this remains to be substantiated [55,56]. Considering the mutation found here, it was established that this variation was not located in the functional domain of the molecule (Supplementary Figure S4). Consequently, it was not possible to ascertain whether there were functional changes. Ramachandran plots (Supplementary Figure S5) did not show major differences, as well as ProtParam indexes. In addition, electrostatics analyses performed in PyMol also did not present changes, showing that sites of putative interaction maintain similar. However, the position was in close proximity to the 5′ end of this domain, not only in secondary structure (61 amino acids distant) but also spatially (14.8 Å) and has a predicted ΔΔG calculated as 0.775 kcal, indicating that this variant destabilizes the structure. Therefore, it is plausible to hypothesize that these mutant molecules may exhibit deficiencies in activity in comparison to the naive molecule, as evidenced by their thermodynamic variations.
Finally, the ORF related to the B22R family serpin had two single nucleotide variations, one from cytosine to thymine and the other from guanine to adenine. The first substitution resulted in the replacement of proline with serine in position 722, both of which are uncharged polar. The second substitution involved the replacement of methionine with isoleucine, both of which are also nonpolar. The scores for both substitutions are 4 in the BLOSUM62 matrix. B22 proteins have been demonstrated to play a role in the inactivation of host T cells, a process that is crucial for virulence. Consequently, mutants lacking this gene have been shown to exhibit an attenuated infection [57]. However, attempts to reconstruct the molecule using computational methods have generated models with low reliability, which has prevented the utilization of any model for analyses involving catalytic sites and interaction pockets.
The integration of these results yielded no compelling evidence to suggest a potential phenotypic consequence, such the ones cited before. The mutation in RNA polymerase appears to cause a molecular imbalance; however, further analysis is required to indicate causality. Mutations in B22R could not be analyzed at the structural level because the models generated were not reliable. As demonstrated by Delamonica et al. (2023), the APOBEC-3 enzyme, which is part of the innate immune system against viruses, causes C > T and G > A mutations, thereby exerting pressure on the emergence of these mutations in MPXV [58]. It was demonstrated that epitopes against the OPG105 and OPG220 genes were identified in various orthopoxvirus clades, indicating them as possible targets for immune response [59,60]. In this sense, it is not possible to conclude that the 2022 outbreak was primarily driven by vaccine escape from mutations in these two genes. However, our data underscores the prevalence of this variability, thereby classifying these regions as mutation hotspots.
4. Conclusions
Herein, we have evaluated the biological diversity of an MPXV sample from 2022 early outbreak through phenotypic and genomic characterization. The three distinct selected MPXV clones exhibited lysis plaques of irregular shape and reduced size in BSC-40 after 48 h of incubation. Compared to the VACV-WR strain, the MPXV clones did not produce comets when evaluated for EEV liberation, indicating a low rate of EEV liberation. Regarding the phenotypic characterization of MPXV lysis plaques, our results suggest a low level of viral spread in vitro, which is consistent with recent descriptions of the 2022 MPXV IIb strain in the literature. Nevertheless, the usage of alternative cell lineages may present different results for in vitro phenotypic characterization of MPXV.
Regarding the genomic characteristics of the clones, we have found that the genes did not deviate in function from the reference. However, additional sequences were identified in the strains under investigation. Subsequent analysis of the composition of each gene, in addition to a review of the extant literature, demonstrated that the majority of these extra sequences are, in fact, genes that have lost their function, generating truncated transcripts [61]. These sequences were found in variable regions and were associated with functions related to virus–host interaction.
Nevertheless, a high level of SNVs was identified, which is analogous to that observed when comparing different MPXV lineages from various global regions during the 2022 outbreak [62]. In addition to our data, the frequency of allele variation in the raw reads of the original sample supports the idea that clonal SNVs are well represented within the sample, thus demonstrating allelic heterogeneity within a single MPXV-derived lesion. These variations were identified in genes that have previously been documented as being highly variable, as they are known to be targeted by APOBEC-3. Our findings are consistent with those reported by Vauhkonen et al. (2023), which indicate that substantial variations in MPXV occur within the host, with samples from the same lesion exhibiting viral clones that exhibit significant variations [63]. The findings of both studies underscore the correlation between these mutations and the activity of APOBEC-3, as the enzyme’s distinctive signature has been identified.
Altogether, the clonal phenotypic and genomic characterization of the Brazilian 2022 MPXV isolate has provided data that supports the current knowledge of MPXV through yet a different approach. Our findings suggest that although conserved, it is possible that distinct MPXV 2022 IIb variants may circulate together and can be found within a single exanthematous lesion. However, to further demonstrate the extent to which subpopulation variants are present and co-circulating after the 2022 MPXV emergence, replicates of these analyses should be done to a more representative dataset of lesion samples. Finally, the results hereby presented contribute to our understanding of the virus’s biology, corroborating previous findings while offering a new perspective on the viral characterization.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1von Magnus P. Andersen E.K. Petersen K.B. Birch-Andersen A. A pox-like disease in Cynomolgus monkeys Acta Pathol. Microbiol. Scand.19594615617610.1111/j.1699-0463.1959.tb 00328.x · doi ↗
- 2Marennikova S.S. 9eluhina T.M. Mal’ceva N.N. Cmi 1kjan K.L. Macevit G.R. Isolation and Properties of the Causal Agent of a New Variola-like Disease (Monkeypox) in Man Bull. World Health Organ.1972465996114340219 PMC 2480798 · pubmed ↗
- 3World Health Organization (WHO) Global Mpox (Monkeypox) Situation Dashboard Available online: https://worldhealthorg.shinyapps.io/mpx_global/(accessed on 15 September 2025)
- 4Happi C. Adetifa I. Mbala P. Njouom R. Nakoune E. Happi A. Ndodo N. Ayansola O. Mboowa G. Bedford T. Urgent Need for a Non-Discriminatory and Non-Stigmatizing Nomenclature for Monkeypox Virus P Lo S Biol.202220 e 300176910.1371/journal.pbio.300176935998195 PMC 9451062 · doi ↗ · pubmed ↗
- 5Bunge E.M. Hoet B. Chen L. Lienert F. Weidenthaler H. Baer L.R. Steffen R. The Changing Epidemiology of Human Monkeypox—A Potential Threat? A Systematic Review P Lo S Negl. Trop. Dis.202216 e 001014110.1371/journal.pntd.001014135148313 PMC 8870502 · doi ↗ · pubmed ↗
- 6Djuicy D.D. Sadeuh-Mba S.A. Bilounga C.N. Yonga M.G. Tchatchueng-Mbougua J.B. Essima G.D. Esso L. Nguidjol I.M.E. Metomb S.F. Chebo C. Concurrent Clade I and Clade II Monkeypox Virus Circulation, Cameroon, 1979–2022 Emerg. Infect. Dis.20243043244310.3201/eid 3003.23086138325363 PMC 10902553 · doi ↗ · pubmed ↗
- 7Desingu P.A. Rubeni T.P. Sundaresan N.R. Evolution of Monkeypox Virus from 2017 to 2022: In the Light of Point Mutations Front. Microbiol.202213103759810.3389/fmicb.2022.103759836590408 PMC 9795006 · doi ↗ · pubmed ↗
- 8Luna N. Muñoz M. Bonilla-Aldana D.K. Patiño L.H. Kasminskaya Y. Paniz-Mondolfi A. Ramírez J.D. Monkeypox Virus (MPXV) Genomics: A Mutational and Phylogenomic Analyses of B.1 Lineages Travel. Med. Infect. Dis.20235210255110.1016/j.tmaid.2023.10255136746267 PMC 9946793 · doi ↗ · pubmed ↗
