Characterizing SARS-CoV-2 mutations in the United States
Rui Wang, Jiahui Chen, Kaifu Gao, Yuta Hozumi, Changchuan Yin, and, Guo-Wei Wei

TL;DR
This study analyzes SARS-CoV-2 mutations in the US, revealing four main substrains, their origins, mutation groups, and gender-dependent mutation effects, highlighting increased infectivity and informing containment strategies.
Contribution
It introduces a comprehensive multi-method analysis of US SARS-CoV-2 mutations, identifying distinct substrains, mutation groups, and gender-related mutation effects not previously characterized.
Findings
US SARS-CoV-2 has four main substrains.
Five top mutations originated outside the US, three within the US.
Three US substrains are more infectious.
Abstract
The severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has been mutating since it was first sequenced in early January 2020. The genetic variants have developed into a few distinct clusters with different properties. Since the United States (US) has the highest number of viral infected patients globally, it is essential to understand the US SARS-CoV-2. Using genotyping, sequence-alignment, time-evolution, -means clustering, protein-folding stability, algebraic topology, and network theory, we reveal that the US SARS-CoV-2 has four substrains and five top US SARS-CoV-2 mutations were first detected in China (2 cases), Singapore (2 cases), and the United Kingdom (1 case). The next three top US SARS-CoV-2 mutations were first detected in the US. These eight top mutations belong to two disconnected groups. The first group consisting of 5 concurrent mutations is prevailing,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSARS-CoV-2 and COVID-19 Research · Computational Drug Discovery Methods · vaccines and immunoinformatics approaches
