On the distributions of restriction sites in human and pangolin sarbecoviruses
Zach Hensel

TL;DR
This study reanalyzed restriction site distributions in SARS-CoV-2 and related viruses, finding no evidence of artificial design and showing their restriction maps are similar to natural bat and pangolin coronaviruses.
Contribution
It challenges prior claims of synthetic origins by providing a comprehensive reanalysis of restriction sites across multiple sarbecoviruses, including recent bat and pangolin viruses.
Findings
Closely related bat coronaviruses share identical restriction sites with SARS-CoV-2.
Pangolin sarbecoviruses also exhibit similar restriction site patterns.
Reanalysis of previous metrics shows no significant difference in restriction site spacing.
Abstract
Since early 2020, several theories have suggested that a distribution of restriction endonuclease recognition sites in the SARS-CoV-2 genome indicates a synthetic origin. The most influential of these, a 2022 preprint by Bruttel et al. claimed: "The BsaI/BsmBI restriction map of SARS-CoV-2 is unlike any wild-type coronavirus, and it is unlikely to evolve from its closest relatives." To test this, I reanalyzed the same 11 contested sites using an expanded set of sarbecovirus genomes, including bat coronaviruses published after the Bruttel et al. preprint. For each site, I identified the bat coronaviruses most closely matching SARS-CoV-2 in the surrounding sequences, excluding the sites themselves. The Bruttel et al. hypothesis predicts that these closely related viruses should differ from SARS-CoV-2 at many of the contested sites if restriction sites had been artificially introduced or…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
