In Silico Genome-Genome Hybridization Values Accurately and Precisely Predict Empirical DNA-DNA Hybridization Values for Classifying Prokaryotes
Paul A. Muller Jr., Slava S. Epstein

TL;DR
This paper presents a computational method that accurately predicts traditional DNA-DNA hybridization values from whole genome sequences, simplifying prokaryotic classification.
Contribution
The authors developed an in silico pipeline based on BLAST comparisons that reliably estimates empirical hybridization values, improving reproducibility and efficiency.
Findings
High accuracy in predicting DNA-DNA hybridization values
Method is precise and consistent with wet lab results
Streamlines prokaryotic genome classification
Abstract
For nearly 50 years microbiologists have been determining prokaryotic genome relatedness by means of nucleic acid reassociation kinetics. These methods, however, are technically challenging, difficult to reproduce, and - given the time and resources it takes to generate a single data-point - not cost effective. In the post genomic era, with the cost of sequencing whole prokaryotic genomes no longer a limiting factor, we believed that computationally predicting the output value from a traditional DNA-DNA hybridization experiment using pair-wise comparisons of whole genome sequences to be of value. While other computational whole-genome classification methods exist, they predict values on widely different scales than DNA-DNA hybridization, introducing yet another metric into the polyphasic approach of defining microbial species. Our goal was to develop an in silico BLAST based pipeline…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Chromosomal and Genetic Variations · Plant Disease Resistance and Genetics
