Evaluation of the Number of Different Genomes on Medium and Identification of Known Genomes Using Composition Spectra Approach
Valery Kirzhner, Zeev Volkovich

TL;DR
This paper develops theoretical algorithms to estimate the number of distinct genomes in a sample and identify known genomes using compositional spectra analysis, enhancing genomic mixture analysis methods.
Contribution
It introduces new theoretical algorithms for estimating genome diversity and detecting known genomes based on compositional spectra analysis.
Findings
Algorithms for genome number estimation derived
Algorithms for known genome detection developed
Theoretical foundations established for spectral analysis approach
Abstract
The article presents the theoretical foundations of the algorithm for calculating the number of different genomes in the medium under study and of two algorithms for determining the presence of a particular (known) genome in this medium. The approach is based on the analysis of the compositional spectra of subsequently sequenced samples of the medium. The theoretical estimations required for the implementation of the algorithms are obtained.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Fractal and DNA sequence analysis · Machine Learning in Bioinformatics
