Mapping the Space of Genomic Signatures
Lila Kari, Kathleen A. Hill, Abu S. Sayem, Rallis Karamichalis,, Nathaniel Bryans, Katelyn Davis, Nikesh S. Dattani

TL;DR
This paper introduces a novel computational approach using graphical DNA sequence representations and image comparison metrics to visualize and analyze genetic relatedness across large genomic datasets, aiding taxonomy and evolutionary studies.
Contribution
The method combines Chaos Game Representation with Structural Dissimilarity Index and Multi-Dimensional Scaling to compare DNA sequences without requiring homology, applicable to diverse and large datasets.
Findings
Genomic signatures can inform taxonomic classification.
The approach effectively visualizes sequence relatedness across taxonomic levels.
Over 5 million mitochondrial genome pairs analyzed, confirming oligomer composition as a taxonomic marker.
Abstract
We propose a computational method to measure and visualize interrelationships among any number of DNA sequences allowing, for example, the examination of hundreds or thousands of complete mitochondrial genomes. An "image distance" is computed for each pair of graphical representations of DNA sequences, and the distances are visualized as a Molecular Distance Map: Each point on the map represents a DNA sequence, and the spatial proximity between any two points reflects the degree of structural similarity between the corresponding sequences. The graphical representation of DNA sequences utilized, Chaos Game Representation (CGR), is genome- and species-specific and can thus act as a genomic signature. Consequently, Molecular Distance Maps could inform species identification, taxonomic classifications and, to a certain extent, evolutionary history. The image distance employed, Structural…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
