An investigation into inter- and intragenomic variations of graphic genomic signatures
Rallis Karamichalis, Lila Kari, Stavros Konstantinidis, Steffen Kopecki

TL;DR
This study shows that Chaos Game Representation patterns of DNA sequences can serve as unique genomic signatures for species classification.
Contribution
The paper confirms CGR patterns are consistent within species and distinct between species, using diverse organisms and multiple distance metrics.
Findings
CGR patterns are preserved within a genome and differ between species.
Several distance metrics outperform Euclidean distance for CGR-based comparisons.
Molecular Distance Maps effectively visualize genomic sequence relationships.
Abstract
Motivated by the general need to identify and classify species based on molecular evidence, genome comparisons have been proposed that are based on measuring mostly Euclidean distances between Chaos Game Representation (CGR) patterns of genomic DNA sequences. We provide, on an extensive dataset and using several different distances, confirmation of the hypothesis that CGR patterns are preserved along a genomic DNA sequence, and are different for DNA sequences originating from genomes of different species. This finding lends support to the theory that CGRs of genomic sequences can act as graphic genomic signatures. In particular, we compare the CGR patterns of over five hundred different 150,000 bp genomic sequences spanning one complete chromosome from each of six organisms, representing all kingdoms of life: H. sapiens (Animalia; chromosome 21), S. cerevisiae (Fungi; chromosome 4), A.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Fractal and DNA sequence analysis · Machine Learning in Bioinformatics
