Genomic Imaging Based on Codongrams and a^2grams
E.A. Bouton, H.M. de Oliveira, R.M. Campello de Souza, N.S., Santos-Magalhaes

TL;DR
This paper presents codongrams and a^2grams as innovative genomic imaging tools that analyze DNA to extract biologically meaningful features, potentially aiding in gene identification and genetic disease diagnosis.
Contribution
It introduces codongrams and a^2grams as novel genomic signal processing tools, offering an alternative to spectrograms for DNA analysis.
Findings
A^2grams reveal amino acid distribution in genomes.
Metgram can identify potential gene start positions.
Method may enable DNA-based medical imaging for genetic diagnosis.
Abstract
This paper introduces new tools for genomic signal processing, which can assist for genomic attribute extracting or describing biologically meaningful features embedded in a DNA. The codongrams and a2grams are offered as an alternative to spectrograms and scalograms. Twenty different a^2grams are defined for a genome, one for each amino acid (valgram is an a^2gram for valine; alagram is an a^2gram for alanine and so on). They provide information about the distribution and occurrence of the investigated amino acid. In particular, the metgram can be used to find out potential start position of genes within a genome. This approach can help implementing a new diagnosis test for genetic diseases by providing a type of DNA-medical imaging.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Machine Learning in Bioinformatics · RNA and protein synthesis mechanisms
