Using Signal Processing in Tandem With Adapted Mixture Models for Classifying Genomic Signals
Saish Jaiswal, Shreya Nema, Hema A Murthy, Manikandan Narayanan

TL;DR
This paper introduces a novel approach combining signal processing and Gaussian mixture models to enhance spectral representation and improve the accuracy of genomic sequence classification into taxonomic groups.
Contribution
The study presents a new technique that integrates signal processing with adapted mixture models for better spectral analysis and classification of genomic sequences.
Findings
Outperforms state-of-the-art methods by 6.06% in accuracy
Effective spectral transformation and projection improve taxonomic distinguishability
Addresses variable-length sequence handling in genomic signal processing
Abstract
Genomic signal processing has been used successfully in bioinformatics to analyze biomolecular sequences and gain varied insights into DNA structure, gene organization, protein binding, sequence evolution, etc. But challenges remain in finding the appropriate spectral representation of a biomolecular sequence, especially when multiple variable-length sequences need to be handled consistently. In this study, we address this challenge in the context of the well-studied problem of classifying genomic sequences into different taxonomic units (strain, phyla, order, etc.). We propose a novel technique that employs signal processing in tandem with Gaussian mixture models to improve the spectral representation of a sequence and subsequently the taxonomic classification accuracies. The sequences are first transformed into spectra, and projected to a subspace, where sequences belonging to…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Machine Learning in Bioinformatics · Genomics and Phylogenetic Studies
