A Markovian genomic concatenation model guided by persymmetric matrices
Andrew G. Hart, M. Sobottka

TL;DR
This paper provides a rigorous mathematical analysis of a Markovian model for bacterial DNA concatenation, including conditions for Markov chain generation and its relation to intra-strand parity.
Contribution
It introduces two formal probabilistic constructions of the model and offers conditions for Markov chain realization, aiding statistical analysis of bacterial genomes.
Findings
Model encompasses Markov chains with intra-strand parity
Provides necessary and sufficient conditions for Markov chain generation
Lays groundwork for algorithms analyzing bacterial DNA sequences
Abstract
The aim of this work is to provide a rigorous mathematical analysis of a stochastic concatenation model presented by Sobottka and Hart (2011) which allows approximation of the first-order stochastic structure in bacterial DNA by means of a stationary Markov chain. Two probabilistic constructions that rigorously formalize the model are presented. Necessary and sufficient conditions for a Markov chain to be generated by the model are given, as well as the theoretical background needed for designing new algorithms for statistical analyses of real bacterial genomes. It is shown that the model encompasses the Markov chains satisfying intra-strand parity, a property observed in most DNA sequences.
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · RNA and protein synthesis mechanisms · Machine Learning in Bioinformatics
