# Investigating some attributes of periodicity in DNA sequences via   semi-Markov modelling

**Authors:** Pavlos Kolias, Alexandra Papadopoulou

arXiv: 1907.03119 · 2019-07-09

## TL;DR

This paper applies semi-Markov models to analyze 3-base periodicity in DNA sequences, aiding in identifying protein-coding regions within genes, and provides theoretical and empirical insights into the periodic patterns.

## Contribution

It introduces a semi-Markov modeling approach for DNA periodicity analysis, offering new analytical forms and demonstrating effectiveness on synthetic and real data.

## Key findings

- Semi-Markov models effectively characterize DNA periodicity.
- Analytical probability forms describe underlying patterns.
- Empirical results validate the model's applicability.

## Abstract

DNA segments and sequences have been studied thoroughly during the past decades. One of the main problems in computational biology is the identification of exon-intron structures inside genes using mathematical techniques. Previous studies have used different methods, such as Fourier analysis and hidden-Markov models, in order to be able to predict which parts of a gene correspond to a protein encoding area. In this paper, a semi-Markov model is applied to 3-base periodic sequences, which characterize the protein-coding regions of the gene. Analytic forms of the related probabilities and the corresponding indexes are provided, which yield a description of the underlying periodic pattern. Last, the previous theoretical results are illustrated with DNA sequences of synthetic and real data.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1907.03119/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/1907.03119/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/1907.03119/full.md

---
Source: https://tomesphere.com/paper/1907.03119