# Base spacing distribution analysis for human genome

**Authors:** Andrzej Z. G\'orski, Monika Piwowar

arXiv: 1904.07860 · 2019-04-18

## TL;DR

This study analyzes the distribution of base spacings in the human genome, revealing both stochastic similarities at short ranges and significant long-range correlations, with notable differences between nucleotide pairs.

## Contribution

It provides a detailed characterization of base spacing distributions in the human genome, highlighting the presence of long-range correlations and base-specific differences.

## Key findings

- Short-range spacing resembles stochastic sequences
- Long-range spacing shows strong correlations
- Significant differences between (A, T) and (C, G) distributions

## Abstract

The distribution of bases spacing in human genome was investigated. An analysis of the frequency of occurrence in the human genome of different sequence lengths flanked by one type of nucleotide was carried out showing that the distribution has no self-similar (fractal) structure. The results nevertheless revealed several characteristic features: (i) the distribution for short-range spacing is quite similar to the purely stochastic sequences; (ii) the distribution for long-range spacing essentially deviates from the random sequence distribution, showing strong long-range correlations; (iii) the differences between (A, T) and (C, G) bases are quite significant; (iv) the spacing distribution displays tiny oscillations.

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/1904.07860/full.md

## Figures

13 figures with captions in the complete paper: https://tomesphere.com/paper/1904.07860/full.md

## References

26 references — full list in the complete paper: https://tomesphere.com/paper/1904.07860/full.md

---
Source: https://tomesphere.com/paper/1904.07860