# Genome Divergence Based on Entropic Segmentation of DNA

**Authors:** Pedro A. Bernaola-Galván, Pedro Carpena, Cristina Gómez-Martín, José L. Oliver

PMC · DOI: 10.3390/e27101019 · Entropy · 2025-09-28

## TL;DR

This paper introduces a new genome signature based on DNA composition patterns to measure species divergence and evolutionary relationships.

## Contribution

A novel genome signature using entropic segmentation and Jensen–Shannon distance to capture phylogenetic signals and species divergence.

## Key findings

- The proposed Segment Compositional Distance (D) correlates with species divergence times.
- The method captures strong phylogenetic signals using genome-wide compositional analysis.
- The approach provides a coarse-grained view of genome evolution relevant to the molecular clock hypothesis.

## Abstract

The concept of a genome signature broadly refers to characteristic patterns in DNA sequences that enable the identification and comparison of species or individuals, often without requiring sequence alignment. Such signatures have applications ranging from forensic identification of individuals to cancer genomics. In comparative genomics and evolutionary biology, genome signatures typically rely on statistical properties of DNA that are species-specific and carry phylogenetic information reflecting evolutionary relationships. We propose a novel genome signature based on the compositional structure of DNA, defined by the distributions of strong/weak, purine/pyrimidine, and keto/amino ratios across DNA segments identified through entropic segmentation. We observe that these ratio distributions are similar among closely related species but differ markedly between distant ones. To quantify these differences, we employ the Jensen–Shannon distance—a symmetric and robust measure of distributional dissimilarity—to define a genome-to-genome distance metric, termed Segment Compositional Distance (D). Our results demonstrate a clear correlation between D and species divergence times, and also that this metric captures a strong phylogenetic signal. Our method employs a genome-wide approach rather than tracking specific mutations; thus, D offers a coarse-grained perspective on genome compositional evolution, contributing to the ongoing discussion surrounding the molecular clock hypothesis.

## Full-text entities

- **Diseases:** cancer (MESH:D009369)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12564409/full.md

## Figures

5 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12564409/full.md

## References

67 references — full list in the complete paper: https://tomesphere.com/paper/PMC12564409/full.md

---
Source: https://tomesphere.com/paper/PMC12564409