Construction of Phylogenetic Relationships Based on 8-mer Spectra Distribution Characteristics of Vertebrate Whole Genome Sequences
Zhenhua Yang, Li Wang, Guojun Liu, Dongsheng Yu, Xiangjun Cui

TL;DR
This paper introduces a new method for understanding species evolution by analyzing 8-mer patterns in whole genome sequences.
Contribution
A dual-feature strategy combining class-level and order-level phylogenetic features derived from 8-mer spectra is proposed.
Findings
Class-level features capture macroevolutionary patterns and establish the phylogenetic backbone.
Order-level features enable finer-resolution discrimination at the ordinal level.
Validation across vertebrate genomes confirmed the effectiveness of the dual-feature strategy.
Abstract
Background/Objectives: With advances in sequencing technology, whole genome sequences have become a valuable resource for deciphering species evolution. However, efficiently extracting phylogenetic information from such data remains a major challenge. Traditional multiple sequence alignment methods are computationally intensive and perform poorly for distantly related species, while k-mer analysis offers a new direction for efficiently capturing genomic composition and evolutionary signatures. Methods: Feature extraction based on 8-mer spectra from 16 XYi subsets. Results: This study found that the distribution characteristics of whole genome sequences 8-mer spectra are closely related to species evolution. Building on this, we developed a dual-feature strategy for genome-scale phylogenetics. The strategy incorporates two distinct feature types: (a) 186 class-level phylogenetic features…
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genome Rearrangement Algorithms · Fractal and DNA sequence analysis
