Exploring the Multifractal Behavior of the Human Genome T2T-CHM13v2.0: Graphical Representations and Cytogenetics
Yuli\'an A. Alvarez-Ballesteros, Mario A. Quiroz-Juarez, Jos\'e L., Del-Rio-Correa, Adrian M. Escobar-Ruiz

TL;DR
This study applies multifractal analysis to the complete human genome using CGR, revealing structural patterns, differences among chromosomes, and comparing two genomic representations with high accuracy.
Contribution
It introduces a comprehensive multifractal analysis of the human genome with novel insights into chromosomal differences and compares two genomic representations for fractal support accuracy.
Findings
Chromosomes 9 and Y show the greatest multifractal differences.
CGR distributions differentiate coding from non-coding regions.
MC representation with twelve-base chains achieves 2% error.
Abstract
In this work, we applied the Chaos Game Representation (CGR) to the complete human genomic sequence T2T-CHM13v2.0, analyzing the entire chromosome assembly and each chromosome separately, including mitochondrial DNA. Multifractal spectra were determined using two types of box-counting coverage, revealing slight variations across most chromosomes. While the geometric support remained consistent, distinct distributions were observed for each chromosome. Chromosomes 9 and Y exhibited the greatest differences in singularity (H\"older exponent), with minor variations in their fractal support. The CGR distributions generally demonstrated an approximate separation between coding and non-coding sections, as well as CpG or GpC islands. A base-by-base analysis of the fractal support of the CGR uncovered characteristic structural bands in chromosome sequences, which align with patterns identified…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsFractal and DNA sequence analysis · Complex Systems and Time Series Analysis · Machine Learning in Bioinformatics
MethodsALIGN
