Graph Based Analysis for Gene Segment Organization In a Scrambled Genome
Mustafa Hajij, Nata\v{s}a Jonoska, Denys Kukushkin, Masahico Saito

TL;DR
This paper introduces a graph-based method combined with topological data analysis to study complex gene segment arrangements in scrambled genomes, revealing intricate interleaving and overlapping patterns.
Contribution
The paper presents a novel approach using directed graphs and topological data analysis to characterize gene segment organization in highly scrambled genomes.
Findings
Identification of star-like graph structures indicating interleaving of gene segments
Detection of segments containing multiple other gene segments within a single gene
Observation of mutual interleaving among up to six genes
Abstract
DNA rearrangement processes recombine gene segments that are organized on the chromosome in a variety of ways. The segments can overlap, interleave or one may be a subsegment of another. We use directed graphs to represent segment organizations on a given locus where contigs containing rearranged segments represent vertices and the edges correspond to the segment relationships. Using graph properties we associate a point in a higher dimensional Euclidean space to each graph such that cluster formations and analysis can be performed with methods from topological data analysis. The method is applied to a recently sequenced model organism \textit{Oxytricha trifallax}, a species of ciliate with highly scrambled genome that undergoes massive rearrangement process after conjugation. The analysis shows some emerging star-like graph structures indicating that segments of a single gene can…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Protist diversity and phylogeny · Microbial Community Ecology and Physiology
