Hidden breakpoints in genome alignments
Birte Kehr, Knut Reinert, Aaron E. Darling

TL;DR
This paper extends the multichromosomal breakpoint median problem to include gene gain and loss, providing a method to estimate hidden breakpoints in genome evolution and analyzing the impact of alignment errors.
Contribution
It introduces a new approach to detect hidden breakpoints considering gene gain/loss and improves understanding of alignment errors in genome comparisons.
Findings
Hidden breakpoints depend on inversion and gene gain/loss rates.
Current aligners introduce significant errors in hidden breakpoint estimation.
Error in hidden breakpoint detection increases with evolutionary distance.
Abstract
During the course of evolution, an organism's genome can undergo changes that affect the large-scale structure of the genome. These changes include gene gain, loss, duplication, chromosome fusion, fission, and rearrangement. When gene gain and loss occurs in addition to other types of rearrangement, breakpoints of rearrangement can exist that are only detectable by comparison of three or more genomes. An arbitrarily large number of these "hidden" breakpoints can exist among genomes that exhibit no rearrangements in pairwise comparisons. We present an extension of the multichromosomal breakpoint median problem to genomes that have undergone gene gain and loss. We then demonstrate that the median distance among three genomes can be used to calculate a lower bound on the number of hidden breakpoints present. We provide an implementation of this calculation including the median distance,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Chromosomal and Genetic Variations · Genome Rearrangement Algorithms
