Bayesian History Reconstruction of Complex Human Gene Clusters on a Phylogeny
Tom\'a\v{s} Vina\v{r}, Bro\v{n}a Brejov\'a, Giltae Song, Adam Siepel

TL;DR
This paper introduces a probabilistic model and MCMC algorithm for reconstructing the evolutionary history of complex gene clusters across multiple species, aiding genomic analysis and understanding of their role in evolution and disease.
Contribution
It presents a novel probabilistic framework and MCMC method for reconstructing gene cluster evolution on a phylogeny, improving analysis of duplicated gene regions.
Findings
Developed a probabilistic model for gene cluster evolution.
Designed an MCMC algorithm for duplication history reconstruction.
Applicable to multi-species genomic data.
Abstract
Clusters of genes that have evolved by repeated segmental duplication present difficult challenges throughout genomic analysis, from sequence assembly to functional analysis. Improved understanding of these clusters is of utmost importance, since they have been shown to be the source of evolutionary innovation, and have been linked to multiple diseases, including HIV and a variety of cancers. Previously, Zhang et al. (2008) developed an algorithm for reconstructing parsimonious evolutionary histories of such gene clusters, using only human genomic sequence data. In this paper, we propose a probabilistic model for the evolution of gene clusters on a phylogeny, and an MCMC algorithm for reconstruction of duplication histories from genomic sequences in multiple species. Several projects are underway to obtain high quality BAC-based assemblies of duplicated clusters in multiple species, and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Algorithms and Data Compression · Genome Rearrangement Algorithms
