Gene Tree Construction and Correction using SuperTree and Reconciliation
Manuel Lafond, C\'edric Chauve, Nadia El-Mabrouk, A\"ida Ouangraoua

TL;DR
This paper introduces new algorithms for constructing and correcting gene trees using supertree and reconciliation methods, improving accuracy in gene tree reconstruction especially for weakly supported duplication nodes.
Contribution
It develops novel dynamic programming and quadratic time algorithms for gene tree correction and construction within the supertree framework, incorporating duplication, speciation, and orthology/paralogy constraints.
Findings
Algorithms effectively correct weakly supported duplication nodes.
Supertree approach improves gene tree accuracy using reconciliation costs.
Source code and simulations demonstrate practical utility.
Abstract
The supertree problem asking for a tree displaying a set of consistent input trees has been largely considered for the reconstruction of species trees. Here, we rather explore this framework for the sake of reconstructing a gene tree from a set of input gene trees on partial data. In this perspective, the phylogenetic tree for the species containing the genes of interest can be used to choose among the many possible compatible "supergenetrees", the most natural criteria being to minimize a reconciliation cost. We develop a variety of algorithmic solutions for the construction and correction of gene trees using the supertree framework. A dynamic programming supertree algorithm for constructing or correcting gene trees, exponential in the number of input trees, is first developed for the less constrained version of the problem. It is then adapted to gene trees with nodes labeled as…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Gene expression and cancer classification · Bioinformatics and Genomic Networks
