Improving Gene Trees without more data
Ashu Gupta

TL;DR
This paper introduces WSB+WQMC, a new pipeline for gene and species tree estimation that improves accuracy in low signal conditions and is statistically consistent under the GTR+MSC model, showing promising results compared to existing methods.
Contribution
The study proposes WSB+WQMC, a novel, statistically consistent pipeline that enhances gene and species tree accuracy, especially with low phylogenetic signal, compared to previous pipelines.
Findings
WSB+WQMC improves accuracy on most datasets with low to high ILS.
It performs better or equally well as WSB+CAML on high ILS datasets.
WQMC is a promising alternative for phylogenetic estimation in challenging conditions.
Abstract
Estimating species and gene trees from sequence data is challenging. Gene tree estimation is often hampered by low phylogenetic signal in alignments, leading to inaccurate trees. Species tree estimation is complicated by incomplete lineage sorting (ILS), where gene histories differ from the species' history. Summary methods like MP-EST, ASTRAL2, and ASTRID infer species trees from gene trees but suffer when gene tree accuracy is low. To address this, the Statistical Binning (SB) and Weighted Statistical Binning (WSB) pipelines were developed to improve gene tree estimation. However, previous studies only tested these pipelines using multi-locus bootstrapping (MLBS), not the BestML approach. This thesis proposes a novel pipeline, WSB+WQMC, which shares design features with the existing WSB+CAML pipeline but has other desirable properties and is statistically consistent under the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Bioinformatics and Genomic Networks · Gene expression and cancer classification
