Coping with Ineffective Overlap in Multilocus Phylogenetics
Ana Serra Silva, Karen Siu-Ting, Christopher J Creevey, Davide Pisani, Mark Wilkinson

TL;DR
The paper introduces a new method to handle missing data in phylogenetics by identifying taxa and loci for targeted sequencing to improve tree stability.
Contribution
The novel approach combines concatabominations with gene-tree jackknifing to identify candidates for additional sequencing.
Findings
The method successfully identifies taxa and loci for targeted sequencing to reduce topological instability.
It performs well even with modest amounts of added data.
Results are compared with a mathematics-based gene sampling approach.
Abstract
Missing data is a long-standing issue in phylogenetic inference, which often results in high levels of taxonomic instability, obscuring otherwise well-supported relationships. Multiple approaches have been developed to deal with the negative effects of ineffective overlap on tree resolution, often by identifying taxa for removal. Here, we repurpose a heuristic method developed to identify unstable taxa in morphological data matrices, concatabominations, and combine it with a novel gene-tree jackknifing on matrix representation of trees to identify candidates for targeted sequencing. Using a multilocus caecilian data set, we illustrate the method’s capacity to identify candidate taxa and loci for additional sequencing, compare the results with those of the mathematics-based gene sampling sufficiency approach, and explore the terrace space associated with the multilocus data set. We show…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Genetic diversity and population structure · Evolution and Paleontology Studies
