# Coping with Ineffective Overlap in Multilocus Phylogenetics

**Authors:** Ana Serra Silva, Karen Siu-Ting, Christopher J Creevey, Davide Pisani, Mark Wilkinson

PMC · DOI: 10.1093/sysbio/syaf044 · 2025-07-03

## TL;DR

The paper introduces a new method to handle missing data in phylogenetics by identifying taxa and loci for targeted sequencing to improve tree stability.

## Contribution

The novel approach combines concatabominations with gene-tree jackknifing to identify candidates for additional sequencing.

## Key findings

- The method successfully identifies taxa and loci for targeted sequencing to reduce topological instability.
- It performs well even with modest amounts of added data.
- Results are compared with a mathematics-based gene sampling approach.

## Abstract

Missing data is a long-standing issue in phylogenetic inference, which often results in high levels of taxonomic instability, obscuring otherwise well-supported relationships. Multiple approaches have been developed to deal with the negative effects of ineffective overlap on tree resolution, often by identifying taxa for removal. Here, we repurpose a heuristic method developed to identify unstable taxa in morphological data matrices, concatabominations, and combine it with a novel gene-tree jackknifing on matrix representation of trees to identify candidates for targeted sequencing. Using a multilocus caecilian data set, we illustrate the method’s capacity to identify candidate taxa and loci for additional sequencing, compare the results with those of the mathematics-based gene sampling sufficiency approach, and explore the terrace space associated with the multilocus data set. We show that our approach yields tractable numbers of loci/taxa for targeted sequencing that successfully mitigate topological instability due to ineffective overlap, even when modest amounts of data are added.

## Full-text entities

- **Genes:** CYTB [NCBI Gene 7670238], COX1 [NCBI Gene 7670228], ND1 [NCBI Gene 7670226], COX2 [NCBI Gene 7670229], ND2 [NCBI Gene 7670227]
- **Chemicals:** H3A (-)
- **Species:** Ichthyophis (genus) [taxon 8452], Syzygium (genus) [taxon 178174], Gallus gallus (bantam, species) [taxon 9031], Gymnopis multiplicata (purple caecilian, species) [taxon 449092], Xenopus laevis (African clawed frog, species) [taxon 8355], Protopterus annectens (West African lungfish, species) [taxon 7888], Andrias davidianus (Chinese giant salamander, species) [taxon 141262], Rhododendron (genus) [taxon 4346], Gymnophiona (caecilians, order) [taxon 8445], Bombina orientalis (Oriental fire-bellied toad, species) [taxon 8346], Dermophis mexicanus (Mexican burrowing caecilian, species) [taxon 118251], Mammalia (mammals, class) [taxon 40674], Hypogeophis montanus (species) [taxon 2116690], Anolis carolinensis (Carolina anole, species) [taxon 28377], Lyciasalamandra atifi (species) [taxon 297010], Leiopelma archeyi (Archey's frog, species) [taxon 118230], Microcaecilia nicefori (species) [taxon 2664915], Mus musculus (house mouse, species) [taxon 10090], Latimeria chalumnae (coelacanth, species) [taxon 7897], Amniota (amniotes, clade) [taxon 32524], Ranunculus (buttercups, genus) [taxon 3445]

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12805666/full.md

---
Source: https://tomesphere.com/paper/PMC12805666