# Phylogeny-aware Simulations Suggest a Low Impact of Unsampled Lineages in the Inference of Gene Flow During Eukaryogenesis

**Authors:** Moisès Bernabeu, Saioa Manzano-Morales, Toni Gabaldón

PMC · DOI: 10.1093/gbe/evaf190 · 2025-10-09

## TL;DR

This study uses simulations to show that unaccounted evolutionary lineages have a small impact on determining the order of gene transfers during eukaryotic evolution.

## Contribution

A novel simulation framework is introduced to assess the impact of unsampled lineages on gene flow inference during eukaryogenesis.

## Key findings

- Most gene acquisitions from ghost lineages are inferred with the correct relative order.
- Certain phylogenetic placements are more prone to producing misleading results from ghost lineages.
- The simulation approach can be adapted to other evolutionary scenarios.

## Abstract

The topologies of gene trees are broadly used to infer horizontal gene transfer events and characterize the potential donor and acceptor partners. Additionally, ratios between branch lengths in the gene tree can inform about the timing of transfers relative to each other. Using this approach, recent studies have proposed a relative chronology of gene acquisitions in the lineage leading to the last eukaryotic common ancestor. However, a recognized caveat of the branch-length ratio method are potential biases due to incomplete taxon sampling resulting in so-called “ghost” lineages. Here, we assessed the effect of ghost lineages on the inference of the relative ordering of gene acquisition events during eukaryogenesis. For this, we used a novel simulation framework that populates a dated Tree of Life with plausible “ghost” lineages and simulates their gene transfers to the lineage leading to last eukaryotic common ancestor. Our simulations suggest that a substantial majority of gene acquisitions from distinct ghost donors are inferred with the correct relative order. However, we identify phylogenetic placements where ghost lineages would be more likely to produce misleading results. Overall, our approach offers valuable guidance for the interpretation of future work on eukaryogenesis, and can be readily adapted to other evolutionary scenarios.

## Full-text entities

- **Genes:** DNAH8 (dynein axonemal heavy chain 8) [NCBI Gene 1769] {aka ATPase, SPGF46, hdhc9}

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12573248/full.md

---
Source: https://tomesphere.com/paper/PMC12573248