# Poplar: a phylogenomics pipeline

**Authors:** Elizabeth Koning, Arjun Subedi, Raga Krishnakumar

PMC · DOI: 10.1093/bioadv/vbaf104 · Bioinformatics Advances · 2025-05-06

## TL;DR

Poplar is a new computational pipeline that simplifies and speeds up the process of generating phylogenomic trees from genomic data.

## Contribution

Poplar introduces a streamlined, automated pipeline for phylogenomic analysis that requires no external databases and supports parallel computing.

## Key findings

- Poplar generates phylogenomic trees that closely match state-of-the-art published trees.
- The pipeline simplifies and accelerates phylogenomic analysis compared to manual methods.
- Poplar supports parallel execution for clusters and cloud computing.

## Abstract

Generating phylogenomic trees from the genomic data is essential in understanding biological systems. Each step of this complex process has received extensive attention and has been significantly streamlined over the years. Given the public availability of data, obtaining genomes for a wide selection of species is straightforward. However, analyzing that data to generate a phylogenomic tree is a multistep process with legitimate scientific and technical challenges, often requiring a significant input from a domain-area scientist.

We present Poplar, a new, streamlined computational pipeline, to address the computational logistical issues that arise when constructing the phylogenomic trees. It provides a framework that runs state-of-the-art software for essential steps in the phylogenomic pipeline, beginning from a genome with or without an annotation, and resulting in a species tree. Running Poplar requires no external databases. In the execution, it enables parallelism for execution for clusters and cloud computing. The trees generated by Poplar match closely with state-of-the-art published trees. The usage and performance of Poplar is far simpler and quicker than manually running a phylogenomic pipeline.

Freely available on GitHub at https://github.com/sandialabs/poplar. Implemented using Python and supported on Linux.

## Full-text entities

- **Diseases:** fungal (MESH:D009181)
- **Species:** Spermatophyta (seed plants, clade) [taxon 58024], Lathyrus oleraceus (garden pea, species) [taxon 3888], Acer negundo (box elder, species) [taxon 4023], Quercus lobata (valley oak, species) [taxon 97700], Populus (poplar, genus) [taxon 3689], Miscanthus lutarioriparius (species) [taxon 422564], Puccinia graminis (wheat stem rust, species) [taxon 5297], Ensete ventricosum (species) [taxon 4639], Severe acute respiratory syndrome coronavirus 2 (no rank) [taxon 2697049], Lactuca saligna (species) [taxon 75948], Cajanus cajan (pigeon pea, species) [taxon 3821], PX clade (clade) [taxon 569578], Lithocarpus litseifolius (species) [taxon 425828], Escherichia coli (E. coli, species) [taxon 562], Prunus dulcis (almond, species) [taxon 3755], Plasmodium falciparum (malaria parasite P. falciparum, species) [taxon 5833], Persea americana (avocado, species) [taxon 3435], Arabidopsis thaliana (mouse-ear cress, species) [taxon 3702], Theobroma cacao (cacao, species) [taxon 3641]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12159734/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12159734/full.md

## References

102 references — full list in the complete paper: https://tomesphere.com/paper/PMC12159734/full.md

---
Source: https://tomesphere.com/paper/PMC12159734