Agalma: an automated phylogenomics workflow
Casey W. Dunn, Mark Howison, and Felipe Zapata

TL;DR
Agalma is an automated workflow that streamlines phylogenomic analyses from raw transcriptome data, improving reproducibility, efficiency, and extensibility of complex phylogenetic studies.
Contribution
It introduces Agalma, a comprehensive tool that automates multiple steps in phylogenomics, integrating data processing, analysis, and reporting in a reproducible framework.
Findings
Automates phylogenomic analysis pipeline from raw data to phylogeny.
Tracks data provenance and computational diagnostics.
Enables reproducible and extendable phylogenomic studies.
Abstract
In the past decade, transcriptome data have become an important component of many phylogenetic studies. Phylogenetic studies now regularly include genes from newly sequenced transcriptomes, as well as publicly available transcriptomes and genomes. Implementing such a phylogenomic study, however, is computationally intensive, requires the coordinated use of many complex software tools, and includes multiple steps for which no published tools exist. Phylogenomic studies have therefore been manual or semiautomated. In addition to taking considerable user time, this makes phylogenomic analyses difficult to reproduce, compare, and extend. In addition, methodological improvements made in the context of one study often cannot be easily applied and evaluated in the context of other studies. We present Agalma, an automated tool that conducts phylogenomic analyses. The user provides raw Illumina…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
