# SwarmGenomics: A Unified Pipeline for Individual‐Based Whole‐Genome Analyses

**Authors:** Aure Kylmänen, Yu‐Chi Chen, Sahar Javaheri Tehrani, Nikolas Vellnow, Justin J. S. Wilcox, Toni I. Gossmann

PMC · DOI: 10.1111/1755-0998.70119 · Molecular Ecology Resources · 2026-03-19

## TL;DR

SwarmGenomics is a user-friendly pipeline for whole-genome analysis that simplifies genetic studies of non-model species.

## Contribution

SwarmGenomics introduces a unified, modular pipeline for individual-based genomic analyses with minimal bioinformatics expertise.

## Key findings

- SwarmGenomics integrates seven modules for various genetic analyses, including heterozygosity and population history.
- The pipeline was successfully applied to the giant panda, revealing genetic diversity and inbreeding patterns.
- SwarmGenomics helps identify transposable element activity and microbial contamination in genomic data.

## Abstract

Advances in sequencing technologies have made whole‐genome data widely accessible, enabling research in population genetics, evolutionary biology, and conservation. However, analysing whole‐genome sequencing (WGS) data remains challenging, often requiring multiple specialised tools and substantial bioinformatics expertise. We present SwarmGenomics, a modular, user‐friendly command‐line pipeline for reference‐based genome assembly and individual‐based genetic analyses. The pipeline integrates seven modules: heterozygosity estimation, runs of homozygosity detection, Pairwise Sequentially Markovian Coalescent (PSMC) analysis, unmapped reads classification, repeat analysis, mitochondrial genome assembly, and nuclear mitochondrial DNA segment (NUMT) identification. Each module can be run independently or as part of a complete workflow. We demonstrate the pipeline's utility with a case study on the giant panda (
Ailuropoda melanoleuca
), revealing insights into genetic diversity, inbreeding history, historical population size changes, transposable element activity, and microbial contamination. SwarmGenomics lowers the entry barrier for genomic analysis of diploid, non‐model species, serving both as a research and teaching tool. The pipeline and documentation are available at https://github.com/AureKylmanen/Swarmgenomics.

## Linked entities

- **Species:** Ailuropoda melanoleuca (taxon 9646)

## Full-text entities

- **Diseases:** infection (MESH:D007239), ANGSD (MESH:D010855), inbreeding depression (MESH:D003866), NUMTs (MESH:C536350)
- **Chemicals:** MitoZ (-)
- **Species:** Acyrthosiphon pisum (pea aphid, species) [taxon 7029], Plasmodium (subgenus) [taxon 418103], Panda (genus) [taxon 212257], Homo sapiens (human, species) [taxon 9606], Viruses (acellular root) [taxon 10239], Ailuropoda melanoleuca (giant panda, species) [taxon 9646]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13000882/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13000882/full.md

## References

104 references — full list in the complete paper: https://tomesphere.com/paper/PMC13000882/full.md

---
Source: https://tomesphere.com/paper/PMC13000882