# QIIME2 pipeline for ITS2-based nemabiome sequencing in veterinary species and the importance of analysis parameters

**Authors:** Jeba R. J. Jesudoss Chelladurai, Theresa A. Quintana, Aloysius Abraham

PMC · DOI: 10.1186/s13071-025-07184-1 · Parasites & Vectors · 2025-12-17

## TL;DR

This paper introduces a QIIME2 pipeline for analyzing nematode communities in veterinary species using ITS2 sequencing, showing it outperforms the R-based DADA2 pipeline in accuracy and reproducibility.

## Contribution

A QIIME2-based pipeline for ITS2 nemabiome analysis is introduced, offering improved taxonomic resolution and reproducibility over the R-based DADA2 pipeline.

## Key findings

- QIIME2 produced more accurate results in simulated datasets with minimal parameter tuning.
- QIIME2's scikit Bayes classifier improved species-level identification and reduced unclassified taxa.
- Pipeline choices significantly affect nemabiome outputs like species detection and diversity metrics.

## Abstract

Deep amplicon sequencing of nematode internal transcribed spacer 2 (ITS2), also referred to as the “nemabiome,” has been increasingly used in veterinary hosts to study gastrointestinal nematodes. While post-sequencing bioinformatic pipelines such as DADA2 and mothur have been optimized, most researchers typically use the DADA2 pipeline in R. For optimal performance, DADA2 needs parameter tuning, which is hard for novices.

In this study, we present an implementation of the DADA2 pipeline within QIIME2 for nemabiome analysis and compare its performance against the commonly used R-based DADA2 pipeline. To evaluate performance against samples with known composition, we generated simulated nemabiome datasets representing canine, ruminant, and equine nematode communities. We also tested the pipelines using publicly available datasets from ten veterinary host species. For both pipelines, we evaluated differences in amplified sequence variant (ASV) generation, taxonomic classification, and diversity metrics. We also tested different Idtaxa parameter settings within the R DADA2 pipeline (classification threshold and bootstrap iterations) to understand its effects on nemabiome outcomes.

While both pipelines showed minor discrepancies in relative abundance estimates, with minimal parameter optimization, QIIME2 outputs were closer to ground truth in simulated datasets. QIIME2 using the scikit Bayes classifier produced fewer unclassified taxa and more consistent species-level identifications compared with R DADA2’s Idtaxa, particularly in complex communities. Community-level differences in beta diversity were primarily driven by differences in taxonomic assignment. Parameter testing revealed that lower classification thresholds in R DADA2 reduced the number of unclassified taxa but increased the risk of misclassification, highlighting the need for careful parameter selection and reporting.

With minimal parameter tuning, QIIME2 outperformed the R pipeline in taxonomic resolution, and improved reproducibility by provenance tracking. Our findings emphasize how bioinformatics pipeline choices impact nemabiome outputs including the number of species detected, ranks of abundant taxa, and alpha and beta diversities. We provide a reproducible and user-friendly QIIME2 workflow suitable for researchers seeking standardized analyses of ITS2 nemabiome data.

The online version contains supplementary material available at 10.1186/s13071-025-07184-1.

## Full-text entities

- **Diseases:** gastrointestinal nematodes (MESH:D009349)
- **Species:** Canis lupus familiaris (dog, subspecies) [taxon 9615], Equus caballus (domestic horse, species) [taxon 9796], Nematoda (nematode, phylum) [taxon 6231]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12822254/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12822254/full.md

## References

1 references — full list in the complete paper: https://tomesphere.com/paper/PMC12822254/full.md

---
Source: https://tomesphere.com/paper/PMC12822254