# MetaBIDx: a new computational approach to bacteria identification in microbiomes

**Authors:** Diem-Trang Pham, Vinhthuy Phan

PMC · DOI: 10.20517/mrr.2024.01 · Microbiome Research Reports · 2024-04-01

## TL;DR

MetaBIDx is a new computational tool that improves the accuracy of identifying bacteria in complex microbiomes, helping with disease diagnosis and outbreak tracing.

## Contribution

MetaBIDx introduces a novel clustering strategy to reduce false positives in bacterial species identification using metagenomic data.

## Key findings

- MetaBIDx outperformed existing tools in precision and F1-score for species prediction.
- Clustering species by approximate genomic coverage significantly reduced false positives.
- The approach can be applied to improve other metagenomic tools.

## Abstract

Objectives: This study introduces MetaBIDx, a computational method designed to enhance species prediction in metagenomic environments. The method addresses the challenge of accurate species identification in complex microbiomes, which is due to the large number of generated reads and the ever-expanding number of bacterial genomes. Bacterial identification is essential for disease diagnosis and tracing outbreaks associated with microbial infections.

Methods: MetaBIDx utilizes a modified Bloom filter for efficient indexing of reference genomes and incorporates a novel strategy for reducing false positives by clustering species based on their genomic coverages by identified reads. The approach was evaluated and compared with several well-established tools across various datasets. Precision, recall, and F1-score were used to quantify the accuracy of species prediction.

Results: MetaBIDx demonstrated superior performance compared to other tools, especially in terms of precision and F1-score. The application of clustering based on approximate coverages significantly improved precision in species identification, effectively minimizing false positives. We further demonstrated that other methods can also benefit from our approach to removing false positives by clustering species based on approximate coverages.

Conclusion: With a novel approach to reducing false positives and the effective use of a modified Bloom filter to index species, MetaBIDx represents an advancement in metagenomic analysis. The findings suggest that the proposed approach could also benefit other metagenomic tools, indicating its potential for broader application in the field. The study lays the groundwork for future improvements in computational efficiency and the expansion of microbial databases.

## Full-text entities

- **Diseases:** microbial infections (MESH:D015163)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11149084/full.md

## References

43 references — full list in the complete paper: https://tomesphere.com/paper/PMC11149084/full.md

---
Source: https://tomesphere.com/paper/PMC11149084