Optimization of Mapping Tools and Investigation of Ribosomal RNA Influence for Data-Driven Gene Expression Analysis in Complex Microbiomes
Ryo Mameda, Hidemasa Bono

TL;DR
This study improves gene expression analysis in complex microbiomes by optimizing mapping tools and addressing the impact of ribosomal RNA on data accuracy.
Contribution
The study introduces optimized mapping strategies and highlights the need to exclude rRNA reads for accurate TPM calculations in microbiome analysis.
Findings
BWA-MEM outperformed Bowtie2 in mapping efficiency for both metagenomic and metatranscriptomic reads.
rRNA contamination in metagenomic contigs leads to overestimated TPM changes in metatranscriptomic data.
Excluding rRNA reads before TPM calculations improves analytical accuracy in microbiome studies.
Abstract
For gene expression analysis in complex microbiomes, utilizing both metagenomic and metatranscriptomic reads from the same sample enables advanced functional analysis. Due to their diversity, metagenomic contigs are often used as reference sequences instead of complete genomes. However, studies optimizing mapping strategies for both read types remain limited. In addition, although transcripts per million (TPM) is commonly used for normalization, few studies have evaluated the influence of ribosomal RNA (rRNA) in metatranscriptomic reads. This study compared Burrows–Wheeler Aligner–Maximal Exact Match (BWA-MEM) and Bowtie2 as mapping tools for metagenomic contigs. Even after optimizing Bowtie2 parameters, BWA-MEM showed higher efficiency in mapping both metagenomic and metatranscriptomic reads. Further analysis revealed that rRNA sequences contaminate predicted protein-coding regions in…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · RNA modifications and cancer · Molecular Biology Techniques and Applications
