# metaFun: An analysis pipeline for metagenomic big data with fast and unified functional searches

**Authors:** Hyeon Gwon Lee, Ju Yeon Song, Jaekyung Yoon, Yusook Chung, Soon-Kyeong Kwon, Jihyun F. Kim

PMC · DOI: 10.1080/19490976.2025.2611544 · Gut Microbes · 2026-01-13

## TL;DR

metaFun is a new pipeline that simplifies and unifies metagenomic analysis, making it more reliable and accessible for studying microbial communities.

## Contribution

metaFun introduces a unified, end-to-end pipeline for metagenomic analysis with optimized workflows and standardized interpretation.

## Key findings

- metaFun integrates multiple analysis steps into a single framework, improving reproducibility and accessibility.
- The pipeline was validated using a colorectal cancer dataset, demonstrating its effectiveness in real-world applications.
- Benchmarking identified optimal configurations for metagenomic analysis, enhancing accuracy and efficiency.

## Abstract

Metagenomic approaches offer unprecedented opportunities to characterize microbial community structure and function, yet several challenges remain unresolved. Inconsistent genome quality impairs reliability of metagenome-assembled genomes, lack of unified taxonomic criteria limits cross-study comparability, and multi-step workflows involving numerous programs and parameters hinder reproducibility and accessibility. We benchmarked existing programs and parameters using simulated metagenomic data to identify optimal configurations. metaFun is an open-source, end-to-end pipeline that integrates quality control, taxonomic profiling, functional profiling, de novo assembly, binning, genome assessment, comparative genomic analysis, pangenome annotation, network analysis, and strain-level microdiversity analysis into a unified framework. Interactive modules support standardized data interpretation and exploratory visualization. The pipeline is implemented with Nextflow and containerized with Apptainer, ensuring environment reproducibility and scalability. Comprehensive documentation is available at https://metafun-doc.readthedocs.io/en/main. The pipeline was validated using a colorectal cancer cohort dataset. By addressing key methodological gaps, metaFun facilitates accessible and reproducible metagenomic analysis for the broader research community.

## Linked entities

- **Diseases:** colorectal cancer (MONDO:0005575)

## Full-text entities

- **Diseases:** colorectal cancer (MESH:D015179)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12818822/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12818822/full.md

## References

114 references — full list in the complete paper: https://tomesphere.com/paper/PMC12818822/full.md

---
Source: https://tomesphere.com/paper/PMC12818822