Benchmarking alignment strategies for Hi-C reads in metagenomic Hi-C data
Yuqiu Wang, Wenxuan Zuo, Jiawei Huang, Fengzhu Sun, Yuxuan Du

TL;DR
This paper compares different methods for aligning Hi-C reads in metagenomic data to determine which ones best support accurate analysis of microbial communities.
Contribution
The study introduces a focused benchmark of Hi-C alignment strategies specifically for metagenomic data, highlighting performance trade-offs between accuracy and efficiency.
Findings
BWA MEM -5SP outperformed other tools in inter-contig read pairs and binning quality across most environments.
Chromap and Minimap2 showed the highest computational efficiency despite lower accuracy in some metrics.
Alignment performance varied significantly across synthetic and real-world metagenomic datasets.
Abstract
Metagenomics combined with High-throughput Chromosome Conformation Capture (Hi-C) provides a powerful approach to study microbial communities by linking genomic content with spatial interactions. Hi-C complements shotgun sequencing by revealing taxonomic composition, functional interactions, and genomic organization within a single sample. However, aligning Hi-C reads to metagenomic contigs is challenging due to variable insert sizes of Hi-C paired-end reads, multi-species complexity, and gaps in assemblies. Although several benchmark studies have evaluated general alignment tools and Hi-C data alignment, none have specifically focused on metagenomic Hi-C data. We evaluated seven alignment strategies commonly used in Hi-C analyses: BWA MEM -5SP, BWA MEM default, BWA aln default, Bowtie2 default, Bowtie2 –very-sensitive-local, Minimap2 default, and Chromap Hi-C default. We benchmarked…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGut microbiota and health · Microbial Community Ecology and Physiology · Genomics and Phylogenetic Studies
