CLMB: deep contrastive learning for robust metagenomic binning
Pengfei Zhang, Zhengyuan Jiang, Yixuan Wang, Yu Li

TL;DR
CLMB introduces a deep contrastive learning approach for metagenomic binning that enhances robustness to noise, leading to significantly improved recovery of microbial genomes from large datasets.
Contribution
This paper presents a novel contrastive learning framework that implicitly handles noise in metagenomic binning, outperforming existing methods in genome reconstruction accuracy.
Findings
Recovered up to 17% more genomes than previous methods
Improved bin refinement results by 8-22 high-quality genomes
Single CLMB outperforms existing binning refiners on benchmark datasets
Abstract
The reconstruction of microbial genomes from large metagenomic datasets is a critical procedure for finding uncultivated microbial populations and defining their microbial functional roles. To achieve that, we need to perform metagenomic binning, clustering the assembled contigs into draft genomes. Despite the existing computational tools, most of them neglect one important property of the metagenomic data, that is, the noise. To further improve the metagenomic binning step and reconstruct better metagenomes, we propose a deep Contrastive Learning framework for Metagenome Binning (CLMB), which can efficiently eliminate the disturbance of noise and produce more stable and robust results. Essentially, instead of denoising the data explicitly, we add simulated noise to the training data and force the deep learning model to produce similar and stable representations for both the noise-free…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsGenomics and Phylogenetic Studies · Gene expression and cancer classification · Molecular Biology Techniques and Applications
MethodsContrastive Learning
