# BiCLUM: Bilateral contrastive learning for unpaired single-cell multi-omics integration

**Authors:** Yin Guo, Izaskun Mallona, Mark D. Robinson, Limin Li

PMC · DOI: 10.1371/journal.pcbi.1013932 · PLOS Computational Biology · 2026-02-03

## TL;DR

BiCLUM is a new method for combining unpaired single-cell multi-omics data, improving integration by aligning both cell and feature levels, leading to better biological insights.

## Contribution

BiCLUM introduces a novel bilateral contrastive learning framework that simultaneously aligns cell and feature levels for unpaired single-cell multi-omics integration.

## Key findings

- BiCLUM outperforms existing methods in integrating RNA+ATAC and RNA+protein datasets.
- BiCLUM embeddings preserve biologically meaningful regulatory relationships between chromatin accessibility and gene expression.
- BiCLUM enables robust downstream analyses like transcription factor activity inference and cell–cell interaction mapping.

## Abstract

The integration of single-cell multi-omics data provides a powerful approach for understanding the complex interplay between different molecular modalities, such as RNA expression, chromatin accessibility and protein abundance, measured through assays like scRNA-seq, scATAC-seq and CITE-seq, at single-cell resolution. However, most existing single-cell technologies focus on individual modalities, limiting a comprehensive understanding of their interconnections. Integrating such diverse and often unpaired datasets remains a challenging task due to unknown cell correspondences across distinct feature spaces and limited insights into cell-type-specific activities in non-scRNA-seq modalities. In this work, we propose BiCLUM, a Bilateral Contrastive Learning approach for Unpaired single-cell Multi-omics integration, which simultaneously enforces cell-level and feature-level alignment across modalities. BiCLUM first transforms one modality, such as scATAC-seq, into the data space of another modality, such as scRNA-seq, using prior genomic knowledge. It then learns cell and gene embeddings simultaneously through a bilateral contrastive learning framework, incorporating both cell-level and feature-level contrastive losses. Across multiple RNA+ATAC and RNA+protein datasets, BiCLUM consistently outperforms or matches existing integration methods in both visualization and quantitative benchmarks. Importantly, BiCLUM embeddings preserve biologically meaningful regulatory relationships between chromatin accessibility and gene expression, as evidenced by significantly higher gene–peak correlations than random controls. Downstream analyses further demonstrate that BiCLUM-derived embeddings facilitate transcription factor activity inference, identification of cell-type-specific marker genes, functional enrichment, and cell–cell interaction mapping. Comprehensive hyperparameter sensitivity and ablation analyses further establish BiCLUM as a robust and interpretable framework that not only achieves effective cross-modal alignment but also retains the underlying regulatory and functional landscape across single-cell modalities.

With the rise of single-cell multi-omics technologies, researchers can now probe multiple molecular layers (e.g., transcriptome and epigenome) within individual cells, enabling a more comprehensive understanding of cellular states and functions. However, integrating unpaired multi-omics data from different modalities poses significant challenges due to batch effects, non-overlapping feature spaces, and the lack of cell-to-cell correspondence. To address these issues, we introduce BiCLUM, a bilateral contrastive learning–based integration framework that innovatively models both cell-level and feature-level dependencies in a unified manner. Instead of relying solely on cell-level matching, BiCLUM constructs mutual nearest neighbor (MNN) pairs at both the cell and feature levels, and employs a bilateral contrastive objective to enforce consistent alignment across modalities. This design enables BiCLUM to capture multi-omic correspondences from two complementary perspectives, enhancing the robustness and biological interpretability of the integration. We evaluate BiCLUM across a diverse set of benchmark datasets and show that it consistently outperforms existing integration methods in terms of cell-type alignment, omics mixing, and downstream trajectory inference. Our work highlights the effectiveness of contrastive learning in resolving modality discrepancies and provides a robust framework for integrative single-cell analysis.

## Full-text entities

- **Genes:** CXCR5 (C-X-C motif chemokine receptor 5) [NCBI Gene 100515679] {aka CD185}, CD4 (CD4 molecule) [NCBI Gene 404704], VCAM1 (vascular cell adhesion molecule 1) [NCBI Gene 7412] {aka CD106, INCAM-100}, ABCB6 (ATP binding cassette subfamily B member 6 (LAN blood group)) [NCBI Gene 10058] {aka ABC, LAN, MTABC3, PRP, umat}
- **Diseases:** tumor (MESH:D009369), BiCLUM (MESH:D007859), infection (MESH:D007239)
- **Chemicals:** GPU (-), steroid hormone (MESH:D013256)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** ENDO — Homo sapiens (Human), Transformed cell line (CVCL_E711), DCT — Mus musculus (Mouse), Transformed cell line (CVCL_B6H8), ICB — Homo sapiens (Human), Ewing sarcoma, Cancer cell line (CVCL_W328)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12904586/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12904586/full.md

## References

71 references — full list in the complete paper: https://tomesphere.com/paper/PMC12904586/full.md

---
Source: https://tomesphere.com/paper/PMC12904586