# ACE: A Versatile Contrastive Learning Framework for Single-cell Mosaic Integration

**Authors:** Xuhua Yan, Jinmiao Chen, Ruiqing Zheng, Min Li

PMC · DOI: 10.1093/gpbjnl/qzaf062 · 2025-08-04

## TL;DR

ACE is a new framework that improves the integration of single-cell multi-omics data by aligning and completing missing modalities across datasets.

## Contribution

ACE introduces two novel strategies, ACE-align and ACE-spec, for mosaic integration of single-cell data using contrastive learning and regression.

## Key findings

- ACE-align uses contrastive learning to align modalities and reveal shared latent representations.
- ACE-spec enhances cellular heterogeneity representation in datasets with incomplete modalities.
- ACE outperforms existing methods in various mosaic integration scenarios.

## Abstract

The integration of single-cell multi-omics datasets is critical for deciphering cellular heterogeneities. Mosaic integration, the most general integration task, poses a greater challenge regarding disparity in modality abundance across datasets. Here, we present Align and CompletE (ACE), a mosaic integration framework that assembles two types of strategies to handle this problem: modality alignment-based strategy (ACE-align) and regression-based strategy (ACE-spec). ACE-align utilizes a novel contrastive learning objective for explicit modality alignment to uncover the shared latent representations behind modalities. ACE-spec combines the modality alignment results and modality-specific representations to construct complete multi-omics representations for all datasets. Extensive experiments across various mosaic integration scenarios demonstrate the superiority of ACE’s two strategies over existing methods. Application of ACE-spec to bi-modal and tri-modal integration scenarios showcases that ACE-spec is able to enhance the representation of cellular heterogeneities for datasets with incomplete modalities. The source code of ACE can be accessed at https://github.com/CSUBioGroup/ACE-main.

## Full-text entities

- **Genes:** S100A8 (S100 calcium binding protein A8) [NCBI Gene 6279] {aka 60B8AG, CAGA, CFAG, CGLA, CP-10, L1Ag}, CD1C (CD1c molecule) [NCBI Gene 911] {aka BDCA1, CD1, R7}, KLRB1 (killer cell lectin like receptor B1) [NCBI Gene 3820] {aka CD161, CLEC5B, NKR, NKR-P1, NKR-P1A, NKRP1A}, OXNAD1 (oxidoreductase NAD binding domain containing 1) [NCBI Gene 92106] {aka RFTN1-AS1}, AP2B1 (adaptor related protein complex 2 subunit beta 1) [NCBI Gene 163] {aka ADTB2, AP105B, AP2-BETA, CLAPB1}, MAP9 (microtubule associated protein 9) [NCBI Gene 79884] {aka ASAP}, PRF1 (perforin 1) [NCBI Gene 5551] {aka HPLH2, P1, PFP}, ITGAX (integrin subunit alpha X) [NCBI Gene 3687] {aka CD11C, SLEB6}, TRBV20OR9-2 (T cell receptor beta variable 20/OR9-2 (non-functional)) [NCBI Gene 6962] {aka CDR3, TCRBV20S2, TCRBV2O, TCRBV2S2O}, CD14 (CD14 molecule) [NCBI Gene 929], IL32 (interleukin 32) [NCBI Gene 9235] {aka IL-32alpha, IL-32beta, IL-32delta, IL-32gamma, NK4, TAIF}, CLEC10A (C-type lectin domain containing 10A) [NCBI Gene 10462] {aka CD301, CLECSF13, CLECSF14, DC-ASGPR, HML, HML2}, CD79A (CD79a molecule) [NCBI Gene 973] {aka IGA, IGAlpha, MB-1, MB1}, ITM2C (integral membrane protein 2C) [NCBI Gene 81618] {aka BRI3, BRICD2C, E25, E25C, ITM3}, IL2RA (interleukin 2 receptor subunit alpha) [NCBI Gene 3559] {aka CD25, IDDM10, IL2R, IMD41, TCGFR, p55}, FCER1A (Fc epsilon receptor Ia) [NCBI Gene 2205] {aka FCE1A, FCERIA, FcERI}, GZMB (granzyme B) [NCBI Gene 3002] {aka C11, CCPI, CGL-1, CGL1, CSP-B, CSPB}, IL3RA (interleukin 3 receptor subunit alpha) [NCBI Gene 3563] {aka CD123, IL-3R-alpha, IL3R, IL3RAY, IL3RX, IL3RY}, KLRD1 (killer cell lectin like receptor D1) [NCBI Gene 3824] {aka CD94}, JUNB (JunB proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3726] {aka AP-1}, LYZ (lysozyme) [NCBI Gene 4069] {aka AMYLD5, LYZF1, LZM}, TIGIT (T cell immunoreceptor with Ig and ITIM domains) [NCBI Gene 201633] {aka VSIG9, VSTM3, WUCAM}, GNLY (granulysin) [NCBI Gene 10578] {aka D2S69E, LAG-2, LAG2, NKG5, TLA519}, HLA-DQA1 (major histocompatibility complex, class II, DQ alpha 1) [NCBI Gene 3117] {aka CELIAC1, DQ-A1, DQA1, HLA-DQA, HLA-DQA1*}, IL7R (interleukin 7 receptor) [NCBI Gene 3575] {aka CD127, CDW127, IL-7R-alpha, IL-7Ralpha, IL7RA, IL7Ralpha}, PRSS57 (serine protease 57) [NCBI Gene 400668] {aka NSP4, PRSSL1, UNQ782}, CDKN1C (cyclin dependent kinase inhibitor 1C) [NCBI Gene 1028] {aka BWCR, BWS, KIP2, WBS, p57, p57Kip2}, CST7 (cystatin F) [NCBI Gene 8530] {aka CMAP}, MS4A1 (membrane spanning 4-domains A1) [NCBI Gene 931] {aka B1, Bp35, CD20, CVID5, FMC7, LEU-16}, FCGR3A (Fc gamma receptor IIIa) [NCBI Gene 2214] {aka CD16-II, CD16A, FCG3, FCGR3, FCRIIIA, FcGRIIIA}, TCF7 (transcription factor 7) [NCBI Gene 6932] {aka TCF-1}, FOXP3 (forkhead box P3) [NCBI Gene 50943] {aka AIID, DIETER, IPEX, JM2, PIDX, XPID}, XCL1 (X-C motif chemokine ligand 1) [NCBI Gene 6375] {aka ATAC, LPTN, LTN, SCM-1, SCM-1a, SCM1}, CD3G (CD3 gamma subunit of T-cell receptor complex) [NCBI Gene 917] {aka CD3-GAMMA, CD3GAMMA, IMD17, T3G}, CD8B (CD8 subunit beta) [NCBI Gene 926] {aka CD8B1, CD8beta, LEU2, LY3, LYT3, Ly-3}, LEF1 (lymphoid enhancer binding factor 1) [NCBI Gene 51176] {aka ECTD1, ECTD17, LEF-1, TCF10, TCF1ALPHA, TCF7L3}, GZMK (granzyme K) [NCBI Gene 3003] {aka GrK, TRYP2}, GZMH (granzyme H) [NCBI Gene 2999] {aka CCP-X, CGL-2, CSP-C, CTLA1, CTSGL2}, CD3E (CD3 epsilon subunit of T-cell receptor complex) [NCBI Gene 916] {aka CD3epsilon, IMD18, T3E, TCRE}, RALGPS2 (Ral GEF with PH domain and SH3 binding motif 2) [NCBI Gene 55103] {aka dJ595C2.1}, TRGC1 (T cell receptor gamma constant 1) [NCBI Gene 6966] {aka C1, TCRG, TCRGC1}, MS4A7 (membrane spanning 4-domains A7) [NCBI Gene 58475] {aka 4SPAN2, CD20L4, CFFM4, MS4A8}, CCL5 (C-C motif chemokine ligand 5) [NCBI Gene 6352] {aka D17S136E, RANTES, SCYA5, SIS-delta, SISd, TCP228}, CCR7 (C-C motif chemokine receptor 7) [NCBI Gene 1236] {aka BLR2, CC-CKR-7, CCR-7, CD197, CDw197, CMKBR7}, CD34 (CD34 molecule) [NCBI Gene 947], S100A9 (S100 calcium binding protein A9) [NCBI Gene 6280] {aka 60B8AG, CAGB, CFAG, CGLB, L1AG, LIAG}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, EGFL7 (EGF like domain multiple 7) [NCBI Gene 51162] {aka NEU1, VE-STATIN, ZNEU1}, SERPINF1 (serpin family F member 1) [NCBI Gene 5176] {aka EPC-1, OI12, OI6, PEDF, PIG35}, GATA2 (GATA binding protein 2) [NCBI Gene 2624] {aka DCML, IMD21, MONOMAC, NFE1B}, HES4 (hes family bHLH transcription factor 4) [NCBI Gene 57801] {aka bHLHb42}, CD3D (CD3 delta subunit of T-cell receptor complex) [NCBI Gene 915] {aka CD3-DELTA, CD3DELTA, IMD19, T3D}, RTKN2 (rhotekin 2) [NCBI Gene 219790] {aka PLEKHK1, bA531F24.1}, TRDC (T cell receptor delta constant) [NCBI Gene 28526] {aka TCRD}
- **Diseases:** sepsis (MESH:D018805), primary immunodeficiency (MESH:D000081207), inflammatory (MESH:D007249), CL (MESH:D007859), ARDS (MESH:D012128), VP (MESH:D046350), MAIT (MESH:D001260), ARI (MESH:D000275)
- **Chemicals:** Li (MESH:D008094), BM-CITE (-), VP (MESH:C038467)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12582371/full.md

---
Source: https://tomesphere.com/paper/PMC12582371