# Modeling combinatorial regulation from single-cell multi-omics provides regulatory units underpinning cell type landscape using cRegulon

**Authors:** Zhanying Feng, Xi Chen, Zhana Duren, Jingxue Xin, Hao Miao, Qiuyue Yuan, Yong Wang, Wing Hung Wong

PMC · DOI: 10.1186/s13059-025-03680-w · Genome Biology · 2025-07-24

## TL;DR

This paper introduces cRegulon, a new method that identifies regulatory modules from single-cell data to better understand how gene regulation shapes different cell types.

## Contribution

cRegulon models combinatorial regulation to infer reusable regulatory modules from single-cell multi-omics data.

## Key findings

- cRegulon outperforms existing methods in identifying transcription factor combinatorial modules.
- The method improves cell type annotation using regulatory units derived from GRNs.

## Abstract

Advances in single-cell technology enable large-scale generation of omics data, promising for clarifying gene regulatory networks governing different cell type/states. Nonetheless, prevailing methods fail to account for universal and reusable regulatory modules in GRNs, which are fundamental underpinnings of cell type landscape. We introduce cRegulon to infer regulatory modules by modeling combinatorial regulation of transcription factors based on diverse GRNs from single-cell multi-omics data. Through benchmarking and applications using simulated datasets and real datasets, cRegulon outperforms existing approaches in identifying TF combinatorial modules as regulatory units and annotating cell types. cRegulon offers new insights and methodology into combinatorial regulation.

The online version contains supplementary material available at 10.1186/s13059-025-03680-w.

## Full-text entities

- **Genes:** TBX5 (T-box transcription factor 5) [NCBI Gene 6910] {aka HOS}, Pou3f2 (POU domain, class 3, transcription factor 2) [NCBI Gene 18992] {aka 9430075J19Rik, A230098E07Rik, Brn-2, Brn2, OTF-7, Otf7}, SOX8 (SRY-box transcription factor 8) [NCBI Gene 30812], CHIA (chitinase acidic) [NCBI Gene 27159] {aka AMCASE, CHIT2, TSA1902}, Gem (GTP binding protein overexpressed in skeletal muscle) [NCBI Gene 14579], JUN (Jun proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 3725] {aka AP-1, AP1, c-Jun, cJUN, p39}, HNF1A (HNF1 homeobox A) [NCBI Gene 6927] {aka HNF-1-alpha, HNF-1A, HNF1, HNF1alpha, IDDM20, LFB1}, MAFA (MAF bZIP transcription factor A) [NCBI Gene 389692] {aka INSDM, RIPE3b1, hMafA}, TWIST1 (twist family bHLH transcription factor 1) [NCBI Gene 7291] {aka ACS3, BPES2, BPES3, CRS, CRS1, CSO}, E2F1 (E2F transcription factor 1) [NCBI Gene 1869] {aka E2F-1, RBAP1, RBBP3, RBP3}, POU5F1 (POU class 5 homeobox 1) [NCBI Gene 5460] {aka OCT3, OCT4, OCT4Borf1, OTF-3, OTF3, OTF4}, Foxa2 (forkhead box A2) [NCBI Gene 15376] {aka HNF3-beta, HNF3beta, Hnf-3b, Hnf3b, Tcf-3b, Tcf3b}, LIF (LIF interleukin 6 family cytokine) [NCBI Gene 3976] {aka CDF, DIA, HILDA, MLPLI}, Hnf4a (hepatic nuclear factor 4, alpha) [NCBI Gene 15378] {aka HNF-4, Hnf4, Hnf4alpha, MODY1, Nr2a1, TCF-14}, F3 (coagulation factor III, tissue factor) [NCBI Gene 2152] {aka CD142, TF, TFA}, IRF8 (interferon regulatory factor 8) [NCBI Gene 3394] {aka H-ICSBP, ICSBP, ICSBP1, IMD32A, IMD32B, IRF-8}, POU3F2 (POU class 3 homeobox 2) [NCBI Gene 5454] {aka BRN2, N-Oct3, OCT7, OTF-7, OTF7, POUF3}, TAL1 (TAL bHLH transcription factor 1, erythroid differentiation factor) [NCBI Gene 6886] {aka SCL, TCL5, bHLHa17, tal-1}, SLC6A1 (solute carrier family 6 member 1) [NCBI Gene 6529] {aka GABATHG, GABATR, GAT1, MAE, hGAT-1}, MYOD1 (myogenic differentiation 1) [NCBI Gene 4654] {aka CMYO17, CMYP17, MYF3, MYOD, MYODRIF, PUM}, HOXD11 (homeobox D11) [NCBI Gene 3237] {aka HOX4, HOX4F}, SCGN (secretagogin, EF-hand calcium binding protein) [NCBI Gene 10590] {aka CALBL, DJ501N12.8, SECRET, SEGN, setagin}, ZBTB12 (zinc finger and BTB domain containing 12) [NCBI Gene 221527] {aka Bat9, C6orf46, D6S59E, G10, NG35}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, FOS (Fos proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 2353] {aka AP-1, C-FOS, p55}, CHGA (chromogranin A) [NCBI Gene 1113] {aka CGA, PHE5, PHES}, NFIC (nuclear factor I C) [NCBI Gene 4782] {aka CTF, CTF5, NF-I, NF-I/C, NF1-C, NFI}, MLX (MAX dimerization protein MLX) [NCBI Gene 6945] {aka MAD7, MXD7, TCFL4, TF4, bHLHd13}, CUX2 (cut like homeobox 2) [NCBI Gene 23316] {aka CDP2, CUTL2, DEE67, EIEE67}, OLIG2 (oligodendrocyte transcription factor 2) [NCBI Gene 10215] {aka BHLHB1, OLIGO2, PRKCBP2, RACK17, bHLHe19}, NFE2 (nuclear factor, erythroid 2) [NCBI Gene 4778] {aka NF-E2, p45}, SOX10 (SRY-box transcription factor 10) [NCBI Gene 6663] {aka DOM, PCWH, SOX-10, WS2E, WS4, WS4C}, Xcl1 (chemokine (C motif) ligand 1) [NCBI Gene 16963] {aka ATAC, LTN, Lptn, SCM-1, SCM-1a, Scyc1}, MYOG (myogenin) [NCBI Gene 4656] {aka MYF4, bHLHc3, myf-4}, SLC6A11 (solute carrier family 6 member 11) [NCBI Gene 6538] {aka GAT-3, GAT3, GAT4}, NFE2L3 (NFE2 like bZIP transcription factor 3) [NCBI Gene 9603] {aka NRF3}, POU4F3 (POU class 4 homeobox 3) [NCBI Gene 5459] {aka BRN3C, DFNA15, DFNA42, DFNA52}, IGKV5-2 (immunoglobulin kappa variable 5-2) [NCBI Gene 28907] {aka B2, IGKV52}, Sox2 (SRY (sex determining region Y)-box 2) [NCBI Gene 20674] {aka Sox-2, lcc, ysb}, Itpr3 (inositol 1,4,5-triphosphate receptor 3) [NCBI Gene 16440] {aka IP3R 3, IP3R-3, Ip3r3, Itpr-3, tf}, FOSL2 (FOS like 2, AP-1 transcription factor subunit) [NCBI Gene 2355] {aka ACED, FRA2}, TRIM63 (tripartite motif containing 63) [NCBI Gene 84676] {aka CMH31, IRF, MURF1, MURF2, RNF28, SMRZ}, FOXL2 (forkhead box L2) [NCBI Gene 668] {aka BPES, BPES1, PFRK, PINTO, POF3}, Olig1 (oligodendrocyte transcription factor 1) [NCBI Gene 50914] {aka Bhlhb6, Olg-1, Oligo1, bHLHe21}, GATA4 (GATA binding protein 4) [NCBI Gene 2626] {aka ASD2, TACHD, TOF, VSD1}, SOX1 (SRY-box transcription factor 1) [NCBI Gene 6656], IGKV7-3 (immunoglobulin kappa variable 7-3 (pseudogene)) [NCBI Gene 28905] {aka B1, IGKV73}, FOXA3 (forkhead box A3) [NCBI Gene 3171] {aka FKHH3, HNF3G, TCF3G}, STAR (steroidogenic acute regulatory protein) [NCBI Gene 6770] {aka STARD1}, FOXA2 (forkhead box A2) [NCBI Gene 3170] {aka HNF-3-beta, HNF3B, TCF3B}, NEUROD1 (neuronal differentiation 1) [NCBI Gene 4760] {aka BETA2, BHF-1, MODY6, NEUROD, T2D, bHLHa3}, Ascl1 (achaete-scute family bHLH transcription factor 1) [NCBI Gene 17172] {aka ASH1, Mash1, bHLHa46}, PAEP (progestagen associated endometrial protein) [NCBI Gene 5047] {aka GD, GdA, GdF, GdS, PAEG, PEP}, MYT1L (myelin transcription factor 1 like) [NCBI Gene 23040] {aka MRD39, NZF1, ZC2H2C2, ZC2HC4B, myT1-L}, HOXB5 (homeobox B5) [NCBI Gene 3215] {aka HHO.C10, HOX2, HOX2A, HU-1, Hox2.1}, GATA1 (GATA binding protein 1) [NCBI Gene 2623] {aka CNSHA9, ERYF1, GATA-1, GF-1, GF1, HAEADA}, TGM6 (transglutaminase 6) [NCBI Gene 343641] {aka SCA35, TG6, TGM3L, TGY, dJ734P14.3}, ZIC1 (Zic family zinc finger 1) [NCBI Gene 7545] {aka BAIDCS, CRS6, ZIC, ZNF201}, SOX21 (SRY-box transcription factor 21) [NCBI Gene 11166] {aka SOX-A, SOX25}, MYF6 (myogenic factor 6) [NCBI Gene 4618] {aka CNM3, MRF4, bHLHc4, myf-6}
- **Diseases:** RA (MESH:D011015), cancer (MESH:D009369), Maturity onset diabetes of the young, type 8 (MESH:C565101), MODY8 (MESH:C565225), mycoplasma (MESH:D009175), erythroleukemia (MESH:D004915), TGs (MESH:C537680), diabetes (MESH:D003920), tumorigenesis (MESH:D063646)
- **Chemicals:** amino acids (MESH:D000596), Streptomycin (MESH:D013307), fatty acid (MESH:D005227), PBS (MESH:D007854), lipid (MESH:D008055), BP (-), Penicillin (MESH:D010406), GlutaMax (MESH:C054122), glucose (MESH:D005947), cholesterol (MESH:D002784), RA (MESH:D014212)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Bos taurus (bovine, species) [taxon 9913], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** S2d — Mus musculus (Mouse), Hybridoma (CVCL_C5HT), mESC — Mus musculus (Mouse), Embryonic stem cell (CVCL_4378), K562 — Homo sapiens (Human), Blast phase chronic myelogenous leukemia, BCR-ABL1 positive, Cancer cell line (CVCL_0004), ESC — Homo sapiens (Human), Embryonic stem cell (CVCL_9771), fibroblasts — Mus musculus (Mouse), Spontaneously immortalized cell line (CVCL_0594), HepG2 — Homo sapiens (Human), Hepatoblastoma, Cancer cell line (CVCL_0027), MEF — Mus musculus (Mouse), Finite cell line (CVCL_9115), C10 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_C7T6), BJ — Homo sapiens (Human), Telomerase immortalized cell line (CVCL_6573), H1 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_HA53), GM12878 — Homo sapiens (Human), Transformed cell line (CVCL_7526)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12291291/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12291291/full.md

## References

8 references — full list in the complete paper: https://tomesphere.com/paper/PMC12291291/full.md

---
Source: https://tomesphere.com/paper/PMC12291291