# Mass spectrometry-based multi-omics analysis elucidates immune microenvironmental characteristics and the risk of distant metastasis in N1c colorectal cancer

**Authors:** Baiwang Zhu, Chenxiao Zheng, Hangjia Xu, Yating Zheng, Yunfei Liu, Peng Li, Haobo Jin, Binbin Ou, Yanyu Chen, Zihan Wang, Qiongying Zhang, Xiaodong Zhang, Yifei Pan

PMC · DOI: 10.3389/fimmu.2026.1590042 · Frontiers in Immunology · 2026-02-17

## TL;DR

This study uses mass spectrometry and machine learning to uncover molecular and immune features of N1c colorectal cancer, identifying genes and proteins linked to metastasis and poor immunotherapy response.

## Contribution

The study introduces a novel machine learning model for predicting prognosis and immunotherapy response in N1c CRC, and identifies PLOD1 as a potential biomarker and therapeutic target.

## Key findings

- N1c CRC samples show increased ECM remodeling and EMT, linked to metastasis and poor prognosis.
- A machine learning model based on 10 TDRGs accurately predicts DFS and immune activity across multiple cohorts.
- PLOD1 expression in fibroblasts correlates with collagen fiber abundance and promotes CRC cell migration and invasion.

## Abstract

Colorectal cancer (CRC) ranks third in global cancer incidence and second in mortality. Tumor deposit (TD), a specific regional spread form, is a crucial independent risk factor for survival and have biological differences from lymph node metastasis (LNM). However, it is underestimated in current staging systems, which results in biased treatment decisions and prognosis evaluation. Moreover, the biological features and distant metastasis patterns of N1c CRC remain largely unknown.

We performed Data-independent Acquisition Mass Spectrometry (DIA-MS) analysis of formalin-fixed, paraffin-embedded (FFPE) samples from 13 T1-T4N1cM1 CRC patients to reveal their molecular characteristics. 9 machine learning algorithms identified 10 TD-related metastasis genes (TDRGs), the multi-cohort validation in 1,582 CRC patients confirmed their role in prognosis. Immune landscape and immunotherapy response were assessed by the CIBERSORT, tumor mutation burden (TMB), Tumor Immune Dysfunction and Exclusion (TIDE) score, Consensus molecular subtype (CMS), immune checkpoint gene expression. scRNA-seq analysis identified Procollagen-Lysine,2-Oxoglutarate 5-Dioxygenase 1 (PLOD1) expression in CRC, Immunohistochemical staining (IHC) and Masson’s trichrome staining were used to assess PLOD1 expression and collagen fiber content in CRC, its role in tumor invasion and migration was elucidated by wound healing and transwell assays.

N1c samples exhibit enhanced extracellular matrix (ECM) remodeling and epithelial mesenchymal transition (EMT). The TDRGs identified by machine learning robustly predicting disease-free survival (DFS) across multiple cohorts (Mean C-index = 0.72) and immune activity. A higher risk score predicted early metastasis and poorer response to immunotherapy, marked by lower level of immune infiltration, higher TIDE scores, lower TMB, and downregulated immune checkpoint genes. scRNA-seq analysis pinpoints highest PLOD1 expression in fibroblasts. Histological analysis of N1c samples demonstrated PLOD1 expression patterns and their significant correlation with stromal collagen fiber abundance (p < 0.01). Wound healing and transwell assays indicating the knockdown of PLOD1 hinders the migration and invasion of CRC DLD-1 cell.

We assessed the protein expression profiles and pathway characteristics of N1c CRC. The model developed based on TDRGs, effectively predicted DFS and immunotherapy response, supporting precision treatment and staging system optimization. PLOD1’s role in ECM remodeling and CRC cell migration and invasion suggests its potential as a prognostic biomarker and therapeutic target.

## Linked entities

- **Genes:** PLOD1 (procollagen-lysine,2-oxoglutarate 5-dioxygenase 1) [NCBI Gene 5351]
- **Diseases:** colorectal cancer (MONDO:0005575), CRC (MONDO:0005575)

## Full-text entities

- **Genes:** TWIST2 (twist family bHLH transcription factor 2) [NCBI Gene 117581] {aka AMS, BBRSAY, DERMO1, FFDD3, SETLSS, bHLHa39}, ABL1 (ABL proto-oncogene 1, non-receptor tyrosine kinase) [NCBI Gene 25] {aka ABL, BCR-ABL, CHDSKM, JTK7, bcr/abl, c-ABL}, KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, TGFB1 (transforming growth factor beta 1) [NCBI Gene 7040] {aka CAEND1, CED, DPD1, IBDIMDE, LAP, TGF-beta1}, SNAI2 (snail family transcriptional repressor 2) [NCBI Gene 6591] {aka SLUG, SLUGH, SLUGH1, SNAIL2, WS2D}, COL3A1 (collagen type III alpha 1 chain) [NCBI Gene 1281] {aka EDS4A, EDSVASC, PMGEDSV}, CTLA4 (cytotoxic T-lymphocyte associated protein 4) [NCBI Gene 1493] {aka ALPS5, CD, CD152, CELIAC3, CTLA-4, GRD4}, ZEB2 (zinc finger E-box binding homeobox 2) [NCBI Gene 9839] {aka HSPC082, SIP-1, SIP1, SMADIP1, ZFHX1B}, TWIST1 (twist family bHLH transcription factor 1) [NCBI Gene 7291] {aka ACS3, BPES2, BPES3, CRS, CRS1, CSO}, TTN (titin) [NCBI Gene 7273] {aka CMD1G, CMH9, CMPD4, CMYO5, CMYP5, EOMFC}, MMP9 (matrix metallopeptidase 9) [NCBI Gene 4318] {aka CLG4B, GELB, MANDP2, MMP-9}, CDH1 (cadherin 1) [NCBI Gene 999] {aka Arc-1, BCDS1, CD324, CDHE, ECAD, LCAM}, LOX (lysyl oxidase) [NCBI Gene 4015] {aka AAT10}, HIF1A (hypoxia inducible factor 1 subunit alpha) [NCBI Gene 3091] {aka HIF-1-alpha, HIF-1A, HIF-1alpha, HIF1, HIF1-ALPHA, MOP1}, DDR1 (discoidin domain receptor tyrosine kinase 1) [NCBI Gene 780] {aka CAK, CD167, DDR, EDDR1, HGK2, MCK10}, CD274 (CD274 molecule) [NCBI Gene 29126] {aka ADMIO5, B7-H, B7H1, PD-L1, PDCD1L1, PDCD1LG1}, SNAI1 (snail family transcriptional repressor 1) [NCBI Gene 6615] {aka SLUGH2, SNA, SNAH, SNAIL, SNAIL1, dJ710H13.1}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, GAPDH (glyceraldehyde-3-phosphate dehydrogenase) [NCBI Gene 2597] {aka G3PD, GAPD, HEL-S-162eP}, HSPH1 (heat shock protein family H (Hsp110) member 1) [NCBI Gene 10808] {aka HSP105, HSP105A, HSP105B, NY-CO-25}, MMRN1 (multimerin 1) [NCBI Gene 22915] {aka ECM, EMILIN4, GPIa*, MMRN}, COL6A2 (collagen type VI alpha 2 chain) [NCBI Gene 1292] {aka BTHLM1, BTHLM1B, PP3610, UCMD1, UCMD1B}, FOXC2 (forkhead box C2) [NCBI Gene 2303] {aka FKHL14, LD, MFH-1, MFH1}, NDUFA8 (NADH:ubiquinone oxidoreductase subunit A8) [NCBI Gene 4702] {aka CI-19KD, CI-PGIV, MC1DN37, PGIV}, PLOD1 (procollagen-lysine,2-oxoglutarate 5-dioxygenase 1) [NCBI Gene 5351] {aka EDS6, EDSKCL1, LH, LH1, LLH, PLOD}, SEC31A (SEC31 homolog A, COPII component) [NCBI Gene 22872] {aka ABP125, ABP130, HPBKS, HSPC275, HSPC334, NEDSOSB}, APC (APC regulator of Wnt signaling pathway) [NCBI Gene 324] {aka BTPS2, DESMD, DP2, DP2.5, DP3, GS}, IHH (Indian hedgehog signaling molecule) [NCBI Gene 3549] {aka BDA1, HHG2}, PDCD1 (programmed cell death 1) [NCBI Gene 5133] {aka ADMIO4, AIMTBS, CD279, PD-1, PD1, SLEB2}, ZEB1 (zinc finger E-box binding homeobox 1) [NCBI Gene 6935] {aka AREB6, BZP, DELTAEF1, FECD6, NIL2A, PPCD3}, CXCL14 (C-X-C motif chemokine ligand 14) [NCBI Gene 9547] {aka BMAC, BRAK, KEC, KS1, MIP-2g, MIP2G}, PHGDH (phosphoglycerate dehydrogenase) [NCBI Gene 26227] {aka 3-PGDH, 3PGDH, HEL-S-113, NLS, NLS1, PDG}, FMOD (fibromodulin) [NCBI Gene 2331] {aka FM, SLRR2E}, PLK1 (polo like kinase 1) [NCBI Gene 5347] {aka PLK, STPK13}, HSPA4 (heat shock protein family A (Hsp70) member 4) [NCBI Gene 3308] {aka APG-2, HEL-S-5a, HS24/P52, HSPH2, RY, hsp70}, PTCH1 (patched 1) [NCBI Gene 5727] {aka BCNS, BCNS1, NBCCS, PTC, PTC1, PTCH}, AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, MYC (MYC proto-oncogene, bHLH transcription factor) [NCBI Gene 4609] {aka MRTL, MYCC, bHLHe39, c-Myc}, SRC (SRC proto-oncogene, non-receptor tyrosine kinase) [NCBI Gene 6714] {aka ASV, SRC1, THC6, c-SRC, p60-Src}, TENM1 (teneurin transmembrane protein 1) [NCBI Gene 10178] {aka ODZ1, ODZ3, TEN-M1, TEN1, TNM, TNM1}, STAT1 (signal transducer and activator of transcription 1) [NCBI Gene 6772] {aka CANDF7, IMD31A, IMD31B, IMD31C, ISGF-3, STAT91}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, CHDH (choline dehydrogenase) [NCBI Gene 55349], ITGA2 (integrin subunit alpha 2) [NCBI Gene 3673] {aka BR, CD49B, FMAIT3, GPIa, HPA-5, VLA-2}, BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, MMP2 (matrix metallopeptidase 2) [NCBI Gene 4313] {aka CLG4, CLG4A, MMP-2, MMP-II, MONA, TBE-1}, RTCB (RNA 2',3'-cyclic phosphate and 5'-OH ligase) [NCBI Gene 51493] {aka C22orf28, DJ149A16.6, FAAP, HSPC117, TSLIG5}, PRKAB1 (protein kinase AMP-activated non-catalytic subunit beta 1) [NCBI Gene 5564] {aka AMPK, HAMPKb}, NFKB1 (nuclear factor kappa B subunit 1) [NCBI Gene 4790] {aka CVID12, EBP-1, KBF1, NF-kB, NF-kB1, NF-kappa-B1}
- **Diseases:** peritoneal metastasis (MESH:D010538), TIDE (MESH:D007154), urothelial carcinoma (MESH:D014523), DEPs (MESH:D001039), CRC (MESH:D015179), deaths (MESH:D003643), Amoebiasis (MESH:D000562), liver metastases (MESH:D009362), thrombus (MESH:D013927), CCI (MESH:D002292), bladder cancer (MESH:D001749), Dysfunction (MESH:D006331), gastrointestinal cancer (MESH:D005770), breast cancer (MESH:D001943), LNM (MESH:D008207), G/GEJA (MESH:D000230), Cancer (MESH:D009369), TD (MESH:D000079822), Small cell lung cancer (MESH:D055752), cervical cancer (MESH:D002583), Fibrosis (MESH:D005355), T cell Dysfunction (MESH:C536780), colorectal adenocarcinoma (MESH:D003110), CMS (MESH:C535673), COAD (MESH:D029424), hypoxic (MESH:D002534), gastric adenocarcinoma (MESH:D013274)
- **Chemicals:** Fatty acid (MESH:D005227), onvansertib (MESH:C000706408), Dasatinib (MESH:D000069439), hematoxylin (MESH:D006416), penicillin (MESH:D010406), HE (MESH:D006371), Masson A (-), formalin (MESH:D005557), DAB (MESH:C000469), PVDF (MESH:C024865), PBS (MESH:D007854), paraformaldehyde (MESH:C003043), DIA (MESH:C076868), citrate (MESH:D019343), CO2 (MESH:D002245), streptomycin (MESH:D013307), Irinotecan (MESH:D000077146), paclitaxel (MESH:D017239), xylene (MESH:D014992), formic acid (MESH:C030544), methanol (MESH:D000432), C-6-S (MESH:D002809), paraffin (MESH:D010232), BI2536 (MESH:C518477), bevacizumab (MESH:D000068258), Oxaliplatin (MESH:D000077150), ethanol (MESH:D000431), HCl (MESH:D006851), acetic acid (MESH:D019342), SDS (MESH:D012967), Staurosporine (MESH:D019311), 5-Fluorouracil (MESH:D005472)
- **Species:** Homo sapiens (human, species) [taxon 9606], Mus musculus (house mouse, species) [taxon 10090]
- **Cell lines:** DLD-1 — Homo sapiens (Human), Colon adenocarcinoma, Cancer cell line (CVCL_0248), CBP60037 — Homo sapiens (Human), Induced pluripotent stem cell (CVCL_RV79)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12953441/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12953441/full.md

## References

63 references — full list in the complete paper: https://tomesphere.com/paper/PMC12953441/full.md

---
Source: https://tomesphere.com/paper/PMC12953441