# Integrating multi-omics and machine learning to unravel mechanisms of lymph node metastasis in papillary carcinoma with and without thyroiditis

**Authors:** Shuran Chen, Wencan Yang, Yanyan Liu, Angqing Li

PMC · DOI: 10.3389/fimmu.2026.1708330 · Frontiers in Immunology · 2026-02-18

## TL;DR

This study explores how lymph node metastasis differs between two types of thyroid cancer, revealing distinct mechanisms and a 17-gene model to predict metastasis risk.

## Contribution

The study identifies subtype-specific mechanisms of metastasis and introduces a predictive model and a novel causal gene for thyroid cancer.

## Key findings

- PTC-blank is linked to ECM remodeling and fibroblast infiltration, while PTC-thyroiditis involves immune-related pathways.
- A 17-gene model accurately predicts lymph node metastasis risk in PTC patients.
- SHISA5 is identified as a causal gene with potential therapeutic value when bound to acetaminophen.

## Abstract

Papillary thyroid carcinoma (PTC) is the most common thyroid malignancy, with lymph node metastasis (LNM) significantly influencing prognosis. PTC cases were categorized based on the presence or absence of coexisting thyroiditis: those with thyroiditis were designated as PTC-thyroiditis, and those without as PTC-blank. Although both subtypes exhibit distinct clinical and genomic profiles, the mechanisms underlying LNM—particularly the role of extracellular matrix (ECM) remodeling and cancer-associated fibroblasts (CAFs)—remain poorly understood. This study aimed to compare the molecular and cellular heterogeneity of PTC-blank and PTC-thyroiditis, identify key drivers of LNM, and develop a predictive model for clinical use.

We integrated bulk RNA-seq data from the TCGA database and single-cell RNA-seq (scRNA-seq) data from GSE184362 to analyze clinical, genomic, and tumor microenvironment (TME) differences between PTC-blank and PTC-thyroiditis. Differential expression analysis, Gene Set Enrichment Analysis (GSEA), and immune cell infiltration analysis were performed. Fibroblast subpopulations were characterized at single-cell resolution. Machine learning algorithms (LASSO, random forest, KNN) were applied to identify LNM-related genes and construct a predictive model. Mendelian randomization (MR) and molecular docking were used to validate causal genes and potential drug interactions.

PTC-blank exhibited higher T, N, and M stages and increased mutations in BRAF and MUC16 compared to PTC-thyroiditis. LNM in PTC-blank was associated with ECM remodeling and collagen fiber accumulation, associated with a distinct PI16+ fibroblast subcluster with active ECM organization functions. In contrast, LNM in PTC-thyroiditis involved immune-related pathways without significant fibroblast infiltration or ECM changes. A 17-gene predictive model for LNM was developed, with the KNN classifier demonstrating high accuracy. MR analysis identified SHISA5 as a causal risk gene for thyroid cancer, and molecular docking revealed strong binding affinity with acetaminophen, suggesting therapeutic potential.

PTC-blank and PTC-thyroiditis exhibit distinct LNM mechanisms: ECM remodeling and fibroblast infiltration are associated with metastasis in PTC-blank, while immune dysregulation appears to be more prominent in PTC-thyroiditis. The identified 17-gene model offers robust predictive value for LNM risk, and SHISA5 represents a novel causal gene and potential therapeutic target. These findings provide insights into subtype-specific management strategies for PTC patients.

## Linked entities

- **Genes:** BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673], MUC16 (mucin 16, cell surface associated) [NCBI Gene 94025], SHISA5 (shisa family member 5) [NCBI Gene 51246]
- **Chemicals:** acetaminophen (PubChem CID 1983)
- **Diseases:** papillary thyroid carcinoma (MONDO:0005075), thyroiditis (MONDO:0004126)

## Full-text entities

- **Genes:** S100A6 (S100 calcium binding protein A6) [NCBI Gene 6277] {aka 2A9, 5B10, CABP, CACY, PRA, S10A6}, POSTN (periostin) [NCBI Gene 10631] {aka OSF-2, OSF2, PDLPOSTN, PN}, Cd79b (CD79B antigen) [NCBI Gene 15985] {aka B29, Ig-beta, Igb, Igbeta}, TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, Il7r (interleukin 7 receptor) [NCBI Gene 16197] {aka CD127, IL-7Ralpha}, Fcer1g (Fc receptor, IgE, high affinity I, gamma polypeptide) [NCBI Gene 14127] {aka CD23, FcR-gamma, FcR[g], FcRgamma, Fce1g, FcepsilonRI}, BRAF (B-Raf proto-oncogene, serine/threonine kinase) [NCBI Gene 673] {aka B-RAF1, B-raf, BRAF-1, BRAF1, NS7, RAFB1}, NKX2-1 (NK2 homeobox 1) [NCBI Gene 7080] {aka BCH, BHC, NK-2, NKX2.1, NKX2A, NMTC1}, TSHR (thyroid stimulating hormone receptor) [NCBI Gene 7253] {aka CHNG1, LGR3, hTSHR-I}, Chrdl1 (chordin-like 1) [NCBI Gene 83453] {aka CHL, CHL1, Nrln1, VOPT}, Igkc (immunoglobulin kappa constant) [NCBI Gene 16071] {aka Igk-C}, LOX (lysyl oxidase) [NCBI Gene 4015] {aka AAT10}, Tfpi (tissue factor pathway inhibitor) [NCBI Gene 21788] {aka A630013F22Rik, EPI, LACI}, CCDC6 (coiled-coil domain containing 6) [NCBI Gene 8030] {aka D10S170, H4, PTC, PTC1, TPC, TST1}, TENM1 (teneurin transmembrane protein 1) [NCBI Gene 10178] {aka ODZ1, ODZ3, TEN-M1, TEN1, TNM, TNM1}, Lyz1 (lysozyme 1) [NCBI Gene 17110] {aka Lyz, Lyzf3, Lzp-s}, Mgp (matrix Gla protein) [NCBI Gene 17313] {aka Mglap}, Col1a2 (collagen, type I, alpha 2) [NCBI Gene 12843] {aka Col1a-2, Cola-2, Cola2, oim}, BCR (BCR activator of RhoGEF and GTPase) [NCBI Gene 613] {aka ALL, BCR1, CML, D22S11, D22S662, PHL}, IDH1 (isocitrate dehydrogenase (NADP(+)) 1) [NCBI Gene 3417] {aka HEL-216, HEL-S-26, IDCD, IDH, IDP, IDPC}, GLIS3 (GLIS family zinc finger 3) [NCBI Gene 169792] {aka NDH, ZNF515}, Tagln (transgelin) [NCBI Gene 21345] {aka Sm22, Sm22a, Ws310}, NRAS (NRAS proto-oncogene, GTPase) [NCBI Gene 4893] {aka ALPS4, CMNS, N-ras, NCMS, NRAS1, NS6}, TPO (thyroid peroxidase) [NCBI Gene 7173] {aka MSA, TDH2A, TPX}, Mgst1 (microsomal glutathione S-transferase 1) [NCBI Gene 56615] {aka 1500002K10Rik, Gst}, Tyrobp (TYRO protein tyrosine kinase binding protein) [NCBI Gene 22177] {aka DAP12, KARAP, Ly83}, DUOX1 (dual oxidase 1) [NCBI Gene 53905] {aka LNOX1, NOXEF1, THOX1}, SLC26A4 (solute carrier family 26 member 4) [NCBI Gene 5172] {aka DFNB4, EVA, PDS, TDH2B}, Fn1 (fibronectin 1) [NCBI Gene 14268] {aka E330027I09, Fn, Fn-1}, Cldn5 (claudin 5) [NCBI Gene 12741] {aka MBEC1, Tmvcf}, SHISA5 (shisa family member 5) [NCBI Gene 51246] {aka SCOTIN}, SHROOM4 (shroom family member 4) [NCBI Gene 57477] {aka MRXSSDS, SHAP, shrm4}, Col1a1 (collagen, type I, alpha 1) [NCBI Gene 12842] {aka Col1a-1, Cola-1, Cola1, Mov-13, Mov13}, CD46 (CD46 molecule) [NCBI Gene 4179] {aka AHUS2, MCP, MIC10, TLX, TRA2.10}, CD22 (CD22 molecule) [NCBI Gene 933] {aka SIGLEC-2, SIGLEC2}, CSTB (cystatin B) [NCBI Gene 1476] {aka CPI-B, CST6, EPM1, EPM1A, PME, STFB}, FHL1 (four and a half LIM domains 1) [NCBI Gene 2273] {aka FCMSU, FHL-1, FHL1A, FHL1B, FLH1A, KYOT}, Ms4a1 (membrane-spanning 4-domains, subfamily A, member 1) [NCBI Gene 12482] {aka Cd20, Ly-44, Ms4a2}, Rgs5 (regulator of G-protein signaling 5) [NCBI Gene 19737] {aka 1110070A02Rik}, Igfbp7 (insulin-like growth factor binding protein 7) [NCBI Gene 29817] {aka AGM, Fstl2, Mac25}, PI16 (peptidase inhibitor 16) [NCBI Gene 221476] {aka CD364, CRISP9, MSMBBP, PSPBP}, SULF1 (sulfatase 1) [NCBI Gene 23213] {aka SULF-1}, ITGB1 (integrin subunit beta 1) [NCBI Gene 3688] {aka CD29, FNRB, GPIIA, MDF2, MSK12, VLA-BETA}, Cd3d (CD3 antigen, delta polypeptide) [NCBI Gene 12500] {aka T3d}, COL1A1 (collagen type I alpha 1 chain) [NCBI Gene 1277] {aka CAFYD, EDSARTH1, EDSC, OI1, OI2, OI3}, Cd79a (CD79A antigen (immunoglobulin-associated alpha)) [NCBI Gene 12518] {aka Ig-alpha, Iga, Igalpha, Ly-54, Ly54, mb-1}, Timp3 (tissue inhibitor of metalloproteinase 3) [NCBI Gene 21859] {aka Timp-3}, Ramp2 (receptor (calcitonin) activity modifying protein 2) [NCBI Gene 54409], Acta2 (actin alpha 2, smooth muscle, aorta) [NCBI Gene 11475] {aka 0610041G09Rik, Actvs, SMAalpha, SMalphaA, a-SMA, alphaSMA}, S100a13 (S100 calcium binding protein A13) [NCBI Gene 20196], FOXE1 (forkhead box E1) [NCBI Gene 2304] {aka BAMLAZ, FKHL15, FOXE2, HFKH4, HFKL5, NMTC4}, Clu (clusterin) [NCBI Gene 12759] {aka ApoJ, Cli, D14Ucla3, SP-40, Sgp-2, Sgp2}, Cd74 (CD74 antigen (invariant polypeptide of major histocompatibility complex, class II antigen-associated)) [NCBI Gene 16149] {aka CLIP, DHLAG, HLADG, Ia-GAMMA, Ii}, PAX8 (paired box 8) [NCBI Gene 7849] {aka PAX-8}, Pi16 (peptidase inhibitor 16) [NCBI Gene 74116] {aka 1200009H11Rik, Cripi, PI-16}, DIO2 (iodothyronine deiodinase 2) [NCBI Gene 1734] {aka 5DII, D2, DIOII, SELENOY, SelY, TXDI2}, Jun (Jun proto-oncogene, AP-1 transcription factor subunit) [NCBI Gene 16476] {aka AP-1, Junc, c-jun}, TFF3 (trefoil factor 3) [NCBI Gene 7033] {aka ITF, P1B, TFI}, MUC16 (mucin 16, cell surface associated) [NCBI Gene 94025] {aka CA125}, Trac (T cell receptor alpha constant) [NCBI Gene 100101484] {aka Gm16914, Tcra, Tcra-C}
- **Diseases:** TDS (MESH:D013966), papillary carcinoma (MESH:D002291), HT (MESH:D050031), lung, hepatocellular, gastric, renal cell, and colorectal cancers (MESH:D002292), autoimmune disease (MESH:D001327), LNM (MESH:D008207), thyroid dysfunction (MESH:D013959), PTC thyroiditis (MESH:D000077273), Cancer (MESH:D009369), Alzheimer's disease (MESH:D000544), ESCC (MESH:D000077277), N (MESH:C536108), metastasis (MESH:D009362), Inflammatory (MESH:D007249), THCA (MESH:D013964)
- **Chemicals:** ethanol (MESH:D000431), Formalin (MESH:D005557), water (MESH:D014867), paraformaldehyde (MESH:C003043), Picrosirius red (MESH:C009798), xylene (MESH:D014992), BL-RS-19 (-), hydrogen peroxide (MESH:D006861), cisplatin (MESH:D002945), paraffin (MESH:D010232), Acetaminophen (MESH:D000082), sodium citrate (MESH:D000077559)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** BRAFV600E

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12957244/full.md

## Figures

7 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12957244/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12957244/full.md

---
Source: https://tomesphere.com/paper/PMC12957244