# Uncovering potential molecular biomarkers for cancer-associated secondary lymphedema through integrated analyses of RNA-sequencing, machine learning, and clinical data

**Authors:** Hao Dong, Jianliang Miao, Zhong Liu, Yuguang Sun, Peilin Li, Song Xia, Wenbin Shen

PMC · DOI: 10.3389/fonc.2026.1760040 · Frontiers in Oncology · 2026-02-13

## TL;DR

This study identifies three genes as potential biomarkers for cancer-related lymphedema, offering new insights for diagnosis and treatment.

## Contribution

The study introduces novel molecular markers for cancer-associated secondary lymphedema using RNA sequencing and machine learning.

## Key findings

- IL2RG, HOXD10, and TSPAN1 are potential biomarkers for CASL with high diagnostic model performance.
- Immune pathways and T cell/macrophage infiltration are linked to CASL progression.
- Single-cell analysis shows elevated expression of the biomarker genes in T cell subsets.

## Abstract

Cancer-associated secondary lymphedema (CASL) commonly occurs after tumor-related lymph node dissection and radiotherapy. Nevertheless, the mechanisms of CASL remain unclear, and there are no specific molecular markers for its diagnosis and treatment.

In this study, RNA sequencing was performed on adipose tissues from 10 normal controls and 40 patients with CASL. Differentially expressed genes were screened using two machine learning algorithms to identify potential molecular markers for CASL. Subsequently, seven machine learning algorithms were employed to develop predictive models based on the identified markers. The contribution of each feature to the predictive outcomes was interpreted using Shapley additive explanation (SHAP). Immune cell infiltration was profiled through CIBERSORT and MCP-counter algorithms, and single-cell RNA sequencing (scRNA-seq) data were integrated to explore interactions between characteristic genes and immune cell subpopulations. Furthermore, associations between characteristic genes and clinical parameters were also assessed.

IL2RG, HOXD10, and TSPAN1 were identified as potential biomarkers of CASL. Diagnostic models built on these three genes showed excellent performance. Functional enrichment analysis suggested that the dysregulation of cytokine-cytokine receptor interactions and immune pathways underlies the pathological progression of CASL. In addition, immune infiltration analysis indicated that T cell and macrophage infiltration were intricately involved in CASL progression. Intriguingly, single-cell transcriptomic analysis further revealed elevated expression of IL2RG, HOXD10, and TSPAN1 in T cell subsets. Finally, RT-qPCR validated that these genes were expressed at higher levels in CASL tissues than in normal tissues. Moreover, IL2RG expression was strongly associated with clinical parameters.

This study identified IL2RG, HOXD10, and TSPAN1 as novel potential molecular markers for CASL, providing valuable biological evidence for the diagnosis and intervention of CASL.

## Linked entities

- **Genes:** IL2RG (interleukin 2 receptor subunit gamma) [NCBI Gene 3561], HOXD10 (homeobox D10) [NCBI Gene 3236], TSPAN1 (tetraspanin 1) [NCBI Gene 10103]

## Full-text entities

- **Genes:** TP53 (tumor protein p53) [NCBI Gene 7157] {aka BCC7, BMFS5, LFS1, P53, TRP53}, CDH5 (cadherin 5) [NCBI Gene 1003] {aka 7B4, CD144}, TEK (TEK receptor tyrosine kinase) [NCBI Gene 7010] {aka CD202B, GLC3E, TIE-2, TIE2, VMCM, VMCM1}, Il2 (interleukin 2) [NCBI Gene 16183] {aka Il-2}, Cd4 (CD4 antigen) [NCBI Gene 12504] {aka L3T4, Ly-4}, CD3D (CD3 delta subunit of T-cell receptor complex) [NCBI Gene 915] {aka CD3-DELTA, CD3DELTA, IMD19, T3D}, Il7 (interleukin 7) [NCBI Gene 16196] {aka A630026I06Rik, Il-7, hlb368}, NOS3 (nitric oxide synthase 3) [NCBI Gene 4846] {aka EC-NOS, ECNOS, MYMY8, NOSIII, cNOS, eNOS}, KLRB1 (killer cell lectin like receptor B1) [NCBI Gene 3820] {aka CD161, CLEC5B, NKR, NKR-P1, NKR-P1A, NKRP1A}, IL2RG (interleukin 2 receptor subunit gamma) [NCBI Gene 3561] {aka CD132, CIDX, IL-2RG, IMD4, P64, SCIDX}, CLDN5 (claudin 5) [NCBI Gene 7122] {aka AWAL, BEC1, CPETRL1, TMDVCF, TMVCF}, PDGFRA (platelet derived growth factor receptor alpha) [NCBI Gene 5156] {aka CD140A, PDGFR-2, PDGFR2}, Il9 (interleukin 9) [NCBI Gene 16198] {aka Il-9, P40}, Il21 (interleukin 21) [NCBI Gene 60505] {aka IL-21}, CD3G (CD3 gamma subunit of T-cell receptor complex) [NCBI Gene 917] {aka CD3-GAMMA, CD3GAMMA, IMD17, T3G}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, NFKB1 (nuclear factor kappa B subunit 1) [NCBI Gene 4790] {aka CVID12, EBP-1, KBF1, NF-kB, NF-kB1, NF-kappa-B1}, FLT4 (fms related receptor tyrosine kinase 4) [NCBI Gene 2324] {aka CHTD7, FLT-4, FLT41, LMPH1A, LMPHM1, PCL}, DLL4 (delta like canonical Notch ligand 4) [NCBI Gene 54567] {aka AOS6, delta4, hdelta2}, CD68 (CD68 molecule) [NCBI Gene 968] {aka GP110, LAMP4, SCARD1}, ITGAM (integrin subunit alpha M) [NCBI Gene 3684] {aka CD11B, CR3A, HNA-4, MAC-1, MAC1A, MO1A}, KIT (KIT proto-oncogene, receptor tyrosine kinase) [NCBI Gene 3815] {aka C-Kit, CD117, MASTC, PBT, SCFR}, CD8A (CD8 subunit alpha) [NCBI Gene 925] {aka CD8, CD8alpha, IMD116, Leu2, p32}, LY75 (lymphocyte antigen 75) [NCBI Gene 4065] {aka CD205, CLEC13B, DEC-205, GP200-MR6, LY-75}, PECAM1 (platelet and endothelial cell adhesion molecule 1) [NCBI Gene 5175] {aka CD31, CD31/EndoCAM, GPIIA', PECA1, PECAM-1, endoCAM}, Il13 (interleukin 13) [NCBI Gene 16163] {aka Il-13}, C1QA (complement C1q A chain) [NCBI Gene 712] {aka C1QD1}, CLEC4C (C-type lectin domain family 4 member C) [NCBI Gene 170482] {aka BDCA-2, BDCA2, CD303, CLECSF11, CLECSF7, DLEC}, Il10 (interleukin 10) [NCBI Gene 16153] {aka CSIF, If2a, Il-10}, DCN (decorin) [NCBI Gene 1634] {aka CSCD, DSPG2, PG40, PGII, PGS2, SLRR1B}, TPSB2 (tryptase beta 2) [NCBI Gene 64499] {aka TPS2, tryptaseB, tryptaseC}, CTNNB1 (catenin beta 1) [NCBI Gene 1499] {aka CTNNB, EVR7, MRD19, NEDSDV, armadillo}, HOXD10 (homeobox D10) [NCBI Gene 3236] {aka HOX4, HOX4D, HOX4E, Hox-4.4}, Il4 (interleukin 4) [NCBI Gene 16189] {aka BSF-1, Il-4}, KLRD1 (killer cell lectin like receptor D1) [NCBI Gene 3824] {aka CD94}, CD79A (CD79a molecule) [NCBI Gene 973] {aka IGA, IGAlpha, MB-1, MB1}, Tgfb1 (transforming growth factor, beta 1) [NCBI Gene 21803] {aka TGF-beta1, TGFbeta1, Tgfb, Tgfb-1}, Il15 (interleukin 15) [NCBI Gene 16168] {aka IL-15}, NEDD9 (neural precursor cell expressed, developmentally down-regulated 9) [NCBI Gene 4739] {aka CAS-L, CAS2, CASL, CASS2, HEF1}, IGHG2 (immunoglobulin heavy constant gamma 2 (G2m marker)) [NCBI Gene 3501], TSPAN1 (tetraspanin 1) [NCBI Gene 10103] {aka NET1, TM4C, TM4SF}, Nedd9 (neural precursor cell expressed, developmentally down-regulated gene 9) [NCBI Gene 18003] {aka Cas-L, CasL, HEF1, MEF1, Nedd-9, p105}, ITIH2 (inter-alpha-trypsin inhibitor heavy chain 2) [NCBI Gene 3698] {aka H2P, ITI-HC2, SHAP}, VEGFC (vascular endothelial growth factor C) [NCBI Gene 7424] {aka Flt4-L, LMPH1D, LMPHM4, VRP}, Il2rg (interleukin 2 receptor, gamma chain) [NCBI Gene 16186] {aka CD132, [g]c, gamma(c), gc, p64}
- **Diseases:** lymphatic system dysfunction (MESH:D006425), tissue injury (MESH:D017695), staphylococcus aureus infection (MESH:D013203), breast cancer (MESH:D001943), kidney disease (MESH:D007674), endometrial, colorectal, or ovarian cancer (MESH:D018203), ISL (MESH:C000719191), lymphatic disorders (MESH:D008206), heart disease (MESH:D006331), ovarian cancer (MESH:D010051), pulmonary fibrosis (MESH:D011658), hypertrophic cardiomyopathy (MESH:D002312), Lymphedema (MESH:D008209), CASL (MESH:D009369), infection (MESH:D007239), venous insufficiency (MESH:D014689), digestive tract disease (MESH:D004066), swelling (MESH:D004487), congenital defects (MESH:D000013), fibrosis (MESH:D005355), inflammation (MESH:D007249), endometrial cancer (MESH:D016889), cervical cancer (MESH:D002583), pain (MESH:D010146)
- **Chemicals:** poly-T (MESH:D011071), water (MESH:D014867), TRIzol (MESH:C411644), nitrogen (MESH:D009584), PBS (-)
- **Species:** Mus musculus (house mouse, species) [taxon 10090], Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12945810/full.md

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12945810/full.md

## References

40 references — full list in the complete paper: https://tomesphere.com/paper/PMC12945810/full.md

---
Source: https://tomesphere.com/paper/PMC12945810