# Unraveling the PFOS-NSCLC axis: integrated network toxicology, machine learning, and causal inference identify EIF4EBP1 as a key molecular hub

**Authors:** Ting Huang, Huaxin Pang, Jundan Wang, Junhua Guo, Keke Hu, Heran Zhou

PMC · DOI: 10.3389/fpubh.2026.1751125 · Frontiers in Public Health · 2026-02-12

## TL;DR

This study identifies EIF4EBP1 as a key gene linking PFOS exposure to lung cancer, using machine learning and causal analysis to explore how this pollutant may contribute to cancer development.

## Contribution

The study introduces a novel integration of network toxicology, machine learning, and causal inference to identify EIF4EBP1 as a central hub in PFOS-related non-small cell lung cancer.

## Key findings

- EIF4EBP1 is significantly upregulated in both discovery and validation cohorts, with subtype-specific prognostic value in lung cancer.
- Mendelian randomization suggests a causal link between EIF4EBP1 expression and increased risk of lung adenocarcinoma.
- Molecular docking confirms a stable interaction between PFOS and the EIF4EBP1 binding pocket.

## Abstract

Perfluorooctanesulfonic acid (PFOS) is a persistent environmental pollutant with suspected carcinogenic potential; however, the molecular mechanisms driving PFOS-associated non-small cell lung cancer (NSCLC) remain obscure. In particular, the interplay between chemical exposure, oncogenic signaling nodes, and tumor microenvironment (TME) remodeling is poorly defined. This study integrates systems toxicology with multi-omics to elucidate the role of EIF4EBP1 as a mechanistic bridge connecting PFOS exposure to NSCLC pathogenesis.

We synthesized chemical–protein interactions from toxicological databases (ChEMBL, STITCH, and SwissTargetPrediction) and disease-associated genes to map the PFOS–NSCLC intersection. Robust feature selection, utilizing LASSO and SVM-RFE algorithms, was applied to transcriptomic data from the GSE33532 discovery cohort to identify core targets. Key findings were substantiated through external validation in The Cancer Genome Atlas (TCGA) dataset, including differential expression and survival analyses. Causal associations were investigated via two-sample Mendelian randomization (MR), and the immune landscape was characterized using the CIBERSORT algorithm. Molecular docking simulations and an adverse outcome pathway (AOP) framework were further employed to assess mechanistic plausibility.

Network analysis identified 41 shared targets significantly enriched in PPAR signaling and xenobiotic metabolism. Machine learning consensus prioritized EIF4EBP1 as a critical hub gene. EIF4EBP1 was significantly upregulated in both the discovery (AUC = 0.936) and TCGA validation cohorts. Clinical analysis revealed subtype-specific prognostic value, where high EIF4EBP1 expression correlated with poor survival in lung adenocarcinoma (LUAD) but favorable outcomes in squamous cell carcinoma (LUSC). Immunologically, EIF4EBP1 expression tracked with an adaptive immune-skewed profile, characterized by increased plasma cell and activated CD4 + memory T cell infiltration. MR analysis indicated a potential causal effect of genetically predicted EIF4EBP1 expression on increased LUAD risk (OR = 4.196, 95% CI: 1.209–14.565), but not squamous cell carcinoma. Structural docking confirmed a stable, non-covalent interaction between PFOS and the EIF4EBP1 binding pocket (−7.2 kcal/mol).

This study identifies EIF4EBP1 as a putative molecular initiating node linking PFOS exposure to LUAD susceptibility and immune modulation. The constructed AOP framework suggests a mechanism wherein PFOS-mediated translational dysregulation contributes to subtype-specific carcinogenesis. These findings provide a data-driven rationale for risk assessment and warrant further experimental verification in toxicological models.

## Linked entities

- **Genes:** EIF4EBP1 (eukaryotic translation initiation factor 4E binding protein 1) [NCBI Gene 1978]
- **Chemicals:** PFOS (PubChem CID 74483)
- **Diseases:** non-small cell lung cancer (MONDO:0005233), lung adenocarcinoma (MONDO:0005061), squamous cell carcinoma (MONDO:0005096)

## Full-text entities

- **Genes:** AKT1 (AKT serine/threonine kinase 1) [NCBI Gene 207] {aka AKT, PKB, PKB-ALPHA, PRKBA, RAC, RAC-ALPHA}, PPARA (peroxisome proliferator activated receptor alpha) [NCBI Gene 5465] {aka NR1C1, PPAR, PPAR-alpha, PPARalpha, hPPAR}, CD4 (CD4 molecule) [NCBI Gene 920] {aka CD4mut, IMD79, Leu-3, OKT4D, T4}, PIK3CB (phosphatidylinositol-4,5-bisphosphate 3-kinase catalytic subunit beta) [NCBI Gene 5291] {aka P110BETA, PI3K, PI3KBETA, PIK3C1}, PFAS (phosphoribosylformylglycinamidine synthase) [NCBI Gene 5198] {aka FGAMS, FGAR-AT, FGARAT, GATD8, PURL}, EIF4EBP1 (eukaryotic translation initiation factor 4E binding protein 1) [NCBI Gene 1978] {aka 4E-BP1, 4EBP1, BP-1, PHAS-I}
- **Diseases:** inflammatory (MESH:D007249), carcinogenic (MESH:D011230), tumorigenic (MESH:D002471), AOP (MESH:D011248), lung cancer (MESH:D008175), endocrine (MESH:D004700), Cancer (MESH:D009369), adenocarcinoma (MESH:D000230), lung toxicity (MESH:D008171), toxicity (MESH:D064420), PFOS (MESH:D011015), smoking (MESH:D015208), LUAD (MESH:D000077192), carcinogenesis (MESH:D063646), Mendelian Inheritance in Man (MESH:D030342), NSCLC (MESH:D002289), LUSC (MESH:D002294)
- **Chemicals:** Perfluoroalkyl and polyfluoroalkyl substances (-), fatty acids (MESH:D005227), water (MESH:D014867), lipid (MESH:D008055), hydrogen (MESH:D006859), PFOS (MESH:C076994)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** rs28565141, AUC of 0

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12935900/full.md

## Figures

11 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12935900/full.md

## References

34 references — full list in the complete paper: https://tomesphere.com/paper/PMC12935900/full.md

---
Source: https://tomesphere.com/paper/PMC12935900