# Exploring the toxicological mechanisms of Benzo[a]anthracene (BaA) exposure in lung adenocarcinoma (LUAD) via network toxicology, machine learning, and multi-dimensional bioinformatics analysis

**Authors:** Zhiyao Shi, Zhiyong Fang, Qiang Qin, Yu Gao, Xi Yang, Likun Liu, Xixing Wang, Jiafu Li, Jiafu Li, Jiafu Li, Jiafu Li

PMC · DOI: 10.1371/journal.pone.0340116 · PLOS One · 2026-02-11

## TL;DR

This study explores how the carcinogen Benzo[a]anthracene affects lung cancer by using machine learning and bioinformatics to identify key genes and immune responses.

## Contribution

The study introduces a novel integrated framework combining network toxicology and machine learning to identify BaA's mechanisms in LUAD and validate core targets.

## Key findings

- 248 intersection targets were identified, enriched in chemokine and ErbB signaling pathways.
- Five core targets (TNNC1, ABCC3, CRABP2, CXCL12, OLR1) were validated with immune infiltration and expression patterns in LUAD.
- Molecular docking confirmed stable BaA-core target interactions, with strongest binding to CRABP2.

## Abstract

Lung adenocarcinoma (LUAD) is a major lung cancer subtype influenced by environmental factors. Benzo[a]anthracene (BaA), a common Group 2B carcinogen found in pollutants, smoke, and food, shows genotoxic and oncogenic activity; however, its specific mechanisms in LUAD pathogenesis remain unclear and warrant systematic investigation.

This study aims to elucidate the mechanisms of BaA-induced LUAD, identify core targets, validate their expression, immunorelevance and clinical significance, and construct a hypothesis framework for AOP in BaA-exposed LUAD.

We integrated network toxicology, multi-machine learning algorithms (LASSO, SVM-RFE, and Random Forest) and multidimensional bioinformatics analysis. Potential BaA-LUAD intersection targets were collected from public databases and subjected to functional enrichment analysis. Core targets were screened and validated using GEO and TCGA-LUAD (via UALCAN) datasets for differential expression, immune infiltration and prognostic value. Molecular docking and 100 ns molecular dynamics (MD) simulations were applied to evaluate the binding stability between BaA and core targets.

A total of 248 intersection targets were identified, with significant enrichment in chemokine signaling, ErbB signaling, and viral protein–cytokine receptor interaction pathways. Machine learning prioritized five core targets: TNNC1, ABCC3, CRABP2, CXCL12, and OLR1. These genes were consistently dysregulated in LUAD samples across cohorts (p < 0.05) and correlated distinctly with immune cell infiltration: TNNC1 was associated with anti-tumor immunity, while the others linked to immunosuppressive cells. Prognostic analysis showed trends of ABCC3/CRABP2 high-expression and TNNC1/CXCL12/OLR1 low-expression correlating with patient outcomes (p > 0.05). Molecular docking confirmed stable binding between BaA and all core targets, with the strongest affinity for CRABP2 (–8.4 kcal/mol). MD simulations further supported complex stability.

BaA promotes LUAD progression via multi-target regulation and tumor immune microenvironment remodeling. This study offers an integrated computational framework and an AOP-based theoretical foundation for assessing pollutant health risks and informing targeted LUAD interventions.

## Linked entities

- **Genes:** TNNC1 (troponin C1, slow skeletal and cardiac type) [NCBI Gene 7134], ABCC3 (ATP binding cassette subfamily C member 3) [NCBI Gene 8714], CRABP2 (cellular retinoic acid binding protein 2) [NCBI Gene 1382], CXCL12 (C-X-C motif chemokine ligand 12) [NCBI Gene 6387], OLR1 (oxidized low density lipoprotein receptor 1) [NCBI Gene 4973]
- **Chemicals:** Benzo[a]anthracene (PubChem CID 5954)
- **Diseases:** Lung adenocarcinoma (MONDO:0005061)

## Full-text entities

- **Genes:** ABCC3 (ATP binding cassette subfamily C member 3) [NCBI Gene 8714] {aka ABC31, EST90757, MLP2, MOAT-D, MRP3, cMOAT2}, TNNC1 (troponin C1, slow skeletal and cardiac type) [NCBI Gene 7134] {aka CMD1Z, CMH13, TN-C, TNC, TNNC}, CRABP2 (cellular retinoic acid binding protein 2) [NCBI Gene 1382] {aka CRABP-II, RBP6}, OLR1 (oxidized low density lipoprotein receptor 1) [NCBI Gene 4973] {aka CLEC8A, LOX1, LOXIN, SCARE1, SLOX1}, CXCL12 (C-X-C motif chemokine ligand 12) [NCBI Gene 6387] {aka IRH, PBSF, SCYB12, SDF1, TLSF, TPAR1}
- **Diseases:** lung cancer (MESH:D008175), tumor (MESH:D009369), LUAD (MESH:D000077192)
- **Chemicals:** BaA (MESH:C030935)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12893569/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12893569/full.md

## References

54 references — full list in the complete paper: https://tomesphere.com/paper/PMC12893569/full.md

---
Source: https://tomesphere.com/paper/PMC12893569