# An integrated study combining network toxicology machine learning and molecular simulation reveals the molecular mechanisms of permanent hair dyes in breast cancer

**Authors:** Xiaolu Yang, Yilun Li, Tianqi Zhang, Binglu He, Jingyan Wang, Shiyu Zhang, Li Ma

PMC · DOI: 10.1007/s12672-026-04585-1 · Discover Oncology · 2026-02-08

## TL;DR

This study combines machine learning and molecular simulations to uncover how permanent hair dye ingredients may contribute to breast cancer development.

## Contribution

The study introduces an integrated approach using network toxicology and machine learning to identify key molecular targets linking hair dyes to breast cancer.

## Key findings

- Eight key targets (e.g., HSP90AA1, CDK1) were identified as regulators of breast cancer progression linked to hair dye ingredients.
- Disperse Yellow 3 showed the strongest binding affinity to key targets, indicating a strong association with breast cancer risk.
- Machine learning confirmed the prognostic importance of SRC, HSP90AB1, HSP90AA1, and CDK1 in breast cancer.

## Abstract

Permanent hair dyes have been linked to an increased risk of breast cancer (BC), though the underlying mechanisms remain unclear. To address this knowledge gap, our investigation employed an integrated approach combining network toxicology, molecular docking, molecular dynamics simulations, and machine learning to decipher the molecular mechanisms by which permanent hair dyes might promote BC pathogenesis. Five permanent hair dye ingredients classified by IARC as carcinogenic were included in this study: p-phenylenediamine, resorcinol, pyridine, Disperse Yellow 3, and HC Blue No. 2. These chemicals can regulate BC progression through various signaling pathways, with key core targets identified as HSP90AA1, HSP90AB1, ESR1, CDK1, STAT3, MAPK8, HDAC1, and SRC. A machine learning model comprising 128 algorithms confirmed that these eight targets possess strong prognostic predictive capabilities for BC. Subsequent SHAP analysis revealed SRC, HSP90AB1, HSP90AA1 and CDK1 as the key contributors to prognostic prediction, with each being highly expressed in BC and linked to poor clinical prognosis. Notably, among all chemicals screened, Disperse Yellow 3 exhibited the strongest binding affinity to these four key targets, demonstrating the strongest association with BC risk.

The online version contains supplementary material available at 10.1007/s12672-026-04585-1.

## Linked entities

- **Genes:** HSP90AA1 (heat shock protein 90 alpha family class A member 1) [NCBI Gene 3320], HSP90AB1 (heat shock protein 90 alpha family class B member 1) [NCBI Gene 3326], ESR1 (estrogen receptor 1) [NCBI Gene 2099], CDK1 (cyclin dependent kinase 1) [NCBI Gene 983], STAT3 (signal transducer and activator of transcription 3) [NCBI Gene 6774], MAPK8 (mitogen-activated protein kinase 8) [NCBI Gene 5599], HDAC1 (histone deacetylase 1) [NCBI Gene 3065], SRC (SRC proto-oncogene, non-receptor tyrosine kinase) [NCBI Gene 6714]
- **Chemicals:** p-phenylenediamine (PubChem CID 7814), resorcinol (PubChem CID 5054), pyridine (PubChem CID 1049), Disperse Yellow 3 (PubChem CID 17811), HC Blue No. 2 (PubChem CID 36383)
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** STAT3 (signal transducer and activator of transcription 3) [NCBI Gene 6774] {aka ADMIO, ADMIO1, APRF, HIES}, SRC (SRC proto-oncogene, non-receptor tyrosine kinase) [NCBI Gene 6714] {aka ASV, SRC1, THC6, c-SRC, p60-Src}, HDAC1 (histone deacetylase 1) [NCBI Gene 3065] {aka GON-10, HD1, KDAC1, RPD3, RPD3L1}, MAPK8 (mitogen-activated protein kinase 8) [NCBI Gene 5599] {aka JNK, JNK-46, JNK1, JNK1A2, JNK21B1/2, PRKM8}, HSP90AB1 (heat shock protein 90 alpha family class B member 1) [NCBI Gene 3326] {aka D6S182, HSP84, HSP90B, HSPC2, HSPCB}, HSP90AA1 (heat shock protein 90 alpha family class A member 1) [NCBI Gene 3320] {aka EL52, HEL-S-65p, HSP86, HSP89A, HSP90A, HSP90N}, ESR1 (estrogen receptor 1) [NCBI Gene 2099] {aka ER, ESR, ESRA, ESTRR, Era, NR3A1}, CDK1 (cyclin dependent kinase 1) [NCBI Gene 983] {aka CDC2, CDC28A, P34CDC2}
- **Diseases:** carcinogenic (MESH:D011230), BC (MESH:D001943)
- **Chemicals:** HC Blue No (-), pyridine (MESH:C023666), Disperse Yellow 3 (MESH:C024957), p-phenylenediamine (MESH:C029728), resorcinol (MESH:C031389)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12979728/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12979728/full.md

---
Source: https://tomesphere.com/paper/PMC12979728