# Enhancing ERα-targeted compound efficacy in breast cancer threapy with ExplainableAI and GeneticAlgorithm

**Authors:** Zeonlung Pun, Qiaoyun Xue, Yichi Zhang, Manikkam Rajalakshmi, Manikkam Rajalakshmi, Manikkam Rajalakshmi

PMC · DOI: 10.1371/journal.pone.0319673 · 2025-05-20

## TL;DR

This paper introduces a new method combining AI and genetic algorithms to improve cancer drug effectiveness by optimizing compound properties.

## Contribution

A novel integration of explainable AI and genetic algorithms to enhance ERα-targeted compound efficacy and ADMET properties.

## Key findings

- Selected 50 critical molecular descriptors from 729, improving bioactivity prediction robustness.
- Optimized compounds achieved pIC50 values up to 10.05, surpassing previous peaks in the dataset.

## Abstract

Breast cancer remains a major cause of mortality among women globally, driving the need for advanced therapeutic solutions. This study presents a novel, comprehensive methodology integrating explainable artificial intelligence (AI), machine learning models, and genetic algorithms to enhance the bioactivity and ADMET (Absorption, Distribution, Metabolism, Excretion, and Toxicity) properties of compounds targeting estrogen receptor alpha (ERα). By employing SHAP (SHapley Additive exPlanations) and LassoNet, we identified and refined 50 critical molecular descriptors from an initial set of 729, significantly influencing the prediction of bioactivity. The selected descriptors were systematically validated, bolstering the predictive robustness of our models, which demonstrated a mean coefficient of determination of 77% for bioactivity and high accuracy scores of 90.2%, 93.7%, 89.5%, 87.3%, and 95.8% for absorption, distribution, metabolism, excretion, and toxicity, respectively. Further optimization through genetic algorithms identified candidate compounds with superior bioactivity, achieving pIC50 values as high as 10.05, surpassing the previously observed peak values in the dataset. These results underscore the potential of leveraging advanced machine learning and optimization techniques to accelerate the discovery of effective cancer therapies.

## Linked entities

- **Proteins:** ESR1 (estrogen receptor 1)
- **Diseases:** breast cancer (MONDO:0004989)

## Full-text entities

- **Genes:** ESR1 (estrogen receptor 1) [NCBI Gene 2099] {aka ER, ESR, ESRA, ESTRR, Era, NR3A1}, EREG (epiregulin) [NCBI Gene 2069] {aka EPR, ER, Ep}
- **Diseases:** Breast cancer (MESH:D001943), cancer (MESH:D009369), Toxicity (MESH:D064420)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

50 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12091784/full.md

---
Source: https://tomesphere.com/paper/PMC12091784