Selective Cleaning Enhances Machine Learning Accuracy for Drug Repurposing: Multiscale Discovery of MDM2 Inhibitors
Mohammad Firdaus Akmal, Ming Wah Wong

TL;DR
This study improves drug repurposing for cancer by using a selective cleaning algorithm to boost machine learning accuracy in identifying MDM2 inhibitors.
Contribution
A selective cleaning algorithm is introduced to enhance machine learning accuracy in drug repurposing for MDM2 inhibition.
Findings
Selective cleaning reduced RMSE by 21.6% and achieved R2 = 0.87 in predicting pIC50 values.
Three clinically tested compounds were identified as promising MDM2 inhibitors with high predicted potency and binding affinity.
Quantum mechanical and molecular dynamics simulations confirmed stable interactions of selected compounds with MDM2.
Abstract
Cancer remains one of the most formidable challenges to human health; hence, developing effective treatments is critical for saving lives. An important strategy involves reactivating tumor suppressor genes, particularly p53, by targeting their negative regulator MDM2, which is essential in promoting cell cycle arrest and apoptosis. Leveraging a drug repurposing approach, we screened over 24,000 clinically tested molecules to identify new MDM2 inhibitors. A key innovation of this work is the development and application of a selective cleaning algorithm that systematically filters assay data to mitigate noise and inconsistencies inherent in large-scale bioactivity datasets. This approach significantly improved the predictive accuracy of our machine learning model for pIC50 values, reducing RMSE by 21.6% and achieving state-of-the-art performance (R2 = 0.87)—a substantial improvement over…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Protein Structure and Dynamics
