Mitigating Molecular Aggregation in Drug Discovery with Predictive Insights from Explainable AI
Hunter Sturm, Jonas Teufel, Kaitlin A. Isfeld, Pascal Friederich, Rebecca L. Davis

TL;DR
This paper introduces MEGAN, an explainable AI model that identifies and mitigates colloidal aggregation in molecules, reducing false positives in drug screening and accelerating drug discovery.
Contribution
The work presents a novel xAI approach, MEGAN, for predicting and modifying molecular aggregation properties, improving high-throughput screening accuracy.
Findings
MEGAN accurately classifies aggregating molecules with high precision.
Counterfactuals guide structural modifications to reduce aggregation.
Experimental validation confirms the effectiveness of counterfactual-based design.
Abstract
Herein, we present the application of MEGAN, our explainable AI (xAI) model, for the identification of small colloidally aggregating molecules (SCAMs). This work offers solutions to the long-standing problem of false positives caused by SCAMs in high throughput screening for drug discovery and demonstrates the power of xAI in the classification of molecular properties that are not chemically intuitive based on our current understanding. We leverage xAI insights and molecular counterfactuals to design alternatives to problematic compounds in drug screening libraries. Additionally, we experimentally validate the MEGAN prediction classification for one of the counterfactuals and demonstrate the utility of counterfactuals for altering the aggregation properties of a compound through minor structural modifications. The integration of this method in high-throughput screening approaches will…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Cell Image Analysis Techniques
MethodsCounterfactuals Explanations
