CASTELO: Clustered Atom Subtypes aidEd Lead Optimization -- a combined machine learning and molecular modeling method
Leili Zhang, Giacomo Domeniconi, Chih-Chieh Yang, Seung-gu Kang,, Ruhong Zhou, Guojing Cong

TL;DR
CASTELO combines machine learning and molecular modeling to automate drug lead optimization, reducing time from months or years to days by analyzing molecular dynamics data and clustering substructures.
Contribution
It introduces a novel in silico workflow integrating physics-based simulations, temporal data enhancement, and clustering methods for efficient lead optimization without extensive databases.
Findings
Effective clustering of submolecular structures
Potential hotspots for drug modification identified
Workflow reduces lead optimization time significantly
Abstract
Drug discovery is a multi-stage process that comprises two costly major steps: pre-clinical research and clinical trials. Among its stages, lead optimization easily consumes more than half of the pre-clinical budget. We propose a combined machine learning and molecular modeling approach that automates lead optimization workflow \textit{in silico}. The initial data collection is achieved with physics-based molecular dynamics (MD) simulation. Contact matrices are calculated as the preliminary features extracted from the simulations. To take advantage of the temporal information from the simulations, we enhanced contact matrices data with temporal dynamism representation, which are then modeled with unsupervised convolutional variational autoencoder (CVAE). Finally, conventional clustering method and CVAE-based clustering method are compared with metrics to rank the submolecular structures…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Computational Drug Discovery Methods · Innovative Microfluidic and Catalytic Techniques Innovation
MethodsSolana Customer Service Number +1-833-534-1729
