Drug Discovery with Dynamic Goal-aware Fragments
Seul Lee, Seanie Lee, Kenji Kawaguchi, Sung Ju Hwang

TL;DR
The paper introduces GEAM, a novel molecular generative framework that dynamically extracts, assembles, and modifies goal-aware fragments for drug discovery, improving the discovery process by updating fragment vocabularies based on target properties.
Contribution
GEAM is the first framework to incorporate dynamic, goal-aware fragment extraction, assembly, and modification, enabling more effective and adaptable drug candidate generation.
Findings
GEAM outperforms existing models in discovering drug candidates.
The goal-aware fragment extraction improves property relevance.
Dynamic vocabulary updates enhance exploration capabilities.
Abstract
Fragment-based drug discovery is an effective strategy for discovering drug candidates in the vast chemical space, and has been widely employed in molecular generative models. However, many existing fragment extraction methods in such models do not take the target chemical properties into account or rely on heuristic rules. Additionally, the existing fragment-based generative models cannot update the fragment vocabulary with goal-aware fragments newly discovered during the generation. To this end, we propose a molecular generative framework for drug discovery, named Goal-aware fragment Extraction, Assembly, and Modification (GEAM). GEAM consists of three modules, each responsible for goal-aware fragment extraction, fragment assembly, and fragment modification. The fragment extraction module identifies important fragments contributing to the desired target properties with the information…
Peer Reviews
Decision·ICML 2024 Poster
- The research content of the article is drug discovery, which is an important and cutting-edge field. - The method proposed in the article is simple and effective. The theoretical analysis and proof are clear. - The article provides detailed experimental results.
In the experimental part, the method proposed in the article did not achieve the best results on some data or some indicators.
- The proposed Fragment extraction module stands out as a robust strategy for constructing the fragment vocabulary. It ensures that the generative model prioritizes the most relevant fragments for the target property. Additionally, leveraging molecules generated by the modification module to expand the fragment vocabulary further bolsters this strategy. - The paper is well written and gives a good intuition of each module of GEAM, justifying their choice and their roles within the generative pro
- While the paper introduces GEAM as an innovative framework, where the importance of each module is well understood, it could benefit from a more explicit delineation of the unique contributions of each module, mainly for the assembly and modification modules. - The paper could enhance its clarity by describing the specific advantages of FGIB over existing substructure identification architectures based on GIB theory. - Lack of clear intuition for the loss proposed in Equation 5, focusing on ho
- The proposed method is good motivated. - The experiments are comprehensive. - Overall, the paper is easy to follow.
The main concern is about the novelty. The three techniques, i.e., information bottleneck, soft-actor critic (SAC), and genetic algorithm (GA), are well known methods. Seems the authors just make a pipeline to combine all existing methods together.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Machine Learning in Materials Science · Scientific Computing and Data Management
