TL;DR
This paper introduces a deep inverse reinforcement learning framework for generating chemical compounds, offering a promising alternative to traditional reward-based methods in drug discovery, especially when designing explicit reward functions is challenging.
Contribution
The study presents a novel inverse reinforcement learning approach for compound generation, reducing reliance on reward function engineering in chemical domain applications.
Findings
Inverse reinforcement learning effectively generates chemical compounds.
The method performs well without explicit reward function design.
It offers a transferable reward function for chemical synthesis tasks.
Abstract
The size and quality of chemical libraries to the drug discovery pipeline are crucial for developing new drugs or repurposing existing drugs. Existing techniques such as combinatorial organic synthesis and High-Throughput Screening usually make the process extraordinarily tough and complicated since the search space of synthetically feasible drugs is exorbitantly huge. While reinforcement learning has been mostly exploited in the literature for generating novel compounds, the requirement of designing a reward function that succinctly represents the learning objective could prove daunting in certain complex domains. Generative Adversarial Network-based methods also mostly discard the discriminator after training and could be hard to train. In this study, we propose a framework for training a compound generator and learning a transferable reward function based on the entropy maximization…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
