TL;DR
MolMem introduces a memory-augmented reinforcement learning framework for molecular optimization, significantly improving sample efficiency and success rates in drug discovery tasks with limited oracle evaluations.
Contribution
It proposes a dual-memory system within an RL framework to enhance decision grounding and reuse strategies, advancing molecular optimization methods.
Findings
Achieves 90% success on single-property tasks, 1.5x better than baseline.
Attains 52% success on multi-property tasks with only 500 oracle calls.
Outperforms existing methods in sample efficiency and optimization success.
Abstract
In drug discovery, molecular optimization aims to iteratively refine a lead compound to improve molecular properties while preserving structural similarity to the original molecule. However, each oracle evaluation is expensive, making sample efficiency a key challenge for existing methods under a limited oracle budget. Trial-and-error approaches require many oracle calls, while methods that leverage external knowledge tend to reuse familiar templates and struggle on challenging objectives. A key missing piece is long-term memory that can ground decisions and provide reusable insights for future optimizations. To address this, we present MolMem (\textbf{Mol}ecular optimization with \textbf{Mem}ory), a multi-turn agentic reinforcement learning (RL) framework with a dual-memory system. Specifically, MolMem uses Static Exemplar Memory to retrieve relevant exemplars for cold-start grounding,…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
