Saturn: Sample-efficient Generative Molecular Design using Memory Manipulation
Jeff Guo, Philippe Schwaller

TL;DR
Saturn is a novel generative molecular design method that uses memory manipulation and experience replay to improve sample efficiency, enabling potential direct optimization of high-fidelity oracles in drug discovery.
Contribution
Introducing Saturn, which applies the Mamba architecture and augmented memory techniques to enhance sample efficiency in molecular generation tasks.
Findings
Saturn outperforms 22 models on multi-parameter optimization tasks.
Memory augmentation improves sample efficiency in molecular design.
Potential to directly optimize high-fidelity oracles in drug discovery.
Abstract
Generative molecular design for drug discovery has very recently achieved a wave of experimental validation, with language-based backbones being the most common architectures employed. The most important factor for downstream success is whether an in silico oracle is well correlated with the desired end-point. To this end, current methods use cheaper proxy oracles with higher throughput before evaluating the most promising subset with high-fidelity oracles. The ability to directly optimize high-fidelity oracles would greatly enhance generative design and be expected to improve hit rates. However, current models are not efficient enough to consider such a prospect, exemplifying the sample efficiency problem. In this work, we introduce Saturn, which leverages the Augmented Memory algorithm and demonstrates the first application of the Mamba architecture for generative molecular design. We…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced biosensing and bioanalysis techniques
MethodsExperience Replay
