Improved Off-policy Reinforcement Learning in Biological Sequence Design
Hyeonah Kim, Minsu Kim, Taeyoung Yun, Sanghyeok Choi, Emmanuel Bengio, Alex Hern\'andez-Garc\'ia, Jinkyoo Park

TL;DR
This paper introduces a conservative off-policy reinforcement learning method for biological sequence design that improves robustness and outperforms existing methods across various biological tasks.
Contribution
The paper proposes $oldsymbol{ extdelta}$-Conservative Search, a novel off-policy RL approach that adaptively restricts exploration based on proxy model confidence to enhance robustness in biological sequence design.
Findings
Outperforms existing methods in discovering high-score sequences.
Effectively adapts conservativeness based on proxy uncertainty.
Demonstrates success across DNA, RNA, protein, and peptide design tasks.
Abstract
Designing biological sequences with desired properties is challenging due to vast search spaces and limited evaluation budgets. Although reinforcement learning methods use proxy models for rapid reward evaluation, insufficient training data can cause proxy misspecification on out-of-distribution inputs. To address this, we propose a novel off-policy search, -Conservative Search, that enhances robustness by restricting policy exploration to reliable regions. Starting from high-score offline sequences, we inject noise by randomly masking tokens with probability , then denoise them using our policy. We further adapt based on proxy uncertainty on each data point, aligning the level of conservativeness with model confidence. Experimental results show that our conservative search consistently enhances the off-policy training, outperforming existing machine learning…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Code & Models
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMicrobial Metabolic Engineering and Bioproduction · Viral Infectious Diseases and Gene Expression in Insects · RNA and protein synthesis mechanisms
