Improved Off-policy Reinforcement Learning in Biological Sequence Design

Hyeonah Kim; Minsu Kim; Taeyoung Yun; Sanghyeok Choi; Emmanuel Bengio; Alex Hern\'andez-Garc\'ia; Jinkyoo Park

arXiv:2410.04461·cs.LG·June 18, 2025

Improved Off-policy Reinforcement Learning in Biological Sequence Design

Hyeonah Kim, Minsu Kim, Taeyoung Yun, Sanghyeok Choi, Emmanuel Bengio, Alex Hern\'andez-Garc\'ia, Jinkyoo Park

PDF

Open Access 1 Repo

TL;DR

This paper introduces a conservative off-policy reinforcement learning method for biological sequence design that improves robustness and outperforms existing methods across various biological tasks.

Contribution

The paper proposes $oldsymbol{ extdelta}$-Conservative Search, a novel off-policy RL approach that adaptively restricts exploration based on proxy model confidence to enhance robustness in biological sequence design.

Findings

01

Outperforms existing methods in discovering high-score sequences.

02

Effectively adapts conservativeness based on proxy uncertainty.

03

Demonstrates success across DNA, RNA, protein, and peptide design tasks.

Abstract

Designing biological sequences with desired properties is challenging due to vast search spaces and limited evaluation budgets. Although reinforcement learning methods use proxy models for rapid reward evaluation, insufficient training data can cause proxy misspecification on out-of-distribution inputs. To address this, we propose a novel off-policy search, $δ$ -Conservative Search, that enhances robustness by restricting policy exploration to reliable regions. Starting from high-score offline sequences, we inject noise by randomly masking tokens with probability $δ$ , then denoise them using our policy. We further adapt $δ$ based on proxy uncertainty on each data point, aligning the level of conservativeness with model confidence. Experimental results show that our conservative search consistently enhances the off-policy training, outperforming existing machine learning…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Code & Models

Repositories

hyeonahkimm/delta_cs
pytorchOfficial

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsMicrobial Metabolic Engineering and Bioproduction · Viral Infectious Diseases and Gene Expression in Insects · RNA and protein synthesis mechanisms