Protein-Conditioned Multi-Objective Reinforcement Learning for Full-Length mRNA Design
Zixi Shao, Tao Wang, Yibei Xiao, Tianyi Huang

TL;DR
ProMORNA is a multi-objective reinforcement learning framework that designs full-length mRNA transcripts from protein sequences, optimizing stability, efficiency, and safety, demonstrated on unseen targets.
Contribution
It introduces a novel multi-objective RL method, MO-GRPO, trained on large protein-mRNA data to generate optimized mRNA sequences de novo.
Findings
ProMORNA improves the Pareto frontier for half-life and translation efficiency.
It achieves higher predicted functional scores than state-of-the-art baselines.
The approach is effective on unseen protein targets.
Abstract
Designing therapeutic messenger RNA (mRNA) requires creating full-length transcripts that carefully balance stability, translation efficiency, and immune safety. To address this challenge, we propose ProMORNA, a multi-objective generation framework that produces complete mRNA transcripts \textit{de novo} directly from a target protein sequence. Our approach begins by training a BART-style encoder-decoder model on over 6 million natural protein-mRNA pairs. We then introduce Multi-Objective Group Relative Policy Optimization (MO-GRPO) to simultaneously optimize for various biological objectives in a unified way. As a case study, we evaluated ProMORNA on the widely used firefly luciferase target, excluding it from both our supervised training data and the prompt pool. The results indicate that ProMORNA improves the \textit{in silico} Pareto frontier for predicted half-life and translation…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
