RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design

Meghana Kshirsagar; Allen Nie; Ching-An Cheng; Fanglei Xue; Rahul Dodhia; Juan Lavista Ferres; Kevin K. Yang; Frank DiMaio

arXiv:2604.17175·cs.LG·May 1, 2026

RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design

Meghana Kshirsagar, Allen Nie, Ching-An Cheng, Fanglei Xue, Rahul Dodhia, Juan Lavista Ferres, Kevin K. Yang, Frank DiMaio

PDF

TL;DR

RosettaSearch employs large language models for multi-objective, inference-time optimization in protein sequence design, significantly improving structural fidelity and success rates over existing methods without retraining models.

Contribution

This work introduces a novel inference-time optimization approach using LLMs for protein design, demonstrating systematic improvements and generalization across models and structures.

Findings

01

Achieved 18% to 68% improvements in structural fidelity metrics.

02

Increased design success rate by 2.5 times over baseline.

03

Demonstrated robustness across different LLMs and backbone structures.

Abstract

We introduce RosettaSearch, an inference-time multi-objective optimization approach for backbone conditioned protein sequence design. We use large language models (LLMs) as a generative optimizer within a search algorithm capable of controlled exploration and exploitation, using rewards computed from RosettaFold3, a structure prediction model, under a strict computational budget. In a large-scale evaluation, we apply RosettaSearch to 400 suboptimal sequences generated by LigandMPNN (a state-of-the-art model trained for protein sequence design), recovering high-fidelity designs that LigandMPNN's single-pass decoding fails to produce. RosettaSearch's designs show improvements in structural fidelity metrics ranging between 18% to 68%, translating to a 2.5x improvement in design success rate. We observe that these gains in success rate are robust when RosettaSearch-designed sequences are…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.