RosettaSearch: Multi-Objective Inference-Time Search for Protein Sequence Design
Meghana Kshirsagar, Allen Nie, Ching-An Cheng, Fanglei Xue, Rahul Dodhia, Juan Lavista Ferres, Kevin K. Yang, Frank DiMaio

TL;DR
RosettaSearch employs large language models for multi-objective, inference-time optimization in protein sequence design, significantly improving structural fidelity and success rates over existing methods without retraining models.
Contribution
This work introduces a novel inference-time optimization approach using LLMs for protein design, demonstrating systematic improvements and generalization across models and structures.
Findings
Achieved 18% to 68% improvements in structural fidelity metrics.
Increased design success rate by 2.5 times over baseline.
Demonstrated robustness across different LLMs and backbone structures.
Abstract
We introduce RosettaSearch, an inference-time multi-objective optimization approach for backbone conditioned protein sequence design. We use large language models (LLMs) as a generative optimizer within a search algorithm capable of controlled exploration and exploitation, using rewards computed from RosettaFold3, a structure prediction model, under a strict computational budget. In a large-scale evaluation, we apply RosettaSearch to 400 suboptimal sequences generated by LigandMPNN (a state-of-the-art model trained for protein sequence design), recovering high-fidelity designs that LigandMPNN's single-pass decoding fails to produce. RosettaSearch's designs show improvements in structural fidelity metrics ranging between 18% to 68%, translating to a 2.5x improvement in design success rate. We observe that these gains in success rate are robust when RosettaSearch-designed sequences are…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
