# Stable de novo protein design via joint conformational landscape and sequence optimization

**Authors:** Yehlin Cho, Justas Dauparas, Kotaro Tsuboyama, Gabriel J. Rocklin, Sergey Ovchinnikov

PMC · DOI: 10.1038/s41467-025-66526-w · Nature Communications · 2025-12-24

## TL;DR

This paper introduces a new method for designing stable proteins by jointly optimizing their sequence and structure, leading to better folding stability.

## Contribution

The novel contribution is a joint optimization framework combining sequence-to-structure and structure-to-sequence modeling for improved de novo protein design.

## Key findings

- Joint optimization improves stability prediction and protein design compared to separate methods.
- Sequences from the joint model show more hydrophilic interactions, aiding structural stability.
- Large-scale experiments validate the effectiveness of the joint modeling approach.

## Abstract

Generative protein modeling provides advanced tools for designing diverse protein sequences and structures. However, accurately modeling the conformational landscape and designing sequences remain critical challenges: ensuring that the designed sequence reliably folds into the target structure as its most stable conformation, and optimizing the sequence for a given suboptimal fixed input structure. In this study, we present a systematic analysis of jointly optimizing sequence-to-structure and structure-to-sequence mappings. This approach enables us to find optimal solutions for modeling the conformational landscape. We validate our approach with large-scale protein stability measurements, demonstrating that joint optimization is superior for designing stable proteins using a joint model (TrRosetta and TrMRF) and for achieving high accuracy in stability prediction when jointly modeling (half-masked ESMFold pLDDT + ESM2 Pseudo-likelihood). We further investigate features of sequences generated from the joint model and find that they exhibit higher frequencies of hydrophilic interactions, which may help maintain both secondary structure registry and pairing-features not captured by structure-to-sequence modeling alone.

This study presents a comprehensive modelling framework that jointly optimizes sequence and structure to generate de novo proteins with improved folding stability, providing large-scale experimental benchmarking across multiple computational design methods

## Full-text entities

- **Diseases:** ESM 2 (MESH:D020803)
- **Chemicals:** Cysteine (MESH:D003545), glutamic acid (MESH:D018698), amino acid (MESH:D000596), hydrogen (MESH:D006859), disulfide (MESH:D004220), lysine (MESH:D008239)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12764529/full.md

## Figures

4 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12764529/full.md

## References

12 references — full list in the complete paper: https://tomesphere.com/paper/PMC12764529/full.md

---
Source: https://tomesphere.com/paper/PMC12764529