Relaxed Sequence Sampling for Diverse Protein Design
Joohwan Ko, Aristofanis Rontogiannis, Yih-En Andrew Ban, Axel Elaldi, Nicholas Franklin

TL;DR
Relaxed Sequence Sampling (RSS) is a novel MCMC-based framework that enhances protein design by integrating structural and evolutionary information, achieving greater diversity and designability than previous methods.
Contribution
RSS introduces a continuous logit space sampling method combining gradient guidance with language model jumps, improving diversity and structural plausibility in protein design.
Findings
RSS produces 5× more designable structures.
RSS achieves 2-3× greater structural diversity.
RSS matches baseline computational costs.
Abstract
Protein design using structure prediction models such as AlphaFold2 has shown remarkable success, but existing approaches like relaxed sequence optimization (RSO) rely on single-path gradient descent and ignore sequence-space constraints, limiting diversity and designability. We introduce Relaxed Sequence Sampling (RSS), a Markov chain Monte Carlo (MCMC) framework that integrates structural and evolutionary information for protein design. RSS operates in continuous logit space, combining gradient-guided exploration with protein language model-informed jumps. Its energy function couples AlphaFold2-derived structural objectives with ESM2-derived sequence priors, balancing accuracy and biological plausibility. In an in silico protein binder design task, RSS produces 5 more designable structures and 2-3 greater structural diversity than RSO baselines, at equal computational…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
