Steering Generative Models for Protein Design: Aligning and Conditioning Strategies
Filippo Stocco, Michele Garibbo, Noelia Ferruz

TL;DR
This paper reviews strategies for guiding generative models to produce proteins with specific desired properties, addressing the challenge of exploring low-probability regions in protein design.
Contribution
It categorizes and surveys recent methods for steering generative models in protein design, highlighting approaches that modify or keep fixed the model parameters.
Findings
Strategies include parameter modification and fixed-parameter approaches.
These methods enable targeted protein generation with desired properties.
The review clarifies the landscape of steering techniques in protein generative modeling.
Abstract
Generative artificial intelligence models learn probability distributions from data and produce novel samples that capture the salient properties of their training sets. Proteins are particularly attractive for such approaches given their abundant data and the versatility of their representations, ranging from sequences to structures and functions. This versatility has motivated the rapid development of generative models for protein design, enabling the generation of functional proteins and enzymes with unprecedented success. However, because these models mirror their training distribution, they tend to sample from its most probable modes, while low-probability regions, often encoding valuable properties, remain underexplored. To address this challenge, recent work has proposed strategies for steering generative models toward user-specified properties. In this review, we survey and…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMachine Learning in Materials Science · Protein Structure and Dynamics · Machine Learning in Bioinformatics
