Emergence of stable and fast folding protein structures
D. Thirumalai, D. K. Klimov

TL;DR
This paper proposes that the space of protein structures is sparse compared to sequences, with only a small subset of sequences folding into stable, functional structures, as demonstrated through lattice models.
Contribution
It introduces a physical framework explaining how simple features like low energy and compaction reduce the sequence space to foldable structures, supported by lattice model simulations.
Findings
Number of low energy structures scales as ln N
Sequences with small sigma fold rapidly and are stable over temperature range
Functional constraints further limit the set of biologically competent sequences
Abstract
The number of protein structures is far less than the number of sequences. By imposing simple generic features of proteins (low energy and compaction) on all possible sequences we show that the structure space is sparse compared to the sequence space. Even though the sequence space grows exponentially with N (the number of amino acids) we conjecture that the number of low energy compact structures only scales as ln N. This implies that many sequences must map onto countable number of basins in the structure space. The number of sequences for which a given fold emerges as a native structure is further reduced by the dual requirements of stability and kinetic accessibility. The factor that determines the dual requirement is related to the sequence dependent temperatures, T_\theta (collapse transition temperature) and T_F (folding transition temperature). Sequences, for which \sigma…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
