Benchmarking inverse statistical approaches for protein structure and design with exactly solvable models
Hugo Jacquin, Amy Gilson, Eugene Shakhnovich, Simona Cocco, R\'emi, Monasson

TL;DR
This study benchmarks inverse statistical methods for protein structure prediction using lattice protein models, showing they effectively capture structural and energetic features, including design of new sequences, despite underlying complexities.
Contribution
It demonstrates that inverse statistical approaches can reliably infer protein energetics and structure from sequence data using lattice models, revealing their capabilities and limitations.
Findings
Inferred Potts Hamiltonians reflect native and competing fold energetics.
Potts models enable design of new sequences with desired folds.
Effective couplings depend on both positive and negative design factors.
Abstract
Inverse statistical approaches to determine protein structure and function from Multiple Sequence Alignments (MSA) are emerging as powerful tools in computational biology. However the underlying assumptions of the relationship between the inferred effective Potts Hamiltonian and real protein structure and energetics remain untested so far. Here we use lattice protein model (LP) to benchmark those inverse statistical approaches. We build MSA of highly stable sequences in target LP structures, and infer the effective pairwise Potts Hamiltonians from those MSA. We find that inferred Potts Hamiltonians reproduce many important aspects of 'true' LP structures and energetics. Careful analysis reveals that effective pairwise couplings in inferred Potts Hamiltonians depend not only on the energetics of the native structure but also on competing folds; in particular, the coupling values reflect…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
