# Sampling Conformational Ensembles of Highly Dynamic Proteins via Generative Deep Learning

**Authors:** Talant Ruzmetov, Ta I Hung, Saisri Padmaja Jonnalagedda, Si-han Chen, Parisa Fasihianifard, Zhefeng Guo, Bir Bhanu, Chia-en A. Chang

PMC · DOI: 10.21203/rs.3.rs-4301803/v1 · Research Square · 2024-06-28

## TL;DR

This paper introduces a deep learning model to sample protein conformations, revealing new insights into dynamic proteins like Aβ42.

## Contribution

A novel unsupervised deep learning model, ICoN, is developed to sample conformational ensembles of dynamic proteins.

## Key findings

- ICoN learns physical principles of conformational changes from MD simulations.
- The model generates novel conformations with detailed sidechain and backbone arrangements.
- Synthetic conformations of Aβ42 revealed clusters that align with experimental data.

## Abstract

Proteins are inherently dynamic, and their conformational ensembles are functionally important in biology. Large-scale motions may govern protein structure–function relationship, and numerous transient but stable conformations of intrinsically disordered proteins (IDPs) can play a crucial role in biological function. Investigating conformational ensembles to understand regulations and disease-related aggregations of IDPs is challenging both experimentally and computationally. In this paper first an unsupervised deep learning-based model, termed Internal Coordinate Net (ICoN), is developed that learns the physical principles of conformational changes from molecular dynamics (MD) simulation data. Second, interpolating data points in the learned latent space are selected that rapidly identify novel synthetic conformations with sophisticated and large-scale sidechains and backbone arrangements. Third, with the highly dynamic amyloid-β1–42 (Aβ42) monomer, our deep learning model provided a comprehensive sampling of Aβ42’s conformational landscape. Analysis of these synthetic conformations revealed conformational clusters that can be used to rationalize experimental findings. Additionally, the method can identify novel conformations with important interactions in atomistic details that are not included in the training data. New synthetic conformations showed distinct sidechain rearrangements that are probed by our EPR and amino acid substitution studies. The proposed approach is highly transferable and can be used for any available data for training. The work also demonstrated the ability for deep learning to utilize learned natural atomistic motions in protein conformation sampling.

## Full-text entities

- **Genes:** APP (amyloid beta precursor protein) [NCBI Gene 351] {aka AAA, ABETA, ABPP, AD1, APPI, CTFgamma}

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC11230488/full.md

## Figures

9 figures with captions in the complete paper: https://tomesphere.com/paper/PMC11230488/full.md

## References

74 references — full list in the complete paper: https://tomesphere.com/paper/PMC11230488/full.md

---
Source: https://tomesphere.com/paper/PMC11230488