# Improving Protein Structure Determination by Integrating Ensemble-Driven Molecular Dynamics with Chemical Shift-Based Restraints

**Authors:** Márton Gadanecz, Zsolt Fazekas, Dóra K. Menyhárd, András Perczel

PMC · DOI: 10.1021/acs.jcim.5c02358 · 2026-02-16

## TL;DR

This paper introduces a new method that combines molecular dynamics with chemical shift data to improve the accuracy of protein structure determination.

## Contribution

The novel protocol integrates EDMD with chemical shift-based restraints to refine protein structures more accurately.

## Key findings

- EDMD improved backbone RMSD compared to CS-Rosetta ensembles for 5 benchmark proteins.
- EDMD enhanced fulfillment of NOE-derived distance restraints over CS-Rosetta and unrestrained MD.
- EDMD maintained correct protein–ligand conformations and can refine nonconverged CS-Rosetta results.

## Abstract

We present a protocol for nuclear magnetic resonance
(NMR) chemical
shift-based structure determination that employs ensemble-driven molecular
dynamics (EDMD) for structure refinement. Here, specifically, Chemical-Shift-Rosetta
(CS-Rosetta) was applied, followed by EDMD. EDMD eliminates the need
to predict chemical shifts at every molecular dynamics (MD) step by
defining continuous, differentiable potential energy functions (PEFs)
based on dihedral angle distributions from CS-Rosetta models while
incorporating the measurement temperature. This yielded a thermodynamically
realistic, experiment-based custom force field for each studied system.
We benchmarked EDMD against 5 proteins (13.1–19.2 kDa), focusing
on systems with nonprotein components and demonstrated its consistent
improvement of backbone root-mean-square deviation (RMSD) relative
to known reference structures over the original CS-Rosetta ensemble.
Moreover, EDMD enhanced the fulfillment of NOE-derived (nuclear Overhauser
effect) distance restraints compared to the results of CS-Rosetta
and unrestrained MD simulations. EDMD also improved NOE-RASREC-Rosetta
(resolution-adapted structural recombination Rosetta protocol supplemented
with NOE-based distance restraints) models and maintained the correct
protein–ligand conformations. This approach provides an opportunity
to refine nonconverged CS-Rosetta structure calculations, where the
results would not be interpretable otherwise. EDMD can be generalized
to any ensemble with scoring information, enabling refined exploration
of the φ/ψ phase space and accurate reinsertion of nonprotein
moieties.

## Full-text entities

- **Genes:** Ly96 (lymphocyte antigen 96) [NCBI Gene 448830] {aka MD-2, MD2}, FBP [NCBI Gene 8056], KRAS (KRAS proto-oncogene, GTPase) [NCBI Gene 3845] {aka 'C-K-RAS, C-K-RAS, CFC2, K-RAS2A, K-RAS2B, K-RAS4A}, Fubp1 (far upstream element binding protein 1) [NCBI Gene 654496], Rbp1 (retinol binding protein 1) [NCBI Gene 25056], Ugt1a7c (UDP glucuronosyltransferase 1 family, polypeptide A7C) [NCBI Gene 154516] {aka UDPGT 1-7, Ugt1, Ugt1a7, Ugt1a8}, Kras (KRAS proto-oncogene, GTPase) [NCBI Gene 24525] {aka K-ras, Kras2, c-Ki-ras, p21}, RBP1 (retinol binding protein 1) [NCBI Gene 5947] {aka CRABP-I, CRBP, CRBP1, CRBPI, RBPC, hCRBP1}
- **Diseases:** PDB (MESH:D011488), csdMD (MESH:D019966), SASA (MESH:D010534), RDC (MESH:C535684), EDMD (MESH:D000092242), PEF (MESH:D011502)
- **Chemicals:** NADP (MESH:D009249), amino acid (MESH:D000596), Ile (MESH:D007532), nitrogens (MESH:D009584), ionone (MESH:D045792), Ni (MESH:D009532), isoalloxazine (MESH:C008173), carbon (MESH:D002244), methanol (MESH:D000432), 1H (-), MgCl2 (MESH:D015636), Met (MESH:D008715), sodium (MESH:D012964), oxygens (MESH:D010100), NAP (MESH:C043186), salt (MESH:D012492), phosphorus (MESH:D010758), GDP (MESH:D006153), H (MESH:D006859), flavin (MESH:C024132), NADH (MESH:D009243), PBS (MESH:D007854), 13C (MESH:C000615229), chloride (MESH:D002712), nicotinamide ribose (MESH:C018613), NaN3 (MESH:D019810), nicotinamide (MESH:D009536), polyene (MESH:D011090), FMN (MESH:D005486), D2O (MESH:D017666), CS (MESH:D002586), water (MESH:D014867), retinol (MESH:D014801)
- **Species:** Homo sapiens (human, species) [taxon 9606]
- **Mutations:** G12C, Tyr-142-Trp
- **Cell lines:** MD — Mus musculus (Mouse), Embryonic stem cell (CVCL_IU77)

## Figures

8 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12977052/full.md

---
Source: https://tomesphere.com/paper/PMC12977052