NMR-Derived Salt Bridges in Insulin Analogue: Resolving Artifactual Overbinding in Molecular Dynamics via Charge Scaling
Ngoc Lan Le Nguyen, Jiří Žák, Pavel Jungwirth, Martin Lepšík

TL;DR
This paper uses charge scaling in molecular dynamics simulations to accurately model salt bridges in an insulin analogue, resolving issues of overbinding seen in traditional methods.
Contribution
The novel use of scaled-charge prosECCo75 in MD simulations provides a reliable description of salt bridges in proteins.
Findings
ff19SB induces a non-native salt bridge in insulin analogue simulations.
prosECCo75 provides a biologically reasonable dissociation barrier of 1 kcal mol–1.
CHARMM36m and ff19SB show high salt bridge strengths of up to 4 and 5 kcal mol–1, respectively.
Abstract
Salt bridges are ionic interactions that are of great importance in protein recognition. However, their structural description using X-ray crystallography or NMR may be inconclusive. Classical molecular dynamics (MD) used for the interpretation neglects electronic polarization, which results in artifactual overbinding. Here, we resolve the problem via charge scaling, which accounts for electronic polarization in a mean-field way. We study three salt bridges in insulin analogue. New NMR ensembles are generated via NOE-restrained MD using ff19SB and CHARMM36m force fields and the scaled-charge prosECCo75. Tens of μs of unrestrained MD show in a statistically converged manner that ff19SB induces a non-native salt bridge. This behavior is quantified via umbrella sampling of salt bridge dissociation, which indicates a rather high strength of up to 4 and 5 kcal mol–1 for CHARMM36m and ff19SB,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5- —Univerzita Karlova v Praze10.13039/100007397
- —European Research Council10.13039/501100000781
- —Grantov? Agentura Cesk? Republiky10.13039/501100001824
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy and Quantum Chemical Studies · Advanced NMR Techniques and Applications · Protein Structure and Dynamics
Charge–charge interactions are among the fundamental physical phenomena in chemical processes, playing crucial roles in molecular structure and recognition. Their correct description in MD simulations is essential for accurate modeling of chemical and biological systems and thus advancing material science as well as computer-aided drug design. ?−? ? The commonly used nonpolarizable force fields, however, tend to exaggerate charge–charge interactions due to the absence of additional attenuation caused by electronic polarization. To address this limitation without additional computational costs, the charge scaling method, ?,? introduces electronic polarization effects through a mean-field approach denoted as the Electronic Continuum Correction (ECC).? During the past decade, ECC-based models have been extensively developed and applied to various systems, including ionic solutions, ?−? ? ? ? ? ? ? ? ? protein–ion interactions, ?−? ? ? lipids, ?−? ? solid surfaces and their interfaces with aqueous solutions, ?−? ? ? biological systems, ?−? ? and ionic liquids. ?−? ? Overall, the use of ECC qualitatively improved the results, often aligning the computational predictions quantitatively with the experimental results.
Salt bridges are ubiquitous ion–ion interactions that determine the shapes of biomolecules, their specific recognition, and thus underlie many important processes in molecular biology. ?−? ? ? In proteins, they occur between acidic carboxylates of Glu or Asp side chains or the C-termini and basic Lys, Arg, or His(+) side chains or the N-termini within a cutoff N···O distance of about 3.5 Å. Despite the critical role of salt bridges in biomolecular stability and function, systematic studies quantifying their strength remain limited, with only a few investigations reported in both experimental and computational contexts. The stability of salt bridges is dependent on not only the strength of the noncovalent interaction but also its solvent exposure, hydrophilicity of the surroundings, and conformational entropy. Thus, the two charged groups can form direct contact pairs or solvent-shared pairs or become completely separated. Salt bridges become typically stronger, with stabilization free energies of about 3–5 kcal mol^–1^, when partially buried inside proteins or shielded from the protein surface due to reduced dielectric effects, ?,? while surface-exposed ones are disfavored by increased solvation and conformational entropy, with their strength ranging from 0.2–0.5 kcal mol^–1^. ?−? ? A theoretical study using molecular dynamics simulations with the OPLS-AA force field on a capped Lys–Glu dipeptide in water clusters supports this view. The results show that salt bridge strength decreases with increasing hydrationfrom approximately 3.7 kcal mol^–1^ in a 20-water molecule cluster to around 2.2 kcal mol^–1^ in a 150-water molecule clusterhighlighting the role of solvent screening in modulating electrostatic interactions.? To extrapolate to the bulk solvent, a smaller model, namely, the NH_4_ ^+^ ··· HCOO^–^ ion pair, was calculated, yielding nearly identical results (approximately 3.9 kcal mol^–1^ in a 20-water molecule cluster and 2.4 kcal mol^–1^ in the bulk water). In addition, a comprehensive study of salt bridge interactions across various force fields, including AMBER, CHARMM, and OPLS, was conducted using MD simulations of amino acid analogues (Arg/Asp, Lys/Asp, and His(+)/Asp) in an explicit solvent.? One Arg/Asp capped amino acid dipeptide salt bridge was also simulated. Given the availability of experimentally measured association constant values of the oppositely charged pairs, the simulations revealed that most of the tested force fields overestimated the strength of the salt bridges. The free energies obtained from potential of mean force calculations are about 2–3 kcal mol^–1^ for the amino acid analogues and about 2–4 kcal mol^–1^ for the capped amino acid dipeptide. Overall, previous scarce molecular simulation studies have primarily focused on salt bridge stability of simplified systems such as amino acid analogues or capped dipeptides. An insight into salt bridge behavior at the molecular level within complex and biologically relevant protein environments is still lacking.
To address this issue, we have chosen a structurally very well characterized and relatively small systeminsulin and its analogues.? These peptides consist of two interconnected chains, A and B, which make up, in total, about 50 amino acids. The three possible salt bridges are inferred from the available high-resolution X-ray crystal structures as follows: the N-terminal primary amino group of A1 and the Glu A4 side chain (A1–A4), the C-terminal carboxylate of A21 and the Arg B22 side chain (A21–B22), and the side chains of Glu A17 and Arg B22 (A17–B22). A refined experimental reference is derived from the existing NOE distance restraints.? They serve for restrained MD simulations using the standard ff19SB? and CHARMM36m (C36m),? alongside the recent scaled-charge prosECCo75? protein force fields, to yield extended NMR ensembles. These, in turn, are compared to unrestrained MD performed with the same parameter sets. Further, to quantify the Gibbs free energy required to transition from a contact to a solvent-shared ion pair, umbrella sampling simulations for the two stable salt bridges are carried out.
Four high-resolution crystallographic structures of human insulin and its analogues (resolution of 1 Å or better) were examined for the presence of salt bridges. In the order of decreasing prevalence in all the chains and alternative conformations, A1–A4 was found in 8 cases of occurrences, A21–B22 had 5 occurrences, and A17–B22 only three (Figure and ). Note that in none of the cases, all three salt bridges would be found present at the same time. To go beyond the static view of the frozen crystals, the 30 NMR models of the l-HisB24 analogue (PDB 2M2N ?) were analyzed. The occurrences of the three salt bridges were 23.3%, 60.0%, and 0%, respectively. The discrepancy between these orderings may come from different experimental conditions, but also from the specific force field (AMBER) used for the interpretation of the NMR data.?
To obtain unbiased and expanded NMR ensembles for reference, four replicas of 100 ns NOE-restrained MD simulations were conducted using the three protein force fields, combined with the standard TIP3P ?,? or SPC/E? water models. The stability of the secondary structures was maintained in all of the setups with an average root-mean-square deviation (RMSD) values of 0.6–0.7 Å (). In comparison, four replicas of 10 μs of unrestrained MD simulations gave average RMSDs of the secondary structure backbone twice larger but still reasonably small, about 1.2 to 1.4 Å (). The time evolution of RMSDs suggests that some equilibration occurred in the first 2.5 μs (). Therefore, only the salt bridge occurrences in the last 7.5 μs were taken into account for further analyses.
Figure shows the salt bridge occupancies extracted from all of the MD simulations using the TIP3P water model. Very similar trends were observed for the SPC/E water model (), with some quantitative differences noted below. For the A1–A4 and A21–B22 salt bridges, the occupancies from the NOE-restrained MDs are similar (within 23%) to the unrestrained MD (Figuresab, ). Also, compared to ff19SB and C36m, prosECCo75 always yields lower occupancies, which is understandable given its scaled charges (). While the trends were maintained, there was a quantitative difference for the A21–B22 case using the SPC/E water model, where the drop in occupancy from ff19SB and C36m to prosECCo75 was reduced from about 50% to 20% ().
At a closer inspection of the convergence of these two salt bridges in time, based on comparing the salt bridge occurrences for the individual segments of the production runs, we typically find differences up to 10%, with a few cases reaching 20% (). The differences between the four segments are much smaller in the case of A1–A4 than A21–B22 (Figuresab). This is caused by their different locations within the insulin analogue. The N-terminal A1 and Glu A4 form the stable α-helix1 of the A-chain. In contrast, the C-terminal A21–Arg B22 salt bridge is surface-exposed and flexible, hence the larger standard deviations.
The striking case is the A17–B22 salt bridge which, in the NOE-restrained MD, is very weak using ff19SB (8%) and practically nonexistent in NOE-restrained MD using C36m and prosECCo75 (Figurec, ). Consistently, unrestrained MD with prosECCo75 shows practically no salt bridge formation and with C36m only a very weak one. In sharp contrast, in unrestrained MD, ff19SB predicts this non-native salt bridge at 80–90% occurrence in both water models. This overbinding and related formation of artifactual charged clusters (Glu A17···Arg B22···Asn A21 C-terminus in this case; Figure) calls for caution when using ff19SB for MD or NMR structural interpretation of interactions in proteins involving charged residues.
To inspect the salt bridge strength in more detail, we carried out umbrella sampling simulations to calculate the free energy of dissociation (meaning the transition from the direct contact to the solvent-shared pair) of the two well-defined salt bridges A1-A4 and A21–B22. We note that in agreement with Figurec, A17 frequently made contact with B22 during the simulations using ff19SB but it did not using C36m or prosECCo75. The resulting potentials of mean force (PMF) profiles are presented in Figures and for the TIP3P and SPC/E water models, respectively. The estimated boundaries separating the direct contact and solvent-shared conformations of the salt bridges are shown in . The results described below are for the TIP3P water model, but comparable tendencies are obtained using the SPC/E water model as well; see . For the A1–A4 pair, the free energy profiles computed from ff19SB and C36m force fields closely resemble each other (Figure, upper left panel) with a transition barrier of 2.5–2.9 kcal mol^–1^. prosECCo75, on the other hand, yields a barrier of half the height, roughly 1.3 kcal mol^–1^. Similarly, the A21–B22 pair exhibits high ion pair stabilities using ff19SB and C36m with free energy barriers of approximately 4.9 and 3.1 kcal mol^–1^, respectively (Figure, upper right panel). In contrast, prosECCo75, again in this case, yields the barrier height of one-third or one-half, respectively, with a value around 1.4 kcal mol^–1^. Consistently with previous studies on salt bridge strength, the full-charge ff19SB and C36m force fields tend to overbind, whereas prosECCo75 provides a more realistic stabilization. ?−? ? ? ?,?
Taken together, the description of salt bridges in proteins by standard and scaled-charge force fields was tested in insulin analogues against newly extended NMR ensembles. The standard force fields ff19SB showed a tendency for overbinding in an Arg–Glu salt bridge, thus forming a non-native contact and a charged cluster, observed neither in X-ray nor in the extended NMR ensemble. This behavior was caused by the overpolarization of fixed partial charges.
A simple, yet physically well grounded solution with no computational overhead is the inclusion of electronic polarization effects via charge scaling. ?−? ? The tested recent scaled-charge force field prosECCo75? did not induce the non-native salt bridge, had lower occupancies of the two native salt bridges and the dissociation barrier of 1 kcal mol^–1^, which is biologically more realistic than values up to 4–5 kcal mol^–1^ as observed for standard full charge force fields. Such differences have the potential to critically influence computational studies which use MD for evaluating insulin and related peptide structure and their binding toward the receptor. ?−? ?
In general, overbinding of charged species in nonpolarizable MD is a known issue which can be addressed within the nonpolarizable framework either via a nonbonded fix or via charge scaling within ECC.? The latter approach, which is physically well justified and therefore preferred also in this study, is currently under active development. The employed scaling factors generally range from 0.75–0.85, with a notion that the former may in some cases slightly underestimate the strength of ion pairing. ?,? Other characteristics, such as the charge density or hydrophobicity of the ion, have also bearings on the most appropriate scaling factor.? Finally, new water force fields? compatible with ECC emerge, setting the ground for fully consistent models.
Computational Methods
Experimental Structure Selection
Atomistic structural experimental evidence of salt bridges in human insulin and its analogues comes from X-ray crystallography and nuclear magnetic resonance (NMR) spectroscopy. Four high-quality crystal structures with resolution equal or less than 1 Å are available under PDB codes 1MSO,? 3W7Y,? 5E7W,? and 5HQI.? More than a hundred NMR structures are available in the PDB but some are present in hexamers or solved at acidic pH, which affects their structures. We have thus chosen the NMR structural ensemble of the l-His B24 insulin analogue solved at pH = 8 (PDB 2M2N)? The three salt bridges studied are not affected by the Phe-to-His replacement at the B24 position, which points away from them. The presence of the salt bridges in the listed X-ray structures and 30 models of 2M2N was measured as distances between implicated oxygen and nitrogen atoms in PyMol, version 2.3.0.? or VMD, version 1.9.4a57.?
System Setup
The first model from the 30-structure ensemble of l-HisB24 insulin analogue, PDB 2M2N,? was used as the starting structure and prepared in CHARMM-GUI.? The three disulfide bonds (cysteine pairs A6 and A11, A7 and B7, and A20 and B19) were automatically recognized. The A1, A21, B1, and B30 termini were set as charged.
Two standard (ff19SB? and CHARMM36m, C36m?) and one charge-scaled (prosECCo75?) protein force fields were used, as they represent the state-of-the-art in their categories. The partial charge distributions of the charged amino acids forming the salt bridge were presented in . All the protein force fields were combined with TIP3P (the original version? for ff19SB, the modified one? for C36m and prosECCo75) and SPC/E? water models. For simplicity, the original and modified TIP3P water models are referred to throughout the text as TIP3P. We acknowledge that ff19SB may pair well also with the (computationally more demanding) 4-site OPC water model. Nevertheless, to maintain consistency with CHARMM36m, and prosECCo75 water models which were originally parametrized and validated with the TIP3P water model, we opted in this study to use the TIP3P and SPC/E water models in combination with ff19SB as well. The systems were neutralized in a 150 mM NaCl solution using CHARMM-GUI solution builder tool,? with Na^+^ and Cl^–^ modeled using scaled-charge force fields. A cubic unit cell had sides of 5.3 nm.
MD Details
The MD simulations were carried out using GROMACS 2022.3 and 2022.4? and AMBER20? software packages for unrestrained MD (MD structural ensembles) and Nuclear Overhauser Enhancement (NOE)-restrained MD (NMR structural ensembles), respectively. We have strived to use the same setup for running the simulations in GROMACS and AMBER but due to different options available, timing of individual steps, or hardware requirements, minor deviations between these protocols occurred as detailed below. These pertained mostly to the lengths of equilibration steps, force constants for positional restraints, thermostat, barostat, or bond-constraining algorithms.
The unrestrained MD simulations were subject to minimization and equilibration in the NVT ensemble for 125 ps with harmonically restrained proteins (force constants of 96 kcal mol^–1^ nm^–2^ for backbones, 10 kcal mol^–1^ nm^–2^ for side chains, and 1 kcal mol^–1^ rad^–2^ for dihedrals). In the NOE-restrained MD, the systems were minimized and equilibrated in NVT and NpT ensembles (60 and 60 ps) with 1 fs time step and harmonically restrained proteins (force constant of 100 kcal mol^–1^ nm^–2^). Both production simulations were propagated with a 2 fs time step at constant temperature (300 K) and constant pressure (1 bar) in NpT ensemble. The unrestrained MD were simulated for 10 μs and employed the Nosé-Hoover thermostat ?−? ? and Parrinello–Rahman barostat,? and all the bonds containing hydrogen were constrained with the LINCS algorithm.? The NOE-restrained MD was simulated for 100 ns and used the Langevin thermostat? and Monte Carlo barostat? and bonds containing hydrogen were constrained with the SHAKE algorithm.? Four replicas of production MD were performed by using the force field combinations mentioned above. The Particle Mesh Ewald (PME) method? was applied to calculate long-range electrostatic interactions. The cutoff distance of 1.2 nm was used for both short-range electrostatic and van der Waals interactions.
To study the dynamical properties of the insulin analogue, the occupancies of all salt bridges in the NMR structural ensemble and our MD trajectories were calculated using MDAnalysis ?,? with a cutoff N···O distance of 3.5 Å. The calculated occupancies provide a quantitative measure of the stability and persistence of salt bridge interactions over the simulation time.
Umbrella sampling simulations were conducted to measure the free energy profile for the salt bridge opening, i.e., increasing the C···CZ distance (r C‑CZ) in A21–B22 and the CD ···N distance (r CD‑N) in A1–A4. The sampling windows were spaced 0.3 to 0.5 Å apart, and harmonic umbrella potentials with force constants of 72 and 95 kcal mol^–1^ nm^–2^ were applied for N-terminal A1–Glu A4 and C-terminal A21–Arg B22, respectively. The free energy profiles were obtained using the Weighted Histogram Analysis Method (WHAM)? as implemented in GROMACS.? Finally, free energy profiles are corrected by +2k B Tln(r), which is a volume entropy factor,? and shifted to get the minimum value at zero.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Simonson T.Archontis G.Karplus M.Free energy simulations come of age: Protein-ligand recognition Acc. Chem. Res.20023543043710.1021/ar 010030 m 12069628 · doi ↗ · pubmed ↗
- 2Sheinerman F. B Norel R.Honig B.Electrostatic aspects of protein-protein interactions Curr. Opin. Struct. Biol.20001015315910.1016/S 0959-440X(00)00065-810753808 · doi ↗ · pubmed ↗
- 3Quan X.Liu J.Zhou J.Multiscale modeling and simulations of protein adsorption: progresses and perspectives Curr. Opin. Colloid Interface Sci.201941748510.1016/j.cocis.2018.12.004 · doi ↗
- 4Leontyev I. V.Stuchebrukhov A. A.Electronic continuum model for molecular dynamics simulations J. Chem. Phys.200913008510210.1063/1.306016419256627 PMC 3910273 · doi ↗ · pubmed ↗
- 5Leontyev I. V.Stuchebrukhov A. A.Accounting for electronic polarization in non-polarizable force fields Phys. Chem. Chem. Phys.2011132613262610.1039/c 0cp 01971 b 21212894 · doi ↗ · pubmed ↗
- 6Kirby B. J.Jungwirth P.Charge scaling manifesto: A way of reconciling the inherently macroscopic and microscopic natures of molecular simulations J. Phys. Chem. Lett.2019107531753610.1021/acs.jpclett.9b 0265231743030 · doi ↗ · pubmed ↗
- 7Kohagen M.Lepšík M.Jungwirth P.Calcium binding to calmodulin by molecular dynamics with effective polarization J. Phys. Chem. Lett.201453964396910.1021/jz 502099 g 26276478 · doi ↗ · pubmed ↗
- 8Kohagen M.Mason P. E.Jungwirth P.Accounting for electronic polarization effects in aqueous sodium chloride via molecular dynamics aided by neutron scattering J. Phys. Chem. B 20161201454146010.1021/acs.jpcb.5b 0522126172524 · doi ↗ · pubmed ↗
