How protein hydration depends on amino acid composition, peptide conformation, and force fields
Johanna-Barbara Linse, Tobias M. Fischbach, Jochen S. Hub

TL;DR
This study explores how amino acids and protein structure influence the hydration shell around proteins, impacting biological processes and experimental techniques.
Contribution
The paper introduces an amino acid-specific contrast score derived from SAXS data to quantify hydration effects across different protein types.
Findings
Acidic amino acids generate the strongest hydration contrast with 1–1.5 excess water molecules compared to alanine.
Apolar residues cause water depletion at the protein surface, while hydration is weaker around intrinsically disordered proteins.
Hydration shell effects on the radius of gyration are strongly conformation-dependent for intrinsically disordered proteins.
Abstract
The protein hydration shell is a key mediator of processes such as molecular recognition, protein folding, and proton transfer. How solvent-exposed amino acids shape the hydration shell structure is not well understood. We combine molecular dynamics simulations with explicit-solvent predictions of small-angle x-ray scattering (SAXS) curves to quantify the contributions of all 20 proteinogenic amino acids to the hydration shell of the globular GB3 domain and the intrinsically disordered protein (IDP) XAO. We focus on two quantities encoded by SAXS curves: the hydration shell effect on the radius of gyration and the electron density contrast between protein and solvent. We derive an amino acid-specific contrast score, revealing that acidic residues generate the strongest contrast with 1–1.5 excess water molecules relative to alanine, followed by cationic and polar residues. In contrast,…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Enzyme Structure and Function · Mass Spectrometry Techniques and Applications
Significance
Hydration shells of biomolecules constitute a large fraction of the water in crowded cellular environments and play key roles in biological functions such as enzymatic reactions and conformational transitions. Small-angle x-ray scattering (SAXS) has shown that hydration shells differ in density from bulk water, yet how solvent-exposed amino acids and protein surface geometry shape the hydration shell is not well understood. We combined molecular dynamics simulations with explicit-solvent SAXS predictions to quantify how solvent-exposed chemical moieties and protein geometry drive variations in hydration shell density. Notably, the hydration shell of a globular protein differs markedly from that of an intrinsically disordered protein. Our study offers a comprehensive characterization of protein hydration and informs the interpretation of hydration-sensitive experimental techniques.
Introduction
Proteins in solution are enveloped by a hydration shell, formed through electrostatic and dispersive interactions between water molecules and solvent-exposed protein moieties. The hydration shell actively participates in various biological functions, such as protein folding, molecular recognition, enzyme catalysis, proton transfer, or avoidance of unspecific aggregation, and is thus considered an integral part of proteins (1,2,3,4,5). The structure and dynamics of the hydration shell differ from those of bulk water, as revealed by nuclear magnetic resonance, terahertz spectroscopy, time-dependent fluorescence Stokes shift, inelastic neutron scattering, molecular dynamics (MD) simulations, and several other techniques (6,7,8,9,10,11,12,13,14,15,16). Consequently, the vibrational, rotational, and translational dynamics of water molecules in the hydration shell are slowed down by approximately two- to fivefold. Scattering experiments have revealed that the water density in the hydration shell of many proteins is increased relative to the density of bulk water, with the magnitude of this density increase likely being protein dependent (17,18,19,20). In the crowded cytoplasm of biological cells, up to 70% of water belongs to a hydration shell, indicating that water involved in life is predominantly non-bulk-like (3,21,22).
How the composition and relative arrangement of solvent-exposed amino acids control the properties of the protein hydration shell is not well understood. Small-angle scattering with x-rays and neutrons (SAXS/SANS) of highly charged proteins suggested that solvent-exposed anionic aspartate or glutamate residues increase the hydration shell density more than cationic lysine or arginine residues (20,23), which aligns with the large number of structured water molecules located at anionic residues in protein crystals (24). Furthermore, spectroscopic techniques revealed that the polarity of surfaces influences the properties of water at interfaces. At polar surfaces, water exhibits decreased internal water order and fewer internal hydrogen bonds. At apolar surfaces, in contrast, water exhibits increased internal order and more internal hydrogen bonds, and it may form low-density clathrate structures (25,26,27,28,29,30,31,32,33,34,35). Additionally, water has been shown to form a depletion layer with reduced density at hydrophobic surfaces (36,37,38). However, the quantitative influence of solvent-exposed amino acids or of specific chemical moieties on the hydration shell architecture remains largely unexplored.
Upon modifying solvent-exposed moieties of proteins by mutagenesis, the hydration shell may change via two distinct effects: first, due to altered residue-water interaction energies at fixed protein conformations (see previous paragraph). Second, mutations may shift the conformational ensemble of the protein, thereby replacing protein-water contacts with protein-protein contacts or vice versa. Such effects are especially pronounced in intrinsically disordered proteins (IDPs), as their shallow free energy landscapes are sensitive to mutations (39,40). Nevertheless, surface mutations may also alter the conformation of globular proteins, for example by triggering unfolding or a transition to a molten globule (41,42). By simulating proteins at preselected, fixed conformations, this study focuses on the first effect while noting that, under experimental conditions, both effects often play a role.
We recently validated the protein hydration shell from MD simulations by comparing results from explicit-solvent SAXS/SANS predictions (43,44,45) with consensus experimental data obtained from a worldwide community effort (20,46). SAXS and SANS data reflect the contrast of the, respectively, electron density or neutron scattering length density of the protein relative to bulk solvent, thereby including contributions of the hydration shell. We observed that many but not all combinations of protein force fields and water models accurately reproduce the hydration shell effect on the radius of gyration Rg. We furthermore found that the hydration shell effect on Rg depends on protein size, geometry, and surface composition, suggesting that the effect represents a protein-specific footprint of the hydration shell. In this study, we use MD simulations to quantify the influence of all proteinogenic amino acids on the hydration shell of a globular and an intrinsically disordered protein on two parameters that are encoded by SAXS curves, namely on the Rg and on the overall contrast between solute and solvent. We derive an amino acid-specific contrast score for solvent-exposed proteinogenic residues and show that the hydration shell structure and its effect on SAXS data depends not only on chemical composition but also on peptide conformation and water models. We rationalize the amino acid-specific effects on SAXS curves by analyzing three-dimensional solvent densities as well as radial distribution functions around solvent-exposed amino acid side chains.
Materials and methods
Simulation setup and parameters for the GB3 domain
The initial structure of the GB3 domain was taken from the protein data bank (PDB (47): 1IGD (48)). Ten amino acids on the surface of the GB3 domain were selected and mutated to one of the 21 proteinogenic amino acid, involving two protonation states of histidine, with the software Chimera (49), namely residues 15, 18, 20, 22, 24, 27, 33, 37, 47, and 51 (Fig. 1 A). Hydrogen atoms were added with pdb2gmx. MD simulations of GB3 were carried out with GROMACS, version 2020.3 (50). Interactions of the proteins were described with the following variants of the AMBER03 force field (51): ff03^∗^ (52), ff03w (53), and ff03ws (54). The starting structures were placed in a dodecahedral box, where the distance between the protein and the box edges was at least 2.0 nm, and solvated in TIP3P (55), TIP4P/2005 (56), or TIP4P/2005s (54) water. The simulation systems were neutralized by adding Na^+^ or Cl^−^ ions. After 400 steps of minimization with the steepest decent algorithm, the systems were equilibrated for 100 ps with harmonic position restraints applied to the heavy atoms of the proteins (force constant 1000 kJ mol^−1^ nm^−2^). Subsequently, production runs were started for 50 ns with harmonic position restraints applied to the backbone atoms of the proteins (force constant 2000 kJ mol^−1^ nm^−2^). The equations of motion were integrated using the leapfrog algorithm (57). The temperature was controlled at 298.15 K, using velocity rescaling (τ = 1 ps) (58). The pressure was controlled at 1 bar with the Berendsen barostat (τ = 1 ps) (59) and with the Parrinello-Rahman barostat (τ = 5 ps) (60) during equilibration and production simulations, respectively. The geometry of the water molecules was constrained with the SETTLE algorithm (61), and LINCS (62) was used to constrain all other bond length. A time-step of 2 fs was used. Dispersive interactions and short-range repulsion were described by a Lennard-Jones potential with a cutoff at 1 nm. The pressure and the energy were corrected for missing dispersion corrections beyond the cut-off. Neighbor lists were updated with the Verlet scheme. Coulomb interactions were computed with the smooth particle-mesh Ewald (PME) method (63,64). We used a Fourier spacing of approx. 0.12 nm, which was optimized by the GROMACS mdrun module at the beginning of each simulation.Figure 1. Structures and hydration shells of the GB3 domain and XAO peptide. (A) Cartoon representation of the GB3 domain. Ten surface amino acids shown in purple ball-and-stick representation were mutated into each of the 20 proteinogenic amino acids. (B) Three-dimensional density of the hydration shell around the wild-type GB3 domain (for colors, see colorbar). The first and second hydration layers appear as orange and blue densities, respectively. Red densities indicate a well-defined water position at the protein surface. (C) XAO peptide with four unnatural amino acids at the termini: two 2,4-diaminobutyric acid (Dab) and two ornithin (Orn) shown in pink and white, respectively. (D) Three-dimensional solvent density around the wild-type XAO peptide.
Simulation setup and parameters for the XAO peptide
To obtain an ensemble of XAO, we carried out maximum-entropy ensemble refinement (65) of XAO against experimental SAXS data. To this end, four parallel replicas of XAO simulations were coupled to SAXS data as described in the supporting material. From the ensemble, we obtained 20 frames that reasonably represent the confrontational space adopted by XAO, thus including compact and extended conformations. The simulation systems for the XAO conformations and mutants were set up as described above for the GB3 domain, except that the MD simulations were carried out with the GROMACS 2021.7 (50). Side chains of ornithin (Orn) and 2,4-diaminobutyric acid (Dab) were modeled based on the parameters for lysine by removing either one or two CH2 groups from the lysine topology, respectively. Four residues of the XAO, the two Orn and the two Dab residues, were mutated to one of the 21 proteinogenic amino acids with the software Chimera (49).
Explicit-solvent SAXS calculations
2251 simulation frames from the time interval between 5 and 50 ns from MD simulations of the GB3 domain and 2501 simulation frames from the time interval between 0 and 50 ns from simulations of the XAO peptide were used for SAXS calculations. The SAXS calculations were performed with GROMACS-SWAXS, an in-house modification of GROMACS 2021.7, as also implemented by the web server WAXSiS (43,44,66). The code and tutorials are available at https://cbjh.gitlab.io/gromacs-swaxs-docs/. For more background on explicit-solvent SAXS calculations, we refer to recent reviews (45,67). Explicit-solvent SAXS calculations have previously been successfully compared with experimental data from diverse solutes including proteins (43,68), IDPs (69), protein-detergent complexes (70), or detergent micelles (71,72), suggesting that the predictions are robust and accurate. A spatial envelope was built around all solute frames from the proteins. Solvent atoms inside the envelope contributed to the calculated SAXS curves. The distance between the protein and the envelope surface was at least 9 Å, such that all water atoms of the hydration shell were included. The buffer subtraction was carried out using 2251 simulations frames of a pure-water simulation box, which was simulated for 50 ns and large enough to enclose the envelope. The orientational average was carried out using 200 q-vectors for the GB3 domain and 50 q-vectors for the XAO peptide for each absolute value of q, and the solvent electron density was corrected to the experimental value of 334 e/nm^3^ as described previously (43).
Statistical errors were computed for simulations with the GB3 domain by binning the trajectory into 10 time blocks of 4.5 ns and computing the standard error. In the case of the XAO peptide, statistical errors were calculated from simulations of 20 independent conformations.
Calculation of the hydration shell contrast
The forward scattering intensity of a SAXS curve is given by the square of the contrast between solute (including the hydration shell) and the solvent in number of electrons. Thus, I0 follows by
where is the number of electrons of the protein, V^prot^ the protein volume, the contrast imposed by the hydration shell, and ρsolv the solvent electron density taken as 334 e/nm^3^. Thus, we have
Here, the plus and minus signs correspond to the cases where is positive or negative, respectively. In our implementation, is taken from atomic form factor at zero scattering angle as defined via the Cromer-Mann parameters of the atoms (73). The volume of the solute V^prot^ was defined as the cavity volume calculated with the 3V volume calculator (74) with a grid spacing of 0.16 Å and a probe radius of 1.4 Å corresponding to the van der Waals radius of a water molecule. Volumes were computed as an average over 20 simulations frames, which were randomly rotated before running the 3V software. Statistical uncertainties of volume calculations correspond to 1 SE obtained from the 20 frames. Volumes of GB3 and XAO variants are shown in Fig. S13. The influence of the probe radius and the grid spacing on the volume calculation is analyzed in Fig. S14.
Results
To quantify the effects of amino acid composition on the hydration shell of proteins, we simulated the GB3 domain as a representative for globular proteins (Fig. 1 A and B) and the XAO peptide (75,76) as representative for IDPs (Fig. 1 C and D). The three-dimensional electron densities of solvent around the GB3 domain or around XAO are shown in Fig. 1 B and D, here computed from simulations with position restraints on all heavy atoms leading to spatially well-defined densities from surface-bound water molecules. The densities reveal highly localized water molecules (red densities), the first hydration shell (orange/red densities), as well as the second hydration shell (dark blue densities). A highly shallow third shell is hardly visible in the three-dimensional density representation (Fig. 1 D, cyan density layer). The structure of the hydration shell involving a pronounced first shell, a shallow second, and a highly shallow third shell agrees with many previous MD studies (Ref. (77) and references therein).
The hydration shell of the globular GB3 domain strongly depends on the surface amino acid composition
SAXS experiments of proteins probe the electron density contrast between the protein and the bulk solvent, including the density contrast contributed by the hydration shell. To quantify the effects of different amino acids to the hydration shell, and to relate variations among different amino acids to putative solution scattering experiments, we computed SAXS curves of the wild-type and of 21 mutants of the globular GB3 domain. We used explicit-solvent SAXS calculations (43), as also implemented by the WAXSiS web server (66), thereby accounting for all non-bulk-like solvent molecules of the hydration shell and using an explicit representation of the excluded solvent. In contrast to implicit-solvent SAXS predictions, the method does not require any solvent-related fitting parameters or a choice for a thickness for the hydration shell (45). We selected 10 solvent-exposed residues (Fig. 1 A, pink ball-and-stick representation) and mutated these residues to each of the 20 proteinogenic amino acids while including histidine in the neutral form (δ-nitrogen protonated, His^0^) and in the cationic form (δ- and ϵ-nitrogen protonated, His^+^), resulting in 22 GB3 variants (wild-type and 21 mutants). We performed explicit-solvent MD simulations with restraints on the backbone atoms to maintain all GB3 variants in identical backbone conformation and to prevent unfolding of putatively unstable GB3 mutants such as mutants with many hydrophobic solvent-exposed residues. SAXS curves I(q) computed for the 22 GB3 variants differ (Fig. 2). Since we computed the SAXS curves taking the solvent explicitly into account (43,45,66), the variations among the SAXS curves include effects owing to variations of the hydration shell contrast.Figure 2SAXS curves of the GB3 domain from explicit-solvent SAXS calculations with the TIP4P/2005 water model in combination with the ff03w protein force field (A) for GB3 wild-type or (B–F) for 21 GB3 variants with 10 mutated solvent-exposed amino acids each (for color code and line style, see legends). For clarity, SAXS curves are grouped by the amino acid property (glycine, polar, apolar, cationic, anionic) in (B)–(F). Guinier plots of these curves are shown in Fig. S2.
In this study, we used two quantities that provide a footprint for the hydration shell while being encoded by the SAXS curves: 1) the forward scattering intensity I0 = I(q = 0), which is related to the contrast between protein and solvent; and 2) the radius of gyration Rg, which quantifies the spatial extent of the protein. Focusing first on the former quantity, the forward scattering is given by the square of the total contrast ΔNe in number of electrons between protein and solvent, i.e., . We decomposed the total contrast into contributions from the contrast of the bare protein and the contrast of the hydration shell , via (see Materials and Methods). Fig. 3 A (yellow bars) presents the contrast of the hydration shell for 22 GB3 variants, here computed with the AMBER force field ff03w in conjunction with the TIP4P/2005 water model (53,56), which revealed excellent agreement with experimental SAXS/SANS data in our previous study and may, therefore, be taken as reference force field (20). The contrast is plotted in number of water molecules as given that each water molecule contains 10 electrons. Evidently, the contrast of the hydration shell differs greatly among GB3 variants with different solvent-exposed amino acid types. The hydration shell of wild-type GB3 exhibits a positive contrast of 4.1 water molecules implying the presence of additional 4.1 water molecules in the hydration shell relative to an equivalent volume of bulk water, in line with the well-known densely packed hydration shell documented by SAXS experiments of several proteins (17,18,20). Among the mutated GB3 variants, the variants with additional anionic residues (Asp/Glu) reveal the largest contrast, followed in decreasing order by GB3 variants with cationic (Lys/Arg/His^+^) and polar charge-neutral amino acids (Fig. 3, pink labels at the abscissa). The marked hydration shell imposed by the anionic residues Glu/Asp aligns with previous SAXS/SANS experiments of super-charged variants of green fluorescent protein (78) and of the highly anionic glucose isomerase (20,46). In contrast, GB3 variants with many apolar solvent-exposed residues reveal a negative hydration shell contrast, indicating a water depletion layer in the vicinity of hydrophobic amino acids, in line with reports for other types of hydrophobic surfaces (36,37,38). An exception to the order anionic–cationic–polar–apolar is given by tyrosine, which displays a more negative contrast compared with all other polar amino acids, rationalized by the presence of the apolar six-membered aromatic ring.Figure 3. Analysis of hydration shell contrasts. (A) Contrast of the hydration shell in number of water molecules of GB3 wild-type and 21 GB3 mutants; see labels at abscissa colored by the property of the amino acid: Gly (gray), polar (pink), apolar (green), cationic (orange), and anionic (blue) residues. Contrast values are shown for three combinations of protein force field and water model: ff03w–TIP4P/2005 (yellow), ff03ws–TIP4P/2005s (blue), ff03^∗^–TIP3P (red). (B) Contrast per amino acid for GB3 domain relative to alanine. (C and D) Same analysis as in (A) and (B) for the XAO peptide. Error bars denote 1 SE.
To quantify the hydration shell contrast imposed by individual amino acids, Fig. 3 B presents the contrast per amino acid and relative to alanine as reference. Accordingly, the hydration shells of anionic residues exhibit approximately 1.5 additional water molecules relative to alanine, with glutamate standing out as the amino acid whose hydration shell imposes the largest contrast, indicative for a particularly densely packed hydration shell. The hydration shells of cationic residues contain roughly one additional water molecule relative to alanine, whereas the hydration shells of polar residues contain approximately 0.2–0.5 additional water molecules relative to alanine. Bulky apolar amino acids such as leucine or phenylalanine may contain up to 0.3 fewer water molecules relative to alanine. Tyrosine with its polar hydroxyl group and apolar aromatic ring represents an intermediate case between polar and apolar residues. These values provide an amino acid-resolved hydration layer contrast score for a common globular protein such as GB3, thus quantifying how chemical specificities of proteinogenic amino acids control the density of the protein hydration shell. Below, we further analyze these data to dissect how individual chemical moieties control the hydration shell.
Critically, the overall contrast ΔNe for a given MD simulation is determined unambiguously from the forward scattering calculated using explicit-solvent SAXS calculations. In contrast, the decomposition of ΔNe into contributions from the bare protein and the hydration shell depends on how the protein volume is defined. Assigning a larger volume to the protein leads to a decrease in and a corresponding increase in . This effect is illustrated in Fig. S4 A and C, where protein volumes were computed using three different probe radii: 1.3 Å, 1.4 Å, or 1.5 Å. However, the contrast per amino acid relative to alanine depends only marginally on the volume definition (Fig. S4 B and D), suggesting that ambiguities in our volume calculations cancel out upon comparing different amino acids. Consequently, although the choice of volume definition (see Materials and Methods) affects the absolute contrasts shown in Fig. 3 A and C, it has only a marginal impact on the relative contrasts in Fig. 3 B and D.
Among different water models, effects of amino acid properties on the hydration layer contrast agree qualitatively but differ quantitatively
To test the influence of water models, and to exclude that our key findings are not biased by the choice of the water model, we calculated the contrasts for three different water models, namely TIP3P (55), TIP4P/2005 (56), and TIP4P/2005s (54), in combination with their corresponding AMBER03 force field (51) variant. Among these water models, TIP3P is the most widely used model since the popular CHARMM and AMBER protein force fields have originally been parameterized in conjunction with TIP3P. However, TIP3P shows poor agreement with experimental data as it yields a too low density, a too high diffusion coefficient, and a too high isothermal compressibility (79). TIP4P/2005 reproduces water properties more accurately compared with TIP3P, and it furthermore captures the water density over a wide temperature range (56). TIP4P/2005s takes water-water interactions from TIP4P/2005; however, it implements increased water-protein dispersion interactions with the aim to balance water-water against water-protein interactions in protein simulations with AMBER force fields (54).
Fig. 3 A compares the total hydration shell contrast of 22 GB3 variants for simulations with these three water models (yellow, blue, red bars, see legend). Irrespective of the water model, the contrasts follow the order anionic–cationic–polar–apolar, suggesting that the effects of amino acid classes on the hydration shell contrast agree qualitatively among the water models. This finding is confirmed by comparing the contrast per amino acid relative to alanine (Fig. 3 B). However, the total contrasts shown in Fig. 3 A furthermore reveal considerable quantitative differences among water models. TIP3P yields the lowest contrast for all GB3 variants with approximately four water molecules fewer within the overall GB3 hydration shell compared with TIP4P/2005 (Fig. 3 A, red vs. yellow bars). In contrast, TIP4P/2005s yields an increased contrast for most GB3 variants with typically zero to three additional water molecules in the hydration shell compared with TIP4P/2005 (blue vs. yellow bars), rationalized by the increased protein-water dispersion interactions implemented by TIP4P/2005s (54). More specifically, TIP4P/2005s yields a similar contrast compared with TIP4P/2005 for the GB3 wild-type and for the Ser, His^0^, His^+^, Asp, Glu and an increased contrast for all the other variants. Thereby, the hydration shell contrast by TIP4P/2005s largely exceeds the contrast by TIP3P, in particular for the apolar variants, with up to 7.5 additional water molecules in the hydration shell for the Trp variant.
Hydration shell contrasts controlled by amino acid type and force field impose experimentally accessible variations of the radius of gyration
Analysis of the hydration shell based on I0 and hydration shell contrasts involve two caveats. First, experimental I0 values—or, equivalently, total electron density contrasts—are subject to relatively high uncertainty because obtaining I0 from an experimental SAXS curve would require precise knowledge of the solute concentration. Because the solute concentration is typically only approximately known, quantitative comparisons of I0 between MD simulations and experiments is difficult. In contrast, the radius of gyration Rg obtained by SAXS coupled to size exclusion chromatography (SEC-SAXS) enables Rg measurements with sub-Ångström accuracy (46), thereby enabling quantitative validation of MD simulations against experiments (20). Second, whereas the overall solute contrast is unambiguously defined in explicit-solvent SAXS calculations via , its decomposition into contrast contributions from the protein and hydration shell depends on the definition of the protein volume or, equivalently, on the definition of the dividing surface at the protein-water interface (77). In contrast, computing the hydration shell effect on Rg does not require a definition of the dividing surface.
Thus, as a second indicator for the hydration shell, we analyzed hydration shell effects on the Rg values of 22 GB3 variants. We computed the change of Rg owing to the hydration shell, as given by , where denotes the Rg value obtained by Guinier analysis of the SAXS curve, thereby taking the hydration shell into account, and denotes the Rg value of the bare proteins computed from the atomic coordinates of protein atoms. Fig. 4 A presents ΔRg values for the 22 GB3 variants, whereas Fig. 4 B shows the ΔRg values relative to alanine per mutated amino acid. In line with the contrasts discussed above, the presence of anionic residues (Asp/Glu) imposes by far the largest increase of ΔRg values by ∼1.5 Å and an increase per amino acid relative to alanine by ∼0.1 Å. These results confirm that anionic residues impose a tightly packed hydration shell (20,24,78). In contrast, bulky hydrophobic residues such as Val, Ile, Leu, Phe, or Trp lead to small ΔRg values and a decrease per amino acid relative to alanine by up to ∼0.05 Å, in line with a water depletion layer at the hydrophobic surface. Cationic and many polar residues lead to intermediate and similar ΔRg values, which may be surprising considering that hydration shell contrasts imposed by cationic residues clearly exceed the contrasts imposed by polar residues (see Fig. 3 A and B). The findings might reflect that ΔRg is sensitive to the spatial distributions of contrasts, which may lead to different amino acid-specific effects on ΔRg compared with effects on the total contrast derived from I0.Figure 4. Effects of hydration shell on radii of gyration. (A) Hydration-induced shift in the radius of gyration ΔRg from explicit-solvent SAXS calculation of 22 GB3 variants. ΔRg values are shown for GB3 wild-type (WT) and 21 mutants; see labels at the abscissa colored by the property of the amino acid: Gly (gray), polar (pink), apolar (green), cationic (orange), and anionic (blue) residues. Results are shown for three combinations of protein force field and water model: ff03w–TIP4P/2005 (yellow), ff03ws–TIP4P/2005s (blue), ff03^∗^–TIP3P (red). (B) ΔRg values relative to the alanine mutant and per amino acid. (C and D) Same analysis as in (A) and (B) for the XAO peptide. Error bars denote 1 SE.
Comparing the ΔRg values between simulations with TIP3, TIP4P/2005, and TIP4P/2005s reveals specific properties of water models, although the relative differences among ΔRg values are overall smaller compared with the differences among the contrasts (compare Fig. 4 A and B with Fig. 3 A and B). Whereas the ΔRg values of GB3 wild-type are in excellent agreement among the three water models (Fig. 4 A, left column), TIP3 yields lower ΔRg for many GB3 mutants such as mutants with anionic residues (Asp/Glu), several apolar residues, as well as for several apolar residues such as Thr or Tyr. The largest variations of ΔRg values among water models is found for the bulky apolar residues, where in particular TIP4P/2005s, but also TIP4P/2005, yields by far larger ΔRg values compared with TIP3P. Thus, the increased protein-water dispersion interactions implemented by TIP4P/2005s lead to a partial loss of the water depletion layer at hydrophobic surfaces, with a footprint on the radii of gyration.
Hydration shell of the intrinsically disordered protein XAO exhibits a negative amino acid-dependent contrast
Compared with globular proteins, IDPs exhibit a larger surface/volume ratio, suggesting that—at a given protein density—IDPs exhibit more protein-water contacts and perturb a larger volume of water, rationalizing the tight coupling between IDP and water dynamics (80). Many IDPs carry out their function by partial folding on the surface of other proteins, thereby involving large rearrangements of protein-water and water-water interaction networks. Nevertheless, the hydration shell of IDPs has attracted less attention compared with the hydration shell of globular proteins (81,82). A previous MD study suggested that accurate representation of the hydration shell density by explicit-solvent models is critical for predicting SAXS curves of IDPs since even small variations of the hydration shell density may strongly influence predicted SAXS curves (19,65). MD simulations using TIP4P/2005 or TIP4P/2005s revealed that the water structure in the hydration shell of an IDP is perturbed relative to the bulk as indicated by a loss of tetrahedrality; however, this perturbation has been weaker as compared with the hydration shell of a folded protein (83). How the amino acid composition of an IDP controls the density of its hydration shell has not been systematically addressed.
As a model IDP, we here consider the XAO peptide with the sequence Ace-(diaminobutyric acid)2-(Ala)7-(ornithine)2-NH_2_, whose conformational ensemble has been studied by SAXS as well as by NMR and circular dichroism spectroscopy (75,76). We used maximum-entropy ensemble refinement against SAXS data taken from Ref. (75) to obtain the heterogeneous ensemble of XAO. We randomly selected 20 frames from the ensemble, thereby representing the heterogeneous ensemble, involving compact and expanded XAO conformations (see Materials and Methods and Fig. S1). In follow-up simulations of XAO and its mutants, these conformations were maintained using backbone restraints.
To reveal how the amino acid composition influences the hydration shell of an IDP, we mutated the four terminal XAO residues to 20 proteinogenic amino acids, again including histidine in the neutral and cationic form (21 mutants). For each XAO variant, we carried out 20 MD simulations with backbone restraints to the 20 conformations taken from the XAO ensemble (see above) and computed the SAXS curve using explicit-solvent SAXS calculations (Figs. S3 and S5). Following the analysis described above for the GB3 domain, we obtained the contrast of the hydration shell of the XAO wild-type and 21 mutants (22 variants), using the three water models TIP4P/2005, TIP4P/2005s, and TIP3P. We computed the overall contrast of the hydration shell relative to an equivalent volume of bulk water (Fig. 3 C) as well as the contrast per amino acid relative to alanine (Fig. 3 D).
At variance with the analysis for GB3, we found a negative hydration shell contrast for all 22 XAO variants, implying that the hydration shell of XAO contains fewer water molecules compared with bulk water. Whereas XAO wild-type and anionic mutants reveal a contrast of up to approximately −2 water molecules, apolar variants may reveal contrast of up to −7 water molecules or even fewer (Fig. 3 C). We rationalize the negative contrasts with the high flexibility of XAO, leading to more loosely packed conformations compared with the structure of the globular GB3. Thereby, small voids between the XAO backbone and side chains may exclude water molecules or enable only incomplete packing of water around XAO moieties, leading to a lower water density around XAO compared with GB3.
The effects of amino acid classes on the hydration shells are consistent with the findings for GB3; namely the contrasts follow roughly the order anionic–cationic–polar–apolar (Fig. 3 D). However, contrasts per amino acid differ quantitatively for XAO compared with our results for GB3. For instance, arginine residues imposed smaller contrasts in XAO compared with arginine in GB3 (compare Fig. 3 B with Fig. 3 D), possibly because the hydrophobic C_β, Cγ, and Cδ_ atoms of arginine near the XAO termini are more solvent-exposed compared with arginines at the GB3 surface.
The hydration shell of an IDP may reduce the radius of gyration detected by SAXS
In sharp contrast to the findings for GB3 variants, for which ΔRg values were mostly positive in the range of approximately 0–1.5 Å, ΔRg values for XAO variants with different force fields are mostly negative and may take large absolute values (Fig. 4 C). The radius of gyration detected by SAXS is given by
where Δρ(r) denotes the electron density contrast, and r is the distance from the (contrast-weighted) center of mass. The large negative ΔRg values up to −4 Å may be rationalized by 1) the overall small contrast of XAO (small value in brackets in Eq. (4)), leading to a large impact of the hydration shell contrast on Rg, and 2) hydration shell contributions to the contrast close to the center of mass (at small r), as occurring for extended conformations for which moieties near the center of mass are solvent-exposed. Positive ΔRg values are found only for the anionic mutants for which Asp/Glu residues may impose large contrast near the endpoints of XAO, but also for the XAO wild-type and few polar variants, mostly with the TIP4P/2005 water model (Fig. 4 C, yellow bars). Thus, whereas the hydration shell of globular proteins often leads to an increased Rg detected by SAXS (17,23,43), the hydration shell of IDPs may also lower the Rg value.
Three-dimensional densities around XAO reveal amino acid- and force field-specific hydration shell structures
To rationalize the variations of contrast and Rg values in structural terms, we computed the three-dimensional solvent densities with the TIP4P/2005 water model around extended conformations of four XAO variants, in which the four terminal residues were mutated to serine, leucine, lysine, or glutamate, providing one representative each for a polar, apolar, cationic, or anionic variant (Fig. 5 E–H). In line with the findings from the XAO ensemble (Figs. 3 C and D and 4 C and D), these mutations lead to large variations of the forward scattering I0 (Fig. 5 A), contrast of the hydration shell between approximately −1 and −6 water molecules (Fig. 5 B), Rg values obtained from SAXS between ∼7.5 Å and ∼11.5 Å (Fig. 5 C, blue squares), and ΔRg values between −3 Å and +1 Å (Fig. 5 D). These values align qualitatively with the variations of solvent densities near the endpoints of XAO. For instance, the solvent structure around the anionic glutamate residues reveals several tightly bound water molecules (Fig. 5 H, red spots) and a pronounced second hydration shell. In contrast, the hydration shell around the apolar leucine lacks any structured water molecules, whereas the second hydration shell is blurred out (Fig. 5 F). The serine and lysine variants yield intermediate cases (Fig. 5 E and G). Thus, the variations of contrast and Rg values quantified above are footprints of amino acid-specific hydration shell structures revealed by three-dimensional solvent densities.Figure 5. Comparison of hydration shells of XAO mutants with different types of mutated residues: Ser (polar), Leu (apolar), Lys (cationic), Glu (anionic). (A) Forward scattering I0, (B) hydration shell contrast in number of water molecules, (C) radius of gyration from the bare peptide (black dots) and from Guinier analysis (blue squares), and (D) . Values were calculated from simulations with the TIP4P/2005 water model in combination with the ff03w protein force field. (E–H) Shaded colors show the three-dimensional solvent density maps from 50 ns simulations around the mutants analyzed in (A)–(D) with color code taken from Fig. 1B and D. MD simulations used to compute density maps were carried out with restraints on all heavy atoms, thereby yielding spatially well-defined hydration shells. The density is overlayed with one MD frame, showing the XAO in ball-and-stick representation and water molecules within the envelope as red/white lines. Color code according to Fig. 1B and D. Example PyMOL scripts for visualizing the densities are provided in the supporting material.
The varying solvent structures around different amino acid types are also evident from radial distribution functions (RDFs) of water oxygen atoms with respect to heavy atoms of the side chains of the mutated amino acids. The RDFs for anionic residues reveal a pronounced peak at 2.65 Å, which is not present for any other residue, illustrating their exceptionally tightly bound hydration shell (Figs. 6, blue; S6 D; S7 D). The first peak in the RDFs of cationic residues is much smaller than that of anionic residues and is located at a larger distance (Figs. 6, orange; S6 D; S7 D), rationalizing the weaker hydration shell contrast (Fig. 3). RDFs of polar residues depend strongly on the amino acid species, reflecting the relative abundances of polar and hydrophobic moieties; for instance, water oxygen atoms around serine, and to a lesser degree around threonine, glutamine, or asparagine, show considerable density at 2.8 Å, whereas water is on average located at larger distances from cysteine or tyrosine (Figs. 6, black; S6 A/B; S7 A/B). The water depletion layer at apolar residues is illustrated by RDFs that are nearly zero below 3 Å (Figs. 6, green curve; S6 C; S7 C). Moreover, the RDFs show that the hydration shell depends on the protonation state, as evident from more tightly bound water at doubly protonated histidine compared with singly protonated histidine (Fig. S8). Together, the RDFs demonstrate the highly distinct hydration shell structures around different amino acid types, and they align with the varying contrasts discussed above.Figure 6. Radial distributions functions (RDFs) of water oxygens with respect to side chain heavy atoms of serine, leucine, lysine, or glutamate (see legend). RDFs were computed from simulations of XAO with TIP4P/2005, averaged over 20 XAO conformation. RDFs for other amino acids or computed from simulations of GB3 are shown in Figs. S6 and S7.
The solvent densities around the extended XAO peptide depend not only on amino acid composition but furthermore on the water model, as evident, for instance, from more pronounced solvent structures modeled by TIP4P/2005s compared with TIP3P (Fig. S9). Such force-field-dependent solvent densities rationalize variations of hydration shell contrasts by the ensembles of 22 XAO variants presented in Fig. 3 C and D, which qualitatively align with the findings for GB3 (Fig. 3 A and B). Specifically, TIP3P yields smaller (more negative) hydration shell contrasts relative to TIP4P/2005, whereas the increased protein-water dispersion interactions implemented by TIP4P/2005s yield larger (less negative) contrasts (Fig. 3 C and D, yellow, blue, or red bars). Notably, owing to the small contrast ∫Δρ(r) dr of the overall XAO, ΔRg values are highly sensitive with respect to water model-imposed variations of the hydration shell, leading to large ΔRg variations by up to 2 Å and even more (Fig. 4 C and D, yellow, blue, and red bars; cf. Eq. (4)).
Hydration shell effect on Rg strongly depends on IDP conformation
To test how the peptide conformation influences the hydration shell of XAO, we computed hydration shell contrasts and ΔRg values for 20 representative conformations of the aspartate mutant of XAO (Fig. S10) and analyzed two example conformations—one compact and one extended conformation—in detail in Fig. 7. We found that peptide conformations have only a small effect on the contrast (Fig. 7 A and B) but may impose large variations of ΔRg by up to ∼0.8 Å (Figs. 7 C and D and S10). Notably, ΔRg values neither correlate significantly with Rg values nor with I0, suggesting that subtle details of the peptide and hydration shell geometries determine the ΔRg value.Figure 7. Effects of peptide conformation on the hydration shell. (A) Forward scattering I0, (B) hydration shell contrast, (C) Rg, and (D) ΔRg values of one extended and one compact conformation of the aspartate mutant of XAO obtained with TIP4P/2005 and ff03w. (E and F) Shaded colors show the three-dimensional solvent density maps from 50 ns MD simulations around the XAO mutant analyzed in (A)–(D) with color code taken from Fig. 1B and D. The density is overlayed with one MD frame, showing the XAO as balls/sticks and water molecules within the envelope as red/white lines. MD simulations used to compute density maps were carried out with restraints on all heavy atoms, thereby yielding spatially well-defined hydration shells. (G) Extended (blue/white) and compact conformation (pink/white) of XAO with four terminal residues mutated to aspartate (white).
How solvent-exposed chemical moieties determine the hydration shell contrast
The hydration shell contrasts computed for all proteinogenic amino acids (Fig. 3) enabled us to quantify how chemical modifications of solvent-exposed moieties alter the hydration shell contrast. Table 1 lists 12 chemical modifications, together with the accompanying changes of contrasts in number of electrons, as taken from the GB3 or XAO simulations with TIP4P/2005. Since one water molecule contains 10 electrons, the values may be translated to number of water molecules by dividing by 10. The values are qualitatively consistent among the analysis from GB3 and XAO. However, values differ quantitatively, likely reflecting that the degree to which these moieties are solvent-exposed differs between GB3 and XAO.Table 1. Change of Hydration Shell Contrast in Number of Electrons upon Various Chemical Modifications of Solvent-Exposed Amino AcidsChemical ModificationGB3XAOFromTo 11.8 ± 0.52.2 ± 0.32−1.3 ± 0.4−0.8 ± 0.233.7 ± 0.33.6 ± 0.24−2.2 ± 0.4−2.9 ± 0.254.8 ± 0.42.8 ± 0.260.8 ± 0.4−0.1 ± 0.2710.2 ± 0.45.3 ± 0.2814.2 ± 0.48.1 ± 0.292.3 ± 0.42.2 ± 0.2103.3 ± 0.42.3 ± 0.2112.6 ± 0.4−0.1 ± 0.212−2.2 ± 0.4−2.1 ± 0.1Errors denote 1 SE.
The analysis enables the following conclusions: 1) replacing a hydrogen (H) with a methyl group (CH_3_) decreases the contrast by about one electron (Table 1, row 2). 2) Replacing a methyl (CH_3_) with a hydroxyl group (OH) increases the contrast by about three electrons (rows 1, 2, 10). 3) Replacing carbamoyl group (CONH_2_) as present in Asn or Gln with a carboxyl group (COO ^−^) as present in Asp or Glu leads a marked increase of the contrast between 5 and 14 electrons (rows 7, 8). 4) Extending a side chain by the addition of a CH_2_ group increases the contrast for a polar side chain (e.g., Asp → Glu, row 5) and decreases the contrast for apolar side chains (row 4), rationalized by the fact that longer side chains are more solvent-exposed. In other words, chemical modifications at the tip of longer side chains have a larger effect on the hydration shell contrast compared with modifications at shorter side chains. Together, these values quantify how solvent-exposed chemical moieties modulate the electron density of the hydration shell of globular proteins or IDPs.
Discussion
We quantified the effect of the 20 proteinogenic amino acids on the protein hydration shell density of the GB3 domain and the XAO peptide as representatives for the classes of globular or intrinsically disorderd proteins with focus on two quantities encoded by SAXS curves: 1) the forward scattering intensity I0, which reports on the overall contrast between the protein—including its hydration shell—relative to the buffer; and 2) the radius of gyration Rg, which is modified by the hydration shell relative to the Rg of the bare protein. Across both proteins and three different water models, we observed a consistent trend in hydration shell density at solvent-exposed amino acids: anionic > cationic > polar > apolar residues. Substituting alanine with an anionic residue at the GB3 surface, the hydration shell density increased considerably and contained approximately 1–1.5 additional water molecules relative to the bulk density. Given that 1) water is nearly incompressible and 2) amino acids are in contact with only a few water molecules, these values demonstrate a highly condensed packing of water at anionic amino acids and rationalize the presence of a marked hydration shell around anionic proteins (20,78). In contrast, replacing alanine with bulkier hydrophobic residues led to a decreased hydration shell density, which supports the presence of a water depletion layer previously reported for hydrophobic surfaces (36,37,38).
Although the amino acid-specific effects on the hydration shell are qualitatively consistent between the GB3 domain and the XAO peptide, we also observed distinct differences in their hydration shells and their effects on SAXS curves. The hydration shell contrast was positive for the GB3 wild type and for most polar or charged GB3 mutants (Fig. 3 A), indicating tightly packed water on the protein surface. In contrast, the hydration shell of most XAO variants revealed a negative contrast, indicating the presence of water depletion layers at the XAO surface (Fig. 3 C). Thus, hydration shells of IDPs may differ substantially from hydration shells of globular proteins.
Furthermore, our calculations revealed that the effect of the hydration shell on Rg strongly depends on the conformation of XAO. This finding can be rationalized by the fact that the Rg of XAO is sensitive to the spatial distribution of hydration shell contrast: contrasts located farther from the XAO center of mass have greater effect on Rg compared with contrasts closer the center of mass. Thus, the structure of the hydration shell and its effects on SAXS curves is controlled by an interplay between amino acid composition and peptide conformation.
Computing the hydration shell contrast from I0 values requires a definition of the protein volume or, equivalently, of the protein-water dividing surface. Indeed, different conventions for the dividing surface, for instance based on different Voronoi tessellation schemes, have led to slightly different protein volumes and different estimates for the hydration shell density (77,84,85,86,87). Accordingly, we observed that different choices for computing the dividing surface influence the absolute hydration shell contrasts; however, the choices have only a marginal effect on the contrasts relative to alanine found in this study (Figs. S4 and S14). We speculate that different volume definitions might explain why a previous study found similar water volumes at anionic, cationic, and polar moieties (77), whereas our analysis suggested by far larger contrasts imposed by the anionic Asp/Glu residues. Critically, the pronounced hydration shells by Asp/Glu are confirmed by our ΔRg calculation, which do not require assumptions on the dividing surface. In addition, they align with previous experimental SAXS/SANS studies of super-charged GFP variants and with large ΔRg values found for the highly anionic glucose isomerase (46,68).
We previously found that MD simulations with certain but not all force fields accurately reproduce the difference in Rg between SAXS relative to SANS in D_2_O, suggesting that they capture the hydration shell contrast of several globular proteins (20). To experimentally test our amino acid-specific contrast values reported here, future SAXS/SANS studies of proteins and their mutants would be desirable. Designed ultrastable proteins may be particularly suitable for such studies, as their conformations are relatively insensitive to surface substitutions (88). For IDPs, whose ensembles are strongly amino acid dependent, careful SAXS/SANS measurements in both H_2_O and D_2_O could provide a means to isolate amino acid-specific effects on the hydration shell.
We anticipate that the residue-resolved hydration shell contrast scores derived for all 20 proteinogenic amino acids will be useful for several future developments. Contrast scores may be used to parameterize computationally efficient SAXS curve predictions that account for residue-specific hydration while avoiding the need for explicit-solvent MD simulations for each protein conformation. Thereby, our calculations may bridge the gap between accurate yet computationally expensive explicit-solvent SAXS calculations (43,66,71,89,90,91,92) and simplified implicit-solvent methods that require fitting of the hydration shell against experimental data (93,94). In addition, quantifying residue-specific hydration will be key to understanding how targeted modulation of the water structure by protein-water interactions promotes biomolecular function, for instance in contexts of antifreeze proteins, molecular recognition, or biomolecular phase separation (95,96,97).
Data and code availability
A modified GROMACS version that implements the explicit-solvent SAXS calculations is available at https://gitlab.com/cbjh/gromacs-swaxs.
Acknowledgments
We thank Jan Lipfert for stimulating discussions and for sharing SAXS data of XAO. This study was supported by the 10.13039/501100001659Deutsche Forschungsgemeinschaft (10.13039/501100001659DFG, 10.13039/501100001659German Research Foundation) via grants HU 1971/3-1 and INST 256/539-1.
Author contributions
J.-B.L. and T.M.F. performed research and analyzed data. J.S.H. designed research and contributed software. J.-B.L. and J.S.H. wrote the paper.
Declaration of interests
The authors declare no competing interests.
Declaration of generative AI and AI-assisted technologies in the writing process
J.B.L. and J.S.H. acknowledge the assistance of ChatGPT-4 for providing proofreading support during the writing process.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Bellissent-Funel M.-C.Hassanali A.Garcia A.E.Water Determines the Structure and Dynamics of Proteins Chem. Rev.1162016767376972718699210.1021/acs.chemrev.5b 00664 PMC 7116073 · doi ↗ · pubmed ↗
- 2Levy Y.Onuchic J.N.Water mediation in protein folding and molecular recognition Annu. Rev. Biophys. Biomol. Struct.3520063894151668964210.1146/annurev.biophys.35.040405.102134 · doi ↗ · pubmed ↗
- 3Ball P.Water as an Active Constituent in Cell Biology Chem. Rev.1082008741081809571510.1021/cr 068037 a · doi ↗ · pubmed ↗
- 4Daidone I.Ulmschneider M.B.Smith J.C.Dehydration-driven solvent exposure of hydrophobic surfaces as a driving force in peptide folding Proc. Natl. Acad. Sci. USA 104200715230152351788158510.1073/pnas.0701401104 PMC 2000556 · doi ↗ · pubmed ↗
- 5Fogarty A.C.Laage D.Water Dynamics in Protein Hydration Shells: The Molecular Origins of the Dynamical Perturbation J. Phys. Chem. B 1182014771577292447958510.1021/jp 409805 p PMC 4103960 · doi ↗ · pubmed ↗
- 6Mondal S.Mukherjee S.Bagchi B.Protein Hydration Dynamics: Much Ado about Nothing?J. Phys. Chem. Lett.82017487848822897820110.1021/acs.jpclett.7b 02324 · doi ↗ · pubmed ↗
- 7Fogarty A.C.Duboué-Dijon E.Laage D.Biomolecular hydration dynamics: a jump model perspective Chem. Soc. Rev.422013567256832361268510.1039/c 3cs 60091 b · doi ↗ · pubmed ↗
- 8Bagchi B.Water Dynamics in the Hydration Layer around Proteins and Micelles Chem. Rev.1052005319732191615915010.1021/cr 020661+ · doi ↗ · pubmed ↗
