Quantitative Integration of FRET and Molecular Dynamics for Modeling Flexible Peptides
Danilo Roccatano

TL;DR
This paper combines FRET experiments and molecular dynamics simulations to study the behavior of flexible peptides and validate their structural and kinetic properties.
Contribution
The study introduces an improved force field parameterization for Dbo-labeled aspartic residues, enabling accurate modeling of fluorophore-labeled peptides.
Findings
Simulations reproduce FRET-derived end-to-end distances with less than 5% deviation for (GS)n peptides.
Looping kinetics agree with experimental contact formation rates after viscosity correction.
The work establishes a quantitative link between structural ensembles and FRET experiments.
Abstract
Flexible protein regions, often enriched in glycine- and serine-rich segments, play a central role in biomolecular dynamics and function. The combination of time-resolved fluorescence resonance energy transfer (FRET) spectroscopy and molecular dynamics simulations provides a powerful framework to characterize these motions at atomic resolution. In this work, we investigate the conformational and kinetic properties of Trp-(GS) n -Dbo and Trp-(PP) n -Dbo peptides (n = 0, 1, 2, 3) in aqueous solution using microsecond-scale MD simulations, informed by an improved description of the Dbo-labeled aspartic residue compatible with the G54A7 force field. The simulations quantitatively reproduce experimental end-to-end distances derived from FRET measurements, with deviations below 5% for all (GS) n peptides, and correctly capture the systematic relationship between chain flexibility and…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4
5
6| name | sequence | water molecules |
|---|---|---|
| TrpDbo | Trp-Dbo-NH2 | 2248 |
| (GS) | Trp-(GlySer)-Dbo-NH2 | 2241 |
| (GS)2 | Trp-(GlySer)2-Dbo-NH2 | 2144 |
| (GS)3 | Trp-(GlySer)3-Dbo-NH2 | 3119 |
| (PP) | Trp-(ProPro)-Dbo-NH2 | 2237 |
| (PP)2 | Trp-(ProPro)2-Dbo-NH2 | 2135 |
| (PP)3 | Trp-(ProPro)3-Dbo-NH2 | 3114 |
| peptide |
| SASA (nm2) | % loop | % PP | % coil |
|---|---|---|---|---|---|
| trpDbo | 0.463 (0.001) | 6.61 (0.01) | – | – | 100 |
| (GS) | 0.507 (0.002) | 8.06 (0.01) | 26 | 1 | 73 |
| (GS)2 | 0.537 (0.002) | 8.80 (0.02) | 45 | – | 55 |
| (GS)3 | 0.567 (0.004) | 9.87 (0.04) | 55 | – | 45 |
| (PP) | 0.481 (0.001) | 8.10 (0.01) | 25 | 12 | 63 |
| (PP)2 | 0.618 (0.003) | 10.23 (0.02) | 17 | 46 | 37 |
| (PP)3 | 0.797 (0.002) | 12.60 (0.02) | 13 | 57 | 30 |
| peptide |
|
|
|
| ⟨κ2⟩ |
|---|---|---|---|---|---|
| trpDbo | 0.38 | 0.97 | 0.86 | 0.87 | 0.589 |
| (GS) | 0.84 | 0.92 | 0.80 | 0.90 | 0.631 |
| (GS)2 | 0.87 | 0.95 | 0.82 | 0.97 | 0.673 |
| (GS)3 | 0.82 | 1.02 | 0.88 | 1.01 | 0.619 |
| (PP) | 0.91 | 0.81 | 0.89 | 0.82 | 0.673 |
| (PP)2 | 1.53 | 1.24 | 1.13 | 1.23 | 0.791 |
| (PP)3 | 2.10 | 1.89 | 1.80 | 1.66 | 0.686 |
| peptide | observable | τcorr (ps) |
| α | noise type | cross-corr. |
|---|---|---|---|---|---|---|
| TrpDbo |
| 3820.9 | 0.803 | 1.308 | 1/f | |
|
| 10.5 | 0.994 | 0.029 | white | –0.888 | |
| (GS) |
| 10722.6 | 0.781 | 1.335 | 1/f | |
|
| 1148.3 | 0.863 | 1.144 | 1/f | 1.000 | |
| (GS)2 |
| 6744.8 | 0.762 | 1.326 | 1/f | |
|
| 2194.7 | 0.781 | 1.306 | 1/f | 1.000 | |
| (GS)3 |
| 11071.5 | 0.750 | 1.352 | 1/f | |
|
| 2905.1 | 0.753 | 1.299 | 1/f | 1.000 | |
| (PP) |
| 3647.8 | 0.864 | 1.153 | 1/f | |
|
| 228.7 | 0.951 | 0.848 | 1/f | –0.346 | |
| (PP)2 |
| 47841.7 | 0.834 | 1.196 | 1/f | |
|
| 136.6 | 0.968 | 0.737 | 1/f | 0.002 | |
| (PP)3 |
| 9910.9 | 0.856 | 1.160 | 1/f | |
|
| 120.8 | 0.956 | 0.898 | 1/f | –0.423 |
| peptide |
|
|
|
| errorcorr(%) |
|---|---|---|---|---|---|
| trpDbo | 0.497 | 8.64 ± 0.84 | 21.36 ± 2.08 | 22.0 ± 0.7 | 2.9 |
| (GS) | 0.441 | 5.11 ± 0.48 | 12.64 ± 1.20 | 13.0 ± 0.4 | 2.8 |
| (GS)2 | 0.441 | 7.12 ± 0.86 | 17.60 ± 2.11 | 17.0 ± 0.5 | 3.5 |
| (GS)3 | 0.441 | 8.95 ± 1.34 | 22.13 ± 3.32 | 24.0 ± 0.7 | 7.8 |
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsProtein Structure and Dynamics · Lipid Membrane Structure and Behavior · RNA Interference and Gene Delivery
Introduction
Short peptides provide valuable model systems for probing the conformational dynamics of intrinsically disordered protein regions, such as loops and linkers.? Their reduced complexity enables detailed comparisons between molecular dynamics (MD) simulations? and single-molecule spectroscopic techniques, particularly Förster resonance energy transfer (FRET). ?−? ? ? In such studies, the choice of fluorophores and the accuracy of the underlying force fields are crucial for establishing reliable structure–dynamics relationships. Among several donor–acceptor pairs, the intrinsic fluorophore tryptophan combined with the extrinsic probe 2,3-diazabicyclo[2.2.2]oct-2-ene (DBO) has proven especially useful for characterizing peptides with end-to-end distances in the nanometer range. ?−? ? This pair has been widely applied to study the flexibility of short protein sequences. ?−? ? ? In particular, glycine/serine-rich sequences ?,?−? ? ? ? and polyproline peptides ?−? ? have been investigated in detail, as they serve as reference systems for random coil conformations in aqueous solution. Glycine- and serine-rich fragments, which are common in flexible protein regions, are particularly suitable for exploring how sequence composition influences conformational dynamics. ?−? ? In earlier work, we developed a GROMOS43A1 (G43A1)? based force-field model of a modified Aspartic residue linked to the DBO probe and benchmarked MD simulations against experimental FRET data. ?,?,?,?,? The results showed a good agreement between simulated and FRET-derived end-to-end distances and orientational factors, validating the model. The GS-based peptides predominantly adopted compact, disordered states consistent with polypeptides in poor solvents, ?,?,? while the polyproline consistently reproduced the expected extended state. ?−? ?
This study provided key benchmarks for understanding the conformational and dynamical behavior of short peptides in solution and for quantitatively linking FRET experiments with molecular simulations. Building on this work, the present study extends the analysis to Trp-(GS)_ n _-Dbo peptides (n = 0, 1, 2, 3) and to polyproline peptides of equivalent length. While glycine/serine sequences represent highly flexible, coil-like chains, polyprolines serve as comparatively rigid, rod-like reference systems. This contrast enables a systematic investigation of how sequence-dependent flexibility influences both equilibrium conformational sampling and intramolecular dynamics probed by FRET experiments. Although previous studies demonstrated the feasibility of combining time-resolved FRET measurements with MD simulations, they were largely limited to earlier generations of force-field descriptions. Recent developments in the GROMOS force field, culminating in the G54A7 parameter set with refined backbone and nonbonded interactions, provide a more balanced and physically consistent description of peptide flexibility and intramolecular interactions.? In this context, an improved representation of the Dbo-labeled aspartic residue compatible with G54A7 enables a direct and quantitative comparison between simulation results and experimental observables. The present work combines an improved, G54A7-consistent description of the Dbo-labeled aspartic residue with extensive MD simulations to investigate the structural and dynamical properties of fluorophore-labeled peptides. By comparing flexible glycine/serine-rich chains with rigid polyproline analogues, we assess how sequence-dependent flexibility is reflected in conformational sampling, end-to-end distance distributions, and intrachain dynamics probed by FRET experiments. The simulations reproduce experimental end-to-end distances with high accuracy and capture the systematic relationship between chain flexibility and fluorophore separation. Analysis of looping and contact-formation kinetics further yields quantitative agreement with experimentally measured rates after viscosity correction, consistent with a diffusion-controlled mechanism for intrachain contact formation on experimentally relevant time scales. Through microsecond-scale MD simulations, we characterize the structural ensembles and dynamical behavior of these systems, providing physical insight into how sequence composition governs both conformational sampling and time-dependent behavior in solution, and establishing quantitative benchmarks for the interpretation of FRET experiments on fluorophore-labeled peptides and intrinsically disordered protein segments.
Methods
Fluorophore Force Field and MD Simulations
The C-terminal Dbo residue is derived from an Asp amino acid, in which the side-chain carboxylate is replaced by an amide linkage to an amino-functionalized DBO fluorophore. ?,? In the present work, the G43A1-based model? was updated to ensure full compatibility with the G54A7 force field, ?,? The complete set of revised Dbo parameters is reported in Tables S1–S4 of the Supporting Information. Peptides with sequences Trp–(GS)_ n –Dbo and Trp–(PP) n –Dbo (n = 0–3) were simulated using the SPC water model.? This water model was chosen to preserve the internal consistency of the GROMOS force field. While SPC underestimates the experimental viscosity of water, this limitation primarily affects dynamical time scales and can be addressed through a posteriori viscosity-based corrections, as described below. All peptides were built in an extended conformation. Following the experimental conditions,? the simulations were performed at pH 7; thus, the N-terminus is protonated, and the C-terminus is amidated, giving each peptide a net charge of +1. Each peptide was solvated in a periodic cubic box of side length 5.0 nm, removing water molecules closer than 0.15 nm to any peptide atom. A single chloride ion was added to neutralize the total charge. Table provides a list of all simulated systems along with the total number of water molecules. All systems were energy-minimized using the steepest descent algorithm for at least 500 steps. Temperature was maintained at 300 K using weak coupling (coupling time τ T _ = 0.1 ps), applied separately to the peptide and solvent.? Pressure was controlled at 1 bar using the Berendsen barostat (τ_ P _ = 0.5 ps).? All covalent bonds were constrained using the LINCS algorithm,? and water geometry was constrained using SETTLE.? Simulations were performed using the GROMACS 2023 package, ?,? employing the time-reversible Verlet integrator. A single cutoff of 1.4 nm, as recommended for the GROMOS force field when used with the time-reversible Verlet scheme implemented in GROMACS,? was applied to both Lennard–Jones and Coulomb interactions, and the neighbor list was updated every 20 integration steps. Electrostatics were treated using the generalized reaction field method with a reaction-field dielectric constant of 61,? while the dielectric permittivity within the cutoff was set to 1, in accordance with standard GROMOS practice. The integration time step was 2 fs, and initial velocities were drawn from a Maxwell–Boltzmann distribution at 300 K. All systems were first equilibrated with positional restraints applied to the peptide to allow solvent relaxation, after which unrestrained production simulations of 1 μs were performed.
Analyses of MD Trajectories
Shear Viscosity of the SPC Water Model. The shear viscosity of the SPC water model was determined using nonequilibrium MD (NEMD) simulations following the method of Hess.? A spatially varying external acceleration field of the form was applied along the z-direction, with amplitude A = 0.025 nm ps^–2^. This perturbation induces a stationary velocity profile v _ z _(x), from which the shear viscosity η can be extracted analytically.? The velocity profiles were analyzed to compute the instantaneous inverse viscosity. The average inverse viscosity over the production trajectory was , which corresponds to a shear viscosity of The statistical uncertainty was evaluated via block averaging of the inverse-viscosity time series.? This value, in excellent agreement with previous NEMD determinations, ?,? was used to rescale the end-to-end contact time (see below).
Analysis of the Peptide Conformations. The peptide secondary structure was determined using the DSSP algorithm of Kabsch and Sander.? The solvent-accessible surface area (SASA) and the radius of gyration (R _ g _) were further computed to provide a qualitative description of the overall folding landscape of each peptide in solution. SASA values were obtained using the algorithm of Eisenhaber et al.?
Cluster analysis of the trajectories was performed following the method of Daura et al.,? using 10,000 conformations sampled at 100 ps intervals from each peptide simulation. Structural similarity between conformations was assessed using the positional root-mean-square deviation (RMSD), and clusters were constructed iteratively by identifying the structure with the largest number of neighbors within a cutoff radius, assigning its cluster, and removing its members from further consideration. All peptide non-hydrogen atoms were used in the analysis. The clustering cutoff was selected between 0.2 and 0.35 nm (see Table S5 in the Supporting Information) to produce a comparable number of clusters across all systems. Statistical uncertainties for all average quantities were estimated using the block-averaging method.?
End-to-End Distance and Orientational Factor. To enable comparison with experimental FRET measurements, we extracted both the end-to-end distance (R _ ee _) and the orientational factor (κ^2^).? The end-to-end distance was defined as the distance between the center of the Trp indole fusion bond and the center of the diazo group of the Dbo residue (Figure). The probability distribution of R _ ee _ was modeled using a sum of three skewed Gaussian functions
where A _ j , r j _ ^0^, and σ_ j _ are fitting parameters. This functional form represents a simple approximation of polymer end-to-end distance statistics, and it is commonly used to model FRET-derived distance distributions of flexible peptides. ?,?,? In our analysis, three Gaussian terms (nine parameters) were required to adequately reproduce the simulation-derived R ee distributions. Reducing the model to two Gaussians resulted in visibly poorer fits, particularly in the short-distance region. This region is especially important for comparison with FRET experiments, as it captures donor–acceptor contact configurations where the uncertainties associated with the orientational factor and FRET efficiency are highest.?
Definition of the polar angles θ D , θ A , and θ T and the end-to-end vector R⃗ee used in eq for computing the orientational factor κ2.
The orientational factor κ^2^ was calculated from the relative orientation of the Trp and Dbo transition dipole moments using the standard Förster expression ?,?
where θ_ D _ and θ_ A _ are the angles between each dipole moment and , and θ_ T _ is the dihedral angle between the two dipoles (Figure).
Free energy landscape of SASA versus radius of gyration The free energy landscape in the parameter space of SASA as a function of the radius of gyration (R g) is also calculated. This representation is usually used to provide a reduced representation of the folding landscape sampled during MD simulations by peptides.? The function is computed by evaluating the joint probability distribution of the two parameters on a grid of 50 × 50 points. The number of conformation occurrences at each grid point has been counted. The joint probability function was then converted into free energies using the Boltzmann inversion formula
the ΔF(R g, SASA) represents the change in free energy, where R is the gas constant, and T is the temperature in Kelvin, n(R g, SASA) is the count of conformations at a specific grid point, and N is the total number of analyzed conformations.
Contact Formation Kinetics. Looping rates (end-to-end collision rate constants) were calculated from the time series of the donor–acceptor separation R ee(t) between the Dbo and Trp chromophores. The analysis follows the first-contact-time (FCT) framework used in refs ?,? , where a contact event is defined as the first passage of R ee(t) below a predefined capture distance r, referred to as the sink radius. The sink radius represents the effective spatial range within which donor–acceptor contact (and quenching) can occur, accounting for chromophore size and local conformational fluctuations. Intercontact intervals were obtained from the MD trajectory as
for all i < j and k = 1, ..., n int. Here, n int is the total number of independently detected looping intervals. The mean first-contact time (MFCT) from simulation was then computed using the equation
which gives to the mean time between two successive contact events in a diffusion-limited process.
The experimentally derived FRET kinetics were measured in D_2_O. Because the SPC water model is known to underestimate the shear viscosity of experimental heavy water, ?,?,?,? a direct comparison of simulated and experimental kinetic observables requires a viscosity correction. Under the assumption that diffusive conformational kinetics scale linearly with solvent viscosity, as predicted by Kramers-type theories and commonly applied in peptide looping and contact-formation studies,? all simulated mean first-contact times (MFCTs) and correlation times were rescaled according to
here, η_SPC_ = 0.445 mPa·s is the shear viscosity of SPC water determined from independent nonequilibrium MD simulations (see below), and mPa·s is the experimental viscosity of heavy water at ambient conditions.? This yields a viscosity correction factor
which was applied uniformly to all peptides. Equilibrium structural properties were not affected by this rescaling.
All kinetic analyses were performed using custom Python scripts implementing a first-contact-time detection algorithm. Trajectories were prefiltered using a Savitzky–Golay smoothing routine ?,? to reduce high-frequency noise prior to FCT detection. For each peptide, the optimal sink radius r was determined by minimizing the deviation between corrected MFCTs and their experimental counterparts. Statistical uncertainties were estimated from the variance of the independent intercontact intervals, and agreement with experimental data was assessed by comparing propagated simulation errors with experimental uncertainties.
Time Series Analysis The dynamical behavior of each peptide was also characterized through time series analysis of two observables: the R ee distance, and the Cα–Cα distance (R CαCα) between the terminal residues. All analyses were performed using custom Python scripts based on the NumPy and SciPy scientific libraries. ?,?
Autocorrelation and Correlation Times
The normalized autocorrelation function (ACF) was computed using a fast Fourier transform (FFT) estimator
where x(t) denotes either R ee or R CαCα. The integrated correlation time was evaluated as
with τ_0_ defined as the first zero crossing of the ACF.? Cross-correlations between R ee and R CαCα were also computed to quantify how chain extension is dynamically coupled to terminal backbone separation.?
Hurst Exponent
To characterize temporal correlations and long-range memory effects in peptide conformational dynamics, we evaluated the Hurst exponent H for both R ee and R CαCα time series. The Hurst exponent provides a quantitative measure of persistence in stochastic processes and allows discrimination between purely diffusive fluctuations and dynamics influenced by internal friction or conformational constraints. The exponent H was determined using the classical rescaled-range (R/S) analysis ?,?
where R(n) denotes the range of cumulative deviations and S(n) the corresponding standard deviation within a window of size n. Values of H > 0.5 indicate persistent dynamics, reflecting correlated conformational motions along a rugged free-energy landscape, whereas H < 0.5 corresponds to antipersistent behavior dominated by restoring forces. The limiting case H = 0.5 describes uncorrelated Brownian diffusion. Deviations from Brownian behavior, therefore, provide insight into conformational memory effects that influence the temporal averaging of experimentally derived observables.
Power Spectral Density
To complement the time-domain analysis, the power spectral density (PSD) of the same time series was computed using Welch’s method,? employing Hanning windows with 50% overlap. Spectral analysis enables separation of dynamical contributions occurring on different time scales and provides insight into the hierarchy of motions sampled by the peptides.
The high-frequency region of the PSD was fitted to a power-law form
yielding the spectral exponent α. Different values of α correspond to distinct noise regimes: α ≈ 0 indicates white noise dominated by fast, uncorrelated local motions; α ≈ 1 corresponds to 1/f noise associated with scale-free dynamics and the coupling of multiple time scales; and α ≈ 2 reflects Brownian noise characteristic of slow, diffusive conformational rearrangements.
Together with the Hurst exponent, the PSD analysis provides a consistent description of peptide dynamics across time and frequency domains, linking fast local fluctuations to slower global conformational changes and clarifying their impact on experimentally measured, time-averaged FRET observables.
Results and Discussion
General Structural Features
The DSSP analysis revealed that all peptides are predominantly disordered, with coil structures representing the major secondary-structure component across the series. Only the polyproline peptides displayed appreciable β-strand content, increasing systematically with peptide length and reaching 57% for (PP)3. In contrast, (GS)3 exhibited the highest loop content (55%), consistent with the expected conformational flexibility of Gly/Ser-rich sequences.
We also assessed conformational diversity using the clustering approach described in the Methods section. Figure ? shows the cumulative number of identified clusters as a function of simulation time. In all cases, the curves exhibit clear plateaus, indicating satisfactory sampling of the conformational landscape.? The cluster analysis revealed a marked contrast between the flexible GS peptides and the more structurally restricted polyproline series. For (GS)2 and (GS)3, the three most populated clusters account for approximately 50% of all sampled conformations, as quantified explicitly in Table S5 of the Supporting Information and illustrated by the cumulative population curves in Figure. This distribution reflects a broad and diverse conformational ensemble, consistent with the expectation that short disordered peptides explore a rugged and heterogeneous energy landscape.? In contrast, the polyproline peptides exhibit a much narrower distribution, with the three dominant clusters accounting for more than 70% of the population, as shown in Figure and reported numerically in Table S5 of the Supporting Information.
Cumulative number of clusters as a function of simulation time.
1: Summary of Peptide Simulations
Representative structures of these dominant clusters are shown in Figure. As expected, the GS peptides sample compact conformations due to the increased flexibility imparted by glycine, whereas the polyproline peptides maintain extended architectures dictated by the rigidity of the proline backbone. These trends are consistent with R g reported in Table. Across the series, R g increases monotonically with peptide length, spanning 0.46–0.80 nm. The polyproline peptides, particularly for n ≥ 2, exhibit larger R g and SASA values than their GS analogues, reflecting their more extended conformations. Table S2 in the Supporting Information provides full cluster statistics including ⟨R g⟩, ⟨R ee⟩ and the population relative contributions (PRC) of the three most populated clusters. The GS peptides populate several compact conformational states with similar R g values and moderately distributed cluster populations (27–37% for the most populated cluster). By contrast, the polyproline peptides occupy more extended and conformationally restricted states, characterized by larger R g values and a clearly dominant first cluster (45–57% for PP, PP_2_, and PP_3_).
Representative structures from the first three clusters of each simulation. Glycine residues are shown in cyan, serine in pink, proline in orange, Dbo in red (nitrogen atoms as spheres), and tryptophan in blue.
2: Average Structural Parameters for all Peptides
Figure shows the contour plots of the free energy landscapes ΔF(R g, SASA) for the (GS)_ n _ and (PP)_ n _ peptide sequences. The resulting potentials of mean force (PMFs) display the characteristic comet-shaped profile previously observed for the (GS)3, (GG)3, and (SS)3 peptides.? For the (GS) peptides, the position of the free-energy minimum reflects a preference for compact, disordered conformations and is located at relatively small values of the radius of gyration R g. In contrast, the more rigid polyproline peptides exhibit a systematic shift of the free-energy minimum toward larger R g values as the chain length increases, consistent with their intrinsically extended backbone geometry.
Contour plots of the free energy landscapes ΔF(R g, SASA) derived from simulations of the (GS) n and (PP) n peptides.
End-to-End Distance Distributions and Orientation Factor
Figures and ? illustrate the R ee distance distributions for all peptides. Each figure presents the distributions as gray silhouettes overlaid with multi-Gaussian fits (orange lines). The colored curves correspond to the distributions obtained from the first three conformational clusters, scaled by their relative populations, with their sum shown as dashed red lines. All distributions are multimodal, featuring a main peak at short distances and, depending on the peptide, additional smaller peaks. The positions of these features coincide with the main peaks observed in the first three cluster distributions.
Distribution of R ee distances for all conformations of the (GS) n peptides (gray silhouette) with multi-Gaussian fits (orange line). Cluster-specific distributions are shown in brown, green, and blue, scaled by their relative populations (see Table S2 in the Supporting Information).
As in Figure , but for the (PP) n peptides.
For TrpDbo and (GS)3, two distinct peaks at short and longer distances are clearly visible. For the other (GS) peptides, the peaks at longer distances overlap and are less distinct. The distributions were fitted using triple-exponential functions (see Methods), which provided excellent representations of the data for the GS series, as also noted in previous studies. ?,?,? In contrast, the extended conformations of the polyproline peptides yield distributions with a single dominant peak. Because of this characteristic, even a single exponential function can adequately describe their profiles.
The average R ee distances obtained from the simulations are compared in Table with those reported in previous works using the G43A1 version of the force field ?,? and with the values estimated from experimental data. The comparison highlights the improved performance of the updated G54A7-based model relative to the earlier G43A1 description. For the GS series, the R ee values predicted with G54A7 (0.92–1.02 nm) are in excellent agreement with the experimental data (0.90–1.01 nm), whereas the G43A1 results (0.80–0.88 nm) systematically underestimate chain extension. This improvement is particularly evident for (GS)3, where the G43A1 prediction of 0.88 nm falls below the experimental value (1.01 nm), while G54A7 reproduces it almost quantitatively (1.02 nm). This trend indicates that the refined parameter set provides a more realistic balance of backbone flexibility and fluorophore interactions in disordered peptide ensembles. For the PP peptides, which are intrinsically more rigid, the differences between the two force fields are smaller. In addition to the donor–acceptor distance, Table also reports the terminal Cα–Cα distance R CαCα, which provides a direct measure of backbone end-to-end extension, independent of fluorophore size, linker flexibility, or orientational effects. This quantity serves as a structural reference for interpreting the fluorescence-based R ee values. For the GS peptides, R CαCα remains relatively small (0.82–0.87 nm) and shows only a weak dependence on chain length, consistent with their compact and highly flexible disordered conformations. In contrast, the polyproline peptides exhibit a pronounced and systematic increase in R CαCα with chain length, from 0.91 nm for (PP) to 2.10 nm for (PP)3, reflecting the stiffness of the proline backbone and the tendency to form extended, rod-like conformations. The TrpDbo peptide displays the smallest Cα–Cα separation (0.38 nm), indicative of a highly compact backbone arrangement. Nonetheless, GROMOS54A7 generally produces values closer to the experiment. For example, the R ee for (PP)2 improved from 1.13 nm (G43A1) to 1.24 nm (G54A7), in excellent agreement with the experimental estimate of 1.23 nm. The longest proline chain, (PP)3, was slightly overextended in G54A7 (1.89 nm versus 1.66 nm), while G43A1 slightly underestimated it (1.80 nm). Despite this, both force fields reproduced the overall rigidity and extended nature of proline-rich chains.
3: Calculated and Experimental Parameters for all Peptides
The average orientation factor (κ^2^) distributions further support these structural trends. The GS peptides exhibit relatively broad κ^2^ distributions, consistent with the greater orientational freedom of the Trp-DBO donor–acceptor pair, whereas the PP peptides display narrower profiles reflecting their more restricted conformational sampling. The mean values of ⟨κ^2^⟩ (Table) fall within the range expected for flexible versus conformationally constrained peptide chains. To assess the donor–acceptor relative orientation more quantitatively, we compared the average ⟨κ^2^⟩ values for all peptides. Most peptides exhibit mean orientation factors close to the isotropic limit of 2/3, with deviations of typically 5–10%. The polyproline dimer (PP_2_), which samples particularly restricted backbone geometries, shows a larger deviation (approximately 0.79), consistent with its more rigid conformational landscape.
Overall, these results demonstrate that the refined DBO residue parameters, combined with the GROMOS54A7 backbone description, provide a substantially improved representation of peptide end-to-end distances. The excellent agreement with the FRET benchmarks (particularly for the GS series) underscores the reliability of the updated model for quantitative spectroscopic comparisons.
Time Series Analysis
We also analyzed the R ee and R CαCα time series using Hurst exponents, autocorrelation times, PSD scaling, and the zero-lag cross-correlation between the two observables. The resulting dynamical indicators are summarized in Table, allowing a direct comparison of the dynamical behavior of global chain extension and backbone end-to-end fluctuations.
4: Summary of the Main Dynamical Indicators Extracted from Time-Series Analysis of the Peptide Simulations
The correlation times τ_corr_ (third column of Table) span a wide range, from tens of picoseconds to tens of nanoseconds. For the GS series, both observables display correlation times of comparable order of magnitude, with systematically shorter relaxation times for R CαCα, consistent with concerted backbone motions. In contrast, for Trp–Dbo and the polyproline peptides, R CαCα relaxes significantly faster than R ee, indicating rapid local fluctuations superimposed on slower global rearrangements.
The estimated Hurst exponents H (fourth column of Table) are consistently high (H ≳ 0.75) for both R ee and R CαCα across all peptides, indicating strong long-range persistence and highly correlated dynamics. Such values are characteristic of fractional Brownian motion with pronounced memory effects? and are often associated with rugged and multiscale biomolecular free-energy landscapes.? Notably, the GS peptides exhibit similar H values for both observables, suggesting that backbone fluctuations and global chain extension are governed by the same underlying correlated motion. In contrast, the polyproline peptides show systematically higher H values for R CαCα than for R ee, reflecting the enhanced rigidity of the proline backbone.
The spectral exponents α of the PSD (fifth column of Table) reveal predominantly 1/f (pink) noise behavior for R ee, indicative of scale-free dynamics. In contrast, R CαCα exhibits a broader range of spectral exponents, with values closer to white or weakly correlated noise for Trp–Dbo and the polyproline peptides. This reflects the presence of fast, less correlated local backbone fluctuations in more rigid sequences.
Finally, the cross-correlation coefficients reported in the last column of Table quantify the degree of coupling between R ee and R CαCα. The GS peptides show strong positive correlations, indicating a concerted increase or decrease of both distances. In contrast, Trp–Dbo and the longer polyproline chains exhibit weak or negative correlations, revealing a partial decoupling between local backbone motions and global chain extension.
End-to-End Contact Kinetics
The kinetics of donor–acceptor contact in the peptide series were analyzed using the MFCT. This is defined as the average time it takes for the donor–acceptor distance R ee(t) to first fall below a specified contact radius r (see Methods). Starting from the reference value of 0.45 nm employed in previous studies,? the optimized radii (r opt) fall between 0.44 and 0.50 nm (Table), consistent with the expected spatial proximity of the Dbo and Trp chromophores and in line with our earlier work. ?,?,? To compare simulations with experimental MFCTs measured in D_2_O, all simulated times were multiplied by a viscosity correction factor of f = 2.472. This correction reduces the mean deviation between simulation and experiment from approximately 60% to about 4%, corresponding to an absolute reduction of roughly 56–57% across the peptide set. Statistical uncertainties in the uncorrected MFCTs (0.48–1.34 ns, or 6–15%) propagate to 1.2–3.3 ns (5–15%) after the correction, remaining well within the experimental error of approximately 3%. For all peptides, the corrected MFCTs agree with experiment within the combined uncertainty bounds, with (GS)3 showing marginal but still acceptable overlap due to its larger statistical uncertainty (Table). The optimized sink radii reflect subtle variations in chromophore accessibility and local backbone flexibility. The glycine–serine peptides display the fastest looping kinetics, consistent with their highly flexible and compact conformational ensemble. In contrast, the polyproline series reported previously,? which adopt more extended and conformationally restricted structures, display progressively slower contact formation times. The strong overall agreement between the corrected MFCTs and experimental values supports a diffusion-controlled mechanism for end-to-end contact formation, consistent with theoretical expectations for disordered peptides. ?,?,?,? Taken together, these results demonstrate that solvent-mediated diffusion dominates the kinetics, while peptide-specific structural effects are effectively captured by optimizing the sink radius, enabling accurate and robust kinetic predictions from MD simulations.
5: Optimized donor–acceptor Contact Formation Times with Viscosity Correction and Propagated Uncertainties
More generally, the refined force field and the combined FRET–MD framework presented here are not limited to short model peptides but are applicable to a broad class of flexible biomolecular systems. MD simulations have been extensively used to rationalize the stability and conformational behavior of flexible and designed peptides, where subtle balances between backbone flexibility and side-chain interactions shape the accessible ensembles. ?,? These considerations become even more relevant for intrinsically disordered regions and short protein segments, whose functional properties are governed by heterogeneous and highly dynamic conformational landscapes. Accurately capturing such disordered ensembles remains a central challenge for biomolecular force fields, and quantitative validation against experimental observables is essential for their assessment. ?,? In this respect, the present work extends force-field benchmarking by combining equilibrium and kinetic FRET observables, thereby validating not only average structural properties but also intramolecular diffusion and looping dynamics. More broadly, the integration of FRET-derived information with molecular simulations has emerged as a powerful strategy for resolving heterogeneous conformational ensembles in flexible systems.? The refined DBO force-field model and the analysis framework developed here provide a transferable platform for applying FRET-assisted MD to intrinsically disordered segments, flexible linkers, and small proteins undergoing conformational adaptation or binding-induced rearrangements.
Conclusions
In this work, we conducted microsecond-scale MD simulations of Trp-(GS)_ n _-Dbo peptides (n = 0–3) and their polyproline analogues using the updated G54A7 force field together with newly refined parameters for the Dbo fluorophore. The simulations reveal that Gly/Ser-rich sequences predominantly adopt compact, disordered conformations, whereas proline-based peptides maintain extended, rod-like structures, consistent with their intrinsic rigidity. The incremental extension of the Gly/Ser chains increases the mean end-to-end distance by approximately 0.1 nm per residue, in excellent agreement with FRET experiments. The refined Dbo model significantly improves the quantitative reproduction of experimental donor–acceptor distances compared to the earlier G43A1 formulation, indicating a more accurate balance between peptide flexibility and chromophore interactions. When applied to contact-formation kinetics, the combination of trajectory smoothing, optimized sink radii, and viscosity correction yields mean first-contact times that agree with experiment within 5%. This demonstrates both the precision and the physical consistency of the kinetic model.
Overall, this study validates the improved Dbo force-field description and its seamless integration with G54A7 for the quantitative simulation of time-resolved spectroscopic observables. Beyond reproducing experimental FRET data, the results establish a robust computational framework for probing conformational dynamics and intramolecular diffusion in disordered and semirigid peptide chains, offering a valuable tool for benchmarking and refining biomolecular force fields.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Papaleo E.Saladino G.Lambrughi M.Lindorff-Larsen K.Gervasio F. L.Nussinov R.The role of protein loops and linkers in conformational dynamics and allostery Chem. Rev.20161166391642310.1021/acs.chemrev.5b 0062326889708 · doi ↗ · pubmed ↗
- 2Roccatano D.Computer simulations study of biomolecules in non-aqueous or cosolvent/water mixture solutions Curr. Protein Pept. Sci.2008940742610.2174/13892030878513268618691127 · doi ↗ · pubmed ↗
- 3Haas E.Wilchek M.Katchalskikatzir E.Steinberg I. Z.Distribution of End-to-End Distances of Oligopeptides in Solution as Estimated by Energy-Transfer Proc. Natl. Acad. Sci. U.S.A.1975721807181110.1073/pnas.72.5.18071057171 PMC 432635 · doi ↗ · pubmed ↗
- 4Lakowicz, J. R. Principles of Fluorescence Spectroscopy; Kluwer Academic/Plenum: New York, 1999.
- 5Soranno A.Cabassi F.Orselli M. E.Cellmer T.Gori A.Longhi R.Buscaglia M.Dynamics of Structural Elements of GB 1 beta-Hairpin Revealed by Tryptophan-Cysteine Contact Formation Experiments J. Phys. Chem. B 2018122114681147710.1021/acs.jpcb.8b 0739930215522 · doi ↗ · pubmed ↗
- 6Schuler B.Borgia A.Borgia M. B.Heidarsson P. O.Holmstrom E. D.Nettels D.Sottini A.Single-molecule FRET spectroscopy and the polymer physics of unfolded and intrinsically disordered proteins Annu. Rev. Biophys.20225142545310.1146/annurev-biophys-062215-01091527145874 · doi ↗ · pubmed ↗
- 7Sahoo H.Förster resonance energy transfer spectroscopic nanoruler: Principle and applications J. Photochem. Photobiol., C 201112203010.1016/j.jphotochemrev.2011.05.001 · doi ↗
- 8Sahoo H.Roccatano D.Zacharias M.Nau W. M.Distance Distribution of Short Polypeptides Recovered by Fluorescence Resonance Energy Transfer in the 10 Å Domain J. Am. Chem. Soc.20061288118811910.1021/ja 062293 n 16787059 · doi ↗ · pubmed ↗
