Low‑Q Asymptotic Behavior of the Effective Structure Factor Yields Model-Independent Radius of Interparticle Interaction (R i )
Chelsea E. R. Edwards, Wellington C. Leite, Yun Liu

TL;DR
This paper introduces a new method to determine interparticle interaction radius from low-Q scattering data, similar to Guinier analysis for particle size.
Contribution
A novel model-independent method to extract the radius of interparticle interaction (Ri) from low-Q scattering data is introduced.
Findings
The new method successfully analyzes experimental neutron scattering data on lysozyme solutions.
Ri quantifies deviations in interparticle correlations from ideal gas behavior due to interactions.
Abstract
Guinier analysis has been extensively used in academic and industrial research settings to obtain the model-independent size of a polymer, protein, or colloid in solution from small-angle scattering data. Using the Guinier model, the radius of gyration (R g ) is extracted from the form factor at low Q. Here, we develop an analogous approach for analyzing the effective structure factor data at low Q to extract a model-independent radius of interaction potential, R i . Whereas R g describes how spread out the scattering length density distribution of particles is from their center of mass, R i is an effective root-mean-square distance that quantifies how far the interparticle correlation deviates from its ideal gas configuration due to interactions. We demonstrate this novel analysis method by applying it to experimental small-angle neutron scattering data on lysozyme protein…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4| General Case | Dilute Limit | |
|---|---|---|
| Classical Guinier law: |
| |
| Already assumes | ||
| Radius
of gyration ( |
| |
| Radius of interparticle interaction ( |
|
|
| For weak interactions | ||
| Low- |
|
|
| Modified Guinier
law: L |
|
|
| Nominal Concentration | 2.5 | 5 | 7.5 | 10 | 15 | 20 |
|---|---|---|---|---|---|---|
| Median | 2.46 | 4.92 | 7.40 | 9.95 | 14.14 | 18.59 |
| Mean ( | 2.45 | 4.91 | 7.40 | 9.94 | 14.21 | 18.56 |
| Standard Deviation of | 0.03 | 0.04 | 0.06 | 0.05 | 0.20 | 0.12 |
| Number Density ( | 1.030 | 2.068 | 3.117 | 4.187 | 5.985 | 7.818 |
| Standard
Deviation of | 0.013 | 0.017 | 0.026 | 0.021 | 0.083 | 0.052 |
| Volume % (100ϕ) | 0.168 | 0.336 | 0.507 | 0.681 | 0.974 | 1.272 |
| Standard Deviation of 100ϕ | 0.002 | 0.003 | 0.004 | 0.003 | 0.014 | 0.009 |
- —Division of Materials Research10.13039/100000078
- —National Institute of Standards and Technology10.13039/100000161
- —Oak Ridge National Laboratory10.13039/100006228
- —National Academies of Sciences, Engineering, and Medicine10.13039/100009643
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnzyme Structure and Function · Protein Structure and Dynamics · Electrostatics and Colloid Interactions
Introduction
1
Small-angle X-ray and neutron scattering (SAXS and SANS) methods are convenient and well-established techniques used to measure the relevant angstrom-to-micron length scales across biology and soft matter systems, elucidating structure–property relationships in materials like polymers, proteins, and micelles. The 1D scattering pattern, I(Q), for a solution of relatively monodisperse particles with number density, n, can be generally expressed as, I(Q) = nP(Q)S(Q), ?−? ? where Q is the magnitude of the scattering wave vector, Q; P(Q) is the (single-particle) form factor; and S(Q) the interparticle structure factor. Among many analysis models of scattering data, the Guinier model is among the most well-established,? and is commonly used to obtain the radius of gyration of a typical particle in solution. ?,? Guinier analysis is applicable at low particle concentration or dilute conditions where interparticle interactions are minimal, so S(Q) = 1 is typically assumed and I(Q) ≈ nP(Q) = n|F(Q)|^2^. In the limit of small Q, the “shape function”, F(Q), which is the Fourier transform of the scattering length density (SLD) distribution ρ(r), given by F(Q) = ∫ ρ(r)e ^ i Q·r ^ d r, is approximated with a Taylor expansion of the phase factor e ^ i Q·r ^ to arrive at the Guinier law,
Note that, because all orientations of particles in the solution are equally likely, we use scalar Q for F(Q). Eq 1 enables model-free determination of the radius of gyration, R _ g _, directly from small-angle scattering data, given by
Here, Δρ(r) is the SLD difference between particles and solvent.
In Guinier analysis, R _ g _ ^2^ is defined as the mean squared displacement (MSD) of the particle’s SLD distribution relative to the solvent SLD, i.e. how spread out a particle is from its apparent center of SLD. For homogeneous scattering objects, the value of R _ g _ as measured by scattering is equivalent to the true value of R _ g _, which reflects the mass density distribution about the center of mass of particles. By focusing on the low-Q asymptotic behavior of P(Q), the Guinier model allows us to obtain the size information of a particle without having a priori knowledge of the shape and morphology of particles.
S(Q) is determined by the relative positions of particles in solution, which has a strong impact to the low-Q region when n is large. For systems at equilibrium, S(Q) is a thermodynamic quantity determined by the interparticle potential, V(r). For monodisperse, isotropic particles with isotropic distribution,?
where h(r) is the total correlation function. h(r) = g(r) – 1 represents the deviation of the local particle density from a uniform distribution, or more explicitly it is the difference in probability between a uniform distribution and the real system, where the probabilistic distribution is described by the pair distribution function, g(r). Therefore, measuring I(Q) of concentrated particle solutions has been widely used to probe particle–particle interactions, V(r), in solution.
Information about interparticle potential is traditionally obtained by fitting I(Q) data from concentrated samples, which requires modeling both S(Q) and P(Q) simultaneously. While it is typically straightforward to estimate P(Q) if the solute’s structure details are known or can be obtained from by measuring I(Q) in dilute solution, modeling S(Q) has proven to be challenging. For systems with isotropic interactions, if V(r) is known, S(Q) can be obtained by solving the Ornstein–Zernike (OZ) equation. ?−? ? Typically, this requires a closure relationship, e.g. Percus–Yevick, Hypernetted Chain, etc.,? whose accuracy varies with the interaction potential. Analytically solving the OZ equation is very difficult, with solutions developed for only a few isotropic V(r). ?−? ? ? ? In recent years, numerical methods have been used to solve the OZ equation by fitting experimental data, ?,? and efforts to implement thermodynamic self-consistency have been developed to improve the accuracy of OZ equation solutions. ?,?−? ? However, these approaches are not yet widely used, and all are limited to isotropic V(r). For anisotropic interparticle interactions, evaluating S(Q) faces even steeper challenges. The Reference Interaction Site Model (RISM), which makes an additional approximation when solving the OZ equation, was developed several decades ago to address this issue ?,? and was extended to polymer systems.? However, the broad applicability and widespread accuracy of these approaches across colloidal and protein systems are still not widely investigated.
To bypass the challenge of solving the OZ equation for anisotropic particles, Liu et al. recently introduced the generalized second osmotic virial coefficient, B 22(Q), a function that encodes how net pairwise interactions vary across length scales and satisfies S(Q) = 1 – 2nB 22(Q) under dilute conditions.? It is an extension of the traditional measurement of the second virial coefficient B 22 (or B 22 (Q=0) ), which quantifies the interaction information averaged over distance and orientation between solute–solute pairs in a single parameter. B 22 has been cataloged across numerous protein, polymer, and other colloidal systems, typically using static light scattering measurements, ?−? ? and has been linked to viscosity? and the tendency of proteins to aggregate,? among other applications.? Whereas B 22 is defined at Q = 0, this approach demonstrates its measurement at finite nonzero Q-values using SAXS or SANS. And the analytical theory to calculate B 22(Q) or S(Q) at dilute conditions is developed for any isotropic or anisotropic interaction model and particle shape.?
Still, many experiments and simulations focus on the small Q limit due to the complexity in modeling S(Q) and B 22(Q). Therefore, similar to the Guinier equation that describes low-Q form factor scattering data and returns a shape-independent size parameter (R _ g _), it would be useful to have an equation that describes the asymptotic low-Q behavior of S(Q) and returns a shape-independent interaction parameter.
In this work, we develop a universal theory to model the S(Q) data at low Q. This theory can then be used to model I(Q) at low Q for any sample concentration using a method similar to the traditional Guinier analysis. Our theory introduces a new interaction length scale analogous to R _ g _, which is extracted directly from the experimental small-angle scattering data. This new length scale, R _ i _, defined as
is the MSD of the interparticle correlation, i.e. how far a typical particle deviates from the uniform distribution due to interparticle interactions. For low-concentration systems with weak interactions, R _ i _ is directly related to the interparticle potential, V(r), as . We thus call this term the “radius of interaction potential” (or “interaction radius”). While in some fields B 22 ^1/3^ could also be considered as a thermodynamic length scale related to pair interactions, since B 22 is the system-averaged effective interaction volume of a particle pair, this length scale only reflects the net magnitude of interactions. Conversely, R _ i _ reflects the details of the shape of the interparticle potential when providing a characteristic length scale of interparticle interactions. R _ i _ can thus differentiate between systems with predominantly short-ranged versus long-ranged interparticle interactions, even if the B 22 values are the same.
As a result, the scattering analysis method introduced herein enables for the first time the net spatial extent of the correlation function to be obtained directly from experimental data, using a new parameter that simulation experts can also straightforwardly compute. The approach also provides a quantitative measure of how the apparent experimental R _ g _ is influenced by the particle concentration and interparticle interactions. Analogous to the traditional Guinier analysis on P(Q), our method obtains interaction information in a model-independent way, without a priori knowledge of the interparticle potential or S(Q).
We begin by providing a derivation for our model, including both isotropic and anisotropic particles, as well as noting the modification to the Zimm equation that follows from the derivation. We also demonstrate the application of the method to extract R _ i _ and concentration-independent R _ g _ from example experimental SANS data using a model protein system of lysozyme in histidine buffer. In addition to introducing the model and its implementation, our goal is to demonstrate the utility and easy application of our method to any colloidal system where Guinier analysis might traditionally be applied. Broadly, the work provides a more accurate description of the overall size of biomacromolecules, such as proteins, while also applicable to polymers, lipid-based systems, and other colloids.
Theory
2
The classical Guinier law gives , with R _ g _ (eq) obtained by analysis of the low-Q asymptotic behavior of SAXS/SANS data. Herein, we focus on developing the asymptotic equation describing S(Q) at low Q, from which we introduce a new parameter, R _ i _, as the root-mean-square distance of the interparticle interactions (eq). The key equations from this section are summarized in Table.
1: Summary of Key Equations for Both Isotropic and Anisotropic Particles
Case 1: Spherical Solutes
2.1
For a system with monodisperse particles with isotropic density distribution and isotropic interactions, the structure factor is given by eq. At the limit of Q → 0, the phase factor can be approximated using a Taylor expansion as , which leads to
Because S(Q) is isotropic, we can perform the angular average of Q. Eq can then be simplified by observing that i Q ·∫ h(r)rdr = 0, since g(r) is isotropic, and therefore h(r)r is odd. In addition, ∫ h(r)(Q ·r)^2^dr = Q ^2^ ∫ h(r)r ^2^ cos ^2^(θ)dr, whose average value over the solid angle is . As a result,
where we have introduced the interaction radius, R _ i _ ^2^, defined in eq. Next, we observe that S(0) = 1 + n∫ h(r)dr, so we can write eq equivalently as
Eq can be rewritten by recognizing that it is the expansion of the exponential function in the limit of small Q ^2^, giving
This expression describes the low-Q asymptotic behavior of the structure factor S(Q), analogous to the Guinier law for the form factor P(Q).
Since the scattering intensity is given by I(Q) = nP(Q)S(Q),? we can combine the low-Q expansion of S(Q) (eq) with Guinier law (eq) to obtain a modified Guinier expression that accounts for structure-factor effects at finite concentration as follows:
This equation takes the exact same form as the classical Guinier law; here, R _ g,obs_ is still the observed or apparent radius of gyration as measured by traditional Guinier analysis. However, by accounting for S(Q) in the derivation as shown herein, R _ g,obs_ introduces two new parameters to more accurately describe the colloidal length scales,
where the constant . In this equation, R _ i _ is the radius of interparticle interaction introduced above, and R _ g,0_ is the value of R _ g,obs_ at the infinitely dilute limit (n = 0). In other words, R _ g,0_ is the concentration-independent or “true” value of the radius of gyration as measured by small-angle scattering.
For a general or concentrated particle solution, h(r) is related with the potential of mean force, ω(r), through , where k _ B _ is the Boltzmann constant and T is the absolute temperature.
Dilute Limit
2.1.1
Further simplifications can be made in the dilute limit. At relatively dilute conditions, ω(r) is the pairwise effective interaction potential, V(r). Under these conditions,
If furthermore is small, i.e. the interaction potential is relatively weak, then the exponential term in eq can be approximated with a Taylor series to give
Under these conditions, R _ i _ ^2^ is an approximate measure of the spatial extent or effective range of interparticle interaction as described by the potential V(r). In other words, for dilute solutions with weak potential, R _ i _ directly describes the typical distance over which the interaction potential is significant.
Additionally in the dilute limit, ξ becomes , where B 22 is the second virial coefficient, which at the limit of small n is ξ ≈−nB 22. Therefore, eq can be written as
In other words, the observed R _ g _ from classical Guinier analysis varies linearly from its infinitely dilute value at a rate determined by interparticle interactions. Practically, the two parameters R _ g,0_ and R _ i _ can thus be extracted from a linear fit to R _ g,obs_ data, provided that −nB 22 can be calculated as shown in Section.
Case 2: Anisotropic Solutes
2.2
For anisotropic solutes with isotropic density distribution, we show here that the low-Q asymptotic behavior of the scattering intensity has the same final form as for spherical solutes, with
where “A” simply denotes the anisotropic case. Again, can be also linked to the interaction radius by the following equation
Here, the ′ denotes orientation-dependent averages of the interparticle correlation functions. Otherwise, the anisotropic modified Guinier equation and the dependence of R _ g, obs_ ^ A ^ on the infinitely dilute R _ g _ and radius of interparticle correlation match eqs and ? exactly. Thus, from an experimental perspective, the application and interpretation of the modified Guinier equation for anisotropic and spherical solutes is identical.
We derive eqs and ? as follows. For monodisperse anisotropic particles, the SANS/SAXS pattern can generally be expressed as
where ⟨P(Q, Ω)⟩Ω indicates the average of the orientation-dependent form factor P(Q, Ω) over all particle orientations Ω, and S(Q) is the effective structure factor. ?,? Here and in the following we use Ω to refer to the orientation of a particle in 3D space; Ω_ i _ refers to the orientation of particle i.
Liut et al. have shown recently that S(Q) in eq is given by?
where the orientation-dependent shape function, F(Q, Ω) = ∫ ρ(r, Ω)e ^ i Q·r ^ d r, is defined in terms of ρ(r, Ω), the SLD distribution function of solutes with orientation Ω. (We replace Δρ(r) with ρ(r) to simplify the equations in this section.) Thus, F 1 and complex conjugate F 2 ^*^ describe particles with orientation Ω_1_ and Ω_2_, respectively. The orientation-dependent total correlation function h(r, Ω_1_, Ω_2_) is given by h(r, Ω_1_, Ω_2_) = g(r, Ω_1_, Ω_2_) – 1. Whereas g(r, Ω_1_, Ω_2_) describes the probability of finding a particle at position r with orientation Ω_2_ given that there is a particle at the origin with orientation Ω_1_, h(r, Ω_1_, Ω_2_) describes the excess (h > 0) or deficit (h < 0) in that probability due to interactions, relative to a uniform distribution at the same density. In particular, , where n(r, Ω_1_, Ω_2_) is the conditional number density of particles with separation r and orientation Ω_1_, Ω_2_; and n is still the bulk (positionally and orientationally averaged) number density.
For randomly oriented particles in solution, averaging over Q-orientation does not affect S(Q) = ⟨S(Q)⟩_ Q _, which is described by the expression
This simplifies to
As Q → 0, the phase factor in F _ i (Q, Ω i _) can be approximated using a Taylor expansion as in eq, i.e.,
where r _ i _ indicates distance to particles of orientation Ω_ i _. This expression can be simplified analogously to eq by observing that i Q ·∫ ρ(r, Ω)r d r = 0 as we can define the center of the scattering length to be the origin. So,
Therefore, the averaged integral in eq becomes
which expression can be further simplified to
After Taylor expansion of the phase factor e ^ i Q·r ^ and only keeping the terms up to Q ^2^, this expression is equivalently
The Q-average can be distributed as
which expression can be further simplified using the isotropic average ⟨(Q ·r)^2^⟩_ Q _ = Q ^2^ r ^2^/3 to obtain
Since F(Q = 0) is a constant, i.e. ∫ ρ(r _ i , Ω i )dr _ i _ takes the same value for all particle orientations Ω i , we can define Λ ∫ ρ(r 1, Ω_1)dr 1 = ∫ ρ(r 2, Ω_2_)dr 2. Note that R _ g _ can be written equivalently as (see eq). Plugging in gives
To further simplify eq, the Q- and orientationally averaged form factor, ⟨P(Q, Ω)⟩_Ω,Q _ must also be evaluated with
Note that P(0) = |F(0)|^2^ = Λ^2^. In addition, at a specific Q-value is simply , where the approximation from the Taylor expansion of the exponential is the same as that in the Guinier's law derivation. Therefore, we can rewrite eq as
Next, we plug the simplified expressions for the integral (eq) and the averaged form factor (eq) into the expression for the effective structure factor S(Q) for a solution of anisotropic particles with isotropic density (eq):
which simplifies to
The fraction in the above equation can be further simplified as
which is equivalent to
Defining = ⟨h(r, Ω_1_, Ω_2_)⟩Ω_1,Ω_2_ _, where the double-overline denotes the average over the orientations of particles 1 and 2, then we may define . Similarly, we define , which is essentially the same as eq. Using these definitions, we rewrite the effective structure factor as
In the limit of small Q ^2^, this expression can be approximated by an exponential as
similarly to the spherical particle case (eq). As in the derivation for spherical solutes, we combine the above expression for S(Q) with the classical Guinier law to obtain the structure-factor-corrected Guinier expression for anisotropic solutes:
which leads to eq. Note that this modified Guinier law is exactly the same as that for spherical particles (eq). Since the measured I(Q) will not differentiate between and S(0), the final equations for the anisotropic and spherical cases are identical from an experimental perspective.
Implications for Zimm Analysis
2.3
Given the close relationship between the Zimm equation and the Guinier law, we show here the modified form of the Zimm equation due to the structure factor effects that follows directly from the above equations. To derive the Zimm equation, the Guinier law (eq) can be rewritten as
The Zimm equation assumes dilute conditions and is typically written in terms of the second Virial coefficient B 22(Q = 0) or B 22, i.e., using for n → 0. Thus, eq can be approximated with a Taylor expansion to arrive at the equation:
Note that Zimm’s original 1948 equation,? and also the form used in Lodge and Hiemenz,? can be simply derived from eq (see ESI Section 1).
To derive the modified Zimm equation, we follow this same procedure using our modified Guinier equation (eq). In reciprocal form, it is
Again, S(0)^−1^ and the exponential terms can be approximated using Taylor expansion. We also plug in ξ ≈ −nB 22. The resulting extension of the Zimm equation, which is relevant as Q, n → 0 is given by
As compared to eq, this modified Zimm equation has one additional term related to the interaction radius that also varies with the concentration n.
Expected Magnitude of R
i at Dilute Conditions
2.4
How far might we expect the particle configuration to deviate from the ideal gas configuration at relatively dilute conditions? As a first approximation, consider the hard sphere system, wherein spheres of uniform internal density and diameter σ = 2R HS, where R HS is the radius, interact according to the classical hard sphere potential
In this case, the true hard-sphere radius of gyration is given by . The radius of interparticle interaction is given by , where , which evaluates to . Therefore, in this limiting case, we find that
or R _ i _ ^2^ = 4R _ g _ ^2^. Though this exact equivalence is unlikely to hold in a real system, we may expect similar orders of magnitude for R _ g _ and R _ i _. For systems involving significantly nonideal characteristics, like strongly attractive or sticky interactions and anisotropic or floppy solutes, determining the typical magnitude of R _ i _ will require substantial future experimental characterization across a variety of real systems.
Materials and Methods
3
Materials
3.1
Lysozyme from chicken egg white was obtained as an off-white crystalline powder from MP Biomedicals (<3.5% chloride and activity >23,500 u/mg solid). Sodium chloride (NaCl) was obtained from Sigma-Aldrich (BioXtra: ≥99.5%). l-Histidine (98+%) and l-histidine hydrochloride monohydrate (98%) were obtained from Thermo Scientific Chemicals. Deuterium oxide (99.9% D_2_O) was obtained from Cambridge Isotope Laboratories, Inc.
Solution Preparation
3.2
Histidine buffer was prepared by dissolving 0.3870 g of l-histidine with 0.5242 g of l-histidine hydrochloride monohydrate in 200 mL of D_2_O using a volumetric flask. The buffer pH was measured to be 6.05. Next, NaCl was added to the solution by dissolving 0.8767 g of NaCl in 100 mL of the l-histidine buffer using a volumetric flask. The resulting final buffer consisted of 150 mmol/L NaCl in 25 mmol/L histidine buffer, pH 6 in D_2_O. The buffer was vacuum filtered with a Thermo Fisher Scientific Nalgene RapidFlow 50 mm filter unit.
To prepare the protein solutions, frozen crystalline lysozyme was brought to room temperature, massed on a microbalance (120.0 mg), and 6 mL of buffer was added to reach a nominal concentration of 20 mg/mL. This solution was mixed on a rocker for 30 min at room temperature. A concentration series was prepared by successive dilution of this initial stock solution to obtain nominal concentrations of 15, 10, 7.5, 5, and 2.5 mg/mL lysozyme. The buffer and protein solutions were prepared 4 days prior to the first day of the neutron scattering experiment, and were stored in a closed container away from light and sealed with parafilm, either at 4°C or on ice during transportation to the beamline.
Concentration and Volume Fraction
3.3
The exact mass concentration of lysozyme corresponding to each nominal concentration was measured using the NanoDrop One instrument (Thermoscientific) within 12 h of conducting the corresponding neutron measurement. The default lysozyme measurement settings were used: = 26.4 at 280 nm for a 1% mass fraction lysozyme solution (10 mg/mL). At least 6 measurements were conducted per solution (2 μL each). The resulting average, median, and standard deviation for each concentration are shown in Table.
2: Measured Lysozyme Concentrations [mg/mL], Number Density in Units [1/mL] ∗ 1017, and Volume Percent
Number densities n of lysozyme were calculated using the measured mean mass concentrations, c, from the nanodrop measurements using M _ W _ = 14.3 kDa, its theoretical mass based on the amino acid sequence, and Avogadro’s number. Volume fractions ϕ were calculated as , where ρ is the mass density of the lysozyme protein. Though the spatial average of protein density is typically assumed to equal 1.35 g/cm^3^, it is actually molecular-weight-dependent, with smaller molecular-weight proteins like lysozyme having increased density. Using the empirical equation proposed by Fischer et al.,? we find that ρ ≈ 1.46 g/cm^3^ for lysozyme protein, from which we calculate that the ϕ used herein ranges from 0.0017 to 0.013. See Table for complete results for n, c, and ϕ for each nominal concentration.
Small Angle Neutron Scattering
3.4
Small-angle neutron scattering (SANS) experiments were performed at the Bio-SANS beamline (CG-3)? of the high flux isotope reactor at Oak Ridge National Laboratory (ORNL). Samples were measured in 5 mm quartz cylindrical cuvettes (Hellma), and the sample temperature was maintained at 25°C using a Peltier device. To minimize error due to variations in path length between individual cuvettes, the exact same cuvette was used for both buffer and the sample measurements. SANS patterns were obtained in 60 minutes to provide sufficient signal-to-noise ratio.
SANS data were collected using a three-detector array. The small-angle detector was positioned at 7.0 m from the sample, the mid-range detector at 4.0 m with a 2.7° anticlockwise rotation relative to the direct beam, and the high-angle (curved wing) detector at 1.13 m with a 7.25° rotation from the direct beam. This configuration covered a momentum transfer range of 0.007Å^–1^ < Q < 0.85Å^–1^, where the magnitude of the scattering vector sin with λ the wavelength and θ the scattering angle, using neutrons with a wavelength of 6.44 Å and a relative wavelength spread (Δλ/λ) of 0.123. The Panel Scan feature? of the data acquisition system was used to control the instrument during measurement.
Data reduction was performed using the facility’s data reduction toolkit for small-angle neutron scattering (drt-SANS) software.? The reduction included corrections for instrument background, detector sensitivity, and instrument geometry, followed by circular averaging to obtain 1D scattering profiles. Reference measurements of empty beam, beam center, and buffer backgrounds were measured and applied during data reduction with drt-SANS.
SANS data analysis was conducted on the resulting 1D profiles in scattering intensity. The average I(Q)-value between 0.35–0.45 Å was subtracted as background using a custom Python code. The background-subtracted I(Q) data were fitted using Guinier analysis as described below. Background intensity values and SANS data prior to background subtraction are shown in Figure S1.
Results
4
As shown in Section, the observed radius of gyration R _ g,obs_ from traditional Guinier analysis is dependent on the concentration according to the interparticle interaction potential. In particular, R _ g,obs_ ^2^ varies linearly from its infinitely dilute valuedenoted by R _ g,0_ ^2^ herein, which can also be thought of as a “true” R _ g -valuewith a slope that depends on R _ i _ ^2^, the squared “radius of interparticle interaction” that reflects the interactions between particles in the solution. This dependence is given by eq and holds for both spherical and anisotropic solutes. In this section, we demonstrate the extraction of the parameters R _ g,0 and R _ i _ from experimental small-angle neutron scattering (SANS) data on a model system of lysozyme protein in solution. Buffer conditions, sample preparation, and instrument configuration details are provided in Section.
SANS data for the tested lysozyme solutions after background subtraction are shown in FigureA as a function of nominal concentration. Data are shown after normalization by the protein volume fraction ϕ, calculated from the measured concentration (exact values provided in Table) in FigureB. In general, the normalized data overlay well, reflecting the accuracy of the concentration measurements and that there is only minimal effect of the structure factor on the scattering pattern.
SANS data from solutions of lysozyme at 25 mmol/L histidine and 150 mmol/L NaCl in D2O, pH 6, at various nominal concentrations at 25°C. (A) After background subtraction in log–log scale. (B) Data normalized by the lysozyme volume fraction. Insets show data in lin-log scale. Dotted black lines indicate the Q-range used for subsequent Guinier analysis. Error bars indicate detector counting statistics as 1/N , where N is the number of counts at a given Q bin during the exposure.
The modified Guinier analysis (eqs and ?) was conducted over the range indicated by the black dotted lines in Figure, 0.015 Å < Q < 0.07 Å, using a custom Python code. As nominal lysozyme concentration increases from 2.5 to 20 mg/mL, R _ g,obs_ Q max ranges from 0.88 to 1.01 and the fit value of R _ g,obs_ increases from 12.57 ± 0.34 Å to 14.41 ± 0.06 Å. Exact fit values for R _ g,obs_, I(0), and associated errors are provided in Table S2. Linear fits to the data in ln[I(Q)] versus Q ^2^ used to obtain these values are shown in Figure S2. The results are summarized in Figure. Note that the data at very low Q (shown in Figure insets) were not used in the Guinier fits. The exact Q-range was not found to significantly affect the results of the Guinier analysis; variational analysis is provided in Figure S3. Residual analysis of the Guinier fits over the reported Q-range indicates the absence of significant irreversible aggregation, radiation damage, or other artifacts in the data (Figure S4).
Results of the traditional Guinier fit to the low-Q lysozyme scattering intensity. (A) Radius of gyration. (B) Scattering intensity at Q = 0. Fits are shown for a representative Q-range of 0.015 Å < Q < 0.07 Å; variational analysis is provided in Figure S3. Error bars indicate the 1σ uncertainty of the least-squares Guinier fit value (y-error) and of the measured lysozyme concentration (x-error).
Our goal is to demonstrate the application of eq, in this case for relatively dilute solutions of lysozyme where eq can be used to extract the relevant parameters. This requires replotting the R _ g,obs_ values from the Guinier analysis (Figure) as R _ g,obs_ ^2^ versus –nB 22, where B 22 is the second virial coefficient. Using the I(0) data from the Guinier fits, we fit a line to versus n, where n is the lysozyme number density, as shown in Figure. We compute the second virial coefficient from this fit as , where a is the slope of the fit line and b is the y-intercept. For the example data shown, we obtain B 22 = (−3.80 ± 0.56) × 10^–20^ mL. In other commonly reported units, this value is equivalent to B 22 = –22900 ± 3400 mL/mol or B 22 = (−1.12 ± 0.17) × 10^–4^ mol·mL/g^2^. Using this result, we can also calculate S(0) at each protein concentration via S(0,n) = 1 – 2nB 22. The resulting S(0)-values, shown in Figure S5, indicate a deviation of 5% or less from S(Q) = 1 for all lysozyme solutions tested herein. In general, S(Q) slightly exceeds 1, which along with the small negative value of B 22 is consistent with weak attractive interactions between particles in solution.
Obtaining the second virial coefficient (B 22). A linear fit to I(0)/n data gives B 22 [mL] as B 22 = –a/2b, where n is the lysozyme number density, a is the slope, and b is the y-intercept.
Finally, we use R _ g,obs_ and B 22 to extract the radius of interparticle interaction, R _ i , and the infinitely dilute radius of gyration, R _ g,0, based on eq. Figure shows a plot of R _ g,obs_ ^2^ versus –nB 22 since ξ becomes –nB 22 at dilute conditions, along with the resulting linear fit which yields R _ i _ ^2^ as the slope and R _ g,0_ ^2^ as the y-intercept. We find that R _ i _ ^2^ = 1431 ± 158 Å^2^, from which we obtain R _ i _ = 38 ± 2 Å. In addition, we obtain R _ g,0_ ^2^ = 168 ± 3 Å^2^, corresponding to R _ g,0_ = 13.0 ± 0.1 Å. Thus, for this example data set of lysozyme protein in histidine buffer at pH 6 with added NaCl, we find that R _ i :R _ g,0 ≈ 2.9:1, as compared to the theoretical relationship for hard spheres for which R _ i,HS_ = 2R _ g,HS_ (see Section).
*R
i , the radius of interparticle interaction, and R
g,0, the infinitely-dilute radius of gyration, are obtained from our modified Guinier analysis using this plot. The values of R
g,obs from the classical Guinier fit, along with the zero-angle structure factor S(0) as ξ=12S(0)−1S(0) , or ξ = –nB 22 in the dilute case with B 22 is the zero-angle second virial coefficient as shown here (x-axis), are used to create this plot. The linear fit (dotted blue line) returns the values of R
i
2 and R
g,0 2 as the slope and y-intercept, respectively. The shaded region indicates the 95% confidence interval of the fit line.*
Discussion
5
Above, we showed that the “true” or infinitely dilute R _ g -value (R _ g,0) and the radius of interparticle interaction (R _ i ) can be obtained from the low-Q asymptotic behavior of I(Q). We demonstrated this method using example SANS data on solutions of lysozyme, a small and well-studied prolate protein which is often used as a model protein for small-angle scattering measurements. As demonstrated here, the method is straightforward to apply for systems with relatively weak or intermediate interparticle interaction. Whether R _ i _ is measurable for strong interactions will depend on the capability of a scattering instrument and how strongly the sample scatters. For strongly interacting samples, the concentration range for linearity in R _ g,obs ^2^ is likely quite small, requiring good measurement accuracy over a range of low-concentration measurements. In addition, the method is straightforward and most accurately applied in the case of monodisperse solutes like proteins, but will need modification to obtain R _ i _ for samples with significant polydispersity. For high concentration measurements, note that R _ i _ works for S(Q) at low Q for any concentration and shape of particles and does not require a virial expansion, since it is related to the potential of mean force, ω(r), which includes all correlations (see eq).
The quality of scattering data is very important to obtaining an accurate result for R _ i _ and R _ g,0_. In particular, due to the relatively weak scattering signal for dilute solution measurements, correct background subtraction is extremely important. To eliminate differences in background due to thickness variation between sample holder cell walls, we recommend either using the same cell to conduct buffer and sample measurements as in the data set reported herein, or the use of a flow cell. Still, the forward scattering intensity for the nominal 2.5 mg/mL sample shows a slight downturn likely due to low protein concentration and short data collection time, which may have resulted in fitting signal near the beamstop and account for its deviation from linearity in Figure. Therefore, depending on what concentrations correspond to minimal interparticle interaction for a given sample, future measurements of this type could easily push beyond measurable limits of current neutron scattering instruments, and require careful planning and quality evaluation prior to analysis.
Though the lysozyme proteins have attractive interactions, evidenced by the negative B 22-value, interactions between them are weak so clusters that form as a result of the attractive interactions should be reversible. ?−? ? Thus, the residuals of the Guinier fit are qualitatively flat and randomly distributed about zero (Figure S3), rather than showing a characteristic “smile” shape that typically indicates the presence of irreversible aggregates. Our B 22-value near −1 × 10^–4^ mol·mL/g^2^ is close to literature values in similar buffers and temperatures. At 50 mmol/L sodium acetate and pH 4.5 with 200 mmol/L NaCl in H_2_O, the lysozyme B 22 at 20°C is −0.39 × 10^–4^ mol·mL/g^2^.? Lysozyme at pH 6 in aqueous NaCl solution with total electrolyte concentrations 0.1 and 0.3 M in minimal citrate buffer have B 22-values between −1 × 10^–4^ and −2.5 × 10^–4^ mol·mL/g^2^ at 25°C.? Additionally, prior studies of lysozyme solubility in various buffer conditions similar to those used hereinnamely, 150 mmol/L sodium chloride in 20 mmol/L histidine buffer, pH 6 in D_2_O at 25°Chave found the proteins are fully soluble in pure H_2_O at similar concentrations of sodium chloride and protein at slightly lower pH. ?−? ?
The R _ g _ of lysozyme has been extensively investigated before, and our results herein are consistent with the literature. A recent multinational round-robin study investigating protein size from small-angle scattering found values ranging from R _ g _ = 12.16 ± 0.42 Å to R _ g _ = 13.6 ± 0.33 Å for lysozyme in D_2_O depending on the measurement and calculation method. Batch SANS measurements of hen egg lysozyme protein (150 mmol/L sodium chloride in 50 mmol/L sodium citrate buffer, pH 4.5 in D_2_O) resulted in R _ g,obs_-values in the range of 13–14 Å in Guinier analysis (excluding an outlier above 15 Å), with an average of 13.60 ± 0.33 Å where the error indicates one standard deviation. The averaged round-robin data were collected at a variety of concentrations in the dilute regime at the discretion of the participating beamline scientists; the study did not extrapolate the reported values to infinite dilution.? Despite the different buffer, the reported round-robin average is equivalent within error to our R _ g,obs_ average of 13.67 ± 0.10 Å.
As expected, both averaged Guinier R _ g,obs_ values slightly exceed our extrapolated infinitely-dilute R _ g,0_ value of 13.0 ± 0.1 Å. As shown in eq, for relatively dilute concentrations, the negative B 22(0), orattraction between particles, results in a slight increase of R _ g,obs_ with increasing concentration. On the other hand, positive B 22(0), orrepulsion between particles results in a decrease in R _ g,obs_ as the concentration increases. In both cases, R _ i _ reflects the model-independent net displacement from ideality due to interparticle interaction.
R _ g,0_ is relevant to all the tested lysozyme concentrations, in contrast to the values from Guinier analysis which change with concentration even though these are dilute solutions (i.e., minimal interparticle interactions). Our work highlights that the small deviations from S(Q) = 1 at low-Q, which reflect the interparticle interactions in our lysozyme solutions, do play a role in the observed R _ g,obs_ value. This is true even in the dilute regime: S(Q) deviates above 1 by less than 6% in the tested concentration range (Figure S3). Our method enables a means to quantify the degree of this deviation from ideality by introducing the parameter R _ i _.
However, the modified Guinier analysis method described herein does not necessarily require a dilute solution. In fact, the derivation of eqs and ?, which describe S(Q) at low Q, only requires the assumption that Q → 0 without imposing any constraints on the concentration regime. Further, Guinier analysis requires the use of a dilute solution only because of the assumption that S(Q) = 1. We use dilute solutions for our example data because we expect that regime is likely to be most relevant to applications of the method, due to the direct relation between R _ i _ and V(r) at low concentration (eq).
Since this work introduces the parameter R _ i _ for the first time, no literature values are available to compare either the value of R _ i _ (38 ± 2 Å) or its relative size compared to R _ g,0_ found herein. As a point of reference, in Section we calculated the ratio = 2 for the hard sphere potential, as compared to a value of 2.9 from our experimental lysozyme data. Given that lysozyme is a patchy and prolate solute in a liquid solution, we did not expect an agreement with the hard sphere result, since the hard sphere potential assumes perfectly spherical particles with only contact interactions. Yet, the experimental value of the ratio is still of similar order and magnitude. Since the low-Q scattering data only reflect the particle size, and some interaction information through R _ i , distinguishing spherical and anisotropic particles requires small-angle scattering data beyond the low-Q region, which one could obtain in a model-free way from B 22(Q) data.? More broadly, a deeper understanding of the magnitude of R _ i _ and its relationship to R _ g,0 will require substantial further investigation beyond the scope of this study.
We anticipate that future efforts using the modified Guinier analysis method introduced herein will help establish the typical R _ i _ values across a range of model proteins, polymers, and other colloidal systems of industrial relevance; their relation to B 22 and R _ g,0_; and comparison to simulations data. These endeavors will enable fundamental, model-independent insights about the length scale of the effective interaction potential in these experimental systems for the first time. Moreover, because R _ i _ directly reflects the interparticle interaction potential between colloidal particles, it therefore may be relevant to predicting solution properties that are critical to a wide variety of industrial formulations.
Conclusions
6
In analogy to the Guinier radius of gyration (R _ g ), we introduce a new parameter, the radius of interparticle interaction (R _ i ), which is defined as the mean square displacement (MSD) of the total correlation function h(r). In dilute solutions, R _ i _ is directly the MSD of the interparticle potential V(r). Herein, we present a model-independent extension to traditional Guinier analysis that enables simultaneous determination of both R _ i _ and the “true” or infinitely dilute R _ g _ value (R _ g,0) from the low-Q asymptotic behavior of I(Q). We derive a relationship between R _ i _ ^2^, R _ g,0 ^2^, and the observed radius of gyration obtained by traditional Guinier analysis, R _ g,obs_ ^2^, that holds for both spherical and anisotropic solutes. We also derive the modified or revised Zimm equation that incorporates interaction effects. We demonstrate the application of our modified Guinier analysis method using example SANS data from lysozyme solutions. We find that R _ g,0_ = 13.0 Å, consistent with the reported literature values for lysozyme R _ g _ at finite concentration in similar solvents. For R _ g,obs_, the magnitude of the deviation from R _ g,0_ is governed by R _ i _, which is 38 Å for our example data set, indicating the length scale where interactions are significant in our dilute lysozyme solutions. Here, 2.9, around 50% larger than the calculated value of 2 for the hard sphere potential. The method introduced herein both improves the accuracy in measurement and reporting of R _ g _-values and provides a model-independent parameter R _ i _ that directly reflects colloidal interaction potentials in solution. Therefore, we expect that determining the typical magnitude of R _ i _ across a variety of colloidal systems of various identities, flexibilities, and interaction types and understanding its relation to macroscopic properties like high-concentration viscosity will prove useful to future fundamental and industrial research efforts.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Glatter, G. ; Kratky, O. Small Angle X-Ray Scattering; Academic Press, 1982.
- 2Pedersen J. S.Analysis of small-angle scattering data from colloids and polymer solutions: Modeling and least-squares fitting Adv. Colloid Interface Sci.19977017121010.1016/S 0001-8686(97)00312-6 · doi ↗
- 3Chen S.-H.Small angle neutron scattering studies of the structure and interaction in micellar and microemulsion systems Annu. Rev. Phys. Chem.19863735139910.1146/annurev.pc.37.100186.002031 · doi ↗
- 4Jeffries C. M.Ilavsky J.Martel A.Hinrichs S.Meyer A.Pedersen J. S.Sokolova A. V.Svergun D. I.Small-angle x-ray and neutron scattering Nat. Rev. Methods Primers 202117010.1038/s 43586-021-00064-9 · doi ↗
- 5Guinier A.La diffusion des rayons X sous faibles angles, appliquée à l’étude de fines particules et de suspensions colloïdales CR Hebd Seances Acad. Sci.19382061374
- 6Guinier, A. ; Fournet, G. Small-Angle Scattering of X-Rays; John Wiley & Sons, Inc., 1955.
- 7Hansen, J.-P. ; Mc Donald, I. R. Theory of Simple Liquids, 3rd ed.; Elsevier Academic Press, 2006.
- 8Caccamo C.Integral equation theory description of phase equilibria in classical fluids Phys. Rep.1996274110510.1016/0370-1573(96)00011-7 · doi ↗
