Investigating the Effect of Isoelectric Points on the Gas-Phase Stability of Native-like Proteins Analyzed in Positive- versus Negative-Ion Mode by IMS-MS
Alexis N. Edwards, Madeline G. Bannon, Michael S. Cordes, Elyssia S. Gallagher

TL;DR
This study shows that the gas-phase stability of proteins analyzed by IMS-MS depends on the ionization polarity, with proteins being more stable in the polarity that matches their solution-phase charge.
Contribution
The work reveals that ionization polarity significantly affects the gas-phase stability of native-like proteins in IMS-MS.
Findings
Proteins analyzed in the polarity matching their solution-phase charge are more stable in the gas phase.
Positive and negative ions show significant differences in gas-phase stability despite similar initial folded structures.
The energy required for unfolding is higher in the polarity corresponding to the protein's solution-phase charge.
Abstract
Native ion mobility spectrometry-mass spectrometry (IMS-MS) is routinely used for analysis of folded proteins and protein complexes. For many proteins, the three-dimensional structure is maintained during electrospray ionization (ESI) as the protein transitions to the gas phase, allowing for detailed investigation of the gaseous, ionic protein’s structure and stability. Much of the native IMS-MS research has been conducted in positive-ion mode (+ESI), even when the protein of interest has a net-negative charge in solution at physiological pH. We hypothesize that analyzing a protein in the polarity that is opposite to its solution-phase charge, such as analyzing net-negative proteins by +ESI-MS, disrupts the network of noncovalent-bonding interactions within the protein to a greater extent than using the polarity that matches the protein’s solution-phase charge, resulting in differences…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4| protein | number of subunits | molecular weight (kDa) | disulfide bridges | theoretical pI | theoretical | experimental
pI |
|---|---|---|---|---|---|---|
| ovalbumin | 1 | 44.3 | 1 | 5.19 | –10.1 | 4.5 |
|
| 2 | 23.9 | 2 | 4.04 | –7.99 | n/a |
| β-lactoglobulin | 1 | 18.2 | 2 | 4.97 | –5.89 | 4.8–5.2 |
| α-lactalbumin | 1 | 14.2 | 4 | 5.09 | –5.49 | 4.2–4.6 |
| concanavalin A, con A | 1 | 25.6 | 0 | 6.09 | –3.41 | 4.5–5.5, 6.6–7.1 |
| streptavidin | 4 | 55.0 | 0 | 6.71 | –1.23 | 5.0–6.0 |
| ubiquitin | 1 | 8.56 | 0 | 7.02 | +0.01 | 6.79 |
| wheat germ agglutinin, WGA | 2 | 34.4 | 32 | 9.38 | +5.36 | 8.5 |
| cytochrome | 1 | 12.3 | 0 | 11.30 | +7.40 | 10.0–10.5 |
| lysozyme | 1 | 14.2 | 4 | 11.07 | +8.36 | 11.4 |
| protein |
| initial, folded TW(CCS ± CCSuncert)N2→He (Å2) |
| within uncertainty | final, extended TW(CCS ± CCSuncert)N2→He (Å2) |
| within uncertainty | ΔCCS (Å2) |
|---|---|---|---|---|---|---|---|---|
| ovalbumin | 11– | 3500 ± 110 | 9 | √ | 4000 ± 130 | 9 | √ | 600 ± 170 |
| 11+ | 3400 ± 50 | 9 | 4040 ± 60 | 9 | 610 ± 80 | |||
| GNA | 9– | 2180 ± 60 | 9 | 2540 ± 80 | 9 | 360 ± 100 | ||
| 9+ | 2100 ± 30 | 9 | 2670 ± 40 | 9 | 560 ± 50 | |||
| β-lactoglobulin | 7– | 1640 ± 40 | 9 | √ | 1880 ± 50 | 9 | 240 ± 70 | |
| 7+ | 1640 ± 40 | 9 | 1960 ± 30 | 9 | 320 ± 20 | |||
| α-lactalbumin | 6– | 1380 ± 40 | 9 | √ | 1400 ± 40 | 8 | √ | 20 ± 60 |
| 6+ | 1410 ± 20 | 9 | 1410 ± 20 | 9 | N/A | |||
| concanavalin A | 9– | 2150 ± 70 | 9 | 2840 ± 70 | 9 | 470 ± 120 | ||
| 9+ | 2090 ± 40 | 9 | 2880 ± 50 | 9 | 570 ± 70 | |||
| streptavidin | 13– | 3900 ± 120 | 9 | √ | 4700 ± 150 | 9 | 800 ± 200 | |
| 13+ | 3970 ± 60 | 9 | 4960 ± 70 | 9 | 1000 ± 90 | |||
| ubiquitin | 5– | 960 ± 30 | 9 | √ | 1020 ± 40 | 8 | 60 ± 40 | |
| 5+ | 990 ± 20 | 9 | 1070 ± 20 | 9 | 80 ± 20 | |||
| WGA | 10– | 2680 ± 80 | 9 | 3070 ± 90 | 9 | √ | 390 ± 120 | |
| 10+ | 2560 ± 50 | 9 | 2920 ± 40 | 9 | 370 ± 60 | |||
| cytochrome | 6– | 1240 ± 40 | 9 | √ | 1570 ± 50 | 9 | 330 ± 60 | |
| 6+ | 1260 ± 20 | 9 | 1380 ± 20 | 9 | 130 ± 30 | |||
| lysozyme | 6– | 1390 ± 40 | 9 | √ | 1720 ± 60 | 9 | √ | 330 ± 70 |
| 6+ | 1420 ± 20 | 9 | 1710 ± 20 | 9 | 290 ± 30 |
- —National Institute of General Medical Sciences10.13039/100000057
- —Division of Chemistry10.13039/100000165
- —Welch Foundation10.13039/100000928
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsMass Spectrometry Techniques and Applications · Metabolomics and Mass Spectrometry Studies · Ion-surface interactions and analysis
Introduction
In native mass spectrometry (nMS), the native-like structures of proteins are maintained in the gas phase by preserving the secondary, tertiary, and quaternary structures. ?−? ? ? ? ? ? The preservation of native-like protein structures is primarily accomplished via the use of electrospray ionization (ESI), a soft ionization method that forms a distribution of multiply charged protein ions. These multiply charged ions can then be investigated to characterize stoichiometry, oligomeric composition, and ligand binding. ?−? ?
During ESI, a potential is applied to the sample-containing solution. In an aqueous spray solvent, H_3_O^+^ and OH^–^ are produced in positive-ion mode (+ESI) and negative-ion mode (−ESI), respectively.? The buildup of like-charges within the solution results in the formation of a Taylor Cone at the capillary tip, which ejects charged droplets that travel toward the inlet of the mass spectrometer. Droplets undergo evaporation and fission until complete analyte desolvation, leading to the release of folded, gas-phase protein ions. As the droplets shrink, H_3_O^+^ or OH^–^ can become concentrated, causing the pH of the droplet to decrease or increase, respectively. Shifts in pH can denature proteins. Therefore, to maintain native-like protein structures, nMS is conducted with solutions containing volatile salts, usually ammonium acetate, that provide ionic strength and minimize pH changes compared to spraying from pure water. ?,? Droplet pH in ESI is influenced by flow rate, solvent composition, and applied voltage,? but often approaches values near the pK a of acetate (4.76) or ammonium (9.25) in +ESI or −ESI, ?,? respectively. During ESI, changes in overall protein charge take place via proton transfer at chargeable amino acid residues, resulting in the formation of protonated or deprotonated protein ions with the charged sites being dependent on the relative proton affinities of ammonium, acetate, and chargeable amino acids as proteins near complete desolvation. ?−? ? ? ?
Ion mobility spectrometry (IMS) measures an ion’s mobility through an inert gas under the influence of an electric field. Thus, mobility is directly influenced by the ion’s size, shape, and charge.? A protein’s mobility can be used to estimate its structure by calculating an average momentum-transfer collisional cross section (CCS). In addition to structural characterization, IMS-MS can investigate the gas-phase stability of protein complexes when paired with collision-induced unfolding (CIU). Specifically, in CIU experiments, protein ions are accelerated into a cell filled with inert gas molecules and undergo multiple collisions with higher frequency and energy relative to IMS. This converts kinetic energy to internal, vibrational energy, disrupting noncovalent interactions and inducing unfolding of the protein ions.? Unfolding alters the protein structure, which can then be quantified by monitoring the protein’s mobility during subsequent IMS-MS measurement.? Prior work with mutant proteins has shown that proteins with similar intramolecular interactions have similar CIU fingerprints, while proteins with different intramolecular interactions have different CIU fingerprints.? Therefore, gas-phase protein stability is correlated with intramolecular interactions.
Past native IMS-MS work has primarily been conducted in +ESI given that -ESI has the reputation of being unreliable for native-like protein analysis due to the occurrence of corona discharge,? and charge-state distributions that may not be representative of folded proteins.? In recent years, −ESI has been increasingly used in CIU and IMS-MS, with recent work demonstrating that −ESI-MS is capable of discerning denatured and native-like protein structures. ?,? Prior work showed that charge manipulation of folded proteins, such as charge reduction, to achieve similar absolute charge states in both ion modes, results in CCS values of protein cations that are comparable to their anion counterparts. ?,? This suggests that gas-phase structures are weakly dependent on protein charge and ionization polarity. ?,?,? Yet, Hong et al. showed that, in the absence of charge reduction, the gas-phase stability of two proteins (avidin and β-lactoglobulin) is sensitive to ionization polarity, highlighting that gas-phase stability is not consistent between +ESI and −ESI.?
Herein, we explore the relationship between the solution-phase charge of proteins and their stability as either positive or negative ions in the gas phase. We hypothesize that proteins with a net-negative charge in neutral solution experience greater changes to their intramolecular interactions during +ESI; therefore, becoming less stable in the gas phase when compared to −ESI. We hypothesize that the inverse is true for positively charged proteins analyzed via −ESI. To investigate the effect of ESI charging on the gas-phase structure and stability of intact proteins, 10 proteins with a range of oligomeric states and isoelectric points (pI), which define the pH at which proteins have no net charge in solution (see Table), were subjected to CIU-IMS-MS in both +ESI and −ESI. Here, we show that most proteins have comparable folded CCS in +ESI and −ESI, consistent with prior data. ?,?−? ? Though, several of these proteins unfold to extended structures that are not comparable in +ESI and −ESI. Additionally, we show that many proteins require significantly more energy to unfold in one polarity, which often matched their solution-phase charge at physiological pH. Particularly, proteins with a net-negative charge in solution are shown to be significantly more stable in −ESI than in +ESI. Furthermore, disulfide bridges stabilize proteins in both +ESI and −ESI. Therefore, our data indicates that a protein’s gas-phase stability may be inconsistent across MS polarities, highlighting that the polarity used to perform native IMS-MS experiments is an important consideration in experimental design.
1: Analyzed Proteins and Structural Details
Materials
and Methods
Materials
Lyophilized proteins were from Sigma-Aldrich (St. Louis, MO). Methanol was from Thermo Fisher (Thermo Fisher Scientific Inc., Waltham, MA). All other materials were from VWR (Radnor, PA). Nanopure water was acquired from a Purelab Flex 3 purification system (Elga, Veolia Environment S.A., Paris, France).
Sample Preparation
Folded proteins (see Tables and S1) were prepared in 200 mM ammonium acetate (pH ∼ 7) and desalted using Micro Bio-Spin P-6 gel columns (Bio-Rad, Mississauga, ON). Concanavalin A (con A) was analyzed at 20 μM, while all other proteins were analyzed at 10 μM. Denatured myoglobin, cytochrome c, and ubiquitin (10 μM) were prepared for CCS calibration in 49:49:2 (v/v/v) methanol: water: formic acid for +ESI and in 49:49:2 (v/v/v) methanol: water: ammonium hydroxide for −ESI.
Native
IMS-MS Analysis
Glass capillaries were pulled using a P-1000 micropipette puller (Sutter Instrument Company, Novato, CA). Samples and calibrants were sprayed from pulled capillaries (inner diameter ∼1 μm) and analyzed with a Synapt G2-S High-Definition MS (Waters Corporation, Milford, MA). The trap and transfer cells contained Ar, the helium cell contained He, and the IMS cell contained N_2_. IMS wave velocity was 350 m/s, while IMS wave height was either 7 or 9 V, depending on the protein size (see Table S2). Proteins with larger mass required a wave height of 9 V to ensure that all the ions migrated out of the IMS cell prior to the next ion injection. Mass spectra were collected and signal averaged over 30 s. Additional instrument details can be found in the Supporting Information (Table S2).
To limit systematic error due to repositioning of the ESI tip between runs,? the XYZ-stage position was optimized and kept consistent for all replicates. Approximately 5 μL of sample was loaded into a glass capillary, which was then mounted on the XYZ-stage. Electric potential was delivered to the sample by a platinum wire inserted into the capillary and placed in contact with the solution. The voltage applied to the tip holder ranged from 0.60 to 1.2 kV for native samples, and 0.80 to 1.40 kV for denatured samples. Nine replicates in each ion mode were collected for all proteins with measurements spread across 8 days over 3 weeks. These replicates were conducted in pairs by collecting data in both ion modes consecutively. To accomplish this, a CIU experiment was collected in +ESI before the instrument polarity was changed to −ESI. A second CIU experiment was performed, beginning with −ESI to counteract any changes in protein stability due to pH changes during ESI.? This allowed for use of the same capillary, at the same position in relation to the instrument inlet, for each +ESI/–ESI replicate pair. A new tip was used for each +ESI/–ESI replicate pair. CIU experiments were accomplished by increasing collision voltage (CV) to the trap cell in 5 V steps, from 0 V to a maximum of 120 V.
Data Analysis
Charge states for each protein were calculated using peaks with adjacent m/z. Following the recommendations of Gabelica et al.,? experimental CCS values reported in this manuscript are labeled ^TW^CCS_N2→He_, where the superscript indicates the type of IMS used (traveling-wave, TW) and the subscript indicates that experimental values were measured using N_2_ and then converted to He with calibrant values. ^TW^CCS_N2→He_ values and their corresponding uncertainties were calculated using previously described protocols (see Table S3 for drift tube CCS (^DT^CCS_He_) and Figure S1 for calibration curves). ?,? Independent calibration curves were generated for both ion modes, as well as for both the 7 and 9 V IMS wave heights (totaling four calibration curves). For each protein, the arrival time at each CV was used to determine a corresponding ^TW^(CCS ± CCS_uncert_)N2→He. When the mean arrival time increased, resulting in a significant change to ^TW^CCS_N2→He_ (at the 95% confidence level), the resulting structure was considered a new feature. The change in CCS and its corresponding uncertainty (ΔCCS ± ΔCCS_uncert_) was calculated following eqs and ?, respectively.
Here, CCS_folded,uncert_ and CCS_extended,uncert_, are the propagated uncertainties for the CCS of the folded and extended structures, respectively.
For each protein replicate, CV values were converted to laboratory frame energy (eV) by multiplying CV by the absolute charge of the analyte. The corresponding arrival time was plotted versus eV, yielding CIU fingerprints. CIU fingerprints in this manuscript were generated by averaging all individual replicate CIU fingerprints (n was a minimum of eight replicates). All CIU fingerprints were made in CIUSuite2.? The CV at which 50% of the species unfolded were determined by calculating CIU50? values. CIU50 values for replicate analysis are presented as the average ± standard deviation. To calculate CIU50 values using CIUSuite2, the following parameters were used: minimum feature length (steps) = 2, feature allowed width (drift axis units) = 0.5, and maximum CV gap length (CV steps) = 1. Individual CIU50 values for each replicate analysis were determined to calculate an average CIU50 and a corresponding standard deviation (see Table S4). For some proteins, different values were chosen for the highest CV (e.g., 540 eV versus 480 eV) in +ESI and -ESI as they were the last CV step where the signal-to-noise ratio (S/N) was greater than 3. F tests and t-tests (95% confidence level) were used to determine if ^TW^CCS_N2→He_ and CIU50 were significantly different between ion modes.
PROPKA
PROPKA 3.5.0
?,?,?−? ? was used to determine protein pI and predict solution-phase charge at pH 7. Briefly, PROPKA predicts pK a values of ionizable groups in a protein via a user-defined crystal structure (Table S1). These pK a values are then used to determine the pI for the protein, from which a theoretical charge can be calculated at varying pH.
Molecular Dynamics Simulations
Molecular dynamics (MD) simulations were used to determine the degree to which disulfide bridges stabilize gaseous, ionic protein structures after ESI. The GROMACS 2022.3 package? was used to conduct the MD simulations with the CHARMM36 force field.? To study the gas-phase stability of proteins with disulfide bridges, the model proteins lysozyme (PDB: 1DPX ?) and α-lactalbumin (PDB: 1F6R ?) were used. Additional details can be found in the Supporting Information.
Results
and Discussion
Ten proteins were chosen for analysis in +ESI and −ESI in this work (Table). These proteins can be broken into three groups depending on the net-charge in solution at pH 7: net-negative (with pI < 6.5), net-neutral (with 6.5 ≤ pI ≤ 7.5), and net-positive (with pI > 7.5). Five of the ten proteins have a net-negative charge at pH 7, though four of these proteins (con A, β-lactoglobulin, ovalbumin, and α-lactalbumin) are routinely analyzed via +ESI-IMS-MS. ?,? Additionally, each protein had charge-state distributions in +ESI and −ESI that resulted in at least one overlapping charge state (Figure), allowing for direct comparison of ^TW^CCS_N2→He_ and CIU50 in both ion modes.
*Representative mass spectra in +ESI (top) and −ESI (bottom) for (A) ovalbumin, (B) GNA, (C) β-lactoglobulin, (D) α-lactalbumin, (E) con A, (F) streptavidin, (G) ubiquitin, (H) WGA, (I) cytochrome c, and (J) lysozyme. Proteins that have blue or red charge states indicate additional oligomeric states that were detected. Outlines in red boxes are overlapping charge states that are compared herein. Indicates contaminant peaks.
Folded TWCCSN2→He are often Comparable
for Proteins Ionized in Each Ion Mode
To compare folded structures in +ESI and −ESI, each protein was sprayed under native conditions and analyzed using instrumental parameters that limited unintentional ion activation. When comparing overlapping charge states, seven of the ten proteins (ovalbumin, β-lactoglobulin, α-lactalbumin, streptavidin, ubiquitin, cytochrome c, and lysozyme) exhibit similar ^TW^CCS_N2→He_ for the initial, folded structures (Tables and S4). Two of the three net-positive proteins, both net-neutral proteins, and three of the five net-negative proteins follow this trend. Several papers have compared CCS values of folded proteins in +ESI and −ESI. ?,?,?,? Collectively, these papers concluded that gas-phase structures for proteins are comparable across ion modes when sprayed under native conditions. Observing comparable ^TW^CCS_N2→He_ from +ESI and −ESI suggests that, for these seven proteins, the gas-phase structures are not significantly different as either positive or negative ions, indicating that the noncovalent interactions associated with protein structure are similarly maintained during ESI in both ion modes.
2: Comparison of Folded and Extended TW(CCS ± CCSuncert)N2→He for Proteins Analyzed in +ESI and −ESI
For the three proteins (GNA, con A, and WGA) that did not have comparable, folded CCS, ^TW^CCS_N2→He_ was more compact in +ESI compared to −ESI. At pH 7, GNA and con A have a net-negative charge, while WGA has a net-positive charge (Table). IMS-MS experiments and simulations have shown that amino-acid side chains at the protein surface can collapse onto the protein surface during desolvation, ?,? causing compaction of the protein upon entering the gas phase. We hypothesize that for these three proteins, gas-phase compaction may be more extensive for the positive-protein ions compared to the negative-protein ions. Because these three proteins are lectins, or carbohydrate-binding proteins, the carbohydrate-binding pocket may be influencing the observed structural difference in +ESI and −ESI mode. That is, the greater compaction in +ESI may correlate to the empty carbohydrate-binding pockets causing greater protein compaction in positive-protein ions compared to negative-protein ions.
Extended TWCCSN2→He are Different
in both Polarities for Many Proteins
We subjected proteins to CIU in both +ESI and -ESI to compare the ^TW^CCS_N2→He_ of the extended structures. After unfolding by CIU, four out of the ten proteins had extended ^TW^CCS_N2→He_ that are comparable within experimental uncertainty in +ESI and −ESI (Table), suggesting similar extents of unfolding. In solution, two of these proteins (ovalbumin and α-lactalbumin) are net-negative and two proteins (WGA and lysozyme) are net-positive.
GNA and con A, both net-negative proteins in solution, had folded structures that were significantly smaller in +ESI compared to −ESI. Unlike WGA, the third lectin in this study, both GNA and con A unfold to extended ^TW^CCS_N2→He_ that are different, with the expanded structure being significantly larger in +ESI compared to −ESI (Table). Because the +ESI structures go from being significantly smaller when folded, to significantly larger when extended, the ΔCCS in +ESI is much larger than the ΔCCS observed for −ESI. For example, the 9– ions for GNA have ΔCCS of (360 ± 100) Å^2^ while the 9+ ions have ΔCCS of (560 ± 50) Å^2^. This indicates that the extended structures in +ESI and −ESI are not the same. We hypothesize that for these two lectins, the cationic proteins achieve a higher degree of unfolding, which may also be associated with the carbohydrate-binding sites. In comparison, WGA unfolds to extended ^TW^CCS_N2→He_ that are within experimental uncertainty in +ESI and −ESI. Though WGA is also a lectin, this protein contains more disulfide bridges than GNA and con A (Table). Disulfide bridges will be examined in greater detail in the next section.
Cytochrome c, a protein with a net-positive charge in solution at pH 7, exhibited different extents of unfolding in +ESI and −ESI. The 6- ions experienced two unfolding events, while the 6+ ions experienced a single unfolding event; thus, the 6– ions had three distinct structures (initial folded, intermediate, and final extended), while the 6+ ions had two distinct structures (initial folded and final extended). The 6– ions had an extended ^TW^CCS_N2→He_ of (1570 ± 50) Å^2^ that was significantly larger than the extended ^TW^CCS_N2→He_ of the 6+ ions (1380 ± 20) Å^2^. When comparing the folded and extended structures, the 6– ions experience a ΔCCS of (330 ± 60) Å^2^ while the 6+ ions experience a smaller increase in CCS (ΔCCS) of (130 ± 30) Å^2^. The significant difference in ΔCCS is due to the 6– ions experiencing a second unfolding event compared to the 6+ ions (Table). Interestingly, the extended ^TW^CCS_N2→He_ (1380 ± 20) Å^2^ for the 6+ ion was comparable to the intermediate ^TW^CCS_N2→He_ (1390 ± 40) Å^2^ of the 6– ion. This suggests that the 6+ ions do not unfold to the fully extended structure exhibited by the 6- ions prior to loss of ion signal.
Proteins analyzed by CIU using both +ESI and −ESI exhibited diverse unfolding behaviors. Four of the ten proteins showed similar extended structures across both ionization modes, while the remaining six displayed significant differences. Two net-negative proteins, GNA and con A, had smaller folded structures in +ESI that unfolded to larger extended structures than in −ESI, resulting in greater ΔCCS values, thus suggesting more extensive unfolding for the cations. Cytochrome c, a net-positive protein, exhibited distinct unfolding pathways that suggest the cationic species underwent incomplete unfolding compared to the anion species. Overall, these findings indicate that ionization mode influences the extent of unfolding and the structural transitions observed in CIU experiments.
Proteins
with Disulfide Bridges are Stable in Both Polarities
For both lysozyme and α-lactalbumin, the average ^TW^CCS_N2→He_ were within experimental uncertainty in +ESI and −ESI for both the initial, folded structures and the final, extended structures (Table). However, the 6+ and 6- ions of lysozyme and α-lactalbumin showed a high resistance to unfolding with >90% of the ions retaining their folded ^TW^CCS_N2→He_ values even at high CV (Figure). Replicate data showed similar trends (Figure S2). While CCS values provide a description of gas-phase structures, CIU experiments provide information on the stability of the gas-phase protein structures. Furthermore, the number and pattern of disulfide bridges present in antibodies has been characterized by CIU.? Lysozyme and α-lactalbumin both contain the glycosyl hydrolases family 22 (GH22) domain, which has four disulfide bridges that span ∼94% and ∼93% of the amino acid sequence, respectively (Figure S3). Thus, we hypothesize that rather than effects from protein pI and ionization polarity, the unfolding behavior of these proteins is controlled to a greater extent by the presence of stabilizing disulfide bridges that prevent routine unfolding during collisional activation.
Representative mobiligrams showing arrival times for lysozyme (A, C) and α-lactalbumin (B, D) at low and high CV (blue solid and orange dashed lines, respectively) for +ESI (A, B) and −ESI (C, D). For lysozyme, different values were chosen for the high CV (e.g., 540 eV in +ESI and 480 eV in −ESI) because these were the last analyzed CV where the signal-to-noise ratio (S/N) for the 6 ± ions was greater than 3.
To characterize the stabilizing effect of disulfides, MD simulations were conducted that heat denatured 6+ ions of lysozyme and α-lactalbumin with disulfide bridges either present or removed with the resulting cysteine residues being protonated prior to heating (Figure S4). Structural changes associated with heat denaturation were characterized by taking the root-mean-square deviation (RMSD) of the heated structures relative to initial, crystal structures. Over ten trials, lysozyme ions possessed an average RMSD of (1.6 ± 0.2) nm when disulfides were present which increased to (3.5 ± 0.5) nm when disulfides were removed. Similarly, the RMSD of α-lactalbumin ions increased from (1.8 ± 0.1) nm when disulfides were present to (3.7 ± 0.3) nm when disulfides were removed, suggesting that the disulfides serve as a barrier to unfolding. This is visually represented in Figure S4C,D where simulated structures of heated lysozyme and α-lactalbumin ions with intact disulfides adopt more condensed configurations relative to those with disulfides removed. In essence, the disulfides “tie” disparate regions of protein together, which serves as a barrier to more thorough unfolding even at a maximum simulated temperature of 900 K, explaining the similarity of ^TW^CCS_N2→He_ values in +ESI versus −ESI as well as the limited percentage of ions that undergo a change in ^TW^CCS_N2→He_ upon additional collisional activation.
WGA has 16 disulfide bridges per monomeric subunit (Table). On average, these disulfides encompass seven amino acids each, with ∼78% the total amino acids in WGA being encompassed by at least one disulfide bridge (Figure S3). Compared to lysozyme and α-lactalbumin, the presence of disulfide bridges across small numbers of amino acids enables WGA to unfold with ΔCCS of (370 ± 60) Å^2^ and (390 ± 120) Å^2^ in +ESI and −ESI, respectively (Table). However, in comparison to the other lectins, GNA and con A, we hypothesize that these disulfide bridges in WGA limit the extent of unfolding, resulting in the extended, ^TW^CCS_N2→He_ values being within experimental uncertainty in both ion modes.
To summarize, lysozyme and α-lactalbumin exhibit similar gas-phase behavior in both +ESI and −ESI, with most ions retaining folded structures even under high CV. We hypothesize this resistance to unfolding is largely attributed to their four disulfide bridges, which span over 90% of their sequences and act as stabilizing constraints. MD simulations confirm that removing these disulfides significantly increases structural deviation during heat denaturation, indicating their critical role in maintaining compact configurations. Consequently, the presence of disulfide bridges explains the minimal changes in ^TW^CCS_N2→He_ and the proteins’ high stability under activation.
Proteins Often Require More Energy to Unfold
in the Polarity Matching Their Solution-Phase Charge
Hong et al. reported that avidin and β-lactoglobulin required different amounts of energy to unfold in +ESI and −ESI, resulting in different intermediate species.? We hypothesize that this behavior is related to differences in a protein’s net solution- versus gas-phase charge. Therefore, using the proteins in Table, we quantified the CIU50 to compare the energy required to unfold proteins with varying pI.
Of the analyzed proteins that are negatively charged in solution, four of the five (monomeric ovalbumin, dimeric GNA, monomeric β-lactoglobulin, and monomeric con A) required significantly more energy to unfold in -ESI compared to +ESI (FiguresA, S5, and S6). For example, with ovalbumin the 11+ ions have a CIU50 of (460 ± 30) eV, while the 11– ions have a CIU50 of (560 ± 30) eV. Therefore, the energy required to unfold ovalbumin in +ESI is significantly lower than in −ESI, though the ions exhibited similar folded and extended ^TW^CCS_N2→He_. This indicates that the positive-ions are less stable than the negative-ions because less energy was required to unfold the ions.
Proteins require more energy to unfold in the polarity that results in the smallest change-in-charge (Δz) compared to their solution-phase charge. (A) Most proteins require more energy to unfold in one polarity. Each bar represents the average CIU50 with error bars corresponding to the standard deviation for the listed charge state. Proteins that have a net-negative, neutral, or net-positive charge in solution at pH 7 are in blue, gray, and orange, respectively. A star represents a statistical difference at the 95% CI. (B) Gas-phase proteins experience a larger Δz in the polarity opposite their net-charge in solution at pH 7. The top portion (orange) represents the Δz for +ESI, and the bottom portion (blue) represents the Δz for −ESI. Δz was determined by comparing the charge states analyzed in +ESI and −ESI (in part A) to the theoretical charge in solution at pH 7.0 using PROPKA. The following abbreviations are used: β-lactoglobulin (β-lacto.) and streptavidin (Strept.). (A and B) Proteins are listed from left to right in order of increasing theoretical charge in solution at pH 7.
Ovalbumin, GNA, β-lactoglobulin, and con A all experience a larger change-in-charge (Δz) during +ESI compared to −ESI, due to the proteins having a net-negative charge in solution (Table and FigureB). For example, ovalbumin has a predicted solution-phase charge of −10.1 at pH 7 with a pI of ∼5.19. Thus, the 11+ ions experienced a Δz during ESI of +21.1, while the 11– ions experienced a Δz of −0.900. This is concerning because most proteins, regardless of their net-charge in solution, are routinely analyzed in +ESI.
Tetrameric streptavidin, which has a solution-phase charge of −1.23 at pH 7 due to its pI of ∼6.71, has a near-neutral net charge. The 13 ± ions were routinely observed to have comparable folded ^TW^CCS_N2→He_ that unfold to different extended ^TW^CCS_N2→He_. The 13+ and 13- ions required similar energy, (450 ± 30 eV) and (490 ± 60 eV), respectively, to unfold (FigureA). For this protein, the 13+ ions experience a Δz of +14.2 while the 13– ions experience a Δz of −11.8 (Table and FigureB). The streptavidin Δz values are more comparable than the Δz observed for ovalbumin and the other proteins with a greater net-negative charge in solution. The energy required for streptavidin to unfold in +ESI and −ESI are comparable within experimental uncertainty, suggesting that this protein does not exhibit varying gas-phase stability when analyzed in either ion mode. We hypothesize that proteins with net-neutral charge in solution, which experience similar Δz during +ESI and −ESI, have comparable stability in both ion modes.
We hypothesize that proteins with a net-positive charge in solution, such as WGA and cytochrome c, will be more stable in +ESI, due to the smaller Δz. WGA has a theoretical solution-phase charge of +3.64 due to its pI of ∼9.05. Interestingly, the positive and negative ions have folded ^TW^CCS_N2→He_ that are significantly different, but the extended ^TW^CCS_N2→He_ are comparable within experimental uncertainty. The 10+ ions have a CIU50 of (660 ± 30) eV while the 10– ions have a smaller CIU50 of (533 ± 3) eV (FiguresA, S5 and S6). WGA requires significantly more energy to unfold in +ESI than in −ESI. During the ESI process the 10+ ions of WGA experience a smaller Δz (+6.36) compared to the Δz (−13.6) of the 10– ions (Table and FigureB). In direct contrast to the proteins with a negative or neutral charge in solution, WGA requires significantly more energy to unfold in +ESI compared to −ESI, indicating an increased structural stability in +ESI.
Cytochrome c has a solution-phase charge of +7.40 due to its pI of ∼11.3; this gives the 6+ ions a smaller Δz (−0.40) than the 6- ions (−14.40). Cytochrome c’s positive and negative ions have folded ^TW^CCS_N2→He_ values that were comparable within experimental uncertainty (Table) but have extended ^TW^CCS_N2→He_ that are not comparable within experimental uncertainty. In fact, we see that the 6- ions have two CIU50 values of (111 ± 4) eV and (150 ± 20), for the first and second unfolding events, respectively, while the 6+ ions have a single CIU50 value of (70 ± 20, Figure). The lower CIU50 value for the 6+ ions may initially lead one to think that the +ESI ions require less energy to unfold than the −ESI ions, but paired with the fact that the extended structure for the 6+ ions is comparable to intermediate structure of the 6– ions, we hypothesize that the 6+ ions do not unfold to the fully extended structure exhibited by the 6– ions prior to loss of ion signal (Figures and S6). Taken together, this data suggests that for cytochrome c, which is net-positive in solution, the 6+ ions exhibit stronger gas-phase stability than the 6– ions.
Cytochrome c is more stable in +ESI compared to −ESI. CIU fingerprints of cytochrome c for the 6+ (A) and 6– (B) ions. Cytochrome c has one additional unfolding event in −ESI compared to +ESI. Each fingerprint is an average of all replicates for that ion.
This data shows that there are differences in the gas-phase stability for many proteins when ionized in +ESI versus −ESI. We hypothesize that these shifts in gas-phase stability are correlated to an increase in Δz relative to a protein’s solution charge. Protonation patterns in the gas-phase differ from solution, favoring neutralized rather than ionic forms of chargeable amino acids. However, upon ionization, larger swings in Δz may cause more significant changes to the noncovalent interactions in gas-phase proteins relative to their “native” state. Here, proteins that were analyzed in the polarity opposite to their solution-phase charge exhibited larger Δz and lower stability. Prior work showed that alterations to the electrostatic interactions at protein surfaces resulted in changes in gas-phase stability.? Alternatively, streptavidin had similar Δz and stability in both ion modes. Thus, we hypothesize that the intramolecular interactions for cationic and anionic streptavidin were similar in both ion modes. Furthermore, MD simulations of ESI suggest that the protonation patterns of protein ions are more similar to their biological structures when ionized in the mode that corresponds to their solution-phase charge.? Thus, proteins with larger Δz were expected to have more extensive restructuring of intramolecular interactions compared to the solution-phase structures.
Conclusions
IMS-MS allows for detailed analysis of the structure and stability of native-like proteins in the gas phase. The structure of a protein is dictated by its amino acid sequence and the resulting noncovalent interactions that form the secondary, tertiary, and quaternary structures. Chargeable amino acid residues that are solvent exposed can carry charges leading to the protein having a net-positive, net-negative, or net-neutral charge in solution at physiological pH. Yet, most native IMS-MS analysis is conducted in +ESI, leading to proteins that are net-negative in solution experiencing large swings in their net-charge during the ESI process.
This work highlights that gas-phase stability of native-like proteins varies with ionization polarity. While CCS values for folded structures are generally comparable between +ESI and −ESI, unfolding energetics differnet-negative proteins are more stable in −ESI, whereas net-positive proteins are more stable in +ESI. We hypothesize that these differences are due to changes in the intramolecular interactions within cationic and anionic proteins. Therefore, proteins analyzed in the mode opposite to their solution-phase charge may have different gas-phase structures compared to biological systems. For example, proteins that are net-negatively charged in physiological environments (pI < 6.5) are more likely to represent their biological counterparts as gas-phase anions when analyzed in −ESI compared to +ESI, enabling nMS experiments to more closely characterize the structures present in biological systems.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Lodish, H. Molecular Cell Biology; Freeman, 2001.
- 2Ghosh, A. ; Nandy, A. Chapter 1- Graphical representation and mathematical characterization of protein sequences and applications to viral proteins. In Advances in Protein Chemistry and Structural Biology; Donev, R. , Ed.; Academic Press, 2011; Vol. 83, pp 1–42.10.1016/B 978-0-12-381262-9.00001-XPMC 715026621570664 · doi ↗ · pubmed ↗
- 3Greene L. H.Protein structure networks Brief. Funct. Genomics 201211646947810.1093/bfgp/els 03923042823 · doi ↗ · pubmed ↗
- 4Greene L. H.Higman V. A.Uncovering network systems within protein structures J. Mol. Biol.2003334478179110.1016/j.jmb.2003.08.06114636602 · doi ↗ · pubmed ↗
- 5Schlick, T. Protein Structure Hierarchy. In Molecular Modeling and Simulation: An Interdisciplinary Guide: An Interdisciplinary Guide; Springer, 2010; Vol. 21, pp 105–128.
- 6Bhagavan, N. V. ; Ha, C.-E. Three-Dimensional Structure of Proteins. In Essentials of Medical Biochemistry; Bhagavan, N. V. ; Ha, C.-E. , Eds.; Academic Press, 2011; pp 29–38.
- 7Klotz I. M.Langerman N. R.Darnall D. W.Quaternary structure of proteins Annu. Rev. Biochem.1970391256210.1146/annurev.bi.39.070170.0003254920824 · doi ↗ · pubmed ↗
- 8Blackwell A. E.Dodds E. D.Bandarian V.Wysocki V. H.Revealing the quaternary structure of a heterogeneous noncovalent protein complex through surface-induced dissociation Anal. Chem.20118382862286510.1021/ac 200452 b 21417466 PMC 3343771 · doi ↗ · pubmed ↗
