Comparison of the Human Plasma Peptides from the Fit of Fragmentation Spectra versus Accurate Monoisotopic Precursor Mass
Zhuo Zhen Chen, Jaimie Dufresne, Peter Bowden, John G. Marshall

TL;DR
This study compares two mass spectrometry techniques for identifying plasma peptides by analyzing fragmentation spectra and precursor mass accuracy.
Contribution
The study demonstrates that accurate peptide identification in human plasma can be achieved using MS/MS spectra alone without precise precursor mass measurements.
Findings
Peptide identifications using MS/MS spectra alone showed high agreement between OIT and LIT instruments.
Delta mass plots matched predicted isotope and hydrogen rearrangement distributions, validating MS/MS-based identification.
Over 99.9% overlap in protein gene symbols identified by both instruments confirmed robust and reproducible results.
Abstract
In nature, ionized peptides with heavy isotopes and hydrogen rearrangements show a broad mass distribution with signals at discrete delta mass values from −3 to +5 Da by mass spectrometry (MS). For many peptides, the intensity of the +1 or +2 Da isotope exceeds the signal from the monoisotopic mass. Therefore, there is a need for a method that improves peptide identification from heavy isotopes or hydrogen rearrangements based on the fit of tandem mass spectra. Peptides may be identified using an accurate monoisotopic precursor mass with ≤0.1 Da. However, many peptides with heavy isotopes and H-loss can be identified and enumerated based on the fit of their MS/MS spectra alone in the absence of an accurate precursor monoisotopic mass (i.e., ± 3 Da) using the X!TANDEM MS/MS fitting algorithm. In this study, human plasma samples were analyzed with a highly resolving axially harmonic…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8| OIT_ ± 0.1 Da (N) | OIT ± 3 Da (N) | LIT ± 3 Da equal peptide species (N) | LIT ± 3 Da 99% gene symbol overlap to OIT (N) | ||||
|---|---|---|---|---|---|---|---|
| ALB | 507 | ALB | 2743 | ALB | 2797 | ALB | 16,405 |
| C3 | 334 | C3 | 936 | APOA1 | 1434 | APOA1 | 5259 |
| FGA | 253 | IGH@ | 824 | FGA | 1195 | TF | 4883 |
| CP | 252 | TF | 764 | TF | 1023 | C3 | 3818 |
| APOA1 | 236 | CP | 646 | C3 | 913 | IGH@ | 3665 |
| TF | 216 | APOA1 | 641 | HP | 834 | HP | 3585 |
| SERPINA1 | 203 | FGA | 641 | FGB | 773 | SERPINA1 | 3098 |
| C4B_2 | 192 | HEL-214 | 626 | CP | 730 | FGA | 2766 |
| C4B | 190 | SERPINA1 | 583 | IGH@ | 676 | HEL-214 | 2732 |
| C4A | 187 | IGHG1 | 564 | FGG | 571 | IGHG1 | 2592 |
| IGH@ | 186 | ORM1 | 538 | C4B_2 | 518 | A2M | 2361 |
| APOA4 | 181 | HP | 515 | C4B | 512 | FGB | 2029 |
| FGB | 161 | IGHG2 | 487 | SERPINA1 | 508 | FLJ00385 | 1926 |
| ORM1 | 146 | C4B_2 | 450 | C4A | 487 | IGHG3 | 1926 |
| ITIH4 | 143 | FGB | 447 | HEL-214 | 453 | IGHG4 | 1792 |
| HEL-214 | 141 | C4B | 444 | HPR | 450 | IGL@ | 1783 |
| HP | 141 | C4A | 437 | IGHG1 | 426 | HPR | 1780 |
| A2M | 135 | IGL@ | 432 | APOA4 | 386 | CP | 1750 |
| FGG | 125 | IGK@ | 428 | ITIH2 | 376 | C4B_2 | 1646 |
| AZGP1 | 122 | APOA4 | 425 | A2M | 373 | C4B | 1628 |
| HPX | 118 | HEL-213 | 407 | IGL@ | 373 | IGHG2 | 1607 |
| IGHG1 | 114 | ORM2 | 406 | HPX | 355 | C4A | 1587 |
| IGL@ | 114 | AZGP1 | 395 | A1BG | 351 | APOA4 | 1549 |
| ORM2 | 108 | FLJ00385 | 375 | SNC73 | 333 | FGG | 1542 |
| SERPINC1 | 100 | IGHG3 | 375 | IGHA1 | 330 | HPX | 1521 |
| FLJ00385 | 95 | IGHG4 | 368 | IGHG4 | 316 | IGK@ | 1492 |
| IGHG2 | 94 | A2M | 366 | FLJ00385 | 307 | HEL-213 | 1452 |
| IGHG3 | 94 | ITIH4 | 365 | IGHG3 | 307 | APOA2 | 1397 |
| TTR | 93 | FGG | 337 | APOA2 | 275 | SNC73 | 1300 |
| HPR | 90 | HPX | 323 | CFH | 264 | IGHA1 | 1279 |
| GC | 88 | IGKC | 295 | IgLC-rG | 258 | IGKC | 1265 |
| SERPINA3 | 86 | GC | 279 | ITIH1 | 256 | IgLC-rG | 1254 |
| A1BG | 84 | A1BG | 266 | IGHG2 | 243 | IGLC3 | 1081 |
| HBB | 83 | SERPINC1 | 257 | IGK@ | 243 | IGLC2 | 1079 |
| IGK@ | 82 | HBB | 252 | HEL-213 | 235 | A1BG | 1029 |
| HEL-213 | 81 | SNC73 | 241 | IGHA2 | 227 | ITIH2 | 1021 |
| CLU | 76 | TTR | 232 | IGLC2 | 220 | HBB | 1004 |
| IGHG4 | 74 | IgLC-rG | 230 | IGLC3 | 220 | IGHA2 | 903 |
| CFB | 71 | IGHA1 | 224 | IGKC | 218 | AAT | 891 |
| ITIH2 | 71 | SERPINA3 | 220 | KNG1 | 216 | IGLL5 | 814 |
| APCS | 70 | IGLC3 | 216 | VTN | 212 | ITIH1 | 766 |
| ITIH1 | 62 | HPR | 214 | AHSG | 200 | ORM1 | 763 |
| SNC73 | 62 | IGLC2 | 214 | ITIH4 | 193 | ITIH4 | 755 |
| CFH | 61 | CFH | 204 | AAT | 164 | IGLC6 | 751 |
| HRG | 61 | IGLC7 | 193 | IGLL5 | 162 | AHSG | 740 |
| IGHM | 61 | IGHA2 | 182 | GC | 159 | CFH | 732 |
| HBA1 | 60 | ITIH2 | 181 | IGLC7 | 155 | IGLC7 | 726 |
| HBA2 | 60 | AHSG | 176 | IGLC6 | 151 | GC | 719 |
| KNG1 | 59 | APCS | 174 | HBB | 149 | C1 | 704 |
| AFM | 57 | ITIH1 | 172 | CLU | 135 | IGLC1 | 696 |
- —Natural Sciences and Engineering Research Council of Canada10.13039/501100000038
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Proteomics Techniques and Applications · Protein Structure and Dynamics · Enzyme Structure and Function
Introduction
Natural peptides contain many heavy isotopes^1^ and may undergo hydrogen rearrangements in the gas phase.^2,3^ Thus, only a fraction of peptide ions can be detected by MS/MS based on their monoisotopic mass. Clearly, limiting the analysis to the monoisotopic mass alone may reduce the overall sensitivity of the LC-ESI-MS/MS method for challenging applications like analysis of plasma.^2,3^ The precursor masses of peptides derived from natural sources, for example, human plasma proteins, may have a wide distribution and differ from the monoisotopic mass by many Daltons (e.g., −3 to +5 Da).^2,3^ Analyses of highly purified viral coat proteins,^4^ protein standards,^5,6^ and synthetic peptide standards^7^ provided strong experimental support for the validity of peptide identification from the fit of tandem mass spectra (MS/MS) alone in the absence of an accurate estimate of the precursor peptide mass. Experiments based on results from random protein libraries^8^ that include wide search parameters^9^ and random or “noise” MS/MS spectra^5,6^ provided strong statistical support for the reliable identification of peptides from fit MS/MS spectra alone^10−12^ where confidence builds with replication.^13^ Similarly, statistical methods, for example, the X!TANDEM goodness-of-fit algorithm, provide protein p-values that are direct estimates of the type I error rate from the fit of MS/MS spectra and peptides^10^ that may be corrected to false discovery rate (FDR) q-value using the well-established method of Benjamini and Hochberg.^14^ The FDR q-value agreed with the results of the Monte Carlo simulation^15−18^ which revealed that false positives were randomly distributed across all peptides in the database while true positive results were sharply focused on a subset of peptides with high observation frequencies.^5,6,8^ The MS/MS spectra from natural peptides, including those with heavy isotopes, were reliably identified without accurate peptide mass measurements^19−21^ but did require correction against the “noise” MS/MS analytical and random MS/MS spectra statistical controls to correct the observation frequency of peptides from large proteins or extensive protein families, e.g., titan (TTN), obscurin (OBSC), microtubule-actin cross-linking factor 1 (MACF1), immunoglobulin (Ig), major histocompatibility complex (MHC) and others.^5,6,22,23^ Taken together, it was possible to analyze the MS/MS spectra alone to identify and enumerate many peptides from most proteins.
There is limited direct biophysical evidence indicating that the fit of peptides from MS/MS spectra alone has a low error rate. Powerful and important findings might be drawn from the delta mass distribution obtained from the fit of MS/MS spectra from the OIT to the known peptides with heavy isotopes and hydrogen rearrangements. If the peptides identified by the trihybrid orbital ion trap (OIT) from the fit of MS/MS spectra alone showed the expected delta mass distributions from heavy isotopes and hydrogen rearrangements, these observations would provide strong biophysical support for identification from peptide fragmentation (MS/MS) without an accurate mass. One strategy that might be used to support the validity of peptides identified from MS/MS spectra with an ion trap might be to compare peptides identified from an analysis of the precursor at the monoisotopic mass versus those with delta mass values consistent with heavy isotopes and hydrogen rearrangements.^2,3^ In addition, the fit of MS/MS spectra with the OIT might be compared to MS/MS spectra using a linear quadrupole ion trap (LIT) that showed excellent agreement but that the LIT was more sensitive.^24^ Similarly the LIT provided comparable results but was more sensitive for targeted proteomics.^25^ Comparisons of low-resolution ion traps to high-resolution quadrupole-time-of-flight (Qq-TOF) or Fourier-transform ion cyclotron resonance (FTICR) revealed that results from these instruments were in agreement on high abundance proteins but that the simple ion trap was more sensitive.^26−29^ The use of the LIT resulted in high observation frequencies for plasma proteins that were also detected by the OIT.^23^ In this study, the direct biophysical measurement of delta mass distribution, the agreement of the monoisotopic MS/MS spectra with those of hydrogen rearrangements and heavy isotopes, the concordance of results from the OIT and LIT, the independent statistical analysis of the FDR q-value, and the comparisons to random MS/MS spectra collectively provide an unambiguous demonstration that the peptides and proteins from human plasma may be identified at a low error rate from the fit of MS/MS spectra alone using the OIT or LIT instrument without the need for an accurate measure of monoisotopic mass.
Results
The tryptic peptides from human plasma were analyzed by LC-ESI-MS/MS using the accurate mass of the trihybrid OIT and the MS/MS of the OIT and simple LIT; the results were collected and rendered nonredundant in an SQL Server database for comparison. The MS/MS spectra from the two different instruments and the results obtained from the LC-ESI-MS/MS experimental setups were compared using the X!TANDEM algorithm. The Best Fit Per MS/MS Spectra (BFPS) peptides were selected, and the results were then graphically and statistically analyzed using the R statistical system. The steps to making a highly sensitive analysis of human plasma have been examined (Figure 1): The simple ion trap may identity fully tryptic and nontryptic peptides of blood;^27,30^ the methods to precipitate plasma proteins were compared;^31,32^ partition chromatography^27,33^ performed better than depletion chromatography;^34^ the optimal QA chromatography was compared under different salt elution regimes;^22^ the rigorous X!TANDEM algorithm was validated with noise and random spectra using p-values versus decoy library;^35,36^ purified peptide and proteins standards were used to select the optimal search parameters of ±3 Da precursor and ±0.5 Da fragments to identify and quantify proteins from MS/MS spectra;^5,6^ optimized tryptic digestions conditions with trypsin alone were established to avoid the digestion of albumin.^27^ The SQR Server removed redundant protein assignments selecting the single best fit of MS/MS spectra^37,38^ and the R Statistical system identified some 13,000 proteins from plasma with the rigorous X!TANDEM algorithm after Chi Square analysis against noise and random MS/MS before computing the gene symbol p-value and FDR q-value.^39^ The delta mass distribution of peptides identified by the OIT instrument using MS/MS spectra alone exactly matched the predicted distribution of peptides with hydrogen rearrangements and heavy isotopes. The findings generated using the OIT, which has a much higher mass resolution, were in excellent agreement (99.9%) with those from the sensitive LIT where the two instruments identified similar peptides from an overlapping set of human plasma proteins.
Flowchart of the experiment design to compare the results of OIT versus LIT for the analysis of tryptic peptides from human plasma proteins. Normal human plasma (NHP) was precipitated in ACN and resuspended in 20 mM tris pH 8.85 for protein assays with the addition of urea and ACN prior to digestion. Disposable C18 columns were used to collect peptides for LC-ESI-MS/MS.
Orbital Ion Trap (OIT) Delta Mass Distribution
The fragmentation spectra (524,867 MS/MS) from the OIT were matched to peptides with a wide mass tolerance and the resulting delta mass distribution was plotted. The OIT data revealed clear peaks in the delta mass distribution at −3, −2, −1, 0, +1, +2, +3, +4, and +5 Da (Figure 2A) which exactly matched the distributions predicted for heavy isotopes and hydrogen rearrangements observed in natural peptides^2,3^ and so is powerful biophysical evidence supporting the veracity of the fit of MS/MS spectra independent of the precursor mass. The precursor intensity distribution was Gaussian from E6 to E10 counts and focused at E8 counts (Figure 2B). The delta mass values were highly concentrated at −3 to +3 Da (Figure 2C). In total, the OIT identified 60,000 peptides from ≥9000 protein gene symbols with wide delta mass distribution (Figure 2D,E) that included peptides with the expected heavy isotope and hydrogen rearrangements. The results indicated that the high energy levels required to resonate in the orbital trap generate hydrogen rearrangements.
Distributions of the mass accurate results with heavy isotopes and hydrogen rearrangements of MH ± 3 Da showing the delta mass and intensity results of the human plasma protein data from orbital ion trap (OIT). Panels: (A) The delta mass distribution of tryptic peptides from human plasma proteins; (B) The Gaussian log10 precursor intensity distribution; (C) The distribution of mean precursor intensity over protein accessions; (D) The distribution of mean precursor intensity over protein gene symbols. The results from 524,867 MS/MS peptide identification events (delta mass −3 to +5 Da) are shown.
Orbital Ion Trap (OIT) Monoisotopic Mass Distribution
The fragmentation spectra (524,867 MS/MS) generated by the OIT instrument displayed distinct peaks within the monoisotopic mass distribution from −0.05 to +0.05 Da (Figure 3A). The precursor intensity distribution was Gaussian from E6 to E10 counts and focused at E8 counts (Figure 3B). Approximately 12,000 peptides (Figure 3C) from 400 human plasma proteins (Figure 3D) were identified at or near the monoisotopic mass. The highly accurate agreement between the predicted precursor mass (MH) and the observations from the OIT clearly demonstrate a low error rate for the analysis of monoisotopic peptides by MS/MS spectra alone.
Distributions of the mass accurate data from monoisotopic peptides MH ± 0.1 Da showing the delta mass and intensity results of the human plasma protein data from the OIT. Panels: (A) The distribution of delta mass for tryptic peptides from human plasma proteins; (B) Gaussian log10 monoisotopic precursor intensity distribution; (C) The distribution of mean monoisotopic precursor intensities over protein accessions; (D) The distribution of mean monoisotopic precursor intensity over protein gene symbols. Only the monoisotopic results (delta mass −0.1 to +0.1 Da) from a total of 524,867 MS/MS peptide identification events are shown.
Linear Quadrupole Ion Trap (LIT) Delta Mass Distribution
A roughly equal number of identified peptides were obtained using fragmentation spectra (1,025,423 MS/MS) from the LIT that were matched with a wide precursor mass tolerance and the resulting delta mass distributions were plotted. The MS/MS spectra generated by the LIT instrument displayed a delta mass distribution peak from 0 to +2 Da (Figure 4A) that matched the predicted distribution of natural human plasma peptides.^2,3^ The precursor intensity distribution was Gaussian from E4 to E7 counts and focused on E5 counts (Figure 4B). In order to compare the results from MS/MS spectra to those of accurate mass on the basis of equal identified peptides, the LIT was observed to identify a similar number of peptides (∼60,000) (Cf. Figure 2C vs 4C) from 11,000 protein gene symbols from twice as many MS/MS spectra and at ten times the flow rate. The LIT mass analyzer did not seem to generate many peaks from hydrogen rearrangements with the high frequencies observed in spectra generated by the OIT.
Distributions of the fit of MS/MS spectra in the absence of an accurate mass values (MH ± 3 Da) showing the delta mass and intensity results of the human plasma protein data from the linear quadrupole ion trap (LIT). Panels: (A) The distribution of delta mass for tryptic peptides from human plasma proteins; (B) The Gaussian log10 precursor intensity distribution; (C) The distribution of mean precursor intensity over protein accessions; (D) The distribution of mean precursor intensity over protein gene symbols. The results from 1,025,423 MS/MS peptide identification events are shown.
Observation Frequencies from the Fit of Peptide Fragments
Observation frequency was computed after selecting the BFPS and correcting for noise with blank analytical control and random MS/MS spectra as statistical control. Peptides identified based on accurate precursor mass (±0.1 Da) from spectra generated by the OIT instrument mapped to 300 protein gene symbols with peptide observations n ≥ 3 (Figure 5A). In contrast, accepting the peptides fit from OIT MS/MS spectra with all delta mass values (±3 Da) resulted in the identification of more than 2000 protein gene symbols with n ≥ 3 (Figure 5B). Comparing a similar number of peptide identifications from the LIT based on MS/MS fragmentation alone in the absence of an accurate precursor mass also yielded ≥2000 protein gene symbols (Figure 5C). Increasing the sampling of fragment spectra (3,786,151 MS/MS) from the LIT eventually yielded ≥10,000 protein gene symbols (n ≥ 3) after correction against analytical and statistical controls (Figure 5D). The maximum accepted protein FDR was ≤1% (q ≤ 0.01) false positive rate (i.e. type I error). The OIT monoisotopic mass ±0.01 Da identified 302 proteins gene symbols with n ≥ 3 peptides (q ≤ 0.01), the OIT ± 3 Da identified 2784 true positive proteins n ≥ 3 peptides (q ≤ 0.01)^8^ with protein FDR q-values ≤0.01 from X!TANDEM. Thus, selecting the strict monoisotopic mass results in about a 90% false negative (type II error) rate compare to accepting all OIT MS/MS from ±3 Da.
Observation frequencies of protein gene symbols from monoisotopic mass (±0.05 Da) and all delta mass values from the orbital ion trap (OIT) vs the linear quadrupole ion trap (LIT) with different levels of replication. Panels: (A) Peptide observation frequencies of protein gene symbols from the monoisotopic mass (MH ± 0.1 Da) of the OIT (524,867 MS/MS); (B) Peptide observation frequencies of protein gene symbols from all delta mass values (MH ± 3 Da) of the OIT (524,867 MS/MS); (C) Peptide observation frequencies of protein gene symbols from all delta mass values (MH ± 3 Da) of the LIT where the number of peptides identified equaled that of the OIT (1,025,423 MS/MS); (D) Peptide observation frequencies of protein gene symbols from all delta mass values (MH ± 3 Da) of the LIT where there was 99% overlap between the OIT versus LIT (3,786,151 MS/MS).
Venn Diagram of Protein Gene Symbols
Analysis of the monoisotopic mass from OIT data yielded at total of 436 proteins; these proteins were then compared to the top 436 proteins identified by the OIT by including all heavy isotopes, hydrogen rearrangements, and other masses and to the top 436 proteins identified from data generated by the LIT analyzer. There was strong agreement on 196 protein gene symbols identified by the OIT (monoisotopic alone and including peptides with other masses) and the LIT (Figure 6A). Also, comparing the fit of fragmentation spectra from the OIT versus the LIT from a similar number of peptides revealed a high level of agreement (Figure 6B). However, the LIT is the more sensitive of the two instruments^24^ and spends most of its cycle time sampling peptides that were below the detection by the OIT. Thus, once the LIT was allowed to sample a sufficient number of peptides, the proteins identified by the instrument were in near-perfect overlap with those identified by the OIT (Figure 6C). The proteins identified by the OIT at the monoisotopic mass showed lower peptide MS/MS observation frequencies (Table 1). The OIT ± 0.1 Da agreed with the OIT ± 3 Da and the LIT ± 3 Da that identified the same peptides from the same proteins in the same proportions consistent with previous observations.^24,25^ However, the MS/MS strategy was more sensitive at the low concentration range and identified some peptides from some proteins that were below detection of the high-resolution instrument and confirmed that plasma peptides and proteins may be identified and quantified from MS/MS spectra alone without the need for an accurate measure of the precursor mass. The maximum accepted protein FDR was ≤1% (q ≤ 0.01) false positive rate (type I error). The OIT monoisotopic mass ±0.01 Da identified 302 proteins gene symbols with n ≥ 3 peptides (q ≤ 0.01), the OIT ± 3 Da identified 2784 true positive proteins n ≥ 3 peptides (q ≤ 0.01)^8^ with protein FDR q-values ≤0.01 from X!TANDEM. Thus, selecting the strict monoisotopic mass results in about a 90% false negative (type II error) rate compared to accepting all OIT MS/MS from ±3 Da.
Venn diagrams demonstrate qualitative agreement between the OIT and the LIT at the level of protein gene symbols. Panels: (A) Comparison of an equal number of top-hit proteins. Shown are the top 436 proteins identified from peptides (MH ± 0.01 Da) based on the monoisotopic mass (524,867 MS/MS, in blue), compared to the top 436 proteins of any mass (MH ± 3 Da) from the OIT (in green) and the top 436 proteins from the LIT (in red); (B) Comparison of protein gene symbols from n ≥ 3 peptides from an equal number of peptides identified using OIT vs LIT at any delta mass value (MH ± 3 Da); (C) The top 2726 protein gene symbols identified at any mass value (MH ± 3 Da) with n ≥ 3 peptides by the OIT vs the sampling required to obtain the same coverage from the LIT (3,786,151 MS/MS).
Table 1: Agreement between Results Using the Orbital Ion Trap (OIT) Monoisotopic Mass (OIT mono ±0.1 Da, 524,867 MS/MS) vs OIT and LIT Data That Included Heavy Isotopes, Hydrogen Rearrangements, and Other Masses (OIT ± 3 Da, 524,867 MS/MS; LIT ± 3 Da, 1,025,423 MS/MS)a
Regression of Peptides to Protein Gene Symbols from Monoisotopic
Mass OIT and LIT
There was a direct linear relationship between the log of the number of peptides per protein gene symbol identified by each of the two ion trap instruments, demonstrating both qualitative and quantitative agreement between these data sets. The highly significant quantitative relationship between the proteins that were independently identified and counted by the two instruments confirmed the validity of peptide identification based on MS/MS spectra alone within a mass tolerance of ±0.5 Da. Regression of the monoisotopic OIT protein gene symbol observation frequencies onto those determined by including peaks from heavy isotopes and hydrogen rearrangements and all other masses resulted in an adjusted *R-*squared value of 0.8772 with 433 degrees of freedom (DF) that was highly significant (Figure 7A). Regression of all masses from the OIT onto the LIT data resulted in an adjusted R-squared of 0.6003 with 9581 DF that was also highly significant (Figure 7B). In addition, regression of the OIT monoisotopic mass data onto all results from the LIT resulted in an R-squared value of 0.8028 with 433 DF that was highly significant as well (Figure 7C). The likelihood that all MS/MS spectra generated by each of the two instruments displayed both qualitative and quantitative relationships with the monoisotopic mass data by random chance approached zero. The highly significant qualitative and quantitative agreement between the results of the OIT monoisotopic mass with all other MS/MS spectra also rules out the possibility that analysis of plasma peptides by MS/MS spectra alone had a high error rate. Accepting peptides from the Orbitrap (OIT) at ±3 Da results in more peptides from the same proteins in the same proportion those of the OIT from ±0.1 Da of the monoisotopic that is a powerful biophysical proof of the validity of proteomics [regression p < 2.2 × 10^–16^] and shows good agreement with the LIT ± 3 Da. In agreement with previous results true positive peptides of albumin (ALB) from plasma show the same range of peptide p-values from false positive peptides of Titin (TTN) from random spectra.^5,6,28,35,36,40,41^ In contrast, the observation frequency of true positive albumin from plasma is high while the observation frequency of false positive ALB from random MS/MS spectra is low and thus the classical Monte Carlo strategy^8,15^ efficiently resolves true positive from false positive results based on observation frequency and subsequent protein p-value yet retains excellent sensitivity and so shows great statistical power (Supporting Figures 1–3).
Regression of log-transformed peptide observation frequencies vs protein gene symbols from the LIT onto OIT data. Panels: (A) Regression of all log OIT observations (MH ± 3 Da) onto log of OIT monoisotopic observations (MH ± 0.1 Da) by protein gene symbols (residual standard error: 0.2692 on 433° of freedom (DF), adjusted R-squared: 0.8772, F-statistic: 3102 on 1 and 433 DF, p-value: < 2.2 × 10–16); (B) Regression of all log LIT observations (MH ± 3 Da) onto log OIT observations (MH ± 3 Da) by protein gene symbols (residual standard error: 0.2927 on 9581 DF, adjusted R-squared: 0.6003, F-statistic: 1.439 × 104 on 1 and 9581 DF, p-value: < 2.2 × 10–16); (C) Regression of all log LIT observations (MH ± 3 Da) onto log OIT monoisotopic observations (MH ± 0.1 Da) by protein gene symbols (residual standard error: 0.3897 on 433° of freedom, adjusted R-squared: 0.8028, F-statistic: 1768 on 1 and 433 DF, p-value: < 2.2 × 10–16).
Agreement at the Level of Peptide Species
The peptides identified by OIT and LIT instruments from the common plasma protein, apolipoprotein (APOA1) were compared across the delta mass distribution. The peptides detected by analysis of the monoisotopic mass peak were also identified at peaks from hydrogen rearrangements, heavy isotopes, and other delta mass values (Supporting Table 1). The same peptides with similar proportions were detected using both the OIT and the LIT instruments (Supporting Table 2). The regression of APOA1 peptide observation frequencies determined by the OIT versus the LIT was highly significant (Figure 8A). The delta mass Gaussian density was centered within 0.5 Da between ± and LIT instruments where the OIT distribution was noticeable skewed toward −3 and −2 Da (Figure 8B,C).
Regression and delta mass density of the common APOA1 peptide, LLDNWDSVTSTFSK, from the OIT vs LIT instrument. (A) Regression of all log LIT peptide observations from APOA1 onto log OIT observations (MH ± 3 Da) over the protein APOA1 (residual standard error: 0.4924 on 54 DF, multiple R-squared: 0.3964, F-statistic: 35.46 on 1 and 54 DF, p-value = 0.0000002015); (B) Delta mass density from OIT observations of the most common APOA1 peptide; (C) Delta mass density from LIT observations of the most common APOA1 peptide. Regression of the delta mass values from panel C onto those of panel B resulted in a significant p-value (2.53 × 10–14) and a residual standard error of 0.9096.
Plasma Proteins
The accurate monoisotopic peptides compared with the MS/MS spectra from the OIT or LIT without an accurate precursor mass all agreed on the major protein of plasma that should not have occurred if the fit of MS/MS alone showed a large error in peptide identification (Table 1 & Figure 7). The MS/MS from a single quadrupole with the QqQ functions separated in time returned similar identifications and relative observation frequencies of plasma peptides and proteins compared to those obtained using the trihybrid orbital trap from MS/MS or accurate mass. Similarly, the excellent agreement between the MS/MS versus accurate mass with respect to peptide and protein frequencies provided independent evidence that the estimates of peptide identity and observation frequencies made with the robust and sensitive LIT instrument were reliable. The high level of agreement between the lists of plasma proteins identified independently by the OIT and the LIT instrument (after correction with analytical and statistical controls) underscored the low error rate from the fit of fragment spectra alone in the absence of an accurate monoisotopic mass.
Discussion
The aim of these experiments was to compare the identification of peptides and proteins using accurate precursor mass from the orbitrap (OIT) ± 0.1 Da versus those from the fit of the MS/MS spectra alone from both the OIT ± 3 Da and from the linear quadrupole ion trap (LIT) ± 3 Da. It was critical to first establish that the delta mass distribution of tryptic peptides derived from human plasma proteins using the highly resolving OIT instrument closely agreed with those predicted from heavy isotopes or hydrogen rearrangements. Next it was determined that the set of tryptic peptides that were identified based on monoisotopic mass agreed with those identified across the entire predicted delta mass distribution of the OIT instrument. Finally, it was established that the results of the OIT across the predicted delta mass distribution closely agreed with those from MS/MS spectra alone without an accurate mass estimate from the LIT instrument. The high resolution of the orbital trap was employed to independently confirm that the peptide delta mass distribution matched the predicted heavy isotopes and hydrogen rearrangements within a fraction of a Dalton. Moreover, the peptides and proteins identified from the monoisotopic mass exhibited qualitative and quantitative agreement with those from hydrogen rearrangements, heavy isotopes, and other masses. The human plasma samples recorded by the OIT and the LIT showed excellent agreement with one another at the level of peptides and proteins. The near-perfect agreement (99.9%) of proteins from the OIT versus the LIT unambiguously demonstrated that it was possible to identify peptides from the fit of the MS/MS spectra alone without the need for an accurate precursor mass.^5,6,9^Recent proteomic analysis of plasma with the orbitrap was only able to detect high abundance molecules like AZGP1, B2M, CRP, HP, HPR, ORM, RBP4, SAA and some others.^42−52^ In contrast MS/MS spectra in the absence of an accurate precursor mass identified cellular proteins in blood plasma that were undetectable to the orbitrap.^23^
Delta Mass Distribution of the OIT
The high resolution of the orbital trap revealed that the fit of MS/MS spectra alone correctly identified peptides that matched to the expected delta mass distribution with sharp peaks at the monoisotopic mass as well as at heavy isotopes and hydrogen rearrangements; this information provided powerful biophysical confirmation of the validity of the MS/MS alone for identifying peptides from human plasma proteins. In this study, an analysis of human plasma proteins and their peptides that were identified based on MS/MS fragmentation spectra alone^10^ reliably predicted peptide identity as confirmed by plots of precursor delta mass distribution. Naturally occurring peptides commonly contain heavy isotopes and/or hydrogen rearrangements; the precursor delta mass distribution falls into an integer envelope with expected mass values from −3 to +5 Da.^1−3^ Providing a search tolerance with a wide window (for example, ±3 Da) and then using the high mass resolution of the OIT to determine whether the predicted delta MH values exactly matched the −3, −2, −1, 0, +1, +2, +3, +4, and +5 Da integer distributions expected to result from heavy isotopes and hydrogen rearrangements was a powerful biophysical test of whether the fit of MS/MS spectra alone was sufficient to predict the correct peptide sequence. The OIT provided much greater resolution than the LIT for the measurement of the precursor delta mass values within a small fraction of a Dalton from the monoisotopic or other anticipated integer mass values. Exploiting the high mass resolution of the OIT by searching mass spectrometry data for peptides within ±0.05 Da of the monoisotopic mass leads to a significant loss of sensitivity because most of the data that might be collected from naturally occurring peptides with heavy isotopes, hydrogen rearrangements and other masses will be lost. The hydrogen rearrangement at −1, −2, and −3 as well as the +1 isotope Da has also been apparently observed previously with X!TANDEM and with the Paul trap.^40^
MS/MS Spectra versus Accurate Precursor Mass
The MS and MS/MS spectra from tryptic peptides of human plasma proteins were recorded using the OIT with the precursor ±0.1 or ±3 Da compared to MS/MS from the LIT without an accurate mass estimate. The OIT is a trihybrid, high-vacuum instrument with mass analyzers separated in space^53^ that delivers high mass resolution albeit with less sensitivity than can be achieved with the linear ion trap.^23,24^ In contrast, the more sensitive LIT^23,24^ performs the precursor scan, trapping, fragmentation, and fragment scan (QqQ) functions using the same low vacuum quadrupole with waveforms separated in time.^54^ Previous comparisons of the LIT to the OIT demonstrated that the two instruments showed near-perfect agreement but that the LIT was more sensitive.^24,25^ The experimentally observed peptide MS/MS spectra from the OIT and LIT may be correlated to theoretical fragmentation spectra of the known human peptides using correlation algorithms such as those found in X!TANDEM.^10^ The MS/MS from the LIT without an accurate precursor mass^19−21^ were sufficient to identify a similar set of peptides, proteins, and gene symbols as the OIT. The predicted delta mass values from the LIT showed the expected distribution of isotopes at 0, +1, and +2 Da as observed. We conclude that the fragment masses ±0.5 Da alone were sufficient to identify many peptides from most plasma proteins using OIT and LIT mass spectrometers. However, there is a temporal cost to running X!TANDEM at a wider mass window where the precursor tolerance of ±0.5 Da required 40 min compute the 9 orbital trap files but required 175 min to compute them with a tolerance of 3 ± Da.
MS/MS Goodness-of-Fit and Monte Carlo Random MS/MS Corrections
Computing the goodness-of-fit p-value with X!TANDEM revealed that most proteins identified with three or more BFPS peptides were reliable^8^ after correcting against the analytical control from blank injections and the random MS/MS spectra statistical control^22,23,55^ before generating the FDR correction using the method of Benjamini and Hochberg.^14^ The classical statistical methods of goodness-of-fit^56^ and Monte Carlo correction^8,15^ against random MS/MS controls^5,6,35,36,41^ confirmed the strong statistical confidence in the peptides identified by MS/MS spectra without a monoisotopic mass from the LIT or OIT instruments.^5,6^ The Monte Carlo correction against noise and random MS/MS spectra followed by calculating the fit of MS/MS spectra to yield p-values and FDR q-values provided a quantitative assessment of the low error rate from statistical methods. The highly significant regression results revealed a strong linear relation between the peptides identified from the analysis of the monoisotopic mass with those of all the other masses using the OIT and the LIT unambiguously provided highly significant and quantitative evidence of the validity of fitting peptides by the fit of MS/MS spectra alone without the need for an accurate mass estimate. In contrast, the exact match of the delta mass distribution of heavy isotopes and hydrogen rearrangements in the OIT provided direct biophysical evidence for the validity of peptide identification from fragment spectra alone without selecting a monoisotopic mass. The combination of affinity or partition chromatography followed by random and independent sampling^41^ of MS/MS spectra in the absence of an accurate precursor mass for analysis with SQL Server to remove redundant identifications has revealed the ligand receptors from blood cells^57,58^ plasma peptides of disease populations,^59−62^ revealed the growth factors of fetal serum that were confirmed by cellular add-back experiments,^55^ the humoral proteins confirmed by ELISA^22^ and the biomarkers of COVID-19 that were confirmed by enzyme assays.^23^ Biological discoveries using MS/MS spectra from low-resolution ion traps have been confirmed using Western blot,^30^ ELISA^22,33^ enzyme assays^23^ and cellular add-back experiments^55^ as well as specific drugs, genetic mutants, GFP fusions and silencing RNA.^57,58^ We have previously shown that the simple ion trap agrees with the Qq-TOF but was more sensitive.^27−29^
Agreement at the Level of Peptides and Proteins
Natural peptides show a wide mass distribution;^1−3^ thus, limited sampling to within 0.05 Da of the monoisotopic mass will not provide the highest sensitivity. An analysis across the entire set of peptide delta mass values revealed the peptide species associated with the monoisotopic mass of the OIT were also observed at peaks from heavy isotopes and hydrogen rearrangements and showed qualitative and quantitative agreement with the peptides detected using the LIT instrument. Evidence provided from an analysis of purified peptide or protein standards revealed that many peptides from most proteins can be identified by the MS/MS fragmentation spectra alone^4−7^ and thus that there was no benefit to imposing a narrow precursor mass search window.^9^ Modern 64-bit computers possess more than sufficient power to search the MS/MS spectra over a wider range (±3 Da) so computational efficiency is no longer a major consideration. The validity of fitting peptides by MS/MS alone and the wide natural mass distribution of peptides from −3 to +5 Da provide compelling reasons for the routine adoption of wider search windows which will dramatically increase the analytical power of plasma proteomics by LC-ESI-MS/MS. The strategy of using the best-fit ±3 Da^5,6^ takes into account the existence of naturally occurring heavy isotopes that would present with delta mass values of precisely +1 and +2 Da.^1,2^ Similar changes in delta mass values were observed for potential hydrogen rearrangements of tryptic peptides that may lead to the loss of 1 to 3 Da, as previously reported.^3,40,63^ The most important considerations for rigorous statistical analysis of changes in plasma proteins that may result from different diseases or drug treatments are sensitivity to provide high observation frequencies for confident comparison.^22,23,41,55^ Thus, omitting the strong signals from peptides with heavy isotopes or hydrogen rearrangements represents a large type II error (false negative) that should be avoided to prevent the loss of information of importance to human health. If the fit of MS/MS spectra was false positive then the peptide should have been randomly distributed across the database and the regression should show a flat line with large residuals. Instead, the regression shows the same peptides from the same proteins in the same proportions that could not occur unless and the fit of MS/MS spectra returned the same true positive results as the accurate mass method.
Human Plasma Proteome
The plasma proteome recorded by the OIT and the LIT instruments showed strong qualitative and quantitative agreement with each other as confirmed by regression analysis.^38,39,64,65^ The data indicated that the MS/MS spectra from human plasma protein digests collected with the LIT or the OIT resulted in at least 2726 protein identifications with nearly perfect qualitative (99.9%) and profound quantitative (p ≤ 2 × 10^–16^) agreement. The agreement between the results for known plasma proteins obtained from the independent OIT and LIT instruments also indicated that plasma peptides could be identified and quantified using MS/MS spectra alone. The LIT is a sensitive and robust instrument for the analysis of peptides by LC-ESI-MS/MS where the equations that describe the motion of ions are well understood and their performance can be predicted directly.^54^ The results presented in this study demonstrate that random and independent peptide sampling followed by searching the fragment spectra within ±3 Da for the precursor at a 2+ or 3+ charge state and analyzing fragments from MS/MS spectra within 0.5 Da of the expected theoretical peptide fragmentation pattern are sufficient to identify many peptides from most major plasma proteins with high confidence.^23,41,55^ The results presented here indicate that peptides and proteins in a complex sample like blood plasma can be robustly identified and enumerated using MS/MS spectra in the absence of an accurate precursor ion mass together with a 64-bit computer for classical statistical analysis that delivers a result remarkably similar to those obtained using the accurate precursor mass and has great importance for wide scale application in clinical proteomics.
Conclusions
The exact match of the isotope distribution and hydrogen rearrangement data identified using the orbital trap provided independent biophysical confirmation of the low error rate of the approach using the best fit of the MS/MS spectra for the identification of human plasma peptides and proteins. The peptides and proteins identified at the monoisotopic mass agreed both qualitatively and quantitatively with the heavy isotope and hydrogen rearrangement data. The best fit of the MS/MS spectra from the OIT included the same peptides and proteins as identified using the LIT. In this study, independent lines of evidence from the biophysical agreement of predicted and measured monoisotopic mass, the exact delta mass distribution of hydrogen rearrangements and heavy isotopes, the fit of MS/MS to peptide p-values and FDR corrected q-values, the Monte Carlo correction against noise and 30,000,000 random MS/MS spectra, the qualitative agreement shown by Venn diagram, and the quantitative agreement determined by regression all demonstrated the low error rate of the OIT and excellent agreement with the LIT without specifying an accurate precursor mass. It is important to note that, after calculating the predicted precursor monoisotopic mass from the best-fit of the MS/MS spectra, the delta mass values independently exhibited the predicted peaks at exactly −3, −2, −1, 0, +1, +2, +3, +4, and +5 Da.^3^ The excellent fit to the expected delta mass distribution provides powerful biophysical evidence for the validity of peptide identification from goodness of fit using the MS/MS spectra alone. The delta mass and statistical results demonstrated that many peptides from most plasma proteins can be identified and enumerated using the sensitive fit of MS/MS spectra without an accurate monoisotopic peptide mass.^22,23,55^ In agreement with previous studies,^5,6^ the largest sources of error in proteomics were not from the mass estimates or the instruments employed but were from computational sources that could be corrected by removing redundancy with an SQL Server and correcting with noise analytical and random MS/MS statistical controls in the R statistical system.^5,6^ Here the comparison of the human plasma peptides from the fit of fragmentation spectra provided nearly identical results to the accurate monoisotopic precursor mass but the MS/MS approach was much more sensitive. Taken together this paper demonstrates that combining standard precipitation with convention protein chromatography followed by simple trypsin digestion and MS/MS analysis with robust and sensitive linear ion traps may provide a large plasma proteome with low Type I and Type II error using only classical statistical techniques with the widely available SQL Server and R statistical system that has profound implications for the clinical analysis of blood samples.
Materials and Methods
Materials
The Dionex UltiMate 3000 series ultraperformance liquid chromatography (UPLC), C18 Acclaim PepMap NanoLC column (75 μm ID, 25 cm length C18), and Fusion Lumos (Q-Orbital ion trap-LTQ Tribrid MS) were obtained from Thermo Fisher Scientific (Waltham, MA). The Hewlett-Packard 1100 high-performance liquid chromatography (HPLC) (Santa Clara, CA) was coupled to an LTQ XL linear ion trap mass spectrometer (Thermo Electron Corporation, Madison, WI). The salts, buffers, and Coomassie Blue were obtained from Sigma-Aldrich (St. Louis, MO). The QA resin on a ceramic support was from Bio-Rad (Hercules, CA). HPLC-grade water, ethanol, acetone, and acetonitrile (ACN) were obtained from Caledon Laboratories (Georgetown, Ontario, Canada). The analytical C18 resin was Princeton Sphere 5 μm 300 Å (Princeton Chromatography, Cranbury, NJ). Sequencing grade trypsin was from Promega (Madison, Wisconsin).
Plasma Separation and Analysis
Plasma samples (25 μL) were precipitated with 10 volumes of ACN,^31^ and centrifuged at 12,000 relative centrifugal force (RCF) for 15 min at room temperature and the pellet dried under vacuum. The proteins were resuspended in 250 μL of 20 mM pH 8.85 tris buffer on ice after vortexing and then centrifuged at 12,000 RCF for 5 min; the supernatant was then collected. The intact plasma proteins were collected over 100 μL of Quaternary amine (QA) also called Strong Anion Exchange (SAX) resin or simply Q resin, washed with three column volumes, and eluted with two column volumes (200 μL) of ≤1 M NaCl.^34^ The resulting protein samples (200 μL) were digested in 600 mM urea and 5% ACN in 20 mM tris pH 8.85 with trypsin at 1/100 at 37 °C for ≤16 h, reduced with 2 mM dithiothreitol at 50 °C for 20 min, and digested again with trypsin for 2h.^34^
C18 ZipTips
The plasma peptides were collected over a C18 ZipTip column in 5% acetic acid, washed, and eluted in 2 μL 5% formic acid, and 65% ACN. The eluate was diluted immediately with 18 μL of 5% formic acid for injection via a 20 μL loop.
Linear Quadrupole Ion Trap (LIT)
Analytical HPLC was performed over 90 min with a 5 to 65% ACN gradient on a 150-μm ID column (15 cm) with in-line filter frits as previously described.^22^ The peptides were ionized by nano spray of the solvent gradient at 2 μL/min by a 1:10 split to a flow of ∼200 nL/min with a transfer capillary temperature of 250 °C in a Thermo Electron Corporation LTQ XL ion trap mass spectrometer.^54^ Approximately 5 μg of extracted and purified peptides were injected via a 20 μL sample loop. The electrospray voltage was 1500 v and the instrument was set to sample the 5 most intense ions randomly with respect to time^41^ as they eluted with up to 200 ms to fill the trap to a target 250,000 ions.
Orbital Ion Trap (OIT)
The plasma peptides collected over a C18 ZipTip were eluted in 2 μL of 5% formic acid and 65% ACN and immediately diluted with 18 μL of 5% formic acid for injections via a 20 μL loop. The resulting peptides were analyzed over an Acclaim PepMap Nano LC column (C18, 2 μm particles with a pore size of 100 Å, ID: 0.075 mm × 250 mm) at 30 nL per minute with a 50 min gradient from 5 to 70% acetonitrile for the Thermo orbital ion trap, Fusion Lumos (Q-Orbital ion trap-LTQ Tribrid MS) at the Department of Chemistry, National Taiwan University. The instrument was tested with the manufacturer’s calibration mixture. The electrospray was formed at 2000 V and the ion transfer tube was maintained at 275 °C. The tryptic peptides were identified by MS scans from m/z 350 to m/z 1700 and followed by higher-energy collisional dissociation-MS/MS spectra of the most intense ions at a normalized collision energy of 32%. The MS survey scan was performed every 3 s (AGC target 5e5, maximum injection time of 50 ms) and all ions above the signal threshold were submitted for MS/MS (AGC target 5e4, maximum injection time of 50 ms) in order of decreasing intensity with previously selected ions dynamically excluded for 60s and the isolation width was 1.4 Th
Peptide MS/MS Spectra Correlation Analysis
A physical filter of at least 1000 (E3) intensity counts for peptide parent ions was used to limit type I error.^22^ The MS/MS spectra were fit to human peptides cataloged in the UniParc protein library^66^ as of 2023. The MS/MS spectra were fit to peptides in the absence of an accurate precursor mass by the X!TANDEM algorithm which generates p-values directly from the fit of predicted versus observed MS/MS spectra by the standard statistical method of goodness of fit.^10,12,67−69^ Fully tryptic enzyme specifications that included a charge state of 2+ or 3+, as many as three missed cleavages, and a wide precursor mass tolerance (±3 Da) were used to calculate the peptide [M + H]^+^ and fragments within 0.5 Da.^5,6,40^ Protein counts from the best-fit per MS/MS spectra (BFPS) were computed in the SQL Server with each MS/MS spectra used only once and assigned to the best-fit peptide for the computation of protein (gene symbol) p-values and FDR corrected q-values.^14^ The MS/MS spectra were fit to peptides with fully tryptic X!TANDEM settings.^10^ The X!TANDEM algorithm was set to search parameters of 2+ or 3+ precursor ions from 500 to 2000 m/z that correlated from −3 to +5 Da with fragments ±0.5 Da and a maximal accepted value of p ≤ 0.01.^5,6^ TRYP [RK]|[X] conditions included peptides with modifications such as the oxidation of methionine (M) or tryptophan (W) (15.994915), the addition of +1 at asparagine (N) or glutamine (Q) (0.984016), and hydroxylation or sulfonation of M or W (31.98983) and glycine (G) substitution at Cysteine (C) (57.021464). The addition of a carboxy-terminal hydroxyl (17.002735) or an amino-terminal proton (1.007825) was also included with a maximum protein p-value of p ≤ 0.01. The X!TANDEM algorithm^10^ provided a computed p-value for each peptide match together with an estimate of type I error rate.^5,6,41^
Random MS/MS Control
Random MS/MS spectra were created using a modification of the random MS/MS spectra generator as previously described.^5,6^ The range of precursor masses, the range and number of fragments and their mass distribution were matched to those of the experimental observations. The precursors were z = +2 to z = +3 and ranged from 350 to 2000 m/z (i.e., 700 to 6000 Da). Fragments were restricted to z = +1 with 150 m/z ≤ 2000 Da.
Computational Analysis in SQL and Statistical Analysis with
R
The LC-ESI-MS/MS spectra, including the parent and fragment m/z and intensity values, together with the resulting peptide and protein identifications from the X!TANDEM,^10^ were introduced into an SQL Server database.^39^ The inherent features of the SQL Server system permitted the MS/MS spectra to be assigned to a nonredundant set of BFPSs. The experimental results were collected and corrected with analytical controls that included blank injection runs with HPLC-grade solvents over untreated columns.^5,6,40^ The observation frequencies of peptide species from the experimental recordings were resolved from those of blank injection LC-ESI-MS/MS recordings and random MS/MS spectra by the chi Square test (χ^2^ ≥ 9, p ≤ p0.01). The LC-ESI-MS/MS data were graphically and statistically analyzed by regression with the R statistical system.^41^ The R statistical system was used to plot data density and distribution and to compute the cumulative p-value for protein gene symbols with FDR correction to q-values by the method of Benjamini and Hochberg.^14^ The plasma samples resolved by the LIT were compared to those of the OIT under three conditions, specifically, (1) an equal number of proteins identified by each instrument; (2) a roughly equal number of peptides identified by each instrument, and (3) sufficient sampling with the LIT to achieve nearly complete overlap with the OIT instrument.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1KantnerováK.; Kuhlbusch N.; Juchelka D.; et al. A guide to precise measurements of isotope abundance by ESI-Orbitrap MS. Nat. Protoc. 2024, 19, 2435–2466. 10.1038/s 41596-024-00981-5.38654136 · doi ↗ · pubmed ↗
- 2Popovic Z.; Anderson L. C.; Zhang X.; et al. Analysis of Isotopically Depleted Proteins Derived from Escherichia coli and Caenorhabditis elegans Cell Lines by Liquid Chromatography 21 T Fourier Transform-Ion Cyclotron Resonance Mass Spectrometry. J. Am. Soc. Mass Spectrom. 2023, 34 (2), 137–144. 10.1021/jasms.2c 00242.36656140 · doi ↗ · pubmed ↗
- 3Savitski M. M.; Kjeldsen F.; Nielsen M. L.; et al. Hydrogen rearrangement to and from radical z fragments in electron capture dissociation of peptides. J. Am. Soc. Mass Spectrom. 2007, 18 (1), 113–120. 10.1016/j.jasms.2006.09.008.17059886 · doi ↗ · pubmed ↗
- 4Guzzetta A. W.; Thakur R. A.; Mylchreest I. C. A robust micro-electrospray ionization technique for high-throughput liquid chromatography/mass spectrometry proteomics using a sanded metal needle as an emitter. Rapid Commun. Mass Spectrom. 2002, 16 (21), 2067–2072. 10.1002/rcm.829.12391582 · doi ↗ · pubmed ↗
- 5Dufresne J.; Florentinus-Mefailoski A.; Zhu P. H.; et al. Re-evaluation of the rabbit myosin protein standard used to create the empirical statistical model for decoy library searching. Anal. Biochem. 2018, 560, 39–49. 10.1016/j.ab.2018.08.025.30171831 · doi ↗ · pubmed ↗
- 6Thavarajah T.; Tucholska M.; Zhu P. H.; et al. Re-evaluation of the 18 non-human protein standards used to create the Empirical Statistical Model for Decoy Library Searching. Anal. Biochem. 2020, 599, 11368010.1016/j.ab.2020.113680.32194076 · doi ↗ · pubmed ↗
- 7Zolg D. P.; Wilhelm M.; Schnatbaum K.; et al. Building Proteome Tools based on a complete synthetic human proteome. Nat. Methods 2017, 14 (3), 259–262. 10.1038/nmeth.4153.28135259 PMC 5868332 · doi ↗ · pubmed ↗
- 8Cargile B. J.; Bundy J. L.; Stephenson J. L.Jr. Potential for false positive identifications from large databases through tandem mass spectrometry. J. Proteome Res. 2004, 3 (5), 1082–1085. 10.1021/pr 049946 o.15473699 · doi ↗ · pubmed ↗
