Bottom-Up Proteomics Under Acidic Conditions Using Protease Type XIII From Aspergillus saitoi
Ryota Tomioka, Ayana Tomioka, Kosuke Ogata, Yasushi Ishihama

TL;DR
A new acidic protease from Aspergillus saitoi improves bottom-up proteomics by reducing artifacts and increasing sequence coverage.
Contribution
P13ase's cleavage preferences and acidic digestion conditions are characterized for precision proteomics.
Findings
P13ase cleaves preferentially at leucine, lysine, and arginine residues.
P13ase achieves over 90% sequence coverage at pH 3.5, outperforming trypsin.
P13ase reduces deamidation artifacts due to its acidic optimal pH.
Abstract
Bottom-up proteomics is a powerful technique for comprehensive analysis of proteins by proteolytic cleavage followed by LC-MS/MS to identify the resulting peptides. Trypsin is the gold-standard protease for bottom-up proteomics, though its cleavage specificity limits peptide identification, depending on the protein sequence. In addition, its optimal pH is weakly alkaline, which can cause modification artifacts such as deamidation. We hypothesized that these limitations might be overcome by using protease type XIII (P13ase) from Aspergillus saitoi, which is active at low pH. P13ase has been used for protein structural analysis by hydrogen-deuterium exchange mass spectrometry, but its cleavage preferences have not been clarified. Here, we demonstrated that P13ase exhibits a preference for cleaving the C-terminal sides of leucine (21%), lysine (19%), and arginine (15%) residues, and that…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsAdvanced Proteomics Techniques and Applications · Peptidase Inhibition and Analysis · Protein purification and stability
Protein digestion is an important technique for characterizing proteins. For example, the sequence of an antibody drug can be identified by mass spectrometric analysis of peptides generated by cleaving peptide bonds (1). Indeed, protein digestion is a crucial and essential step not only in such single protein analysis but also in bottom-up proteomics using LC-MS/MS, and stringent control is required to ensure accurate identification and quantification of each protein (2, 3). Trypsin has been widely used as the gold standard protease in proteomics due to its cleavage specificity, high digestion efficiency, and ready availability (4, 5). Trypsin can specifically cleave the C-terminal side of lysine and arginine, but the peptides generated by trypsin are sometimes excessively long or short, making it difficult to identify the entire sequence of proteins of interest (4, 5). Therefore, alternative proteases with different specificities, such as Lys-C, Lys-N, Asp-N, Glu-C, Arg-C, chymotrypsin, and ProAlanase, have been adopted for bottom-up proteomics (6, 7), and combinations of these proteases have been reported to improve coverage compared to trypsin alone. In addition, modifying the sequence of proteases can enhance their functionality. For example, methylating the lysine residues of trypsin has been reported to inhibit self-digestion, and modifying the Arg-C sequence has been shown to improve the cleavage specificity (8, 9). Furthermore, subtilisin, proteinase K, and thermolysin, which have relatively low cleavage specificity, have been proposed as proteases for rapid digestion (10). In addition to these alternative proteases, chemical modifications have been employed to expand the cleavage sites of trypsin (11, 12, 13). However, there is still a need for new methods to improve coverage.
The challenges in trypsin digestion are not limited to the length of the generated peptides. Another issue is the modification artifacts generated due to the optimal pH of trypsin (pH 7–9). Specifically, under these conditions, artifacts are produced by cyclization reactions followed by hydrolysis (14, 15). These artifacts can cause false-positive identifications and distort quantitative analysis, thereby complicating data interpretation. Various attempts have been made to overcome this problem. Liu et al. (15) demonstrated that digestion using Glu-C protease at pH 4.5 in ammonium acetate effectively removed deamidation artifacts. Cao et al. (16) developed a new low-pH peptide mapping protocol combining Lys-C and modified trypsin to enable qualitative and quantitative characterization of succinimidyl intermediates and deamidation products. However, lowering the digestion pH shifts the protease out of its optimal pH range, reducing digestion efficiency. There are also thermostable acidic archaeal proteases that function optimally at pH 2 to 4 and exhibit high digestion efficiency (17, 18), but their cleavage specificity is relatively low, and their artifact suppression efficacy remains unclear.
In trypsin digestion, the relatively long digestion time (overnight) is also an issue. While trypsin-immobilized spin columns can complete digestion in approximately 10 min (19, 20, 21), in-solution digestion is still superior in terms of cost and reproducibility, and methods to achieve short digestion times under in-solution digestion conditions are needed.
P13ase from Aspergillus saitoi digests a wide range of proteins with optimal activity at pH 2.5 to 3.0 (22, 23). It has received much attention in recent years, especially for application in hydrogen-deuterium exchange mass spectrometry, where it is essential to carry out digestion within a short time at low temperature and low pH (24, 25, 26, 27, 28). This protease is an acidic and nonspecific protease, but its cleavage preference is not the same as that of pepsin, enabling the generation of a variety of peptides (26, 28). While P13ase has been increasingly used in hydrogen-deuterium exchange mass spectrometry in recent years, it has hardly been utilized in bottom-up proteomics. One reason for this is that optimal digestion conditions, such as pH, temperature, and digestion time, as well as the enzyme’s cleavage preferences, remain to be systematically evaluated. Understanding these parameters is critical to maximize the utility of type XIII proteases in a bottom-up proteomics workflow.
In this study, we comprehensively evaluated the proteolytic performance of P13ase using HeLa cell lysate as a model system, aiming to maximize the potential of P13ase with state-of-the-art LC-MS/MS analysis and data processing systems. The effects of digestion pH, temperature and reaction time on peptide and protein identification, the cleavage preferences and the extent of artifact formation were examined. Finally, P13ase was compared with trypsin under optimized conditions to establish its potential as a new protease for proteomics research.
Experimental Procedures
Materials
Urea, DTT, iodoacetamide (IAA), Ambic, acetonitrile, acetic acid, sodium hydroxide, hydrochloric acid, formic acid, and TFA were obtained from FUJIFILM Wako. P13ase from A. saitoi was obtained from Sigma-Aldrich (Cat No P2143). Sequencing-grade modified trypsin was obtained from Promega. Polystyrene-divinylbenzene Empore disks were from GL Sciences. Water was purified with a PURELAB Quest system (ELGA LabWater).
Sample Preparation
HeLa cells were cultured in Dulbecco’s modified Eagle’s medium containing 1× antibiotic-antimycotic solution with 10% fetal bovine serum at 37 °C under 5% CO_2_. HeLa cells were pelleted by centrifugation and proteins were denatured with 8 M urea in 50 mM Ambic solution and extracted by sonication. The concentration of proteins was determined by bicinchoninic acid assay. Proteins were reduced with 10 mM DTT in 50 mM Ambic solution at 37 °C for 30 min and carbamidomethylated with 50 mM IAA in 50 mM Ambic solution at 37 °C for 30 min. Proteins were 10-fold diluted with 1% formic acid (pH 1.5, 2.0, 2.5, 3.0, 3.5, and 4.0) for P13ase digestion or 10-fold diluted with 50 mM Ambic solution (pH 7.8) for trypsin digestion. Proteins were digested with P13ase (protein:enzyme = 10:1, w/w) or trypsin (protein:enzyme = 100:1, w/w). The enzyme:protein ratio for P13ase was established based on a previous report (29). Trypsin digestion was performed at 37 °C for 16 h, while the P13ase digestion conditions during the optimization process are described together with the results in the following section. Digests were desalted in reversed-phase StageTips (30, 31) as described previously.
NanoLC-MS/MS
NanoLC-MS/MS was performed on an Ultimate 3000 liquid chromatograph combined with an Orbitrap Exploris 480 mass spectrometer (Thermo Fisher Scientific). The mobile phases for liquid chromatography consisted of solvent A (0.5% acetic acid) and solvent B (0.5% acetic acid and 80% acetonitrile). Peptides were separated on self-pulled needle columns (250 mm length, 100 μm ID) packed with Reprosil-Pur 120 C18-AQ 1.9 μm reversed-phase material (Dr Maisch) at 50 °C in a column oven (Sonation). The injection volume was 5 μl and the flow rate was 400 nl/min. Digested peptides were separated by applying a step gradient of 5% B for 5 min, 5 to 19% B for 55 min, 19 to 29% B for 21 min, 29 to 40% B for 9 min, 40 to 99% B for 0.1 min, and 99% B for 5 min. Peptides were analyzed in the data-dependent acquisition mode. The electrospray voltage was set to 2.4 kV in the positive mode. The mass range of the survey scan was from 375 to 1500 m/z with a resolution of 60,000, 300% normalized automatic gain control target, and auto maximum injection time. The first mass of the product ion scan was set to 120 m/z with a resolution of 15,000, standard automatic gain control target, and auto maximum injection time. The fragmentation was performed by higher-energy collisional dissociation with normalized collision energy of 30%. The dynamic exclusion time was set to 20 s.
Data Analysis and Bioinformatics
Peptides and proteins were identified through automated database searching using MSFragger (32, 33) version 4.1 and Philosopher (34) version 5.1 against the human database (release 2023/09, 20,468 entries) from UniProtKB/Swiss-Prot. Sequences of P13ase (UniProt Accession: Q12567) and trypsin (UniProt Accession: P00761) were used to evaluate self-digestion. For P13ase, the digestion mode was set to nonspecific digestion. For trypsin, the digestion mode was set allowing for up to two missed cleavages. The mass tolerance for precursor ions and fragment ions were set as 20 ppm. Oxidation (M), acetylation (protein N-term), deamidation (N), and NH_3_-loss (N) were allowed as variable modifications. Carbamidomethylation (C) was set as a fixed modification. The false discovery rate filter was set to 0.01 at both the peptide-spectrum match and protein levels. Peptides and proteins were quantified by IonQuant (35). Searches were also performed in MaxQuant (36) version 2.4.2.0 for the analysis of cleavage preference. Identified peptides and proteins identified in this study are listed in supplemental Table S1.
Experimental Design and Statistical Rationale
All of mass spectrometry datasets were performed with three technical replicates in each experimental condition. All values were presented as mean ± standard error. Data visualization was carried out by using Microsoft Excel and R program. GRAVY index was performed by using GRAVY Calculator (https://www.gravy-calculator.de/). For cleavage preference analysis, sequence window surrounds the cleavage sites (±7) obtained from search result from MaxQuant was submitted to iceLogo (37), utilizing protein sequences in search FASTA files of human as backgrounds. The sequence coverages were calculated with Protein Coverage Summarizer (https://github.com/PNNL-Comp-Mass-Spec/protein-coverage-summarizer). Extracted ion chromatograms and mass spectra were analyzed by FreeStyle (Thermo Fisher Scientific). Motif analysis was performed using rmotif-X (38).
Results and Discussion
Effect of pH on P13ase Digestion
To evaluate the activity of P13ase in bottom-up proteomics, we first investigated the effect of pH. Based on previous reports indicating that the optimal pH for P13ase is 2.5 to 3.0 (23), we set the pH range for this study from 1.5 to 4.0. Proteins extracted from HeLa cells were used as samples and digested at 37 °C for 10 min. Triplicate experiments were conducted for each condition, and nanoLC-MS/MS measurements were performed in the data-dependent acquisition mode. Peptide identification was performed using MSFragger in the nonspecific digestion mode. As a control, a sample was digested with trypsin under weakly alkaline conditions at 37 °C for 16 h. The number of identified peptides and cleavage specificity are shown in Figures 1 and 2, respectively.Fig. 1**Effect of digestion pH on the activity of P13ase.A, the number of unique peptides and proteins identified in HeLa cell extract at various pH values (1.5 ∼ 4.0). The bar graphs represent the mean of three replicates, and error bars represent the standard error. B, the length distribution of identified peptides after P13ase digestion at various pH values. C, the distribution of retention time of identified peptides after P13ase digestion at various pH values. P13ase, protease type XIII.Fig. 2Cleavage specificity of P13ase in HeLa cell extract at various pH values.**A, frequencies of P1 amino acids at cleavage sites at various pH values (1.5 ∼ 4.0). The bar graphs represent the mean of three replicates, and error bars represent the standard error. B, cleavage preferences visualized using iceLogo at various pH values (1.5 ∼ 4.0). Nonspecific MaxQuant search was performed to calculate cleavage preferences against the HeLa cell lysate dataset (three replicates combined). The *y-*axis is the percentage difference in frequency for an amino acid in the experimental set and the reference set. P13ase, protease type XIII.
The number of identified peptides tended to increase with increasing pH. At pH 3.5, although the number of identified peptides was lower than in the case of trypsin digestion, 22,793 peptides and 2334 proteins were identified, representing the highest number of unique peptides identified (Fig. 1A). The average length of identified peptides increased with decreasing pH at pH 2.5 or lower, but at pH 2.5 or higher, it remained at an average of approximately 14 amino acid residues. Compared with trypsin-digested peptides, they were approximately two amino acid residues longer (Fig. 1B). In reversed-phase liquid chromatography separation, the retention time range showed a trend toward longer retention times at pH values of 2.5 or lower, but remained nearly constant at pH 2.5 or higher, differing from the peptide length pattern and exhibiting a retention time range similar to that of trypsin-digested peptides (Fig. 1C). The total ion current chromatograms showed that ions with longer retention times were more abundant at lower pH values, suggesting that digestion may not have been sufficient at lower pH (Supplemental Fig. S1).
Using these identified peptides, the cleavage preferences were examined. Because previous reports on cleavage preferences of P13ase focused on the P1 position (26, 27, 28), we first examined the amino acid frequency at the P1 position (Fig. 2A). As the pH was increased, the frequency of trypsin-like cleavage (Lys and Arg) increased, while the frequency of cleavage at acidic (Asp and Glu) and hydrophobic (Leu and Phe) amino acids decreased. Previous studies on Asp cleavage have demonstrated that peptide bonds involving Asp can undergo nonenzymatic hydrolysis under conditions of low pH and elevated temperature (18, 39, 40). However, cyclization followed by cleavage is known to require a pH below 2.1 and temperatures of 108 °C or higher. Therefore, we considered that such reactions would not occur under the conditions examined in our study. In fact, while previous reports have described nonenzymatic cleavage at the N-terminal side of Asp, our data showed no pH-dependent cleavage at this site (data not shown). These findings suggested that the majority of the cleavage observed under our digestion conditions is attributable to the enzymatic activity of P13ase. Further analysis of the effect of pH on the frequency of missed cleavages showed that the percentage of peptides containing missed cleavages decreased at higher pH for Lys and Arg, whereas the decrease in missed cleavages was not significant for Leu, Phe, Asp, and Glu (Supplemental Fig. S2). Considering that the average peptide length at pH 2.5 and above was approximately two amino acid residues longer than tryptic peptides, even though there were other amino acids besides Lys and Arg that could be cleaved, this suggests that there is a contribution from sites other than the P1 position. Thus, we performed motif analysis of the cleavage preferences for 15 amino acid residues (from P8 to P7′) using iceLogo and motif-x (Fig. 2B and Supplemental Table S2). The results showed that not only P1 but also co-occurring sequences were observed, with a trend toward higher frequencies of Ile, Val, and Glu at P2 and Arg, Lys, and Val at P3. On the other hand, the reported hydrophobic amino acid preference at P1' (23, 28) was not confirmed.
Compared to trypsin, P13ase is able to cleave the C-terminal side of Leu in addition to Lys and Arg, while cleavage at Ile does not occur at all. This result suggests that Leu, which cannot be distinguished from Ile by MS, may be unambiguously identified by P13ase digestion. Furthermore, it is known that under alkaline trypsin digestion conditions, Asn and Gln undergo cyclization reactions with the carbonyl groups of peptide bonds to produce succinimide intermediates followed by hydrolysis to produce deamidated peptides. Since the entire P13ase digestion process can be performed at low pH, one would expect these artificial modifications to be reduced. Indeed, the contents of both deamidated peptides and succinimide intermediates were significantly reduced by P13ase digestion compared to trypsin digestion under all pH conditions tested (Supplemental Fig. S3). This result will be discussed in more detail in the following section.
Based on these results, pH 3.5 was selected as the optimum pH for P13ase digestion in bottom-up proteomics.
Effect of Temperature and Time on P13ase Digestion
Next, we examined the digestion time (2 min to 16 h) and temperature (4 °C, 25 °C, 37 °C) at pH 3.5. As expected, higher temperatures reduced the time required to maximize the number of identified peptides. At 37 °C for 60 min, 32,472 peptides and 2721 proteins were identified, representing the highest number of unique peptides and proteins identified (Fig. 3A). Although the number of identified peptides and proteins was lower than trypsin, the number of peptides per identified protein for P13ase (11.6 peptides/protein) was higher than that for trypsin (9.8 peptides/protein). Additionally, the average length of the identified peptides showed a decreasing trend with increasing digestion time and temperature (Fig. 3B and Supplemental Fig. S4). Furthermore, the retention time range was similar to that of trypsin under most conditions, though in the case of 16 h of digestion at 37 °C, the number of peptides with shorter retention times increased (Fig. 3C and Supplemental Fig. S5).Fig. 3**Effect of digestion temperature and time on the activity of P13ase in HeLa cell extract.**A, the number of identified unique peptides and proteins at various digestion temperature and times. The bar graphs represent the mean of three replicates, and error bars represent the standard error. B, the length distribution of identified peptides after various digestion times at 37 °C. C, the distribution of retention times of identified peptides after various digestion times at 37 °C. P13ase, protease type XIII.
When the cleavage preferences at 37 °C were examined, the cleavage preferences at positions P1, P2, and P3 were consistent with the results presented in the previous section for digestion at 60 min or less. However, the cleavage preferences at the P3 position disappeared when digestion was performed at 37 °C for 120 min or longer (Fig. 4A). A similar trend was observed when digestion was performed at 4 °C and 25 °C, and the lower the digestion temperature, the longer the digestion time required for the cleavage preferences to change (Supplemental Fig. S6). The number of motifs extracted by motif analysis was significantly reduced upon digestion for 16 h, suggesting that more nonspecific cleavages occurred (Supplemental Table S3 and S4).Fig. 4**Cleavage specificity of P13ase after digestion of HeLa cell extract for various times at 37 °C.**A, cleavage preferences visualized using iceLogo at 37 °C after digestion by P13ase for various times. Nonspecific MaxQuant search was performed to calculate cleavage preferences against the HeLa cells lysate dataset (three replicates combined). The y-axis is the percentage difference in frequency for an amino acid in the experimental set and the reference set. B, frequencies of P1 amino acids at cleavage sites at 37 °C after digestion by P13ase for various times. The bar graphs represent the mean of three replicates, and error bars represent the standard error. P13ase, protease type XIII.
Comparing the amino acid frequencies at the P1 position, peptides containing Leu, Lys, and Arg showed high frequencies, followed by those containing Asp, Phe, and Glu (Fig. 4B). At 37 °C, the frequency of peptides containing Leu decreased significantly with increasing digestion time (especially after 16 h), while the frequencies of peptides containing His and Pro showed a tendency to increase with increasing digestion time, especially at 16 h. Similar trends were observed at 4 °C and 25 °C, but the changes were more gradual at lower digestion temperatures (Supplemental Fig. S7). When the results at each temperature were compared at the digestion time giving the maximum number of identified peptides, the cleavage preferences and amino acid frequencies at the P1 position were almost the same (Fig. 5, A and B). These results suggest that digestion temperature and time affect the digestion progress but do not significantly change the cleavage preferences. Next, we analyzed the effects of digestion temperature and time on the frequency of missed cleavages. For Lys, Arg, Leu, and Glu at position P1, the percentage of peptides with missed cleavages decreased with increasing digestion temperature and time (Supplemental Fig. S8). On the other hand, the frequency of missed cleavages for Asp and Phe did not decrease significantly.Fig. 5**Cleavage specificity of P13ase for digestion times that maximized the number of peptides identified from HeLa cell extract at each temperature.**A, cleavage preferences visualized using iceLogo digestion time by P13ase for digestion times that maximized the number of peptides identified at each temperature. Nonspecific MaxQuant search was performed to calculate cleavage preferences against the HeLa cells lysate dataset (three replicates combined). The y-axis is the percentage difference in frequency for an amino acid in the experimental set and the reference set. (B) Frequencies of P1 amino acids at P13ase cleavage sites for digestion times that maximized the number of peptides identified at each temperature. The bar graphs represent the mean of three replicates, and error bars represent the standard error. P13ase, protease type XIII.
Next, artifacts of deamidation and succinimidation were evaluated. At all digestion temperatures and times, digestion with P13ase suppressed deamidation and succinimidation compared to trypsin (Supplemental Fig. S9). Peptides generated by self-digestion were also compared to those in the case of trypsin. The trypsin used in this study was methylated at lysine, which is believed to suppress autolysis to some extent. However, despite the protease concentration being higher than that of trypsin, the proportion of autolysis peptides for P13ase was mostly less than one-tenth of that for trypsin (Supplemental Fig. S10). However, it is currently unclear whether P13ase undergoes a change in selectivity due to self-digestion, analogous to the conversion of trypsin to show chymotrypsin-like activity.
Based on the above results, the digestion conditions for P13ase in bottom-up proteomics were selected as pH 3.5, 37 °C, and 60 min.
Characterization of Peptides and Proteins Identified by P13ase Digestion
Peptides and proteins generated by P13ase digestion under conditions optimized for bottom-up proteomics (pH 3.5, 37 °C, 60 min digestion) were characterized. We found that few peptides were commonly identified by P13ase and trypsin. This may be due to differences in the cleavage preferences of the proteases (Supplemental Fig. S11A). The physico-chemical properties including m/z, pI, molecular weight and gravy score for peptides identified by P13ase and trypsin were compared, and no significant difference was found (Supplemental Fig. S11B). The median relative SD of the quantitative reproducibility of the identified unique peptides was 10.5% for P13ase, which is sufficient for general bottom-up proteomics (Supplemental Fig. S11C). For sensitivity comparison, we extracted commonly identified peptides where either shorter fragments or extended sequences were not detected (390 peptides). Based on this subset, the peak intensity of P13ase digestion was 79% on average of that observed with trypsin digestion. On the other hand, most of the proteins identified by P13ase digestion overlapped with those identified by trypsin digestion (Supplemental Fig. S12A). There was no significant difference in the distribution of pI or molecular weight of the proteins identified by trypsin digestion and P13ase digestion (Supplemental Fig. S12B). The quantitative reproducibility of the identified proteins was also evaluated. The median relative standard deviation of quantified proteins with P13ase digestion was 9.6%, which is sufficient for general bottom-up proteomics (Supplemental Fig. S12C). Although the peptides identified by P13ase digestion and trypsin digestion shared few common peptides due to the different cleavage preferences, there was no obvious bias in the properties of the identified peptides and proteins, and quantitative reproducibility was sufficient. These results confirm the suitability of P13ase for application to bottom-up proteomics.
The number of peptides identified per protein was higher in the case of P13ase digestion than trypsin digestion (Supplemental Fig. S12D). Comparison of protein sequence coverage also showed that trypsin gave higher values for proteins with relatively low sequence coverage, but P13ase gave higher values for proteins with relatively high sequence coverage (Supplemental Fig. S12E). In particular, the number of proteins with sequence coverage greater than 90% was 37 in the case of P13ase digestion compared to only two for trypsin digestion. As a representative example, trypsin digestion of histone H2A type 1 (UniProt accession P0C0S8) yielded 46% coverage and failed to identify K/R-rich and K/R-poor regions (Supplemental Fig. S13). This was because the peptides produced by trypsin were either too long or too short to be identified. In contrast, P13ase digestion yielded 95% coverage. The reason for the high coverage was thought to be that P13ase does not necessarily cleave the C-terminal side of all Lys and Arg residues, allowing identification of K/R-rich regions. In addition, P13ase cleaved the C-terminal side of Leu, allowing identification of K/R-poor regions.
Reduction of Modification Artifacts
As mentioned above, the rapid digestion by P13ase under acidic conditions is expected to reduce the deamidation artifact, compared with trypsin digestion. Here, we analyze the results in more detail. First, the number of peptides containing deamidation modifications was less than 7% in P13ase digestion compared to trypsin digestion (Fig. 6A). Of the deamidation sites identified, only 11 were common (Fig. 6B), possibly because the sites identified only with P13ase were too long when contained in tryptic peptides. For quantitative comparison, we focused on one peptide from glyceraldehyde-3-phosphate dehydrogenase as a commonly identified peptide containing the same deamidation site. In trypsin digestion, the peak area indicated that approximately 1% of the peptide was deaminated (Fig. 6C). In contrast, P13ase digestion yielded a peak area for unmodified peptide similar to that of trypsin digestion, but the deamidated peptide was below the limit of quantitation. These results suggest that P13ase, which produces fewer artifacts during digestion, is appropriate for deamidation proteomics. Modifications other than Asn deamidation were similarly examined (Fig. 6D). For both Asn and Gln, P13ase digestion reduced the frequency of deamidation, succinimidation, and pyroglutamination. As for N-terminal cyclization of carbamidomethyl-cysteine, there was little difference between trypsin and P13ase. In contrast, N-terminal cyclization of Glu was less frequent with P13ase, and N-terminal cyclization of Asp was almost absent. These results suggest that P13ase digestion inhibits not only postcyclization hydrolysis of Asn, but also cyclization and postcyclization hydrolysis of other amino acids under acidic conditions. Thus, P13ase is advantageous in that it can be conducted under acidic conditions.Fig. 6**Reduction of artifacts of deamidation and cyclization by P13ase digestion of HeLa cell extract.**A, the number of identified Asn-deamidated peptides generated by trypsin (37 °C, 16 h) or P13ase (37 °C, 60 min) digestion. The bar graphs represent the mean of three replicates, and error bars represent the standard error. B, overlap of identified deamidation sites generated by trypsin and P13ase digestion. C, extracted ion chromatograms (XICs) and mass spectra of typical deamidated and unmodified peptides (glyceraldehyde-3-phosphate dehydrogenase: IISNASCTTNCLAPLAK). XIC and mass spectra were analyzed by FreeStyle (Thermo Fisher Scientific). D, ratios of the number of identified peptides deamidated or cyclized at each amino acid after digestion with trypsin (37 °C, 16 h) or P13ase (37 °C, 60 min). “N-term” in the bar graph means N-terminal modification of the identified peptide, and “internal” means internal modification of the identified peptide. Deamidation (NQ), cyclization (NQ), and peptide N-terminal cyclization (CDE) were added as variable modifications and searched individually. P13ase, protease type XIII.
Acidic conditions also inhibit the rearrangement of disulfide bonds (i.e., disulfide bond scrambling), which may enable the analysis of proteins without the need for alkylation by IAA or other alkylating agents. Since it has been reported that alkylating agents react with amino acids other than Cys, avoiding the use of alkylating agents may enable the capture of the true state of proteins. Additionally, while P13ase has not yet surpassed trypsin in terms of identification efficiency, there is still potential for improving identification and quantification performance through the application of data-independent acquisition and modifications to the open search method. These points will be investigated in future studies.
Conclusion
In this study, we investigated the potential utility of P13ase as an alternative or complementary protease to trypsin in bottom-up proteomics. We evaluated the effects of factors such as pH, temperature, and time of digestion on the cleavage efficiency and selectivity of P13ase, and identified pH 3.5, 37 °C, and 60 min as the optimal conditions for bottom-up proteomics. We also found that P13ase has co-occurrence motifs not only at P1 but also at other positions. Furthermore, we demonstrated that P13ase digestion can be achieved rapidly under low pH conditions, thereby minimizing deamidation and other artifacts. These results not only extend the applicability of P13ase in standard bottom-up proteomics workflows but also suggest its suitability for applications requiring artifact-free proteomics in order to target only intrinsic modifications.
Data Availability
The MS raw data and analysis files have been deposited with the ProteomeXchange Consortium (https://proteomecentral.proteomexchange.org) via the jPOST partner repository (https://jpostdb.org) (41) with the dataset identifier PXD064095. Annotation of fragment ions for MS/MS spectra can be confirmed with PDV (42).
Supplemental data
This article contains supplemental data.
Funding and additional information
This work was supported by JST SPRING (JPMJSP2123 to A. T.), JST ACT-X (JPMJAX2324 to K. O.), 10.13039/501100001691JSPS KAKENHI (23K18185, 23H04924 to Y. I., 24KJ1440 to A. T., and 25K18607 to K. O.), FY 2024 Researcher Exchange Program between 10.13039/501100001691JSPS and ETH to K. O., 10.13039/501100008884Nakatani Foundation to K. O., and the Research Foundation for Pharmaceutical Sciences to K. O.
Conflict of interest
R. T. is an employee of Shionogi & Co, Ltd. The other authors declare no competing interests.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Dick L.W.Jr.Mahon D.Qiu D.Cheng K.-C.Peptide mapping of therapeutic monoclonal antibodies: improvements for increased speed and fewer artifacts J. Chromatogr. B Analyt. Technol. Biomed. Life Sci.877200923023610.1016/j.jchromb.2008.12.00919112052 · doi ↗ · pubmed ↗
- 2Aebersold R.Mann M.Mass-spectrometric exploration of proteome structure and function Nature 53720163473552762964110.1038/nature 19949 · doi ↗ · pubmed ↗
- 3Müller J.B.Geyer P.E.Colaço A.R.Treit P.V.Strauss M.T.Oroshi M.The proteome landscape of the kingdoms of life Nature 58220205925963255545810.1038/s 41586-020-2402-x · doi ↗ · pubmed ↗
- 4Tsiatsiani L.Heck A.J.R.Proteomics beyond trypsin FEBS J.2822015261226262582341010.1111/febs.13287 · doi ↗ · pubmed ↗
- 5Sinitcyn P.Richards A.L.Weatheritt R.J.Brademan D.R.Marx H.Shishkova E.Global detection of human variants and isoforms by deep proteome sequencing Nat. Biotechnol.412023177617863695935210.1038/s 41587-023-01714-x PMC 10713452 · doi ↗ · pubmed ↗
- 6Giansanti P.Tsiatsiani L.Low T.Y.Heck A.J.R.Six alternative proteases for mass spectrometry-based proteomics beyond trypsin Nat. Protoc.11201699310062712395010.1038/nprot.2016.057 · doi ↗ · pubmed ↗
- 7Samodova D.Hosfield C.M.Cramer C.N.Giuli M.V.Cappellini E.Franciosa G.Pro Alanase is an effective alternative to trypsin for proteomics applications and disulfide bond mapping Mol. Cell Proteomics 192020213921573302019010.1074/mcp.TIR 120.002129 PMC 7710147 · doi ↗ · pubmed ↗
- 8Muriithi B.Ippoliti S.Finny A.Addepalli B.Lauber M.Clean and complete protein digestion with an autolysis resistant trypsin for peptide mapping J. Proteome Res.232024522152283939267810.1021/acs.jproteome.4c 00598 PMC 11536465 · doi ↗ · pubmed ↗
