Biochemical and structural characterization of a GNAT superfamily protein acetyltransferase from Helicobacter pylori
Venkatareddy Dadireddy, Amrendra Kumar, Sumith Kumar, Siddhartha P. Sarma, Pranjal Mahanta, Suryanarayanarao Ramakumar, Rao N. Desirazu

TL;DR
This study identifies a protein acetyltransferase in Helicobacter pylori that regulates DNA binding and natural transformation through acetylation.
Contribution
The discovery of the first protein acetyltransferase in H. pylori and its role in acetylating DNA processing proteins.
Findings
HP0935 acetylates DprA at K133, affecting DNA binding and natural transformation.
Crystal structures show conformational changes in HP0935 upon ACO binding.
Molecular dynamics suggest loop dynamics enable HP0935 to accept multiple substrates.
Abstract
Helicobacter pylori (H. pylori), a gastric pathogen with high genetic variability and a unique niche, causes peptic ulcers and gastric cancer. Natural transformation contributes to the genetic variability of H. pylori. To date, protein acetylation and the associated acetyltransferase(s) have not been reported in this bacterium. Here, we report protein acetylation in H. pylori and identify a putative protein acetyltransferase, HP0935, capable of acetylating amino acids and proteins, including DNA processing protein A (DprA), which is involved in natural transformation. HP0935 acetylates residue K133 in DprA, which is important for DNA binding, thus is likely to regulate natural transformation. We determined the crystal structures of HP0935 in its apo form and in complex with acetyl-coenzyme A (ACO) to 2.00 Å and 2.40 Å resolution, respectively. Structural analysis revealed a…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPeptidase Inhibition and Analysis · Helicobacter pylori-related gastroenterology studies · Biochemical and Molecular Research
Protein acetylation is an essential post-translational modification (PTM) that is conserved in all three domains of life. It has several roles in key cellular processes including, gene regulation (1), metabolic pathways (2, 3), protein stability (4), protein-protein interactions (5), enzymatic activity (6), and virulence (7). Essentially, two amino groups, namely the N-terminal (Nα) and lysine side chain (Nε) of proteins, undergo acetylation. Nα-acetylation is irreversible and is catalyzed by N-terminal acetyltransferases (NATs). Conversely, Nε-acetylation is reversible, and the acetylated and unacetylated states are maintained by lysine acetyltransferases (KATs) and deacetylases, respectively. KATs are also called protein acetyltransferases (PATs), especially in bacteria. Protein acetyltransferases have been grouped into three families based on sequence similarity, domain organization, and substrate specificity—GNAT (Gcn5-related N-acetyltransferase), MYST (human MOZ, yeast Ybf2/Sas3, yeast Sas2, and human Tip60), and p300/CBP (human p300 and human CREB-binding protein) (8).
These three families of N-acetyltransferases employ different mechanisms for acetylation. However, the core of the N-acetyltransferases, which binds ACO through its pyrophosphate and pantetheine moieties, remains conserved. ACO's adenine base and acetyl moieties have no significant contribution to its binding to the N-acetyltransferase (9). These properties enable N-acetyltransferases to bind various acyl-coenzyme A as well as coenzyme A (COA) within their structure. Eukaryotes possess all three families, whereas prokaryotes are known to have only GNAT superfamily members. However, GNAT substrates range from small molecules to proteins (10). In bacteria, the number of GNATs encoded by a given genome varies, from one in Borrelia burgdorferi to approximately 72 in Streptomyces coelicolor (11). Although GNAT superfamily members exhibit low sequence similarity, the structure of the GNAT fold remains conserved. Based on the number and organization of domains, prokaryotic GNATs are classified into five types: I–V. Types I to III comprise a regulatory domain alongside the catalytic GNAT domain, type IV comprises only the catalytic domain, and type V comprises an N-terminal and central GNAT domain followed by a C-terminal region. However, it remains unclear whether the central GNAT domain in type V is catalytically active (11).
Helicobacter pylori (H. pylori) is a Gram-negative, microaerophilic, and spiral-shaped bacterium residing in the stomach. H. pylori is the causative agent of peptic ulcers, chronic gastritis, and gastric cancers (12, 13). Over half of the world’s population is infected with H. pylori (14), with up to 90% of infected individuals developing no symptoms of infection (15). H. pylori exhibits significant genetic diversity among strains due to insertions, deletions, point mutations, and phase variation (16).
This genetic diversity enables H. pylori strains to survive in the acidic environment of the human stomach (17). Genetic recombination can occur between different strains within an infected host (18). Natural competence and an efficient recombination system are the key contributors to horizontal gene transfer-mediated genetic diversity in H. pylori. The genetic diversity in the virulence genes and the host context determine the clinical outcome of H. pylori infection (16).
Until recently, PTMs have not been reported in H. pylori. A major virulence factor, CagA, undergoes phosphorylation by host kinase upon secretion into the host gastric epithelium (19). A proteomic study identified the α-subunit of urease with N-terminal acetyl modification (20). A recent study reported how the acetylation of RNase J affects its oligomeric state and consequently its activity (21). Given the ubiquitous nature of protein acetylation, we initially assessed the prevalence of protein acetylation in H. pylori and subsequently identified the relevant protein acetyltransferase, HP0935. A biochemical study showed that HP0935, the only known protein acetyltransferase in H. pylori, can acetylate free lysine amino acid and lysine residues in proteins. Based on the three-dimensional (3D) structure and dynamics, we hypothesize that the flexibility in substrate-binding loops enables HP0935 to accommodate and acetylate multiple protein substrates.
Results and discussion
Acetylation in H. pylori
The high genetic diversity among H. pylori strains may lead to differential expression of proteins and their PTMs. To investigate protein acetylation, whole cell lysates of different H. pylori strains—26695, J99, I-10, San-64, and 93(1)—were analyzed by Western blotting using an anti-acetyl lysine antibody (see Experimental procedures). Protein acetylation in H. pylori was prominent and varied significantly among the strains studied (Fig. S1). In strain 26695, protein acetylation seems well distributed from lower to higher-molecular-weight proteins. Higher-molecular-weight proteins were predominantly acetylated in strains I-10, San-64, and 93(1). However, in strain J99, lower-molecular-weight proteins were primarily acetylated. These observations suggest that the acetylome could depend on the genetic composition of a given strain. Several acetylome studies in other bacteria indicate its role in regulating various biological processes (22, 23, 24, 25). Therefore, protein acetylation in H. pylori suggests potential regulatory roles in H. pylori physiology.
Identification of a putative N-acetyltransferase HP0935 from H. pylori
In prokaryotes, protein acetylation follows two distinct mechanisms: enzymatic acetylation catalyzed by protein acetyltransferases and non-enzymatic acetylation mediated by the highly unstable acetyl phosphate (AcP) and ACO. To identify putative protein acetyltransferase(s), a PSI-BLAST was carried out with the GNAT domain of known protein acetyltransferases from the organisms Salmonella enterica, Saccharomyces cerevisiae, Mycobacterium smegmatis, and Mycobacterium tuberculosis (see Experimental procedures). The search identified HP0935, an N-acetyltransferase domain-containing protein, from H. pylori strain 26695 with a significant expect (E)-value (Table S1). HP0935 shares the highest sequence identity of 27% (over 71 amino acids overlap) with protein acetyltransferases MSMEG_5458 from M. smegmatis. In addition to PSI-BLAST, InterProScan (26) identified two GNAT domain-containing proteins (InterPro ID: IPR000182), in strain 26695. These include UDP-4-amino-4,6-dideoxy-N-acetyl-β-L-altrosamine N-acetyltransferase (PseH, a pseudaminic acid biosynthesis protein) and N-acetyltransferase domain-containing protein, HP0935. Given that PseH is a small-molecule acetyltransferase involved in the biosynthesis of pseudaminic acid that is subsequently used to modify flagellin (27), we hypothesized that HP0935 is a probable protein acetyltransferase. HP0935 is a small (161 amino acids long) single GNAT domain-containing protein belonging to type IV protein acetyltransferases (11). For biochemical and structural characterization, the hp0935 gene was cloned, and the protein was expressed and purified to homogeneity (see Supporting information, Fig. S2A). The protein’s identity was confirmed by peptide mass fingerprinting (Fig. S2B).
Plausible substrate(s) of HP0935
To investigate the acetyltransferase activity of HP0935, thin-layer chromatography (TLC) and autoradiography-based acetylation assays were performed with amino acids lysine, arginine, serine, methionine, and proline as substrates (see Experimental procedures). HP0935 exhibited activity towards lysine, arginine, serine, and methionine but not proline (Fig. 1A). Unlike Salinococcus halodurans lysine Nα-acetyltransferase, which acetylates only basic amino acids (28), HP0935 acetylates both neutral (serine and methionine) and basic amino acids, except proline. To determine the kinetic parameters, initial velocity measurements were conducted using TLC followed by densitometric analysis (see Experimental procedures, Fig. S3). HP0935 exhibited Michaelis-Menten (MM) kinetics, with apparent MM constant (K_M_) values of 31.31 mM for lysine and 21.44 μM for ACO (Fig. 1B). A similar result, a high K_M_ value for lysine compared to ACO, was observed for S. cerevisiae L-lysine N6-acetyltransferase (29).Figure 1**Acetyltransferase activity of HP0935.**A, acetyltransferase activity of HP0935 was probed using lysine, arginine, serine, methionine, and proline as substrates. The acetylation reaction mix was separated using silica gel thin-layer chromatography and detected by autoradiography. ^14^C-ACO (^14^C-labeled acetyl-coenzyme A) was used as an acetyl group donor. Acetylation products are indicated with an asterisk (∗). D: heat-denatured HP0935. B, HP0935 showed Michaelis-Menten kinetics. The kinetic parameters (K_M_ and V_max_) for lysine and ACO were measured at 800 nM HP0935 concentration. Reaction rates were quantified from Figure S3. C, the group-specific (Nα or Nϵ) acetylation in lysine was tested using Nα-acetyl lysine (free ϵ-amino group) and Nϵ-acetyl lysine (free α-amino group). The acetylation reaction mix was separated using silica gel thin-layer chromatography and detected by autoradiography. ^14^C-ACO is used as an acetyl group donor. Acetylation products are indicated with an asterisk (∗). D, heat-denatured HP0935. D, Michaelis-Menten kinetics of HP0935 (800 nM) for substrates Nϵ-acetyl lysine and Nα-acetyl lysine. The error bars represent the SD.
To determine which of the primary amino groups, Nα or Nε, of lysine undergoes acetylation, a similar TLC-based assays were performed using Nε-acetyl lysine (free Nα group) and Nα-acetyl lysine (free Nε group) as substrates. HP0935 acetylated both the substrates, with a greater degree of acetylation observed for Nε-acetyl lysine, suggesting a preference for the Nα-amino group (Fig. 1C). Further assays with Nα-acetyl lysine and Nε-acetyl lysine also showed MM kinetics, revealing a significantly higher K_M_ for Nα-acetyl lysine (124.8 mM) compared to Nε-acetyl lysine (26.71 μM) (Fig. 1D). HP0935’s preference for the Nα-amino group was further confirmed by 1D-proton nuclear magnetic resonance (NMR) spectroscopy. Enzymatically acetylated lysine showed a chemical shift corresponding to that of Nα-acetyl lysine (Fig. S4), indicating HP0935 prefers the Nα-amino group over Nε. Our results show that HP0935 acetylates both Nα and Nε positions of free lysine but preferentially the former, suggesting that HP0935 could function as N-terminal and Nε-lysine protein acetyltransferase. A recent study reported a new family of plastid-localized GNATs with dual-specificity, i.e., N-terminal and Nε-lysine acetylation of proteins (30). These dual-specificity acetyltransferases reveal a new layer of complexity in the machinery controlling protein acetylation.
D-amino acids are essential components of the bacterial peptidoglycan cell wall (31). Interestingly, HP0935 was found to be stereospecific, as D-lysine was not acetylated (Fig. S5A). Notably, the S. cerevisiae D-amino acid N-acetyltransferase, encoded by HPA3 (a putative histone/protein acetyltransferase), is a unique enzyme that acts specifically on D-amino acids but does little on histones (32).
A PSI-BLAST search revealed spermidine-N(1)-acetyltransferase from Escherichia coli, Vibrio cholerae, and Salmonella typhi shares ∼30% sequence identity (∼60% query cover) with HP0935. However, HP0935 exhibited no enzymatic acetylation of spermidine, showing only minimal non-enzymatic activity, thereby ruling out its role as a spermidine-N(1)-acetyltransferase (Fig. S5B). There are reports of GNAT protein acetyltransferases acetylating the Nε-amino group of lysine in proteins and small molecules. Eis, a protein acetyltransferase from M. tuberculosis (Mtb), was found to acetylate aminoglycoside antibiotics, contributing to drug resistance, and also aryl alkylamines like histamine and octopamine (33, 34). The known protein substrates of Eis include a human dual specificity protein phosphatase 16 (DUSP16)/mitogen-activated protein kinase phosphatase-7 (MKP-7) (33), and a nucleoid-associated protein HU of Mtb (35). In another study, Hpa3, a histone acetyltransferase, was shown to acetylate D-amino acids as well as histone H4 (36). Based on these observations, HP0935 is likely a protein acetyltransferase with the ability to acetylate free amino acids.
HP0935 acetylates DprA
Further, acetylation of the Nα-amino group in proteins, i.e., N-terminal acetylation, was investigated using peptides as substrates. Among the known N-terminal acetyltransferases in prokaryotes, RimI, RimJ, and RimL acetylate ribosomal proteins S18, S5, and L7/L12, respectively. Acetylation of EF-Tu (elongation factor-Tu), ribosomal protein S7, and glutamate dehydrogenase by unknown acetyltransferases in various organisms has also been reported (37, 38). Several prokaryotic N-terminal acetyltransferases preferentially acetylate methionine (M), serine (S), alanine (A), or threonine (T) at the protein N-terminus (37). We, therefore, wanted to investigate the Nα-acetylation activity of HP0935 with peptides containing the first amino acid as M, S, or A. No HP0935 activity was observed with any of these peptides (data not shown). However, a detailed investigation of HP0935 for its N-terminal acetyltransferase activity is needed. It is possible that, apart from the first N-terminal amino acid, the structure of a protein substrate might be necessary for HP0935 activity. In parallel, the Nε-acetylation activity of HP0935 was also investigated using protein substrates.
From Figure S1, it was clear that several proteins of H. pylori are prominently acetylated. Our laboratory has been working on structure-function relationships in several H. pylori's proteins are involved in DNA repair, recombination, and natural transformation processes. Natural transformation is a tightly regulated, multistep process influenced by species-specific environmental conditions. It involves a series of proteins that facilitate DNA uptake and integration into the bacterial genome (39, 40, 41). Studies on Streptococcus pneumoniae suggest that DprA and RecA protect internalized ssDNA from bacterial nucleases. Inactivation of the dprA gene abolishes transformation (42). We previously demonstrated that H. pylori DprA facilitates natural transformation by binding to both ssDNA and dsDNA, acting as a physical barrier to nucleases, and stimulating DNA methylation to protect against the restriction-modification (R-M) barrier (43). However, the role of PTMs, such as acetylation, in natural transformation remains unexplored. To investigate this, the protein acetyltransferase activity of HP0935 against DprA (strain J99) as a substrate was probed by Western blotting using anti-acetyl lysine antibodies. Western blot analysis showed that HP0935 acetylates DprA in a concentration dependent mannar. However, minor non-enzymatic acetylation was observed due to ACO (Fig. 2A). To investigate the non-enzymatic acetylation of DprA, the protein was incubated with increasing concentrations of ACO and acetyl-phosphate. It was observed that both ACO and acetyl-phosphate acetylate DprA in a concentration-dependent manner. Further, surface plasmon resonance (SPR) spectroscopy confirmed a direct interaction between DprA and HP0935, with a dissociation constant (K_d_) of 3.28 μM (Fig. 2B). Mass spectrometry analysis of three acetylation reactions, R1 (only DprA), R2 (DprA + ACO), and R3 (DprA + ACO + HP0935), identified 19 out of a total of 24 lysine residues, with K110 and K133 uniquely acetylated in reaction R3 by HP0935 (Supporting Data 1, Fig. 2C). Notably, K133 in strain J99 corresponds to K137 in strain 26695, which interacts with ssDNA. A charge-reversal mutation (K137E) is known to reduce DNA binding activity (44). To assess the functional impact of K133 acetylation, two mutants were generated: K133R (charge-retaining) and K133Q (acetylation-mimicking). Electrophoretic mobility shift assay (EMSA) revealed that K133R retained WT-like DNA binding, whereas K133Q exhibited reduced activity (Fig. 2D). This suggests that K133 acetylation affects DprA’s ability to bind DNA, thereby potentially influencing H. pylori natural transformation. Given that protein acetylation serves regulatory roles in other bacteria, these findings highlight a possible role for acetylation in *H. pylori'*s genetic adaptability and transformation efficiency.Figure 2**Acetylation of DprA by HP0935.**A, acetylation of H. pylori DprA with increasing concentration of GST-tagged HP0935 (1–8 μM). The reactions were separated on SDS-PAGE and acetylated DprA (^Ac^DprA) was probed by western blotting using anti-acetyl lysine antibodies (top panel). Ponceau stained blot showing protein loadings of GST-tagged HP0935 and DprA (bottom panel). B, interaction of HP0935 with DprA shown by surface plasmon resonance (SPR). The sensorgram is shown at different concentrations of HP0935. C, Venn diagram of acetylated lysine residues of DprA detected in acetylation reactions (R1: DprA, R2: DprA + ACO, or R3: DprA + ACO + HP0935) by mass spectrometry. D, DNA binding activity of DprA wild type (WT) and its point mutants (K133R, and K133Q) checked by electrophoretic mobility shift assay (EMSA). A 53-nucleotide single-stranded DNA (ssDNA) was used as a substrate. 0: control (no DprA), asterisk (∗): free ssDNA, and hash (#): nucleoprotein complex.
Overall structure of apo- and ACO-HP0935
HP0935 exists as a monomer under physiological conditions (see Supporting information, Fig. S6, A and B). To gain structural insights, the crystal structures of HP0935 in its apo and ACO-bound forms were determined to be 2.00 Å and 2.40 Å resolution, respectively (see Experimental procedures, Supporting information, Table S2). The crystal structure of HP0935 revealed a conserved GNAT fold, characteristic of the GNAT superfamily (45, 46). The structure of HP0935 is composed of seven β-strands and four α-helices arranged in a topology—β1· α1· α2· β2· β3· β4· α3· β5· α4· β6· β7 (Figs. 3A and S7A). All β-strands form a central, twisted mixed β-sheet splayed between strands β4 and β5, which forms a wedge-shaped cleft for ACO and substrate binding. The four α-helices flank the β-sheet on both sides (Fig. 3A). Strand β4 contains a β-bulge structure that disrupts the β4 strand. This β-bulge structure is observed in many other GNAT family enzymes (45). The β-bulge can form, but not always, an oxyanion hole that stabilizes the tetrahedral reaction intermediate formed during catalysis (47, 48). Another feature of GNAT enzymes, the pyrophosphate-binding loop (P-loop), corresponding to the residues 88-QSQGLG-93 (Q-x-x-G-x-G) in HP0935, is located at a canonical position—a region spanning the c-terminal part of β4-α3 loop and the N-terminus of the α3 helix (Fig. 3A). ACO is bound in a tunnel formed by the α1-α2 loop, β4, β4-α3 loop, α3, β5, β5-α4 loop, and α4 (Fig. 3B). The conformation of ACO and its interactions are typical of other GNAT members (49). While the ACO is bound in a bent (L-shaped) conformation (Figs. 3C and S7B), the pyrophospha forms hydrogen bonds with residute moiety is accommodated by the P-loop, and the pantotheine moietyes on the β4 side of the wedge-shaped cleft (residues F81 and I83), mimicking the hydrogen bonding pattern of a parallel β-sheet structure. The adenine moiety of ACO is stabilized by hydrogen bonding with side chains of residues S89 and N125. The carbonyl oxygen of ACO’s acetyl group forms a hydrogen bond with the backbone amide (NH) of the β-bulge residue F81, which may stabilize the tetrahedral intermediate during the acetylation reaction (Figs. 3C and S8, A and B).Figure 3**Crystal structures of apo- and ACO-HP0935.A, Apo-HP0935 structure showing typical GNAT domain. The secondary structure elements are labeled: α1-α4 and β1-β7 (see Fig. S7A). The mixed β-sheet splayed between the β4 and β5 strands (indicated by a *grey triange*). The P-loop and the β-bulge are shown in pale green and dodger blue colors, respectively. B, ACO-HP0935 structure. ACO (grey sticks) binds in the V-shaped cleft C, interactions between HP0935 and ACO (see Fig. S8). D, conformational changes observed in α1-α2 and β6-β7 loops upon ACO binding. Superposed structures of apo- and ACO-bound HP0935 crystal structures (middle panel). ACO is rendered as spheres. In the ACO-HP0935 crystal structure, the α1-α2 loop undergoes a conformational transition where it is moved toward ACO. The α1-α2 loop conformation in apo-HP0935 structure is called distal as it is away from the ACO-binding cleft, whereas the conformation in ACO-HP0935 structure is called proximal as it is close to ACO. This conformational transition in the α1-α2 loop brings N25 and F29 close to ACO by 6.00 Å (distance between Cγ-atoms of N25) and 9.44 Å (distance between the phenyl ring centroids of F29), respectively, as shown by broken arrows (right panel). The disordered β6-β7 loop (red broken line) attains β6-β7 hairpin structure in ACO-bound structure (left panel). The missing residues of α1-α2 and β6-β7 loops in apo-HP0935 are shown as broken lines (red) in (A and D).
Structural basis for ACO-induced conformational change in loops
Crystallization conditions A and B produced triangular-shaped apo-HP0935 crystals. However, in the presence of ACO, only condition B yielded crystals with an altered, needle-like, morphology (Experimental procedures, Fig. S9). This change in crystal morphology suggested a potential alteration in crystal packing, likely induced by structural changes in HP0935 upon ACO binding. To investigate this, the crystal structures of apo- and ACO-HP0935 were compared. The apo- and ACO-bound HP0935 crystallized in distinct space groups, C2 and P6_1_, respectively. A key observation was the partial disorder in the α1-α2 and complete disorder in the β6-β7 loop region in the apo-HP0935 structure, as evident from missing electron density. In contrast, these regions were well-ordered in the ACO-HP0935 structure, with a significant conformational change in the α1-α2 loop. In apo-HP0935, the α1-α2 loop adopts an extended, distal conformation, positioned away from the ACO-binding cleft. In contrast, in ACO-HP0935, the loop shifts toward ACO, adopting a proximal conformation (Fig. 3D). Despite a low overall Cα RMSD between the two structures (1.95 Å), the α1-α2 loop displayed high residue-wise RMSD values (Fig. S10), with residues N25 and F29 shifting towards ACO, by 6.00 Å and 9.44 Å, respectively (Fig. 3D). This transition constricts the ACO-binding cleft, likely stabilizing ACO and facilitating acetyl transfer. A similar observation has been reported in human Nα-acetyltransferase 60 (Naa60), where an F34A mutation in the α1-α2 loop abolished enzymatic activity (50).
Further, analysis of ACO/COA-bound structures revealed that a few GNAT members exhibited a similar distal-to-proximal loop transition upon ACO/COA binding (Table S3). In the aminoglycoside 6-N-acetyltransferase and the FdhC (sugar N-acetyltransferase), the distal α1-α2 loop is stabilized by crystal contacts in the apo state. However, in the presence of COA, the loop transitions to the proximal conformation and lacks crystal contacts (51, 52). In Rv0802c (acetyl- and succinyl-CoA transferase), the loop remains disordered in the apo state but adopts an ordered proximal conformation upon succinyl coenzyme A binding (53), suggesting ligand-induced stabilization. In HP0935, the α1-α2 loop is involved in crystal contacts in both conformations. In the distal conformation, two to three residues at the C-terminus of the loop are disordered, probably, due to the lack of stabilizing contacts. In contrast, in the proximal conformation, the loop is fully ordered and stabilized by interactions with the β6-β7 hairpin and symmetry-related molecules (Fig. S11, A and B). Given the distinct crystallization space groups and supporting evidence from the literature, it is likely that ACO binding induces the loop transition, which could further be stabilized by crystal packing. The effect of ACO on the distal conformation was further assessed using molecular dynamics simulations. The simulations showed that ACO affects the dynamics of the proximal conformation, and to some extent, the distal state explores a conformational space close to proximal conformation in the presence of ACO (see Supporting information, Fig. S12).
While the α1-α2 loop transition in HP0935 appears to be ACO-induced, the ordering of the β6-β7 loop is likely driven by crystal packing. In apo-HP0935, this region remains disordered, exposed to solvent channels, whereas in ACO-HP0935, it is ordered as a β-hairpin and forms part of a β-sheet alongside β1, β2, and β3 from a symmetry-related molecule. These findings suggest that the α1-α2 loop transition is primarily ACO-induced, possibly contributing to enzymatic activity. Ligand/substrate-induced conformational changes are widely recognized as critical for enzymatic function (54, 55).
Acceptor substrate-binding region of HP0935
A structural similarity search using the DALI server (56) identified HP0935 structural homologs, including putative and small-molecule (57) acetyltransferases, as well as NATs (37, 58) and C-terminal lysine PAT (59). Despite low sequence conservation, structural similarities among GNAT family members are primarily due to the conserved core structure of GNAT fold (46). Structural differences among GNAT family members primarily stem from the α1-α2 loop, β6-β7 loop/hairpin, and the lengths and orientations of α1 and α2 helices—regions crucial for substrate binding (46, 49). These features define the structural and physicochemical properties of the substrate-binding region. However, no closely related structural homologs of HP0935 were found, with respect to its substrate-binding region.
Since HP0935 acetylates peptidyl Nε-lysine (Fig. 2C), its substrate-binding region was compared to those of known prokaryotic PAT structures. PATs targeting acetyl-coenzyme A synthetase (ACS) contain an additional α-helix in the α1-α2 loop, forming a narrow tunnel that accommodates only the peptidyl-lysine side chain (Fig. S13, A–C) (60, 61, 62). A similar architecture is present in C-terminal lysine PAT (Fig. S13D) (59). In the PatA-Acs complex, an extensive interaction interface is observed outside the substrate tunnel (60). Conversely, E. coli YajB has a shorter α1-α2 loop, with parallel α1 and α2 helices forming a narrow binding region (Fig. S13E) (63). Rv0802c, from Mtb, features a wider, cleft-like substrate-binding region within its tetrameric structure (Fig. S13F) (53, 64), whereas Eis from Mtb contains a substrate-binding channel between two domains (Fig. S13G) (33). In contrast, HP0935 exhibits a distinct substrate-binding region formed by an extended α1-α2 loop and β6-β7 hairpin that differs from other bacterial PATs (Fig. S13H).
The substrate-binding α1-α2 and β6-β7 loops undergo substrate-induced conformational changes (65, 66, 67). Root mean square fluctuation (RMSF) analysis of MD simulation data (systems I and II) revealed high flexibility in these loops, with a slight reduction in the presence of ACO (Fig. 4, A and B). These observations suggest that the conformational flexibility of these loops may contribute to the structural adaptability of the substrate-binding region, potentially facilitating HP0935’s ability to interact with diverse substrates. Loop dynamics influence the catalytic efficiency, specificity, and promiscuity of enzymes (66, 68, 69). Given that HP0935 is the only known GNAT protein acetyltransferase present in H. pylori, and the prevalence of protein lysine acetylation in this organism (Fig. S1), we hypothesize that the observed loop flexibility may enable HP0935 to acetylate multiple substrates, potentially influencing bacterial physiology. However, the role of loop dynamics in the broad-substrate specificity of HP0935 remains to be evaluated experimentally.Figure 4**Dynamics of α1-α2 and β6-β7 loops.**A, backbone root mean square fluctuations (RMSF) of HP0935 simulation systems I and II. The vertical filled areas represent the α1-α2 loop (golden yellow) and β6-β7 loop (peach puff). The span of secondary structure elements (β-strand: gray, α-helix: green) along the residue index is shown at the top of the plot. B, representative structures from clustered simulation trajectories showing the widespread conformations of α1-α2 and β6-β7 loops.
Catalytic mechanism
Acetylation of an amino acid or a protein requires deprotonation of the Nα/Nε-amino group before a nucleophilic attack on the acetyl group of ACO. Previous studies have shown that the deprotonation of the amino group is facilitated by a conserved general base residue(s) or a conserved catalytic water molecule, while a general acid protonates the leaving group, COA-S^−^ (45). Structural analysis of HP0935 identified E77 and H115 as potential general base residues and Y127 as a general acid. A conserved water molecule, hydrogen-bonded to L78, H115, and E77, was observed (Fig. 5A).Figure 5**The active site of HP0935.**A, HP0935 contains two putative general base residues (E77 and H115) and a general acid (Y127) at the catalytic site. A conserved water molecule (WAT) is shown for apo- and ACO-HP0935 crystal structures at the catalytic site. This water molecule is coordinated with L78 and H115 backbone atoms, and the E77 side chain by hydrogen bonding. The electron density (2F_o_ - F_c_) map, contoured at 1.5 σ level, is shown for the conserved water molecule. B, acetyltransferase activity of HP0935 wild type (WT) and its variants (E77Q, H115A, and Y127F) was tested with lysine as a substrate. The acetylation reaction was separated by silica gel TLC and imaged by autoradiography. ^14^C-ACO: ^14^C labeled acetyl-coenzyme A, Ac-lysine: acetylated lysine, and D: heat-denatured HP0935.
The role of these putative catalytic residues was investigated using lysine as a substrate and HP0935 point mutants: E77Q, H115A, and Y127F. Circular dichroism confirmed the structural integrity of these point mutants (data not shown). Activity assays showed a slight reduction in activity for the E77Q mutant, while H115A retained WT-like activity. A similar observation was made for E. coli PatZ where an E809 point mutation showed only a 20% reduction in activity (70). In the case of yeast GCN5 acetyltransferase, the general base is indispensable (71). In contrast, Y127F showed negligible activity (Fig. 5B). This suggests that for lysine substrate, the general base is not essential for acetylation, likely due to the following reasons: at assay pH 8.0, approximately 10% of lysine’s Nα-amino group exists in a deprotonated state which serves as a substrate and the equilibrium maintains the deprotonated state of lysine during acetylation. Alternatively, HP0935 could utilize a conserved water molecule for lysine deprotonation. C-terminal lysine protein acetyltransferase does not contain a general base residue at the active site; however, a conserved water molecule present at the active site was proposed to do this job (59).
In summary, we identified and cloned a putative acetyltransferase gene, hp0935. The purified HP0935 protein was active against amino acids and protein substrate. We showed that HP0935 acetylates DprA, in vitro, thus regulating its DNA-binding properties, and hence possibly natural transformation. The 3D structure of HP0935 was determined in the apo- and ACO-bound forms. Molecular dynamics simulation data showed that the loops at the substrate binding region are highly dynamic. The conformational flexibility of these loops is likely to facilitate the acetylation of multiple substrates by an induced fit, conformational selection, or a combination of both mechanisms. In conclusion, this study clearly shows that HP0935 is likely the only protein acetyltransferase that acetylates free amino acids (Nα and Nε) and protein substrates (Nε-lysine), highlighting the robustness of acetylation in the human pathogen H. pylori.
Experimental procedures
Identification of putative N-acetyltransferase
A putative N-acetyltransferase from H. pylori was identified using PSI-BLAST (http://www.ncbi.nlm.nih.gov/BLAST/). The GNAT domain of known protein acetyltransferases from organisms S. enterica, S. cerevisiae, M. smegmatis, and M. tuberculosis were used as queries against the “non-redundant protein sequences (nr)” database. Hits from the first round of searches with an e-value <0.005 were included in the second iteration. Each query sequence underwent six iterations in total.
Cloning and mutagenesis
The gene sequence of hp0935 was procured from the annotated whole-genome sequence of H. pylori 26695 strain from JCVI (J. Craig Venter Institute). The gene was amplified from the genomic DNA of H. pylori 26695 strain by polymerase chain reaction (PCR) using primers HP0935-His-FP and HP0935-His-RF for His-tagged HP0935 and primers HP0935-GST-FP and HP0935-GST-RP for GST-tagged HP0935 (Table S4). The restriction enzyme-digested PCR products were ligated into the pET15b and pGEX_4T2 vectors linearized with the same enzymes, generating the recombinant DNA constructs pET15b-hp0935 and pGEX_4T2-hp**0935, respectively. For crystallization purposes, the hp0935 gene was amplified from the pET15b-hp0935 construct and cloned into pNIC28-Bsa4 plasmid by ligation-independent cloning to generate the pNIC28-hp**0935 construct.
Site-directed mutagenesis, a technique described earlier (72), was used to replace the amino acid of interest in the protein. To replace the amino acid E77 with glutamine, H115 with alanine, and Y127 with phenylalanine in HP0935 protein, sense and antisense primers (Table S4) were used against the template pGEX_4T2-hp**0935 clone for PCR. Q5 polymerase was used for the PCR amplification of the gene. The mutagenic primers introduced a suitable restriction enzyme recognition site. The mutations were further confirmed by DNA sequencing.
Peptide mass fingerprinting
Peptide mass fingerprint analysis of trypsin-treated HP0935 was carried out as described (73). MALDI-MS data were acquired on an ultraflex TOF/TOF spectrometer (Bruker Daltonics and Bremen) equipped with 50 Hz pulsed nitrogen laser (11/4337 nm), operated in positive ion reflection mode using a 90-ns time delay, and a 25 kV accelerating voltage. Samples were prepared by mixing an equal volume of peptide (0.5 ml) with matrices dihydroxybenzoic acid/α-cyano-4-hydroxycinnamic acid saturated in 0.1% trifluoroacetic acid and acetonitrile (1:1 v/v). Masses below 500 m/z were not considered because of interference from the matrix. Using Mascot software, peptide mass fingerprints obtained by MALDI-TOF MS were used to search NCBInr, SWISS-PROT, and MSDB databases (http://www.matrixscience.com). The parameters used for the search were as follows: peptide mass ranged from 800 to 3500 U; modifications were allowed for carboxyamidomethylation of cysteine and oxidation of methionine; one missed cleavage site was allowed; mass accuracy was 1 U; restriction was placed on Bacteria (Eubacteria). Top-scoring hits were analyzed for sequence coverage (>50%) and number of matched peptides.
Acetylation assays
Acetyltransferase activity of WT and mutant HP0935 was assayed with amino acids (methionine, lysine, arginine, or proline), acetylated lysine (Nα-acetyl lysine or Nε-acetyl lysine), and small peptides as substrates. A reaction mixture (10 μl) containing 12 mM substrate, ^14^C-ACO (^14^C-labeled acetyl-coenzyme A), and 2 μM HP0935 in an assay buffer (50 mM Tris-HCl, pH 8.0, 50 mM NaCl) was incubated at 30 °C for 2 h, and the products were resolved by silica gel TLC using butanol, formic acid, and water mixture (5:3:1) as mobile phase. TLC silica plates were dried, exposed overnight to a Phosphor-imaging cassette, and imaged using Fujifilm reader. Acetylation products were quantified by densitometry using IMAGEGAUGE V2.54 software (Fujifilm).
To determine the kinetic parameters for lysine, 800 nM HP0935 was incubated with 50 μM ^14^C-ACO and varying concentrations of lysine (1–50 mM) in an assay buffer (50 mM Tris-HCl, pH 8.0, 50 mM NaCl) at 30 °C for 20 min. The reaction mixture was resolved, and the products were quantified as described above. Similarly, kinetic parameters for ACO were determined similarly at 50 mM lysine and varying [^14^C]-ACO concentrations (5–50 μM). The data were fit to Michaelis-Menten equation.
In vitro acetylation of DprA was carried out in 50 mM Tris-Cl (pH 8.0) buffer containing 50 mM NaCl, 200 μM ACO, and 4 μM DprA at 30 °C for 2 h. The reaction was initiated by the addition of an increasing concentration of HP0935 (1–8 μM). Acetylated reactions were separated on 0.1% SDS - 10% PAGE and transferred onto the nitrocellulose membrane. The membrane was blocked with 5% w/v skimmed milk for 2 h at room temperature followed by incubation in 5% BSA in PBST (phosphate-buffered saline with 0.05% Tween 20) containing anti-acetyl lysine antibody (1:1000 dilution) from Abcam at 4 °C for overnight. The blot was washed with PBST and further incubated in PBST containing HRP-conjugated anti-rabbit antibody (1:10,000 dilution). The blots were then developed with an ECL kit (Merck Life Science) using a LAS3000 image analyzer.
In vitro acetylation of DprA and mass spectrometry analysis
All three in vitro acetylation reactions were set up in a reaction buffer containing 50 mM Tris-Cl (pH 8.0) and 50 mM NaCl: (i) unacetylation control (R1: DprA only), (ii) non-enzymatic acetylation (R2: DprA + 200 μM ACO), and (iii) enzymatic acetylation (R3: DprA + 200 μM ACO + 8 μM HP0935). All reactions were incubated at 30 °C for 2 h. Recombinant His-tagged DprA (R1 – R3) and GST-tagged HP0935 (R3) proteins purified from E. coli were used. DprA from these reactions was purified using Ni-NTA resin. The binding of DprA to Ni-NTA resin was performed in reaction buffer (50 mM Tris-HCl (pH 8.0), 50 mM NaCl and 200 μM ACO). The slurry was washed with the reaction buffer containing 10 mM imidazole. His-tag DprA was eluted from Ni-NTA resin using the reaction buffer containing 300 mM imidazole. Purified protein was dialyzed in 50 mM (NH_4_)HCO_3_ for 8 h at 4 °C. DprA protein was digested with mass spectrometry-grade trypsin (NEB #8101S) at a 1:50 ratio (w/w) for 16 h at 37 °C/300 RPM. The digested peptide was purified using C18 resin Zip/Tip. Purified peptides were analyzed by LC-MS/MS using a Q Exactive Plus mass spectrometer (Thermo Scientific) coupled with a nano-HPLC system. Peptides were separated on a nano-HPLC column over a 100-min run time. The mobile phase consisted of A: 0.1% formic acid in water, and B: 0.1% formic acid in acetonitrile. A gradient of 25% B was achieved over 100 min at a flow rate of 300 nl/min. Mass spectrometry data were acquired with an MS1 resolution of 70,000 and an MS2 resolution of 17,500. Peptide quantification was performed using a Tandem Mass Tag (TMT). Raw mass spectrometry data were processed using Proteome Discoverer 2.1 (Thermo Scientific) to generate peak lists. Peptides were identified against a FASTA database derived from the H. pylori 26695 strain proteome. Search parameters included fixed modification of carbamidomethylation and variable modification of oxidation, with up to two missed cleavages allowed due to trypsin specificity. Precursor and product ion mass tolerances were set to 1.2 Da. A false discovery rate (FDR) of 0.05 was applied to filter peptide identifications, ensuring high-confidence results. The analysis identified 45 unique and razor peptides, reflecting the protein’s acetylation profile. The experiment was conducted with two biological replicates and two technical replicates. The data were provided in Supporting Data 1 (Excel file).
NMR spectroscopy
Samples for NMR spectroscopy were prepared by aliquoting 600 μl of the reaction products (vide supra) and adding 60 μl of D2O (99.9%) as the lock solvent. All NMR spectra were acquired on a Bruker 700 MHz NEO NMR spectrometer equipped with a room temperature triple resonance probe fitted with a single (z-axis only) pulse field gradient accessory. NMR spectra were acquired at 25 °C. One-dimensional NMR spectra were acquired using the excitation sculpting pulse scheme for solvent suppression. A total of 8192 complex points were sampled over a spectral width of 9000 Hz. Spectra were exponentially multiplied with a line broadening parameter of 5 Hz, Fourier transformed, and analyzed using Topspin 4.0.9 software.
Electrophoretic mobility shift assays
The DNA-binding activity of DprA was estimated in 1X TAM reaction buffer (50 mM Tris-Cl pH 7.4, 50 mM sodium acetate, 10 mM magnesium acetate, and 1 mM dithiothreitol (DTT)) containing 1 nM of 32P labeled ssDNA (53Forward, Table. S4). An increasing concentration of DprA was added to start the reaction (100–1000 nM) in a 20 μl reaction volume. The reaction was then incubated at 4 °C for 30 min, and the reaction was resolved on 8% native-PAGE in 1X TAME buffer (6 mM Tris-Cl (pH 7.8), 10 mM sodium acetate, 4 mM magnesium acetate, and 1 mM ethylenediaminetetraacetic acid (EDTA)) at 100 V for 8 h, 4 °C. The gel was vacuum dried at 80 °C for 45 min after transferring onto Whatman 3 mm paper and imaged using Phospho-Imager (Fuji FLA-9000).
Surface plasmon resonance
The protein-protein interaction of DprA and HP0935 was investigated by surface plasmon resonance (Biacore3000 from GE Healthcare Life Science) as described earlier (43). Briefly, the DprA protein was immobilized on the surface of the CM5 sensor chip (GE Healthcare Life Science). Individual proteins’ interaction with the CM5 chip surface resulted in a ∼500 RU/flow cell response signal. One of the flow cells was treated with ethanolamine alone to score the non-specific binding. An interaction study was performed in 50 mM Tris-Cl (pH7.4) buffer at 25 °C with a flow rate of 20 μl/min. Varying concentrations of HP0935 were passed over the flow cell surface, followed by a dissociation period of 300 s. The regeneration of the flow cell surface was performed by 50 mM NaOH. The blank reference flow cell corrected all the protein-protein interaction data for non-specific binding. Each experiment was repeated twice to ensure reproducibility. The standard Langmuir binding equation was a good fit for the association and dissociation phases.
Crystallization
Crystallization trials were carried out with tag-removed HP0935 at 22 °C using the sitting-drop vapor diffusion method. We were able to grow apo-form HP0935 crystals in two different crystallization conditions (condition A: 0.2 M potassium thiocyanate, 0.1 M Bis-Tris propane pH 6.5, and 20% (w/v) polyethylene glycol 3500 and condition B: 0.02 M magnesium chloride hexahydrate, 0.1 M HEPES pH 7.5 and 22% (w/v) poly(acrylic acid sodium salt) 5100) with identical morphology (triangular plate-shaped, Fig. S9, A and B). Final diffraction quality crystals were obtained at 4 °C in a crystallization drop set up with 1 μl of protein (5 mg/ml) solution and 1 μl of reservoir solution in 2 days. For growing cofactor-bound (ACO) crystals, protein at 16 mg/ml concentration was incubated with five-fold molar excesses of ACO on ice for 1 h. Crystallization was carried out at 4 °C similarly as described above. Needle-like crystals were obtained exclusively in condition B (Fig. S9C).
Data collection and structure solution
Anomalous datasets were collected on apo-protein crystals and native datasets were collected on apo- and ACO-bound-protein crystals. To collect anomalous data, crystals were soaked in a reservoir solution containing 400 mM of sodium iodide for 10 min. Both anomalous and native data sets were collected (1° slices covering 360°) at a home source (Bruker Microstar, Cu Kα λ = 1.5418 Å). Diffraction data for ACO-bound protein crystals were collected at ID29 (ESRF, Grenoble) at a wavelength of 1.0723 Å, with 0.1° slices covering 360°. Prior to data collection, all the crystals were cryo-protected with 20% (v/v) ethylene glycol and flash-frozen in liquid nitrogen. All datasets were collected at 100 K. Anomalous and native datasets for apo crystals were processed using Burker’s PROTEUM3 suite, to resolutions of 2.42 Å and 2.00 Å, respectively. The ACO-bound dataset was processed using XDSAPP3.0 (74) to 2.40 Å resolution. Experimental phases were determined from the anomalous dataset using the iodine SAD phasing method with the CRANK2 pipeline (75). A partial model obtained from CRANK2 was built to completion using Buccaneer (76). High-resolution structures of both apo and ACO-bound forms were determined from the native datasets by molecular replacement (MR) method using Phaser-MR (77). Model building was completed by iterative manual building with Coot (78) and refinement with REFMAC5 (79). Model quality was assessed using MolProbity (80).
Miscellaneous
All data plots were generated using GraphPad Prism and Python plotting library, Matplotlib (81). The structures and trajectories were visualized with ChimeraX (82). All artwork was prepared using Inkscape (https://inkscape.org).
Statistical analysis
All enzyme kinetic data analyses, including both linear and non-linear regressions, were performed with GraphPad Prism version 8.0. Lineweaver-Burke plot linear regressions for all enzyme kinetic data have a p-value <0.005. Unless otherwise noted, all enzyme activity data were the average of at least duplicate determinations. The error bars represent the standard deviation (SD).
Data availability
The coordinates for apo-HP0935 and ACO-HP0935 structures have been deposited in the Protein Data Bank under the accession codes 8IYM and 8IYO, respectively.
Supporting information
This article contains supporting information (43, 45, 78, 83, 84, 85, 86, 87, 88, 89, 90, 91, 92, 93, 94).
Conflict of interest
The authors declare that they have no conflicts of interest with the contents of this article.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Verdone L.Agricola E.Caserta M.Di Mauro E.Histone acetylation in gene regulation Brief. Funct. Genomic. Proteomic.520062092211687746710.1093/bfgp/ell 028 · doi ↗ · pubmed ↗
- 2Wang Q.Zhang Y.Yang C.Xiong H.Lin Y.Yao J.(2010) Acetylation of metabolic enzymes coordinates carbon source utilization and metabolic flux Science 32719791004100710.1126/science.1179687 PMC 418314120167787 · doi ↗ · pubmed ↗
- 3Guan K.L.Xiong Y.Regulation of intermediary metabolism by protein acetylation Trends Biochem. Sci.3620111081162093434010.1016/j.tibs.2010.09.003PMC 3038179 · doi ↗ · pubmed ↗
- 4van de Kooij B.de Vries E.Rooswinkel R.W.Janssen G.M.C.Kok F.K.van Veelen P.A.N-terminal acetylation can stabilize proteins independent of their ubiquitination Sci. Rep.1320231133700545910.1038/s 41598-023-32380-3PMC 10067848 · doi ↗ · pubmed ↗
- 5Choudhary C.Kumar C.Gnad F.Nielsen M.L.Rehman M.Walther T.C.Lysine acetylation targets protein complexes and co-regulates major cellular functions Science 32520098348401960886110.1126/science.1175371 · doi ↗ · pubmed ↗
- 6Starai V.J.Escalante-Semerena J.C.Identification of the protein acetyltransferase (Pat) enzyme that acetylates acetyl-Co A synthetase in Salmonella enterica J. Mol. Biol.3402004100510121523696310.1016/j.jmb.2004.05.010 · doi ↗ · pubmed ↗
- 7Ren J.Sang Y.Lu J.Yao Y.F.Protein acetylation and its role in bacterial virulence Trends Microbiol.2520177687792846278910.1016/j.tim.2017.04.001 · doi ↗ · pubmed ↗
- 8Drazic A.Myklebust L.M.Ree R.Arnesen T.The world of protein acetylation Biochim. Biophys. Acta 18642016137214012729653010.1016/j.bbapap.2016.06.007 · doi ↗ · pubmed ↗
