DERA-Catalyzed Chemoenzymatic Access to Nucleobase-Substituted Candidate Statin Precursors
Romina Fernández Varela, Eman Abdelraheem, Lautaro Giaimo, Luciano Cortés, Leticia Lafuente, Ana Laura Valino, Peter-Leon Hagedoorn, Ulf Hanefeld, Adolfo Iribarren, Elizabeth Lewkowicz

TL;DR
Researchers developed a new method using an enzyme to create statin precursors with nucleobase substitutions, opening up possibilities for new drug candidates.
Contribution
A novel chemoenzymatic approach using a DERA variant to synthesize nucleobase-substituted statin precursors is introduced.
Findings
A C49M variant of PaDERA catalyzed aldol additions with nucleobase aldehydes to form lactols.
Nucleobase-derived aldehydes were accepted in DERA-catalyzed reactions for the first time.
The method provides structurally novel statin side-chain precursors for drug development.
Abstract
Aldolases are powerful biocatalysts for the stereoselective formation of carbon–carbon bonds and are widely used in the synthesis of chiral intermediates for pharmaceutical applications. Among them, 2-deoxyribose-5-phosphate aldolase (DERA) has been extensively exploited for the preparation of the conserved side chain of statins. In this work, we report a novel chemoenzymatic approach for the synthesis of nucleobase-substituted lactol products as potential precursors of new statin analogues. A C49M variant of DERA from Pectobacterium atrosepticum (PaDERA C49M) was employed to catalyze sequential aldol additions using aldehyde-functionalized nucleobases as non-natural electrophilic substrates. The formation of nucleobase-containing lactols was confirmed, demonstrating for the first time the acceptance of nucleobase-derived aldehydes in DERA-catalyzed aldol reactions. This strategy…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9
Figure 10
Figure 11
Figure 12
Figure 13
Figure 14
Figure 15- —Universidad Nacional de Quilmes
- —Agencia Nacional de Promoción Científica y Tecnológica, ANPCYT
- —National Council of Science and Technology, CONICET
- —ERACoBiotech
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsEnzyme Catalysis and Immobilization · Enzyme Structure and Function · Microbial Natural Products and Biosynthesis
1. Introduction
Cardiovascular disease remains the leading cause of mortality worldwide, with over 20 million deaths annually [1,2]. Among the established risk factors associated with its global health impact—including low physical activity, obesity, diabetes, smoking and high blood pressure—elevated low-density lipoprotein (LDL) cholesterol represents a major determinant, implicated in approximately 3.8 million deaths [3].
Statins are usually prescribed to reduce the levels of LDL cholesterol through inhibition of the rate-limiting enzyme 3-hydroxy-3-methylglutaryl-coenzyme A (HMG-CoA) reductase involved in the conversion of HMG-CoA to mevalonate, which is an early step in the synthesis of cholesterol [4]. Since the discovery of the first cholesterol biosynthesis inhibitor, Mevastatin, in the 1970s, statin families have expanded rapidly through the development of natural, semisynthetic, and fully synthetic statins (Figure 1) [5]. Although statins are generally well-tolerated and safe, adverse reactions become more significant as the number of users increases. These often include muscle weakness, hepatic damage, renal insufficiency, and the potential to induce diabetes. For this reason, there is growing interest in finding alternatives that minimize these effects [6,7].
The aldol reaction is an important tool in biological systems and in organic chemistry to produce carbon skeletons, with aldolases being the most suitable enzymes to catalyze this reaction in an asymmetric way [8,9]. In particular, 2-deoxyribose-5-phosphate aldolase (DERA) catalyzes the stereoselective aldol addition between two aldehydes via an enamine intermediate, the typical mechanism of Class I aldolases [10]. This enzyme is strictly dependent on acetaldehyde as a nucleophile but can accept a wide range of aldehydes as electrophiles, producing 2,4,6-trideoxyhexoses (Scheme 1). This sequential aldol addition is thermodynamically controlled and stops when a stable intramolecular hemiacetal is formed [11]. The two chiral centers generated, controlled by the enzyme [12], have the appropriate stereochemistry for the cyclic hemiacetal (lactol) to be used as intermediates in the production of Atorvastatin and other related cholesterol-lowering drugs [13,14,15]. However, the application of DERA in large-scale production of these pharmaceutical compounds is limited by some drawbacks, such as low catalytic activity and low tolerance to aldehydes [16]. To overcome this disadvantage, different technologies such as directed evolution, protein engineering, computational design, and immobilization have been applied to improve DERA’s properties [17]. In particular, a number of recombinant DERA variants with increased productivity, from mesophilic [18], psychrophilic [19], and thermophilic organisms [20], have been reported.
The most efficient modification to provide acetaldehyde resistance involves the mutation of a cysteine, non-essential for catalysis (C47 in DERA from E. coli), by a non-nucleophilic amino acid such as methionine, which avoids the formation of a bridge via a Michael addition of crotonaldehyde, resulting from the condensation of two acetaldehyde molecules with the lysine K167 in the active site [16]. In this regard, EcDERA C47M showed no loss of activity after incubation in 300 mM of acetaldehyde for 16 h, in contrast to wild-type EcDERA, which was completely inactivated after 3 h [21]. Similar behavior was observed in DERAs from Lactobacillus brevis [22], Rhodococcus erythropolis strain DSM 311 [23], Thermotoga maritima [24], archaeon Pyrobaculum aerophilum [25] and Staphylococcus aureus [26], among others, after cloning, modification and recombinant expression in E. coli. Previous work in our lab involved DERA from Pectobacterium atrosepticum ATCC 33,260 (PaDERA) [27], which was genetically modified, cloned and expressed in E. coli. In comparison to the wild-type enzyme, PaDERA C49M (which corresponds to C47 in EcDERA) retained 90% of its activity after incubation for 90 min in 300 mM of acetaldehyde [28].
Different 6-substituted 2,4,6-trideoxy-D-erythro hexapyranoses (6-R-TDHP) were reported over time as precursors of the side chain of statins. (3R,5S)-6-chloro-2,4,6-trideoxyhexapyranoside (ClTDHP) was the first biocatalytically synthesized intermediate using diverse DERA variants. This approach represented a great improvement, in chemical and stereochemical purity and cheap achiral starting materials, over its chemical synthesis [29,30,31,32,33]. This biotransformation has been successfully used to prepare the statin side-chain of Rosuvastatin on a manufacturing scale [34]. Besides chlorine, acetoxy [35], azido [36] and N-protected amino [37] groups were reported as TDHP 6-substituents in order to improve productivity. Švarc et al. [34] developed an alternative strategy in which aminophenylacetamide-protected propanal is used as the starting aldehyde of the aminolactol precursor of Atorvastatin. Lactols can then be oxidized to the corresponding lactones, after which further chemical steps attach them to the heterocycle moiety to obtain the different statins [5,38,39].
The conserved 3,5-dihydroxyheptanoic acid moiety present in all statins is responsible for HMG-CoA reductase inhibition through competitive and reversible binding due to its structural similarity to its natural substrate, HMG-CoA. In addition, interactions between the enzyme and the remaining portions of the molecule contribute to binding affinity [40]. While natural statins contain a hexahydronaphthalene core, synthetic statins incorporate diverse heterocyclic structures, such as pyrrole in Atorvastatin, pyrimidine in Rosuvastatin, or quinoline in Pitavastatin [41]. Accordingly, drug development efforts aimed at improving efficacy and tolerability have primarily targeted the heterocyclic scaffold [42].
Unlike statins, eritadenine, an acyclic adenosine analog with hypocholesterolemic activity, does not affect cholesterol production but accelerates cholesterol metabolism in the liver, modifying phospholipid metabolism and promoting the removal of cholesterol from the blood through mechanisms that increase LDL cholesterol uptake [43,44]. This led us to consider the possibility of preparing statins containing a nucleoside base, such as the heterocycle. Therefore, the objective of this work was to generate the side chain precursors for new statin analogues from aldehyde derivatives of nitrogen bases, using PaDERA C49M, recently cloned and overexpressed by us, and whose synthetic activity towards the formation of TDHP has already been reported [45].
2. Materials and Methods
2.1. Chemicals and Microorganisms
Reagents and substrates were purchased from Sigma-Aldrich (St Louis, MO, USA). TLC analysis was performed using silica gel 60 F254 aluminum plates from Merck (Darmstadt, Germany). The culture media components were obtained from Anedra (Buenos Aires, Argentina), Britania (Buenos Aires, Argentina) and Sigma-Aldrich. Solvents were from Sintorgan (Buenos Aires, Argentina) and Biopack (Zárate, Argentina). Escherichia coli DH5α (Invitrogen, Thermo Fisher Scientific, Waltham, MA, USA) and E. coli BL21 (DE3) (ATCC 47092, acid phosphatase-deficient) strains were used in cloning and expression experiments.
2.2. Culture Conditions
Plasmids encoding P. atrosepticum DERA, including the mutation C49M (PaDERA C49M), as described in ref. [28], were transformed into chemically competent E. coli BL21 (DE3) for expression. Each recombinant E. coli strain was grown in LB medium supplemented with kanamycin 30 μg/μL at 37 °C and 200 rpm overnight. The pre-cultures (25 mL) were then used to inoculate a fresh LB medium containing kanamycin 30 μg/μL (250 mL) and grown at 37 °C, 200 rpm until OD_600_ reached 0.6–0.8, at which point the protein expression was induced with isopropyl-β-thiogalactoside (IPTG) at a final concentration of 0.1 mM. The cultures continued growing at 28 °C and 180 rpm overnight. Subsequently, the cells were harvested by centrifugation (5000 rpm, 10 min, 4 °C), washed with 0.1 M potassium phosphate buffer pH 7.0, recentrifuged and used as whole cell biocatalysts (2.5 × 10^9^ cells/mg wet weight) or to continue towards protein purification.
2.3. Enzyme Purification
E. coli (PaDERA C49M) cells were resuspended in 0.1 M potassium phosphate buffer, pH 7, to achieve a final concentration of 30% w/v. The suspension was sonicated at 50% amplitude for four cycles at 4 °C using the Vibra Cell disruptor (VCX130, Sonics, Newtown, CT, USA) or incubated with lysozyme (1 mg/g wet weight of cells) for 24 h at 4 °C and 200 rpm. The suspension was centrifuged at 4 °C and 9000 rpm for 15 min, and the supernatant was conserved as cell-free extract (CFE).
For the enzyme purification by batch-mode affinity chromatography, 20 mL of Ni- Sepharose TM 6 Fast Flow resin (obtained from GE Life Science, Boston, MA, USA) and 20 mL of CFE were incubated for 1 h at 4 °C. After that, the solid phase (resin with bound enzyme) was separated from the supernatant by centrifugation, and the unbound CFE was discarded. The resin was then subjected to sequential washing–centrifugation steps with imidazole solutions at increasing concentrations (5 mM–500 mM). Relevant fractions of the eluted proteins were combined, desalted and concentrated to a final volume of 1 mL by centrifuging at 5700 rpm at 4 °C in a Vivaspin 20 Centrifugal Concentrator (Sartorius, Göttingen, Alemania).
2.4. Activity Assay and Determination of Protein Concentration
The activity of PaDERA C49M was determined by the coupled assay described previously [46], measuring the oxidation of NADH. The assay mixture contained 0.1 M potassium phosphate buffer, pH 7.0, 0.2 mM NADH, 0.4 mM 2-deoxyribose-5-phosphate (DR5P), 4 μL of the mixture α-glycerophosphate dehydrogenase/triosephosphate isomerase (α-GDH-TPI) and 5 μL of purified protein. The reaction was initiated by the addition of purified protein and the subsequent decrease of NADH concentration was monitored at 340 nm. A total of 1 U of protein activity was defined as the amount of enzyme required for cleavage of 1 μmol of DR5P per minute at 28 °C. The protein concentration was determined by the Bradford assay [47]. PaDERA C49M protein concentration: 0.98 mg/mL and PaDERA C49M activity: 28.9 U/mg.
2.5. Substrate Synthesis
*N-*1-(2,2-dimethoxyethyl) thymine (4i), *N-*1-(2,2-dimethoxyethyl) cytosine (4ii), and *N-*9-(2,2-dimethoxyethyl) adenine (4iii) were prepared according to our previous report [48]. Mixtures of thymine (3i), cytosine (3ii) or adenine (3iii) (1 equiv.) and K_2_CO_3_ (2 equiv.) in DMF (10 mL) were stirred at 90 °C in the presence of 2-bromo-1,1-dimethoxy ethane (2 equiv.). After 24 h, the reactions were filtered and the solvent was removed under reduced pressure. The crude mixtures were purified by column chromatography on silica gel (230 mesh), using approximately 40% w/w of crude material relative to the stationary phase. Elution was performed using dichloromethane/methanol as the mobile phase, gradually increasing the polarity. Column progress was monitored by TLC, using dichloromethane/methanol (9:1) as the eluent. Aldehydes (N-1-(2-oxoethyl) thymine (5i), N-1-(2-oxoethyl) cytosine (5ii) or N-9-(2-oxoethyl) adenine (5iii) were obtained after hydrolysis of the corresponding acetals (4i-iii) in HCl (1 N) at 90 °C for 1 h and neutralization with NaOH (2 N). NMR and ESI-MS spectral characteristics are in accordance with the literature [49].
2.6. Whole Cell Biotransformations
2.6.1. Synthesis of 2,4,6-Trideoxy-D-Erythro-Hexapyranose (TDHP, 2a)
The reactions were performed in a 4 mL final volume, 100 mM potassium phosphate buffer, pH 7, carrying 1 M of acetaldehyde and 2.5 × 10^9^ E. coli (PaDERA C49M) cells/mg wet weight as biocatalyst. The reaction mixture was shaken at 28 °C and 200 rpm for 24 h and then centrifuged for 10 min at 5000 rpm. The supernatant was extracted with 2 volumes of ethyl acetate and purified by silica gel column chromatography using ethyl acetate as eluent. The NMR data are in accordance with the literature [45].
2.6.2. Synthesis of 6-Substituted-2,4,6-Trideoxy-D-Erythro-Hexapyranoses (2b-e)
The reactions were performed in a 4 mL final volume, 100 mM potassium phosphate buffer, pH 7, carrying 100 mM of chloroacetaldehyde (1b), propanaldehyde (1c), phenylacetaldehyde (1d) or benzaldehyde (1e) as acceptors, acetaldehyde 200 mM as donor, and 2.5 × 10^9^ E. coli (PaDERA C49M) cells/mg wet weight as biocatalyst. The reaction mixtures were shaken at 28 °C and 200 rpm for 24 h and then centrifuged for 10 min at 5000 rpm. The supernatants were extracted with 2 volumes of ethyl acetate, purified by silica gel column chromatography using ethyl acetate as eluent and analyzed by GC.
2.7. Free-Enzyme Biotransformation
Synthesis of 6-Substituted Lactols (6i-iii)
The reactions were carried out in a final volume of 1 mL of 0.1 M potassium phosphate buffer (pH 7), containing 100 mM (5i, 5ii, 5iii) as the acceptors, 100 mM acetaldehyde as the donor, and 1 mg/mL PaDERA C49M. The reaction mixtures were shaken at 28 °C and 200 rpm for 24 h. Subsequently, the reactions were quenched with acetonitrile and centrifuged at 11,000 rpm for 3 min. The resulting supernatants were extracted three times with one volume of ethyl acetate. Compounds 6i-iii were analyzed by HPLC-UV/MS. Compound 6i was isolated from an 8 mL reaction mixture through extensive extraction with ethyl acetate and subsequently purified by silica gel (230 mesh) column chromatography employing dichloromethane/methanol (9:1) as the mobile phase.
2.8. Analytical Methods
Instrumental
The progress of the reactions and the conversion of substrates 5i–iii were monitored by analytical HPLC–UV using a Gilson chromatograph (321 pump, 156 UV/VIS detector, and 234 series autoinjector; Middleton, WI, USA). Separations were performed on a SiliaChrom SB RP18 column (150 × 4.6 mm, 5 µm; Quebec City, QC, Canada). Water (A) and acetonitrile (B) were used as mobile phases at a flow rate of 0.9 mL·min^−1^. The elution program was as follows: 4 min at 99% A and 1% B; a linear gradient over 3 min to 88% A and 12% B; 5 min at 88% A and 12% B; followed by a linear gradient over 3 min back to 99% A and 1% B.
Gas chromatography (GC) analyses of the biotransformations of compounds 1a–e were performed using a Thermo Scientific Trace 1300 gas chromatograph (Thermo Fisher Scientific) equipped with a TR-5 capillary column (30 m × 0.25 mm × 0.25 μm), using helium as carrier gas (1 mL/min). The injector temperature was set at 200 °C, and the oven temperature program was 50 °C for 5 min, ramped to 250 °C at 10 °C/min, and held at the final temperature for 5 min.
Structural characterization of all isolated compounds was carried out by ^1^H and ^13^C NMR spectroscopy on Bruker Avance II 500 and Avance III 600 spectrometers (Madison, WI, USA), operating at 500/125 MHz and 600 MHz, respectively, using CDCl_3_ as solvent.
High-resolution mass spectra of 6-substituted lactols (6i-iii) were recorded using an ACQUITY UPLC system (Waters, Milford, MA, USA) coupled to an Orbitrap mass spectrometer (Q-Exactive Focus, Thermo Fisher Scientific,). Chromatographic separation was achieved on a reverse-phase ACQUITY UPLC BEH C18 column (1.0 × 100 mm, 1.7 μm; Waters, UK) at room temperature. The mobile phases consisted of water containing 0.1% formic acid (A) and acetonitrile containing 0.1% formic acid (B), using a flow rate of 50 μL/min. The gradient started at 5% B for 15 min, increased to 75% B over 22 min, and then returned to initial conditions for re-equilibration. The mass spectrometer was operated in positive ESI mode, alternating between full scan (50–750 m/z, resolution 70 K) and PRM mode. Data were analyzed using XCalibur 4.1 software.
Enzymatic activity and other spectrophotometric assays were performed using a Shimadzu UV-160A UV–visible spectrophotometer (Shimadzu, Kyoto, Japan).
2.9. 1H and 13C NMR Product Data
All chromatographic and spectral data are available in the Supplementary Materials (SM) file.
2,4,6-trideoxy-D-erythro-hexapyranose (2a) [45]
^1^H NMR (600 MHz, CDCl_3_) δ (ppm) 5.34 (t, J = 4.7 Hz, 1H, α, H-1), 5.22–5.12 (m, 1H, β, H-1), 4.44 (dtd, J = 12.4, 6.3, 2.4 Hz, 1H, α/β, H-5), 4.35 (q, J = 3.1 Hz, 1H, β, H-5), 4.24 (dt, J = 6.4, 3.1 Hz, 1H, α, H-3), 4.10 (qd, J = 5.7, 5.2, 2.0 Hz, 1H, β, H-3), 4.07 (d, J = 6.5 Hz, 1H, α, H-4a), 3.17 (d, J = 6.1 Hz, 1H, β, H-4a), 3.01 (d, J = 6.5 Hz, 1H, α, H-4b), 2.05–1.95 (m, 1H, α/β, H-4b), 1.88–1.76 (m, 2H, α/β, H’s-2a), 1.56 (ddd, J = 14.2, 11.7, 2.7 Hz, 1H, α, H-2b), 1.48 (ddd, J = 14.2, 11.5, 2.9 Hz, 1H, β, H-2b), 1.26 (d, J = 6.3 Hz, 3H, β, H’s-6), 1.24 (d, J = 6.3 Hz, 3H, α, H’s-6).
^13^C NMR (126 MHz, CDCl_3_) δ (ppm): 94.3 (CHOH, C-1), 93.0 (CHOH, C-1′), 68.3 (CH, C-5), 66.8 (CH, C-5′), 65.1 (CHOH, C-3), 64.1 (CHOH, C-3′), 42.8 (CH_2_, C-2), 42.3 (CH_2_, C-2′), 39.9 (CH_2_, C-4), 39.1 (CH_2_, C-4′), 21.4 (CH_3_, C-6), 21.2 (CH_3_, C-6′).
6-chloro-2,4,6-trideoxy-D-erythro-hexapyranose (2b) [15]
^1^H RMN (500 MHz, CDCl_3_): δ (ppm) 5.51 (d, J = 3.5 Hz, 1H, β, H-1), 5.41 (d, J = 3.6 Hz, 1H, α, H-1), 4.53 (dd, J = 9.0, 4.2 Hz, 1H, H-5), 4.32 (m, 1H, H-5’), 4.28–4.18 (m, 1H, H-3’), 3.91 (dq, J = 11.7, 6.2 Hz, 1H, H-3), 3.63–3.55 (m, 4H, H’s-6), 2.22–1.74 (H-4, H-4’, H-2b, H-2’b), 1.55 (td, J = 12.0, 3.6 Hz, 1H, H-2a), 1.47–1.28 (m, 1H, H-2’a).
^13^C NMR (126 MHz, CDCl_3_) δ (ppm): 93.1 (CHOH, C-1), 92.9 (CHOH, C-1′), 68.0 (CH, C-5), 64.6 (CH, C5′), 63.1 (CHOH, C-3), 63.1 (CHOH, C-3′), 47.7 (CH_2_, C-6), 47.2 (CH_2_, C-6′), 39.0 (CH_2_, C-2), 38.3 (CH_2_, C-2′), 35.4 (CH_2_, C-4), 34.8 (CH_2_, C-4′).
6-thyminyl-2,4,6-trideoxy-D-erythro-hexapyranose (6i)
^1^H NMR (500 MHz, DMSO-d6): δ (ppm) 11.25 (d, J = 12.5 Hz, 1H, NH), 7.44 (s, 1H, H-7), 4.85 (d, J = 9.1 Hz, 1H, H-1), 4.08 (s, 1H, H-3), 4.03 (d, J = 8.6 Hz, 1H, H-5), 3.74 (d, J = 3.5 Hz, 1H, H-6a), 3.58 (d, J = 8.4 Hz, 1H, H-6b), 1.75 (s, 3H, H-9), 1.68 (d, J = 14.0 Hz, 1H, H-2b), 1.47 (d, J = 12.7 Hz, 1H, H-4a), 1.34 (t, J = 11.6 Hz, 1H, H-2a), 1.26 (d, J = 11.6 Hz, 1H, H-4b).
^13^C NMR (126 MHz, DMSO-d6): δ (ppm): 164.8 (CO, C-11), 151.4 (CO, C-10), 143.0 (CH, C-7), 108.1 (CH, C-8), 92.1 (CHOH, C-1), 68.3 (CH, C-5), 63.7 (CHOH, C-3), 52.15 (CH_2_, C-6), 40.2 (CH_2_, C-2), 35.4 (CH_2_, C-4), 12.4 (CH_3_, C-9).
m/z found: 257.1124, estimated [C_11_H_17_N_2_O_5_]^+^: 257.2634.
3. Results and Discussion
The chemical synthesis of statins presents several drawbacks, including modest overall yields, the use of large amounts of hazardous reagents, and the generation of considerable chemical waste [50]. Furthermore, the stereoselective construction of the chiral centers in the side chain, the main pharmacophoric motif, is particularly challenging. In contrast, the use of biocatalysts offers advantages such as high regio and stereo selectivity, use of non-polluting raw materials and solvents, and mild reaction conditions, which help to reduce costs and improve sustainability [51]. Among the different biocatalysts used to synthesize the conserved chiral side chain of statins [52], DERA is one of the most studied because it has the ability to simultaneously generate both stereocenters from simple aldehydes [5].
3.1. Expression and Purification of PaDERA C-His AA C49M
Pectobacterium atrosepticum ATCC 33,260 DERA (PaDERA) [27] was cloned and expressed as previously reported [28]. The variant PaDERA C49M, including a C-terminal 6×His tag, a 5-amino-acid spacer, and the C49M mutation, was selected due to its high activity in the presence of acetaldehyde [28,45]. To enable the use of free PaDERA C49M as a biocatalyst, the enzyme was purified from a crude cell-free extract (CFE) obtained by enzymatic or mechanical lysis. Both methods gave comparable enzyme recovery and catalytic activity; however, only enzymatic lysis preserved the expected stereoselectivity. Mechanical disruption led to a marked loss of stereocontrol, likely due to structural perturbations, as reported for other aldolases [53].
Purification was performed by batch-mode affinity chromatography, since column-based purification caused complete loss of enzymatic activity. This behavior is likely linked to the high flexibility of the C-terminal region [54], which may be incompatible with prolonged Ni–NTA binding conditions [55]. Batch-mode purification reduces resin contact time, preserving enzymatic activity and also providing sufficient material for subsequent reactions. After the purification, the enzyme displayed an initial specific activity of 28.9 U/mg; however, activity declined to 17.19 U/mg after 24 h at 4 °C and was completely lost after 48 h. Limited stability of isolated DERAs has been widely reported, and although approaches such as protein engineering and enzyme immobilization have been investigated, improving their stability continues to be challenging [32,56].
3.2. PaDERA C-His AA C49M and Chiral Lactols
The use of whole-cell biocatalysts represents an attractive alternative for industrial processes, not only because of their low production cost, but also because they help mitigate stability issues and circumvent the need for laborious enzyme purification. E. coli (PaDERA C49M*)* was employed as a whole-cell biocatalyst to prepare 2,4,6-trideoxy-D-erythro-hexapyranose (TDHP, 2a) in 99% yield by tandem aldol addition of acetaldehyde even at concentration of 500 mM [45].
To validate the potential of the new mutant as a versatile whole-cell biocatalyst, we performed a DERA-catalyzed aldol addition using acetaldehyde as donor and different acceptor aldehydes, as shown in Table 1. Chloroacetaldehyde (1b) was the first substrate studied and used in DERA-catalyzed reactions. Moreover, it is the most widely industrially employed due to the versatility of chlorine to participate in subsequent reactions to introduce the heterocycle moiety to statins preparation [52,57]. Additionally, propanaldehyde (1c), phenylacetaldehyde (1d) and benzaldehyde (1e) allowed us to explore the acceptance of non-phosphorylated electrophilic substrates by DERA, non-polar, bulky and aromatic respectively. None of the substrates was as good as 1a for PaDERA C49M. As usual, aromatic aldehydes were not substrates for the enzyme [54]. Recently, Rizzo et al. [53], using metagenomic technology for enzyme discovery, reported for the first time a DERA capable of accepting furfural and benzaldehyde with higher activity and stereoselectivity than those achieved with other well-known DERAs with non-natural substrates.
All biotransformations were carried out using experimental conditions previously optimized for the synthesis of 2a [45]. After biotransformations, lactols 2b-e were extracted with ethyl acetate and further purified by silica column chromatography. Due to their instability and the presence of byproducts, the purification was quite problematic. Indeed, given the difficulty of purifying these compounds, most reports that employ them as intermediates utilize the crude product [16]. All compounds were quantified by GC, observing the presence of TDHP as a secondary product, a pattern that was maintained in all aldol biotransformations. Only 2b was isolated and analyzed by ^1^H and ^13^C NMR, which were consistent with previous reports [33,45]. The presence of both anomers was corroborated by the duplication of signals in both spectra and, in particular, by the signals corresponding to the hydrogens bonded to the anomeric carbons of the two anomers at 5.41 ppm and 5.51 ppm of the ^1^H NMR spectrum with a 7:3 (α/β) ratio (Figures S6 and S7).
3.3. Novel Statin Precursors
As previously mentioned, traditional statin synthesis involves the construction of the functionalized chiral side-chain with a reactive group (e.g., chloro, cyano, or acetoxy), which then serves as a building block for introducing the corresponding heterocycle [54]. For over two decades, our laboratory has focused on the synthesis of nucleoside analogues using biocatalytic approaches. In recent years we have explored the use of different aldolases as biocatalysts for the synthesis of acyclic nucleoside analogues employing aldehyde-functionalized nucleobases as key precursors [48,58,59]. In line with these efforts, we implemented a chemoenzymatic strategy aimed at generating novel nucleobase-containing statin candidates. This approach comprises the synthesis of suitably functionalized aldehyde nucleobases, their conversion into the corresponding substituted lactols through PaDERA C49M catalysis, and a final oxidation step affording the target lactone (Scheme 2).
Compounds 5i–iii were obtained by N-alkylation of the corresponding bases 3i–iii, followed by acid hydrolysis of the acetal intermediates 4i–iii, as previously reported [49]. The resulting aldehydes were then added to a phosphate-buffered solution containing E. coli (PaDERA C49M) whole cells and acetaldehyde, which serves as the specific donor. Upon substrate consumption, analysis of the crude reaction mixture and the extracted supernatant revealed several low-abundance, hard-to-purify byproducts, most likely originating from biocatalyst-mediated substrate decomposition.
In light of these results, the crude cell-free extract (CFE) obtained from disrupted E. coli (PaDERA C49M) whole cells was tested as a biocatalyst. This strategy did not lead to improved outcomes, as the small amount of the product formed exhibited degradation after 3 h of reaction. However, the appearance of the product allowed us to determine that aldehydes 5i-iii were substrates of DERA and that, probably due to their structures being similar to natural nucleosides, they could also be substrates for other enzymes present in the CFE. Therefore, the use of free purified PaDERA C49M as a biocatalyst for this biotransformation was necessary.
The enzyme was purified as described above and used immediately to minimize storage instability. The amount of enzyme used (1 mg/mL) was equivalent to the biocatalyst loading employed in the other formats. In order to reduce the formation of TDHP as a by-product, the 1a:5i-iii ratio was adjusted to 1:1. However, the strong preference of 1a as both donor and acceptor substrate made this unavoidable, and TDHP formation was still observed. Additionally, the presence of TDHP further increased the intrinsic difficulty of product purification, as is commonly observed for nucleobase derivatives.
The biotransformation was performed in a buffered solution containing substrates and the enzyme. Reaction progress was monitored by TLC and HPLC, and upon completion, the formation of products 6i-iii was confirmed by HPLC–MS (Figures S20–S22).
As discussed above, the purification of lactol products from reaction mixtures is challenging due to the instability of the hemiacetal moiety, the coexistence of anomeric forms, and the poor solubility of nucleoside bases in organic solvents. In addition, when DERA is employed to catalyze sequential aldol reactions with non-natural substrates, the catalytic efficiency is quite low, which restricts its practical applicability. Consequently, scaling up the reaction was required to obtain sufficient amounts of material for successful purification [28].
Following scale-up, the lactol product 6i was isolated from an 8 mL reaction mixture through an extensive work-up and chromatographic purification. Due to the high polarity of the nucleoside scaffold and the lability of the hemiacetal function, the crude reaction mixture was first subjected to multiple extractions with ethyl acetate. The combined organic phases were dried, filtered, and concentrated under reduced pressure. The resulting residue was purified by column chromatography on silica gel (230 mesh), using approximately 40% w/w of crude material relative to the stationary phase. Elution was performed using dichloromethane/methanol as the mobile phase, gradually increasing the polarity until a final ratio of 90:10 was reached. Column progress was monitored by TLC, using dichloromethane/methanol (9:1) as the eluent, where the desired lactol exhibited an Rf value of 0.20. The product was obtained as a colorless, viscous liquid in 28% yield.
^1^H and ^13^C NMR spectroscopic analyses confirmed the successful isolation of the lactol product and revealed the presence of a mixture of anomeric forms resulting from cyclization of the thymine-substituted lactol (Figures S8–S11). The observed α/β anomeric ratio is consistent with that previously reported for related substituted lactols bearing different R groups [60]. As shown in Figure 2, the ^1^H NMR spectrum of an anomerically enriched mixture clearly displays the signal corresponding to the major anomer, appearing as a doublet with a coupling constant of 9.1 Hz, which is characteristic of the α-anomer. In the expanded region of this part of the spectrum, a significantly weaker signal attributable to the β-anomer can also be observed. Minor impurities attributable to residual TDHP were also detected in the crude material but were effectively removed during chromatographic purification.
The scale up and purification of 6ii and 6iii still remain a challenge, mainly due to the low conversions observed for the corresponding nucleobase-derived aldehydes, which reached 5% for the adenine derivative (6ii) and below 2% for the cytosine analogue (6iii), as determined by HPLC analysis.
4. Conclusions
In this work, a novel chemoenzymatic strategy was developed for the synthesis of nucleobase-substituted lactol products as potential precursors of new statin analogues. The C49M variant of 2-deoxyribose-5-phosphate aldolase from Pectobacterium atrosepticum (PaDERA C49M) was demonstrated to catalyze aldol reactions involving aldehyde-functionalized nucleobases as non-natural electrophilic substrates, thereby expanding the substrate scope of DERA beyond conventional electrophiles and providing access to previously unexplored statin side-chain architectures.
Notably, the methodology presented here introduces a distinct approach to statin synthesis, as the lateral chain is constructed directly from substrates already bearing the heterocyclic moiety. By incorporating the nucleobase at an early stage, this strategy avoids post-aldol coupling steps commonly required in traditional synthetic routes, which often result in longer, less efficient, and lower-yielding synthetic sequences.
While significant progress has been achieved in improving DERA stability, particularly through the use of enzymes from thermophilic organisms or via random mutagenesis, several challenges must still be addressed before DERA can be considered a robust and broadly applicable industrial biocatalyst. These include limitations in catalytic efficiency with non-natural substrates, product stability, and downstream processing.
Looking forward, constant efforts focus on enzyme optimization and reaction engineering to improve productivity and selectivity. In parallel, the lactol oxidation to lactone and the biological evaluation of the newly synthesized nucleobase-containing statin analogues are currently in progress, with the aim of assessing their pharmacological potential and identifying promising candidates for further development.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Laranjo L. Lanas F. Sun M.C. Chen D.A. Hynes L. Imran T.F. Kazi D.S. Kengne A.P. Komiyama M. Kuwabara M. World Heart Federation Roadmap for Secondary Prevention of Cardiovascular Disease: 2023 Update Glob. Heart 202419810.5334/gh.127838273995 PMC 10809857 · doi ↗ · pubmed ↗
- 2Di Cesare M. Mc Ghie D.V. Perel P. Mwangi J. Taylor S. Pervan B. Kabudula C. Narula J. Bixby H. Pineiro D. The Heart of the World Glob. Heart 2024191110.5334/gh.128838273998 PMC 10809869 · doi ↗ · pubmed ↗
- 3Münzel T. Sørensen M. Lelieveld J. Landrigan P.J. Kuntic M. Nieuwenhuijsen M. Miller M.R. Schneider A. Daiber A. A Comprehensive Review/Expert Statement on Environmental Risk Factors of Cardiovascular Disease Cardiovasc. Res.20251211653167810.1093/cvr/cvaf 11940795898 PMC 12477681 · doi ↗ · pubmed ↗
- 4Virani S.S. Ramsey D.J. Westerman D. Kuebeler M.K. Chen L. Akeroyd J.M. Gobbel G.T. Ballantyne C.M. Petersen L.A. Turchin A. Cluster Randomized Trial of a Personalized Clinical Decision Support Intervention to Improve Statin Prescribing in Patients with Atherosclerotic Cardiovascular Disease Circulation 20231471411141310.1161/CIRCULATIONAHA.123.06422636871214 · doi ↗ · pubmed ↗
- 5Tang X.L. Yu J.W. Geng Y.H. Wang J.R. Zheng R.C. Zheng Y.G. From Discovery to Mass Production: A Perspective on Bio-Manufacturing Exemplified by the Development of Statins Engineering 20232413815010.1016/j.eng.2022.04.030 · doi ↗
- 6Zeng W. Deng H. Luo Y. Zhong S. Huang M. Tomlinson B. Advances in Statin Adverse Reactions and the Potential Mechanisms: A Systematic Review J. Adv. Res.20257678179710.1016/j.jare.2024.12.02039681285 PMC 12793736 · doi ↗ · pubmed ↗
- 7Ward N.C. Watts G.F. Eckel R.H. Statin Toxicity: Mechanistic Insights and Clinical Implications Circ. Res.201912432835010.1161/CIRCRESAHA.118.31278231170055 · doi ↗ · pubmed ↗
- 8Kim T. Stogios P.J. Khusnutdinova A.N. Nemr K. Skarina T. Flick R. Joo J.C. Mahadevan R. Savchenko A. Yakunin A.F. Rational Engineering of 2-Deoxyribose-5-Phosphate Aldolases for the Biosynthesis of (R)-1,3-Butanediol J. Biol. Chem.202029559760910.1074/jbc.RA 119.01136331806708 PMC 6956516 · doi ↗ · pubmed ↗
