Engineering Fluoroacetate Dehalogenase by Growth‐Based Selections on Non‐Natural Organofluorides
Suzanne C. Jansen, Pauline van Beers, Clemens Mayer

TL;DR
Scientists engineered an enzyme to break down harmful fluorinated pollutants by selecting bacteria that can grow using these pollutants as a carbon source.
Contribution
A high-throughput growth-based selection system was developed to engineer fluoroacetate dehalogenases for non-natural organofluorides.
Findings
FAcD variants with improved activity and altered substrate specificity were isolated using non-natural organofluorides.
A previously overlooked inhibition pathway was identified for gem-difluoride compounds.
The engineered FAcDs can sustainably degrade contaminating organofluorides.
Abstract
The widespread use of organofluorides in modern society has inadvertently led to the bioaccumulation of harmful pollutants, most prominently per‐ and polyfluorinated alkyl substances (PFAS). In principle, tailored biocatalysts able to cleave C─F bonds represent an attractive strategy to combat this (emerging) environmental crisis. However, Nature is largely impartial to C─F bonds, with fluoroacetate dehalogenases (FAcDs) standing out by catalyzing the hydrolysis of single C─F bonds in fluoroacetate at high turnover rates. To harness its catalytic prowess for non‐natural organofluorides, we designed and applied a robust growth‐based selection strategy for large‐scale FAcD engineering. Specifically, we demonstrate that FAcD‐catalyzed C─F bond cleavage of (natural and) synthetic organofluorides generates metabolizable carbon sources for bacteria, enabling in vivo enrichment of active FAcD…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
FIGURE 1
FIGURE 2
FIGURE 3
FIGURE 4- —NWO10.13039/501100003246
- —ERC10.13039/501100000781
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsPer- and polyfluoroalkyl substances research · Microbial bioremediation and biosurfactants · Pesticide and Herbicide Environmental Studies
Introduction
1
Organofluorides have become indispensable in modern society, as they have found widespread applications as pharmaceuticals, agrochemicals, oil/water‐repelling materials, or surfactants [1, 2, 3, 4]. The unique physiochemical properties and inherent stability of C─F bonds make them ideal to tailor the properties of molecules and materials alike [5, 6]. However, the stability of C─F bonds also has unintended consequences, such as organofluorides persisting and accumulating in the environment [7, 8]. Particularly, the contamination of soil and water by per‐ and polyfluorinated alkyl substances (PFAS) is detrimental to ecosystems and human health [9, 10]. Consequently, sustainable and efficient strategies to degrade PFAS and other organofluoride species are in urgent demand.
Bioremediation with the help of specialized biocatalysts and/or bacteria is an attractive strategy to seek and destroy organofluorides. However, Nature is largely agnostic to C─F bonds. Indeed, only a single natural enzyme class is known to selectively install a C─F bond [11] and only about a dozen natural products have been identified to contain fluorine atoms [12]. Among these compounds, fluoroacetate (FA) is particularly notable, as it is produced by ∼40 plant species as a defense mechanism against herbivores [13, 14]. Specifically, FA functions as a metabolic poison that can enter the citric acid cycle as fluoroacetyl‐CoA, where it is subsequently converted to 2‐fluorocitrate (Figure 1A) [15]. The latter compound is a potent aconitase inhibitor that leads to the accumulation of citrate in the blood and, ultimately, shuts down the citric acid cycle.
The role and degradation of fluoroacetate in nature. (A) The toxicity of fluoroacetate (FA) derives from its ability to enter the citric acid cycle as fluoroacetyl‐CoA. Following condensation with oxaloacetate, a competitive aconitase inhibitor (2‐fluorocitrate) is produced that stalls the citric acid cycle, resulting in citrate accumulation and ATP depletion. (B) Fluoroacetate dehalogenases degrade FA by hydrolyzing the C─F bond to yield glycolate, a proton, and fluoride. (C) The FAcD‐catalyzed defluorination is generally accepted to proceed through an SN2 mechanism, in which Asp105 performs a nucleophilic attack. Concomitant C─F cleavage is facilitated through a specialized fluoride‐binding pocket. The resulting acyl‐enzyme intermediate is cleaved by water (activated by His272) to release glycolate with a subsequent proton transfer regenerating the nucleophile.
In response to the (general) toxicity of FA, soil bacteria and those residing in cattle rumen have evolved fluoroacetate dehalogenases (FAcDs, EC 3.8.1.3), which neutralize this metabolic poison by promoting its hydrolysis to glycolate (Figure 1B) [16, 17, 18, 19]. To do so, FAcDs employ an Asp‐His‐Asp catalytic triad that facilitates C─F bond cleavage in a two‐step mechanism, which is supported by both experimental and computational efforts (Figure 1C) [20, 21, 22, 23, 24, 25]. Aided by a specific fluoride binding pocket, a catalytic aspartate first substitutes the fluoride to yield an acyl‐enzyme intermediate, which, with the help of a histidine, is subsequently hydrolyzed to release glycolate [20, 22, 26, 27].
Somewhat surprisingly, although characterized FAcDs efficiently hydrolyze FA with rates >10 s^−1^, examples to engineer defluorinases to accept alternative non‐natural fluorinated substrates or improve their activity are scarce. Indeed, we could only identify two prior reports, one in which amino acids substitutions were introduced in a FAcD to accept bulkier substrates [28], and another where domain‐swapping unlocked weak defluorination in a chimeric haloacid dehalogenase (HAD) [29]. We ascribe the scarcity of FAcD‐engineering studies to the difficulties associated with assaying enzymatic activity –FAcDs neither act on UV/Vis‐active substrates/products nor feature an assayable cofactor. While proton release can be coupled to a pH indicator as readout [19, 30, 31], and fluoride release can be detected by various metal‐dyes [32, 33], established methods are not readily adaptable to assay thousands of enzyme variants in high‐throughput fashion (for a detailed comparison of available methods see Supporting Table S1).
To overcome this challenge, we present a technically simple growth‐based selection system to engineer FAcDs. Specifically, we demonstrate that (non‐)natural organofluorides can be converted to carbon sources by active defluorinases, enabling us to faithfully elicit improved FAcD variants from diverse populations by serial passaging. Notably, kinetic characterization of selected variants revealed FAcDs with increased activity and/or altered substrate selectivity. In these efforts, we also identified a previously unknown inhibition pathway that prevents the conversion of gem‐difluoride compounds. By having established a straightforward means to enable the large‐scale engineering of FAcDs, we anticipate that our approach will prove useful for tailoring the activities of these defluorinases toward emerging and harmful organofluorides that persist in our environment.
Results and Discussion
2
Evaluating the Substrate Scope of FAcD‐H1
2.1
From the set of previously characterized defluorinases, we chose FAcD‐H1 (= H1) from Delftia acidovorans strain B (formerly Moraxella sp. strain B) for our studies [17, 22]. To evaluate its substrate scope, we produced and purified H1 and incubated it with FA as well as non‐natural substrates featuring an additional methyl group and/or a second C─F bond. To assess defluorination activity, we employed ^19^F‐NMR spectroscopy, which enables us to monitor both fluoride (F^−^) release and substrate consumption in complex mixtures (see Supporting Information for details).
Consistent with prior reports [22], H1 (0.5 µM) rapidly converted FA (10 mM) with an apparent initial rate (v 0,app) of 32 ± 4.5 s^−1^ (Figure 2A and Supporting Table S2). The more sterically hindered substrate 2‐fluoropropionate (FP, racemic) was also readily accepted, albeit with a 17‐fold lower initial rate of 1.9 ± 0.4 s^−1^. Conversely, the defluorination of gem‐difluorinated substrates proved significantly more challenging (Figure 2A and Supporting Table S2): the activity of H1 for 2,2‐difluoroacetate (F_2_A) was ∼9,000‐fold lower than for FA, while for 2,2‐difluoropropionate (F_2_P) no significant fluoride formation could be observed, even after prolonged reaction times (6 days) and in presence of higher enzyme concentrations (20 µM).
Design and validation of a growth‐based selection strategy for FAcD engineering. (A) Initial rates of H1 on four organofluorides. Rates were determined by 19F‐NMR and represent the average of 2–3 measurements from biological duplicates. (B) In growth‐based selections, the enzymatic defluorination of either substrate by H1 yields a suitable carbon source that promotes E. coli growth in minimal media. (C) FP or F2P can enter the citric acid cycle once successfully converted to lactate or pyruvate, respectively. LOX = lactate oxidase. (D) FA or F2A can enter the citric acid cycle via the glyoxylate shunt once successfully converted to glycolate or glyoxylate, respectively. GOX = glycolate oxidase. (E, F) Growth curves (in triplicate) of H1‐ or H1‐H272A‐producing E. coli in presence of 100 mM FP (E) or 100 mM FA (F) as sole carbon source. Activity of H1 enables growth under these conditions, whereas a host producing inactive enzyme variant H1‐H272A cannot proliferate. (G, H) 19F‐NMR spectra of culture supernatants from E‐F, showing the successful conversion of FP (G) or FA (H) to F− for cells producing H1, but no F− release for cells producing H1‐H272A.
Design and Validation of Growth‐Based Selections
2.2
While ^19^F‐NMR provides a convenient means to characterize purified FAcD variants, it lacks the necessary throughput for large‐scale directed evolution campaigns. To allow for the engineering of H1 toward non‐natural organofluorides, we next explored the feasibility of in vivo growth‐based selection strategies, which are easy‐to‐use, low‐cost, and effective to assay millions of enzyme variants in parallel [34, 35]. To create the necessary link between enzymatic activity and host survival, we designed and evaluated simple growth‐based selections, in which the conversion of fluorinated substrates provides the sole carbon source for growth of the host organism, Escherichia coli (Figure 2B). These efforts are inspired by previous reports that have demonstrated that organisms with functional FAcDs can employ FA or FP as sole carbon source [18, 36, 37].
In principle, all four substrates (FA, FP, F_2_A, and F_2_P) yield potential carbon sources upon successful defluorination. Specifically, defluorination of FP and F_2_P by H1 yields lactate and pyruvate, respectively, which are C3 carbon sources that can directly fuel the citric acid cycle (Figure 2C). Conversely, FA and F_2_A give glycolate and glyoxylate, respectively, which are C2 carbon sources that can be metabolized via the glyoxylate shunt (Figure 2D). Additionally, while F_2_A, FP, and F_2_P at concentrations of <100 mM are innocuous compounds for E. coli, FA is inherently toxic and thus adds an additional selective pressure. To validate whether this approach can differentiate between active and inactive defluorinases, we constructed H1‐H272A, in which the essential histidine from the catalytic triad is substituted for an alanine. Using an inactive H1 variant as a negative control is ideal for growth‐based selections, as it mimics the burden bestowed on the host cell upon induction, which also affects fitness. Following production and purification of H1‐H272A, we confirmed that it is produced at levels comparable to the wildtype and it lacks any appreciable activity for any of the four substrates (Supporting Table S2).
Next, we monitored bacterial growth in a defined minimal medium at 30^0^C under varying conditions by measuring the optical density at 600 nm (OD_600_) in a plate reader (see Supporting Information for details). As positive controls, we provided E. coli expressing H1 with 100 mM glycolate, 50 mM lactate, or 50 mM pyruvate as sole carbon sources, which gave a robust growth phenotype for the C3 carbon sources pyruvate and lactate, and a more moderate growth phenotype for the C2 carbon source glycolate (Supporting Figure S1A). Gratifyingly, when substituting lactate for 100 mM FP, cultures producing H1 could proliferate over a period of 48 h (OD_600_ from ≈ 0.5–1), whereas cultures featuring H1‐H272A failed to grow under otherwise identical conditions (Figure 2E). When supplementing minimal media with 100 mM FA as a sole carbon source, H1 producers were again the only ones that could proliferate, but displayed extended lag times (∼36 h) and reached lower cell densities (end OD_600_ ≈ 0.3–0.4, Figure 2F). Together with the subsequent stationary phase, this less robust growth phenotype likely results from both the inherent toxicity of FA and glycolate being a relatively poor carbon source [38]. Lastly, we also attempted to grow E. coli expressing H1‐variants on F_2_A and F_2_P as sole carbon sources. However, these attempts proved fruitless, consistent with the low activities H1 displays for these substrates (Supporting Figure S1B,C).
To further attest that the observed growth phenotypes correlated with enzymatic activity, we subjected cells’ supernatants to ^19^F‐NMR analyses. H1‐producing cells displayed significant conversion of racemic FP (51%, consistent with the high enantioselectivity of FAcDs) and substantial fluoride formation (74%) when grown in FA (Figure 2G,H). Conversely, none of the substrates were converted by bacteria producing H1‐H272A under otherwise identical conditions (Figure 2G,H). For bacteria that failed to grow in the presence of F_2_A or F_2_P as sole carbon source, ^19^F‐NMR did not detect any fluoride release over a period of 72 h (Figure S1).
Combined, these experiments demonstrate that FAcD activity can be linked to host fitness by employing either FA or FP as the sole carbon source. While both approaches are suitable for growth‐based selections, we predominantly employed FP in our engineering efforts described below, as: (1) the growth phenotype provided by the carbon source (lactate) is more pronounced, and (2) engineering FAcD‐H1 for non‐natural and more sterically demanding organofluorides aligns with our goal to, ultimately, degrade persistent pollutants.
Design and Generation of H1‐Libraries
2.3
To apply our growth‐based selections, we constructed H1 libraries by randomizing residues lining the active site of this enzyme. Toward this end, we first constructed an AlphaFold2 [39, 40] model of H1 and then submitted the model to the online webserver HotSpot Wizard [41], which identified Gln245 and Ser147 as targets for randomization (Figure 3A). To probe potential epistatic interactions, we included Met246, whose side chain points toward the active site. Finally, we also chose Trp180, as substitution for smaller residues at this position facilitated turnover of sterically‐demanding α‐fluorocarboxylic acids in a homologous FAcD (RPA1163 from Rhodopseudomonas palustris CGA009) [28].
Selecting better FAcDs. (A) Close‐up of the active site of H1, showing the catalytic triad in purple (Asp105 and His272), the fluoride binding pocket in green (His150, Trp151, and Tyr213), and residues targeted for site‐directed mutagenesis in orange (Ser147, Trp180, Gln245, and Met246). (B) OD600 measurements over time of two consecutive serial passages (pink and purple) for all populations using FP as sole carbon source. Controls (H1 and H1‐H272A) were measured in parallel with Pop‐D1, and are shown in blue and grey, respectively. The same control traces are included in the other populations with lower opacity for comparison. Note that the lag period decreases from the initial to the second passage. (C, D) Treemap charts depicting the enrichment of amino acids across randomized positions in populations after two passages on FP as sole carbon source. On the left, a legend for the color code of the properties of amino acid side chains. (E) OD600 measurements over time showing growth of Pop‐D1 (orange) or H1 (blue) with FA as sole carbon source. (F) Treemap chart depicting the enrichment of amino acids across randomized positions for survivors of Pop‐T1 after prolonged incubation with F2P as sole carbon source.
Based on these considerations, we constructed a total of four libraries using degenerate NNK codons (see Supporting Information for details): three libraries featuring two randomized positions giving Lib‐D1 (Trp180/Gln245), Lib‐D2 (Ser147/Gln245), and Lib‐D3 (Gln245/Met246); and one library in which three positions were randomized, Lib‐T1 (Ser147/Gln245/Met246). Following transformation into a cloning E. coli strain, the number of transformants exceeded the theoretical genetic diversity of each library by a factor of 3.9 – 20 (see Supporting Table S3). Next, we transformed each library into the appropriate expression strain and applied whole‐plasmid sequencing (WPS) of pooled plasmid samples to assess the diversity of populations Pops‐D1‐D3 and Pop‐T1 prior to selection. Mapping obtained reads to the reference plasmid and quantifying the amino acid distribution across randomized positions attested on the overall high degree of amino acid diversity for each position in every library (Supporting Figure S2).
Lastly, to provide an estimate of active H1‐variants in the transformed libraries, we determined the growth characteristics in presence of FP as sole carbon source of 282 individual colonies across all four populations. While for Pop‐D2 more than 50% of all inspected colonies were able to utilize FP as a sole carbon source, Pop‐D1, Pop‐D3, and Pop‐T1 fared significantly worse, with only <10% of all inspected variants being able to grow within 48 h (Supporting Figure S3). These results suggest that the latter three libraries contain a significant number of variants with deleterious mutations, while substitutions at Ser147 & Gln245 (in Pop‐D2) are generally more permissive.
Autonomous Selection of H1‐Variants
2.4
With all populations featuring variants with growth phenotypes, we next performed a series of selection campaigns in which individual populations were challenged to utilize different organofluorides as sole carbon sources. Specifically, we subjected all four populations to two consecutive serial passages in the presence of FP. Monitoring the OD_600_ over time revealed that growth phenotypes for the initial passage required 3–5 days, significantly longer than H1‐producing cells (2 days, Figure 3B). This extended lag period suggests that indeed many cells produced inactive variants that could not proliferate, while the few cells with active variants required longer lag periods to reach detectable cell densities. Interestingly, Pop‐D2 featured a somewhat shorter lag period than the other populations in the initial passage, again suggesting that substitutions of Ser147 and Gln245 are more permissive. Strikingly, all four populations displayed significantly shorter lag periods (1‐2 days) during the following passage, indicative of the successful enrichment of active defluorinases (Figure 3B).
To track the enrichment of individual variants throughout these selection campaigns, we sequenced the remaining genotypic diversity after each passage by performing WPS on pooled plasmids and by Sanger sequencing 4–18 random colonies per passage (Figure 3C,D, Supporting Figure S4 and Supporting Table S4). Gratifyingly, these data attested on the robust enrichment of a small number of variants for all selections and revealed notable trends across populations: (1) Trp180 and Met246 appear critical for defluorination activity, since FP‐based selections almost exclusively enriched variants retaining these wildtype residues after randomization. This result is consistent with the outcome of our plate reader experiments, in which growth phenotypes for populations that targeted one of these residues were scarce (<10% for Pop‐D1, Pop‐D3, and Pop‐T1). (2) Ser147 and Gln245 are more permissive to substitutions, yet small apolar residues appear favorable, such as Ala/Leu(/Ser) for position Ser147 and Ala/Val for Gln245. (3) Growth‐based selections with FP as sole carbon source are robust and do not encourage the enrichment of escape variants. Indeed, only in Pop‐T1 we detected hitchhikers, marked by the excision of the H1 coding sequence in about half of the cells left in the population. These passengers likely originated from a spontaneous recombination event (the host E. coli BL21(DE3) possesses a functional recombination gene, recA), which provides increased fitness by lowering the gene expression / protein production burden.
Based on all sequencing results from these FP‐based selections, we chose nine H1‐variants for further characterization as they (1) were strongly enriched after two passages, (2) were present in populations after the initial passage, or (3) performed well in our initial plate reader experiment (see Supporting Table S5 for a detailed summary). These nine variants are Q245A and Q245G from Pop‐D1; S147L‐Q245W and S147V‐Q245W from Pop‐D2; Q245V, Q245M‐M246L, and Q245F‐M246I from Pop‐D3; and S147L‐Q245V and S147R‐Q245P‐M246L from Pop‐T1. For clarity, shorthand names are used throughout the manuscript that only list the substituted residues (i.e., H1‐A, ‐G, ‐LW, ‐VW, ‐V, ‐ML, ‐FI, ‐LV, and ‐RPL).
Besides FP‐based selections, we also performed a selection campaign utilizing FA as sole carbon source for Pop‐D1. While the H1 wildtype control started to proliferate slowly after approximately 5 days, Pop‐D1 required 10 days to reach a measurable OD_600_ ≈ 0.2 (Figure 3E). Because the growth conditions were highly stringent, we did not perform a second passage but immediately analyzed the remaining diversity in the population by Sanger sequencing. Interestingly, we found that all 6 tested colonies harbored the same variant: Q245R (and W180W, consistent with previous findings) (Supporting Table S4). As we did not identify this H1 variant in any of the populations in the FP‐based selection, we added H1‐R to the panel for further characterization (Supporting Table S5).
Lastly, we performed selection campaigns utilizing F_2_P as potential carbon source for Pop‐D2, Pop‐D3, and Pop‐T1. Unfortunately, none of these populations were able to grow to a measurable OD_600_ even after prolonged incubation (14 days). Curious to see whether cells remained viable after two weeks, we plated a fraction of the selection cultures on LB agar plates. Interestingly, Pop‐T1 gave rise to ∼10‐fold more colonies than Pop‐D3, and about ∼100‐fold more colonies than Pop‐D2 (Supporting Figure S5). Intrigued by this stark difference, we subjected surviving colonies from Pop‐T1 to WPS sequencing, which revealed a marked enrichment of H1‐variants distinct from our previous efforts (Figure 3F). Notably, Met246, which was deemed critical for defluorinase activity on FP, had been substituted with Leu in 48% of reads. Additionally, Ser147 and Gln245 showed enrichment of positively charged residues (Arg, His) rather than small, apolar ones as observed before. While the observed survival phenotype could be unrelated to defluorination activity, we added another three representative H1‐variants of this group for further characterization, namely S147R‐Q245L (H1‐RL), S147R‐Q245H‐M246L (H1‐RHL), and S147P‐Q245L (H1‐PL, Supporting Table S5).
Interestingly, the residual sequence diversity after our selections partially mirrors the amino acid frequency patterns observed in the multiple sequence alignment (MSA) of FAcDs generated by HotSpotWizard. For example, Ser147 and Q245 appear most permissive to substitution, while Trp180 and Met246 exhibit markedly less diversity (Supporting Figure S6). Notably, in the MSA, Q245 is most often substituted for Ala/Val, which are two variants that we identified from our FP selections as well. Conversely, we did neither observe a Trp180Arg substitution, which is present in the MSA, nor found substitutions of Met246 with apolar residues in FP selections at an appreciable level. Overall, our results suggest that the selections recover both evolutionarily conserved and novel adaptive solutions to tune FAcD activity.
Validation of Growth Phenotypes of Selected H1‐Variants
2.5
Prior to the kinetic characterization of all 13 H1‐variants we identified during our selections, we evaluated their growth phenotypes when challenged with the organofluorides of interest. To exclude unintended fitness adaptations resulting from mutations in the vector backbone or within the host's genotype, we cloned the H1‐coding sequences anew and transformed them into fresh E. coli hosts.
From the recorded growth curves on (non‐)natural organofluorides as sole carbon sources (Figure 4A and Supporting Figure S7), a number of interesting trends across the groups of H1‐variants are readily apparent: (1) Hosts producing strongly enriched H1‐variants from FP‐ (or FA‐) based selections (H1‐A, ‐LW, ‐V, ‐LV(, ‐R)) displayed strong growth on FP as carbon source (Figure 4A), attesting on the robustness of our growth‐based selection strategy. (2) H1‐variants chosen from the initial 96‐well plate screen (H1‐G and ‐VW) demonstrated repeated growth on FP. (3) Results for variants that featured substitutions to the conserved Met246 varied, with H1‐FI displaying robust growth on FP, while H1‐ML and ‐RPL largely failed to proliferate. (4) Survivors from F_2_P‐based selections (H1‐RL, ‐PL, and ‐RHL) could not utilize FP, but as all other tested H1‐variants, they also failed to proliferate in presence of F_2_P. Lastly, (5) growth curves on FA and F_2_A did not show consistent proliferation (Supporting Figure S7), a fact that we ascribe to cells possibly residing in stationary phase, excessive stringency by FA, and/or carbon source (= acetate) carryover from the inoculum (see Supporting Discussion for details).
Characterization of selected H1‐variants. (A) Growth properties that H1‐variants bestow on their host when challenged to utilize lactate, FP, or F2P. OD600 values represent the average of 3 measurements from 3 single colonies after 50‐h incubation. The corresponding growth curves can be found in Figure S7. (B) Kinetic parameters of H1‐variants on FA, FP, and F2A. Values represent the average of 2–3 measurements from technical duplicates (except for H1, in which biological duplicates were used). Standard deviations are given. (C) Time‐course analysis of F2A (10 mM) conversion by purified enzymes (1 µM) H1, H1‐ML, and RPA1163 by following F− release over 4 days. (D, E) Deconvoluted masses from UPLC‐MS spectra for H1 (D) or H1‐ML (E) before and during F2A incubation. The observed mass (M) for unmodified FAcDs is marked in black, and three mass species are labeled in green (M‐18), pink (M+56), and orange (M+74). The corresponding raw spectra can be found in Figure S10.
As a proof of concept, we also evaluated the possibility of growing bacteria harboring (in)active FAcDs on solid medium. To do so, we monitored colony formation of cells harboring H1, H1‐A, and H1‐H272A on minimal medium agar plates over 7 days at 30 ^0^C. Notably, cells harboring active FAcDs were able to proliferate on FP as sole carbon source after 4 days of incubation (Supporting Figure S8). The observed growth can be confidently attributed to active defluorination, as E. coli carrying the inactive H1‐H272A variant failed to grow on FP even after extended incubation. Although we favor liquid selections for their higher throughput, the ability of our growth‐based selection to function effectively on both solid and liquid media underscores its versatility and ease of implementation.
Kinetic Characterization of H1‐Variants
2.6
To verify that the observed growth phenotypes correlate with the enrichment of active defluorinases, we produced and purified all 13 H1‐variants (Supporting Figure S9 and Supporting Table S6) and determined their kinetic parameters for organofluoride hydrolysis by ^19^F‐NMR spectroscopy. Gratifyingly, strongly enriched variants from FP‐ (or FA)‐based selections (H1‐A, ‐LW, ‐V, ‐LV (, ‐R)) all demonstrated activity on both FA and FP (Figure 4B and Supporting Table S7). In fact, three of these variants (H1‐A, ‐V, ‐R) exceeded wildtype activity for both substrates, with H1‐A being the most active and displaying an apparent initial rate (v 0,app) of 63 ± 0.82 s^−1^ for FA (2‐fold higher than wildtype) and 3.2 ± 0.21 for FP (1.7‐fold higher than wildtype). To investigate whether enantioselectivity was affected, we incubated the wildtype FAcD and H1‐A with FP for up to 24 hours. We found that for both enzymes, conversion halted at 51% F‐ formation (Supporting Table S8), indicating that enantioselectivity was maintained.
Variants chosen from the initial 96‐well plate screen (H1‐G, H1‐VW) were active on FA and FP but performed (slightly) worse than the wildtype FAcD. Furthermore, the inclusion of variants from the initial serial passage that featured Met246 mutations yielded notable results. Specifically, H1‐FI displayed high catalytic activities with a v 0,app of 38 ± 5.8 s^−1^ for FA (1.2‐fold higher than H1) and 2.1 ± 0.11 s^−1^ for FP (1.1‐fold higher than H1), demonstrating that Met246 is not essential for defluorination activity (consistent with the MSA alignment). Conversely, variants carrying the Met246Leu substitution performed significantly worse: H1‐ML and H1‐RPL retained only 20% and 4% of wildtype activity, respectively, in presence of FA, and just 2% and 1% residual activity with FP as the substrate (Figure 4B and Supporting Table S7). Lastly, variants originating from F_2_P selections overall performed poorly on FA and FP, with the exception of H1‐RL, whose activity on these substrates were comparable to the wildtype FAcD.
To gain insight into the substrate promiscuity for all H1‐variants, we further determined their activity with the gem‐difluorinated substrates F_2_A and F_2_P (Figure 4B and Supporting Table S7). While all variants were able to hydrolyze F_2_A, none of them yielded appreciable conversions for F_2_P (<1%), even at elevated enzyme concentrations (20 µM) and extended reaction times (up to 7 days). Notably, for none of the F_2_A samples we could identify a mono‐fluorinated product by ^19^F‐NMR, indicating that upon breaking the first C─F bond and hydrolysis of the acyl‐enzyme intermediate, the monofluorinated product either undergoes spontaneous defluorination or is rapidly hydrolyzed. Instead, the enzymes seem to undergo catalysis‐dependent inhibition, leading to covalent modification of the H1‐variants (vide infra).
Due this inhibition pathway, FAcD variants were compared based on their turnover numbers (TONs) after 24 hours rather than their apparent initial rates. This analysis revealed a strong correlation: H1‐variants that showed poor activity on mono‐fluorinated substrates often exhibited significantly increased TON when tested with F_2_A (Figure 4B). For example, H1‐ML and ‐RHL displayed TONs of 1,900 ± 161 and 1,140 ± 95, thus exceeding the wildtype enzyme (TON = 307 ± 81) by a factor of 6.1 and 3.7, respectively. Moreover, three of the four variants with the highest F_2_A TONs possess a Met246Leu substitution (Figure 4B), a residue that was highly conserved in selections with mono‐fluorinated substrates (cf. Figure 3D–E). The sole exception in this trend is the generalist H1‐G (featuring a Gln245Gly substitution) which displays high activities for FA (v 0,app = 26 ± 2.1 s^−1^), FP (v 0,app = 1.1 ± 0.17 s^−1^), and F_2_A (TON = 783 ± 21).
Elucidation of F2A‐Dependent Inhibition Pathway of FAcDs
2.7
As noted before, we observed F_2_A‐dependent inhibition of H1‐variants during our kinetic characterization efforts. Notably, while prior works on F_2_A conversion by homologous FAcDs have consistently described low conversions over prolonged incubation periods [21, 42, 43], none of them have pinpointed a catalysis‐dependent inhibition pathway as the reason for this poor performance.
To further examine this inhibition, we monitored F_2_A conversion over four days by periodically quantifying fluoride release from enzymatic assays (see Supporting Information for details). The time course included wildtype H1, the most improved variant H1‐ML, and the well‐characterized FAcD RPA1163, to assess whether F_2_A‐dependent inhibition is a general feature of FAcDs. While H1 reached a plateau at approximately 4% conversion within 24 h, RPA1163 displayed a slow but continuous linear increase in F^−^ release, reaching ∼10% conversion after four days (Figure 4C). In contrast, H1‐ML maintained turnover throughout the entire time course, achieving >40% conversion (although the rate gradually decreased). These observations suggest that H1 undergoes catalysis‐dependent inhibition, which is partially alleviated in H1‐ML. Moreover, the linear F_2_A turnover observed for RPA1163 does not exclude inhibition, although its kinetic behavior differs markedly from that of H1 variants.
To elucidate this inhibition pathway, we incubated H1 with an excess of F_2_A and periodically analyzed samples by mass spectrometry (UPLC‐MS). Following 2‐h incubation, we identified masses of H1 (M) and three distinct species, namely M‐18 Da, M+56 Da, and M+74 Da (Figure 4D and Supporting Figure S10). While the former two species are transient, the M+74 species becomes the dominant species after 24‐h incubation. An analogous MS time course study with RPA1163 revealed the formation of the M+56 Da and M+74 Da species, although a significant fraction of the unmodified enzyme persisted (Supporting Figure S11).
Combined with the findings from the time course, these results suggest that the M+74 Da species represents a covalently modified, inactivated form of FAcDs. Based on the ratios in mass species and degree of inhibition, we surmise that this inhibitory covalent modification is more detrimental to H1 than to RPA1163. One explanation for this difference could lie in the reported half‐site reactivity of RPA1163, where catalysis occurs in a single subunit of the dimer, and binding of a second substrate at an allosteric site inhibits activity. [20, 44, 45, 46] However, other explanations cannot be excluded at this point, as these studies used FA, not F2A, as the substrate and inhibition arose from noncovalent interactions.
To identify potential causes for the increased TONs observed for selected H1‐variants, we again performed MS time‐course studies with H1‐ML, which provides a 6.1‐fold increase in F_2_A conversion when compared to H1 (Figure 4D and Supporting Figure S10). While this improved variant still becomes fully modified after 24 h, we observe a marked decrease in signals corresponding to the transient species M‐18 and M+56 after two hours. Given these observations, higher TONs are likely the result of overall slower inhibition, possibly by virtue of slower activity. While the Met246Leu substitution negatively impacts initial rates for the mono‐fluorinated substrates FA and FP, this setback is likely critical for the beneficial conversion of F_2_A.
In an effort to further pinpoint the residue that is modified in the M+74 species, we digested (F_2_A‐inhibited) H1‐variants using (chymo)trypsin and subjected the resulting peptides to LC‐MS/MS analysis (see Supporting Information for details). Unfortunately, we were unable to detect significant differences between samples of H1‐variants in presence and absence of F_2_A, indicating that the observed modification is not stable throughout the proteomics workflow. Based on prior mechanistic investigations [21, 42, 43], we tentatively assign the three species as follows (Supporting Figure S12): (1) M‐18 corresponds to the fragmentation of the Asp105 acyl‐enzyme intermediate that is formed after the initial defluorination reaction; (2) M+56 is consistent with the formation of an imine intermediate that results from the condensation of glyoxylate and a nucleophilic lysine residue; (3) M+74 matches the mass of an enzyme‐glycolate covalent intermediate (Enz—C_2_H_2_O_3_) rather than an enzyme‐FA intermediate (Enz—C_2_HO_2_F) with a mass difference of M+76.
Docking Studies with Boltz‐2
2.8
To rationalize the observed improvements in FP initial rates and F_2_A TONs, we generated structural models of the most improved variants and performed docking studies with Boltz‐2 [47]. Specifically, we modeled H1, H1‐A (improved for FP), H1‐G (generalist), and H1‐ML (improved for F_2_A) for substrates FA, FP, and F_2_A. Upon inspection of the predictions, we noticed that the carboxylate group instead of the F atom(s) of the substrates is oriented toward the fluoride pocket (Supporting Figure S13 and Supporting Data). While this substrate orientation does not appear to be suitable for nucleophilic attack by Asp105, a similar pose has previously been reported to be the most energetically stable [48]. As such, caution should be exercised when interpreting the resulting Boltz‐2 models.
When comparing models for different FAcD variants and substrates, we could only pinpoint a slight variability the orientation of Trp180 (Supporting Figure S13). However, this variability could result from the moderate confidence in the residue's predicted position, as the predicted local distance difference test (pLDDT) value for Trp180 was approximately 65 across all models (Supporting Table S9). Interestingly, pLDDT values from AlphaFold2 models have been shown to correlate well with flexibility scores from molecular dynamics simulations [49], which aligns well with the equivalent residue in a homologous FAcD having been reported to be highly dynamic and possibly performing a ‘flapping’ motion [20].
Since the substitutions in H1 (Q245M and M246L for H1‐ML, Q245G for H1‐G, and Q245A for H1‐A) are located in a loop close to the α‐helix containing Trp180, further investigation into the interplay between Trp180 and this loop could yield valuable insights in the future.
Conclusion
3
The continuous and widespread use of organofluorides in modern society necessitate sustainable degradation strategies for the harmful, persistent pollutants that arise from fluorinated compounds and materials. While C─F bonds are inherently stable and largely absent from Nature, engineering existing biocatalysts to recognize harmful and persistent pollutants represent an attractive strategy. Toward this goal, FAcDs are particularly promising starting points, as they can hydrolyze C─F bonds in FA multiple times per seconds. However, their engineering toward non‐natural organofluorides has remained virtually unexplored.
In this work, we developed and applied a novel growth‐based selection platform that enables high‐throughput engineering of FAcDs for the degradation of organofluorides. Our cost‐effective and technically simple strategy relies on the growth advantage conferred by active biocatalysts that can utilize fluorinated (non‐)natural substrates as sole carbon source. By using bacterial growth as a proxy for C─F bond cleavage activity, consecutive serial passages elicited active H1‐variants from diverse starting populations. Attesting on the robustness of this strategy, we did not observe escape variants, with only one population partially harboring hitchhikers which had excised the H1 coding sequence to reduce expression burden. Selected H1‐variants displayed up to 2‐fold higher activities for FA and 1.7‐fold increased activity for FP. Moreover, by including survivors on F_2_P and variants that went extinct after an initial passage, we identified H1‐variants with drastically altered substrate selectivity. Notably, large increases in F_2_A turnover required a sharp loss in defluorination activity for FA and FP. A Met246Leu substitution proved key to retard a previously unknown F_2_A‐dependent inhibition pathway, which we characterized by time‐course UPLC‐MS studies. With the degradation of gem‐difluorinated carboxylic acids being a critical steppingstone for the degradation of PFAS and related compounds, this inhibition pathway warrants further investigation in the future.
Lastly, the realization of technically simple, growth‐based selections based on the catabolism of organofluorides provides ample opportunities for the discovery and engineering of FAcDs. For example, it offers a straightforward means to elicit novel defluorinases from PFAS‐contaminated soil samples. Additionally, it is conceivable that other short‐chain (gem‐di)fluorinated molecules can be used as sole carbon sources, enabling the engineering of FAcDs toward more PFAS‐like molecules. Lastly, interfacing this platform with targeted in vivo mutagenesis strategies in the future should allow for the continuous and autonomous exploration of long evolutionary pathways [50]. As such, we anticipate that the operationally simple, high‐throughput selection strategy we developed will prove valuable in making defluorinases fit for the degradation of persistent and harmful organofluorides in our environment.
Conflicts of Interest
The authors declare no conflict of interest.
Supporting information
Supporting File 1: The manuscript is accompanied by a Supporting Information file, which includes Supporting Tables and Figures, a Supporting Discussion, experimental procedures, and sequences of H1‐variants used in the study. Additionally, all Boltz‐2 predictions are included (.json and. cif files), and a supporting document is provided, which contains all raw ^19^F‐NMR spectra for kinetic characterization of H1‐variants, determination of defluorination activity in the supernatant of cultures, and relaxation delay tests.The authors have cited additional references within the Supporting Information [51, 52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 62, 63, 64].
Supporting File 2: anie71294‐sup‐0002‐Data.zip.
Supporting File 3: anie71294‐sup‐0003‐19FNMR_supernatants, kinetics, eetest, d1, 4F.pdf.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Q. Wang , H. Song , and Q. Wang , “Fluorine‐Containing Agrochemicals in the Last Decade and Approaches for Fluorine Incorporation,” Chinese Chemical Letters 33 (2022): 626–642.
- 2M. Inoue , Y. Sumii , and N. Shibata , “Contribution of Organofluorine Compounds to Pharmaceuticals,” ACS Omega 5 (2020): 10633–10640.32455181 10.1021/acsomega.0c 00830 PMC 7240833 · doi ↗ · pubmed ↗
- 3S. M. S. Hussain , A. A. Adewunmi , A. Mahboob , M. Murtaza , X. Zhou , and M. S. Kamal , “Fluorinated Surfactants: A Review on Recent Progress on Synthesis and Oilfield Applications,” Advances in Colloid and Interface Science 303 (2022): 102634.35305443 10.1016/j.cis.2022.102634 · doi ↗ · pubmed ↗
- 4E. Kissa , Fluorinated Surfactants and Repellents, Vol. 97 (CRC Press, 2001).
- 5K. Uneyama , Organofluorine Chemistry, (Blackwell Publishing, 2006).
- 6D. O'Hagan , “Understanding Organofluorine Chemistry. An Introduction to the C─F Bond,” Chemical Society Reviews 37 (2008): 308–319.18197347 10.1039/b 711844 a · doi ↗ · pubmed ↗
- 7F. Xiao , “Emerging Poly‐ and Perfluoroalkyl Substances in the Aquatic Environment: A Review of Current Literature,” Water Research 124 (2017): 482–495.28800519 10.1016/j.watres.2017.07.024 · doi ↗ · pubmed ↗
- 8M. B. Murphy , E. I. H. Loi , K. Y. Kwok , and P. K. S. Lam , in Fluorous Chemistry, I. T. Horváth (Springer Berlin Heidelberg, 2012) 339–363.
