π–π Stacking Determines the Selectivity of Unnatural DNA Base Pairs Even without Polymerase
Zahra Noori, Andreu Bermejo, Josep Maria Bofill, Jordi Poater

TL;DR
This study explains how synthetic DNA base pairs can be selectively incorporated into DNA using quantum chemistry, without needing a polymerase enzyme.
Contribution
The study shows that π–π stacking interactions alone determine the selectivity of unnatural DNA base pairs.
Findings
Stacking energies within DNA helix explain the selectivity of DsPx base pairs.
Electrostatic and dispersion interactions enhance DsPx's affinity compared to other base pairs.
The framework helps design synthetic genetic systems with improved replication fidelity.
Abstract
Expanding the genetic alphabet requires a mechanistic understanding of how synthetic bases are faithfully replicated alongside natural DNA. We present a quantum chemical study reproducing the experimentally observed single-nucleotide incorporation selectivity of Hirao’s unnatural base pairs (UBPs) by the 3′–5′ exonuclease-deficient Klenow fragment of Escherichia coli DNA polymerase I. Our analysis focuses on the highly selective DsPx pair, benchmarking its behavior against canonical Watson–Crick pairs and other UBPs. Strikingly, the observed selectivity emerges without explicitly modeling the polymerase, relying solely on computed stacking energies within the DNA helix. Molecular orbital and energy-decomposition analyses show that both electrostatic and dispersion interactions strengthen DsPx’s affinity more, capturing experimental fidelity trends and explaining its superior performance…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4
5
6
7
8
9
10
11
12
13
14| Δ | Δ | Δ | Δ | Δ | |
|---|---|---|---|---|---|
| DsPx/AT | –14.9 | 23.1 | –7.8 | –4.4 | –25.8 |
| AT/AT | –14.0 | 17.3 | –5.8 | –3.4 | –22.1 |
| DsPx/GC | –16.0 | 20.6 | –7.1 | –4.3 | –25.2 |
| AT/GC | –14.1 | 16.6 | –4.9 | –3.8 | –21.9 |
| Δ | Δ | Δ | Δ | Δ | ||
|---|---|---|---|---|---|---|
| DsPx/GC | DsPx/GC | –16.0 | 20.6 | –7.1 | –4.3 | –25.2 |
| Ds/G | –6.3 | 8.0 | –2.3 | –1.9 | –10.1 | |
| Px/C | –4.9 | 7.6 | –2.2 | –2.0 | –8.3 | |
| Px/G | –2.0 | 1.2 | –0.7 | –0.5 | –2.0 | |
| Ds/C | –3.7 | 4.6 | –2.4 | –1.0 | –4.8 | |
| AT/GC | AT/GC | –14.1 | 16.6 | –4.9 | –3.8 | –21.9 |
| A/G | –7.6 | 7.5 | –3.6 | –1.9 | –9.6 | |
| T/C | –6.5 | 7.8 | –3.2 | –2.4 | –8.7 | |
| T/G | –0.4 | 1.0 | 0.9 | –0.4 | –1.9 | |
| A/C | 0.1 | 1.1 | 1.2 | –0.4 | –1.8 |
- —Ministerio de Ciencia, Innovaci?n y Universidades10.13039/100014440
- —Ministerio de Ciencia, Innovaci?n y Universidades10.13039/100014440
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Nucleic Acid Chemistry · DNA Repair Mechanisms · Cyclization and Aryne Chemistry
Introduction
1
DNA aptamers are short, single-stranded DNA molecules that bind specifically to a variety of targets, including small molecules, proteins, and cells. ?,? They are typically generated through an evolutionary process known as SELEX (Systematic Evolution of Ligands by EXponential enrichment), which involves repeated cycles of selection and polymerase chain reaction amplification (PCR) using DNA libraries with randomized sequences.? Once the aptamer sequences are identified via SELEX, they are synthesized chemically, enabling the production of highly pure DNA aptamers. This method allows for precise sequence control and site-specific chemical modifications, making these newly synthesized DNA aptamers attractive alternatives to antibodies due to their reproducibility and consistency.
Compared to protein-based antibodies, DNA aptamers are inherently more hydrophilic and highly soluble in water, which helps avoid the nonspecific “stickiness” often associated with antibodies. ?,? However, this hydrophilic nature also weakens the hydrophobic interactions essential for strong binding to many protein targets. As a result, conventional aptamers composed solely of the four natural nucleotidesadenine (A), guanine (G), cytosine (C), and thymine (T)often exhibit insufficient affinity for practical applications. ?−? ? To address this limitation, several chemical modification strategies have been explored. ?−? ? One common approach involves attaching hydrophobic groups to the natural DNA bases to enhance target binding. ?−? ?
As a novel strategy to improve binding affinity, a genetic alphabet expansion method was developed by Hirao and co-workers by introducing a highly hydrophobic unnatural base (UB)7-(2-thienyl)imidazo[4,5-b]pyridine (Ds)as a fifth nucleotide.? This base specifically pairs with a modified partner, 2-nitro-4-propynylpyrrole (Px), which is functionalized with a diol group (Figure). The Ds–Px pair forms a third base pair that can be replicated with high fidelity during PCR amplification. ?,? Using this expanded base pair system, the authors developed a modified SELEX technique called ExSELEX (genetic alphabet expansion SELEX), which utilizes DNA libraries incorporating the Ds base. This method successfully yielded XenoAptamersDs-containing aptamers with markedly enhanced affinities for protein targets. ?,?,?
Hirao’s DsPx UB pair development process.
However, where does DsPx come from? In the mid-1990s, Morales and Kool developed hydrophobic analogs of the AT Watson–Crick base pair. In particular, non-hydrogen bonding UB pairs (UBPs), like QF and ZF structures depicted in Figure, are capable of DNA replication. ?,? Next, improvements by Hirao and co-workers led to the QPa pair using pyrrole-2-carboxaldehyde (Pa), which showed better shape complementarity and replication selectivity than QF.? To enhance selectivity and avoid mispairing (e.g., Q with T), the same team developed a new hydrophobic base, Ds, which paired with Pa.? They used γ-amidotriphosphates to suppress mispairing (either DsDs or APa), achieving the high-fidelity PCR amplification of DsPa pairs (99.9% selectivity after 20 cycles). To eliminate reliance on γ-amidotriphosphates, they created improved Pa analogs. Replacing the aldehyde with a nitro group yielded Pn, reducing mispairing with A. And finally, further enhancement led to Px (2-nitro-4-propynylpyrrole), which offered increased hydrophobicity and polymerase interaction, as pointed out above.? In addition, the added propynyl group causes an increase of the stacking with adjacent base pairs, also avoiding DsDs mispairing.
In earlier work by our group, we showed that DNA replication depends critically on π–π stacking interactions and aqueous solvation. ?−? ? ? We specifically highlighted the interplay among hydrogen bonding, solvation, twist angles, and π–π stacking within a template–primer complex and demonstrated how these factors govern the stability of both complementary and mismatched base pairs. Building on that foundation, the study examined the stability of Watson–Crick and UB pairs, as well as the intrinsic ability of the template–primer complex to select the correct complementary base in the absence of DNA polymerase.? To achieve this, we designed DNA model systems that operate independently of the polymerase active site, leveraging the fact that template-assisted chemical primer extension can reach base selectivity levels of up to 97% without enzymatic support. ?,? This strategy allowed us to isolate and probe the detailed interactions between DNA bases without interference from amino acid residues in the active site.
Now, our goal is to reproduce, using quantum chemical methods, the single-nucleotide incorporation efficiency and selectivity of various UB pairs (UBPs) developed by Hirao,? as observed with the 3′–5′ exonuclease-deficient Klenow fragment of Escherichia coli DNA polymerase I. In particular, we focus on those UBPs with the highest reported selectivitynamely, QPa, DsPa, DsPn, and DsPx. ?,?,? We aim to be able to explain, through quantum chemistry, the higher selectivity of DsPx as depicted in Figure 4 of the referred work by Hirao and co-workers.? By using known DNA bases alongside the proposed UB pair DsPx as a starting point, our approach aims to deepen the understanding of how UBPs can be incorporated into DNA helix and the corresponding results can be used in further studies, with emphasis on those on DNA replication.
Computational Methods
2
All calculations were carried out with the Amsterdam Density Functional (ADF) program using dispersion-corrected density functional theory at the ZORA-BLYP-D3(BJ)/TZ2P level of theory (Table S3). ?−? ? ? ? ? The chosen functional has been demonstrated to reliably reproduce hydrogen-bonding geometries and energies for A–T and G–C Watson–Crick base pairs as well as stacked nucleotide arrangements. It also provides accurate interaction and bonding energies for π–π-stacked base systems. Both π-stacking and hydrogen-bonding energetic values obtained with this method closely match high-level CCSD(T) benchmark results. ?,? To mimic the near-planar structure of B-DNA, isolated bases and hydrogen-bonded base pairs were optimized under a C _ s _ symmetry constraint. Prior studies have shown that fully optimized C _ s _-symmetric planar structures and C 1-symmetric nonplanar structures exhibit virtually identical stabilities. The difference in bond energies between relaxed C 1 and C _ s _ conformations is generally within 0.1 kcal mol^–1^, except for the G–A system, which deviates by 0.6 kcal mol^–1^ due to amino-group pyramidalization. ?,?
The effect of solvation in water was simulated by means of the Conductor-like Screening Model of solvation as implemented in ADF. The through-space interaction was analyzed within the framework of quantitative Kohn–Sham molecular orbital theory in combination with a quantitative energy decomposition analysis (EDA) in the gas phase. The interaction energy ΔE int between the tetrazoles and aryl fragments was decomposed into the classical electrostatic attraction ΔV elstat, Pauli repulsion ΔE Pauli between occupied orbitals, stabilizing orbital interactions ΔE oi, and dispersion ΔE disp.? Finally, the electron density distribution is analyzed by using the Voronoi deformation density (VDD) method for atomic charge.? Non-Covalent Interaction (NCI) plots have been computed by means of the Multiwfn software. ?,?
Model
System
2.1
To begin, each nucleotide base was fully optimized without symmetry constraints (C _ s _ symmetry). We then evaluated the affinity between pairs of DNA bases, whether they form canonical Watson–Crick hydrogen bonds or represent non-natural base pairs that do not naturally hydrogen-bond. For this analysis, the individually optimized bases were positioned to favor potential hydrogen-bond formation, after which a geometry optimization was carried out while enforcing the C _ s _ symmetry. The orientation of the hydrogen-bondedor nonbondedpairs, specifically the placement of the terminated glycosidic bond, was chosen to reproduce the experimentally observed α-helical twist of DNA. ?−? ? ? Then, the X–Y base pair bonding energy is computed as ΔE = E(X–Y) – E(X) – E(Y), being both the base pair and the bases alone fully optimized in C _ s _ symmetry.
The second part of this study focuses on π-stacking interactions between base pairs. Stacked dimers were constructed using C _ s _-optimized base pairs. To isolate stacking effects, the sugar–phosphate backbone was removed and replaced with hydrogen atoms, enabling us to study interactions directly between the nucleobases. ?,?,? The stacked arrangement was modeled by fixing the interpair separation at 3.4 Å and imposing a 36° twist angle, mimicking the influence of the backbone while preventing relaxation of the stacked structure (Figure). For this latter rotation, first the base pair is geometrically centered in the xy plane, with the two glycosidic bonds aligned in the x-axis, and next, the 36° rotation is applied in the z-axis around this center. This twist angle is between the two planes arisen from the straight lines connecting these two added H atoms that would give the glycosidic bond of each base. Noticeably, this simplified stacking model, which omits the backbone, has also been employed previously in studies of ring-expanded nucleobases. Prior work by Wetmore and co-workers demonstrated that although including the phosphodiester backbone can slightly alter the stationary points associated with nucleotide deglycosylation, backbone-truncated models with capped glycosidic linkages strike an effective balance between computational accuracy and efficiency. ?−? ? Nonetheless, with the aim to further support our model system, we have also performed the optimization of the DsPx/GC-stacked base pair dimer including the backbone together with two sodium atoms (Figure S4), proving the good agreement with our model system and supporting previous works undertaken with the same methodology. Finally, the X–Y/X′–Y′ stacking energy is computed as ΔE ^stacking^ = E(X–Y/X′–Y′) – E(X–Y) – E(X′–Y′), being the base pairs fully optimized in C _ s _ symmetry, both alone and in the built stacked system. And for the stacked trimer, X″–Y″/X–Y/X′–Y′, its stacking energy is computed as ΔE ^stacking^ = E(X″–Y″/X–Y/X′–Y′) – E(X″–Y″) – E(X–Y) – E(X′–Y′).
Schematic (top) and on-scale atomistic (bottom) representation of the complex of the DsPx base pair stacked to AT (colored in light blue) at a distance of 3.4 Å and with a mutual rotation of 36°. Front view (left) and top view (right) of the stacked base pair dimer.
Results and Discussion
3
The first step is to analyze the bonding energies of the set of base pairs. As a reference, Watson–Crick AT and GC base pairs present total bonding energies of −8.7 and −13.4 kcal mol^–1^, respectively, computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water. The stronger GC is attributed to the presence of three hydrogen bonds, compared to two for AT (Figure and Table S1). ?−? ? ? Noticeably, computed hydrogen bond lengths are very close to those present in Dickerson’s dodecamer. ?,? Next, we have considered recently designed UB pairs (UBPs), together with those formed between Ds or Q with the four DNA bases, with the aim to analyze their selectivity. At difference from WC base pairs, all UBPs under analysis present very weak bonding energies. For instance, in the case of Ds, the bonding energies of the base pairs amount between −2.6 and −1.0 kcal mol^–1^ (Figure), whereas in the case of Q, ΔE amounts between −1.0 and −0.2 kcal mol^–1^ (Figure). Such weaker bonding energies compared to WC energies must be attributed to the lack of hydrogen bonds in these UBPs. All interactions are much longer than 2.0 Å (Figures and ?), thus supporting their weak nature. Importantly, all base pairs have been constrained to C _ s _ symmetry to mimic the planarity in DNA helix, in which the base pairs have been previously proven to present an almost planar geometry.? Nonetheless, we have checked that the maximum energy difference between constrained and fully relaxed bases amounts to be less than 0.2 kcal mol^–1^.
Watson–Crick AT and GC base pairs. Hydrogen bond lengths (in Å) and bonding energies (in kcal mol–1) are also enclosed. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
DNA base pairs with unnatural Ds. Main bond lengths (in Å) and bonding energies (in kcal mol–1) are also enclosed. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
DNA base pairs with unnatural Q. Main bond lengths (in Å) and bonding energies (in kcal mol–1) are also enclosed. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
The aim of this analysis is to check the selectivity of these newly proposed UBPs, i.e., how favorable are either DsPn, DsPa, or DsPx compared to Ds binding any of the DNA bases (DsA, DsC, DsG, or DsT). For such an aim, a model system was built without polymerase (see the Computational Details section). The stacked systems were prepared by separating the base pairs 3.4 Å, which is the average distance in DNA, and rotating them to a twist angle of 36°, which corresponds to the average twist in B-DNA (Figure). In all the systems studied, the backbone was removed, hydrogen atoms have been inserted in its position, and a C _ s _ symmetry was imposed to mimic the nearly planar arrangement observed in B-DNA.
Outstandingly, the UB pair DsPx appears to present the strongest interaction with either Watson–Crick AT or GC (Figure). Thus, the preferred selectivity of the formation of DsPx is also confirmed by its stacking strength to a natural base pair. In particular, DsPx presents stacking energies (ΔE ^stack^) that amount to −11.7 and −12.6 kcal mol^–1^ with either AT or GC, respectively. Its higher affinity is supported by the difference of 1.0 and 2.0 kcal mol^–1^ in ΔE ^stack^ compared to the possible formation of any other base pair with Ds. The preference for DsPx is followed by DsPn (ΔE ^stack^ = −10.4 and −10.6 kcal mol^–1^ with AT and GC, respectively), the UBP from which it was derived, thus also supporting its good performance compared to the previously developed ones. However, its ΔE ^stack^ is closely followed by other possible base pairs. For instance, DsPa follows next (ΔE ^stack^ = −10.3 and −10.3 kcal mol^–1^ with AT and GC, respectively), although it is not being preferred compared to its binding with natural bases. Also noticeably, the stacking energies with either AT or GC present quite similar values, with a maximum difference of 0.82 kcal mol^–1^ in the case of the most favored DsPx when stacked to GC. At this point, it must be pointed out that, despite the fact that we have referred to this preference for DsPx as selectivity, the term affinity is better suited based on the small energy difference.
Stacking interaction (in kcal mol–1) of the considered base pairs (XY) with Ds, and Watson–Crick for comparison, on top of AT or GC base pairs with a twist angle of 36°. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
If the bottom base pair is reversed, i.e., instead of AT or GC, we put TA or CG, the trends are kept (Figure). DsPx interacts the strongest (ΔE ^stack^ = −13.0 and −12.7 kcal mol^–1^ with TA and CG, respectively). The unnatural DsPn (ΔE ^stack^ = −12.0 and −11.9 kcal mol^–1^ with TA and CG, respectively) and DsPa (ΔE ^stack^ = −12.2 and −11.7 kcal mol^–1^ with TA and CG, respectively) follow next. Noticeably, in this case, the formation of UBPs presents a clear preference compared to Ds binding any of the four natural DNA bases.
Stacking interaction (in kcal mol–1) of the considered base pairs (XY) with Ds, and Watson–Crick for comparison, on top of TA or CG base pairs with a twist angle of 36°. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
At difference, when the UBPs are stacked at the bottom of either AT or GC, DsPx is not the preferred one, but other base pairs stack stronger (Figure S1). In particular, DsA presents a stronger stacking energy (−14.4 and −13.5 kcal mol^–1^ with AT and GC, respectively) than that of DsPx (−12.5 and −12.4 kcal mol^–1^ with AT and GC, respectively). However, this is not the case with reversed TA or CG (Figure S2), when again DsPx presents the strongest stacking interaction (−13.9 and −13.1 kcal mol^–1^ with AT and GC, respectively), followed by DsA (−13.3 and −12.4 kcal mol^–1^ with AT and GC, respectively). Thus, based on a model system with two stacked base pairs, being one the UBP and the second a Watson–Crick one, in almost all cases, the preferred affinity of DsPx is determined by its stronger stacking interaction with the Watson–Crick base pair. This is not the case with either previously designed DsPa or DsPn that, despite also presenting a relatively similar ΔE ^stack^ to DsPx, values are very close or even smaller to those base pairs of Ds binding natural DNA bases.
The same analysis above has been applied to QPa, with the aim to check its selectivity compared to Q forming either QA, QC, QG, or QT (Figure). QPa presents the strongest stacking energy with either AT or GC (−9.7 and −9.9 kcal mol^–1^ with AT and GC, respectively). Thus, QPa, which was even designed before DsPx, is preferred over Q binding to any of the natural bases just based on the stacking interaction with Watson–Crick DNA bases.
Stacking interaction (in kcal mol–1) of the considered base pairs (XY) with Q, and Watson–Crick for comparison, on top of AT or GC base pairs with a twist angle of 36°. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
With the aim to get closer to a more realistic DNA helix, we placed the UBPs stacked between two Watson–Crick base pairs (Figure). As the dimer model system above, each base pair is twisted 36° with respect to the next one.
Schematic (left) and on-scale atomistic (right) representation of the complex of DsPx base pair stacked between two AT (colored in light blue) base pairs at a distance of 3.4 Å and with a mutual rotation of 36°.
DsPx once again appears as the UPB that presents the strongest stacking interaction. Whatever the combination of AT and GC is, DsPx stacks stronger (between −24.7 and −25.6 kcal mol^–1^) than any of the other UBPs (Figure). Both previously designed DsPa and DsPn also present strong ΔE ^stack^ energies (between −22.6 and −23.4 kcal mol^–1^). These latter are only weaker than those by DsA (between −23.5 and −24.7 kcal mol^–1^).
Stacking interaction (in kcal mol–1) of the considered base pairs with Ds, located in between Watson–Crick base pairs forming a trimer with a twist angle of 36°. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
Noticeably, DsPx is also preferred if it is reversed, i.e., PxDs, and also placed in the same trimer model system (Figure). ΔE ^stack^ energies are even larger compared to the original orientation (from −26.3 to −27.4 kcal mol^–1^). Then, DsPa directly competes with DsA, closely followed by DsPn.
Stacking interaction (in kcal mol–1) of the considered base pairs with Ds, located in between Watson–Crick base pairs forming a trimer with a twist angle of 36°. For nomenclature, whereas XY refers to DsPx, YX refers to PxDs. Computed at the ZORA-BLYP-D3(BJ)/TZ2P level of theory in water.
Overall, with our model system, we are able to reproduce the observed experimental single-nucleotide incorporation selectivity of Hirao’s UB pairs by the 3,5-exonucleobase-deficient Klenow fragment of E. coli DNA polymerase I. ?,? Noticeably, our model system does not include polymerase; thus, such selectivity is exclusively already reproduced by means of computed stacking energies in the DNA helix, giving rise to stronger affinities.
However, can we obtain further insight regarding the reason for this stronger affinity by the UBP DsPx compared to that of either Watson–Crick AT or GC? Why is such enhanced stacking of DsPx responsible for its better selectivity among the other possible formed base pairs? To address this issue, a quantitative Kohn–Sham molecular orbital analysis? together with an EDA has been performed on two extreme cases discussed above, i.e., DsPx and AT, and each of them stacked on top of both AT and GC.
First, even without the aqueous environment, DsPx stacks stronger than AT on top of either AT (−14.9 vs −14.0 kcal mol^–1^) or GC (−16.0 vs −14.1 kcal mol^–1^, Table). These values are larger than those in water due to the effect of the solvation energy in each base pair compared to the whole dimer. ?−? ? Let us first focus on the dimers stacked with GC. Despite the more unfavorable Pauli repulsion DsPx experiences (20.6 vs 16.6 kcal mol^–1^), it is compensated by more attractive electrostatic (−7.1 vs −4.9 kcal mol^–1^) and dispersion (−25.2 vs −21.9 kcal mol^–1^) interactions and also by slightly more favorable orbital interactions (−4.3 vs −3.8 kcal mol^–1^), thus giving rise to an enhanced stacking interaction compared to AT (Table). We can go further by analyzing the individual interactions in DsPx/GC, i.e., between Ds and G, and between Px and C, and also their cross terms (Table), and compare to those for the AT/GC-stacked dimer. Noticeably, the cross interactions are the ones responsible for the stronger stacking of DsPx compared to AT with GC. In particular, Px/G and Ds/C are stronger than T/G and A/C by 1.6 and 3.8 kcal mol^–1^, respectively. And the reason is found in the more favorable electrostatic interactions between these involved bases (by −1.6 and −3.7 kcal mol^–1^, respectively) but also the more attractive dispersion interaction in the case of Ds/C compared to A/C (by −3.0 kcal mol^–1^, Table). The more favorable, i.e., more attractive, electrostatic interaction in the case of DsPx than AT stacked to GC comes from their different electronic structure, as supported by the VDD charges distribution (Figure), as well as by their molecular electrostatic potential (MEP) isosurfaces (Figures and S3).
1: EDA (in kcal mol–1) of the Stacking Interaction of the DsPx and AT Base Pairs on Top of the AT or GC Base Pairs with a Twist Angle of 36°
2: EDA (in kcal mol–1) of the Stacking Interaction of the DsPx and AT Base Pairs on Top of the GC Base Pair with a Twist Angle of 36°, together with the Individual Contributions of the Bases and the Cross Terms
VDD charge isosurfaces (in e) of the base pairs involved in DsPx/GC and AT/GC dimers. GC is rotated with a twist angle of 36°.
MEP isosurfaces (a.u., electronic density isovalue = 0.03 au) of the base pairs involved in DsPx/GC and AT/GC dimers. GC is rotated with a twist angle of 36°.
The above conclusions are not only restricted to these base pair dimers, but the same can be applied when comparing DsPx to AT stacked to AT (Table S2). In addition, the determinant role of both electrostatic and dispersion interactions is also applied to the rest of the possible UB pairs under analysis, whose values lie in between the two extreme cases we have analyzed in depth above.
Finally, and for completeness, we have also computed the NCI plots for both these latter dimers with the aim to better understand the better dispersion interaction in the case of DsPx stacked to GC (Figure). In the case of the AT/GC dimer, A shows an important noncovalent interaction with G; however, that between T and C is quite weak. At difference, in DsPx/GC, with a stacked base, dimers show such a noncovalent interaction, with a special emphasis on the propynyl group attached in Px interacting with C below. Thus, such observation agrees with the stronger dispersion interaction of this latter dimer compared to the Watson–Crick one.
NCI plot of DsPx/GC and AT/GC dimers (promolecular density isovalue = 0.35 a.u.).
Conclusions
4
We have successfully reproduced, using quantum chemical methods, the experimentally observed single-nucleotide incorporation selectivity of Hirao’s UB pairs (UBPs) by the 3′–5′ exonuclease-deficient Klenow fragment of E. coli DNA polymerase I, originally developed by Hirao and co-workers.? Our analysis focuses on the UBP that exhibits the highest selectivity, Ds–Px, and compares its performance not only with canonical Watson–Crick base pairs but also with other previously designed UBPs. Importantly, our model system excludes the polymerase itself, yet the experimentally observed selectivity emerges solely from the computed stacking energies within the DNA helix. Through quantitative molecular orbital and energy decomposition analyses, we demonstrate the decisive contributions of both electrostatic and dispersion interactions to the computed enhanced affinity of Ds–Px. Ultimately, our approach provides new insights into DNA replicationone of the most fundamental biological processeswhose molecular basis of remarkably high fidelity remains only partially understood.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hirao I.Kimoto M.Yamashige R.Natural versus Artificial Creation of Base Pairs in DNA: Origin of Nucleobases from the Perspectives of Unnatural Base Pair Studies Acc. Chem. Res.2012452055206510.1021/ar 200257 x 22263525 · doi ↗ · pubmed ↗
- 2Kimoto M.Hirao I.Genetic alphabet expansion technology by creating unnatural base pairs Chem. Soc. Rev.2020497602762610.1039/D 0CS 00457 J 33015699 · doi ↗ · pubmed ↗
- 3Kimoto M.Tan H. P.Matsunaga K.-i.Binte Mohd Mislan N. A.Kawai G.Hirao I.Strict Interactions of Fifth Letters, Hydrophobic Unnatural Bases, in Xeno Aptamers with Target Proteins J. Am. Chem. Soc.2023145204322044110.1021/jacs.3c 0612237677157 PMC 10515488 · doi ↗ · pubmed ↗
- 4Ausserwöger H.Schneider M. M.Herling T. W.Arosio P.Invernizzi G.Knowles T. P. J.Lorenzen N.Non-specificity as the sticky problem in therapeutic antibody development Nat. Rev. Chem.2022684486110.1038/s 41570-022-00438-x 37117703 · doi ↗ · pubmed ↗
- 5Kimoto M.Shermane Lim Y. W.Hirao I.Molecular affinity rulers: systematic evaluation of DNA aptamers for their applicabilities in ELISA Nucleic Acids Res.2019478362837410.1093/nar/gkz 68831392985 PMC 6895277 · doi ↗ · pubmed ↗
- 6Qian S.Chang D.He S.Li Y.Aptamers from random sequence space: Accomplishments, gaps and future considerations Anal. Chim. Acta 2022119633951110.1016/j.aca.2022.33951135151405 · doi ↗ · pubmed ↗
- 7Wang T.Chen C.Larcher L. M.Barrero R. A.Veedu R. N.Three decades of nucleic acid aptamer technologies: Lessons learned, progress and opportunities on aptamer development Biotechnol. Adv.201937285010.1016/j.biotechadv.2018.11.00130408510 · doi ↗ · pubmed ↗
- 8Pfeiffer F.Rosenthal M.Siegl J.Ewers J.Mayer G.Customised nucleic acid libraries for enhanced aptamer selection and performance Curr. Opin. Biotechnol.20174811111810.1016/j.copbio.2017.03.02628437710 · doi ↗ · pubmed ↗
