Energy landscapes and dynamics of xylo-nucleic acids
Daniel J. Sharpe, Konstantin Roeder, David J. Wales

TL;DR
This study uses computational energy landscape analysis to explore the structural dynamics of xylo-nucleic acids, revealing their conformational flexibility and potential as molecular switches, contrasting with natural nucleic acids.
Contribution
It introduces the energy landscape framework to analyze xylo-nucleic acids, highlighting their unique frustration and conformational properties compared to natural nucleic acids.
Findings
Xylo-nucleic acids exhibit a flexible, bistable conformational landscape.
A left-handed helix is the global free energy minimum.
Xylo-nucleic acids are more frustrated than natural biomolecules.
Abstract
Artificial analogues of the natural nucleic acids have attracted recent interest as a diverse class of information storage molecules capable of self-replication. In the present study, we use the computational potential energy landscape framework to investigate the structural and dynamical properties of xylo- and deoxyxylo-nucleic acids (XyNA and dXyNA), which are derived from their respective RNA and DNA analogues by an inversion of configuration at a single chiral center in the sugar moiety of the nucleotide unit. The free energy landscapes of an octameric XyNA sequence and its dXyNA analogue demonstrate the existence of a facile conformational transition between a left-handed helix that is the global free energy minimum, and a closely competing ladder-type structure with approximately zero helicity. The separation of the competing conformational ensembles is better-defined for the…
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsDNA and Nucleic Acid Chemistry · Advanced biosensing and bioanalysis techniques · Bacteriophages and microbial interactions
Energy landscapes and dynamics of xylo-nucleic acids
Daniel J. Sharpe
Konstantin Röder
David J. Wales
[
Abstract
Artificial analogues of the natural nucleic acids have attracted recent interest as a diverse class of information storage molecules capable of self-replication. In the present study, we use the computational potential energy landscape framework to investigate the structural and dynamical properties of xylo- and deoxyxylo-nucleic acids (XyNA and dXyNA), which are derived from their respective RNA and DNA analogues by an inversion of configuration at a single chiral center in the sugar moiety of the nucleotide unit. The free energy landscapes of an octameric XyNA sequence and its dXyNA analogue demonstrate the existence of a facile conformational transition between a left-handed helix that is the global free energy minimum, and a closely competing ladder-type structure with approximately zero helicity. The separation of the competing conformational ensembles is better-defined for the dXyNA system, whereas the XyNA analogue is inherently more flexible. The former therefore appear more suitable candidates for a molecular switch. The landscapes differ qualitatively from those reported in previous studies for evolved biomolecules: they are significantly more frustrated, so that XyNAs provide an example of an unnatural system for which the conditions constituting the principle of minimal frustration are, as may be expected, violated.
keywords:
American Chemical Society, LaTeX
University of Cambridge] Department of Chemistry, University of Cambridge, Lensfield Road,
Cambridge CB2 1EW, United Kingdom
\abbreviationsIR,NMR,UV
{tocentry}
Some journals require a graphical entry for the Table of Contents. This should be laid out “print ready” so that the sizing of the text is correct.
Inside the tocentry environment, the font used is Helvetica 8 pt, as required by Journal of the American Chemical Society.
The surrounding frame is 9 cm by 3.5 cm, which is the maximum permitted for Journal of the American Chemical Society graphical table of content entries. The box will not resize if the content is too big: instead it will overflow the edge of the box.
This box and the associated title will always be printed on a separate page at the end of the document.
1 Introduction
Xeno-nucleic acids (XNAs) are a diverse family of nucleic acid structures derived from DNA or RNA by chemical modification of the nucleotide units.1 XNAs have rapidly emerged as having important medical applications,2 for example as aptamers,3, 4 synthetic ribozymes,5 artificial small interfering RNAs (siRNAs) and antisense oligonucleotides for the targeting of microRNAs.6, 7 The resistance of XNAs to endonucleases, a result of the inability of natural enzymes to recognise the modified nucleic acid structures, is a particularly valuable advantage to the use of XNAs in place of natural nucleic acids for therapeutic purposes.8 XNAs are also of current interest in the emergent fields of synthetic biology,9 which demands the development of chemical information storage systems capable of self-replication in vitro and in vivo for artificial life and biological computation, and systems chemistry,10 which requires molecular switches for the control of operations in complex chemical networks. Other potential applications of XNAs include their use as self-assembling nanomaterials, broadening the possible design scope in DNA nanotechnology, chemical sensors and catalysts.11, 12 The study of XNAs is also motivated by the fundamental question of the origins of life, where it is important to understand the factors that led to evolution selecting ribofuranosyl nucleic acids as the genetic biopolymer for the basis of life, and where it remains unknown if an alternative nucleic acid was utilised in hypothetical organisms preceding those based on RNA.5 Despite their importance, there are few reported structures of XNAs, and relatively little is known concerning the structural and dynamical properties of XNAs in atomistic detail.13 In particular, computational studies have been thus far largely limited to molecular dynamics (MD) simulations, which do not overcome the broken ergodicity encountered in biomolecules.
The present study is focused on nucleic acids based on xylose (XyNA) and deoxyxylose (dXyNA), herein referred to collectively as XyNAs, which represent some of the simplest possible perturbations to the chemical structure of natural nucleic acids. Xylose is derived from ribose by a simple inversion of configuration at the C3*′* atom of the sugar moiety (Fig. 1), and likewise deoxyxylose is derived from deoxyribose. Xylose is a thermodynamic product of the formose reaction,14 the most probable prebiotic route of sugar synthesis,15 and so XyNAs represent arguably the most credible possibility of a genetic biopolymer adopted by prebiotic organisms that are speculative precursors to RNA-based life forms.
Thermal denaturation studies have demonstrated that dXyNA:DNA hybrid duplexes exhibit markedly low thermodynamic stability compared to corresponding DNA duplexes,16 whereas dXyNA homoduplexes display thermodynamic stability commensurate with analogous DNA duplexes.17 In this respect, dXyNA exhibits complementary properties to many alternative XNAs, which are able to form a stable duplex through hybridisation with DNA or RNA, with strong discrimination against mismatches.13 While this behaviour precludes the use of XyNAs as an aptamer and for other applications requiring sequence-specific binding, it is an ideal property if XyNAs are to be utilised alongside and independent of natural nucleic acids as an orthogonal information system.
Circular dichroism (CD) studies have shown that XyNA and dXyNA oligomers may adopt a left-handed helical duplex structure or a structure with an apparent lack of helicity, and that the observed structure is dependent on base sequence and sequence length as well as external factors including temperature and salt concentration.18, 19
MD simulations of XyNA and dXyNA duplexes of length 8, 13 and 29 bp (base pairs) have revealed the existence of a helical inversion transition from a right- to a left-handed helical conformation.20, 21 The observed timescale for this transition is of the order of tens of ns. For XyNA duplexes, it was observed that the left-handed helical structure is not stable, but rather that the terminal regions of the duplex undergo oscillatory movements from coiled to uncoiled states that act to repeatedly screw and unscrew the helix, suggesting structural competition between left-handed helical and linear ladder-type structures. The existence of this equilibrium between a pair of competing conformational ensembles suggests XyNAs as a potential molecular switch, and is also a property unique to XyNAs among the known XNAs.13 Other recent MD studies of XyNA duplexes in the presence of a carbon nanotube have demonstrated fast spontaneous unzipping as a consequence of the strained backbone, highlighting the promise of XyNAs with respect to gene delivery for therapeutic purposes.22, 23
In the present study, we use the computational potential energy landscape framework24 to investigate the structural and dynamical properties of XyNA and dXyNA duplexes. The extensive sampling facilitated by discrete path sampling25 (DPS) allows for the calculation of free energies and therefore proper comparison of the relative thermodynamic stabilities of the three major conformations expected, namely left-handed helical, right-handed helical and ladder-type structures. The free energy barriers partitioning these major conformational ensembles determine the applicability of XyNA and dXyNA duplexes as molecular switches, which requires two competing funnels to be separated by a barrier that is surmountable at ambient temperatures. Visualisation of the free energy landscapes will clearly elucidate structural differences between XyNA and dXyNA duplexes, as well as between them and their naturally evolved counterparts. The framework has been successfully applied to a variety of biomolecular systems26 including the B-Z-DNA transition27 and the formation of DNA mini-dumbbells.28
2 Methodology
2.1 System modeling
The xylo- and deoxyxylo-nucleotide monomer units were constructed with the furanose moiety in the C3*′-endo (cf. canonical A-RNA) and C2′-endo conformations, respectively, using the LEAP program of AMBER.29 Atom parameters were given by the parm99 force field30 incorporating the bsc0 correction31 for and backbone torsion angles. The parm99 force field has been applied to yield important insight into the behaviour of a variety of artificial nucleic acid systems,32, 33, 34 and can therefore be justifiably applied to XyNAs as in previous MD studies.20, 21 The bsc0 correction is acknowledged to yield a general improvement in the description of the behaviour of both DNA and RNA, including noncanonical structures thereof, hence its implementation in the present work. It was previously shown that different parameterisations for the glycosidic torsion angle have little effect on the MD trajectories of XyNA sequences,21 therefore no reparameterisation of this dihedral was used. Partial charges were obtained by the two-stage RESP fitting procedure35 at the HF/6-31G level of theory using the ANTECHAMBER program36 of AMBER, with electrostatic potentials calculated using Gaussian03.37 The potential function was correctly symmetrised.38, 39
With the nucleotide units constructed as described, complete right-handed duplexes of the octameric sequence (5*′-3′*)[xG-xU-xG-xU-xA-xC-xA-xC-T] (XyNA1) and the deoxyxylose analogue thereof (dXyNA1) were constructed using the LEAP program based on a template of canonical B-DNA produced with the NAB program40 of AMBER. Ladder-type structures were obtained from the NMR solution structure of the XyNA oligomer (PDB: 2N4J).41 From these initial structures, optimised right-handed helical and ladder-type structures were obtained by basin-hopping global optimisation,42 as described below. From the latter conformation, in turn, a left-handed helical structure was obtained by short timescale explicit solvent MD simulation, performed using the SANDER package43 of AMBER14, and further optimised by basin-hopping global optimisation.
2.2 Exploration of the energy landscapes
Basin-hopping (BH) global optimisation44, 42, 45, 46, 47 to obtain the lowest-energy structures of the right-handed helical, left-handed helical and ladder-type conformational ensembles was achieved using the GMIN program48 interfaced with AMBER12. The BH algorithm uses a Metropolis criterion to sample basins of attraction of a potential energy landscape and thereby locate low-energy configurations. Perturbation moves include concerted rotations of groups about internal axes of the system, in addition to standard atomic displacements. Chirality checks are implemented to ensure that no chiral centres become inverted in the course of the simulation.
The landscapes were further explored using discrete path sampling (DPS) to create kinetic transition networks (KTNs).49, 50, 51, 52, 53 Transition states were located with the doubly-nudged elastic band algorithm54, 55, 56, 57, 58, 59 and converged with hybrid eigenvector-following.60, 61, 62, 63, 64 Local minima were then characterised by approximate steepest-descent paths, using a modified version of the L-BFGS algorithm.65, 66 These calculations used the OPTIM program67 interfaced with AMBER12. Initial optimal alignment of endpoint structures was achieved by a shortest augmenting path alogorithm.68 A Dijkstra-based missing connection algorithm69, 70 was used to construct a priority list of connection attempts based on appropriate edge weights.
After initial sampling initiated from the low energy structures located by BH, the PATHSAMPLE driver program71 was used to conduct further sampling to improve the connectivity of the landscape,72 remove artificial kinetic traps and high energy barriers,73 and shorten path lengths.74
All calculations described above used a generalised Born implicit solvent model with surface area term75, 76 and an effective monovalent salt concentration of 0.1 M maintained using the Debye-Hückel approximation.77
Free energy landscapes at a temperature of 298 K were calculated using the harmonic superposition approximation (HSA).78, 79, 80 A self-consistent recursive regrouping scheme based on a specified free energy barrier threshold was used to classify minima into collective macrostates.81, 82 Free energy pathways were determined by Dijkstra’s shortest path algorithm.69, 83
2.3 Analysis of the energy landscapes
Free energy landscapes are visualised as disconnectivity graphs,84, 85, 86, 87, 72 where each leaf corresponds to a free energy group, and is coloured according to the value of an appropriate order parameter for a representative potential energy minimum of that group. Here, the chosen order parameter is the helical handedness ,88 in the form originally proposed for describing the B-Z transition in DNA duplexes.89 The magnitude of is a measure of the number of turns associated with a helix. Values and are reflective of right- and left-handed double helial turns, respectively, while values indicate approximately zero helicity.
Processing of the data in the KTN was achieved with the CPPTRAJ module90 of AMBER. The CURVES+ program91 was used to extract bp-axis, inter-bp and intra-bp geometric parameters.92 All analyses exclude the terminal base pairs to avoid the influence of end effects. Molecular graphics images were produced using VMD.93
3 Results
3.1 Free energy landscapes
The free energy disconnectivity graphs at 298 K for XyNA1 and dXyNA1 are shown in Figs. 2 and 3, respectively. For both duplexes, the right-handed helical structures are located only in regions of the landscape with high free energy, so that the occupation probability for right-handed helices is very low. The inversion of chirality in the nucleotide units, with respect to natural nucleic acids, seeds an inversion of preferred helical sense to favour a stable left-handed helix, and XyNAs can be thought of as effective two-state systems, with equilibrium between left-handed helical and ladder-type structures.
There are striking differences between the free energy landscapes for XyNA and dXyNA duplexes. Most notably, the ideal left-handed helical state of the dXyNA1 duplex is a relatively well-defined global free energy minimum on the landscape, being ca. 5 kcal mol*-1* more stable than the next lowest energy minimum, which is the most stable ladder-type structure. The free energy barrier for the conversion of the left-handed helical to the ladder-type structure is around 20 kcal mol*-1*, and for the reverse transition is around 12 kcal mol*-1*. By contrast, the free energy barriers separating the corresponding states of the XyNA1 duplex are around 10 kcal mol*-1* in either direction, and their free energy difference is less than 2 kcal mol*-1*. Another immediately apparent difference between the free energy landscapes of the two systems is in the distribution of values for the helical handedness order parameter. For the XyNA duplex, a much broader range of values for is observed, with values ranging from to appearing in the low-energy region of the landscape. In notable contrast, the low free energy region of the landscape for the dXyNA duplex is dominated by left-handed helical structures, the handedness of which takes a somewhat narrow distribution of values ca. . Thus XyNA duplexes are more flexible than their dXyNA analogues. The clear separation of free energy funnels for left-handed helical and ladder-type conformational ensembles in dXyNA makes it a more suitable candidate for use as a molecular switch than XyNA.
The ideal ladder-type structure of the XyNA1 duplex compares favourably with the NMR solution structure.41 As noted in Ref. 19, the ladder-type structures of the XyNA and dXyNA duplexes are stabilised by the interstrand stacking of adjacent bases, which arises due to the strong inclination of bases with respect to the helical axis (Fig. 4). However, the ladder-type structure is predicted to have marginal left-handed helicity, as opposed to the marginal right-handed helicity in the observed structure. The over-stabilisation of left-handed helical states of XyNA within the force field parameterisation implemented in this work is also evidenced by the fact that left-handed helical structure is erroneously predicted to be the global free energy minimum, although the free energy difference from the idealised ladder-type structure is very small. The force field is apparently more accurate in replicating the behaviour of dXyNA duplexes, where there appears to be no such bias, and the ladder-type structures are correctly predicted to have slight right-handed helicity.
The landscapes of both XyNA1 and dXyNA1 duplexes are significantly frustrated, there being many minima of low free energy in competition with the global free energy minimum. This finding is in contrast to the free energy landscape reported in a study of the analogous B-Z transition in CG-rich DNA sequences, where there is strong funnelling to the native B-DNA state. Frustration in the free energy landscapes of XyNAs is likely largely attributable to the geometrical frustration of base pairs that prevents formation of a properly extended linear duplex and instead necessitates that the ladder-type conformation of XyNA duplexes partly bends inwards on itself to maintain optimal Watson-Crick base-pairing of all base pairs, and that drives the continual unwinding and rewinding of the left-handed helix. This geometric frustration may also be relieved by the adoption of noncanonical base pairings at one or both of the duplex termini, which then allows the linear structures to form a properly extended conformation.
Evolved biomolecules obey the principle of minimal frustration.94 The evolutionary process favours energy landscapes having strong bias towards a well-defined native state, and promotes the elimination of kinetic traps on the pathways to this native state. The fact that the energy landscapes of XyNAs are significantly more frustrated than the landscapes of DNA and RNA presents an evolutionary argument for the adoption of DNA and RNA over XyNAs.
3.2 Free energy pathways
Free energy pathways for helical inversion are shown in Fig. 5 for the XyNA1 and dXyNA1 sequences, going from a representative right-handed helical structure to a left-handed helix representing the global free energy minimum. For both duplexes, the pathways are downhill in energy, and the mechanism features only low barriers, leading to fast kinetics. Furthermore, the interconversion of ladder-type and left-handed helical states occurs by helix winding (unwinding) in left- (right-) handed directions, respectively, propagated inwards from one terminus, and not from both termini simultaneously. [Calculate NGT rate consts (for ladder-type - left-handed helical transition), a key experimental observable]
For the XyNA1 duplex, the helical inversion proceeds via a low-energy ladder-type structure, which subsequently evolves to the left-handed helical state via a transition-state ensemble of ‘kinked’ structures. The corresponding pathway for the dXyNA1 duplex is significantly different. In particular, structures with approximately zero helicity represent a high-energy transient state in the early stages of the pathway, and so the mechanism is not mediated by a ladder-type intermediate state, as is observed for the XyNA1 duplex. This again reflects the greater bias towards left-handed helical over ladder-type structures in dXyNA compared to XyNA duplexes. The transition then continues to progress smoothly with respect to handedness, that is, via a more regular helical structure with partial left-handed helicity.
The evolution of backbone dihedral angles and the glycosidic torsion angle along the helical inversion pathways based on potential energy barriers is shown in Fig. 6. The dihedral, with characteristic value ca. ° that effectively defines XyNA and dXyNA duplexes with respect to their natural analogues, remains relatively stable throughout the pathways (note the small scale). The , and backbone dihedrals for the right-handed helical states of XyNA1 and dXyNA1 adopt values similar to those observed in canonical A-RNA and B-DNA, and undergo sign inversion in the course of the transition to left-handed helical states, seeded by the sign inversion of the dihedral with respect to the natural nucleic acid analogues. For the and dihedrals, values in the left- and right-handed helical states are of approximately equal magnitude but opposite sign, in both XyNA and dXyNA. The behaviour of the and dihedrals exhibits less variance along the pathways, though these angles do likewise adopt values of opposite sign to the corresponding angles in canonical A-RNA and B-DNA. The glycosidic torsion angle takes one of two predominant values in XyNAs, ca. ° (anti) or ca. ° (syn), and plays an important role in driving the transition. The evolution of inter-bp, intra-bp and bp-axis geometrical parameters92 along the fastest potential energy pathway for the helical transition of XyNA1 is shown in Fig. 7. There are large-scale changes in the bp-axis inclination angle, and in the communicative parameters of inter-bp twist angle, roll angle and slide distance. Values of the helical rise and helical twist are also diagnostic of each of the three major conformations.
4 Conclusions
The present results indicate that an equilibrium between left-handed helical and ladder-type structures, the conformational transitions between which are facile, exists for both XyNA and dXyNA duplexes. The global free energy minimum left-handed helical structure is more stable with respect to the ladder-type structure for the deoxyxylose analogue, one possible explanation for which is the increased solvation of the C2*′* hydroxyl group, which is present in XyNAs but not dXyNAs, in the ladder-type compared to the helical state. The pair of conformational ensembles are more clearly well-defined for the dXyNA compared to the XyNA system, both with respect to the magnitude of the free energy barrier separating the basins and with respect to the helical handedness order parameter. Therefore the free energy landscapes demonstrate that XyNA duplexes are more flexible than dXyNA duplexes, and hence that the latter appear more promising candidates for use as a molecular switch, and for use as a chemical information storage molecule capable of self-replication, where facile unwinding of a helical structure is undesirable.
The free energy landscapes of both XyNA and dXyNA duplexes are significantly frustrated, highlighting an important factor that may have led evolution to select ribofuranosyl nucleic acids, and not the xylose-based analogues, as the genetic basis for life. The origin of the structural competition evident in these systems is the geometrical frustration that prevents Watson-Crick base pairing without the induction of strain in the nucleic acid backbone that must be relieved by bending, or else by the adoption of noncanonical base pairing modes at one or both of the duplex termini.
Free energy pathways from a disfavoured right-handed helical state to a left-handed helical state that is the global free energy minimum also differ notably between XyNA and dXyNA duplexes. For the latter system, extended linear structures represent an early and high-free energy transition state along the pathway, which then continues to proceed via regular helical structures with a smooth change in the handedness order parameter. For the XyNA system, the transition is mediated by low-energy ladder-type structures, which evolve to left-handed helical structures via a transition state ensemble of irregular ‘kinked’ structures. Left-handed helix winding and unwinding transitions of XyNA and dXyNA duplexes are driven by the highly flexible terminal base pairs. The inversion of the dihedral angle in XyNAs with respect to their natural nucleic acid analogues seeds a direct inversion not only in the overall helical sense but also in the backbone dihedral angles. The glycosidic torsion angle also goes large-scale changes in the course of the transition, as do certain key geometric parameters, most notably the bp-axis inclination angle.
Further studies should focus on the design and application of XyNA and dXyNA duplexes for molecular devices, for example by investigating the sequence and length-dependence of the propensity for helicity, and the response of the equilibrium to environmental conditions. Kinetic studies could be used to compare the barriers for the helical unwinding transition of XyNA and dXyNA duplexes.
{acknowledgement}
DJS gratefully acknowledges the Cambridge Commonwealth, European and International Trust for a PhD scholarship. KR and DJW gratefully acknowledge funding from the Engineering and Physical Sciences Research Council.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Herdewijn and Marlière 2009 Herdewijn, P.; Marlière, P. Chem. Biodivers. 2009 , 6 , 791–808
- 2Wang et al. 2013 Wang, Q.; Chen, L.; Long, Y.; Tian, H.; Wu, J. Theranostics 2013 , 3 , 395–408
- 3Appella 2009 Appella, D. H. Curr. Opin. Chem. Biol. 2009 , 13 , 687–696
- 4Taylor et al. 2014 Taylor, A.; Arangundy-Franklin, S.; Holliger, P. Curr. Opin. Chem. Biol. 2014 , 22 , 79–84
- 5Taylor et al. 2015 Taylor, A.; Pinheiro, V. B.; Smola, M. J.; Morgunov, A. S.; Peak-Chew, S.-Y.; Cozens, C.; Weeks, K. M.; Herdewijn, P.; Holliger, P. Nature 2015 , 518 , 427–430
- 6Morihiro et al. 2017 Morihiro, K.; Kasahara, Y.; Obika, S. Mol. Biosyst. 2017 , 13 , 235–245
- 7Pinheiro and Holliger 2012 Pinheiro, V. B.; Holliger, P. Curr. Opin. Chem. Biol. 2012 , 16 , 245–252
- 8Chaput et al. 2012 Chaput, J. C.; Yu, H.; Zhang, S. Chem. Biol. 2012 , 19 , 1360–1371
