Computational NMR Study of Cryptolepis Alkaloids: Could the Structural Misassignment of Cryptospirolepine Have Been Avoided?
Valentin A. Semenov, Leonid B. Krivdin, Gary E. Martin

TL;DR
This paper shows how computational methods can detect and correct structural errors in complex molecules like cryptospirolepine using NMR data.
Contribution
The study demonstrates that computational NMR methods can effectively identify and correct structural misassignments in complex alkaloids.
Findings
Computational methods achieved high correlation with experimental NMR chemical shifts for thirteen Cryptolepis alkaloids.
The corrected mean absolute error for proton and carbon shifts was 0.13 ppm and 1.5 ppm, respectively.
The study suggests spectral reassignments and predicts missing chemical shifts for some compounds.
Abstract
It is demonstrated that by using modern computational methods, it is possible to identify potential structural misassignments for even molecules as complex as cryptospirolepine. While methods that rely on anisotropic nuclear magnetic resonance (NMR) methods can also be used to offer a powerful orthogonal means of structure elucidation and verification, the present computational study should be feasible in most chemistry departments with access to sophisticated computational facilities. From the overall correlations calculated against experimental 1H and 13C NMR chemical shifts in the series of thirteen Cryptolepis alkaloids, it follows that, generally, they provide a very high level of correlation. The corrected mean absolute error (CMAE) is found to be at the level of 0.13 and 1.5 ppm for, proton and carbon chemical shifts, respectively, which is only about 2 and 1% of their ranges.…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSynthesis and bioactivity of alkaloids · Advanced Materials Characterization Techniques · Synthesis and Reactivity of Heterocycles
1. Introduction
Numerous alkaloids have been isolated and characterized from the West African medicinal plant Cryptolepis sanguinolenta. The first member of the family of alkaloids to be isolated was the indoloquinoine alkaloid cryptolepione [1,2]. The most complex and interesting member of the series is the spirononacycllic alkaloid cryptospirolepine, labeled A in Figure 1, whose structure was first reported in 1993 [3]. However, it was not until a degraded NMR sample of the alkaloid from a sealed NMR tube was examined chromatographically with the two major components isolated and fully characterized, nearly a decade after the first report, that it became obvious that the structure of cryptospirolepine had, indeed, been misassigned [4]. It was evident that one of the two major degradants, labeled C, could not be mechanistically or structurally rationalized from the originally reported structure of A, as shown in Figure 1. The structure of cryptospirolepine was not corrected to the structure labeled D in Figure 1 until a, then newly reported, 1,1-HD-ADEQUATE spectrum was acquired for a submilligram sample of the alkaloid using a 600 MHz spectrometer equipped with a 1.7 mm MicroCryoProbe™ in a 120 h long experiment in a 2015 study [5].
Obviously, accurately calculating the ^13^C NMR chemical shifts in a molecule as complex as cryptospirolepine was well beyond computational capabilities when the structure was first reported in 1993 [3]. The evolution of NMR probe technology and newly developed NMR methods finally, in 2015, allowed the correct structure of cryptospirolepine (D) to be determined (Scheme 1). The structure reported in 2015 has since been unequivocally confirmed through the application of anisotropic NMR techniques in a 2017 report [6], and again more recently using the recently developed i-HMBC technique that allows the unequivocal differentiation of two-bond from three-bond and longer-range heteronuclear correlations on the basis of sub-ppb isotope shifts [7].
Over the more than 70 years since the initial isolation of quindolinone, a number of other members of this intriguing family of indoloquinoline alkaloids have been isolated and characterized. These have included: cryptotackieine (1), a linear indolo[2,3-b]quinoline [8]; cryptolepine (2), an indolo[3,2-b]quinoline [1,2,9]; cryptosanguinolentine (3), an angular isomeric indolo[3,2-c]quinoline [8]; quindolinone (4) [10]; oxidation at the 11-position of the cryptolepine skeleton has been variously reported for afford cryptolepinone (5a) or 11-hydroxycryptolepine (5b) [11,12,13]; a facile N-oxidation product of cryptolepinone (6) has also been reported [14] as has an 11-i-propyl analog (7) [15]. While most members of the Cryptolepis alkaloid family have involved indoloquinoline frameworks, one indolobenzazepine analog, homocryptolepinone (8), has been reported [16]. More complex examples include cryptolepicarboline (9) [17]; quindolinocryptotackiine (10) [18]; the bis-indoloquinoline cryptomisrine (11) [19]; and the initial incorrect (12a) and subsequently corrected (12b) structures of cryptospirolepine [3,5]. Finally, a complex degradation product of cryptospirolepine, 5.5′-dimethyl-5′H-10,11′-biindolo[3,2-b]quinoline-11(5H)one, which is essentially an indoloquinoline dimer, has also been isolated and characterized (13) [4].
2. Results and Discussion
The initial structural misassignment of cryptospirolepine as 12a [3], based on the then available NMR methods and probe technology, which was subsequently corrected to 12b in 2015 [5] (see also Figure 1), coupled with the confusion over whether the structure of 5a or 5b was correct based on the available NMR data, prompted us to embark on the present computational study. There also exists the possibility that an as-yet-undiscovered indolo[2,3-c]quinoline analog of cryptosanguinolentine (3, indolo-[3,2-c]quinoline) may be discovered that could benefit from an in-depth analysis of the proton and carbon chemical shift behavior of existing Cryptolepis alkaloids. In part, we also undertook this study based on the growing numbers of natural product structural reassignments and corrections appearing in the literature.
Recent developments in anisotropic NMR methods, as highlighted in a recent chapter [20] and several pivotal earlier reports [6,21], provide a powerful orthogonal means of confirming or refuting structural proposals, but require undertaking additional experimental studies that may be beyond the abilities of a given investigator’s lab. In contrast, computational methods, such as those embodied in the present study, can be confidently undertaken in most modern chemistry departments, making them a potential alternative to anisotropic NMR studies.
Therefore, in the course of theoretical part of this study, the three-dimensional structures of the natural alkaloids under scrutiny were primarily established. To achieve this goal, a preliminary conformational search was carried out for each of the structures studied. In this case, for compounds 9–13, the existing experimental NOE correlations between the two alkaloid “subunits” were taken into account. In this way, the conformational search was reduced mainly to the detection of exact rotational angles around the subunit linkages. As a result, in the process of a conformational search in the range of about 0–10 kcal/mol, several groups of the low-energy conformers were identified for each of the compounds 1–13.
Next, the geometric parameters of the conformers found were optimized at a higher density functional theory (DFT) level, namely M06-2X/cc-pVTZ//aug-cc-pVTZ. A cc-pVTZ basis set was assigned to the carbon and hydrogen atoms, and aug-cc-pVTZ to the nitrogen and oxygen atoms, the latter for a more accurate definition of the diffuse region in the field of their LEPs. In the next step, for the accurate geometries obtained, the ^1^H and ^13^C shielding constants were calculated and then converted into the corresponding NMR chemical shifts, see the Section 3. Using an array of experimental spectral data and theoretical calculation data, a comprehensive correlation assessment of all considered stereoisomers was carried out, taking into account such statistical descriptors as CMAE, root mean square deviation (RMSD) and Pearson correlation coefficient.
Compound 5a/5b was used as a starting point because it is a convenient model for developing a methodology for the further (and most interesting) computational study of cryptospirolepine 12a/12b. Therefore, the first point of discussion pertains to the interpretation of the NMR data acquired for 5a/5b, not to the validity of the overall structure of the molecule. In the initial report by one of the authors, the structure was assigned as the indoloquinone form, 5a, based on the acquired NMR data [10]. A subsequent synthetic report by Jolie and co-workers [11] reported the structure as 11-hydroxy cryptolepine, 5b. When examining the proton and carbon NMR data for both structures in Figure 2, it is quite apparent that there is significantly better agreement between the calculated and observed ^1^H and ^13^C chemical shift data for the quinolinone structure, 5a, than for the 11-hydoxycyptolepine, 5b, conformer of the molecule. When the graphical plots of the data presented in Figure 2 are considered, the scatter in the proton data for the H-4, H-8, and H-7 resonances for the 11-hydoxy form is quite apparent. The error is also pronounced for the C-9a and C-11 carbon chemical shifts in the 11-hydroxy form (5b), whereas there is excellent congruence between the ^1^H and ^13^C chemical shift data for all of the resonance of quindolinone, 5a. For the ^13^C chemical shift in the C9a resonance, this is not particularly surprising given that there is a C-9a=N-10 bond in the cryptolepine form of the molecule, 5b, that tends to shift that carbon resonance considerably downfield of where it would be observed as the C-9a–N-10 single bond in quindolinone, 5a. In this case, if a computational study such as the present one had been possible when those compounds were synthesized, the misassignment of the isomeric form of the molecule could have been avoided.
The picture presented by the assignment of the structure of cryptospirolepine, 12a, is considerably more complex [3], as shown in Scheme 1. When the original significantly discolored sealed NMR tube containing the sample was cut open and examined chromatographically in 2002 [4], it became apparent from the structural characterization of one of the two major degradants contained in the sample (there were a total of sixteen peaks observed in the HPLC chromatogram of the sample when injected prior to isolation efforts) that the degradant could not have arisen from the structure initially reported for complex alkaloid in 1993 [3]. One of the major degradants isolated and characterized was cryptolepinone, whose structure can, by contrast, be mechanistically derived from the originally proposed structure of cryptospirolepine (12a) [4]. The second major degradant was characterized by NMR spectroscopic methods, in conjunction with a computer-assisted structure elucidation (CASE) study as 5,5′-dimethyl-5′H-10,11-biindolo[3,2*-b*]quinoline-11(5H)-one (13) [4], which cannot be mechanistically rationalized by the originally reported structure of cryptospirolepine (12a).
It took more than a decade and the development of a 600 MHz 1.7 mm MicroCryoProbe™ gradient triple resonance probe in conjunction with the development of homodecoupled (HD) variants of the 1,1- and 1,n-ADEQUATE experiment [4] to be able to correctly characterize the remaining 700 μg sample of the alkaloid in an experiment that consumed 120 h of spectrometer time using a 1.7 mm MicroCryoProbe™-equipped 600 MHz spectrometer, finally establishing the correct structure of the alkaloid as 12b, in which the benzazepine ring was revised to a pyridinone moiety. The structure of 12b has subsequently been orthogonally confirmed using anisotropic NMR methods [6] and the newly developed i-HMBC experiment that can unequivocally differentiate ^2^JCH from ^3^JCH correlations on the basis of isotope shift differences [7].
Figure 3 presents the CMAE and RMSD bar charts for the calculated ^1^H and ^13^C NMR chemical shifts in compounds 1–13. Each individual bar for each compound was obtained by correlating the theoretical and experimental chemical shifts using Equations (2) and (3), see the Section 3. As is readily apparent from the graphical bar data in Figure 3, while the revised structure of cryptospirolepine (12b) gave excellent congruence between the calculated and observed ^1^H and ^13^C chemical shift data, in contrast, the originally reported incorrect structure, 12a, essentially had the worst correlation data of any of the molecules considered in this study.
Referring to Figure 4, it can be readily seen that the C-2, C-8, and C-13 carbon resonances of the azepine ring were significant outliers in the plotted data. Had the present computational study been possible when the alkaloid was originally isolated and characterized, these data would have signaled a problem with the proposed structure that would have, in turn, prompted further experimental investigation.
In an unrelated study using anisotropic NMR methods reported in 2017 in Science [6], one of the authors demonstrated how the indoloquinoline-derived portion of the cryptospirolepine structure was consistent with the reported structure in terms of both the residual dipolar coupling (RDC) and residual chemical shift anisotropy (RCSA), while, in contrast, the indolobenzazepine-derived portion of the molecule in the originally proposed structure (12a) exhibited severe aberration between the calculated and observed data. Once again, had the present computational study been conducted on the original data, it would have been evident that there was a structural misassignment that could have been further investigated that might have averted the mistaken original report of the structure. While the anisotropic NMR study does, indeed, afford a method for the orthogonal assessment of the validity of complex natural product structures, such NMR studies may be beyond the scope of what is possible in some investigators’ laboratories. In contrast, the more recently reported i-HMBC method [7] is no more experimentally challenging to perform than a conventional HMBC experiment, and could also have avoided the structural misassignment.
As for the other Cryptolepis alkaloids studied, it should be noted that, generally, they provide a high level of correlation between theoretically calculated and experimental values of ^1^H and ^13^C NMR chemical shifts, see Figure 3. The average CMAE is 0.05–0.20 ppm for ^1^H and 0.5–3.0 ppm for ^13^C NMR chemical shifts. However, in addition to the described particular features of 5b and 12a, the behavior of cryptolepinone 5(N)-oxide (6) is particularly noteworthy. It is obvious that the (N)-oxide-pyridinone moiety is extremely complex for being described in terms of its stereoelectronic structure. A significant spin density is concentrated on the nitrone nitrogen atom, which is partially donated from oxygen. The excess spin density is then distributed throughout the aromatic system of the rest of the molecule. These electronic effects are extremely difficult to accurately describe within the DFT framework. As a result, significant deviations of calculated NMR chemical shifts from the experiment arise, and this is especially so for carbon nuclei. For a more detailed distribution of these deviations, see Supporting Information—Figure S1.
As a result of the performed calculations for the whole series of 1–13, several missing (unknown) experimental ^13^C NMR chemical shifts were predicted with a high level of confidence, which are provided in Scheme 2.
Besides of the mentioned deviations in 5a/5b, 6 and 12a/12b, there are also a number of established deviations in other compounds, see Scheme 3. Most of them are related to the underestimation of the electron correlation effects at the applied level of approximation, GIAO-DFT/[triple-zeta basis set]. However, based on the performed calculations, a number of reassignments are most likely justified, as exemplified by C-9 and C-11a in 5b.
As a final illustration, Figure 5 shows the overall correlation plots calculated against experimental ^1^H and ^13^C NMR chemical shifts in the complete series of alkaloids, 1–13. A total number of 188 ^1^H and 265 ^13^C NMR chemical shifts were analyzed in this study, which provides a statistically significant set for correlation analysis. As follows from the correlations presented, the CMAE is at the level of 0.13 and 1.5 ppm, respectively, for the ^1^H and ^13^C NMR chemical shifts, which is only about 2 and 1%, respectively, of their studied chemical shift ranges. At that, the RMSD is only 0.17 and 2.1 ppm for ^1^H and ^13^C NMR chemical shifts, respectively.
For more clarity, we present Table 1 and Table 2, providing calculated and experimental ^1^H and ^13^C NMR chemical shifts in compounds 9–13, each containing two alkaloid subunits. These data demonstrate how the variation in chemical environment reflects the values of chemical shifts in particular atoms for the most demanding compounds from the studied series. For more detailed analysis, all calculated ^1^H and ^13^C NMR chemical shifts of 1–13 are collected in Tables S1 and S2, see Supporting Information.
3. Materials and Methods
The initial conformational search of compounds 1–13 was performed using the OPLS3 force field in the liquid phase of particular solvent, employing the MacroModel module implemented in the Schrödinger Maestro 11.5 package (Schrödinger, LLC: New York, NY, USA) [22]. During this search, approximately 10^5^ steps were performed to identify the most probable conformers. Subsequently, the unique conformations of the compounds were identified and subjected to further geometry optimization using the GAUSSIAN 09 program (Gaussian, Inc.: Wallingford, CT, USA) [23] at the M06-2X/cc-pVTZ//aug-cc-pVTZ level [24,25]. The solvation effect was taken into account using the IEF-PCM model developed by Tomasi [26,27]. After the optimization process had reached the potential energy surface minima for each of the studied compounds 1–13, their final geometries were retained and used for further calculation of the shielding constants and corresponding NMR chemical shifts.
Calculations of ^1^H and ^13^C NMR isotropic magnetic shielding constants were carried out at the GIAO-DFT level in the liquid phase of particular solvent using the GAUSSIAN 09 program. In these calculations, we employed the one-parameter hybrid functional PBE0 [28], which was used in combination with Rusakov’s basis set, pecS-2 [29,30]. Further, the calculated NMR shielding constants of all significant conformers found by the conformational search were averaged according to the Boltzmann distribution equation:
where σ_n_^i^ is the shielding constant of nucleus i in conformer n, E_n_ are the Gibbs free energies (J) of each of the significant conformer states n (relative to the lowest energy conformer), R is the molar gas constant (8.31446 J·K^−1^·mol^−1^), and T is the temperature (298 K).
To take systematic errors of calculated chemical shifts into account, we have established correlations between their experimental chemical shifts (x) and isotropic magnetic shielding constants (y), which were further used to find the linear correlation equations of the y = ax + b type. Thus, the resulting shielding constants σ^i^ were converted into the corresponding chemical shifts: the parameters a and b were used for recalculating theoretical shielding constants using the equation δ_calc_ = (σ^i^ − b)/a.
Corrected mean absolute errors (CMAE) were calculated as:
where σ^i^ are the unscaled shielding constants for each of the n nuclei in the molecule, while a and b are the slope and intercept of the linear regression σ^i^ = aδ_exp_ + b.
The root-mean-square deviations (RMSD) were evaluated as:
where δ_exp_ and δ_calc_ are experimental and scaled chemical shifts, respectively, in each of the n nuclei.
4. Conclusions
We have demonstrated through the model theoretical study that, using modern computational methods, it is possible to identify potential structural misassignments, even for molecules as complex as cryptospirolepine (12b). While methods that rely on anisotropic NMR methods, such as RDCs and RCSAs, can also be used [6] to offer a powerful orthogonal means of structure verification, the experimental methodology may not be possible in many laboratories, whereas the present computational study should be feasible in most chemistry departments with access to sophisticated computational facilities.
From the overall correlations calculated against experimental ^1^H and ^13^C NMR chemical shifts in the series of 188 ^1^H and 265 ^13^C NMR chemical shifts of 13 compounds, it follows that, generally, they provide a very high level of correlation. The CMAE is at the level of 0.13 and 1.5 ppm for proton and carbon chemical shifts, respectively, which is only about 2 and 1% of their ranges. At that, the RMSD is only 0.17 and 2.1 ppm for ^1^H and ^13^C NMR chemical shifts, respectively. Based on the performed calculations, some spectral reassignments are suggested, and a number of missing (unknown) experimental chemical shifts are predicted.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Gellert E. Hamet R. Schlittler E. Die Konstitution des Alkaloids Cryptolepin Helv. Chim. Acta 19513464265110.1002/hlca.19510340228 · doi ↗
- 2Dwuma-Badu D. Ayin J.S.K. Fiagbe N.I.Y. Knapp J.E. Schiff P.L.Jr. Slatkin D.J. Constituents of West African medicinal plants. XX: Quindoline from Cryptolepis sanguinolenta J. Pharm. Sci.19786743343410.1002/jps.2600670350641744 · doi ↗ · pubmed ↗
- 3Tackie A.N. Boye G.L. Sharaf M.H.M. Schiff P.L.Jr. Crouch R.C. Spitzer T.D. Johnson R.L. Dunn J. Minick D. Martin G.E. Cryptospirolepine—A Unique Spirononacyclic Alkaloid Isolated from Cryptolepis sanguinolenta J. Nat. Prod.19935665367010.1021/np 50095 a 001 · doi ↗
- 4Martin G.E. Hadden C.E. Russell D.J. Kaluzny B.D. Guido J.E. Duholke W.K. Stiemsma B.A. Thamann T.J. Crouch R.C. Blinov K. Identification of Degradants of a Complex Alkaloid Using NMR Cryoprobe Technology and ACD/Structure Elucidator J. Heterocycl. Chem.2002391241125010.1002/jhet.5570390619 · doi ↗
- 5SauríJ. Bermel W. Buevich A.V. Sharaf M.H.M. Schiff P.L.Jr. Parella T. Williamson R.T. Martin G.E. Improved 1,1- and 1,n-ADEQUATE: Pivotal Experiments for the Structure Revision of Cryptospirolepine Angew. Chem. Int. Ed.201554101601016410.1002/anie.20150254026178441 · doi ↗ · pubmed ↗
- 6Liu Y. SauríJ. Mevers E. Peczuh M.W. Heimstra H. Clardy J. Martin G.E. Williamson R.T. Unequivocal determination of complex molecular structures using anisotropic NMR measurements Science 2017356633310.1126/science.aam 5349 PMC 659629728385960 · doi ↗ · pubmed ↗
- 7Wang Y. Fan A. Cohen R.D. Del Poggetto G. Huang Z. Yang H. Martin G.E. Sherer E.C. Reibarkh M. Wang X. Unequivocal identification of two-bond heteronuclear correlations in natural products at nanomole scale by i-HMBC Nat. Commun.202314184210.1038/s 41467-023-37289-z 37012241 PMC 10070429 · doi ↗ · pubmed ↗
- 8Sharaf M.H.M. Schiff P.L.Jr. Tackie A.N. Martin G.E. Two New Indoloquinoline Alkaloids from Cryptolepis sanguinolenta: Cryptosanguinolentine and Cryptotackiine J. Heterocyclic Chem.19963323924310.1002/jhet.5570330204 · doi ↗
