Comparative Assessment of Statistical and Thermodynamic Prediction Methods for Solvate Formation: A Case Study with Curcumin and Its Derivatives
Julian Ticona-Chambi, Duane Choquesillo-Lazarte, Silvia Lucia Cuffini, Lourdes Infantes

TL;DR
This paper compares methods to predict which solvents form solvates with curcumin derivatives, finding that combining COSMO-RS with hydrogen bond propensity gives the best results.
Contribution
The study introduces a novel combination of COSMO-RS and hydrogen bond propensity for improved prediction of solvate formation.
Findings
Hydrogen bond propensity (HBP) was the best individual predictor for solvate formation.
Combining COSMO-RS with HBP improved predictive accuracy over standalone methods.
New solvated and hydrated forms of curcumin derivatives were identified through crystallization screens.
Abstract
This study compares statistical and thermodynamic methodologies for predicting solvate formation using curcumin (CUR) and its derivatives demethoxycurcumin (DMC) and bisdemethoxycurcumin (BDMC) as models. We evaluated the performance of Statistical Frequency of Interaction for Multicomponent Prediction (SFIMP) and Conductor-like Screening Model for Realistic Solvents (COSMO-RS) methods to identify solvents likely to form solvates. A comprehensive crystallization screen yielded several new solvated and hydrated forms. Our results show that hydrogen bond propensity (HBP) performed best among individual predictors, while COSMO-RS combined with HBP yielded superior predictive accuracy overall. These insights aid rational design and screening of multicomponent solid forms in pharmaceutical development.
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
1
2
3
4
5
6
7
8
9
10
11
12| compounds | solvent-(CUR, DMC, BDMC) interactions | type of interaction | distances (Å) | compounds | solvent-(CUR, DMC, BDMC) interactions | type of interaction | distances (Å) |
|---|---|---|---|---|---|---|---|
| CUR | |||||||
| LADXEX (DOX) | Yes | O4–H···ODOX | 2.676(4) | CUR-DOX | Yes | O3–H···ODOX | a: 2.83(1), b: 2.72(2) |
| O6–H···ODOX | 2.665(4) | C20–H3···ODOX‑a | 3.44(1) | ||||
| C20–H3···ODOX‑b | 3.62 | ||||||
| LADXIB (DOX) | Yes | O4–H···ODOX‑A | 2.666(2) | OJIWOV (CH2Cl2) | It does not present strong HBs but a lot of weak HBs, CH···O and CH···Cl type. These HBs are not well represented in HBP what is reflected in the high values that this molecule obtains in his ranking | ||
| O6–H···ODOX‑B | 2.691(2) | ||||||
| BDMC | |||||||
| BDMC-DOX | Yes | O4–HA···ODOX‑A | 2.62(3) | BDMC-Water-2 | Yes | OWA-H···O1A | 2.780(2) |
| O4–HB···ODOX‑A | 2.77(3) | OWA-H···O2HC | 2.710(2) | ||||
| O3–HB···ODOX‑B | 2.70(2) | O3E-H··· OWA | 2.774(2) | ||||
| O3–HA···ODOX‑C | a: 2.72(4) | O4E-H··· OWA | 2.713(2) | ||||
| b: 2.73(3) | OWB-H···O1B | 2.701(2) | |||||
| XIWDEP (iPrOH) | Yes | O3–H···Oiso‑PROP | 2.640(3) | OWB-H···O2HD | 2.778(2) | ||
| Oiso‑PROP-H···O3H | 2.759(3) | O3A-H··· OWB | 2.772(2) | ||||
| XIWDAL (Acetone) | Yes | O3–H···OAce | 2.761(4) | O4A-H··· OWB | 2.727(2) | ||
| CAce-H3···O2H | 3.536(7) | OWC-H···O1D | 2.702(2) | ||||
| CAce-H3···O4H | 3.328(6) | OWC-H···O2HB | 2.781(2) | ||||
| BUWKUZ (MeOH) | Yes | OMeOH-H···O1 | 2.716 | O3B-H··· OWC | 2.780(2) | ||
| O3–H··· OMeOH | 2.758 | O4B-H··· OWC | 2.716(2) | ||||
| O4–H··· OMeOH | 2.719 | OWD-H···O1C | 2.700(2) | ||||
| BDMC-Water-1 | Yes | Ow-H···O1 | 2.751(1) | OWD-H···O2HA | 2.774(2) | ||
| Ow-H···O1 | 2.801(2) | O3D-H···OWD | 2.785(2) | ||||
| O4–H···Ow | 2.754(2) | O4D-H···OWD | 2.714(2) | ||||
| O3–H···Ow | 2.796(1) | OWE-H···O1E | 2.700(2) | ||||
| GANJAG (Water) | Yes | Ow-H···O1 | 2.735(3) | OWE-H···O2HE | 2.755(2) | ||
| Ow-H···O3 | 2.756(4) | O3C-H···OWE | 2.779(2) | ||||
| O3–H···Ow | 2.742(4) | O4C-H···OWE | 2.720(2) | ||||
| O4–H···Ow | 2.742(4) | ||||||
| DMC | |||||||
| DMC-DOX | Yes | O4–H···ODOX | 2.724(8) | ||||
| O5–H···ODOX | 2.689(8) | ||||||
- —Ministerio de Ciencia, Innovaci?n y Universidades10.13039/100014440
- —Funda??o de Amparo ? Pesquisa do Estado de S?o Paulo10.13039/501100001807
- —Coordena??o de Aperfei?oamento de Pessoal de N?vel Superior10.13039/501100002322
- —Consejo Superior de Investigaciones Cient?ficas10.13039/501100003339
- —European Regional Development Fund10.13039/501100008530
- —Red de Cristalograf?a y Cristalizaci?n ?Factor?a de Cristalizaci?n?NA
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsComputational Drug Discovery Methods · Curcumin's Biomedical Applications · Drug Solubulity and Delivery Systems
Introduction
1
Curcumin (CUR), a natural polyphenol derived from the rhizome of Curcuma longa, has attracted significant attention due to its wide-ranging biological properties, including anti-inflammatory, antioxidant, antimicrobial, and anticancer activities. ?−? ? Recent studies have demonstrated that curcuminoids possess promising potential not only as therapeutic agents but also as functional materials in fields such as photonics, sensors, and drug delivery systems. ?,? However, their application remains limited by poor aqueous solubility, low bioavailability, and physicochemical instability. Strategies such as nanoparticle formulation, cocrystallization, and solvate/hydrate formation have been explored to improve their pharmacokinetic and physicochemical properties.?
From a solid-state chemistry perspective, the formation of solvates and hydrates can dramatically affect a compound’s physical and chemical properties, including solubility, melting point, stability, and polymorphic behavior. ?−? ? In the case of curcuminoids, several solvate and hydrate forms have been reported, particularly for CUR and bisdemethoxycurcumin (BDMC), illustrating their capacity to engage in specific supramolecular interactions with a variety of solvents. ?−? ? ? ? ? ? ? Conversely, demethoxycurcumin (DMC) remains less explored in this context, with no detailed structural reports of its solvated forms to date.
Predicting the formation of solvates remains a challenge in crystal engineering, as it involves understanding the balance of intermolecular forces, conformational flexibility, and thermodynamic stability. Several computational tools have emerged to support rational solid form screening, including statistical models based on experimental data (e.g., Hydrogen Bond Propensity (HBP), Molecular Complementarity (MC), Coordination Values (CV)) and thermodynamic methods such as the COSMO-RS (Conductor-like Screening Model for Realistic Solvents) approach. ?−? ? Notably, Costa et al.? demonstrated the application of these statistical tools in the prediction of cocrystal formation for the antiretroviral drug nevirapine, showcasing their utility in multicomponent solid form screening. While these approaches have shown promise in cocrystal design, their comparative performance in forecasting solvate formation remains underexplored, particularly for structurally related compounds such as curcuminoids.
This study presents a comprehensive investigation of solvate and hydrate formation in curcuminoids, focusing on curcumin (CUR), demethoxycurcumin (DMC), and bisdemethoxycurcumin (BDMC). Several new crystalline forms are described, including previously unreported solvates and hydrates, and notably, the first crystal structure of DMC. These solid forms provide valuable structural insights and enable a systematic evaluation of predictive methodologies. We assess the performance of statistical (Hydrogen Bond Propensity, Molecular Complementarity, Coordination Values) and thermodynamic (COSMO-RS) tools in ranking solvents based on their likelihood to form solvates. By comparing computational predictions with experimental outcomes, this work contributes to the development of more effective strategies for solvate screening in pharmaceutical and materials science applications.
Methodology
2
Crystallization Process
2.1
CUR, DMC, and BDMC (Sigma-Aldrich) were used as received. Analytical-grade solvents were employed throughout. CUR was screened with 41 solvents, while DMC and BDMC were limited to 16 solvents each due to material availability. Recrystallization was performed under varied temperature and pressure conditions to optimize single-crystal growth (Tables S1 and S2). Key solvates were successfully obtained via slow evaporation (from 50 °C to room temperature) for CUR-DOX and DMC-DOX, and via cooling crystallization (heating to 30 °C, rapid cooling in liquid nitrogen and slow evaporation at 4 °C) for BDMC-DOX, BDMC-WATER1, BDMC-WATER-2, all under ambient pressure. Suitable crystals for X-ray diffraction were carefully selected and preserved (e.g., in capillaries or under cryogenic conditions when necessary).
Single Crystal X-ray Diffraction
and Crystal Structure Analyses
2.2
Prior to X-ray data collection, specific precautions were taken to prevent degradation or dissolution of the crystals. The CUR-DOX solvate was unstable at room temperature and darkened upon contact with inert oil. To preserve its integrity, the crystal was placed inside a glass capillary along with a small amount of DOX solvent. In the case of BDMC-DOX, the crystal exhibited rapid dissolution when handled at ambient conditions. Therefore, it was picked up at 4 °C in a cold room using a cryoloop and immediately flash-frozen in liquid nitrogen before mounting on the diffractometer.
Single-crystal X-ray diffraction (SCXRD) data for CUR-DOX, DMC-DOX, and BDMC-DOX were collected using a Bruker APEX II diffractometer equipped with a microsource Cu Kα radiation and a Photon 100 CCD detector. Measurements were performed at room temperature for CUR-DOX, DMC-DOX, and CUR-DMSO, and at 120 K for BDMC-DOX. Diffraction data for BDMC-WATER-1 and BDMC-WATER-2 were collected at 100 K using synchrotron radiation at the ALBA Synchrotron facility (BL13-XALOC beamline).
Diffraction data collected on the APEX II instrument were processed with the APEX software suite,? whereas synchrotron data sets were processed using XDS.? All structures were solved by intrinsic phasing using SHELXT and refined by full-matrix least-squares on F^2^ using SHELXL,? both implemented within the Olex2–1.5 platform.? Non-hydrogen atoms were refined anisotropically. Disordered solvent molecules (1,4-dioxane) were observed in CUR-DOX and BDMC-DOX and modeled using appropriate disorder restraints and constraints.
For BDMC-WATER-2, two possible crystallographic cells were identified [P2_1_/c: 20.255(5) Å, 7.2390(6) Å, 53.895(4) Å, β = 90.034(7)°, Volume = 7902(2) Å^3^; P2_1_/n: 20.254(4) Å, 7.2380(4) Å, 11.5070(10) Å, β = 110.522(8)°, Volume = 1579.9(4) Å^3^]. We retained the former because it accounts for all reflections in reciprocal space and yielded lower R-factors in the refinement (6.6 vs 10.2).
For postrefinement structural analysis, several tools implemented in Mercury 2024.1.0 were employed. These included Full Interaction Maps (FIM), intermolecular potential surface visualization, Hydrate and Solvate Analyzer, and structural overlay comparison. These tools were used to assess hydrogen bonding patterns, molecular packing, and conformational variability across the new solvates and hydrates of CUR, DMC, and BDMC.
Prediction Tools
2.3
Two classes of prediction models were evaluated: The Statistical Analysis of Frequency of Interaction for Multicomponent Prediction (SFIMP)? and the thermodynamic Conductor-like Screening Model for Realistic Solvents (COSMO-RS).? The general flowchart, illustrated in Figure, summarizes these methodologies. A data set of forty-eight polar and nonpolar solvent molecules were classified for each analysis (See Supporting Information Table S1). According to values obtain in each method, a ranking and consensus ranking were established using different criteria, as presented below.
Flowchart summarizing the methodologies used to evaluate the probability of solvate formation.
Statistical
ApproachSFIMP
2.3.1
The Statistical Frequency of Interaction for Multicomponent Prediction (SFIMP) framework integrates three key descriptors: H-Bond Propensity (HBP), ?,? Molecular Complementarity (MC)? and Coordination Values (CV), ?,? which were developed by the Cambridge Crystallographic Data Center (CCDC) and optimized by Costa et al., 2020. ?,? The procedure applied in each technique is described below
HBP
2.3.1.1
The Hydrogen Bond Propensity (HBP) method estimates the likelihood of hydrogen bond formation between donor and acceptor functional groups based on statistical models derived from experimental crystal structures from the Cambridge Structural Database (CSD). The HBP tool evaluates each possible donor–acceptor interaction within a target system by assigning a propensity score (P). This score reflects the probability that a hydrogen bond is formed.
To assess the likelihood of multicomponent formation (e.g., target molecule with a solvent), the HBP* score is computed using the following formula (eq)
This equation compares the strongest heteromeric (target-solvent) interaction with the strongest homomeric (target–target or solvent–solvent) interaction. A positive HBP* score suggests that the heteromeric interaction is more favorable, indicating a higher probability of solvate formation. Conversely, negative values imply a preference for self-association, making solvate formation less likely.
Solvents are then ranked in descending order based on their HBP* values. This ranking reflects their potential to form stable hydrogen-bonded solvates with the target molecule.
MC
2.3.1.2
Molecular Complementarity (MC) is a knowledge-based approach used to assess the geometric and electronic compatibility between two molecules, based on shape and polarity descriptors. It provides a measure of how well a pair of molecules might fit and interact in a solid-state environment, supporting the prediction of cocrystal or solvate formation. This method is particularly useful in multicomponent crystal engineering where shape complementarity and dipolar matching play crucial roles in molecular packing and stabilization.
Multiple conformations of the target compounds and solvents were generated using the Conformation Generator Tool available in Mercury. For each conformation, the molecular complementarity between the target molecule and each solvent was assessed using the Molecular Complementarity Screening Wizard implemented in Mercury. The resulting data table includes five key descriptors for both components: the medium (M), short (S), and long (L) molecular axes, the dipole moment, and the fraction of nitrogen and oxygen atoms (N/O). To compare the complementarity between pairs, these descriptors were normalized using the equation below (eq)
MC Ranking Criteria: A lower MC_ N _ value indicates greater molecular complementarity between the target compound and the solvent. Accordingly, solvents were ranked from lowest to highest MC_ N _ values, with the lowest scores representing the most complementary solvent–target combinations.?
CV
2.3.1.3
The Coordination Values (CV) method is a statistical approach that quantifies a molecule’s ability to act as a hydrogen bond donor or acceptor based on empirical data from crystal structures. It estimates the number of hydrogen bonds each functional group can form, accounting for both electronic and geometric factors that influence intermolecular interactions in the solid state. This method is particularly useful in predicting multicomponent formation by assessing whether the hydrogen bonding landscape of a coformer complements that of a target molecule.
Different conformations of each analyzed molecule (target molecules and solvents) were generated using the Conformation generator tool available in the Mercury software. These conformers were used to evaluate hydrogen bonding capacities by calculating the coordination values of each structure using custom Python scripts based on the CSD Python API. These CV scripts assign a donor coordination value (D target‑molecule, D solvent, D multicomponent) and an acceptor coordination value (A target‑molecule, A solvent, A multicomponent) to each molecule.
The donor and acceptor capacities represent the estimated number of hydrogen bonds each molecule can form as a donor or acceptor. The difference between these capacities was computed as an absolute value (|D–A|), which reflects the balance or imbalance of hydrogen bonding potential.
The degree of coordination mismatchor “comfort”in the multicomponent system (solvate) relative to the pure components was calculated using the following expression (eq). A worked example is provided in the worked-out example section of the Supporting Information.
CV Ranking Criteria: Solvents were ranked based on increasing ΔCV. Lower values indicate a greater likelihood of favorable hydrogen bonding complementarity in the multicomponent form.?
Thermodynamic ApproachCOSMO-RS
2.3.2
The COSMO-RS (COnductor-like Screening MOdel for Real Solvents) approach is a quantum chemistry-based method that predicts the thermodynamic behavior of molecular interactions in solution or solid state. It combines electronic structure calculations with statistical thermodynamics to estimate the excess enthalpy (Hex) of mixing between molecular pairs. This descriptor reflects the strength of interaction between a target molecule and a solvent and can be used to assess the likelihood of solvate formation.
In this study, the COSMOQuick 1.9 software? was employed to perform COSMO-RS calculations. SMILES representations of all target molecules (CUR, DMC, BDMC) and solvents were obtained from the PubChem database (See Supporting Information Table S1). These SMILES were used as input to compute the excess enthalpy of interaction between each target–solvent pair.
The COSMO-RS model operates by placing molecules in a virtual conductor environment and analyzing their surface polarization charge densities (σ-surfaces). Interaction energies are derived from the statistical thermodynamic treatment of surface interactions, which include contributions from hydrogen bonding, electrostatics, and van der Waals forces.
The excess enthalpy of formation (Hex) between each pair reflects the thermodynamic favorability of interaction: more negative Hex values indicate stronger interactions, suggesting a higher potential for solvate formation.
COSMO-RS Ranking Criteria: Solvents were ranked in ascending order of Hex values. Those with the most negative excess enthalpies were considered to have the highest likelihood of forming thermodynamically stable solvates with the target compound.?
Combined Ranking and
Evaluation
2.3.3
Since each predictive method captures different aspects of molecular behaviorsuch as hydrogen bonding propensity, electrostatic complementarity, or donor–acceptor balancecombined rankings were generated to assess their collective predictive power. These Consensus Rankings (CRs) integrate the strengths of multiple descriptors and allow evaluation of synergistic effects between statistical and thermodynamic models. A series of combinations were constructed by summing the rank positions of each solvent across different pairs or groups of methods. The combinations tested included: COSMO–CV, COSMO-MC, COSMO-HBP, CV-MC, CV-HBP, HBP-MC, COSMO–CV-MC; COSMO–CV-HBP, CV-MC-HBP, MC-HBP-COSMO, CV-MC-HBP-COSMO. For each combination, the consensus rank of a solvent was obtained by calculating the arithmetic sum of its rank positions in each individual method. Lower consensus values indicate better agreement across methods regarding that solvent’s ability to form a stable solvate.
To evaluate the performance of both individual and combined prediction methods, we compared their ability to correctly rank solvents that are known (from literature) or were found experimentally (see Section) to form solvates with CUR and BDMC. For each method, the positions of all confirmed solvating solvents in the ranked list were summed and then averaged (eq)
A lower mean rank value implies that the method consistently ranked actual solvating solvents closer to the top of the list, thereby indicating higher predictive performance. This approach enables a direct comparison between the relative accuracies of individual and combined methodologies in identifying suitable solvents for solvate formation.
Results and Discussion
3
Structural and Supramolecular Features of
Curcumin and Its Derivatives in the Solid State
3.1
Structurally, CUR consists of two aromatic rings connected by a conjugated keto–enol linker, containing both phenolic and enolic functional groups capable of engaging in various intermolecular interactions (Figure). Two naturally occurring derivatives, DMC and BDMC, share the same core skeleton but differ in the number of methoxy substituents on the aromatic rings. These subtle structural differences influence their chemical reactivity, biological efficacy, and solid-state behavior. ?,?
Comparison of (a) chemical diagram of CUR molecule with atom labels used in the conformational analysis of CUR, DMC and BDMC, (b) molecular conformations, showing two perpendicular projections with superpositions of the three CUR polymorphs and the two independent BDMC molecules, and (c) Full Interaction Maps (FIMs) highlighting the intermolecular interaction propensities of CUR (polymorph I) and BDMC (molecule in cis conformation) in their crystalline forms.
Regarding single component crystal structures, three polymorphs of CUR have been reported in the Cambridge Structural Database (CSD): polymorphs I, II, and III, with reference codes BINMEQ02, BINMEQ06, and BINMEQ07, respectively. Comparison of these structures reveals notable conformational differences, particularly between polymorph I and the other two forms. In polymorph I, the torsion angle C7–C8–C9–C10 adopts a cis conformation, whereas in polymorphs II and III it exhibits a trans conformation. Both methoxy groups in polymorphs II and III are oriented toward the keto–enol moiety, while in polymorph I the methoxy group in C16 is rotated away from it. Additionally, while in polymorphs II and III the molecules adopt nearly planar conformations, the phenyl ring bonded to carbon C7 is rotated by 44° in polymorph I (Figuresb and ?).
When comparing the crystal structures of polymorphs II and III, both exhibit very similar packing arrangements, although with slight differences in molecular orientation. These differences can be visualized through the overlay of 15 molecules from each structure, where polymorph II is shown in gray (BINMEQ06) and polymorph III in green and red (BINMEQ07) depending the degree of deviation from the reference molecules of the former compound, as illustrated in Figure.
Superposition of 15 molecules from each crystal structure: polymorph II is shown in gray (BINMEQ06), and polymorph III in green and red (BINMEQ07), with color variation indicating the degree of deviation from the reference molecules of polymorph II.
For BDMC, only one crystalline form is currently reported in the Cambridge Structural Database (CSD), under the refcode XIWCUE. The structure contains two symmetry-independent BDMC molecules: one adopts a cis conformation at the C7–C8–C9–C10 torsion, while the other adopts a trans conformation. Neither molecule shows significant rotation of the phenyl rings; however, both exhibit slightly curved conformations along the main molecular axis (Figureb). Interestingly, in the crystal structure of BDMC, the molecules in cis conformation are arranged in well-defined layers, distinct from those formed by the trans conformers. The 2D molecular arrangement (substructure) within the cis layers is isostructural to the packing observed in polymorph I of CUR, Figure.
Superposition of molecular layers composed of cis-conformation BDMC molecules from the crystal structure of XIWCUE (shown in gray) with the equivalent molecular arrangement in curcumin polymorph I (BINMEQ02, shown in green) highlighting their isostructural 2D packing. The unit cell shown corresponds to the crystal structure of XIWCUE, and the views are projections along b axis and ac diagonal of this unit cell.
In the case of DMC, no single-component crystal structures have been reported to date.
FIM analyses of CUR and BDMC suggest that the keto–enolic and phenolic groups are the main hydrogen-bonding sites, mapped as red and blue regions that indicate high-probability positions for external donors and acceptors, respectively (Figurec). In contrast, the methoxy groups exhibit significant steric hindrance, and intramolecular hydrogen bonds between the methoxy oxygen and adjacent phenolic OH reduce the donor ability of these OH groups. The FIMs also display gray regions, highlighting favorable sites for hydrophobic contacts or π–π interactions on either side of the phenyl rings. Given the similarities observed between the FIMs of CUR and BDMC, the interaction landscape of DMC can be readily inferred, with the same hydrogen-bonding hotspots at the keto–enolic and phenolic groups and intermediate features arising from the presence of a single methoxy substituent.
These observations suggest that solvents containing hydrogen-bond donor or acceptor groups, such as ketones, ethers, and amines, are likely to interact with the hydrogen-bonding sites of curcuminoids (hydroxy and keto–enolic groups), potentially promoting solvate formation.
Reported Solvates of Curcumin and Its Derivatives
3.1.1
Crystal structures of CUR solvates with 1,4-dioxane (DOX) (LADXEX, LADXIB),? acetone (ACE) (FIHRUN),? and dichloromethane (DCM) (OJIWOV),? as well as BDMC solvates with isopropanol (ISP) (XIWDEP),? acetone (ACE) (XIWDAL),? methanol (MET) (BUWKUZ),? and water (GANJAG),? have been reported in the Cambridge Structural Database (CSD). In addition, the literature describes evidence of CUR solvates with methyl acetate (MAC)? and water,? and BDMC solvates with DOX,? tetrahydrofuran (THF),? and dimethyl sulfoxide (DMSO);? however, their crystal structures have not yet been determined. No solvate forms have been reported for DMC to date.
All curcuminoid molecules adopt a trans conformation at the C7–C8–C9–C10 dihedral angle in their crystal structures, with the exception of the CUR–DCM solvate, which contains two symmetry-independent CUR molecules, both exhibiting a cis conformation (Figures and ?).
Superposition of curcumin (CUR) molecules from all known crystal structures, including polymorphs I, II, and III (BINMEQ02,06,07), previously reported solvates from the Cambridge Structural Database (CSD), and the new solvate structure obtained in this work (CUR-DOX). Each structure is shown in a different color, as indicated in the accompanying legend.
Superposition of bisdemethoxycurcumin (BDMC) molecules from all available crystal structures, including reported solvates in the Cambridge Structural Database (CSD) and the new solvates identified in this work (BDMC-DOX, BDMC-WATER-1, BDMC-WATER-2). Each structure is represented in a different color, as indicated in the accompanying legend.
New Solvates Forms of
CUR, DMC and BDMC and Comparative Crystal Structure Analysis with Previously Reported Structures
3.1.2
As described in the Methods, 41 solvents were screened for CUR, and 16 for DMC and BDMC due to limited material availability (Table S3). Single crystals of CUR, DMC, and BDMC were successfully obtained using 1,4-dioxane (DOX) as crystallization solvent.
For BDMC, two distinct hydrated forms were isolated from the same recrystallization in tetrahydrofuran (THF), here designated BDMC-WATER-1 and BDMC-WATER-2. A THF solvate of BDMC had previously been reported without structural characterization.? Comparison of the PXRD pattern provided by those authors with the present phases confirms that BDMC-WATER-1 and BDMC-WATER-2 are different crystalline forms. The earlier report therefore most likely corresponds to a distinct hydrate or a genuine THF solvate (Figure S1). The two hydrates crystallized simultaneously in the same vessel as morphologically distinct ribbon-like crystals, one yellow and one orange, each associated with a different structure (Figure S2).
Notably, this work reports the first DMC solvate structure, obtained with DOX (DMC-DOX), representing the first crystal structure of DMC described in the literature. Crystals of CUR (from DMSO) and DMC (from ACN) were also obtained, although only unit cell parameters could be determined as the crystals degraded before full diffraction data sets could be collected.
The molecular structures of the new crystalline forms are presented in Figure, with displacement ellipsoids drawn at the 50% probability level for non-hydrogen atoms. Complete crystallographic and refinement data are provided in Table S4 and have been deposited with the Cambridge Crystallographic Data Centre (CCDC).
Molecular structures of CUR-DOX, DMC-DOX, BDMC-DOX, BDMC-WATER-1, BDMC-WATER-2 showing thermal ellipsoids with 50% probability (except the hydrogen atoms). The disorder of DOX molecule in CUR-DOX and BDMC-DOX was omitted just for clarity purposes.
CUR-DOX
3.1.3
Besides the two previously reported 1,4-dioxane solvates of curcumin (CUR-DOX: LADXEX and LADXIB) in the CSD, we have crystallized a new 1,4-dioxane solvate in the present work. In both reported CUR-DOX solvates, the CUR molecules adopt a trans conformation in their conjugated chain linking the aromatic rings. In contrast, the CUR molecules in the new solvate exhibit a cis conformation, similar to that observed in polymorph I of CUR. Additionally, the phenyl ring attached to carbon C7 is rotated by 52° relative to the plane of the rest of the molecule, and the conformation matches that of CUR in polymorph I, as shown by the perfect overlap (Figure, green and black molecules, respectively).
This structure exhibits a 1:1 molecular ratio of CUR to 1,4-dioxane, with the solvent modeled as disordered. LADXEX also contains one independent molecule in a 1:1 ratio, while LADXIB shows a 1:1.5 ratio. No similarity was found in the molecular packing of the three DOX solvates, except for the common tape motifs observed in both LADXEX and LADXIB (Figurea) but not in CUR-DOX, where the molecules grow through stacking of the planar part of the structure, extending into 2D layers stabilized by O6–H···O1C and C8–H···O1C hydrogen bonds (D···A distances of 2.791(3) Å and 3.736(4) Å respectively), Figureb.
(a) Superposition of molecules from the crystal structures of LADXEX (gray) and LADXIB (green), highlighting the common tape motifs observed in both structures. (b) Tapes in CUR-DOX formed through the stacking of the planar delocalized bonds in the CUR molecule, and 2D structures stabilized by hydrogen bonds.
DMC-DOX
3.1.4
This is the first crystal structure determination of the DMC molecule. In its solvate form (DMC-DOX), DMC adopts a trans conformation, with the methoxy group retained on the keto side of the molecule and oriented toward the keto–enol moiety. The molecule exhibits a highly curved geometry along its main molecular axis.
The crystal structure of DMC reveals tape-like arrangements similar to those observed in LADXEX and LADXIB (Figure). Moreover, these tapes extend into two-dimensional architectures equivalent to those seen in LADXIB (Figure) through weak hydrogen bonds CH···pi at 3.528(9) Å.
Superposition of molecules from the crystal structures of LADXIB (in gray) and DMC-DOX (in green), highlighting the common tape motifs and equivalent two-dimensional arrangements observed.
BDMC-DOX
3.1.5
In the crystal structure of the BDMC–DOX solvate, the asymmetric unit contains two BDMC molecules and three 1,4-dioxane (DOX) molecules. Each BDMC molecule adopts a planar trans conformation and independently forms molecular tapes (Figurea), analogous to those observed in previously reported DOX solvates of CUR derivatives (LADXEX, LADXIB, and DMC–DOX).
(a) Molecular tapes formed by the two independent molecules in BDMC-DOX structure (one shown in dark gray and the other in light gray). (b) Superposition of molecules from the crystal structures of DMC-DOX (gray) and BDMC-DOX (green), highlighting their common tape motifs. (c) Projections along the b axis of BDMC-DOX showing the layered packing with alternating two-dimensional sheets of BDMC and DOX molecules.
In all four DOX solvates, the molecular tapes result from translational repetition along the b-axis. The lengths of the b-axis for each structure are as follows: LADXEX, 5.346 Å; LADXIB, 5.418 Å; DMC–DOX, 5.6502(4) Å; and BDMC–DOX, 5.7592(7) Å. These tapes are stabilized by weak C–H···O hydrogen bonds between the methylene protons of the central aliphatic linker and the oxygen atoms of the keto–enol moieties of adjacent molecules.
The instability of BDMC–DOX crystals, already noted above, can be understood in light of their layered architecture. The structure consists of alternating two-dimensional sheets of BDMC and DOX molecules, with one of the three independent DOX molecules forming strong hydrogen bonds that bridge adjacent BDMC layers, Figurec and Table S3. Nevertheless, the extended segregation of solvent into continuous layers reduces the overall robustness of the packing, making the crystals particularly sensitive to ambient conditions.
BDMC-WATER
3.1.6
Two hydrated forms of BDMC were crystallized in this work, both with a 1:1 stoichiometric ratio of BDMC to water. BDMC-WATER-1 contains one molecule in the asymmetric unit (Z′ = 1), whereas BDMC-WATER-2 contains five independent molecules (Z′ = 5). In both cases, the BDMC molecules adopt a trans conformation along the central linker, with no significant differences in molecular geometry. A previously reported hydrate of BDMC (CSD refcode GANJAG) also shows a 1:1 stoichiometry and Z′ = 1. This structure and BDMC-WATER-1 are isostructural at the level of molecular layers (Figure), although the stacking between layers differs significantly.
(a) Superposition of molecules from the crystal structures of GANJAG (gray, hydrate) and BDMC-Water-1 (green), showing their common 2D structural motifs. (b) Projections along the a (top) and b (bottom) axes of the crystal structure of BDMC-Water-2, with each independent molecule shown in a different color.
Interestingly, a comparable layered arrangement is also found in the methanol solvate of BDMC (BUWKUZ). Moreover, BDMC–methanol (BUWKUZ) and BDMC–water (GANJAG) are 3D isostructural (Figure), displaying nearly identical superpositions of layers. The key difference is that in BUWKUZ the layers are distorted relative to each other, owing to the larger size of methanol molecules, which increases the separation between BDMC layers. Hydrogen-bonding patterns reflect this substitution effect. In GANJAG, each water molecule participates in four hydrogen bonds, acting twice as donor and twice as acceptor. When water is replaced by methanol in BUWKUZ, three of these interactions are retained: the hydroxyl group of methanol accepts two hydrogen bonds and donates one (Table S3). The second donor interaction, however, is lost, precisely along the direction in which the structure becomes elongated.
Superposition of molecules from the crystal structures of BUWKUZ (gray, methanol solvate) and GANJAG (green, hydrate), showing their 3D isostructurality. The circled regions highlight the solvent sites, where accommodation of the bulkier MeOH molecules induces a relative displacement of adjacent layers.
The crystal structure of BDMC-Water-2 differs from any of the structures solved to date, Figureb.
Analyses of Statistical
and Thermodynamic Prediction Methods Using for Obtaining Solvates
3.2
The prediction results for obtaining CUR, DMC and BDMC solvates using statistical and thermodynamic methods are presented in summary Table and in more detail in Tables S5–S7 of the Supporting Information. In the crystallization results, new solvates and hydrates were obtained for CUR, BDMC and DMC. These new solvates, CUR-DOX, BDMC-DOX, DMC-DOX, BDMC-WATER1, BDMC-WATER2 (with determined crystal structure) and CUR-DMSO and DMC-ACN (with unit cell parameter information) were included in the evaluation of the prediction methods, as well as the solvates already described in the literature.
1: Ranking and Consensus Ranking Positions of New Solvates Prepared in This Work and in the Literature Reported for Curcumin (CUR), Bisdemethoxycurcumin (BDMC), and Demethoxycurcumin (DMC), Obtained through Statistical (MC, CV, HBP) and Thermodynamic (COSMO) Methods
Since CUR and BDMC exhibit a larger number of solvates, the analysis of the prediction methods was first carried out for these molecules, followed by the assessment of DMC. Based on the evaluation of each individual and combined method, the HBP approach gave the best results for solvate prediction, as summarized in Table S8. This outcome indicates that hydrogen bonding is the predominant interaction driving solvate formation in CUR and BDMC. This is further corroborated by Table, which shows that nearly all solvent molecules form strong hydrogen bonds with the hydroxy and keto–enolic groups of CUR, DMC, or BDMC. The only exception is the CUR–dichloromethane solvate, which does not exhibit strong hydrogen bonds but instead multiple weak interactions of the C–H···O and C–H···Cl types. These weaker contacts are not well captured by the HBP model, which explains the anomalously low ranking obtained for this solvate.
2: Hydrogen Bonds between Solvent Molecules and the Host Structures, with Distances Given between the Non-Hydrogen Atoms Involved (a and b Denote Molecules in a Disorder Model; Subscripts A, B, C, D and E Indicate Independent Molecules in the Structures)
When analyzing the CV method, which evaluates the acceptor and donor capacity of the atoms of the functional group of each molecule, its performance was affected by the accessibility of these functional groups. Although several conformations of both the target molecule and the solvent (when was applicable) were generated in an attempt to improve accessibility between the molecules, the prediction values did not improve. Regarding the COSMO method, it was observed that its combination with HBP provided better prediction results compared to the individual method.
The worst performing solvate prediction method was MC, both as a single technique and in combination with other techniques. It was observed that the positions of CUR and BDMC solvates appeared at the bottom of the ranking, suggesting that solvates would not form. The differences in shape, size and molecular polarity of the molecules and accessibility between the molecules could have had a significant impact on the evaluation of solvate formation. To improve accessibility between molecules, different conformations were generated for all molecules; however, the prediction results showed no improvement.
For DMC, no solvates had been previously reported. In this work, two new solvates were obtained: one with a fully solved crystal structure and another characterized only by unit cell parameters. As observed for CUR and BDMC, the HBP method provided the most accurate ranking for these new solvates when compared with the other prediction methods. Since the MC approach proved unsuitable for solvate prediction in CUR and BDMC, it was not applied to DMC.
Conclusion
4
This study provides a systematic comparison between statistical and thermodynamic methods for predicting solvate formation in CUR and its derivatives DMC and BDMC. Through an extensive crystallization screen, several new solvates and hydrates were identified, including the first crystal structure of DMC, as well as novel BDMC hydrates and additional CUR and BDMC solvates.
Evaluation of predictive methodologies demonstrated that the Hydrogen Bond Propensity (HBP) method outperformed other statistical approaches, consistently ranking experimental solvates near the top of its predictions. The COSMO-RS thermodynamic model, while less accurate on its own, showed enhanced performance when combined with HBP, highlighting the benefit of hybrid prediction strategies. In contrast, the Molecular Complementarity (MC) method proved unsuitable for solvate screening, as it often misranked known solvates.
The findings also emphasize the critical role of hydrogen bonding as the primary driving force in curcuminoid solvate formation. Nearly all experimentally obtained solvates involved strong hydrogen-bond interactions with solvent molecules, with the exception of CUR–dichloromethane, which was stabilized instead by multiple weaker contacts. Such cases underscore the need for predictive models to better capture the contribution of weaker interactions.
Overall, this work not only expands the structural knowledge of curcuminoid solvates but also benchmarks the relative strengths and weaknesses of current predictive methods. The combined HBP–COSMO approach emerges as the most reliable strategy, offering valuable guidance for rational solvate screening in pharmaceuticals and functional materials.
Supplementary Material
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Hewlings S. J.Kalman D. S.Curcumin: A Review of Its Effects on Human Health Foods 20176109210.3390/foods 610009229065496 PMC 5664031 · doi ↗ · pubmed ↗
- 2Nelson K. M.Dahlin J. L.Bisson J.Graham J.Pauli G. F.Walters M. A.The Essential Medicinal Chemistry of Curcumin J. Med. Chem.20176051620163710.1021/acs.jmedchem.6b 0097528074653 PMC 5346970 · doi ↗ · pubmed ↗
- 3Gupta S. C.Patchva S.Aggarwal B. B.Therapeutic Roles of Curcumin: Lessons Learned from Clinical Trials AAPS J.201315119521810.1208/s 12248-012-9432-823143785 PMC 3535097 · doi ↗ · pubmed ↗
- 4Prasad S.Tyagi A. K.Aggarwal B. B.Recent Developments in Delivery, Bioavailability, Absorption and Metabolism of Curcumin: The Golden Pigment from Golden Spice Cancer Res. Treat.201446121810.4143/crt.2014.46.1.224520218 PMC 3918523 · doi ↗ · pubmed ↗
- 5Li S.Yuan W.Deng G.Wang P.Yang P.Aggarwal B. B.Chemical Composition and Product Quality Control of Turmeric (Curcuma Longa L.)Pharm. Crops 201151285410.2174/2210290601102010028 · doi ↗
- 6Lee W. H.Loo C. Y.Traini D.Young P. M.Nano- and Micro-Based Inhaled Drug Delivery Systems for Targeting Alveolar Macrophages Expert Opin. Drug Delivery 20151261009102610.1517/17425247.2015.103950925912721 · doi ↗ · pubmed ↗
- 7Pudipeddi M.Serajuddin A. T. M.Trends in Solubility of Polymorphs J. Pharm. Sci.200594592993910.1002/jps.2030215770643 · doi ↗ · pubmed ↗
- 8Gillon A. L.Feeder N.Davey R. J.Storey R.Hydration in Molecular Crystals - A Cambridge Structural Database Analysis Cryst. Growth Des.20033566367310.1021/cg 034088 e · doi ↗
