The Bioprospecting of Bixa orellana L. for the Selection of Characters with Biological Activity
Luz A. Guerrero-Lagunes, Lucero M. Ruiz-Posadas, Jorge Cadena-Iñiguez, Ramón Marcos Soto-Hernández, Carlos H. Avendaño-Arrazate, Juan F. Aguirre-Medina, Celeste Soto-Mendoza, Juan F. Aguirre-Cadena

TL;DR
This study identifies specific traits in Bixa orellana plants that produce bioactive compounds with anticancer and antibacterial properties.
Contribution
The paper introduces a novel approach combining cladistics and multivariate analysis to identify bioprospective traits in Bixa orellana.
Findings
Phenotypes from India, Brazil, and Yucatán show anticancer activity against multiple cell lines.
These phenotypes also exhibit antibacterial effects against Staphylococcus aureus and other bacteria.
Biochemical compounds like geranylgeraniol, ellagic acid, and carotenoids are linked to biological activity.
Abstract
A meta-analysis of 28 sources of information was conducted, considering different variables in Bixa orellana, with the aim of identifying bioprospective variables. Variables were approached, such as the organ of extraction and extraction method, with 63 biochemical classes and 20 for biological activity, and their states were codified. The statistical analysis was developed through a cladistics analysis using the WinClada version1.00.08 84,85 software and the explicative accumulated variance was determined through a descriptive multivariate analysis and multiple correspondence analysis (MCA). The tree obtained showed the phenotype Africa1 as the one closest to the basal state. After Africa1, nine clades are derived and the phenotypes Colombia3 and Colombia5 were the most evolved. The analyses demonstrated that in B. orellana L., the phenotypes from India, Brazil, and Yucatán present…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsNatural Products and Biological Research · Psidium guajava Extracts and Applications · Phytochemical Studies and Bioactivities
1. Introduction
Bixa orellana L. (Bixaceae), known as achiote or annatto, is native to Central and South America and is grown in some tropical countries of the world, such as Peru, Mexico, Brazil, Colombia, Ecuador, Indonesia, India, Kenya, and Eastern Africa [1].
The plant is a shrub with simple ovate leaves and compound flowers, each consisting of five petals that may be white, violet, or pink. Its red, triangular seeds are enclosed in capsules covered with trichomes, as depicted in Figure 1. A morphological analysis of individuals from various origins has revealed significant phenotypic variability, manifested in traits such as leaf color and morphology, flower color, fruit shape, the presence of trichomes on the fruit, and seed count. These traits have been key in identifying and differentiating certain genotypes [2,3].
The seeds and leaves of B. orellana L. have been used since ancient times, both in culinary applications and traditional medicine. In the former, they are used to impart color and flavor, while in the latter, they are employed to treat a variety of ailments, including constipation, fever, heartburn, asthma, scabies, ulcers, and diarrhea [4]. The dyeing properties of the seeds, primarily attributed to the compounds bixin and norbixin, are currently exploited by several industries, including the food, textile, chemical, pharmaceutical, and cosmetics industries [5].
A range of bioactive compounds, including tannins, flavonoids, phenolic compounds, terpenoids, alkaloids, saponins, and anthraquinones, have been identified in the seeds and leaves [6,7,8]. The presence of bixin, norbixin, and geranylgeraniol in the seeds imparts biological activity with potential therapeutic applications [9,10,11].
The carotenoids, apocarotenoids, terpenes, terpenoids, sterols, and aliphatic compounds are the main compounds that are found in every part of this plant, for which a wide range of pharmacological activities have been researched [12]. Their biological activity has been demonstrated for the control of bacteria and fungi [10,13]. The antioxidant activity has been demonstrated by various studies [8,10,14], also displaying anticancer activity in cell lines of medical interest [9,11,15,16]; therefore, it has been included among nutraceutical foods. Because of its broad biological activity, B. orellana L. is a source for the development of new drugs with pharmacological activity, so there is the possibility of identifying morphological and phytochemical variables with a bioprospective approach, under the premise that the bioprospective meta-analysis facilitates the identification of the phenotype, its character, or outstanding phytochemical variable, as well as the state of the character, specifying the statistical validity and reducing possible contradictions in the literature.
2. Materials and Methods
An analysis of the studies published in the Scopus, Science Direct, Scifinder, Springer, and Google Scholar databases was carried out, using the search terms achiote, B. orellana L., phytochemicals, pharmacology, cancer, biological activity, antibacterial activity, and anticancer, as well as cytotoxic and antioxidant activity. From this, n = 56 results were identified, and when the criteria of plant organ and biological activity identified in each publication were applied, the sample was reduced to n = 28. All the studies included were studies that addressed the phytochemical characterization and biological activity of extracts from B. orellana L. (Table 1).
The studies included were conducted in Africa (n = 1), United States (n = 1), the Philippines (n = 1), Ecuador (n = 1), Bangladesh (n = 1), South Korea (n = 1), Nigeria (n = 2), Yucatán—Mexico (n = 2), Colombia (n = 5), India (n = 5), and Brazil (n = 6). The last three led the phytochemical and biological activity research of B. orellana L. The information was recorded in a database, codifying the variables and their different states (Table 2), made up by the following: organ of the plant used, extraction methods, biochemical classes, groups of compounds, phenols and phenolic acids, flavonoids, tannins, monoterpenes, sesquiterpenes, diterpenes, triterpenes, tetraterpenes, alkaloids, cyanogenic glucosides, and antimicrobial and anticancer activity.
Statistical Analysis
Cladistic and statistical analyses were conducted based on the presence or absence of variables and their respective states (Table 2). Regarding biological and anticancer activity, the minimum inhibitory concentrations (MIC) of B. orellana L. extracts against Pseudomonas aeruginosa, Esherichia coli, Staphylococcus aureus, Salmonella sp., and Candida albicans are presented. The MIC obtained were 50 to 500, 50 to 1024, 50 to 1000, 1000 and 50 to 140 μg/mL, respectively. In terms of inhibition zones generated by the same extracts against these species, the reported values ranged from 13.00 to 100.00, 11.00 to 90.00, 11.00 to 100.00, 18.00 and 13.00 to 100.00 mm [1,10,12,15,16,18,19,20,22,27,29].
Furthermore, the anticancer activity of B*. orellana* L. extracts, assessed by their ability to reduce cell viability to 50% in various cancer cell lines, was observed at the following minimum concentrations: 100 µg/mL for liver cancer (HepG2), 3.9 µg/mL for glioblastoma multiforme (U251), 3.1 µg/mL for breast cancer (MCF-7), 37.2 µg/mL for cervical cancer (HeLa), 3.3 µg/mL for lung cancer (NCI-H460 and A549), 2.8 µg/mL for prostate cancer (PC-3), and 3.3 µg/mL for colon cancer (HT-29) [4,11,25,27,28].
Several authors [1,17,20] have demonstrated the antioxidant activity of B*. orellana*, reporting free-radical-scavenging DPPH IC_50_ values of approximately 1 to 15 mg/mL for seeds and 3.2 to 10 mg/mL for leaves.
The statistical analysis was developed with two approaches. The first through a cladistics analysis that incorporates the approach of Popper’s critical rationalism through the refutation of phylogenetic hypotheses examined under a parsimonious principle [31,32]; and through non-parametric statistics using the WinClada version 1.00.08 84,85 software (free license) [33], with the Bootstrap/Jackknife resampling methods, approaching the genotypes as populations through a random simulation until generating a parsimonious cladogram [34]. This analysis defines the stability of the clades and identifies the state of the outstanding variables. The analysis was repeated 1000 times, creating values such as support indices, consistency, and reliability in the cladograms [35]. The systematic reviews carried out in the meta-analysis were directed towards the information disseminated, to reanalyze it with approaches adapted to the present research [36]. It must be clarified that the criteria selected were those with complete, traceable data and reproducible results, to avoid biases in the study [37].
The second approach was to determine the explicative accumulated variance, the statistical weight of each variable, and its state through a descriptive multivariate analysis and multiple correspondence analysis (MCA), with the FactoMineR and factoextra [38] libraries with the Rstudio statistical package [39].
Following the phylogenetic and biochemical characterization of *B. *orellana L. phenotypes, a multivariate analysis using Principal Component Analysis (PCA) was performed to identify the key variables driving the differentiation of the chemical profile based on the extraction organ and the method applied. This analysis helped reduce the data complexity and group the biochemical, morphological, and functional traits into representative dimensions. The PCA applied to the 28 genotypes of B. orellana L. enabled the identification of the most significant variables responsible for the differentiation of the biochemical and functional profiles.
3. Results and Discussion
Figure 2 presents the general cladogram that indicates the distribution of the B. orellana L. phenotypes analyzed in function of the characters organ of extraction, method, biochemical class, and biological activity (Table 2). In total, 12 trees were obtained to create a consensus tree. This tree showed 149 steps or changes, a cladogram consistency index of 50%, and a retention index that reflects the percentage of characters that retain and conserve a change in taxa of 64%. The bioprospective meta-analysis presented in this paper aids in the identification of key phenotypic traits, variables, and their respective statuses, offering statistical validation while minimizing potential contradictions in the existing literature.
The parsimonious distribution of the phenotypes of B. orellana L. (Figure 2) is not indicative of a strict genealogical relationship, since there are no morphological and genetic characters; however, it helps to understand the adaptive specialization [40,41] of plants in the face of the differences in environmental conditions unlike those in their habitat. In general, reproductive isolation, selective pressure, and the lack of variability create unique survival characters reflected in the content and diversity of secondary metabolites [42].
The biochemical and biological activity variables showed that the phenotype with origin from Africa1 was located as the closest one to the basal state, hypothetically indicating due to the variables analyzed that it could have greater similarity with a phenotype from the original habitat (Figure 1).
Nine clades derive from Africa1. The first formed by Nigeria2 and India3, phenotypes closer to the root, which share the presence of alkaloids in leaves as plesiomorphic characters. Even when the publications do not record the time of introduction to Nigeria and India, it is presumed that they could have had some reproductive isolation, absence of variability, and agroclimatic conditions different from their geographic origin (Central and South America). Various authors mention how reproductive isolation and the absence of biological variation in some organisms promote unique characters that can be used in different sectors of society, such as in the case of enzymes responsible for producing secondary metabolites with biological effects of medical, agricultural, or industrial interest [43].
The second clade derives from India3, formed by the phenotypes USA1, India4, Brazil2, and Brazil3, which make up an independent evolutionary route characterized by sharing the seed as an organ of extraction, which is a derived state. The USA1 phenotype shares the presence of geranylgeraniol (apomorphic state) with the rest of the phenotypes from this group, highlighting that it has the derived characters cis-norbixin and trans-norbixin, which are related with the biological activity against Staphylococcus aureus (MIC 50 a 100 μg/mL), also a derived state. Brazil2 and Brazil3 are sibling phenotypes, and presumed to be those of greatest “evolved” specialization from this group, characterized by the presence of flavonoids, which is classified as an ancestral state. When it comes to apomorphic states, the presence of terpenes, ocimene, spathulenol, isoledene, and bergamotene stands out, as well as bixin and norbixin. This clade is categorized based on the organ of extraction and the presence of carotenoids within it. The phenotype Brazil3 shows anticancer activity, reducing cellular lines by 50% against the cell lines U251, MCF-7, NCI-H460, PC-3, and HT-29 from 3.9, 3.1, 3.3, and 3.3 μg/mL of B. orellana L. extract, and presents as a plesiomorphic state (ancient or primitive character). The distinction of the anticancer activity in a phenotype that is in the origin center of the species proves that the specific conditions of this place favored the presence of the compounds mentioned and contributed to the anticancer activity. However, when agroclimatic conditions change outside its place of origin, it loses chemical variability.
In the case of Ecuador1, it forms an independent clade and shows an evolutionary divergence, possibly due to reproductive isolation. A plesiomorphic state stands out in the group, which is an extraction method comprising the vapor sweeping of leaves, different from the rest of the genotypes, while the presence of ocimene, pinene, germacrene, farnesol and caryophyllene, as well as the activity against Staphylococcus aureus, are new characters or derived states (apomorphic). In this group, the extraction method marked the difference in the compounds detected.
The six remaining clades derive from Ecuador1. India5, Colombia1, and Philippines1 form a group characterized by the new or derived characters represented by the farnesol compounds, saponins, and carotenoids. The phenotypes from Colombia and the Philippines present three ancestral characters constituted by anthocyanins and polyprenol, as well as stigmaesterol and sitoesterol. Although both phenotypes do not have a geographic grouping, it is evident in the group of compounds, which indicates a possible displacement of the phenotypes from the center of origin towards the Philippines and India, where the original compounds could be conserved, or there was an influence of similar agroecological conditions that impacted the production of these secondary metabolites.
Also, Ecuador1 is derived from the group constituted by India1, Brazil1, Colombia2, Brazil4, and Colombia4. It should be mentioned that the phenotypes from Brazil are again those that present the highest number of plesiomorphic states (kaempherol, granatin, neostrictinin, procyanidines, and ellagic acid, as well as antioxidant activity). When it comes to apomorphic characters, the presence of saponins, tannic acid, and anthraquinones stands out, which are present in the phenotypes from Colombia and India. In this group, the geographic grouping is clear regarding the states that highlight the ancestral characteristics of the phenotypes from Brazil, the zone registered as the origin center of B. orellana.
Brazil4 is associated with anti-inflammatory activity and can be linked to the presence of ellagic acid, a finding that is in agreement with the authors of another study [44], who determined that this compound acts as chemo-protector against different types of cancer and shows strong antiproliferative activity against colon, lung, and prostate cancer cells.
From Colombia4, four subgroups are derived, an independent one formed by Nigeria3, Philippines2, and Yucatán1, which are characterized by presenting three plesiomorphic states represented by tannins, alkaloids, and atropines. From this group, the phenotype located in Yucatán is the most evolved, which is proven by the derived states present, such as the presence of saponins and the hepatoprotective activity. This evolution could be due to the agroclimatic characteristics or the manipulation of the crops in the zone, since in Yucatán there are commercial crops of B. orellana L. that have been genetically improved to reach higher seed production, which can be a factor that impacts the production of secondary metabolites [45].
The other sibling arm of Colombia4 groups, on the one hand, is Yucatán2, India6, India2, and Indonesia1, which, despite not having a geographic grouping, was characterized by the presence of the highest number of plesiomorphic states, among which germacrene, elemene, caryophyllene, and squalene stand out, as well as chemo-preventive activity. The anticancer activity of B. orellana L. extract, which reduces cell viability by 50%, was observed in the HeLa, A549, and MCF-7 cell lines at concentrations of 37.2, 2.8, and 3.1 µg/mL, respectively. This a simplesiomorphic state, since it also presents in Brazil3, which is a genotype close to the root.
The derived character has to do with the presence of geranylgeraniol, carotenoids, bixin, and norbixin, in addition to the activity against P. aeruginosa, E. coli, and S. aureus. The biological and anticancer activity is determined by the variety of phytochemical compounds present in B. orellana L. and by the capacity of geranylgeraniol to induce apoptosis in A549 cells [29,46,47]. In this group, it can be inferred that there was a flow of plants from Yucatán towards India and Indonesia and the antiproliferative activity was conserved.
The last two groups derive from the branch coming from Colombia4. The node formed by Nigeria1 and Bangladesh share the apomorphic states represented by saponins and the biological activity against E. coli (MIC of 50 to 1024 μg/mL). Only an ancestral state is present (tomentosin).
The phenotypes SouthKorea1, Colombia3, and Colombia5 are the last group and share four apomorphic states integrated by carotenoids, cis-norbixin, trans-norbixin, and bixin. In addition, they present butein, catechins, and chlorogenic acid, as well as cytotoxic activity, as ancestral characters. The phenotypes located in Colombia were considered the most evolved, compared to Africa1, Nigeria2, and India3, whose evolution can be due to pressure processes, such as manipulation, edaphoclimatic conditions, or the genetic flow between genotypes. It should be highlighted that the activity found in the phenotypes present in this bioprospective study is consistent with that found by other authors [1,48,49], who determined that the tannins, quinones, and terpenoids have biological activity; in addition, lipophylic flavonoids can be disruptive for the cell membrane [49].
Table 3 shows the apomorphic characters present in the phenotypes studied, observing sinapomorphic characters (shared characters) among the phenotypes USA1, India2, and India4, such as geranylgeraniol, while India4 and India5 share farnesol; Brazil2 and India5, steroids; Colombia2 and Colombia4, tannic acid; Colombia4, India6, and Bangladesh1, saponins; India5 and Indonesia1, carotenoids; Brazil2 and Indonesia1, bixin and norbixin; Brazil3 and Ecuador1, ocimene; USA1 and Ecuador1, activity against S. aureus; and India6 and Colombia3 activity against P. aeruginosa. On the other hand, cis-norbixin, trans-norbixin, spathulenol, isoledene, bergamotene, germacrene, farnesol, and caryophyllene are autopomorphic states (unique characters) because they are present in a single taxon or genotype.
Table 4 also shows that apomorphic characters (new or derived) are related to the antimicrobial activity against P. aeruginosa, E. coli, S. aureus, and biochemical class, primarily the carotenoids bixin, norbixin, 9′-cis-norbixin, transnorbixin, saponins, and monoterpenes. The plesiomorphic characters are more closely related to the hepatoprotective, chemo-preventive and anticancer activity against the cell lines MCF-7, NCI-H460, PC-3, HT-29, HeLa and A549, as well as the presence of flavonoids (naringenin, kaempherol, anthocyanins, procyanidines, ellagic acid), triterpenes (stigamesterol and sitoesterol), tannins (granatin, neostrictinin), sesquiterpenes (elemene, caryophyllene), and coumarins. It stands out that some compounds from B. orellana in this bioprospective analysis act on microorganisms that can cause public health problems, highlighting characters for a possible program for genetic improvement.
It is important to highlight that the non-detection of a compound does not mean it is absent, since it could have remained undetected because of the plant organ used, extraction method, or the seasonal time of sample harvesting. For future studies, our proposal is to elucidate the “absent” compounds to secure the grouping; something to remember is that since this is a meta-analysis, comparative biases between taxa can occur due to the various methods of sampling, extraction, and analysis. Over-studied and under-studied taxa can bring biases to the analysis.
Multivariate Analysis
The multivariate analysis allowed us to identify the variables that explain the highest explicative variance, in addition to exploring the correlations and reducing the dimension of the analysis with new indices [50]. It was determined that in four principal components (PCs), the accumulated value is 86.31% (Table 5).
As shown in Table 6, the extraction organ had a distinct impact on the biochemical variability of the phenotypes. Leaves exhibited a higher factor loading in PC1 (0.40600), indicating that this tissue concentrates a significant proportion of metabolites relevant to the differentiation of chemical profiles. In contrast, seeds contributed to a lesser extent, with a lower loading in PC2 (0.09092), suggesting that while their chemical composition is unique, their impact on the overall variability is less influential.
The extraction method revealed differences in the recovery of bioactive compounds. Ethanol extraction showed a significant loading in PC1 (0.07360), suggesting higher efficiency in recovering key metabolites, particularly carotenoids and flavonoids.
Biochemical classes exhibited clear differentiation among phenotypes. Terpenoids were particularly prominent, with loadings in both PC1 (0.18442) and PC2 (0.20710), highlighting their structural role in the chemical variability of B. orellana L. Principal component analysis also revealed the influence of secondary metabolites of B. orellana L., showing a clear differentiation between phenotypes in the profiles of bioactive compounds, as outlined in the table. Flavonoids stood out with a significant factor loading in CP1 (0.29988), reflecting their strong association with the observed chemical variability, indicating that they play a key role in phenotype differentiation. On the other hand, tannins, with a prominent contribution in CP3 (0.20160), serve a differentiating role in a secondary dimension of biochemical variability. Terpenoids, with notable loadings in CP1 (0.18442) and CP2 (0.20710), further confirm their significant impact on the chemical composition of phenotypes, underscoring their contribution to the observed chemical variability.
The biological activity of B. orellana L. was assessed in terms of its anticancer and antimicrobial potential, with a focus on its relationship to the chemical profiles of the phenotypes. The results show a strong correlation between antiproliferative activity and the MCF-7 (0.47323) and HeLa (0.45618) cell lines in CP1, indicating that these cell lines are key differentiators between phenotypes. This association suggests that certain phenotypes exhibit a stronger antiproliferative effect, highlighting their potential for future pharmacological research.
Regarding antimicrobial activity, the relationship with E. coli (0.21822) and P*. aeruginosa* (0.16510) in CP2 and CP1, respectively, indicates a moderate association between antimicrobial activity and the biochemical differentiation of phenotypes. These findings suggest that B. orellana L. phenotypes could have potential for antimicrobial applications, particularly against these bacteria. In contrast, the activity against C. albicans showed less influence on the overall data structure, suggesting its impact on the observed variability is less significant.
Overall, it is observed that anticancer activity is influenced by the presence of bixin and norbixin; however, the results also suggest that these compounds are not the sole determinants. Synergistic interactions with flavonoids and alkaloids may also be contributing to the observed activity, emphasizing the complexity of the mechanisms involved.
4. Conclusions
There is scientific evidence for the use of B. orellana L. as an agent with anticancer activity, primarily against the cell lines U251, MCF-7, HeLa, NCI-H460, PC-3, A549, and HT-29, as well as biological activity against S. aureus, E. coli, and P. aeruginosa. The antimicrobial and anticancer activity is related primarily to biochemical compounds such as geranylgeraniol, ellagic acid, carotenoids (bixin and norbixin), naringenin, and alkaloids. The conditions of reproductive isolation of the phenotypes from Brazil, Yucatán, India, and Indonesia provided the ideal agroclimatic conditions to produce compounds with biological activity, because they produce those metabolites. This analysis can be used as reference for additional studies, genetic improvement programs, and the revaluation of the species.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Karmakar A.U. Sultana S. Nishi S. Nath Biswas N. Hossain L. Sheikh S. Antioxidant, analgesic, antimicrobial, and anthelmintic activity of the dried seeds of Bixa orellana (L)Int. J. Pharm.20188150163
- 2Quiñones B.X. Yunda M.C. El achiote Bixa orellana L. como posible alternativa productiva para el Departamento del Meta Rev. Sist. Prod. Agroecol.20145142173
- 3Arias-Pérez I.M. De Dios-Durán F.M. Caracterización morfológica de una muestra local de Bixa orellana L., en Tabasco, México Rev. Agroproduct.201369197
- 4Vidusha A. Gayatri Devi R. Selvaraj J. Cytotoxic effects of Bixa orellana bark extracts on human cell line (cell line HEPG 2)J. Pharm. Negat. Results 2022131811181610.47750/pnr.2022.13.S 06.238 · doi ↗
- 5Perecin M.B. Bovi O.A. Maia N.B. Pesquisa com plantas aromáticas, medicinais corantes: O papel do Instituto Agronómico O Agron.2002542124
- 6Dike I.P. Ibojo O.O. Omonhinmin D.F. Phytochemical and proximate analysis of foliage and seed of Bixa orellana Linn Int. J. Pharm. Sci. Rev. Res.201636247251
- 7López C.P. Sumalapao D.E.P. Villarante N.R. Hepatoprotective activity of aqueous and ethanolic Bixa orellana L. leaf extracts against carbon tetrachloride-induced hepatotoxicity Natl. J. Physiol. Pharmacol.2017797297610.5455/njppp.2017.7.0412011052017 · doi ↗
- 8Zarza-García A.L. Sauri-Duch E. Raddatz-Mota D. Cuevas-Glory L.F. Pinzón-López L.L. Rivera-Cabrera F. Mendoza-Espinoza J.A. Pharmacological, phytochemical and morphological study of three Mayan accessions of Bixa orellana L. leaves Emir. J. Food Agric.201729163169
