# Predicting Bioactive Compounds in Arbutus unedo L. Leaves Using Machine Learning: Influence of Extraction Technique, Solvent Type, and Geographical Location

**Authors:** Jasmina Lapić, Anica Bebek Markovinović, Nikolina Račić, Lana Vujanić, Marko Kostić, Dušan Rakić, Senka Djaković, Danijela Bursać Kovačević

PMC · DOI: 10.3390/foods15060993 · Foods · 2026-03-11

## TL;DR

This study uses machine learning to predict bioactive compounds in Arbutus unedo leaves, finding that solvent type and extraction method significantly affect compound recovery.

## Contribution

The novel use of machine learning models to predict bioactive compound recovery based on extraction parameters and geographical origin.

## Key findings

- 70% ethanol yielded the highest levels of bioactives and antioxidant capacity.
- Leaves from Vis had higher total phenolics and condensed tannins than those from Mali Lošinj.
- Gradient Boosting and Decision Tree models achieved high accuracy (R2 > 0.91) in predicting bioactive compounds.

## Abstract

This study investigates the effects of extraction technique, solvent type, and geographical origin on the recovery of bioactive compounds from Arbutus unedo L. leaves collected from two Croatian islands (Vis and Mali Lošinj) and extracted using conventional, Soxhlet, and ultrasound-assisted extraction (UAE) with green solvents (distilled water, 70% ethanol, and ethyl acetate). Extracts were purified and characterized by thin-layer chromatography, column chromatography, and FTIR spectroscopy. Total phenols, hydroxycinnamic acids, flavonols, condensed tannins, and antioxidant capacity were quantified spectrophotometrically. Solvent type had the greatest influence, with 70% ethanol yielding the highest levels of bioactives and antioxidant capacity. Geographical origin significantly affected total phenolics and condensed tannins, with leaves from Vis outperforming those from Mali Lošinj. UAE was slightly more efficient than conventional and Soxhlet methods, particularly for thermolabile phenolics. Machine learning algorithms were applied as exploratory tools, using total phenols as a proxy variable to estimate selected bioactive compounds and antioxidant capacity based on extraction parameters. Decision Tree and Gradient Boosting models showed high goodness of fit within the experimental dataset (R2 > 0.91). These results support the potential of green extraction strategies combined with data-driven screening for the valorization of A. unedo leaf extracts, while highlighting the need for further validation prior to industrial application.

## Linked entities

- **Chemicals:** 70% ethanol (PubChem CID 702), ethyl acetate (PubChem CID 8857), distilled water (PubChem CID 962)
- **Species:** Mus musculus (taxon 10090)

## Full-text entities

- **Chemicals:** phenolics (-), hydroxycinnamic acids (MESH:D003373), phenols (MESH:D010636), ethanol (MESH:D000431), condensed tannins (MESH:D044945), ethyl acetate (MESH:C007650), flavonols (MESH:D044948), water (MESH:D014867)
- **Species:** Arbutus unedo (strawberry tree, species) [taxon 84005]

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC13024744/full.md

## Figures

6 figures with captions in the complete paper: https://tomesphere.com/paper/PMC13024744/full.md

## References

46 references — full list in the complete paper: https://tomesphere.com/paper/PMC13024744/full.md

---
Source: https://tomesphere.com/paper/PMC13024744