# Origin Identification of Scutellariae radix Based on Multidimensional Quality Indicators and Machine Learning Algorithms

**Authors:** Xiao-Lu Liu, Tong Zhu, Ming-Yue Zhang, Jun-Xuan Yang, Hua Li, Bin Yang

PMC · DOI: 10.3390/molecules31040680 · 2026-02-15

## TL;DR

This study develops a method to identify the origin of Scutellariae radix using quality indicators and machine learning, finding that Random Forest performs best with limited data.

## Contribution

Proposes a novel origin identification method for Scutellariae radix using multidimensional quality indicators and machine learning, with a focus on Random Forest's performance in small-sample settings.

## Key findings

- Random Forest achieved 75% test accuracy in origin classification, outperforming other models.
- Significant differences in baicalin, wogonoside, and chromaticity values were found among origins.
- Neural networks like RBF and BP showed lower accuracy and recall compared to ensemble methods.

## Abstract

This study aims to establish an origin identification method for Scutellariae radix that integrates multidimensional quality indicators and machine learning algorithms, enabling accurate and rapid traceability of Scutellariae radix medicinal materials from four production areas: Hebei (HB), Shanxi (SX), Shaanxi (SAX), and Chengde (CD). The study collected a total of 43 batches of Scutellariae radix samples from the aforementioned origins. It systematically measured 12 key quality indicators covering flavonoids, physicochemical parameters, chromaticity values, and biological activity. These specifically include four flavonoid components: baicalin, wogonoside, baicalein, and wogonin; three physicochemical parameters: moisture content, ash content, and alcohol-soluble extract; four chromaticity values: L*, a*, b*, and ΔE; and in vitro anti-inflammatory activity (IC50 value for NO clearance). On the basis of these parameters, in this study there were five machine learning models constructed based on the following algorithms and methods: Random Forest (RF), Extreme Learning Machine (ELM), Backpropagation Neural Network (BP), and Radial Basis Function Neural Network (RBF). A comparative analysis was conducted to evaluate the origin identification performance of each model. The results indicate significant differences (p < 0.05) in the contents of baicalin, wogonoside, L*, a*, b*, ΔE, and alcohol-soluble extract among Scutellariae radix from different origins. The comparative analysis of four machine learning models reveals that RF outperforms ELM, BP, and RBF in multiclass classification, achieving a test accuracy of 75% and consistent precision, recall, and F1-score of 79.17%. In contrast, the three neural networks attain only 66.67% test accuracy, with RBF showing high precision but low recall, ELM delivering moderate performance, and BP performing poorly. These results underscore the strength of ensemble methods like RF in small-sample settings, where they mitigate overfitting and enhance generalization, whereas neural networks struggle with limited data. We therefore recommend RF for deployment under current data constraints and suggest future work should focus on data expansion, especially for under-performing classes, along with hyperparameter tuning to further improve classification.

## Linked entities

- **Chemicals:** baicalin (PubChem CID 64982), wogonoside (PubChem CID 3084961), baicalein (PubChem CID 5281605), wogonin (PubChem CID 5281703)

## Full-text entities

- **Diseases:** ELM (MESH:D007859), injury to (MESH:D014947), inflammatory (MESH:D007249)
- **Chemicals:** ethanol (MESH:D000431), NO (MESH:D009569), aglycone (MESH:C458179), Wogonoside (MESH:C473995), water (MESH:D014867), Acetonitrile (MESH:C032159), Formic acid (MESH:C030544), methanol (MESH:D000432), Baicalein (MESH:C006680), glucose (MESH:D005947), DMSO (MESH:D004121), Flavonoid (MESH:D005419), Baicalin (MESH:C038044), glycoside (MESH:D006027), Alcohol (MESH:D000438), LPS (MESH:D008070), CO2 (MESH:D002245), Dexamethasone (MESH:D003907), amino acids (MESH:D000596), Dulbecco's Modified Eagle Medium (-), Wogonin (MESH:C085514), NO (MESH:D009614)
- **Species:** Scutellaria baicalensis (Baikal skullcap, species) [taxon 65409], Homo sapiens (human, species) [taxon 9606]
- **Cell lines:** RAW 264.7 — Mus musculus (Mouse), Mouse leukemia, Cancer cell line (CVCL_0493)

## Figures

3 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12943610/full.md

---
Source: https://tomesphere.com/paper/PMC12943610