# Adulteration Detection of Multi-Species Vegetable Oils in Camellia Oil Using SICRIT-HRMS and Machine Learning Methods

**Authors:** Mei Wang, Ting Liu, Han Liao, Xian-Biao Liu, Qi Zou, Hao-Cheng Liu, Xiao-Yin Wang

PMC · DOI: 10.3390/foods15030434 · Foods · 2026-01-24

## TL;DR

This study developed a fast and accurate method using SICRIT-HRMS and machine learning to detect and quantify adulteration of camellia oil with multiple vegetable oils.

## Contribution

The novel approach combines SICRIT-HRMS with machine learning to detect and quantify multi-species oil adulteration in camellia oil.

## Key findings

- SICRIT-HRMS effectively characterized volatile profiles of pure and adulterated camellia oil samples.
- Machine learning models achieved high accuracy in detecting and quantifying adulteration levels.
- PCA-CNN model showed optimal performance in predicting adulteration levels with high determination coefficients and low error rates.

## Abstract

We aimed to establish a rapid and precise method for identifying and quantifying multi-species vegetable oil (corn oil, olive oil (OLO), soybean oil, and sunflower oil (SUO)) adulterations in camellia oil (CAO), using soft ionization by chemical reaction in transfer–high-resolution mass spectrometry (SICRIT-HRMS) and machine learning methods. The results showed that SICRIT-HRMS could effectively characterize the volatile profiles of pure and adulterated CAO samples, including binary, ternary, quaternary, and quinary adulteration systems. The low m/z region (especially 100–300) exhibited importance to oil classification in multiple feature-selection methods. For qualitative detection, binary classification models based on convolutional neural networks (CNN), Random Forest (RF), and gradient boosting trees (GBT) algorithms showed high accuracies (98.70–100.00%) for identifying CAO adulteration under no dimensionality reduction (NON), principal component analysis (PCA), and uniform manifold approximation and projection (UMAP) strategies. The RF algorithm exhibited relatively high accuracy (96.25–99.45%) in multiclass classification. Moreover, the five models, including CNN, RF, support vector machines (SVM), logistic regression (LR), and GBT, exhibited different performances in distinguishing pure and adulterated CAO. Among 1093 blind oil samples, under NON, PCA, and UMAP: 10, 5, and 67 samples were misclassified by CNN model; 6, 7, and 41 samples were misclassified by RF model; 8, 9, and 82 samples were misclassified by SVM model; 17, 18, and 78 samples were misclassified by LR model; 7, 9, and 43 samples were misclassified by GBT model. For quantitative prediction, the PCA-CNN model performed optimally in predicting adulteration levels in CAO, especially with respect to OLO and SUO, exhibiting a high coefficient of determination for calibration (RC2, 0.9664–0.9974) and coefficient of determination for prediction (Rp2, 0.9599–0.9963) values, low root mean square error of calibration (RMSEC, 0.9–5.3%) and root mean square error of prediction (RMSEP, 1.1–5.8%) values, and RPD (5.0–16.3) values greater than 3.0. These results indicate that SICRIT-HRMS combined with machine learning can rapidly and accurately identify and quantify multi-species vegetable oil adulterations in CAO, which provides a reference for developing non-targeted and high-throughput detection methods in edible oil authenticity.

## Full-text entities

- **Chemicals:** vegetable oil (MESH:D010938), CAO (-), corn oil (MESH:D003314), OLO (MESH:D000069463), oil (MESH:D009821)

## Full text

_Full body text omitted from this summary view._ Fetch the complete paper as Markdown: https://tomesphere.com/paper/PMC12896747/full.md

## Figures

10 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12896747/full.md

## References

61 references — full list in the complete paper: https://tomesphere.com/paper/PMC12896747/full.md

---
Source: https://tomesphere.com/paper/PMC12896747