Lavender hydrosol analysis using UV spectroscopy data and partial least squares regression
Sára Preiner, Bálint Levente Tarcsay, Dóra Pethő, Norbert Miskolczi

TL;DR
This study uses UV spectroscopy and machine learning to estimate the composition of lavender hydrosol, a byproduct of essential oil production.
Contribution
A PLS regression model is developed to predict hydrosol composition from UV–Vis spectra with high accuracy.
Findings
UV–Vis spectra combined with PLS regression can accurately predict hydrosol composition changes.
The PLS model achieved an R2 score above 0.95 during cross-validation.
GC–MS confirmed the composition of extracted lavender essential oil components.
Abstract
The aim of our work was to estimate the composition of hydrosol produced as a byproduct of lavender steam distillation using UV–Vis spectrophotometry in the 200–600 nm wavelength range through a machine learning algorithm. The dissolved components of lavender essential oil (EO) from lavender hydrosol samples were extracted via liquid-liquid extraction, using three different solvents (pentane, heptane and diethyl ether). The UV–Vis absorbance spectra of the extracts were recorded and the composition analyzed using GC–MS. The composition data obtained allowed for the calculation of changes within the quantities of different EO components in the samples. The partial least squares regression technique (PLS) was utilized to establish a connection between changes in the composition of the hydrosol and the changes in the UV–Vis spectra. After optimization the established PLS model showed an…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsWater Quality Monitoring and Analysis · Spectroscopy and Chemometric Analyses · Advanced Chemical Sensor Technologies
Specifications tableSubject area:Chemical EngineeringMore specific subject area:Composition detection using soft sensorsName of your method:Lavender hydrosol analysis using UV–Vis spectroscopy and partial least squares regressionName and reference of original method:–Resource availability:–
Background
In this work a method is developed to infer composition changes within lavender hydrosol based on UV–Vis spectra and partial least squares (PLS) regression. Essential oil (EO) isolated from lavender is one of the most used aromatic products which can be found in food, pharmaceuticals and cosmetic industries just like in aromatherapy due to its pleasant fragrance and calming effect on the nervous system [1,2].
Its extensive usage can be attributed to the chemical composition of the EO. These oils can contain more than one hundred components [3]. Linalool and linalyl acetate are the two main components of lavender EO but eucalyptol, α-terpineol, terpinen-4-ol, lavandulol and lavandulyl acetate are also important compounds. The amount of these components depends on the lavender type and extraction technique [4,5].
In industrial applications steam distillation is the most commonly used method to extract EO [6,7]. Through steam distillation of lavender, huge amounts of aqueous solution remain as industrial byproducts called hydrosol or floral water. Several studies investigated the composition and effect of the hydrosols which can widely be used in different sectors of the pharmaceutical, food and cosmetics industry [8]. Therefore, it is required to develop methods to effectively estimate the composition of hydrosols with a minimal number of experimental procedures to minimize the costs of analysis. In this work the authors developed a method to calculate the changes in EO component mass of lavender hydrosol using UV–Vis analysis and partial least squares regression (PLS).
The investigation of the hydrosols in terms of composition and different properties were implemented by liquid-liquid extraction and gas chromatography analysis of the extract [9,10], while biological properties, application possibilities, aromatic, antioxidant and antimicrobial effects of lavender aromatic water were determined [11,12].
The aim of our work was to develop a faster and cheaper method to quantify the main components of lavender hydrosol using soft sensors. UV–Vis spectrophotometry was used to define the changes in the main components of the floral water. The volatile components of the hydrosol were separated by liquid -liquid extraction. The original hydrosols and the extracts were analyzed with UV–Vis spectrophotometry. To specify the main components the extracts were analyzed by GC–MS and compared to the spectra of the samples.
Subsequently the partial least squares regression (PLS) was used to establish a link between changes in the composition of the hydrosol (based on GC–MS) and the changes in the UV–Vis spectra [13]. Using the PLS regression as a soft sensor the GC–MS step to determine the composition changes within the hydrosol could be eliminated and the composition changes could be directly inferred from changes in the UV–Vis spectra of hydrosol.
The high accuracy PLS model developed ( during five-fold cross-validation) enabled the elimination of the use of GC–MS and thus simplified the composition evaluation process [14].
Method details
Sample collection and analysis
Various types of lavender hydrosols were investigated to establish the composition estimation method. Industrial samples were collected in the city of Tihany (46.90891°N 17.87923°E), which belongs to the main lavender essential oil producing region of Hungary. Lavender plants were collected from different production areas there to perform laboratory scale experiments. The plant samples were dried until constant mass and were chopped before the steam distillation measurements to uniform size distribution. The measured sample (170 g) and the steam flowrate was the same in case of every experiment. After the steam distillation methods, the lavender hydrosol samples were collected and analyzed by UV–Vis spectroscopy and liquid-liquid extraction with different extractants (pentane, diethyl ether, heptane). All in all, 22 (S1-S22) samples were produced and evaluated.
Our goal was to define the main EO components and their concentrations within the hydrosol. The original samples were tenfold diluted. 0.1 ml of the sample was measured using an automatic pipette which was transferred to a 10 cm^3^ standard flask and filled to mark. The cuvette was filled with the examined samples and their spectra was taken with Thermo Scientific Genesys 150 UV–Vis spectrophotometer. The blank solutions were distilled water for the hydrosol samples and pentane, diethyl ether and heptane for the extracts. The spectra were taken between 200 and 600 nm range wavelengths. The main differences were seen in the UV domain between 200 and 400 nm. The spectrum of different hydrosols and distilled water (DW) can be seen on Fig. 1.Fig. 1UV–Vis spectrum of distilled water (DW) and lavender hydrosols of samples S19-S22.Fig. 1
The spectra were corrected with the absorbance of distilled water. The absorbances were read at the maximum change of the spectra which were at wavelength 278 nm. Changes can be seen in the spectrum at the given wavelength so it can be inferred that every sample contains different amounts of the EO components.
To find out which are the main components of the hydrosols, a given volume of the hydrosol (50 cm^3^) was extracted with different solvents. Three different solvents, heptane, pentane and diethyl‑ether were used as extractants for the measurements. The liquid-liquid extraction was performed in a separation funnel, the contact between the two phases lasted 10 min every time. In every case 50 cm^3^ of the hydrosol sample was extracted with 10 cm^3^ solvent. The extractions were carried out in five stages for the heptane solvent and in one stage for pentane and diethyl‑ether as for the latter two spectra did not show significant changes after the first extraction step. For the heptane extraction fresh solvent was used in every stage. The extracts were also diluted tenfold or more if the extracted amount was too high and the spectrum was not relevant.
Measurement samples S1-S5 and S6-S10 were performed with industrial hydrosol from the year 2024 and 2023 respectively. Each sample composition was evaluated during one of the stages of five-stage heptane extraction method. The same industrial hydrosols were used for S11-S12 and S17-S18, while the hydrosols from laboratory experiments (with lavender plants from different producing areas) were used for S13-S16 and S19-S22. These latter samples were extracted with pentane and diethyl‑ether respectively.
To monitor the extractable components, the extracts and the raffinates were analyzed with UV–Vis as well. The spectra of the extracts (S1-S5) in five stages can be seen on Fig. 2 while the raffinate spectra are shown in Fig. 3.Fig. 2. Spectra of the heptane extracts in every stage.Fig. 2. Fig. 3Spectra of the original hydrosol and the raffinates in every stage.Fig. 3
Fig. 2 shows how the amount of extracted components changes in the solvent through the stages. After the first extraction the maximum absorbance value of the spectrum was around 0.4 while after the second extraction it did not reach the 0.2 absorbance and it continuously decreased in every stage. The same information can be read about Fig. 3. The absorbance difference of the original and once extracted hydrosol is around 0.1 at 277 nm, while there are no such big differences between the last four extracted hydrosols.
The extracts were collected in every stage and their composition was analyzed by GC–MS.
Agilent model 5975C gas chromatograph was used to perform the analysis. The GC separation was achieved on an HP5 type column (30.0 m × 0.25 mm; film thickness 0.25 µm). The injector and detector temperatures were set to 250 °C. 0.1 µL of the sample was injected on the column. The column was programmed at 10 °C/min heating rate from 50 °C to 250 °C. The samples were not diluted through the examination and in this way 9 main components were identified. These are α-pinene, β-trans-ocimene, β-cis-ocimene, linalool, octene-1-ol, camphor, terpinen-4-ol, hexyl butyrate and lavandulyl acetate. The weight percentage composition (m/m %) of the extracted oil in case of different extractants can be seen in Table 1, which shows that linalool, terpinene-4-ol and hexyl butyrate are the main components.Table 1. Composition of the extracted oil from the same hydrosol in case of different extractants (S6, S12, S18, first stage).Table 1. ComponentComposition (m/m %)PentaneDiethyl etherHeptaneα-pinene2.281.840.4β-trans-ocimene6.335.494.7β-cis-ocimene1.521.170linalool43.8041.6539.66octen-1-ol2.642.792.65camphor3.753.714.28terpinen-4-ol16.1415.0418.36hexyl butyrate6.706.7011.48lavandulyl acetate0.370.530.33
Table 1 shows the extracted oil composition but to calculate the exact amount of these components in the hydrosols, the amount of the solvent in the extract should not be ignored. The information in Table 2 was used to calculate the amount of different types of components in the extract.Table 2. Composition of the extract from the same hydrosol in case of different extractants (S6, S12, S18, first stage).Table 2. ComponentComposition (m/m %)PentaneDiethyl etherHeptanesolvent99.28397.40499.490α-pinene0.0080.0330.001β-trans-ocimene0.0100.0400.024β-cis-ocimene0.0060.0260.001linalool0.2990.9700.212octen-1-ol0.0070.0250.014camphor0.0150.0630.020terpinen-4-ol0.1520.4910.102hexyl butyrate0.0320.1160.056lavandulyl acetate0.0050.0250.002other0.1830.8070.078
After establishing the composition changes during each step of the liquid-liquid extraction process partial least squares (PLS) regression was used to establish a connection between changes in the UV–Vis spectra, the type of employed solvent and the changes in the mass of lavender EO components within the hydrosol. For a given solvent the change of mass ( ) within the water for a specific EO component was calculated using Eq. (1).
In Eq. (1). refers to the mass fraction of the i th component calculated as per the results of Table 2. The variable is the density of the extractant used for the specific sample and is the volume of the employed extractant.
The changes within the mass of the EO components within the hydrosol were calculated for each sample and different extractants. The results are shown in Table 3, where DEE refers to the solvent diethyl ether.Table 3. Changes within the mass of EO components in hydrosol during the use of different solvents.Table 3. SolventSample numberα-pinene (g)β-trans-ocimene (g)β-cis-ocimene (g)linalool (g)octen-1-ol (g)camphor (g)terpinen-4-ol (g)hexyl butyrate (g)lavandulyl acetate (g)HeptaneS10.0780.0840.0890.0920.0930.0930.0900.0860.080HeptaneS20.0080.0090.0090.0100.0110.0120.0130.0140.016HeptaneS30.0070.0070.0070.0080.0080.0090.0090.0100.011HeptaneS40.0020.0020.0020.0020.0030.0030.0030.0030.003HeptaneS50.0030.0030.0030.0030.0030.0030.0030.0040.004HeptaneS60.0700.0760.0820.0860.0890.0890.0870.0830.079HeptaneS70.0150.0160.0170.0190.0200.0220.0240.0250.027HeptaneS80.0040.0040.0040.0040.0050.0050.0050.0060.006HeptaneS90.0050.0050.0050.0050.0060.0060.0060.0070.007HeptaneS100.0020.0020.0020.0020.0020.0020.0020.0020.003PentaneS110.0400.0420.0440.0460.0480.0490.0500.0520.053PentaneS120.0190.0210.0220.0200.0200.0200.0180.0170.015PentaneS130.0140.0150.0170.0160.0170.0160.0160.0160.015PentaneS140.0390.0420.0440.0460.0480.0490.0500.0520.053PentaneS150.0230.0260.0280.0300.0320.0350.0360.0380.039PentaneS160.0090.0110.0130.0110.0120.0130.0130.0120.011DEES170.0260.0290.0310.0330.0360.0390.0410.0430.045DEES180.0240.0270.0300.0320.0350.0380.0400.0420.044DEES190.0370.0400.0430.0440.0470.0500.0510.0530.053DEES200.0450.0470.0510.0520.0550.0570.0590.0610.062DEES210.0350.0380.0410.0430.0460.0490.0510.0530.055DEES220.0250.0270.0290.0280.0300.0310.0310.0310.031
Using the 22 obtained samples the PLS regression model was established. The type of solvent and the changes in the UV–Vis spectra were used as input variables while the changes in EO component mass were used as predicted variables.
During the dimensionality reduction step the optimal number of latent variables
(PLS components) was evaluated using five-fold cross-validation. The and RMSPE (root mean square percentage error) metrics were utilized to evaluate the fit of the model to the data. The model fit as a function of PLS component number in terms of RMSPE can be seen in Fig. 4. The figure shows that after a certain point adding more latent variables to the PLS model its fit to the test data does not increase significantly. Therefore, an optimal PLS component number of 15 was chosen during further investigation, at this point the root mean square percentage error of the fit was 0.004 or 0.4 %.Fig. 4. Average [-] of the PLS model to training and test data as a function of PLS component number during five-fold cross-validation.Fig. 4
In case of each predicted EO component the cross-validated scores are displayed individually on Fig. 5. The results showcase that for each EO component the model was able to accurately predict the change of component mass within the hydrosol based on the change in the spectra.Fig. 5. Average scores during the five-fold cross-validation for each individual EO component.Fig. 5
The variable importance among the predictor variables was observed for each utilized wavelength and scored. The variable importance scores are displayed in Fig. 6. The results indicate that wavelengths between 200 and 450 nm are most significant for determining the composition changes, higher wavelength spectra may be omitted during future use of the model.Fig. 6. Variable importance of different UV–Vis wavelengths for determining changes in hydrosol EO component masses.Fig. 6
Method validation
Using the optimized PLS model the mass change in the EO components within the water was predicted for each solvent during the extraction process based on the change in the UV–Vis spectra. The predicted and measured mass changes in the EO components during the five-fold cross-validation process using the optimized model are shown in Fig. 7.Fig. 7Fit of the optimized PLS model to the test data during five-fold cross-validation.Fig. 7
As the score indicated the model has high accuracy in predicting the changes in individual EO components within the hydrosol during the extraction process for the different solvents. Greater deviations can only be observed in the case of lavandulyl acetate and hexyl butyrate being 0.02 g in magnitude.
To evaluate the appropriate nature of the PLS model for the data the distribution of prediction errors for all component mass changes during the cross-validation process were evaluated as well. These errors show an approximately normal distribution indicating that the model error is random in nature and not due to neglected terms during modeling. The distribution of the error terms can be seen in Fig. 8.Fig. 8. Distribution of the PLS model errors during the five-fold cross-validation process.Fig. 8
The results indicate that the PLS model is adequate in replacing the GC–MS step for estimating the composition changes within lavender hydrosol during fluid extraction. The model may be used for industrial purposes to evaluate the composition of lavender hydrosol derived from steam as well as hydrodistillation using UV–Vis spectroscopy without a need for more in-depth analytical techniques for composition determination.
Limitations
The model applies to the solvents used during the measurements. In case of different solvent usage, additional data must be provided to the model.
Ethics statements
This work did not involve human subjects, animal experiments or data from social networks
CRediT author statement
Sára Preiner: Conceptualization, Methodology, Investigation, Writing - Original draft preparation, Bálint Levente Tarcsay: Conceptualization, Data curation, Visualization, Writing - Original draft preparation, Dóra Pethő: Supervision, Writing- Reviewing and Editing, Norbert Miskolczi: Supervision, Writing - Reviewing and Editing,
Declaration of competing interest
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Silva G.L.Luft C.Lunardelli A.Amaral R.H.Melo D.A.Donadio M.V.Oliveira J.R.Antioxidant, analgesic and anti-inflammatory effects of lavender essential oil Anais da Academia Brasileira de Ciências 872015139714082624715210.1590/0001-3765201520150056 · doi ↗ · pubmed ↗
- 2Wesołowska A.Jadczak D.Zyburtowicz K.Influence of distillation time and distillation apparatus on the chemical composition and quality of mill. Essential oil Polish J. Chem. Technol.25420233643
- 3Pokajewicz K.BiałońM.Svydenko L.Fedin R.Hudz N.Chemical composition of the essential oil of the new cultivars of Lavandula angustifolia Mill. Bred in Ukraine Molecules 2618202156813457715210.3390/molecules 26185681 PMC 8465323 · doi ↗ · pubmed ↗
- 4Gök S.B.Erdoğdu Y.Chemical composition and antimicrobial activity of essential oils from six lavender (Lavandula angustifolia Mill.) cultivars Plant, Soil Environ.7022024
- 5Babaee Khalajee M.Jaimand K.Mozaffari S.Mirshokraie S.A.Comparative study on essential oils of Lavandula officinalis L. from three different sites with different methods of distillation J. Medic. Plant. By-Product 6120175358
- 6Stratakos A.C.Koidis A.Methods for extracting essential oils Essential Oils in Food preservation, Flavor and Safety 2016 Academic Press 3138
- 7Périno-Issartier S.Ginies C.Cravotto G.Chemat F.A comparison of essential oils obtained from lavandin via different extraction processes: ultrasound, microwave, turbohydrodistillation, steam and hydrodistillation J. Chromatogr. A 1305201341472389054510.1016/j.chroma.2013.07.024 · doi ↗ · pubmed ↗
- 8Politi M.Menghini L.Conti B.Bedini S.Farina P.Cioni P.L.De Leo M.Reconsidering hydrosols as main products of aromatic plants manufactory: the lavandin (Lavandula× intermedia) case study in Tuscany Molecules 259202022253239738510.3390/molecules 25092225 PMC 7249177 · doi ↗ · pubmed ↗
