Raman spectroscopy as an alternative approach for prediction of silicate mineral content in sedimentary rocks
Zuzana Pěgřimočová, Michal Ritz

TL;DR
Raman spectroscopy is shown as a fast and non-destructive method to estimate silicate mineral content in sedimentary rocks.
Contribution
A novel application of Raman spectroscopy with PLSR models for rapid silicate mineral prediction in claystones and clay shales.
Findings
PLSR models achieved correlation coefficients > 0.91 for calibration and > 0.84 for validation samples.
RMSEP values ranged from 1.8 to 4.9 wt%, indicating good predictive accuracy.
The method is suitable for on-site mineral content estimation using mobile Raman instruments.
Abstract
This work presents Raman spectroscopy as an alternative approach for rapid prediction of silicate mineral content in sedimentary rocks. Raman spectra of clay shale and claystone samples were measured by dispersive Raman spectrometer using the excitation laser wavelength of 633 nm. Due to the presence of strong fluorescence and overlapping bands in Raman spectra of claystones and clay shales, partial least squares regression (PLSR) was employed as a multivariate calibration method for semi-quantitative estimation of silicate mineral content. The PLSR models were developed for four predominant silicate minerals of claystones and clay shales – chlorite, muscovite, quartz and albite. The correlation coefficients for calibration and validation samples were > 0.91 and > 0.84, respectively. The root mean square error of prediction (RMSEP) ranged from 1.8 to 4.9 wt %. The developed PLSR models…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3- —The Ministry of Education, Youth and Sports of the Czech Republic and Co-founded by the European Union
- —The Ministry of Education, Youth and Sports of the Czech Republic
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSpectroscopy Techniques in Biomedical and Chemical Research · Soil Geostatistics and Mapping · Spectroscopy and Chemometric Analyses
Introduction
Knowledge of mineral composition (type and amount of minerals) of rocks is essential for determining their physical and mechanical properties and subsequent applications. Moreover, quantitative mineral composition of sedimentary rocks provides information about processes like weathering, sedimentation, transportation and diagenesis^1^. Rock analysis can be performed by several techniques. These techniques include X-ray fluorescence spectrometry (XRF)^2–4^, X-ray diffraction (XRD)^1,3,5^, electron microscopy (EM)^6–8^, inductively coupled plasma-mass spectrometry (ICP-MS)^9–11^, Fourier-transform infrared (FTIR) spectroscopy^2,3,12^, Raman spectroscopy^13–15^, Mössbauer spectroscopy (for iron-bearing minerals)^16,17^ and laser-induced breakdown spectroscopy (LIBS)^18,19^.
The most powerful and widely used technique for the quantification of mineral composition of rocks is X-ray diffraction^5,20^. Nonetheless, this technique has some limitations, including laborious and time-consuming process of powdering the sample (particle-size fractionation and milling), complex interpretation of diffractograms and its unavailability for amorphous materials^21,22^. Additionally, the quantification of e.g. clay minerals remains challenging due to the high degree of structural diversity, structural disorder, various chemical composition and preferred orientation which may lead to quantitative errors^5^. As an alternative, methods of vibrational spectroscopy (FTIR and Raman spectroscopy) can be advantageously utilized for quantification of mineral composition of rocks. These methods enable non-destructive and rapid measurements with little or no sample preparation. Moreover, a very small amount of sample (a few milligrams) is sufficient for analysis^22–24^. The use of Raman spectroscopy has many advantages over much more utilized infrared spectroscopy, including its capability to measure moist samples since water is very weak Raman scatterer and the possibility to be used on-site^23–25^. Nowadays, Raman spectroscopy offers a variety of mobile instruments. These instruments include portable Raman spectrometers weighing usually around 10 kg, smaller handheld spectrometers, or even pocket-size spectrometers weighing less than 1 kg^26^. In comparison with laboratory spectrometers, mobile Raman spectrometers provide information about studied rocks in a short amount of time, but with a lower spectral resolution^27^.
The interpretation of vibrational spectra of polymineral rocks can be challenging due to the presence of strongly overlapping bands^22,28,29^. Moreover, in case of Raman spectroscopy, fluorescence can be induced while using short-wavelength excitation lasers, resulting in strong fluorescence band (which is generally several orders of magnitude larger than Raman signal) in Raman spectrum^30,31^. This fluorescence band can often cause a loss of spectral information by overlapping the Raman signal. In these cases, the use of multivariate calibration methods seems to be the solution^32^ as they do not work with single characteristic spectral band but the whole spectrum^33^.
The objective of multivariate calibration is to establish a relationship between a desired property of a sample (e.g., concentration) and its spectrum^34^. In the field of vibrational spectroscopy, partial least squares (PLS) regression is frequently used as a multivariate calibration method^35–39^. This multivariate analysis method was originally developed at the end of 1960 s by Herman Wold within the field of econometrics and later adapted for use in chemistry by Svante Wold and Harald Martens^40^. The PLS is used to establish a relationship between data matrix X (independent variables) and vector y (dependent variables). The independent variables describe the response of the instrument (FTIR or Raman spectra), while the dependent variables describe the required property (e.g. mineral content obtained using reference method). PLS can reduce the original spectral data into a smaller number of latent variables called PLS factors, which are able to describe maximum covariance between variables X (spectral data) and Y (mineral content)^36,41–43^. A mathematical and much more detailed description of the PLS modeling can be found in the relevant publications^40,44,45^.
The development of a multivariate calibration model generally consists of several steps. First, representative calibration samples must be selected and their properties determined (e.g. mineral composition using reference method). The vibrational (infrared or Raman) spectra of these calibration samples are collected. Next, the calibration model is calculated and validated using a set of validation samples or cross-validation. Finally, the calibration model is applied to analyze unknown samples^42^.
The multivariate calibration methods (mainly PLS regression) were used for the quantification of mineral composition of rocks using both FTIR spectroscopy^12,22,29,38,43,46–48^ and Raman spectroscopy^28,49,50^. However, there is a lack of studies that focus on the quantification of mineral composition of rocks using Raman spectroscopy in more complex mixtures, where the spectral bands are strongly overlapping and are affected by fluorescence. The utilization of Raman spectroscopy for quantitative purposes may offer greater potential than FTIR spectroscopy, as it is available for on-site analysis.
The objective of the present study is to investigate the potential of Raman spectroscopy in conjunction with partial least squares (PLS) regression as a rapid method for predicting the major mineral content of sedimentary rocks. Fifty-two powdered samples of clay shales and claystones were employed as calibration and validation sets to build PLSR models for prediction of chlorite, muscovite, quartz and albite content. The predictive ability of developed PLSR models was tested via analyzing six control samples. The same control samples were analyzed to test the repeatability of measurements. In terms of accuracy, the utilization of Raman spectroscopy and PLSR cannot compete with XRD analysis as it depends on the accuracy of the XRD results. However, this approach can be used for rapid and non-destructive on-site estimation of silicate mineral content in claystones and clay shales. Moreover, this approach is suitable for the analysis of samples available in very small amounts (a few milligrams). To the best of our knowledge, this is the first time when Raman spectroscopy in conjunction with partial least squares (PLS) regression has been employed to predict the silicate mineral content using sedimentary rocks as calibration and validation samples.
Methods
Collection of claystone and clay shale samples
Fifty-eight pulverized samples of clay shales (forty samples) and claystones (eighteen samples) were obtained from the collection of Institute of Geonics of the Czech Academy of Sciences. The clay shale samples originated from Kyjovice layers (Lower Carboniferous period of the Moravian-Silesian area). The claystone samples were collected from the lower-to-middle Cretaceous strata in the Silesian unit of the Moravian-Silesian Beskydy Mountains (Mazak and Godula formations, Štramberk area, Hradiště and Lhota formations). The X-ray diffraction (XRD) revealed that quartz, muscovite, chlorite and albite were the most abundant minerals in all the samples. The samples of claystones from Štramberk area also revealed the prevalence of calcite. Some of the samples contained dolomite, kaolinite, ankerite, pyrite, hematite, siderite, orthoclase and rutile in minor quantities. However, most of the samples fell below the detection limit for these minor minerals. For these reasons, the research was focused on the estimation of chlorite, muscovite, quartz and albite content in the samples of claystones and clay shales. The same samples have also been used in previous studies focused on the quantification of chlorite, muscovite, quartz and albite using the DRIFT and KBr pellet techniques of FTIR spectroscopy and the partial least squares regression (PLSR)^12,46^.
Of the total number of fifty-eight samples, fifty-two samples were used for calibration and validation (XRD results are in Supplementary Information Table S1) and six samples were used as control samples (C1-C6; XRD results are in Supplementary Information Table S2). Control samples were selected in such a way that they cover the entire concentration range of analyzed minerals. To construct the PLSR model for prediction of chlorite content, thirty calibration samples (in range from 2.8 to 25.5 wt %) and nine validation samples were employed. For construction of PLSR model for prediction of muscovite content, thirty-eight calibration samples (in range from 10.5 to 56.0 wt %) and eight validation samples were employed. For construction of PLSR model for prediction of quartz content, thirty-five calibration samples (in range from 13.5 to 64.3 wt %) and eight validation samples were employed. Finally, for construction of PLSR model for prediction of albite content, twenty-six calibration samples (in range from 2.3 to 27.0 wt %) and six validation samples were employed.
X-ray powder diffraction (XRD) analysis
Knowledge of mineral composition of the claystone and clay shale samples was important for development of multivariate calibration models and for determination of accuracy. In the case of this study, X-ray powder diffraction analysis was utilized for these purposes. XRD analysis has been previously conducted in the context of earlier studies^12,46^. XRD measurements were carried out by diffractometer ID3003 (Rich. Seifert-FPM, Germany) using the Co lamp (heated at conditions: 40 kV, 35 mA), Fe filter, 2θ/θ goniometer geometry, step size 0.05° in 2θ range and 3 s measurement time per step. For measurement and evaluation (qualitative and quantitative), the manufacturer’s RayfleX (RayfleX scanX, RayfleX Analyze and RayfleX Autoquan) software (version 2.289) was used. The content of minerals was quantified from diffraction data by Rietveld technique^51^.
Raman measurements
Raman spectra of powdered claystones and clay shales were collected using dispersive Raman spectrometer DXR SmartRaman (Thermo Scientific, USA) equipped with 633 nm He-Ne laser, 1200 lines/mm grating, CCD detector and 180° reflective sampling accessory. The excitation laser power was set to 7 mW. The aperture, number of exposures and exposure time were as follows: 50 μm slit, 500 times and 1s. The spot size of laser was 10 μm in diameter. The spectra were obtained in the spectral range of 1900–50 cm^− 1^. Prior to measurement, the spectrometer was calibrated using polystyrene and silicon as calibration standards. Raman spectra were recorded and processed using the OMNIC 9 software (version 9.11.706, Thermo Fisher Scientific Inc., USA; https://www.thermofisher.com/). A single Raman spectrum was recorded for each sample and therefore no spectral averaging was performed. As a treatment, the fluorescence correction was used (6th order polynomial).
Development of PLS models
To predict the content of silicate minerals in claystones and clay shales, PLSR models were developed using the TQ Analyst 9 software (version 9.3.107, Thermo Fisher Scientific Inc., USA; https://www.thermofisher.com/) and imported into OriginPro 8 (version 8.0725, OriginLab Corporation, USA; https://www.originlab.com/) software. The PLSR analysis was performed on the acquired Raman spectra without normalization. The initial step involved the use of calibration and validation sets (samples of claystones and clay shales with known composition) to build PLSR model and evaluate performance of calibration. The validation samples were selected in a manner that ensured coverage of the concentration range of the analyzed minerals. The mineral content of calibration and validation samples was determined by XRD analysis (used as a reference method) as described in Sect. X-ray powder diffraction (XRD) analysis. The quantitative data were then assigned to the corresponding Raman spectra of calibration and validation samples. Finally, the spectral region corresponding to Raman shift of approximately 1150–100 cm^− 1^ was chosen for analysis due to the presence of bands characteristic for the studied minerals^52–54^.
To optimize the method, the outlier spectra were assessed using two diagnostic tools (both provided by the TQ Analyst software) – Spectrum Outlier (utilizing the Chauvenet’s test for outlier detection) and Principal Component Scores (PC Scores). The PLSR models were optimized by excluding spectra of samples that were affected by high level of fluorescence emission, and whose content of the target mineral was below the detection limit of the reference method (XRD). The optimal number of factors was chosen based on the PRESS (Predicted Residual Sums of Squares) diagnostic. The PRESS diagnostic is based on the relationship between the root mean square error of cross-validation (RMSECV) and number of PLS factors. An insufficient or excessive number of PLS factors can lead to suboptimal prediction because all the variances in the data may not be represented or in the second case the model begins to include noise^38^. The selected number of factors for calibration was subsequently verified by the Loading spectra diagnostic (which displays a loading spectrum for each factor).
The developed PLSR models were characterized by the following parameters: correlation coefficient (R), root mean square error of calibration (RMSEC) and root mean square error of prediction (RMSEP). The aforementioned parameters provided information on the accuracy of the method by establishing a comparison between predicted and reference values. The smaller the RMSE value, the more accurate the PLSR model is.
The calibration error of the PLSR model, expressed as the root mean square error of calibration (RMSEC), was calculated by Eq. (1).
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:RMSEC=\:\sqrt{\frac{\sum\:_{i=1}^{n}{({c}_{i,pred}-{c}_{i,ref})}^{2}}{n}}$$\end{document}Where ci, pred is the mineral content of the i^th^ calibration sample predicted from the PLS model, ci, ref is the mineral content of the i^th^ calibration sample obtained by reference method (in case of this study, the Rietveld method of XRD) and n is the number of calibration samples.
The validation error of the PLSR model, expressed as the root mean square error of prediction (RMSEP), was calculated by Eq. (2).
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\:RMSEP=\:\sqrt{\frac{\sum\:_{i=1}^{n}{({c}_{iv,\:pred}-{c}_{iv,\:ref})}^{2}}{m}}$$\end{document}Where civ, pred is the mineral content of the i^th^ validation sample predicted from the PLS model, civ, ref is the mineral content of the i^th^ validation sample obtained by reference method (in case of this study, the Rietveld method of XRD) and m is the number of used validation samples. The calculations of these parameters were performed by the TQ Analyst software.
Results and discussion
Spectral features of minerals
The obtained Raman spectra contained a relatively high level of fluorescence and therefore, it would be appropriate to make measurements using a laser with wavelength of 780–1064 nm^30^. In this case, however, several expected spectral features were absent from the spectra due to the low excitation energy. For this reason, PLSR models were created from the spectra obtained using a laser with excitation wavelength of 633 nm. The Raman spectra of selected sedimentary rock samples collected using a laser with excitation wavelength of 633 nm are shown in Fig. 1.
Fig. 1. Raman spectra of selected sedimentary rock samples used in this study (after fluorescence correction).
As mentioned in Sect. Collection of claystone and clay shale samples, the studied samples were composed (based on the XRD analysis; see Table S1 and Table S2) mainly of silicate minerals such as chlorite, muscovite, quartz and albite. Other minerals (calcite, dolomite, kaolinite, ankerite, pyrite, hematite, siderite, orthoclase and rutile) were present in very small quantities and were detected only in limited number of samples. The presence of silicate minerals was also confirmed by Raman analysis, which revealed characteristic bands of chlorite (∼102 cm^− 1^, ∼359 cm^− 1^ and ∼611 cm^− 1^)^52^, muscovite (∼266 cm^− 1^, ∼411 cm^− 1^, ∼705 cm^− 1^ and ∼760 cm^− 1^)^53^, quartz (∼207 cm^− 1^ and ∼465 cm^− 1^)^50,52^ and albite (∼111 cm^− 1^, ∼166 cm^− 1^, ∼211 cm^− 1^, ∼250 cm^− 1^, ∼294 cm^− 1^, ∼482 cm^− 1^, and ∼511 cm^− 1^)^54^. Several characteristic Raman bands of these silicate minerals were found to overlap (e.g. bands characteristic for albite ∼211 cm^− 1^ and ∼250 cm^− 1^ were overlapped by ∼207 cm^− 1^ of quartz). However, the presence of several silicate mineral Raman bands in the spectra of claystones and clay shales was not always guaranteed, since they were present in some samples in small quantities and the phenomenon of fluorescence led to the loss of spectral information. Apart from Raman bands of silicate minerals, several samples exhibited Raman bands of dolomite (∼1098 cm^− 1^) and calcite (∼1088 cm^− 1^)^55^. The presence of other minerals was not detected using Raman spectroscopy. The D (∼1340 cm^− 1^), G (∼1580 cm^− 1^) and D2 (∼1620 cm^− 1^) bands, which are characteristic of carbonaceous materials^37^, were also frequently observed in the Raman spectra of studied samples.
Performance of PLS1 models
The multivariate calibration models for content prediction of silicate minerals (chlorite, muscovite, quartz and albite) in claystones and clay shales were developed using the partial least squares (PLS) regression. The PLS can be categorized into two algorithms: PLS1 (standard algorithm) and PLS2. The difference between the two algorithms lies in the number of y variables that are used for PLS modelling. The PLS1 algorithm uses only one y variable, whereas the PLS2 algorithm is capable of modelling multiple y variables simultaneously. In practice this means that the PLS1 is employed for the quantification of each component individually, whereas the PLS2 for the quantification of all components simultaneously. The use of PLS2 algorithm is faster, however the number of optimum factors is calculated for all components even if the optimal number of factors may vary for each component. This fact in many cases leads to worse results^12,44,45,56^. For this reason, PLS1 algorithm was employed in the present study.
The developed PLSR models for each mineral are shown in Fig. 2. Statistical parameters characterizing the performance of the developed PLSR models, are listed in Table 1. PLSR models developed for predicting silicate mineral composition of sedimentary rocks showed good performances. The range of correlation coefficient for calibration set (R) was from 0.9152 to 0.9790, the range of correlation coefficient for validation set (R_P_) was from 0.8479 to 0.9642. RMSEC values varied between 1.4 and 4.4 wt % and RMSEP values between 1.8 and 4.9 wt %. In most cases, the RMSE values in this study were lower than those obtained from PLSR models based on FTIR spectra of claystones and clay shales from previous study. The RMSE values for albite were only slightly greater^46^. Optimal number of factors varied between 5 and 9 based on the minimum value of RMSECV, ranging from 2.6 to 7.0 wt %. As an example, the relationship between the RMSECV and number of PLS factors for calibration model of chlorite is demonstrated on Fig. 3. In this case, the minimum value of RMSECV was achieved at 7 PLS factors.
Fig. 2. Plots of predicted (Raman spectroscopy) vs. actual (Rietveld method of XRD analysis) content for each mineral in claystones and clay shales. Calibration standards are represented by the black points and validation standards by the red points.
Table 1. Parameters of developed PLSR models for semi-quantitative estimation of silicate mineral content based on Raman spectra.Mineral R
R P RMSEC(wt %)RMSEP(wt %)RMSECV(wt %)Number of factorsChlorite0.97900.96421.41.82.67Muscovite0.92060.90614.44.97.09Quartz0.92650.91003.64.86.46Albite0.91520.84792.53.24.15
Fig. 3. The relationship between the root mean square error of cross-validation (RMSECV) and number of PLS factors for calibration model of chlorite.
Analysis of control samples (accuracy)
The accuracy of the four PLSR models to predict mineral content in unknown samples was tested using six control samples (C1-C6). The Raman spectra of those control samples were inserted into TQ Analyst software which then calculated their mineral content based on the developed PLSR models. The results of the studied silicate mineral content in the control samples predicted by PLSR models and the studied silicate mineral content in the same control samples obtained by XRD are given in Table 2. Uncertainties of mineral content predicted by PLSR models correspond to RMSEP values^22^.
Table 2XRD determined vs. predicted mineral content for claystones and clay shales.SampleChloriteMuscoviteQuartzAlbiteXRDPLSXRDPLSXRDPLSXRDPLSC0117.6 ± 1.816.1 ± 1.832.4 ± 1.733.8 ± 4.930.1 ± 1.234.7 ± 4.819.8 ± 1.118.9 ± 3.2C0218.0 ± 2.517.5 ± 1.825.4 ± 2.429.8 ± 4.936.9 ± 1.834.5 ± 4.814.7 ± 1.414.6 ± 3.2C0322.8 ± 2.619.8 ± 1.834.0 ± 2.831.5 ± 4.932.4 ± 1.833.1 ± 4.810.9 ± 1.611.6 ± 3.2C045.5 ± 2.81.3 ± 1.814.4 ± 4.28.0 ± 4.927.3 ± 2.019.0 ± 4.8< 1.0N/AC0522.5 ± 4.821.2 ± 1.845.9 ± 4.832.2 ± 4.925.9 ± 2.732.3 ± 4.83.0 ± 2.08.9 ± 3.2C0621.6 ± 2.419.6 ± 1.831.9 ± 2.431.5 ± 4.934.0 ± 1.632.7 ± 4.812.6 ± 1.412.6 ± 3.2
To determine the difference between the quantity obtained using the reference method and the quantity obtained from chemometric calibration models, the following were calculated:
- Relative error (RE)
- 2)Standard error of prediction (SEP)
In these equations, ci, pred is the mineral content value of the i^th^ control sample predicted from the PLSR model, ci, ref is the mineral content value of the i^th^ control sample obtained by the reference method (the Rietveld method of XRD analysis) and n is the number of control samples. It should be noted that the accuracy of the results obtained from developed PLSR models is inherently limited and depends on the accuracy of the reference method (in case of this study, the Rietveld method of XRD analysis). Therefore, the accuracy of the results obtained from PLSR models cannot be higher than the accuracy of the reference method.
The accuracy expressed by the relative error (RE) for each control sample is shown in Table 3. Some developed PLSR models demonstrated a lack of efficacy in describing lower content values due to a low number of calibration samples describing lower content values. A comparable phenomenon was also observed with high content values of muscovite. Consequently, samples with very low mineral content often exhibited extremely high RE values resulting in distortion of values of mean RE. For example, in case of albite, the difference (absolute error) between the value obtained by XRD (3.0 wt %) and value predicted by PLSR model (8.9 wt %) for sample C05 was 5.9 wt % and RE was 196.7%. In case of muscovite, the content value obtained by XRD was 45.9 wt % and the content value obtained from PLSR model was 32.2 wt %, this difference was even greater (13.7 wt %) but RE value was only 29.9%. The fact that RE for lower values increases rapidly is evident from the mathematical formulation. For chlorite, muscovite and quartz the mean RE was below 20%. For albite, the mean RE value was ~ 40%.
Table 3. Relative error (RE) values calculated for each mineral and control sample.SamplesRE (%)ChloriteMuscoviteQuartzAlbiteC018.54.315.34.6C022.817.36.50.7C0313.27.42.26.4C0476.444.430.4N/AC055.829.924.7196.7C069.31.33.80.0 Mean 19.317.413.841.7
The accuracy expressed by standard error of prediction (SEP) ranged from 2.4 wt % to 6.5 wt %. The best SEP value had PLSR model for the prediction of chlorite content with 2.4 wt %. In contrast, the highest value of SEP (6.5 wt %) was observed by the PLSR model of muscovite due to a large difference between the muscovite content predicted by PLSR model and muscovite content obtained using XRD in sample C05.
A comparison of the obtained results of accuracy with the previous study^46^ which was focused on the quantification of the same minerals in the same type of samples using combination of FTIR spectroscopy (DRIFT and KBr pellet techniques) and PLSR, is presented in Table 4. The standard error of prediction (SEP) values obtained in this study for chlorite, muscovite and quartz were found to be either almost identical or lower (in case of muscovite) compared to those reported in previous study^46^. In the case of albite, the SEP value was found to be higher than the SEP value reported in FTIR study^46^. The mean relative errors (RE) of chlorite, muscovite, quartz and albite were either similar or greater to those obtained by FTIR spectroscopy. This is mainly because the PLSR models developed using infrared spectra (both DRIFT and pellet technique) were more accurate (mostly in case of albite) in describing lower mineral content values. However, except for low mineral content, the accuracy of the results obtained using Raman spectroscopy was comparable to those obtained using infrared spectroscopy^46^.
Table 4. Comparison of accuracy of PLSR models based on Raman (current study) and FTIR spectra.TechniqueChloriteMuscoviteQuartzAlbiteSEP(wt %)RE(%)SEP(wt %)RE(%)SEP(wt %)RE(%)SEP(wt %)RE(%)Raman spectroscopy2.419.36.517.44.813.82.741.7FTIR spectroscopy (DRIFT)^46^2.420.710.312.94.910.51.719.4FTIR spectroscopy (KBr pellet)^46^2.111.519.113.94.212.61.69.1
Analysis of control samples (repeatability)
The precision of the developed PLSR models was demonstrated on the same samples that were used for accuracy testing (C01-C06). Each sample was measured ten times (10 portions of the re-homogenized same sample) under the same condition of measurement and on the same day. Measured spectra were then used for prediction of chlorite, muscovite, quartz and albite content using developed PLSR models. A relative standard deviation (RSD) was used for estimation of repeatability of measurement. The results of testing repeatability are presented in Table 5. For most of the control samples and PLSR models, the repeatability, expressed as RSD, was less than 10%, indicating that the samples were relatively homogeneous. The highest RSD values were exhibited by sample C04, particularly in the prediction of chlorite content due to its low content leading to the higher RSD values. The Raman spectrum of control sample C04 was of considerably lower quality as it was strongly affected by high level of fluorescent emission. The high RSD values may therefore indicate inappropriateness of Raman spectra of highly fluorescent samples for prediction of mineral composition using PLSR.
Table 5. Repeatability of mineral content prediction in claystones and clay shales using PLSR.Predicted mineral content (wt %)MineralSample12345678910RSD(%)ChloriteC0112.815.916.517.215.014.316.713.415.015.39.4C0215.813.217.715.717.815.416.217.014.716.48.7C0319.719.916.519.818.918.519.018.516.517.96.7C04−0.2−23.70.7−5.0−4.4−2.0−13.0−4.0−7.0−7.0−109.4C0521.221.221.221.321.121.221.221.421.321.20.4C0620.320.120.120.120.420.020.419.720.420.01.3MuscoviteC0124.128.629.030.627.927.231.526.931.430.38.1C0229.723.930.329.731.729.128.829.727.828.67.1C0331.733.128.332.031.330.431.430.026.829.46.2C0437.211.941.931.829.031.832.434.835.335.324.7C0532.732.632.732.832.733.033.133.133.233.10.7C0632.432.332.432.232.632.232.432.032.732.30.6QuartzC0137.935.035.234.635.437.934.837.736.935.23.8C0235.638.735.336.134.436.735.034.736.935.03.6C0334.134.035.633.434.834.734.035.236.834.92.8C0428.22.530.521.222.518.522.521.523.522.035.1C0531.631.631.731.531.631.631.631.431.631.60.2C0632.632.732.632.532.332.632.632.932.332.40.6AlbiteC0131.924.223.020.327.528.521.930.522.624.515.3C0215.021.713.811.912.515.818.315.215.016.918.3C0311.711.714.512.810.212.711.79.714.314.513.8C04N/AN/AN/AN/AN/AN/AN/AN/AN/AN/AN/AC0510.210.410.59.910.49.89.79.89.49.93.6C0610.210.410.511.110.610.911.211.910.610.84.5
Conclusions
In this paper, Raman spectroscopy was employed in combination with partial least squares regression (PLSR) for semi-quantitative estimation of silicate mineral (chlorite, muscovite, quartz and albite) content in sedimentary rocks. Samples of claystones and clay shales with known composition (determined by XRD) were utilized as calibration and validation samples in the process of PLSR models development. The correlation coefficients for calibration and validation samples were > 0.91 and > 0.84, respectively. RMSEC values varied between 1.4 and 4.4 wt % and RMSEP values between 1.8 and 4.9 wt %. The developed PLSR models showed relatively good predictive capabilities for control samples. However, some developed PLSR models demonstrated a lack of efficacy in describing lower and higher content values due to a low number of calibration samples representing lower or higher content values. The accuracy was expressed as a standard error of prediction (SEP) which ranged from 2.4 wt % to 6.5 wt % and as a mean relative error (RE) which was < 20% for chlorite, muscovite and quartz and ~ 40% for albite. The repeatability of measurements expressed as relative standard deviation (RSD) was < 20%. The results obtained using Raman spectroscopy were comparable to those obtained using FTIR spectroscopy^46^. In terms of accuracy, the utilization of Raman spectroscopy and PLSR presented in this study cannot compete with XRD analysis (as it depends on the accuracy of the XRD results). However, this approach has the potential to be used for rapid and non-destructive on-site estimation of chlorite, muscovite, quartz and albite content in samples of clay shales and claystones. Moreover, this approach is also suitable for the analysis of samples that are available in very small amounts.
Supplementary Information
Below is the link to the electronic supplementary material.
Supplementary Material 1
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Kumar, A. & Lingfa, P. Sodium bentonite and kaolin clays: Comparative study on their FT-IR, XRF, and XRD. Mater. Today: Proc. 22, 737–742. 10.1016/j.matpr.2019.10.037 (2020).
- 2Pan, M. W., Benner, R. E. & Smith, L. M. Continuous lasers for Raman spectrometry in Handbook of Vibrational Spectroscopy (eds Chalmers, J. M. & Griffiths, P. R.) 490–506 (Wiley, 2002).
- 3Esbensen, K. H. Multivariate Data analysis - In Practice: an Introduction To Multivariate Data analysis and Experimental Design Fifth edn (CAMO, 2006).
- 4Brereton, R. G. Calibration. In Applied Chemometrics for Scientists. 193–220 (Wiley, 2007).
