A decade-long seamless-continuity daily L-band soil moisture product derived from SMOS observations since 2010
Yu Bai, Li Jia, Tianjie Zhao, Zhiqing Peng, Jingyao Zheng, Chaolei Zheng, Ping Tang, Jiancheng Shi

TL;DR
This paper presents a new method to create a continuous, 10-year global soil moisture dataset from satellite data, improving long-term climate and hydrology studies.
Contribution
A novel automated gap-filling method (DCT-PLS) that generates a seamless, decade-long soil moisture product without external data.
Findings
The method achieved high fidelity with R > 0.9 and RMSE < 0.04 m³/m³ in synthetic gap reconstruction.
The product matched in-situ measurements with R > 0.7 and unbiased RMSE = 0.057 m³/m³.
The dataset preserved spatial patterns and seasonal amplitudes globally without boundary artifacts.
Abstract
Soil moisture (SM) is a fundamental state variable in the Earth system that governs water, energy, and carbon exchanges between land and atmosphere. Current satellite-derived SM products, such as SMOS (Soil Moisture and Ocean Salinity) SM product, exhibit pronounced spatial and temporal discontinuities caused by orbital coverage, radio frequency interference, and retrieval failures, which severely limit their utility to long-term eco-hydrological studies. This study introduces a fully automated gap-filling method that combines Discrete Cosine Transformation with Partial Least Squares (DCT-PLS). The method exploits the intrinsic spatiotemporal coherence of the original SMOS Multi-Temporal and Multi-Angular (MTMA) product without relying on any external ancillary data. It was applied to the entire SMOS MTMA archive to generate a seamless-continuity product (MTMA-SC_SM) at 25 km and daily…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 10
Figure 11
Figure 1
Figure 2
Figure 3
Figure 4
Figure 5
Figure 6
Figure 7
Figure 8
Figure 9- —the National Natural Science Foundation of China (Grant No. 42090014)
Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsSoil Moisture and Remote Sensing · Climate change and permafrost · Synthetic Aperture Radar (SAR) Applications and Techniques
Background & Summary
Soil moisture is a key variable in the global hydrological cycle system and plays a crucial role in research on global energy cycling and environmental change^1–6^. Microwave remote sensing has become an important means of monitoring global surface soil moisture due to its advantages of being all-day, all-weather and sensitive to the nuances of soil moisture changes^1,7–9^. This technology has been embraced for its ability to penetrate vegetation and detect even subtle changes in soil moisture, with the L-band in particular being esteemed for its monitoring capabilities^10–12^.
At present, the satellite-derived global daily soil moisture products mainly come from active and passive satellite sensors, such as ASCAT (Advanced Scatterometer), AMSR-E/2 (Advanced Microwave Scanning Radiometer-EOS/2)^13,14^, FengYun-3 MWRI (Microwave Radiation Imager)^15^, SMOS (Soil Moisture and Ocean Salinity)^16,17^, and SMAP (Soil Moisture Active and Passive)^18–21^. However, due to satellite orbit coverage issues, potential satellite malfunction, radio frequency interference (RFI), and challenges associated with retrieval algorithms (the error accumulation in the soil moisture retrievals during the decoupling of vegetation and soil information via retrieval algorithms, or the non-convergence of the solution results caused by the nonlinear retrieval algorithms), there are gaps in these daily global soil moisture products. These data voids can limit the application of soil moisture products for modelling global climate change and land surface processes^22–26^, especially for daily temporal analyses. For example, global (or large-scale) evapotranspiration estimation or spatial boundary capture of saturated-unsaturated zone for hydrological models (such as SWAT model) require spatiotemporal continuous soil moisture data as input. However, the presence of gaps may affect the simulation process. Gaps can lead to the fragmentation of drought range estimated based on soil moisture data, making it difficult to accurately determine the continuous distribution of drought, which in turn affects the estimation of drought areas. In addition, assessing the expansion or reversal trend of desertified areas also requires spatiotemporal continuous soil moisture data, and gaps in soil moisture data can lead to ‘breakpoints’ in desertification trends, resulting in misjudgments.
To tackle the issue of data gaps in satellite-derived daily soil moisture products, researchers have undertaken extensive investigations into the reconstruction of missing data. Several types of reconstruction methods have been explored, each with its unique focus and application. (1) One such category is the spatial-based reconstruction methods^27,28^. These methods encompass a range of techniques including the nearest neighbor interpolation, inverse distance weighting interpolation^29^, the Tyson polygon method^30,31^, spline method, and the kriging method^32^. These techniques are particularly adept at utilizing the spatial correlation present within the data, employing the relationships between nearby valid measurements to infer values for missing data points. However, these spatial methods inherently prioritize spatial relationships over temporal dynamics. This focus, although beneficial for highlighting spatial patterns, may result in a less pronounced representation of temporal variations. In scenarios where the landscape is highly variable or where there is a significant absence of spatial data, the performance of these methods might not fully meet the expectations set by more homogenous conditions. (2) The temporal methods capitalize on the intrinsic dynamics of time series data to perform reconstructions^33–35^, including a variety of techniques such as the linear interpolation^36^, spline interpolation^37^, the time series harmonic analysis through Fourier transform^38^, and singular spectrum analysis in an iterative reconstruction framework^39^. These temporal methods stand out for their ability to capture the evolving patterns within time series data. However, the efficacy of these methods is particularly sensitive to the selection of the time interval for analysis. When intervals are too short, the method may not adequately capture the necessary temporal correlations, potentially leading to less effective reconstructions. Conversely, when the time interval is overly long, the surface parameters can undergo substantial variations, thereby compromising the reliability of reconstruction results^33^. (3) Spatiotemporal methods, relying on the synthesis of spatial and temporal dimensions, integrate one-dimensional temporal dynamics with two-dimensional spatial patterns to offer a multifaceted approach to addressing data gaps^40,41^. Although Borak et al.^42^ and Pede et al.^43^ have confirmed the superior capability of spatiotemporal information in reconstructing missing data compared to single-dimension (temporal or spatial) approaches, it is essential to acknowledge that the incorporation of auxiliary data in these spatiotemporal-based approaches introduces uncertainties that can impact the reconstruction results^44^. (4) Machine learning has introduced a transformative approach to soil moisture data reconstruction, with methodologies such as neural networks, random forest, deep learning network and decision trees being utilized to create long-term, continuous datasets from AMSR-E/2^24,45,46^, SMAP^47^, and FengYun-3^48^ sensors. While these models excel at capturing complex patterns, their performance is inherently linked to the quality of the training data and the associated feature parameters. As a result, the inherent variability in these parameters must be carefully considered to ensure the robustness of such models^49^.
Overall, the current research predominantly focuses on the reconstruction of global seamless daily soil moisture products at high frequencies (such as the C/X bands for AMSR-E/2 sensors)^24,45,46^, while limited attention is given to the reconstruction of soil moisture products at lower frequencies (L-band). Compared to high-frequency (C/X-band) microwaves, low-frequency (L-band) microwaves can penetrate sparse and moderately dense vegetation to obtain soil information at a depth of 0~5 cm^9,50^. The L-band is considered the best band for obtaining surface soil moisture due to its higher sensitivity to surface soil moisture^11,50^. Zheng et al. found that the overall behavior of L-band soil moisture products was better than that of high-frequency (the C/X bands for AMSR-E/2 and FengYun-3 sensors) soil moisture products^11^. As the vegetation becomes denser, the performance of the low-frequency (L-band) soil moisture product is superior to that of the high-frequency products^50^. Moreover, to mitigate the influence of external auxiliary information and effectively exploit the inherent spatiotemporal characteristics of the dataset, the Discrete Cosine Transformation-Partial Least Squares (DCT-PLS) method^51^ is used to reconstruct missing data in this study. The DCT-PLS method belongs to a spatiotemporal method (as described in category 3 of traditional reconstruction methods mentioned above). However, compared to current spatiotemporal methods, the DCT-PLS method is independent of auxiliary data and relies on the spatiotemporal characteristics of the dataset itself as a basis to fill gaps^51^. In this study, we applied the DCT-PLS method to reconstruct missing data in soil moisture products generated by the world’s first SMOS L-band sensor, resulting in a comprehensive, global spatiotemporal seamless daily soil moisture product (2010 to 2020) at L-band.
Methods
The DCT-PLS method
Maximizing the utilization of temporal continuity and spatial correlation inherent in the dataset itself, while minimizing errors introduced by auxiliary information, is a crucial strategy for achieving seamless spatio-temporal filling. One feasible approach for this is the utilization of the DCT-PLS method^51^, which leverages the one-dimensional time series continuity and two-dimensional spatial consistency information of the original soil moisture product to reconstruct missing data regions. The objective of this study is to employ the DCT-PLS method to fill the gaps of SMOS L-band soil moisture products, and then to obtain long-term global spatiotemporal seamless daily soil moisture products, thereby enhancing the usability of this soil moisture product for spatio-temporal characteristics analysis and the assessment of global climate change in specific regions.
The DCT-PLS method mainly uses the partial least square method to reconstruct missing data:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$F(\hat{y})={RSS}+s\cdot P(\hat{y})={{\rm{||}}\hat{y}-y{\rm{||}}}^{2}+s\cdot P(\hat{y})$$\end{document}where, ||·|| represents the Euclid norm, RSS represents the residual sum of squares and is used to measure the fidelity of the smoothing results, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{y}$$\end{document} represents the smoothed soil moisture data, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P\left(\hat{y}\right)$$\end{document} represents the roughness (smoothness) of the data without smoothing, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s$$\end{document} represents the positive real number used to control the smoothness and the smoothness degree of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{y}$$\end{document} increases with the increase of the parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s$$\end{document} . The primary objective of the DCT-PLS method is to achieve the smoothing of the original data \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y$$\end{document} by determining the optimal solution of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{y}$$\end{document} .
Based on second-order difference, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P\left(\hat{y}\right)$$\end{document} can be expressed by:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$P\left(\hat{y}\right)={{\rm{||}}D\cdot \hat{y}{\rm{||}}}^{2}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=\left(\begin{array}{crccccc}-1 & 1 & & & & & \\ 1 & -2 & 1 & & & & \\ & 1 & -2 & & & & \\ & & & \ddots & & & \\ & & & & -2 & 1 & \\ & & & & 1 & -2 & 1\\ & & & & & 1 & -1\end{array}\right)$$\end{document}where, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D$$\end{document} represents the tridiagonal matrix. Given that the pixel distances in the soil moisture products generated using the MTMA method are equal, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D$$\end{document} can be represented by Eq. (3). By substituting Eq. (2) into Eq. (1), the linear relationship between the original soil moisture data and the smoothed soil moisture data can be obtained:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$y=\left({I}_{n}+s\cdot {D}^{T}\cdot D\right)\cdot \hat{y}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D=U\cdot \varLambda \cdot {U}^{-1}$$\end{document}where, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${I}_{n}$$\end{document} represents the diagonal matrix of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n\times n$$\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${D}^{T}$$\end{document} is the transposition of the matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D$$\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U$$\end{document} is the unitary matrix and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${U}^{-1}={U}^{T}$$\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U\cdot {U}^{T}={I}_{n}$$\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varLambda $$\end{document} ( \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\varLambda ={diag}\left({\lambda }_{1},{\lambda }_{2},\ldots ,{\lambda }_{n}\right)$$\end{document} , \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\lambda }_{i}=-2+2\cdot \cos \,(\left(i-1\right)\cdot \pi /n)$$\end{document} ) is the diagonal matrix containing the eigenvalue of matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D$$\end{document} . Equation (5) is obtained by eigen-decomposition of matrix \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$D$$\end{document} .
Equation (4) can be rewritten as:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{y}=U\cdot {\left({I}_{n}+s\cdot {\varLambda }^{2}\right)}^{-1}\cdot {U}^{T}\cdot y=U\cdot \varGamma \cdot {U}^{T}\cdot y$$\end{document}where, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$U$$\end{document} and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${U}^{T}$$\end{document} are the discrete cosine transform (DCT) matrix and the inverse matrix of the discrete cosine transform (IDCT) matrix of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$n\times n$$\end{document} respectively, Γ is the diagonal matrix containing the parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s$$\end{document} . When smoothing one-dimensional time series data, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varGamma }_{i}={\left[1+s\cdot {\left(2-2\cdot \cos (\left(i-1\right)\cdot \pi /n)\right)}^{2}\right]}^{-1}$$\end{document} , and when smoothing three-dimensional spatiotemporal data, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\varGamma }_{{i}_{1}{,i}_{2},{i}_{3}}={[1+s\cdot (\mathop{\sum }\limits_{j=1}^{3}{\left(2-\cos (\left({i}_{j}-1\right)\cdot \pi /{n}_{j})\right)}^{2})]}^{-1}$$\end{document} . Equation (6) also can be rewritten as:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\hat{y}={IDCT}\left(\varGamma \cdot {DCT}\left(y\right)\right)$$\end{document}According to Eq. (7), the reconstruction process of missing soil moisture using the DCT-PLS method involves performing DCT and IDCT transformations sequentially on the original soil moisture data. The performance of reconstruction results is primarily influenced by the parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s$$\end{document} . The parameter \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$s$$\end{document} is typically determined using the generalized cross-validation method, which involves iterative optimization to minimize the smoothing error^51^.
Evaluation by generating artificial gaps
This study assesses the performance of the DCT-PLS method in spatiotemporal reconstruction of soil moisture through simulation experiments with artificially generated gaps.
- time-series validationThe performance of the DCT-PLS method in reconstructing simulated missing time-series soil moisture is assessed based on in-situ data from 22 ground soil moisture observation networks in this study. For each soil moisture network, one-third of the in-situ data are randomly selected and replaced with ‘NaN’ values (i.e., as gaps). The DCT-PLS method is then used to reconstruct the soil moisture values at these locations. It should be noted that one-third of the randomly selected in-situ data from each soil moisture network were used as the ‘true value’ to assess the reconstruction results.
- Simulated missing-region validation
In addition to assessing the time series consistency between the soil moisture reconstructed by the DCT-PLS method and the in-situ data, it is also necessary to assess the DCT-PLS method’s capability in reconstructing spatially continuous soil moisture. To further demonstrate this, the study evaluated the performance of the DCT-PLS method in reconstructing soil moisture over regions with missing data.
Figure 1 shows an example of the five simulated missing data regions selected based on the original SMOS MTMA daily soil moisture product. The values within these regions are replaced with ‘NaN’. The soil moisture values in these areas were then reconstructed using the DCT-PLS method. This study compares the reconstructed values with the original values to evaluate the fidelity of the DCT-PLS method in reconstructing spatial continuity information.Fig. 1. Global spatial distribution of original soil moisture with five simulated missing data areas in 1 June 2017.
Accuracy metrics
In-situ validation is the most reliable method for evaluating the applicability of the DCT-PLS method^52–54^. To quantitatively evaluate the performance of the SMOS MTMA-SC_SM products obtained using the DCT-PLS method, five validation indices are used to compare in-situ soil moisture and reconstructed soil moisture in this study. These indices include the mean absolute error (MAE), the unbiased root mean squared error (ubRMSE), root mean squared error (RMSE), the Bias, and the correlation coefficient (R). The equations for calculating these five indices are as follows:
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${MAE}=\frac{\mathop{\sum }\limits_{i=1}^{n}|{{SM}}_{i}^{{est}}-{{SM}}_{i}^{{obs}}|}{n}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$R=\frac{\mathop{\sum }\limits_{i=1}^{n}\left({{SM}}_{i}^{{est}}-E\left[{{SM}}^{{est}}\right]\right)\cdot \left({{SM}}_{i}^{{obs}}-E\left[{{SM}}^{{obs}}\right]\right)}{\sqrt{{\mathop{\sum }\limits_{i=1}^{n}\left({{SM}}_{i}^{{est}}-E\left[{{SM}}^{{est}}\right]\right)}^{2}\cdot {\mathop{\sum }\limits_{i=1}^{n}\left({{SM}}_{i}^{{obs}}-E\left[{{SM}}^{{obs}}\right]\right)}^{2}}}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${RMSE}=\sqrt{E\left[{\left({{SM}}^{{est}}-{{SM}}^{{obs}}\right)}^{2}\right]}$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Bias}=E\left[{{SM}}^{{est}}\right]-E\left[{{SM}}^{{obs}}\right]$$\end{document} \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ubRMSE}=\sqrt{{{RMSE}}^{2}-{{Bias}}^{2}}$$\end{document}where, E [.] represents the expectation or linear averaging operator, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{SM}}^{{est}}$$\end{document} represents the estimated soil moisture, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{SM}}^{{obs}}$$\end{document} represents the in-situ soil moisture.
SMOS MTMA soil moisture products
In this study, soil moisture products (referred to as MTMA_SM) from June 2010 to December 2020 are obtained using the MTMA method^17,55^ based on SMOS H-pol multi-angular brightness temperature data. The MTMA_SM product exhibits a temporal resolution of one day and a spatial resolution of 25 km. Compared to SMOS Level 2 (L2)/Level 3 (L3)^34,56^ and SMOS-IC^16,57^ products, the global daily land coverage of MTMA_SM products is higher. Figure 1 presents the global spatial distribution of MTMA_SM products on 1 June, 2017. However, due to the impact of satellite orbit, satellite malfunction, radio frequency interference (RFI), and ineffective retrievals caused by the uncertainty of the soil moisture retrieval algorithm itself, there is a significant phenomenon of missing data in MTMA_SM daily products on the Earth’s land surface (Fig. 1). Taking 2017 as an example, the average land surface coverage rate of MTMA_SM daily products ranges from 39.42% to 50.92% by statistics.
In addition, Fig. 2 also illustrates the global spatial distribution of coverage rate of soil moisture retrievals for each pixel on the global land surface in 2017. It is evident that the coverage rate of soil moisture retrieval results for each pixel varies significantly with latitude, with the higher coverage rate in high latitude areas due to shorter satellite revisit periods, and lower coverage rate in low latitude areas due to longer satellite revisit periods. Moreover, the global average land surface coverage rate of the MTMA_SM daily products was calculated to be 45.98%. It is clear that the spatiotemporal discontinuity of the SMOS MTMA_SM product makes it unsuitable for subsequent spatiotemporal feature analysis.Fig. 2. Spatial distribution of coverage rate of soil moisture retrievals in each pixel in 2017.
In-situ soil moisture measurements
In this study, in-situ soil moisture data are collected from 22 soil moisture observation networks (a total of 398 sites) from the International Soil Moisture Network (ISMN, https://ismn.meaemaearth/en/) and the Long-Term Agroecosystem Research (LTAR) network to quantitatively evaluate the performance of the reconstructed soil moisture. Figure 3 presents the spatial distribution of all ground validation networks used in this study. These soil moisture networks are distributed across five continents (Asia, Europe, Africa, North America, Oceania) in different climatic regions and cover the period from 2010 to 2019. More details of each soil moisture network, including name, land cover type, measuring depth, etc., are given in Table 1. The MTMA_SM products are mainly derived using the SMOS brightness temperature data at 6:00 A.M. Consequently, the corresponding in-situ soil moisture data were extracted at the same time to ensure consistency. The in-situ measurements are primarily used to evaluate the reliability of the DCT-PLS method and the accuracy of the reconstructed soil moisture.Fig. 3. Spatial distribution of the location of the ground soil moisture network.Table 1. The information of in-situ soil moisture observation networks used for validation.Network/SitesCountryLand CoverTotal sitesMeasuring depthReferencesAMMA-CATCH (Benin)Beninsavannas405 cm^60^AMMA-CATCH (Niger)Nigergrasslands305 cm^60^BNZ-LTERAmericawoody savannas1005 cm^61^FMIFinlandwoody savannas2205 cm^62^HOBEDenmarkcroplands3205 cm^63^MySMNetMalaysiawoody savannas and evergreen broadleaf forest705 cm^64^RISMA (Manitoba)Canadacroplands and cropland/natural vegetation mosaics905 cm^65^RISMA (Saskatchewan)Canadacroplands and cropland/natural vegetation mosaics405 cm^65^SMN-SDRChinagrasslands3403 cm^11,66^TERENOGermanycroplands, urban areas, buildings, and mixed forests405 cm^67^Reynolds CreekAmericagrasslands and croplands1805 cm^68^REMEDHUSSpaincroplands2005 cm^69^YancoAustraliagrasslands and croplands1205 cm^70^KyeambaAustraliacroplands705 cm^70^South ForkAmericacroplands2005 cm^71^Fort CobbAmericagrasslands and croplands1405 cm^72,73^Little RiverAmericacroplands and woody savannas3305 cm^74,75^Little WashitaAmericagrasslands1905 cm^72,73^Walnut GulchAmericashrub open2905 cm^76^AliChinabare area2005 cm^77^NaquChinagrasslands5705 cm^78^MaquChinagrasslands2005 cm^79^
Data Records
The SMOS MTMA-SC global daily soil moisture dataset (2010 to 2020)^58^ with a spatial resolution of 25 km (projection type: the EASE-GRID 2.0) is located at 10.11888/Terre.tpdc.303001. All data are stored in NetCDF format, and each file contains variables including soil moisture (SM, m^3^/m^3^), latitude (lat, °), and longitude (lon, °). The file name is “MTMA-SC_SMOS_25KM_YYYYMMDD_V01.nc”, where “MTMA” represents the Multi-Temporal and Multi-Angular method, “SC” represents the seamless-continuity product, “SMOS” represents the Soil Moisture and Ocean Salinity, “YYYY” represents the year, “MM” stands for the month, “DD” stands for the day, “25KM” stands for the spatial resolution of 25 km, and “V01” stands for the version.
Technical Validation
Performance assessment of the DCT-PLS method
The following analyses are presented to support the technical reliability of the DCT-PLS method.
Experiment with simulated missing data in time series
This section focuses on evaluating the time series reconstruction capability of the DCT-PLS method using in-situ soil moisture data from 22 soil moisture observation networks.
The time series of in-situ and reconstructed soil moisture for 10 soil moisture observation networks (Naqu, Maqu, Yanco, Kyeamba, Little River, Little Washita, South Fork, Fort Cobb, REMEDHUS, and Walnut Gulch) from 1 January 2010 to 31 December 2019 are shown in Fig. 4a–j. The displayed results in Fig. 4a–j correspond to the actual observation periods for each respective ground soil moisture network. In Fig. 4a–j, it was found that both the original and reconstructed time series soil moisture can clearly reveal the interannual change characteristics of soil moisture over most of the soil moisture networks. The reconstructed soil moisture, obtained using the DCT-PLS method, has a good temporal consistency with the in-situ soil moisture and effectively captures the dynamic changes in soil moisture. Relatively low soil moisture values occur mainly in winter, while relatively high soil moisture values occur mainly in rainy summer.Fig. 4. Comparison between the time series of in-situ soil moisture and reconstructed soil moisture in 2010 to 2019 at different ground validation networks (the pink hollow diamond represents the in-situ soil moisture, and the blue hollow circle represents the reconstructed soil moisture based on the DCT-PLS method).
The accuracy metrics of the reconstructed soil moisture at 22 ground soil moisture networks are listed in Table 2. In Table 2, it can be found that the correlation between the reconstructed soil moisture and the in-situ data exceeds 0.9 at all the soil moisture networks, especially in the Naqu, Maqu, and Reynolds Creeks networks where the R even exceeds 0.99. However, in certain soil moisture networks (such as the Little River and Fort Cobb networks), where soil moisture is significantly influenced by irrigation or rainfall, the consistency between reconstructed and in-situ soil moisture is slightly lower. This indicates that the soil moisture reconstructed by the DCT-PLS method may not respond effectively to sudden rainfall or irrigation events. In addition, the RMSE of the reconstructed soil moisture is less than 0.03 m^3^/m^3^ at all soil moisture networks, and the MAE is even less than 0.02 m^3^/m^3^, which is lower than the expected accuracy (0.04 m^3^/m^3^) of the SMOS satellite. According to the accuracy indicators of combining data from all the networks together in Table 2, the overall R, RMSE and MAE of the reconstructed soil moisture are 0.963, 0.015 m^3^/m^3^, and 0.008 m^3^/m^3^ respectively, indicating that the overall consistency between the reconstructed soil moisture using the DCT-PLS method and the measured soil moisture data is very well.Table 2. Quantitative evaluation results of reconstructed soil moisture based on ground soil moisture networks.Network/SitesRRMSE(m^3^/m^3^)MAE(m^3^/m^3^)NumberREMEDHUS0.9850.0090.0051086Yanco0.9640.0200.0101030Kyeamba0.9510.0290.0131001South Fork0.9520.0200.012634Fort Cobb0.9240.0210.0101123Little River0.9110.0180.0101041Little Washita0.9440.0190.0091124Walnut Gulch0.9670.0080.0041109Maqu0.9900.0160.010786Naqu0.9960.0080.005748Ali0.9790.0050.002715Benin0.9600.0210.011584Niger0.9210.0130.005608BNZ-LTER0.9950.0060.003366FMI0.9630.0100.006852HOBE0.9760.0100.0071120MySMNet0.9300.0210.014193RISMA (Manitoba)0.9660.0190.011438RISMA (Saskatchewan)0.9600.0180.008437SMN-SDR0.9750.0110.006159TERENO0.9850.0130.007924Reynolds Creek0.9910.0090.005974All networks0.9630.0150.00817052
In general, compared with the time series in-situ soil moisture from 2010 to 2019 at all soil moisture networks, the soil moisture reconstructed based on the DCT-PLS method can stably respond to the temporal variation characteristics of soil moisture, which is crucial for the analysis of time series or seasonal changes of soil moisture in specific regions. The aforementioned simulation experiment to reconstruct missing soil moisture data in time series based on in-situ data has demonstrated the robustness and reliability of soil moisture reconstruction using the DCT-PLS method.
Experiment with simulated missing data regions
In this study, five simulated missing data regions were selected globally using the original SMOS MTMA soil moisture product based on June 1, 2017 (Fig. 5a). The reliability of the DCT-PLS method in reconstructing spatially continuous soil moisture capability was then evaluated by comparing the reconstructed soil moisture with the original MTMA_SM data.Fig. 5. Global spatial distribution of original soil moisture and reconstructed soil moisture with five simulated missing data areas in 1 June 2017: (a) Global spatial distribution of original soil moisture with five simulated missing data areas (square area) and (b) global spatial distribution of reconstructed soil moisture.
In Fig. 5a,b, the reconstructed soil moisture exhibits continuous spatial texture within the five circular simulated missing data regions, respectively. Furthermore, the reconstructed soil moisture within these five circular areas also showed spatial continuity with the surrounding original soil moisture. To further illustrate the spatial distribution of the reconstructed soil moisture in the simulated missing data areas, the five selected simulated missing data regions was enlarged in Fig. 6, and compared the values in these areas with the corresponding original soil moisture. According to the first and second columns of Fig. 6, it becomes evident that the spatial distribution of the reconstructed soil moisture is closely consistent with that of the original soil moisture. This result indicates that the reconstructed soil moisture effectively reproduces the continuous characteristics of soil moisture spatial changes within the simulated missing data regions.Fig. 6. Spatial distribution and scatters of original soil moisture and reconstructed soil moisture of five simulated missing data areas in 1 June 2017: the first column represents the spatial distribution of original soil moisture products with five simulated missing data areas, the second column represents the spatial distribution of reconstructed soil moisture, and the third column represents the scatters of original soil moisture and reconstructed soil moisture of five simulated missing data areas.
This study used R, RMSE, and MAE metrics to quantitatively evaluate the accuracy of the reconstructed soil moisture in simulated missing data areas. In Fig. 6a–e, it is evident that the scatter distribution of the reconstructed MTMA-SC_SM of the selected five simulated missing data regions and the corresponding original MTMA_SM closely aligns with the 1:1 line. Moreover, the correlation coefficients between the two all exceed 0.93, providing substantial evidence of the favorable consistency between the MTMA-SC_SM and the MTMA_SM. Furthermore, the RMSE and MAE of the MTMA-SC_SM of the five simulated missing data areas range from 0.0068 to 0.032 m^3^/m^3^ and 0.0052 to 0.023 m^3^/m^3^, respectively, indicating minimal deviations from the MTMA_SM, which meets the expected accuracy (0.04 m^3^/m^3^) of the SMOS satellite. Overall, the MTMA-SC_SM not only showed a high degree of agreement with the spatial distribution characteristics of the MTMA_SM in the simulated missing data areas, but also showed negligible deviations, confirming the ability of the DCT-PLS method to reconstruct spatial continuity information.
Assessment of SMOS MTMA-SC_SM products
In the past section, this study confirmed the reliability of the DCT-PLS method. We then applied the DCT-PLS method to the SMOS MTMA_SM product, producing SMOS MTMA-SC_SM products spanning almost ten years. This section focuses on evaluating the spatio-temporal performance of the reconstructed results of the gaps in the MTMA_SM product.
The spatial patterns of the reconstructed product
Figure 7 displayed the global spatial distribution of the MTMA_SM and MTMA-SC_SM products from 1 to 3 June 2017. It should be noted that the MTMA method did not produce soil moisture data for certain landcover types such as water bodies, snow and ice, permanent wetlands and urban and built-up lands. Consequently, the soil moisture for these landcover types were not reconstructed using the DCT-PLS method in this study.Fig. 7. Global spatial distribution of original soil moisture and reconstructed soil moisture from 1 to 3 June, 2017 (the first column is the original soil moisture product, and the second column is the reconstructed soil moisture product).
From the spatial perspective, the reconstructed regions of the MTMA-SC_SM product exhibit continuous spatial information with their adjacent original-valued regions (Fig. 7b,d,f). Particularly, there are no apparent boundary effects around the high- and low-valued regions, such as in the Sahel region of Africa, Australia and southern South America. Moreover, the spatial distribution of the MTMA-SC_SM product generally reflects the spatial patterns of soil moisture in different climatic regions.
From the temporal perspective, the MTMA-SC_SM product utilises the sequential time-series soil moisture information. The time-series daily MTMA-SC_SM product results (Fig. 7b,d,f) are highly similar and correlative, but there are also some variations and differences between each other.
Therefore, this study concludes that the MTMA-SC_SM product not only exhibits continuity of spatial information, implying that the spatial information within the reconstructed data region and between it and adjacent original-valued region is continuous, but also maintains the overall consistency of temporal information on spatial distribution characteristics between daily-scale soil moisture products.
Figure 8 presents the global spatial distribution of coverage rate of soil moisture retrievals for each pixel on the global land surface for MTMA_SM and MTMA-SC_SM products from 2015 to 2017. It is evident that the coverage rate of MTMA-SC_SM soil moisture product (Fig. 8b) has increased to 100% compared to MTMA_SM product (Fig. 8a), demonstrating the effectiveness of the DCT-PLS method in improving spatiotemporal coverage.Fig. 8. Spatial distribution of coverage rate of soil moisture retrievals in each pixel from 2015 to 2017: (a) MTMA_SM product and (b) MTMA-SC_SM product.
The global average soil moisture and the seasonal amplitude (2015–2017) of the original MTMA_SM and reconstructed MTMA-SC_SM products are shown in Fig. 9 and Fig. 10, respectively. The spatial distribution of the MTMA_SM and MTMA-SC_SM products exhibits similarity and generally follows the distribution pattern of the climatic regions (Fig. 9). Seasonal amplitude (SA) is defined as the difference between the values at 95% and 5% percentiles of the data after applying a 45-day moving-window-average^13,59^. Both the MTMA_SM and MTMA-SC_SM products also show agreement in the spatial distribution of seasonal amplitude in Fig. 10a,b, with relatively high amplitude values over the Indian Peninsula, the northwest of Europe, the Sahel region, which are affected by the monsoon precipitation, and relatively low amplitude values in the tropical forests of Central Africa and the Amazon areas. The similarity in seasonal amplitude between the MTMA_SM and MTMA-SC_SM products indicates that the reconstruction results do not distort the seasonal characteristics of soil moisture, while reasonably ensuring that the spatial and temporal characteristics of soil moisture changes. The above results further confirm the satisfactory performance of the MTMA-SC_SM product, with no significant deviation in the global distribution pattern compared to the original MTMA_SM product.Fig. 9. The spatial distribution of global average soil moisture of original soil moisture (a) and reconstructed soil moisture (b) from 2015 to 2017.Fig. 10. Global spatial distribution of seasonal amplitude of original soil moisture (a) and reconstructed soil moisture (b) from 2015 to 2017.
In-situ validation
This study also quantitatively evaluated the MTMA_SM, MTMA-SC_SM, and MTMA-SC_SM-Recon (a subset of the MTMA-SC_SM product containing only the gap-filled values) products using in-situ measurements from soil moisture networks. MTMA-SC_SM is a seamless-continuity product consisting of MTMA_SM and MTMA-SC_SM-Recon. The MTMA-SC_SM-Recon product includes both intra- and inter-swath gap-filled values, with the latter being the dominant component, as illustrated by the predominance of inter-swath gaps in Fig. 7, whereas intra-swath gaps are not prominent. We present the statistical metrics of the MTMA_SM, MTMA-SC_SM-Recon, and MTMA-SC_SM, respectively.
The time series MTMA-SC_SM-Recon and MTMA_SM products at 10 soil moisture networks (Naqu, Maqu, Yanco, Kyeamba, Little River, Little Washita, South Fork, Fort Cobb, REMEDHUS, and Walnut Gulch) from 1 January 2010 to 31 December 2019 are shown in Fig. 11. The MTMA-SC_SM-Recon results exhibit a characteristic of continuous changes in soil moisture over time at most validation networks. Furthermore, there is also the back-and-forth consistency between the values of the MTMA_SM and the values of the temporally adjacent MTMA-SC_SM-Recon, i.e., if the values of the MTMA_SM product are high or low, its temporally adjacent values of the MTMA-SC_SM-Recon will also be high or low, such as in the Naqu and Maqu soil moisture networks. However, the MTMA-SC_SM-Recon product may not capture changes in soil moisture related to rainfall and agricultural irrigation in a timely manner, such as in the Kyeamba (Fig. 11f), South Fork (Fig. 11i) and Yanco (Fig. 11j) soil moisture networks. Despite this, the time series validation for the MTMA-SC_SM-Recon results also demonstrate the applicability of the DCT-PLS method used in this study.Fig. 11. Comparison between the time series of MTMA_SM and MTMA-SC_SM-Recon in 2010 to 2019 at different ground validation networks.
The accuracy metrics of the MTMA_SM, the MTMA-SC_SM-Recon and the MTMA-SC_SM products based on in-situ soil moisture data from 22 validation networks are calculated. With the exception of the MySMNet and Ali networks, the R values of the MTMA_SM, the MTMA-SC_SM-Recon and the MTMA-SC_SM products exceed 0.55 at all validation networks (Supplementary Table S1), indicating that the MTMA_SM and MTMA-SC_SM products are in good agreement with the in-situ data. Notably, the MTMA-SC_SM product demonstrates the highest R at the Maqu and BNZ-LTER networks, exceeding 0.9 (Supplementary Table S1). Conversely, the MTMA-SC_SM product has the lowest R value in the MySMNet soil moisture network, which can be attributed to the influence of the dense vegetation (the vegetation type in the MySMNet network is evergreen broadleaf forest). For dense vegetation, the uncertainty of the MTMA_SM product obtained by the MTMA method has been discussed by Bai^17^, and then this uncertainty is also inevitably transferred to the MTMA-SC_SM product. The ubRMSE of the MTMA_SM, MTMA-SC_SM-Recon and MTMA-SC_SM products range from 0.022 to 0.057 m^3^/m^3^, 0.0025 to 0.061 m^3^/m^3^, and 0.024 to 0.058 m^3^/m^3^ (Supplementary Table S1), respectively, indicating close proximity. In addition, the overall accuracy metrics of combining data from all the networks have also been calculated for the MTMA_SM, MTMA-SC_SM-Recon and MTMA-SC_SM products. The comprehensive evaluation yielded the following overall metrics for MTMA_SM, MTMA-SC_SM-Recon and MTMA-SC_SM: R (0.733, 0.742, 0.746), ubRMSE (0.057, 0.059, 0.057) m^3^/m^3^, Bias (-0.039, -0.034, -0.035) m^3^/m^3^, and RMSE (0.069, 0.068, 0.067) m^3^/m^3^ (Supplementary Table S1), suggesting minimal differences in accuracy between the three, with a remarkably high level of consistency. On the whole, the MTMA-SC_SM product agrees well with in-situ soil moisture and maintains a level of accuracy comparable to the MTMA_SM product. These validation results confirm the reliability and availability of the MTMA-SC_SM product.
Usage Notes
This study begins by introducing the principle of the DCT-PLS method and subsequently assesses its reliability for soil moisture reconstruction. Finally, the DCT-PLS method is implemented on the MTMA_SM product, and the resulting reconstructions are subjected to validation. The reliability of the DCT-PLS method for soil moisture reconstruction is demonstrated by the simulation experiments involving time series and spatial region reconstruction with missing data. And through spatiotemporal validation for the MTMA-SC_SM product, it was found that: the MTMA-SC_SM product had a good consistency with the in-situ soil moisture data (overall R > 0.7 and overall ubRMSE = 0.057 m^3^/m^3^), and the difference of each accuracy indicator between the MTMA_SM product and the MTMA-SC_SM product was less than 0.01; the spatial distribution of soil moisture in the gap-filled data area demonstrated continuity with the adjacent original-valued areas, without significant boundary effects.
In addition, there is room for improvement in the reconstructed soil moisture obtained by the DCT-PLS method. Soil moisture data reconstructed using the DCT-PLS method did not respond adequately to unexpected events (such as rainfall or irrigation events), and how to consider these effects in the DCT-PLS method should be further improved in subsequent research.
Supplementary information
Supplementary Table
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1Shi, J. et al. Airborne and Spaceborne Passive Microwave Measurements of Soil Moisture. 2, 71-105(2019).
- 2Yao, P. et al. A long term global daily soil moisture dataset derived from AMSR-E and AMSR 2 (2002–2019). 8, 143(2021).10.1038/s 41597-021-00925-8PMC 816018634045448 · doi ↗ · pubmed ↗
- 3Yao, P. et al. A global daily soil moisture dataset derived from Chinese Feng Yun Microwave Radiation Imager (MWRI)(2010–2019). 10, 133 (2023).10.1038/s 41597-023-02007-3PMC 1001496636918527 · doi ↗ · pubmed ↗
- 4Zhao, J., Lange, H., & Meissner, H. Gap-filling continuously-measured soil respiration data: A highlight of time- series-based methods. Agricultural and Forest Meteorology. 285-286(2020).
- 5Wei, Z. et al. Bridging spatio-temporal discontinuities in global soil moisture mapping by coupling physics in deep learning. 313, 114371(2024).
- 6Bai, Y. et al. SMOS gap-free daily soil moisture product (2010-2020). National Tibetan Plateau/Third Pole Environment Data Center, 10.11888/Terre.tpdc.303001 (2025).
- 7Kang, C.S., Kanniah, K.D., & Kerr, Y.H. Calibration of SMOS Soil Moisture Retrieval Algorithm: A Case of Tropical Site in Malaysia. IEEE Transactions on Geoscience and Remote Sensing. 1-13(2019).
- 8Zhao, T. et al. In-situ measurement data set (2019) of the soil moisture and temperature wireless sensor network within the Shandian River Basin. National Tibetan Plateau Data Center (2021).
