A satellite-derived dataset on vegetation phenology across Central Asia from 2001 to 2023
Chao Ding

TL;DR
This paper introduces a satellite-based dataset tracking vegetation growth patterns in Central Asia from 2001 to 2023, aiming to improve understanding of dryland ecosystem changes.
Contribution
A novel satellite-derived dataset for Central Asian vegetation phenology using MODIS data and advanced smoothing techniques.
Findings
The dataset includes seven LSP metrics categorized into timing, NIRv values, and vegetation growth state.
The stationary wavelet transform with Biorthogonal 1.1 wavelet improved smoothing of NIRv time series.
The dataset supports analysis of dryland ecosystem responses to climate and human activities.
Abstract
Satellite-observed land surface phenology (LSP) data have helped us better understand terrestrial ecosystem dynamics at large scales. However, uncertainties remain in comprehending LSP variations in Central Asian drylands. In this article, an LSP dataset covering Central Asia (45–100°E, 33–57°N) is introduced. This LSP dataset was produced based on Moderate Resolution Imaging Spectroradiometer (MODIS) 0.05-degree daily reflectance and land cover data. The phenological dynamics of drylands were tracked using the seasonal profiles of near-infrared reflectance of vegetation (NIRv). NIRv time series processing involved the following steps: identifying low-quality observations, smoothing the NIRv time series, and retrieving LSP metrics. In the smoothing step, a median filter was first applied to reduce spikes, after which the stationary wavelet transform (SWT) was used to smooth the NIRv…
Genes, proteins, chemicals, diseases, species, mutations and cell lines named across the full text — each resolved to its canonical identifier and authoritative record.
Click any figure to enlarge with its caption.
Figure 1
Figure 2
Figure 3Peer Reviews
No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.
Videos
No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.
Taxonomy
TopicsRemote Sensing in Agriculture · Plant Water Relations and Carbon Dynamics · Urban Heat Island Mitigation
Specifications TableSubjectEnvironmental ScienceSpecific subject areaGlobal change, terrestrial ecosystem dynamics, remote sensing in ecologyData formatAnalyzedType of dataImageData collectionThis land surface phenology (LSP) dataset was generated using Earth observation satellite data, namely Moderate Resolution Imaging Spectroradiometer (MODIS) surface reflectance and land cover data. The collected time-series satellite images were processed pixel-by-pixel through the following steps: (1) identifying low-quality observations; (2) smoothing the time series; and (3) retrieving LSP metrics. Processing was implemented using Python and MATLAB codes.Data source locationThe spatial coverage of the LSP dataset is 45–100°E, 33–57°N.The MODIS reflectance and land cover data used to generate this LSP dataset can be downloaded from https://search.earthdata.nasa.gov/searchData accessibilityRepository name: Mendeley dataData identification number: 10.17632/frvnt3jj4w.1Direct URL to data:https://data.mendeley.com/datasets/frvnt3jj4w/1
Value of the Data
1
- •The LSP dataset presented herein can be utilized to examine the dynamics of drylands and their responses to climate variation/change and human activities across Central Asia.
- •The spatio-temporal dynamics of the LSP metrics are valuable for evaluating ecosystem functions in Central Asia.
- •This dataset supports studies concerning the United Nations Sustainable Development Goal (SDG) 15 [1], which is related to ecosystem productivity, biodiversity, and land degradation.
Background
2
Information on land surface phenology (LSP) can facilitate our understanding of land surface dynamics [2,[3]](#bib0003). Knowledge of LSP has been significantly enhanced by the rapid growth of time-series remote sensing data [3]. Nonetheless, a substantial knowledge gap regarding LSP in Central Asia, a region characterized by extensive drylands, remains [3,[4]](#bib0004). The degradation of ecosystem functions and services in drylands is a barrier to sustainable development in Central Asia [5]. Data on LSP are important for comprehending the dynamics of several key ecological indicators related to SDG 15 [6]. Although global LSP products are available, information on LSP in Central Asia remains insufficient, partially due to widespread missing data for sparse vegetation and uncertainties in modeling complex vegetation index (VI) time series [7]. In this context, I developed an LSP dataset specific to the drylands in Central Asia [8] by utilizing satellite-derived images of the near-infrared reflectance of vegetation (NIRv) [9], considering the low vegetation cover and complexity of satellite VI time series in drylands.
Data Description
3
The spatio-temporal characteristics of the LSP dataset for Central Asia (LSPCA) are outlined in Table 1. These LSP metrics were categorized into the following three groups: (1) timing of key phenological events, (2) NIRv values essential for detecting the phenological events throughout the growing season, and (3) NIRv value linked to vegetation growth states during the growing season (Table 2). Additionally, other phenological metrics, such as the length of the vegetation growing season (LOS), can be calculated using the provided LSP metrics.Table 1. Spatio-temporal characteristics of the land surface phenology dataset for Central Asia.Table 1. Geographic coverage45–100°E, 33–57°NSpatial resolution0.05°Temporal extent2001–2023Temporal resolutionYearlySpatial coordinate systemWGS 84 geographic coordinate systemTable 2Characteristics of the land surface phenology metrics.Table 2LSP groupLSP metricDefinitionValue domainTiming of key phenological eventsSOSStart of the vegetation growing season-59 to 257POSMaximum peak of the vegetation growing season32 to 258EOSEnd of the vegetation growing season33 to 365NIRv values essential for detecting the phenological eventsNIRv_PEAKMaximum peak in the NIRv time series150 to 6000 (actual value: 0.015 to 0.6)NIRv_MINLMinimum of the NIRv time series before the POS0 to 6000 (0 to 0.6)NIRv_MINRMinimum of the NIRv time series after the POS0 to 6000 (0 to 0.6)NIRv value linked to vegetation growth stateNIRv_MEANMean of the NIRv time series from the SOS to the EOS0 to 6000 (0 to 0.6)
The images of the LSP metrics are included in the compressed file ‘LSPCA.zip’. Each LSP metric is stored in an individual subfolder, and the name of the subfolder is the same as that of the LSP metric. Each subfolder contains 23 images. The name of an image denotes the LSP metric and the corresponding year. For example, ‘SOS_2022.tif’ indicates the SOS image for 2022.
The values of POS, SOS, and EOS fall within specific valid ranges, as outlined in Table 2. The POS is within the range of day of year (DOY) 32 to 258. In the southern regions of Central Asia, the SOS may occur during the last two months of the year. For example, the SOS of a pixel has been recorded on 12/9/2021. In this case, the year of the SOS is also identified as 2022, given that the peak NIRv of this growing season occurs in 2022 [10]. Consequently, the SOS value is negative (-22 for SOS on 12/9/2021). The spatial pattern of the SOS in 2022, as depicted in Fig. 1, illustrates numerous instances of negative SOS values. The yearly identifiers for all other LSP metrics are likewise assigned according to the POS [10]. This process allows users to track an entire growing season across two calendar years [10]. Notably, NIRv_MEAN represents the mean NIRv value from the SOS to the EOS, rather than the mean value over a calendar year. Fig. 2 shows the spatial distribution of NIRv_MEAN during the growing season identified as 2022. Additionally, the values of the masked pixels are set at -20,000.Fig. 1. Spatial distribution of the SOS in 2022 derived from the land surface phenology dataset for Central Asia.Fig 1. Fig. 2Spatial distribution of the NIRv_MEAN in 2022 derived from the land surface phenology dataset for Central Asia.Fig 2
Experimental Design, Materials and Methods
4
The MODIS MCD43C4 V061 0.05° daily reflectance data [11] for the period of 2000–2023 and the MCD12C1 V061 land cover data [12] for 2001 were obtained from https://search.earthdata.nasa.gov/search. The NIRv was calculated to track the vegetation seasonality dynamics (Eq. (1)).
where and are the surface reflectance values of the near-infrared (Band 2) and red (Band 1) bands of the MODIS sensor, respectively.
NIRv is suitable for observing vegetation dynamics in drylands with a low vegetation cover [9,[13]](#bib0013). A previous study reported that NIRv can be used to effectively estimate the gross primary productivity of vegetation in sparse drylands [13].
The following steps were conducted to process the NIRv time series of a land pixel (as identified by the land cover data): identifying low-quality observations, smoothing the NIRv time series, and retrieving the LSP metrics from the smoothed NIRv time series.
Identifying Low-Quality Observations
4.1
Observations with a Bidirectional Reflectance Distribution Function (BRDF) quality value greater than 3 were considered to be low quality [14]. Additionally, data with NIRv < 0 and NIRv > 0.6 were identified as low-quality data. The corresponding data points were excluded from the NIRv time series. The normalized difference snow index (NDSI) [15] was employed to detect the snow-affected NIRv values for which the NDSI > 0.1 [16]. The 5^th^ percentile of high-quality NIRv values observed within a five-year temporal window was used to replace the snow-affected observations in the NIRv time series [10]. Any remaining data gaps in the NIRv time series were filled using linear interpolation.
Smoothing the NIRv Time Series
4.2
Droughts can lead to complex seasonal NIRv profiles of drylands in Central Asia. Here, I applied a median filter to detect spikes in the NIRv time series [17]. If the absolute difference between the raw and the filtered NIRv values (11-day temporal window) was greater than 2.5 times the standard deviation of the raw NIRv values within a 21-day window, the raw NIRv value was replaced with the filtered value. The stationary wavelet transform (SWT), a local filtering method, was then applied to further reduce the noise in the NIRv time series [18,[19]](#bib0019). SWT was performed on the NIRv time series at a decomposition level of 5 using the Biorthogonal 1.1 wavelet. The SWT decomposed the NIRv time series into an approximation signal and five detailed signals. Typically, the detailed signals contain information on high-frequency fluctuations in the NIRv time series, which may be considered noise [19]. Here, all detailed signals were eliminated to obtain a smooth NIRv time series [20]. An example of the final NIRv time series reconstructed through the two aforementioned steps is shown in Fig. 3.Fig. 3. An example of the reconstructed near-infrared reflectance of vegetation (NIRv) time series.Fig 3
Retrieving the LSP Metrics
4.3
The LSP metrics were estimated annually within a 14-month temporal window. For example, when retrieving the LSP metrics for 2005, NIRv data from 11/2/2004 to 12/31/2005 were used. A single growing season was considered for the LSPCA dataset. The maximum NIRv peak (NIRv_PEAK) and the corresponding date (POS) were initially detected. POS was assumed to occur prior to DOY 258. The following steps were performed if the detected NIRv_PEAK exceeded 0.015. First, the seasonal minimum NIRv values before and after POS were detected. Then, the seasonal amplitudes of the NIRv for the vegetation green-up (AMP_G) and senescence (AMP_S) phases within the temporal window were calculated. Finally, the thresholds set to 20% of AMP_G and AMP_S were applied to retrieve the SOS and EOS, respectively [7]. If the detected POS occurred prior to DOY 32, all LSP metrics for the corresponding growing season were masked, as the NIRv time series for that growing season may be abnormal.
Limitations
In vegetated drylands, droughts can result in extremely low vegetation productivity and abnormal seasonal NIRv profiles. Consequently, cases of missing data can occur in the 23-year time series of an LSP metric, particularly for desert vegetation pixels. In such cases, a long-term trend analysis of the LSP metrics for a specific pixel cannot be carried out.
Ethics Statement
The author confirms that the current work does not involve human subjects, animal experiments, or any data collected from social media platforms.
CRediT authorship contribution statement
Chao Ding: Conceptualization, Methodology, Software, Resources, Data curation, Visualization, Writing – original draft.
The reference list from the paper itself. Each links out to its DOI / PubMed record.
- 1United Nations, Transforming our world: the 2030 Agenda for sustainable development, 2015. https://sdgs.un.org/2030 agenda.
- 2Henebry G.M.de Beurs K.M.Remote sensing of land surface phenology: a prospectus Schwartz M.D.Phenology: An Integrative Environmental Science 2013 Springer Netherlands Dordrecht 38541110.1007/978-94-007-6925-0_21 · doi ↗
- 3Caparros-Santiago J.A.Rodriguez-Galiano V.Dash J.Land surface phenology as indicator of global terrestrial ecosystem dynamics: a systematic review ISPRS J. Photogramm. Remote Sens.171202133034710.1016/j.isprsjprs.2020.11.019 · doi ↗
- 4Wu L.Ma X.Dou X.Zhu J.Zhao C.Impacts of climate change on vegetation phenology and net primary productivity in arid Central Asia Sci. Total Environ.796202114905510.1016/j.scitotenv.2021.14905534328878 · doi ↗ · pubmed ↗
- 5Chen J.John R.Yuan J.Mack E.A.Groisman P.Allington G.Wu J.Fan P.De Beurs K.M.Karnieli A.Gutman G.Kappas M.Dong G.Zhao F.Ouyang Z.Pearson A.L.Şat B.Graham N.A.Shao C.Graham A.K.Henebry G.M.Xue Z.Amartuvshin A.Qu L.Park H.Xin X.Chen J.Tian L.Knight C.Kussainova M.Li F.Fürst C.Qi J.Sustainability challenges for the social-environmental systems across the Asian Drylands belt Environ. Res. Lett.17202202300110.1088/1748-9326/ac 472f · doi ↗
- 6Misra G.Cawkwell F.Wingler A.Status of phenological research using sentinel-2 data: a review Remote Sens.122020276010.3390/rs 12172760 · doi ↗
- 7Xie Q.Cleverly J.Moore C.E.Ding Y.Hall C.C.Ma X.Brown L.A.Wang C.Beringer J.Prober S.M.Macfarlane C.Meyer W.S.Yin G.Huete A.Land surface phenology retrievals for arid and semi-arid ecosystems ISPRS J. Photogramm. Remote Sens.185202212914510.1016/j.isprsjprs.2022.01.017 · doi ↗
- 8Ding C.A land surface phenology dataset for Central Asia derived from MODIS NI Rv time series Mendeley Data 202410.17632/frvnt 3jj 4w.1 · doi ↗
