# A hybrid approach for regionalization of precipitation based on maximal discrete wavelet transform and growing neural gas network clustering

**Authors:** Xu Tao, Ma Ben, He Cao Yin Xuan, Ali Arshaghi

PMC · DOI: 10.1038/s41598-025-24400-1 · 2025-11-18

## TL;DR

This study uses a new method combining wavelet analysis and neural clustering to better understand and map precipitation patterns in China over 45 years.

## Contribution

A novel hybrid approach integrating MODWT and GNG clustering for improved precipitation regionalization.

## Key findings

- The hybrid model achieved a silhouette coefficient of 0.68, indicating effective clustering performance.
- Northern and northwestern China showed higher precipitation variability compared to southern regions.
- The MODWT-based approach outperformed traditional clustering without wavelet preprocessing.

## Abstract

Understanding the spatiotemporal variability of precipitation is critical for effective water resource planning, particularly in regions with diverse climatic zones such as China. This study presents a hybrid methodology combining the Maximal Overlap Discrete Wavelet Transform (MODWT) and the Growing Neural Gas (GNG) clustering algorithm to regionalize precipitation patterns using monthly data from 123 synoptic stations over a 45-year period (1980–2024). MODWT was applied to decompose the precipitation time series into five frequency-based sub-series (W1–W5 and V5), capturing variability across 2- to 32-month cycles. Shannon entropy was calculated for each sub-series, generating a comprehensive feature set that reflects the temporal complexity at each station. These entropy features were subsequently used as input for the GNG algorithm, which identified 12 homogeneous precipitation clusters. The clustering performance was quantitatively assessed using the silhouette coefficient (SC), where the proposed model achieved a maximum SC value of 0.68, indicating strong inter-cluster separation and intra-cluster compactness. In comparison, clustering performed without MODWT-based preprocessing yielded a lower SC value of 0.56, highlighting the effectiveness of the hybrid approach. Spatial analysis revealed that northern and northwestern China exhibited the highest precipitation variability, particularly in the W3 (8-month) and V5 (trend) components, while southern and southeastern regions demonstrated more stable patterns. The results underscore the value of integrating multiscale temporal analysis with neural-based clustering for robust and interpretable regionalization of precipitation. This framework offers substantial potential for informing water resource management, climate adaptation policies, and infrastructure development under future hydroclimatic uncertainty.

## Full-text entities

- **Genes:** SERPINE2 (serpin family E member 2) [NCBI Gene 5270] {aka GDN, GDNPF, PI-7, PI7, PN-1, PN1}, GRHL3 (grainyhead like transcription factor 3) [NCBI Gene 57822] {aka SOM, TFCP2L4, VWS2}, APP (amyloid beta precursor protein) [NCBI Gene 351] {aka AAA, ABETA, ABPP, AD1, APPI, CTFgamma}, USB1 (U6 snRNA biogenesis phosphodiesterase 1) [NCBI Gene 79650] {aka C16orf57, HVSL1, Mpn1, PN, hMpn1, hUsb1}
- **Diseases:** drought (MESH:C536747), flood (MESH:C565009), N-PN (MESH:C565820), MODWT (MESH:D021922)
- **Chemicals:** NAO (-)
- **Species:** Homo sapiens (human, species) [taxon 9606]

## Figures

18 figures with captions in the complete paper: https://tomesphere.com/paper/PMC12627621/full.md

---
Source: https://tomesphere.com/paper/PMC12627621