Clustering Methods for Identifying and Modelling Areas with Similar Temperature Variations

Edoardo Otranto

arXiv:2601.21495·stat.AP·January 30, 2026

Clustering Methods for Identifying and Modelling Areas with Similar Temperature Variations

Edoardo Otranto

PDF

Open Access

TL;DR

This study introduces a data-driven clustering approach combined with Space-Time AutoRegressive models to better understand and predict global temperature variations, outperforming traditional methods.

Contribution

It presents a novel combination of clustering and STAR models using statistical similarity measures, enhancing temperature variation modelling accuracy.

Findings

01

Distance-based STAR models outperform classical models.

02

Hamming distance-based STAR model achieves highest predictive accuracy.

03

Statistical similarity improves global temperature modelling.

Abstract

This paper proposes a novel data-driven approach for identifying and modelling areas with similar temperature variations throufigureh clustering and Space-Time AutoRegressive (STAR) models. Using annual temperature data from 168 countries (1901-2022), we apply three clustering methods based on (i) warming rates, (ii) annual temperature variations, and (iii) persistence of variation signs, using Euclidean and Hamming distances. These clusters are then employed to construct alternative spatial weight matrices for STAR models. Empirical results show that distance-based STAR models outperform classical contiguity-based ones, both in-sample and out-of-sample, with the Hamming distance-based STAR model achieving the best predictive accuracy. The study demonstrates that using statistical similarity rather than geographical proximity improves the modelling of global temperature dynamics,…

Peer Reviews

No public reviews on file for this paper yet. If you reviewed it on a platform where reviews are public (OpenReview, ICLR, NeurIPS, ICML), you can paste yours below so the community can read it here.

Videos

No videos yet. Explain this paper in a talk, walkthrough, or lecture? Add one.

Taxonomy

TopicsSpatial and Panel Data Analysis · Data-Driven Disease Surveillance · Human Mobility and Location-Based Analysis