Score: 1

Clustering Methods for Identifying and Modelling Areas with Similar Temperature Variations

Published: January 29, 2026 | arXiv ID: 2601.21495v1

By: Edoardo Otranto

Potential Business Impact:

Predicts future global warming patterns more accurately.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

This paper proposes a novel data-driven approach for identifying and modelling areas with similar temperature variations throufigureh clustering and Space-Time AutoRegressive (STAR) models. Using annual temperature data from 168 countries (1901-2022), we apply three clustering methods based on (i) warming rates, (ii) annual temperature variations, and (iii) persistence of variation signs, using Euclidean and Hamming distances. These clusters are then employed to construct alternative spatial weight matrices for STAR models. Empirical results show that distance-based STAR models outperform classical contiguity-based ones, both in-sample and out-of-sample, with the Hamming distance-based STAR model achieving the best predictive accuracy. The study demonstrates that using statistical similarity rather than geographical proximity improves the modelling of global temperature dynamics, suggesting broader applicability to other environmental and socioeconomic datasets.

Country of Origin
🇮🇹 Italy

Repos / Data Links

Page Count
22 pages

Category
Statistics:
Applications