Clustering country-level all-cause mortality data: a review
By: Pedro Menezes de Araujo, Isobel Claire Gormley, Thomas Brendan Murphy
Potential Business Impact:
Finds patterns in country death rates.
Mortality data are relevant to demography, public health, and actuarial science. Whilst clustering is increasingly used to explore patterns in such data, no study has reviewed its application to country-level all-cause mortality. This review therefore summarises recent work and addresses key questions: why clustering is used, which mortality data are analysed, which methods are most common, and what main findings emerge. To address these questions, we examine studies applying clustering to country-level all-cause mortality, focusing on mortality indices, data sources, and methodological choices, and we replicate some approaches using Human Mortality Database (HMD) data. Our analysis reveals that clustering is mainly motivated by forecasting and by studying convergence and inequality. Most studies use HMD data from developed countries and rely on k-means, hierarchical, or functional clustering. Main findings include a persistent East-West European division across applications, with clustering generally improving forecast accuracy over single-country models. Overall, this review highlights the methodological range in the literature, summarises clustering results, and identifies gaps, such as the limited evaluation of clustering quality and the underuse of data from countries outside the high-income world.
Similar Papers
Latent Spatial Heterogeneity in U.S. Cancer Mortality: A Multi-Site Clustering and Spatial Autocorrelation Analysis
Applications
Finds where cancer kills most to help stop it.
Bayesian local clustering of age-period mortality surfaces across multiple countries
Applications
Finds hidden patterns in how people die.
Modeling smooth and localized mortality patterns across age, time, and space to uncover small-area inequalities
Applications
**Predicts deaths in small areas accurately.**