Score: 0

A comparison between initialization strategies for the infinite hidden Markov model

Published: December 3, 2025 | arXiv ID: 2512.03777v1

By: Federico P. Cortese, Luca Rossini

Potential Business Impact:

Helps computers find hidden patterns in changing data.

Business Areas:
A/B Testing Data and Analytics

Infinite hidden Markov models provide a flexible framework for modelling time series with structural changes and complex dynamics, without requiring the number of latent states to be specified in advance. This flexibility is achieved through the hierarchical Dirichlet process prior, while efficient Bayesian inference is enabled by the beam sampler, which combines dynamic programming with slice sampling to truncate the infinite state space adaptively. Despite extensive methodological developments, the role of initialization in this framework has received limited attention. This study addresses this gap by systematically evaluating initialization strategies commonly used for finite hidden Markov models and assessing their suitability in the infinite setting. Results from both simulated and real datasets show that distance-based clustering initializations consistently outperform model-based and uniform alternatives, the latter being the most widely adopted in the existing literature.

Country of Origin
🇮🇹 Italy

Page Count
31 pages

Category
Statistics:
Methodology