On the Identifiability of Regime-Switching Models with Multi-Lag Dependencies
By: Carles Balsells-Rodas , Toshiko Matsui , Pedro A. M. Mediano and more
Potential Business Impact:
Helps understand complex systems by finding hidden patterns.
Identifiability is central to the interpretability of deep latent variable models, ensuring parameterisations are uniquely determined by the data-generating distribution. However, it remains underexplored for deep regime-switching time series. We develop a general theoretical framework for multi-lag Regime-Switching Models (RSMs), encompassing Markov Switching Models (MSMs) and Switching Dynamical Systems (SDSs). For MSMs, we formulate the model as a temporally structured finite mixture and prove identifiability of both the number of regimes and the multi-lag transitions in a nonlinear-Gaussian setting. For SDSs, we establish identifiability of the latent variables up to permutation and scaling via temporal structure, which in turn yields conditions for identifiability of regime-dependent latent causal graphs (up to regime/node permutations). Our results hold in a fully unsupervised setting through architectural and noise assumptions that are directly enforceable via neural network design. We complement the theory with a flexible variational estimator that satisfies the assumptions and validate the results on synthetic benchmarks. Across real-world datasets from neuroscience, finance, and climate, identifiability leads to more trustworthy interpretability analysis, which is crucial for scientific discovery.
Similar Papers
Frequentist forecasting in regime-switching models with extended Hamilton filter
Methodology
Helps predict when students might quit math.
Unsupervised learning of multiscale switching dynamical system models from multimodal neural data
Machine Learning (CS)
Helps brain implants understand thoughts better.
On the importance of structural identifiability for machine learning with partially observed dynamical systems
Machine Learning (CS)
Helps computers learn from messy, limited data.