MMM: Clustering Multivariate Longitudinal Mixed-type Data
By: Francesco Amato, Julien Jacques
Potential Business Impact:
Groups mixed data by time and type.
Multivariate longitudinal data of mixed-type are increasingly collected in many science domains. However, algorithms to cluster this kind of data remain scarce, due to the challenge to simultaneously model the within- and between-time dependence structures for multivariate data of mixed kind. We introduce the Mixture of Mixed-Matrices (MMM) model: reorganizing the data in a three-way structure and assuming that the non-continuous variables are observations of underlying latent continuous variables, the model relies on a mixture of matrix-variate normal distributions to perform clustering in the latent dimension. The MMM model is thus able to handle continuous, ordinal, binary, nominal and count data and to concurrently model the heterogeneity, the association among the responses and the temporal dependence structure in a parsimonious way and without assuming conditional independence. The inference is carried out through an MCMC-EM algorithm, which is detailed. An evaluation of the model through synthetic data shows its inference abilities. A real-world application on financial data is presented.
Similar Papers
SiamMM: A Mixture Model Perspective on Deep Unsupervised Learning
Machine Learning (CS)
Finds hidden patterns in data, improving learning.
Clustering Approaches for Mixed-Type Data: A Comparative Study
Machine Learning (Stat)
Finds patterns in mixed-type data.
Multivariate longitudinal modeling of cross-sectional and lagged associations between a continuous time-varying endogenous covariate and a non-Gaussian outcome
Methodology
Helps doctors understand disease changes better.