Repulsive mixtures via the sparsity-inducing partition prior
By: Alexander Mozdzen , Timothy Wertz , Maria De Iorio and more
Potential Business Impact:
Finds fewer, stronger groups in data.
We introduce a novel prior distribution for modelling the weights in mixture models based on a generalisation of the Dirichlet distribution, the Selberg Dirichlet distribution. This distribution contains a repulsive term, which naturally penalises values that lie close to each other on the simplex, thus encouraging few dominating clusters. The repulsive behaviour induces additional sparsity on the number of components. We refer to this construction as sparsity-inducing partition (SIP) prior. By highlighting differences with the conventional Dirichlet distribution, we present relevant properties of the SIP prior and demonstrate their implications across a variety of mixture models, including finite mixtures with a fixed or random number of components, as well as repulsive mixtures. We propose an efficient posterior sampling algorithm and validate our model through an extensive simulation study as well as an application to a biomedical dataset describing children's Body Mass Index and eating behaviour.
Similar Papers
Repulsive Mixture Model with Projection Determinantal Point Process
Methodology
Finds clear groups in messy data.
Bayesian Wasserstein Repulsive Gaussian Mixture Models
Methodology
Finds groups of things that are clearly different.
New sampling approaches for Shrinkage Inverse-Wishart distribution
Methodology
Makes computer math faster and more reliable.