Score: 0

Repulsive mixtures via the sparsity-inducing partition prior

Published: September 30, 2025 | arXiv ID: 2509.25860v1

By: Alexander Mozdzen , Timothy Wertz , Maria De Iorio and more

Potential Business Impact:

Finds fewer, stronger groups in data.

Business Areas:
A/B Testing Data and Analytics

We introduce a novel prior distribution for modelling the weights in mixture models based on a generalisation of the Dirichlet distribution, the Selberg Dirichlet distribution. This distribution contains a repulsive term, which naturally penalises values that lie close to each other on the simplex, thus encouraging few dominating clusters. The repulsive behaviour induces additional sparsity on the number of components. We refer to this construction as sparsity-inducing partition (SIP) prior. By highlighting differences with the conventional Dirichlet distribution, we present relevant properties of the SIP prior and demonstrate their implications across a variety of mixture models, including finite mixtures with a fixed or random number of components, as well as repulsive mixtures. We propose an efficient posterior sampling algorithm and validate our model through an extensive simulation study as well as an application to a biomedical dataset describing children's Body Mass Index and eating behaviour.

Page Count
25 pages

Category
Statistics:
Methodology