Uncertainty Quantification in Bayesian Clustering
By: Garritt L. Page , Andrés F. Barrientos , David B. Dahl and more
Potential Business Impact:
Shows how sure we are about groups of things.
Bayesian clustering methods have the widely touted advantage of providing a probabilistic characterization of uncertainty in clustering through the posterior distribution. An amazing variety of priors and likelihoods have been proposed for clustering in a broad array of settings. There is also a rich literature on Markov chain Monte Carlo (MCMC) algorithms for sampling from posterior clustering distributions. However, there is relatively little work on summarizing the posterior uncertainty. The complexity of the partition space corresponding to different clusterings makes this problem challenging. We propose a post-processing procedure for any Bayesian clustering model with posterior samples that generates a credible set that is easy to use, fast to compute, and intuitive to interpret. We also provide new measures of clustering uncertainty and show how to compute cluster-specific parameter estimates and credible regions that accumulate a desired posterior probability without having to condition on a partition estimate or employ label-switching techniques. We illustrate our procedure through several applications.
Similar Papers
Understanding uncertainty in Bayesian cluster analysis
Computation
Shows many possible ways to group data.
Conformalized Bayesian Inference, with Applications to Random Partition Models
Methodology
Makes complex computer guesses easier to understand.
Calibrating Bayesian Inference
Methodology
Makes sure computer guesses are always right.