Bayesian Multivariate Density-Density Regression
By: Khai Nguyen, Yang Ni, Peter Mueller
Potential Business Impact:
Finds hidden patterns in complex data.
We introduce a novel and scalable Bayesian framework for multivariate-density-density regression (DDR), designed to model relationships between multivariate distributions. Our approach addresses the critical issue of distributions residing in spaces of differing dimensions. We utilize a generalized Bayes framework, circumventing the need for a fully specified likelihood by employing the sliced Wasserstein distance to measure the discrepancy between fitted and observed distributions. This choice not only handles high-dimensional data and varying sample sizes efficiently but also facilitates a Metropolis-adjusted Langevin algorithm (MALA) for posterior inference. Furthermore, we establish the posterior consistency of our generalized Bayesian approach, ensuring that the posterior distribution concentrates around the true parameters as the sample size increases. Through simulations and application to a population-scale single-cell dataset, we show that Bayesian DDR provides robust fits, superior predictive performance compared to traditional methods, and valuable insights into complex biological interactions.
Similar Papers
Robust and Scalable Variational Bayes
Machine Learning (Stat)
Cleans messy data for better computer learning.
Sensitivity-Aware Density Estimation in Multiple Dimensions
Machine Learning (CS)
Makes computers better at guessing patterns in messy data.
Savage-Dickey density ratio estimation with normalizing flows for Bayesian model comparison
Cosmology and Nongalactic Astrophysics
Helps scientists pick the best science idea.