On the Choice of Model Space Priors and Multiplicity Control in Bayesian Variable Selection: An Application to Streaming Logistic Regression
By: Joyee Ghosh
Bayesian variable selection (BVS) depends critically on the specification of a prior distribution over the model space, particularly for controlling sparsity and multiplicity. This paper examines the practical consequences of different model space priors for BVS in logistic regression, with an emphasis on streaming data settings. We review some popular and well-known Beta--Binomial priors alongside the recently proposed matryoshka doll (MD) prior. We introduce a simple approximation to the MD prior that yields independent inclusion indicators and is convenient for scalable inference. Using BIC-based approximations to marginal likelihoods, we compare the effect of different model space priors on posterior inclusion probabilities and coefficient estimation at intermediate and final stages of the data stream via simulation studies. Overall, the results indicate that no single model space prior uniformly dominates across scenarios, and that the recently proposed MD prior provides a useful additional option that occupies an intermediate position between commonly used Beta--Binomial priors with differing degrees of sparsity.
Similar Papers
Logistic regression models: practical induced prior specification
Methodology
Makes computer models guess better about hidden things.
Bayesian inference with Besov-Laplace priors for spatially inhomogeneous binary classification surfaces
Statistics Theory
Helps computers learn from messy data better.
An Online Algorithm for Bayesian Variable Selection in Logistic Regression Models With Streaming Data
Methodology
Lets computers learn from new data faster.