Scalable and robust regression models for continuous proportional data
By: Changwoo J. Lee , Benjamin K. Dahl , Otso Ovaskainen and more
Potential Business Impact:
Makes data analysis more reliable and accurate.
Beta regression is used routinely for continuous proportional data, but it often encounters practical issues such as a lack of robustness of regression parameter estimates to misspecification of the beta distribution. We develop an improved class of generalized linear models starting with the continuous binomial (cobin) distribution and further extending to dispersion mixtures of cobin distributions (micobin). The proposed cobin regression and micobin regression models have attractive robustness, computation, and flexibility properties. A key innovation is the Kolmogorov-Gamma data augmentation scheme, which facilitates Gibbs sampling for Bayesian computation, including in hierarchical cases involving nested, longitudinal, or spatial data. We demonstrate robustness, ability to handle responses exactly at the boundary (0 or 1), and computational efficiency relative to beta regression in simulation experiments and through analysis of the benthic macroinvertebrate multimetric index of US lakes using lake watershed covariates.
Similar Papers
Modeling Bounded Count Environmental Data Using a Contaminated Beta-Binomial Regression Model
Methodology
Helps climate studies use extreme weather data.
Outlier-robust copula regression for bivariate continuous proportions: an application to cushion plant vitality
Methodology
Models plant death better, showing how size matters.
Approximate Bayesian inference for cumulative probit regression models
Methodology
Helps computers learn from ranked data faster.