Score: 0

Scalable and robust regression models for continuous proportional data

Published: April 21, 2025 | arXiv ID: 2504.15269v1

By: Changwoo J. Lee , Benjamin K. Dahl , Otso Ovaskainen and more

Potential Business Impact:

Makes data analysis more reliable and accurate.

Business Areas:
A/B Testing Data and Analytics

Beta regression is used routinely for continuous proportional data, but it often encounters practical issues such as a lack of robustness of regression parameter estimates to misspecification of the beta distribution. We develop an improved class of generalized linear models starting with the continuous binomial (cobin) distribution and further extending to dispersion mixtures of cobin distributions (micobin). The proposed cobin regression and micobin regression models have attractive robustness, computation, and flexibility properties. A key innovation is the Kolmogorov-Gamma data augmentation scheme, which facilitates Gibbs sampling for Bayesian computation, including in hierarchical cases involving nested, longitudinal, or spatial data. We demonstrate robustness, ability to handle responses exactly at the boundary (0 or 1), and computational efficiency relative to beta regression in simulation experiments and through analysis of the benthic macroinvertebrate multimetric index of US lakes using lake watershed covariates.

Country of Origin
🇺🇸 United States

Page Count
47 pages

Category
Statistics:
Methodology