Conformalized Regression for Continuous Bounded Outcomes
By: Zhanli Wu, Fabrizio Leisen, F. Javier Rubio
Potential Business Impact:
Gives more accurate predictions for rates and proportions.
Regression problems with bounded continuous outcomes frequently arise in real-world statistical and machine learning applications, such as the analysis of rates and proportions. A central challenge in this setting is predicting a response associated with a new covariate value. Most of the existing statistical and machine learning literature has focused either on point prediction of bounded outcomes or on interval prediction based on asymptotic approximations. We develop conformal prediction intervals for bounded outcomes based on transformation models and beta regression. We introduce tailored non-conformity measures based on residuals that are aligned with the underlying models, and account for the inherent heteroscedasticity in regression settings with bounded outcomes. We present a theoretical result on asymptotic marginal and conditional validity in the context of full conformal prediction, which remains valid under model misspecification. For split conformal prediction, we provide an empirical coverage analysis based on a comprehensive simulation study. The simulation study demonstrates that both methods provide valid finite-sample predictive coverage, including settings with model misspecification. Finally, we demonstrate the practical performance of the proposed conformal prediction intervals on real data and compare them with bootstrap-based alternatives.
Similar Papers
Conformal prediction of future insurance claims in the regression problem
Machine Learning (Stat)
Makes insurance predictions more trustworthy and reliable.
Extreme Conformal Prediction: Reliable Intervals for High-Impact Events
Methodology
Predicts rare events with high certainty.
A Unified Comparative Study with Generalized Conformity Scores for Multi-Output Conformal Regression
Machine Learning (Stat)
Helps computers guess many things at once.