Integrating Complex Covariate Transformations in Generalized Additive Models
By: Claudia Collarin , Matteo Fasiolo , Yannig Goude and more
Potential Business Impact:
Makes computer models learn better from data.
Transformations of covariates are widely used in applied statistics to improve interpretability and to satisfy assumptions required for valid inference. More broadly, feature engineering encompasses a wider set of practices aimed at enhancing predictive performance, and is typically performed as part of a data pre-processing step. In contrast, this paper integrates a substantial component of the feature engineering process directly into the modelling stage. This is achieved by introducing a novel general framework for embedding interpretable covariate transformations within multi-parameter Generalised Additive Models (GAMs). Our framework accommodates any sufficiently differentiable scalar-valued transformation of potentially high-dimensional and complex covariates. These transformations are treated as integral model components, with their parameters estimated jointly with regression coefficients via maximum a posteriori (MAP) methods, and joint uncertainty quantified via approximate Bayesian techniques. Smoothing parameters are selected in an empirical Bayes framework using a Laplace approximation to the marginal likelihood, supported by efficient computation based on implicit differentiation methods. We demonstrate the flexibility and practical value of the proposed methodology through applications to forecasting electricity net-demand in Great Britain and to modelling house prices in London. The proposed methods are implemented by the gamFactory R package, available at https://github.com/mfasiolo/gamFactory.
Similar Papers
Adding structure to generalized additive models, with applications in ecology
Methodology
Helps scientists understand nature with complex data.
Adding structure to generalized additive models, with applications in ecology
Methodology
Lets computers understand complex data better.
Graphical Transformation Models
Methodology
Finds hidden patterns in complicated data.