Scalable Computations for Generalized Mixed Effects Models with Crossed Random Effects Using Krylov Subspace Methods
By: Pascal Kündig, Fabio Sigrist
Potential Business Impact:
Makes complex math problems solve much faster.
Mixed effects models are widely used for modeling data with hierarchically grouped structures and high-cardinality categorical predictor variables. However, for high-dimensional crossed random effects, current standard computations relying on Cholesky decompositions can become prohibitively slow. In this work, we present novel Krylov subspace-based methods that address several existing computational bottlenecks. Among other things, we theoretically analyze and empirically evaluate various preconditioners for the conjugate gradient and stochastic Lanczos quadrature methods, derive new convergence results, and develop computationally efficient methods for calculating predictive variances. Extensive experiments using simulated and real-world data sets show that our proposed methods scale much better than Cholesky-based computations, for instance, achieving a runtime reduction of approximately two orders of magnitudes for both estimation and prediction. Moreover, our software implementation is up to 10'000 times faster and more stable than state-of-the-art implementations such as lme4 and glmmTMB when using default settings. Our methods are implemented in the free C++ software library GPBoost with high-level Python and R packages.
Similar Papers
Scalable Fitting Methods for Multivariate Gaussian Additive Models with Covariate-dependent Covariance Matrices
Computation
Helps computers understand complex data patterns better.
Randomized Krylov methods for inverse problems
Numerical Analysis
Cleans up blurry pictures and earthquake maps.
Scalable Gaussian Processes with Latent Kronecker Structure
Machine Learning (CS)
Lets computers learn from huge, messy data.