On the Distributed Estimation for Scalar-on-Function Regression Models
By: Peilun He, Han Lin Shang, Nan Zou
Potential Business Impact:
Helps computers analyze big data faster.
This paper proposes distributed estimation procedures for three scalar-on-function regression models: the functional linear model (FLM), the functional non-parametric model (FNPM), and the functional partial linear model (FPLM). The framework addresses two key challenges in functional data analysis, namely the high computational cost of large samples and limitations on sharing raw data across institutions. Monte Carlo simulations show that the distributed estimators substantially reduce computation time while preserving high estimation and prediction accuracy for all three models. When block sizes become too small, the FPLM exhibits overfitting, leading to narrower prediction intervals and reduced empirical coverage probability. An example of an empirical study using the \textit{tecator} dataset further supports these findings.
Similar Papers
Estimation and inference of high-dimensional partially linear regression models with latent factors
Methodology
Finds hidden patterns in complex data.
On function-on-function linear quantile regression
Methodology
Helps understand complex data patterns better.
Penalized spatial function-on-function regression
Methodology
Improves weather forecasts by understanding nearby data.