Score: 0

Random Subset Averaging

Published: December 27, 2025 | arXiv ID: 2512.22472v1

By: Wenhao Cui, Jie Hu

We propose a new ensemble prediction method, Random Subset Averaging (RSA), tailored for settings with many covariates, particularly in the presence of strong correlations. RSA constructs candidate models via binomial random subset strategy and aggregates their predictions through a two-round weighting scheme, resulting in a structure analogous to a two-layer neural network. All tuning parameters are selected via cross-validation, requiring no prior knowledge of covariate relevance. We establish the asymptotic optimality of RSA under general conditions, allowing the first-round weights to be data-dependent, and demonstrate that RSA achieves a lower finite-sample risk bound under orthogonal design. Simulation studies demonstrate that RSA consistently delivers superior and stable predictive performance across a wide range of sample sizes, dimensional settings, sparsity levels and correlation structures, outperforming conventional model selection and ensemble learning methods. An empirical application to financial return forecasting further illustrates its practical utility.

High-Dimensional Model Averaging via Cross-Validation

Statistics Theory

Helps computers pick the best answers from many guesses.

10 Jun 2025 2

87%

Collective Wisdom: Policy Averaging with an Application to the Newsvendor Problem

Applications

Makes smart choices better and more trustworthy.

22 Mar 2025 1

87%

Spatial weights matrix selection and model averaging for multivariate spatial autoregressive models

Methodology

Finds how online friends influence what you post.

7 Sep 2025 0

View PDF Login to Bookmark

Random Subset Averaging

Technical Abstract

High-Dimensional Model Averaging via Cross-Validation

Collective Wisdom: Policy Averaging with an Application to the Newsvendor Problem

Spatial weights matrix selection and model averaging for multivariate spatial autoregressive models