Score: 0

Inference for Forecasting Accuracy: Pooled versus Individual Estimators in High-dimensional Panel Data

Published: December 17, 2025 | arXiv ID: 2512.15592v1

By: Tim Kutta, Martin Schumann, Holger Dette

Panels with large time $(T)$ and cross-sectional $(N)$ dimensions are a key data structure in social sciences and other fields. A central question in panel data analysis is whether to pool data across individuals or to estimate separate models. Pooled estimators typically have lower variance but may suffer from bias, creating a fundamental trade-off for optimal estimation. We develop a new inference method to compare the forecasting performance of pooled and individual estimators. Specifically, we propose a confidence interval for the difference between their forecasting errors and establish its asymptotic validity. Our theory allows for complex temporal and cross-sectional dependence in the model errors and covers scenarios where $N$ can be much larger than $T$-including the independent case under the classical condition $N/T^2 \to 0$. The finite-sample properties of the proposed method are examined in an extensive simulation study.

Training and Testing with Multiple Splits: A Central Limit Theorem for Split-Sample Estimators

Econometrics

Improves computer learning by using data smarter.

7 Nov 2025 0

86%

Robust Inference Methods for Latent Group Panel Models under Possible Group Non-Separation

Econometrics

Finds hidden patterns in data to make better predictions.

23 Nov 2025 1

86%

Learning Across Experiments and Time: Tackling Heterogeneity in A/B Testing

Methodology

Makes online tests give truer results sooner.

26 Nov 2025 0

View PDF Login to Bookmark

Inference for Forecasting Accuracy: Pooled versus Individual Estimators in High-dimensional Panel Data

Technical Abstract

Training and Testing with Multiple Splits: A Central Limit Theorem for Split-Sample Estimators

Robust Inference Methods for Latent Group Panel Models under Possible Group Non-Separation

Learning Across Experiments and Time: Tackling Heterogeneity in A/B Testing