Score: 0

Calibrated Principal Component Regression

Published: October 21, 2025 | arXiv ID: 2510.19020v1

By: Yixuan Florence Wu , Yilun Zhu , Lei Cao and and more

Potential Business Impact:

Fixes computer predictions when data has too many details.

Business Areas:

A/B Testing Data and Analytics

We propose a new method for statistical inference in generalized linear models. In the overparameterized regime, Principal Component Regression (PCR) reduces variance by projecting high-dimensional data to a low-dimensional principal subspace before fitting. However, PCR incurs truncation bias whenever the true regression vector has mass outside the retained principal components (PC). To mitigate the bias, we propose Calibrated Principal Component Regression (CPCR), which first learns a low-variance prior in the PC subspace and then calibrates the model in the original feature space via a centered Tikhonov step. CPCR leverages cross-fitting and controls the truncation bias by softening PCR's hard cutoff. Theoretically, we calculate the out-of-sample risk in the random matrix regime, which shows that CPCR outperforms standard PCR when the regression signal has non-negligible components in low-variance directions. Empirically, CPCR consistently improves prediction across multiple overparameterized problems. The results highlight CPCR's stability and flexibility in modern overparameterized settings.

Estimating the true number of principal components under the random design

Econometrics

Finds the best way to simplify complex data.

13 Nov 2025 0

87%

Identifying Neural Signatures from fMRI using Hybrid Principal Components Regression

Machine Learning (Stat)

Reads thoughts better by focusing on important brain signals.

9 Sep 2025 0

87%

Highly robust factored principal component analysis for matrix-valued outlier accommodation and explainable detection via matrix minimum covariance determinant

Methodology

Finds bad data points in complex pictures.

30 Sep 2025 0

View PDF Login to Bookmark

Page Count

22 pages

Calibrated Principal Component Regression

Fixes computer predictions when data has too many details.

Technical Abstract

Estimating the true number of principal components under the random design

Identifying Neural Signatures from fMRI using Hybrid Principal Components Regression

Highly robust factored principal component analysis for matrix-valued outlier accommodation and explainable detection via matrix minimum covariance determinant