Canonical correlation regression with noisy data
By: Isaac Meza, Rahul Singh
We study instrumental variable regression in data rich environments. The goal is to estimate a linear model from many noisy covariates and many noisy instruments. Our key assumption is that true covariates and true instruments are repetitive, though possibly different in nature; they each reflect a few underlying factors, however those underlying factors may be misaligned. We analyze a family of estimators based on two stage least squares with spectral regularization: canonical correlations between covariates and instruments are learned in the first stage, which are used as regressors in the second stage. As a theoretical contribution, we derive upper and lower bounds on estimation error, proving optimality of the method with noisy data. As a practical contribution, we provide guidance on which types of spectral regularization to use in different regimes.
Similar Papers
Nonparametric Instrumental Variable Regression with Observed Covariates
Machine Learning (Stat)
Helps understand cause and effect better.
Possibilistic Instrumental Variable Regression
Methodology
Finds truth even with bad information.
Demystifying Spectral Feature Learning for Instrumental Variable Regression
Machine Learning (Stat)
Finds hidden causes of effects in data.