Score: 3

It's Hard to Be Normal: The Impact of Noise on Structure-agnostic Estimation

Published: July 3, 2025 | arXiv ID: 2507.02275v2

By: Jikai Jin, Lester Mackey, Vasilis Syrgkanis

BigTech Affiliations: Microsoft Stanford University

Potential Business Impact:

Makes computer predictions more accurate with messy data.

Business Areas:
A/B Testing Data and Analytics

Structure-agnostic causal inference studies how well one can estimate a treatment effect given black-box machine learning estimates of nuisance functions (like the impact of confounders on treatment and outcomes). Here, we find that the answer depends in a surprising way on the distribution of the treatment noise. Focusing on the partially linear model of \citet{robinson1988root}, we first show that the widely adopted double machine learning (DML) estimator is minimax rate-optimal for Gaussian treatment noise, resolving an open problem of \citet{mackey2018orthogonal}. Meanwhile, for independent non-Gaussian treatment noise, we show that DML is always suboptimal by constructing new practical procedures with higher-order robustness to nuisance errors. These \emph{ACE} procedures use structure-agnostic cumulant estimators to achieve $r$-th order insensitivity to nuisance errors whenever the $(r+1)$-st treatment cumulant is non-zero. We complement these core results with novel minimax guarantees for binary treatments in the partially linear model. Finally, using synthetic demand estimation experiments, we demonstrate the practical benefits of our higher-order robust estimators.

Country of Origin
πŸ‡ΊπŸ‡Έ United States

Repos / Data Links

Page Count
63 pages

Category
Statistics:
Machine Learning (Stat)