The Bias-Variance Tradeoff in Data-Driven Optimization: A Local Misspecification Perspective
By: Haixiang Lan , Luofeng Liao , Adam N. Elmachtoub and more
Potential Business Impact:
Improves computer learning by balancing guessing and certainty.
Data-driven stochastic optimization is ubiquitous in machine learning and operational decision-making problems. Sample average approximation (SAA) and model-based approaches such as estimate-then-optimize (ETO) or integrated estimation-optimization (IEO) are all popular, with model-based approaches being able to circumvent some of the issues with SAA in complex context-dependent problems. Yet the relative performance of these methods is poorly understood, with most results confined to the dichotomous cases of the model-based approach being either well-specified or misspecified. We develop the first results that allow for a more granular analysis of the relative performance of these methods under a local misspecification setting, which models the scenario where the model-based approach is nearly well-specified. By leveraging tools from contiguity theory in statistics, we show that there is a bias-variance tradeoff between SAA, IEO, and ETO under local misspecification, and that the relative importance of the bias and the variance depends on the degree of local misspecification. Moreover, we derive explicit expressions for the decision bias, which allows us to characterize (un)impactful misspecification directions, and provide further geometric understanding of the variance.
Similar Papers
Dissecting the Impact of Model Misspecification in Data-driven Optimization
Machine Learning (CS)
Improves computer decisions when learning from imperfect data.
When three experiments are better than two: Avoiding intractable correlated aleatoric uncertainty by leveraging a novel bias--variance tradeoff
Machine Learning (CS)
Helps computers learn faster with noisy data.
Accounting for Underspecification in Statistical Claims of Model Superiority
Machine Learning (CS)
Makes AI in medicine more trustworthy and reliable.