Improving Sensitivity in A/B Tests: Integrating CUPED with Trimmed Mean Techniques
By: Kevin Charette, Tristan Boudreault
Potential Business Impact:
Finds real results faster in online tests.
Accurate estimation of treatment effects in online A/B testing is challenging with zero-inflated and skewed metrics. Traditional tests, like Welch's t-test, often lack sensitivity with heavy-tailed data due to their reliance on means, as opposed to e.g., percentiles. The Controlled Experiments Using Pre-experiment Data (CUPED) technique improves sensitivity by reducing variance, yet that variance reduction is insufficient for highly skewed metrics. Alternatively, Yuen's t-test uses trimmed means to robustly handle outliers and skewness. This paper introduces a method that combines the variance reduction of CUPED with the robustness of Yuen's t-test to enhance hypothesis testing sensitivity. Our novel approach integrates trimmed data in a principled manner, offering a framework that balances variance reduction with robust location measures. We demonstrate improved detection of significant effects with smaller sample sizes, enabling quicker experimental decisions without sacrificing statistical power. This work broadens the utility of controlled experiments in environments characterized by highly skewed or high-variance data.
Similar Papers
Beyond Basic A/B testing: Improving Statistical Efficiency for Business Growth
Methodology
Improves website tests for better business results.
Beyond Normality: Reliable A/B Testing with Non-Gaussian Data
Machine Learning (Stat)
Fixes online tests to make better choices.
Multi-Metric Adaptive Experimental Design under Fixed Budget with Validation
Machine Learning (CS)
Finds best website changes faster and more reliably.