Does Rerandomization Help Beyond Covariate Adjustment? A Review and Guide for Theory and Practice
By: Antônio Carlos Herling Ribeiro Junior, Zach Branson
Rerandomization is a modern experimental design technique that repeatedly randomizes treatment assignments until covariates are deemed balanced between treatment groups. This enhances the precision and coherence of causal effect estimators, mitigates false discoveries from p-hacking, and increases statistical power. Recent work suggests that balancing covariates via rerandomization does not alter the asymptotic precision of covariate-adjusted estimators, thereby making it unclear whether rerandomization is worthwhile if adjusted estimators are used. However, these results have two key caveats. First, these results are asymptotic, leaving finite sample performance unknown. Second, these results focus on precision, while other potential benefits, such as increased coherence among flexible estimators, remain understudied. Hence, in this paper we provide three main contributions: (i) a comprehensive review of the rerandomization literature, covering historical foundations, theoretical developments, and recent methodological advancements, (ii) an extensive simulation study examining finite-sample performance, and (iii) a practical guide for practitioners. Our study compares precision, coherence, power, and coverage of various estimators under rerandomization versus complete randomization. We find rerandomization to be a complementary design strategy that enhances the precision, robustness, and reliability of causal effect estimators, especially for smaller sample sizes.
Similar Papers
Rerandomization for covariate balance mitigates p-hacking in regression adjustment
Statistics Theory
Stops cheating in science tests.
Regression adjustment in covariate-adaptive randomized experiments with missing covariates
Methodology
Fixes missing data in medical tests for better results.
Design-based finite-sample analysis for regression adjustment
Statistics Theory
Makes study results more accurate, even with lots of data.