Score: 0

Machine learning to optimize precision in the analysis of randomized trials: A journey in pre-specified, yet data-adaptive learning

Published: December 15, 2025 | arXiv ID: 2512.13610v1

By: Laura B. Balzer, Mark J. van der Laan, Maya L. Petersen

Covariate adjustment is an approach to improve the precision of trial analyses by adjusting for baseline variables that are prognostic of the primary endpoint. Motivated by the SEARCH Universal HIV Test-and-Treat Trial (2013-2017), we tell our story of developing, evaluating, and implementing a machine learning-based approach for covariate adjustment. We provide the rationale for as well as the practical concerns with such an approach for estimating marginal effects. Using schematics, we illustrate our procedure: targeted machine learning estimation (TMLE) with Adaptive Pre-specification. Briefly, sample-splitting is used to data-adaptively select the combination of estimators of the outcome regression (i.e., the conditional expectation of the outcome given the trial arm and covariates) and known propensity score (i.e., the conditional probability of being randomized to the intervention given the covariates) that minimizes the cross-validated variance estimate and, thereby, maximizes empirical efficiency. We discuss our approach for evaluating finite sample performance with parametric and plasmode simulations, pre-specifying the Statistical Analysis Plan, and unblinding in real-time on video conference with our colleagues from around the world. We present the results from applying our approach in the primary, pre-specified analysis of 8 recently published trials (2022-2024). We conclude with practical recommendations and an invitation to implement our approach in the primary analysis of your next trial.

Conditional cross-fitting for unbiased machine-learning-assisted covariate adjustment in randomized experiments

Methodology

Makes study results more accurate with less data.

21 Aug 2025 0

89%

A Unified Approach to Covariate Adjustment for Survival Endpoints in Randomized Clinical Trials

Methodology

Makes medical studies more accurate with patient info.

8 May 2025 0

89%

Regression adjustment in covariate-adaptive randomized experiments with missing covariates

Methodology

Fixes missing data in medical tests for better results.

13 Aug 2025 0

View PDF Login to Bookmark

Machine learning to optimize precision in the analysis of randomized trials: A journey in pre-specified, yet data-adaptive learning

Technical Abstract

Conditional cross-fitting for unbiased machine-learning-assisted covariate adjustment in randomized experiments

A Unified Approach to Covariate Adjustment for Survival Endpoints in Randomized Clinical Trials

Regression adjustment in covariate-adaptive randomized experiments with missing covariates