Subgroup Identification and Individualized Treatment Policies: A Tutorial on the Hybrid Two-Stage Workflow
By: Nan Miles Xi, Xin Huang, Lin Wang
Potential Business Impact:
Finds best medicine for each person.
Patients in clinical studies often exhibit heterogeneous treatment effect (HTE). Classical subgroup analyses provide inferential tools to test for effect modification, while modern machine learning methods estimate the Conditional Average Treatment Effect (CATE) to enable individual level prediction. Each paradigm has limitations: inference focused approaches may sacrifice predictive utility, and prediction focused approaches often lack statistical guarantees. We present a hybrid two-stage workflow that integrates these perspectives. Stage 1 applies statistical inference to test whether credible treatment effect heterogeneity exists with the protection against spurious findings. Stage 2 translates heterogeneity evidence into individualized treatment policies, evaluated by cross fitted doubly robust (DR) metrics with Neyman-Pearson (NP) constraints on harm. We illustrate the workflow with working examples based on simulated data and a real ACTG 175 HIV trial. This tutorial provides practical implementation checklists and discusses links to sponsor oriented HTE workflows, offering a transparent and auditable pathway from heterogeneity assessment to individualized treatment policies.
Similar Papers
Causal Clustering for Conditional Average Treatment Effects Estimation and Subgroup Discovery
Machine Learning (Stat)
Finds groups who benefit most from treatments.
Assumption-Lean Differential Variance Inference for Heterogeneous Treatment Effect Detection
Methodology
Tests if treatments work the same for everyone.
Assumption-Lean Differential Variance Inference for Heterogeneous Treatment Effect Detection
Methodology
Tests if treatments work the same for everyone.