Score: 0

Model selection with uncertainty in estimating optimal dynamic treatment regimes

Published: December 5, 2025 | arXiv ID: 2512.05695v1

By: Chunyu Wang, Brian Tom

Optimal dynamic treatment regimes (DTRs), as a key part of precision medicine, have progressively gained more attention recently. To inform clinical decision making, interpretable and parsimonious models for contrast functions are preferred, raising concerns about undue misspecification. It is therefore important to properly evaluate the performance of candidate interpretable models and select the one that best approximates the unknown contrast function. Moreover, since a DTR usually involves multiple decision points, an inaccurate approximation at a later decision point affects its estimation at an earlier decision point when a backward induction algorithm is applied. This paper aims to perform model selection for contrast functions in the context of learning optimal DTRs from observed data. Note that the relative performance of candidate models may heavily depend on the sample size when, for example, the comparison is made between parametric and tree-based models. Therefore, instead of investigating the limiting behavior of each candidate model and developing methods to select asymptotically the `correct' one, we focus on the finite sample performance of each model and attempt to perform model selection under a given sample size. To this end, we adopt the counterfactual cross-validation metric and propose a novel method to estimate the variance of the metric. Supplementing the cross-validation metric with its estimated variance allows us to characterize the uncertainty in model selection under a given sample size and facilitates hypothesis testing associated with a preferred model structure.

Evaluating and Learning Optimal Dynamic Treatment Regimes under Truncation by Death

Machine Learning (Stat)

Helps doctors choose best treatments for sick patients.

8 Oct 2025 0

87%

Bayesian Machine Learning for Estimating Optimal Dynamic Treatment Regimes with Ordinal Outcomes

Methodology

Helps doctors pick best treatments for patients.

8 Mar 2025 1

87%

Optimal real-time dynamic treatment regimes with application to oxytocin use in preventing postpartum hemorrhage

Methodology

Helps doctors give medicine at the right time.

28 Apr 2025 0

View PDF Login to Bookmark

Model selection with uncertainty in estimating optimal dynamic treatment regimes

Technical Abstract

Evaluating and Learning Optimal Dynamic Treatment Regimes under Truncation by Death

Bayesian Machine Learning for Estimating Optimal Dynamic Treatment Regimes with Ordinal Outcomes

Optimal real-time dynamic treatment regimes with application to oxytocin use in preventing postpartum hemorrhage