Score: 0

Causal-Policy Forest for End-to-End Policy Learning

Published: December 28, 2025 | arXiv ID: 2512.22846v1

By: Masahiro Kato

This study proposes an end-to-end algorithm for policy learning in causal inference. We observe data consisting of covariates, treatment assignments, and outcomes, where only the outcome corresponding to the assigned treatment is observed. The goal of policy learning is to train a policy from the observed data, where a policy is a function that recommends an optimal treatment for each individual, to maximize the policy value. In this study, we first show that maximizing the policy value is equivalent to minimizing the mean squared error for the conditional average treatment effect (CATE) under $\{-1, 1\}$ restricted regression models. Based on this finding, we modify the causal forest, an end-to-end CATE estimation algorithm, for policy learning. We refer to our algorithm as the causal-policy forest. Our algorithm has three advantages. First, it is a simple modification of an existing, widely used CATE estimation method, therefore, it helps bridge the gap between policy learning and CATE estimation in practice. Second, while existing studies typically estimate nuisance parameters for policy learning as a separate task, our algorithm trains the policy in a more end-to-end manner. Third, as in standard decision trees and random forests, we train the models efficiently, avoiding computational intractability.

Generalizable estimation of conditional average treatment effects using Causal Forest in randomized controlled trials

Methodology

Helps doctors give the right medicine to each person.

14 Jun 2025 0

87%

Policy-Aligned Estimation of Conditional Average Treatment Effects

Econometrics

Helps companies pick customers who will buy more.

15 Dec 2025 0

87%

Bridging the Gap between Empirical Welfare Maximization and Conditional Average Treatment Effect Estimation in Policy Learning

Machine Learning (Stat)

Helps computers learn best choices for people.

30 Oct 2025 0

View PDF Login to Bookmark

Causal-Policy Forest for End-to-End Policy Learning

Technical Abstract

Generalizable estimation of conditional average treatment effects using Causal Forest in randomized controlled trials

Policy-Aligned Estimation of Conditional Average Treatment Effects

Bridging the Gap between Empirical Welfare Maximization and Conditional Average Treatment Effect Estimation in Policy Learning