Score: 0

Identifying counterfactual probabilities using bivariate distributions and uplift modeling

Published: December 9, 2025 | arXiv ID: 2512.08805v1

By: Théo Verhelst, Gianluca Bontempi

Potential Business Impact:

Finds out if a sale *really* made a customer buy.

Business Areas:
A/B Testing Data and Analytics

Uplift modeling estimates the causal effect of an intervention as the difference between potential outcomes under treatment and control, whereas counterfactual identification aims to recover the joint distribution of these potential outcomes (e.g., "Would this customer still have churned had we given them a marketing offer?"). This joint counterfactual distribution provides richer information than the uplift but is harder to estimate. However, the two approaches are synergistic: uplift models can be leveraged for counterfactual estimation. We propose a counterfactual estimator that fits a bivariate beta distribution to predicted uplift scores, yielding posterior distributions over counterfactual outcomes. Our approach requires no causal assumptions beyond those of uplift modeling. Simulations show the efficacy of the approach, which can be applied, for example, to the problem of customer churn in telecom, where it reveals insights unavailable to standard ML or uplift models alone.

Page Count
7 pages

Category
Computer Science:
Machine Learning (CS)