Identifying counterfactual probabilities using bivariate distributions and uplift modeling
By: Théo Verhelst, Gianluca Bontempi
Potential Business Impact:
Finds out if a sale *really* made a customer buy.
Uplift modeling estimates the causal effect of an intervention as the difference between potential outcomes under treatment and control, whereas counterfactual identification aims to recover the joint distribution of these potential outcomes (e.g., "Would this customer still have churned had we given them a marketing offer?"). This joint counterfactual distribution provides richer information than the uplift but is harder to estimate. However, the two approaches are synergistic: uplift models can be leveraged for counterfactual estimation. We propose a counterfactual estimator that fits a bivariate beta distribution to predicted uplift scores, yielding posterior distributions over counterfactual outcomes. Our approach requires no causal assumptions beyond those of uplift modeling. Simulations show the efficacy of the approach, which can be applied, for example, to the problem of customer churn in telecom, where it reveals insights unavailable to standard ML or uplift models alone.
Similar Papers
CID: Measuring Feature Importance Through Counterfactual Distributions
Machine Learning (CS)
Shows why computers make certain choices.
Bounding Causal Effects and Counterfactuals
Machine Learning (CS)
Finds best answers when data is incomplete.
LeapFactual: Reliable Visual Counterfactual Explanation Using Conditional Flow Matching
Machine Learning (CS)
Shows how to change answers to be correct.