Score: 1

Bayesian Causal Effect Estimation for Categorical Data using Staged Tree Models

Published: November 5, 2025 | arXiv ID: 2511.03399v1

By: Andrea Cremaschi, Manuele Leonelli, Gherardo Varando

Potential Business Impact:

Finds what truly causes health problems.

Business Areas:
A/B Testing Data and Analytics

We propose a fully Bayesian approach for causal inference with multivariate categorical data based on staged tree models, a class of probabilistic graphical models capable of representing asymmetric and context-specific dependencies. To account for uncertainty in both structure and parameters, we introduce a flexible family of prior distributions over staged trees. These include product partition models to encourage parsimony, a novel distance-based prior to promote interpretable dependence patterns, and an extension that incorporates continuous covariates into the learning process. Posterior inference is achieved via a tailored Markov Chain Monte Carlo algorithm with split-and-merge moves, yielding posterior samples of staged trees from which average treatment effects and uncertainty measures are derived. Posterior summaries and uncertainty measures are obtained via techniques from the Bayesian nonparametrics literature. Two case studies on electronic fetal monitoring and cesarean delivery and on anthracycline therapy and cardiac dysfunction in breast cancer illustrate the methods.

Repos / Data Links

Page Count
41 pages

Category
Statistics:
Methodology