Foundation Models for Causal Inference via Prior-Data Fitted Networks
By: Yuchen Ma , Dennis Frauen , Emil Javurek and more
Potential Business Impact:
Helps computers understand cause and effect.
Prior-data fitted networks (PFNs) have recently been proposed as a promising way to train tabular foundation models. PFNs are transformers that are pre-trained on synthetic data generated from a prespecified prior distribution and that enable Bayesian inference through in-context learning. In this paper, we introduce CausalFM, a comprehensive framework for training PFN-based foundation models in various causal inference settings. First, we formalize the construction of Bayesian priors for causal inference based on structural causal models (SCMs) in a principled way and derive necessary criteria for the validity of such priors. Building on this, we propose a novel family of prior distributions using causality-inspired Bayesian neural networks that enable CausalFM to perform Bayesian causal inference in various settings, including back-door, front-door, and instrumental variable adjustment. Finally, we instantiate CausalFM and explicitly train a foundation model for estimating conditional average treatment effects (CATEs) using back-door adjustment. We show that CausalFM performs competitively for CATE estimation using various synthetic and semi-synthetic benchmarks. In sum, our framework can be used as a general recipe to train foundation models for various causal inference settings. In contrast to the current state-of-the-art in causal inference, CausalFM offers a novel paradigm with the potential to fundamentally change how practitioners perform causal inference in medicine, economics, and other disciplines.
Similar Papers
Do-PFN: In-Context Learning for Causal Effect Estimation
Machine Learning (CS)
Finds cause and effect without knowing all the rules.
CausalPFN: Amortized Causal Effect Estimation via In-Context Learning
Machine Learning (CS)
Finds what causes what from data automatically.
FairPFN: A Tabular Foundation Model for Causal Fairness
Machine Learning (CS)
Fixes unfair computer decisions without knowing why.