SCMD: A Kernel-Based Distance for Structural Causal Models to Quantify Transferability Across Environments
By: Théotime Le Goff, Émilie Devijver
Potential Business Impact:
Measures how well AI works in new places.
Out-of-distribution generalization is key to building models that remain reliable across diverse environments. Recent causality-based methods address this challenge by learning invariant causal relationships in the underlying data-generating process. Yet, measuring how causal structures differ across environments, and the resulting generalization difficulty, remains difficult. To tackle this challenge, we propose the Structural Causal Model Distance (SCMD), a principled metric that quantifies discrepancies between two SCMs by combining (i) kernel-based distances for nonparametric comparison of distributions and (ii) pairwise interventional comparisons to capture differences in causal effects. We show that SCMD is a proper metric and provide a consistent estimator with theoretical guarantees. Experiments on synthetic and real-world datasets demonstrate that SCMD effectively captures both structural and distributional differences between SCMs, providing a practical tool to assess causal transferability and generalization difficulty. Given two joint distributions P 1 (V j ) and P 2 (V j ), the Maximum Mean Discrepancy (MMD, Gretton et al. which defines a metric between distributions for characteristic kernels (Fukumizu et al., 2007). Kernel conditional mean embeddings (Park and Muandet, 2020) represent conditional expectation operators in an RKHS H Vj , which allows us to define the Maximum Conditional Mean Discrepancy between two
Similar Papers
A Practical Introduction to Kernel Discrepancies: MMD, HSIC & KSD
Machine Learning (Stat)
Measures how different two groups of data are.
Kernel-Based Evaluation of Conditional Biological Sequence Models
Machine Learning (Stat)
Helps scientists check if computer models understand biology.
Fast and Scalable Score-Based Kernel Calibration Tests
Machine Learning (Stat)
Checks if computer predictions are trustworthy.