Score: 0

SCMD: A Kernel-Based Distance for Structural Causal Models to Quantify Transferability Across Environments

Published: October 23, 2025 | arXiv ID: 2510.20481v1

By: Théotime Le Goff, Émilie Devijver

Potential Business Impact:

Measures how well AI works in new places.

Business Areas:

Simulation Software

Out-of-distribution generalization is key to building models that remain reliable across diverse environments. Recent causality-based methods address this challenge by learning invariant causal relationships in the underlying data-generating process. Yet, measuring how causal structures differ across environments, and the resulting generalization difficulty, remains difficult. To tackle this challenge, we propose the Structural Causal Model Distance (SCMD), a principled metric that quantifies discrepancies between two SCMs by combining (i) kernel-based distances for nonparametric comparison of distributions and (ii) pairwise interventional comparisons to capture differences in causal effects. We show that SCMD is a proper metric and provide a consistent estimator with theoretical guarantees. Experiments on synthetic and real-world datasets demonstrate that SCMD effectively captures both structural and distributional differences between SCMs, providing a practical tool to assess causal transferability and generalization difficulty. Given two joint distributions P 1 (V j ) and P 2 (V j ), the Maximum Mean Discrepancy (MMD, Gretton et al. which defines a metric between distributions for characteristic kernels (Fukumizu et al., 2007). Kernel conditional mean embeddings (Park and Muandet, 2020) represent conditional expectation operators in an RKHS H Vj , which allows us to define the Maximum Conditional Mean Discrepancy between two