On the identifiability of causal graphs with multiple environments
By: Francesco Montagna
Potential Business Impact:
Finds cause-and-effect relationships using different data.
Causal discovery from i.i.d. observational data is known to be generally ill-posed. We demonstrate that if we have access to the distribution of a structural causal model, and additional data from only two environments that sufficiently differ in the noise statistics, the unique causal graph is identifiable. Notably, this is the first result in the literature that guarantees the entire causal graph recovery with a constant number of environments and arbitrary nonlinear mechanisms. Our only constraint is the Gaussianity of the noise terms; however, we propose potential ways to relax this requirement. Of interest on its own, we expand on the well-known duality between independent component analysis (ICA) and causal discovery; recent advancements have shown that nonlinear ICA can be solved from multiple environments, at least as many as the number of sources: we show that the same can be achieved for causal discovery while having access to much less auxiliary information.
Similar Papers
Causal Effect Identification in Heterogeneous Environments from Higher-Order Moments
Artificial Intelligence
Finds hidden causes even when data changes.
Rethinking Causal Discovery Through the Lens of Exchangeability
Machine Learning (CS)
Finds hidden causes in data better.
Environment Inference for Learning Generalizable Dynamical System
Machine Learning (CS)
Finds hidden patterns in data without knowing the environment.