Score: 1

A Fast Kernel-based Conditional Independence test with Application to Causal Discovery

Published: May 16, 2025 | arXiv ID: 2505.11085v1

By: Oliver Schacht, Biwei Huang

Potential Business Impact:

Makes finding causes in big data much faster.

Business Areas:
A/B Testing Data and Analytics

Kernel-based conditional independence (KCI) testing is a powerful nonparametric method commonly employed in causal discovery tasks. Despite its flexibility and statistical reliability, cubic computational complexity limits its application to large datasets. To address this computational bottleneck, we propose \textit{FastKCI}, a scalable and parallelizable kernel-based conditional independence test that utilizes a mixture-of-experts approach inspired by embarrassingly parallel inference techniques for Gaussian processes. By partitioning the dataset based on a Gaussian mixture model over the conditioning variables, FastKCI conducts local KCI tests in parallel, aggregating the results using an importance-weighted sampling scheme. Experiments on synthetic datasets and benchmarks on real-world production data validate that FastKCI maintains the statistical power of the original KCI test while achieving substantial computational speedups. FastKCI thus represents a practical and efficient solution for conditional independence testing in causal inference on large-scale data.

Country of Origin
πŸ‡ΊπŸ‡Έ πŸ‡©πŸ‡ͺ United States, Germany

Page Count
13 pages

Category
Computer Science:
Machine Learning (CS)