Score: 0

Cluster-Dags as Powerful Background Knowledge For Causal Discovery

Published: December 10, 2025 | arXiv ID: 2512.10032v1

By: Jan Marco Ruiz de Vargas, Kirtan Padh, Niki Kilbertus

Potential Business Impact:

Finds what causes what, even with lots of data.

Business Areas:
Predictive Analytics Artificial Intelligence, Data and Analytics, Software

Finding cause-effect relationships is of key importance in science. Causal discovery aims to recover a graph from data that succinctly describes these cause-effect relationships. However, current methods face several challenges, especially when dealing with high-dimensional data and complex dependencies. Incorporating prior knowledge about the system can aid causal discovery. In this work, we leverage Cluster-DAGs as a prior knowledge framework to warm-start causal discovery. We show that Cluster-DAGs offer greater flexibility than existing approaches based on tiered background knowledge and introduce two modified constraint-based algorithms, Cluster-PC and Cluster-FCI, for causal discovery in the fully and partially observed setting, respectively. Empirical evaluation on simulated data demonstrates that Cluster-PC and Cluster-FCI outperform their respective baselines without prior knowledge.

Country of Origin
🇩🇪 Germany

Page Count
23 pages

Category
Computer Science:
Machine Learning (CS)