Density Ratio-based Causal Discovery from Bivariate Continuous-Discrete Data
By: Takashi Nicholas Maeda, Shohei Shimizu, Hidetoshi Matsui
Potential Business Impact:
Finds cause when one thing is a number, another is a choice.
This paper proposes a causal discovery method for mixed bivariate data consisting of one continuous and one discrete variable. Existing constraint-based approaches are ineffective in the bivariate setting, as they rely on conditional independence tests that are not suited to bivariate data. Score-based methods either impose strong distributional assumptions or face challenges in fairly comparing causal directions between variables of different types, due to differences in their information content. We introduce a novel approach that determines causal direction by analyzing the monotonicity of the conditional density ratio of the continuous variable, conditioned on different values of the discrete variable. Our theoretical analysis shows that the conditional density ratio exhibits monotonicity when the continuous variable causes the discrete variable, but not in the reverse direction. This property provides a principled basis for comparing causal directions between variables of different types, free from strong distributional assumptions and bias arising from differences in their information content. We demonstrate its effectiveness through experiments on both synthetic and real-world datasets, showing superior accuracy compared to existing methods.
Similar Papers
Testing Conditional Independence via Density Ratio Regression
Methodology
Finds hidden connections in messy data.
Density Ratio-based Proxy Causal Learning Without Density Ratios
Machine Learning (CS)
Finds hidden causes even with missing information.
Estimating Unbounded Density Ratios: Applications in Error Control under Covariate Shift
Machine Learning (Stat)
Makes computer learning better with tricky data.