Causally Fair Node Classification on Non-IID Graph Data
By: Yucong Dai , Lu Zhang , Yaowei Hu and more
Potential Business Impact:
Makes computer predictions fairer for everyone.
Fair machine learning seeks to identify and mitigate biases in predictions against unfavorable populations characterized by demographic attributes, such as race and gender. Recently, a few works have extended fairness to graph data, such as social networks, but most of them neglect the causal relationships among data instances. This paper addresses the prevalent challenge in fairness-aware ML algorithms, which typically assume Independent and Identically Distributed (IID) data. We tackle the overlooked domain of non-IID, graph-based settings where data instances are interconnected, influencing the outcomes of fairness interventions. We base our research on the Network Structural Causal Model (NSCM) framework and posit two main assumptions: Decomposability and Graph Independence, which enable the computation of interventional distributions in non-IID settings using the $do$-calculus. Based on that, we develop the Message Passing Variational Autoencoder for Causal Inference (MPVA) to compute interventional distributions and facilitate causally fair node classification through estimated interventional distributions. Empirical evaluations on semi-synthetic and real-world datasets demonstrate that MPVA outperforms conventional methods by effectively approximating interventional distributions and mitigating bias. The implications of our findings underscore the potential of causality-based fairness in complex ML applications, setting the stage for further research into relaxing the initial assumptions to enhance model fairness.
Similar Papers
Causal representation learning from network data
Machine Learning (CS)
Finds hidden causes in complex systems.
Improving Fairness in Graph Neural Networks via Counterfactual Debiasing
Machine Learning (CS)
Makes computer predictions fairer by adding fake data.
Testing Individual Fairness in Graph Neural Networks
Machine Learning (CS)
Makes AI fair for everyone, not just groups.