Score: 0

Causal Inference, Biomarker Discovery, Graph Neural Network, Feature Selection

Published: November 17, 2025 | arXiv ID: 2511.13295v1

By: Chaowang Lan , Jingxin Wu , Yulong Yuan and more

Potential Business Impact:

Finds better disease clues in our genes.

Business Areas:
Bioinformatics Biotechnology, Data and Analytics, Science and Engineering

Biomarker discovery from high-throughput transcriptomic data is crucial for advancing precision medicine. However, existing methods often neglect gene-gene regulatory relationships and lack stability across datasets, leading to conflation of spurious correlations with genuine causal effects. To address these issues, we develop a causal graph neural network (Causal-GNN) method that integrates causal inference with multi-layer graph neural networks (GNNs). The key innovation is the incorporation of causal effect estimation for identifying stable biomarkers, coupled with a GNN-based propensity scoring mechanism that leverages cross-gene regulatory networks. Experimental results demonstrate that our method achieves consistently high predictive accuracy across four distinct datasets and four independent classifiers. Moreover, it enables the identification of more stable biomarkers compared to traditional methods. Our work provides a robust, efficient, and biologically interpretable tool for biomarker discovery, demonstrating strong potential for broad application across medical disciplines.

Page Count
17 pages

Category
Quantitative Biology:
Quantitative Methods